chore: update Yi models params (#2895)

This commit is contained in:
Su Yang 2024-03-19 20:54:31 +08:00 committed by GitHub
parent fbbba6db92
commit 4419d357c4
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
3 changed files with 66 additions and 21 deletions

View File

@ -9,18 +9,33 @@ model_properties:
mode: chat mode: chat
context_size: 4096 context_size: 4096
parameter_rules: parameter_rules:
- name: temperature
use_template: temperature
type: float
default: 0.3
min: 0.0
max: 2.0
help:
zh_Hans: 控制生成结果的多样性和随机性。数值越小,越严谨;数值越大,越发散。
en_US: Control the diversity and randomness of generated results. The smaller the value, the more rigorous it is; the larger the value, the more divergent it is.
- name: max_tokens - name: max_tokens
use_template: max_tokens use_template: max_tokens
type: int type: int
default: 512 default: 512
min: 1 min: 1
max: 4096 max: 4000
- name: temperature help:
use_template: temperature zh_Hans: 指定生成结果长度的上限。如果生成结果截断,可以调大该参数。
en_US: Specifies the upper limit on the length of generated results. If the generated results are truncated, you can increase this parameter.
- name: top_p
use_template: top_p
type: float type: float
default: 0.7 default: 0.8
min: 0 min: 0.01
max: 2 max: 1.00
help:
zh_Hans: 控制生成结果的随机性。数值越小随机性越弱数值越大随机性越强。一般而言top_p 和 temperature 两个参数选择一个进行调整即可。
en_US: Control the randomness of generated results. The smaller the value, the weaker the randomness; the larger the value, the stronger the randomness. Generally speaking, you can adjust one of the two parameters top_p and temperature.
pricing: pricing:
input: '0.0025' input: '0.0025'
output: '0.0025' output: '0.0025'

View File

@ -9,18 +9,33 @@ model_properties:
mode: chat mode: chat
context_size: 200000 context_size: 200000
parameter_rules: parameter_rules:
- name: max_tokens
use_template: max_tokens
type: int
default: 1024
min: 1
max: 200000
- name: temperature - name: temperature
use_template: temperature use_template: temperature
type: float type: float
default: 0.7 default: 0.6
min: 0 min: 0.0
max: 2 max: 2.0
help:
zh_Hans: 控制生成结果的多样性和随机性。数值越小,越严谨;数值越大,越发散。
en_US: Control the diversity and randomness of generated results. The smaller the value, the more rigorous it is; the larger the value, the more divergent it is.
- name: max_tokens
use_template: max_tokens
type: int
default: 4096
min: 1
max: 199950
help:
zh_Hans: 指定生成结果长度的上限。如果生成结果截断,可以调大该参数。
en_US: Specifies the upper limit on the length of generated results. If the generated results are truncated, you can increase this parameter.
- name: top_p
use_template: top_p
type: float
default: 0.9
min: 0.01
max: 1.00
help:
zh_Hans: 控制生成结果的随机性。数值越小随机性越弱数值越大随机性越强。一般而言top_p 和 temperature 两个参数选择一个进行调整即可。
en_US: Control the randomness of generated results. The smaller the value, the weaker the randomness; the larger the value, the stronger the randomness. Generally speaking, you can adjust one of the two parameters top_p and temperature.
pricing: pricing:
input: '0.012' input: '0.012'
output: '0.012' output: '0.012'

View File

@ -9,18 +9,33 @@ model_properties:
mode: chat mode: chat
context_size: 4096 context_size: 4096
parameter_rules: parameter_rules:
- name: temperature
use_template: temperature
type: float
default: 0.3
min: 0.0
max: 2.0
help:
zh_Hans: 控制生成结果的多样性和随机性。数值越小,越严谨;数值越大,越发散。
en_US: Control the diversity and randomness of generated results. The smaller the value, the more rigorous it is; the larger the value, the more divergent it is.
- name: max_tokens - name: max_tokens
use_template: max_tokens use_template: max_tokens
type: int type: int
default: 512 default: 512
min: 1 min: 1
max: 4096 max: 4000
- name: temperature help:
use_template: temperature zh_Hans: 指定生成结果长度的上限。如果生成结果截断,可以调大该参数。
en_US: Specifies the upper limit on the length of generated results. If the generated results are truncated, you can increase this parameter.
- name: top_p
use_template: top_p
type: float type: float
default: 0.7 default: 0.8
min: 0 min: 0.01
max: 2 max: 1.00
help:
zh_Hans: 控制生成结果的随机性。数值越小随机性越弱数值越大随机性越强。一般而言top_p 和 temperature 两个参数选择一个进行调整即可。
en_US: Control the randomness of generated results. The smaller the value, the weaker the randomness; the larger the value, the stronger the randomness. Generally speaking, you can adjust one of the two parameters top_p and temperature.
pricing: pricing:
input: '0.01' input: '0.01'
output: '0.03' output: '0.03'