Zixuan Cheng
4fa3d78ed8
Revert "feat : add GPT4.1 in the model providers" ( #19002 )
2025-04-28 18:15:24 +08:00
-LAN-
559ab46ee1
fix: Removes redundant token calculations and updates dependencies
...
Eliminates unnecessary pre-calculation of token limits and recalculation of max tokens
across multiple app runners, simplifying the logic for prompt handling.
Updates tiktoken library from version 0.8.0 to 0.9.0 for improved tokenization performance.
Increases default token limit in TokenBufferMemory to accommodate larger prompt messages.
These changes streamline the token management process and leverage the latest
improvements in the tiktoken library.
Fixes potential token overflow issues and prepares the system for handling larger
inputs more efficiently.
Relates to internal optimization tasks.
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-04-28 15:39:12 +08:00
Zixuan Cheng
144f9507f8
feat : add GPT4.1 in the model providers ( #18912 )
2025-04-27 19:31:20 +08:00
kelvintsim
2e097a1ac0
add bedrock deepseek-r1 ( #18908 )
2025-04-27 19:30:42 +08:00
kelvintsim
024f242251
add bedrock claude-sonnet-3.7 ( #18788 )
2025-04-25 17:35:12 +08:00
kautsar_masuara
b26e20fe34
fix: fix vertex gemini 2.0 flash 001 schema ( #18405 )
...
Co-authored-by: achmad-kautsar <achmad.kautsar@insignia.co.id>
2025-04-19 22:04:13 +08:00
Alexi.F
fe1846c437
fix: change gemini-2.0-flash to validate google api #17082 ( #17115 )
2025-03-30 13:04:12 +08:00
-LAN-
413dfd5628
feat: add completion mode and context size options for LLM configuration ( #13325 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-07 15:08:53 +08:00
-LAN-
f9515901cc
fix: Azure AI Foundry model cannot be used in the workflow ( #13323 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-07 14:52:57 +08:00
呆萌闷油瓶
3f42fabff8
chore:improve thinking display for llm from xinference and ollama pro… ( #13318 )
2025-02-07 14:29:29 +08:00
-LAN-
1caa578771
chore(*): Update style of thinking ( #13319 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-07 14:06:35 +08:00
非法操作
3eb3db0663
chore: refactor the OpenAICompatible and improve thinking display ( #13299 )
2025-02-07 13:28:46 +08:00
sino
6e5c915f96
feat(model): add deepseek-r1 for openrouter ( #13312 )
2025-02-07 12:39:13 +08:00
Riddhimaan-Senapati
2348abe4bf
feat: added a couple of models not defined in vertex ai, that were already … ( #13296 )
2025-02-07 09:11:25 +08:00
呆萌闷油瓶
f7e7a399d9
feat:add think tag display for xinference deepseek r1 ( #13291 )
2025-02-06 22:04:58 +08:00
zhu-an
16865d43a8
feat: add deepseek models for volcengine provider ( #13283 )
...
Co-authored-by: zhaoqingyu.1075 <zhaoqingyu.1075@bytedance.com>
2025-02-06 18:20:03 +08:00
呆萌闷油瓶
0d13aee15c
feat:add deepseek r1 think display for ollama provider ( #13272 )
2025-02-06 15:32:10 +08:00
engchina
40dd63ecef
Upgrade oracle models ( #13174 )
...
Co-authored-by: engchina <atjapan2015@gmail.com>
2025-02-06 13:24:27 +08:00
-LAN-
6d66d6da15
feat(model_providers): Support deepseek-r1 for Nvidia Catalog ( #13269 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-06 13:03:19 +08:00
-LAN-
87763fc234
feat(model_providers): Support deepseek for Azure AI Foundry ( #13267 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-06 12:45:48 +08:00
JasonVV
f6c44cae2e
feat(model): add gemini-2.0 model ( #13266 )
2025-02-06 12:28:59 +08:00
xhe
da2ee04fce
fix: correct linewrap think display in generic openai api ( #13260 )
...
Signed-off-by: xhe <xw897002528@gmail.com>
2025-02-06 10:53:08 +08:00
JasonVV
7673c36af3
feat(model): add gemini-2.0-flash-thinking-exp-01-21 ( #13230 )
2025-02-06 10:01:00 +08:00
Riddhimaan-Senapati
9457b2af2f
feat: added models :gemini 2.0 flash 001 and gemini 2.0 pro exp 02-05 ( #13247 )
2025-02-06 09:58:39 +08:00
k-zaku
7203991032
feat: add parameter "reasoning_effort" and Openai o3-mini ( #13243 )
2025-02-06 09:29:48 +08:00
xhe
5a685f7156
feat: add think display for volcengine and generic openapi ( #13234 )
...
Signed-off-by: xhe <xw897002528@gmail.com>
2025-02-06 09:24:40 +08:00
Riddhimaan-Senapati
a6a25030ad
fix: updated _position.yaml to include the latest model already integ… ( #13245 )
2025-02-06 09:21:51 +08:00
Riddhimaan-Senapati
00458a31d5
feat: added deepseek r1 and v3 to siliconflow ( #13238 )
2025-02-05 21:59:18 +08:00
-LAN-
c6ddf6d6cc
feat(model_providers): Add Groq DeepSeek-R1-Distill-Llama-70b ( #13229 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-05 19:15:29 +08:00
Joshbly
34b21b3065
feat: Add o3-mini and o3-mini-2025-01-31 model variants ( #13129 )
...
Co-authored-by: crazywoola <427733928@qq.com>
2025-02-05 17:04:45 +08:00
-LAN-
59ca44f493
chore(model_runtime): Move deepseek ahead in the providers list. ( #13197 )
...
Signed-off-by: -LAN- <laipz8200@outlook.com>
2025-02-05 16:08:28 +08:00
MaFee921
1a2523fd15
feat: bedrock_endpoint_url ( #12838 )
2025-02-05 12:24:24 +08:00
Kei YAMAZAKI
7452032d81
add azure openai api version 2024-12-01-preview ( #13135 )
2025-02-03 11:04:20 +08:00
非法操作
840729afa5
feat: the think tag display of siliconflow's deepseek r1 ( #13153 )
2025-02-02 21:55:13 +08:00
Yingchun Lai
b09c39c8dc
refactor: avoid to use extra space when finding model by name ( #13043 )
2025-01-30 15:08:29 +08:00
heyszt
b4b09ddc3c
add tongyi qwen2.5-14b/7b-instruct-1m model ( #13089 )
2025-01-29 11:58:01 +08:00
Yingchun Lai
d44882c1b5
refactor: reduce duplciate code by inheritance ( #13073 )
2025-01-28 10:52:01 +08:00
Jason
560c5de1b7
Fixed Novita AI color and added DeepSeek R1 model ( #13074 )
2025-01-28 10:38:54 +08:00
heyszt
6c31ee36cd
fix qwen-vl blocking mode ( #13052 )
2025-01-27 11:35:23 +08:00
Jason
d4be5ef9de
Update Novita AI predefined models ( #13045 )
2025-01-26 09:25:29 +08:00
非法操作
59b3e672aa
feat: add agent thinking content display of deepseek R1 ( #12949 )
2025-01-24 20:13:42 +08:00
IWAI, Masaharu
a2f8bce8f5
chore: add Japanese translation: model_providers/bedrock ( #13016 )
2025-01-24 18:43:33 +08:00
IWAI, Masaharu
28067640b5
fix: wrong zh_Hans translation: Ohio ( #13006 )
2025-01-24 13:41:20 +08:00
lowell
da67916843
feat: add glm-4-air-0111 ( #12997 )
...
Co-authored-by: lowell <lowell.hu@zkteco.in>
2025-01-24 10:04:46 +08:00
sino
d167d5b1be
feat(ark): support doubao 1.5 series of models ( #12935 )
2025-01-22 15:25:57 +08:00
jiandanfeng
e23f4b0265
feat: add gemini-2.0-flash-thinking-exp-01-21 ( #12924 )
2025-01-22 10:14:37 +08:00
luckylhb90
3d1ce4c53f
bug: fixed bedrock rerank bug ( #12774 )
...
Co-authored-by: hobo.l <hobo.l@binance.com>
2025-01-21 19:09:36 +08:00
k-zaku
46e95e8309
fix: OpenAI o1 Bad Request Error ( #12839 )
2025-01-21 15:29:13 +08:00
JasonVV
a7b9375877
Update deepseek model configuration ( #12899 )
2025-01-21 15:28:11 +08:00
JasonVV
9903f1e703
add deepseek-reasoner ( #12898 )
2025-01-21 12:40:58 +08:00