615 Commits

Author SHA1 Message Date
orangeclk
317ae9233e
feat: add json response format for siliconflow models (#10657) 2024-11-14 08:58:22 +08:00
xiandan-erizo
5b8f03cd9d
add abab7-chat-preview model (#10654)
Co-authored-by: xiandan-erizo <xiandan-erizo@outlook.com>
2024-11-13 19:30:42 +08:00
方程
ef8022f715
Gitee AI Qwen2.5-72B model (#10595) 2024-11-12 21:40:32 +08:00
Kevin9703
e03ec0032b
fix: Azure OpenAI o1 max_completion_token error (#10593) 2024-11-12 21:40:13 +08:00
-LAN-
867bf70f1a
fix(model_runtime): ensure compatibility with O1 models by adjusting token parameters (#10537) 2024-11-11 16:06:53 +08:00
Jyong
0c1307b083
add jina rerank http timout parameter (#10476) 2024-11-11 13:28:11 +08:00
fdb02983rhy
05d43a4074
Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) 2024-11-11 08:41:43 +08:00
larcane97
aa895cfa9b
fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417)
Co-authored-by: moon <moon@vessl.ai>
2024-11-11 08:38:26 +08:00
非法操作
033ab5490b
feat: support LLM understand video (#9828) 2024-11-08 13:22:52 +08:00
Bowen Liang
574c4a264f
chore(lint): Use logging.exception instead of logging.error (#10415) 2024-11-07 21:13:02 +08:00
Matsuda
1e8457441d
fix(model_runtime): remove vision from features for Claude 3.5 Haiku (#10360) 2024-11-06 17:42:18 +08:00
Infinitnet
5a9448245b
fix: remove unsupported vision in OpenRouter Haiku 3.5 (#10364) 2024-11-06 17:41:48 +08:00
Bowen Liang
d45d90e8ae
chore: lazy import sagemaker (#10342) 2024-11-06 12:45:22 +08:00
Infinitnet
bdadca1a65
feat: add support for anthropic/claude-3-5-haiku through OpenRouter (#10331) 2024-11-06 08:26:44 +08:00
非法操作
bf9349c4dc
feat: add xAI model provider (#10272) 2024-11-05 14:42:47 +08:00
Matsuda
4847548779
feat(model_runtime): add new model 'claude-3-5-haiku-20241022' (#10285) 2024-11-05 14:41:39 +08:00
Matsuda
cb245b5435
fix(model_runtime): fix wrong max_tokens for Claude 3.5 Haiku on Amazon Bedrock (#10286) 2024-11-05 14:41:15 +08:00
Matsuda
9305ad2102
feat: support Claude 3.5 Haiku on Amazon Bedrock (#10265) 2024-11-05 09:42:51 +08:00
方程
2aa171c348
Using a dedicated interface to obtain the token credential for the gitee.ai provider (#10243) 2024-11-04 17:22:02 +08:00
Xiao Ley
b28cf68097
chore: enable vision support for models in OpenRouter that should have supported vision (#10191) 2024-11-02 19:45:20 +08:00
Lawrence Li
76b0328eb1
feat: add gpustack model provider (#10158) 2024-11-01 17:23:30 +08:00
larcane97
8d5456b6d0
Add VESSL AI OpenAI API-compatible model provider and LLM model (#9474)
Co-authored-by: moon <moon@vessl.ai>
2024-11-01 13:38:52 +08:00
Coal Pigeon
4d5546953a
add llm: ernie-4.0-turbo-128k of wenxin (#10135)
Co-authored-by: Pigeon姚宏锋 <pigeon.yhf@galaxyoversea.com>
2024-10-31 21:49:04 +08:00
Charlie.Wei
f6fecb957e
fix azure chatgpt o1 parameter error (#10067) 2024-10-30 22:08:56 +08:00
zhuhao
92a3898540
fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 2024-10-30 15:43:29 +08:00
非法操作
12adcf8925
fix: gemini model use some tools raise error (#9993) 2024-10-29 16:09:29 +08:00
方程
0ebd985672
feat: add models for gitee.ai (#9490) 2024-10-28 16:52:12 +08:00
ice yao
22776f24ab
chore: Extract common functions of the base model in Azure OpenAI Provider (#9907) 2024-10-27 11:56:17 +08:00
非法操作
1b5adf40da
fix: moonshot response_format raise error (#9847) 2024-10-25 14:59:55 +08:00
guogeer
70ddc0ce43
openai compatiable api usage and id (#9800)
Co-authored-by: jinqi.guo <jinqi.guo@ubtrobot.com>
2024-10-24 21:51:36 +08:00
-LAN-
e11d5ac708
feat(model_runtime): add new model 'claude-3-5-sonnet-20241022' (#9708) 2024-10-23 17:03:30 +08:00
Pan, Wen-Ming
ecc8beef3f
feat: added claude 3.5 sonnet v2 model to Google Cloud Vertex AI (#9688) 2024-10-23 16:13:51 +08:00
ybalbert001
4989d0c904
add bedrock claude 3.5 v2 support (#9685)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-10-23 13:54:21 +08:00
ice yao
1e829ceaf3
chore: format get_customizable_model_schema return value (#9335) 2024-10-21 19:05:44 +08:00
AAEE86
9b32bfb3db
feat: Updata tongyi models (#9552) 2024-10-21 19:04:45 +08:00
-LAN-
e61752bd3a
feat/enhance the multi-modal support (#8818) 2024-10-21 10:43:49 +08:00
chzphoenix
42fe208eda
refactor wenxin rerank (#9486)
Co-authored-by: cuihz <cuihz@knowbox.cn>
2024-10-21 09:03:25 +08:00
Ziyu Huang
660fc3bb34
Resolve 9508 openai compatible rerank (#9511) 2024-10-20 21:59:58 +08:00
Tao Wang
b92504bebc
Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) 2024-10-18 18:10:33 +08:00
zhuhao
e0846792d2
feat: add yi custom llm intergration (#9482) 2024-10-18 17:23:21 +08:00
zhuhao
b3cde9900c
feat: add parameter top-k for the llm model provided by openrouter and siliconflow (#9455) 2024-10-18 08:21:54 +08:00
zhuhao
3fc0ebdd51
feat: add yi-lightning llm model for yi (#9458) 2024-10-18 08:19:58 +08:00
chzphoenix
211f416806
feat:add wenxin rerank (#9431)
Co-authored-by: cuihz <cuihz@knowbox.cn>
Co-authored-by: crazywoola <427733928@qq.com>
2024-10-17 19:18:32 +08:00
zhuhao
b90ad587c2
refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 2024-10-17 19:12:42 +08:00
zhuhao
a45f8969a0
fix: remove the undefined variable line (#9446) 2024-10-17 17:25:14 +08:00
ybalbert001
fdcf87c70c
fix https://github.com/langgenius/dify/issues/9409 (#9433)
Co-authored-by: Yuanbo Li <ybalbert@amazon.com>
2024-10-17 10:47:56 +08:00
ice yao
dd22e78515
fix: Deprecated gemma2-9b model in Fireworks AI Provider (#9373) 2024-10-16 10:44:54 +08:00
crazywoola
423df67042
fix: use gpt-4o-mini for validating credentials (#9387) 2024-10-16 10:18:06 +08:00
非法操作
da25b91980
fix: remove the stream option of zhipu and gemini (#9319) 2024-10-15 19:13:43 +08:00
Jason Tan
9b8aa9b75d
feat: add minimax abab6.5t support (#9365) 2024-10-15 19:00:05 +08:00