ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-05-24 23:28:47 +08:00

Author	SHA1	Message	Date
Kevin Hu	daddfc9e1b	Remove dup gb2312, solve currupt error. (#5326 ) ### What problem does this PR solve? #5252 #5325 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-25 12:22:37 +08:00
Kevin Hu	df3d0f61bd	Fix base url missing for deepseek from Tongyi. (#5294 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 15:43:32 +08:00
Kevin Hu	ec96426c00	Tongyi adapts deepseek. (#5285 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-24 14:04:25 +08:00
liwenju0	569e40544d	Refactor rerank model with dynamic batch processing and memory manage… (#5273 ) …ment ### What problem does this PR solve? Issue：https://github.com/infiniflow/ragflow/issues/5262 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-24 11:32:08 +08:00
Omar Leonardo Sanchez Granados	4f2816c01c	Add support to boto3 default connection (#5246 ) ### What problem does this PR solve? This pull request includes changes to the initialization logic of the `ChatModel` and `EmbeddingModel` classes to enhance the handling of AWS credentials. Use cases: - Use env variables for credentials instead of managing them on the DB - Easy connection when deploying on an AWS machine ### Type of change - [X] New Feature (non-breaking change which adds functionality)	2025-02-24 11:01:14 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	1a755e75c5	Remove v1 (#5220 ) ### What problem does this PR solve? #5201 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 15:15:38 +08:00
saikidev	d2929e432e	Feat: add LLM provider PPIO (#5013 ) ### What problem does this PR solve? Add a LLM provider: PPIO ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2025-02-17 12:03:26 +08:00
Kevin Hu	b08bb56f6c	Display thinking for deepseek r1 (#4904 ) ### What problem does this PR solve? #4903 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-12 15:43:13 +08:00
Kevin Hu	2aa0cdde8f	Fix Gemini chat issue. (#4757 ) ### What problem does this PR solve? #4753 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-07 12:00:19 +08:00
Kyle	036f37a627	fix: err object has no attribute 'iter_lines' (#4686 ) ### What problem does this PR solve? ERROR: 'Stream' object has no attribute 'iter_lines' with reference to Claude/Anthropic chat streams ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kyle Olmstead <k.olmstead@offensive-security.com>	2025-02-01 22:39:30 +08:00
Kevin Hu	4776fa5e4e	Refactor for total_tokens. (#4652 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 13:54:26 +08:00
writinwaters	2cb8edc42c	Added GPUStack (#4649 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-01-26 12:25:02 +08:00
Kevin Hu	f1d9f4290e	Fix TogetherAIEmbed. (#4623 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-24 10:29:30 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Kevin Hu	3805621564	Fix xinference rerank issue. (#4499 ) ### What problem does this PR solve? #4495 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-16 11:35:51 +08:00
Kevin Hu	be5f830878	Truncate text for zhipu embedding. (#4490 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-15 14:36:27 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	b93c136797	Fix gemini embedding error. (#4356 ) ### What problem does this PR solve? #4314 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-06 14:41:29 +08:00
Yingfeng	50f209204e	Synchronize with enterprise version (#4325 ) ### Type of change - [x] Refactoring	2025-01-02 13:44:44 +08:00
Jin Hai	4abc144d3d	Fix error of changing embedding model (#4184 ) ### What problem does this PR solve? 1. Change embedding model of knowledge base won't change the default embedding model. 2. Retrieval test bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-12-23 16:23:54 +08:00
Kevin Hu	cb45431412	Fix Voyage re-rank model. Limit file name length. (#4171 ) ### What problem does this PR solve? #4152 #4154 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-23 10:03:50 +08:00
Kevin Hu	d8fca43017	Make fast embed and default embed mutually exclusive. (#4121 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-12-19 17:27:09 +08:00
Kevin Hu	7474348394	Fix fastembed reloading issue. (#4117 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-19 16:18:18 +08:00
Kevin Hu	044afa83d1	Fix transformers dependencies for slim. (#3934 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-09 14:21:37 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Kevin Hu	593ffc4067	Fix HuggingFace model error. (#3870 ) ### What problem does this PR solve? #3865 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 13:28:42 +08:00
Kevin Hu	78601ee1bd	Fix open AI compatible rerank issue. (#3866 ) ### What problem does this PR solve? #3700 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 10:26:21 +08:00
Kevin Hu	3f3469130b	Fix preview issue in file manager. (#3846 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-04 11:53:23 +08:00
Jin Hai	6657ca7cde	Change default error message to English (#3838 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2024-12-04 09:34:49 +08:00
Zhichang Yu	92ab7ef659	Refactor embedding batch_size (#3825 ) ### What problem does this PR solve? Refactor embedding batch_size. Close #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2024-12-03 16:22:39 +08:00
Kevin Hu	6a0583f5ad	Fix voyage embedding. (#3818 ) ### What problem does this PR solve? #3816 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-03 09:33:54 +08:00
Zhichang Yu	d19f059f34	Detect invalid response from api.siliconflow.cn (#3792 ) ### What problem does this PR solve? Detect invalid response from api.siliconflow.cn. Close #2643 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-02 12:55:05 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Zhichang Yu	d94386e00a	Pass top_p to ollama (#3744 ) ### What problem does this PR solve? Pass top_p to ollama. Close #1769 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-29 14:52:27 +08:00
Kevin Hu	91f1814a87	Fix error response (#3719 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2024-11-28 18:56:10 +08:00
Kevin Hu	57208d8e53	Fix batch size issue. (#3675 ) ### What problem does this PR solve? #3657 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-27 18:06:43 +08:00
liuhua	8b35776916	Fix a bug in VolcEngine (#3658 ) ### What problem does this PR solve? Fix a bug in VolcEngine #3553 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-27 09:30:49 +08:00
Kevin Hu	0891a393d7	Let ThreadPool exit gracefully. (#3653 ) ### What problem does this PR solve? #3646 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-26 16:31:07 +08:00
Kevin Hu	e5af18d5ea	Update docs for v0.14.0 (#3625 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2024-11-25 11:37:56 +08:00
liwenju0	875096384b	when qwen rerank model not return ok, raise exception to notice user (#3593 ) ### What problem does this PR solve? When calling the Qwen rerank model, if the model does not return correctly, an exception should be raised to notify the user, rather than simply returning a value of 0, as this would be confusing to the user. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-22 22:34:34 +08:00
Kevin Hu	81c7b6afc5	Make spark model robuster to model name (#3514 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-20 20:53:44 +08:00
liuhua	d42362deb6	Add api for sessions and add max_tokens for tenant_llm (#3472 ) ### What problem does this PR solve? Add api for sessions and add max_tokens for tenant_llm ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-19 14:51:33 +08:00
Zhichang Yu	4413683898	Introduced beartype (#3460 ) ### What problem does this PR solve? Introduced [beartype](https://github.com/beartype/beartype) for runtime type-checking. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 17:38:17 +08:00
shizzgar	4b3eeaa6ef	Added LocalAI support for rerank models (#3446 ) ### What problem does this PR solve? Hi there! LocalAI added support of rerank models https://localai.io/features/reranker/ I've implemented LocalAIRerank class (typically copied it from OpenAI_APIRerank class). Also, LocalAI model response with 500 error code if len of "documents" is less than 2 in similarity check. So I've added the second "document" on RERANK model connection check in `api/apps/llm_app.py`. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-18 12:05:52 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
roc king	fa54cd5f5c	exstract model dir from model‘s full name (#3368 ) ### What problem does this PR solve? When model’s group name contains 0-9，we can't find downloaded model，because we do not correctly exstract model dir's name from model‘s full name ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 王志鹏 <zhipeng3.wang@midea.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 14:10:16 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00

1 2 3 4 5

239 Commits