ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-07-12 11:21:51 +08:00

Author	SHA1	Message	Date
Zhichang Yu	c813c1ff4c	Made task_executor async to speedup parsing (#5530 ) ### What problem does this PR solve? Made task_executor async to speedup parsing ### Type of change - [x] Performance Improvement	2025-03-03 18:59:49 +08:00
Debug Doctor	76cb4cd174	Feat: add 'delete' for agent's sessions api and unify apis of agent sdk (#5525 ) ### What problem does this PR solve? Add sessions deletion support for agent in http and python api ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-03-03 17:15:16 +08:00
Kevin Hu	7a81fa00e9	Optimize prompt. (#5541 ) ### What problem does this PR solve? #5526 ### Type of change - [x] Performance Improvement	2025-03-03 13:12:38 +08:00
Kevin Hu	606ed0c8ab	Fix: in case running KG repeatly. (#5538 ) ### What problem does this PR solve? #5512 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-03 12:22:36 +08:00
Felipe Hertzer	8b1a4365ed	Fix email validation regex (#5533 ) ### What problem does this PR solve? This pull request aims to fix a bug that prevents certain email addresses from signing up. The affected TLDs were returning 'invalid email address' errors: .museum .software .photography .technology .marketing .education .international .community .construction .government .consulting .... ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue)	2025-03-03 10:55:10 +08:00
yihong	8a2542157f	Fix: possible memory leaks close #5277 (#5500 ) ### What problem does this PR solve? close #5277 by make sure the file close ### Type of change - [x] Performance Improvement --------- Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-03-03 10:26:45 +08:00
yihong	622b72db4b	Fix: add ctrl+c signal for better exit (#5469 ) ### What problem does this PR solve? This patch add signal for ctrl + c that can exit the code friendly cause code base use thread daemon can not exit friendly for being started. how to reproduce 1. docker-compose -f docker/docker-compose-base.yml up 2. other window `bash docker/launch_backend_service.sh` 3. stop 1 first 4. try to stop 2 then two thread can not exit which must use `kill pid` This patch fix it and should fix most the related issues in the `issues` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-02-28 14:52:40 +08:00
Kevin Hu	5fdfb8d465	Fix: rm think if stream is Flase. (#5458 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-28 10:05:18 +08:00
hy89	651422127c	Feat: Accessing Alibaba Cloud OSS with Amazon S3 SDK (#5438 ) Accessing Alibaba Cloud OSS with Amazon S3 SDK	2025-02-27 17:02:42 +08:00
Kevin Hu	4c9a3e918f	Fix: add image2text issue. (#5431 ) ### What problem does this PR solve? #5356 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 14:06:49 +08:00
Kevin Hu	5beb022ee1	Fix: string format error. (#5422 ) ### What problem does this PR solve? #5404 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 12:01:46 +08:00
Kevin Hu	afaa7144a5	Fix: issue of no id for /datasets/<dataset_id>/documents (#5420 ) ### What problem does this PR solve? #5401 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 10:39:34 +08:00
Kevin Hu	fa76974e24	Fix issue of `ask` API. (#5400 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 19:45:22 +08:00
so95	fefea3a2a5	Fixed OpenAI compatibility stream [DONE] (#5389 ) Fixed OpenAI compatibility stream [DONE] - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-26 17:55:12 +08:00
Yongteng Lei	0e920a91dd	FIX: correct typo (#5387 ) ### What problem does this PR solve? Correct typo in supported_models file ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 17:21:09 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
k	5cab6c4ccb	Fix:HTTP API -> Stop parsing documents(AttributeError: ‘list‘ object … (#5375 ) …has no attribute ‘id‘) ### What problem does this PR solve? No PR ![image](https://github.com/user-attachments/assets/988d31bc-6551-4bb8-846c-cbbc1883d804) ![image](https://github.com/user-attachments/assets/8b09681b-1239-4ed9-8bc3-11436c5e90bc) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-02-26 15:57:50 +08:00
Yongteng Lei	b3b341173f	DOCS: add OpenAI-compatible http and python api reference (#5374 ) ### What problem does this PR solve? Add OpenAI-compatible http and python api reference ### Type of change - [x] Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com>	2025-02-26 15:52:26 +08:00
liwenju0	a9e4695b74	Fix：validate knowledge base association before document upload (#5373 ) ### What problem does this PR solve? fix this bug: https://github.com/infiniflow/ragflow/issues/5368 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-26 15:47:34 +08:00
Kevin Hu	4f40f685d9	Code refactor (#5371 ) ### What problem does this PR solve? #5173 ### Type of change - [x] Refactoring	2025-02-26 15:40:52 +08:00
Yongteng Lei	5c6a7cb4b8	Added OpenAI-like completion api (#5351 ) ### What problem does this PR solve? Added OpenAI-like completion api, related to #4672, #4705 This function allows users to interact with a model to get responses based on a series of messages. If `stream` is set to True, the response will be streamed in chunks, mimicking the OpenAI-style API. #### Example usage: ```bash curl -X POST https://ragflow_address.com/api/v1/chats_openai/<chat_id>/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $RAGFLOW_API_KEY" \ -d '{ "model": "model", "messages": [{"role": "user", "content": "Say this is a test!"}], "stream": true }' ``` Alternatively, you can use Python's `OpenAI` client: ```python from openai import OpenAI model = "model" client = OpenAI(api_key="ragflow-api-key", base_url=f"http://ragflow_address/api/v1/chats_openai/<chat_id>") completion = client.chat.completions.create( model=model, messages=[ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Who you are?"}, {"role": "assistant", "content": "I am an AI assistant named..."}, {"role": "user", "content": "Can you tell me how to install neovim"}, ], stream=True ) stream = True if stream: for chunk in completion: print(chunk) else: print(completion.choices[0].message.content) ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Related Issues Related to #4672, #4705	2025-02-26 11:37:29 +08:00
Kevin Hu	4e2afcd3b8	Fix FlagRerank max_length issue. (#5366 ) ### What problem does this PR solve? #5352 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 11:01:13 +08:00
Zhenglin Dong	11e6d84d46	Fix: 'Chunk not found!' error in team-sharing knowledge base. (#5361 ) ### What problem does this PR solve? As issue #3268 mentioned, "Chun not found!" exception will occur, especially during the teamwork of knowledge bases. ### The reason of this bug "tenants" are the people on current_user's team, including the team owner itself. The old one only checks the first "tenant", tenants[0], which will cause error when anyone editing the chunk that is not in tenants[0]'s knowledge base. My modification won't introduce new errors while iterate all the tenant then retrieve knowledge bases of each. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 10:24:35 +08:00
Kevin Hu	53b9e7b52f	Add tavily as web searh tool. (#5349 ) ### What problem does this PR solve? #5198 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 10:21:04 +08:00
Kevin Hu	b3d579e2c1	Refine prompt of agentic search. (#5312 ) ### What problem does this PR solve? #5173 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-25 09:21:52 +08:00
Kevin Hu	9aa222f738	Let list_chat go without kb checking. (#5280 ) ### What problem does this PR solve? #5278 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-24 13:21:05 +08:00
Kevin Hu	605cfdb8dc	Refine error message for re-rank model. (#5278 ) ### What problem does this PR solve? #5261 ### Type of change - [x] Refactoring	2025-02-24 13:01:34 +08:00
Omar Leonardo Sanchez Granados	a0b461a18e	Add configuration to choose default llm models (#5245 ) ### What problem does this PR solve? This pull request includes changes to the `api/settings.py` and `docker/service_conf.yaml.template` files to add support for default models in the LLM configuration (specially for LIGHTEN builds). The most important changes include adding default model configurations and updating the initialization settings to use these defaults. For example: With this configuration Bedrock will be enable by default with claude and titan embeddings. ``` user_default_llm: factory: 'Bedrock' api_key: '{}' base_url: '' default_models: chat_model: 'anthropic.claude-3-5-sonnet-20240620-v1:0' embedding_model: 'amazon.titan-embed-text-v2:0' rerank_model: '' asr_model: '' image2text_model: '' ``` ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:13:39 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	ef8847eda7	Double check error of adding llm. (#5237 ) ### What problem does this PR solve? #5227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 19:09:49 +08:00
Kevin Hu	3444cb15e3	Refine search query. (#5235 ) ### What problem does this PR solve? #5173 #5214 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 18:32:32 +08:00
Kevin Hu	f5d63bb7df	Support chat solo. (#5218 ) ### What problem does this PR solve? #5216 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-21 12:24:02 +08:00
Kevin Hu	7b3d700d5f	Apply agentic searching. (#5196 ) ### What problem does this PR solve? #5173 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-20 17:41:01 +08:00
liwenju0	f298e55ded	Fix: Normalize embedding model ID comparison across datasets (#5169 ) Modify embedding model ID comparison to remove vendor suffixes, ensuring consistent model identification when working with multiple knowledge bases. This change affects dialog creation, chat operations, and document retrieval test functions. ### What problem does this PR solve? resolve this bug: https://github.com/infiniflow/ragflow/issues/5166 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-02-20 12:40:59 +08:00
liwenju0	3ced290eb5	Feat: Add support for document meta fields update through api (#5120 ) ### What problem does this PR solve? add support for update document meta data through api ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: wenju.li <wenju.li@deepctr.cn> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-19 13:39:31 +08:00
petertc	8525f55ad0	Fix: Option ineffective in Chat API (#5118 ) ### What problem does this PR solve? API options like `stream` was ignored when no session_id was provided. This PR fixes the issue. Test command and expected result: ``` curl --request POST \ --url http://:9222/api/v1/chats/2f2e1d30ee6111efafe211749b004925/completions \ --header 'Content-Type: application/json' \ --header 'Authorization: Bearer ragflow-xxx' \ --data '{ "question":"Who are you", "stream":false }' {"code":0,"data":"data:{\"code\": 0, \"message\": \"\", \"data\": {\"answer\": \"Hi! I'm your assistant, what can I do for you?\", \"reference\": {}, \"audio_binary\": null, \"id\": null, \"session_id\": \"82ceb0fcee7111efafe211749b004925\"}}\n\n"} ``` ### Type of change - [*] Bug Fix (non-breaking change which fixes an issue)	2025-02-19 13:18:51 +08:00
zhxlp	00c7ddbc9b	Fix: The max tokens defined by the tenant are not used (#4297 ) (#2817 ) (#5066 ) ### What problem does this PR solve? Fix: The max tokens defined by the tenant are not used (#4297) (#2817) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-18 13:42:22 +08:00
Kevin Hu	84b4b38cbb	Remove <think> for exeSql component. (#5069 ) ### What problem does this PR solve? #5061 #5067 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-18 13:39:37 +08:00
flygithub	409310aae9	Update agent session API, to support uploading files while create a new session (#5039 ) ### What problem does this PR solve? Update the agent session API "POST /api/v1/agents/{agent_id}/sessions", to support uploading files while create a new session: - currently, the API only supports requesting with a json body. If user wants to upload a doc or image when create session, like what is already supported on the web client, we need to update the API. - if upload an image, ragflow will call image2text, and a user_id is needed for the image2text model. So we need to send user_id in the API request. As form-data is needed to upload files, not json body, seems we need to put the user_id in the url as an optional parameter (currently user_id is an optional in json body). ### Type of change - [x] Documentation Update - [x] Other (please describe):	2025-02-18 09:45:40 +08:00
Kevin Hu	9ff825f39d	Ignore exceptions when no index ahead. (#5047 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-18 09:09:22 +08:00
hy89	7b5d831296	Fix: Starting the source code on Windows, the 'HTTP API' returns 404 (#5042 ) Fix: When starting the backend service from source code on Windows, the "HTTP API" no longer returns 404.	2025-02-17 19:33:49 +08:00
Kevin Hu	e4096fbc33	Add another decrypt function. (#5043 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-17 18:09:11 +08:00
Kevin Hu	3aa5c2a699	Ignore exception of empty index. (#5030 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-02-17 15:59:55 +08:00
kuschzzp	88daa349f9	Optimize conversation when uploading attachments (#4964 ) ### What problem does this PR solve? #4929 ### Type of change - [x] Performance Improvement	2025-02-17 12:03:04 +08:00
zhxlp	194e8ea696	Fix knowledge graph node not found (#4968 ) (#4970 ) ### What problem does this PR solve? Fix knowledge graph node not found (#4968) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-17 11:49:27 +08:00
Kevin Hu	810f997276	Fix <think> in keywords or question auto-generations. (#5021 ) ### What problem does this PR solve? #4983 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-17 11:20:57 +08:00
Kevin Hu	849d9eb463	Ignore tenant not found error while increasing token usage. (#4950 ) ### What problem does this PR solve? #4940 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-14 11:10:49 +08:00
Peterson Alves	042f4c90c6	Fixes KeyError: 'content' when using stream=False (#4944 ) ### 🛠 Fixes `KeyError: 'content'` when using `stream=False` #### 🔍 Problem When calling the chat API with `stream=False`, the code attempts to access `msg[-1]["content"]` without verifying if the key exists. This causes a `KeyError` when the message structure does not contain `"content"`. This issue was discussed in [#4885](https://github.com/infiniflow/ragflow/issues/4885), where we analyzed the root cause. The error does not occur with `stream=True`, as the response is processed differently. #### ✅ Solution - Logging Fix: - Before accessing `msg[-1]["content"]`, we check if the key exists. - If it does not exist, a default value (`"[content not available]"`) is used to prevent errors. - Structural Fix in `msg` Construction: - Ensured that every message in `msg` contains the `"content"` key, even if empty. - This fixes the issue at its root and ensures consistent behavior between `stream=True` and `stream=False`. #### 🔄 Impact - Prevents the `KeyError` without affecting normal application flow. - Ensures the integrity of the `msg` structure, avoiding future failures. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-14 10:27:01 +08:00
Kevin Hu	78982d88e0	Reformat error message. (#4829 ) ### What problem does this PR solve? #4828 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 16:47:53 +08:00
Kevin Hu	6fa34d5532	Fix KG circle. (#4823 ) ### What problem does this PR solve? #4760 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 11:02:29 +08:00

1 2 3 4 5 ...

661 Commits