ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-04-19 12:39:59 +08:00

Author	SHA1	Message	Date
balibabu	3af1063737	Feat: Set the default value of Chunk token number to 512 #6016 (#6017 ) ### What problem does this PR solve? Feat: Set the default value of Chunk token number to 512 #6016 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-13 14:51:55 +08:00
writinwaters	9c8060f619	0.17.1 release notes (#6021 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-13 14:43:24 +08:00
Zhichang Yu	e213873852	Optimize graphrag cache get entity (#6018 ) ### What problem does this PR solve? Optimize graphrag cache get entity ### Type of change - [x] Performance Improvement	2025-03-13 14:37:59 +08:00
liu an	56acb340d2	Test: update test cases per issue #5920 #5923 (#6007 ) ### What problem does this PR solve? update test cases per issue #5920 #5923 ### Type of change - [x] update test case	2025-03-13 10:53:07 +08:00
Kevin Hu	e05cdc2f9c	Fix: encode detect error. (#6006 ) ### What problem does this PR solve? #5967 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-13 10:47:58 +08:00
Kevin Hu	3571270191	Refa: refine the context window size warning. (#5993 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-03-12 19:40:54 +08:00
liu an	bd5eb47441	TEST: Added test cases for Upload Documents HTTP API (#5991 ) ### What problem does this PR solve? cover upload docments endpoints ### Type of change - [x] add test cases	2025-03-12 19:38:52 +08:00
Yongteng Lei	7cd37c37cd	Feat: add CSV file parsing support (#5989 ) ### What problem does this PR solve? Add CSV file parsing support #4552, #5849, #5870 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 19:20:50 +08:00
Kevin Hu	d660f6b9a5	Feat: add use KG to retrieval component. (#5988 ) ### What problem does this PR solve? #5973 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 19:10:07 +08:00
balibabu	80389ae61e	Feat: Alter Item to TransferListItemType #3221 (#5986 ) ### What problem does this PR solve? Feat: Alter Item to TransferListItemType #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 18:54:41 +08:00
kuro5989	6e13922bdc	Feat: Add qwq model support to Tongyi-Qianwen factory (#5981 ) ### What problem does this PR solve? add qwq model support to Tongyi-Qianwen factory https://github.com/infiniflow/ragflow/issues/5869 ### Type of change - [x] New Feature (non-breaking change which adds functionality) ![image](https://github.com/user-attachments/assets/49f5c6a0-ecaf-41dd-a23a-2009f854d62c) ![image](https://github.com/user-attachments/assets/93ffa303-920e-4942-8188-bcd6b7209204) ![1741774779438](https://github.com/user-attachments/assets/25f2fd1d-8640-4df0-9a08-78ee9daaa8fe) ![image](https://github.com/user-attachments/assets/4763cf6c-1f76-43c4-80ee-74dfd666a184) Co-authored-by: zhaozhicheng <zhicheng.zhao@fastonetech.com>	2025-03-12 18:54:15 +08:00
balibabu	c57f16d16f	Feat: Why can't Retrieval component support internet web search. #5973 (#5978 ) ### What problem does this PR solve? Feat: Why can't Retrieval component support internet web search. #5973 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 18:47:22 +08:00
so95	3c43a7aee8	For an Agent with an Input Begin value, on the first call the return … (#5957 ) …session_id does not exist in the session For an Agent with an Input Begin value, on the first call the return session_id does not exist in the session ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-12 17:01:44 +08:00
Kevin Hu	dd8779b257	Feat: `Retrieval` supports internet search. (#5974 ) ### What problem does this PR solve? #5973 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-12 16:51:01 +08:00
liu an	46bdfb9661	TEST: Remove unstable assertion introduced in PR #5924 (#5968 ) ### What problem does this PR solve? Remove unstable assertion introduced in PR #5924 ### Type of change - [x] update test cases	2025-03-12 16:09:45 +08:00
liwenju0	e3ea4b7ec2	Fix: Add Knowledge Base Document Parsing Status Check (#5966 ) When creating and updating chats, add a check for the parsing status of knowledge base documents. Ensure that all documents have been parsed before allowing chat creation to improve user experience and system stability. Main Changes: - Add document parsing status check logic in `chat.py`. - Implement the `is_parsed_done` method in `knowledgebase_service.py`. - Prevent chat creation when documents are being parsed or parsing has failed. ### What problem does this PR solve? fix this bug：https://github.com/infiniflow/ragflow/issues/5960 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-03-12 16:07:45 +08:00
writinwaters	41c67ce8dd	Fixed a Docusaurus display issue. (#5969 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-12 16:07:22 +08:00
liwenju0	870a6e93da	Refactoring: Optimization of the Deep Research Module Code Structure (#5959 ) This commit refactors the deep research module (deep_research.py), with the following major improvements: The complex thinking and retrieval logic has been broken down into multiple independent private methods, enhancing code readability and maintainability. Static methods and class methods have been introduced to simplify the logic for tag processing. The search and reasoning processes have been optimized, increasing the modularity of the code. The flexibility of information retrieval and processing has been improved. The refactored code structure is now clearer, making it easier to understand and extend the functionality of the deep research module. ### What problem does this PR solve? increase the modularity of the code ### Type of change - [x] Refactoring Co-authored-by: wenju.li <wenju.li@deepctr.cn>	2025-03-12 15:34:52 +08:00
Kevin Hu	80f87913bb	Fix: empty value updating. (#5949 ) ### What problem does this PR solve? #5920 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-12 11:25:17 +08:00
Kevin Hu	45123dcc0a	Fix: ollama model add error. (#5947 ) ### What problem does this PR solve? #5944 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-12 10:56:05 +08:00
Raghav Patidar	49d560583f	Fix: HTTP API Updates Read-Only Dataset Fields During Modification #5923 (#5937 ) ### What problem does this PR solve? Fixes #5923 Fixes the readonly variables from payload at /datasets/<dataset_id> _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ Now if user tries to modify readonly values then it will show " The input parameters are invalid. " invalid_keys = {"id", "embd_id", "chunk_num", "doc_num", "parser_id", "create_date", "create_time", "created_by", "status","token_num","update_date","update_time"} if any(key in req for key in invalid_keys): return get_error_data_result(message="The input parameters are invalid.") i have include those readonly keys in invalid_keys ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Raghav <2020csb1115@iitrpr.ac.in>	2025-03-12 10:27:02 +08:00
donblack01	1c663b32b9	Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env (#5941 ) ### What problem does this PR solve? Fix:signal.SIGUSR1 and signal.SIGUSR2 can't use in window. so don't bind signal.SIGUSR1 and signal.SIGUSR2 in the windows env ### Type of change - [✓ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Co-authored-by: tangyu <1@1.com>	2025-03-12 09:43:18 +08:00
Kevin Hu	caecaa7562	Feat: apply LLM to optimize citations. (#5935 ) ### What problem does this PR solve? #5905 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-11 19:56:21 +08:00
任奇	ed11be23bf	Fix: When calling the Create chat completion API, the response data… (#5928 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: renqi <renqi08266@fxomail.com>	2025-03-11 19:56:07 +08:00
balibabu	7bd5a52019	Feat: Add Breadcrumb component #3221 (#5929 ) ### What problem does this PR solve? Feat: Add Breadcrumb component #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-11 18:55:25 +08:00
liu an	87763ef0a0	TEST: Added test cases for Update Dataset HTTP API (#5924 ) ### What problem does this PR solve? cover dataset update endpoints ### Type of change - [x] Add test cases	2025-03-11 18:55:11 +08:00
Zhichang Yu	939e668096	Optimized graphrag again (#5927 ) ### What problem does this PR solve? Optimized graphrag again ### Type of change - [x] Performance Improvement	2025-03-11 18:36:10 +08:00
Kevin Hu	45318e7575	Docs: updates. (#5921 ) ### What problem does this PR solve? ### Type of change - [x] Other (please describe):	2025-03-11 16:43:50 +08:00
Philipp Rien	8250b9f6b0	Feat: Add german translations (#5866 ) ### What problem does this PR solve? Add Support for german language ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-11 16:13:58 +08:00
Kevin Hu	1abf03351d	Docs: reformat. (#5914 ) ### What problem does this PR solve? ### Type of change - [x] Other (please describe):	2025-03-11 16:11:27 +08:00
writinwaters	46b95d5cfe	Reverted some of the version changes (#5908 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-11 16:03:11 +08:00
Kevin Hu	59ba4777ee	Docs: updates issue templates. (#5913 ) ### What problem does this PR solve? ### Type of change - [x] Other (please describe):	2025-03-11 16:02:28 +08:00
Kevin Hu	d44739283c	Docs: prepare docs for release v0.17.1 (#5900 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update v0.17.1	2025-03-11 14:39:41 +08:00
writinwaters	9c953a67a6	UI updates (#5899 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-11 14:14:37 +08:00
writinwaters	bd3fa317e7	Add docs for tag sets (#5890 ) ### What problem does this PR solve? #5716, #5529 ### Type of change - [x] Documentation Update	2025-03-11 13:57:36 +08:00
liu an	715e2b48ca	Test: Update test cases per PR #5748 #5878 (#5894 ) ### What problem does this PR solve? update test cases per PR #5748 #5878 issue #5709 ### Type of change - [x] update test cases	2025-03-11 13:35:28 +08:00
Kevin Hu	90d18143ba	Refa: add prompt to empty retrieved answwer. (#5892 ) ### What problem does this PR solve? #5883 ### Type of change - [x] Refactoring	2025-03-11 13:11:14 +08:00
Kevin Hu	4b6809b32d	Fix: docs updates. (#5889 ) ### What problem does this PR solve? #5852 ### Type of change - [x] Documentation Update	2025-03-11 11:55:39 +08:00
Kevin Hu	7b96146d3f	Fix: check `desc` parameter value. (#5884 ) ### What problem does this PR solve? #5851 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-11 11:43:21 +08:00
liu an	21c55a2e0f	Test: Update test cases per PR #5778 (#5880 ) ### What problem does this PR solve? update test cases per PR https://github.com/infiniflow/ragflow/pull/5778 ### Type of change - [x] update test cases	2025-03-11 11:07:09 +08:00
Kevin Hu	8e965040ce	Fix: rm <think> for ES sql generation. (#5881 ) ### What problem does this PR solve? #5850 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-11 10:41:19 +08:00
Kevin Hu	780ee2b2be	Fix: empty dataset parser id. (#5878 ) ### What problem does this PR solve? #5709 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-11 10:23:08 +08:00
Raghav Patidar	6f9cd96ec5	Fix: dataset_ids parameter (#5864 ) ### What problem does this PR solve? Fixed #5839 This PR fix error code 102, stating dataset_ids is required. curl --request POST \ --url http://{address}/api/v1/chats \ --header 'Content-Type: application/json' \ --header 'Authorization: Bearer <YOUR_API_KEY>' \ --data '{ "name": "test_chat" }' this is not getting datasetids , fix for it. file location : sdk\python\ragflow_sdk\ragflow.py added : "dataset_ids": dataset_list if dataset_list else [], ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Raghav <2020csb1115@iitrpr.ac.in>	2025-03-11 09:44:06 +08:00
liu an	47e244ee9f	Test: Update test cases per PR #5755 (#5857 ) ### What problem does this PR solve? Update test cases per PR #5755 ### Type of change - [x] update test cases	2025-03-10 19:04:39 +08:00
balibabu	df11fe75d3	Feat: Add AvatarGroup component. #3221 (#5858 ) ### What problem does this PR solve? Feat: Add AvatarGroup component. #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-03-10 19:03:48 +08:00
zhangcdian	bf0d516e49	Agent Update: Fix Role Issue and Enhance KB Search (#5842 ) ### What problem does this PR solve? generate.py 更新：问题：部分模型提供商对输入对话内容的格式有严格校验，要求第一条内容的 role 不能为 assistant，否则会报错。解决：删除了系统设置的 agent 开场白，确保传递给模型的对话内容中，第一条内容的 role 不为 assistant。 retrieval.py 更新：问题：当前知识库检索使用全部对话内容作为输入，可能导致检索结果不准确。解决：改为仅使用用户最后提出的一个问题进行知识库检索，提高检索的准确性。 Update generate.py: Issue: Some model providers have strict validation rules for the format of input conversation content, requiring that the role of the first content must not be assistant. Otherwise, an error will occur. Solution: Removed the system-set agent opening statement to ensure that the role of the first content in the conversation passed to the model is not assistant. Update retrieval.py: Issue: The current knowledge base retrieval uses the entire conversation content as input, which may lead to inaccurate retrieval results. Solution: Changed the retrieval logic to use only the last question asked by the user for knowledge base retrieval, improving retrieval accuracy. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Performance Improvement	2025-03-10 18:29:58 +08:00
liu an	b18da35da6	TEST: Added test cases for List Dataset HTTP API (#5856 ) ### What problem does this PR solve? cover dataset list endpoints ### Type of change - [x] Add test cases	2025-03-10 18:29:33 +08:00
hy89	8ba1e6c183	Feat: add `sync_dsl` parameter to support synchronizing modifications to existing sessions (#5843 ) When accessing the /api/v1/agents/{agent_id}/completions API, sessions created before agent modifications retain the old DSL data. To use the latest agent configuration (like new prompts) in historical sessions, I added the sync_dsl parameter. It defaults to False to maintain existing behavior and only synchronizes when set to True. If needed, a manual synchronization API can be created to trigger the sync explicitly.	2025-03-10 17:46:08 +08:00
balibabu	d4f84f0b54	Fix: keyword compont display issue #5794 (#5844 ) ### What problem does this PR solve? Fix: keyword compont display issue #5794 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-10 16:15:44 +08:00
Zhichang Yu	6ec6ca6971	Refactor graphrag to remove redis lock (#5828 ) ### What problem does this PR solve? Refactor graphrag to remove redis lock ### Type of change - [x] Refactoring	2025-03-10 15:15:06 +08:00

... 5 6 7 8 9 ...

2804 Commits