ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-08-12 14:28:59 +08:00

Author	SHA1	Message	Date
balibabu	e97fd2b5e6	Feat: Add InnerBlurInput component to avoid frequent updates of zustand causing the input box to lose focus #3221 (#7955 ) ### What problem does this PR solve? Feat: Add InnerBlurInput component to avoid frequent updates of zustand causing the input box to lose focus #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-29 19:52:56 +08:00
Yongteng Lei	49ff1ca934	Fix: code debug (#7949 ) ### What problem does this PR solve? Fix code component debug issue. #7908. I delete the additions in #7933, there is no semantic meaning `output` for `parameters`. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-29 16:53:27 +08:00
Yongteng Lei	46963ab1ca	Fix: add advanced delimiter detection for naive merge (#7941 ) ### What problem does this PR solve? Add advanced delimiter detection for naive merge. #7824 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-05-29 16:17:22 +08:00
giiiiiithub	6ba5a4348a	set PARALLEL_DEVICES default value= 0 (#7935 ) ### What problem does this PR solve? it would be fail if PARALLEL_DEVICES = None in OCR class , because it pass 0 to TextDetector and TextRecognizer init method. and It would be simpler to set 0 as the default value for PARALLEL_DEVICES. ### Type of change - [x] Refactoring	2025-05-29 13:32:16 +08:00
天海蒼灆	f584f5c3d0	agents openai API add new way to get session_id (#7937 ) ### What problem does this PR solve? SpringAI can only add session_id in metadata。so add new way to get session_id from "id" or "metadata.id" ![image](https://github.com/user-attachments/assets/0c698ebb-2228-46d8-94c5-2a291b6f70bf) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-29 13:31:17 +08:00
Stephen Hu	a0f76b7a4d	Fix: add default output method for ComponentParamBase (#7933 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/7908 For the code ` _, out = cpn.output(allow_partial=False)` ` def output(self, allow_partial=True) -> Tuple[str, Union[pd.DataFrame, partial]]: o = getattr(self._param, self._param.output_var_name)` need to call this method But I do not have a full context. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-29 11:50:01 +08:00
balibabu	3f695a542c	Feat: Use memo to wrap canvas nodes to improve fluency #3221 (#7929 ) ### What problem does this PR solve? Feat: Use memo to wrap canvas nodes to improve fluency #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-29 11:10:45 +08:00
Jason Li	64f930b1c5	Truncate long agent descriptions text (#7924 ) Truncate long agent descriptions to prevent overflow outside the agent card container ### What problem does this PR solve? Now the Long text of description will overflow from the agent card, should display the long text properly with truncate. <img width="275" alt="Screenshot 2025-05-28 220329" src="https://github.com/user-attachments/assets/954b3a48-bcab-4669-a42f-6981d4bf859f" /> <img width="275" alt="Screenshot 2025-05-28 220353" src="https://github.com/user-attachments/assets/f385d95a-3e40-4117-b412-ae6a4508e646" /> ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-29 11:10:02 +08:00
balibabu	81b306aac9	Feat:: Use useWatch to synchronize the form data to canvas zustand #3221 (#7926 ) ### What problem does this PR solve? Feat:: Use useWatch to synchronize the form data to canvas zustand #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-29 10:18:52 +08:00
Yongteng Lei	0c562f0a9f	Refa: change citation mark as [ID:n] (#7923 ) ### What problem does this PR solve? Change citation mark as [ID:n], it's easier for LLMs to follow the instruction :) #7904 ### Type of change - [x] Refactoring	2025-05-29 10:03:51 +08:00
balibabu	7c098f9fd1	Fix: Display bug in the early stage of conversation chat #7904 (#7922 ) ### What problem does this PR solve? Fix: Display bug in the early stage of conversation chat #7904 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-28 19:42:56 +08:00
Yongteng Lei	b95747be4c	Fix: early return when update doc in sdk (#7907 ) ### What problem does this PR solve? Fix early return when update doc. #7886 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-28 19:20:27 +08:00
TeslaZY	1239f5afc8	Fix: bad escape \P at position 374 (line 18, column 23) when using th… (#7909 ) …e graph feature (#1727) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-28 19:16:31 +08:00
sinopec	243ed4bc35	Feat: Surpport dynamically add knowledge basees for retrieval while u… (#7915 ) …sing the SDK chat API ### What problem does this PR solve? When using the SDK for chat, you can include the IDs of additional knowledge bases you want to use in the request. This way, you don’t need to repeatedly create new assistants to support various combinations of knowledge bases. This is especially useful when there are many knowledge bases with different content. If users clearly know which knowledge base contains the information they need and select accordingly, the recall accuracy will be greatly improved. Users only need to add an extra field, a kb_ids array, in the HTTP request. The content of this field can be determined by the client fetching the list of knowledge bases and letting the user select from it. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Li Ye <liye@unittec.com>	2025-05-28 19:16:16 +08:00
天海蒼灆	47d40806a4	doc related_question path changed (#7918 ) conversation change to sessions ### What problem does this PR solve? related_question interface has wrong uri in HTTP API doc ### Type of change - [x] Documentation Update	2025-05-28 18:36:42 +08:00
Kevin Hu	91df073653	Docs: about latest updates (#7902 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com>	2025-05-28 18:31:50 +08:00
liu an	20ab6aad4a	Fix: patch SSTI vulnerability in template rendering (#7905 ) ### What problem does this PR solve? [[Critical] RagFlow has a SSTI, which can lead to Remote Code Execution (RCE).](https://github.com/infiniflow/ragflow/security/advisories/GHSA-mrf5-7w8r-8x88#event-463508) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-28 11:47:22 +08:00
Stephen Hu	a71376ad6a	Fix: KeyError: 'method' when build run_graphrag (#7899 ) ### What problem does this PR solve? Close #7879 I checked the current master code, the kb_parser_config is join from knowledge table, so I think should be some edge cases due to history data ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-28 11:46:41 +08:00
Qidi Cao	4d835b7303	fix: resolve “has no attribute 'max_length'” error in keyword_extraction (#7903 ) ### What problem does this PR solve? Issue Description: When using the `/api/retrieval` endpoint with a POST request and setting the `keyword` parameter to `true`, the system invokes the `model_instance` method from `TenantLLMService` to create a `chat_mdl` instance. Subsequently, it calls the `keyword_extraction` method to extract keywords. However, within the `keyword_extraction` method, the `chat` function of the LLM attempts to access the `chat_mdl.max_length` attribute to validate input length. This results in the following error: ``` AttributeError: 'SILICONFLOWChat' object has no attribute 'max_length' ``` Proposed Solution: Upon reviewing other parts of the codebase where `chat_mdl` instances are created, it appears that utilizing `LLMBundle` for instantiation is more appropriate. `LLMBundle` includes the `max_length` attribute, which should resolve the encountered error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-28 10:58:06 +08:00
Xiaodong Li	b922dd06a5	Update README.md (#7864 ) ### What problem does this PR solve? add DeepWiki Badge Maker ### Type of change - [x] Other (please describe):add DeepWiki Badge Maker --------- Co-authored-by: lixiaodong11 <lixiaodong11@hikvision.com.cn>	2025-05-28 09:29:33 +08:00
balibabu	84f5ae20be	Feat: Add the SelectWithSearch component #3221 (#7892 ) ### What problem does this PR solve? Feat: Add the SelectWithSearch component #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-28 09:22:09 +08:00
Stephen Hu	273f36cc54	Perf: reduce upload to minio limiter scope (#7878 ) ### What problem does this PR solve? reduce upload_to_minio limter scope ### Type of change - [x] Performance Improvement	2025-05-27 17:49:37 +08:00
Kevin Hu	28cb4df127	Fix: raptor overloading (#7889 ) ### What problem does this PR solve? #7840 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-27 17:41:35 +08:00
Sol	bc578e1e83	Removed the "USER:" in the question, reducing the accuracy of the search (#7852 ) ### What problem does this PR solve? ![85784793b445e081ea1c7524b568123f](https://github.com/user-attachments/assets/88748407-ea3d-445a-9dae-8f02cfdf78f3) ![77e59b94b621b3b6fdda654104f01d1a](https://github.com/user-attachments/assets/6531c691-a625-48c4-b05f-c64f8acd7c28) ![73e91d72114b905cfa39e804cd3240a3](https://github.com/user-attachments/assets/eb9d0bb2-4aac-40d8-8444-cdcbc0835568) ![45c8a52ecf5e1603354c4d0a814ecf06](https://github.com/user-attachments/assets/d56162a4-8168-4e7f-a113-17ec258b9539) user will be used as a common keyword to participate in the search, which may lead to the recall of irrelevant content and reduce the search accuracy. If user appears frequently in your knowledge base, it may affect relevance sorting and even recall some irrelevant FAQs or documents. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [x] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-05-27 16:58:18 +08:00
liu an	ff0e82988f	Fix: patch regex vulnerability in filename handling (#7887 ) ### What problem does this PR solve? [Regular Expression Injection leading to Denial of Service (ReDoS)](https://github.com/infiniflow/ragflow/security/advisories/GHSA-wqq6-x8g9-f7mh) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-27 16:35:37 +08:00
writinwaters	13528ec328	Docs: From v0.13.0 onwards, markdown chunking is added to the General chunking method. (#7883 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-05-27 16:33:14 +08:00
balibabu	590070e47d	Feat: Put buildSelectOptions to common-util.ts #3221 (#7875 ) ### What problem does this PR solve? Feat: Put buildSelectOptions to common-util.ts #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-27 11:44:54 +08:00
Kevin Hu	959793e83c	Fix: task limiter issue. (#7873 ) ### What problem does this PR solve? #7869 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-27 11:16:29 +08:00
He Wang	aaefc3f44c	update xgboost and dep scripts for local build on MacOS (#7857 ) ### What problem does this PR solve? There are two main changes: 1. Update xgboost to 1.6.0 to build the project on MacOS with Apple chips, this change refers to the issue: https://github.com/infiniflow/ragflow/issues/5114. 2. When `use_china_mirrors` is set in `download_deps.py`, the names of chrome files downloaded by the script will be different from the file names used in Dockerfile, so I added the file name in `get_urls` function to solve this problem. I think it's better to add testing for Docker image `infiniflow/ragflow_deps` to the test workflow, but since the workflow is currently running on a self-hosted runner, I'm not sure how to modify it. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-27 09:28:52 +08:00
balibabu	48294e624c	Feat: Add the WaitingDialogue operator. #3221 (#7862 ) ### What problem does this PR solve? Feat: Add the WaitingDialogue operator. #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-26 19:36:49 +08:00
writinwaters	add4b13856	Docs: Miscellaneous editorial updates (#7865 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-05-26 19:36:35 +08:00
pyyuhao	5d6bf2224a	Fix: Opensearch chunk management (#7802 ) ### What problem does this PR solve? This PR solve the problems metioned in the pr(https://github.com/infiniflow/ragflow/pull/7140) which is also submitted by me ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### Introduction I fixed the problems when using OpenSearch as the DOC_ENGINE, the failures of pytest and the wrong API's return. Mainly about delete chunk, list chunks, update chunk, retrieval chunk. The pytest comand "cd sdk/python && uv sync --python 3.10 --group test --frozen && source .venv/bin/activate && cd test/test_http_api && DOC_ENGINE=opensearch pytest test_chunk_management_within_dataset -s --tb=short " is finally successful. ###Others As some changes between Elasticsearch And Opensearch differ, some pytest results about OpenSearch are correct and resonable. However, some pytest params (skipif params) are incompatible. So I changed some pytest params about skipif. As a search engine programmer, I will still focus on the usage of vector databases (especially OpenSearch) for the RAG stuff. Thanks for your review	2025-05-26 16:57:58 +08:00
balibabu	c09bd9fe4a	Feat: Convert the data of the messge operator to a string array #3221 (#7853 ) ### What problem does this PR solve? Feat: Convert the data of the messge operator to a string array #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-26 16:56:50 +08:00
LUO huan huan	c7db0eaca6	Optimize Tag Removal Method (#7847 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-26 16:56:33 +08:00
balibabu	78fa37f8ae	Feat: Upgrade react-hook-form to the latest version to solve the problem that appending a useFieldArray entry cannot trigger the watch callback function #3221 (#7849 ) ### What problem does this PR solve? Feat: Upgrade react-hook-form to the latest version to solve the problem that appending a useFieldArray entry cannot trigger the watch callback function #3221 [issue: watch is not called when appending first item to Field Array #12370](https://github.com/react-hook-form/react-hook-form/issues/12370) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-26 12:21:19 +08:00
Kevin Hu	be83074131	Fix: restore task limiter. (#7844 ) ### What problem does this PR solve? Close #7828 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-26 10:59:01 +08:00
writinwaters	1f756947da	Docs: Added code component reference (#7821 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-05-26 10:39:51 +08:00
Stephen Hu	ae171956e8	Fix:Setting the message_history_window_size to 0 does not take effect (#7842 ) ### What problem does this PR solve? Close #7830 The caller method should already have code to handle this. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-26 10:28:46 +08:00
Kevin Hu	1f32e6e4f4	Fix: list out of boundary (#7843 ) ### What problem does this PR solve? Close #7837 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-26 10:28:36 +08:00
Hao Zhang	2f4d803db1	Delete Corresponding Minio Bucket When Deleting a Knowledge Base (#7841 ) ### What problem does this PR solve? Delete Corresponding Minio Bucket When Deleting a Knowledge Base [issue #4113 ](https://github.com/infiniflow/ragflow/issues/4113) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-05-26 10:02:51 +08:00
Yongteng Lei	552023ee4b	Fix: catch non-begin component output (#7827 ) ### What problem does this PR solve? Catch non-begin component output ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) v0.19.x	2025-05-23 20:29:23 +08:00
Yongteng Lei	6c9b8ec860	Refa: update gemini2.5 (#7822 ) ### What problem does this PR solve? Update gemini2.5 ### Type of change - [x] Refactoring	2025-05-23 20:29:10 +08:00
balibabu	f9e6ad86b7	Fix: Fixed the issue that the script text of the code operator is not displayed after refreshing the page after saving the script text of the code operator #4977 (#7825 ) ### What problem does this PR solve? Fix: Fixed the issue that the script text of the code operator is not displayed after refreshing the page after saving the script text of the code operator #4977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-23 18:57:45 +08:00
balibabu	e604634d2a	Feat: Refactor the MessageForm with shadcn #3221 (#7820 ) ### What problem does this PR solve? Feat: Refactor the MessageForm with shadcn #3221 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:45:13 +08:00
liu an	590b9dabab	Docs: update for v0.19.0 (#7823 ) ### What problem does this PR solve? update for v0.19.0 ### Type of change - [x] Documentation Update	2025-05-23 18:25:47 +08:00
writinwaters	c283ea57fd	Docs: Added v0.19.0 release notes (#7818 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-05-23 18:25:33 +08:00
Yongteng Lei	50ff16e7a4	Feat: add claude4 models (#7809 ) ### What problem does this PR solve? Add claude4 models. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:25:13 +08:00
Yongteng Lei	453287b06b	Feat: more robust fallbacks for citations (#7801 ) ### What problem does this PR solve? Add more robust fallbacks for citations ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:24:55 +08:00
liu an	e166f132b3	Feat: change default models (#7777 ) ### What problem does this PR solve? change default models to buildin models https://github.com/infiniflow/ragflow/issues/7774 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-05-23 18:21:25 +08:00
Yongteng Lei	42f4d4dbc8	Fix: wrong type hint (#7738 ) ### What problem does this PR solve? Wrong hint type. #7729. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-23 18:21:06 +08:00

1 2 3 4 5 ...

3075 Commits