ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-07-13 16:31:46 +08:00

Author	SHA1	Message	Date
Kevin Hu	4776fa5e4e	Refactor for total_tokens. (#4652 ) ### What problem does this PR solve? #4567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-26 13:54:26 +08:00
Kevin Hu	3805621564	Fix xinference rerank issue. (#4499 ) ### What problem does this PR solve? #4495 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-01-16 11:35:51 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	cb45431412	Fix Voyage re-rank model. Limit file name length. (#4171 ) ### What problem does this PR solve? #4152 #4154 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-23 10:03:50 +08:00
Kevin Hu	593ffc4067	Fix HuggingFace model error. (#3870 ) ### What problem does this PR solve? #3865 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 13:28:42 +08:00
Kevin Hu	78601ee1bd	Fix open AI compatible rerank issue. (#3866 ) ### What problem does this PR solve? #3700 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-05 10:26:21 +08:00
Kevin Hu	3f3469130b	Fix preview issue in file manager. (#3846 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-04 11:53:23 +08:00
devMls	59a5813f1b	add jina new models in jina connector (#3770 ) ### What problem does this PR solve? add new models in jinna connector, to allow use models that support multilingual models ### Type of change - [X] Other (please describe): new connectors no breaking change	2024-12-02 10:06:39 +08:00
Kevin Hu	91f1814a87	Fix error response (#3719 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2024-11-28 18:56:10 +08:00
liwenju0	875096384b	when qwen rerank model not return ok, raise exception to notice user (#3593 ) ### What problem does this PR solve? When calling the Qwen rerank model, if the model does not return correctly, an exception should be raised to notify the user, rather than simply returning a value of 0, as this would be confusing to the user. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-22 22:34:34 +08:00
shizzgar	4b3eeaa6ef	Added LocalAI support for rerank models (#3446 ) ### What problem does this PR solve? Hi there! LocalAI added support of rerank models https://localai.io/features/reranker/ I've implemented LocalAIRerank class (typically copied it from OpenAI_APIRerank class). Also, LocalAI model response with 500 error code if len of "documents" is less than 2 in similarity check. So I've added the second "document" on RERANK model connection check in `api/apps/llm_app.py`. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-18 12:05:52 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
roc king	fa54cd5f5c	exstract model dir from model‘s full name (#3368 ) ### What problem does this PR solve? When model’s group name contains 0-9，we can't find downloaded model，because we do not correctly exstract model dir's name from model‘s full name ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 王志鹏 <zhipeng3.wang@midea.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 14:10:16 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
Kevin Hu	89d5b2414e	fix SILICONFLOW rerank error (#2980 ) ### What problem does this PR solve? #2977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-23 10:12:39 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Ziyu Huang	e5f7733b31	Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906 ) ### What problem does this PR solve? Resolve #2905 due to the in-consistent of token size, I make it safe to limit 500 in code, since there is no config param to control my llama.cpp run set -ub to 1024: ${llama_path}/bin/llama-server --host 0.0.0.0 --port 9901 -ub 1024 -ngl 99 -m $gguf_file --reranking "$@" ### Type of change - [x] New Feature (non-breaking change which adds functionality) Here is my test Ragflow use llama.cpp ``` lot update_slots: id 0 \| task 458 \| prompt done, n_past = 416, n_tokens = 416 slot release: id 0 \| task 458 \| stop processing: n_past = 416, truncated = 0 slot launch_slot_: id 0 \| task 459 \| processing task slot update_slots: id 0 \| task 459 \| tokenizing prompt, len = 2 slot update_slots: id 0 \| task 459 \| prompt tokenized, n_ctx_slot = 8192, n_keep = 0, n_prompt_tokens = 111 slot update_slots: id 0 \| task 459 \| kv cache rm [0, end) slot update_slots: id 0 \| task 459 \| prompt processing progress, n_past = 111, n_tokens = 111, progress = 1.000000 slot update_slots: id 0 \| task 459 \| prompt done, n_past = 111, n_tokens = 111 slot release: id 0 \| task 459 \| stop processing: n_past = 111, truncated = 0 srv update_slots: all slots are idle request: POST /rerank 172.23.0.4 200 ```	2024-10-21 10:06:29 +08:00
0000sir	4991107822	Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832 ) ### What problem does this PR solve? Fix keys of Xinference deployed models, especially has the same model name with public hosted models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: 0000sir <0000sir@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-16 10:21:08 +08:00
Kevin Hu	5e7c1fb23a	reduce rerank batch size (#2801 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-11 11:29:19 +08:00
Sky Blue	2df15742fc	fix xinference add rerank model bug (#2758 ) ### What problem does this PR solve? Fix xinference add rerank model bug, https://github.com/infiniflow/ragflow/issues/2294#issue-2510788135 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-09 19:37:11 +08:00
liuhua	b68d349bd6	Fix: renrank_model and pdf_parser bugs \| Update: session API (#2601 ) ### What problem does this PR solve? Fix: renrank_model and pdf_parser bugs \| Update: session API #2575 #2559 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-09-26 16:05:25 +08:00
Kevin Hu	dda1367ab2	make it lighten (#2577 ) ### What problem does this PR solve? #2295 ### Type of change - [x] Refactoring	2024-09-25 13:38:40 +08:00
Kevin Hu	7bb28ca2bd	add lighten control (#2567 ) ### What problem does this PR solve? #2295 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-09-24 19:22:01 +08:00
Kevin Hu	4a6a2a0f1b	refine xinference (#2521 ) ### What problem does this PR solve? #1588 ### Type of change - [x] Refactoring	2024-09-20 18:37:01 +08:00
黄腾	99993e5026	add support for Voyage AI (#2159 ) ### What problem does this PR solve? #1853 #2138 add support for Voyage AI ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-29 16:14:49 +08:00
黄腾	733219cc3f	add support for Baidu yiyan (#2049 ) ### What problem does this PR solve? add support for Baidu yiyan ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-22 16:45:15 +08:00
黄腾	e013ac52af	add support for SILICONFLOW (#1926 ) ### What problem does this PR solve? #1853 add support for SILICONFLOW ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-13 16:09:10 +08:00
黄腾	94cb66ba80	add support for TogetherAI (#1890 ) ### What problem does this PR solve? #1853 add support for TogetherAI ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-12 10:15:21 +08:00
黄腾	e34817c2a9	add support for cohere (#1849 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-07 18:40:51 +08:00
黄腾	b67484e77d	add supprot for OpenAI-API-Compatible llm (#1787 ) ### What problem does this PR solve? #1771 add supprot for OpenAI-API-Compatible ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-06 16:20:21 +08:00
Kung Quang	32d5885b68	Fix api reference empty bug (#1655 ) ### What problem does this PR solve? fix api reference empty bug ``` for chunk_i in answer['reference'].get('chunks',[]): ^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'list' object has no attribute 'get' ``` ``` return np.array([d["relevance_score"] for d in res["results"]]), res["meta"]["tokens"]["input_tokens"]+res["meta"]["tokens"]["output_tokens"] ~~~^^^^^^^^^^^ KeyError: 'results' ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-24 18:02:22 +08:00
黄腾	d96348eb22	add support for LM Studio (#1663 ) ### What problem does this PR solve? #1602 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-24 12:46:43 +08:00
黄腾	b4a281eca1	add support for NVIDIA llm (#1645 ) ### What problem does this PR solve? add support for NVIDIA llm ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-23 10:43:09 +08:00
黄腾	3fcdba1683	add support for LocalAI (#1608 ) ### What problem does this PR solve? #762 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-19 15:50:28 +08:00
zhuhao	3657b1f2a2	fix the tokens error that occurred when adding the xinference model (#1527 ) ### What problem does this PR solve? fix the tokens error that occurred when adding the xinference model #1522 root@pc-gpu-86-41:~# curl -X 'POST' 'http://127.0.0.1:9997/v1/rerank' -H 'accept: application/json' -H 'Content-Type: application/json' -d '{ "model": "bge-reranker-v2-m3", "query": "A man is eating pasta.", "return_documents":"true", "return_len":"true", "documents": [ "A man is eating food.", "A man is eating a piece of bread.", "The girl is carrying a baby.", "A man is riding a horse.", "A woman is playing violin." ] }' {"id":"610a8724-3e96-11ef-81ce-08bfb886c012","results":[{"index":0,"relevance_score":0.999574601650238,"document":{"text":"A man is eating food."}},{"index":1,"relevance_score":0.07814773917198181,"document":{"text":"A man is eating a piece of bread."}},{"index":3,"relevance_score":0.000017700713215162978,"document":{"text":"A man is riding a horse."}},{"index":2,"relevance_score":0.0000163753629749408,"document":{"text":"The girl is carrying a baby."}},{"index":4,"relevance_score":0.00001631895975151565,"document":{"text":"A woman is playing violin."}}],"meta":{"api_version":null,"billed_units":null,"tokens":{"input_tokens":38,"output_tokens":38},"warnings":null}} ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-07-16 15:08:51 +08:00
Kevin Hu	99f7bbaaa2	fix bugs of rerank model with xinference (#1481 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-12 12:33:37 +08:00
zhuhao	009e18f094	feat: support xinference rerank model (#1466 ) ### What problem does this PR solve? support xinference rerank model #1455 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-07-11 18:37:41 +08:00
KevinHuSh	0ce720a247	fix mem leak for local reranker (#1295 ) ### What problem does this PR solve? #1288 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-27 14:57:24 +08:00
zhuhao	47926a95ae	Fix ragflow may encounter an OOM (Out Of Memory) when there are a lot of conversations (#1292 ) ### What problem does this PR solve? Fix ragflow may encounter an OOM (Out Of Memory) when there are a lot of conversations. #1288 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: zhuhao <zhuhao@linklogis.com>	2024-06-27 14:48:49 +08:00
GYH	4fcd05ad23	fix Rerank Vector Similarity Score (#1249 ) ### What problem does this PR solve? #1243 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-24 12:25:50 +08:00
Wang Baoling	722c342d56	fix: bug similarity() in YoudaoRerank (#1084 ) ### What problem does this PR solve? bix fix ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-07 09:04:53 +08:00
KevinHuSh	4454ba7a1e	add self-rag (#1070 ) ### What problem does this PR solve? #1069 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-06-06 11:13:39 +08:00
KevinHuSh	b8eedbdd86	refine rerank (#1056 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-04 17:27:00 +08:00
KevinHuSh	cc064040a2	refine API request data processing (#1031 ) ### What problem does this PR solve? #1024 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-03 09:02:25 +08:00
Wang Baoling	c58a1c48eb	Fix: bug #991 (#1013 ) ### What problem does this PR solve? issue #991 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: KevinHuSh <kevinhu.sh@gmail.com>	2024-05-31 18:03:47 +08:00
KevinHuSh	dc7afe46fb	fix bug 994 ,991 (#1004 ) ### What problem does this PR solve? #994 #991 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-05-31 09:24:24 +08:00
KevinHuSh	77363a0875	fix bge rerank normalize issue (#988 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-05-30 12:55:17 +08:00
KevinHuSh	758eb03ccb	fix jina adding issure and term weight refinement (#974 ) ### What problem does this PR solve? #724 #162 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-05-29 19:38:57 +08:00

1 2

51 Commits