ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-07-15 06:01:48 +08:00

Author	SHA1	Message	Date
Kevin Hu	89d5b2414e	fix SILICONFLOW rerank error (#2980 ) ### What problem does this PR solve? #2977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-23 10:12:39 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Ziyu Huang	e5f7733b31	Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906 ) ### What problem does this PR solve? Resolve #2905 due to the in-consistent of token size, I make it safe to limit 500 in code, since there is no config param to control my llama.cpp run set -ub to 1024: ${llama_path}/bin/llama-server --host 0.0.0.0 --port 9901 -ub 1024 -ngl 99 -m $gguf_file --reranking "$@" ### Type of change - [x] New Feature (non-breaking change which adds functionality) Here is my test Ragflow use llama.cpp ``` lot update_slots: id 0 \| task 458 \| prompt done, n_past = 416, n_tokens = 416 slot release: id 0 \| task 458 \| stop processing: n_past = 416, truncated = 0 slot launch_slot_: id 0 \| task 459 \| processing task slot update_slots: id 0 \| task 459 \| tokenizing prompt, len = 2 slot update_slots: id 0 \| task 459 \| prompt tokenized, n_ctx_slot = 8192, n_keep = 0, n_prompt_tokens = 111 slot update_slots: id 0 \| task 459 \| kv cache rm [0, end) slot update_slots: id 0 \| task 459 \| prompt processing progress, n_past = 111, n_tokens = 111, progress = 1.000000 slot update_slots: id 0 \| task 459 \| prompt done, n_past = 111, n_tokens = 111 slot release: id 0 \| task 459 \| stop processing: n_past = 111, truncated = 0 srv update_slots: all slots are idle request: POST /rerank 172.23.0.4 200 ```	2024-10-21 10:06:29 +08:00
0000sir	4991107822	Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832 ) ### What problem does this PR solve? Fix keys of Xinference deployed models, especially has the same model name with public hosted models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: 0000sir <0000sir@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-16 10:21:08 +08:00
Kevin Hu	5e7c1fb23a	reduce rerank batch size (#2801 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-11 11:29:19 +08:00
Sky Blue	2df15742fc	fix xinference add rerank model bug (#2758 ) ### What problem does this PR solve? Fix xinference add rerank model bug, https://github.com/infiniflow/ragflow/issues/2294#issue-2510788135 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-09 19:37:11 +08:00
liuhua	b68d349bd6	Fix: renrank_model and pdf_parser bugs \| Update: session API (#2601 ) ### What problem does this PR solve? Fix: renrank_model and pdf_parser bugs \| Update: session API #2575 #2559 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-09-26 16:05:25 +08:00
Kevin Hu	dda1367ab2	make it lighten (#2577 ) ### What problem does this PR solve? #2295 ### Type of change - [x] Refactoring	2024-09-25 13:38:40 +08:00
Kevin Hu	7bb28ca2bd	add lighten control (#2567 ) ### What problem does this PR solve? #2295 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-09-24 19:22:01 +08:00
Kevin Hu	4a6a2a0f1b	refine xinference (#2521 ) ### What problem does this PR solve? #1588 ### Type of change - [x] Refactoring	2024-09-20 18:37:01 +08:00
黄腾	99993e5026	add support for Voyage AI (#2159 ) ### What problem does this PR solve? #1853 #2138 add support for Voyage AI ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-29 16:14:49 +08:00
黄腾	733219cc3f	add support for Baidu yiyan (#2049 ) ### What problem does this PR solve? add support for Baidu yiyan ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-22 16:45:15 +08:00
黄腾	e013ac52af	add support for SILICONFLOW (#1926 ) ### What problem does this PR solve? #1853 add support for SILICONFLOW ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-13 16:09:10 +08:00
黄腾	94cb66ba80	add support for TogetherAI (#1890 ) ### What problem does this PR solve? #1853 add support for TogetherAI ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-12 10:15:21 +08:00
黄腾	e34817c2a9	add support for cohere (#1849 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-07 18:40:51 +08:00
黄腾	b67484e77d	add supprot for OpenAI-API-Compatible llm (#1787 ) ### What problem does this PR solve? #1771 add supprot for OpenAI-API-Compatible ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-08-06 16:20:21 +08:00
Kung Quang	32d5885b68	Fix api reference empty bug (#1655 ) ### What problem does this PR solve? fix api reference empty bug ``` for chunk_i in answer['reference'].get('chunks',[]): ^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: 'list' object has no attribute 'get' ``` ``` return np.array([d["relevance_score"] for d in res["results"]]), res["meta"]["tokens"]["input_tokens"]+res["meta"]["tokens"]["output_tokens"] ~~~^^^^^^^^^^^ KeyError: 'results' ``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-24 18:02:22 +08:00
黄腾	d96348eb22	add support for LM Studio (#1663 ) ### What problem does this PR solve? #1602 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-24 12:46:43 +08:00
黄腾	b4a281eca1	add support for NVIDIA llm (#1645 ) ### What problem does this PR solve? add support for NVIDIA llm ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-23 10:43:09 +08:00
黄腾	3fcdba1683	add support for LocalAI (#1608 ) ### What problem does this PR solve? #762 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-19 15:50:28 +08:00
zhuhao	3657b1f2a2	fix the tokens error that occurred when adding the xinference model (#1527 ) ### What problem does this PR solve? fix the tokens error that occurred when adding the xinference model #1522 root@pc-gpu-86-41:~# curl -X 'POST' 'http://127.0.0.1:9997/v1/rerank' -H 'accept: application/json' -H 'Content-Type: application/json' -d '{ "model": "bge-reranker-v2-m3", "query": "A man is eating pasta.", "return_documents":"true", "return_len":"true", "documents": [ "A man is eating food.", "A man is eating a piece of bread.", "The girl is carrying a baby.", "A man is riding a horse.", "A woman is playing violin." ] }' {"id":"610a8724-3e96-11ef-81ce-08bfb886c012","results":[{"index":0,"relevance_score":0.999574601650238,"document":{"text":"A man is eating food."}},{"index":1,"relevance_score":0.07814773917198181,"document":{"text":"A man is eating a piece of bread."}},{"index":3,"relevance_score":0.000017700713215162978,"document":{"text":"A man is riding a horse."}},{"index":2,"relevance_score":0.0000163753629749408,"document":{"text":"The girl is carrying a baby."}},{"index":4,"relevance_score":0.00001631895975151565,"document":{"text":"A woman is playing violin."}}],"meta":{"api_version":null,"billed_units":null,"tokens":{"input_tokens":38,"output_tokens":38},"warnings":null}} ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-07-16 15:08:51 +08:00
Kevin Hu	99f7bbaaa2	fix bugs of rerank model with xinference (#1481 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-12 12:33:37 +08:00
zhuhao	009e18f094	feat: support xinference rerank model (#1466 ) ### What problem does this PR solve? support xinference rerank model #1455 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-07-11 18:37:41 +08:00
KevinHuSh	0ce720a247	fix mem leak for local reranker (#1295 ) ### What problem does this PR solve? #1288 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-27 14:57:24 +08:00
zhuhao	47926a95ae	Fix ragflow may encounter an OOM (Out Of Memory) when there are a lot of conversations (#1292 ) ### What problem does this PR solve? Fix ragflow may encounter an OOM (Out Of Memory) when there are a lot of conversations. #1288 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: zhuhao <zhuhao@linklogis.com>	2024-06-27 14:48:49 +08:00
GYH	4fcd05ad23	fix Rerank Vector Similarity Score (#1249 ) ### What problem does this PR solve? #1243 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-24 12:25:50 +08:00
Wang Baoling	722c342d56	fix: bug similarity() in YoudaoRerank (#1084 ) ### What problem does this PR solve? bix fix ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-07 09:04:53 +08:00
KevinHuSh	4454ba7a1e	add self-rag (#1070 ) ### What problem does this PR solve? #1069 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-06-06 11:13:39 +08:00
KevinHuSh	b8eedbdd86	refine rerank (#1056 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-04 17:27:00 +08:00
KevinHuSh	cc064040a2	refine API request data processing (#1031 ) ### What problem does this PR solve? #1024 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-06-03 09:02:25 +08:00
Wang Baoling	c58a1c48eb	Fix: bug #991 (#1013 ) ### What problem does this PR solve? issue #991 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: KevinHuSh <kevinhu.sh@gmail.com>	2024-05-31 18:03:47 +08:00
KevinHuSh	dc7afe46fb	fix bug 994 ,991 (#1004 ) ### What problem does this PR solve? #994 #991 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-05-31 09:24:24 +08:00
KevinHuSh	77363a0875	fix bge rerank normalize issue (#988 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-05-30 12:55:17 +08:00
KevinHuSh	758eb03ccb	fix jina adding issure and term weight refinement (#974 ) ### What problem does this PR solve? #724 #162 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-05-29 19:38:57 +08:00
KevinHuSh	614defec21	add rerank model (#969 ) ### What problem does this PR solve? feat: add rerank models to the project #724 #162 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-05-29 16:50:02 +08:00

35 Commits