ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-08-19 16:49:10 +08:00

Author	SHA1	Message	Date
Zhichang Yu	dec9b3e540	Fix logs. Use dict.pop instead of del. Close #3473 (#3484 ) ### What problem does this PR solve? Fix logs. Use dict.pop instead of del. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-19 14:15:25 +08:00
Zhichang Yu	4413683898	Introduced beartype (#3460 ) ### What problem does this PR solve? Introduced [beartype](https://github.com/beartype/beartype) for runtime type-checking. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 17:38:17 +08:00
shizzgar	4b3eeaa6ef	Added LocalAI support for rerank models (#3446 ) ### What problem does this PR solve? Hi there! LocalAI added support of rerank models https://localai.io/features/reranker/ I've implemented LocalAIRerank class (typically copied it from OpenAI_APIRerank class). Also, LocalAI model response with 500 error code if len of "documents" is less than 2 in similarity check. So I've added the second "document" on RERANK model connection check in `api/apps/llm_app.py`. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-18 12:05:52 +08:00
Kevin Hu	a1d01a1b2f	enlarge the default token length of RAPTOR summarization (#3454 ) ### What problem does this PR solve? #3426 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-18 10:15:26 +08:00
Zhichang Yu	77bdeb32bd	Added current task into task executor's hearbeat (#3444 ) ### What problem does this PR solve? Added current task into task executor's hearbeat ### Type of change - [x] Refactoring	2024-11-15 22:55:41 +08:00
Zhichang Yu	4ed5ca2666	handle_task catch all exception (#3441 ) ### What problem does this PR solve? handle_task catch all exception Report heartbeats ### Type of change - [x] Refactoring	2024-11-15 18:51:09 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Kevin Hu	cb3b9d7ada	refine the message of queuing a task (#3437 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-15 15:59:54 +08:00
Kevin Hu	ca9e97d2f2	Enlarge the term weight difference (#3435 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-15 15:41:50 +08:00
Zhichang Yu	a854bc22d1	Rework task executor heartbeat (#3430 ) ### What problem does this PR solve? Rework task executor heartbeat, and print in console. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-15 14:43:55 +08:00
Kevin Hu	48e060aa53	rm es query escape chars (#3428 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-15 13:19:07 +08:00
Kevin Hu	a1ba228bc2	fix: empty token bug (#3424 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-15 10:33:03 +08:00
Jin Hai	996c94a8e7	Move clk100k_base tokenizer to docker image (#3411 ) ### What problem does this PR solve? Move the tiktoken of cl100k_base into docker image issue: #3338 ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-15 10:18:40 +08:00
Kevin Hu	220aaddc62	fix: synonym bug (#3423 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-15 10:14:51 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
Kevin Hu	ab4384e011	Updates on parsing progress, including more detailed time cost inform… (#3402 ) ### What problem does this PR solve? #3401 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-14 16:28:10 +08:00
Kevin Hu	c5368c7745	resolve halt while starting up (#3397 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-14 13:20:17 +08:00
Kevin Hu	4caf932808	fix bug about fetching knowledge graph (#3394 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-14 12:29:15 +08:00
Zhichang Yu	9d395ab74e	Added doc for switching elasticsearch to infinity (#3370 ) ### What problem does this PR solve? Added doc for switching elasticsearch to infinity ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2024-11-14 00:08:55 +08:00
Kevin Hu	83c6b1f308	set DLA active for KG (#3386 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-13 16:59:19 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
Kevin Hu	ccf189cb7f	mv service_conf.yaml to conf/ and fix: add 'answer' as a parameter to 'generate' (#3379 ) ### What problem does this PR solve? #3373 ### Type of change - [x] Refactoring - [x] Bug fix	2024-11-13 15:56:40 +08:00
roc king	fa54cd5f5c	exstract model dir from model‘s full name (#3368 ) ### What problem does this PR solve? When model’s group name contains 0-9，we can't find downloaded model，because we do not correctly exstract model dir's name from model‘s full name ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 王志鹏 <zhipeng3.wang@midea.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 14:10:16 +08:00
Kevin Hu	91332fa0f8	Refine english synonym (#3371 ) ### What problem does this PR solve? #3361 ### Type of change - [x] Performance Improvement	2024-11-13 12:58:37 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Zhichang Yu	f4c52371ab	Integration with Infinity (#2894 ) ### What problem does this PR solve? Integration with Infinity - Replaced ELASTICSEARCH with dataStoreConn - Renamed deleteByQuery with delete - Renamed bulk to upsertBulk - getHighlight, getAggregation - Fix KGSearch.search - Moved Dealer.sql_retrieval to es_conn.py ### Type of change - [x] Refactoring	2024-11-12 14:59:41 +08:00
Kevin Hu	34d1daac67	fix: Anthropic param error (#3327 ) ### What problem does this PR solve? #3263 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-11 11:54:14 +08:00
Kevin Hu	5e5a35191e	fix benchmark issue (#3324 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-11 10:14:30 +08:00
Kevin Hu	004487cca0	fix term weight issue (#3306 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-08 18:25:23 +08:00
Kevin Hu	8b6e272197	fix: term weight issue (#3294 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-08 15:49:44 +08:00
Kevin Hu	d88f0d43ea	make language judgement robuster (#3287 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-08 12:48:11 +08:00
Kevin Hu	fbcc0bb408	accelerate tokenize (#3244 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-06 18:54:41 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
ksztone-huanggonghao	0dff64f6ad	fix: TypeError: only length-1 arrays can be converted to Python scalars (#3211 ) ### What problem does this PR solve? fix "TypeError: only length-1 arrays can be converted to Python scalars" while using cohere embedding model. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) ![image](https://github.com/user-attachments/assets/2c21a69f-cd76-4d25-b320-058964812db8)	2024-11-06 11:15:00 +08:00
Kevin Hu	55953819c1	accelerate term weight calculation (#3206 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-05 13:11:26 +08:00
Kevin Hu	677f02c2a7	rm unused file (#3205 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-05 11:56:09 +08:00
Zhichang Yu	37d71dfa90	Replaced redis with Valkey (#3164 ) ### What problem does this PR solve? Replaced redis with Valkey. Close #3070 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-02 20:05:12 +08:00
Kevin Hu	2d1fbefdb5	search between multiple indiices for team function (#3079 ) ### What problem does this PR solve? #2834 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-29 13:19:01 +08:00
Kevin Hu	7e0148c058	fix local variable ans (#3077 ) ### What problem does this PR solve? #3064 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-29 10:42:45 +08:00
Kevin Hu	f86826b7a0	refactor error message of qwen (#3074 ) ### What problem does this PR solve? #3055 ### Type of change - [x] Refactoring	2024-10-29 10:08:08 +08:00
Kevin Hu	9457d20ef1	make gemini robust (#3012 ) ### What problem does this PR solve? #3003 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-25 10:50:44 +08:00
Kevin Hu	7f81fc8f9b	refactor auto keywords and auto question (#2990 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-10-23 17:00:56 +08:00
Kevin Hu	89d5b2414e	fix SILICONFLOW rerank error (#2980 ) ### What problem does this PR solve? #2977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-23 10:12:39 +08:00
Yinquan WANG	445dce4363	[Bug]: unnecessary auto-increment calculations in the tokens statistics of the chat model (#2969 ) ### What problem does this PR solve? the details is shown in https://github.com/infiniflow/ragflow/issues/2968 ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 16:26:04 +08:00
Kevin Hu	1fce6caf80	make titles in markdown not be splited with following content (#2971 ) ### What problem does this PR solve? #2970 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-10-22 15:25:23 +08:00
Kevin Hu	226bdd6e99	add auto keywords and auto-question (#2965 ) ### What problem does this PR solve? #2687 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-22 13:12:49 +08:00
Yinquan WANG	5aa9d7787e	[Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 (#2949 ) [Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 ### What problem does this PR solve? the detail of this PR is shown at https://github.com/infiniflow/ragflow/issues/2948 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 11:40:05 +08:00
Kevin Hu	b2524eec49	fix sequence2txt error and usage total token issue (#2961 ) ### What problem does this PR solve? #1363 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-22 11:38:37 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Ziyu Huang	e5f7733b31	Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906 ) ### What problem does this PR solve? Resolve #2905 due to the in-consistent of token size, I make it safe to limit 500 in code, since there is no config param to control my llama.cpp run set -ub to 1024: ${llama_path}/bin/llama-server --host 0.0.0.0 --port 9901 -ub 1024 -ngl 99 -m $gguf_file --reranking "$@" ### Type of change - [x] New Feature (non-breaking change which adds functionality) Here is my test Ragflow use llama.cpp ``` lot update_slots: id 0 \| task 458 \| prompt done, n_past = 416, n_tokens = 416 slot release: id 0 \| task 458 \| stop processing: n_past = 416, truncated = 0 slot launch_slot_: id 0 \| task 459 \| processing task slot update_slots: id 0 \| task 459 \| tokenizing prompt, len = 2 slot update_slots: id 0 \| task 459 \| prompt tokenized, n_ctx_slot = 8192, n_keep = 0, n_prompt_tokens = 111 slot update_slots: id 0 \| task 459 \| kv cache rm [0, end) slot update_slots: id 0 \| task 459 \| prompt processing progress, n_past = 111, n_tokens = 111, progress = 1.000000 slot update_slots: id 0 \| task 459 \| prompt done, n_past = 111, n_tokens = 111 slot release: id 0 \| task 459 \| stop processing: n_past = 111, truncated = 0 srv update_slots: all slots are idle request: POST /rerank 172.23.0.4 200 ```	2024-10-21 10:06:29 +08:00

... 5 6 7 8 9 ...

681 Commits