ragflow

AI/ragflow

mirror of https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git synced 2025-06-04 11:24:00 +08:00

Author	SHA1	Message	Date
Jin Hai	996c94a8e7	Move clk100k_base tokenizer to docker image (#3411 ) ### What problem does this PR solve? Move the tiktoken of cl100k_base into docker image issue: #3338 ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-15 10:18:40 +08:00
Kevin Hu	220aaddc62	fix: synonym bug (#3423 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-15 10:14:51 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
Kevin Hu	ab4384e011	Updates on parsing progress, including more detailed time cost inform… (#3402 ) ### What problem does this PR solve? #3401 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-14 16:28:10 +08:00
Kevin Hu	c5368c7745	resolve halt while starting up (#3397 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-14 13:20:17 +08:00
Kevin Hu	4caf932808	fix bug about fetching knowledge graph (#3394 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-14 12:29:15 +08:00
Zhichang Yu	9d395ab74e	Added doc for switching elasticsearch to infinity (#3370 ) ### What problem does this PR solve? Added doc for switching elasticsearch to infinity ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2024-11-14 00:08:55 +08:00
Kevin Hu	83c6b1f308	set DLA active for KG (#3386 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-13 16:59:19 +08:00
shijiefengjun	632b23486f	Fix the value issue of anthropic (#3351 ) ### What problem does this PR solve? This pull request fixes the issue mentioned in https://github.com/infiniflow/ragflow/issues/3263. 1. response should be parsed as dict, prevent the following code from failing to take values: ans = response["content"][0]["text"] 2. API Model ```claude-instant-1.2``` has retired (by [model-deprecations](https://docs.anthropic.com/en/docs/resources/model-deprecations)), it will trigger errors in the code, so I deleted it from the conf/llm_factories.json file and updated the latest API Model ```claude-3-5-sonnet-20241022``` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: chenhaodong <chenhaodong@ctrlvideo.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 16:13:52 +08:00
Kevin Hu	ccf189cb7f	mv service_conf.yaml to conf/ and fix: add 'answer' as a parameter to 'generate' (#3379 ) ### What problem does this PR solve? #3373 ### Type of change - [x] Refactoring - [x] Bug fix	2024-11-13 15:56:40 +08:00
roc king	fa54cd5f5c	exstract model dir from model‘s full name (#3368 ) ### What problem does this PR solve? When model’s group name contains 0-9，we can't find downloaded model，because we do not correctly exstract model dir's name from model‘s full name ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 王志鹏 <zhipeng3.wang@midea.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-13 14:10:16 +08:00
Kevin Hu	91332fa0f8	Refine english synonym (#3371 ) ### What problem does this PR solve? #3361 ### Type of change - [x] Performance Improvement	2024-11-13 12:58:37 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Zhichang Yu	f4c52371ab	Integration with Infinity (#2894 ) ### What problem does this PR solve? Integration with Infinity - Replaced ELASTICSEARCH with dataStoreConn - Renamed deleteByQuery with delete - Renamed bulk to upsertBulk - getHighlight, getAggregation - Fix KGSearch.search - Moved Dealer.sql_retrieval to es_conn.py ### Type of change - [x] Refactoring	2024-11-12 14:59:41 +08:00
Kevin Hu	34d1daac67	fix: Anthropic param error (#3327 ) ### What problem does this PR solve? #3263 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-11 11:54:14 +08:00
Kevin Hu	5e5a35191e	fix benchmark issue (#3324 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-11 10:14:30 +08:00
Kevin Hu	004487cca0	fix term weight issue (#3306 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-08 18:25:23 +08:00
Kevin Hu	8b6e272197	fix: term weight issue (#3294 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-08 15:49:44 +08:00
Kevin Hu	d88f0d43ea	make language judgement robuster (#3287 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-08 12:48:11 +08:00
Kevin Hu	fbcc0bb408	accelerate tokenize (#3244 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-06 18:54:41 +08:00
Kevin Hu	4097912d59	add inputs to display to every components (#3242 ) ### What problem does this PR solve? #3240 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-11-06 18:47:53 +08:00
ksztone-huanggonghao	0dff64f6ad	fix: TypeError: only length-1 arrays can be converted to Python scalars (#3211 ) ### What problem does this PR solve? fix "TypeError: only length-1 arrays can be converted to Python scalars" while using cohere embedding model. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) ![image](https://github.com/user-attachments/assets/2c21a69f-cd76-4d25-b320-058964812db8)	2024-11-06 11:15:00 +08:00
Kevin Hu	55953819c1	accelerate term weight calculation (#3206 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-11-05 13:11:26 +08:00
Kevin Hu	677f02c2a7	rm unused file (#3205 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-11-05 11:56:09 +08:00
Zhichang Yu	37d71dfa90	Replaced redis with Valkey (#3164 ) ### What problem does this PR solve? Replaced redis with Valkey. Close #3070 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-02 20:05:12 +08:00
Kevin Hu	2d1fbefdb5	search between multiple indiices for team function (#3079 ) ### What problem does this PR solve? #2834 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-29 13:19:01 +08:00
Kevin Hu	7e0148c058	fix local variable ans (#3077 ) ### What problem does this PR solve? #3064 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-29 10:42:45 +08:00
Kevin Hu	f86826b7a0	refactor error message of qwen (#3074 ) ### What problem does this PR solve? #3055 ### Type of change - [x] Refactoring	2024-10-29 10:08:08 +08:00
Kevin Hu	9457d20ef1	make gemini robust (#3012 ) ### What problem does this PR solve? #3003 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-25 10:50:44 +08:00
Kevin Hu	7f81fc8f9b	refactor auto keywords and auto question (#2990 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2024-10-23 17:00:56 +08:00
Kevin Hu	89d5b2414e	fix SILICONFLOW rerank error (#2980 ) ### What problem does this PR solve? #2977 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-23 10:12:39 +08:00
Yinquan WANG	445dce4363	[Bug]: unnecessary auto-increment calculations in the tokens statistics of the chat model (#2969 ) ### What problem does this PR solve? the details is shown in https://github.com/infiniflow/ragflow/issues/2968 ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 16:26:04 +08:00
Kevin Hu	1fce6caf80	make titles in markdown not be splited with following content (#2971 ) ### What problem does this PR solve? #2970 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-10-22 15:25:23 +08:00
Kevin Hu	226bdd6e99	add auto keywords and auto-question (#2965 ) ### What problem does this PR solve? #2687 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-22 13:12:49 +08:00
Yinquan WANG	5aa9d7787e	[Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 (#2949 ) [Bug]: When use OpenAI chat model , raise ERROR: 'CompletionUsage' object has no attribute 'get' #2948 ### What problem does this PR solve? the detail of this PR is shown at https://github.com/infiniflow/ragflow/issues/2948 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-22 11:40:05 +08:00
Kevin Hu	b2524eec49	fix sequence2txt error and usage total token issue (#2961 ) ### What problem does this PR solve? #1363 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-22 11:38:37 +08:00
chongchuanbing	ac26d09a59	Feature/feat1017 (#2872 ) ### What problem does this PR solve? 1. fix: mid map show error in knowledge graph, juse because ```@antv/g6```version changed 2. feat: concurrent threads configuration support in graph extractor 3. fix: used tokens update failed for tenant 4. feat: timeout configuration support for llm 5. fix: regex error in graph extractor 6. feat: qwen rerank(```gte-rerank```) support 7. fix: timeout deal in knowledge graph index process. Now chat by stream output, also, it is configuratable. 8. feat: ```qwen-long``` model configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: chongchuanbing <chongchuanbing@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-21 12:11:08 +08:00
Ziyu Huang	e5f7733b31	Resolves #2905 openai compatible model provider add llama.cpp rerank support (#2906 ) ### What problem does this PR solve? Resolve #2905 due to the in-consistent of token size, I make it safe to limit 500 in code, since there is no config param to control my llama.cpp run set -ub to 1024: ${llama_path}/bin/llama-server --host 0.0.0.0 --port 9901 -ub 1024 -ngl 99 -m $gguf_file --reranking "$@" ### Type of change - [x] New Feature (non-breaking change which adds functionality) Here is my test Ragflow use llama.cpp ``` lot update_slots: id 0 \| task 458 \| prompt done, n_past = 416, n_tokens = 416 slot release: id 0 \| task 458 \| stop processing: n_past = 416, truncated = 0 slot launch_slot_: id 0 \| task 459 \| processing task slot update_slots: id 0 \| task 459 \| tokenizing prompt, len = 2 slot update_slots: id 0 \| task 459 \| prompt tokenized, n_ctx_slot = 8192, n_keep = 0, n_prompt_tokens = 111 slot update_slots: id 0 \| task 459 \| kv cache rm [0, end) slot update_slots: id 0 \| task 459 \| prompt processing progress, n_past = 111, n_tokens = 111, progress = 1.000000 slot update_slots: id 0 \| task 459 \| prompt done, n_past = 111, n_tokens = 111 slot release: id 0 \| task 459 \| stop processing: n_past = 111, truncated = 0 srv update_slots: all slots are idle request: POST /rerank 172.23.0.4 200 ```	2024-10-21 10:06:29 +08:00
Kevin Hu	b9fa00f341	add API for tenant function (#2866 ) ### What problem does this PR solve? feat: API access key management https://github.com/infiniflow/ragflow/issues/2846 feat: Render markdown file with remark-loader https://github.com/infiniflow/ragflow/issues/2846 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-16 16:10:24 +08:00
0000sir	4991107822	Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832 ) ### What problem does this PR solve? Fix keys of Xinference deployed models, especially has the same model name with public hosted models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: 0000sir <0000sir@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-16 10:21:08 +08:00
Kevin Hu	b540d41cdc	let presentation do raptor (#2838 ) ### What problem does this PR solve? #2837 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-15 10:11:09 +08:00
Kevin Hu	b164116277	refine token similarity (#2824 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-14 13:33:18 +08:00
Kevin Hu	190eea7097	trival (#2808 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-11 15:33:38 +08:00
Kevin Hu	2d1c83da59	fix LIGHTEN issue (#2806 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-11 15:01:27 +08:00
JobSmithManipulation	3f065c75da	support chat model in huggingface (#2802 ) ### What problem does this PR solve? #2794 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-10-11 14:45:48 +08:00
Kevin Hu	5e7c1fb23a	reduce rerank batch size (#2801 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-11 11:29:19 +08:00
JobSmithManipulation	18f80743eb	support api-version and change default-model in adding azure-openai and openai (#2799 ) ### What problem does this PR solve? #2701 #2712 #2749 ### Type of change -[x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-11 11:26:42 +08:00
Kevin Hu	29f022c91c	fix bedrock issue (#2776 ) ### What problem does this PR solve? #2722 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-10 09:13:35 +08:00
lidp	20e63f8ec4	Fix docx images (#2756 ) ### What problem does this PR solve? #2755 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-09 19:37:32 +08:00
Sky Blue	2df15742fc	fix xinference add rerank model bug (#2758 ) ### What problem does this PR solve? Fix xinference add rerank model bug, https://github.com/infiniflow/ragflow/issues/2294#issue-2510788135 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-09 19:37:11 +08:00

1 2 3 4 5 ...

469 Commits