Kevin Hu
220aaddc62
fix: synonym bug ( #3423 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-15 10:14:51 +08:00
Zhichang Yu
30f6421760
Use consistent log file names, introduced initLogger ( #3403 )
...
### What problem does this PR solve?
Use consistent log file names, introduced initLogger
### Type of change
- [ ] Bug Fix (non-breaking change which fixes an issue)
- [ ] New Feature (non-breaking change which adds functionality)
- [ ] Documentation Update
- [x] Refactoring
- [ ] Performance Improvement
- [ ] Other (please describe):
2024-11-14 17:13:48 +08:00
Kevin Hu
c5368c7745
resolve halt while starting up ( #3397 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-14 13:20:17 +08:00
Kevin Hu
91332fa0f8
Refine english synonym ( #3371 )
...
### What problem does this PR solve?
#3361
### Type of change
- [x] Performance Improvement
2024-11-13 12:58:37 +08:00
Zhichang Yu
a2a5631da4
Rework logging ( #3358 )
...
Unified all log files into one.
### What problem does this PR solve?
Unified all log files into one.
### Type of change
- [x] Refactoring
2024-11-12 17:35:13 +08:00
Zhichang Yu
f4c52371ab
Integration with Infinity ( #2894 )
...
### What problem does this PR solve?
Integration with Infinity
- Replaced ELASTICSEARCH with dataStoreConn
- Renamed deleteByQuery with delete
- Renamed bulk to upsertBulk
- getHighlight, getAggregation
- Fix KGSearch.search
- Moved Dealer.sql_retrieval to es_conn.py
### Type of change
- [x] Refactoring
2024-11-12 14:59:41 +08:00
Kevin Hu
004487cca0
fix term weight issue ( #3306 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-08 18:25:23 +08:00
Kevin Hu
8b6e272197
fix: term weight issue ( #3294 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-11-08 15:49:44 +08:00
Kevin Hu
d88f0d43ea
make language judgement robuster ( #3287 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-08 12:48:11 +08:00
Kevin Hu
fbcc0bb408
accelerate tokenize ( #3244 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-06 18:54:41 +08:00
Kevin Hu
55953819c1
accelerate term weight calculation ( #3206 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-11-05 13:11:26 +08:00
Kevin Hu
2d1fbefdb5
search between multiple indiices for team function ( #3079 )
...
### What problem does this PR solve?
#2834
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-10-29 13:19:01 +08:00
Kevin Hu
226bdd6e99
add auto keywords and auto-question ( #2965 )
...
### What problem does this PR solve?
#2687
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-10-22 13:12:49 +08:00
Kevin Hu
b164116277
refine token similarity ( #2824 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-10-14 13:33:18 +08:00
Kevin Hu
190eea7097
trival ( #2808 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-10-11 15:33:38 +08:00
Kevin Hu
2d1c83da59
fix LIGHTEN issue ( #2806 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-10-11 15:01:27 +08:00
Kevin Hu
04ff9cda7c
expand rerank range ( #2746 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-10-08 16:34:33 +08:00
lidp
08d5637770
Fix tokenizer bug ( #2573 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-09-25 10:30:49 +08:00
Kevin Hu
ecf441c830
refine using rerank model ( #2553 )
...
### What problem does this PR solve?
#2552
### Type of change
- [x] Performance Improvement
2024-09-24 12:38:18 +08:00
Kevin Hu
54342ae0a2
boost highlight performace ( #2419 )
...
### What problem does this PR solve?
#2415
### Type of change
- [x] Performance Improvement
2024-09-13 18:10:32 +08:00
Kevin Hu
9d4bb5767c
make highlight friendly to English ( #2417 )
...
### What problem does this PR solve?
#2415
### Type of change
- [x] Performance Improvement
2024-09-13 17:03:51 +08:00
Kevin Hu
4730145696
debug backend API for TAB 'search' ( #2389 )
...
### What problem does this PR solve?
#2247
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-09-12 17:51:20 +08:00
Kevin Hu
333608a1d4
add search TAB backend api ( #2375 )
...
### What problem does this PR solve?
#2247
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-09-11 19:49:18 +08:00
Kevin Hu
5a2c542ce2
make term similarity robust ( #2212 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-09-03 14:30:07 +08:00
Kevin Hu
6d232f1bdb
enable 3 char words to finegrind tokenize ( #2210 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-09-03 13:37:32 +08:00
Kevin Hu
642006c8e2
filter out + in es query ( #2046 )
...
### What problem does this PR solve?
#2028
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [ ]
2024-08-22 10:02:04 +08:00
Jin Hai
6b3a40be5c
Format file format from Windows/dos to Unix ( #1949 )
...
### What problem does this PR solve?
Related source file is in Windows/DOS format, they are format to Unix
format.
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com>
2024-08-15 09:17:36 +08:00
Kevin Hu
78ed8fe9a5
add doc ids to chat ( #1944 )
...
### What problem does this PR solve?
### Type of change
- [x] Performance Improvement
2024-08-14 16:31:49 +08:00
Kevin Hu
43199c45c3
refine loginfo about graprag progress ( #1823 )
...
### What problem does this PR solve?
### Type of change
- [x] Refactoring
2024-08-06 16:01:43 +08:00
Kevin Hu
2452c5624f
remove duplicated key in mind map ( #1809 )
...
### What problem does this PR solve?
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-08-05 15:57:33 +08:00
Kevin Hu
152072f900
Add graphrag ( #1793 )
...
### What problem does this PR solve?
#1594
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-08-02 18:51:14 +08:00
H
79c873344b
Fix docs parser ( #1714 )
...
### What problem does this PR solve?
#1711
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-07-26 10:52:56 +08:00
Kevin Hu
c92d334b29
fix bug of regx ( #1703 )
...
### What problem does this PR solve?
#1689
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-07-25 14:30:58 +08:00
KevinHuSh
c889ef6363
examples empty in categorize ( #1422 )
...
### What problem does this PR solve?
Examples empty in categorize
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-07-08 17:40:50 +08:00
KevinHuSh
7c9ea5cad9
add interpreter to graph ( #1347 )
...
### What problem does this PR solve?
#918
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-07-03 12:15:15 +08:00
KevinHuSh
92e9320657
upgrade laws parser of docx ( #1332 )
...
### What problem does this PR solve?
### Type of change
- [x] Refactoring
2024-07-01 15:50:24 +08:00
Zhedong Cen
fc7cc1d36c
Optimize docx handle method in laws parser ( #1302 )
...
### What problem does this PR solve?
Optimize docx handle method in laws parser
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-28 17:42:59 +08:00
Zhedong Cen
38bd02f402
Support displaying images in the chunks of docx files when using general parser ( #1253 )
...
### What problem does this PR solve?
Support displaying images in chunks of docx files when using general
parser
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-24 16:29:36 +08:00
Wang Baoling
18f4a6b35c
feat: support json file ( #1217 )
...
### What problem does this PR solve?
feat: support json file.
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
---------
Co-authored-by: KevinHuSh <kevinhu.sh@gmail.com>
2024-06-21 10:42:29 +08:00
Zhedong Cen
3c1444ab19
Add docx support for manual parser ( #1227 )
...
### What problem does this PR solve?
Add docx support for manual parser
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-20 17:03:02 +08:00
KevinHuSh
e35f7610e7
fix too long query exception ( #1195 )
...
### What problem does this PR solve?
#1161
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-06-18 09:50:59 +08:00
Zhedong Cen
90975460af
Add pdf support for QA parser ( #1155 )
...
### What problem does this PR solve?
Support extracting questions and answers from PDF files
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-14 15:12:39 +08:00
KevinHuSh
4454ba7a1e
add self-rag ( #1070 )
...
### What problem does this PR solve?
#1069
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-06-06 11:13:39 +08:00
Jin Hai
9ed0e50f6b
Update info ( #1005 )
...
### What problem does this PR solve?
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
### Type of change
- [x] Refactoring
Signed-off-by: Jin Hai <haijin.chn@gmail.com>
2024-05-31 09:53:04 +08:00
KevinHuSh
758eb03ccb
fix jina adding issure and term weight refinement ( #974 )
...
### What problem does this PR solve?
#724 #162
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [x] New Feature (non-breaking change which adds functionality)
2024-05-29 19:38:57 +08:00
KevinHuSh
614defec21
add rerank model ( #969 )
...
### What problem does this PR solve?
feat: add rerank models to the project #724 #162
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-05-29 16:50:02 +08:00
KevinHuSh
7eee193956
fix #917 #915 ( #946 )
...
### What problem does this PR solve?
#917
#915
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-28 11:13:02 +08:00
GYH
be13429d05
Add api/list_kb_docs function and modify api/list_chunks ( #874 )
...
### What problem does this PR solve?
#717
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-05-22 14:58:56 +08:00
KevinHuSh
2b36283712
fix english query bug ( #840 )
...
### What problem does this PR solve?
#834
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-20 12:23:51 +08:00
KevinHuSh
648a2baaa9
fix disabled doc is still retreivalable ( #695 )
...
### What problem does this PR solve?
Fix that disabled doc is still retreivalable
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-09 15:32:24 +08:00