101 Commits

Author SHA1 Message Date
Jin Hai
a6d85c6c2f
Provide detailed error information. ()
### What problem does this PR solve?

Most 'index failure' error is caused by ES.

### Type of change

- [x] Refactoring

Signed-off-by: Jin Hai <haijin.chn@gmail.com>
2024-08-26 13:36:00 +08:00
Kevin Hu
cc6a48b128
support monitoring task executor ()
### What problem does this PR solve?


### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-08-23 15:55:58 +08:00
Kevin Hu
0f95086813
add taskexecutor status check ()
### What problem does this PR solve?


### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-08-21 17:48:00 +08:00
Kevin Hu
5efb3476f2
turn down es bulk size ()
### What problem does this PR solve?


### Type of change

- [x] Performance Improvement
2024-08-20 09:59:17 +08:00
Kevin Hu
da8802d010
refine error log while chunking ()
### What problem does this PR solve?



### Type of change

- [x] Refactoring
2024-08-14 11:09:07 +08:00
黄腾
ede733e130
add support for eml file parser ()
### What problem does this PR solve?

add support for eml file parser


### Type of change

- [x] New Feature (non-breaking change which adds functionality)

---------

Co-authored-by: Zhedong Cen <cenzhedong2@126.com>
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
2024-08-06 16:42:14 +08:00
Kevin Hu
152072f900
Add graphrag ()
### What problem does this PR solve?



### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-08-02 18:51:14 +08:00
H
ac7a0d4fbf
Add ParsertType Audio ()
### What problem does this PR solve?

 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-07-22 19:17:30 +08:00
KevinHuSh
6f99bbbb08
add raptor ()
### What problem does this PR solve?

 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-05-23 14:31:16 +08:00
KevinHuSh
95f809187e
add stream chat ()
### What problem does this PR solve?

 
### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-05-16 20:14:53 +08:00
KevinHuSh
4d47b2b459
fix a string format error ()
### What problem does this PR solve?


### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-15 13:02:31 +08:00
KevinHuSh
aa1c915d6e
support gpt-4o ()
### What problem does this PR solve?
 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-05-15 11:16:08 +08:00
Jin Hai
77b1520b66
Refactor message output format ()
### What problem does this PR solve?

_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._

### Type of change

- [x] Refactoring

Signed-off-by: Jin Hai <haijin.chn@gmail.com>
2024-05-15 10:48:42 +08:00
KevinHuSh
ffe5737f7d
let index be batchly. ()
### What problem does this PR solve?

let index be batchly.

### Type of change


- [x] Refactoring
2024-05-11 19:47:53 +08:00
KevinHuSh
04a9e95161
let file in knowledgebases visible in file manager ()
### What problem does this PR solve?

Let file in knowledgebases visible in file manager.
 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-05-11 16:04:28 +08:00
KevinHuSh
c28f7b5d38
make sure the error will be recorded. ()
### What problem does this PR solve?


### Type of change

- [x] Refactoring
2024-05-08 13:58:41 +08:00
KevinHuSh
a6e4b74d94
remove unused dependency ()
### What problem does this PR solve?

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-05-07 19:46:17 +08:00
Fakai Zhao
de839fc3f0
optimize srv broker and executor logic ()
### What problem does this PR solve?

Optimize task broker and executor for reduce memory usage and deployment
complexity.

### Type of change
- [x] Performance Improvement
- [x] Refactoring

### Change Log
- Enhance redis utils for message queue(use stream)
- Modify task broker logic via message queue (1.get parse event from
message queue 2.use ThreadPoolExecutor async executor )
- Modify the table column name of document and task (process_duation ->
process_duration maybe just a spelling mistake)
- Reformat some code style(just what i see)
- Add requirement_dev.txt for developer
- Add redis container on docker compose

---------

Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
2024-05-07 11:43:33 +08:00
KevinHuSh
5f03a4de11
remove redis ()
### What problem does this PR solve?

### Type of change

- [x] Refactoring
2024-04-30 19:00:41 +08:00
KevinHuSh
9d60a84958
refactor code ()
### What problem does this PR solve?

### Type of change

- [x] Refactoring
2024-04-28 13:19:54 +08:00
KevinHuSh
944776f207
fix bug about fetching file from minio ()
### What problem does this PR solve?


### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-04-28 09:57:40 +08:00
KevinHuSh
66f8d35632
Refactor ()
### What problem does this PR solve?

### Type of change

- [x] Refactoring
2024-04-25 14:14:28 +08:00
KevinHuSh
0dfc8ddc0f
enlarge docker memory usage ()
### What problem does this PR solve?

### Type of change

- [x] Refactoring
2024-04-23 14:41:10 +08:00
KevinHuSh
b8e58fe27a
add redis to accelerate access of minio ()
### What problem does this PR solve?

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-04-22 14:11:09 +08:00
KevinHuSh
ed6081845a
Fit a lot of encodings for text file. ()
### What problem does this PR solve?



### Type of change

- [x] Performance Improvement
2024-04-19 18:02:53 +08:00
KevinHuSh
890561703b
Add bce-embedding and fastembed ()
### What problem does this PR solve?


Issue link:#326

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-04-16 16:42:19 +08:00
KevinHuSh
91068edf16
Support Xinference ()
### What problem does this PR solve?

Issue link:#299

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-04-11 18:22:25 +08:00
KevinHuSh
923cbe488e
fix task_executor occupy cpu too much ()
### What problem does this PR solve?

Issue link:#285

### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
2024-04-10 10:11:22 +08:00
KevinHuSh
3708b97db9
Support Ollama ()
### What problem does this PR solve?

Issue link:#221

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
2024-04-08 19:20:57 +08:00
KevinHuSh
38e5737067
add base url for OpenAI () 2024-03-28 19:15:16 +08:00
KevinHuSh
fd7fcb5baf
apply pep8 formalize () 2024-03-27 11:33:46 +08:00
KevinHuSh
979b3a5b4b
support snapshot download from local ()
* support snapshot download from local

* let snapshot download from local
2024-03-27 09:53:42 +08:00
KevinHuSh
71fe314955
refine page ranges () 2024-03-25 13:11:57 +08:00
KevinHuSh
5875c8ba08
Add 'One' chunk method () 2024-03-20 18:57:22 +08:00
KevinHuSh
675a9f8d9a
add dockerfile for cuda envirement. Refine table search strategy, () 2024-03-14 19:45:29 +08:00
KevinHuSh
0feb085c88
refine table parser () 2024-03-12 18:56:04 +08:00
KevinHuSh
f1f09df901
add local llm implementation () 2024-03-12 11:57:08 +08:00
KevinHuSh
b89ac3c4be
chage tas execution logic () 2024-03-06 19:16:31 +08:00
KevinHuSh
602038ac49
fix task cancling bug () 2024-03-05 16:33:47 +08:00
KevinHuSh
8a57f2afd5
change callback strategy, add timezone to docker () 2024-03-05 12:08:41 +08:00
KevinHuSh
685b4d8a95
fix table desc bugs, add positions to chunks () 2024-03-04 14:42:26 +08:00
KevinHuSh
8a726fb04b
solve task execution issues () 2024-03-01 19:48:01 +08:00
KevinHuSh
0429107e80
fix user login issue () 2024-02-29 14:03:07 +08:00
KevinHuSh
320cbf545f
docker debugging modification () 2024-02-28 18:57:43 +08:00
KevinHuSh
7fd1eca582
init README of deepdoc, add picture processer. ()
* init README of deepdoc, add picture processer.

* add resume parsing
2024-02-23 18:28:12 +08:00
KevinHuSh
a8294f2168 Refine resume parts and fix bugs in retrival using sql () 2024-02-19 19:22:17 +08:00
KevinHuSh
5e0a689c43 refactor retieval_test, add SQl retrieval methods () 2024-02-08 17:01:01 +08:00
KevinHuSh
c5ea37cd30 Add resume parser and fix bugs ()
* Update .gitignore

* Update .gitignore

* Add resume parser and fix bugs
2024-02-07 19:27:23 +08:00
KevinHuSh
407b2523b6 remove unused codes, seperate layout detection out as a new api. Add new rag methed 'table' () 2024-02-05 18:08:17 +08:00
KevinHuSh
e6acaf6738 Add Q&A and Book, fix task running bugs () 2024-02-01 18:53:56 +08:00