KevinHuSh
b8e58fe27a
add redis to accelerate access of minio ( #482 )
...
### What problem does this PR solve?
### Type of change
- [x] New Feature (non-breaking change which adds functionality)
2024-04-22 14:11:09 +08:00
KevinHuSh
7e41b4bc94
change readme for 0.3.0 release ( #459 )
...
### What problem does this PR solve?
### Type of change
- [x] Documentation Update
2024-04-19 18:19:15 +08:00
KevinHuSh
ed6081845a
Fit a lot of encodings for text file. ( #458 )
...
### What problem does this PR solve?
#384
### Type of change
- [x] Performance Improvement
2024-04-19 18:02:53 +08:00
KevinHuSh
572e5b1ff1
Let task continue dispaching while meeting unexpected doc formats ( #199 )
...
### What problem does this PR solve?
_Briefly describe what this PR aims to solve. Include background context
that will help reviewers understand the purpose of the PR._
Issue link:#[[Link the issue
here](https://github.com/infiniflow/ragflow/issues/198 )]
### Type of change
- [x] Bug Fix (non-breaking change which fixes an issue)
- [ ] New Feature (non-breaking change which adds functionality)
- [ ] Breaking Change (fix or feature that could cause existing
functionality not to work as expected)
- [ ] Documentation Update
- [ ] Refactoring
- [ ] Performance Improvement
- [ ] Test cases
- [ ] Python SDK impacted, Need to update PyPI
- [ ] Other (please describe):
2024-04-02 11:39:01 +08:00
KevinHuSh
fd7fcb5baf
apply pep8 formalize ( #155 )
2024-03-27 11:33:46 +08:00
KevinHuSh
da21320b88
fix plainPdf bugs ( #152 )
2024-03-26 15:11:07 +08:00
KevinHuSh
71fe314955
refine page ranges ( #147 )
2024-03-25 13:11:57 +08:00
KevinHuSh
f6aee7f230
add use layout or not option ( #145 )
...
* add use layout or not option
* trival
2024-03-22 19:21:09 +08:00
KevinHuSh
6c6b144de2
refine manual parser ( #140 )
2024-03-21 18:17:32 +08:00
KevinHuSh
5875c8ba08
Add 'One' chunk method ( #137 )
2024-03-20 18:57:22 +08:00
KevinHuSh
6999598101
refine for English corpus ( #135 )
2024-03-20 16:56:16 +08:00
KevinHuSh
0feb085c88
refine table parser ( #120 )
2024-03-12 18:56:04 +08:00
KevinHuSh
f1f09df901
add local llm implementation ( #119 )
2024-03-12 11:57:08 +08:00
KevinHuSh
8a57f2afd5
change callback strategy, add timezone to docker ( #96 )
2024-03-05 12:08:41 +08:00
KevinHuSh
7bfaf0df29
fix position extraction bug ( #93 )
...
* fix position extraction bug
* remove delimiter for naive parser
2024-03-04 17:08:35 +08:00
KevinHuSh
8a726fb04b
solve task execution issues ( #90 )
2024-03-01 19:48:01 +08:00
KevinHuSh
d32322c081
rename vision, add layour and tsr recognizer ( #70 )
...
* rename vision, add layour and tsr recognizer
* trivial fixing
2024-02-22 19:11:37 +08:00
KevinHuSh
cacd36c5e1
use onnx models, new deepdoc ( #68 )
2024-02-21 16:32:38 +08:00
KevinHuSh
c5ea37cd30
Add resume parser and fix bugs ( #59 )
...
* Update .gitignore
* Update .gitignore
* Add resume parser and fix bugs
2024-02-07 19:27:23 +08:00
KevinHuSh
407b2523b6
remove unused codes, seperate layout detection out as a new api. Add new rag methed 'table' ( #55 )
2024-02-05 18:08:17 +08:00
KevinHuSh
e6acaf6738
Add Q&A and Book, fix task running bugs ( #50 )
2024-02-01 18:53:56 +08:00
KevinHuSh
6224edcd1b
Add task moduel, and pipline the task and every parser ( #49 )
2024-01-31 19:57:45 +08:00