Ademílson Tonato
|
d0a21086bd
|
refactor: Update Firecrawl API parameters and default settings (#13082)
|
2025-01-29 11:21:05 +08:00 |
|
Ademílson Tonato
|
6024d8a42d
|
refactor: Update Firecrawl to use v1 API (#12574)
Co-authored-by: Ademílson Tonato <ademilson.tonato@refurbed.com>
|
2025-01-23 11:14:48 +08:00 |
|
huangzhuo1949
|
4c3076f2a4
|
feat: add pg vector index (#12338)
Co-authored-by: huangzhuo <huangzhuo1@xiaomi.com>
|
2025-01-22 17:07:18 +08:00 |
|
Bowen Liang
|
166221d784
|
chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702)
|
2025-01-21 10:12:29 +08:00 |
|
yihong
|
4e101604c3
|
fix: ruff check for True if ... else (#12576)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2025-01-13 09:38:48 +08:00 |
|
CN-P5
|
cd257b91c5
|
Fix pandas indexing method for knowledge base imports (#12637) (#12638)
Co-authored-by: CN-P5 <heibai2006@qq.com>
|
2025-01-13 09:06:59 +08:00 |
|
YoungLH
|
040a3b782c
|
FEAT: support milvus to full text search (#11430)
Signed-off-by: YoungLH <974840768@qq.com>
|
2025-01-08 17:39:53 +08:00 |
|
Yingchun Lai
|
53bb37b749
|
fix: fix the incorrect plaintext file key when saving (#10429)
|
2025-01-08 12:52:45 +08:00 |
|
Hiroshi Fujita
|
d2586278d6
|
Feat elasticsearch japanese (#12194)
|
2025-01-08 12:35:41 +08:00 |
|
Jyong
|
05bda6f38d
|
add tidb on qdrant redis lock (#12462)
|
2025-01-08 08:55:44 +08:00 |
|
huangzhuo1949
|
70698024f5
|
fix: empty delete bug (#12339)
Co-authored-by: huangzhuo <huangzhuo1@xiaomi.com>
|
2025-01-03 20:46:39 +08:00 |
|
Jyong
|
b873e6349c
|
add child chunk preview number limit (#12309)
|
2025-01-03 16:14:27 +08:00 |
|
-LAN-
|
8d15c8cfbf
|
fix: improve error handling in NotionExtractor data fetching (#12182)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-29 11:53:09 +08:00 |
|
-LAN-
|
dae1b5a619
|
fix: import jieba.analyse (#12133)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-27 11:37:55 +08:00 |
|
Jyong
|
811e4bd0cf
|
fix unstructured setting (#12116)
|
2024-12-26 12:08:36 +08:00 |
|
Jyong
|
84ac004772
|
py lint (#12102)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: -LAN- <laipz8200@outlook.com>
|
2024-12-26 00:16:35 +08:00 |
|
Jyong
|
9231fdbf4c
|
Feat/support parent child chunk (#12092)
|
2024-12-25 19:49:07 +08:00 |
|
yihong
|
56e15d09a9
|
feat: mypy for all type check (#10921)
|
2024-12-24 18:38:51 +08:00 |
|
-LAN-
|
599d410d99
|
fix: validate reranking model attributes before processing (#11930)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-21 21:23:12 +08:00 |
|
-LAN-
|
8c559d6231
|
fix(retrieval_service): avoid to use exception (#11925)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-21 21:19:46 +08:00 |
|
yihong
|
7b03a0316d
|
fix: better memory usage from 800+ to 500+ (#11796)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-20 14:51:43 +08:00 |
|
yihong
|
463fbe2680
|
fix: better gard nan value from numpy for issue #11827 (#11864)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-20 09:28:32 +08:00 |
|
yihong
|
5a8a901560
|
fix: float values are not json for nan value close #11827 (#11840)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-19 20:50:20 +08:00 |
|
Jiang
|
ad17ff9a92
|
Lindorm vdb bug-fix (#11790)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
|
2024-12-18 15:19:20 +08:00 |
|
Bowen Liang
|
924b4fe742
|
test: run vdb tests on TiDB Vector with docker in CI tests (#11645)
|
2024-12-15 17:16:40 +08:00 |
|
yihong
|
22258fb0bf
|
fix: filter bug for keywork cause code can not reach (#11666)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-15 17:12:06 +08:00 |
|
yihong
|
36cb25b341
|
fix: support mdx files close #11557 (#11565)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-12 13:37:56 +08:00 |
|
Jiang
|
0d04cdc323
|
Lindorm vdb (#11574)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
|
2024-12-12 09:43:27 +08:00 |
|
Jyong
|
9b7adcd4d9
|
update tidb batch get endpoint to basic mode (#11426)
|
2024-12-06 17:06:46 +08:00 |
|
Jyong
|
d7c1f43b49
|
fix tidb full-text-search vector missed (#11337)
|
2024-12-04 16:13:23 +08:00 |
|
Jyong
|
c58d2fce89
|
roll back rerank topn setting (#11297)
|
2024-12-03 17:34:56 +08:00 |
|
yihong
|
e686f12317
|
fix: better handle error (#11265)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-03 09:15:38 +08:00 |
|
-LAN-
|
9601102885
|
fix(word_extractor): Fix type error and remove stream in ssrf_proxy (#11241)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-02 10:24:03 +08:00 |
|
Cling_o3
|
f9c2aa7689
|
feat: add retireval_top_n to config in env (#11132)
|
2024-11-30 11:14:45 +08:00 |
|
kazuya-awano
|
2d6865d421
|
Ensure consistent float type for cached embedding return values (#10185)
|
2024-11-29 09:18:41 +08:00 |
|
yihong
|
d7160ee563
|
fix: typo in upstashVector if id is always true, also fix some type hint (#11183)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-11-28 14:05:25 +08:00 |
|
-LAN-
|
9789905a1f
|
chore(*): Removes debugging print statements (#11145)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-11-26 22:03:19 +08:00 |
|
Bowen Liang
|
6c8e208ef3
|
chore: bump minimum supported Python version to 3.11 (#10386)
|
2024-11-24 13:28:46 +08:00 |
|
yihong
|
ed55de888a
|
fix: rules should not be None for in (#10977)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-11-22 23:04:20 +08:00 |
|
AkisAya
|
cb0c55daa7
|
fix weight rerank of knowledge retrieval (#10931)
|
2024-11-21 17:53:20 +08:00 |
|
yihong
|
58a9d9eb9a
|
fix: better WeightRerankRunner run logic use O(1) and delete unused code (#10849)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-11-19 20:12:13 +08:00 |
|
Zane
|
14f3d44c37
|
refactor: improve handling of leading punctuation removal (#10761)
|
2024-11-18 21:32:33 +08:00 |
|
8bitpd
|
873e9720e9
|
feat: AnalyticDB vector store supports invocation via SQL. (#10802)
Co-authored-by: 璟义 <yangshangpo.ysp@alibaba-inc.com>
|
2024-11-18 19:29:54 +08:00 |
|
Bowen Liang
|
51db59622c
|
chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425)
|
2024-11-15 15:41:40 +08:00 |
|
Jyong
|
0b2d51d859
|
add the index field for elasticsearch (#10592)
|
2024-11-12 21:43:16 +08:00 |
|
-LAN-
|
a1543b7da0
|
fix(extractor): temporary file (#10543)
|
2024-11-11 17:31:27 +08:00 |
|
Leo.Wang
|
c9f785e00f
|
Feat/tools/gitlab (#10407)
|
2024-11-08 09:53:03 +08:00 |
|
Bowen Liang
|
574c4a264f
|
chore(lint): Use logging.exception instead of logging.error (#10415)
|
2024-11-07 21:13:02 +08:00 |
|
Jyong
|
1024fc623e
|
fix the ssrf of docx file extractor external images (#10237)
|
2024-11-04 15:22:07 +08:00 |
|
Jiang
|
0c9e79cd67
|
Add Lindorm as a VDB choice (#10202)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
|
2024-11-04 09:10:26 +08:00 |
|