337 Commits

Author SHA1 Message Date
Darlewo
8cb3b4aef2
fix: multiple retrieve reranking_enabled switch (#19958) 2025-05-20 15:22:03 +08:00
Amir Mohsen Asaran
c9ee60e197
Feat(WaterCrawl error handling): add custom exceptions and error handling (#19948) 2025-05-20 10:25:16 +08:00
-LAN-
4977bb21ec
feat(workflow): domain model for workflow node execution (#19430)
Signed-off-by: -LAN- <laipz8200@outlook.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-05-17 00:56:16 +08:00
k-kayashima
b292990075
Fix: Ensure unique index names for pgvector knowledge tables (#19672)
Co-authored-by: crazywoola <427733928@qq.com>
2025-05-15 11:43:44 +08:00
非法操作
085bd1aa93
chore: model.query change to db.session.query (#19551)
Co-authored-by: QuantumGhost <obelisk.reg+git@gmail.com>
2025-05-13 09:13:12 +08:00
非法操作
14cd71ed0a
chore: all model.query replace to db.session.query (#19521) 2025-05-12 15:19:41 +08:00
非法操作
b00f94df64
fix: replace all dataset.Model.query to db.session.query(Model) (#19509) 2025-05-12 13:52:33 +08:00
湛露先生
1119790b02
clean rag word_extractor. (#19397)
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
2025-05-09 16:39:16 +08:00
Will
bfa652f2d0
fix: metadata filtering condition variable unassigned; fix External K… (#19208) 2025-05-07 14:52:09 +08:00
Hao Cheng
9bcf837f17
fix: use only supported operators in metadata filter system prompts (#19195) 2025-05-03 20:08:08 +08:00
Will
a212a63e6a
fix: time type metadata filtering error (#19192) 2025-05-03 20:07:37 +08:00
Bowen Liang
12c96b93d9
immediately return initialed tiktokenizer instance and remove dead code in usage of tiktokenizer (#17957) 2025-04-30 16:07:20 +08:00
QuantumGhost
bd1bbfee4b
Enhance Code Consistency Across Repository with .editorconfig (#19023) 2025-04-29 18:04:33 +08:00
Ahmad Zidan
8266815cda
feat: add AWS Managed IAM auth for OpenSearch vector DB (#18963) 2025-04-29 15:10:08 +08:00
Ethan
8b4ea01810
feat: support access milvus with token (#19034) 2025-04-29 14:52:13 +08:00
Panpan
83187b30c0
fix: fix rerank model runner usage (#19008) 2025-04-29 14:51:21 +08:00
Wesley
b62eb61400
fix depth param issue for WaterCrawl (#18839) 2025-04-27 11:04:56 +08:00
Jiang
37e2f73909
[Lindorm VDB] Add the QUERY_TIMEOUT parameter to force the search query to fail. (#18613)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
2025-04-25 09:42:58 +08:00
王晓阳
0babdffe3e
feat: support vastbase vector database (#16308) 2025-04-24 18:04:57 +08:00
Jyong
e2cb7006c4
check metadata_filtering_conditions could be None in auto mode (#18548) 2025-04-22 17:09:33 +08:00
lauding
eb1ce3dd6b
feat: support huawei cloud vector database (#16141) 2025-04-22 13:03:35 +08:00
tmuife
7b6523e54d
Update Oracle db connection library and change connection pool to single connection (#18466) 2025-04-21 17:56:57 +08:00
Rain Wang
d2e3744ca3
Switching from CONSOLE_API_URL to FILES_URL in word_extractor.py (#18249) 2025-04-18 16:05:48 +08:00
Rain Wang
83f1aeec1d
Fix ORDER BY (score, id) error in api/core/rag/datasource/vdb/analyticdb/analyticdb_vector_sql.py line 249 (#18252) 2025-04-17 14:15:05 +08:00
Rain Wang
e8d98e3d89
Add analyzer_params config for milvus vectordb (#18180) 2025-04-17 10:38:56 +08:00
Jyong
95283b4dd3
Feat/change split length method (#18097)
Co-authored-by: JzoNg <jzongcode@gmail.com>
2025-04-16 12:28:22 +08:00
YANG
d119c7d629
ignore errors when creating duplicate indexes (#18069)
Co-authored-by: 璟义 <yangshangpo.ysp@alibaba-inc.com>
2025-04-15 15:48:16 +08:00
Jasonfish
1f722cde22
fix(api): Some params were ignored when creating empty Datasets through API (#17932) 2025-04-14 10:24:01 +08:00
Yongtao Huang
5d72003ebb
Remove dead code (#17899) 2025-04-11 20:33:52 +08:00
briqt
91cfa90503
Fix external knowledge Issues: (#17685) (#17843) 2025-04-11 15:37:27 +08:00
yihong
f04d52c044
fix: autocorrect everything in api (#17859)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
2025-04-11 15:24:39 +08:00
wlleiiwang
9d20561af4
create db if not exists (#17796)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
2025-04-10 18:03:22 +08:00
Jyong
8b3be4224d
revert batch query (#17707) 2025-04-09 20:25:36 +08:00
wlleiiwang
f148f1efa2
fix: Check collection exists before drop it. (#17692)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
2025-04-09 19:14:32 +08:00
Jyong
abfcd9d3b6
fix segment query index not effect (#17704) 2025-04-09 19:09:08 +08:00
Han
f1e4d5ed6c
Fix Performance Issues: (#17083)
Co-authored-by: Wang Han <wanghan@zhejianglab.org>
2025-04-09 11:22:53 +08:00
Steven Li
abead647e2
fix: Extract docx file fails when the file contains an invalid link (#17576) 2025-04-08 13:59:33 +08:00
Amir Mohsen Asaran
f54905e685
feat: Integrate WaterCrawl.dev as a new knowledge base provider (#16396)
Co-authored-by: crazywoola <427733928@qq.com>
2025-04-07 12:43:23 +08:00
wlleiiwang
42a42a7962
FEAT: support Tencent vectordb to full text search (#16865)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
2025-04-07 09:50:03 +08:00
crazywoola
3e698074e7
Fix/17466 cannot create a knowledge base by adding files (#17470) 2025-04-06 00:03:05 +08:00
Panpan
fc3f14c0ee
fix: keep image url (#17430) 2025-04-04 15:55:48 +08:00
Perfecto
16c722d1d8
fix: move hardcoded text to language settings (#16990) (#17133) 2025-04-02 22:35:51 +08:00
Jyong
6104b91d3f
add doc support in knowledge base for unstructured (#17352) 2025-04-02 21:35:01 +08:00
Jiang
fd1e40d22e
Lindorm VDB bugfix (#17357)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
2025-04-02 21:31:59 +08:00
Jiang
ff388fe3e6
optimize lindorm vdb add_texts (#17212)
Co-authored-by: jiangzhijie <jiangzhijie.jzj@alibaba-inc.com>
2025-04-01 11:06:35 +08:00
非法操作
44f911a0a8
chore: docstring not match the function parameter (#17162) 2025-03-31 13:19:15 +08:00
jiangbo721
a1aa325ce3
Chore/code format and Repair commit_id 3254018d more deleted codes and Fix naming error ambiguity between workflow_run_id and workflow_id (#17075)
Co-authored-by: 刘江波 <jiangbo721@163.com>
2025-03-29 14:15:53 +08:00
wanttobeamaster
7f70cadacb
feat: support Tablestore vector database (#16601)
Co-authored-by: xiaozhiqing.xzq <xiaozhiqing.xzq@alibaba-inc.com>
2025-03-27 15:53:33 +08:00
wlleiiwang
a743d5dc71
feat: tencent vectordb: use grpc client and set upsert batch size (#16016)
Co-authored-by: wlleiiwang <wlleiiwang@tencent.com>
2025-03-27 12:20:16 +08:00
Jyong
30792a1e1a
install pandoc (#16825) 2025-03-26 22:34:10 +08:00