From 3ae8a87986089b98486462ff468429a48c13ad53 Mon Sep 17 00:00:00 2001 From: writinwaters <93570324+writinwaters@users.noreply.github.com> Date: Mon, 27 May 2024 14:01:52 +0800 Subject: [PATCH] Expanded list of locally deployed embedding models (#930) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Documentation Update --- README.md | 4 ++++ README_ja.md | 4 ++++ README_zh.md | 4 ++++ docs/guides/configure_knowledge_base.md | 1 + docs/references/faq.md | 13 +++++++++++++ 5 files changed, 26 insertions(+) diff --git a/README.md b/README.md index 724b699ce..4985e0a13 100644 --- a/README.md +++ b/README.md @@ -28,6 +28,10 @@ [RAGFlow](https://ragflow.io/) is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. It offers a streamlined RAG workflow for businesses of any scale, combining LLM (Large Language Models) to provide truthful question-answering capabilities, backed by well-founded citations from various complex formatted data. +## 🎮 Demo + +Visit our demo at [https://demo.ragflow.io](https://demo.ragflow.io) + ## 📌 Latest Updates - 2024-05-23 Supports [RAPTOR](https://arxiv.org/html/2401.18059v1) for better text retrieval. diff --git a/README_ja.md b/README_ja.md index c1e06912a..ff1717310 100644 --- a/README_ja.md +++ b/README_ja.md @@ -28,6 +28,10 @@ [RAGFlow](https://ragflow.io/) は、深い文書理解に基づいたオープンソースの RAG (Retrieval-Augmented Generation) エンジンである。LLM(大規模言語モデル)を組み合わせることで、様々な複雑なフォーマットのデータから根拠のある引用に裏打ちされた、信頼できる質問応答機能を実現し、あらゆる規模のビジネスに適した RAG ワークフローを提供します。 +## 🎮 Demo + +デモをお試しください:[https://demo.ragflow.io](https://demo.ragflow.io)。 + ## 📌 最新情報 - 2024-05-23 より良いテキスト検索のために[RAPTOR](https://arxiv.org/html/2401.18059v1)をサポート。 diff --git a/README_zh.md b/README_zh.md index 7ef38ee43..337ad20ce 100644 --- a/README_zh.md +++ b/README_zh.md @@ -28,6 +28,10 @@ [RAGFlow](https://ragflow.io/) 是一款基于深度文档理解构建的开源 RAG(Retrieval-Augmented Generation)引擎。RAGFlow 可以为各种规模的企业及个人提供一套精简的 RAG 工作流程,结合大语言模型(LLM)针对用户各类不同的复杂格式数据提供可靠的问答以及有理有据的引用。 +## 🎮 Demo 试用 + +请登录网址 [https://demo.ragflow.io](https://demo.ragflow.io) 试用 demo。 + ## 📌 近期更新 - 2024-05-23 实现 [RAPTOR](https://arxiv.org/html/2401.18059v1) 提供更好的文本检索。 diff --git a/docs/guides/configure_knowledge_base.md b/docs/guides/configure_knowledge_base.md index c84df0e0d..536eaca05 100644 --- a/docs/guides/configure_knowledge_base.md +++ b/docs/guides/configure_knowledge_base.md @@ -62,6 +62,7 @@ An embedding model builds vector index on file chunks. Once you have chosen an e The following embedding models can be deployed locally: +- BAAI/bge-large-zh-v1.5 - BAAI/bge-base-en-v1.5 - BAAI/bge-large-en-v1.5 - BAAI/bge-small-en-v1.5 diff --git a/docs/references/faq.md b/docs/references/faq.md index 6e5c9f1b5..5c0bbde1b 100644 --- a/docs/references/faq.md +++ b/docs/references/faq.md @@ -18,6 +18,19 @@ The "garbage in garbage out" status quo remains unchanged despite the fact that English, simplified Chinese, traditional Chinese for now. +### 3. Which embedding models can be deployed locally? + +- BAAI/bge-large-zh-v1.5 +- BAAI/bge-base-en-v1.5 +- BAAI/bge-large-en-v1.5 +- BAAI/bge-small-en-v1.5 +- BAAI/bge-small-zh-v1.5 +- jinaai/jina-embeddings-v2-base-en +- jinaai/jina-embeddings-v2-small-en +- nomic-ai/nomic-embed-text-v1.5 +- sentence-transformers/all-MiniLM-L6-v2 +- maidalun1020/bce-embedding-base_v1 + ## Performance ### 1. Why does it take longer for RAGFlow to parse a document than LangChain?