mirror of
https://git.mirrors.martin98.com/https://github.com/infiniflow/ragflow.git
synced 2025-04-22 14:10:01 +08:00

### What problem does this PR solve? #5625 #5614 ### Type of change - [x] Documentation Update
19 lines
1.0 KiB
Plaintext
19 lines
1.0 KiB
Plaintext
---
|
|
sidebar_position: 9
|
|
slug: /accelerate_doc_indexing
|
|
---
|
|
|
|
# Accelerate indexing
|
|
import APITable from '@site/src/components/APITable';
|
|
|
|
A checklist to speed up document parsing and indexing.
|
|
|
|
---
|
|
|
|
Please note that some of your settings may consume a significant amount of time. If you often find that document parsing is time-consuming, here is a checklist to consider:
|
|
|
|
- Use GPU to reduce embedding time.
|
|
- On the configuration page of your knowledge base, switch off **Use RAPTOR to enhance retrieval**.
|
|
- Extracting knowledge graph (GraphRAG) is time-consuming.
|
|
- Disable **Auto-keyword** and **Auto-question** on the configuration page of yor knowledge base, as both depend on the LLM.
|
|
- **v0.17.0:** If your document is plain text PDF and does not require GPU-intensive processes like OCR (Optical Character Recognition), TSR (Table Structure Recognition), or DLA (Document Layout Analysis), you can choose **Naive** over **DeepDoc** or other time-consuming large model options in the **Document parser** dropdown. This will substantially reduce document parsing time. |