feat: Add html to the description text of the parsing method general #336 (#2432)

### What problem does this PR solve?

feat: Add html to the description text of the parsing method general
#336

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
This commit is contained in:
balibabu 2024-09-14 15:18:34 +08:00 committed by GitHub
parent 6a0702f55f
commit deeb950e1c
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
3 changed files with 3 additions and 3 deletions

View File

@ -200,7 +200,7 @@ export default {
We assume manual has hierarchical section structure. We use the lowest section titles as pivots to slice documents. We assume manual has hierarchical section structure. We use the lowest section titles as pivots to slice documents.
So, the figures and tables in the same section will not be sliced apart, and chunk size might be large. So, the figures and tables in the same section will not be sliced apart, and chunk size might be large.
</p>`, </p>`,
naive: `<p>Supported file formats are <b>DOCX, EXCEL, PPT, IMAGE, PDF, TXT, MD, JSON, EML</b>.</p> naive: `<p>Supported file formats are <b>DOCX, EXCEL, PPT, IMAGE, PDF, TXT, MD, JSON, EML, HTML</b>.</p>
<p>This method apply the naive ways to chunk files: </p> <p>This method apply the naive ways to chunk files: </p>
<p> <p>
<li>Successive text will be sliced into pieces using vision detection model.</li> <li>Successive text will be sliced into pieces using vision detection model.</li>

View File

@ -191,7 +191,7 @@ export default {
使 使
</p>`, </p>`,
naive: `<p>支持的文件格式為<b>DOCX、EXCEL、PPT、IMAGE、PDF、TXT、MD、JSON、EML</b>。</p> naive: `<p>支持的文件格式為<b>DOCX、EXCEL、PPT、IMAGE、PDF、TXT、MD、JSON、EML、HTML</b>。</p>
<p></p> <p></p>
<p> <p>
<li>使</li> <li>使</li>

View File

@ -192,7 +192,7 @@ export default {
使 使
</p>`, </p>`,
naive: `<p>支持的文件格式为<b>DOCX、EXCEL、PPT、IMAGE、PDF、TXT、MD、JSON、EML</b>。</p> naive: `<p>支持的文件格式为<b>DOCX、EXCEL、PPT、IMAGE、PDF、TXT、MD、JSON、EML、HTML</b>。</p>
<p></p> <p></p>
<p> <p>
<li>使</li> <li>使</li>