3460 Commits

Author SHA1 Message Date
Nicolas
e37ab8431a Update search.ts 2025-01-02 21:07:14 -03:00
Nicolas
8b64e915b3 Update search.ts 2025-01-02 21:02:55 -03:00
Nicolas
7ce780ac81 Update search.ts 2025-01-02 20:40:38 -03:00
Nicolas
b244afbc82 Update README.md 2025-01-02 20:26:51 -03:00
Nicolas
a4b6dfecd1 Nick: v1.8.0 - added /v1/search support v1.2.0 2025-01-02 20:02:07 -03:00
Nicolas
b61a1ccfd3
Merge pull request #1032 from mendableai/nsc/v1-search
(feat/v1) Search
2025-01-02 19:58:44 -03:00
Nicolas
21bf89b6cc Update search.ts 2025-01-02 19:57:51 -03:00
Nicolas
22ae1730bd Update search.ts 2025-01-02 19:57:41 -03:00
Nicolas
a0dbf20c40 Update types.ts 2025-01-02 19:55:28 -03:00
Nicolas
25da20efd2 Nick: e2e 2025-01-02 19:53:54 -03:00
Nicolas
eae393afb5 Nick: fixed js sdk 2025-01-02 19:52:50 -03:00
Nicolas
07a6ba5d91 Nick: 2025-01-02 19:34:37 -03:00
Nicolas
35d7202894 Update search.ts 2025-01-02 19:33:21 -03:00
Nicolas
d2742bec4d Nick: v1 search 2025-01-02 19:31:03 -03:00
rafaelmmiller
ef0fc8d0d3 broader search if didnt find results 2025-01-02 18:00:18 -03:00
Nicolas
c9d91af86f Merge branch 'main' into nsc/semantic-index-extract 2025-01-02 15:26:40 -03:00
Nicolas
c822e34d37 Nick: fixed extract schema 2025-01-02 14:03:23 -03:00
Nicolas
c3fd13a82b Nick: fixed re-ranker and enabled url cache of 2hrs 2024-12-31 18:06:07 -03:00
Nicolas
07f4b714af Update removeUnwantedElements.ts 2024-12-31 15:23:02 -03:00
Nicolas
33632d2fe3 Update extraction-service.ts 2024-12-31 15:22:50 -03:00
Nicolas
27cc3dba30 Nick: rm vscode settings 2024-12-31 12:53:07 -03:00
Nicolas
bd81b41d5f Update queue-worker.ts 2024-12-30 21:43:59 -03:00
Nicolas
e6da214aeb Nick: async background index 2024-12-30 21:42:01 -03:00
Nicolas
7a31306be5 Nick: url normalization + max metadata size 2024-12-30 20:04:22 -03:00
Nicolas
bf9d41d0b2 Nick: index exploration 2024-12-30 19:37:48 -03:00
Nicolas
0847a6038e
Merge pull request #1014 from mendableai/nsc/extract-url-trace
/extract URL trace
2024-12-30 19:00:58 -03:00
Gergő Móricz
71a8f7452c fix(WebScraper/sitemap): await urlsHandler to fix race condition v1.1.1 2024-12-30 16:09:22 +01:00
Nicolas
8ae34a0d31 Nick: rm .xml from isFile 2024-12-30 11:57:01 -03:00
Gergő Móricz
9005757de3 fix(queue-worker): do not follow redirect URLs if they are not allowed by the crawl options 2024-12-30 14:41:31 +01:00
Gergő Móricz
4d1f92f4c8 fix(scrapeURL/fetch): block loopback and link-local IPs 2024-12-29 17:35:14 +01:00
Nicolas
e255301005 Update index.ts 2024-12-27 21:31:29 -03:00
Nicolas
c1fa5a44ae
Merge pull request #1016 from mendableai/mog/mineru
feat(scrapeURL/pdf): switch to MU (FIR-356)
2024-12-27 21:19:48 -03:00
Nicolas
1eca61bffb Update index.ts 2024-12-27 20:59:18 -03:00
Nicolas
f9d55efba8 Update index.ts 2024-12-27 20:54:26 -03:00
Nicolas
b8d7f9f257 Nick: we are using runpod 2024-12-27 19:59:05 -03:00
Nicolas
5fcf3fa97e Merge branch 'main' into mog/mineru 2024-12-27 19:53:09 -03:00
Nicolas
a431cafa47
Merge pull request #991 from RutamBhagat/rust-sdk-conditionally-enforce-api-key
feat(rust-sdk): Make API key optional for self-hosted instances
2024-12-27 19:07:01 -03:00
Nicolas
65cf4cd74e
Merge pull request #1013 from yujunhui/main
fix: merge mock success data
2024-12-27 19:04:04 -03:00
Nicolas
05d5f84d87
Merge pull request #1018 from mendableai/feat/add-favicon-metadata
[FIR-37] feat: extract and return favicon URL during scraping
2024-12-27 17:44:03 -03:00
Nicolas
eba5fda9a1
Merge pull request #955 from mendableai/rafa/fix-default-on-schema-llm-extract
fixed optional+default bug on llm schema
2024-12-27 16:33:04 -03:00
Ademílson F. Tonato
a4cf814f70 feat: return favicon url when scraping 2024-12-27 19:18:53 +00:00
Gergő Móricz
0421f81020
Sitemap fixes (#1010)
* sitemap fixes iter 1

* feat(sitemap): dedupe improvements

---------

Co-authored-by: Nicolas <nicolascamara29@gmail.com>
2024-12-27 19:59:26 +01:00
Nicolas
6851281beb Update __init__.py 2024-12-27 15:46:00 -03:00
Nicolas
cd08be7f37
Merge pull request #990 from RutamBhagat/python-sdk-conditionally-enforce-api-key
feat(python-sdk): Make API key optional for self-hosted instances
2024-12-27 15:43:37 -03:00
Nicolas
c5b6495e48
Merge pull request #1015 from mendableai/nsc/improves-sitemap-fetching
Improves sitemap fetching
v1.1.0
2024-12-27 14:41:04 -03:00
Nicolas
2ea0e9a241
Merge pull request #1003 from RutamBhagat/credit-usage-api-docs
docs(credit-usage-api): add new endpoint documentation for credit usage
2024-12-27 13:59:54 -03:00
Nicolas
e8f0a22ebe Update v1-openapi.json 2024-12-27 13:59:43 -03:00
Nicolas
f7cfbba651 Merge branch 'main' into pr/1003 2024-12-27 13:59:24 -03:00
Nicolas
1abb544e3e Update index.test.ts 2024-12-27 13:59:09 -03:00
Gergő Móricz
4772951313 feat(scrapeURL/fire-engine): explicitly delete job after scrape 2024-12-27 16:44:41 +01:00