1213 Commits

Author SHA1 Message Date
rafaelsideguide
2d3d7c827a fix/added unkwown status to job filter 2024-10-11 15:40:29 -03:00
Rafael Miller
ca51521625
Merge pull request #761 from mendableai/fix/filter-status-unknown-jobs
[BUG] filters failed and unknown jobs now
2024-10-11 15:36:16 -03:00
Nicolas
0bff5b1a24 Update auth.ts 2024-10-11 15:29:25 -03:00
Nicolas
257a951132 Update auth.ts 2024-10-11 14:21:04 -03:00
rafaelsideguide
c1f98d0371 fixed developer.notion special case 2024-10-11 10:54:59 -03:00
rafaelsideguide
8cbd94ed2d fix/filters failed and unknown jobs now 2024-10-11 09:45:51 -03:00
rafaelsideguide
f113222829 fix: removing test teams concurrency limit 2024-10-10 09:46:25 -03:00
Nicolas
d410804348
Merge pull request #755 from busaud/main
bugfix: self-host crawling doesnt respect limit
2024-10-09 22:56:44 -03:00
Nicolas
abb5ec7439 Update playwright.ts 2024-10-09 22:55:01 -03:00
Nicolas
f6ec45f046
Merge pull request #747 from Harsh0707005/timeout-parameter-not-passed
Fixed Issue #734
2024-10-09 22:53:26 -03:00
Nicolas
222a34cae8
Update playwright.ts 2024-10-09 22:53:03 -03:00
busaud
c6ebbc6f6a bugfix: self-host crawling doesnt respect limit 2024-10-09 22:52:49 +00:00
Nicolas
52ec43aac3 Update index.ts 2024-10-09 19:42:25 -03:00
Nicolas
5ff6c64d77 Update index.ts 2024-10-09 19:30:14 -03:00
Gergő Móricz
17d0ed061e push 2024-10-09 23:13:26 +02:00
Gergő Móricz
b2ae1a52d5 fix(Dockerfile): remove chromium 2024-10-09 23:13:13 +02:00
busaud
237442fabb Make sure the entrypoint script has the correct line endings 2024-10-09 20:58:37 +02:00
rafaelsideguide
ae464ada60 tests: teamIds 2024-10-09 15:06:29 -03:00
Nicolas
1cd49a0a95 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-10-09 14:41:25 -03:00
Nicolas
064ce482c2 Update blocklist.ts 2024-10-09 14:41:23 -03:00
rafaelsideguide
4020a7d781 test: added test suite tokens 2024-10-08 15:11:08 -03:00
Harsh Master
aa3d4b8d6c
Fixed Issue #734 2024-10-08 11:36:12 +05:30
Nicolas
5c0c952a27 Update website_params.ts 2024-10-07 14:51:05 -03:00
Nicolas
1f1afeaac4 Update system-monitor.ts 2024-10-04 15:15:04 -03:00
Nicolas
dba96998e3 Update fetch.ts 2024-10-03 18:56:51 -03:00
Nicolas
668ff3c71b Update fetch.ts 2024-10-03 18:55:39 -03:00
Nicolas
25dd16bf2a Nick: removed 401 2024-10-03 18:52:17 -03:00
Nicolas
93657f6a44 Update queue-worker.ts 2024-10-03 18:44:40 -03:00
Thomas Kosmas
28b64fc704 Change the gracefull shutdown signal 2024-10-04 00:40:09 +03:00
Nicolas
497ac3328b
Merge pull request #732 from mendableai/fix/url-validation-params
[BUG] Fixed URLs with params
2024-10-03 17:43:37 -03:00
rafaelsideguide
cfd776a5de fix: now urls with params are passing validation
example: https://www.granitecreek.com?asljhda=akjshd
2024-10-03 17:37:04 -03:00
Nicolas
c6a29efbed Update crawl-status.ts 2024-10-03 17:33:38 -03:00
Nicolas
ddd774ed68 Nick: 2024-10-03 17:20:57 -03:00
Nicolas
82551bb6bc Update index.test.ts 2024-10-03 17:13:30 -03:00
Nicolas
49bd95327e Update types.ts 2024-10-03 17:00:33 -03:00
Nicolas
1a1ac9fd60 Nick: 2024-10-03 16:37:58 -03:00
Nicolas
a150aa820c Nick: shouldnt fallback on a 400 + error code should be correct on page status code 2024-10-03 15:21:42 -03:00
Gergő Móricz
26771e2e71 debug(zod): log unsupported protocol errors 2024-10-01 22:13:28 +02:00
Nicolas
d1b838322d
Merge pull request #721 from mendableai/feat/concurrency-limit
Concurrency limits
2024-10-01 16:15:05 -03:00
Nicolas
ac5e1fc194 Update sitemap.ts 2024-10-01 16:14:43 -03:00
Nicolas
c6717fecaa Nick: got rid of job interval sleep and math.min 2024-10-01 16:11:12 -03:00
Nicolas
18f9cd09e1 Nick: fixed more stuff 2024-10-01 16:04:39 -03:00
Gergő Móricz
fe721fffbe fix(crawl-redis): normalize URL before locking 2024-10-01 20:59:50 +02:00
Nicolas
c0541cc990 Update queue-worker.ts 2024-10-01 15:38:24 -03:00
Nicolas
37299fc035 Update types.ts 2024-10-01 15:18:11 -03:00
Nicolas
8aa07afb6d Nick: fixes 2024-10-01 15:15:49 -03:00
Nicolas
92dbd33e57 Update queue-worker.ts 2024-10-01 14:53:26 -03:00
Nicolas
4d5477f357 Nick: resolved conflicts 2024-10-01 14:39:57 -03:00
Nicolas
96245e387d Update crawl.ts 2024-10-01 14:29:53 -03:00
Nicolas
258c67ce67 Revert "feat(queue-worker): always crawl links from content even if sitemapped"
This reverts commit 3c045c43a446bb7895892338c881cd7bc4f77cbf.
2024-10-01 14:20:23 -03:00