Gergő Móricz
|
129a882bcc
|
fix(scrape): give scrapes their real job id
|
2024-08-15 19:29:47 +02:00 |
|
Gergő Móricz
|
965a5817d1
|
fix(queue-worker): log jobs correctly
|
2024-08-15 19:27:15 +02:00 |
|
Gergő Móricz
|
dad9d353d9
|
use thomas's url validation
|
2024-08-15 19:19:02 +02:00 |
|
Gergő Móricz
|
e3279274f1
|
fix: make playground crawl work
|
2024-08-15 19:14:32 +02:00 |
|
Gergő Móricz
|
c5597bc722
|
fix: robots.txt laoding
|
2024-08-15 19:11:07 +02:00 |
|
Gergő Móricz
|
29f0d9ec94
|
propagate priority to fire-engine
|
2024-08-15 19:04:46 +02:00 |
|
Gergő Móricz
|
b79d3d1754
|
fix
|
2024-08-15 19:02:05 +02:00 |
|
Gergő Móricz
|
57730f6a35
|
priority changes
|
2024-08-15 18:58:07 +02:00 |
|
Gergő Móricz
|
846610681b
|
fix: fix posthog, add dummy crawl DB items
|
2024-08-15 18:55:18 +02:00 |
|
Gergő Móricz
|
8a5cad72f6
|
fix(queue-worker): variable name collision
|
2024-08-14 22:02:05 +02:00 |
|
Gergő Móricz
|
b8ec40dd72
|
fix(crawl): submit sitemapped jobs in bulk
|
2024-08-14 20:34:19 +02:00 |
|
Gergő Móricz
|
2ca1017fc3
|
fix(crawl): make request 0 of crawl jobs higher priority
|
2024-08-14 19:34:18 +02:00 |
|
Gergő Móricz
|
cfad067a63
|
fix(fly): change proxy limits
|
2024-08-14 18:52:40 +02:00 |
|
Gergő Móricz
|
a6c81f9d62
|
fix: return all data when calling webhook
|
2024-08-14 17:53:47 +02:00 |
|
Gergo Moricz
|
2e5e480cc2
|
fix(crawl): call webhooks
|
2024-08-13 22:10:17 +02:00 |
|
Gergo Moricz
|
a33596de3c
|
fix(log_job): add crawl_id
|
2024-08-13 22:03:46 +02:00 |
|
Gergo Moricz
|
9252940b52
|
fix(crawl-status): sort data
|
2024-08-13 21:55:13 +02:00 |
|
Gergo Moricz
|
8dbac0268c
|
feat: offload crawl results to the DB
|
2024-08-13 21:40:59 +02:00 |
|
Gergo Moricz
|
4bbc9db1df
|
fix: prioritize scrape jobs over crawl jobs
|
2024-08-13 21:31:34 +02:00 |
|
Gergo Moricz
|
5f2af37880
|
fix(scrape): remove scrape job from queue after the job is done
|
2024-08-13 21:26:41 +02:00 |
|
Gergo Moricz
|
2413e33359
|
fix(queue-worker): remove console.log
|
2024-08-13 21:07:36 +02:00 |
|
Gergo Moricz
|
d7549d4dc5
|
feat: remove webScraperQueue
|
2024-08-13 21:03:24 +02:00 |
|
Gergő Móricz
|
4a2c37dcf5
|
Merge branch 'main' into feat/queue-scrapes
|
2024-08-13 20:53:49 +02:00 |
|
Gergo Moricz
|
86e136beca
|
feat: crawl to scrape conversion
|
2024-08-13 20:51:43 +02:00 |
|
rafaelsideguide
|
a4be95ac27
|
fixed tests
|
2024-08-13 13:42:26 -03:00 |
|
Nicolas
|
09ca165d2e
|
Merge pull request #531 from kevinswiber/fix/respect-docker-env-file-comments
Self-host fix: Moving comments of .env.example values from end-of-line to above-line.
|
2024-08-12 16:54:56 -04:00 |
|
Nicolas
|
d06f40810c
|
Merge pull request #515 from wahpiangle/main
Update redis urls in example .env
|
2024-08-12 16:51:59 -04:00 |
|
Nicolas
|
6810338271
|
Update search.ts
|
2024-08-12 16:51:43 -04:00 |
|
Thomas Kosmas
|
98be29c963
|
Update parameters for platform.openai.com
|
2024-08-12 22:49:28 +03:00 |
|
Kevin Swiber
|
33aa5cf0de
|
Moving comments of .env.example values from end-of-line to above-line. Self-host docs suggest using .env.example as a base. However, Docker doesn't respect end-of-line comments. It sets the comment as the actual value of the variable. This fix prevents that.
|
2024-08-12 12:24:46 -07:00 |
|
Nicolas
|
74a5125185
|
Nick: removed redlock
|
2024-08-12 15:07:30 -04:00 |
|
Nicolas
|
0bd1a820ee
|
Update auth.ts
|
2024-08-12 13:42:09 -04:00 |
|
Nicolas
|
25a899eae3
|
Nick: redlock cache in auth
|
2024-08-12 13:37:47 -04:00 |
|
Rafael Miller
|
36e4b2cf49
|
Update .env.example
|
2024-08-12 10:37:00 -03:00 |
|
Quan Ming
|
a96ad4b0e2
|
Update redis url to use comment
|
2024-08-10 12:33:26 +08:00 |
|
Nicolas
|
e28c415cf4
|
Nick:
|
2024-08-09 14:07:46 -04:00 |
|
rafaelsideguide
|
0591000b64
|
bugfix includes excludes
|
2024-08-09 14:30:41 -03:00 |
|
Quan Ming
|
0221872a70
|
Update redis urls in example .env
|
2024-08-10 00:40:11 +08:00 |
|
Nicolas
|
f1f5605010
|
Update website_params.ts
|
2024-08-08 12:31:58 -04:00 |
|
Nicolas
|
b0abad07da
|
Merge pull request #496 from tak-s/improve-logging-level
Improve logs
|
2024-08-07 22:01:12 -04:00 |
|
Gergo Moricz
|
920b7f2f44
|
fix(runWebScraper): don't filter empty docs
|
2024-08-07 21:00:22 +02:00 |
|
Gergo Moricz
|
55ec96c23f
|
fix(queue-worker): bad job lock extension time
|
2024-08-07 20:24:16 +02:00 |
|
Gergo Moricz
|
ab7a35c581
|
fix(queue-worker): log lock extensions
|
2024-08-07 19:49:48 +02:00 |
|
Gergo Moricz
|
a1c2ee5aa9
|
fix: always complete job, no try
|
2024-08-07 19:39:09 +02:00 |
|
Gergo Moricz
|
191dfbd9ca
|
fix: move to completed in one place
|
2024-08-07 18:49:58 +02:00 |
|
Nicolas
|
457c082ba1
|
Nick: fixed tests
|
2024-08-07 11:08:53 -04:00 |
|
Gergő Móricz
|
5fc7fcb77c
|
Merge branch 'main' into feat/queue-scrapes
|
2024-08-07 16:35:44 +02:00 |
|
Gergo Moricz
|
fe9fdb578b
|
revert bad hotfixes
|
2024-08-07 16:34:25 +02:00 |
|
Gergo Moricz
|
b7c01dcb9b
|
fix(webScraperQueue): reduce retries to 2
|
2024-08-07 16:31:50 +02:00 |
|
Gergo Moricz
|
cdf7bad5b4
|
fix(runWebScraper): don't move to completed
|
2024-08-07 15:20:56 +02:00 |
|