* fix(extract): construct OpenAI on demand
Fixes hard-crash if api key not specified in a self-hosting environment.
* fix(ci): try sleeping
* fix(ci): override host
* fix(ci): wait for server to start
* Support /extract and /crawl for self-hosted (FIR-1097) (#1137)
* Support /extract for self-hosted
This returns the job response from redis rather than supabase when db auth is disabled (self hosted mode)
* Use getJob for extract and use correct types
* fix(v1/crawl-status): only poll DB for total count if DB is enabled
* feat(snips): TEST_SUITE_SELF_HOSTED
* fix(ci/test-server-self-host): use pr trigger
* fix(scrapeURL): f-e mocking in selfhosted env
* fix(snips): do not try to eval json format on selfhost
* fix(scrapeURL): further f-e mocking
* fix(snips): don't timeout on hard fail polling
* fix(v1/extract-status): fix-up the db-agnostic impl
unfortunately had to separate the functions since the schema
was too divergent :(
* fix(snips): boost screenshot delay
* feat(ci): test with openai
* feat(ci): extract, search testing
* fix(ci): matrix
* fix(ci): bleh
* Update: fix default google search (#1174)
* fix log title
* search should always work
* asd
* fix ci
---------
Co-authored-by: Nick Roth <nlr06886@gmail.com>
Co-authored-by: William <sdustusun@gmail.com>
* fix: Quote variables in `docker-entrypoint.sh`
- This commit adds double quotes around variables in the docker-entrypoint.sh script to prevent word splitting and globbing issues, ensuring the script behaves correctly in all cases.
* fix: Ensure worker/api deployment starts with `OPENAI_API_KEY`
* fix: Add missing `FLY_PROCESS_GROUP` env var to deployments
* fix: Correct `PLAYWRIGHT_MICROSERVICE_URL` in `firecrawl-config`
* fix: Update Docker build options for Apple Silicon compatibility
* fix: Correct `PLAYWRIGHT_MICROSERVICE_URL` in `firecrawl-config`
- Implement max actions limit (15 actions)
- Add total wait time calculation for actions
- Increase max waitFor time to 60 seconds
- Refactor scrape options schema to include validation
* Update Dockerfile
Config corepack and pnpm
* Update Dockerfile
Try using this approach: https://github.com/pnpm/pnpm/issues/9029#issuecomment-2631400936
* Add ulimits command to docker compose
* Remove ulimit from entrypoint
* Add ulimit back to docker-entrypoint.sh
This implements the proposed approach to not fail when the ulimit command can't run due to the environment
* Resolve signature mismatch
Resolves issue encountered when using docker compose up on a clean environment on an intel mac. This just adds the clean command to avoid leveraging any cached data before updating apt-get and installing ca-certificates
---------
Co-authored-by: Nick Roth <nroth@dealnews.com>
Co-authored-by: Gergő Móricz <mo.geryy@gmail.com>
* added cached scrapes to extract
* dont save if exists
* no duplicates
* experimental tag
* Update requests.http
---------
Co-authored-by: Nicolas <nicolascamara29@gmail.com>