Yanlong Wang
|
f8a0dbd0ac
|
deps: ditch firebase logger
|
2025-02-27 10:36:02 +08:00 |
|
Yanlong Wang
|
690984d44a
|
serving: switching to h2c
|
2025-02-27 10:11:04 +08:00 |
|
Yanlong Wang
|
20a7d4d538
|
fix: search return amount
|
2025-02-27 09:55:10 +08:00 |
|
Sha Zhou
|
1ca29b4b0e
|
fix: update charge amount for search v2
|
2025-02-27 09:43:38 +08:00 |
|
Aaron Ji
|
fcb59591c2
|
fix: always return 10 results (#1152)
* fix: always return 10 results
* chore: get 2 extra results
|
2025-02-25 18:11:08 +08:00 |
|
Sha Zhou
|
4d48797f17
|
switch to search serper
|
2025-02-25 16:36:28 +08:00 |
|
Sha Zhou
|
abd37e5266
|
feat: search v2 (#1151)
|
2025-02-25 16:33:23 +08:00 |
|
Yanlong Wang
|
29774ac637
|
fix: scrapMany and searcher
|
2025-02-24 23:44:29 +08:00 |
|
yanlong.wang
|
380bbffc0c
|
fix: formatting in some edge cases
|
2025-02-24 13:29:56 +08:00 |
|
yanlong.wang
|
ecb1af07f9
|
cleanup
|
2025-02-24 11:38:47 +08:00 |
|
yanlong.wang
|
6335c526ec
|
fine: stop worrying about the global environment
|
2025-02-24 11:31:38 +08:00 |
|
yanlong.wang
|
a2f810cce6
|
fix: allow redefine Function.prototype.toString
|
2025-02-24 11:05:36 +08:00 |
|
yanlong.wang
|
6a1f391339
|
ci: fix mmdb download
|
2025-02-20 18:46:23 +08:00 |
|
yanlong.wang
|
7340028c64
|
fix(stand-alone-server): set ip to local context
|
2025-02-20 18:36:04 +08:00 |
|
yanlong.wang
|
9dbecbbc9f
|
fix
|
2025-02-20 17:50:12 +08:00 |
|
yanlong.wang
|
ef2f87cd7d
|
fix: serper search caching
|
2025-02-20 17:49:24 +08:00 |
|
yanlong.wang
|
7c029bf429
|
search: new branch using serper as provider
|
2025-02-20 17:32:45 +08:00 |
|
yanlong.wang
|
b44247c008
|
deps: block avif images for now
Because it causes segfaults.
See https://github.com/Brooooooklyn/canvas/issues/996
|
2025-02-19 17:52:51 +08:00 |
|
yanlong.wang
|
dc1dd7e30d
|
deps: update @napi-rs/canvas
Getting segfault in some cases, try updating.
|
2025-02-19 15:38:17 +08:00 |
|
yanlong.wang
|
7f04a65548
|
fix: curl response headers
|
2025-02-19 15:18:38 +08:00 |
|
yanlong.wang
|
d749809966
|
deps: ditch puppeteer-extra-plugin-stealth
|
2025-02-19 14:48:12 +08:00 |
|
yanlong.wang
|
b6846ab4b6
|
curl-impersonate: add zstd compression options
|
2025-02-19 14:22:19 +08:00 |
|
yanlong.wang
|
2991e300d8
|
puppeteer: tweak the ua a bit
|
2025-02-19 14:05:26 +08:00 |
|
yanlong.wang
|
cf24d84e8a
|
puppeteer: ditch puppeteer-stealth and use the real stable chrome
|
2025-02-19 13:47:33 +08:00 |
|
yanlong.wang
|
e4bc29aab8
|
fix: expect malformed url in iframes
|
2025-02-17 18:53:55 +08:00 |
|
yanlong.wang
|
92f636474d
|
style: prefer const for originalSrc
|
2025-02-17 17:43:23 +08:00 |
|
yanlong.wang
|
008dcbaf22
|
fix: image in summary
|
2025-02-17 17:41:39 +08:00 |
|
yanlong.wang
|
fc2824b115
|
fix: bump deps
|
2025-02-17 13:34:24 +08:00 |
|
yanlong.wang
|
0e8308e627
|
fix: some invalid uriComponent case
|
2025-02-17 12:28:57 +08:00 |
|
Yanlong Wang
|
05df989202
|
fix: unhandled rejection case
|
2025-02-14 17:59:18 +08:00 |
|
yanlong.wang
|
0b93e7da53
|
fix
|
2025-02-10 15:05:58 +08:00 |
|
yanlong.wang
|
f7fd8132b8
|
bump: deps
|
2025-02-10 14:13:19 +08:00 |
|
Yanlong Wang
|
033a53af30
|
fix: options handling in stand-alone script
|
2025-02-07 11:14:55 +08:00 |
|
yanlong.wang
|
0f36fe81a6
|
fix: compressed response from curl
|
2025-02-05 16:09:54 +08:00 |
|
Yanlong Wang
|
6a58de590c
|
deployment: dedicated server script for cloud-run (#1139)
* refactor: domain profile and attempt direct engine
* fix: direct engine
* fix: abuse in background phase
* fix
* wip
* use curl-impersonate in custom image
* local pdf for curl
* listen port from env
* fix
* fix
* fix
* fix: ditch http2
* cd: using gh action
* ci: token for thinapps-shared
* ci: setup node lock file path
* ci: tweak
* ci: mmdb
* ci: docker build
* fix: ci
* fix: ci
|
2025-02-05 14:50:18 +08:00 |
|
Yanlong Wang
|
a453ab5f16
|
fix: content suffix for markdown respond format
|
2025-02-04 15:59:01 +08:00 |
|
Yanlong Wang
|
cc6d2f3e29
|
fix: search params
|
2025-01-26 21:21:48 +08:00 |
|
yanlong.wang
|
234f61d066
|
remove more attrs in readerlm preprocessing
|
2025-01-20 11:54:31 +08:00 |
|
Yanlong Wang
|
140a6f86ae
|
fix: tweak readerlm
|
2025-01-17 12:24:05 +08:00 |
|
Yanlong Wang
|
f95eb027d7
|
fix: tweak readerlm parameters
|
2025-01-17 11:42:36 +08:00 |
|
yanlong.wang
|
4e5729372e
|
fix: readerlm repetition_penalty
|
2025-01-16 19:20:44 +08:00 |
|
yanlong.wang
|
3e58afc2ba
|
fix: readerlm params
|
2025-01-16 18:46:14 +08:00 |
|
yanlong.wang
|
e23d9f30a6
|
fix: base parameter
|
2025-01-16 15:37:16 +08:00 |
|
yanlong.wang
|
53821d0105
|
fix: lm and related options
|
2025-01-16 15:11:32 +08:00 |
|
yanlong.wang
|
80b9a6a5a0
|
fix: curl with errors
|
2025-01-15 19:29:59 +08:00 |
|
yanlong.wang
|
6be6051aa7
|
fix
|
2025-01-15 17:50:03 +08:00 |
|
yanlong.wang
|
06f359309e
|
feat: new lm engine
|
2025-01-15 17:38:49 +08:00 |
|
Yanlong Wang
|
51a4877933
|
feat: gemini to replace blip2 (#1129)
* feat: domain profile
* fix
* fix
* fix
* fix
* fix
* refactor: curl as direct engine
* fix
* wip
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: Sha Zhou <sha.zhou@jina.ai>
|
2025-01-15 15:03:46 +08:00 |
|
Sha Zhou
|
c19ba65391
|
update scrapping options
|
2025-01-14 15:32:50 +08:00 |
|
Sha Zhou
|
8f25fe1d45
|
fix pageshot failure
|
2025-01-13 19:25:07 +08:00 |
|