Commit Graph

  • 33e14e5404
    feat: extract text from pdf (#70) Yanlong Wang 2024-05-30 20:21:33 +08:00
  • 7c5712363c
    feat: allow custom rate limit per uid yanlong.wang 2024-05-23 15:36:09 +08:00
  • 8eee95119d
    feat: index brief in JSON format yanlong.wang 2024-05-23 12:06:07 +08:00
  • 1c944562f7
    Update README.md Han Xiao 2024-05-21 18:49:08 +02:00
  • b7e5935e83
    Update README.md Han Xiao 2024-05-21 18:48:34 +02:00
  • 4f37de24f6
    fix: docs yanlong.wang 2024-05-21 17:35:16 +08:00
  • a8e0628460
    feat: links and images summary (#63) Yanlong Wang 2024-05-21 17:34:19 +08:00
  • df71c9a534
    fix: stop using pool Yanlong Wang 2024-05-20 01:12:22 +08:00
  • ba8ab88811
    Update README.md Han Xiao 2024-05-17 22:07:57 +02:00
  • 4077fa7040
    fix: geoip encoding Yanlong Wang 2024-05-17 09:31:22 +08:00
  • 2941be6096
    fix: potential unencoded query Yanlong Wang 2024-05-17 09:15:37 +08:00
  • ed9e9f43cf
    fix: block rough requests Yanlong Wang 2024-05-16 20:22:26 +08:00
  • 8ec8c1e718
    fix: logging for search error yanlong.wang 2024-05-16 19:01:30 +08:00
  • e0e37ad4d7
    fix: potential chargeAmount mismatch yanlong.wang 2024-05-16 18:43:41 +08:00
  • 8b0916f858
    fix: race condition while logging chargeAmount yanlong.wang 2024-05-16 18:26:18 +08:00
  • 6f4819bc49
    chore: tweak deployment yanlong.wang 2024-05-16 17:46:53 +08:00
  • 322cb86f21
    fix: on no results yanlong.wang 2024-05-16 17:30:47 +08:00
  • e2698b48bd
    fix: rate limit tag for search yanlong.wang 2024-05-16 16:58:10 +08:00
  • 72e1c46a6c
    fix: improve search responsiveness yanlong.wang 2024-05-16 15:47:49 +08:00
  • e100b257f4
    fix: typo in readme Yanlong Wang 2024-05-16 09:49:32 +08:00
  • 0583645613
    fix: noCache in search Yanlong Wang 2024-05-16 00:42:30 +08:00
  • 4556954d17
    fix: image url Yanlong Wang 2024-05-16 00:39:24 +08:00
  • 6f65083f8d
    feat: control cache tolerance and select target using headers Yanlong Wang 2024-05-16 00:10:13 +08:00
  • 029f568c38
    Update README.md Han Xiao 2024-05-15 17:37:15 +02:00
  • ffaf621b18
    Update README.md Han Xiao 2024-05-15 12:54:47 +02:00
  • 361f81effc
    Update README.md Han Xiao 2024-05-15 12:41:55 +02:00
  • 77fc500f41
    fix: allow x-return-format header alias yanlong.wang 2024-05-15 12:24:46 +08:00
  • 445624c405
    fix: early return for search Yanlong Wang 2024-05-15 08:47:16 +08:00
  • 1cf8e83857
    fix: add cache tolerance Yanlong Wang 2024-05-15 08:06:35 +08:00
  • d100c3fc5f
    fix: search result cache save Yanlong Wang 2024-05-14 19:57:49 +08:00
  • ec4ce4fef3
    chore: update rate limits Yanlong Wang 2024-05-14 19:44:35 +08:00
  • 2e3c217479
    feat: web search (#57) Yanlong Wang 2024-05-14 19:39:43 +08:00
  • f171e54ac9
    fix: log charge amount Yanlong Wang 2024-05-14 17:25:59 +08:00
  • 04eb6b3087
    Update README.md Han Xiao 2024-05-13 14:32:01 +02:00
  • ffc6899acd
    chore: reduce resource yanlong.wang 2024-05-13 18:35:11 +08:00
  • e417cd8a53
    fix: tidyMarkdown feature in turndown rues yanlong.wang 2024-05-09 15:15:15 +08:00
  • 36bf5d96b5
    fix: remove tidyMarkdown at all Yanlong Wang 2024-05-09 11:33:56 +08:00
  • 59f807cb7c
    fix: tidyMarkdown Yanlong Wang 2024-05-09 11:32:26 +08:00
  • 6b6774f43b
    fix: tidyMarkdown Yanlong Wang 2024-05-09 11:25:51 +08:00
  • 4bee36ed4a
    fix: patch tidyMarkdown Yanlong Wang 2024-05-09 11:06:20 +08:00
  • de22127d2f
    fix: leak of crippled listeners Yanlong Wang 2024-05-08 19:51:55 +08:00
  • b7523f0b2a
    Update README.md Han Xiao 2024-05-08 13:30:28 +02:00
  • 62dc75f78e
    fix: consider image data-src and make generated alt text optional (#50) Yanlong Wang 2024-05-08 18:29:11 +08:00
  • 8cfd0d67dc
    feat: jina paywall (#49) Yanlong Wang 2024-05-08 18:25:26 +08:00
  • 2e025d10cf
    fix: the complex regexp caused node.js process to hang Yanlong Wang 2024-05-05 16:29:39 +08:00
  • fef1d0faf1
    bump: deps Yanlong Wang 2024-05-05 10:54:11 +08:00
  • e0a9025b4f
    Update README.md Han Xiao 2024-05-03 20:41:51 +02:00
  • 3bb7bd1c29
    chore: bump deps Yanlong Wang 2024-05-04 00:46:15 +08:00
  • a0d1a7234b
    chore: tweak health check Yanlong Wang 2024-05-02 08:39:54 +08:00
  • 9e02080103
    fix: error on browser crashes Yanlong Wang 2024-05-02 03:23:57 +08:00
  • 55b954ffeb
    fix: tweak health check Yanlong Wang 2024-04-30 18:56:46 +08:00
  • 528b3e5fed
    fix: add health check to detect puppeteer stall Yanlong Wang 2024-04-30 18:30:31 +08:00
  • ae29055142
    chore: tweaks Yanlong Wang 2024-04-29 20:12:11 +08:00
  • 867636d037
    fix: apply rate limit to 100qpm per IP yanlong.wang 2024-04-29 18:54:51 +08:00
  • 15606f38d7
    fix: on null element yanlong.wang 2024-04-29 17:28:07 +08:00
  • 53a4361c23
    fix: block firebase runtime intrusion yanlong.wang 2024-04-29 17:21:34 +08:00
  • 059c8aa61e
    fix: remove exposed function before cleanup yanlong.wang 2024-04-29 15:51:23 +08:00
  • bfc6d678d8
    fix: split report handler from other page preps yanlong.wang 2024-04-29 15:19:05 +08:00
  • 036f6dc776
    chore: tweak runtime config Yanlong Wang 2024-04-29 09:49:29 +08:00
  • 6ac2863e89
    bump: deps Yanlong Wang 2024-04-28 22:28:24 +08:00
  • fcd471b099
    Update README.md Han Xiao 2024-04-25 20:35:28 +02:00
  • a6a5b7c530
    fix: respond with markdown yanlong.wang 2024-04-25 18:58:42 +08:00
  • 69231ad59e
    feat: full markdown mode yanlong.wang 2024-04-25 18:21:04 +08:00
  • 0f70723d19
    Update README.md Han Xiao 2024-04-25 10:10:32 +02:00
  • adc05fe20a
    fix yanlong.wang 2024-04-25 16:09:11 +08:00
  • c2cd71d6a5
    Update README.md Han Xiao 2024-04-25 10:06:24 +02:00
  • 39a446f5e7
    fix: root content yanlong.wang 2024-04-25 15:43:17 +08:00
  • f1016649ac
    fix: firebase limit on document size causing cache failures yanlong.wang 2024-04-25 12:24:05 +08:00
  • 9dd5af0cb5
    Update README.md Han Xiao 2024-04-24 17:30:08 +02:00
  • ae788c39c5
    docs: document header usage Han Xiao 2024-04-24 17:28:55 +02:00
  • 94a72052f4
    fix: reduce frequency of screenshot if possible yanlong.wang 2024-04-24 19:43:24 +08:00
  • ae99af50aa
    Merge branch 'main' of github.com:jina-ai/url2text yanlong.wang 2024-04-24 19:21:50 +08:00
  • 230388529e
    bump: deps yanlong.wang 2024-04-24 19:21:44 +08:00
  • 7ee2c327a3
    refactor: reorganize features (#37) Yanlong Wang 2024-04-24 19:21:12 +08:00
  • e36d3b0f24
    chore(deps): bump protobufjs and firebase-admin in /backend/functions (#35) dependabot[bot] 2024-04-24 16:37:38 +08:00
  • c5bc474964
    cleanup: remove top level package lock yanlong.wang 2024-04-24 16:33:25 +08:00
  • 4b208f44b5
    fix: process not quitting on errors Yanlong Wang 2024-04-21 10:17:05 +08:00
  • 17415ed1f1
    docs: fix readme image Han Xiao 2024-04-20 23:27:42 +02:00
  • 5d255dda3b
    chore: update deps Yanlong Wang 2024-04-19 09:30:19 +08:00
  • d47310a6f7
    fix: allocating incorrect max value due to missing parentheses (#26) Charuka Samarakoon 2024-04-19 06:31:23 +05:30
  • d4ca381c38
    fix: explicitly reject non http protocols yanlong.wang 2024-04-18 15:35:06 +08:00
  • abc817e960
    feat: block media resources to improve speed yanlong.wang 2024-04-18 15:06:28 +08:00
  • cbc13ecbbd
    fix: catch turndown errors yanlong.wang 2024-04-18 13:51:54 +08:00
  • 6ee0f2de75
    docs: update streaming mode Han Xiao 2024-04-17 21:53:20 -07:00
  • 3557cba48d
    docs: update explain of streaming mode Han Xiao 2024-04-17 21:48:42 -07:00
  • 0975b35ca2
    chore: turn up concurrency a bit base on analysis yanlong.wang 2024-04-18 11:53:55 +08:00
  • a211366501
    fix: expose publishedTime if possible yanlong.wang 2024-04-17 12:36:36 +08:00
  • 6e36f0a447
    fix: url wrong normalization Yanlong Wang 2024-04-17 09:55:41 +08:00
  • 781b835466
    fix: keep url details Yanlong Wang 2024-04-17 09:48:26 +08:00
  • 11a5a90611
    fix: favor nominal url over real url Yanlong Wang 2024-04-17 09:30:49 +08:00
  • bda7e76e50
    chore: increase max instances to target 10k concurrent requests Yanlong Wang 2024-04-17 09:22:26 +08:00
  • 50ed9cc248
    feat: fallback to google archive (#16) Yanlong Wang 2024-04-17 00:17:45 +08:00
  • 8a2b095bd7
    fix: give expireAt for image cache yanlong.wang 2024-04-16 15:20:46 +08:00
  • 4f284f51b6 docs: update readme Han Xiao 2024-04-15 21:50:34 -07:00
  • b3fb4c5c57
    feat: add image captioning (#6) Han Xiao 2024-04-15 20:51:31 -07:00
  • 18373626b2 fix: catch parse error Han Xiao 2024-04-15 19:27:40 -07:00
  • 3134a59d8f chore: update readme Han Xiao 2024-04-15 17:23:16 -07:00
  • 9b190127aa fix: clean broken markdown Han Xiao 2024-04-13 21:40:51 -07:00
  • 7fc30dd003 docs: explain stream mode Han Xiao 2024-04-13 19:27:10 -07:00
  • af2775d1aa docs: explain stream mode Han Xiao 2024-04-13 19:25:51 -07:00