Merge branch 'main' of https://github.com/mendableai/firecrawl

2025-08-12 01:38:59 +08:00 · 2025-04-18 01:54:20 -07:00 · 2025-04-18 01:54:20 -07:00 · bc5c5a31d4
commit bc5c5a31d4
parent a74b2dc59f 06204bd825
1 changed files with 11 additions and 20 deletions
--- a/README.md
+++ b/README.md
@ -481,17 +481,15 @@ app = FirecrawlApp(api_key="fc-YOUR_API_KEY")
 # Scrape a website:
 scrape_status = app.scrape_url(
  'https://firecrawl.dev', 
-  params={'formats': ['markdown', 'html']}
+  formats=["markdown", "html"]
 )
 print(scrape_status)

 # Crawl a website:
 crawl_status = app.crawl_url(
  'https://firecrawl.dev',
-  params={
-    'limit': 100, 
-    'scrapeOptions': {'formats': ['markdown', 'html']}
-  },
+  limit=100,
+  scrapeOptions'={'formats': ['markdown', 'html']}
  poll_interval=30
 )
 print(crawl_status)
@ -502,11 +500,6 @@ print(crawl_status)
 With LLM extraction, you can easily extract structured data from any URL. We support pydantic schemas to make it easier for you too. Here is how you to use it:

 ```python
-
-from firecrawl.firecrawl import FirecrawlApp
-
-app = FirecrawlApp(api_key="fc-YOUR_API_KEY")
-
 class ArticleSchema(BaseModel):
    title: str
    points: int 
@ -514,15 +507,13 @@ class ArticleSchema(BaseModel):
    commentsURL: str

 class TopArticlesSchema(BaseModel):
-    top: List[ArticleSchema] = Field(..., max_items=5, description="Top 5 stories")
+    top: List[ArticleSchema] = Field(..., description="Top 5 stories")

-data = app.scrape_url('https://news.ycombinator.com', {
-    'formats': ['json'],
-    'jsonOptions': {
-        'schema': TopArticlesSchema.model_json_schema()
-    }
-})
-print(data["json"])
+json_config = ExtractConfig(schema=TopArticlesSchema.model_json_schema())
+
+llm_extraction_result = app.scrape_url('https://news.ycombinator.com', formats=["json"], json=json_config)
+
+print(llm_extraction_result.json)
 ```

 ## Using the Node SDK