mirror of
https://git.mirrors.martin98.com/https://github.com/mendableai/firecrawl
synced 2025-04-18 12:09:42 +08:00
Web Information Extractor with Llama 4 Maverick
This tool uses Llama 4 Maverick (via Together AI), SerpAPI, and Firecrawl to automatically extract structured information about companies from the web. It performs intelligent URL selection and information extraction from web content.
Features
- Automated Google search using SerpAPI
- Intelligent URL selection using Llama 4 Maverick
- Structured data extraction using Firecrawl
- Color-coded console output for better readability
Prerequisites
- Python 3.8+
- Together AI API key
- SerpAPI API key
- Firecrawl API key
Installation
- Clone the repository:
git clone <your-repo-url>
cd <your-repo-name>
- Install dependencies:
pip install -r requirements.txt
- Copy the example environment file and fill in your API keys:
cp .env.example .env
- Edit the
.env
file with your API keys:
TOGETHER_API_KEY=your_together_ai_key
SERP_API_KEY=your_serpapi_key
FIRECRAWL_API_KEY=your_firecrawl_key
Usage
Run the script:
python llama-4-maverick-extractor.py
The script will:
- Prompt you for a company name
- Ask what information you want to extract
- Search for relevant URLs
- Extract and structure the requested information
- Display the results
Example
$ python llama-4-maverick-extractor.py
Enter the company name: Tesla
Enter what information you want about the company: latest electric vehicle models and their prices
Error Handling
The script includes comprehensive error handling for:
- Missing API keys
- API rate limits
- Network issues
- Invalid responses
- JSON parsing errors
License
MIT License - feel free to use and modify as needed.