Firecrawl
Scrape website content
Firecrawl is a powerful web scraping and content extraction API that integrates seamlessly into Sim Studio, enabling developers to extract clean, structured content from any website. This integration provides a simple way to transform web pages into usable data formats like Markdown and HTML while preserving the essential content.
With Firecrawl in Sim Studio, you can:
- Extract clean content: Remove ads, navigation elements, and other distractions to get just the main content
- Convert to structured formats: Transform web pages into Markdown, HTML, or JSON
- Capture metadata: Extract SEO metadata, Open Graph tags, and other page information
- Handle JavaScript-heavy sites: Process content from modern web applications that rely on JavaScript
- Filter content: Focus on specific parts of a page using CSS selectors
- Process at scale: Handle high-volume scraping needs with a reliable API
The Firecrawl integration allows your agents to access and process web content programmatically without leaving the Sim Studio environment. This enables scenarios like research, content aggregation, data extraction, and information analysis from across the web. Your agents can gather information from websites, extract structured data, and use that information to make decisions or generate insights - all without having to navigate the complexities of raw HTML parsing or browser automation. Simply configure the Firecrawl block with your API key, provide the target URL, and your agents can immediately begin working with web content in a clean, structured format.
Usage Instructions
Extract content from any website with advanced web scraping capabilities and content filtering. Retrieve clean, structured data from web pages with options to focus on main content.
Tools
firecrawl_scrape
Extract structured content from web pages with comprehensive metadata support. Converts content to markdown or HTML while capturing SEO metadata, Open Graph tags, and page information.
Input
Parameter | Type | Required | Description |
---|---|---|---|
apiKey | string | Yes | Firecrawl API key |
url | string | Yes | The URL to scrape content from |
scrapeOptions | json | No | Options for content scraping |
Output
Parameter | Type |
---|---|
markdown | string |
html | string |
metadata | string |
Block Configuration
Input
Parameter | Type | Required | Description |
---|---|---|---|
apiKey | string | Yes | API Key - Enter your Firecrawl API key |
Outputs
Output | Type | Description |
---|---|---|
response | object | Output from response |
↳ markdown | string | markdown of the response |
↳ html | any | html of the response |
↳ metadata | json | metadata of the response |
Notes
- Category:
tools
- Type:
firecrawl