Web Scraping & Search
Search Google, scrape websites, extract data from Google Maps, and run browsers at scale
Run Google searches programmatically and get structured results
google-search
SerpAPI
by serpapi.comMost reliable Google SERP API. Handles proxies, CAPTCHAs, and parsing. Returns structured JSON.
Example API call
GET https://serpapi.com/search.json?q=run+clubs+in+austin&api_key=YOUR_KEY&engine=google
// Response:
{
"organic_results": [
{
"title": "Austin Run Club",
"link": "https://austinrunclub.com",
"snippet": "Weekly group runs in downtown Austin..."
}
]
}Alternatives
Cheaper, simpler, good for high volume.
Scrape Google Maps for local businesses with reviews, ratings, and contact info
google-maps-scraping
Apify
by apify.comPre-built actors for Google Maps, Facebook, Instagram, Yelp. Returns structured data with name, address, phone, website, rating, reviews.
Example API call
POST https://api.apify.com/v2/acts/nwua9Gu5YrADL7ZDj/runs?token=YOUR_TOKEN
{
"searchStringsArray": [
"restaurants in Austin TX"
],
"countryCode": "us",
"maxCrawledPlacesPerSearch": 50
}
// Response:
[
{
"placeId": "ChIJ...",
"title": "Joe's BBQ",
"address": "123 Main St",
"phone": "+1-555-0100",
"totalScore": 4.5,
"reviewsCount": 234
}
]Extract content from any website as clean, LLM-ready markdown
website-scraping
FireCrawl
by firecrawl.devPurpose-built for LLM-ready web scraping. Returns clean markdown. Handles JavaScript rendering, pagination, and site maps.
Example API call
POST https://api.firecrawl.dev/v1/scrape
{
"url": "https://acmecorp.com"
}
// Response:
{
"success": true,
"data": {
"markdown": "# Acme Corp\n\nWe help businesses...",
"metadata": {
"title": "Acme Corp"
}
}
}Alternatives
Reliable, handles proxies and JS rendering. Premium proxy option.
Screenshot a website and extract text via OCR for JavaScript-heavy sites
screenshot-to-text
Image2Text
by Self-hosted (Playwright + Tesseract)The fallback scraping method that always works. Screenshot any site, run OCR. Handles JavaScript-heavy sites that resist traditional scraping.
Example API call
POST http://localhost:8000/extract/url
{
"url": "https://acmecorp.com",
"full_page": false
}
// Response:
{
"text": "Acme Corp - We help businesses scale...",
"characters": 1234,
"lines": 45
}Rent stealth cloud browsers with proxy rotation to bypass CAPTCHAs at scale
stealth-browsers
Kernel
by kernel.shStealth cloud browsers with residential proxies and fingerprint randomization. Connect Playwright via CDP. Each session looks like a different person.
Example API call
POST https://api.kernel.sh/browsers
{
"stealth": true,
"timeout": 120
}
// Response:
{
"id": "browser_abc123",
"cdp_ws_url": "wss://browsers.kernel.sh/abc123"
}