Skip to main content

scrape_url

Scrape one or more URLs and return content as markdown, raw HTML, or XHR data. Credits: 1.5 per URL

Parameters

ParameterTypeRequiredDescription
urlsarrayYes1-5 URLs to scrape
use_javascriptbooleanNoRender JavaScript before scraping (default: true)
outputstringNoOutput format: markdown (default), raw (HTML), or xhr
geostringNoGeographic location for proxy routing
localestringNoLocale code (e.g., en-gb)

Request

curl -X POST https://mcp.debytes.io/mcp \
  -H "Authorization: Bearer deb_your_api_key_here" \
  -H "Content-Type: application/json" \
  -d '{
    "jsonrpc": "2.0",
    "id": 1,
    "method": "tools/call",
    "params": {
      "name": "scrape_url",
      "arguments": {
        "urls": ["https://example.com/job/12345"],
        "output": "markdown"
      }
    }
  }'

Response

{
  "https://example.com/job/12345": "# Senior ML Engineer\n\nTechCorp is looking for..."
}
A dictionary mapping each URL to its scraped content. Content is truncated at 30,000 characters per URL.

Notes

  • Maximum 5 URLs per call
  • Credits are charged per URL (e.g., 3 URLs = 4.5 credits)
  • JavaScript rendering is enabled by default for dynamic pages
  • Results are always fresh (not cached)