# web-crawl

`station__web-autonomy__web-crawl` · external (needs EXECUTION_BACKEND_URL configured) · domain `web-autonomy` · pv-relevance `non-pv`

Crawl a website from a seed URL. Breadth-first traversal with configurable depth, page limit, and same-domain filtering. Returns title and text preview per page.

> **Note:** This tool routes through an external execution backend. If `EXECUTION_BACKEND_URL` is unset on the server, calls return JSON-RPC error `-32603 "Tool execution backend not configured"`. Tools with `backend: native` execute in-process and are always callable.

## Agent metadata

- `idempotent`: unknown
- `read_only`: unknown
- `expected_latency_ms`: unknown (not yet contract-tested)
- `cost_tokens_estimate`: unknown

## Input schema

- `max_depth` *integer*
- `max_pages` *integer*
- `same_domain_only` *boolean*
- `url` *string* (required) — Seed URL to start crawling.

## Example call

```json
POST /api/mcp
Content-Type: application/json

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "tools/call",
  "params": {
    "name": "station__web-autonomy__web-crawl",
    "arguments": {
      "url": ""
    }
  }
}
```

## Related

- [/tools](/tools) — all 3062 tools
- [/tools/web-autonomy__web-crawl](/tools/web-autonomy__web-crawl) — HTML page
- [/tools/web-autonomy__web-crawl/json](/tools/web-autonomy__web-crawl/json) — JSON form (agent-friendly)
- [/api/mcp](/api/mcp) — endpoint
- [/AGENTS.md](/AGENTS.md) — agent guide
