# Microgram: kokoro-text-to-speech

`station__algovigilance__microgram-kokoro-text-to-speech` · native (always callable) · domain `algovigilance` · pv-relevance `pv-core`

Synthesize speech from text using the Kokoro neural TTS model. Returns the audio inline as base64. Default voice is af_bella, default format mp3.

## Agent metadata

- `idempotent`: true
- `read_only`: true
- `expected_latency_ms`: unknown (not yet contract-tested)
- `cost_tokens_estimate`: unknown

## Input schema

- `text` *string* (required) — Text to synthesize. Max 4096 characters.
- `voice` *string* — Voice identifier (e.g. af_bella, af_kore, af_nicole). Call list-voices for the full set of 67. Default: af_bella.
- `model` *string* — Model name. One of: kokoro, tts-1, tts-1-hd. Default: kokoro.
- `format` *string* — Audio container. One of: mp3, wav, opus, flac, aac. Default: mp3.

## Example call

```json
POST /api/mcp
Content-Type: application/json

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "tools/call",
  "params": {
    "name": "station__algovigilance__microgram-kokoro-text-to-speech",
    "arguments": {
      "text": ""
    }
  }
}
```

## Related

- [/tools](/tools) — all 7718 tools
- [/tools/algovigilance__microgram-kokoro-text-to-speech](/tools/algovigilance__microgram-kokoro-text-to-speech) — HTML page
- [/tools/algovigilance__microgram-kokoro-text-to-speech/json](/tools/algovigilance__microgram-kokoro-text-to-speech/json) — JSON form (agent-friendly)
- [/api/mcp](/api/mcp) — endpoint
- [/AGENTS.md](/AGENTS.md) — agent guide