Models

Every model this key can call, with metadata on context length, streaming, tool use, and hardware requirements.

Package availability

Wordcab SDKs, CLI tools, Helm charts, model weights, and deployment packages are delivered directly to each customer for self-hosted installation. They are not publicly published package-manager artifacts, so install commands in these docs are placeholders until your Wordcab team provides your private package source or offline bundle.

List models

GET/v1/models

Returns everything this key can call. Fields per model:

json

{
  "id": "qwen3.5-4b",
  "object": "model",
  "type": "llm",
  "context_length": 131072,
  "streaming": true,
  "tool_use": true,
  "modalities": ["text"],
  "requirements": {"min_gpu": "L40S", "vram_gb": 22},
  "license": "Apache-2.0"
}

Types

llm — chat completions.
stt — transcription (batch + streaming).
tts — speech generation.
embedding — embedding models.
diarization — speaker segmentation models.

Retrieve a single model

GET/v1/models/{model_id}

← Previous

Voices

Test suites