GET /v1/models
Returns a list of all models available for inference.Example request
curl
Example response
Response fields
| Field | Type | Description |
|---|---|---|
id | string | The model identifier to use in API requests |
architecture.modality | string | Input/output modality (e.g. text->text) |
context_length | integer | Maximum context window in tokens |
max_completion_tokens | integer | Maximum tokens the model can generate |
pricing.prompt | string | Cost per 1M input tokens (in USD) |
pricing.completion | string | Cost per 1M output tokens (in USD) |
quantization | string | Quantization level (e.g. fp4, fp8) |