Build with Mira
Mira is a family of large language models exposed through an OpenAI-compatible HTTP API. If your code already talks to OpenAI's Chat Completions endpoint, point its base URL at https://api.vmira.ai/v1 and use an sk-mira-… key — the official Python and JavaScript SDKs work unchanged.
The model line
- mira — the default. A fast model for chat, generation, light coding, classification, and most tool-using agents.
- mira-thinking — a deeper, slower model that allocates extra compute to step-by-step reasoning. Use when correctness matters more than latency.
The API shape
One canonical endpoint: POST /v1/chat/completions. Send the usual { model, messages }payload, get the OpenAI-shape response back. Streaming is server-sent events, identical to OpenAI's.
Get started
Quickstart
Your first call in under a minute — curl, Python, JavaScript, any OpenAI SDK.
API reference
Base URL, authentication, request and response shape.
Models
When to pick mira vs mira-thinking. Latency, context limits, pricing.
Streaming
Server-sent events, partial messages, and how to consume the wire format.