This model is not currently supported on Together AI.
Visit our Models page to view all the latest models.
Introducing Mist v2
Mist v2 is a conversational text-to-speech model designed for real-time voice applications. With low latency and natural speech patterns, it enables developers to build responsive voice interfaces for customer support, IVR systems, and conversational AI in English and Spanish.
Mist v2 API Usage
Endpoint
curl -X POST "https://api.together.xyz/v1/chat/completions" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "rime-labs/rime-mist-v2",
"messages": [
{
"role": "user",
"content": "What are some fun things to do in New York?"
}
]
}'
curl -X POST "https://api.together.xyz/v1/images/generations" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "rime-labs/rime-mist-v2",
"prompt": "Draw an anime style version of this image.",
"width": 1024,
"height": 768,
"steps": 28,
"n": 1,
"response_format": "url",
"image_url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"
}'
curl -X POST https://api.together.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-d '{
"model": "rime-labs/rime-mist-v2",
"messages": [{
"role": "user",
"content": [
{"type": "text", "text": "Describe what you see in this image."},
{"type": "image_url", "image_url": {"url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"}}
]
}],
"max_tokens": 512
}'
curl -X POST https://api.together.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-d '{
"model": "rime-labs/rime-mist-v2",
"messages": [{
"role": "user",
"content": "Given two binary strings `a` and `b`, return their sum as a binary string"
}]
}'
curl -X POST https://api.together.xyz/v1/rerank \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-d '{
"model": "rime-labs/rime-mist-v2",
"query": "What animals can I find near Peru?",
"documents": [
"The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
"The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
"The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
"The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations."
],
"top_n": 2
}'
curl -X POST https://api.together.xyz/v1/embeddings \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"input": "Our solar system orbits the Milky Way galaxy at about 515,000 mph.",
"model": "rime-labs/rime-mist-v2"
}'
curl -X POST https://api.together.xyz/v1/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-d '{
"model": "meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
"prompt": "A horse is a horse",
"max_tokens": 32,
"temperature": 0.1,
"safety_model": "rime-labs/rime-mist-v2"
}'
curl --location 'https://api.together.ai/v1/audio/generations' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer $TOGETHER_API_KEY' \
--output speech.mp3 \
--data '{
"input": "Today is a wonderful day to build something people love!",
"voice": "helpful woman",
"response_format": "mp3",
"sample_rate": 44100,
"stream": false,
"model": "rime-labs/rime-mist-v2"
}'
curl -X POST "https://api.together.xyz/v1/audio/transcriptions" \
-H "Authorization: Bearer $TOGETHER_API_KEY" \
-F "model=rime-labs/rime-mist-v2" \
-F "language=en" \
-F "response_format=json" \
-F "timestamp_granularities=segment"
curl --request POST \
--url https://api.together.xyz/v2/videos \
--header "Authorization: Bearer $TOGETHER_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"model": "rime-labs/rime-mist-v2",
"prompt": "some penguins building a snowman"
}'
curl --request POST \
--url https://api.together.xyz/v2/videos \
--header "Authorization: Bearer $TOGETHER_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"model": "rime-labs/rime-mist-v2",
"frame_images": [{"input_image": "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg"}]
}'
from together import Together
client = Together()
response = client.chat.completions.create(
model="rime-labs/rime-mist-v2",
messages=[
{
"role": "user",
"content": "What are some fun things to do in New York?"
}
]
)
print(response.choices[0].message.content)
from together import Together
client = Together()
imageCompletion = client.images.generate(
model="rime-labs/rime-mist-v2",
width=1024,
height=768,
steps=28,
prompt="Draw an anime style version of this image.",
image_url="https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
)
print(imageCompletion.data[0].url)
from together import Together
client = Together()
response = client.chat.completions.create(
model="rime-labs/rime-mist-v2",
messages=[{
"role": "user",
"content": [
{"type": "text", "text": "Describe what you see in this image."},
{"type": "image_url", "image_url": {"url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"}}
]
}]
)
print(response.choices[0].message.content)
from together import Together
client = Together()
response = client.chat.completions.create(
model="rime-labs/rime-mist-v2",
messages=[
{
"role": "user",
"content": "Given two binary strings `a` and `b`, return their sum as a binary string"
}
],
)
print(response.choices[0].message.content)
from together import Together
client = Together()
query = "What animals can I find near Peru?"
documents = [
"The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
"The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
"The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
"The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations.",
]
response = client.rerank.create(
model="rime-labs/rime-mist-v2",
query=query,
documents=documents,
top_n=2
)
for result in response.results:
print(f"Relevance Score: {result.relevance_score}")
from together import Together
client = Together()
response = client.embeddings.create(
model = "rime-labs/rime-mist-v2",
input = "Our solar system orbits the Milky Way galaxy at about 515,000 mph"
)
from together import Together
client = Together()
response = client.completions.create(
model="meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
prompt="A horse is a horse",
max_tokens=32,
temperature=0.1,
safety_model="rime-labs/rime-mist-v2",
)
print(response.choices[0].text)
from together import Together
client = Together()
speech_file_path = "speech.mp3"
response = client.audio.speech.create(
model="rime-labs/rime-mist-v2",
input="Today is a wonderful day to build something people love!",
voice="helpful woman",
)
response.stream_to_file(speech_file_path)
from together import Together
client = Together()
response = client.audio.transcribe(
model="rime-labs/rime-mist-v2",
language="en",
response_format="json",
timestamp_granularities="segment"
)
print(response.text)
from together import Together
client = Together()
# Create a video generation job
job = client.videos.create(
prompt="A serene sunset over the ocean with gentle waves",
model="rime-labs/rime-mist-v2"
)
from together import Together
client = Together()
job = client.videos.create(
model="rime-labs/rime-mist-v2",
frame_images=[
{
"input_image": "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg",
}
]
)
import Together from 'together-ai';
const together = new Together();
const completion = await together.chat.completions.create({
model: 'rime-labs/rime-mist-v2',
messages: [
{
role: 'user',
content: 'What are some fun things to do in New York?'
}
],
});
console.log(completion.choices[0].message.content);
import Together from "together-ai";
const together = new Together();
async function main() {
const response = await together.images.create({
model: "rime-labs/rime-mist-v2",
width: 1024,
height: 1024,
steps: 28,
prompt: "Draw an anime style version of this image.",
image_url: "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
});
console.log(response.data[0].url);
}
main();
import Together from "together-ai";
const together = new Together();
const imageUrl = "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png";
async function main() {
const response = await together.chat.completions.create({
model: "rime-labs/rime-mist-v2",
messages: [{
role: "user",
content: [
{ type: "text", text: "Describe what you see in this image." },
{ type: "image_url", image_url: { url: imageUrl } }
]
}]
});
console.log(response.choices[0]?.message?.content);
}
main();
import Together from "together-ai";
const together = new Together();
async function main() {
const response = await together.chat.completions.create({
model: "rime-labs/rime-mist-v2",
messages: [{
role: "user",
content: "Given two binary strings `a` and `b`, return their sum as a binary string"
}]
});
console.log(response.choices[0]?.message?.content);
}
main();
import Together from "together-ai";
const together = new Together();
const query = "What animals can I find near Peru?";
const documents = [
"The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
"The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
"The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
"The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations."
];
async function main() {
const response = await together.rerank.create({
model: "rime-labs/rime-mist-v2",
query: query,
documents: documents,
top_n: 2
});
for (const result of response.results) {
console.log(`Relevance Score: ${result.relevance_score}`);
}
}
main();
import Together from "together-ai";
const together = new Together();
const response = await client.embeddings.create({
model: 'rime-labs/rime-mist-v2',
input: 'Our solar system orbits the Milky Way galaxy at about 515,000 mph',
});
import Together from "together-ai";
const together = new Together();
async function main() {
const response = await together.completions.create({
model: "meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
prompt: "A horse is a horse",
max_tokens: 32,
temperature: 0.1,
safety_model: "rime-labs/rime-mist-v2"
});
console.log(response.choices[0]?.text);
}
main();
import Together from 'together-ai';
const together = new Together();
async function generateAudio() {
const res = await together.audio.create({
input: 'Today is a wonderful day to build something people love!',
voice: 'helpful woman',
response_format: 'mp3',
sample_rate: 44100,
stream: false,
model: 'rime-labs/rime-mist-v2',
});
if (res.body) {
console.log(res.body);
const nodeStream = Readable.from(res.body as ReadableStream);
const fileStream = createWriteStream('./speech.mp3');
nodeStream.pipe(fileStream);
}
}
generateAudio();
import Together from "together-ai";
const together = new Together();
const response = await together.audio.transcriptions.create(
model: "rime-labs/rime-mist-v2",
language: "en",
response_format: "json",
timestamp_granularities: "segment"
});
console.log(response)
import Together from "together-ai";
const together = new Together();
async function main() {
// Create a video generation job
const job = await together.videos.create({
prompt: "A serene sunset over the ocean with gentle waves",
model: "rime-labs/rime-mist-v2"
});
import Together from "together-ai";
const together = new Together();
const job = await together.videos.create({
model: "rime-labs/rime-mist-v2",
frame_images: [
{
input_image: "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg",
}
]
});
How to use Mist v2
Model details
Architecture Overview:
• Conversational TTS model optimized for low-latency real-time voice synthesis.
• Trained on conversational speech data with natural interaction patterns.
• Supports English and Spanish with accent and pronunciation diversity.
• Includes filler words, backchanneling, and breathing patterns for conversational realism.
Training Methodology:
• Trained on conversational speech dataset capturing natural dialogue patterns.
• Multi-lingual training for English and Spanish with authentic pronunciation.
• Optimized for fast synthesis while maintaining natural voice quality.
• Fine-tuned for controllable pronunciation of technical and brand-specific terminology.
Performance Characteristics:
• Low latency enables real-time responsiveness for live voice interactions.
• Natural Speech: Conversational voices with natural filler words and breaths
• Bilingual English and Spanish support for diverse user bases.
• Customizable pronunciation for domain-specific vocabulary and terminology.
Prompting Mist v2
Applications & Use Cases
Phone & IVR Systems:
• Building automated phone systems with natural voice for customer service.
• IVR (interactive voice response) for call centers and customer support lines.
• Appointment reminder and notification systems via phone calls.
Voice Agents:
• Conversational AI agents for e-commerce, booking, and scheduling.
• Customer support chatbots with voice output across phone and web channels.
• Virtual assistants requiring natural, responsive speech synthesis.
Real-Time Voice Applications:
• Live voice translation and interpretation services.
• Voice-enabled applications requiring immediate audio feedback.
• Accessibility tools with text-to-speech for visually impaired users.
Bilingual Services:
• Applications serving English and Spanish-speaking customers.
• Healthcare providers with multilingual patient communication systems.
• Government and public services requiring accessible language support.
Model Provider:
Rime
Type:
Audio
Variant:
Parameters:
Deployment:
Serverless
On-Demand Dedicated
Monthly Reserved
Quantization
Context length:
Resolution / Duration
Pricing:
Check pricing
Run in playground
Deploy model
Quickstart docs
Quickstart docs
On-Demand Dedicated
Monthly Reserved
Looking for production scale? Deploy on a dedicated endpoint
Deploy Mist v2 on a dedicated endpoint with custom hardware configuration, as many instances as you need, and auto-scaling.

