Models / Rime /  / Arcana V3 Turbo API

Arcana V3 Turbo API

Real-time bilingual TTS with native code-switching for production voice agents.

Deploy Now
new

This model is not currently supported on Together AI.

Visit our Models page to view all the latest models.

Rime Arcana V3 supports 11 languages with native code-switching that preserves prosody and accent consistency across language boundaries. When customers switch from French to English for technical terms, then back to French for clarification, V3 handles these transitions while maintaining natural cadence and emphasis—so the conversation doesn't sound stitched together. Teams consolidate what used to require separate models or vendors per language into a single endpoint serving multilingual customers, with co-located deployment alongside LLM and STT workloads on Together AI's unified infrastructure.

11
Languages
Native code-switching support
1
Model
Consolidates multilingual infrastructure
99.9%
Uptime SLA
Production-ready infrastructure

Key Capabilities

  • ✓ Multilingual Code-Switching: 11 languages with natural transitions preserving prosody and accent consistency across language boundaries
  • ✓ Single Model Deployment: Consolidate infrastructure previously requiring separate models or vendors per language into unified endpoint
  • ✓ Natural Prosody: Transitions between languages preserve cadence and emphasis rather than sounding mechanical or stitched together
  • ✓ Unified Infrastructure: Co-located with LLM and STT on Together AI—track performance across all languages from single dashboard

Arcana V3 Turbo API Usage

Endpoint

curl -X POST "https://api.together.xyz/v1/chat/completions" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "messages": [
      {
        "role": "user",
        "content": "What are some fun things to do in New York?"
      }
    ]
}'
curl -X POST "https://api.together.xyz/v1/images/generations" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "prompt": "Draw an anime style version of this image.",
    "width": 1024,
    "height": 768,
    "steps": 28,
    "n": 1,
    "response_format": "url",
    "image_url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"
  }'
curl -X POST https://api.together.xyz/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -d '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "messages": [{
      "role": "user",
      "content": [
        {"type": "text", "text": "Describe what you see in this image."},
        {"type": "image_url", "image_url": {"url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"}}
      ]
    }],
    "max_tokens": 512
  }'
curl -X POST https://api.together.xyz/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -d '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "messages": [{
      "role": "user",
      "content": "Given two binary strings `a` and `b`, return their sum as a binary string"
    }]
  }'
curl -X POST https://api.together.xyz/v1/rerank \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -d '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "query": "What animals can I find near Peru?",
    "documents": [
      "The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
      "The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
      "The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
      "The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations."
    ],
    "top_n": 2
  }'
curl -X POST https://api.together.xyz/v1/embeddings \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "input": "Our solar system orbits the Milky Way galaxy at about 515,000 mph.",
    "model": "rime-labs/rime-arcana-v3-turbo"
  }'
curl -X POST https://api.together.xyz/v1/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -d '{
    "model": "meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
    "prompt": "A horse is a horse",
    "max_tokens": 32,
    "temperature": 0.1,
    "safety_model": "rime-labs/rime-arcana-v3-turbo"
  }'
curl --location 'https://api.together.ai/v1/audio/generations' \
  --header 'Content-Type: application/json' \
  --header 'Authorization: Bearer $TOGETHER_API_KEY' \
  --output speech.mp3 \
  --data '{
    "input": "Today is a wonderful day to build something people love!",
    "voice": "helpful woman",
    "response_format": "mp3",
    "sample_rate": 44100,
    "stream": false,
    "model": "rime-labs/rime-arcana-v3-turbo"
  }'
curl -X POST "https://api.together.xyz/v1/audio/transcriptions" \
  -H "Authorization: Bearer $TOGETHER_API_KEY" \
  -F "model=rime-labs/rime-arcana-v3-turbo" \
  -F "language=en" \
  -F "response_format=json" \
  -F "timestamp_granularities=segment"
curl --request POST \
  --url https://api.together.xyz/v2/videos \
  --header "Authorization: Bearer $TOGETHER_API_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "prompt": "some penguins building a snowman"
  }'
curl --request POST \
  --url https://api.together.xyz/v2/videos \
  --header "Authorization: Bearer $TOGETHER_API_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "model": "rime-labs/rime-arcana-v3-turbo",
    "frame_images": [{"input_image": "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg"}]
  }'

from together import Together

client = Together()

response = client.chat.completions.create(
  model="rime-labs/rime-arcana-v3-turbo",
  messages=[
    {
      "role": "user",
      "content": "What are some fun things to do in New York?"
    }
  ]
)
print(response.choices[0].message.content)
from together import Together

client = Together()

imageCompletion = client.images.generate(
    model="rime-labs/rime-arcana-v3-turbo",
    width=1024,
    height=768,
    steps=28,
    prompt="Draw an anime style version of this image.",
    image_url="https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
)

print(imageCompletion.data[0].url)


from together import Together

client = Together()

response = client.chat.completions.create(
    model="rime-labs/rime-arcana-v3-turbo",
    messages=[{
    	"role": "user",
      "content": [
        {"type": "text", "text": "Describe what you see in this image."},
        {"type": "image_url", "image_url": {"url": "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png"}}
      ]
    }]
)
print(response.choices[0].message.content)

from together import Together

client = Together()
response = client.chat.completions.create(
  model="rime-labs/rime-arcana-v3-turbo",
  messages=[
  	{
	    "role": "user", 
      "content": "Given two binary strings `a` and `b`, return their sum as a binary string"
    }
 ],
)

print(response.choices[0].message.content)

from together import Together

client = Together()

query = "What animals can I find near Peru?"

documents = [
  "The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
  "The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
  "The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
  "The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations.",
]

response = client.rerank.create(
  model="rime-labs/rime-arcana-v3-turbo",
  query=query,
  documents=documents,
  top_n=2
)

for result in response.results:
    print(f"Relevance Score: {result.relevance_score}")

from together import Together

client = Together()

response = client.embeddings.create(
  model = "rime-labs/rime-arcana-v3-turbo",
  input = "Our solar system orbits the Milky Way galaxy at about 515,000 mph"
)

from together import Together

client = Together()

response = client.completions.create(
  model="meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
  prompt="A horse is a horse",
  max_tokens=32,
  temperature=0.1,
  safety_model="rime-labs/rime-arcana-v3-turbo",
)

print(response.choices[0].text)

from together import Together

client = Together()

speech_file_path = "speech.mp3"

response = client.audio.speech.create(
  model="rime-labs/rime-arcana-v3-turbo",
  input="Today is a wonderful day to build something people love!",
  voice="helpful woman",
)
    
response.stream_to_file(speech_file_path)

from together import Together

client = Together()
response = client.audio.transcribe(
    model="rime-labs/rime-arcana-v3-turbo",
    language="en",
    response_format="json",
    timestamp_granularities="segment"
)
print(response.text)
from together import Together

client = Together()

# Create a video generation job
job = client.videos.create(
    prompt="A serene sunset over the ocean with gentle waves",
    model="rime-labs/rime-arcana-v3-turbo"
)
from together import Together

client = Together()

job = client.videos.create(
    model="rime-labs/rime-arcana-v3-turbo",
    frame_images=[
        {
            "input_image": "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg",
        }
    ]
)
import Together from 'together-ai';
const together = new Together();

const completion = await together.chat.completions.create({
  model: 'rime-labs/rime-arcana-v3-turbo',
  messages: [
    {
      role: 'user',
      content: 'What are some fun things to do in New York?'
     }
  ],
});

console.log(completion.choices[0].message.content);
import Together from "together-ai";

const together = new Together();

async function main() {
  const response = await together.images.create({
    model: "rime-labs/rime-arcana-v3-turbo",
    width: 1024,
    height: 1024,
    steps: 28,
    prompt: "Draw an anime style version of this image.",
    image_url: "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png",
  });

  console.log(response.data[0].url);
}

main();

import Together from "together-ai";

const together = new Together();
const imageUrl = "https://huggingface.co/datasets/patrickvonplaten/random_img/resolve/main/yosemite.png";

async function main() {
  const response = await together.chat.completions.create({
    model: "rime-labs/rime-arcana-v3-turbo",
    messages: [{
      role: "user",
      content: [
        { type: "text", text: "Describe what you see in this image." },
        { type: "image_url", image_url: { url: imageUrl } }
      ]
    }]
  });
  
  console.log(response.choices[0]?.message?.content);
}

main();

import Together from "together-ai";

const together = new Together();

async function main() {
  const response = await together.chat.completions.create({
    model: "rime-labs/rime-arcana-v3-turbo",
    messages: [{
      role: "user",
      content: "Given two binary strings `a` and `b`, return their sum as a binary string"
    }]
  });
  
  console.log(response.choices[0]?.message?.content);
}

main();

import Together from "together-ai";

const together = new Together();

const query = "What animals can I find near Peru?";
const documents = [
  "The giant panda (Ailuropoda melanoleuca), also known as the panda bear or simply panda, is a bear species endemic to China.",
  "The llama is a domesticated South American camelid, widely used as a meat and pack animal by Andean cultures since the pre-Columbian era.",
  "The wild Bactrian camel (Camelus ferus) is an endangered species of camel endemic to Northwest China and southwestern Mongolia.",
  "The guanaco is a camelid native to South America, closely related to the llama. Guanacos are one of two wild South American camelids; the other species is the vicuña, which lives at higher elevations."
];

async function main() {
  const response = await together.rerank.create({
    model: "rime-labs/rime-arcana-v3-turbo",
    query: query,
    documents: documents,
    top_n: 2
  });
  
  for (const result of response.results) {
    console.log(`Relevance Score: ${result.relevance_score}`);
  }
}

main();


import Together from "together-ai";

const together = new Together();

const response = await client.embeddings.create({
  model: 'rime-labs/rime-arcana-v3-turbo',
  input: 'Our solar system orbits the Milky Way galaxy at about 515,000 mph',
});

import Together from "together-ai";

const together = new Together();

async function main() {
  const response = await together.completions.create({
    model: "meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8",
    prompt: "A horse is a horse",
    max_tokens: 32,
    temperature: 0.1,
    safety_model: "rime-labs/rime-arcana-v3-turbo"
  });
  
  console.log(response.choices[0]?.text);
}

main();

import Together from 'together-ai';

const together = new Together();

async function generateAudio() {
   const res = await together.audio.create({
    input: 'Today is a wonderful day to build something people love!',
    voice: 'helpful woman',
    response_format: 'mp3',
    sample_rate: 44100,
    stream: false,
    model: 'rime-labs/rime-arcana-v3-turbo',
  });

  if (res.body) {
    console.log(res.body);
    const nodeStream = Readable.from(res.body as ReadableStream);
    const fileStream = createWriteStream('./speech.mp3');

    nodeStream.pipe(fileStream);
  }
}

generateAudio();

import Together from "together-ai";

const together = new Together();

const response = await together.audio.transcriptions.create(
  model: "rime-labs/rime-arcana-v3-turbo",
  language: "en",
  response_format: "json",
  timestamp_granularities: "segment"
});
console.log(response)
import Together from "together-ai";

const together = new Together();

async function main() {
  // Create a video generation job
  const job = await together.videos.create({
    prompt: "A serene sunset over the ocean with gentle waves",
    model: "rime-labs/rime-arcana-v3-turbo"
  });
import Together from "together-ai";

const together = new Together();

const job = await together.videos.create({
  model: "rime-labs/rime-arcana-v3-turbo",
  frame_images: [
    {
      input_image: "https://cdn.pixabay.com/photo/2020/05/20/08/27/cat-5195431_1280.jpg",
    }
  ]
});

How to use Arcana V3 Turbo

Model details

Architecture Overview:
• High-performance TTS optimized for real-time bilingual voice agents
• Native code-switching trained on bilingual speech patterns preserving prosody across language boundaries
• ~120ms time-to-first-audio on Together AI dedicated endpoints
• Efficient concurrency enabling higher GPU utilization for high-volume deployments
• WebSocket streaming support for real-time voice applications
• Co-located with LLM and STT workloads on unified infrastructure

Training Methodology:
• Trained on native bilingual speech patterns including pause placement and stress shifts at language boundaries
• Prosody optimization for English-Spanish code-switching within sentences
• Performance tuning for sub-200ms time-to-first-audio in production environments
• Emphasis and cadence modeling matching natural bilingual speaker behavior

Key Capabilities:
• English-Spanish Code-Switching: Native transitions with consistent prosody when callers switch languages mid-sentence
• Real-Time Performance: ~120ms time-to-first-audio leaves headroom for full voice pipeline processing
• Production Efficiency: Higher concurrency per GPU reduces infrastructure costs for high-volume deployments
• Natural Prosody: Pauses and emphasis match how bilingual speakers actually talk, not mechanical language switching
• Infrastructure Integration: Same API, authentication, and observability as LLM and STT endpoints

Prompting Arcana V3 Turbo

Applications & Use Cases

Bilingual Voice Agents:
• Customer service agents handling English-Spanish code-switching in real-time conversations
• Contact centers in bilingual metro markets where callers naturally mix languages
• Voice assistants for bilingual communities maintaining natural speech patterns
• Automated phone systems responding to code-switched queries without latency spikes
• Reduces transfers to human agents by handling natural language mixing

Regulated Services in Bilingual Markets:
• Banking and financial services serving bilingual customer bases
• Healthcare providers handling mixed-language symptom descriptions
• Government services in bilingual jurisdictions maintaining accessibility
• Insurance claims processing with natural code-switching support
• Single compliance review covering LLM, STT, and TTS on unified infrastructure

High-Volume Contact Centers:
• Enterprise contact centers handling thousands of concurrent bilingual calls
• Customer support for brands serving English-Spanish markets at scale
• Appointment scheduling and confirmation systems in bilingual regions
• Order management and tracking for bilingual customer bases
• Efficient GPU utilization reducing total cost of ownership at production scale

Real-Time Voice Applications:
• Voice agents requiring sub-700ms end-to-end latency
• Interactive voice response (IVR) systems with natural bilingual flow
• Voice assistants co-located with LLM reasoning and STT processing
• Conversational AI maintaining natural cadence across language switches
• WebSocket-based streaming for low-latency voice synthesis

Multilingual Business Operations:
• International business operations in US Hispanic markets
• Cross-border commerce serving English and Spanish speakers
• Tourism and hospitality voice agents in bilingual destinations
• Educational platforms with bilingual voice instruction
• Technical support handling code-switched terminology

Looking for production scale? Deploy on a dedicated endpoint

Deploy Arcana V3 Turbo on a dedicated endpoint with custom hardware configuration, as many instances as you need, and auto-scaling.

Get started