Wan 2.7 Text-to-Video
Text-to-video generation with native 1080p output, optional audio input, and multi-shot narrative control

About model
Wan 2.7 Text-to-Video is a dedicated generation model for creating 720P or native 1080P video directly from text prompts. It features optional audio input, multi-shot narrative control through prompt language, and output durations from 2 to 15 seconds.
2-15s
Supported generation length
720P / 1080P
Supported output tiers
Audio
Optional audio input
- Text-to-Video: Generate 720P or 1080P video directly from text prompts
- Audio Support: Optional audio input with auto-generated background audio when absent
- Narrative Control: Multi-shot narrative control through prompt language
API usage
Endpoint:
Model card
Architecture Overview:
• Text-to-video (T2V) generation model
• 720P and native 1080P output
Performance Characteristics:
• Supports 2-15 second generation
Prompting
Together AI API Access:
• Access Wan 2.7 Text-to-Video via Together AI APIs using the endpoint Wan-AI/Wan2.7-t2v
• Authenticate using your Together AI API key in request headers
• Available on Together AI serverless infrastructure
Applications & use cases
Creative Production:
• Generate 720P or native 1080P video directly from text prompts for campaign content, product videos, and creative prototyping
- Model providerWan-AI
- TypeVideo
- Resolution/Duration720p, 1080p / 2-15s
- DeploymentServerless
- Endpoint
- Price
$0.10 / video
- Input modalitiesTextAudio
- Output modalitiesVideo
- CategoryVideo