Cog Orpheus 3B
Discover Cog Orpheus 3B, a powerful multilingual text-to-speech model. Ready to experience the power of AI? Start your journey here!
🚀Function Overview
A multilingual text-to-speech model generating expressive speech with emotional tags and voice cloning capabilities.
Key Features
- Human-like speech with natural intonation
- Zero-shot voice cloning without prior training
- Emotion and intonation control via tags
- Low-latency streaming for real-time applications
Use Cases
- •Real-time voice streaming systems
- •Localized virtual assistants
- •Audiobook narration with emotional expression
- •Accessibility tools for speech generation
⚙️Input Parameters
text
stringText to convert to speech
voice
stringVoice to use
temperature
numberTemperature for generation
top_p
numberTop P for nucleus sampling
repetition_penalty
numberRepetition penalty
max_new_tokens
integerMaximum number of tokens to generate
💡Usage Examples
Quick Actions
Technical Specifications
- Hardware Type
- L40S
- Run Count
- 94
- Commercial Use
- Unknown/Restricted
- Platform
- Replicate
Related Keywords
Related Models
Minimax Speech-02-HD
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
Spark TTS
A model for text-to-speech generation with voice cloning and adjustable vocal parameters.
Dia 1.6B
Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning