Sonic Talking Face Animation
Unleash the power of Sonic Talking Face Animation to convert any portrait image into a lifelike, audio-driven video. Try it now and see the results!
🚀Function Overview
Generates animated portrait videos synchronized with input audio by analyzing global audio perception for natural facial movements.
Key Features
- Creates expressive facial animations with head movements
- Synchronizes lip movements with audio input
- Processes single portrait images and common audio formats
- Includes automatic face detection and cropping
- Configurable intensity, resolution, and processing steps
Use Cases
- •Avatar animation for virtual assistants and gaming
- •Content creation from static images and voiceovers
- •Accessibility tools for communication
- •Creative video projects
⚙️Input Parameters
image
stringInput portrait image (will be cropped if face is detected).
audio
stringInput audio file (WAV, MP3, etc.) for the voice.
dynamic_scale
numberControls movement intensity. Increase/decrease for more/less movement.
min_resolution
integerMinimum image resolution for processing. Lower values use less memory but may reduce quality.
inference_steps
integerNumber of diffusion steps. Higher values may improve quality but take longer.
keep_resolution
booleanIf true, output video matches the original image resolution. Otherwise uses the min_resolution after cropping.
seed
integerRandom seed for reproducible results. Leave blank for a random seed.
💡Usage Examples
Example 1
Input Parameters
{ "seed": 42, "audio": "https://raw.githubusercontent.com/jixiaozhong/Sonic/main/examples/wav/sing_female_10s.wav", "image": "https://raw.githubusercontent.com/jixiaozhong/Sonic/main/examples/image/hair.png", "dynamic_scale": 1, "min_resolution": 512, "inference_steps": 25, "keep_resolution": false }
Quick Actions
Technical Specifications
- Hardware Type
- A100 (80GB)
- Run Count
- 5.7k
- Commercial Use
- Unknown/Restricted
- Platform
- Replicate
Related Keywords
Related Models
Luma Reframe Video
Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p
Frames to Video Merger
Convert a set of image frames (JPG or PNG) into a high-quality MP4 video. Automatically handles sorting and frame order for smooth playback.
twha Video Generation Model
A model for generating videos from text prompts and starting images.