G
GetLLMs

Sonic Talking Face Animation

Unleash the power of Sonic Talking Face Animation to convert any portrait image into a lifelike, audio-driven video. Try it now and see the results!

Platform: Replicate
Talking Face AnimationAudio-Driven VideoFacial Synthesis
5.7k runs
A100 (80GB)
License Check Required

🚀Function Overview

Generates animated portrait videos synchronized with input audio by analyzing global audio perception for natural facial movements.

Key Features

  • Creates expressive facial animations with head movements
  • Synchronizes lip movements with audio input
  • Processes single portrait images and common audio formats
  • Includes automatic face detection and cropping
  • Configurable intensity, resolution, and processing steps

Use Cases

  • Avatar animation for virtual assistants and gaming
  • Content creation from static images and voiceovers
  • Accessibility tools for communication
  • Creative video projects

⚙️Input Parameters

image

string

Input portrait image (will be cropped if face is detected).

audio

string

Input audio file (WAV, MP3, etc.) for the voice.

dynamic_scale

number

Controls movement intensity. Increase/decrease for more/less movement.

min_resolution

integer

Minimum image resolution for processing. Lower values use less memory but may reduce quality.

inference_steps

integer

Number of diffusion steps. Higher values may improve quality but take longer.

keep_resolution

boolean

If true, output video matches the original image resolution. Otherwise uses the min_resolution after cropping.

seed

integer

Random seed for reproducible results. Leave blank for a random seed.

💡Usage Examples

Example 1

Input Parameters

{
  "seed": 42,
  "audio": "https://raw.githubusercontent.com/jixiaozhong/Sonic/main/examples/wav/sing_female_10s.wav",
  "image": "https://raw.githubusercontent.com/jixiaozhong/Sonic/main/examples/image/hair.png",
  "dynamic_scale": 1,
  "min_resolution": 512,
  "inference_steps": 25,
  "keep_resolution": false
}

Output Results

https://replicate.delivery/yhqm/BBIQVop2woqcFNiXmkKeDwu68U5oEc5qCrmLLhp726cS1iPKA/output.mp4

Quick Actions

Technical Specifications

Hardware Type
A100 (80GB)
Run Count
5.7k
Commercial Use
Unknown/Restricted
Platform
Replicate

Related Keywords

Audio-Driven Video GenerationExpressive Facial AnimationsLip SynchronizationPortrait to VideoContent CreationVirtual AssistantsAccessibility Tools