Kling Lip Sync

Kling Lip Sync provides cutting-edge lip synchronization for your videos. Ready to experience the power of AI? Start your journey here!

Platform: Replicate

Lip SyncVideo EditingSpeech SynchronizationTalking Head

2.0k runs

0.014 per second of output video

License Check Required

🚀Function Overview

This model adds synchronized lip movements to existing videos using either audio input or text-to-speech synthesis.

Key Features

Lip synchronization for videos
Supports both audio files and text-to-speech input
Resolution support for 720p-1080p videos
Integrated speech rate control for text-based inputs

Use Cases

•Dubbing videos in different languages
•Creating personalized talking-head videos
•Generating synchronized promotional content
•AI-driven video content creation

⚙️Input Parameters

video_url

string

URL of a video for lip syncing. It can be an .mp4 or .mov file, should be less than 100MB, with a duration of 2-10 seconds, and a resolution of 720p-1080p (720-1920px dimensions). Cannot be used with video_id.

audio_file

string

Audio file for lip sync. Must be .mp3, .wav, .m4a, or .aac and less than 5MB.

text

string

Text content for lip sync (if not using audio)

voice_id

string

Voice ID for speech synthesis (if using text and not audio)

voice_speed

number

Speech rate (only used if using text and not audio)

video_id

string

ID of a video generated by Kling. Cannot be used with video_url.

💡Usage Examples

Example 1

Input Parameters

{
  "voice_id": "en_AOT",
  "video_url": "https://replicate.delivery/xezq/ipjGAn65es3cfkPhe7g3IvYQqDfbHeCfBof1ujrrMvb3rQAXKA/tmptucl_ok3.mp4",
  "audio_file": "https://replicate.delivery/pbxt/N245edsFrGTRuk6v5OWFet0nsqiahHTlSF8yRfZEbbZCxSzY/replicate-prediction-sz4ehr9vanrme0cpwnp9wr4g8c.mp3",
  "voice_speed": 1
}

Output Results

https://replicate.delivery/xezq/92rRwTYlfo0BLSIIcirPaRwPtJhN0lrl4ww79omyef38rCdpA/tmp2ni84f_5.mp4

Quick Actions

Use NowView Documentation

Technical Specifications

Hardware Type
Run Count: 2.0k
Commercial Use: Unknown/Restricted
Pricing: 0.014 per second of output video
Platform: Replicate

Related Keywords

lip synchronizationvideo dubbingtalking-head videospromotional contentAI video creationaudio inputtext-to-speech720p-1080p resolution

Related Models

Luma Reframe Video

Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p

Frames to Video Merger

Convert a set of image frames (JPG or PNG) into a high-quality MP4 video. Automatically handles sorting and frame order for smooth playback.

twha Video Generation Model

A model for generating videos from text prompts and starting images.