Kling Lip Sync
Kling Lip Sync provides cutting-edge lip synchronization for your videos. Ready to experience the power of AI? Start your journey here!
🚀Function Overview
This model adds synchronized lip movements to existing videos using either audio input or text-to-speech synthesis.
Key Features
- Lip synchronization for videos
- Supports both audio files and text-to-speech input
- Resolution support for 720p-1080p videos
- Integrated speech rate control for text-based inputs
Use Cases
- •Dubbing videos in different languages
- •Creating personalized talking-head videos
- •Generating synchronized promotional content
- •AI-driven video content creation
⚙️Input Parameters
video_url
stringURL of a video for lip syncing. It can be an .mp4 or .mov file, should be less than 100MB, with a duration of 2-10 seconds, and a resolution of 720p-1080p (720-1920px dimensions). Cannot be used with video_id.
audio_file
stringAudio file for lip sync. Must be .mp3, .wav, .m4a, or .aac and less than 5MB.
text
stringText content for lip sync (if not using audio)
voice_id
stringVoice ID for speech synthesis (if using text and not audio)
voice_speed
numberSpeech rate (only used if using text and not audio)
video_id
stringID of a video generated by Kling. Cannot be used with video_url.
💡Usage Examples
Example 1
Input Parameters
{ "voice_id": "en_AOT", "video_url": "https://replicate.delivery/xezq/ipjGAn65es3cfkPhe7g3IvYQqDfbHeCfBof1ujrrMvb3rQAXKA/tmptucl_ok3.mp4", "audio_file": "https://replicate.delivery/pbxt/N245edsFrGTRuk6v5OWFet0nsqiahHTlSF8yRfZEbbZCxSzY/replicate-prediction-sz4ehr9vanrme0cpwnp9wr4g8c.mp3", "voice_speed": 1 }
Quick Actions
Technical Specifications
- Hardware Type
- Run Count
- 2.0k
- Commercial Use
- Unknown/Restricted
- Pricing
- 0.014 per second of output video
- Platform
- Replicate
Related Keywords
Related Models
Luma Reframe Video
Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p
Frames to Video Merger
Convert a set of image frames (JPG or PNG) into a high-quality MP4 video. Automatically handles sorting and frame order for smooth playback.
twha Video Generation Model
A model for generating videos from text prompts and starting images.