MMAudio V2 Video to Audio

Transform your videos with MMAudio V2 Video to Audio, generating high-quality soundscapes. Discover how this AI model can transform your workflow!

Platform: Replicate

Video-to-Audio SynthesisSound GenerationT4 GPU Optimized

391 runs

License Check Required

🚀Function Overview

Generates high-quality audio from video content with temporal synchronization, optimized for cost efficiency on T4 GPUs.

Key Features

Transforms visual content into contextually appropriate audio
Maintains temporal consistency with video events
Adjustable audio parameters via prompts and settings
Supports environmental sound synthesis and action-to-sound mapping
Cost-optimized for T4 hardware

Use Cases

•Film and video post-production
•Silent film restoration
•Educational content enhancement
•Gaming and VR sound design
•Accessibility improvements for videos

⚙️Input Parameters

prompt

string

Text prompt for generated audio

negative_prompt

string

Negative prompt to avoid certain sounds

video

string

Optional video file for video-to-audio generation

duration

number

Duration of output in seconds

num_steps

integer

Number of inference steps

cfg_strength

number

Guidance strength (CFG)

seed

integer

Random seed. Use -1 or leave blank to randomize the seed

image

string

Optional image file for image-to-audio generation (experimental)

💡Usage Examples

Example 1

Input Parameters

{
  "video": "https://huggingface.co/hkchengrex/MMAudio/resolve/main/examples/sora_kraken.mp4",
  "prompt": "waves, storm",
  "duration": 10,
  "num_steps": 25,
  "cfg_strength": 4.5,
  "negative_prompt": "music"
}

Output Results

https://replicate.delivery/czjl/YiDWoy40aW5sC9T8lqKsldET8rW4DZ4vXGHhw6A7wELBdrHF/20250402_121809.mp4

Quick Actions

Use NowView Documentation

Technical Specifications

Hardware Type: T4
Run Count: 391
Commercial Use: Unknown/Restricted
Platform: Replicate

Related Keywords

Video-to-Audio SynthesisSound GenerationFilm and Video Post-ProductionSilent Film RestorationGaming and VR Sound DesignTemporal SynchronizationCost-optimized T4 GPUAdjustable Audio Parameters

Related Models

DrumTest2 Rhythmic Audio Transformer

Transforms any rhythmic sound—a drum kit, beatboxing, a toy drum, even drumming on your belly—into a pro-quality performance on Zohar's studio drum kit.

Speaker Diarization

Speaker Diarization with "pyannote/speaker-diarization-3.1"

Resemble Enhance AI

Optimizes audio files with speech