G
GetLLMs

MMAudio V2 Video to Audio

Transform your videos with MMAudio V2 Video to Audio, generating high-quality soundscapes. Discover how this AI model can transform your workflow!

Platform: Replicate
Video-to-Audio SynthesisSound GenerationT4 GPU Optimized
391 runs
T4
License Check Required

🚀Function Overview

Generates high-quality audio from video content with temporal synchronization, optimized for cost efficiency on T4 GPUs.

Key Features

  • Transforms visual content into contextually appropriate audio
  • Maintains temporal consistency with video events
  • Adjustable audio parameters via prompts and settings
  • Supports environmental sound synthesis and action-to-sound mapping
  • Cost-optimized for T4 hardware

Use Cases

  • Film and video post-production
  • Silent film restoration
  • Educational content enhancement
  • Gaming and VR sound design
  • Accessibility improvements for videos

⚙️Input Parameters

prompt

string

Text prompt for generated audio

negative_prompt

string

Negative prompt to avoid certain sounds

video

string

Optional video file for video-to-audio generation

duration

number

Duration of output in seconds

num_steps

integer

Number of inference steps

cfg_strength

number

Guidance strength (CFG)

seed

integer

Random seed. Use -1 or leave blank to randomize the seed

image

string

Optional image file for image-to-audio generation (experimental)

💡Usage Examples

Example 1

Input Parameters

{
  "video": "https://huggingface.co/hkchengrex/MMAudio/resolve/main/examples/sora_kraken.mp4",
  "prompt": "waves, storm",
  "duration": 10,
  "num_steps": 25,
  "cfg_strength": 4.5,
  "negative_prompt": "music"
}

Output Results

https://replicate.delivery/czjl/YiDWoy40aW5sC9T8lqKsldET8rW4DZ4vXGHhw6A7wELBdrHF/20250402_121809.mp4

Quick Actions

Technical Specifications

Hardware Type
T4
Run Count
391
Commercial Use
Unknown/Restricted
Platform
Replicate

Related Keywords

Video-to-Audio SynthesisSound GenerationFilm and Video Post-ProductionSilent Film RestorationGaming and VR Sound DesignTemporal SynchronizationCost-optimized T4 GPUAdjustable Audio Parameters