Carnegie Mellon University
Carnegie Mellon University
Runway
ACM UIST 2023
NeurIPS ML4CD 2021

Soundify: Matching Sound Effects to Video

1Carnegie Mellon University
2Runway
Soundify system pipeline
Soundify assists users in matching sound effects (in bold) and ambients (in italics) to video, and helps dynamically adjust panning and volume by localizing "sound emitters."

Summary

In the art of video editing, sound helps add character to an object and immerse the viewer within a space. Through formative interviews with professional editors (N=10), we found that the task of adding sounds to video can be challenging. This paper presents Soundify, a system that assists editors in matching sounds to video. Given a video, Soundify identifies matching sounds, synchronizes the sounds to the video, and dynamically adjusts panning and volume to create spatial audio. In a human evaluation study (N=889), we show that Soundify is capable of matching sounds to video out-of-the-box for a diverse range of audio categories. In a within-subjects expert study (N=12), we demonstrate the usefulness of Soundify in helping video editors match sounds to video with lighter workload, reduced task completion time, and improved usability.

Video

About 1 min read

Example Results

Cars

Cooking Fire Crackle

Ducks Water Ripple

Footsteps Tram

Helicopter

Lightning

Seagulls Crows

Washing Machine

Waterfall

Waves