Docs

https://deepmind.google/discover/blog/generating-audio-for-video/

下面的视频建议打开声音

Video-to-audio research uses video pixels and text prompts to generate rich soundtracks​

视频转音频研究利用视频像素和文字提示生成丰富的背景音乐​

Video generation models are advancing at an incredible pace, but many current systems can only generate silent output. One of the next major steps toward bringing generated movies to life is creating soundtracks for these silent videos.​

视频生成模型正以惊人的速度发展，但目前许多系统只能生成无声输出。要使生成的电影栩栩如生，下一个重要步骤就是为这些无声视频创建配乐。​

Today, we're sharing progress on our video-to-audio (V2A) technology, which makes synchronized audiovisual generation possible. V2A combines video pixels with natural language text prompts to generate rich soundscapes for the on-screen action.​

今天，我们将与大家分享我们的视频音频（V2A）技术的进展，该技术使同步视听生成成为可能。V2A 将视频像素与自然语言文本提示相结合，为屏幕上的动作生成丰富的音效。​

Our V2A technology is pairable with video generation models like Veo to create shots with a dramatic score, realistic sound effects or dialogue that matches the characters and tone of a video.

我们的 V2A 技术可与 Veo 等视频生成模型搭配使用，以创建具有戏剧性配乐、逼真音效或对话的镜头，从而与视频中的人物和基调相匹配。

It can also generate soundtracks for a range of traditional footage, including archival material, silent films and more — opening a wider range of creative opportunities.​

它还可以为各种传统素材（包括档案资料、默片等）生成配乐，从而带来更多的创作机会。​

Prompt for audio: Cinematic, thriller, horror film, music, tension, ambience, footsteps on concrete​

50%

Prompt for audio: Cute baby dinosaur chirps, jungle ambience, egg cracking

50%

Prompt for audio: jellyfish pulsating under water, marine life, ocean

50%

Prompt for audio: A drummer on a stage at a concert surrounded by flashing lights and a cheering crowd​

50%

Prompt for audio: cars skidding, car engine throttling, angelic electronic music​

50%

Prompt for audio: a slow mellow harmonica plays as the sun goes down on the prairie​

50%

谷歌Generating audio for video​

谷歌Generating audio for video