一区二区日本_久久久久久久国产精品_无码国模国产在线观看_久久99深爱久久99精品_亚洲一区二区三区四区五区午夜_日本在线观看一区二区

DreamTalk

Diffusion-based Expressive Talking Head
Generation Framework.
dreamtalk

When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Yifeng Ma1, Shiwei Zhang2, Jiayu Wang2, Xiang Wang3, Yingya Zhang2, Zhidong Deng1

1Tsinghua University, 2Alibaba Group, 3Huazhong University of Science and Technology

Diffusion models have shown remarkable success in a variety of downstream generative tasks, yet remain under-explored in the important and challenging expressive talking head generation. In this work, we propose a DreamTalk framework to fulfill this gap, which employs meticulous design to unlock the potential of diffusion models in generating expressive talking heads. Specifically, DreamTalk consists of three crucial components: a denoising network, a style-aware lip expert, and a style predictor. The diffusion-based denoising network is able to consistently synthesize high-quality audio-driven face motions across diverse expressions. To enhance the expressiveness and accuracy of lip motions, we introduce a style-aware lip expert that can guide lip-sync while being mindful of the speaking styles. To eliminate the need for expression reference video or text, an extra diffusion-based style predictor is utilized to predict the target expression directly from the audio. By this means, DreamTalk can harness powerful diffusion models to generate expressive faces effectively and reduce the reliance on expensive style references. Experimental results demonstrate that DreamTalk is capable of generating photo-realistic talking faces with diverse speaking styles and achieving accurate lip motions, surpassing existing state-of-the-art counterparts.

The code and checkpoints are released.

Overview

Generalization Capabilities: Songs
送別 Farewell (Chinese), Love Story (English)
More Songs
上海灘 The Bund (Cantonese), Lemon (Japanese), All For Love (English)
Generalization Capabilities: Out-of-domain Portraits

Generalization Capabilities: Speech in Multiple Languages
Speech in Chinese, French, German, Italian, Japanese, Korean, and Spanish
Generalization Capabilities: Noisy Audio

Speaking Style Manipulation
Adjusting the Scale of Classifier-free Guidance; Style Code Interpolation
Speaking Style Prediction

If you are seeking an exhilarating challenge and the chance to collaborate with AIGC and large-scale pretraining, then you have come to the right place. We are searching for talented, motivated, and imaginative researchers to join our team. If you are interested, please don't hesitate to send us your resume via email yingya.zyy@alibaba-inc.com

References

@article{ma2023dreamtalk,
title={DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models},
author={Ma, Yifeng and Zhang, Shiwei and Wang, Jiayu and Wang, Xiang and Zhang, Yingya and Deng, Zhidong},
journal={arXiv preprint arXiv:2312.09767},
year={2023}
}

主站蜘蛛池模板: 成人免费一级视频 | 亚洲人成人一区二区在线观看 | 欧洲精品久久久久毛片完整版 | 视频二区 | 久久久精选 | 一区二区三区四区视频 | 午夜免费网站 | 国户精品久久久久久久久久久不卡 | 国产99热在线 | 亚洲精品一区二区网址 | 国产精品九九九 | 国产精品一区一区 | 国产精品一区三区 | 中文字幕在线观看精品 | 亚洲视频在线看 | 天堂色| 亚洲福利在线视频 | 国产精品日韩一区二区 | 日韩在线 | 欧美一区二区三区四区视频 | 久久久久久久久久久久一区二区 | 免费看片国产 | 一片毛片 | 国产精品欧美一区二区三区不卡 | 欧美激情视频一区二区三区在线播放 | 亚洲精品乱码久久久久久蜜桃91 | 国产一区二区三区 | 午夜影院在线视频 | 国产精品视频一二三区 | 一区二区三区在线免费观看 | 国产精品视频一区二区三区不卡 | 久久久国产精品 | 日本精品一区二区三区在线观看视频 | 久久草视频 | 黄色精品| 国产欧美在线视频 | 日韩快播电影网 | 欧美午夜视频 | 精品国产青草久久久久福利 | 亚洲综合大片69999 | aaa大片免费观看 |