ABSTRACT
Creating a realistic talking head, which given an arbitrary text as input generates a realistic looking face speaking the text, has been a long standing research challenge. Talking heads which cannot express emotion have been made to look very realistic by using concatenative approaches [Wang et al. 2011], however allowing the head to express emotion creates a much more challenging problem and model based approaches have shown promise in this area. While 2D talking heads currently look more realistic than their 3D counterparts, they are limited both in the range of poses they can express and in the lighting conditions that they can be rendered under. Previous attempts to produce videorealistic 3D expressive talking heads [Cao et al. 2005] have produced encouraging results but not yet achieved the level of realism of their 2D counterparts.
Supplemental Material
- Anderson, R., Stenger, B., Wan, V., and Cipolla, R. 2013. Expressive visual text-to-speech using active appearance models. In CVPR.Google Scholar
- Cao, Y., Tien, W., Faloutsos, P., and Pighin, F. 2005. Expressive speech-driven facial animation. ACM TOG 24, 4, 1283--1302. Google Scholar
Digital Library
- Wang, L., Han, W., Soong, F., and Huo, Q. 2011. Text driven 3D photo-realistic talking head. In Interspeech, 3307--3308.Google Scholar
Recommendations
Text-based editing of talking-head video
Editing talking-head video to change the speech content or to remove filler words is challenging. We propose a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has ...
Emotional Chinese talking head system
ICMI '04: Proceedings of the 6th international conference on Multimodal interfacesNatural Human-Computer Interface requires integration of realistic audio and visual information for perception and display. In this paper, a lifelike talking head system is proposed. The system converts text to speech with synchronized animation of ...
Animated Talking Head with Personalized 3D Head Model
special issue on multimedia signal processingNatural Human-Computer Interface requires integration of realistic audio and visual information for perception and display. An example of such an interface is an animated talking head displayed on the computer screen in the form of a human-like computer ...




Comments