Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis

Tong Sha; Wei Zhang; Tong Shen; Zhoujun Li; Tao Mei

doi:10.1145/3575656

Loading next page...

References (263)

Kun Li, Jinsong Zhang, Yebin Liu, Yu-Kun Lai, Qionghai Dai (2020)
PoNA: Pose-Guided Non-Local Attention for Human Pose Transfer
IEEE Transactions on Image Processing, 29
Ruiyun Yu, Xiaoqi Wang, Xiaohui Xie (2019)
VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Arnab Karmakar, Deepak Mishra (2020)
A Robust Pose Transformational GAN for Pose Guided Person Image Synthesis
ArXiv, abs/2001.01259
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei Efros (2016)
Image-to-Image Translation with Conditional Adversarial Networks
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Hyug-Jae Lee, Rokkyu Lee, Minseok Kang, Myounghoon Cho, Gunhan Park (2019)
LA-VITON: A Network for Looking-Attractive Virtual Try-On
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Wenbin Zhao, Qing Xie, Yanchun Ma, Yongjian Liu, Shengwu Xiong (2020)
Pose Guided Person Image Generation Based on Pose Skeleton Sequence and 3D Convolution
2020 IEEE International Conference on Image Processing (ICIP)
Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao Liu, Jiashi Feng (2017)
Multi-View Image Generation from a Single-View
Proceedings of the 26th ACM international conference on Multimedia
Han Yang, Ruimao Zhang, Xiaobao Guo, Wei Liu, W. Zuo, P. Luo (2020)
Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee (2017)
Learning to Generate Long-term Future via Hierarchical Prediction
Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, P. Luo (2021)
Parser-Free Virtual Try-on via Distilling Appearance Flows
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Prajwal R, Rudrabha Mukhopadhyay, Vinay Namboodiri, C. Jawahar (2020)
A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild
Proceedings of the 28th ACM International Conference on Multimedia
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, J. Kautz (2019)
Joint Discriminative and Generative Learning for Person Re-Identification
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Hyeongwoo Kim, Pablo Garrido, A. Tewari, Weipeng Xu, Justus Thies, M. Nießner, P. Pérez, Christian Richardt, M. Zollhöfer, C. Theobalt (2018)
Deep video portraits
ACM Transactions on Graphics (TOG), 37
Shizuma Kubo, Yusuke Iwasawa, Masahiro Suzuki, Y. Matsuo (2019)
UVTON: UV Mapping to Consider the 3D Structure of a Human in Image-Based Virtual Try-On Network
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Ling Shao, Philip Torr, N. Sebe (2021)
Cloth Interactive Transformer for Virtual Try-On
ACM Transactions on Multimedia Computing, Communications and Applications
Aiyu Cui, Daniel McKee, S. Lazebnik (2021)
Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Zhenyu Xie, J. Lai, Xiaohua Xie (2020)
LG-VTON: Fashion Landmark Meets Image-Based Virtual Try-On
Kang Liu, J. Ostermann (2011)
Realistic facial expression synthesis for an image-based talking head
2011 IEEE International Conference on Multimedia and Expo
Xu Chen, Jie Song, Otmar Hilliges (2019)
Unpaired Pose Guided Human Image Generation
ArXiv, abs/1901.02284
Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara Berg (2019)
Dance Dance Generation: Motion Transfer for Internet Videos
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Tero Karras, S. Laine, Timo Aila (2018)
A Style-Based Generator Architecture for Generative Adversarial Networks
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
C. Xu, Yanwei Fu, Chao Wen, Ye Pan, Yu-Gang Jiang, X. Xue (2020)
Pose-Guided Person Image Synthesis in the Non-Iconic Views
IEEE Transactions on Image Processing, 29
(2019)
2020 . LG - VTON : Fashion Landmark Meets Image - Based Virtual TryOn . In Chinese Conference on Pattern Recognition and Computer Vision ( PRCV )
Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (2019)
Learning Individual Styles of Conversational Gesture
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Xintong Han, Weilin Huang, Xiaojun Hu, Matthew Scott (2019)
ClothFlow: A Flow-Based Model for Clothed Person Generation
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Yang Zhou, Dingzeyu Li, Xintong Han, E. Kalogerakis, Eli Shechtman, J. Echevarria (2020)
MakeItTalk: Speaker-Aware Talking Head Animation
ArXiv, abs/2004.12992
Ceyuan Yang, Zhe Wang, Xinge Zhu, Chen Huang, Jianping Shi, Dahua Lin (2018)
Pose Guided Human Video Generation
Zhou Wang, A. Bovik, H. Sheikh, Eero Simoncelli (2004)
Image quality assessment: from error visibility to structural similarity
IEEE Transactions on Image Processing, 13
Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, C. Qian, R. He, Y. Qiao, Chen Loy (2020)
MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation
I. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio (2014)
Generative adversarial networks
Communications of the ACM, 63
Matiur Minar, T. Tuan, Heejune Ahn, Paul Rosin, Yu-Kun Lai (2020)
CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On
L. Verdoliva (2020)
Media Forensics and DeepFakes: An Overview
IEEE Journal of Selected Topics in Signal Processing, 14
[ (2019)
Multi-view based pose alignment method for person re-identification
Chinese Intelligent Automation Conference. Springer
Frédéric Cordier, Won-Sook Lee, H. Seo, N. Magnenat-Thalmann (2001)
Virtual-Try-On on the Web
Haoye Dong, Xiaodan Liang, Chenxing Zhou, Hanjiang Lai, Jia Zhu, Jian Yin (2019)
Part-Preserving Pose Manipulation for Person Image Synthesis
2019 IEEE International Conference on Multimedia and Expo (ICME)
[ (2020)
What comprises a good talking-head video generation?
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, N. Sebe, Yan Yan (2019)
Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation
Proceedings of the 27th ACM International Conference on Multimedia
Nikolay Jetchev, Urs Bergmann (2017)
The Conditional Analogy GAN: Swapping Fashion Articles on People Images
2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
Hajer Ghodhbani, Mohamed Neji, Imran Razzak, A. Alimi (2021)
You can try without visiting: a comprehensive survey on virtually try-on outfits
Multimedia Tools and Applications, 81
Lingbo Yang, Zhenghui Zhao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao (2019)
Disentangled Human Action Video Generation via Decoupled Learning
2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
A. Jamaludin, Joon Chung, Andrew Zisserman (2019)
You Said That?: Synthesising Talking Faces from Audio
International Journal of Computer Vision, 127
Bochao Wang, Huabing Zhang, Xiaodan Liang, Yimin Chen, Liang Lin, Meng Yang (2018)
Toward Characteristic-Preserving Image-based Virtual Try-On Network
ArXiv, abs/1807.07688
[ (2019)
Deferred neural rendering: Image synthesis using neural textures
ACM Transactions on Graphics (TOG), 38
Shunsuke Saito, T. Simon, Jason Saragih, H. Joo (2020)
PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Zhuo Chen, Chaoyue Wang, Bo Yuan, D. Tao (2020)
PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
S. Eskimez, R. Maddox, Chenliang Xu, Z. Duan (2020)
End-To-End Generation of Talking Faces from Noisy Speech
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Tero Karras, Timo Aila, S. Laine, Antti Herva, J. Lehtinen (2017)
Audio-driven facial animation by joint end-to-end learning of pose and emotion
ACM Transactions on Graphics (TOG), 36
Kuan-Hsien Liu, Ting-Yen Chen, Chu-Song Chen (2016)
MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute Prediction
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew Scott, L. Davis (2019)
Compatible and Diverse Fashion Image Inpainting
ArXiv, abs/1902.01096
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei Efros (2017)
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
2017 IEEE International Conference on Computer Vision (ICCV)
Wayne Wu, Yunxuan Zhang, Cheng Li, C. Qian, Chen Loy (2018)
ReenactGAN: Learning to Reenact Faces via Boundary Transfer
ArXiv, abs/1807.11079
Ming-Yu Liu, Xun Huang, Jiahui Yu, Ting-Chun Wang, Arun Mallya (2020)
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Proceedings of the IEEE, 109
M. Cooke, J. Barker, S. Cunningham, Xu Shao (2006)
An audio-visual corpus for speech perception and automatic speech recognition.
The Journal of the Acoustical Society of America, 120 5 Pt 1
Lele Chen, Guofeng Cui, Ziyi Kou, Haitian Zheng, Chenliang Xu (2020)
What comprises a good talking-head video generation?: A Survey and Benchmark
ArXiv, abs/2005.03201
Amit Raj, Patsorn Sangkloy, Huiwen Chang, James Hays, Duygu Ceylan, Jingwan Lu (2018)
SwapNet: Image Based Garment Transfer
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, J. Kautz, Bryan Catanzaro (2019)
Few-shot Video-to-Video Synthesis
ArXiv, abs/1910.12713
Catalin Ionescu, Dragos Papava, Vlad Olaru, C. Sminchisescu (2014)
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
IEEE Transactions on Pattern Analysis and Machine Intelligence, 36
Amina Kammoun, Rim Slama, Hedi Tabia, T. Ouni, Mohmed Abid (2022)
Generative Adversarial Networks for Face Generation: A Survey
ACM Computing Surveys, 55
Lele Chen, Zhiheng Li, R. Maddox, Z. Duan, Chenliang Xu (2018)
Lip Movements Generation at a Glance
ArXiv, abs/1803.10404
A. Rössler, D. Cozzolino, L. Verdoliva, C. Riess, Justus Thies, M. Nießner (2019)
FaceForensics++: Learning to Detect Manipulated Facial Images
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Chaitanya Ahuja, Shugao Ma, Louis-Philippe Morency, Yaser Sheikh (2019)
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations
2019 International Conference on Multimodal Interaction
Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, K. Grauman (2019)
Fashion++: Minimal Edits for Outfit Improvement
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Najmeh Sadoughi, C. Busso (2018)
Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial Networks
IEEE Transactions on Affective Computing, 12
(2021)
Failure cases with wrong arm shape (b) Failure cases with occlusion
Hajer Ghodhbani, A. Alimi, Mohamed Neji (2021)
Image-Based Virtual Try-on System: A Survey of Deep Learning-Based Methods
Dong Liang, Rui Wang, Xiao‐Bo Tian, Cong Zou (2018)
PCGAN: Partition-Controlled Human Image Generation
ArXiv, abs/1811.09928
Mohammad Koujan, M. Doukas, A. Roussos, S. Zafeiriou (2020)
Head2Head: Video-based Neural Head Synthesis
2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)
Debapriya Roy, Sanchayan Santra, B. Chanda (2020)
LGVTON: A Landmark Guided Approach to Virtual Try-On
ArXiv, abs/2004.00562
Ting-Chun Wang, Arun Mallya, Ming-Yu Liu (2020)
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
A. Tewari, Ohad Fried, Justus Thies, V. Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, T. Simon, Jason Saragih, M. Nießner, Rohit Pandey, S. Fanello, Gordon Wetzstein, Jun-Yan Zhu, C. Theobalt, Maneesh Agrawala, Eli Shechtman, Dan Goldman, Michael Zollhofer (2020)
State of the Art on Neural Rendering
Computer Graphics Forum, 39
Yining Li, Chen Huang, Chen Loy (2019)
Dense Intrinsic Appearance Flow for Human Pose Transfer
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Bo Fan, Lijuan Wang, F. Soong, Lei Xie (2015)
Photo-real talking head with deep bidirectional LSTM
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[ (2020)
DeepFakes: Trick or treat?Business Horizons 63, 2 (2020), 135–146
DeepFakes: Trick or treat?Business Horizons
Y. Nirkin, Y. Keller, Tal Hassner (2019)
FSGAN: Subject Agnostic Face Swapping and Reenactment
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Xianfang Zeng, Yusu Pan, Mengmeng Wang, Jiangning Zhang, Yong Liu (2020)
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
Justus Thies, Mohamed Elgharib, A. Tewari, C. Theobalt, M. Nießner (2019)
Neural Voice Puppetry: Audio-driven Facial Reenactment
Jan Kietzmann, Linda Lee, Ian McCarthy, Tim Kietzmann (2020)
Deepfakes: Trick or treat?
Business Horizons
Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li (2018)
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification
Li Yu, Y. Zhong, Xin Wang (2019)
Inpainting-Based Virtual Try-on Network for Selective Garment Transfer
IEEE Access, 7
S. Tripathy, Juho Kannala, Esa Rahtu (2020)
FACEGAN: Facial Attribute Controllable rEenactment GAN
2021 IEEE Winter Conference on Applications of Computer Vision (WACV)
Jacob Walker, Kenneth Marino, A. Gupta, M. Hebert (2017)
The Pose Knows: Video Forecasting by Generating Pose Futures
2017 IEEE International Conference on Computer Vision (ICCV)
James Charles, D. Magee, David Hogg (2016)
Virtual Immortality: Reanimating Characters from TV Shows
Houwei Cao, David Cooper, M. Keutmann, R. Gur, A. Nenkova, R. Verma (2014)
CREMA-D: Crowd-Sourced Emotional Multimodal Actors Dataset
IEEE Transactions on Affective Computing, 5
Supasorn Suwajanakorn, S. Seitz, Ira Kemelmacher-Shlizerman (2017)
Synthesizing Obama
ACM Transactions on Graphics (TOG), 36
(2020)
Ayush Tewari, Christian Theobalt, and Matthias Nießner
Hadar Averbuch-Elor, D. Cohen-Or, J. Kopf, Michael Cohen (2017)
Bringing portraits to life
ACM Transactions on Graphics (TOG), 36
Haoye Dong, Xiaodan Liang, Bochao Wang, Hanjiang Lai, Jia Zhu, Jian Yin (2019)
Towards Multi-Pose Guided Virtual Try-On Network
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Gökhan Yildirim, Nikolay Jetchev, Roland Vollgraf, Urs Bergmann (2019)
Generating High-Resolution Fashion Model Images Wearing Custom Outfits
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Shane Barratt, Rishi Sharma (2018)
A Note on the Inception Score
ArXiv, abs/1801.01973
Linsen Song, Wayne Wu, Chaoyou Fu, C. Qian, Chen Loy, R. He (2021)
Everything's Talkin': Pareidolia Face Reenactment
ArXiv, abs/2104.03061
Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, G. Henter, Hedvig Kjellstrom (2020)
Moving Fast and Slow: Analysis of Representations and Post-Processing in Speech-Driven Automatic Gesture Generation
International Journal of Human–Computer Interaction, 37
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, P. Lio’, Yoshua Bengio (2017)
Graph Attention Networks
ArXiv, abs/1710.10903
[ (2013)
Human3
IEEE Transactions on Pattern Analysis and Machine Intelligence, 36
Ohad Fried, A. Tewari, M. Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan Goldman, Kyle Genova, Zeyu Jin, C. Theobalt, Maneesh Agrawala (2019)
Text-based editing of talking-head video
ACM Transactions on Graphics (TOG), 38
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew Scott, L. Davis (2019)
FiNet: Compatible and Diverse Fashion Image Inpainting
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Hang Zhou, Yasheng Sun, Wayne Wu, Chen Loy, Xiaogang Wang, Ziwei Liu (2021)
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Justus Thies, M. Zollhöfer, M. Nießner, Levi Valgaerts, M. Stamminger, C. Theobalt (2015)
Real-time expression transfer for facial reenactment
ACM Transactions on Graphics (TOG), 34
Ran Yi, Zipeng Ye, Juyong Zhang, H. Bao, Yong-Jin Liu (2020)
Audio-driven Talking Face Video Generation with Natural Head Pose
ArXiv, abs/2002.10137
Justus Thies, M. Zollhöfer, M. Stamminger, C. Theobalt, M. Nießner (2016)
Face2Face: Real-Time Face Capture and Reenactment of RGB Videos
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Kunlin Liu, Ivan Perov, Daiheng Gao, Nikolay Chervoniy, Wenbo Zhou, Weiming Zhang (2020)
Deepfacelab: Integrated, flexible and extensible face-swapping framework
Pattern Recognit., 141
Egor Burkov, I. Pasechnik, A. Grigorev, V. Lempitsky (2020)
Neural Head Reenactment with Latent Pose Descriptors
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
B. Mildenhall, Pratul Srinivasan, Matthew Tancik, J. Barron, R. Ramamoorthi, Ren Ng (2020)
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Commun. ACM, 65
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, J. Kautz, Bryan Catanzaro (2018)
Video-to-Video Synthesis
Rubén Tolosana, R. Vera-Rodríguez, Julian Fierrez, A. Morales, J. Ortega-Garcia (2020)
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection
ArXiv, abs/2001.00179
Aliaksandr Siarohin, Stéphane Lathuilière, S. Tulyakov, E. Ricci, N. Sebe (2018)
Animating Arbitrary Objects via Deep Motion Transfer
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Yurui Ren, Xiaoming Yu, Junming Chen, Thomas Li, Ge Li (2020)
Deep Image Spatial Transformation for Person Image Generation
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Chenyang Si, Wei Wang, Liang Wang, T. Tan (2018)
Multistage Adversarial Losses for Pose-Based Human Image Synthesis
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Kun Wu, Chengxiang Yin, Zhengping Che, Bo Jiang, Jian Tang, Zheng Guan, Gangyi Ding (2021)
Human Pose Transfer with Disentangled Feature Consistency
ArXiv, abs/2107.10984
Ji Liu, Heshan Liu, M. Chiu, Yu-Wing Tai, Chi-Keung Tang (2020)
Pose-Guided High-Resolution Appearance Transfer via Progressive Training
ArXiv, abs/2008.11898
Meichen Liu, Xin Yan, Chenhui Wang, Ke-jun Wang (2020)
Segmentation mask-guided person image generation
Applied Intelligence, 51
Linsen Song, Wayne Wu, C. Qian, R. He, Chen Loy (2020)
Everybody’s Talkin’: Let Me Talk as You Want
IEEE Transactions on Information Forensics and Security, 17
M. Doukas, Mohammad Koujan, V. Sharmanska, A. Roussos, S. Zafeiriou (2020)
Head2Head++: Deep Facial Attributes Re-Targeting
IEEE Transactions on Biometrics, Behavior, and Identity Science, 3
Max Jaderberg, K. Simonyan, Andrew Zisserman, K. Kavukcuoglu (2015)
Spatial Transformer Networks
ArXiv, abs/1506.02025
Pablo Garrido, Levi Valgaerts, Hamid Sarmadi, I. Steiner, Kiran Varanasi, P. Pérez, C. Theobalt (2015)
VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track
Computer Graphics Forum, 34
Yu Liu, Wei Chen, Li Liu, M. Lew (2019)
SwapGAN: A Multistage Generative Approach for Person-to-Person Fashion Style Transfer
IEEE Transactions on Multimedia, 21
C. Bregler, M. Covell, M. Slaney (1997)
Video Rewrite: driving visual speech with audio
Proceedings of the 24th annual conference on Computer graphics and interactive techniques
Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, P. Luo (2021)
Disentangled Cycle Consistency for Highly-realistic Virtual Try-On
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wenping Wang, C. Theobalt (2018)
Neural Rendering and Reenactment of Human Actor Videos
ACM Transactions on Graphics (TOG), 38
Najwa Alghamdi, Steve Maddock, R. Marxer, J. Barker, Guy Brown (2018)
A corpus of audio-visual Lombard speech with frontal and profile views.
The Journal of the Acoustical Society of America, 143 6
Ivan Petrov, Daiheng Gao, Nikolay Chervoniy, Kunlin Liu, Sugasa Marangonda, C. Umé, Mr. Dpfks, RP Luis, Jian Jiang, Sheng Zhang, Pingyu Wu, Bo Zhou, Weiming Zhang (2020)
DeepFaceLab: A simple, flexible and extensible face swapping framework
ArXiv, abs/2005.05535
Zhen Zhu, Tengteng Huang, Baoguang Shi, Miao Yu, Bofei Wang, X. Bai (2019)
Progressive Pose Attention Transfer for Person Image Generation
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2019)
End-to-End Speech-Driven Realistic Facial Animation with Temporal GANs
[ (2020)
Audio-driven talking face video generation with learning-based personalized head pose
arXiv:2002.10137 (2020).
Weiyu Zhang, Menglong Zhu, K. Derpanis (2013)
From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding
2013 IEEE International Conference on Computer Vision
Lingjie Liu, Weipeng Xu, M. Zollhöfer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wenping Wang, C. Theobalt (2018)
Neural Animation and Reenactment of Human Actor Videos
ArXiv, abs/1809.03658
Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes (2020)
Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On
ArXiv, abs/2007.02721
Stéphane Lathuilière, E. Sangineto, Aliaksandr Siarohin, N. Sebe (2019)
Attention-based Fusion for Multi-source Human Image Generation
2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
Fengiiao Sun, Jiaming Guo, Z. Su, Chengying Gao (2019)
Image-Based Virtual Try-on Network with Structural Coherence
2019 IEEE International Conference on Image Processing (ICIP)
Egor Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, V. Lempitsky (2020)
Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Q. Tian (2015)
Scalable Person Re-identification: A Benchmark
2015 IEEE International Conference on Computer Vision (ICCV)
S. Ha, Martin Kersner, Beomsu Kim, Seokjun Seo, Dongyoung Kim (2019)
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Wen-Huang Cheng (2019)
Fit-me: Image-Based Virtual Try-on With Arbitrary Poses
2019 IEEE International Conference on Image Processing (ICIP)
C. Busso, Srinivas Parthasarathy, Alec Burmania, Mohammed Abdel-Wahab, Najmeh Sadoughi, E. Provost (2017)
MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception
IEEE Transactions on Affective Computing, 8
Lingyun Yu, Jun Yu, Q. Ling (2019)
Mining Audio, Text and Visual Information for Talking Face Generation
2019 IEEE International Conference on Data Mining (ICDM)
Donggeun Yoo, Namil Kim, Sunggyun Park, Anthony Paek, In-So Kweon (2016)
Pixel-Level Domain Transfer
Kedan Li, Min Chong, Jingen Liu, David Forsyth (2020)
Toward Accurate and Realistic Virtual Try-on Through Shape Matching and Multiple Warps
ArXiv, abs/2003.10817
Joon Chung, Andrew Zisserman (2016)
Lip Reading in the Wild
Arsha Nagrani, Joon Chung, Andrew Zisserman (2017)
VoxCeleb: A Large-Scale Speaker Identification Dataset
Xuelin Qian, Yanwei Fu, T. Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, X. Xue (2017)
Pose-Normalized Image Generation for Person Re-identification
A. Czyżewski, B. Kostek, P. Bratoszewski, J. Kotus, Marcin Szczuka (2017)
An audio-visual corpus for multimodal automatic speech recognition
Journal of Intelligent Information Systems, 49
Dai Hasegawa, Naoshi Kaneko, S. Shirakawa, H. Sakuta, K. Sumi (2018)
Evaluation of Speech-to-Gesture Generation Using Bi-Directional LSTM Network
Proceedings of the 18th International Conference on Intelligent Virtual Agents
Polina Zablotskaia, Aliaksandr Siarohin, Bo Zhao, L. Sigal (2019)
DwNet: Dense warp-based network for pose-guided human video generation
ArXiv, abs/1910.09139
Surgan Jandial, Ayush Chopra, Kumar Ayush, Mayur Hemani, Abhijeet Kumar, Balaji Krishnamurthy (2020)
SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On
2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng (2019)
FashionOn: Semantic-guided Image-based Virtual Try-on with Detailed Human and Clothing Information
Proceedings of the 27th ACM International Conference on Multimedia
Yudong Guo, Keyu Chen, Sen Liang, Yongjin Liu, H. Bao, Juyong Zhang (2021)
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan (2015)
Human Parsing with Contextualized Convolutional Neural Network
IEEE Transactions on Pattern Analysis and Machine Intelligence, 39
Albert Pumarola, Antonio Agudo, A. Sanfeliu, F. Moreno-Noguer (2018)
Unsupervised Person Image Synthesis in Arbitrary Poses
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Haoye Dong, Xiaodan Liang, Xiaohui Shen, Bowen Wu, Bing-cheng Chen, Jian Yin (2019)
FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Antonín Vobecký, Michal Uřičář, David Hurych, R. Škoviera (2019)
Advanced Pedestrian Dataset Augmentation for Autonomous Driving
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Yang Zhou, Dingzeyu Li, Xintong Han, E. Kalogerakis, Eli Shechtman, J. Echevarria (2020)
MakeltTalk
ACM Transactions on Graphics (TOG), 39
Matiur Minar, T. Tuan, Heejune Ahn, Paul Rosin, Yu-Kun Lai (2020)
3D Reconstruction of Clothes using a Human Body Model and its Application to Image-based Virtual Try-On
Ylva Ferstl, Michael Neff, R. Mcdonnell (2019)
Multi-objective adversarial gesture generation
Proceedings of the 12th ACM SIGGRAPH Conference on Motion, Interaction and Games
Aliaksandr Siarohin, E. Sangineto, Stéphane Lathuilière, N. Sebe (2017)
Deformable GANs for Pose-Based Human Image Generation
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2014)
Cooper, Michael K
Keutmann, 5
Yurui Ren, Ge Li, Shan Liu, Thomas Li (2020)
Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation
IEEE Transactions on Image Processing, 29
Justus Thies, M. Zollhöfer, M. Nießner (2019)
Deferred neural rendering
ACM Transactions on Graphics (TOG), 38
Jinxian Liu, Bingbing Ni, Yichao Yan, P. Zhou, Shuo Cheng, Jianguo Hu (2018)
Pose Transferrable Person Re-identification
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Gemma Rotger, F. Lumbreras, F. Moreno-Noguer, Antonio Agudo (2018)
2D-to-3D Facial Expression Transfer
2018 24th International Conference on Pattern Recognition (ICPR)
Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai (2018)
M2E-Try On Net: Fashion from Model to Everyone
Proceedings of the 27th ACM International Conference on Multimedia
Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes (2019)
End-to-End Learning of Geometric Deformations of Feature Maps for Virtual Try-On
ArXiv, abs/1906.01347
Olivia Wiles, A. Koepke, Andrew Zisserman (2018)
X2Face: A network for controlling face generation by using images, audio, and pose codes
Geoffrey Hinton, O. Vinyals, J. Dean (2015)
Distilling the Knowledge in a Neural Network
ArXiv, abs/1503.02531
Wen-Huang Cheng, Sijie Song, Chieh-Yun Chen, S. Hidayati, Jiaying Liu (2020)
Fashion Meets Computer Vision
ACM Computing Surveys (CSUR), 54
Xingran Zhou, Siyu Huang, Bin Li, Yingming Li, Jiachen Li, Zhongfei Zhang (2019)
Text Guided Person Image Synthesis
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Richard Zhang, Phillip Isola, Alexei Efros, Eli Shechtman, Oliver Wang (2018)
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Meichen Liu, Ke-jun Wang, Ruihang Ji, S. Ge, Jingyi Chen (2021)
Pose transfer generation with semantic parsing attention network for person re-identification
Knowl. Based Syst., 223
M. Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, S. Hochreiter (2017)
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium
A. Grigorev, A. Sevastopolsky, Alexander Vakhitov, V. Lempitsky (2019)
Coordinate-Based Texture Inpainting for Pose-Guided Human Image Generation
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas (2018)
Learning to Forecast and Refine Residual Motion for Image-to-Video Generation
ArXiv, abs/1807.09951
Jiahang Wang, Tong Sha, Wei Zhang, Zhoujun Li, Tao Mei (2020)
Down to the Last Detail: Virtual Try-on with Fine-grained Details
Proceedings of the 28th ACM International Conference on Multimedia
Alec Radford, Luke Metz, Soumith Chintala (2015)
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
CoRR, abs/1511.06434
A. Raffiee, Michael Sollami (2020)
GarmentGAN: Photo-realistic Adversarial Fashion Transfer
2020 25th International Conference on Pattern Recognition (ICPR)
Jiahao Geng, Tianjia Shao, Youyi Zheng, Y. Weng, Kun Zhou (2018)
Warp-guided GANs for single-photo facial animation
ACM Transactions on Graphics (TOG), 37
Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song (2021)
Shape Controllable Virtual Try-on for Underwear Models
Proceedings of the 29th ACM International Conference on Multimedia
Lele Chen, R. Maddox, Z. Duan, Chenliang Xu (2019)
Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Yulei Zhang, Qingjie Zhao, You Li (2019)
Multi-view Based Pose Alignment Method for Person Re-identification
Lecture Notes in Electrical Engineering
R. Güler, N. Neverova, Iasonas Kokkinos (2018)
DensePose: Dense Human Pose Estimation in the Wild
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Ziwei Liu, Sijie Yan, Ping Luo, Xiaogang Wang, Xiaoou Tang (2016)
Fashion Landmark Detection in the Wild
ArXiv, abs/1608.03049
Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang (2016)
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Seunghwan Choi, Sunghyun Park, M. Lee, J. Choo (2021)
VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
V. Wan, Robert Anderson, A. Blokland, N. Braunschweiler, Langzhou Chen, B. Kolluru, Javier Latorre, R. Maia, B. Stenger, K. Yanagisawa, Y. Stylianou, M. Akamine, M. Gales, R. Cipolla (2013)
Photo-realistic expressive text to talking head synthesis
Soujanya Poria, Devamanyu Hazarika, Navonil Majumder, Gautam Naik, E. Cambria, Rada Mihalcea (2018)
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
ArXiv, abs/1810.02508
Guha Balakrishnan, Amy Zhao, Adrian Dalca, F. Durand, J. Guttag (2018)
Synthesizing Images of Humans in Unseen Poses
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Haitian Zheng, Lele Chen, Chenliang Xu, Jiebo Luo (2019)
Unsupervised Pose Flow Learning for Pose Guided Synthesis
ArXiv, abs/1909.13819
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. VOL. II. NO. 6. JUNE 1989 567 Principal Warps: Thin-Plate Splines and the Decomposition of Deformations
Lilin Cheng, Suzhe Wang, Zhimeng Zhang, Yu Ding, Yixing Zheng, Xin Yu, Changjie Fan (2021)
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2018)
End-to-End Speech-Driven Facial Animation with Temporal GANs
Matteo Fincato, Federico Landi, M. Cornia, Fabio Cesari, R. Cucchiara (2021)
VITON-GT: An Image-based Virtual Try-On Model with Geometric Transformations
2020 25th International Conference on Pattern Recognition (ICPR)
A. Neuberger, Eran Borenstein, Bar Hilleli, Eduard Oks, Sharon Alpert (2020)
Image Based Virtual Try-On Network From Unpaired Data
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
[ (2015)
Human parsing with contextualized convolutional neural network
Proceedings of the IEEE International Conference on Computer Vision
Shizhan Zhu, S. Fidler, R. Urtasun, Dahua Lin, Chen Loy (2017)
Be Your Own Prada: Fashion Synthesis with Structural Coherence
2017 IEEE International Conference on Computer Vision (ICCV)
Yao-Jen Chang, Tony Ezzat (2005)
Transferable videorealistic speech animation
N. Harte, E. Gillen (2015)
TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech
IEEE Transactions on Multimedia, 17
(2021)
Failure cases with inconsistent appearance
B. Chen, Yi Zhang, Hongchen Tan, Baocai Yin, Xiuping Liu (2021)
PMAN: Progressive Multi-Attention Network for Human Pose Transfer
IEEE Transactions on Circuits and Systems for Video Technology, 32
Daniel Vlasic, M. Brand, H. Pfister, J. Popović (2005)
Face transfer with multilinear models
ACM SIGGRAPH 2005 Papers
[ (2020)
Dpfks, Carl Shift Facenheim, Luis RP, Jian Jiang, etal
Zhedong Zheng, Liang Zheng, Yi Yang (2017)
Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro
2017 IEEE International Conference on Computer Vision (ICCV)
Patrick Esser, E. Sutter, B. Ommer (2018)
A Variational U-Net for Conditional Appearance and Shape Generation
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Triantafyllos Afouras, Joon Chung, Andrew Zisserman (2018)
LRS3-TED: a large-scale dataset for visual speech recognition
ArXiv, abs/1809.00496
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis
Hanxiang Hao, Sriram Baireddy, A. Reibman, E. Delp (2020)
FaR-GAN for One-Shot Face Reenactment
ArXiv, abs/2005.06402
Ran Yi, Zipeng Ye, Juyong Zhang, H. Bao, Yong-Jin Liu (2020)
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
arXiv: Computer Vision and Pattern Recognition
Shunsuke Saito, Zeng Huang, Ryota Natsume, S. Morishima, Angjoo Kanazawa, Hao Li (2019)
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao (2019)
Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
M. Turk (1999)
A morphable model for the synthesis of 3D faces
Triantafyllos Afouras, Joon Chung, A. Senior, O. Vinyals, Andrew Zisserman (2018)
Deep Audio-Visual Speech Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 44
Shizuma Kubo, Yusuke Iwasawa, Y. Matsuo (2018)
Generative Adversarial Network-Based Virtual Try-On with Clothing Region
Hyeongwoo Kim, Mohamed Elgharib, M. Zollhöfer, H. Seidel, T. Beeler, Christian Richardt, C. Theobalt (2019)
Neural style-preserving visual dubbing
ACM Transactions on Graphics (TOG), 38
Daniel Cudeiro, Timo Bolkart, Cassidy Laidlaw, Anurag Ranjan, Michael Black (2019)
Capture, Learning, and Synthesis of 3D Speaking Styles
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
[ (2020)
CloTH-VTON: Clothing three-dimensional reconstruction for hybrid image-based virtual try-ON
Proceedings of the Asian Conference on Computer Vision.
[ (2020)
NeRF: Representing scenes as neural radiance fields for view synthesis
European Conference on Computer Vision
Xi Ouyang, Yu Cheng, Yifan Jiang, Chun-Liang Li, Pan Zhou (2018)
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
ArXiv, abs/1804.02047
Rithesh Kumar, Jose Sotelo, Kundan Kumar, A. Brébisson, Yoshua Bengio (2017)
ObamaNet: Photo-realistic lip-sync from text
ArXiv, abs/1801.01442
Sijie Song, Wei Zhang, Jiaying Liu, Tao Mei (2019)
Unsupervised Person Image Generation With Semantic Parsing Transformation
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Kuangxiao Gu, Yuqian Zhou, Thomas Huang (2019)
FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis
ArXiv, abs/1911.09224
Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, L. Davis (2017)
VITON: An Image-Based Virtual Try-on Network
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Mehdi Mirza, Simon Osindero (2014)
Conditional Generative Adversarial Nets
ArXiv, abs/1411.1784
[ (2021)
Fashion meets computer vision: A survey
ACM Computing Surveys (CSUR), 54
Yang-Tian Sun, Haozhi Huang, Xuan Wang, Yu-Kun Lai, Wei Liu, Lin Gao (2021)
Robust Pose Transfer With Dynamic Details Using Neural Video Rendering
IEEE Transactions on Pattern Analysis and Machine Intelligence, 45
Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xiansheng Hua, Wen Gao (2020)
Towards Fine-Grained Human Pose Transfer With Detail Replenishing Network
IEEE Transactions on Image Processing, 30
Romain Lopez, Pierre Boyeau, N. Yosef, Michael Jordan, J. Regier (2020)
AUTO-ENCODING VARIATIONAL BAYES
[ (2014)
Generative adversarial nets
Advances in Neural Information Processing Systems
Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, V. Lempitsky (2019)
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li (2017)
Realistic Dynamic Facial Textures from a Single Image Using GANs
2017 IEEE International Conference on Computer Vision (ICCV)
Haoye Dong, Xiaodan Liang, Ke Gong, Hanjiang Lai, Jia Zhu, Jian Yin (2018)
Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis
S. Tripathy, Juho Kannala, Esa Rahtu (2019)
ICface: Interpretable and Controllable Face Reenactment Using GANs
2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
H. Tang, S. Bai, Li Zhang, Philip Torr, N. Sebe (2020)
XingGAN for Person Image Generation
ArXiv, abs/2007.09278
Jiahang Wang, Wei Zhang, Weizhong Liu, Tao Mei (2019)
Down to the Last Detail: Virtual Try-on with Detail Carving
ArXiv, abs/1912.06324
Aliaksandr Siarohin, Stéphane Lathuilière, S. Tulyakov, E. Ricci, N. Sebe (2020)
First Order Motion Model for Image Animation
Joon Chung, A. Jamaludin, Andrew Zisserman (2017)
You said that?
ArXiv, abs/1705.02966
Christoph Lassner, Gerard Pons-Moll, Peter Gehler (2017)
A Generative Model of People in Clothing
2017 IEEE International Conference on Computer Vision (ICCV)
Zerong Zheng, Tao Yu, Yebin Liu, Qionghai Dai (2020)
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction
IEEE Transactions on Pattern Analysis and Machine Intelligence, 44
Jae Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, H. Park, C. Theobalt (2020)
Pose-Guided Human Animation from a Single Image in the Wild
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Weidong Yin, Ziwei Liu, L. Sigal (2020)
Person-in-Context Synthesis with Compositional Structural Space
2021 IEEE Winter Conference on Applications of Computer Vision (WACV)
Dan Song, Tianbao Li, Zhendong Mao, Anan Liu (2019)
SP-VITON: shape-preserving image-based virtual try-on network
Multimedia Tools and Applications, 79
Tim Salimans, I. Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen (2016)
Improved Techniques for Training GANs
ArXiv, abs/1606.03498
N. Neverova, R. Güler, Iasonas Kokkinos (2018)
Dense Pose Transfer
Yang Song, Jingwen Zhu, Dawei Li, Xiaolong Wang, H. Qi (2018)
Talking Face Generation by Conditional Recurrent Adversarial Network
S. Tulyakov, Ming-Yu Liu, Xiaodong Yang, J. Kautz (2017)
MoCoGAN: Decomposing Motion and Content for Video Generation
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
S. Livingstone, F. Russo (2018)
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English
PLoS ONE, 13
S. Eskimez, You Zhang, Z. Duan (2020)
Speech Driven Talking Face Generation From a Single Image and an Emotion Condition
IEEE Transactions on Multimedia, 24
Koki Nagano, Jaewoo Seo, Jun Xing, Lingyu Wei, Zimo Li, Shunsuke Saito, Aviral Agarwal, Jens Fursund, Hao Li (2019)
paGAN: real-time avatars using dynamic textures
ACM Trans. Graph., 37
Liqian Ma, Qianru Sun, Stamatios Georgoulis, L. Gool, B. Schiele, Mario Fritz (2017)
Disentangled Person Image Generation
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Meichen Liu, Ke-jun Wang, Juihang Ji, S. Ge (2020)
Person image generation with semantic attention network for person re-identification
ArXiv, abs/2008.07884
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2019)
Realistic Speech-Driven Facial Animation with GANs
International Journal of Computer Vision, 128
[ (2017)
Synthesizing Obama: Learning lip sync from audio
ACM Transactions on Graphics (TOG), 36
Joon Chung, Arsha Nagrani, Andrew Zisserman (2018)
VoxCeleb2: Deep Speaker Recognition
Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, Liqiang Nie (2019)
Virtually Trying on New Clothing with Arbitrary Poses
Proceedings of the 27th ACM International Conference on Multimedia
M. Zanfir, A. Popa, Andrei Zanfir, C. Sminchisescu (2018)
Human Appearance Transfer
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Yifang Men, Yiming Mao, Yuning Jiang, Wei-Ying Ma, Z. Lian (2020)
Controllable Person Image Synthesis With Attribute-Decomposed GAN
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Aayush Bansal, Shugao Ma, Deva Ramanan, Yaser Sheikh (2018)
Recycle-GAN: Unsupervised Video Retargeting
Lingbo Yang, Pan Wang, Xinfeng Zhang, Shanshe Wang, Zhanning Gao, Peiran Ren, Xuansong Xie, Siwei Ma, Wen Gao (2020)
Region-Adaptive Texture Enhancement For Detailed Person Image Synthesis
2020 IEEE International Conference on Multimedia and Expo (ICME)
Wei Sun, Jawadul Bappy, Shanglin Yang, Yi Xu, Tianfu Wu, Hui Zhou (2019)
Pose Guided Fashion Image Synthesis Using Deep Generative Model
ArXiv, abs/1906.07251
Yichao Yan, Jingwei Xu, Bingbing Ni, Wendong Zhang, Xiaokang Yang (2017)
Skeleton-Aided Articulated Motion Generation
Proceedings of the 25th ACM international conference on Multimedia
(2020)
Failure cases with complex or incorrect target pose Source Person
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, Illia Polosukhin (2017)
Attention is All you Need
Liqian Ma, Xu Jia, Qianru Sun, B. Schiele, T. Tuytelaars, L. Gool (2017)
Pose Guided Person Image Generation
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei Efros (2018)
Everybody Dance Now
2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Mohan Zhou, Yalong Bai, Wei Zhang, T. Zhao, Tao Mei (2021)
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Z. Wojna (2015)
Rethinking the Inception Architecture for Computer Vision
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Matiur Minar, Heejune Ahn (2021)
CloTH-VTON+: Clothing Three-Dimensional Reconstruction for Hybrid Image-Based Virtual Try-ON
IEEE Access, 9
Hang Zhou, Yu Liu, Ziwei Liu, Ping Luo, Xiaogang Wang (2018)
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
ArXiv, abs/1807.07860

Publisher: Association for Computing Machinery
ISSN: 0360-0300
eISSN: 1557-7341
DOI: 10.1145/3575656
Publisher site: See Article on Publisher Site

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis

Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis

Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis

References (263)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies