Access the full text.
Sign up today, get DeepDyve free for 14 days.
Kun Li, Jinsong Zhang, Yebin Liu, Yu-Kun Lai, Qionghai Dai (2020)
PoNA: Pose-Guided Non-Local Attention for Human Pose TransferIEEE Transactions on Image Processing, 29
Ruiyun Yu, Xiaoqi Wang, Xiaohui Xie (2019)
VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Arnab Karmakar, Deepak Mishra (2020)
A Robust Pose Transformational GAN for Pose Guided Person Image SynthesisArXiv, abs/2001.01259
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei Efros (2016)
Image-to-Image Translation with Conditional Adversarial Networks2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Hyug-Jae Lee, Rokkyu Lee, Minseok Kang, Myounghoon Cho, Gunhan Park (2019)
LA-VITON: A Network for Looking-Attractive Virtual Try-On2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Wenbin Zhao, Qing Xie, Yanchun Ma, Yongjian Liu, Shengwu Xiong (2020)
Pose Guided Person Image Generation Based on Pose Skeleton Sequence and 3D Convolution2020 IEEE International Conference on Image Processing (ICIP)
Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao Liu, Jiashi Feng (2017)
Multi-View Image Generation from a Single-ViewProceedings of the 26th ACM international conference on Multimedia
Han Yang, Ruimao Zhang, Xiaobao Guo, Wei Liu, W. Zuo, P. Luo (2020)
Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee (2017)
Learning to Generate Long-term Future via Hierarchical Prediction
Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, P. Luo (2021)
Parser-Free Virtual Try-on via Distilling Appearance Flows2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Prajwal R, Rudrabha Mukhopadhyay, Vinay Namboodiri, C. Jawahar (2020)
A Lip Sync Expert Is All You Need for Speech to Lip Generation In the WildProceedings of the 28th ACM International Conference on Multimedia
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, J. Kautz (2019)
Joint Discriminative and Generative Learning for Person Re-Identification2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Hyeongwoo Kim, Pablo Garrido, A. Tewari, Weipeng Xu, Justus Thies, M. Nießner, P. Pérez, Christian Richardt, M. Zollhöfer, C. Theobalt (2018)
Deep video portraitsACM Transactions on Graphics (TOG), 37
Shizuma Kubo, Yusuke Iwasawa, Masahiro Suzuki, Y. Matsuo (2019)
UVTON: UV Mapping to Consider the 3D Structure of a Human in Image-Based Virtual Try-On Network2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Bin Ren, Hao Tang, Fanyang Meng, Runwei Ding, Ling Shao, Philip Torr, N. Sebe (2021)
Cloth Interactive Transformer for Virtual Try-OnACM Transactions on Multimedia Computing, Communications and Applications
Aiyu Cui, Daniel McKee, S. Lazebnik (2021)
Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Zhenyu Xie, J. Lai, Xiaohua Xie (2020)
LG-VTON: Fashion Landmark Meets Image-Based Virtual Try-On
Kang Liu, J. Ostermann (2011)
Realistic facial expression synthesis for an image-based talking head2011 IEEE International Conference on Multimedia and Expo
Xu Chen, Jie Song, Otmar Hilliges (2019)
Unpaired Pose Guided Human Image GenerationArXiv, abs/1901.02284
Yipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara Berg (2019)
Dance Dance Generation: Motion Transfer for Internet Videos2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Tero Karras, S. Laine, Timo Aila (2018)
A Style-Based Generator Architecture for Generative Adversarial Networks2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
C. Xu, Yanwei Fu, Chao Wen, Ye Pan, Yu-Gang Jiang, X. Xue (2020)
Pose-Guided Person Image Synthesis in the Non-Iconic ViewsIEEE Transactions on Image Processing, 29
(2019)
2020 . LG - VTON : Fashion Landmark Meets Image - Based Virtual TryOn . In Chinese Conference on Pattern Recognition and Computer Vision ( PRCV )
Shiry Ginosar, Amir Bar, Gefen Kohavi, Caroline Chan, Andrew Owens, Jitendra Malik (2019)
Learning Individual Styles of Conversational Gesture2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Xintong Han, Weilin Huang, Xiaojun Hu, Matthew Scott (2019)
ClothFlow: A Flow-Based Model for Clothed Person Generation2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Yang Zhou, Dingzeyu Li, Xintong Han, E. Kalogerakis, Eli Shechtman, J. Echevarria (2020)
MakeItTalk: Speaker-Aware Talking Head AnimationArXiv, abs/2004.12992
Ceyuan Yang, Zhe Wang, Xinge Zhu, Chen Huang, Jianping Shi, Dahua Lin (2018)
Pose Guided Human Video Generation
Zhou Wang, A. Bovik, H. Sheikh, Eero Simoncelli (2004)
Image quality assessment: from error visibility to structural similarityIEEE Transactions on Image Processing, 13
Kaisiyuan Wang, Qianyi Wu, Linsen Song, Zhuoqian Yang, Wayne Wu, C. Qian, R. He, Y. Qiao, Chen Loy (2020)
MEAD: A Large-Scale Audio-Visual Dataset for Emotional Talking-Face Generation
I. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio (2014)
Generative adversarial networksCommunications of the ACM, 63
Matiur Minar, T. Tuan, Heejune Ahn, Paul Rosin, Yu-Kun Lai (2020)
CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On
L. Verdoliva (2020)
Media Forensics and DeepFakes: An OverviewIEEE Journal of Selected Topics in Signal Processing, 14
[ (2019)
Multi-view based pose alignment method for person re-identificationChinese Intelligent Automation Conference. Springer
Frédéric Cordier, Won-Sook Lee, H. Seo, N. Magnenat-Thalmann (2001)
Virtual-Try-On on the Web
Haoye Dong, Xiaodan Liang, Chenxing Zhou, Hanjiang Lai, Jia Zhu, Jian Yin (2019)
Part-Preserving Pose Manipulation for Person Image Synthesis2019 IEEE International Conference on Multimedia and Expo (ICME)
[ (2020)
What comprises a good talking-head video generation?IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, N. Sebe, Yan Yan (2019)
Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image GenerationProceedings of the 27th ACM International Conference on Multimedia
Nikolay Jetchev, Urs Bergmann (2017)
The Conditional Analogy GAN: Swapping Fashion Articles on People Images2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
Hajer Ghodhbani, Mohamed Neji, Imran Razzak, A. Alimi (2021)
You can try without visiting: a comprehensive survey on virtually try-on outfitsMultimedia Tools and Applications, 81
Lingbo Yang, Zhenghui Zhao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao (2019)
Disentangled Human Action Video Generation via Decoupled Learning2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
A. Jamaludin, Joon Chung, Andrew Zisserman (2019)
You Said That?: Synthesising Talking Faces from AudioInternational Journal of Computer Vision, 127
Bochao Wang, Huabing Zhang, Xiaodan Liang, Yimin Chen, Liang Lin, Meng Yang (2018)
Toward Characteristic-Preserving Image-based Virtual Try-On NetworkArXiv, abs/1807.07688
[ (2019)
Deferred neural rendering: Image synthesis using neural texturesACM Transactions on Graphics (TOG), 38
Shunsuke Saito, T. Simon, Jason Saragih, H. Joo (2020)
PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Zhuo Chen, Chaoyue Wang, Bo Yuan, D. Tao (2020)
PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
S. Eskimez, R. Maddox, Chenliang Xu, Z. Duan (2020)
End-To-End Generation of Talking Faces from Noisy SpeechICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Tero Karras, Timo Aila, S. Laine, Antti Herva, J. Lehtinen (2017)
Audio-driven facial animation by joint end-to-end learning of pose and emotionACM Transactions on Graphics (TOG), 36
Kuan-Hsien Liu, Ting-Yen Chen, Chu-Song Chen (2016)
MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute PredictionProceedings of the 2016 ACM on International Conference on Multimedia Retrieval
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew Scott, L. Davis (2019)
Compatible and Diverse Fashion Image InpaintingArXiv, abs/1902.01096
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei Efros (2017)
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks2017 IEEE International Conference on Computer Vision (ICCV)
Wayne Wu, Yunxuan Zhang, Cheng Li, C. Qian, Chen Loy (2018)
ReenactGAN: Learning to Reenact Faces via Boundary TransferArXiv, abs/1807.11079
Ming-Yu Liu, Xun Huang, Jiahui Yu, Ting-Chun Wang, Arun Mallya (2020)
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and ApplicationsProceedings of the IEEE, 109
M. Cooke, J. Barker, S. Cunningham, Xu Shao (2006)
An audio-visual corpus for speech perception and automatic speech recognition.The Journal of the Acoustical Society of America, 120 5 Pt 1
Lele Chen, Guofeng Cui, Ziyi Kou, Haitian Zheng, Chenliang Xu (2020)
What comprises a good talking-head video generation?: A Survey and BenchmarkArXiv, abs/2005.03201
Amit Raj, Patsorn Sangkloy, Huiwen Chang, James Hays, Duygu Ceylan, Jingwan Lu (2018)
SwapNet: Image Based Garment Transfer
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, J. Kautz, Bryan Catanzaro (2019)
Few-shot Video-to-Video SynthesisArXiv, abs/1910.12713
Catalin Ionescu, Dragos Papava, Vlad Olaru, C. Sminchisescu (2014)
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural EnvironmentsIEEE Transactions on Pattern Analysis and Machine Intelligence, 36
Amina Kammoun, Rim Slama, Hedi Tabia, T. Ouni, Mohmed Abid (2022)
Generative Adversarial Networks for Face Generation: A SurveyACM Computing Surveys, 55
Lele Chen, Zhiheng Li, R. Maddox, Z. Duan, Chenliang Xu (2018)
Lip Movements Generation at a GlanceArXiv, abs/1803.10404
A. Rössler, D. Cozzolino, L. Verdoliva, C. Riess, Justus Thies, M. Nießner (2019)
FaceForensics++: Learning to Detect Manipulated Facial Images2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Chaitanya Ahuja, Shugao Ma, Louis-Philippe Morency, Yaser Sheikh (2019)
To React or not to React: End-to-End Visual Pose Forecasting for Personalized Avatar during Dyadic Conversations2019 International Conference on Multimodal Interaction
Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, K. Grauman (2019)
Fashion++: Minimal Edits for Outfit Improvement2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Najmeh Sadoughi, C. Busso (2018)
Speech-Driven Expressive Talking Lips with Conditional Sequential Generative Adversarial NetworksIEEE Transactions on Affective Computing, 12
(2021)
Failure cases with wrong arm shape (b) Failure cases with occlusion
Hajer Ghodhbani, A. Alimi, Mohamed Neji (2021)
Image-Based Virtual Try-on System: A Survey of Deep Learning-Based Methods
Dong Liang, Rui Wang, Xiao‐Bo Tian, Cong Zou (2018)
PCGAN: Partition-Controlled Human Image GenerationArXiv, abs/1811.09928
Mohammad Koujan, M. Doukas, A. Roussos, S. Zafeiriou (2020)
Head2Head: Video-based Neural Head Synthesis2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)
Debapriya Roy, Sanchayan Santra, B. Chanda (2020)
LGVTON: A Landmark Guided Approach to Virtual Try-OnArXiv, abs/2004.00562
Ting-Chun Wang, Arun Mallya, Ming-Yu Liu (2020)
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
A. Tewari, Ohad Fried, Justus Thies, V. Sitzmann, Stephen Lombardi, Kalyan Sunkavalli, Ricardo Martin-Brualla, T. Simon, Jason Saragih, M. Nießner, Rohit Pandey, S. Fanello, Gordon Wetzstein, Jun-Yan Zhu, C. Theobalt, Maneesh Agrawala, Eli Shechtman, Dan Goldman, Michael Zollhofer (2020)
State of the Art on Neural RenderingComputer Graphics Forum, 39
Yining Li, Chen Huang, Chen Loy (2019)
Dense Intrinsic Appearance Flow for Human Pose Transfer2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Bo Fan, Lijuan Wang, F. Soong, Lei Xie (2015)
Photo-real talking head with deep bidirectional LSTM2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[ (2020)
DeepFakes: Trick or treat?Business Horizons 63, 2 (2020), 135–146DeepFakes: Trick or treat?Business Horizons
Y. Nirkin, Y. Keller, Tal Hassner (2019)
FSGAN: Subject Agnostic Face Swapping and Reenactment2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Xianfang Zeng, Yusu Pan, Mengmeng Wang, Jiangning Zhang, Yong Liu (2020)
Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose
Justus Thies, Mohamed Elgharib, A. Tewari, C. Theobalt, M. Nießner (2019)
Neural Voice Puppetry: Audio-driven Facial Reenactment
Jan Kietzmann, Linda Lee, Ian McCarthy, Tim Kietzmann (2020)
Deepfakes: Trick or treat?Business Horizons
Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li (2018)
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification
Li Yu, Y. Zhong, Xin Wang (2019)
Inpainting-Based Virtual Try-on Network for Selective Garment TransferIEEE Access, 7
S. Tripathy, Juho Kannala, Esa Rahtu (2020)
FACEGAN: Facial Attribute Controllable rEenactment GAN2021 IEEE Winter Conference on Applications of Computer Vision (WACV)
Jacob Walker, Kenneth Marino, A. Gupta, M. Hebert (2017)
The Pose Knows: Video Forecasting by Generating Pose Futures2017 IEEE International Conference on Computer Vision (ICCV)
James Charles, D. Magee, David Hogg (2016)
Virtual Immortality: Reanimating Characters from TV Shows
Houwei Cao, David Cooper, M. Keutmann, R. Gur, A. Nenkova, R. Verma (2014)
CREMA-D: Crowd-Sourced Emotional Multimodal Actors DatasetIEEE Transactions on Affective Computing, 5
Supasorn Suwajanakorn, S. Seitz, Ira Kemelmacher-Shlizerman (2017)
Synthesizing ObamaACM Transactions on Graphics (TOG), 36
(2020)
Ayush Tewari, Christian Theobalt, and Matthias Nießner
Hadar Averbuch-Elor, D. Cohen-Or, J. Kopf, Michael Cohen (2017)
Bringing portraits to lifeACM Transactions on Graphics (TOG), 36
Haoye Dong, Xiaodan Liang, Bochao Wang, Hanjiang Lai, Jia Zhu, Jian Yin (2019)
Towards Multi-Pose Guided Virtual Try-On Network2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Gökhan Yildirim, Nikolay Jetchev, Roland Vollgraf, Urs Bergmann (2019)
Generating High-Resolution Fashion Model Images Wearing Custom Outfits2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Shane Barratt, Rishi Sharma (2018)
A Note on the Inception ScoreArXiv, abs/1801.01973
Linsen Song, Wayne Wu, Chaoyou Fu, C. Qian, Chen Loy, R. He (2021)
Everything's Talkin': Pareidolia Face ReenactmentArXiv, abs/2104.03061
Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, G. Henter, Hedvig Kjellstrom (2020)
Moving Fast and Slow: Analysis of Representations and Post-Processing in Speech-Driven Automatic Gesture GenerationInternational Journal of Human–Computer Interaction, 37
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, P. Lio’, Yoshua Bengio (2017)
Graph Attention NetworksArXiv, abs/1710.10903
[ (2013)
Human3IEEE Transactions on Pattern Analysis and Machine Intelligence, 36
Ohad Fried, A. Tewari, M. Zollhöfer, Adam Finkelstein, Eli Shechtman, Dan Goldman, Kyle Genova, Zeyu Jin, C. Theobalt, Maneesh Agrawala (2019)
Text-based editing of talking-head videoACM Transactions on Graphics (TOG), 38
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew Scott, L. Davis (2019)
FiNet: Compatible and Diverse Fashion Image Inpainting2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Hang Zhou, Yasheng Sun, Wayne Wu, Chen Loy, Xiaogang Wang, Ziwei Liu (2021)
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Justus Thies, M. Zollhöfer, M. Nießner, Levi Valgaerts, M. Stamminger, C. Theobalt (2015)
Real-time expression transfer for facial reenactmentACM Transactions on Graphics (TOG), 34
Ran Yi, Zipeng Ye, Juyong Zhang, H. Bao, Yong-Jin Liu (2020)
Audio-driven Talking Face Video Generation with Natural Head PoseArXiv, abs/2002.10137
Justus Thies, M. Zollhöfer, M. Stamminger, C. Theobalt, M. Nießner (2016)
Face2Face: Real-Time Face Capture and Reenactment of RGB Videos2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Kunlin Liu, Ivan Perov, Daiheng Gao, Nikolay Chervoniy, Wenbo Zhou, Weiming Zhang (2020)
Deepfacelab: Integrated, flexible and extensible face-swapping frameworkPattern Recognit., 141
Egor Burkov, I. Pasechnik, A. Grigorev, V. Lempitsky (2020)
Neural Head Reenactment with Latent Pose Descriptors2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
B. Mildenhall, Pratul Srinivasan, Matthew Tancik, J. Barron, R. Ramamoorthi, Ren Ng (2020)
NeRF: Representing Scenes as Neural Radiance Fields for View SynthesisCommun. ACM, 65
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, J. Kautz, Bryan Catanzaro (2018)
Video-to-Video Synthesis
Rubén Tolosana, R. Vera-Rodríguez, Julian Fierrez, A. Morales, J. Ortega-Garcia (2020)
DeepFakes and Beyond: A Survey of Face Manipulation and Fake DetectionArXiv, abs/2001.00179
Aliaksandr Siarohin, Stéphane Lathuilière, S. Tulyakov, E. Ricci, N. Sebe (2018)
Animating Arbitrary Objects via Deep Motion Transfer2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Yurui Ren, Xiaoming Yu, Junming Chen, Thomas Li, Ge Li (2020)
Deep Image Spatial Transformation for Person Image Generation2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Chenyang Si, Wei Wang, Liang Wang, T. Tan (2018)
Multistage Adversarial Losses for Pose-Based Human Image Synthesis2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Kun Wu, Chengxiang Yin, Zhengping Che, Bo Jiang, Jian Tang, Zheng Guan, Gangyi Ding (2021)
Human Pose Transfer with Disentangled Feature ConsistencyArXiv, abs/2107.10984
Ji Liu, Heshan Liu, M. Chiu, Yu-Wing Tai, Chi-Keung Tang (2020)
Pose-Guided High-Resolution Appearance Transfer via Progressive TrainingArXiv, abs/2008.11898
Meichen Liu, Xin Yan, Chenhui Wang, Ke-jun Wang (2020)
Segmentation mask-guided person image generationApplied Intelligence, 51
Linsen Song, Wayne Wu, C. Qian, R. He, Chen Loy (2020)
Everybody’s Talkin’: Let Me Talk as You WantIEEE Transactions on Information Forensics and Security, 17
M. Doukas, Mohammad Koujan, V. Sharmanska, A. Roussos, S. Zafeiriou (2020)
Head2Head++: Deep Facial Attributes Re-TargetingIEEE Transactions on Biometrics, Behavior, and Identity Science, 3
Max Jaderberg, K. Simonyan, Andrew Zisserman, K. Kavukcuoglu (2015)
Spatial Transformer NetworksArXiv, abs/1506.02025
Pablo Garrido, Levi Valgaerts, Hamid Sarmadi, I. Steiner, Kiran Varanasi, P. Pérez, C. Theobalt (2015)
VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio TrackComputer Graphics Forum, 34
Yu Liu, Wei Chen, Li Liu, M. Lew (2019)
SwapGAN: A Multistage Generative Approach for Person-to-Person Fashion Style TransferIEEE Transactions on Multimedia, 21
C. Bregler, M. Covell, M. Slaney (1997)
Video Rewrite: driving visual speech with audioProceedings of the 24th annual conference on Computer graphics and interactive techniques
Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, P. Luo (2021)
Disentangled Cycle Consistency for Highly-realistic Virtual Try-On2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Lingjie Liu, Weipeng Xu, Michael Zollhoefer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wenping Wang, C. Theobalt (2018)
Neural Rendering and Reenactment of Human Actor VideosACM Transactions on Graphics (TOG), 38
Najwa Alghamdi, Steve Maddock, R. Marxer, J. Barker, Guy Brown (2018)
A corpus of audio-visual Lombard speech with frontal and profile views.The Journal of the Acoustical Society of America, 143 6
Ivan Petrov, Daiheng Gao, Nikolay Chervoniy, Kunlin Liu, Sugasa Marangonda, C. Umé, Mr. Dpfks, RP Luis, Jian Jiang, Sheng Zhang, Pingyu Wu, Bo Zhou, Weiming Zhang (2020)
DeepFaceLab: A simple, flexible and extensible face swapping frameworkArXiv, abs/2005.05535
Zhen Zhu, Tengteng Huang, Baoguang Shi, Miao Yu, Bofei Wang, X. Bai (2019)
Progressive Pose Attention Transfer for Person Image Generation2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2019)
End-to-End Speech-Driven Realistic Facial Animation with Temporal GANs
[ (2020)
Audio-driven talking face video generation with learning-based personalized head posearXiv:2002.10137 (2020).
Weiyu Zhang, Menglong Zhu, K. Derpanis (2013)
From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding2013 IEEE International Conference on Computer Vision
Lingjie Liu, Weipeng Xu, M. Zollhöfer, Hyeongwoo Kim, Florian Bernard, Marc Habermann, Wenping Wang, C. Theobalt (2018)
Neural Animation and Reenactment of Human Actor VideosArXiv, abs/1809.03658
Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes (2020)
Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-OnArXiv, abs/2007.02721
Stéphane Lathuilière, E. Sangineto, Aliaksandr Siarohin, N. Sebe (2019)
Attention-based Fusion for Multi-source Human Image Generation2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
Fengiiao Sun, Jiaming Guo, Z. Su, Chengying Gao (2019)
Image-Based Virtual Try-on Network with Structural Coherence2019 IEEE International Conference on Image Processing (ICIP)
Egor Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, V. Lempitsky (2020)
Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Q. Tian (2015)
Scalable Person Re-identification: A Benchmark2015 IEEE International Conference on Computer Vision (ICCV)
S. Ha, Martin Kersner, Beomsu Kim, Seokjun Seo, Dongyoung Kim (2019)
MarioNETte: Few-shot Face Reenactment Preserving Identity of Unseen Targets
Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Wen-Huang Cheng (2019)
Fit-me: Image-Based Virtual Try-on With Arbitrary Poses2019 IEEE International Conference on Image Processing (ICIP)
C. Busso, Srinivas Parthasarathy, Alec Burmania, Mohammed Abdel-Wahab, Najmeh Sadoughi, E. Provost (2017)
MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion PerceptionIEEE Transactions on Affective Computing, 8
Lingyun Yu, Jun Yu, Q. Ling (2019)
Mining Audio, Text and Visual Information for Talking Face Generation2019 IEEE International Conference on Data Mining (ICDM)
Donggeun Yoo, Namil Kim, Sunggyun Park, Anthony Paek, In-So Kweon (2016)
Pixel-Level Domain Transfer
Kedan Li, Min Chong, Jingen Liu, David Forsyth (2020)
Toward Accurate and Realistic Virtual Try-on Through Shape Matching and Multiple WarpsArXiv, abs/2003.10817
Joon Chung, Andrew Zisserman (2016)
Lip Reading in the Wild
Arsha Nagrani, Joon Chung, Andrew Zisserman (2017)
VoxCeleb: A Large-Scale Speaker Identification Dataset
Xuelin Qian, Yanwei Fu, T. Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, X. Xue (2017)
Pose-Normalized Image Generation for Person Re-identification
A. Czyżewski, B. Kostek, P. Bratoszewski, J. Kotus, Marcin Szczuka (2017)
An audio-visual corpus for multimodal automatic speech recognitionJournal of Intelligent Information Systems, 49
Dai Hasegawa, Naoshi Kaneko, S. Shirakawa, H. Sakuta, K. Sumi (2018)
Evaluation of Speech-to-Gesture Generation Using Bi-Directional LSTM NetworkProceedings of the 18th International Conference on Intelligent Virtual Agents
Polina Zablotskaia, Aliaksandr Siarohin, Bo Zhao, L. Sigal (2019)
DwNet: Dense warp-based network for pose-guided human video generationArXiv, abs/1910.09139
Surgan Jandial, Ayush Chopra, Kumar Ayush, Mayur Hemani, Abhijeet Kumar, Balaji Krishnamurthy (2020)
SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
Chia-Wei Hsieh, Chieh-Yun Chen, Chien-Lung Chou, Hong-Han Shuai, Jiaying Liu, Wen-Huang Cheng (2019)
FashionOn: Semantic-guided Image-based Virtual Try-on with Detailed Human and Clothing InformationProceedings of the 27th ACM International Conference on Multimedia
Yudong Guo, Keyu Chen, Sen Liang, Yongjin Liu, H. Bao, Juyong Zhang (2021)
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan (2015)
Human Parsing with Contextualized Convolutional Neural NetworkIEEE Transactions on Pattern Analysis and Machine Intelligence, 39
Albert Pumarola, Antonio Agudo, A. Sanfeliu, F. Moreno-Noguer (2018)
Unsupervised Person Image Synthesis in Arbitrary Poses2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Haoye Dong, Xiaodan Liang, Xiaohui Shen, Bowen Wu, Bing-cheng Chen, Jian Yin (2019)
FW-GAN: Flow-Navigated Warping GAN for Video Virtual Try-On2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Antonín Vobecký, Michal Uřičář, David Hurych, R. Škoviera (2019)
Advanced Pedestrian Dataset Augmentation for Autonomous Driving2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Yang Zhou, Dingzeyu Li, Xintong Han, E. Kalogerakis, Eli Shechtman, J. Echevarria (2020)
MakeltTalkACM Transactions on Graphics (TOG), 39
Matiur Minar, T. Tuan, Heejune Ahn, Paul Rosin, Yu-Kun Lai (2020)
3D Reconstruction of Clothes using a Human Body Model and its Application to Image-based Virtual Try-On
Ylva Ferstl, Michael Neff, R. Mcdonnell (2019)
Multi-objective adversarial gesture generationProceedings of the 12th ACM SIGGRAPH Conference on Motion, Interaction and Games
Aliaksandr Siarohin, E. Sangineto, Stéphane Lathuilière, N. Sebe (2017)
Deformable GANs for Pose-Based Human Image Generation2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2014)
Cooper, Michael KKeutmann, 5
Yurui Ren, Ge Li, Shan Liu, Thomas Li (2020)
Deep Spatial Transformation for Pose-Guided Person Image Generation and AnimationIEEE Transactions on Image Processing, 29
Justus Thies, M. Zollhöfer, M. Nießner (2019)
Deferred neural renderingACM Transactions on Graphics (TOG), 38
Jinxian Liu, Bingbing Ni, Yichao Yan, P. Zhou, Shuo Cheng, Jianguo Hu (2018)
Pose Transferrable Person Re-identification2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Gemma Rotger, F. Lumbreras, F. Moreno-Noguer, Antonio Agudo (2018)
2D-to-3D Facial Expression Transfer2018 24th International Conference on Pattern Recognition (ICPR)
Zhonghua Wu, Guosheng Lin, Qingyi Tao, Jianfei Cai (2018)
M2E-Try On Net: Fashion from Model to EveryoneProceedings of the 27th ACM International Conference on Multimedia
Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes (2019)
End-to-End Learning of Geometric Deformations of Feature Maps for Virtual Try-OnArXiv, abs/1906.01347
Olivia Wiles, A. Koepke, Andrew Zisserman (2018)
X2Face: A network for controlling face generation by using images, audio, and pose codes
Geoffrey Hinton, O. Vinyals, J. Dean (2015)
Distilling the Knowledge in a Neural NetworkArXiv, abs/1503.02531
Wen-Huang Cheng, Sijie Song, Chieh-Yun Chen, S. Hidayati, Jiaying Liu (2020)
Fashion Meets Computer VisionACM Computing Surveys (CSUR), 54
Xingran Zhou, Siyu Huang, Bin Li, Yingming Li, Jiachen Li, Zhongfei Zhang (2019)
Text Guided Person Image Synthesis2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Richard Zhang, Phillip Isola, Alexei Efros, Eli Shechtman, Oliver Wang (2018)
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Meichen Liu, Ke-jun Wang, Ruihang Ji, S. Ge, Jingyi Chen (2021)
Pose transfer generation with semantic parsing attention network for person re-identificationKnowl. Based Syst., 223
M. Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, S. Hochreiter (2017)
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium
A. Grigorev, A. Sevastopolsky, Alexander Vakhitov, V. Lempitsky (2019)
Coordinate-Based Texture Inpainting for Pose-Guided Human Image Generation2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas (2018)
Learning to Forecast and Refine Residual Motion for Image-to-Video GenerationArXiv, abs/1807.09951
Jiahang Wang, Tong Sha, Wei Zhang, Zhoujun Li, Tao Mei (2020)
Down to the Last Detail: Virtual Try-on with Fine-grained DetailsProceedings of the 28th ACM International Conference on Multimedia
Alec Radford, Luke Metz, Soumith Chintala (2015)
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial NetworksCoRR, abs/1511.06434
A. Raffiee, Michael Sollami (2020)
GarmentGAN: Photo-realistic Adversarial Fashion Transfer2020 25th International Conference on Pattern Recognition (ICPR)
Jiahao Geng, Tianjia Shao, Youyi Zheng, Y. Weng, Kun Zhou (2018)
Warp-guided GANs for single-photo facial animationACM Transactions on Graphics (TOG), 37
Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song (2021)
Shape Controllable Virtual Try-on for Underwear ModelsProceedings of the 29th ACM International Conference on Multimedia
Lele Chen, R. Maddox, Z. Duan, Chenliang Xu (2019)
Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Yulei Zhang, Qingjie Zhao, You Li (2019)
Multi-view Based Pose Alignment Method for Person Re-identificationLecture Notes in Electrical Engineering
R. Güler, N. Neverova, Iasonas Kokkinos (2018)
DensePose: Dense Human Pose Estimation in the Wild2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Ziwei Liu, Sijie Yan, Ping Luo, Xiaogang Wang, Xiaoou Tang (2016)
Fashion Landmark Detection in the WildArXiv, abs/1608.03049
Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang (2016)
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Seunghwan Choi, Sunghyun Park, M. Lee, J. Choo (2021)
VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
V. Wan, Robert Anderson, A. Blokland, N. Braunschweiler, Langzhou Chen, B. Kolluru, Javier Latorre, R. Maia, B. Stenger, K. Yanagisawa, Y. Stylianou, M. Akamine, M. Gales, R. Cipolla (2013)
Photo-realistic expressive text to talking head synthesis
Soujanya Poria, Devamanyu Hazarika, Navonil Majumder, Gautam Naik, E. Cambria, Rada Mihalcea (2018)
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in ConversationsArXiv, abs/1810.02508
Guha Balakrishnan, Amy Zhao, Adrian Dalca, F. Durand, J. Guttag (2018)
Synthesizing Images of Humans in Unseen Poses2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Haitian Zheng, Lele Chen, Chenliang Xu, Jiebo Luo (2019)
Unsupervised Pose Flow Learning for Pose Guided SynthesisArXiv, abs/1909.13819
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. VOL. II. NO. 6. JUNE 1989 567 Principal Warps: Thin-Plate Splines and the Decomposition of Deformations
Lilin Cheng, Suzhe Wang, Zhimeng Zhang, Yu Ding, Yixing Zheng, Xin Yu, Changjie Fan (2021)
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2018)
End-to-End Speech-Driven Facial Animation with Temporal GANs
Matteo Fincato, Federico Landi, M. Cornia, Fabio Cesari, R. Cucchiara (2021)
VITON-GT: An Image-based Virtual Try-On Model with Geometric Transformations2020 25th International Conference on Pattern Recognition (ICPR)
A. Neuberger, Eran Borenstein, Bar Hilleli, Eduard Oks, Sharon Alpert (2020)
Image Based Virtual Try-On Network From Unpaired Data2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
[ (2015)
Human parsing with contextualized convolutional neural networkProceedings of the IEEE International Conference on Computer Vision
Shizhan Zhu, S. Fidler, R. Urtasun, Dahua Lin, Chen Loy (2017)
Be Your Own Prada: Fashion Synthesis with Structural Coherence2017 IEEE International Conference on Computer Vision (ICCV)
Yao-Jen Chang, Tony Ezzat (2005)
Transferable videorealistic speech animation
N. Harte, E. Gillen (2015)
TCD-TIMIT: An Audio-Visual Corpus of Continuous SpeechIEEE Transactions on Multimedia, 17
(2021)
Failure cases with inconsistent appearance
B. Chen, Yi Zhang, Hongchen Tan, Baocai Yin, Xiuping Liu (2021)
PMAN: Progressive Multi-Attention Network for Human Pose TransferIEEE Transactions on Circuits and Systems for Video Technology, 32
Daniel Vlasic, M. Brand, H. Pfister, J. Popović (2005)
Face transfer with multilinear modelsACM SIGGRAPH 2005 Papers
[ (2020)
Dpfks, Carl Shift Facenheim, Luis RP, Jian Jiang, etal
Zhedong Zheng, Liang Zheng, Yi Yang (2017)
Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in Vitro2017 IEEE International Conference on Computer Vision (ICCV)
Patrick Esser, E. Sutter, B. Ommer (2018)
A Variational U-Net for Conditional Appearance and Shape Generation2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Triantafyllos Afouras, Joon Chung, Andrew Zisserman (2018)
LRS3-TED: a large-scale dataset for visual speech recognitionArXiv, abs/1809.00496
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis
Hanxiang Hao, Sriram Baireddy, A. Reibman, E. Delp (2020)
FaR-GAN for One-Shot Face ReenactmentArXiv, abs/2005.06402
Ran Yi, Zipeng Ye, Juyong Zhang, H. Bao, Yong-Jin Liu (2020)
Audio-driven Talking Face Video Generation with Learning-based Personalized Head PosearXiv: Computer Vision and Pattern Recognition
Shunsuke Saito, Zeng Huang, Ryota Natsume, S. Morishima, Angjoo Kanazawa, Hao Li (2019)
PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao (2019)
Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis2019 IEEE/CVF International Conference on Computer Vision (ICCV)
M. Turk (1999)
A morphable model for the synthesis of 3D faces
Triantafyllos Afouras, Joon Chung, A. Senior, O. Vinyals, Andrew Zisserman (2018)
Deep Audio-Visual Speech RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence, 44
Shizuma Kubo, Yusuke Iwasawa, Y. Matsuo (2018)
Generative Adversarial Network-Based Virtual Try-On with Clothing Region
Hyeongwoo Kim, Mohamed Elgharib, M. Zollhöfer, H. Seidel, T. Beeler, Christian Richardt, C. Theobalt (2019)
Neural style-preserving visual dubbingACM Transactions on Graphics (TOG), 38
Daniel Cudeiro, Timo Bolkart, Cassidy Laidlaw, Anurag Ranjan, Michael Black (2019)
Capture, Learning, and Synthesis of 3D Speaking Styles2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
[ (2020)
CloTH-VTON: Clothing three-dimensional reconstruction for hybrid image-based virtual try-ONProceedings of the Asian Conference on Computer Vision.
[ (2020)
NeRF: Representing scenes as neural radiance fields for view synthesisEuropean Conference on Computer Vision
Xi Ouyang, Yu Cheng, Yifan Jiang, Chun-Liang Li, Pan Zhou (2018)
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and BeyondArXiv, abs/1804.02047
Rithesh Kumar, Jose Sotelo, Kundan Kumar, A. Brébisson, Yoshua Bengio (2017)
ObamaNet: Photo-realistic lip-sync from textArXiv, abs/1801.01442
Sijie Song, Wei Zhang, Jiaying Liu, Tao Mei (2019)
Unsupervised Person Image Generation With Semantic Parsing Transformation2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Kuangxiao Gu, Yuqian Zhou, Thomas Huang (2019)
FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation SynthesisArXiv, abs/1911.09224
Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, L. Davis (2017)
VITON: An Image-Based Virtual Try-on Network2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Mehdi Mirza, Simon Osindero (2014)
Conditional Generative Adversarial NetsArXiv, abs/1411.1784
[ (2021)
Fashion meets computer vision: A surveyACM Computing Surveys (CSUR), 54
Yang-Tian Sun, Haozhi Huang, Xuan Wang, Yu-Kun Lai, Wei Liu, Lin Gao (2021)
Robust Pose Transfer With Dynamic Details Using Neural Video RenderingIEEE Transactions on Pattern Analysis and Machine Intelligence, 45
Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xiansheng Hua, Wen Gao (2020)
Towards Fine-Grained Human Pose Transfer With Detail Replenishing NetworkIEEE Transactions on Image Processing, 30
Romain Lopez, Pierre Boyeau, N. Yosef, Michael Jordan, J. Regier (2020)
AUTO-ENCODING VARIATIONAL BAYES
[ (2014)
Generative adversarial netsAdvances in Neural Information Processing Systems
Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, V. Lempitsky (2019)
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li (2017)
Realistic Dynamic Facial Textures from a Single Image Using GANs2017 IEEE International Conference on Computer Vision (ICCV)
Haoye Dong, Xiaodan Liang, Ke Gong, Hanjiang Lai, Jia Zhu, Jian Yin (2018)
Soft-Gated Warping-GAN for Pose-Guided Person Image Synthesis
S. Tripathy, Juho Kannala, Esa Rahtu (2019)
ICface: Interpretable and Controllable Face Reenactment Using GANs2020 IEEE Winter Conference on Applications of Computer Vision (WACV)
H. Tang, S. Bai, Li Zhang, Philip Torr, N. Sebe (2020)
XingGAN for Person Image GenerationArXiv, abs/2007.09278
Jiahang Wang, Wei Zhang, Weizhong Liu, Tao Mei (2019)
Down to the Last Detail: Virtual Try-on with Detail CarvingArXiv, abs/1912.06324
Aliaksandr Siarohin, Stéphane Lathuilière, S. Tulyakov, E. Ricci, N. Sebe (2020)
First Order Motion Model for Image Animation
Joon Chung, A. Jamaludin, Andrew Zisserman (2017)
You said that?ArXiv, abs/1705.02966
Christoph Lassner, Gerard Pons-Moll, Peter Gehler (2017)
A Generative Model of People in Clothing2017 IEEE International Conference on Computer Vision (ICCV)
Zerong Zheng, Tao Yu, Yebin Liu, Qionghai Dai (2020)
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human ReconstructionIEEE Transactions on Pattern Analysis and Machine Intelligence, 44
Jae Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, H. Park, C. Theobalt (2020)
Pose-Guided Human Animation from a Single Image in the Wild2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Weidong Yin, Ziwei Liu, L. Sigal (2020)
Person-in-Context Synthesis with Compositional Structural Space2021 IEEE Winter Conference on Applications of Computer Vision (WACV)
Dan Song, Tianbao Li, Zhendong Mao, Anan Liu (2019)
SP-VITON: shape-preserving image-based virtual try-on networkMultimedia Tools and Applications, 79
Tim Salimans, I. Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen (2016)
Improved Techniques for Training GANsArXiv, abs/1606.03498
N. Neverova, R. Güler, Iasonas Kokkinos (2018)
Dense Pose Transfer
Yang Song, Jingwen Zhu, Dawei Li, Xiaolong Wang, H. Qi (2018)
Talking Face Generation by Conditional Recurrent Adversarial Network
S. Tulyakov, Ming-Yu Liu, Xiaodong Yang, J. Kautz (2017)
MoCoGAN: Decomposing Motion and Content for Video Generation2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
S. Livingstone, F. Russo (2018)
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American EnglishPLoS ONE, 13
S. Eskimez, You Zhang, Z. Duan (2020)
Speech Driven Talking Face Generation From a Single Image and an Emotion ConditionIEEE Transactions on Multimedia, 24
Koki Nagano, Jaewoo Seo, Jun Xing, Lingyu Wei, Zimo Li, Shunsuke Saito, Aviral Agarwal, Jens Fursund, Hao Li (2019)
paGAN: real-time avatars using dynamic texturesACM Trans. Graph., 37
Liqian Ma, Qianru Sun, Stamatios Georgoulis, L. Gool, B. Schiele, Mario Fritz (2017)
Disentangled Person Image Generation2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Meichen Liu, Ke-jun Wang, Juihang Ji, S. Ge (2020)
Person image generation with semantic attention network for person re-identificationArXiv, abs/2008.07884
Konstantinos Vougioukas, Stavros Petridis, M. Pantic (2019)
Realistic Speech-Driven Facial Animation with GANsInternational Journal of Computer Vision, 128
[ (2017)
Synthesizing Obama: Learning lip sync from audioACM Transactions on Graphics (TOG), 36
Joon Chung, Arsha Nagrani, Andrew Zisserman (2018)
VoxCeleb2: Deep Speaker Recognition
Na Zheng, Xuemeng Song, Zhaozheng Chen, Linmei Hu, Da Cao, Liqiang Nie (2019)
Virtually Trying on New Clothing with Arbitrary PosesProceedings of the 27th ACM International Conference on Multimedia
M. Zanfir, A. Popa, Andrei Zanfir, C. Sminchisescu (2018)
Human Appearance Transfer2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Yifang Men, Yiming Mao, Yuning Jiang, Wei-Ying Ma, Z. Lian (2020)
Controllable Person Image Synthesis With Attribute-Decomposed GAN2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Aayush Bansal, Shugao Ma, Deva Ramanan, Yaser Sheikh (2018)
Recycle-GAN: Unsupervised Video Retargeting
Lingbo Yang, Pan Wang, Xinfeng Zhang, Shanshe Wang, Zhanning Gao, Peiran Ren, Xuansong Xie, Siwei Ma, Wen Gao (2020)
Region-Adaptive Texture Enhancement For Detailed Person Image Synthesis2020 IEEE International Conference on Multimedia and Expo (ICME)
Wei Sun, Jawadul Bappy, Shanglin Yang, Yi Xu, Tianfu Wu, Hui Zhou (2019)
Pose Guided Fashion Image Synthesis Using Deep Generative ModelArXiv, abs/1906.07251
Yichao Yan, Jingwei Xu, Bingbing Ni, Wendong Zhang, Xiaokang Yang (2017)
Skeleton-Aided Articulated Motion GenerationProceedings of the 25th ACM international conference on Multimedia
(2020)
Failure cases with complex or incorrect target pose Source Person
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan Gomez, Lukasz Kaiser, Illia Polosukhin (2017)
Attention is All you Need
Liqian Ma, Xu Jia, Qianru Sun, B. Schiele, T. Tuytelaars, L. Gool (2017)
Pose Guided Person Image Generation
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei Efros (2018)
Everybody Dance Now2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Mohan Zhou, Yalong Bai, Wei Zhang, T. Zhao, Tao Mei (2021)
Responsive Listening Head Generation: A Benchmark Dataset and Baseline
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Z. Wojna (2015)
Rethinking the Inception Architecture for Computer Vision2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Matiur Minar, Heejune Ahn (2021)
CloTH-VTON+: Clothing Three-Dimensional Reconstruction for Hybrid Image-Based Virtual Try-ONIEEE Access, 9
Hang Zhou, Yu Liu, Ziwei Liu, Ping Luo, Xiaogang Wang (2018)
Talking Face Generation by Adversarially Disentangled Audio-Visual RepresentationArXiv, abs/1807.07860
Deep person generation has attracted extensive research attention due to its wide applications in virtual agents, video conferencing, online shopping, and art/movie production. With the advancement of deep learning, visual appearances (face, pose, cloth) of a person image can be easily generated on demand. In this survey, we first summarize the scope of person generation, and then systematically review recent progress and technical trends in identity-preserving deep person generation, covering three major tasks: talking-head generation (face), pose-guided person generation (pose), and garment-oriented person generation (cloth). More than two hundred papers are covered for a thorough overview, and the milestone works are highlighted to witness the major technical breakthrough. Based on these fundamental tasks, many applications are investigated, e.g., virtual fitting, digital human, and generative data augmentation. We hope this survey could shed some light on the future prospects of identity-preserving deep person generation, and provide a helpful foundation for full applications towards the digital human.
ACM Computing Surveys (CSUR) – Association for Computing Machinery
Published: Mar 28, 2023
Keywords: Deep person generation
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.