Access the full text.
Sign up today, get DeepDyve free for 14 days.
G. Bjøntegaard (2001)
Calculation of Average PSNR Differences between RD-curves
Lei Zhao, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao (2018)
Enhanced Ctu-Level Inter Prediction with Deep Frame Rate Up-Conversion for High Efficiency Video Coding2018 25th IEEE International Conference on Image Processing (ICIP)
G. Sullivan, J. Ohm, W. Han, T. Wiegand (2012)
Overview of the High Efficiency Video Coding (HEVC) StandardIEEE Transactions on Circuits and Systems for Video Technology, 22
[ (2017)
FlowNet 2Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. IEEE
Felix Haub, Thorsten Laude, J. Ostermann (2018)
HEVC Inter Coding using Deep Recurrent Neural Networks and Artificial Reference Pictures2019 Picture Coding Symposium (PCS)
[ (2018)
Enhanced intra prediction with recurrent neural network in video codingProceedings of the Data Compression Conference.413–413.
[ (2018)
PWC-Net: CNNs for optical flow using pyramid, warping, and cost volumeProceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. IEEE
Wenhan Yang, Jiaying Liu, Mading Li, Zongming Guo (2018)
Isophote-Constrained Autoregressive Model With Adaptive Window Extension for Image InterpolationIEEE Transactions on Circuits and Systems for Video Technology, 28
Mading Li, Jiaying Liu, Xiaoyan Sun, Zhiwei Xiong (2019)
Image/Video Restoration via Multiplanar Autoregressive Model and Low-Rank OptimizationACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 15
Tianfan Xue, Baian Chen, Jiajun Wu, D. Wei, W. Freeman (2017)
Video Enhancement with Task-Oriented FlowInternational Journal of Computer Vision
Wenhan Yang, Sifeng Xia, Jiaying Liu, Zongming Guo (2019)
Reference-Guided Deep Super-Resolution via Manifold Localized External CompensationIEEE Transactions on Circuits and Systems for Video Technology, 29
Simone Schaub-Meyer, Oliver Wang, H. Zimmer, Max Grosse, A. Sorkine-Hornung (2015)
Phase-based frame interpolation for video2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
W. Marsden (2012)
I and J
Aaas News, E. Lu, Min-Min Zhou, Rong Mocsai, A. Myers, E. Huang, B. Jackson, Davide Ferrari, V. Tybulewicz, V. Lowell, Clifford Lepore, J. Koretzky, Gary Kahn, M. L., F. Achard, H. Eva, Ernst-Detlef Schulze, J. Acharya, U. Acharya, U. Acharya, Shetal Patel, E. Koundakjian, K. Nagashima, Xianlin Han, J. Acharya, D. Adams, Jonathan Horton, Blood, M. Adams, M. McVey, J. Sekelsky, J. Adamson, G. Kochendoerfer, A. Adeleke, A. Kamdem-Toham, Alan Aderem, C. Picard, Aeschlimann, G. Haug, G. Agarwal, M. Scully, H. Aguilaniu, L. Gustafsson, M. Rigoulet, T. Nyström, Asymmetric Inheri, Ferhaan Ahmad, J. Schmitt, M. Aida, S. Ammal, J. Aizenberg, D. Muller, J. Grazul, D. Hamann, J. Ajioka, C. Su, A. Akella, M. Alam, F. Gao, A. Alatas, H. Sinn, Titus Albu, P. Zuev, M. Al-Dayeh, J. Dwyer, A. Al-ghonaium, Sami Al-Hajjar, S. Al-Jumaah, A. Allakhverdov, V. Pokrovsky, Allen, A. Brown, James Allen, A. Brown, James Gillooly, James (1893)
Book ReviewsBuffalo Medical and Surgical Journal, 33
Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Chunlei Cai, Zhiyong Gao (2018)
DVC: An End-To-End Deep Video Compression Framework2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
T. Wiegand, G. Sullivan, G. Bjøntegaard, A. Luthra (2003)
Overview of the H.264/AVC video coding standardIEEE Trans. Circuits Syst. Video Technol., 13
Yuzhang Hu, Sifeng Xia, Wenhan Yang, Jiaying Liu (2020)
Memory-Augmented Auto-Regressive Network for Frame Recurrent Inter Prediction2020 IEEE International Symposium on Circuits and Systems (ISCAS)
MD KINAMI, I. Miyazaki, Mdi
AND T
Chuanmin Jia, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Jiaying Liu, Shiliang Pu, Siwei Ma (2019)
Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video CodingIEEE Transactions on Image Processing, 28
S. Hochreiter, J. Schmidhuber (1997)
Long Short-Term MemoryNeural Computation, 9
[ (2020)
Learning enriched features for real image restoration and enhancementProceedings of the European Conference on Computer Vision
Dezhao Wang, Sifeng Xia, Wenhan Yang, Yueyu Hu, Jiaying Liu (2019)
Partition Tree Guided Progressive Rethinking Network for in-Loop Filtering of HEVC2019 IEEE International Conference on Image Processing (ICIP)
Jihong Kang, Sungjei Kim, Kyoung Lee (2017)
Multi-modal/multi-scale convolutional neural network based in-loop filter design for next generation video codec2017 IEEE International Conference on Image Processing (ICIP)
Anurag Ranjan, Michael Black (2016)
Optical Flow Estimation Using a Spatial Pyramid Network2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
William Lotter, Gabriel Kreiman, David Cox (2016)
Deep Predictive Coding Networks for Video Prediction and Unsupervised LearningArXiv, abs/1605.08104
Jiahao Li, Bin Li, Jizheng Xu, Ruiqin Xiong, Wen Gao (2018)
Fully Connected Network-Based Intra Prediction for Image CodingIEEE Transactions on Image Processing, 27
Emily Denton, Soumith Chintala, Arthur Szlam, R. Fergus (2015)
Deep Generative Image Models using a Laplacian Pyramid of Adversarial NetworksArXiv, abs/1506.05751
K. Rao, Humberto Dominguez (2022)
Versatile Video Coding
Yang Wang, Xiaopeng Fan, Chuanmin Jia, Debin Zhao, Wen Gao (2018)
Neural Network Based Inter Prediction for HEVC2018 IEEE International Conference on Multimedia and Expo (ICME)
Yueyu Hu, Wenhan Yang, Sifeng Xia, Jiaying Liu (2018)
Optimized Spatial Recurrent Network for Intra Prediction in Video Coding2018 IEEE Visual Communications and Image Processing (VCIP)
Yueyu Hu, Wenhan Yang, Mading Li, Jiaying Liu (2018)
Progressive Spatial Recurrent Neural Network for Intra PredictionIEEE Transactions on Multimedia, 21
Deqing Sun, Xiaodong Yang, Ming-Yu Liu, J. Kautz (2017)
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Xingjian Shi, Zhourong Chen, Hao Wang, D. Yeung, W. Wong, W. Woo (2015)
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
J. Ren, Jiaying Liu, Wei Bai, Zongming Guo (2011)
Similarity modulated block estimation for image interpolation2011 18th IEEE International Conference on Image Processing
Xin Jin, Zhibo Chen, Sen Liu, Wei Zhou (2018)
Augmented Coarse-to-Fine Video Frame Synthesis with Semantic Loss
Simon Niklaus, Long Mai, Feng Liu (2017)
Video Frame Interpolation via Adaptive Convolution2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Diederik Kingma, Jimmy Ba (2014)
Adam: A Method for Stochastic OptimizationCoRR, abs/1412.6980
Simon Niklaus, Long Mai, Feng Liu (2017)
Video Frame Interpolation via Adaptive Separable Convolution2017 IEEE International Conference on Computer Vision (ICCV)
Jianping Lin, Dong Liu, Houqiang Li, Feng Wu (2018)
Generative Adversarial Network-Based Frame Extrapolation for Video Coding2018 IEEE Visual Communications and Image Processing (VCIP)
Shuai Huo, Dong Liu, Bin Li, Siwei Ma, Feng Wu, Wen Gao (2020)
Deep Network-Based Frame Extrapolation With Reference Frame AlignmentIEEE Transactions on Circuits and Systems for Video Technology, 31
Jiaying Liu, Sifeng Xia, Wenhan Yang (2019)
Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter PredictionIEEE Transactions on Multimedia, 22
Ziwei Liu, Raymond Yeh, Xiaoou Tang, Yiming Liu, A. Agarwala (2017)
Video Frame Synthesis Using Deep Voxel Flow2017 IEEE International Conference on Computer Vision (ICCV)
[ (2003)
Overview of the HIEEE Transactions on Circuits and Systems for Video Technology, 13
Ping Hu, G. Wang, Yap-Peng Tan (2018)
Recurrent Spatial Pyramid CNN for Optical Flow EstimationIEEE Transactions on Multimedia, 20
F. Bossen (2010)
Common test conditions and software reference configurations
[ (2018)
A group variational transformation neural network for fractional interpolation of video codingProceedings of the Data Compression Conference.127–136.
Sifeng Xia, Wenhan Yang, Yueyu Hu, Siwei Ma, Jiaying Liu (2018)
A Group Variational Transformation Neural Network for Fractional Interpolation of Video Coding2018 Data Compression Conference
Xiaoshuai Zhang, Wenhan Yang, Yueyu Hu, Jiaying Liu (2018)
Dmcnn: Dual-Domain Multi-Scale Convolutional Neural Network for Compression Artifacts Removal2018 25th IEEE International Conference on Image Processing (ICIP)
O. Castro-Orgaz, W. Hager (2019)
and sShallow Water Hydraulics
Syed Zamir, Aditya Arora, Salman Khan, Munawar Hayat, F. Khan, Ming-Hsuan Yang, Ling Shao (2022)
Learning Enriched Features for Fast Image Restoration and EnhancementIEEE Transactions on Pattern Analysis and Machine Intelligence, 45
Hyomin Choi, I. Bajić (2018)
Deep Frame Prediction for Video CodingIEEE Transactions on Circuits and Systems for Video Technology, 30
Yueyu Hu, Wenhan Yang, Sifeng Xia, Wen-Huang Cheng, Jiaying Liu (2018)
Enhanced Intra Prediction with Recurrent Neural Network in Video Coding2018 Data Compression Conference
Eddy Ilg, N. Mayer, Tonmoy Saikia, M. Keuper, A. Dosovitskiy, T. Brox (2016)
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Jiaying Liu, Sifeng Xia, Wenhan Yang, Mading Li, Dong Liu (2019)
One-for-All: Grouped Variation Network-Based Fractional Interpolation in Video CodingIEEE Transactions on Image Processing, 28
Mading Li, Jiaying Liu, Zhiwei Xiong, Xiaoyan Sun, Zongming Guo (2016)
MARLow: A Joint Multiplanar Autoregressive and Low-Rank Approach for Image Completion
Mading Li, Jiaying Liu, J. Ren, Zongming Guo (2015)
Adaptive General Scale Interpolation Based on Weighted Autoregressive ModelsIEEE Transactions on Circuits and Systems for Video Technology, 25
[ (2020)
Versatile video coding (draft 9)Proceedings of the Document JVET-R2001.
Lei Zhao, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao (2019)
Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame GenerationIEEE Transactions on Image Processing, 28
F. Reda, Guilin Liu, Kevin Shih, Robert Kirby, Jon Barker, D. Tarjan, Andrew Tao, Bryan Catanzaro (2018)
SDC-Net: Video Prediction Using Spatially-Displaced Convolution
Modern codecs remove temporal redundancy of a video via inter prediction, i.e., searching previously coded frames for similar blocks and storing motion vectors to save bit-rates. However, existing codecs adopt block-level motion estimation, where a block is regressed by reference blocks linearly and is doomed to fail to deal with non-linear motions. In this article, we generate virtual reference frames (VRFs) with previously reconstructed frames via deep networks to offer an additional candidate, which is not constrained to linear motion structure and further significantly improves coding efficiency. More specifically, we propose a novel deep Auto-Regressive Moving-Average (ARMA) model, Error-Corrected Auto-Regressive Network (ECAR-Net), equipped with the powers of the conventional statistic ARMA models and deep networks jointly for reference frame prediction. Similar to conventional ARMA models, the ECAR-Net consists of two stages: Auto-Regression (AR) stage and Error-Correction (EC) stage, where the first part predicts the signal at the current time-step based on previously reconstructed frames, while the second one compensates for the output of the AR stage to obtain finer details. Different from the statistic AR models only focusing on short-term temporal dependency, the AR model of our ECAR-Net is further injected with the long-term dynamics mechanism, where long temporal information is utilized to help predict motions more accurately. Furthermore, ECAR-Net works in a configuration-adaptive way, i.e., using different dynamics and error definitions for the Low Delay B and Random Access configurations, which helps improve the adaptivity and generality in diverse coding scenarios. With the well-designed network, our method surpasses HEVC on average 5.0% and 6.6% BD-rate saving for the luma component under the Low Delay B and Random Access configurations and also obtains on average 1.54% BD-rate saving over VVC. Furthermore, ECAR-Net works in a configuration-adaptive way, i.e., using different dynamics and error definitions for the Low Delay B and Random Access configurations, which helps improve the adaptivity and generality in diverse coding scenarios.
ACM Transactions on Multimedia Computing Communications and Applications (TOMCCAP) – Association for Computing Machinery
Published: Jan 23, 2023
Keywords: High Efficient Video Coding (HEVC)
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.