Maria Meyer, M.Sc.
Researcher
Image and Video Coding Group
Research Topic
Video Coding
Contact Information
email: meyer@ient.rwth-aachen.de
Research Topic
Neuronal networks for intra-precision
A central part of image and video coding is the intra-prediction, which attempts to predict the content of an image block based on the already decoded environment as accurately as possible. In this way, redundancy can be removed from the image and the information content of the prediction difference can be reduced. In current standards, the adjacent pixels of the block are either repeated in a certain direction or summed up and weighted.
In recent years, machine learning techniques, especially neural networks, have achieved great success in both image analysis and classification, as well as in general prediction problems. Therefore, an attempt is made to use these methods also for the intra-prediction of video signals. First, networks must be trained to analyze the already encoded environment and generate a prediction of the block that is to be encoded. These must then be integrated into existing video codecs. Initial experiments have already shown that this can increase coding efficiency. Based on these initial results, it is now being investigated which networks and training methods are optimal for this use case in order to further increase the coding gain while also reducing the computational complexity in order to be able to decode in real time. |
One of the key points that need to be examined is the network architecture, as it not only has a significant influence on the quality of the prediction, but also determines the required computational complexity. A particularly interesting aspect is whether it is worthwhile to have such a network generate multiple predictions, of which the best can then be used, but must also be signaled. In addition, the optimal integration of such networks into the overall system of a video codec is an important aspect. In order limit the amount of additionally transmitted information as much as possible, it is particularly important to be able to estimate as precisely as possible in which cases the network-based prediction brings improvements and for which it is unsuitable. At the same time, however, the integration also determines what information and image areas are available for the prediction of a block.
Publications
There appears to be a problem with the publication server. Please try again later. If the problem persists, please contact our system administrator.
Standard Documents
There appears to be a problem with the publication server. Please try again later. If the problem persists, please contact our system administrator.
NOTICE FOR IEEE PUBLICATIONS: © IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the IEEE. Contact: Manager, Copyrights and Permissions / IEEE Service Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ 08855-1331, USA. Telephone: + Intl. 908-562-3966.