Abstract: Decoder for decoding a picture from a data stream, configured to check (106) whether a plurality (98) of coding parameters, which are contained in the data stream (14), relate to predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding. Responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, the decoder is configured to set one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion.
Encoder, decoder, methods and computer programs for an improved lossless compression
Description
Technical Field
Embodiments according to the invention related to an encoder, a decoder, methods and computer programs for an improved lossless compression. The present invention relates to Extensions for lossless compression in modern hybrid image and video coding solutions.
Introductory remarks:
In the following, different inventive embodiments and aspects will be described. Also, further embodiments will be defined by the enclosed claims.
It should be noted that any embodiments as defined by the claims can be supplemented by any of the details (features and functionalities) described in the following different inventive embodiments and aspects.
Also, it should be noted that individual aspects described herein can be used Individually or in combination. Thus, details can be added to each of said individual aspects without adding details to another one of said aspects.
It should also be noted that the present disclosure describes, explicitly or implicitly, features usable in an encoder (apparatus for providing an encoded representation of an input signal) and in a decoder (apparatus for providing a decoded representation of a signal on the basis of an encoded representation), Thus, any of the features described herein can be used in the context of an encoder and in the context of a decoder.
Moreover, features and functionalities disclosed herein relating to a method can also be used in an apparatus (configured to perform such functionality). Furthermore, any features and functionalities disclosed herein with respect to an apparatus can also be used in a corresponding method. In other words, the methods disclosed herein can be supplemented by any of the features and functionalities described with respect to the apparatuses.
Also, any of the features and functionalities described herein can be implemented in hardware or in software, or using a combination of hardware and software, as will be described in the section “implementation alternatives".
Background of the invention
In state-of-the-art lossy video compression, the encoder quantizes the prediction residual or the transformed prediction residual using a specific quantization step size D. The smaller the step size, the finer the quantization and the smaller the error between original and reconstructed signal. Recent video coding standards (such as H.264 and H.265) derive that quantization step size D using an exponential function of a so-called quantization parameter (QP), e.g.:
The exponential relationship between quantization step size and quantization parameter allows a finer adjustment of the resulting bit rate. The decoder needs to know the quantization step size to perform the correct scaling of the quantized signal. This stage is sometimes referred to as "inverse quantization" although quantization is irreversible. That is why the decoder parses the scaling factor or QP from the bitstream. The QP signalling is typically performed hierarchically, i.e. a base QP is signalled at a higher level in the bitstream, e.g. at picture level. At sub-picture level, where a picture can consist of multiple slices, tiles or bricks, only a delta to the base QP is signalled. In order to adjust the bitrate at an even finer granularity, a delta QP can even be signalled per block or area of blocks, e.g. signaled in one transform unit within an NxN area of coding blocks in HEVC. Encoders usually use the delta QP technique for subjective optimization or rate-control algorithms. Without loss of generality, it is assumed in the following that the base unit in the presented invention is a picture, and hence, the base QP is signalled by the encoder for each picture consisting of a single slice. In addition to this base QP, also referred to as slice QP, a delta QP can be signalled for each transform block (or any union of transform block, also referred to as quantization group).
State-of-the-art video coding schemes, such as High Efficiency Video Coding (HEVC), or the upcoming Versatile Video Coding (WC) standard, optimize the energy compaction of various residual signal types by allowing additional transforms beyond widely used integer approximations of the type II discrete cosine transform (DCT-II). The HEVC standard further specifies an integer
approximation of the type-VII discrete sine transform (DST-VIt) for 4x4 transform blocks using specific intra directional modes. Due to this fixed mapping, there is no need to signal whether DCT-II or DST-VII is used. In addition to that, the identity transform can be selected for 4x4 transform blocks. Here the encoder needs to signal whether DCT-Il/DST-VII or identity transform is applied. Since the identity transform is the matrix equivalent to a multiplication with 1, it is also referred to as transform skip. Furthermore, the current VVC development allows the encoder to select more transforms of the DCT/DST family for the residual as well as additional non-separable transforms, which are applied after the DCT/DST transform at the encoder and before the inverse DCT/DST at the decoder. Both, the extended set of DCT/DST transforms and the additional non-separable transforms, require additional signalling per transform block.
Figure 1b illustrates the hybrid video coding approach with forward transform and subsequent quantization of the residual signal 24 at the encoder 10, and scaling of the quantized transform coefficients followed by inverse transform for the decoder 36. The transform and quantization related blocks 28/32 and 52/54 are highlighted.
Modern image and video coding solutions such as High Efficiency Video Coding (HEVC, H.265, ISO/IEC 23008-3) and the currently developed Versatile Video Coding (WC, H.266) allow to efficiently compress still or moving picture content even at very low bit-rates. The typical use case of these codec (coder-decoder) solutions is the lossy compression of high-resolution video material for broadcasting (e. g. , television) and streaming (e. g., video-over-IP) applications. Nonetheless, the codecs also support lossless compression, thus allowing mathematically perfect reconstruction of the coded input signals upon decoding. More specifically, HEVC provides several pulse code modulation (PCM) related coding tools as well as a so-called transquant bypass coding mode, which facilitates lossless coding by simplifying the entropy coding process and by disabling the quantization, transform (DCT or DST), and deblocking steps. Details can be found in the HEVC syntax and decoding specification which is publicly available [1].
In the current version of WC, the successor of HEVC under development, the lossless coding functionality of HEVC has largely been taken over, at least in the reference coding and decoding software [2], This means that both the PCM related coding tools as well as the transform quantization bypass coding mode are available for activation by both HEVC encoders and current WC reference encoder. Moreover, the transform quantization bypass coding flag (a 1-bit syntax element) is specified not only for an entire bit-stream or picture (frame) of a bit-stream but for individual subblocks (coding units, CUs or transform units, TUs) of said bit-stream or frame. In other words, in both HE VC and WC, transform quantization bypass coding can be enabled on a subblock basis, thus allowing to disable the quantization, transform, and deblocking coding tools individually per subblock.
Recently, a contribution to the WC standardization activity within JVET as described above has been introduced, which corrects a particular lossless coding related shortcoming of the transform skip coding functionality in the current WC draft, which also exists in HE VC and which specifies that, for a given coding subblock (CU or TU), the inverse residual coefficient transform operation (inverse DCT or DST) is bypassed. More specifically, the contribution proposes to restrict the quantization step-size, governed by a quantization parameter (QP), to a value greater than or equal to one (represented by a QP of 4) in case of activated transform skipping in a subblock. As a result, with a QP of 4 and disabled in-loop filtering in the spatial area covered by said subblock, lossless coding can be achieved when transform skipping is activated. This behavior, however, is identical to the use of the transform quantization bypass coding mode, as quantization with QP=4 (i. e., unity step-size) effectively represents the bypassing of the quantization algorithm.
Moreover, to reach lossless coding with acceptably low bit-rate using WC or any other video codec with similar loss coding functionality and tool sets, it may be necessary to alter the behavior of some other newly introduced coding tools, which are not available in HE VC and previous video coding standards. Specifically,
• it may be desirable to disable the newly introduced adaptive loop filter (ALF) and reshaper tools,
• the matrix of the joint inter-chroma transform [3] needs to be changed to allow lossless coding.
It is the objective of the present invention to provide a solution for the abovementioned two drawbacks of redundant lossless coding ability (regarding transform quantization bypass and transform skipping functionality) and the necessity of modifying behavioral details of some coding tools when lossless coding is desired.
Therefore, it is desired to provide concepts for lossless coding of a picture or a video with acceptably low bit-rate and/or an improved lossless compression.
This is achieved by the subject matter of the independent claims of the present application.
Further embodiments according to the invention are defined by the subject matter of the dependent claims of the present application.
Summary of the Invention
In accordance with a first aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that some tools of a decoder or encoder, like adaptive loop filter (ALF) and reshaper tools, result in a lossy coding. According to the first aspect of the present application, this difficulty is overcome by disabling one or more tools for processing a prediction residual corrected predictive reconstruction to avoid a loss of information after a lossless reconstruction of a predetermined portion of a picture. In other words, post reconstruction modifications of samples of the predetermined portion of a picture are disabled. Additionally, the first aspect is based on the idea that the one or more tools can be disabled for lossless coding without the necessity of a syntax element indicating a lossless prediction residual coding for the predetermined portion of the picture. The decoder and encoder may be configured to infer that the lossless prediction residual coding is to be used for the predetermined portion of the picture based on the plurality of coding parameters contained in a data stream, whereby no explicit signaling of the lossless coding is necessary. The decoder is configured to advantageously switch between lossless coding and lossy coding for individual portions of a picture. No additional signaling indicating lossless coding for the individual portions of the picture is necessary. This results in a reduced amount of data to be encoded and in a reduced amount of data used to decode the predetermined portion of the picture. An improved lossless coding compression can be achieved. It is possible that the data stream comprises a lossless coding syntax element which indicates whether a predetermined region of the picture or the whole picture is coded using lossless coding, but it is no longer necessary that the data stream comprises for each portion of the picture the lossless coding syntax element. This is due to the ability of the decoder and/or encoder to check whether coding parameter indicate the lossless prediction residual coding. Furthermore, it was found, that a higher coding efficiency and a reduced bit rate can be achieved by combining the disabling of one or more tools of a decoder or encoder and the checking whether the plurality of coding parameters is indicative of a coding parameter setting corresponding to a lossless prediction residual coding.
Accordingly, in accordance with a first aspect of the present application, a decoder for decoding a picture from a data stream, is configured to check whether a plurality of coding parameters, e.g., a quantization parameter (QP) and/or a transform mode (TM), which are contained in the data stream, relate to a predetermined portion of the picture and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding. Such a coding parameter setting corresponding to a lossless prediction residual coding is, e.g., represented by (QP, TM) = (4, transform skip) or (QP, TM) = (1...4, transform skip). The coding parameter setting corresponds to the lossless prediction residual coding either because such a coding is immediately signaled by the coding parameter setting or by leading to such a coding by the decoder being configured to interpret or change such a coding parameter setting to the coding parameter setting leading to lossless residual coding, such as mapping of a quantization parameter smaller than four (QP<4) to a quantization parameter equal to four (QP=4) in case of the transform mode being transform skip. This interpretation or change of a coding parameter setting may be based on the plurality of coding parameters. The decoder may be configured to derive from the plurality of coding parameters whether lossy or lossless prediction residual coding is to be used for the predetermined portion of the picture. Responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, the decoder is configured to set one or more predetermined coding options relating to one or more tools, e.g., deblocking, sample adaptive offset filtering (SAO) and/or adaptive loop filtering (ALF), of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion.
Parallel to the decoder an encoder for encoding a picture into a data stream, is configured to signal a plurality of coding parameters in the data stream and check whether the plurality of coding parameters, which relate to a predetermined portion of the picture and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding. Responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, the encoder is configured to set one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion in a prediction-loop of the encoder.
In accordance with a second aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that lossless coding typically produces significant higher bitrates compared to lossy coding. According to the second aspect of the present application, this difficulty is overcome by using different codes in binarization for lossless coding and lossy coding. It is possible to switch between different binarizations for lossless coding and lossy coding. It was found that a binarization with a reduced computational complexity with respect to the binarization in the lossy case reduces significantly the bitrates for the lossless coding. The computational complexity, for example, is reduced by coding more bins using an equi-probability bypass mode instead of a regular coding mode in CABAC that uses context modelling and binary arithmetic coding, in case of lossless coding compared to lossy coding. Thus processing bottlenecks can be prevented.
Accordingly, a second aspect of the present application is related to a decoder for decoding a picture from a data stream and an encoder for encoding a picture into a data stream. The decoder (encoder), is configured to determine for a predetermined portion of the picture, whether same is (to be) coded into the data stream using lossless coding or lossy coding. The decoder (encoder) is configured to decode (encode) a prediction residual from (Into) the data stream for the predetermined portion using binarization of prediction residual data info bin strings and context-adaptive binary entropy decoding (encoding) of the bin strings in a first manner, if the predetermined portion of the picture is (to be) coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is (to be) coded into the data stream using lossy coding. The first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
In accordance with a third aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that a transformation by an encoder or an inverse transformation by a decoder may require rounding-to-integer steps forbidding a mathematically perfect reconstruction of a picture or a portion of the picture. Thus currently the transformation process is bypassed or disabled for lossless coding. According to the third aspect of the present application, this difficulty is overcome by using a perfectly invertible transform for lossless coding. The decoder/encoder is configured to switch between different transform modes for lossless coding and lossy coding. With the perfectly invertible transform a lossless coding can be achieved, since no rounding errors occur. The perfectly invertible transform is, for example, an inter-component transform or a spectrally decomposing intra-color-component transform. There might be additional syntax elements necessary in the data stream to indicate and/or control the perfectly invertible transform, but it was found that the bitrate can be significantly reduced, whereby the efficiency of the coding can be increased.
Accordingly, a third aspect of the present application is related to a decoder for decoding a picture from a data stream and an encoder for encoding a picture into a data stream. The decoder (encoder), is configured to determine for a predetermined portion of the picture, whether same is (to be) coded into the data stream using lossless coding or lossy coding. The decoder (encoder) is configured to perform on a prediction residual or a prediction residual corrected reconstruction (e.g. in a prediction-loop of the encoder) of the predetermined portion a perfectly invertible transform, if the predetermined portion of the picture is (to be) coded into the data stream using lossless coding, and a non-perfectly invertible transform, if the predetermined portion of the picture is (to be) coded into the data stream using lossy coding.
In accordance with a fourth aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that smoothing filters result in a loss of information. According to the fourth aspect of the present application, this difficulty is overcome by disabling the filtering or by reducing a filtering used to obtain a prediction signal compared to a lossy coding for a portion of a picture which is to be decoded or encoded lossless. The disabling of the filtering for the portion results in no filtering related syntax elements in the data stream, whereby an improved bit stream compression can be achieved. Alternatively, a prediction signal for the portion of the picture is less filter in the lossless coding case compared to the lossy coding case. In case of lossless coding, for example, an interpolation filter can be applied, wherein a higher frequency half out of an overall spatial frequency spectrum of the prediction signal has higher energy in case of lossless coding than in case of lossy coding. This is based on the idea that it is advantageous to alter the prediction signal in case of lossless coding as little as possible at the derivation of the prediction signal. It was found that the special filtering option for lossless coding, resulting in a higher information content of the prediction signal, results in a reduced bitrate and thus in a high efficiency.
Accordingly, a fourth aspect of the present application is related to a decoder for decoding a picture from a data stream and an encoder for encoding a picture into a data stream. The decoder (encoder), is configured to determine for a predetermined portion of the picture, whether same is (to be) coded into the data stream using lossless coding or lossy coding. The decoder (encoder) is configured to derive a prediction signal for the predetermined portion in a first manner, if the predetermined portion of the picture is (to be) coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is (to be) coded into the data stream using lossy coding. The first and second manners differ so that the prediction signal is less filtered, e.g. by using an interpolation filter, in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner.
In accordance with a fifth aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that in-loop filters and/or quantization result in a loss of information. According to the fifth aspect of the present application, this difficulty is overcome by disabling one or more tools of the decoder/encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion and/or by disabling one or more tools of the decoder/encoder for prediction residual re-quantization/quantization. The disabling of the tools for the portion results in no or a reduced amount of postprocessing related syntax elements and/or (re-)quantization related syntax elements in the data stream, whereby an improved bit stream compression can be achieved.
Accordingly, a fifth aspect of the present application is related to a decoder for decoding a picture from a data stream and an encoder for encoding a picture into a data stream. The decoder (encoder), is configured to determine for a predetermined portion of the picture, whether same is (to be) coded into the data stream using lossless coding or lossy coding. The decoder (encoder) is configured to infer that one or more predetermined coding options relating to one or more tools of the decoder (encoder) for processing a prediction residual corrected predictive reconstruction (e.g. in a prediction-loop of the encoder) with respect to the predetermined portion and/or for prediction residual re-quantization (quantization) are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the predetermined portion of the picture is (to be) coded into the data stream using lossless coding, and by deriving the one or more predetermined coding options from the plurality of coding parameters, if the predetermined portion of the picture is (to be) coded into the data stream using lossy coding.
In accordance with a sixth aspect of the present invention, the inventors of the present application realized that one problem encountered when trying to improve a lossless coding of a picture or a video stems from the fact that a difference between a bit-depth, i.e. an input bit-depth, associated with a video to be encoded or associated with a video to be output by a decoder and an internal bit-depth results in a loss of information. According to the sixth aspect of the present application, this difficulty is overcome by performing a bit-depth transition. Therefore, either the input bit-depth or the difference between the input bit-depth and the internal bit-depth is encoded into a data stream by an encoder and read from the data stream by a decoder. Alternatively, the input bit-depth or the aforementioned difference can be indicated by one or more coding parameters, like a quantization parameter (QP). With the information about the input bit-depth or the difference, the decoder is configured to output the video at the input bit-depth, whereby the bit depth of the video output equals the bit-depth of the video input encoded by the encoder. Thus lossless coding can be achieved. No additional syntax element is necessary in case of the bit-depth being indicated by the one or more parameters, whereby an improved bit stream compression can be achieved. Note that the input bit-depth - having its name from the fact that the input bit-depth is the one at which the encoder receives the video for encoding - which is the bit-depth at which the decoder is to output the video, needs not to be unequal to the internal bit-depth, but that the encoder and decoder allow for different settings in the data stream which may include one where both bit-depths are the same and one or more where this is not the case.
Accordingly, a sixth aspect of the present application is related to a video decoder and a video encoder. The decoder, is configured to perform video decoding from a data stream at an internal bit-depth and output a video at an input bit-depth or internal bit-depth. Parallel to the decoder, the encoder is configured to perform video encoding into the data stream at the internal bit-depth and receive video input at the input bit-depth or the internal bit-depth. The output video and/or the input video, for example, are associated with the internal bit-depth in case of the bit-depth of the input video and/or output video, i.e. the input bit-depth, being equal to the internal bit-depth. Additionally, the decoder (encoder) is configured to read (encode) from (into) the data stream a syntax element which indicates the input bit-depth and/or a difference between the internal bit-depth and the input bit-depth.
Accordingly, a sixth aspect of the present application is related to a video decoder configured to perform video decoding from a data stream at an internal bit-depth and output a video at an input bit-depth or the internal bit-depth. Parallel to the video decoder, a video encoder is configured to perform video encoding into the data stream at the internal bit-depth and receive video input at the input bit-depth or the internal bit-depth. The output video and/or the input video, for example, are associated with the internal bit-depth in case of the bit-depth of the input video and/or output video, i.e. the input bit-depth, being equal to the internal bit-depth. Additionally, the decoder (encoder) is configured to read (encode) from (into) the data stream a syntax element which indicates a minimum for a quantization step size scale parameter (QP minimum), i.e. an absolute or relative quantization step size, re-quantization step size or scaling parameter. The minimum quantization step size scale parameter may represent a unity-step size, i.e. a quantization parameter equal to four. The syntax element, for example, indicates the minimum for a quantization step size scale parameter owing to a difference of non-zero between the internal bit-depth and the input bit-depth. The decoder might be informed of this difference between the internal bit-depth and the input bit-depth as well, e.g., in addition to the QP minimum, either by way of transmitting the difference as described above, or by transmitting both values, i.e. the internal bit-depth and the input bit-depth. Another QP than four may result in, or almost in, lossless coding. A QP smaller than four may be adapted by the decoder to a QP equaling 4 if the QP minimum is set to four resulting in a lossless coding.
Accordingly, a sixth aspect of the present application is related to a video decoder (encoder) configured to decode (encode) from (into) a data stream an indication of an internal bit-depth and an input bit-depth or a difference between same and perform video decoding (encoding) from (into) the data stream at the internal bit-depth. Additionally, the decoder is configured to output a video at the input bit-depth and the encoder is configured to receive the video, i.e. video input, at the input bit-depth. The decoder (encoder) is configured to check whether the internal bit-depth falls below the input bit-depth and change the internal bit-depth to correspond to the input bit-depth. Thus a difference between the input bit-depth and the internal bit-depth is corrected and a lossless coding is achieved.
Embodiments are related to methods based on the same considerations as the above-described encoder and/or decoder. The methods can, by the way, be completed with all features and functionalities, which are also described with regard to the encoder and/or decoder.
An embodiment is related to a computer program having a program code for performing, when running on a computer, a herein described method.
An embodiment is related to a data stream obtained by a method for block-based encoding of a picture signal.
Brief Description of the Drawings
The drawings are not necessarily to scale, emphasis instead generaiiy being placed upon illustrating the principles of the invention. In the following description, various embodiments of the invention are described with reference to the following drawings, in which:
Fig. 1a shows a schematic view of an encoder;
Fig. 1b shows a schematic view of an alternative encoder;
Fig. 2 shows a schematic view of a decoder;
Fig. 3 shows a schematic view of a block-based coding;
Fig. 4 shows a schematic view of a decoder-side scaling and inverse transform in recent video coding standards;
Fig. 5 shows a schematic view of a decoder-side scaling and inverse transform according to an example;
Fig. 6 shows a schematic view of a lossless coding check performed by a decoder and/or encoder according to an embodiment;
Fig. 7 shows a block diagram of a method using dependent on lossless or lossy coding a different binarization;
Fig. 8 shows a block diagram of a method using a perfectly invertible transform for lossless coding;
Fig. 9 shows a block diagram of a method for a derivation of a prediction signal dependent on lossless or lossy coding;
Fig. 10 shows a block diagram of a method for setting one or more predetermined coding options dependent on lossless or lossy coding;
Fig. 11a and Fig. 11b shows a block diagram of a method considering an input bit-depth and an internal bitdepth, according to a first embodiment;
Fig. 12a and Fig. 12b shows a block diagram of a method considering an input bit-depth and an internal bitdepth, according to a second embodiment; and
Fig. 13a and Fig. 13b shows a block diagram of a method considering an input bit-depth and an internal bitdepth, according to a third embodiment.
Detailed Description of the Embodiments
Equal or equivalent elements or elements with equal or equivalent functionality are denoted in the following description by equal or equivalent reference numerals even if occurring in different figures.
In the following description, a plurality of details is set forth to provide a more throughout explanation of embodiments of the present invention. However, it will be apparent to those skilled in the art that embodiments of the present invention may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form rather than in detail in order to avoid obscuring embodiments of the present invention. In addition, features of the different embodiments described herein after may be combined with each other, unless specifically noted otherwise.
The following description of the figures starts with a presentation of a description of an encoder and a decoder of a block-based predictive codec for coding pictures of a video in order to form an example for a coding framework into which embodiments of the present invention may be built in.
The respective encoder and decoder are described with respect to Figures 1a to 3. The herein described embodiments of the concept of the present invention could be built into the encoder and decoder of Figures 1a, 1b and 2, respectively, although the embodiments described with Figures 4 to 6, may also be used to form encoders and decoders not operating according to the coding framework underlying the encoder and decoder of Figures 1a, 1b and 2.
Figure 1a shows an apparatus (e. g. a video encoder and/or a picture encoder) for predictively coding a picture 12 into a data stream 14 exemplarily using transform-based residual coding. The apparatus, or encoder, is indicated using reference sign 10. Figure 1 b shows also the apparatus for predictively coding a picture 12 into a data stream 14, wherein a possible prediction module 44 is shown in more detail. Figure 2 shows a corresponding decoder 20, i.e. an apparatus 20 configured to predictively decode the picture 12’ from the data stream 14 also using transform-based residual decoding, wherein the apostrophe has been used to indicate that the picture 12’ as reconstructed by the decoder 20 deviates from picture 12 originally encoded by apparatus 10 in terms of coding loss introduced by a quantization of the prediction residual signal. Figure 1a, 1b and Figure 2 exemplarily use transform based prediction residual coding, although embodiments of the present application are not restricted to this kind of prediction residual coding. This is true for other details described with respect to Figures 1a, 1b and 2, too, as will be outlined hereinafter.
The encoder 10 is configured to subject the prediction residual signal to spatial-to-spectral transformation and to encode the prediction residual signal, thus obtained, into the data stream 14. Likewise, the decoder 20 is configured to decode the prediction residual signal from the data stream 14 and subject the prediction residual signal, thus obtained, to spectral-to-spatial transformation.
Internally, the encoder 10 may comprise a prediction residual signal former 22 which generates a prediction residual 24 so as to measure a deviation of a prediction signal 26 from the original signal, i.e. from the picture 12, wherein the prediction signal 26 can be interpreted as a linear combination of a set of one or more predictor blocks according to an embodiment of the present invention. The prediction residual signal former 22 may, for instance, be a subtractor which subtracts the prediction signal from the original signal, i.e. from the picture 12. The encoder 10 then further comprises a transformer 28 which subjects the prediction residual signal 24 to a spatial-to-spectral transformation to obtain a spectral-domain prediction residual signal 24’ which is then subject to quantization by a quantizer 32, also comprised by the encoder 10. The thus quantized prediction residual signal 24” is coded into bitstream 14. To this end, encoder 10 may optionally comprise an entropy coder 34 which entropy codes the prediction residual signal as transformed and quantized into data stream 14.
The prediction signal 26 is generated by a prediction stage 36 of encoder 10 on the basis of the prediction residual signal 24" encoded into, and decodable from, data stream 14. To this end, the prediction stage 36 may internally, as is shown in Figure 1a, comprise a dequantizer 38 which dequantizes prediction residual signal 24” so as to gain spectral-domain prediction residua! signal 24’”, which corresponds to signal 24’ except for quantization loss, followed by an inverse transformer 40 which subjects the latter prediction residual signal 24”’ to an inverse transformation, i.e. a spectral-to-spatial transformation, to obtain prediction residual signal 24””, which corresponds to the original prediction residual signal 24 except for quantization loss. A combiner 42 of the prediction stage 36 then recombines, such as by addition, the prediction signal 26 and the prediction residual signal 24”” so as to obtain a reconstructed signal 46, i.e. a reconstruction of the original signal 12. Reconstructed signal 46 may correspond to signal 12’. A prediction module 44 of prediction stage 36 then generates the prediction signal 26 on the basis of signal 46 by using, for instance, spatial prediction, i.e. intra-picture prediction, and/or temporal prediction, i.e. inter-picture prediction, as shown in Figure 1b in more detail.
Likewise, decoder 20, as shown in Figure 2, may be internally composed of components corresponding to, and interconnected in a manner corresponding to, prediction stage 36. In particular, entropy decoder 50 of decoder 20 may entropy decode the quantized spectral-domain prediction residual signal 24” from the data stream, whereupon dequantizer 52, inverse transformer 54, combiner 56 and prediction module 58, interconnected and cooperating in the manner described above with respect to the modules of prediction stage 36, recover the reconstructed signal on the basis of prediction residual signal 24” so that, as shown in Figure 2, the output of combiner 56 results in the reconstructed signal, namely picture 12’.
Although not specifically described above, it is readily clear that the encoder 10 may set some coding parameters including, for instance, prediction modes, motion parameters and the like, according to some optimization scheme such as, for instance, in a manner optimizing some rate and distortion related criterion, i.e. coding cost. For example, encoder 10 and decoder 20 and the corresponding modules 44, 58, respectively, may support different prediction modes such as
intra-coding modes and inter-coding modes. The granularity at which encoder and decoder switch between these prediction mode types may correspond to a subdivision of picture 12 and 12’, respectively, into coding segments or coding blocks. In units of these coding segments, for instance, the picture may be subdivided into blocks being intra-coded and blocks being inter-coded.
Intra-coded blocks are predicted on the basis of a spatial, already coded/decoded neighborhood (e. g. a current template) of the respective block (e. g. a current block) as is outlined in more detail below. Several intra-coding modes may exist and be selected for a respective intra-coded segment including directional or angular intra-coding modes according to which the respective segment is filled by extrapolating the sample values of the neighborhood along a certain direction which is specific for the respective directional intra-coding mode, into the respective intra-coded segment. The intra-coding modes may, for instance, also comprise one or more further modes such as a DC coding mode, according to which the prediction for the respective intra-coded block assigns a DC value to all samples within the respective intra-coded segment, and/or a planar intra-coding mode according to which the prediction of the respective block is approximated or determined to be a spatial distribution of sample values described by a two-dimensional linear function over the sample positions of the respective intra-coded block with driving tilt and offset of the plane defined by the two-dimensional linear function on the basis of the neighboring samples.
Compared thereto, inter-coded blocks may be predicted, for instance, temporally. For inter-coded blocks, motion vectors may be signaled within the data stream 14, the motion vectors indicating the spatial displacement of the portion of a previously coded picture (e. g. a reference picture) of the video to which picture 12 belongs, at which the previously coded/decoded picture is sampled in order to obtain the prediction signal for the respective inter-coded block. This means, in addition to the residual signal coding comprised by data stream 14, such as the entropy-coded transform coefficient levels representing the quantized spectral-domain prediction residual signal 24”, data stream 14 may have encoded thereinto coding mode parameters for assigning the coding modes to the various blocks, prediction parameters for some of the blocks, such as motion parameters for inter-coded segments, and optional further parameters such as parameters for controlling and signaling the subdivision of picture 12 and 12’, respectively, into the segments. The decoder 20 uses these parameters to subdivide the picture in the same manner as the encoder did, to assign
the same prediction modes to the segments, and to perform the same prediction to result in the same prediction signal.
Figure 3 illustrates the relationship between the reconstructed signal, i.e. the reconstructed picture 12’, on the one hand, and the combination of the prediction residual signal 24”” as signaled in the data stream 14, and the prediction signal 26, on the other hand. As already denoted above, the combination may be an addition. The prediction signal 26 is illustrated in Figure 3 as a subdivision of the picture area into intra-coded blocks which are illustratively indicated using hatching, and inter-coded blocks which are illustratively indicated not-hatched. The subdivision may be any subdivision, such as a regular subdivision of the picture area into rows and columns of square blocks or non-square blocks, or a multi-tree subdivision of picture 12 from a tree root block into a plurality of leaf blocks of varying size, such as a quadtree subdivision or the like, wherein a mixture thereof is illustrated in Figure 3 in which the picture area is first subdivided into rows and columns of tree root blocks which are then further subdivided in accordance with a recursive multi-tree subdivisioning into one or more leaf blocks.
Again, data stream 14 may have an intra-coding mode coded thereinto for intra-coded blocks 80, which assigns one of several supported intra-coding modes to the respective intra-coded block 80. For inter-coded blocks 82, the data stream 14 may have one or more motion parameters coded thereinto. Generally speaking, inter-coded blocks 82 are not restricted to being temporally coded. Alternatively, inter-coded blocks 82 may be any block predicted from previously coded portions beyond the current picture 12 itself, such as previously coded pictures of a video to which picture 12 belongs, or picture of another view or an hierarchically lower layer in the case of encoder and decoder being scalable encoders and decoders, respectively.
The prediction residual signal 24”” in Figure 3 is also illustrated as a subdivision of the picture area into blocks 84. These blocks might be called transform blocks in order to distinguish same from the coding blocks 80 and 82. In effect, Figure 3 illustrates that encoder 10 and decoder 20 may use two different subdivisions of picture 12 and picture 12’, respectively, into blocks, namely one subdivisioning into coding blocks 80 and 82, respectively, and another subdivision into transform blocks 84. Both subdivisions might be the same, i.e. each coding block 80 and 82, may concurrently form a transform block 84, but Figure 3 illustrates the case where, for instance, a subdivision into transform blocks 84 forms an extension of the subdivision into coding blocks 80, 82 so that any border between two blocks of blocks 80 and 82 overlays a border between two
blocks 84, or alternatively speaking each block 80, 82 either coincides with one of the transform blocks 84 or coincides with a cluster of transform blocks 84. However, the subdivisions may also be determined or selected independent from each other so that transform blocks 84 could alternatively cross block borders between blocks 80, 82. As far as the subdivision into transform blocks 84 is concerned, similar statements are thus true as those brought forward with respect to the subdivision into blocks 80, 82, i.e. the blocks 84 may be the result of a regular subdivision of picture area into blocks (with or without arrangement into rows and columns), the result of a recursive multi-tree subdivisioning of the picture area, or a combination thereof or any other sort of blockation. Just as an aside, it is noted that blocks 80, 82 and 84 are not restricted to being of quadratic, rectangular or any other shape.
Figure 3 further illustrates that the combination of the prediction signal 26 and the prediction residual signal 24”” directly results in the reconstructed signal 12’. However, it should be noted that more than one prediction signal 26 may be combined with the prediction residual signal 24”” to result into picture 12’ in accordance with alternative embodiments.
In Figure 3, the transform blocks 84 shall have the following significance. Transformer 28 and inverse transformer 54 perform their transformations in units of these transform blocks 84. For instance, many codecs use some sort of DST (discrete sine transform) or DCT (discrete cosine transform) for all transform blocks 84. Some codecs allow for skipping the transformation so that, for some of the transform blocks 84, the prediction residual signal is coded in the spatial domain directly. However, in accordance with embodiments described below, encoder 10 and decoder 20 are configured in such a manner that they support several transforms. For example, the transforms supported by encoder 10 and decoder 20 could comprise:
• DCT-II (or DCT-III), where DCT stands for Discrete Cosine Transform
• DST-IV, where DST stands for Discrete Sine Transform
• DCT-IV
• DST-VII
• Identity Transformation (IT)
Naturally, while transformer 28 would support all of the forward transform versions of these transforms, the decoder 20 or inverse transformer 54 would support the corresponding backward or inverse versions thereof:
• Inverse DCT-II (or inverse DCT-III)
• Inverse DST-IV
• Inverse DCT-IV
• Inverse DST-VII
• Identity Transformation (IT)
The subsequent description provides more details on which transforms could be supported by encoder 10 and decoder 20. In any case, it should be noted that the set of supported transforms may comprise merely one transform such as one spectral-to-spatial or spatial-to-spectral transform, but it is also possible, that no transform is used by the encoder or decoder at all or for single blocks 80, 82, 84.
As already outlined above, Figures 1a to 3 have been presented as an example where the inventive concept described herein may be implemented in order to form specific examples for encoders and decoders according to the present application. Insofar, the encoder and decoder of Figures 1a, 1b and 2, respectively, may represent possible implementations of the encoders and decoders described herein before. Figures 1a, 1b and 2 are, however, only examples. An encoder according to embodiments of the present application may, however, perform block-based encoding of a picture 12 using the concept outlined in more detail before or hereinafter and being different from the encoder of Figure 1a or 1b such as, for instance, in that the sub-division into blocks 80 is performed in a manner different than exemplified in Figure 3 and/or in that no transform (e.g. transform skip/identity transform) is used at all or for single blocks. Likewise, decoders according to embodiments of the present application may perform block-based decoding of picture 12’ from data stream 14 using a coding concept further outlined below, but may differ, for instance, from the decoder 20 of Figure 2 in that same sub-divides picture 12’ into blocks in a manner different than described with respect to Figure 3 and/or in that same does not derive the prediction residual from the data stream 14 in transform domain, but in spatial domain, for instance and/or in that same does not use any transform at all or for single blocks.
According to an embodiment the inventive concept described in the following can concern the transformer 28/inverse transformer 40 and the quantizer 32/dequantizer 38 of the encoder or the inverse transformer 54 and the dequantizer 52 of the decoder. According to an embodiment, the transformer 28, the inverse transformer 40, 54, the quantizer 32 and/or the dequantizer 38, 52 can be disabled for lossless coding of a block of a picture, wherein lossless coding is indicated by coding parameters, like a quantization parameter and a transform mode. The inventive concept may also concern further processing of the prediction residual 24” and/or the prediction signal 26 and/or the prediction residual corrected predictive reconstruction 46, wherein the focus is on the function of the decoder/encoder at lossless coding.
The quantization step size, i.e. the quantization accuracy, can be varied depending on the selected transform and transform block size as described below. The description is written from the decoder perspective and the decoder-side scaling 52 (multiplication) with the quantization step size can be seen as being the inverse (non-reversible) of the encoder-side division by the step size.
On the decoder side, the scaling 52, i.e. the dequantization, of (quantized) transform coefficient levels in current video coding standards like H, 265/HE VC is designed for transform coefficients resulting from DCT/DST integer transforms with higher precision as illustrated in Figure 4. In there, the variable bitDepth specifies the bit depth of the image samples, e.g. 8 or 10-bit. The variables log2TbW and log2TbH specify the binary logarithm of the transform block width and height, respectively. Fig. 4 shows a decoder-side scaling 52 and inverse transform 54 in recent video coding standards such as H.265/HEVC.
It should be noted that, at the decoder, the two 1D DCT/DST -based integer transforms 128i introduce an additional factor of 212 which needs to be compensated by
scaling with the inverse. For non-square blocks with an odd log2TbH + log2TbW, the scaling includes a factor of V2. This can be taken into account by either adding a scale factor of 181/256 or using a different set of levelScale values that incorporate that factor for this case, e.g. levelScale[ ] = {29,32,36,40,45,51}. For the identity transform or transform skip case 1282, this does not apply.
It can be seen that the step size or scaling factor becomes smaller then
1 for QPs less than 4 because levelScale for these QPs is less than 64=26. For the transform coefficients, this is not a problem since the integer forward transform 1281 increases the precision of the residual signal and consequently the dynamic range. However, for the residual signal in case of the identity transform or transform skip 1282, there is no increase in dynamic range. In this case, the scaling factor less than 1 could introduce a distortion for the QPs < 4 which is not there for QP 4, which has a scaling factor of 1. This is contradictory to the quantizer design intent where decreasing the QP should decrease the distortion.
Varying the quantization step size depending on the selected transform, e.g. whether the transform is skipped or not, could be used to derive a different quantization step size for transform skip 1282. Especially for the lowest QPs 0,1,2 and 3, this would solve the problem of having a quantization step size / scaling factor less than 1 for the lowest QPs. In one example shown in Figure 5, the solution could be to clip 53 the quantization parameter to the minimum allowed value of four ( QP' ), resulting in a quantization step size that cannot be lower than one. In addition to that, the size-dependent normalization 541 with bdShift1 and the final rounding 542 to the bit depth with bdShift2, required by the transform, can be moved to the transform path 54. This would reduce the transform skip scaling to a downshift by 10-bit with rounding. In another example, a bitstream restriction can be defined that does not allow an encoder to use QP values that result in a scaling factor of less than 1 for transform skip instead of clipping the QP value to 4. Fig. 5 shows an improved decoder-side scaling 52 and inverse transform 54 according the present invention.
At the other end of the bit-rate range, i.e. for lower bit rates, the quantization step size for the identity transform 1282 may be decreased by an offset, resulting in a higher fidelity for block that does not apply a transform or that does apply the identity transform 1282. This would enable the encoder to select appropriate QP values for transform skip blocks to achieve higher compression efficiency. This aspect is not limited to the identity transform / transform skip 1282, it can also be used to modify the QP for other transform types 128i by an offset. An encoder would, e.g., determine this offset in a way that increases the coding efficiency, e.g. by maximizing perceived visual quality or minimizing objective distortion like a square error for a given bitrate, or by reducing the bitrate for a given quality/distortion. This (in terms of the applied criterion) optimal derivation from the slice QP depends, for example, on the content, bit-rate or complexity operation point, and further factors such as selected transform and transform block size. The present invention describes methods for signaling the QP offset for the case of multiple transforms. Without loss of generality, given two alternative transforms, a fixed QP offset may be transmitted by the encoder in a high-level syntax structure (such as sequence parameter set, picture parameter set, tile group header, slice header, or similar) for each of the two alternative transforms. Alternatively, the QP offset is, e.g., transmitted by the encoder for each transform block when the encoder has selected the alternative transform. A combination of the two approaches is the signaling of a basis QP offset in a high-level syntax structure and an additional offset for each transform block that uses the alternative transform. The offset can be a value that is added or subtracted to a basis QP or an index into a set of offset values. That set can be predefined or signaled in a high-level syntax structure.
• the QP offset relative to a basis QP for the identity transform may be signaled in a high- level syntax structure, e.g on sequence, picture, tile group, tile, or slice level.
• alternatively, the QP offset relative to a basis QP for the identity transform may be signaled for each coding unit or predefined set of coding units.
• alternatively, the QP offset relative to a basis QP, for the identity transform is signaled for each transform unit that applies the identity transform.
Fig. 6 shows features and functionalities of a decoder, for example, of a decoder 20 or 36 shown in Figs. 1a to 2. Also the following description is mainly written from the decoder perspective it is clear that an encoder may comprise parallel features.
A picture 12 is encoded into a data stream 14 by the encoder and the decoder is configured to provide a reconstructed picture based on the data stream 14, wherein the reconstructed picture equals the picture 12 or has no recognizable minimal differences to the picture 12 in case of lossless coding. The picture 12 can be divided into portions 100, i.e. blocks, and regions 104. A region 104 comprises a plurality of portions 100. A predetermined portion 100 is within a predetermined region 104. The data stream 14 may comprise portion individual information, like a plurality 98 of coding parameters, and region individual information, like optionally a lossless coding syntax element 102. The plurality 98 of coding parameters relate to a predetermined portion 100 of the picture 12 and control a prediction residual transform mode, e.g. within the transformer 28 and the inverse transformer 40, 54 shown in Figs. 1a to 2, and a quantization accuracy, e.g. within the quantizer 28 and the dequantizer 40, 54 shown in Figs. 1a to 2, with respect to the predetermined portion 100. The prediction residual transform mode may be controlled by a transform mode indicating syntax element 98i. The quantization accuracy may be controlled by a quantization parameter (QP) 982.
A decoder for decoding the picture 12 from the data stream 14 is configured to check 106 whether a plurality 98 of coding parameters, is indicative of a coding parameter setting corresponding, either because immediately signaling such a coding, or by leading to such a coding by the decoder being configured to interpret or change such a setting to a setting leading to lossless residual coding, such as mapping QP<4 to QP= 4 in case of TM = transform skip, to a lossless prediction
residual coding I061. An encoder for encoding the picture 12 into the data stream 14 may also be configured to check whether the plurality 98 of coding parameters, is indicative of a coding parameter setting corresponding to the lossless prediction residual coding 1061. (QP,TM)=(4,transform skip) or (QP, TM) = (1...4, transform skip) may represent a coding parameter setting corresponding to lossless prediction residual coding 1061. A coding parameter setting of a quantization parameter 982 to a quantization accuracy equal or finer than a predetermined quantization accuracy and/or a transform mode indicating syntax element 981 to a transform skip as transform mode may correspond to the lossless prediction residual coding 1061.
According to an embodiment, the decoder/encoder is configured to read/signal the plurality 98 of coding parameters from/into the data stream 14 and check whether the prediction residual transform mode indicated by the plurality 98 of coding parameters, e.g. indicated by the transform mode indicating syntax element 98i, corresponds to a transform skip mode and whether a quantization accuracy indicated by the plurality 98 of coding parameters, e.g. indicated by the quantization parameter 982, corresponds to a quantization step size finer than a predetermined quantization step size, e.g., corresponds to QP<4, corresponding to no quantization and, if yes, change the quantization step size to the predetermined quantization step size, e.g. QP=4. If the transform mode is set to transform skip, the decoder/encoder is configured to map a quantization parameter 982 finer than the predetermined quantization accuracy (QP<4) to a quantization parameter 982 equal the predetermined quantization accuracy to enable a lossless coding. The predetermined quantization accuracy might represent no quantization 32, 52 or a bypassing or disabling of a quantization 32, 52.
Responsive to the plurality 98 of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding 1061, the decoder/encoder is configured to set 110 one or more predetermined coding options relating to one or more tools of the decoder/encoder for processing a prediction residual corrected predictive reconstruction (e.g. in the prediction-loop of the encoder) with respect to the predetermined portion 100 so that the one or more tools are disabled with respect to the predetermined portion 100. The prediction residual corrected predictive reconstruction may represent the output, i.e. the reconstructed signal 46, of the combiner 56 or 42, respectively, as shown in Figs. 1a to 2. The one or more tools, e.g., relate to deblocking, SAO, ALF and may be positioned downstream the output 46 of the combiner 56 or 42, respectively.
According to an embodiment the decoder/encoder is configured to set the one or more predetermined coding options with respect to the predetermined portion 100 so that the one or more tools are disabled 110 with respect to the predetermined portion 100 if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding 1061 and to a predetermined tool state if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding, i.e. in a lossy coding case 1062.
The decoder is optionally configured to read from the data stream 14 a lossless coding syntax element 102 (e.g., determined and encoded by the encoder) which indicates whether a predetermined region 104 of the picture, which covers or contains the predetermined portion 100, is coded into the data stream 14 using, not exclusively, but for portions fulfilling the check 106, lossless coding or lossy coding. The decoder is configured to set 110 the one or more predetermined coding options so that the one or more tools are disabled with respect to the predetermined portion 100 if the lossless coding syntax element 102 indicates that the predetermined region 104 of the picture 12 is coded into the data stream 14 using lossless coding 1061, and if the plurality 98 of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding 1061. Additionally, the decoder is configured to set 120 the one or more predetermined coding options to a predetermined tool state if the plurality 98 of coding parameters do not indicate, i.e. are not equal to, the coding parameter setting corresponding to the lossless prediction residual coding 1061 or the lossless coding syntax element 102 indicates that the predetermined region 104 of the picture 12 is coded into the data stream 14 using lossy coding 1062. The lossless coding syntax element 102 is signaled for a region 104 of the picture 12 and for each portion 100 within the region 104 the decoder/encoder checks 106 whether the plurality 98 of coding parameters indicate for the individual portion 100 a lossless coding 1061 or a lossy coding 1062. Thus it is possible that some of the portions are decoded/encoded differently than indicated by the lossless coding syntax element 102 for the whole region 104.
According to an embodiment, the decoder is configured to determine the predetermined tool state depending on one or more syntax elements 108, e.g. syntax elements relating to SAO, ALF or the like, in the data stream 14. In case of lossless coding 1061 the decoder is configured to skip 122 reading the one or more tool syntax elements 108, since the one or more tools are disabled 110. Optionally, at least one of the one or more syntax elements 108 is absent from the data
stream 14 if the one or more predetermined coding options with respect to the predetermined portion 100 are set so that the one or more tools are disabled 110 with respect to the predetermined portion 100, compare aspect 3.4.
According to an embodiment, the decoder is configured to set 130 one or more further coding options with respect to the predetermined portion 100, e.g., described with respect to the following aspects 1 to 3.4 in the description below, to a default state responsive to the plurality 98 of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding 1061. The default state may represent a reduction of a filtering or a disabling of a filtering, cf. aspect 3.3, in terms of low-pass filtering for a derivation of a prediction signal, e.g. the prediction signal 26 shown in Figs. 1a to 2, for the predetermined portion 100 and/or a perfectly invertible transform, cf. aspect 3.2, to be performed on a prediction residual signal, e.g. the prediction residual signal 24 shown in Figs. 1a to 2. The further coding options may relate to a binarization, cf. aspect 3.1 , of prediction residual data into bin strings and context-adaptive binary entropy decoding of the bin strings and/or a usage of a perfectly invertible transform, cf. aspect 3.2, on a prediction residual 24’”, 24”" or a prediction residual corrected reconstruction 46 of the predetermined portion 100 and/or a disabling or reduction of a filtering, cf. aspect 3.3, for a derivation of the prediction signal 26 for the predetermined portion and/or a disabling of a processing of a prediction residual corrected predictive reconstruction 46 with respect to the predetermined portion 100 or a prediction residual re-quantization, cf. aspect 3.4. Similarly, for an encoder the further coding options may relate to a binarization of prediction residual data into bin strings and context-adaptive binary entropy encoding of the bin strings and/or a usage of a perfectly invertible transform on a prediction residual 24’”, 24”” or a prediction residual corrected reconstruction 46 of the predetermined portion in a prediction-loop of the encoder and/or a disabling or reduction of a filtering for a derivation of a prediction signal 26 for the predetermined portion and/or a disabling of a processing of a prediction residual corrected predictive reconstruction 46 with respect to the predetermined portion 100 or a prediction residual quantization.
An encoder can be configured to determine and encode the plurality 98 of coding parameters, lossless coding syntax element 102 and/or the tool syntax elements 108 in the data stream 14. The encoder can comprise parallel features and/or functionalities as described with regard to the decoder. This applies at least for the prediction-loop 36 of the encoder which has the same features and/or functionalities as the decoder. But it is clear, that features relating to the inverse transformer 54 or the dequantizer 52 of the decoder can similarly also be applied to the transformer 28 and/or the quantizer 32 of the encoder.
One or more of the following aspects may be integrated in the decoder/encoder described with regard to Fig. 6. Alternatively, the following aspects can be implemented individually in a decoder or encoder.
Aspect 1: Reduction of a number of syntax elements for lossless image or video coding
The basic approach of this aspect is to make a subblock-wise transform quantization bypass coding flag obsolete since, with the corrections described above, its functionality is completely provided by the subblock-wise transform skipping functionality when a coding quantization parameter (QP) of 4 (unity step-size) is employed and post reconstruction modifications of block samples by coding tools such as in-loop filters are disabled. The deactivation of such post filters can be achieved by the conditioning on the transform skip mode and a QP that is lower or equal than four.
In the following paragraphs, different possible implementations are specified.
An embodiment of aspect 1 is by means of a coding block-wise (here, transform block-wise and color component-wise) one-bit transform skipping indicator, called transform_skip_flag. This indicator is transmitted as part of the transform unit coding syntax, for example, as shown in table 1. The plurality 98 of coding parameters may comprise the transform_skip_flag, i.e. the transform mode indicating syntax element 98i, controlling the prediction residual transform mode with respect to the predetermined portion 100 of the picture 12. The transform_skip_flag defines a transform skip as the transform mode, i.e. the prediction residual transform mode, for the predetermined portion 100.
Claims
1. Decoder for decoding a picture from a data stream, configured to
check (106) whether a plurality (98) of coding parameters, which are contained in the data stream (14), relate to a predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, setting one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion.
2. Decoder of claim 1 , configured to
read from the data stream a lossless coding syntax element (102) which indicates whether a predetermined region (104) of the picture, which covers or contains the predetermined portion, is coded into the data stream using lossless coding or lossy coding, and
set the one or more predetermined coding options
so that the one or more tools are disabled with respect to the predetermined portion if the lossless coding syntax element indicates that the predetermined region of the picture is coded into the data stream using lossless coding, and if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding and
to a predetermined tool state if the plurality of coding parameters not equal the coding parameter setting corresponding to the lossless prediction residual coding or the lossless coding syntax element indicates that the predetermined region of the picture is coded into the data stream using lossy coding.
3. Decoder of claim 2, configured to determine the predetermined tool state depending on one or more syntax elements (108) in the data stream.
4. Decoder of any of claims 1 to 3, configured to set one or more further coding options with respect to the predetermined portion to a default state responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
5. Decoder of claim 4, wherein the default state represents
a reduction of a filtering or a disabling of a filtering in terms of low-pass filtering for a derivation of a prediction signal (26) for the predetermined portion; and/or
a perfectly invertible transform to be performed on the prediction residual signal.
6. Decoder of claim 4 or 5, wherein the further coding options relate to
a binarization of prediction residual data into bin strings and context-adaptive binary entropy decoding of the bin strings;
a usage of a perfectly invertible transform on a prediction residual (24’", 24””) or a prediction residual corrected reconstruction (46) of the predetermined portion;
a disabling or reduction of a filtering for a derivation of a prediction signal (26) for the predetermined portion; and/or
a disabling of a processing of a prediction residual corrected predictive reconstruction with respect to the predetermined portion or a prediction residual re-quantization.
7. Decoder of any of claims 1 to 6, configured to
Decode a prediction residual from the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy decoding of the bin strings in a first manner, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and in a second manner, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
8. Decoder of any of claims 1 to 7, configured to
perform on a prediction residual (24’”, 24’”’) or a prediction residual corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and a non-perfectly invertible transform, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
9. Decoder of any of claims 1 to 8, configured to
derive a prediction signal (26) for the predetermined portion in a first manner, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and in a second manner, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner
10. Decoder of any of claims 1 to 9, configured to
infer that one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion or for prediction residual re-quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and by deriving the one or more predetermined coding options from the plurality of coding parameters, if the plurality of
coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
11. Decoder of any of claims 1 to 10, configured to
set the one or more predetermined coding options with respect to the predetermined portion
so that the one or more tools are disabled with respect to the predetermined portion if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding and
to a predetermined tool state if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
12. Decoder of claim 11, configured to determine the predetermined tool state depending on one or more syntax elements in the data stream.
13. Decoder of claim 3 or 12, where at least one of the one or more syntax elements is absent from the data stream if the one or more predetermined coding options with respect to the predetermined portion are set so that the one or more tools are disabled with respect to the predetermined portion.
14. Decoder of any of claims 1 to 13, configured to
read the plurality of coding parameters from the data stream,
check whether the prediction residual transform mode indicated by the plurality of coding parameters corresponds to a transform skip mode and to a quantization step size finer than a predetermined quantization step size corresponding to no quantization, and, if yes, change the quantization step size to the predetermined quantization step size.
15. Decoder for decoding a picture from a data stream, configured to
determine for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
Decode a prediction residual from the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy decoding of the bin strings in a first manner, if the predetermined portion of the picture is coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is coded into the data stream using lossy coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
16. Decoder of claim 15, configured so that the computational complexity is reduced in the first manner compared to the second manner by at least one of
a number of bins of the bin strings coded using equi-probability bypass mode is greater in the first manner than in the second manner, and
different binarization codes being used in the first and second manner.
17. Decoder of claim 16, configured so that
an amount of bins of the bin strings belonging to a unary code or truncated unary code of the bin strings is lower in the first manner than in the second manner.
18. Decoder for decoding a picture from a data stream, configured to
determine for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
perform on a prediction residual (24”’, 24””) or a prediction residual corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the predetermined portion of the picture is coded into the data stream using lossless coding, and a non-perfectly invertible transform, if the predetermined portion of the picture is coded into the data stream using lossy coding.
19. Decoder of claim 18, wherein the perfectly invertible transform and the non-perfectly invertible transform are inter-color-component transforms or are spectrally decomposing intracolor-component transforms.
20. Decoder for decoding a picture from a data stream, configured to
determine, for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
derive a prediction signal (26) for the predetermined portion in a first manner, if the predetermined portion of the picture is coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is coded into the data stream using lossy coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner.
21. Decoder of claim 20, wherein the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner in terms of low-pass filtering.
22. Decoder for decoding a picture from a data stream, configured to
determine, for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
infer that one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion or for prediction residual re-quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the predetermined portion of the picture is coded into the data stream using lossless coding, and by deriving the one or more predetermined coding options from a plurality of coding parameters, if the predetermined portion of the picture is coded into the data stream using lossy coding.
23. Decoder of any of claims 15 to 22, configured to perform the determining based on the data stream.
24. Decoder of any of claims 15 to 23, configured to perform the determining by
Reading from the data stream a lossless coding syntax element which indicates whether the predetermined portion of the picture, or a predetermined region containing the predetermined portion, is coded into the data stream using lossless coding or lossy coding, and
Performing the determination depending on the lossless coding syntax element.
25. Decoder of any of claims 15 to 24 configured to perform the determining by
checking (106) whether a plurality (98) of coding parameters, which are contained in the data stream (14), relate to the predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
determine whether the predetermined portion of the picture is coded into the data stream using lossless coding or lossy coding depending on whether the plurality (98) of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
26. Video decoder configured to
perform video decoding from a data stream at an internal bit-depth and video output at an input bit-depth or internal bit-depth,
read from the data stream a syntax element which indicates a difference between the internal bit-depth and the input bit-depth.
27. Video decoder of claim 26, configured to perform a bit-depth transition from an internally-reconstructed video version to the input bit-depth before the video output.
28. Video decoder of claim 26 or 27, configured to
deduce a minimum for a quantization step size scale parameter based on the difference, and
adhere to the minimum quantization step size scale parameter for portions of the video coded in a prediction residual transform skip mode.
29. Video decoder of any of claims 26 to 28 , configured to
in adhering to the minimum quantization step size scale parameter for video portions coded in a prediction residual transform skip mode, change a signaled quantization step size scale parameter signaled in the data stream for the video portions, to equal the minimum quantization step size scale parameter in case of the signaled quantization step size scale parameter falling below the minimum quantization step size scale parameter.
30. Video decoder of claim 28 or 29, configured to adhere to the minimum quantization step size scale parameter merely within video regions for which the data stream signals a lossless coding mode.
31. Video decoder configured to
perform video decoding from a data stream at an internal bit-depth and video output at an input bit-depth or interna! bit-depth,
read from the data stream a syntax element which indicates a minimum for a quantization step size scale parameter.
32. Video decoder of claim 31, configured to
adhere to the minimum quantization step size scale parameter for video portions coded in a prediction residual transform skip mode.
33. Video decoder of claim 31 or 32 , configured to
in adhering to the minimum quantization step size scale parameter for video portions coded in a prediction residual transform skip mode, change a signaled quantization step size scale parameter signaled in the data stream for the portions, to equal the minimum quantization step size scale parameter in case of the signaled quantization step size scale parameter falling below the minimum quantization step size scale parameter.
34. Video decoder of claim 32 or 33, configured to adhere to the minimum quantization step size scale parameter merely within video regions for which the data stream signals a lossless coding mode
35. Video decoder configured to
derive from a data stream video an indication of an internal bit-depth and an input bit-depth or a difference between same,
perform video decoding from the data stream at the internal bit-depth and video output at the input bit-depth,
check whether the internal bit-depth falls below the input bit-depth and change the internal bit-depth to correspond to the input bit-depth.
36. Video decoder of claim 35, configured to
derive from the data stream an indication of a lossless coded video portion, and
perform the checking and the changing within the lossless coded video portion and use the internal bit-depth as derived from the data stream for a lossy coded video portion.
37. Encoder for encoding a picture into a data stream, configured to
signal a plurality (98) of coding parameters in the data stream (14);
check (106) whether the plurality (98) of coding parameters, which relate to a predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, setting one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion in a prediction-loop of the encoder.
38. Encoder of claim 37, configured to
encode into the data stream a lossless coding syntax element (102) which indicates whether a predetermined region (104) of the picture, which covers or contains the predetermined portion, is to be coded into the data stream using lossless coding or lossy coding, and
set the one or more predetermined coding options
so that the one or more tools are disabled with respect to the predetermined portion if the lossless coding syntax element indicates that the predetermined region of the picture is to be coded into the data stream using lossless coding, and if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding and
to a predetermined tool state if the plurality of coding parameters not equal the coding parameter setting corresponding to the lossless prediction residual coding or the lossless coding syntax element indicates that the predetermined region of the picture is to be coded into the data stream using lossy coding.
39. Encoder of claim 38, configured to
determine and encode one or more syntax elements (108) in the data stream; and
determine the predetermined tool state depending on the one or more syntax elements (108).
40. Encoder of any of claims 37 to 39, configured to set one or more further coding options with respect to the predetermined portion to a default state responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
41. Encoder of claim 40, wherein the default state represents
a reduction of a filtering or a disabling of a filtering in terms of low-pass filtering for a derivation of a prediction signal (26) for the predetermined portion; and/or
a perfectly invertible transform to be performed on the prediction residual signal.
42. Encoder of claim 40 or 41 , wherein the further coding options relate to
a binarization of prediction residual data into bin strings and context-adaptive binary entropy encoding of the bin strings;
a usage of a perfectly invertible transform on a prediction residual (24’”, 24””) or a prediction residual corrected reconstruction (46) of the predetermined portion in a prediction-loop of the encoder;
a disabling or reduction of a filtering for a derivation of a prediction signal (26) for the predetermined portion; and/or
a disabling of a processing of a prediction residual corrected predictive reconstruction with respect to the predetermined portion or a prediction residual quantization.
43. Encoder of any of claims 37 to 42, configured to
encode a prediction residual into the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy encoding of the bin strings in a first manner, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and in a second manner, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
44. Encoder of any of claims 37 to 43, configured to
perform on a prediction residual (24’”, 24””) or a prediction residual corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and a non-perfectly invertible transform, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
45. Encoder of any of claims 37 to 44, configured to
encode a prediction signal (26) for the predetermined portion in a first manner, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and in a second manner, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner
46. Encoder of any of claims 37 to 45, configured such
infer that one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with
respect to the predetermined portion or for prediction residual quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding, and by deriving the one or more predetermined coding options from the plurality of coding parameters, if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
47. Encoder of any of claims 37 to 46, configured to
set the one or more predetermined coding options with respect to the predetermined portion
so that the one or more tools are disabled with respect to the predetermined portion if the plurality of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding and
to a predetermined tool state if the plurality of coding parameters are not indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
48. Encoder of claim 47, configured to
determine and encode one or more syntax elements (108) in the data stream; and
determine the predetermined tool state depending on the one or more syntax elements.
49. Encoder of claim 39 or 48, where at least one of the one or more syntax elements is not encoded into the data stream if the one or more predetermined coding options with respect to the predetermined portion are set so that the one or more tools are disabled with respect to the predetermined portion.
50. Encoder of any of claims 37 to 49, configured to
signal the plurality of coding parameters into the data stream
check whether the prediction residual transform mode indicated by the plurality of coding parameters corresponds to a transform skip mode and to a quantization step size finer than a predetermined quantization step size corresponding to no quantization, and, if yes, change the quantization step size to the predetermined quantization step size.
51. Encoder for encoding a picture into a data stream, configured to
determine for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
encode a prediction residual into the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy encoding of the bin strings in a first manner, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is to be coded into the data stream using lossy coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
52. Encoder of claim 51, configured so that the computational complexity is reduced in the first manner compared to the second manner by at least one of
a number of bins of the bin strings coded using equi-probability bypass mode is greater in the first manner than in the second manner, and
different binarization codes being used in the first and second manner,
53. Encoder of claim 52, configured so that
an amount of bins of the bin strings belonging to a unary code or truncated unary code of the bin strings is lower in the first manner than in the second manner.
54. Encoder for encoding a picture into a data stream, configured to
determine for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
perform on a prediction residual (24'”, 24"") or a prediction residual corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and a non-perfectly invertible transform, if the predetermined portion of the picture is to be coded into the data stream using lossy coding.
55. Encoder of claim 54, wherein the perfectly invertible transform and the non-perfectly invertible transform are inter-color-component transforms or are spectrally decomposing intra-color-component transforms.
56. Encoder for encoding a picture into a data stream, configured to
determine, for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
derive a prediction signal (26) for the predetermined portion in a first manner, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is to be coded into the data stream using lossy coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner.
57. Encoder of claim 56, wherein the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner in terms of low-pass filtering.
58. Encoder for encoding a picture into a data stream, configured to
determine, for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
infer that one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion or for prediction residual quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and by deriving the one or more predetermined coding options from a plurality of coding parameters, if the predetermined portion of the picture is to be coded into the data stream using lossy coding.
59. Encoder of any of claims 51 to 58, configured to signal the determining in the data stream.
60. Encoder of any of claims 51 to 59, configured to
encode into the data stream a lossless coding syntax element which indicates whether the predetermined portion of the picture, or a predetermined region containing the predetermined portion, is coded into the data stream using lossless coding or lossy coding, based on the determining.
61. Encoder of any of claims 51 to 60, configured to perform the determining by
checking (106) whether a plurality (98) of coding parameters relate to the predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
signal the plurality (98) of coding parameters in the data stream (14);
determine whether the predetermined portion of the picture is coded into the data stream using lossless coding or lossy coding depending on whether the plurality (98) of coding parameters are indicative of the coding parameter setting corresponding to the lossless prediction residual coding.
62. Video encoder configured to
perform video encoding into a data stream at an internal bit-depth and receive video input at an input bit-depth or internal bit-depth,
encode into the data stream a syntax element which indicates a difference between the internal bit-depth and the input bit-depth.
63. Video encoder of claim 62, configured to perform a bit-depth transition from an internally-reconstructed video version (46) to the input bit-depth.
64. Video encoder of claim 62 or 63, configured to
deduce a minimum for a quantization step size scale parameter based on the difference, and
adhere to the minimum quantization step size scale parameter for portions of the video to be coded in a prediction residual transform skip mode,
65. Video encoder of any of claims 62 to 64, configured to
in adhering to the minimum quantization step size scale parameter for video portions to be coded in a prediction residual transform skip mode, change a quantization step size scale parameter for the video portions, to equal the minimum quantization step size scale parameter in case of the signaled quantization step size scale parameter falling below the minimum quantization step size scale parameter; and
signal the quantization step size scale parameter in the data stream.
66. Video encoder of claim 64 or 65, configured to adhere to the minimum quantization step size scale parameter merely within video regions for which a lossless coding mode is applied; and
signal the lossless coding mode in the data stream.
67. Video encoder configured to
perform video encoding into a data stream at an internal bit-depth and receive video input at an input bit-depth or internal bit-depth,
encode into the data stream a syntax element which indicates a minimum for a quantization step size scale parameter.
68. Video encoder of claim 67, configured to
adhere to the minimum quantization step size scale parameter for video portions to be coded in a prediction residua! transform skip mode.
69. Video encoder of claim 67 or 68, configured to
in adhering to the minimum quantization step size scale parameter for video portions to be coded in a prediction residual transform skip mode, change a signaled quantization step size scale parameter, to equal the minimum quantization step size scale parameter in case of the signaled quantization step size scale parameter falling below the minimum quantization step size scale parameter; and
signal the quantization step size scale parameter in the data stream.
70. Video encoder of claim 68 or 69, configured to adhere to the minimum quantization step size scale parameter merely within video regions for which a lossless coding mode is applied; and
signal the lossless coding mode in the data stream.
71. Video encoder configured to
encode into a data stream video an indication of an internal bit-depth and an input bit-depth or a difference between same,
perform video encoding into the data stream at the internal bit-depth and receive video input at the input bit-depth,
check whether the internal bit-depth falls below the input bit-depth and change the internal bit-depth to correspond to the input bit-depth.
72. Video encoder of claim 71, configured to
encode into the data stream an indication of a lossless coded video portion, and
perform the checking and the changing within the lossless coded video portion and use the internal bit-depth for a lossy coded video portion; and
signal the internal bit-depth in the data stream.
73. Method for decoding a picture from a data stream, comprising:
checking (106) whether a plurality (98) of coding parameters, which are contained in the data stream (14), relate to a predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, setting one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion.
74. Method for decoding a picture from a data stream, comprising:
determining for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
decoding a prediction residual from the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy decoding of the bin strings in a first manner, if the predetermined portion of the picture is coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is coded into the data stream using lossy coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
75. Method for decoding a picture from a data stream, comprising:
determining for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
performing on a prediction residual (24'", 24””) or a prediction residua! corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the predetermined portion of the picture is coded into the data stream using lossless coding, and a non-perfectly invertible transform, if the predetermined portion of the picture is coded into the data stream using lossy coding.
76. Method for decoding a picture from a data stream, comprising:
determining, for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
deriving a prediction signal (26) for the predetermined portion in a first manner, if the predetermined portion of the picture is coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is coded into the data stream using lossy coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner.
77. Method for decoding a picture from a data stream, comprising:
determining, for a predetermined portion of the picture, whether same is coded into the data stream using lossless coding or lossy coding, and
infering that one or more predetermined coding options relating to one or more tools of the decoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion or for prediction residual re-quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the predetermined portion of the picture is coded into the data stream using lossless coding, and by deriving the one or more predetermined coding options from the plurality of coding parameters, if the predetermined portion of the picture is coded into the data stream using lossy coding.
78. Method, comprising:
performing video decoding from a data stream at an internal bit-depth and video outputing at an input bit-depth or internal bit-depth,
reading from the data stream a syntax element which indicates a difference between the internal bit-depth and the input bit-depth.
79. Method, comprising:
performing video decoding from a data stream at an internal bit-depth and video outputing at an input bit-depth or internal bit-depth,
reading from the data stream a syntax element which indicates a minimum for a quantization step size scale parameter.
80. Method, comprising:
deriving from a data stream video an indication of an internal bit-depth and an input bit-depth or a difference between same,
performing video decoding from the data stream at the internal bit-depth and video outputing at the input bit-depth,
checking whether the internal bit-depth falls below the input bit-depth and changing the internal bit-depth to correspond to the input bit-depth.
81. Method for encoding a picture into a data stream, comprising:
signaling a plurality (98) of coding parameters in the data stream (14);
checking (106) whether the plurality (98) of coding parameters relate to a predetermined portion (100) of the picture (12) and control a prediction residual transform mode and a quantization accuracy with respect to the predetermined portion, are indicative of a coding parameter setting corresponding to a lossless prediction residual coding,
responsive to the plurality of coding parameters being indicative of the coding parameter setting corresponding to the lossless prediction residual coding, setting one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion so that the one or more tools are disabled with respect to the predetermined portion in a prediction-loop of the encoder.
82. Method for encoding a picture into a data stream, comprising;
determining for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
encoding a prediction residual into the data stream for the predetermined portion using binarization of prediction residual data into bin strings and context-adaptive binary entropy encoding of the bin strings in a first manner, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is to be coded into the data stream using lossy coding, wherein the first and second manners differ so that a computational complexity is reduced in the first manner compared to the second manner.
83. Method for encoding a picture into a data stream, comprising:
determining for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
performing on a prediction residual (24’”, 24””) or a prediction residual corrected reconstruction (46) of the predetermined portion a perfectly invertible transform, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and a non-perfectly invertible transform, if the predetermined portion of the picture is to be coded into the data stream using lossy coding.
84. Method for encoding a picture into a data stream, comprising:
determining, for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
deriving a prediction signal (26) for the predetermined portion in a first manner, if the predetermined portion of the picture is to be coded into the data stream using lossless coding, and in a second manner, if the predetermined portion of the picture is to be coded into the data stream using lossy coding, wherein the first and second manners differ so that the prediction signal is less filtered in the first manner than in the second manner or unfiltered in the first manner while being filtered in the second manner.
85. Method for encoding a picture into a data stream, comprising:
determining, for a predetermined portion of the picture, whether same is to be coded into the data stream using lossless coding or lossy coding, and
infering that one or more predetermined coding options relating to one or more tools of the encoder for processing a prediction residual corrected predictive reconstruction with respect to the predetermined portion or for prediction residual quantization are to be set so that the one or more tools are disabled with respect to the predetermined portion, if the predetermined portion of the picture is coded into the data stream using lossless coding, and by deriving the one or more predetermined coding options from the plurality of coding parameters, if the predetermined portion of the picture is coded into the data stream using lossy coding.
86. Method, comprising:
performing video encoding into a data stream at an internal bit-depth and receiving video input at an input bit-depth or internal bit depth,
encoding into the data stream a syntax element which indicates a difference between the internal bit-depth and the input bit-depth.
87. Method, comprising:
performing video encoding into a data stream at an internal bit-depth and receiving video input at an input bit-depth or internal bit-depth,
encoding into the data stream a syntax element which indicates a minimum for a quantization step size scale parameter.
88. Method, comprising:
encoding into a data stream video an indication of an internal bit-depth and an input bit-depth or a difference between same,
performing video encoding into the data stream at the internal bit-depth and receiving video input at the input bit-depth,
checking whether the internal bit-depth falls below the input bit-depth and changing the internal bit-depth to correspond to the input bit-depth.
89. A computer program for implementing the method of at least one of claims 100 to 115.
90. A data stream obtainable by the method of at least one of claims 108 to 115.
| # | Name | Date |
|---|---|---|
| 1 | 202237000201.pdf | 2022-01-03 |
| 2 | 202237000201-STATEMENT OF UNDERTAKING (FORM 3) [03-01-2022(online)].pdf | 2022-01-03 |
| 3 | 202237000201-FORM 1 [03-01-2022(online)].pdf | 2022-01-03 |
| 4 | 202237000201-FIGURE OF ABSTRACT [03-01-2022(online)].pdf | 2022-01-03 |
| 5 | 202237000201-DRAWINGS [03-01-2022(online)].pdf | 2022-01-03 |
| 6 | 202237000201-DECLARATION OF INVENTORSHIP (FORM 5) [03-01-2022(online)].pdf | 2022-01-03 |
| 7 | 202237000201-COMPLETE SPECIFICATION [03-01-2022(online)].pdf | 2022-01-03 |
| 8 | 202237000201-FORM 18 [06-01-2022(online)].pdf | 2022-01-06 |
| 9 | 202237000201-POA [11-03-2022(online)].pdf | 2022-03-11 |
| 10 | 202237000201-FORM 13 [11-03-2022(online)].pdf | 2022-03-11 |
| 11 | 202237000201-AMENDED DOCUMENTS [11-03-2022(online)].pdf | 2022-03-11 |
| 12 | 202237000201-FORM 3 [25-06-2022(online)].pdf | 2022-06-25 |
| 13 | 202237000201-FER.pdf | 2022-06-28 |
| 14 | 202237000201-Proof of Right [01-07-2022(online)].pdf | 2022-07-01 |
| 15 | 202237000201-FORM 3 [30-11-2022(online)].pdf | 2022-11-30 |
| 16 | 202237000201-OTHERS [28-12-2022(online)].pdf | 2022-12-28 |
| 17 | 202237000201-FER_SER_REPLY [28-12-2022(online)].pdf | 2022-12-28 |
| 18 | 202237000201-DRAWING [28-12-2022(online)].pdf | 2022-12-28 |
| 19 | 202237000201-CLAIMS [28-12-2022(online)].pdf | 2022-12-28 |
| 20 | 202237000201-FORM 3 [08-09-2023(online)].pdf | 2023-09-08 |
| 21 | 202237000201-FORM 3 [06-02-2024(online)].pdf | 2024-02-06 |
| 22 | 202237000201-Response to office action [25-07-2025(online)].pdf | 2025-07-25 |
| 1 | SearchE_27-06-2022.pdf |