Abstract: When a coding image selected by an encoding controlling part (1) is an intra prediction mode, an intra prediction part (4) carries out an intra-frame prediction process using pixels adjacent to a partition Pi n which is generated through a division by a block dividing part (2) or pixels adjacent to a higher layer partition Pi n-1 to which the partition Pi n belongs to generate a prediction image (Pi n). [FIG. 1] 53
This application is a divisional application of 5320/CHENP/2013 filed on 05/07/2013
SPECIFICATION TITLE OF THE INVENTION
IMAGE ENCODING DEVICE AND IMAGE DECODING DEVICE
5
FIELD OF THE INVENTION [0001]
The present invention relates to an image coding device for and an image coding method of encoding a moving image with
10 a high degree of efficiency, and an image decoding device for and an image decoding method of decoding an encoded moving image with a high degree of efficiency. BACKBROUND OF THE INVENTION [0002]
15 For example, in an intra prediction mode for luminance in an AVC/H.264 (ISO/IEC 14496-10|ITU-T H.264) encoding method which is an international standard method, one prediction mode can be selected from among a plurality of prediction modes for each block (for example, refer to nonpatent reference 1) . Fig.
20 14 is an explanatory drawing showing intra prediction modes in the case of a 4x4 pixel block size for luminance. In the case of a 4x4 pixel block size for luminance, nine intra prediction modes (a mode 0, a mode 1, ..., and a mode 8) are defined. [0003]
25 In the example shown in Fig. 14, each white circle shows a pixel in a block to be encoded. Each black circle shows a pixel that is used for prediction, and that exists in an already-encoded adjacent block. The mode 2 is the one in which an average prediction is carried out in such a way that each
30 pixel in the block to be encoded is predicted by using the average
2
of adjacent pixels existing in the upper and left blocks. Each of the modes other than the mode 2 is the one in which a directional prediction .is.carried out. Particularly, the mode 0 is the one in which a vertical prediction is carried out in
5 such a way that adj acent pixels in the upper block are repeatedly replicated to create plural rows of pixels along a vertical direction to generate a prediction image. For example, the mode 0 is selected when the block to be encoded is a vertically striped pattern. The mode 1 is the one in which a horizontal prediction
10 is carried out in such a way that adjacent pixels in the left block are repeatedly replicated to create plural columns of pixels along a horizontal direction to generate a prediction image. Further, in each of the modes 3 to 8, interpolation pixels running in a predetermined direction (i.e., a direction
15 shown by arrows) are generated by using the adjacent pixels in the upper block or the left block to generate a prediction image. [0004]
The block size for luminance to which an intra prediction is applied can be selected from 4x4 pixels, 8*8 pixels, and 16x16
20 pixels. In the case in which the block size for luminance is 8x8 pixels, nine intra prediction modes are defined, like in the case in which the block size for luminance is 4x4 pixels. In contrast with this, in the case in which the block size for luminance is 16x16 pixels, four intra prediction modes which
25 are called plane predictions are defined in addition to an average prediction mode, a vertical prediction mode, and a horizontal prediction mode. A plane prediction mode is a mode in which pixels created by carrying out an interpolation in a diagonal direction on the adjacent pixels in the upper block
30 and the adjacent pixels in the left .block are provided as
3
predicted values. [0005]
Although in a directional prediction mode in the case in which the block size for luminance is 4x4 pixels or 8x8 pixels, 5 because predicted values are generated along a direction predetermined according to the mode (e.g., a direction of 45 degrees), the prediction efficiency increases and the code amount can be reduced when the direction of a boundary (edge) of an object in the block matches the direction shown by the
10 prediction mode, the prediction efficiency decreases when the direction of an edge of an object in the block does not match the direction shown by the prediction mode. In contrast with this, because an increase in the number of selectable directional prediction modes causes an increase in the
15 probability that the direction of an edge matches the direction shown by a prediction mode, it can be assumed that the prediction efficiency increases. Related art document Nonpatent reference
20 [0006]
Nonpatent reference 1: MPEG-4 AVC (ISO/IEC 14496-10)/H.ITU-T
264 standards
SUMMARY OF THE INVENTION
PROBLEMS TO BE SOLVED BY THE INVENTION
25 [0007]
Because the conventional image coding device is constructed as above, an increase in the number of selectable directional prediction modes increases the probability that the direction of an edge matches the direction shown by a prediction
30 mode, and hence the prediction efficiency can be improved. A
4
problem is, however, that because the number of choices of similar prediction images increases even if the number of selectable directional predictions is increased, an improvement in the encoding efficiency commensurate with an
5 increase in the amount of computation which is caused by the increase in the number of selectable directional predictions is not provided, and the increase in the number of selectable directional prediction has a limited contribution to improving the encoding efficiency.
10 [0008]
The present invention is made in order to solve the above-mentioned problem, and it is therefore an object of the present invention to provide an image coding device and an image coding method capable of increasing the degree of improvement
15 in the encoding efficiency which is caused by an increase in the number of directional prediction modes with a small amount of computation and a small code amount. It is another object of the present invention to provide an image decoding device and an image decoding method capable of correctly decoding coded *20 data in which an improvement in the encoding efficiency is provided to acquire a moving image. MEANS FOR SOLVING THE PROBLEM [0009]
In accordance with the present invention, there is
25 provided an image coding device in which when a coding mode selected by an encoding controlling unit is an intra prediction mode, a prediction image generating unit carries out an intra-frame prediction process using pixels adjacent to each coding block which is generated through a division by a block
30 dividing unit or pixels adjacent to a higher layer coding block
5
to which each coding block belongs to generate a prediction image. Further, when the coding mode selected by the encoding controlling unit is an intra prediction mode, a variable length encoding unit encodes a flag indicating whether an intra 5 prediction direction of a coding block adjacent to the coding block which is a target to be processed and which is generated through the division by the block dividing unit matches that of the target coding block, and, when the flag indicates that the intra prediction direction of an adjacent coding block
10 matches that of the coding block, also encodes information showing whether which one of adjacent coding blocks has an intra prediction direction matching that of the coding block. ADVANTAGES OF THE INVENTION [0010]
15 Because the image coding device in accordance with the present invention is constructed in such a way that when the coding mode selected by the encoding controlling unit is an intra prediction mode, the prediction image generating unit carries out an intra-frame prediction process using pixels
20 adjacent to each coding block which is generated through the division by the block dividing unit or pixels adjacent to a higher layer coding block to which each coding block belongs to generate a prediction image, there is provided an advantage of being able to increase the degree of improvement in the
25 encoding efficiency which is caused by an increase in the number of directional predictions with a small amount of computation and a small code amount. BRIEF DESCRIPTION OF THE FIGURES [0011]
30 [Fig. 1] Fig. 1 is a block diagram showing an image coding
6
device in accordance with Embodiment 1 of the present invention;
[Fig. 2] Fig. 2 is a flow chart showing processing carried out by the image coding device in accordance with Embodiment 1 of the present invention; 5 [Fig. 3] Fig. 3 is an explanatory drawing showing a state in which each coding block having a maximum size is hierarchically divided into a plurality of coding blocks;
[Fig. 4] Fig. 4(a) is an explanatory drawing showing a distribution of partitions into which a block to encoded is 10 divided, and Fig. 4 (b) is an explanatory drawing showing a state in which a coding mode m (Bn) is assigned to each of the partitions after a hierarchical layer division is performed by using a quadtree graph;
[Fig. 5] Fig. 5 is an explanatory drawing showing an example 15 of an intra prediction parameter (intra prediction mode) which can be selected for each partition Pin belonging to a coding block Bn;
[Fig. 6] Fig. 6 is an explanatory drawing showing an example of pixels which are used when generating a predicted value of 20 each pixel in a partition Pi" in the case of lin=min=4;
[Fig. 7] Fig. 7 is an explanatory drawing showing a relationship between a partition Pin and a higher layer partition Pi""1;
[Fig. 8] Fig. 8 is an explanatory drawing showing a partition Pin belonging to a coding block Bn; 25 [Fig. 9] Fig. 9 is an explanatory drawing showing an example of an intra prediction parameter (intra prediction mode) which can be selected for each partition Pin belonging to a coding block Bn;
[Fig. 10] Fig. 10 is a block diagram showing an image decoding 30 device in accordance with Embodiment 1 of the present
7
invention;
[Fig. 11] Fig. 11 is a flow chart showing processing carried
out by the image decoding device in accordance with Embodiment
1 of the present invention; 5 [Fig. 12] Fig. 12 is an explanatory drawing showing coded data
of each partition;
[Fig. 13] Fig. 13 is an explanatory drawing showing an adjacent
partition; and
[Fig. 14] Fig. 14 is an explanatory drawing showing intra 10 prediction modes in the case of a 4x4 pixel block size for
luminance.
EMBODIMENTS OF THE INVENTION
[0012]
Hereafter, in order to explain this invention in greater 15 detail, the preferred embodiments of the present invention will
be described with reference to the accompanying drawings.
Embodiment 1.
Fig. 1 is a block diagram showing a moving image encoding
device in accordance with Embodiment 1 of the present invention. 20 Referring to Fig. 1, an encoding controlling part 1 carries out
a process of determining a maximum size of each of coding blocks
which is a unit to be processed at a time when an intra prediction
process (intra-frame prediction process) or a
motion-compensated prediction process (inter-frame prediction 25 process) is carried out, and also determining an upper limit
on the number of hierarchical layers in a hierarchy in which
each of the coding blocks having the maximum size is
hierarchically divided into blocks. The encoding controlling
part 1 also carries out a process of selecting a coding mode 30 suitable for each of the coding blocks into which each coding
8
block having the maximum size is divided hierarchically from one or more available coding modes (one or more intra coding modes and one or more inter coding modes) . The encoding controlling part 1 further carries out a process of determining
5 a quantization parameter and a transformation block size which are used when a difference image is compressed for each coding block, and also determining an intra prediction parameter or an inter prediction parameter which is used when a prediction process is carried out for each coding block. The quantization
10 parameter and the transformation block size are included in prediction difference encoding parameters, and these prediction error encoding parameters are outputted to a transformation/quantization part 7, an inverse quantization/inverse transformation part 8, a variable length
15 encoding part 13, and so on. The encoding controlling part 1 constructs an encoding controlling unit. [0013]
A block dividing part 2 carries out a process of, when receiving a video signal showing an inputted image (current
20 picture), dividing the inputted image into coding blocks each having the maximum size determined by the encoding controlling part 1, and also dividing each of the coding blocks into blocks hierarchically until the number of hierarchical layers reaches the upper limit on the number of hierarchical layers which is
25 determined, by the encoding controlling part 1. The block dividing part 2 constructs a block dividing unit. A selection switch 3 carries out a process of, when the coding mode selected by the encoding controlling part 1 for a coding block, which is generated through the division by the block dividing part
30 2, is an intra coding mode, outputting'the coding block to an
9
intra prediction part 4, and, when the coding mode selected by the encoding controlling part 1 for the coding block, which is generated through the division by the block dividing part 2, is an inter coding mode, outputting the coding block to a
5 motion-compensated prediction part 5. [0014]
The intra prediction part 4 carries out a process of, when receiving the coding block, which is generated through the division by the block dividing part 2, from the selection switch
10 3, carrying out an intra prediction process based on the intra prediction parameter outputted thereto from the encoding controlling part on the coding block to generate a prediction image by using already-encoded pixels which are stored in a memory 10 for intra prediction and which are adjacent to the
15 coding block or already-encoded pixels which are adjacent to a higher layer coding block to which the coding block belongs. The motion-compensated prediction part 5 carries out a process of, when an inter coding mode is selected by the encoding controlling part 1 as a coding mode corresponding to the coding
20 block, which is generated through the division by the block dividing part 2, performing a motion-compensated prediction process on the coding block to generate a prediction image by using one or more frames of reference images stored in a motion-compensated prediction frame memory 12 on the basis of
25 the inter prediction parameter outputted thereto from the encoding controlling part 1. A prediction image generating unit is comprised of the selection switch 3, the intra prediction part 4, and the motion-compensated prediction part 5. _30 . [0015]
10
A subtracting part 6 carries out a process of subtracting the prediction image generated by the intra prediction part 4 or the motion-compensated prediction part 5 from the coding block, which is generated through the division by the block
5 dividing part 2, to generate a difference image (= the coding block - the prediction image). The subtracting part 6 constructs a difference image generating unit. The transformation/quantization part 7 carries out a process of performing an orthogonal transformation process (e.g., a DCT
10 (discrete cosine transform) or an orthogonal transformation process, such as a KL transform, in which bases are designed for a specific learning sequence in advance) on the difference image generated by the subtracting part 6 in units of a block having a transformation block size included in the prediction
15 difference encoding parameters outputted from the encoding controlling part 1, and also quantizing the transform coefficients of the difference image by using a quantization parameter included in the prediction difference encoding parameters to output the transform coefficients quantized
20 thereby as~ compressed data of the difference image. The transformation/quantization part 7 constructs an image compression unit. [0016]
The inverse quantization/inverse transformation part 8
25 carries out a process of inverse-quantizing the compressed data outputted thereto from the transformation/quantization part 7 by using the quantization parameter included in the prediction difference encoding parameters outputted from the encoding controlling part 1, and performing an inverse transformation
30 process (e.g., an inverse DCT (inverse discrete cosine
11
transform) or an inverse transformation process such as an inverse KL transform) on the compressed data inverse-quantized thereby to output the compressed data on which the inverse quantization/inverse transformation part has carried out the
5 inverse transformation process as a local decoded prediction difference signal. [0017]
An adding part 9 carries out a process of adding the local decoded prediction difference signal outputted thereto from the
10 inverse quantization/inverse transformation part 8 and the prediction signal showing the prediction image generated by the intra prediction part 4 or the motion-compensated prediction part 5 to generate a local decoded image signal showing a local decoded image. The memory 10 for intra prediction is a
15 recording medium, such as a RAM, for storing the local decoded image shown by the local decoded image signal generated by the adding part 9 as an image which the intra prediction part 4 will use when performing the intra prediction process the next time. [0018]
20 A loop filter part 11 carries out a process of compensating for an encoding distortion included in the local decoded image signal generated by the adding part 9, and outputting the local decoded image shown by the local decoded image signal on which the loop filter part has carried out the encoding distortion
25 compensation to the motion-compensated prediction frame memory 12 as a reference image. The motion-compensated prediction frame memory.12 is a recording medium, such as a RAM, for storing the local decoded image on which the loop filter part 11 has carried out the filtering process as a reference image which
30 the motion-compensated prediction part 5 will use when
12
performing the motion-compensated prediction process the next
time.
[0019]
The variable length encoding part 13 carries out a process
5 of variable-length-encoding the compressed data outputted thereto from the transformation/quantization part 7, the coding mode and the prediction difference encoding parameters which are outputted thereto from the encoding controlling part 1, and the intra prediction parameter outputted thereto from the intra
10 prediction part 4 or the inter prediction parameter outputted thereto from the motion-compensated prediction part 5 to generate a bitstream into which coded data of the compressed data, coded data of the coding mode, coded data of the prediction difference encoding parameters, and coded data of the intra
15 prediction parameter or the inter prediction parameter are multiplexed. The variable length encoding part 13 constructs a variable length encoding unit. [0020]
In the example of Fig. 1, the encoding controlling part
20 1, the block dividing part 2, the selection switch 3, the intra prediction part 4, the motion-compensated prediction part 5, the subtracting part 6, the transformation/quantization part 7, the inverse quantization/inverse transformation part 8, the adding part 9, the loop filter part 11, and the variable length
25 encoding part 13, which are the components of the moving image encoding device, can consist of pieces of hardware for exclusive use (e.g., integrated circuits in each of which a CPU is mounted, one chip microcomputers, or the like), respectively. As an alternative, the moving image encoding device can consist of
30 a computer, and part or all of a program in which the processes
13
carried out by - the encoding controlling part 1, the block dividing part 2, the selection switch 3, the intra prediction part 4, the motion-compensated prediction part 5, the subtracting part 6, the transformation/quantization part 7, the
5 inverse quantization/inverse transformation part 8, the adding part 9, the loop filter part 11, and the variable length encoding part 13 are described can be stored in a memory of the computer and the CPU of the computer can be made to execute the program stored in the memory. Fig. 2 is a flow chart showing the
10 processing carried out by the moving image encoding device in accordance with Embodiment 1 of the present invention. [0021]
Fig. 10 is a block diagram showing a moving image decoding device in accordance with Embodiment 1 of the present invention.
15 Referring to Fig. 10, a variable length decoding part 21 carries out a process of specifying the maximum size of each coding block which is a unit to be processed at a time when an intra prediction process or a motion-compensated prediction process is carried out, and the number of hierarchical layers in the hierarchy in
20 which each of the coding blocks having the maximum size is hierarchically divided into blocks to specify the coded data associated with each of the coding blocks having the maximum size and the coded data associated with each of the coding blocks into which each of the coding blocks having the maximum size
25 is divided hierarchically from the coded data multiplexed into the bitstream, and variable-length-decoding the coded data associated with each of the coding blocks to acquire the compressed data, the coding mode, the prediction difference encoding parameters, and the intra prediction parameter or the
30 inter prediction parameter, which are-associated with each of
14
the coding blocks, and outputting the compressed data and the prediction difference encoding parameters to an inverse quantization/inverse transformation part 25, and also outputting the coding mode, and the intra prediction parameter
5 or the inter prediction parameter to a selection switch 22. The variable length decoding part 21 constructs a variable length decoding unit. [0022]
The selection switch 22 carries out a process of, when
10 the coding mode associated with the coding block, which is outputted from the variable length decoding part 21, is an intra coding mode, outputting the intra prediction parameter outputted thereto from the variable length decoding part 21 to an intra prediction part 23, and, when the coding mode is an
15 inter coding mode, outputting the inter prediction parameter outputted thereto from the variable length decoding part 21 to a motion compensation part 24. [0023]
The intra prediction part 23 carries out a process of
20 performing an intra-frame prediction process on the coding block to generate a prediction image by using either already-decoded pixels which are stored in a memory 27 for intra prediction and which are adjacent to the coding block or already-decoded pixels which are adjacent to a higher layer
25 coding block to which the coding block belongs on the basis of the intra prediction parameter outputted thereto from the selection switch 22. The motion compensation part 24 carries out a process of performing a motion-compensated prediction process on the coding block to generate a prediction image by
30 using one or. more frames of reference images stored in a
15
motion-compensated prediction frame memory 29 on the basis of the inter prediction parameter outputted thereto from the selection switch 22. A prediction image generating unit is comprised of the selection switch 22, the intra prediction part
5 23, and the motion compensation part 24. [0024]
The inverse quantization/inverse transformation part 25 carries out a process of inverse-quantizing the compressed data associated with the coding block, which are outputted thereto
10 from the variable length decoding part 21, by using the quantization parameter included in the prediction difference encoding parameters outputted thereto from the variable length decoding part 21, and performing an inverse transformation process (e.g., an inverse DCT (inverse discrete cosine
15 transform) or an inverse transformation process such as an inverse KL transform) on the compressed data inverse-quantized thereby in units of a block having the transformation block size included in the prediction difference encoding parameters, and outputting the compressed data on which the inverse
20 quantization/inverse transformation part has carried out the inverse transformation process as a decoded prediction difference signal (signal showing a pre-compressed difference image) . The inverse quantization/inverse transformation part 26 constructs a difference image generating unit.
25 [0025]
An adding part 2 6 carries out a process of adding the decoded prediction difference signal outputted thereto from the inverse quantization/inverse transformation part 25 and the prediction signal showing the prediction image generated by the
30 intra prediction part 23 or the motion compensation part 24 to
16
generate a decoded image signal showing a decoded image. The adding part 26 constructs a decoded image generating unit. The memory 27 for intra prediction is a recording medium, such as a RAM, for storing the decoded image shown by the decoded image
5 signal generated by the adding part 26 as an image which the intra prediction part 23 will use when performing the intra prediction process the. next time. [0026]
A loop filter part 58 carries out a process of compensating
10 for an encoding distortion included in the decoded image signal generated by the adding part 26, and outputting the decoded image shown by the decoded image signal on which the loop filter part has carried out the encoding distortion compensation to the motion-compensated prediction frame memory 29 as a
15 reference image and also outputting the decoded image to outside the moving-image decoding device as a reproduced image. The motion-compensated prediction frame memory 29 is a recording medium, such as a RAM, for storing the decoded image on which the loop filter part 28 has carried out the filtering process
20 as a reference image which the motion compensation part 24 will use when performing the motion-compensated prediction process the next time. [0027]
In the example of Fig. 10, the variable length decoding
25 part 21, the selection switch 22, the intra prediction part 23, the motion compensation part 24, the inverse quantization/inverse transformation part 25, the adding part 26, and the loop filter part 28, which are the components of the moving image decoding device, can consist of pieces of
30 hardware for exclusive use (e.g., integrated circuits in each
17
of which a CPU is mounted, one chip microcomputers, or the like), respectively. As an alternative, the moving image decoding device can consist of a computer, and part or all of a program in which the processes carried out by the variable length
5 decoding part 21, the selection switch 22, the intra prediction part 23, the motion compensation part 24, the inverse quantization/inverse transformation part 25, the adding part 26, and the loop filter part 28 are described can be stored in a memory of the computer and the CPU of the computer can be made
10 to execute the program stored in the memory. Fig. 11 is a flow chart showing the processing carried out by the moving image decoding device in accordance with Embodiment 1 of the present invention. [0028]
15 The moving image encoding device in accordance with this Embodiment 1 is characterized in that the moving image encoding device adapts itself to a local change of a video signal in space and time directions to divide the video signal into regions of various sizes, and carries out intra-frame and inter-frame
20 adaptive encoding. In general, a video signal has a characteristic of its complexity varying locally in space and time. There can be a case in which a pattern having a uniform signal characteristic in a relatively large image area, such as a sky image or a wall image, or a pattern having a complicated
25 texture pattern in a small image area, such as a person image or a picture including a fine texture, also coexists on a certain video frame from the viewpoint of space. [0029]
Also from the viewpoint of time, a relatively large image
30 area, such as a sky image or a wall image, has a small local
18
change in a time direction in its pattern, while an image of a moving person or object has a larger temporal change because its outline has a movement of a rigid body and a movement of a non-rigid body with respect to time. Although a process of
5 generating a prediction difference signal having small signal power and small entropy by using temporal and spatial prediction, thereby reducing the whole code amount, is carried out in the encoding process, the code amount of parameters used for the prediction can be reduced as long as the parameters can
10 be applied uniformly to as large an image signal region as possible. On the other hand, because the amount of errors occurring in the prediction increases when the same prediction parameter is applied to a large image area in an image signal pattern having a large change in time and space, the code amount
15 of the prediction difference signal cannot be reduced. Therefore, it is desirable to reduce the size of a region which is subjected to the prediction process when performing the prediction process on an image area having a large change in time and space, thereby reducing the electric power and entropy
20 of the prediction difference signal, even though the data volume of the prediction parameter which is used for the prediction process is increased. In order to carry out an encoding process which is adapted for such the typical characteristics of a video signal, the moving image encoding device in accordance with this
25 Embodiment 1 adopts a structure of hierarchically dividing each region having a predetermined maximum block size of the video signal into blocks, and adapting the prediction process and the encoding process of encoding a prediction difference for each of the blocks into which each region is divided.
30 [0030] _ .:.
19
A video signal which is to be processed by the moving image
encoding device in accordance with this Embodiment 1 can be an
arbitrary video signal in which each video frame consists of
■a series of digital samples (pixels) in two dimensions,
5 horizontal and vertical, such as a YUV signal which consists
of a luminance signal and two color difference signals, a color
video image signal in arbitrary color space, such as an RGB
signal, outputted from a digital image sensor, a monochrome
image signal, or an infrared image signal. The gradation of
10 each pixel can be an 8-bit, 10-bit, or 12-bit one. In the
following explanation, the inputted video signal is a YUV signal
unless otherwise specified. It is further assumed that the two
color difference components U and V are signals having a 4:2:0
format which are subsampled with respect to the luminance
15 component Y. A data unit to be processed which corresponds to
each frame of the video signal is referred to as a "picture."
In this Embodiment 1, a "picture" is explained as a video frame
signal on which progressive scanning has been carried out. When
the video signal is an interlaced signal, a "picture" can be
20 alternatively a field image signal which is a unit which
constructs a video frame.
[0031]
Next, the operation of the moving image encoding device and the operation of the moving image decoding device will be 25 explained.
First, the processing carried out by the moving image encoding device shown in Fig. 1 will be explained. First, the encoding controlling part 1 determines a maximum size of each of coding blocks which is a unit to be processed at a time when an intra 30 prediction process (intra-frame prediction process) or a
20
motion-compensated prediction process (inter-frame prediction process) is carried out, and also determines an upper limit on the number of hierarchical layers in a hierarchy in which each of the coding blocks having the maximum size is hierarchically
5 divided into blocks (step STl of Fig. 2) . [0032]
- As a method of determining the maximum size of each of coding blocks, for example, there is considered a method of determining a maximum size for all the pictures according to
10 the resolution of the inputted image. Further, there can be considered a method of quantifying a variation in the complexity of a local movement of the inputted image as a parameter and then determining a small size for a picture having a large and vigorous movement while determining a large size for a picture
15 having a small movement. As a method of determining the upper limit on the number of hierarchical layers, for example, there can be considered a method of increasing the depth of the hierarchy, i.e., the number of hierarchical layers to make it possible to detect a finer movement as the inputted image has
20 a larger and more vigorous movement, or decreasing the depth of the hierarchy, i.e., the number of hierarchical layers as the inputted image has a smaller movement. [0033]
The encoding controlling part 1 also selects a coding mode
25 corresponding to each of the coding blocks into which each coding block having the maximum size is divided hierarchically from one or more available coding modes (M intra coding modes and N inter coding modes) (step ST2) . The M intra coding modes which are prepared in advance will be mentioned below. When
30 each of the coding block into which the inputted image is
21
hierarchically divided by the block dividing unit 2 which will be mentioned below is further divided into partitions, the encoding controlling part can select a coding mode corresponding to each of the partitions. Hereafter, an
5 explanation will be made in this Embodiment 1 by assuming that each of the coding blocks is further divided into partitions. Although a detailed explanation of a selection method of selecting a coding mode for use. in the encoding controlling part 1 will be omitted because the selection method is a known
10 technique, there is a method of carrying out an encoding process on each coding block by using an arbitrary available coding mode to examine the encoding efficiency and select a coding mode having the highest level of encoding efficiency from among a plurality of available coding modes, for example.
15 [0034]
The encoding controlling part 1 further determines a quantization parameter and a transformation block size which are used when a difference image is compressed for each partition belonging to each coding block, and also determines
20 an intra prediction parameter or an inter prediction parameter which is used when a prediction process is carried out. The encoding controlling part 1 outputs prediction difference encoding parameters including the quantization parameter and the transformation block size to the
25 transformation/quantization part 7, the inverse quantization/inverse transformation part 8, and the variable length encoding part 13. The encoding controlling part also outputs the prediction difference encoding parameters to the intra prediction part 4 as needed.
30 [0035]
22
When receiving the video signal showing the inputted image, the block dividing part 2 divides the inputted image into coding blocks each having the maximum size determined by the encoding controlling part 1, and also divides each of the coding
5 blocks into blocks hierarchically until the number of hierarchical layers reaches the upper limit on the number of hierarchical layers which is determined by the encoding controlling part 1. The block dividing part further divides each of the coding blocks into partitions (step ST3). Fig. 3
10 is an explanatory drawing showing a state in which each coding block having the maximum size is hierarchically divided into a plurality of coding blocks. In the example of Fig. 3, each coding block having the maximum size is a coding block B° in the 0th hierarchical layer, and its luminance component has a
15 size of (L°, M°) . Further, in the example of Fig. 3, by carrying out the hierarchical division with this coding block B° having the maximum size being set as a starting point until the depth of the hierarchy reaches a predetermined depth which is set separately according to a quadtree structure, coding blocks Bn
20 can be acquired. [0036]
At the depth of n, each coding block Bn is- an image area having a size of (Ln, Mn) . Although Ln can be the same as or differ fromMn, the case of Ln=Mn is shown in the example of Fig.
25 3. Hereafter, the size of each coding block Bn is defined as the size of (Ln, Mn) in the luminance component of the coding block Bn. [0037]
Because the block dividing part 2 carries out a quadtree
30 division, (Ln+1, Mn+1) = (Ln/2, Mn/2) is always established. In the
23
case of a color video image signal (4:4:4 format) in which all the color components have the same sample number, such as an RGB signal, all the color components have a size of (Ln, Mn), while in the case of handling a 4:2:0 format, a corresponding 5 color difference component has an encoding block size of (Ln/2, Mn/2). Hereafter, a coding mode selectable for each coding block Bn in the nth hierarchical layer is expressed as m(Bn) . [0038]
In the case of a color video signal which consists of a 10 plurality of color components, the coding mode m(Bn) can be formed in such a way that an individual mode is used for each color component. Hereafter, an explanation will be made by assuming that the coding mode m(Bn) indicates the one for the luminance component of each coding block having a 4:2:0 format 15 in a YUV signal unless otherwise specified. The coding mode m(Bn) can be one of one or more intra coding modes (generically referred to as "INTRA") or one or more inter coding modes (generically referred to as "INTER"), and the encoding controlling part 1 selects, as the coding mode m(Bn), a coding 20 mode with the highest degree of encoding efficiency for each coding block Bn from among all the coding modes available in the picture currently being processed or a subset of these coding modes, as mentioned above. [0039] 25 Each coding block Bn is further divided into one or more prediction units (partitions) by the block dividing part, as shown in Fig. 3. Hereafter, each partition belonging to each coding block Bn is expressed as F±n (i shows a partition number in the nth hierarchical layer) . Fig. 8 is an explanatory 30 drawing showing a partition Pin belonging to a coding block Bn.
24
How the division of each coding block Bn into partitions Pj.n belonging to the coding block Bn is carried out is included as information in the coding mode m(Bn) . While the prediction process is carried out on each of all the partitions Pj.n according 5 to the coding mode m(Bn), an individual prediction parameter can be selected for each partition Pj.n. [0040]
The encoding controlling part 1 produces such a block
division state as shown in, for example, Fig. 4 for a coding
10 block having the maximum size, and then determines coding blocks
Bn. Hatched shown in Fig. 4(a) show a distribution of
partitions into which the coding block having the maximum size
is divided, and Fig. 4 (b) shows a situation in which coding modes
m(Bn) are respectively assigned to the partitions generated
15 through the hierarchical layer division by using a quadtree
graph. Each node enclosed by □ shown in Fig. 4(b) is a node
(coding block Bn) to which a coding mode m(Bn) is assigned.
[0041]
When the encoding controlling part 1 selects an intra
20 coding mode (m (Bn) eINTRA) , the selection switch 3 outputs each partition Pi" belonging to the coding block Bn which is generated through the division by the block dividing unit 2 to the intra prediction part 4, whereas when the encoding controlling part
1 selects an inter coding mode (m(Bn)eINTER), the selection 25 switch 3 outputs each partition Pi" belonging to the coding block
Bn to the motion-compensated prediction part 5.
[0042]
When receiving each partition Pin belonging to the coding
block Bn from the selection switch 3 (step ST4), the intra 30 prediction part 4 carries out an intra prediction process on
25
each partition Pin to generate an intra prediction image (Pin) on the basis of the intra prediction parameter determined by the encoding controlling part 1 (step ST5) , as will be concretely described below. Hereafter, in this specif ication,
5 Pin denotes a partition and (Pin) denotes a prediction image of the partition Pin. [0043]
Because the moving image decoding device also needs to generate an intra prediction image (P±n) which is completely
10 the same as the intra prediction image, the intra prediction
parameter used for the generation of the intra prediction image
(Pin) are multiplexed into the bitstream by the variable length
encoding part 13. The number of intra prediction directions
each of which can be selected as an intra prediction parameter
15 can be configured in such a way as to differ according to the size of the target block to be processed. The number of intra prediction directions selectable for a large-size partition can be decreased because the efficiency of intra prediction decreases, while the number of intra prediction directions
20 selectable for a small-size partition can be increased. For example, 34 intra prediction directions can be provided for a 4*4-pixel partition and an 8*8-pixel partition, 17 intra prediction directions can be provided for a 16*16-pixel partition, and 9 intra prediction directions can be provided
25 for a 32*32-pixel partition. [0044]
When receiving each partition Pin belonging to the coding block Bn from the selection switch ~ 3 (step ST4), the motion-compensated prediction part 5 carries out an inter
30 prediction process on each partition Pin-to generate an inter
26
prediction image (Pin) on the basis of the inter prediction parameter determined by the encoding controlling part 1 (step ST6). More specifically, the motion-compensated prediction part 5 carries out a motion-compensated prediction process on
5 each partition Pin to generate an inter prediction image (Pin) by using one or more frames of reference images stored in the motion-compensated prediction frame memory 12 on the basis of the inter prediction parameter outputted thereto from the encoding controlling part 1. Because the moving image decoding
10 device also needs to generate an inter prediction image (Pin) which is completely the same as the inter prediction image, the inter prediction parameter used for the generation of the inter prediction image (P±n) are multiplexed into the bitstream by the variable length encoding part 13.
15 [0045]
When receiving the prediction image (Pin) from the intra prediction part 4 or the motion-compensated prediction part 5, the subtracting part 6 subtracts the prediction image (Pin) from the partition Pin belonging to the coding block Bn which is
20 generated through the division by the block dividing unit 2 to generate a prediction difference signal ein showing a difference image (step ST7). When the subtracting part 6 generates the prediction difference signal ein, the transformation/quantization part 7 carries out a
25 transformation process (e.g., a DCT (discrete cosine transform) or an orthogonal transformation process, such as a KL transform, in which bases are designed for a specific learning sequence in advance) on the prediction difference signal ein in units of a block having the transformation block size included in the
30 prediction difference encoding parameters outputted thereto
27
from the encoding controlling part 1, and also quantizes the transform coefficients of the prediction difference signal e±a by using the quantization parameter included in the prediction difference encoding parameters and outputs compressed data of
5 the difference image which are the transform coefficients quantized thereby to the inverse quantization/inverse transformation part 8 and the variable length encoding part 13 (step ST8). [0046]
10 When receiving the compressed data of the difference image from the transformation/quantization part 7, the inverse quantization/inverse transformation part 8 inverse-quantizes the compressed data of the difference image by using the quantization parameter included in the prediction difference
15 encoding parameters outputted thereto from the encoding controlling part 1, performs an inverse transformation process (e.g., an inverse DCT (inverse discrete cosine transform) or an inverse transformation process such as an inverse KL transform) on the compressed data inverse-quantized thereby in
20 units of a block having the transformation block size included in the prediction difference encoding parameters, and outputs the compressed data on which the inverse quantization/inverse transformation part has carried out the inverse transformation process to the adding part 9 as a local decoded prediction
25 difference signal (step ST9). [0047]
When receiving the local decoded prediction difference signal from the inverse quantization/inverse transformation part 8, the adding part 9 adds the local decoded prediction
30 difference signal and the prediction signal showing the
28
prediction image (Pin) generated by the intra prediction part 4 or the motion-compensated prediction part 5 to generate a local decoded image signal showing a local decoded partition image or a local decoded coding block image (referred to as a
5 local decoded image from here on) which is a group of local decoded partition images, and outputs the local decoded image signal to the loop filter unit 11 (step ST10) . Further, this local decoded image is stored in the memory 10 for intra prediction in order that it can be used for intra prediction.
10 [0048]
When receiving the local decoded image signal from the adding part 9, the loop filter part 11 compensates for an encoding distortion included in the local decoded image signal, and stores the local decoded image shown by the local decoded
15 image signal on which the loop filter part has carried out the encoding distortion compensation in the motion-compensated prediction frame memory 12 as a reference image (step ST11). The loop filter part 11 can carry out the filtering process for each coding block having the maximum size of the local decoded
20 image signal inputted thereto or for each coding block of the local decoded image signal. As an alternative, after the local decoded image signals corresponding to all the macroblocks of one screen are inputted thereto, the loop filter part can carry out the filtering process on all the macroblocks of the one
25 screen at a time. [0049]
The moving image encoding device repeatedly carries out the processes of steps ST4 to ST10 until the moving image encoding device completes the processing on each partition Pin
30 belonging to each of all the coding blocks Bu into which the
29
inputted image is divided by the block dividing part 2 (step ST12). The variable length encoding part 13 carries out a process of variable-length-encoding the compressed data outputted thereto from the transformation/quantization part 7,
5 the coding mode and the prediction difference encoding parameters which are outputted thereto from the encoding controlling part 1, and the intra prediction parameter outputted thereto from the intra prediction part 4 or the inter prediction parameter outputted thereto from the
10 motion-compensated prediction part 5 to generate a bitstream into which coded data of the compressed data, coded data of the coding mode, coded data of the prediction difference encoding parameters, and coded data of the intra prediction parameter or the inter prediction parameter are multiplexed (step ST13) .
15 [0050]
Next, the process carried out by the intra prediction part 4 will be explained concretely. Fig. 5 is an explanatory drawing showing an example of the intra prediction parameter (intra prediction mode) which can be selected for each partition
20 Pin belonging to the coding block Bn. In the example of Fig. 5, prediction direction vectors corresponding to intra prediction modes are shown, and it is designed that a relative angle between prediction direction vectors becomes small with increase in the number of selectable intra prediction modes.
25 [0051]
Hereafter, an intra process of generating an intra prediction signal of the luminance signal of a partition Pin on the basis of the intra prediction parameter (intra prediction mode) for the luminance signal will be explained. It is assumed
30 that the size of the partition Pin is linXmin pixels. Fig. 6
30
is an. explanatory drawing showing an example of pixels which are used when generating a predicted value of each pixel in the partition Pin in the case of lin=min=4. Although the (2X1^+1) pixels in an already-encoded upper partition which is adjacent
5 to the partition Pj.n and the (2 Xiriin) pixels in an already-encoded left partition which is adjacent to the partition Pj.n are defined as the pixels used for prediction in the example of Fig. 6, a smaller number of pixels than the pixels shown in Fig. 6 can be alternatively used for prediction. Although one row or
10 column of pixels which are adjacent to the partition Pj.n are used for prediction in the example of Fig. 6, two rows or columns of pixels or three or more rows or columns of pixels can be alternatively used for prediction. [0052]
15 When an index value indicating the intra prediction mode for the partition Pin is 2 (average prediction), the intra prediction part 4 generates a prediction image by using the average of the adjacent pixels in the upper partition and the adjacent pixels in the left partition as the predicted value
20 of each pixel in the partition Pin. When the index value indicating the intra prediction mode is other than 2 (average prediction) , the intra prediction part generates the predicted value of each pixel in the partition Pin on the basis of a prediction direction vector vp=(dx, dy) shown by the index
25 value. When relative coordinates in the partition Pin (the upper left pixel of the partition is defined as the point of origin) of each pixel (prediction target pixel) for which the predicted value is generated are expressed as (x, y), the position of a reference pixel which is used for prediction is
30 the point of intersection where the following L and one of the
31
lines of adjacent pixels intersect each other.
L"6)+to'
where k is a positive scalar value. [0053]
When a reference pixel is located at an integer pixel
5 position, the integer pixel is defined as the predicted value of the target pixel for prediction. In contrast, when a reference pixel is not located at an integer pixel position, an interpolation pixel which is generated from an integer pixel adjacent to the reference pixel is defined as the predicted
10 value of the target pixel for prediction. In the example of Fig. 6, because the reference pixel is not at an integer pixel position, the average of the values of the two pixels which are adjacent to the reference pixel is determined as the predicted value of the target pixel for prediction. The intra prediction
15 part can use not only the adjacent two pixels but also two or more adjacent pixels to generate an interpolation pixel and determine the value of this interpolation pixel as the predicted value. [0054]
20 According to the same procedure, the intra prediction part 4 generates prediction pixels for all the pixels of the luminance signal in the partition Pj.n and outputs an intra prediction image (Pin) generated thereby. The intra prediction parameter used for the generation of the intra prediction image
25 (Pin) are outputted to the variable length encoding part 13 in order to multiplex the intra prediction parameter into the bitstream, as mentioned above. While the intra prediction image (Pin) for the luminance signal in the partition Pin is
32
generated in the above-mentioned way, an intra prediction image (Pjn-1) is generated for the luminance signal in a partition Pjn_1 (j: the partition number of the higher layer partition in the (n-l)th hierarchical layer to which the partition Pj.n belongs)
5 in a similar manner, as shown in Fig. 7. [0055]
In this embodiment, as shown in Fig. 6, the intra prediction part is constructed in such a way as to be able to select a portion corresponding to the partition Pin, which is
10 extracted from the intra prediction image (Pjn-1) for the higher layer partition Pjn_1, as a candidate for the intra prediction image (Pin) for the luminance signal in the partition Pin, in addition to the one that is generated from adjacent pixels of reference pixels adjacent to the partition Pin. More
15 specifically, in a case in which nine directions of intra prediction mode including an average prediction are provided as the intra prediction modes, as shown in Fig. 9, nine prediction modes are further added when a higher layer partition exists.
20 [0056]
Although an intra prediction is a means of predicting an unknown area in the screen from a known area, the prediction efficiency can be improved by selecting a candidate for the intra prediction image from prediction images having different
25 properties because the image signal has a large local change in a space direction. Therefore, in the case in which the intra prediction part is constructed as above in such a way that the number of directions in the intra prediction modes is increased, the degree of similarity among prediction images generated is
30 reduced as compared with the case in which- the number of intra
33
prediction modes is simply increased, and hence a candidate for the intra prediction image can be selected from prediction images having many variations. Therefore, an increase in the number of directions in the intra prediction modes can increase
5 the degree of improvement in the encoding efficiency as compared with an increase in the number of intra prediction modes. Further, because a prediction image for a lower layer partition can be generated by extracting a part of a prediction image for a higher layer partition using the method in accordance with
10 the present invention, an increase in the number of prediction directions provides an advantage of being able to reduce the amount of computation as compared with the case of an increase in the number of prediction modes. [0.057]
15 The intra prediction part also carries out an intra
prediction process based on the intra prediction parameter
(intra prediction mode) on each of the color difference signals
of the partition Pin according to the same procedure as that
according to which the intra prediction part carries out the
20 intra prediction process on the luminance signal, and outputs the intra prediction parameter used for the generation of the intra prediction image to the variable length encoding part 13. [0058]
The variable length encoding part 13
25 variable-length-encodes the intra prediction parameter outputted thereto from the intra prediction part 4 and multiplexes the codeword of the intra prediction parameter into the bitstream, as mentioned above. When encoding the intra prediction parameter, the variable length encoding unit 13 can
30 select a representative prediction-- direction vector
34
(prediction- direction representative vector) from among the prediction direction vectors of a plurality of directional predictions, express the intra prediction parameter by using an index indicating the prediction direction representative
5 vector (prediction direction representative index) and an index
indicating the difference between the prediction direction
vector and the prediction direction representative vector
(prediction direction differential index), and carry out
Huffman encoding, such as arithmetic encoding according to a
10 probability model, for each of the indexes, thereby being able to reduce the code amount. [0059]
Next, the processing carried out by the moving image encoding device shown in Fig. 10 will be explained. When
15 receiving the bitstream generated by the moving image encoding device shown in Fig. 1, the variable length decoding part 21 carries out a variable length decoding process on the bitstream to decode a frame size in units of a sequence which consists of one or more frames of pictures or in units of a picture (step
20 ST21 of Fig. 11) . After decoding the frame size, the variable length decoding part 21 determines the maximum encoding block size determined by the moving image encoding device shown in Fig. 1 (the maximum size of each of the coding blocks which is a unit to be processed at a time when an intra prediction process
25 or a motion-compensated prediction process is carried out), and the upper limit on the hierarchy number (the number of hierarchical layers in the hierarchy in which each coding block having the maximum size is hierarchically divided into blocks) according to the same procedure as that which the moving image
30 encoding device uses (step ST22).
35
[0060]
For example, when the maximum size of each of the coding blocks is determined for all the pictures according to the resolution of the inputted image in the moving image encoding
5 device, the variable length decoding part determines the maximum size of each of the coding blocks on the basis of the frame size which the variable length decoding part has decoded previously according to the same procedure as that which the moving image encoding device shown in Fig. 1 uses. When both
10 the maximum size of each of the coding blocks and the number of hierarchical layers of the coding blocks are multiplexed into the bit stream by the moving image encoding device, the variable length decoding part decodes the bitstream to acquire both the maximum size of each of the coding blocks and the number of
15 hierarchical layers of the coding blocks. [0061]
After determining both the maximum size of each of the coding blocks and the number of hierarchical layers of the coding blocks, the variable length decoding part 21 specifies
20 the coded data associated with each coding block from the coded data multiplexed into the bitstream by grasping the state of the hierarchical division of the inputted image into the coding blocks with each coding block having the maximum size being set as a starting point, and decodes the coded data to acquire the
25 coding mode assigned to each of the coding blocks. The variable length decoding part 21 then refers to the division information about the division into partitions Pin belonging to the coding block Bn included in the coding mode to specify the coded data associated with each of the partitions Pin from the coded data
30 multiplexed into the bitstream (step ST23) . The variable
36
length decoding part 21 variable-length-decodes the coded data associated with each of the partitions Pin to acquire the compressed data, the prediction difference encoding parameters, and the intra prediction parameter or the inter
5 prediction parameter, and outputs the compressed data and the prediction difference encoding parameters to the inverse quantization/inverse transformation part 25 and also outputs the coding mode and the intra prediction parameter or the inter prediction parameter to the selection switch 22 (step ST24) .
10 [0062]
For example, when the prediction direction representative index and the prediction direction differential index are multiplexed into the bitstream, the variable length decoding part entropy-decodes the prediction direction
15 representative index and the prediction direction differential index by using, for example, arithmetic decoding methods according to their respective probability models, respectively, to specify the intra prediction parameter from the prediction direction representative index and the
20 prediction direction differential index. As a result, even when the code amount of the intra prediction parameter is reduced in the moving image encoding device, the moving image decoding device can decode the intra prediction parameter correctly.
25 [0063]
When the coding mode of each partition Pin belonging to the coding block Bn outputted thereto from the variable length decoding part 21 is an intra coding mode, the selection switch 22 outputs the intra prediction parameter outputted thereto
30 from the variable length decoding part 21 to the intra
37
prediction part 23, whereas when the coding mode is an inter coding mode, the selection switch outputs the inter prediction parameter outputted thereto from the variable length decoding part 21 to the motion compensation part. 24.
5 [0064]
When receiving the intra prediction parameter from the selection switch 22 (step ST25), the intra prediction part 23 carries out an intra prediction process on each partition Pj.n to generate an intra prediction image (Pj.n) on the basis of the
10 intra prediction parameter (step ST26), like the intra prediction part 4 shown in Fig. 1. More specifically, the intra prediction part 23 carries out an intra-frame prediction process on each partition Pin to generate a prediction image (Pin) by using already-decoded pixels which are stored in the
15 memory 27 for intra prediction and which are adjacent to the partition Pin or already-decoded pixels which are adjacent to a higher layer partition Pjn_1 to which the partition Pin belongs on the basis of the intra prediction parameter. [0065]
20 When receiving the inter prediction parameter from the selection switch 22 (step ST25), the motion compensation part 24 carries out an inter prediction process on each partition Pin to generate an inter prediction image (Pin) on the basis of the inter prediction parameter, like the motion-compensated
25 prediction part 5 shown in Fig. 1 (step ST27) . More specifically, the motion compensation part 24 carries out a motion-compensated prediction process on each partition l?±a to generate an inter prediction image (Pin) by using one or more frames of reference images stored in the motion-compensated
30 prediction frame memory 29 on the basis of the inter prediction
38
parameter. [0066]
When receiving the prediction difference encoding parameters from the variable length decoding part 21, the
5 inverse quantization/inverse transformation part 25 inverse-quantizes the compressed data associated with the coding block, which are outputted thereto from the variable length decoding part 21, by using the quantization parameter included in the prediction difference encoding parameters, and
10 carries out an inverse transformation process (e.g., an inverse DCT (inverse discrete cosine transform) or an inverse transformation process such as an inverse KL transform) on the compressed data inverse-quantized thereby in units of a block having the transformation block size included in the prediction
15 difference encoding parameters, and outputs the compressed data on which the inverse quantization/inverse transformation part has carried out the inverse transformation process to the adding part 26 as a decoded prediction difference signal (signal showing a pre-compressed difference image) (step ST28).
20 [0067]
The adding part 26 generates a decoded image signal showing a decoded partition image or a decoded image which is a group of decoded partition images by adding the decoded prediction difference signal and the prediction signal showing
25 the prediction image (Pin) generated by the intra prediction part 23 or the motion-compensated prediction part 24, and outputs the decoded image signal to the loop filter part 28 (step ST29) . Further, this decoded image is stored in the memory 27 for intra prediction in order that the decoded image can be used
30 for intra prediction. ■• —_.:■:"
39
[0068]
When receiving the decoded image signal from the adding part 2 6, the loop filter part 28 compensates for an encoding distortion included in the decoded image signal, and.stores the
5 decoded image shown by the decoded image signal on which the loop filter part has carried out the encoding distortion compensation in the motion-compensated prediction frame memory 29 as a reference image and also outputs the decoded image as a reproduced image (step ST30). The loop filter part 28 can
10 carry out the filtering process for each coding block having the maximum size of the decoded image signal inputted thereto or each coding block of the decoded image signal. As an alternative, after the decoded image signals corresponding to all the macroblocks of one screen are inputted, the loop filter
15 part can carry out the filtering process on all the macroblocks of the one screen at a time. The processes of steps ST23 to ST29 are repeatedly carried out until the processing on each partition Pin belonging to each of all the coding blocks Bn is completed (step ST31).
20 [0069]
As can be seen from the above description, because the intra prediction part 4 of the moving image encoding device in accordance with this Embodiment 1 is constructed in such a way as to, when the coding mode selected by the encoding controlling
25 unit 1 is an intra prediction mode, carry out an intra-frame prediction process using pixels adjacent to a partition Pj.n which is generated through the division by the block dividing unit 2 or pixels adjacent to a higher layer partition Pjn_1 to which the partition Pj.n belongs to generate a prediction image
30 (Pj.n), there is provided an advantage of being able to increase
40
the degree of improvement in the encoding efficiency with a
small amount of computation.
[0070]
Further, because the intra prediction part 23 of the
5 moving image decoding device in accordance with this Embodiment 1 is constructed in such a way as to, when the coding mode variable-length-decoded by the variable length decoding part 21 is an intra prediction mode, carry out an intra-frame prediction process using already-decoded pixels which are
10 stored in the memory 27 for intra prediction and which are adjacent to a partition Pin or already-decoded pixels adjacent to a higher layer partition Pjn_1 to which the partition Pj.n belongs to generate a prediction image (Pin) , there is provided an advantage of being able to correctly decode the coded data
15 in which an improvement in the encoding efficiency is provided to acquire the moving image. [0071] Embodiment 2.
Although the example in which the variable length
20 encoding part 13 of the moving image encoding device variable-length-encodes the intra prediction parameter for a partition to be encoded is shown in above-mentioned Embodiment 1, the variable length encoding part can variable-length-encode an intra merge flag indicating . whether or not the intra
25 prediction parameter for the partition to be encoded is identical to that for a partition adjacent to the partition, and can also variable-length-encode an intra merge direction for specifying the adjacent partition when the intra prediction parameter for the partition to be encoded is identical to that
30 for ..the adjacent partition, otherwise variable-length-encode
41
. the intra prediction parameter for the partition to be encoded. [0072]
Further, although the example in which the variable length decoding part 21 of the moving image decoding device
5 variable-length-decodes the coded data associated with a partition to be decoded to acquire the intra prediction parameter for the partition is shown in above-mentioned Embodiment 1, the variable length decoding part can variable-length-decode the coded data associated with the
10 partition to be decoded to acquire the intra merge flag indicating whether or not the intra prediction parameter for the partition to be decoded is identical to that for a partition adjacent to the partition, and can also variable-length-decode the coded data to acquire the intra merge direction for
15 specifying the adjacent partition when the intra prediction parameter for the partition to be decoded is identical to that for the adjacent partition, otherwise variable-length-decode the coded data to acquire the intra prediction parameter for the partition to be encoded.
20 [0073]
Because a moving image encoding device and a moving image decoding device in accordance with this embodiment have the same structures as those according to above-mentioned Embodiment 1 except for components corresponding to encoding and decoding
25 of an intra prediction parameter, only the encoding and decoding of the intra prediction parameter will be explained in this Embodiment 2. [0074]
When variable-length-encoding the intra prediction
30 parameter for a partition Pin to be encoded (target to be
42
processed), a variable length encoding -part 13 of the moving image encoding device in accordance with this Embodiment 2 variable-length-encodes an intra merge flag indicating whether the intra prediction parameter is identical to that for a
5 partition adjacent to the partition Pin, as shown in Fig. 12 (A) . When the intra merge flag indicates that the intra prediction parameter is identical to that for an adjacent partition, the variable length encoding part 13 also variable-length-encodes an intra merge direction showing which one of adjacent
10 partitions has the same intra prediction parameter as the partition to be encoded. In contrast, when the intra merge flag indicates that the intra prediction parameter is not identical to that for any adjacent partition, the variable length encoding part variable-length-encodes the intra prediction parameter
15 for the partition Pj.n to be encoded (in this case, the variable length encoding part carries out the same encoding as that shown in above-mentioned Embodiment 1), as shown in Fig. 12(B). [0075]
For example, when a partition adjacent to the left of the
20 partition to be encoded and a partition adjacent to the top of the partition to be encoded are listed as adjacent partition candidates, as shown in Fig. 13(A), a flag indicating whether the intra prediction parameter for the partition to be encoded is identical to that for the partition adjacent to the left of
25 the partition to be encoded or those for the partition adjacent to the top of the partition to be encoded is provided as the intra merge direction. As an alternative, a partition adjacent to the left of the partition to be encoded, a partition adjacent to the top left of the partition to be encoded, and a partition
30 adjacent to the top of the partition to be encoded are listed
43
can be listed as adjacent partition candidates, and a flag indicating which one of the three candidates has the same intra prediction parameter as the partition to be encoded can be provided as the intra merge direction, as shown in Fig. 13(B) .
5 In a case in which all adjacent partitions each of which can be the target for flag encoding have the same intra prediction parameter, it is not necessary to encode the intra merge direction as a matter of course. Therefore, the variable length encoding part can be constructed in such a way as not to encode 10 the intra merge direction in this case, as shown in Fig. 12 (D) . Further, the variable length encoding part can be constructed in such a way as to, when there are two or more partitions adjacent to the top or left of the partition to be encoded, scan the partitions in a direction away from a top left side of the
15 partition to be encoded, for example, and select the first partition which has been encoded in an intra prediction mode as the adjacent partition to the top or left of the partition to be encoded. [0076]
20 Further, when the number NumN of selectable intra prediction directions in the adjacent partition is not equal to the number NumC of selectable intra prediction directions in the partition to be encoded, the variable length encoding part can be constructed in the following way. More
25 specifically, when NumNNumC, the variable length encoding
part brings a representative prediction direction vector, among
the prediction direction vectors of a plurality of directional
predictions in the adjacent partition, into correspondence with
15 one of the selectable intra prediction directions in the
partition Pin to be encoded to determine whether the intra
prediction parameter for the partition Pin to be encoded matches
that for the adjacent partition. In this case, the variable
length encoding part does not have to encode the intra
20 prediction direction residual parameter. Because it can be
considered that an intra prediction mode has a direction
depending on the texture of the image to be encoded, a similar
prediction mode easily occurs locally. Therefore, the
variable length encoding part can encode the intra prediction
25 parameter with a smaller amount of information by encoding the
intra prediction parameter by using the intra merge flag and
the intra merge direction.
[0078]
When variable-length-decoding the intra prediction 30 parameter in a partition Pin to be decoded (target to be
45
processed), a variable length decoding part 21 of the moving image decoding device in accordance with this Embodiment 2 variable-length-decodes the intra merge flag indicating whether or not the intra prediction parameter is identical to 5 that for a partition adjacent to the partition Pj.n, as shown in Fig. 12(A). Further, when the intra merge flag indicates that the intra prediction parameter is identical to that for an adjacent partition, the variable length decoding part 21 variable-length-decodes the intra merge direction showing 10 which one of adjacent partitions has the same intra prediction parameter as the partition to be decoded. In contrast, when the intra merge flag indicates that the intra prediction parameter is not identical to that for an adjacent partition, the variable length decoding part variable-length-decodes the 15 intra prediction parameter in the partition Pin to be decoded (in this case, the variable length decoding part carries out the same decoding as that shown in above-mentioned Embodiment 1), as shown in Fig. 12(B). [0079] 20 For example, when a partition adjacent to the left of the partition to be decoded and a partition adjacent to the top of the partition to be decoded are listed as adjacent partition candidates, as shown in Fig. 13(A), a flag indicating whether the intra prediction parameter for the partition to be decoded 25 is identical to that for the partition adjacent to the left of the partition to be decoded or those for the partition adjacent to the top of the partition to be decoded is provided as the intra merge direction. As an alternative, a partition adjacent to the left of the partition to be decoded, a partition adjacent 30 to the top left of the partition to be decoded, and a partition
46
adjacent to the top of the partition to be decoded can be listed as adjacent partition candidates, and a flag indicating which one of the three candidates has the same intra prediction parameter as the partition to be decoded can be provided as the
5 intra merge direction, as shown in Fig. 13(B). In a case in which all adjacent partitions each of which can be the target for flag encoding have the same intra prediction parameter, it is not necessary to encode the intra merge direction as a matter of course. Therefore, the variable length decoding part can
10 be constructed in such a way as not to decode the intra merge direction in this case, as shown in Fig. 12(D). Further, the variable length decoding part can be constructed in such a way as to, when there are two or more partitions adjacent to the top or left of the partition to be decoded, scan the partitions
15 in a direction away from a top left side of the partition to
be decoded, for example, and select the first partition which
has been encoded in an intra prediction mode as the adjacent
partition to the top or left of the partition to be decoded.
[0080]
20 Further, when the number NumN of selectable intra prediction directions in the adjacent partition is not equal to the number NumC of selectable intra prediction directions in the partition to be decoded, the variable length decoding part can be constructed in the following way. More
25 specifically, when NumNNumC, the variable length decoding part brings a representative prediction direction vector, among the prediction direction vectors of a plurality of directional predictions in the adjacent partition, into correspondence with
15 one of the selectable intra prediction directions in the partition Pin to be decoded to determine whether the intra prediction parameter for the partition Pin to be decoded matches that for the adjacent partition. In this case, the variable length decoding part does not have to decode the intra
20 prediction direction residual parameter. Because the variable length decoding part is constructed in this way, the variable length decoding part can properly decode the intra prediction parameter which is encoded by the moving image encoding device in accordance with this Embodiment 2. As a matter of course,
25 the variable length encoding unit and the variable length decoding unit in accordance with this Embodiment 2 can also be applied to a case in which the partition to be processed and the adjacent partition are not placed in a prediction mode, as explained in this Embodiment 1, in which part of an intra
30 prediction image in a higher hierarchical "layer is extracted.
48
[0082]
While the invention has been described in its preferred embodiments, it is to be understood that an arbitrary combination of two or more of the above-mentioned embodiments 5 can be made, various changes can be made in an arbitrary-component according to any one of the above-mentioned embodiments, and an arbitrary component according to any one of the above-mentioned embodiments can be omitted within the scope of the invention.
10 INDUSTRIAL APPLICABILITY [0083]
As mentioned above, because the image coding device, the image decoding device, the image coding method, and the image decoding method in accordance with the present invention are
15 capable of increasing the degree of improvement in the encoding efficiency which is caused by an increase in the number of directional predictions with a small amount of computation and a small code amount, the image coding device and the image coding method are suitable for use as an image coding device for and
20 an image coding method of encoding a moving image with a high degree of efficiency, and so on, and the image decoding device and the image decoding method are suitable for use as an image decoding device for and an image decoding method of decoding an encoded moving image with a high degree of efficiency, and
25 so on. [0084]
1 encoding controlling part (encoding controlling unit) , 2 block dividing part (block dividing unit), 3 select switch (prediction image generating unit), 4 intra prediction part
49
(prediction image generating unit), 5 motion-compensated prediction part (prediction image generating unit), 6 subtracting part (difference image generating unit), 7 transformation/quantization part (image compression unit), 8
5 inverse quantization/inverse transformation part, 9 adding part, 10 memory for intra prediction, 11 loop filter part, 12 motion-compensated prediction frame memory, 13 variable length encoding part (variable length encoding unit), 21 variable length decoding part (variable length decoding unit), 22 select
10 switch (prediction image generating unit) , 23 intra prediction part (prediction image generating unit) , 24 motion compensation part (prediction image generating unit), 25 inverse quantization/inverse transformation part (difference image generating unit), 26 adding part (decoded image generating
15 unit), 27 memory for intra prediction, 28 loop filter part, 29 motion-compensated prediction frame memory.
50
We Claim:
1. A moving image encoding device comprising:
an encoding controlling unit (1) for selecting a coding mode for each of coding blocks obtained by hierarchically dividing a coding block having a maximum size which is a processing unit of a coding process;
a prediction image generating unit (3, 4, 5) for performing an intra prediction or an inter prediction on each of said coding blocks to generate an intra prediction image or an inter prediction image according to said coding mode; and
a variable length encoding unit (13) for performing a variable length coding on an intra prediction parameter used to perform an intra prediction on a partition of said coding block to generate a coded data;
wherein,
when there are two or more adjacent partitions located above or to the left of said partition, a first partition in a direction away from top left of said partition is regarded as said adjacent partition,
said variable length encoding unit (13) generates an intra merge flag indicating whether or not said intra prediction parameter for said partition is identical to an intra prediction parameter for an adjacent partition located above or to the left of said partition,
when said intra prediction parameter for said partition is identical to an intra parameter for said adjacent partition located above or to the left of said partition, said variable length encoding unit (13) generates an intra merge direction specifying one of adjacent partitions located above and to the left of said partition of which intra prediction parameter is identical to said intra prediction parameter, and
when said intra prediction parameter for said partition is not identical to an intra prediction parameter for an adjacent partition located above or to the left of said partition, said variable length coding unit (13) encodes said intra prediction parameter for said partition.
51
2. A moving image decoding device comprising:
a variable length decoding unit (21) for performing a variable length decoding on coded data associated with hierarchically divided coding blocks to obtain coding mode for each of said coding blocks; and
a prediction image generating unit (22, 23, 24) for performing an intra prediction or an inter prediction on the individual coding blocks to generate an intra prediction image or an inter prediction image according to said coding mode,
wherein,
when there are two or more adjacent partitions located above or to the left of said partition, a first partition in a direction away from top left of said partition is regarded as said adjacent partition,
said variable length decoding unit (21) decodes an intra merge flag indicating whether or not said intra prediction parameter for said partition is identical to an intra prediction parameter for an adjacent partition located above or to the left of said partition,
when said intra prediction parameter for said partition is identical to an intra parameter for said adjacent partition located above or to the left of said partition, said variable length decoding unit (21) decodes an intra merge direction specifying one of adjacent partitions located above and to the left of said partition of which intra prediction parameter is identical to said intra prediction parameter, and
when said intra prediction parameter for said partition is not identical to said intra prediction parameter for said adjacent partition located above or to the left of said partition, said variable length decoding unit (21) decodes said intra prediction parameter for said partition.
Dated this 31 day of October 2018
~Digitally signed~
Anandan S
REG.NO:IN/PA-1717
of De Penning & De Penning
Agent for the Applicants
52
| # | Name | Date |
|---|---|---|
| 1 | 201848041173-TRANSLATIOIN OF PRIOIRTY DOCUMENTS ETC. [31-10-2018(online)].pdf | 2018-10-31 |
| 2 | 201848041173-STATEMENT OF UNDERTAKING (FORM 3) [31-10-2018(online)].pdf | 2018-10-31 |
| 3 | 201848041173-REQUEST FOR EXAMINATION (FORM-18) [31-10-2018(online)].pdf | 2018-10-31 |
| 4 | 201848041173-PRIORITY DOCUMENTS [31-10-2018(online)].pdf | 2018-10-31 |
| 5 | 201848041173-POWER OF AUTHORITY [31-10-2018(online)].pdf | 2018-10-31 |
| 6 | 201848041173-FORM 18 [31-10-2018(online)].pdf | 2018-10-31 |
| 7 | 201848041173-FORM 1 [31-10-2018(online)].pdf | 2018-10-31 |
| 8 | 201848041173-DRAWINGS [31-10-2018(online)].pdf | 2018-10-31 |
| 9 | 201848041173-DECLARATION OF INVENTORSHIP (FORM 5) [31-10-2018(online)].pdf | 2018-10-31 |
| 10 | 201848041173-COMPLETE SPECIFICATION [31-10-2018(online)].pdf | 2018-10-31 |
| 11 | 201848041173-CLAIMS UNDER RULE 1 (PROVISIO) OF RULE 20 [31-10-2018(online)].pdf | 2018-10-31 |
| 12 | 201848041173-Proof of Right (MANDATORY) [07-11-2018(online)].pdf | 2018-11-07 |
| 13 | Correspondence by Agent_Proof of Right_09-11-2018.pdf | 2018-11-09 |
| 14 | 201848041173-FORM 3 [12-04-2019(online)].pdf | 2019-04-12 |
| 15 | 201848041173-FORM 3 [14-10-2019(online)].pdf | 2019-10-14 |
| 16 | 201848041173-FORM 3 [17-04-2020(online)].pdf | 2020-04-17 |
| 17 | 201848041173-FORM 3 [16-10-2020(online)].pdf | 2020-10-16 |
| 18 | 201848041173-FORM 3 [23-04-2021(online)].pdf | 2021-04-23 |
| 19 | 201848041173-FORM 3 [30-08-2021(online)].pdf | 2021-08-30 |
| 20 | 201848041173-FER_SER_REPLY [30-08-2021(online)].pdf | 2021-08-30 |
| 21 | 201848041173-FER.pdf | 2021-10-17 |
| 22 | 201848041173-FORM 3 [28-01-2022(online)].pdf | 2022-01-28 |
| 23 | 201848041173-FORM 3 [08-09-2022(online)].pdf | 2022-09-08 |
| 24 | 201848041173-FORM 3 [01-03-2023(online)].pdf | 2023-03-01 |
| 25 | 201848041173-PatentCertificate04-08-2023.pdf | 2023-08-04 |
| 26 | 201848041173-IntimationOfGrant04-08-2023.pdf | 2023-08-04 |
| 1 | SearchstrategyE_09-09-2020.pdf |