Sign In to Follow Application
View All Documents & Correspondence

Apparatus, Method And Computer Program For Encoding, Decoding, Scene Processing And Other Procedures Related To Dirac Based Spatial Audio Coding Using Diffuse Compensation

Abstract: An apparatus for generating a sound field description from an input signal comprising one or more channels, comprises: an input signal analyzer (600) for obtaining diffuseness data from the input signal; a sound component generator (650) for generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, wherein the sound component generator is configured to perform an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and at least one of a number of sound field components in the second group, a number of diffuse components in the first group, a maximum order of sound field components of the first group and a maximum order of sound field components of the second group.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
31 May 2021
Publication Number
32/2021
Publication Type
INA
Invention Field
ELECTRONICS
Status
Email
kolkatapatent@lsdavar.in
Parent Application
Patent Number
Legal Status
Grant Date
2024-03-04
Renewal Date

Applicants

FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
Hansastraße 27c 80686 München

Inventors

1. FUCHS, Guillaume
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
2. THIERGART, Oliver
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
3. KORSE, Srikanth
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS, Am Wolfsmantel 33 91058 Erlangen
4. DÖHLA, Stefan
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
5. MULTRUS, Markus
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
6. KÜCH, Fabian
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
7. BOUTHÉON, Alexandre
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
8. EICHENSEER, Andrea
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen
9. BAYER, Stefan
c/o Fraunhofer-Institut für Integrierte Schaltungen IIS Am Wolfsmantel 33 91058 Erlangen

Specification

Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to DirAC based spatial audio coding using diffuse compensation Specification The present invention is directed to audio coding and, particularly, to the generation of a sound field description from an input signal using one or more sound component generators. The Directional Audio Coding (DirAC) technique [1] is an efficient approach to the analysis and reproduction of spatial sound. DirAC uses a perceptually motivated representation of the sound field based on direction of arrival (DOA) and diffuseness measured per frequency band. It is built upon the assumption that at one time instant and at one critical band, the spatial resolution of auditory system is limited to decoding one cue for direction and another for inter-aural coherence. The spatial sound is then represented in frequency domain by cross-fading two streams: a non-directional diffuse stream and a directional non-diffuse stream. DirAC was originally intended for recorded B-format sound but can also be extended for microphone signals matching a specific loudspeaker setup like 5.1 [2] or any configuration of microphone arrays [5]. In the latest case, more flexibility can be achieved by recording the signals not for a specific loudspeaker setup, but instead recording the signals of an intermediate format. Such an intermediate format, which is well-established in practice, is represented by (higher-order) Ambisonics [3]. From an Ambisonics signal, one can generate the signals of every desired loudspeaker setup including binaural signals for headphone reproduction. This requires a specific renderer which is applied to the Ambisonics signal, using either a linear Ambisonics renderer [3] or a parametric renderer such as Directional Audio Coding (DirAC). An Ambisonics signal can be represented as a multi-channel signal where each channel (referred to as Ambisonics component) is equivalent to the coefficient of a so-called spatial basis function. With a weighted sum of these spatial basis functions (with the weights corresponding to the coefficients) one can recreate the original sound field in the recording location [3]. Therefore, the spatial basis function coefficients (i.e., the Ambisonics components) represent a compact description of the sound field in the recording location. There exist different types of spatial basis functions, for example spherical harmonics (SHs) [3] or cylindrical harmonics (CHs) [3]. CHs can be used when describing the sound field in the 2D space (for example for 2D sound reproduction) whereas SHs can be used to describe the sound field in the 2D and 3D space (for example for 2D and 3D sound reproduction). As an example, an audio signal f(t) which arrives from a certain direction (φ, θ) results in a spatial audio signal f(φ, θ, t) which can be represented in Ambisonics format by expanding the spherical harmonics up to a truncation order H: whereby being the spherical harmonics of order f and mode m, and ϕlm(t) the expansion coefficients. With increasing truncation order H the expansion results in a more precise spatial representation. Spherical harmonics up to order H = 4 with Ambisonics Channel Numbering (ACN) index are illustrated in Fig. 1a for order n and mode m. DirAC was already extended for delivering higher-order Ambisonics signals from a first order Ambisonics signal (FOA as called B-format) or from different microphone arrays [5]. This document focuses on a more efficient way to synthesize higher-order Ambisonics signals from DirAC parameters and a reference signal. In this document, the reference signal, also referred to as the down-mix signal, is considered a subset of a higher-order Ambisonics signal or a linear combination of a subset of the Ambisonics components. In addition, the present invention considers the case in which the DirAC is used for the transmission in parametric form of the audio scene. In this case, the down-mix signal is encoded by a conventional audio core encoder while the DirAC parameters are transmitted in a compressed manner as side information. The advantage of the present method is to takes into account quantization error occurring during the audio coding. In the following, an overview of a spatial audio coding system based on DirAC designed for Immersive Voice and Audio Services (IVAS) is presented. This represents one of different contexts such as a system overview of a DirAC Spatial Audio Coder. The objective of such a system is to be able to handle different spatial audio formats representing the audio scene and to code them at low bit-rates and to reproduce the original audio scene as faithfully as possible after transmission. The system can accept as input different representations of audio scenes. The input audio scene can be captured by multi-channel signals aimed to be reproduced at the different loudspeaker positions, auditory objects along with metadata describing the positions of the objects over time, or a first-order or higher-order Ambisonics format representing the sound field at the listener or reference position. Preferably the system is based on 3GPP Enhanced Voice Services (EVS) since the solution is expected to operate with low latency to enable conversational services on mobile networks. As shown in Fig. 1b, the encoder (IVAS encoder) is capable of supporting different audio formats presented to the system separately or at the same time. Audio signals can be acoustic in nature, picked up by microphones, or electrical in nature, which are supposed to be transmitted to the loudspeakers. Supported audio formats can be multi-channel signal, first-order and higher-order Ambisonics components, and audio objects. A complex audio scene can also be described by combining different input formats. All audio formats are then transmitted to the DirAC analysis, which extracts a parametric representation of the complete audio scene. A direction of arrival and a diffuseness measured per time-frequency unit form the parameters. The DirAC analysis is followed by a spatial metadata encoder, which quantizes and encodes DirAC parameters to obtain a low bit-rate parametric representation. Along with the parameters, a down-mix signal derived from the different sources or audio input signals is coded for transmission by a conventional audio core-coder. In this case an EVS-based audio coder is adopted for coding the down-mix signal. The down-mix signal consists of different channels, called transport channels: the signal can be e.g. the four coefficient signals composing a B-format signal, a stereo pair or a monophonic down-mix depending of the targeted bit-rate. The coded spatial parameters and the coded audio bitstream are multiplexed before being transmitted over the communication channel. The encoder side of the DirAC-based spatial audio coding supporting different audio formats is illustrated in Fig. 1b. An acoustic/electrical input 1000 is input into an encoder interface 1010, where the encoder interface has a specific functionality for first-order Ambisonics (FOA) or high order Ambisonics (HOA) illustrated in 1013. Furthermore, the encoder interface has a functionality for multichannel (MC) data such as stereo data, 5.1 data or data having more than two or five channels. Furthermore, the encoder interface 1010 has a functionality for object coding as, for example, SAOC (spatial audio object coding) illustrated 1011. The IVAS encoder comprises a DirAC stage 1020 having a DirAC analysis block 1021 and a downmix (DMX) block 1022. The signal output by block 1022 is encoded by an IVAS core encoder 1040 such as AAC or EVS encoder, and the metadata generated by block 1021 is encoded using a DirAC metadata encoder 1030. In the decoder, shown in Fig. 2, the transport channels are decoded by the core-decoder, while the DirAC metadata is first decoded before being conveyed with the decoded transport channels to the DirAC synthesis. At this stage, different options can be considered. It can be requested to play the audio scene directly on any loudspeaker or headphone configurations as is usually possible in a conventional DirAC system (MC in Fig. 2). The decoder can also deliver the individual objects as they were presented at the encoder side (Objects in Fig. 2). Alternatively, it can also be requested to render the scene to Ambisonics format for other further manipulations, such as rotation, reflection or movement of the scene (FOA/HOA in Fig. 2) or for using an external renderer not defined in the original system. The decoder of the DirAC-spatial audio coding delivering different audio formats is illustrated in Fig. 2 and comprises an IVAS decoder 1045 and the subsequently connected decoder interface 1046. The IVAS decoder 1045 comprises an IVAS core-decoder 1060 that is configured in order to perform a decoding operation of content encoded by IVAS core encoder 1040 of Fig. 1b. Furthermore, a DirAC metadata decoder 1050 is provided that delivers the decoding functionality for decoding content encoded by the DirAC metadata encoder 1030. A DirAC synthesizer 1070 receives data from block 1050 and 1060 and using some user interactivity or not, the output is input into a decoder interface 1046 that generates FOA/HOA data illustrated at 1083, multichannel data (MC data) as illustrated in block 1082, or object data as illustrated in block 1080. A conventional HOA synthesis using DirAC paradigm is depicted in Fig. 3. An input signal called down-mix signal is time-frequency analyzed by a frequency filter bank. The frequency filter bank 2000 can be a complex-valued filter-bank like Complex-valued QMF or a block transform like STFT. The HOA synthesis generates at the output an Ambisonics signal of order H containing (H + 1)2 components. Optionally it can also output the Ambisonics signal rendered on a specific loudspeaker layout. In the following, we will detail how to obtain the (H + 1)2 components from the down-mix signal accompanied in some cases by input spatial parameters. The down-mix signal can be the original microphone signals or a mixture of the original signals depicting the original audio scene. For example if the audio scene is captured by a sound field microphone, the down-mix signal can be the omnidirectional component of the scene (W), a stereo down-mix (L/R), or the first order Ambisonics signal (FOA). For each time-frequency tile, a sound direction, also called Direction-of-Arrival (DOA), and a diffuseness factor are estimated by the direction estimator 2020 and by the diffuseness estimator 2010, respectively, if the down-mix signal contains sufficient information for determining such DirAC parameters. It is the case, for example, if the down-mix signal is a First Oder Ambisonics signal (FOA). Alternatively or if the down-mix signal is not sufficient to determine such parameters, the parameters can be conveyed directly to the DirAC synthesis via an input bit-stream containing the spatial parameters. The bit-stream could consists for example of quantized and coded parameters received as side-information in the case of audio transmission applications. In this case, the parameters are derived outside the DirAC synthesis module from the original microphone signals or the input audio formats given to the DirAC analysis module at the encoder side as illustrated by switch 2030 or 2040. The sound directions are used by a directional gains evaluator 2050 for evaluating, for each time-frequency tile of the plurality of time-frequency tiles, one or more set of ( H + 1)2 directional gains where H is the order of the synthesized Ambisonics signal. The directional gains can be obtained by evaluation the spatial basis function for each estimated sound direction at the desired order (level) l and mode m of the Ambisonics signal to synthesize. The sound direction can be expressed for example in terms of a unit-norm vector n(k,n) or in terms of an azimuth angle φ(k,n) and/or elevation angle θ(k, n), which are related for example as: After estimating or obtaining the sound direction, a response of a spatial basis function of the desired order (level) l and mode m can be determined, for example, by considering real-valued spherical harmonics with SN3D normalization as spatial basis function: with the ranges are the Legendre-functions and is a normalization term for both the Legendre functions and the trigonometric functions which takes the following form for SN3D: where the Kronecker-deita δm is one for m = 0 and zero otherwise. The directional gains are then directly deduced for each time-frequency tile of indices (k,n) as: The direct sound Ambisonics components are computed by deriving a reference signal Pref from the down-mix signal and multiplied by the directional gains and a factor function of the diffusenessΨ(k, n): For example, the reference signal Prefcan be the omnidirectional component of the down-mix signal or a linear combination of the K channels of the down-mix signal. The diffuse sound Ambisonics component can be modelled by using a response of a spatial basis function for sounds arriving from all possible directions. One example is to define the average response by considering the integral of the squared magnitude of the spatial basis function over all possible angles φ and θ: The diffuse sound Ambisonics components are computed from a signal Pdiff multiplied by the average response and a factor function of the diffusenessΨ(k, n): The signal can be obtained by using different decorrelators applied to the reference signal Pref. Finally, the direct sound Ambisonics component and the diffuse sound Ambisonics component are combined 2060, for example, via the summation operation, to obtain the final Ambisonics component of the desired order (level) l and mode m for the time-frequency tile (k, n), i.e., The obtained Ambisonics components may be transformed back into the time domain using an inverse filter bank 2080 or an inverse STFT, stored, transmitted, or used for example for spatial sound reproduction applications. Alternatively, a linear Ambisonics renderer 2070 can be applied for each frequency band for obtaining signals to be played on a specific loudspeaker layout or over headphone before transforming the loudspeakers signals or the binaural signals to the time domain. It should be noted that [5] also taught the possibility that diffuse sound components could only be synthesized up to an order L, where L L. The resulting vector contains the synthesized coefficients of order L < l ≤ H, denoted by Ym,l, The HOA synthesis normally depends on the diffuseness Ψ (or a similar measure), which describes how diffuse the sound field for the current time-frequency point is. Normally, the coefficients in only are synthesized if the sound field becomes non-diffuse, whereas in diffuse situations, the coefficients become zero. This prevents artifacts in diffuse situations, but also results in a loss of energy. Details on the HOA synthesis are explained later. To compensate for the loss of energy in diffuse situations mentioned above, we apply an energy compensation to in the energy compensation block 650, 750. The resulting signal is denoted by and has the same maximum order L as The energy compensation depends on the diffuseness (or similar measure) and increases the energy of the coefficients in diffuse situations such that the loss of energy of the coefficients in is compensated. Details are explained later. In the combination block, the energy compensated coefficients in are combined 430 with the synthesized coefficients in to obtain the output Ambisonics signal containing all (H + 1)2 coefficients, i.e., Subsequently, a HOA synthesis is explained as an embodiment. There exist several state-of-the-art approaches to synthesize the HOA coefficients in e.g., a covariance-based rendering or a direct rendering using Directional Audio Coding (DirAC). In the simplest case, the coefficients in are synthesized from the omnidirectional component using Here, (φ, θ) is the direction-of-arrivai (DOA) of the sound and is the corresponding gain of the Ambisonics coefficient of order l and mode m. Normally, corresponds to the real-valued directivity pattern of the well-known spherical harmonic function of order l and mode m, evaluated at the DOA (φ, θ) . The diffuseness Ψ becomes 0 if the sound field is non-diffuse, and 1 if the sound field is diffuse. Consequently, the coefficients computed above order L become zero in diffuse recording situations. Note that the parameters φ, θ and Ψ can be estimated from a first- order Ambisonics signal based on the active sound intensity vector as explained in the original DirAC papers. Subsequently the energy compensation of the diffuse sound components is discussed. To derive the energy compensation, we consider a typical sound field model where the sound field is composed of a direct sound component and a diffuse sound component, i.e., the omnidirectional signal can be written as where Ps is the direct sound (e.g., plane wave) and Pd is the diffuse sound. Assuming this sound field model and an SN3D normalization of the Ambisonics coefficients, the expected power of the physically correct coefficients Bm,l is given by Here, is the power of the direct sound and is the power of the diffuse sound. Moreover, Q1 is the directivity factor of the coefficients of the l-th order, which is given by where N = 2l + 1 is the number of coefficients per order l. To compute the energy compensation, we either can consider the DOA (φ, θ) (more accurate energy compensation) or we assume that (φ, θ) is a uniformly distributed random variable (more practical approach). In the latter case, the expected power of is In the following, let denote a physically correct Ambisonics signal of maximum order H. Using the equations above, the total expected power of is given by Similarly, when using the common diffuseness definition the total expected power of the synthesized Ambisonics signal is given by The energy compensation is carried out by multiplying a factor g to i.e., The total expected power of the output Ambisonics signal now is given by The total expected power of should match the total expected power of Therefore, the squared compensation factor is computed as This can be simplified to where Ψ is the diffuseness, L is the maximum order of the input Ambisonics signal, and H is the maximum order of the output Ambisonics signal. It is possible to adopt the same principle for K < (L + 1)2 where the (L + 1)2 ‒ K diffuse sound Ambisonics components are synthesized using decorrelators and an average diffuse response. In certain cases, K < (L + 1)2 and no diffuse sound components are synthesized, ft is especially true for high frequencies where absolute phases are inaudible and the usage of decorrelators irrelevant. The diffuse sound components can be then modelled by the energy compensation by computing the order Lk and the number of modes mk corresponding to the K low-order components, wherein K represents a number of diffuse components in the first group: The compensating gain becomes then: Subsequently, embodiments of the energy normalization of direct sound components corresponding to the second aspect generally illustrated in Fig. 7 is illustrated. In the above, the input vector was assumed to be a physically correct Ambisonics signal of maximum order L. However, the down-mix input signal may be affected by quantization errors, which may break the energy relationship. This relationship can be restored by normalizing the down-mix input signal: Given the direction of sound and the diffuseness parameters, direct and diffuse components can be expressed as: The expected power according to the model can be then expressed for each components of as: The compensating gain becomes then: where 0 ≤ l ≤ L and ‒l ≤ m ≤ l Alternatively, the expected power according to the model can be then expressed for each components of as: The compensating gain becomes then: where 0 ≤ l ≤ L and ‒l ≤ m ≤ l and are complex values and for the calculation of gs, the norm or magnitude or absolute value or the polar coordinate representation of the complex value is taken and squared to obtain the expected power or energy as the energy- or amplitude-related measure. The energy compensation of diffuse sound components and the energy normalization of direct sound components can be achieved jointly by applying a gain of the form: gs, d = g ‧ gs In a real implementation, the obtained normalization gain, the compensation gain or the combination of the two can be limited for avoiding large gain factors resulting in severe equalization which could lead to audio artefacts. For example the gains can be limited to be between -6 and +6 dB. Furthermore, the gains can be smoothed over time and/or frequency (by a moving average or a recursive average) for avoiding abrupt changes and for then stabilization process. Subsequently, some of the benefits and advantages of preferred embodiments over the state of the art will be summarized. • Simplified (less complex) HOA synthesis within DirAC. ⃘ More direct synthesis without a full synthesis of all Ambisonics components. ⃘ Reduction of the number of decorrelators required and their impact on the final quality. • Reduction of the coding artefacts introduced in the down-mix signal during the transmission. • Separation of the processing for three different orders to have an optimum trade- off between quality and processing efficiency. • Subsequently, several inventive aspects partly or fully included in the above description are summarized that can be used independent from each other or in combination with each other or only in a certain combination combining only arbitrarily selected two aspects from the three aspects. First aspect: Energy compensation for the diffuse sound components This invention starts from the fact that when a sound field description is generated from an input signal comprising one or more signal components, the input signal can be analyzed for obtaining at least diffuseness data for the sound field represented by the input signal. The input signal analysis can be an extraction of diffuseness data associated as metadata to the one or more signal components or the input signal analysis can be a real signal analysis, when, for example, the input signal has two, three or even more signal components such as a full first order representation such as a B-format representation or an A-format representation. Now, there is a sound component generator that generates one or more sound field components of a first group that have a direct component and a diffuse component. And, additionally, one or more sound field components of a second group is generated, where, for such a second group, the sound field component only has direct components. In contrast to a full sound field generation, this will result in an energy error provided that the diffuseness value for the current frame or the current time/frequency bin under consideration has a value different from zero. In order to compensate for this energy error, an energy compensation is performed when generating the first group of sound field components. This energy compensation depends on the diffuseness data and a number of sound field components in the second group representing the energy loss due to the non-synthesis of diffuse components for the second group. In one embodiment, the sound component generator for the first group can be the low order branch of Fig. 4 that extracts the sound field components of the first group by means of copying or performing a weighted addition, i.e., without performing a complex spatial basis function evaluation. Thus, the sound field component of the first group is not separately available as a direct portion and a diffuse portion. However, increasing the whole sound field component of the first group with respect to its energy automatically increases the energy of the diffuse portion. Alternatively, the sound component generator for the one or more sound field components of the first group can also be the mid-order branch in Fig. 4 relying on a separate direct portion synthesis and diffuse portion synthesis. Here, we have the diffuse portion separately available and in one embodiment, the diffuse portion of the sound field component is increased but not the direct portion in order to compensate the energy loss due to the second group. Alternately, however, one could, in this case, increase the energy of the resulting sound field component after having combined the direct portion and the diffuse portion. Alternatively, the sound component generator for the one or more sound field components of the first group can also be the low and mid-order components branches in Fig. 4. The energy compensation can be then applied only to the low-order components, or to both the low- and mid-order components. Second aspect: Energy Normalization of Direct Sound Components In this invention, one starts from the assumption that the generation of the input signal that has two or more sound components was accompanied by some kind of quantization. Typically, when one considers two or more sound components, one sound component of the input signal can be an omnidirectional signal, such as omnidirectional microphone signals W in a B-format representation, and the other sound components can be individual directional signals, such as the figure-of-eight microphone signals X,Y,Z in a B-format representation, i.e., a first order Ambisonics representation. When a signal encoder comes into a situation that the bitrate requirements are too high for a perfect encoding operation, then a typical procedure is that the encoder encodes the omnidirectional signal as exact as possible, but the encoder only spends a lower number of bits for the directional components which can even be so low that one or more directional components are reduced to zero completely. This represents such an energy mismatch and loss in directional information. Now, one nevertheless has the requirement which, for example, is obtained by having explicit parametric side information saying that a certain frame or time/frequency bin has a certain diffuseness being lower than one and a sound direction. Thus, the situation can arise that one has, in accordance with the parametric data, a certain non-diffuse component with a certain direction while, on the other side, the transmitted omnidirectional signal and the directional signals don’t reflect this direction. For example, the omnidirectional signal could have been transmitted without any significant loss of information while the directional signal, Y, responsible for left and right direction could have been set to zero for lack of bits reason. In this scenario, even if in the original audio scene a direct sound component is coming from the left, the transmitted signals will reflect an audio scene without any left-right directional characteristic. Thus, in accordance with the second invention, an energy normalization is performed for the direct sound components in order to compensate for the break of the energy relationship with the help of direction/diffuseness data either being explicitly included in the input signal or being derived from the input signal itself. This energy normalization can be applied in the context of all the individual processing branches of Fig. 4 either altogether or only separately. This invention allows to use the additional parametric data either received from the input signal or derived from non-compromised portions of the input signal, and, therefore, encoding errors being included in the input signal for some reason can be reduced using the additional direction data and diffuseness data derived from the input signal. In this invention, an energy- or amplitude-related measure for an omnidirectional component derived from the input signal and a further energy- or amplitude-related measure for the directional component derived from the input signal are estimated and used for the energy compensation together with the direction data and the diffuseness data. Such an energy- or amplitude-related measure can be the amplitude itself, or the power, i.e., the squared and added amplitudes or can be the energy such as power multiplied by a certain time period or can be any other measure derived from the amplitude with an exponent for an amplitude being different from one and a subsequent adding up. Thus, a further energy- or amplitude-related measure might also be a loudness with an exponent of three compared to the power having an exponent of two. Third aspect: System Implementation with Different Processing Procedures for the Different Orders In the third invention, which is illustrated in Fig. 4, a sound field is generated using an input signal comprising a mono-signal or a multi-component signal having two or more signal components. A signal analyzer derives direction data and diffuseness data from the input signal either by an explicit signal analysis in the case of the input signal have two or more signal components or by analyzing the input signal in order to extract direction data and diffuseness data included in the input signal as metadata. A low-order components generator generates the low-order sound description from the input signal up to a predetermined order and performs this task for available modes which can be extracted from the input signal by means of copying a signal component from the input signal or by means of performing a weighted combination of components in the input signal. The mid-order components generator generates a mid-order sound description having components of orders above the predetermined order or at the predetermined order and above the predetermined mode and lower or equal to a first truncation order using a synthesis of at least one direct component and a synthesis of at least one diffuse component using the direction data and the diffuseness data obtained from the analyzer so that the mid-order sound description comprises a direct contribution and a diffuse contribution. Furthermore, a high-order components generator generates a high-order sound description having components of orders above the first truncated and lower or equal to a second truncation order using a synthesis of at least one direct component without any diffuse component synthesis so that the high-order sound description has a direct contribution only. This system invention has significant advantages in that an exact as possible low-order sound field generation is done by utilizing the information included in the input signal as good as possible while, at the same time, the processing operations to perform the low-order sound description require low efforts due to the fact that only copy operations or weighted combination operations such as weighted additions are required. Thus, a high quality low-order sound description is performed with a minimum amount of required processing power. The mid-order sound description requires more processing power, but allows to generate a very accurate mid-order sound description having direct and diffuse contributions using the analyzed direction data and diffuseness data typically up to an order, i.e., the high order, below which a diffuse contribution in a sound field description is still required from a perceptual point of view. Finally, the high-order components generator generates a high-order sound description only by performing a direct synthesis without performing a diffuse synthesis. This, once again, reduces the amount of required processing power due to the fact that only the direct components are generated while, at the same time, the omitting of the diffuse synthesis is not so problematic from a perceptual point of view. Naturally, the third invention can be combined with the first invention and/or the second invention, but even when, for some reasons, the compensation for not performing the diffuse synthesis with the high-order components generator is not applied, the procedure nevertheless results in an optimum compromise between processing power on the one hand and audio quality on the other hand. The same is true for the performing of the low-order energy normalization compensating for the encoding used for generating the input signal. In an embodiment, this compensation is additionally performed, but even without this compensation, significant non-trivial advantages are obtained. Fig. 4 illustrates, as a symbolical illustration of a parallel transmission, the number of components processed by each components generator. The low-order components generator 810 illustrated in Fig. 4 generates a low-order sound field description from the input signal up to a predetermined order and a predetermined mode, where the low-order components generator 810 is configured to derive the low-order sound field description by copying or taking the input signal as it is or performing a weighted combination of the channels of the input signal. As illustrated between the generator low-order components block and the mixing low-order components block, K individual components are processed by this low-order components generator 810. The mid-order components generator 820 generates the reference signal and, as an exemplary situation, it is outlined that the omnidirectional signal included in the down-mix signal at the input or the output of the filter bank 400 is used. However, when the input signal has the left channel and the right channel, then the mono signal obtained by adding the left and the right channel is calculated by the reference signal generator 821. Furthermore, the number of (L + 1)2 ‒ K components are generated by the mid-order components generator. Furthermore, the high-order components generator generates a number of (H + 1)2 ‒ (L + 1)2 components so that, in the end, at the output of the combiner, (H + 1)2 components are there from the single or several (small number) of components at the input into the filter bank 400. The splitter is configured to provide the individual directional/diffuseness data to the corresponding components generators 810, 820, 830. Thus, the low-order components generator receives the K data items. This is indicated by the line collecting the splitter 420 and the mixing low-order components block. Furthermore, the mixing mix-order components block 825 receives (L + 1 )2 ‒ K data items, and the mixing high-order components block receives (H + 1)2 ‒ (L + 1)2 data items. Correspondingly, the individual mixing components blocks provide a certain number of sound field components to the combiner 430. Subsequently, a preferred implementation of the low-order components generator 810 of Fig. 4 is illustrated with respect to Fig. 9. The input signal is input into an input signal investigator 811 , and the input signal investigator 811 provides the acquired information to a processing mode selector 812. The processing mode selector 812 is configured to select a plurality of different processing modes which are schematically illustrated as a copying block 813 indicated by number 1, a taking (as it is) block 814 indicated by number 2, a linear combination block (first mode) indicated by number 3 and by reference number 815, and a linear combination (second mode) block 816 indicated by number 4. For example, when the input signal investigator 811 determines a certain kind of input signal then the processing mode selector 812 selects one of the plurality of different processing modes as shown in the table of Fig, 9. For example, when the input signal is an omnidirectional signal W or a mono signal then copying 813 or taking 814 is selected. However, when the input signal is a stereo signal with a left channel or a right channel or a multichannel signal with 5.1 or 7.1 channels then the linear combination block 815 is selected in order to derive, from the input signal, the omnidirectional signal W by adding left and right and by calculating a directional component by calculating the difference between left and right. However, when the input signal is a joint stereo signal, i.e., a mid/side representation then either block 813 or block 814 is selected since the mid signal already represents the omnidirectional signal and the side signal already represents the directional component. Similarly, when it is determined that the input signal is a first order Ambisonics signal (FOA) then either block 813 or block 814 is selected by the processing mode selector 812. However, when it is determined that the input signal is a A-format signal then the linear combination (second mode) block 816 is selected in order to perform a linear transformation on the A-format signal to obtain the first order Ambisonics signal having the omnidirectional component and three-directional components representing the K low-order components blocks generated by block 810 of Fig. 8 or Fig. 6. Furthermore, Fig. 9 illustrates an energy compensator 900 that is configured to perform an energy compensation to the output of one of the blocks 813 to 816 in order to perform the fuse compensation and/or the direct compensation with corresponding gain values g and gs. Hence, the implementation of the energy compensator 900 corresponds to the procedure of the sound component generator 650 or the sound component generator 750 of Fig. 6 and Fig. 7, respectively. Fig. 10 illustrates a preferred implementation of the mid-order components generator 820 of Fig. 8 or a part of the sound component generator 650 for the direct/diffuse lower arrow of block 650 relating to the first group. In particular, the mid-order components generator 820 comprises the reference signal generator 821 that receives the input signal and generates the reference signal by copying or taking as it is when the input signal is a mono signal or by deriving the reference signal from the input signal by calculation as discussed before or as illustrated in WO 2017/157803 A1 incorporated herein by reference with its entire teaching. Furthermore, Fig. 10 illustrates the directional gain calculator 410 that is configured to calculate, from the certain DOA information (φ, θ) and from a certain mode number m and a certain order number I the directional gain In the preferred embodiment, where the processing is done in the time/frequency domain for each individual tile referenced by k, n, the directional gain is calculated for each such time/frequency tile. The weighter 820 receives the reference signal and the diffuseness data for the certain time/frequency tile and the result of the weighter 820 is the direct portion. The diffuse portion is generated by the processing performed by the decorrelation filter 823 and the subsequent weighter 824 receiving the diffuseness value Ψ for the certain time frame and the frequency bin and, in particular, receiving the average response to a certain mode m and order l indicated by Dl generated by the average response provider 826 that receives, as an input, the required mode m and the required order l. The result of the weighter 824 is the diffuse portion and the diffuse portion is added to the direct portion by the adder 825 in order to obtain a certain mid-order sound field component for a certain mode m and a certain order l. It is preferred to apply the diffuse compensation gain discussed with respect to Fig. 6 only to the diffuse portion generated by block 823. This can advantageously be done within the procedure done by the (diffuse) weighter. Thus, only the diffuse portion in the signal is enhanced in order to compensate for the loss of diffuse energy incurred by higher components that do not receive a full synthesis as illustrated in Fig. 10. A direct portion only generation is illustrated in Fig. 11 for the high-order components generator. Basically, the high-order components generator is implemented in the same way as the mid-order components generator with respect to the direct branch but does not comprise blocks 823, 824, 825 and 826. Thus, the high-order components generator only comprises the (direct) weighter 822 receiving input data from the directional gain calculator 410 and receiving a reference signal from the reference signal generator 821. Preferably, only a single reference signal for the high-order components generator and the mid-order components generator is generated. However, both blocks can also have individual reference signal generators as the case may be. Nevertheless, it is preferred to only have a single reference signal generator. Thus, the processing performed by the high-order components generator is extremely efficient, since only a single weighting direction with a certain directional gain with a certain diffuseness information Ψ for the time/frequency tile is to be performed. Thus, the high-order sound field components can be generated extremely efficiently and promptly and any error due to a non-generation of diffuse components or non-usage of diffuse components in the output signal is easily compensated for by enhancing the low-order sound field components or the preferably only diffuse portion of the mid-order sound field components. Typically, the diffuse portion will not be available separately within the low-order sound field components generated by copying or by performing a (weighted) linear combination. However, enhancing the energy of such components automatically enhances the energy of the diffuse portion. The concurrent enhancement of the energy of the direct portion is not problematic as has been found out by the inventors. Subsequently reference is made to Figs. 12a to 12c in order to further illustrate the calculation of the individual compensation gains. Fig. 12a illustrates a preferred implementation of the sound component generator 650 of Fig. 6. The (diffuse) compensation gain is calculated, in one embodiment, using the diffuseness value, the maximum order H and the truncation order L. In the other embodiment, the diffuse compensation gain is calculated using the parameter Lk derived from the number of components in the low-order processing branch 810. Furthermore, the parameter mk is used depending on the parameter lk and the number K of components actually generated by the low-order component generator. Furthermore, the value N depending on Lk is used as well. Both values H, L in the first embodiment or H, Lk, mk generally represent the number of sound field components in the second group (related to the number of sound components in the first group). Thus, the more components there are for which no diffuse component is synthesized, the higher the energy compensation gain will be. On the other hand, the higher the number of low-order sound field components there are, which can be compensated for, i.e., multiplied by the gain factor, the lower the gain factor can be. Generally, the gain factor g will always be greater than 1. Fig. 12a illustrates the calculation of the gain factor g by the (diffuse) compensation gain calculator 910 and the subsequent application of this gain factor to the (low-order) component to be "corrected” as done by the compensation gain applicator 900. In case of linear numbers, the compensation gain applicator will be a multiplier, and in case of logarithmic numbers, the compensation gain applicator will be an adder. However, other implementations of the compensation gain application can be implemented depending on the specific nature and way of calculating the compensation gain by block 910. Thus, the gain does not necessarily have to be a multiplicative gain but can also be any other gain. Fig. 12b illustrates a third implementation for the (direct) compensation gain processing. A (direct) compensation gain calculator 920 receives, as an input, the energy- or amplitude-related measure for the omnidirectional component indicated as “power omnidirectional” in Fig. 12b. Furthermore, the second energy- or amplitude-related measure for the directional component is also input into block 920 as “power directional". Furthermore, the direct compensation gain calculator 920 additionally receives the information QL or, alternatively, the information N. N is equal to (2l + 1) being the number of coefficients per order l, and Ql is equal to 1/N. furthermore, the directional gain for the certain time/frequency tile (k, n) is also required for the calculation of the (direct) compensation gain. The directional gain is the same data which is derived from the directional gain calculator 410 of Fig. 4, for example. The (direct) compensation gain gs is forwarded from block 920 to the compensation gain applicator 900 that can be implemented in a similar way as block 900, i.e., receives the component(s) to be “corrected” and outputs the corrected component(s). Fig. 12c illustrates a preferred implementation of the combination of the energy compensation of the diffuse sound components and the energy normalization of compensation of direct sound components to be performed jointly. To this end, the (diffuse) compensation gain g and the (direct) compensation gain gs are input into a gain combiner 930. The result of the gain combiner (i.e., the combined gain) is input into a gain manipulator 940 that is implemented as a post-processor and performs a limitation to a minimum or a maximum value or that applies a compression function in order to perform some kind of softer limitation or performs a smoothing among time or frequency tiles. The manipulated gain which is limited is compressed or smoothed or processed in other post-processing ways and the post-processed gain is then applied by the gain applicator to a low-order component(s) to obtain corrected low-order component(s). In case of linear gains g, gs, the gain combiner 930 is implemented as a multiplier. In case of logarithmic gains, the gain combiner is implemented as an adder. Furthermore, regarding the implementation of the estimator of Fig. 7 indicated at reference number 620, it is outlined that the estimator 620 can provide any energy- or amplitude-related measures for the omnidirectional and the directional component as long as the tower applied to the amplitude is greater than 1. In case of a power as the energy- or amplitude-related measure, the exponent is equal to 2. However, exponents between 1.5 and 2.5 are useful as well. Furthermore, even higher exponents or powers are useful such as a power of 3 applied to the amplitude corresponding to a loudness value rather than a power value. Thus, in general, powers of 2 or 3 are preferred for providing the energy- or amplitude-related measures but powers between 1.5 and 4 are generally preferred as well. Subsequently several examples for the aspects of the invention are summarized. Main Example 1a for the first aspect (Energy Compensation for the Diffuse Sound Components) 1a. Apparatus for generating a sound field description from an input signal comprising one or more channels, the apparatus comprising: an input signal analyzer for obtaining diffuseness data from the input signal; a sound component generator for generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, wherein the sound component generator is configured to perform an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and a number of sound field components in the second group. Main Example 1b for the second aspect (Energy Normalization for the Direct Signal Components) 1b. Apparatus for generating a sound field description from an input signal comprising at least two channels, the apparatus comprising: an input signal analyzer for obtaining direction data and diffuseness data from the input signal; an estimator for estimating a first amplitude-related measure for an omnidirectional component derived from the input signal and for estimating a second amplitude-related measure for a directional component derived from the input signal, and a sound component generator for generating sound field components of the sound field, wherein the sound component generator is configured to perform an energy compensation of the directional component using the first amplitude-related measure, the second amplitude-related measure, the direction data and the diffuseness data. Main example 1c for the third aspect: System Implementation with Different Generator Branches 1c. Apparatus for generating a sound field description using an input signal comprising a mono-signal or a multi-channel signal, the apparatus comprising: an input signal analyzer for analyzing the input signal to derive direction data and diffuseness data; a low-order components generator for generating a low-order sound description from the input signal up to a predetermined order and mode, wherein the low-order components generator is configured to derive the low-order sound description by copying the input signal or performing a weighted combination of the channels of the input signal; a mid-order components generator for generating a mid-order sound description above the predetermined order or at the predetermined order and above the predetermined mode and below or at a first truncation order using a synthesis of at least one direct portion and of at least one diffuse portion using the direction data and the diffuseness data so that the mid-order sound description comprises a direct contribution and a diffuse contribution; and a high-order components generator for generating a high-order sound description having a component above the first truncation order using a synthesis of at least one direct portion without any diffuse component synthesis so that the high order sound description comprises a direct contribution only. 2. The apparatus according to examples 1a, 1b, 1c, wherein the low-order sound description, the mid-order sound description or the high-order description contain sound field components of the output sound field which are orthogonal, so that any two sound descriptions do not contain one and the same sound field components, or wherein the mid-order components generator generates components below or at a first truncation order not used by the low-order components generator. 3. Apparatus of one of the preceding examples, comprising; receiving an input down-mix signal having one or more audio channels that represent the sound field receiving or determining one or more sound directions that represent the sound field; evaluating one or more spatial basis functions using the one and more sound directions; deriving a first set of one or more sound field components from a first weighted combination of input down-mix signal channels. deriving a second set of one or more direct sound field components from a second weighted combination of input down-mix signal channels and the one and more evaluated spatial basis functions. combining the first set of one or more sound field components and second set of one or more sound field components. 4. Apparatus of one of the preceding examples, where the first and second sets of sound field components are orthogonal. 5. Apparatus of one of the preceding examples, where the sound field components are the coefficients of orthogonal basis functions. 6. Apparatus of one of the preceding examples, where the sound field components are the coefficients of spatial basis functions. 7. Apparatus of one of the preceding examples, where the sound field components are the coefficients of spherical or circular harmonics. 8. Apparatus of one of the preceding examples, where the sound field components are Ambisonics coefficients. 9. Apparatus of one of the preceding examples, where the input down-mix signal have less than three audio channels. 10. Apparatus of one of the preceding examples, further comprising: receiving or determining a diffuseness value; generating one or more diffuse sound components as a function of the diffuseness value; and combining the one or more diffuse sound components to a second set of one or more direct sound field components; 11. Apparatus of one of the preceding examples, wherein a diffuse component generator further comprises a decorrelator for decorrelating diffuse sound information. 12. Apparatus of one of the preceding examples, wherein the first set of one or more sound field components are derived from the diffuseness value. 13. Apparatus of one of the preceding examples, wherein the first set of one or more sound field components are derived from the one or more sound directions. 14. Apparatus of one of the preceding examples that derives time-frequency dependent sound directions. 15. Apparatus of one of the preceding examples that derives time-frequency dependent diffuseness values. 16. Apparatus of one of the preceding examples, further comprising: decomposing the plurality of channels of the time-domain down-mix signal into a frequency representation having the plurality of time-frequency tiles. 17 Method for generating a sound field description from an input signal comprising one or more channels, comprising: obtaining diffuseness data from the input signal; generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, wherein the generating comprises performing an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and a number of sound field components in the second group. 18. Method for generating a sound field description from an input signal comprising at least two channels, comprising: obtaining direction data and diffuseness data from the input signal; estimating a first amplitude-related measure for an omnidirectional component derived from the input signal and for estimating a second amplitude-related measure for a directional component derived from the input signal, and generating sound field components of the sound field, wherein the sound component generator is configured to perform an energy compensation of the directional component using the first amplitude-related measure, the second amplitude-related measure, the direction data and the diffuseness data. 19. Method for generating a sound field description using an input signal comprising a mono-signal or a multi-channel signal, comprising: analyzing the input signal to derive direction data and diffuseness data; generating a low order sound description from the input signal up to a predetermined order and mode, wherein the low order generator is configured to derive the low order sound description by copying the input signal or performing a weighted combination of the channels of the input signal; generating a mid-order sound description above the predetermined order or at the predetermined order and above the predetermined mode and below a high order using a synthesis of at least one direct portion and of at least one diffuse portion using the direction data and the diffuseness data so that the mid-order sound description comprises a direct contribution and a diffuse contribution; and generating a high order sound description having a component at or above the high order using a synthesis of at least one direct portion without any diffuse component synthesis so that the high order sound description comprises a direct contribution only. 20. Computer program for performing, when running on a computer or a processor, the method of one of examples 17, 18, or 19. It is to be mentioned here that all alternatives or aspects as discussed before and all aspects as defined by independent claims in the following claims can be used individually, i.e., without any other alternative or object than the contemplated alternative, object or independent claim. However, in other embodiments, two or more of the alternatives or the aspects or the independent claims can be combined with each other and, in other embodiments, all aspects, or alternatives and all independent claims can be combined to each other. An inventively encoded audio signal can be stored on a digital storage medium or a non-transitory storage medium or can be transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet. Although some aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus. Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Some embodiments according to the invention comprise a data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed. Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may for example be stored on a machine readable carrier. Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier or a non-transitory storage medium. In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer. A further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein. A further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet. A further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein. A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein. In some embodiments, a programmable logic device (for example a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods are preferably performed by any hardware apparatus. The above described embodiments are merely illustrative for the principles of the present invention. It is understood that modifications and variations of the arrangements and the details described herein will be apparent to others skilled in the art. It is the intent, therefore, to be limited only by the scope of the impending patent claims and not by the specific details presented by way of description and explanation of the embodiments herein. References that are all incorporated it their entirety by reference: [1] V. Pulkki, M-V Laitinen, J Vilkamo, J Ahonen, T Lokki and T Pihlajamaki, "Directional audio coding - perception-based reproduction of spatial sound”, International Workshop on the Principles and Application on Spatial Hearing, Nov. 2009, Zao; Miyagi, Japan. [2] M. V. Laitinen and V. Pulkki, "Converting 5.1 audio recordings to B-format for directional audio coding reproduction," 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, 2011, pp. 61-64 [3] R. K. Furness, "Ambisonics —An overview," in AES 8th International Conference, April 1990, pp. 181—189. [4] C. Nachbar, F. Zotter, E. Deleflie, and A. Sontacchi, "AMBIX ‒ A Suggested Ambisonics Format", Proceedings of the Ambisonics Symposium 2011 [5] "APPARATUS, METHOD OR COMPUTER PROGRAM FOR GENERATING A SOUND FIELD DESCRIPTION" (corresponding to WO 2017/157803 A1) Claims 1. Apparatus for generating a sound field description from an input signal comprising one or more channels, the apparatus comprising: an input signal analyzer (600) for obtaining diffuseness data from the input signal; a sound component generator (650) for generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, wherein the sound component generator is configured to perform an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and at least one of a number of sound field components in the second group, a number of diffuse components in the first group, a maximum order of sound field components of the first group and a maximum order of sound field components of the second group. 2. Apparatus of claim 1, wherein the sound component generator (650) comprises a mid-order components generator (820) comprising: a reference signal provider (821) for providing a reference signal for a sound field component of the first group of sound field components; a decorrelator (823, 824) for generating a decorrelated signal from the reference signal, wherein the direct component of the sound field component of the first group is derived from the reference signal, wherein the diffuse component of the sound field component of the first group is derived from the decorrelated signal, and a mixer (825) for mixing the direct component and the diffuse component using at least one of the direction of arrival data provided by the input signal analyzer (600) and the diffuseness data. 3. Apparatus of claim 1 or 2, wherein the input signal comprises only a single mono channel, and wherein the sound field components of the first group of sound field components are sound field components of a first order or a higher order, or wherein the input signal comprises two or more channels, and wherein the sound field components of the first group of sound field components are sound field components of a second order or a higher order. 4. Apparatus of one of the preceding claims, wherein the input signal comprises a mono signal or at least two channels, and wherein the sound component generator (650) comprises a low order components generator for generating the low-order sound field components by copying, or taking the input signal, or performing a weighted combination of the channels of the input signal. 5. Apparatus of claim 4, wherein the input signal comprises the mono signal, and wherein the low order components generator (810) is configured to generate a zero order Ambisonics signal by taking or copying the mono signal (813, 814), or wherein the input signal comprises at least two channels, and wherein the low- order components generator (810) is configured to generate a zero order Ambisonics signal by adding the two channels and to generate a first order Ambisonics signal based on a difference of the two channels (815), or wherein the input signal comprises a first order Ambisonics signal with three or four channels, and wherein the low order components generator (810) is configured to generate a first order Ambisonics signal by taking or copying the three or four channels of the input signal (813, 814), or wherein the input signal comprises an A-format signal having four channels, and wherein the low-order components generator (810) is configured to calculate a first order Ambisonics signal by performing a weighted linear combination of the four channels (816). 6. Apparatus of one of the preceding claims, wherein the sound component generator (650) comprises a high-order components generator (830) for generating the sound field components of the second group, the sound field components of the second group having an order being higher than a truncation order used for generating the sound field components of the first group of sound field components. 7. Apparatus in accordance with one of the preceding claims, wherein the sound component generator (650) comprises an energy compensator (910, 900) for performing the energy compensation of the sound field components of the first group, the energy compensator comprising a compensation gain calculator (910) for calculating a compensation gain using the diffuseness data, a maximum order of the sound field components of the first group and a maximum order of the sound field components of the second group. 8. Apparatus of claim 7, wherein the compensation gain calculator (910) is configured to calculate a gain factor based on the following equation or wherein Ψ represents the diffuseness data, H represents a maximum order of the second group of sound field components, and L represents the maximum order of the first group, or wherein the following holds: wherein K represents the number of diffuse components in the first group. 9. Apparatus of claim 7 or 8, wherein the gain calculator (910) is configured to increase the compensation gain with an increasing number of sound field components of the second group, or to decrease the compensation gain with a increasing number of sound field components of the first group, or to increase the compensation gain with an increasing diffuseness value, or to increase the compensation gain with an increasing number of sound field components in the second group, or to decrease the compensation gain with a increasing number of diffuse components in the first group. 10. Apparatus of claim 8 or 9, wherein the gain calculator (920) is configured for calculating the compensation gain additionally using the first energy- or amplitude- related measure for an omnidirectional component derived from the input signal and using a second energy- or amplitude-related measure for a directional component derived from the input signal, the diffuseness data, and direction data obtained from the input signal. 11. Apparatus of one of claims 8 to 10, wherein the compensation gain calculator (910, 920) is configured to calculate (910) a first gain factor depending on the diffuseness data and at least one of the number of sound field components in the second group, the number of diffuse components in the first group, the maximum order of sound field components of the first group, and the maximum number of sound field components of the second group, to calculate (920) a second gain factor (gs) depending on a first amplitude or -related measure for an omnidirectional component derived from the input signal, a second energy- or amplitude-related measure for a directional component derived from the input signal, the direction data and the diffuseness data, and to calculate (930) the compensation gain using the first gain factor and the second gain factor. 12. Apparatus of one of claims 7 to 11, wherein the compensation gain calculator (910, 920, 930, 940) is configured to perform a gain factor manipulation (940) using a limitation with a fixed maximum threshold or a fixed minimum threshold or using a compression function for compressing low or high gain factors towards medium gain factors to obtain the compensation gain. 13. Apparatus of one of claims 7 to 12, wherein the energy compensator (910, 920, 930, 940) comprises a compensation gain applicator (900) for applying the compensation gain to at least one sound field component of the first group. 14. Apparatus of claim 13, wherein the compensation gain applicator (900) is configured to apply the claim compensation gain to each sound field component of the first group, or to only one or more sound field components of the first group with a diffuse portion, or to diffuse portions of the sound field components of the first group. 15. Apparatus of one of the preceding claims, wherein the input signal analyzer (600) is configured to extract the diffuseness data from metadata associated with the input signal or to extract the diffuseness data from the input signal by a signal analysis (610, 620) of the input signal having two or more channels or components. 16. Apparatus of one of the preceding claims, wherein the input signal only comprises one or two sound field components up to an input order, wherein the sound component generator (650) comprises a sound field components combiner (430) for combining the sound field components of the first group and the sound field components of the second group to obtain a sound field description up to an output order being higher than the input order. 17. Apparatus of one of the preceding claims, further comprising: an analysis filter bank (400) for generating the one or more sound field components of the first group and the second group for a plurality of different time- frequency tiles, wherein the input signal analyzer (600) is configured to obtain a diffuseness data item for each time-frequency tile, and wherein the sound component generator (650) is configured to perform the energy compensation separately for each time-frequency tile. 18. Apparatus of one of the preceding claims, further comprising: a high-order decoder (440) for using the one or more sound field components of the first group and the one or more sound field components of the second group to generate a spectral domain or time domain representation of the sound field description generated from the input signal. 19. Apparatus of one of the preceding claims, wherein the first group of sound field components and the second group of sound field components are orthogonal to each other, or wherein the sound field components are at least one of coefficients of orthogonal basis functions, coefficients of spatial basis functions, coefficients of spherical or circular harmonics, and Ambisonics coefficients. 20. Method for generating a sound field description from an input signal comprising one or more channels, comprising: obtaining diffuseness data from the input signal; generating, from the input signal, one or more sound field components of a first group of sound field components having for each sound field component a direct component and a diffuse component, and for generating, from the input signal, a second group of sound field components having only a direct component, wherein the generating comprises performing an energy compensation when generating the first group of sound field components, the energy compensation depending on the diffuseness data and at least one of a number of sound field components in the second group, a number of diffuse components in the first group, a maximum order of sound field components of the first group, and a maximum order of sound field components of the second group. 21. Computer program for performing, when running on a computer or a processor, the method of claim 20.

Documents

Orders

Section Controller Decision Date

Application Documents

# Name Date
1 202137024174-IntimationOfGrant04-03-2024.pdf 2024-03-04
1 202137024174-STATEMENT OF UNDERTAKING (FORM 3) [31-05-2021(online)].pdf 2021-05-31
2 202137024174-FORM 1 [31-05-2021(online)].pdf 2021-05-31
2 202137024174-PatentCertificate04-03-2024.pdf 2024-03-04
3 202137024174-FIGURE OF ABSTRACT [31-05-2021(online)].pdf 2021-05-31
3 202137024174-Annexure [01-03-2024(online)].pdf 2024-03-01
4 202137024174-Information under section 8(2) [01-03-2024(online)].pdf 2024-03-01
4 202137024174-DRAWINGS [31-05-2021(online)].pdf 2021-05-31
5 202137024174-Written submissions and relevant documents [01-03-2024(online)].pdf 2024-03-01
5 202137024174-DECLARATION OF INVENTORSHIP (FORM 5) [31-05-2021(online)].pdf 2021-05-31
6 202137024174-Correspondence to notify the Controller [15-02-2024(online)].pdf 2024-02-15
6 202137024174-COMPLETE SPECIFICATION [31-05-2021(online)].pdf 2021-05-31
7 202137024174-FORM-26 [11-01-2024(online)].pdf 2024-01-11
7 202137024174-FORM 18 [10-06-2021(online)].pdf 2021-06-10
8 202137024174-REQUEST FOR ADJOURNMENT OF HEARING UNDER RULE 129A [11-01-2024(online)].pdf 2024-01-11
8 202137024174-Proof of Right [21-07-2021(online)].pdf 2021-07-21
9 202137024174-FORM-26 [27-07-2021(online)].pdf 2021-07-27
9 202137024174-US(14)-ExtendedHearingNotice-(HearingDate-20-02-2024).pdf 2024-01-11
10 202137024174-US(14)-HearingNotice-(HearingDate-16-01-2024).pdf 2024-01-03
10 202137024174.pdf 2021-10-19
11 202137024174-FORM 3 [14-10-2023(online)].pdf 2023-10-14
11 202137024174-Information under section 8(2) [23-10-2021(online)].pdf 2021-10-23
12 202137024174-FER.pdf 2022-03-10
12 202137024174-Information under section 8(2) [14-10-2023(online)].pdf 2023-10-14
13 202137024174-Information under section 8(2) [12-04-2022(online)].pdf 2022-04-12
13 202137024174-Information under section 8(2) [27-07-2023(online)].pdf 2023-07-27
14 202137024174-FORM 3 [12-04-2022(online)].pdf 2022-04-12
14 202137024174-Information under section 8(2) [26-06-2023(online)].pdf 2023-06-26
15 202137024174-FORM 4(ii) [24-08-2022(online)].pdf 2022-08-24
15 202137024174-Information under section 8(2) [19-05-2023(online)].pdf 2023-05-19
16 202137024174-FORM 3 [13-04-2023(online)].pdf 2023-04-13
16 202137024174-Information under section 8(2) [12-10-2022(online)].pdf 2022-10-12
17 202137024174-Information under section 8(2) [13-04-2023(online)].pdf 2023-04-13
17 202137024174-FORM 3 [12-10-2022(online)].pdf 2022-10-12
18 202137024174-CLAIMS [08-12-2022(online)].pdf 2022-12-08
18 202137024174-FER_SER_REPLY [08-12-2022(online)].pdf 2022-12-08
19 202137024174-CLAIMS [08-12-2022(online)].pdf 2022-12-08
19 202137024174-FER_SER_REPLY [08-12-2022(online)].pdf 2022-12-08
20 202137024174-FORM 3 [12-10-2022(online)].pdf 2022-10-12
20 202137024174-Information under section 8(2) [13-04-2023(online)].pdf 2023-04-13
21 202137024174-FORM 3 [13-04-2023(online)].pdf 2023-04-13
21 202137024174-Information under section 8(2) [12-10-2022(online)].pdf 2022-10-12
22 202137024174-FORM 4(ii) [24-08-2022(online)].pdf 2022-08-24
22 202137024174-Information under section 8(2) [19-05-2023(online)].pdf 2023-05-19
23 202137024174-Information under section 8(2) [26-06-2023(online)].pdf 2023-06-26
23 202137024174-FORM 3 [12-04-2022(online)].pdf 2022-04-12
24 202137024174-Information under section 8(2) [12-04-2022(online)].pdf 2022-04-12
24 202137024174-Information under section 8(2) [27-07-2023(online)].pdf 2023-07-27
25 202137024174-FER.pdf 2022-03-10
25 202137024174-Information under section 8(2) [14-10-2023(online)].pdf 2023-10-14
26 202137024174-FORM 3 [14-10-2023(online)].pdf 2023-10-14
26 202137024174-Information under section 8(2) [23-10-2021(online)].pdf 2021-10-23
27 202137024174-US(14)-HearingNotice-(HearingDate-16-01-2024).pdf 2024-01-03
27 202137024174.pdf 2021-10-19
28 202137024174-FORM-26 [27-07-2021(online)].pdf 2021-07-27
28 202137024174-US(14)-ExtendedHearingNotice-(HearingDate-20-02-2024).pdf 2024-01-11
29 202137024174-Proof of Right [21-07-2021(online)].pdf 2021-07-21
29 202137024174-REQUEST FOR ADJOURNMENT OF HEARING UNDER RULE 129A [11-01-2024(online)].pdf 2024-01-11
30 202137024174-FORM-26 [11-01-2024(online)].pdf 2024-01-11
30 202137024174-FORM 18 [10-06-2021(online)].pdf 2021-06-10
31 202137024174-Correspondence to notify the Controller [15-02-2024(online)].pdf 2024-02-15
31 202137024174-COMPLETE SPECIFICATION [31-05-2021(online)].pdf 2021-05-31
32 202137024174-Written submissions and relevant documents [01-03-2024(online)].pdf 2024-03-01
32 202137024174-DECLARATION OF INVENTORSHIP (FORM 5) [31-05-2021(online)].pdf 2021-05-31
33 202137024174-Information under section 8(2) [01-03-2024(online)].pdf 2024-03-01
33 202137024174-DRAWINGS [31-05-2021(online)].pdf 2021-05-31
34 202137024174-FIGURE OF ABSTRACT [31-05-2021(online)].pdf 2021-05-31
34 202137024174-Annexure [01-03-2024(online)].pdf 2024-03-01
35 202137024174-PatentCertificate04-03-2024.pdf 2024-03-04
35 202137024174-FORM 1 [31-05-2021(online)].pdf 2021-05-31
36 202137024174-IntimationOfGrant04-03-2024.pdf 2024-03-04
36 202137024174-STATEMENT OF UNDERTAKING (FORM 3) [31-05-2021(online)].pdf 2021-05-31

Search Strategy

1 SearchHistory(4)E_09-03-2022.pdf

ERegister / Renewals

3rd: 29 Apr 2024

From 06/12/2021 - To 06/12/2022

4th: 29 Apr 2024

From 06/12/2022 - To 06/12/2023

5th: 29 Apr 2024

From 06/12/2023 - To 06/12/2024

6th: 29 Apr 2024

From 06/12/2024 - To 06/12/2025