Sign In to Follow Application
View All Documents & Correspondence

Apparatus And Method For Audio Rendering Employing A Geometric Distance Definition

Abstract: An apparatus (100) for playing back an audio object associated with a position is provided. The apparatus (100) comprises a distance calculator (110) for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator (110) is configured to take a solution with a smallest distance. The apparatus (100) is configured to play back the audio object using the speaker corresponding to the solution.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
20 September 2016
Publication Number
51/2016
Publication Type
INA
Invention Field
COMMUNICATION
Status
Email
Parent Application
Patent Number
Legal Status
Grant Date
2023-11-06
Renewal Date

Applicants

FRAUNHOFER GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
Hansastraße 27c 80686 München

Inventors

1. PLOGSTIES Jan
Sandleithe 39a 90768 Fürth
2. FÜG Simone
Weißgasse 24 90562 Kalchreuth
3. NEUENDORF Max
Paradiesstraße 20 90459 Nürnberg
4. HERRE Jürgen
Rathsberger Str. 10a 91054 Erlangen
5. GRILL Bernhard
Obere Bergstraße 14 90607 Rückersdorf

Specification

Apparatus and Method for Audio Rendering Employing a Geometric Distance Definition Description The present invention relates to audio signal processing, in particular, to an apparatus and a method for audio rendering, and, more particularly, to an apparatus and a method for audio rendering employing a geometric distance definition. With increasing multimedia content consumption in daily life, the demand for sophisticated multimedia solutions steadily increases. In this context, positioning of audio objects plays an important role. An optimal positioning of audio objects for an existing loudspeaker setup would be desirable. In the state of the art, audio objects are known. Audio objects may, e.g., be considered as sound tracks with associated metadata. The metadata may, e.g., describe the characteristics of the raw audio data, e.g. , the desired playback position or the volume level. An advantage of object-based audio is that a predefined movement can be reproduced by a special rendering process on the playback side in the best way possible for all reproduction loudspeaker layouts. Geometric metadata can be used to define where an audio object should be rendered, e.g. , angles in azimuth or elevation or absolute positions relative to a reference point, e.g., the listener. The metadata is stored or transmitted along with the object audio signals. In the context of MPEG-H, at the 105th MPEG meeting the audio group reviewed the requirements and timelines of different application standards (MPEG = Moving Picture Experts Group). According to that review, it would be essential to meet certain points in time and specific requirements for a next generation broadcast system. According to that, a system should be able to accept audio objects at the encoder input. Moreover, the system should support signaling, delivery and rendering of audio objects and should enable user control of objects, e.g. , for dialog enhancement, alternative language tracks and audio description language. In the state of the art. different concepts are known. A first concept is reflected sound rendering for object-based audio (see [2]). Snap to speaker location information is included in a metadata definition as useful rendering information. However, in [2], no information is provided how the information is used in the playback process. Moreover, no information is provided how a distance between two positions is determined. Another concept of the state of the art, system and tools for enhanced 3D audio authoring and rendering is described in [5]. Fig. 6B of document [5] is a diagram illustrating how a "snapping" to a speaker might be algorithmically realized. In detail, according to the document [5] if it is determined to snap the audio object position to a speaker location (see block 665 of Fig. 6B of document [5]), the audio object position will be mapped to a speaker location (see block 670 of Fig. 6B of document [5]), generally the one closest to the intended (x,y,z) position received for the audio object. According to [5], the snapping might be applied to a small group of reproduction speakers and/or to an individual reproduction speaker. However, [5] employs Cartesian (x,y,z) coordinates instead of spherical coordinates. Moreover, the renderer behavior is just described as map audio object position to a speaker location; if the snap flag is one, no detailed description is provided. Furthermore, no details are provided how the closest speaker is determined. According to another prior art, System and Method for Adaptive Audio Signal Generation, Coding and Rendering, described in document [1 ], metadata information (metadata elements) specify that "one or more sound components are rendered to a speaker feed for playback through a speaker nearest an intended playback location of the sound component, as indicated by the position metadata". However, no information is provided, how the nearest speaker is determined. In a further prior art, audio definition model, described in document [4], a metadata flag is defined called "channelLock". If set to 1 , a renderer can lock the object to the nearest channel or speaker, rather than normal rendering. However, no determination of the nearest channel is described. In another prior art, upmixing of object based audio is described (see [3]). Document [3] describes a method for the usage of a distance measure of speakers in a different field of application: Here it is used for upmixing object-based audio material. The rendering system is configured to determine, from an object based audio program (and knowledge of the positions of the speakers to be employed to play the program), the distance between each position of an audio source indicated by the program and the position of each of the speakers. Furthermore, the rendering system of [3] is configured to determine, for each actual source position (e.g , each source position along a source trajectory) indicated by the program, a subset of the full set of speakers (a "primary" subset) consisting of those speakers of the full set which are (or the speaker of the full set which is) closest to the actuai source position, where "closest" in this context is defined in some reasonably defined sense. However, no information is provided how the distance should be calculated. The object of the present invention is to provide improved concepts for audio rendering. The object of the present invention is solved by an apparatus according to claim 1 . by a decoder device according to claim 13, by a method according to claim 14 and by a computer program according to claim 15. An apparatus for playing back an audio object associated with a position is provided. The apparatus comprises a distance calculator for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator is configured to take a solution with a smallest distance. The apparatus is configured to play back the audio object using the speaker corresponding to the solution. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances of the position to the speakers or to read the distances of the position to the speakers only if a closest speaker playout flag (mdae_closestSpeakerPlayout), being received by the apparatus, is enabled. Moreover, the distance calculator may, e.g. , be configured to take a solution with a smallest distance only if the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. Furthermore, the apparatus may, e.g. , be configured to play back the audio object using the speaker corresponding to the solution only of the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. In an embodiment, the apparatus may, e.g , be configured to not conduct any rendering on the audio object, if the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. According to an embodiment, the distance calculator may. e.g. , be configured to calculate the distances depending on a distance function which returns a weighted Euclidian distance or a great-arc distance. In an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns weighted absolute differences in azimuth and elevation angles. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns weighted absolute differences to the power p, wherein p is a number. In an embodiment, p may, e.g., be set to p = 2. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns a weighted angular difference. In an embodiment, the distance function may. e.g.. be defined according to diffAngle = acos(cos(azDiff) * cos(elDiff)), wherein azDiff indicates a difference of two azimuth angles, wherein elDiff indicates a difference of two elevation angles, and wherein diffAngle indicates the weighted angular difference. According to an embodiment, the distance calculator may, e.g.. be configured to calculate the distances of the position to the speakers, so that each distance Δ (Ρ1 ( Ρ:) 0f the positon to one of the speakers is calculated according to oil indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β indicates an elevation angle of the position, and β indicates an elevation angle of said one of the speakers. Or α-, indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, ?■, indicates an elevation angle of said one of the speakers, and β2 indicates an elevation angle of the position. In an embodiment, the distance calculator may, e.g.. be configured to calculate the distances of the position to the speakers, so that each distance of the positon to one of the speakers is calculated according to Δ (Px , P2 ) = I ¾ - β21 + 1 ax - a2 \ + 1 / Ί - r21 «! indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β1 indicates an elevation angle of the position. β2 indicates an elevation angle of said one of the speakers, τ indicates a radius of the position and r2 indicates a radius of said one of the speakers. Or indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, β^ indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, r-, indicates a radius of said one of the speakers and r2 indicates a radius of the position. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances of the position to the speakers, so that each distance Δ (Ρ., ,, Ρ2) 0f the positon to one of the speakers is calculated according to Δ(Ρ1, Ρ2) = b - \βχ - /?2 | + α · - a2 \ «1 indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, βλ indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, a is a first number, and b is a second number. Or cti indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, β indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, a is a first number, and b is a second number. In an embodiment, the distance calculator may, e.g., be configured to calculate the distances of the position to the speakers, so that each distance Λ(Ρ1, Ρ2) 0f the positon to one of the speakers is calculated according to A(P P2) = b■ \βχ - β2 \ + a■ - a2 \ + c■ |n - r2 | a- indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β^ indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, rA indicates a radius of the position, r2 indicates a radius of said one of the speakers, a is a first number, and h is a second number. Or, α indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, βλ indicates an elevation angle of said one of the speakers, and β2 indicates an elevation angle of the position, r-t indicates a radius of said one of the speakers, and r2 indicates a radius of the position, a is a first number, b is a second number, and c is a third number. According to an embodiment, a decoder device is provided. The decoder device comprises a USAC decoder for decoding a bitstream to obtain one or more audio input channels, to obtain one or more input audio objects, to obtain compressed object metadata and to obtain one or more SAOC transport channels. Moreover, the decoder device comprises an SAOC decoder for decoding the one or more SAOC transport channels to obtain a group of one or more rendered audio objects. Furthermore, the decoder device comprises an object metadata decoder for decoding the compressed object metadata to obtain uncompressed metadata. Moreover, the decoder device comprises a format converter for converting the one or more audio input channels to obtain one or more converted channels. Furthermore, the decoder device comprises a mixer for mixing the one or more rendered audio objects of the group of one or more rendered audio objects, the one or more input audio objects and the one or more converted channels to obtain one or more decoded audio channels. The object metadata decoder and the mixer together form an apparatus according to one of the above-described embodiments. The object metadata decoder comprises the distance calculator of the apparatus according to one of the above-described embodiments, wherein the distance calculator is configured, for each input audio object of the one or more input audio objects, to calculate distances of the position associated with said input audio object to speakers or for reading the distances of the position associated with said input audio object to the speakers, and to take a solution with a smallest distance. The mixer is configured to output each input audio object of the one or more input audio objects within one of the one or more decoded audio channels to the speaker corresponding to the solution determined by the distance calculator of the apparatus according to one of the above-described embodiments for said input audio object. A method for playing back an audio object associated with a position, comprising: Calculating distances of the position to speakers or reading the distances of the position to the speakers. Taking a solution with a smallest distance. And: Playing back the audio object using the speaker corresponding to the solution. Moreover, a computer program for implementing the above-described method when being executed on a computer or signal processor is provided. In the following, embodiments of the present invention are described in more detail with reference to the figures, in which: Fig. 1 is an apparatus according to an embodiment, Fig. 2 illustrates an object renderer according to an embodiment, Fig. 3 illustrates an object metadata processor according to an embodiment, Fig. 4 illustrates an overview of a 3D-audio encoder, Fig. 5 illustrates an overview of a 3D-Audio decoder according to an embodiment, and Fig. 6 illustrates a structure of a format converter. Fig. 1 illustrates an apparatus 100 for playing back an audio object associated with a position is provided. The apparatus 100 comprises a distance calculator 1 10 for calculating distances of the position to speakers or for reading the distances of the position to the speakers. The distance calculator 1 10 is configured to take a solution with a smallest distance. The apparatus 100 is configured to play back the audio object using the speaker corresponding to the solution. For example, for each loudspeaker, a distance between the position (the audio object position) and said loudspeaker (the location of said loudspeaker) is determined. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances of the position to the speakers or to read the distances of the position to the speakers only if a closest speaker playout flag (mdae_closestSpeakerPlayout). being received by the apparatus 100, is enabled. Moreover, the distance calculator may, e.g , be configured to take a solution with a smallest distance only if the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. Furthermore, the apparatus 100 may. e.g., be configured to play back the audio object using the speaker corresponding to the solution only of the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. In an embodiment, the apparatus 100 may. e.g., be configured to not conduct any rendering on the audio object, if the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns a weighted Euclidian distance or a great-arc distance. In an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns weighted absolute differences in azimuth and elevation angles. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns weighted absolute differences to the power p, wherein p is a number. In an embodiment, p may, e.g., be set to p = 2. According to an embodiment, the distance calculator may, e.g., be configured to calculate the distances depending on a distance function which returns a weighted angular difference. In an embodiment, the distance function may. e.g.. be defined according to diffAngle = acos(cos(azDiff) * cos(elDiff)), wherein azDiff indicates a difference of two azimuth angles, wherein elDiff indicates a difference of two elevation angles, and wherein diffAngle indicates the weighted angular difference. According to an embodiment, the distance calculator may, e.g.. be configured to calculate the distances of the position to the speakers, so that each distance Λ .Ρ1,, Ρ2) 0f the positon to one of the speakers is calculated according to α-ι indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β-ι indicates an elevation angle of the position, and β2 indicates an elevation angle of said one of the speakers. Or, cti indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position. β indicates an elevation angle of said one of the speakers, and β2 indicates an elevation angle of the position. In an embodiment, the distance calculator may, e.g., be configured to calculate the distances of the position to the speakers, so that each distance of the positon to one of the speakers is calculated according to Δ(/ , ; ) = | ?1 - /¾ ! + !«! - a2 \ +h - ri a indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, r-, indicates a radius of the position and r2 indicates a radius of said one of the speakers. Or «i indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, β indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, r-, indicates a radius of said one of the speakers and r2 indicates a radius of the position. According to an embodiment, the distance calculator may, e.g. , be configured to calculate the distances of the position to the speakers, so that each distance ^ (Ρ^, Ρ.) 0f the positon to one of the speakers is calculated according to Δ(Ρ1, Ρ2) = b ■ |/¾ - β2 \ + a■ \ai - a2 o indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β\ indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, a is a first number, and h is a second number. Or oA indicates an azimuth angle of said one of the speakers. a2 indicates an azimuth angle of the position, ?·, indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, is a first number, and b is a second number. In an embodiment, the distance calculator may. e.g., be configured to calculate the distances of the position to the speakers, so that each distance M Pl s P2) of the positon to one of the speakers is calculated according to Δ(Ρ1, Ρ2) = b■ [ ?! - β2 \ + a■ \ax - a2 \ - c■ - r2 οΛ indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, β indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, νλ indicates a radius of the position, r2 indicates a radius of said one of the speakers, a is a first number, h is a second number, and c is a third number. Or, α indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, βΛ indicates an elevation angle of said one of the speakers, and /¾ indicates an elevation angle of the position, r-, indicates a radius of said one of the speakers, and r2 indicates a radius of the position, a is a first number, b is a second number, and c is a third number. In the following, embodiments of the present invention are described. The embodiments provide concepts for using a geometric distance definition for audio rendering. Object metadata can be used to define either: 1 ) where in space an object should be rendered, or 2) which loudspeaker should be used to play back the object. If the position of the object indicated in the metadata does not fall on a single speaker, the object renderer would create the output signal based by using multiple loudspeakers and defined panning rules. Panning is suboptimal in terms of localizing sounds or the sound color. Therefore, it may be desirable by the producer of object based content, to define that a certain sound should come from a single loudspeaker from a certain direction. It may happen that this loudspeaker does not exist in the users loudspeaker setup. Then a flag is set in the metadata that forces the sound to be played back by the nearest available loudspeaker without rendering. The invention describes how the closest loudspeaker can be found allowing for some weighting to account for a tolerable deviation from the desired object position. Fig. 2 illustrates an object renderer according to an embodiment. In object-based audio formats metadata are stored or transmitted along with object signals. The audio objects are rendered on the playback side using the metadata and information about the playback environment. Such information is e.g. the number of loudspeakers or the size of the screen. Table 1 - Example metadata: ObjectID Azimuth Dynamic Elevation OAM Gain Distance AllowOnOff AMowPositionlnteractivity AllowGainlnteractivity DefaultOnOff DefaultGain InteractivityMinGain Interactivity Interactivtiy axGain InteractivityMinAzOffset InteractivityMaxAzOffset InteractivityMinEIOffset InteractivityMaxEIOffset InteractivityMinDist InteractivityMaxDist IsSpeakerReiatedGroup SpeakerConfig3D AzimuthScreenRelated Playout ElevationScreenRelated ClosestSpeakerPlayout ContentKind Content ContentLanguage GroupID GroupDescription Group GroupNumMembers GroupMembers Priority SwitchGroupID SwitchGroupDescription Switch SwitchGroupDefault Group SwitchGroupNumMembers SwitchGroupMembers NumGroupsTotal Audio IsMainScene Scene NumGroupsPresent NumSwitchGroups For objects geometric metadata can be used to define how they should be rendered, e.g. angles in azimuth or elevation or absolute positions relative to a reference point, e.g. the listener. The renderer calculates loudspeaker signals on the basis of the geometric data and the available speakers and their position. If an audio-object (audio signal associated with a position in the 3D space, e.g. azimuth, elevation and distance given) should not be rendered to its associated position, but instead played back by a loudspeaker that exists in the local loudspeaker setup, one way would be to define the loudspeaker where the object should be played back by means of metadata. Nevertheless, there are cases where the producer does not want the object content to be played-back by a specific speaker, but rather by the next available speaker, i.e. the "geometrically nearest" speaker. This allows for a discrete playback without the necessity to define which speaker corresponds to which audio signal or to do rendering between multiple loudspeakers. Embodiments according to the present invention emerge from the above in the following manner. Metadata fields: object should be played back by geometrically nearest ClosestSpeakerPlayout speaker, no rendering (only for dynamic objects (IsSpeakerRelatedGroup == 0)) Table 2— Syntax of GroupDefinition(): Syntax No. of bits Mnemonic mdae_GroupDefinition( numGroups ) { grp = 0; grp < numGroups; grp++ ) { mdae grouplDfgrp]; 7 uimsbf mdae_groupPriority[grpj; uimsbf mdae_closestSpeakerPlayout[gi p]; bslbf } } mdae_closestSpeakerPlayout This flag defines that the members of the metadata element group should not be rendered but directly be played back by the speakers which are nearest to the geometric position of the members. The remapping is done in an object metadata processor that takes the local loudspeaker setup into account and performs a routing of the signals to the corresponding renderers with specific information by which loudspeaker or from which direction a sound should be rendered. Fig. 3 illustrates an object metadata processor according to an embodiment. A strategy for distance calculation is described as follows: if closest loudspeaker metadata flag is set, sound is played back over the closest speaker to this end, the distance to next speakers is calculated (or read from a pre-stored table) solution with smallest distance is taken distance function can be, for instance (but not limited to): weighted euclidian or great-arc distance weighted absolute differences in azimuth and elevation angle - weighted absolute differences to the power p (p=2 => Least Squares Solution) weighted angular difference, e.g. diffAngle = acos(cos(azDiff)*cos(elDiff)) Examples for closest speaker calculation are set out below. If the mdae_closestSpeakerPlayout flag of an audio element group is enabled, the members of the audio element group shall each be played back by the speaker that is nearest to the given position of the audio element. No rendering is applied. The distance of two positions and P2 in a spherical coordinate system is defined as the absolute difference of their azimuth angles a and elevation angles β . Δ(/> , R ) = | ¾ - β21 + - a21 + |/j - r2 This distance has to be calculated for all known positions to PN of the N output speakers with respect to the wanted position of the audio element PvmleJ . The nearest known loudspeaker position is the one. where the distance to the wanted position of the audio element gets minimal Pexl = min(A(Pwon,^ /> ), A(^ With this formula, it is possible to add weights to elevation, azimuth and/or radius. In that way it is possible to state that an azimuth deviation should be less tolerable than an elevation deviation by weighting the azimuth deviation by a high number: Δ(Ρ, , Ρ2) = b ■ l i - /?2 | + α · - a2 \ + c · |rx - r2 An example concerns a closest loudspeaker calculation for binaural rendering. If audio content should be played back as a binaural stereo signal over headphones or a stereo speaker setup, each channel of the audio content is traditionally mathematically combined with a binaural room impulse response or a head-related impulse response. The measuring position of this impulse response has to correspond to the direction from which the audio content of the associated channel should be perceived. In multi-channel audio systems or object-based audio there is the case that the number of definable positions (either by a speaker or by an object position) is larger than the number of available impulse responses. In that case, an appropriate impulse response has to be chosen if there is no dedicated one available for the channel position or the object position. To inflict only minimum positional changes in the perception, the chosen impulse response should be the "geometrically nearest" impulse response. It is in both cases needed to determine, which of the list of known positions (i.e. playback speakers or BRIRs) is the next to the wanted position (BRIR = Binaural Room Impulse Response). Therefore a "distance" between different positions has to be defined. The distance between different positions is here defined as the absolute difference of their azimuth and elevation angles. The following formula is used to calculate a distance of two positions Ρ^ Ρ2 in a coordinate system that is defined by elevation cr and azimuth β : Δ(Ρ1, Ρ2) = |/ ! - /i2 | + - a2 It is possible to add the radius r as a third variable: Δ(^, 2) = |/ί1 - ?:| + ι - ;| +|;Ι - Γ2| The nearest known position is the one, where the distance to the wanted position gets minimal P„gxt = min (a(Pwe^, P0,A(P^nt(,d, P2),. A(Pwe nf< Least Squares Solution) (Weighted) Pythagorean Theorem / Euclidean Distance The distance d for Cartesian coordinates may, e.g., be realized by employing the formula ci - V (*i - xz) z ÷ ( i + Oi - zzy- with x Ί , ζ being the x-, y- and z-coordinate values of a first position, with x2, y2, z2 being the X-, y- and z-coordinate values of a second position, and with d being the distance between the first and the second position. A distance measure d for polar coordinates may, e.g., be realized by employing the formula: d = a ' ( -!— 2)2 -f b · {βχ— β-2)2 - c ■ (ra - r2 ) - _ with a-,, β and ν being the polar coordinates of a first position, with a2, β2 and r2 being the polar coordinates of a second position, and with d being the distance between the first and the second position. The weighted angular difference may. e.g., be defined according to d iff Angle = acosicos^— a2)- cos^— β2)) Regarding the orthodromic distance, the Great-Arc Distance, or the Great-Circle Distance, the distance measured along the surface of a sphere (as opposed to a straight line through the sphere's interior). Square root operations and trigonometric functions may, e.g.. be employed. Coordinates may, e.g. , be transformed to latitude and longitude. Returning to the formula presented above: the formula can be seen as a modified Taxicab geometry using polar coordinates instead of Cartesian coordinates as in the original taxicab geometry definition A (Plt P2) = \ X, - x2 \ + \y - y2 \ . With this formula, it is possible to add weights to elevation, azimuth and/or radius. In that way it is possible to state that an azimuth deviation should be less tolerable than an elevation deviation by weighting the azimuth deviation by a high number: Δ(Ρ1 ( Ρ2) = b ■ \βτ - β2 \ + a · \ α - a2 \ + c ■ \ i - r2 | . As a further side remark, it should be noted, that in embodiments, the "rendered object audio" of Fig. 2 may, e.g.. be considered as "rendered object-based audio". In Fig. 2, the usacConfigExtention regarding static object metadata and the usacExtension are only used as examples of particular embodiments. Regarding Fig. 3. It should be noted that in some embodiments, the dynamic object metadata of Fig. 3 may, e.g., positional OAM (audio object metadata, positional data + gain). In some embodiments, the "route signals" may, e.g. , be conducted by routing signals to a format converter or to an object renderer. Although some aspects have been described in the context of an apparatus, it is clear that these aspects also represent a description of the corresponding method, where a block or device corresponds to a method step or a feature of a method step. Analogously, aspects described in the context of a method step also represent a description of a corresponding block or item or feature of a corresponding apparatus. The inventive decomposed signal can be stored on a digital storage medium or can be transmitted on a transmission medium such as a wireless transmission medium or a wired transmission medium such as the Internet. Depending on certain implementation requirements, embodiments of the invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a CD, a ROM. a PROM, an EPROM. an EEPROM or a FLASH memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Some embodiments according to the invention comprise a non-transitory data carrier having electronically readable control signals, which are capable of cooperating with a programmable computer system, such that one of the methods described herein is performed. Generally, embodiments of the present invention can be implemented as a computer program product with a program code, the program code being operative for performing one of the methods when the computer program product runs on a computer. The program code may for example be stored on a machine readable carrier. Other embodiments comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier. In other words, an embodiment of the inventive method is, therefore, a computer program having a program code for performing one of the methods described herein, when the computer program runs on a computer. A further embodiment of the inventive methods is, therefore, a data carrier (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein. A further embodiment of the inventive method is, therefore, a data stream or a sequence of signals representing the computer program for performing one of the methods described herein. The data stream or the sequence of signals may for example be configured to be transferred via a data communication connection, for example via the Internet. A further embodiment comprises a processing means, for example a computer, or a programmable logic device, configured to or adapted to perform one of the methods described herein. A further embodiment comprises a computer having installed thereon the computer program for performing one of the methods described herein. In some embodiments, a programmable logic device (for example a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some embodiments, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods are preferably performed by any hardware apparatus. The above described embodiments are merely illustrative for the principles of the present invention. It is understood that modifications and variations of the arrangements and the details described herein will be apparent to others skilled in the art. It is the intent. therefore, to be limited only by the scope of the impending patent claims and not by the specific details presented by way of description and explanation of the embodiments herein. Literature [1 ] "System and Method for Adaptive Audio Signal Generation, Coding and Rendering", Patent application number: US20140133683 A1 (Claim 48) [2] "Reflected sound rendering for object-based audio", Patent application number: WO2014036085 A1 (Chapter Playback Applications) [3] "Upmixing object based audio", Patent application number: US20140133682 A1 (BRIEF DESCRIPTION OF EXEMPLARY EMBODIMENTS + Claim 71 b)) [4] "Audio Definition Model", EBU-TECH 3364, https://tech.ebu.ch/docs/tech/tech3364.pdf [5] "System and Tools for Enhanced 3D Audio Authoring and Rendering", Patent application number: US201401 19581 A1 Claims An apparatus (100) for playing back an audio object associated with a position, comprising: a distance calculator (1 10) for calculating distances of the position to speakers or for reading the distances of the position to the speakers, wherein the distance calculator (1 10) is configured to take a solution with a smallest distance, and wherein the apparatus (100) is configured to play back the audio object using the speaker corresponding to the solution. An apparatus (100) according to claim 1 , wherein the distance calculator (1 10) is configured to calculate the distances of the position to the speakers or to read the distances of the position to the speakers only if a closest speaker playout flag (mdae_closestSpeakerPlayout), being received by the apparatus (100), is enabled, wherein the distance calculator (1 10) is configured to take a solution with a smallest distance only if the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled, and wherein the apparatus (100) is configured to play back the audio object using the speaker corresponding to the solution only of the closest speaker playout flag (mdae_closestSpeakerPlayout) is enabled. An apparatus (100) according to claim 2, wherein the apparatus (100) is configured to not conduct any rendering on the audio object, if the closest speaker playout flag (mdae closestSpeakerPlayout) is enabled. An apparatus (100) according to one of claims 1 to 3, wherein the distance calculator (1 10) is configured to calculate the distances depending on a distance function which returns a weighted Euclidian distance or a great-arc distance. An apparatus (100) according to one of claims 1 to 3, wherein the distance calculator (1 10) is configured to calculate the distances depending on a distance function which returns weighted absolute differences in azimuth and elevation angles. An apparatus (100) according to one of claims 1 to 3, wherein the distance calculator (1 10) is configured to calculate the distances depending on a distance function which returns weighted absolute differences to the power p. wherein p is a number. An apparatus (100) according to one of claims 1 to 3, wherein the distance calculator (1 10) is configured to calculate the distances depending on a distance function which returns a weighted angular difference. An apparatus (100) according to claim 7, wherein the distance function is defined according to diffAngle = acos(cos(azDiff) * cos(elDiff)), wherein azDiff indicates a difference of two azimuth angles. wherein elDiff indicates a difference of two elevation angles, and wherein diffAngle indicates the weighted angular difference. An apparatus (100) according to one of the preceding claims, wherein the distance calculator (1 10) is configured to calculate the distances of the position to the speakers, so that each distance of the positon to one of the speakers is calculated according to wherein indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, indicates an elevation angle of the position, and ¾ indicates an elevation angle of said one of the speakers, or wherein «1 indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, indicates an elevation angle of said one of the speakers, and β2 indicates an elevation angle of the position. 10. An apparatus (100) according to one of claims 1 to 8, wherein the distance calculator (1 10) is configured to calculate the distances of the position to the speakers, so that each distance Δίρ ρζ of the positon to one of the speakers is calculated according to wherein α indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, βΛ indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, r-, indicates a radius of the position and r2 indicates a radius of said one of the speakers, or wherein α indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, ?·, indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, τ indicates a radius of said one of the speakers and r2 indicates a radius of the position. 1 1 . An apparatus (100) according to one of claims 1 to 8, wherein the distance calculator (1 10) is configured to calculate the distances of the position to the speakers, so that each distance of the positon to one of the speakers is calculated according to Δ(Λ , Ρ ) = b ■ \β, - β2 \ + α · \ αι - a2 wherein αλ indicates an azimuth angle of the position, a2 indicates an azimuth angle of said one of the speakers, ?·, indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, a is a first number, and b is a second number, or wherein oA indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, β indicates an elevation angle of said one of the speakers, β2 indicates an elevation angle of the position, a is a first number, and b is a second number. An apparatus (100) according to one of claims 1 to 8, wherein the distance calculator (1 10) is configured to calculate the distances of the position to the speakers, so that each distance A(PV PZ) of the positon to one of the speakers is calculated according to Δ(/ , Ρ2) = b■ |/¾ - β2 \ + a · - a2 \ + c · \r - r2 | wherein α indicates an azimuth angle of the position, <¾ indicates an azimuth angle of said one of the speakers, β indicates an elevation angle of the position, β2 indicates an elevation angle of said one of the speakers, indicates a radius of the position, r2 indicates a radius of said one of the speakers, a is a first number, b is a second number, and c is a third number, or wherein -ι indicates an azimuth angle of said one of the speakers, a2 indicates an azimuth angle of the position, ?■, indicates an elevation angle of said one of the speakers, and β2 indicates an elevation angle of the position, νΛ indicates a radius of said one of the speakers, and r2 indicates a radius of the position, a is a first number, b is a second number, and c is a third number. A decoder device comprising: a USAC decoder (910) for decoding a bitstream to obtain one or more audio input channels, to obtain one or more input audio objects, to obtain compressed object metadata and to obtain one or more SAOC transport channels, an SAOC decoder (915) for decoding the one or more SAOC transport channels to obtain a group of one or more rendered audio objects, an object metadata decoder (918), for decoding the compressed object metadata to obtain uncompressed metadata. a format converter (922) for converting the one or more audio input channels to obtain one or more converted channels, and a mixer (930) for mixing the one or more rendered audio objects of the group of one or more rendered audio objects, the one or more input audio objects and the one or more converted channels to obtain one or more decoded audio channels. wherein the object metadata decoder (918) and the mixer (930) together form an apparatus (100) according to one of the preceding claims. wherein the object metadata decoder (918) comprises the distance calculator (1 10) (1 10) of the apparatus (100) according to one of the preceding claims, wherein the distance calculator ( 10) (1 0) is configured, for each input audio object of the one or more input audio objects, to calculate distances of the position associated with said input audio object to speakers or for reading the distances of the position associated with said input audio object to the speakers, and to take a solution with a smallest distance, and wherein the mixer (930) is configured to output each input audio object of the one or more input audio objects within one of the one or more decoded audio channels to the speaker corresponding to the solution determined by the distance calculator (1 10) (1 10) of the apparatus (100) according to one of the preceding claims for said input audio object. 14. A method for playing back an audio object associated with a position, comprising: calculating distances of the position to speakers or reading the distances of the position to the speakers, taking a solution with a smallest distance, and playing back the audio object using the speaker corresponding to the solution. 15. A computer program for implementing the method of claim 14 when being executed on a computer or signal processor.

Documents

Application Documents

# Name Date
1 Form 5 [20-09-2016(online)].pdf 2016-09-20
2 Form 3 [20-09-2016(online)].pdf 2016-09-20
3 Form 20 [20-09-2016(online)].pdf 2016-09-20
4 Drawing [20-09-2016(online)].pdf 2016-09-20
5 Description(Complete) [20-09-2016(online)].pdf 2016-09-20
6 Form 18 [04-10-2016(online)].pdf 2016-10-04
7 Other Patent Document [03-11-2016(online)].pdf 2016-11-03
8 Other Patent Document [17-11-2016(online)].pdf 2016-11-17
9 Other Patent Document [07-12-2016(online)].pdf 2016-12-07
10 Other Patent Document [17-02-2017(online)].pdf 2017-02-17
11 Information under section 8(2) [15-06-2017(online)].pdf 2017-06-15
12 201637032003-Information under section 8(2) (MANDATORY) [10-08-2017(online)].pdf 2017-08-10
13 201637032003-Information under section 8(2) (MANDATORY) [23-10-2017(online)].pdf 2017-10-23
14 201637032003-Information under section 8(2) (MANDATORY) [19-12-2017(online)].pdf 2017-12-19
15 201637032003-Information under section 8(2) (MANDATORY) [12-02-2018(online)].pdf 2018-02-12
16 201637032003-Information under section 8(2) (MANDATORY) [14-03-2018(online)].pdf 2018-03-14
17 201637032003-Information under section 8(2) (MANDATORY) [06-07-2018(online)].pdf 2018-07-06
18 201637032003-Information under section 8(2) (MANDATORY) [18-08-2018(online)].pdf 2018-08-18
19 201637032003-Information under section 8(2) (MANDATORY) [31-10-2018(online)].pdf 2018-10-31
20 201637032003-Information under section 8(2) (MANDATORY) [22-02-2019(online)].pdf 2019-02-22
21 201637032003-Information under section 8(2) (MANDATORY) [19-07-2019(online)].pdf 2019-07-19
22 201637032003-Information under section 8(2) (MANDATORY) [19-08-2019(online)].pdf 2019-08-19
23 201637032003-FER.pdf 2019-09-12
24 201637032003-Information under section 8(2) (MANDATORY) [20-01-2020(online)].pdf 2020-01-20
25 201637032003-FORM 4(ii) [05-03-2020(online)].pdf 2020-03-05
26 201637032003-Information under section 8(2) [09-05-2020(online)].pdf 2020-05-09
27 201637032003-OTHERS [09-06-2020(online)].pdf 2020-06-09
28 201637032003-FER_SER_REPLY [09-06-2020(online)].pdf 2020-06-09
29 201637032003-DRAWING [09-06-2020(online)].pdf 2020-06-09
30 201637032003-CLAIMS [09-06-2020(online)].pdf 2020-06-09
31 201637032003-FORM 3 [03-07-2020(online)].pdf 2020-07-03
32 201637032003-Information under section 8(2) [25-09-2020(online)].pdf 2020-09-25
33 201637032003-Information under section 8(2) [12-01-2021(online)].pdf 2021-01-12
34 201637032003-Information under section 8(2) [14-01-2021(online)].pdf 2021-01-14
35 201637032003-Information under section 8(2) [19-03-2021(online)].pdf 2021-03-19
36 201637032003-Information under section 8(2) [21-07-2021(online)].pdf 2021-07-21
37 201637032003-Information under section 8(2) [30-10-2021(online)].pdf 2021-10-30
38 201637032003-FORM 3 [18-01-2022(online)].pdf 2022-01-18
39 201637032003-Information under section 8(2) [26-05-2022(online)].pdf 2022-05-26
40 201637032003-Information under section 8(2) [20-07-2022(online)].pdf 2022-07-20
41 201637032003-FORM 3 [20-07-2022(online)].pdf 2022-07-20
42 201637032003-FORM 3 [17-01-2023(online)].pdf 2023-01-17
43 201637032003-FORM 3 [07-07-2023(online)].pdf 2023-07-07
44 201637032003-Information under section 8(2) [17-10-2023(online)].pdf 2023-10-17
45 201637032003-PatentCertificate06-11-2023.pdf 2023-11-06
46 201637032003-IntimationOfGrant06-11-2023.pdf 2023-11-06

Search Strategy

1 search_11-09-2019.pdf

ERegister / Renewals

3rd: 25 Nov 2023

From 04/03/2017 - To 04/03/2018

4th: 25 Nov 2023

From 04/03/2018 - To 04/03/2019

5th: 25 Nov 2023

From 04/03/2019 - To 04/03/2020

6th: 25 Nov 2023

From 04/03/2020 - To 04/03/2021

7th: 25 Nov 2023

From 04/03/2021 - To 04/03/2022

8th: 25 Nov 2023

From 04/03/2022 - To 04/03/2023

9th: 25 Nov 2023

From 04/03/2023 - To 04/03/2024

10th: 25 Nov 2023

From 04/03/2024 - To 04/03/2025

11th: 28 Feb 2025

From 04/03/2025 - To 04/03/2026