Sign In to Follow Application
View All Documents & Correspondence

Spatially Unequal Streaming

Abstract: Various concepts for media content streaming are described. Some allow for streaming spatial scene content in a spatially unequal manner so that the visible quality for the user is increased, or the processing complexity or necessary bandwidth at the streaming retrieval site is decreased. Other allow for streaming spatial scene content in a manner enlarging the applicability to further application scenarios

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
16 February 2022
Publication Number
36/2023
Publication Type
INA
Invention Field
COMMUNICATION
Status
Email
Parent Application

Applicants

FRAUNHOFER GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
Hansastrasse 27c, 80686 M?nchen, GERMANY, a German Company

Inventors

1. SKUPIN, Robert
Naugarder straRe 42, 70409, Berlin Germany.
2. HELLGE, Cornelius
Erich-Weinert-StraRe 5, 10439, Berlin Germany
3. SCHIERL, Thomas
Boris-Pasternak-Weg 7b, 13156, Berlin, Germany.
4. SANCHEZ DE LA FUENTE, Yago
Warschauer Strasse 67, 10243 Berlin, Germany.
5. WIEGAND, Thomas
c/o Fraunhofer-Institut für Nachrichtentechnik, Heinrich-Hertz-Institut, HHI, Einsteinufer 37, 10587 Berlin, Germany
6. PODBORSKI, Dimitri
Christstrasse 33, 14059, Berlin, Germany.

Specification

Spatially Unequal Streaming

Description

The present application is concerned with spatially unequal streaming such as occurring in virtual reality (VR) streaming.

VR streaming typically involves transmission of a very high-resolution video. The resolving capacity of the human fovea is around 60 pixels per degree. If transmission of the full sphere with 360° x 180° is considered, one would end up by sending a resolution of around 22k x 11k pixels. Since, sending such high resolution would lead to tremendously high bandwidth requirements, another solution is to send only the viewport shown at the Head Mounted Displays (HMDs), which have FoV of 90° x 90°: leading to around a 6k x 6k pixels video. A trade-off between sending the whole video at the highest resolution and sending only the viewport is to send the viewport at high resolution and some neighboring data (or the rest of the spherical video) at lower resolution or lower quality.

In a DASH scenario, an omni-directional video (aka spherical video) can be offered in such a way that the mixed resolution or mixed quality video described before is controlled by the DASH client. The DASH client only needs to know information that describes how the content is offered.

One example could be to offer different representations with different projections that have asymmetric characteristics, such as different quality and distortion for different parts of the video. Each representation would correspond to a given viewport and would have the viewport encoded with a higher quality/resolution than the rest of the content. Knowing the orientation information (direction of the viewport for which the content has been encoded with a higher quality/resolution) the DASH client can chose one or another representation dynamically to match the viewing direction of the user at any time.

A more flexible option for a DASH client to select such asymmetric characteristic for the omni-directional video would be when the video is split into several spatial regions, with each region being available at different resolution or quality. One option could be to split it into rectangular regions (aka tiles) based on a grid, but other options could be foreseen. In such a case, the DASH client would need some signaling about the different qualities into which the different regions are offered and it could download the different regions at different qualities so that the viewport shown to the user is at a better quality than the other non-shown content.

In any of the previous cases, when user interaction happens and the viewport is changed, the DASH client will need some time to react to user movement and download the content in such a way that matches the new viewport. During the time between the user moves and the DASH client adapts its requests to match the new viewport, the user will see in the viewport some regions in high quality and low quality simultaneously. Though the acceptable quality/resolution difference is content dependent, the quality the user sees is in any case degraded.

Thus, it would be favorable to have a concept at hand which alleviates, or renders more efficient, or even increases the visible quality for the user with respect to partial presentation of spatial scene content streamed by adaptive streaming.

Thus, the object of the present invention to provide concepts for streaming spatial scene content in a spatially unequal manner so that the visible quality for the user is increased, or the processing complexity or necessary bandwidth at the streaming retrieval site is decreased, or to provide concepts for streaming spatial scene content in a manner enlarging the applicability to further application scenarios.

This object is achieved by the subject matter of the pending independent claims.

A first aspect of the present application is based on the finding that streaming media content pertaining to a temporally-varying spatial scene such as a video in a spatially unequal manner may be improved in terms of visible quality at comparable bandwidth consumption and/or computational complexity at a streaming reception site if the media segments selected and retrieved and/or a signalization obtained from the server, provides the retrieving device with hints on a predetermined relationship to be complied with by qualities at which different portions of the temporally-varying spatial scene are encoded into the selected and retrieved media segments. Otherwise, the retrieving device may not know beforehand as to which negative impact the juxtaposition of portions encoded at different quality into the selected and retrieved media segment may have on the overall visible quality experienced by the user. Information contained in the media segments and/or a signalization obtained from the server such as, for instance, within a manifest file (media presentation description) or additional streaming related control messages from server to client such as SAND messages, enable the retrieving device to appropriately select among the media segments offered at the server. In this manner, virtual reality streaming or partial streaming of video content may be made more robust against quality degradation as it could otherwise occur owing to an inadequate distribution of the available bandwidth on to this spatial section of the temporally-varying spatial scene presented to the user.

A further aspect of the present invention is based on the finding that streaming of media content pertaining to a temporally-varying spatial scene such as a video in a spatially unequal manner such as using a first quality at a first portion and a second, lower quality at a second portion or with leaving a second portion being non-streamed, may be improved in visible quality and/or may be made less complex in terms of bandwidth consumption and/or computational complexity at the streaming retrieval side, by determining a size and/or position of the first portion depending on information contained in the media segments and/or a signalization obtained from the server. Imagine, for instance, the temporally-varying spatial scene would be offered at the server at a tile-based manner for tile-based streaming, i.e. the media segments would represent spectral temporal portions of the temporally-varying spatial scene each of which would be a temporal segment of the spatial scene within a corresponding tile of a distribution of tiles into which the spatial scene is sub-divided. In such a case, it is up to the retrieving device (client) to decide as to how to distribute the available bandwidth and/or computational power over the spatial scene, namely, at the granularity of tiles. The retrieving device would perform the selection of the media segments to the extent that a first portion of the spatial scene which follows respectively tracks a temporally-varying view section of the spatial scene, is encoded into the selected and retrieved media segments in a predetermined quality which may, for instance, be the highest quality feasible at the current bandwidth and/or computational power conditions. A spatially neighboring second portion of the spatial scene may, for instance, not be encoded into the selected and retrieved media segments, or may be encoded there into at a further quality, reduced relative to the predetermined quality. In such a situation, it is a computationally complex matter, or even not feasible, to compute a number/count of neighboring tiles, the aggregation of which completely covers the temporally-varying view section irrespective of the view section's orientation. Depending on the projection chosen so as to map the spatial scene onto the individual tiles, the angular scene coverage per tile may vary over this scene and the fact that the individual tiles may mutually overlap, even renders a

computation of a count of neighboring tiles sufficient to cover the view section in spatial terms, irrespective of the view section's orientation, more difficult. Accordingly, in such a situation, the aforementioned information could indicate the size of the first portion as a count N of tiles or a number of tiles, respectively. By this measure, the device would be able to track the temporally-varying view section by selecting those media segments having the co-located aggregation of N tiles encoded there at the predetermined quality. The fact that the aggregation of these N tiles sufficiently covers the view section may be guaranteed by way of the information indicating N. Another example would be information contained in the media segments and/or a signalization obtained from the server, which is indicative of the size of the first portion relative to a size of the view section itself. For example, this information could somehow set a "safety zone" or prefetch zone around the actual view section in order to account for movements of the temporally-varying view section. The larger the speed at which the temporally-varying view section moves across the spatial scene, the larger the safety zone should be. Accordingly, the aforementioned information could be indicative of the size of the first portion in a manner relative to a size of the temporally-varying view section such as in an incremental or scaling manner. A retrieving device setting the size of the first portion according to such information would be able to avoid quality degradation which may otherwise occur owing to non-retrieved or low-quality portions of the spatial scene being visible in the view section. Here, it is irrelevant whether this scene is offered in a tile-based manner or in some other manner.

Related to the just-mentioned aspect of the present application, a video bit stream having a video encoded there into, may be made decodable at an increased quality if the video bit stream is provided with a signalization of a size of a focus area within the video onto which a decoding power for decoding the video should be focused. By this measure, a decoder which decodes the video from the bit stream, could focus, or even restrict, its decoding power onto the decoding of the video onto a portion having the size of the focus area signalized in the video bit stream thereby knowing, for instance, that the thus-decoded portion is decodable by the available decoding power, and spatially covers a wanted section of the video. For instance, the size of the focus area thus signalized could be selected to be large enough in order to cover the size of the view section and a movement of this view section taking the decoding latency in decoding the video into account. Or, put differently, a signalization of a recommended preferred view-section area of the video contained in the video bitstream could allow the decoder to treat this area in a preferred manner, thereby allowing the decoder to focus its decoding power accordingly. Irrespective of performing area-specific decoding power focusing, the area signalization

may be forwarded to a stage selecting on which media segments to download, i.e. where to place and how to dimension the portion of increased quality.

The first and second aspects of the present application are closely related to a third aspect of the present application according to which the fact that a vast number of retrieving devices stream media content from a server, is exploited, so as to gain information which may subsequently be used in order to appropriately set the aforementioned types of information allowing to set the size, or size and/or position, of the first portion and/or appropriately set the predetermined relationship between the first and second quality. Thus, in accordance with this aspect of the present application, the retrieving device (client) sends-out log messages logging one of a momentaneous measurement or a statistical value measuring a spatial position and/or movement of the first portion, a momentaneous measurement or a statistical value measuring a quality of the temporally-varying spatial scene as far as is encoded into the selected media segments and as far as is visible in a view section, and a momentaneous measurement or statistical value measuring the quality of the first portion or a quality of the temporally-varying spatial scene as far as is encoded into the selected media segments and as far as is visible in a view section. Momentaneous measurements and/or statistical values may be provided with time information concerning the time the respective momentaneous measurement or statistical value has been obtained. The log messages may be sent to the server where the media segments are offered, or to some other device evaluating the inbound log messages so as to update, based thereon, current settings of the aforementioned information used to set the size, or size and/or position, of the first portion and/or derive the predetermined relationship based thereon.

In accordance with a further aspect of the present application, streaming media content pertaining to a temporally-varying spatial scene such as a video, in particular in a tile-based manner, is made more effective in terms of avoidance of unavailing streaming trials by providing a media presentation description which comprises at least one version at which the temporally-varying spatial scene is offered for tile-based streaming, with an indication of benefitting requirements for benefitting from the tile-based streaming the respective version of the temporally-varying spatial scene for each of the at least one version. By this measure, the retrieving device is able to match the benefitting requirements of the at least one version with a device capability of the retrieving device itself or of another device interacting with the retrieving device with respect to tile-based streaming. For instance, the benefitting requirements could relate to decoding capability requirements. That is, if the decoding power for decoding the streamed/retrieved media content would not suffice to decode all media segments needed to cover a view section of the temporally-varying spatial scene, then trying to stream and present the media content would be a waste of time, bandwidth and computational power and accordingly, it would be more effective to not try it in any case. The decoding capability requirements could, for instance, indicate a number of decoder instantiations necessitated for a respective version if, for instance, the media segments relating to a certain tile form a media stream such as a video stream, separate from media segments pertaining to another tile. The decoding capability requirement could, for instance, also pertain to further information such as a certain fraction of decoder instantiations needed to fit to a predetermined decoding profile and/or level, or could indicate a certain minimum capability of a user input device to move in a sufficiently fast manner a viewport/section via which the user sees the scene. Depending on the scene content, a low movement capability may not suffice for the user to look onto the interesting portions of the scene.

A further aspect of the present invention pertains to an extension of streaming of media content pertaining to temporally-varying spatial scenes. In particular, the idea in accordance with this aspect is that a spatial scene may in fact not only vary temporally but also in terms of at least one further parameter suggest, for instance, views and a position, view depth or some other physical parameter. The retrieving device may use adaptive streaming in this context by, depending on a viewport direction and the at least one further parameter, computing addresses of media segments, the media segments describing a spatial scene varying in time and the at least one parameter, and retrieving the media segments using the computed addresses from a server.

The above-outlined aspects of the present application and their advantageous implementations which are the subject of the dependent claims, may be combined individually or all together.

Preferred embodiments of the present application are set forth below with respect to the figures among which

Fig. 1 shows a schematic diagram illustrating a system of client and server for virtual reality applications as an example as to where the embodiments set forth in the following figures may advantageously be used;

Fig. 2 shows a block diagram of a client device along with a schematic illustration of the media segment selection process in order to describe a possible mode of operation of the client device in accordance with an embodiment of the present application where the server 10 provides the device with information on acceptable or endurable quality variations within the media content presented to the user;

Fig. 3 shows a modification of Fig. 2, the portion of increase quality does not concern the portion tracking the view section of viewport, but a region of interest of the media scene content as signaled from server to client;

shows a block diagram of the client device along with a schematic illustration of the media segment selection process in accordance with an embodiment where the server provides information on how to set a size, or size and/or position, of the portion of increased quality or the size, or size and/or position, of the actually retrieved section of the media scene;

shows a variant of Fig. 5 in that information sent by the server directly indicates the size of portion 64, rather than scaling it depending on expected movements of the viewport;

shows a variant of Fig. 4 according to which the retrieved section has the predetermined quality and its size is determined by the information stemming from the server;

Fig. 7a to 7c show schematic diagrams illustrating the manner in which the information 74 according to Figs. 4 and 6 increases the size of the portion retrieved at the predetermined quality via a corresponding enlargement of the size of the viewport;

Fig. 8a shows a schematic diagram illustrating an embodiment where client device sends log messages to server or a certain evaluator for evaluating these log messages so as to derive thereof appropriate settings, for instance, for the types of information discussed with respect to Figs. 2 to 7c;

Fig. 8b shows a schematic diagram of a tile-based cubic projection of a 360 scene onto the tiles and an example of how some of the tiles are covered by an exemplary position of a viewport. The small circles indicate positions in the viewport equiangularly distributed, and hatched tiles are encoded at higher resolution in the downloaded segments than tiles without hatching;

Fig. 8c and d show a schematic diagram of a diagram showing along a temporal axis

(horizontal) as to how a buffer fullness (vertical axis) of different buffers of the client might develop, wherein Fig. 8c assumes the buffers to be used to buffer representations coding specific tiles, while Fig. 8d assumes the buffers to be used to buffer omnidirectional representations having the scene encoded thereinto at uneven quality, namely increased toward some direction specific for the respective buffer;

Fig. 8 e and f show a three-dimensional diagram of different pixel density measurements within the viewport 28, differing in terms of uniformity in spherical or viewplane sense;

Fig. 9 shows a block diagram of client device and a schematic illustration of the media segment selection process when the device inspects information stemming from the server in order to assess whether a certain version at which a tile-based streaming is offered by the server, is acceptable for the client device or not;

Fig. 10 shows a schematic diagram illustrating the plurality of media segments offered by a server in accordance with an embodiment allowing for a dependency of the media scene not only in time, but also in another non-temporal parameter, namely here, exemplarily, scene center position;

Fig. 1 1 shows a schematic diagram illustrating a video bit stream comprising information steering or controlling a size of a focus area within the video encoded into the bit stream along with an example for a video decoder able to take advantage of this information.

In order to ease the understanding of the description of embodiments of the present application with respect to the various aspects of the present application, Fig. 1 shows an example for an environment where the subsequently described embodiments of the

present application may be applied and advantageously used. In particular, Fig. 1 shows a system composed of client 10 and server 20 interacting via adaptive streaming. For instance, dynamic adaptive streaming over HTTP (DASH) may be used for the communication 22 between client 10 and server 20. However, the subsequently outlined embodiments should not be interpreted as being restricted to the usage of DASH and likewise, terms such as media presentation description (MPD) should be understand as being broad so as to also cover manifest files defined differently than in DASH.

Fig. 1 illustrates a system configured to implement a virtual reality application. That is, the system is configured to present to a user wearing a head up display 24, namely via an internal display 26 of head up display 24, a view section 28 out of a temporally-varying spatial scene 30 which section 28 corresponds to an orientation of the head up display 24 exemplarily measured by an internal orientation sensor 32 such as an inertial sensor of head up display 24. That is, the section 28 presented to the user forms a section of the spatial scene 30 the spatial position of which corresponds to the orientation of head up display 24. In case of Fig.1 , the temporally-varying spatial scene 30 is depicted as an omni-directional video or spherical video, but the description of Fig. 1 and the subsequently explained embodiments are readily transferrable to other examples as well, such as presenting a section out of a video with a spatial position of section 28 being determined by an intersection of a facial access or eye access with a virtual or real projector wall or the like. Further, sensor 32 and display 26 may, for instance, be comprised by different devices such as remote control and corresponding television, respectively, or they may be part of a hand-held device such as a mobile device such as a tablet or a mobile phone. Finally, it should be noted that some of the embodiments described later on, may also be applied to scenarios where the area 28 presented to the user constantly covers the whole temporally-varying spatial scene 30 with the unevenness in presenting the temporally-varying spatial scene relating, for instance, to an unequal distribution of quality over the spatial scene.

Further details with respect to server 20, client 10 and the way the spatial content 30 is offered at server 20 is illustrated in Fig. 1 and described in the following. These details should, however, also not be treated as limiting the subsequently explained embodiments, but should rather serve as an example of how to implement any of the subsequently explained embodiments.

ln particular, as shown in Fig. 1 , server 20 may comprise a storage 34 and a controller 36 such as an appropriately programmed computer, an application-specific integrated circuit or the like. The storage 34 has media segments stored thereon which represent the temporally-varying spatial scene 30. A specific example will be outlined in more detail below with respect to the illustration of Fig. 1. Controller 36 answers requests sent by client 10 by re-sending to client 10 requested media segments, a media presentation description and may send to client 10 further information on its own. Details in this regard are also set out below. Controller 36 may fetch requested media segments from storage 34. Within this storage, also other information may be stored such as the media presentation description or parts thereof, in the other signals sent from server 20 to client 10.

As shown in Fig.1 , server 20 may optionally in addition comprise a stream modifier 38 modifying the media segments sent from server 20 to client 10 responsive to the requests from the latter, so as to result at client 10 in a media data stream forming one single media stream decodable by one associated decoder although, for instance, the media segments retrieved by client 10 in this manner are actually aggregated from several media streams. However, the existence of such a stream modifier 38 is optional.

Client 10 of Fig. 1 is exemplarily depicted as comprising a client device or controller 40 or more decoders 42 and a reprojector 44. Client device 40 may be an appropriately programmed computer, a microprocessor, a programmed hardware device such as an FPGA or an application specific integrated circuit or the like. Client device 40 assumes responsibility for selecting segments to be retrieved from server 20 out of the plurality 46 of media segments offered at server 20. To this end, client device 40 retrieves a manifest or media presentation description from server 20 first. From the same, client device 40 obtains a computational rule for computing addresses of media segments out of plurality 46 which correspond to certain, needed spatial portions of the spatial scene 30. The media segments thus selected are retrieved by client device 40 from server 20 by sending respective requests to server 20. These requests contain computed addresses.

The media segments thus retrieved by client device 40 are forwarded by the latter to the one or more decoders 42 for decoding. In the example of Fig. 1 , the media segments thus retrieved and decoded represent, for each temporal time unit, merely a spatial section 48 out of the temporally-varying spatial scene 30, but as already indicated above, this may be different in accordance with other aspects, where, for instance, the view section 28 to be presented constantly covers the whole scene. Reprojector 44 may optionally re-project and cut-out the view section 28 to be displayed to the user out of the retrieved and decoded scene content of the selected, retrieved and decoded media segments. To this end, as shown in Fig. 1 , client device 40 may, for instance, continuously track and update a spatial position of view section 28 responsive to the user orientation data from sensor 32 and inform reprojector 44, for instance, on this current spatial position of scene section 28 as well as the reprojection mapping to be applied onto the retrieved and decoded media content so as to be mapped onto the area forming view section 28. Reprojector 44 may, accordingly, apply a mapping and an interpolation onto a regular grid of pixels, for instance, to be displayed on display 26.

Fig. 1 illustrates the case where a cubic mapping has been used to map the spatial scene 30 onto tiles 50. The tiles are, thus, depicted as rectangular sub-regions of a cube onto which scene 30 having the form of a sphere has been projected. Reprojector 44 reverses this projection. However, other examples may be applied as well. For instance, instead of a cubic projection, a projection onto a truncated pyramid or a pyramid without truncation may be used. Further, although the tiles of Fig. 1 are depicted as being non-overlapping in terms of coverage of the spatial scene 30, the subdivision into tiles may involve a mutual tile-overlapping. And as will be outlined in more detail below, the subdivision of scene 30 into tiles 50 spatially with each tile forming one representation as explained further below, is also not mandatory.

Thus, as depicted in Fig. 1 , the whole spatial scene 30 is spatially subdivided into tiles 50. In the example of Fig. 1 , each of the six faces of the cube is subdivided into 4 tiles. For illustration purposes, the tiles are enumerated. For each tile 50, server 20 offers a video 52 as depicted in Fig. 1. To be more precise, server 20 even offers more than one video 52 per tile 50, these videos differing in quality Q#. Even further, the videos 52 are temporally subdivided into temporal segments 54. The temporal segments 54 of all videos 52 of all tiles T# form, or are encoded into, respectively, one of the media segments of the plurality 46 of media segments stored in storage 34 of server 20.

It is again emphasized that even the example of a tile-based streaming illustrated in Fig. 1 merely forms an example from which many deviations are possible. For instance, although Fig. 1 seems to suggest that the media segments pertaining to a representation of the scene 30 at a higher quality relate to tiles coinciding to tiles to which media segments belong which have the scene 30 encoded thereinto at quality Q1 this

coincidence is not necessary and the tiles of different qualities may even correspond to tiles of a different projection of scene 30. Moreover, although not discussed so far, it may be that the media segments corresponding to different quality levels depicted in Fig. 1 differ in spatial resolution and/or signal to noise ratio and/or temporal resolution or the like.

Claims

1. Device for streaming media content pertaining a temporally varying spatial scene (30), configured to

select (56) media segments out of a plurality (46) of media segments (58) being available on a server (20),

(60) the selected media segments from the server (20), wherein the device is configured to

perform the selection so that the selected media segments have at least a spatial section (62) of the temporally varying spatial scene (30) encoded thereinto in manner according to which a first portion (64) of the spatial section is encoded into the selected media segments at a

predetermined quality, and according to which a second portion (66) of the temporally varying spatial scene, which spatially neighbors the first portion (64), is encoded into the selected media segments at a further quality fulfilling a predetermined relationship with respect to the predetermined quality, and

derive the predetermined relationship from information (68) contained in the selected media segments and/or a signalization obtained from the server (20).

2. Device according to claim 1, wherein each media segment of the plurality (46) of media segments has encoded thereinto an associated spatiotemporal portion of the temporally varying spatial scene (30) at an associated one of a set of quality levels.

3. Device according to claim 2, wherein each of the spatiotemporal portions of the temporally varying spatial scene (30) encoded into the plurality (46) of media segments are temporal segments (54) of the temporally varying spatial scene (30) at a respective one of tiles (50) into which the temporally varying spatial scene (30) is spatially subdivided.

4. Device according to any of claims 1 to 3, wherein the Information (64) indicates an endurable value for a measure of a difference between the further quality and the predetermined quality.

5. Device according to claim 4, wherein the information (68) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28).

6. Device according to claim 4 or 5, wherein the information indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28) by way of a list of pairs of a respective distance to the view section (28) and a corresponding endurable value for the measure of the difference beyond the respective distance.

7. Device according to any of claims 4 to 6, wherein the information (64) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality so that the endurable value increases with increasing distance to the view section.

8. Device according to any of claims 4 to 7, wherein the information (64) indicates the endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a maximally allowed time interval for which the second portion may be within the view section along with the first portion.

9. Device according to claim 8, wherein the information (64) indicates a further endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a further maximally allowed time interval for which the second portion may be within the view section along with the first portion.

10. Device according to any of claims 4 to 9, wherein the information (64) is time-varying and/or spatially varying.

11. Device according to any of claims 1 to 10, wherein the Information (64) indicates allowed pairs of concurrent settings for the further quality and the predetermined quality.

12. Device according to any of claims 1 to 11, wherein the device is configured to perform the selection so that the first portion (64) follows a temporally varying view section (28) of the temporally varying spatial scene (30).

13. Device according to claim 12, wherein the device is configured so that a spatial location of the temporally varying view section (28) changes according to user input.

14. Device according to any of claims 1 to 13, wherein the device is configured to determ first portion (64) so as to correspond to a region of interest.

15. Device according to claim 14, wherein the device is configured to retrieve information ROI from the server.

16. Streaming server for media content pertaining a temporally varying spatial scene, configured to

render available a plurality of media segments for retrieval by a device, thereby enabling the device to select media segments for retrieval which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is encoded into the selected media segments at a further quality, and signal information on a predetermined relationship in the media segments and/or by way of a signalization to the device, the predetermined relationship being to be fulfilled by the further quality with respect to the predetermined quality.

17. Streaming server according to claim 16, wherein each media segment of the plurality (46) of media segments has encoded thereinto an associated spatiotemporal portion of the temporally varying spatial scene (30) at an associated one of a set of quality levels.

18. Streaming server according to claim 17, wherein each of the spatiotemporal portions of the temporally varying spatial scene (30) encoded into the plurality (46) of media segments are temporal segments (54) of the temporally varying spatial scene (30) at a respective one of tiles (50) into which the temporally varying spatial scene (30) is spatially subdivided.

19. Streaming server according to any of claims 16 to 18, wherein the Information (64) indicates an endurable value for a measure of a difference between the further quality and the predetermined quality.

20. Streaming server according to claim 19, wherein the information (68) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28).

21. Streaming server according to claim 19 or 20, wherein the information indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28) by way of a list of pairs of a respective distance to the view section (28) and a corresponding endurable value for the measure of the difference beyond the respective distance.

22. Streaming server according to any of claims 19 to 21, wherein the information (64) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality so that the endurable value increases with increasing distance to the view section.

23. Streaming server according to any of claims 19 to 22, wherein the information (64) indicates the endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a maximally allowed time interval for which the second portion may be within the view section along with the first portion.

24. Streaming server according to claim23, wherein the information (64) indicates a further endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a further maximally allowed time interval for which the second portion may be within the view section along with the first portion.

25. Streaming server according to any of claims 19 to 24, wherein the information (64) is time-varying and/or spatially varying.

26. Streaming server according to any of claims 19 to 25, wherein the Information (64) indicates allowed pairs of concurrent settings for the further quality and the predetermined quality.

27. Streaming server according to any of claims 19 to 6, wherein the device is configured to send information on an ROI to the device.

28. Media presentation description comprising

Information on computing addresses of a plurality of media segments, so that a device, using the information, may select and retrieve media segments out of the plurality of media segments

which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is encoded into the selected media segments at a further quality, and

information (64) on a predetermined relationship to be fulfilled by the further quality with respect to the predetermined quality.

29. Media presentation description according to claim 28, wherein each media segment of the plurality (46) of media segments has encoded thereinto an associated spatiotemporal portion of the temporally varying spatial scene (30) at an associated one of a set of quality levels.

30. Media presentation description according to claim 29, wherein each of the spatiotemporal portions of the temporally varying spatial scene (30) encoded into the plurality (46) of media segments are temporal segments (54) of the temporally varying spatial scene (30) at a respective one of tiles (50) into which the temporally varying spatial scene (30) is spatially subdivided.

31. Media presentation description according to any of claims 28 to 30, wherein the Informatior (64) indicates an endurable value for a measure of a difference between the further quality and the predetermined quality.

32. Media presentation description according to claim 31, wherein the information (68) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28).

33. Media presentation description according to claim 31 or 32, wherein the information indicates the endurable value for the measure of a difference between the further quality and the predetermined quality in a manner depending on a distance to the view section (28) by way of a list of pairs of a respective distance to the view section (28) and a corresponding endurable value for the measure of the difference beyond the respective distance.

34. Media presentation description according to any of claims 31 to 33, wherein the information (64) indicates the endurable value for the measure of a difference between the further quality and the predetermined quality so that the endurable value increases with increasing distance to the view section.

35. Media presentation description according to any of claims 31 to 34, wherein the information (64) indicates the endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a maximally allowed time interval for which the second portion may be within the view section along with the first portion.

36. Media presentation description according to claim 35, wherein the information (64) indicates a further endurable value for the measure of the difference between the further quality and the predetermined quality along with an indication of a further maximally allowed time interval for which the second portion may be within the view section along with the first portion.

37. Media presentation description according to any of claims 31 to 36, wherein the information (64) is time-varying and/or spatially varying.

38. Media presentation description according to any of claims 31 to 37, wherein the Information (64) indicates allowed pairs of concurrent settings for the further quality and the predetermined quality.

39. Media presentation description according to any of claims 31 to 38, wherein the media presentation description comprises information on an ROI.

40. Device for streaming media content pertaining a temporally varying spatial scene (30), configured to

select media segments out of a plurality (46) of media segments (58) available on a server (20), (20), wherein the device is configured to

perform the selection so that the selected media segments have at least a spatial section (62) of the temporally varying spatial scene (30) encoded thereinto in a manner

according to which a first portion (64) of the spatial section (62) is encoded into the selected media segments at a predetermined quality, and according to which a second portion (66; 72) of the temporally varying spatial scene, which spatially neighbors the first portion (64), is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and

so that the first portion (64) follows a temporally varying view section (28) of the temporally varying spatial scene (30), and

set a size and/or position of the first portion (64) depending on information (74) contained in the selected media segments and/or a signalization obtained from the server.

41. Device according to claim 40, wherein the information (74) is indicative of the size in form of an increment relative to, or a scaling of, a size of the temporally varying view section.

42. Device according to claim 40 or 41, wherein the information (74) indicates a predetermined value for a measure of a spatial speed of the view section.

43. Device according to claim 42, wherein the information (74) indicates the predetermined value for the measure of the spatial speed of the view section

for a default percentile of users for which a measured spatial speed does not exceed the predetermined value, and/or

along with a percentile value indicting a percentile of users for which a measured spatial speed does not exceed the predetermined value, and/or

along with a percentile value indicting a percentile of users for which a the view section is in a predetermined area, and/or

along with a hint indicating one or more types of user inputs controlling a movement of the view section for which the predetermined value is applicable.

44. Device according to claim 42 or 43, configured to perform the setting so that

wherein the higher the predetermined value for the measure of the spatial speed of the section d is the larger the size is.

45. Device according to any of claims 40 or 44, wherein the information (74) indicates a predetermined value for a measure of a probability for a direction of movement of the section.

46. Device according to claim 45, configured to perform the setting so that

the higher the probability for a respective direction of movement is the more the first portion extends into the respective direction.

47. Device according to any of claims 40 or 46, wherein the information (74) indicates the predetermined spatial speed in a time-varying and/or spatially varying and/or directionally varying manner.

48. Device according to any of claims 40 or 47, configured so that the first portion (64) and the spatial section (62) coincide.

49. Device according to any of claims 40 or 48, wherein each of the spatiotemporal portions of the temporally varying spatial scene (30) encoded into the plurality of media segments are temporal segments (54) of the temporally varying spatial scene (30) at a respective one of tiles (50) into which the temporally varying spatial scene (30) is spatially subdivided.

50. Device according to claim 49, wherein each media segment of the plurality of media segments has encoded thereinto an associated spatiotemporal portion of the temporally varying spatial scene at an associated one of a set of quality levels.

51. Device according to claim 40, wherein the device is configured to set the size in a manner independent from a size of the temporally varying view section dependent on the information (74).

52. Device according to claim 40, wherein the information comprises different values for the size for different size options of the time-varying view section with the device using the value comprised by the information for a size option fitting to an actual size of the time-varying view section.

53. Device according to claim 51 or 52, wherein the information indicates the size in number of tiles.

54. Streaming server for media content pertaining a temporally varying spatial scene, configured to

render available a plurality of media segments for retrieval by a device, thereby enabling the device to select media segments for retrieval which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and so that the first portion follows a temporally varying view section of the temporally varying spatial scene, and

signal information on how to set a size and/or position of the first portion, in the media segments and/or by way of a signalization to the device.

55. Signal defining a media presentation description comprising

Information on computing addresses of a plurality of media segments, so that a device, using the information, may select and retrieve media segments out of the plurality of media segments which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and so that the first portion follows a temporally varying view section of the temporally varying spatial scene, and

information on how to set a size and/or of the first portion, in the media segments and/or by way of a signalization to the device.

56. Signal according to claim 55, wherein the information (74) is indicative of the size in form of an increment relative to, or a scaling of, a size of the temporally varying view section.

57. Signal according to claim 55 or 56, wherein the information (74) indicates a predetermined value for a measure of a spatial speed of the view section.

58. Signal according to claim 57, wherein the information (74) indicates the predetermined value for the measure of the spatial speed of the view section

for a default percentile of users for which a measured spatial speed does not exceed the predetermined value, and/or

along with a percentile value indicting a percentile of users for which a measured spatial speed does not exceed the predetermined value, and/or

along with a percentile value indicting a percentile of users for which the view section is in a predetermined area, and/or

along with a hint indicating one or more types of user inputs controlling a movement of the view section for which the predetermined value is applicable.

59. Signal according to claim 57 or 58, configured to perform the setting so that

wherein the higher the predetermined value for the measure of the spatial speed of the section d is the larger the size is.

60. Signal according to any of claims 55 or 59, wherein the information (74) indicates a predetermined value for a measure of a probability for a direction of movement of the view section.

61. Signal according to claim 60, configured to perform the setting so that

the higher the probability for a respective direction of movement is the more the first portion extends into the respective direction.

62. Signal according to any of claims 55 or 61, wherein the information (74) indicates the predetermined spatial speed in a time-varying and/or spatially varying and/or directionally varying manner.

63. Signal according to any of claims 55 or 62, configured so that the first portion (64) and the spatial section (62) coincide.

64. Signal according to any of claims 55 or 63, wherein each of the spatiotemporal portions of the temporally varying spatial scene (30 encoded into the plurality of media segments are temporal segments (54) of the temporally varying spatial scene (30) at a respective one of tiles (50) into which the temporally varying spatial scene (30) is spatially subdivided.

65. Signal according to claim 64, wherein each media segment of the plurality of media segments has encoded thereinto an associated spatiotemporal portion of the temporally varying spatial scene at an associated one of a set of quality levels.

66. Signal according to claim 55, wherein the device is configured to the size in a manner independent from a size of the temporally varying view section dependent on the information (74).

67. Signal according to claim 55, wherein the information comprises different values for the size for different size options of the time-varying view section with the device using the value comprised by the information for a size option fitting to an actual size of the time-varying view section.

68. Signal according to claim 65 or 66, wherein the information indicates the size in number of tiles.

69. Video bitstream having a video encoded thereinto, the video bitstream comprising a signaiization of one or more of a size of focus area within the video onto which a decoding power for decoding the video should be focused, and a recommended preferred view-section area of the video.

70. Decoder for decoding from a video bitstream a video, configured to

derive from the video bitstream a signaiization (74) of a size of a focus area within the video, and focus a decoding power for decoding the video onto the focus area (116).

71. Decoder according to claim 70, configured to decode the focus area exclusively.

72. Decoder according to claim 70, configured to decode start decoding each picture of the video at the focus area.

73. Decoder according to claim 70, configured to cease decoding each picture of the video upon decoding the focus area.

74. Decoder according to any of claims 70 to 73, wherein the signalization indicates the size absolutely or decoder is configured to scale a size of focus area by a parameter contained in the signalization.

75. Device for streaming media content pertaining a temporally varying spatial scene (30), configured to

derive (90) from a media presentation description

at least one version at which the temporally varying spatial scene is offered for tile-based streaming,

for each of the at least one version an indication of benefiting requirements for benefiting from the tile-based streaming the respective version of the temporally varying spatial scene,

match (92) the benefiting requirements of the at least one version with a device capability of the device or another device interacting with the device with respect to the tile-based streaming.

76. Device according to claim 75, wherein the benefiting requirements and the device capability pertain decoding capabilities.

77. Device according to claim 75 or 76, wherein the benefiting requirements and the d capability pertain numbers of available decoders.

78. Device according to any of claims 75 to 77, wherein the benefiting requirements and the device capability pertain level and/or profile descriptors.

79. Device according to any of claims 75 to 78, wherein the benefiting requirements and the device capability pertain a type of input device for moving a view section across the temporally varying spatial scene (30) or pertain a speed at which the view section is, using the input device, moved across the temporally varying spatial scene (30).

80. Device according to any of claims 75 to 79, wherein the device is configured to

select media segments out of a plurality (46) of media segments (58) available on the server (20) by computing addresses of the selected media segments using a computational rule comprised by the media presentation description ,

retrieve the selected media segments from the server (20) using the computed addresses, wherein the device is configured to perform the selection so that the selected media segments have at least a spatial section (62) of the temporally varying spatial scene (30) encoded thereinto in a manner

according to which a first portion (64) of the spatial section (62) is encoded into the selected media segments at a predetermined quality, and according to which a second portion (66; 72) of the temporally varying spatial scene, which spatially neighbors the first portion (64), is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and

so that the first portion (64) follows a temporally varying view section (28) of the temporally varying spatial scene (30).

81. Streaming server for streaming media content pertaining a temporally varying spatial scene (30), configured to

provide (90) a media presentation description from which

at least one version at which the temporally varying spatial scene is offered for tile-based streaming,

for each of the at least one version an indication of benefiting requirements for benefiting from the tile-based streaming the respective version of the temporally varying spatial scene,

is derivable, thereby enabling a device streaming the media content from the streaming server to

match (92) the benefiting requirements of the at least one version with a device capability of the device or another device interacting with the device with respect to the tile-based streaming.

82. A media presentation description comprising

Information on at least one version at which a temporally varying spatial scene is offered for tile-based streaming,

for each of the at least one version, an indication of benefiting requirements for benefiting from the tile-based streaming the respective version of the temporally varying spatial scene.

83. Media presentation description according to claim 82, wherein the benefiting requirements and the device capability pertain decoding capabilities.

84. Media presentation description according to claim 82 or 83, wherein the benefiting requirements and the device capability pertain numbers of available decoders.

85. Media presentation description according to any of claims 82 to 84, wherein the benefiting requirements and the device capability pertain level and/or profile descriptors.

86. Media presentation description according to any of claims 82 to 85, wherein the benefiting requirements and the device capability pertain a type of input device for moving a view section across the temporally varying spatial scene (30) or pertain a speed at which the view section is, using the input device, moved across the temporally varying spatial scene (30).

87. Media presentation description according to any of claims 82 to 86, further comprising a computational rule using which the device is enabled to

select media segments out of a plurality (46) of media segments (58) available on the server (20) by computing addresses of the selected media segments using the computational rule comprised.

88. Device for streaming media content pertaining a temporally varying spatial scene (30), configured to

Depending on a spatial viewport position and at least one parameter, compute addresses of media segments, the media segments describing a spatial scene (30) varying in time and the at least one parameter,

retrieve the media segments using the computed addresses.

89. Device according to claim 88, wherein the at least one parameter comprises one or more coordinates of a viewing center and/or a view depth.

90. Media presentation description comprising

a computation rule for, depending on a spatial viewport position and at least one parameter, computing addresses of media segments, the media segments describing a spatial scene (30)

varying in time and the at least one parameter so as to retrieve the media segments using the computed addresses.

91. Streaming server for allowing a device to stream media content pertaining a temporally varying spatial scene (30) from a server, configured to provide a media presentation descriptio according to claim 90.

92. Device for streaming media content pertaining a temporally varying spatial scene, configured to

select media segments out of a plurality of media segments available on a server,

wherein the device is configured to

have a first portion of the temporally varying spatial scene encoded thereinto at a quality increased compared to a spatial neighborhood of the first portion or in a manner so that the spatial neighborhood of the first portion is not encoded into the selected media segments, send-out log messages logging

a momentaneous measurement measuring a spatial position and/or movement of the first portion; and/or

a statistical value, such as a temporally average, measuring a spatial position and/or movement of the first portion; and/or

a momentaneous measurement measuring a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

an indication of a set of buffers (300) of the device involved in buffering the selected media segments, a description of a distribution rule applied in distributing the selected media segments onto the set of buffers, and a momentaneous buffer fullness of each of the set of buffers; and/or

a measurement of an amount of the selected media segments not having been output from a buffer of the device for being subject to decoding (42); and/or

a statistical value, such as a temporally average, measuring a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a momentaneous measurement measuring the quality of the first portion or a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a statistical value, such as an temporally average, measuring the quality of the first portion or a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a field of view covered by the view section; and/or

a momentaneous measurement measuring a user position or view depth relative to a scene center (100); and/or

a statistical value, such as an temporally average, measuring a user position or view depth relative to a scene center (100).

93, Device according to claim 92, wherein the quality of the first portion or a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section is measured as a time duration at which a lower quality portion is visible in the view section along with a higher quality portion.

94. Device according to claim 92 or 93, configured to perform the selection such that the first portion (64) of the temporally varying spatial scene tacks the view section (28).

95. Device according to any of claims 92 to 94, configured the send-out log messages logging the momentaneous measurement measuring a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section as one of a measure measuring a mean density of pixels falling into the view section (28) at which the temporally varying spatial scene is encoded into the selected media segments.

96. Device according to claim 95, configured so that the measure measures the mean density of pixels by averaging the pixel density in a spatially uniform manner with respect to a pixel grid of pictures coded into the selected media segments.

97. Device according to claim 95, configured so that the measure measures the mean density of pixels by averaging the pixel density in a spatially non-uniform manner with respect to a pixel grid of pictures coded into the selected media segments.

98. Device according to claim 95, configured so that the send-out log messages indicate whether the measure measures the mean density of pixels by

averaging the pixel density in a spatially uniform manner with respect to a pixel grid of pictures coded into the selected media segments, or

averaging the pixel density in a spatially non-uniform manner with respect to a pixel grid of pictures coded into the selected media segments.

99. Device according to claim 97 or 98, wherein the averaging in pixel density in a spatially nonuniform manner corresponds to

averaging in a spherically uniform manner or

averaging spatially uniformly with respect to a viewport plane (310) which is perpendicular to a central view direction (312) of the view section (28).

100. Device according to any of claims 95 to 99, configured so that the measure measures the mean density of pixels by averaging the pixel density in a manner restricting the averaging to a central subsection of the view section (28), or applying a higher averaging weight, to the central subsection (202) compared to an edge portion (204) of the view section, surrounding the central subsection.

101. Device according to any of claims 95 to 100, configured so that the measure measures the mean density of pixels in a manner separately along a horizontal view section axis (204) and a vertical view section axis (206), respectively.

102. Device according to any of claims 95 to 101 configured to send-out log messages intermittently.

103. Device according to any of claims 95 to 102, configured to send-out log messages at a rate controlled by a manifest file based on which the device performs the selection of the media segments for download.

104. Device according to any of claims 95 to 103, wherein the plurality of media segments available on the server each belong to one of a plurality representations of the temporally varying spatial scene, the representations differing in one or more of

scene section (50) of the temporally varying spatial scene being encoded thereinto,

quality at which the temporally varying spatial scene is encoded thereinto,

spatial quality variation at which the temporally varying spatial scene is encoded thereinto, wherein the device is configured to send-out log messages logging the description of the distribution rule applied in distributing the selected media segments onto the set of buffers form of an association of each buffer to one of, or a combination or two or more of,

the scene section,

the quality,

the spatial quality distribution,

representation.

105. Device according to any of claims 95 to 104, wherein the representations are grouped into adaptation sets according to one or more of

the scene section of the temporally varying spatial scene being encoded thereinto,

the spatial quality variation at which the temporally varying spatial scene is encoded thereinto, wherein the device is configured to configured the send-out log messages logging the description of the distribution rule applied in distributing the selected media segments onto the set of buffers in form of an association of each buffer to one of the adaptation sets or logging the description of the distribution rule applied in distributing the selected media segments onto the set of buffers in form of an association of each buffer to one of the representations.

106. Device according to any of claims 95 to 105, wherein the device is configured to send-out log messages logging the measurement of the amount of the selected media segments not having been output from a buffer of the device for being subject to decoding (42) in form of a temporal measurement.

107. Device according to claim 106, wherein the device is configured so that the measurement of the amount of the selected media segments not having been output from a buffer of the device for being subject to decoding (42) in form of a temporal measurement in temporal units smaller than and/or being defined independent from a temporal length of the media segments (58) and/or in milliseconds.

108. Device according to any of claims 95 to 107, wherein the device is configured to send-out log messages logging the measurement of the amount of the selected media segments not having been output from a buffer of the device for being subject to decoding (42) in a format classified by one or more of

a buffer of the decoder within which the respective media segment has been buffered,

a scene section encoded into the respective media segment,

a quality at which the temporally varying spatial scene is encoded into the respective media segment,

a spatial quality distribution at which the temporally varying spatial scene is encoded into the respective media segment.

109. Method for streaming media content pertaining a temporally varying spatial scene (30), comprising

selecting (56) media segments out of a plurality (46) of media segments (58) available on a server (20),

retrieving (60) the selected media segments from the server (20), perform the selection is performed so that the selected media segments have at least a spatial section (62) of the temporally varying spatial scene (30) encoded thereinto in manner according to which a first portion (64) of the spatial section is encoded into the selected media segments at a

predetermined quality, and according to which a second portion (66) of the temporally varying spatial scene, which spatially neighbors the first portion (64), is encoded into the selected media segments at a further quality fulfilling a predetermined relationship with respect to the predetermined quality, and

the method further comprises deriving the predetermined relationship from information (68) contained in the selected media segments and/or a signalization obtained from the server (20).

110. Method for streaming media content pertaining a temporally varying spatial scene, comprising

rendering available a plurality of media segments for retrieval by a device, thereby enabling the device to select media segments for retrieval which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is encoded into the selected media segments at a further quality, and signaling information on a predetermined relationship in the media segments and/or by way of a signalization to the device, the predetermined relationship being to be fulfilled by the further quality with respect to the predetermined quality.

111. Method for streaming media content pertaining a temporally varying spatial scene (30), comprising

selecting media segments out of a plurality (46) of media segments (58) available on a server (20), retrieving the selected media segments from the server (20), wherein the selection is performed so that the selected media segments have at least a spatial section (62) of the temporally varying spatial scene (30) encoded thereinto in a manner

according to which a first portion (64) of the spatial section (62) is encoded into the selected media segments at a predetermined quality, and according to which a second portion (66; 72) of the temporally varying spatial scene, which spatially neighbors the first portion (64), is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and

so that the first portion (64) follows a temporally varying view section (28) of the temporally varying spatial scene (30), and

the method further comprising setting a size of the first portion (64) depending on information (74) contained in the selected media segments and/or a signalization obtained from the server.

112. Method for streaming media content pertaining a temporally varying spatial scene, comprising

rendering available a plurality of media segments for retrieval by a device, thereby enabling the device to select media segments for retrieval which have at least a spatial section of the temporally varying spatial scene encoded thereinto in manner according to which a first portion of the spatial section is encoded into the selected media segments at a predetermined quality, and according to which a second portion of the temporally varying spatial scene, which spatially neighbors the first portion, is not encoded into the selected media segments or encoded into the selected media segments at a further quality reduced relative to the predetermined quality, and so that the first portion follows a temporally varying view section of the temporally varying spatial scene, and

signaling information on how to set a size of the first portion, in the media segments and/or by way of a signalization to the device.

113. Method for decoding from a video bitstream a video, comprising

deriving from the video bitstream a signalization of a size of a focus area within the video, and focussing a decoding power for decoding the video onto the focus area.

114. Method, performed by a device, for streaming media content pertaining a temporally varying spatial scene (30), comprising

deriving (90) from a media presentation description

at least one version at which the temporally varying spatial scene is offered for tile-based streaming,

for each of the at least one version an indication of benefiting requirements for benefiting from the tile-based streaming the respective version of the temporally varying spatial scene,

matching (92) the benefiting requirements of the at least one version with a device capability of the device or another device interacting with the device with respect to the tile-based streaming.

115. Method for streaming media content pertaining a temporally varying spatial scene, comprising

selecting media segments out of a plurality of media segments available on a server,

retrieving the selected media segments from the server, perform wherein the selection is performed so that the selected media segments have a first portion of the temporally varying spatial scene encoded thereinto at a quality increased compared to a spatial neighborhood of the first portion or in a manner so that the spatial neighborhood of the first portion is not encoded into the selected media segments,

the method further comprising sending-out log messages logging

a momentaneous measurement measuring a spatial position and/or movement of the first portion; and/or

a statistical value, such as a temporally average, measuring a spatial position and/or movement of the first portion; and/or

a momentaneous measurement measuring a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

an indication of a set of buffers of the device involved in buffering the selected media segments, a description of a distribution rule applied in distributing the selected media segments onto the set of buffers, and a momentaneous buffer fullness of each of the set of buffers; and/or

a measurement of an amount of the selected media segments not having been output from a buffer of the device for being subject to decoding (42); and/or

a statistical value, such as a temporally average, measuring a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a momentaneous measurement measuring the quality of the first portion or a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a statistical value, such as an temporally average, measuring the quality of the first portion or a quality of the temporally varying spatial scene as far as encoded into the selected media segments and as far as visible in a view section; and/or

a field of view covered by the view section; and/or

a momentaneous measurement measuring a user position or view depth relative to a scene center (100); and/or

a statistical value, such as an temporally average, measuring a user position or view depth relative to a scene center (100).

116. Method for streaming media content pertaining a temporally varying spatial scene (30), comprising

providing (90) a media presentation description from which

at least one version at which the temporally varying spatial scene is offered for tile-based streaming,

for each of the at least one version an indication of benefiting requirements for benefiting from the tile-based streaming the respective version of the temporally varying spatial scene,

is derivable, thereby enabling a device streaming the media content from a streaming server to

match (92) the benefiting requirements of the at least one version with a device capability of the device or another device interacting with the device with respect to the tile-based streaming.

117. Method for allowing a device to stream media content pertaining a temporally varying spatial scene (30) from the server, comprising providing a media presentation description according to claim 90.

118. A computer program having a program code for executing the method according to any of claims 109 to 117, when the program is executed on a computer.

Documents

Application Documents

# Name Date
1 202238008167-TRANSLATIOIN OF PRIOIRTY DOCUMENTS ETC. [16-02-2022(online)].pdf 2022-02-16
2 202238008167-STATEMENT OF UNDERTAKING (FORM 3) [16-02-2022(online)].pdf 2022-02-16
3 202238008167-PROOF OF RIGHT [16-02-2022(online)].pdf 2022-02-16
4 202238008167-FORM 1 [16-02-2022(online)].pdf 2022-02-16
5 202238008167-DRAWINGS [16-02-2022(online)].pdf 2022-02-16
6 202238008167-DECLARATION OF INVENTORSHIP (FORM 5) [16-02-2022(online)].pdf 2022-02-16
7 202238008167-COMPLETE SPECIFICATION [16-02-2022(online)].pdf 2022-02-16
8 202238008167-Information under section 8(2) [19-04-2022(online)].pdf 2022-04-19
9 202238008167-FORM-26 [20-04-2022(online)].pdf 2022-04-20
10 202238008167-FORM 18 [22-04-2022(online)].pdf 2022-04-22
11 202238008167-Information under section 8(2) [23-05-2022(online)].pdf 2022-05-23
12 202238008167-Information under section 8(2) [17-06-2022(online)].pdf 2022-06-17
13 202238008167-Information under section 8(2) [04-07-2022(online)].pdf 2022-07-04
14 202238008167-Information under section 8(2) [07-07-2022(online)].pdf 2022-07-07
15 202238008167-FORM 3 [12-07-2022(online)].pdf 2022-07-12
16 202238008167-Information under section 8(2) [31-08-2022(online)].pdf 2022-08-31
17 202238008167-Information under section 8(2) [22-09-2022(online)].pdf 2022-09-22
18 202238008167-Information under section 8(2) [13-01-2023(online)].pdf 2023-01-13
19 202238008167-FORM 3 [14-01-2023(online)].pdf 2023-01-14
20 202238008167-Information under section 8(2) [31-01-2023(online)].pdf 2023-01-31
21 202238008167-Information under section 8(2) [21-02-2023(online)].pdf 2023-02-21
22 202238008167-Information under section 8(2) [25-03-2023(online)].pdf 2023-03-25
23 202238008167-Information under section 8(2) [05-06-2023(online)].pdf 2023-06-05
24 202238008167-Information under section 8(2) [20-07-2023(online)].pdf 2023-07-20
25 202238008167-FORM 3 [20-07-2023(online)].pdf 2023-07-20
26 202238008167-Information under section 8(2) [22-08-2023(online)].pdf 2023-08-22
27 202238008167-Information under section 8(2) [16-10-2023(online)].pdf 2023-10-16
28 202238008167-Information under section 8(2) [04-12-2023(online)].pdf 2023-12-04
29 202238008167-FORM 3 [11-01-2024(online)].pdf 2024-01-11
30 202238008167-Information under section 8(2) [29-01-2024(online)].pdf 2024-01-29
31 202238008167-Information under section 8(2) [29-02-2024(online)].pdf 2024-02-29
32 202238008167-Information under section 8(2) [13-03-2024(online)].pdf 2024-03-13
33 202238008167-FER.pdf 2024-09-26
34 202238008167-FORM 3 [20-11-2024(online)].pdf 2024-11-20
35 202238008167-FORM 4 [24-03-2025(online)].pdf 2025-03-24
36 202238008167-Form-4 u-r 138 [22-04-2025(online)].pdf 2025-04-22
37 202238008167-OTHERS [23-05-2025(online)].pdf 2025-05-23
38 202238008167-FER_SER_REPLY [23-05-2025(online)].pdf 2025-05-23
39 202238008167-CLAIMS [23-05-2025(online)].pdf 2025-05-23

Search Strategy

1 search8167E_09-09-2024.pdf