Method And Apparatus For Session Bandwidth Estimation And Rate Control

< Back

Method And Apparatus For Session Bandwidth Estimation And Rate Control

Abstract: An intermediate device receives a content data message addressed to a receiving device for a communication session between a source device and the receiving device. The intermediate device substitutes adapted content data for content data of the content data message and then sends the adapted content data to the receiving device such that it appears to the receiving device that the adapted content data originated from the source device. The communication from the source device to the receiving device is intercepted by the intermediate device in a manner that is transparent to the source device and receiving device.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #

Filing Date

18 December 2013

Publication Number

52/2014

Publication Type

INA

Invention Field

COMMUNICATION

Status

Parent Application

Applicants

ALLOT COMMUNICATIONS LTD

22 HaNagar Street 45240 Hod HaSharon

Inventors

1. FOX Michael

P.O. Box 28279 San Diego California 92198

2. MUSHTAQ Faisal

3272 Nipoma Court San Jose California 95135

3. ARYA Ashwani

c/o Ortiva Wireless Inc. 4225 Executive Square Suite 700 La Jolla California 92037

4. GARRISON Ron

12440 Pathos Ln. San Diego California 92129

Specification

METHOD AND APPAUTUS FOR SESSION BANDWIDTH ESTII&lATION
AND U T E CONTROL
CROSS-REFERENCE TO REI,A'FEH> APPLICATIONS
5 [OOQB] This application claims the benefit of priority of co-pending U.S. Provisional Patent
Application Serial No. 61/497,458 entitled "'Method And Apparatus For Session Bandwidth
Estimation And Rate Control" to Michael Fox, Faisal Mushtaq, Ashwani Arya, Ronald Garrison
filed June 15,20 1 1. Priority of the filing date of June B 5,20 1 1 is hereby claimed, and the
disc10sure of the Provisional Patent Application is hereby incorporated by reference.
10
BACKGROUND
[OOB)2] 1. Field of the Invention
[0003] The present invention relates to dab comlnunications and, more particularly, to
managing download of progressive data for video and/or audio streams.
15 [0004] 2. Description of the Related Art
[0005] Video streaming over communication networks has become more and more popular. In
network video streaming, a client machine (such as a desktop or laptop computer or a Webenabled
mobile phone) receives a video stream from a video source over a network connection.
The video stream generally includes video format data colnprising graphic or image data as well
20 as audio data. The video stream may comprise a video clip having content of a predetermined
length, such as a movie or presentation, or the video stream may comprise an ongoing video feed
of undetermined length, such as output from a Web cam or some other live signal feed. Several
communication protocols have been developed and are standardized to enable streaming video
transfer between video source and client machine, for example, protocols such as RTSP, RTMP,
25 HTTP progressive download, MMS, and custom protocols. Among these, progressive download
streaming of videos has become very popular.
[0006] In HTTP progressive download, reproduction or playback of the video data strean?
begins after an initial file download using the HTTP protocol is received at the client end. The
initial file comprises a portion of the video stream, and is followed by download of subsequelmt
30 file content corresponding to subsequent portions of the video stream. As the file content is
WO 20121174474 PCTlUS20121042811
downloaded, the video playback proceeds after receiving a few seconds worth of video data from
the video stream, without waiting until the entire video stream has been received. The
subsequent file content comprising the remaining video is downloaded, decoded, and rendered
for playback. There has been tremendous demand for video viewing on the Internet and that
5 viewing demand has in turn increased demands on wireless network resources due to ubiquitous
coverage and mobile users consuming video everywhere service is available. Consequently the
popularity of video streaming causes overloading of bandwidth-limited networks, especially
radio frequency (RF) wireless networks such as, for example, cellular dapa networks, WiFi
networks, satellite networks, and the like.
11 0 [8007] The underlying Internet network protocol used for video progressive streaming is
Transmission Control Protocol (TCP) or User Datagram Protocol (UDP). In recent years, the
network transfer protocol used for delivery of Internet traffic over all types of networks,
including RX: wireless networks, is TCP, which is used in conjunction with the Internet Protocol
(IP) and is often jointly referred to as 'FCP/IP. TCP provides reliable, ordered, error-free
15 delivery of a strealn of bytes from a program on one computer to a program on another
computer. 'B'he bytes being transferred are typically organized into packets that are routed over
the network using the HP protocol. The TCP protocol has mechanisms for packet Wow control,
retransmission in case of packet loss, segment size, , network congestion avoidance, and session
control, e.g., establishment and termination.
20 [8008] Due to factors such as network congestion, traffic load balancing, switch memory
overflow, physical link layer loss, or other unpredictable network behavior, IP packets can be
Isst, or delivered out of order at the receiving client. These add to processing operations at the
client and can result in choppy video on playback. The receiving TCP stack detects data packet
Boss and/or delay problems, requests retransmission of lost packets, and rearranges out-of-order
25 packets into the proper packet order for display. The sending TCP stack also tries to reduce
network congestion, to reduce the occurrence of the other problems mentioned above, by packet
flow control. Packets comprise collections of bytes of data in a contiguous PDU or Proto~ol
Data Unit. For TCPIIP, the PDU is defined as a TCP segment this determines the maximum
number of bytes in a PDU transported over the network.. Once the receiving TCP Stack, which
30 is part of the machine's operating system kernel, has reassembled a perfect copy of the stream of
data packets originally transmitted it passes that stream data to the application program of the
client machine for playback.
WO 20121174474 PCTlUS20121042811
I00091 TCP is optimized for accurate delivery rather than for timely delivery, and therefore,
'ICP processing sometimes incurs relatively long delays (on the order of hundreds of
milliseconds) while waiting for proper sequencing of out-of-order segments or retransmission of
lost segments. Delays in the reception of packets can cause underflow of the video player at the
5 client, resulting in stalled or choppy playback.
[Q010] Wireless network links are known to experience sporadic and usually temporary packet
losses due to communication artifacts such as fading, shadowing, hand-off, and other radio
effects, as well as network congestion. The sending TCP will react to such losses with network
back-off operations utilizing a congestion window. After the back-off of the congestion window
10 size, TCP can enter a congestion avoidance phase with a conservative decrease in window size.
This congestion avoidance phase results in reduced throughput. Because of the sporadic nature
of TCP throughput in wireless networks, consistent delivery of video content requires the
adaption of the content rate to the effective network rate for stall free playback.
[OOPB] Progressive download results in an aggressive (i.e., as fast as possible) download of
15 video from the H7TP server over the network. This is another source of network inefficiency in
the case of a user selecting a video for download, watching a short portion of the video, and then
stopping the video. Since the progressive download transmits the video stream as quickly as
possible, unviewed content may be transmitted over the network and accumulated at the client
machine, only to be discarded if the user at the client device stops watching the video. This
20 wastes valuable network bandwidth and resources.
[0012] HTTP Progressive download using TCP is the predominant use case over the Internet
because of pervasive support of this video delivery through computer players such as the Adobe
FLASHTM player, Microsoft SILVERLICHPv player, and Apple QUICKTIMETM player, and
through other playback devices. When transporting video or other streamed data over a network,
25 TCP merely uses as much of the network's bandwidth as the TCP protocol allows. If the
network has a bandwidth that is higher than the rate at which TCP is sending the data, no
indication of available network bandwidth can be obtained using TCP.
[O013] Improved methods of estimating the bandwidth available over wired and/or wireless
networks allow a variable rate download system to more effectively manage download of the
30 content streams.
WO 20121174474 PCTlUS20121042811
S U W R U
[0014] A bandwidth estimation technique is used to estimate the bandwidth available from a
midpoint in a network to a client device. In typical content streaming protocols, such as TCP, for
example, a progressive download is used where all of the data is downloaded as fast as the
5 channel can transmit. In contrast, download methods disclosed herein control the amount of data
downloaded to maintain data comprising a certain display time of data in a client buffer, and the
data is written in a way such that the bandwidth to the client can be estimated without feedback
from the client device. Content data is grouped into large pulses or chunks and is downloaded
quickly in order to measure the channel bandwidth. When the pulse takes too long to transmit,
110 the system compresses or re-encodes the video at a lower data rate and transmits nore frames
into a block of a given size. Download techniques described herein measure network bandwidth
by determining how long it takes to transmit a block of data of a predetermined size using an
inelastic fixed output buffer and subsequently pacing the delivery of a content stream based on
the determined bandwidth.
B 5 100151 Other features and advantages of the present invention will be apparent from the
followir~gd escription of the embodiments, which illustrate, by way of example, the principles of
the invention.
[4BO16] Additional details are provided by the attached appendices, which are incorporated
herein.
20
BRIEF DESCRIPTION OF THE DFUWINGS
fOOP7) FIG. B is a high Ievel functional block diagram of a system for managing download of
streaming data.
[0018] FIG. 2 is a hnctional block diagram of the system of FIG. Z illustrating subsystems of
25 an adaptive progressive download (APE)) sewer employed in the system.
[00B9] FIG. 3 is a functional block diagram of a system for managing download of streaming
data in a wireless operator network.
[OQ'EO] FIG. 4 is a functional block diagram of an illustration of subsystems of a video
optimization gateway used in the system of FIG. 3.
30 [06121] FIG. 5 is a functional block diagram of subsystems of the video optinaization gateway
of FIG. 4.
WO 20121174474 PCTlUS20121042811
[0022] FIG. 6 is a flow diagram of operations performed by the video optimization gateway of
FIG. 4.
[0023] FIG. 7 illustrates an example time history for pacing the writing of a content stream
using the disclosed process.
DETAILED DESCRIPTION
[0024] TCP (Transmission Control Protocol) is the most widely used transport layer protocol
used for various Internet applications. Methods described herein provide a scheme to effectively
estimate the end-to-end bandwidth of a TCP session without requiring any explicit feedback
10 from a client who is receiving content associated with the TCP session. The session bandwidth
estimate can then be used to provide effective rate control for high throughput applications such
as video streaming and the like. The rate of content being sent can be paced dynamically to
control the amount of data in a client buffer while sending data in a way that bandwidth to the
client can be estimated. Maintaining a desired amount of data in the client buffer without over
15 trans-rating the data or sending too much data can be difficult to achieve. Methods and systems
described herein are used to dynamically make the video pacing decisions at the same time that
the bandwidth to the client is being estimated. The systems and methods described herein can be
used with protocols other than 'TCP, such as the Real-Time Transport Control Protocol (RTCP).
TCP is used to facilitate the description only as an example.
20 [0025] Methods and apparatus for bandwidth estimation and rate control during a session
provide a means to improve TCP network bandwidth al8ocation for greater efficiency and can be
used for content playback in networks with variable bandwidth such as, for example, wireless
networks. Depending on the nature of encapsulation of the content, content can be adapted in
various ways to accommodate the estimated network bandwidth. For content encapsulated in a
25 non-secure, digital rights management (DM) free encoding (e.g., You-Tube Flash content or
segmented MP4), the session bandwidth estimate can be used to adapt the content as necessary to
a lower bandwidth encoding as the content passes through a rate adaption element. Thus the
content is delivered to the client at a bandwidth below the channel capacity for the client, and
playback is smooth without stalls. For content encapsulated in a secure fonnat that cannot be
30 decoded by a content adaption element, the content data rate can be paced relative to the
measured network bandwidth. This provides improved network load sharing and a more
consistent network performance by pacing the content delivery rate safely below the variance of
WO 20121174474 PCTlUS20121042811
the effective network rate. By pacing below the network variance, the client receives consistent
video quality without stalling due to channel variability.
100261 The bandwidth estimation scheme is used to characterize the channel by way of
identifying choke points. Various impairments can cause a choke point. Packets being dropped
5 for various reasons is the most common cause of a drop in bandwidth. The TCP protocol
retransmits dropped packets. The bandwidth estimation scheme employs transmission of data
pulses or blocks, and measurement of the network transmission time to estimate throughput of a
TCP session. The data to be transmitted is collected together and sent as data pulses. The TCP
transmission time is then measured for each data pulse and the bandwidth is measured as
10 follows:
Session Bandwidth = (Data bytes sent) / (TCP transmission time) (31
[0027] A server employing the bandwidth estimation scheme described herein causes a user
space process to write the pulse of content data to a limited size transmit (TX) buffer using, for
example, an ASYNC operation. User space ASYNC writes require no special I'CP kernel
15 patches, and work with all congestion control algorithms. A limited size TX buffer is set with a
socket option that blocks the writing thread until the TX buffer contains the write data. The TX
buffer is modeled as a leaky bucket. With proper bucket sizing, the time to fill and empty the
bucket can be used to determine the TCP Session bandwidth. The time duration for the blocked
write thread is used to measure transmission time needed for the content block or pulse.
20 Successive pulse tra~~smissionress ult in successive bandwidth measurements which are utilized
to improve the estimate of the available network bandwidth. Filtering (averaging) of the network
bandwidth estimates improves the estimate accuracy. The content block or pulse can be
adaptively sized for resolution of session bandwidth estimates. Lower network bandwidth
conditions call for smaller pulses (less data) than high bandwidth network conditions.
25 I00281 A high level functional block diagram of a system 100 for managing progressive
download of temporally ordered streaming data is illustrated in FIG. I. In the illustrated system
100, a network 104 acts as a conduit of digital data, or backhaul, to a router 108 that receives the
digital data. The network 104 could be a network such as the Internet, or could be a backhaul
network for a cellular network, a satellite network, or other wireless network. The digital data
30 received by the router 108 includes multiple types of data, including temporally-ordered content
streams (referred to herein as content streams) such as video, audio, and combined audio/video.
In this illustrated embodiment, the content streams are transported using an HTTP-based
WO 20121174474 PCTlUS20121042811
progressive download (PD) mechanism such as those used by FLASHTM by Adobe Systems
Incorporated of San Jose, California, USA, or SILVERLIG%%TTbMy M icrosoft Corporation of
Red~mondW, ashington, USA, and the Iike.
100291 The router 108, in this embodiment, is referred to as a Deep Packet Inspection (DN)
5 router. The DPI router 108 intercepts the digital traffic received from the network 104 and filters
out the content streams from other types of traffic. All the content stream traffic and the other
digital traffic. including, for example, HTML, PEG, Binary Streams, and the like. is transferred
over the network using the HTTP protocol. The DPI router I08 typically identifies and discerns
the content streams from the other digital traffic based on MIME-type. The nsn-content stream
10 trafic is forwarded from the DPH router 108 over a subnetwork 1 14 to user equipment 1 16. The
user equipment 1 16 in this embodiment comprises a client machine, and could be a device such
as a laptop computes, personal computer (PC), set-top box, netbook, cellular phone, smart phone,
mobile Internet device (MID), and the like. The subnetwork 114 may include one or more
wireless or wireline networks.
I5 [0038] The DPI router 108 redirects the content stream traffic to one or more adaptive
progressive download (APD) servers l 12. The system 100 of FIG. 1 shows two APD servers
122, but systems may include more or fewer APD servers 1 12. For example, multiple APD
servers 112 may be used to perform load balancing between them and to provide redundancy.
[0031] The APD servers 112 manage transfer and possible modification of the content streams
28 over the subnetwork 114. Details of functions performed by the APD servers 112 are described,
for example, in U.S. Patent Application No. 12/790,728, titled "ADAPTIVE PROGRESSIVE
DOWNLOAD" and filed on May 28,2010, assigned to the assignee of the present invention.
Much of the traffic making up the content stream traffic is Internet video and is displayed on
client devices using mechanisms such as Adobe FLASHm or Microsoft SILVERLIGHTTM
25 technology. Both of these technologies support several video codecs including well-known
codecs such as H.264, VC-I , 0112, and VP6. For audio signals, these technologies are capable
of supporting audio codecs such as AAC, AAC++, mp3 , ADPCM, Windows Media Audio, and
the like.
180321 Content streams using Adobe FLASHTM, MP4, or Microsoft SIkVERLIGHTaM
30 technologies utilize compressed data for both audio and video. The compressed audio and video
data are encapsulated in file container formats such as those comrnonIy known as Adobe Flash
Video "FLV", mp4, or Windows Media Video "WMV9' file formats. These file container
WO 20121174474 PCTlUS20121042811
formats provide time-stamps for rendering of audio and video data, and provide separate
binslpackets to describe audio, video, or text packets.
[0033] In a typical delivery, FLV or UTMV files are hosted on a web-server. Web browser
plug-ins such as for FLASH TM,S HLVERLIGHTTMo, r WINDOWS FdEDIA PLAYERTMa re
5 hosted in a Web browser at a client machine, and are provided with the relevant URL(s) of the
content stream(s) embedded in Web pages as they are browsed by the end-users at the client
machine. The hosting Web server also sets the appropriate MIME-type as videolx-flv or
video/x-ms-wmv (see, e.g., the Web page at http:Nsuppo~.rnicros(lft.~om/kb/288102)I.n this
way, a receiving browser knows to load the appropriate plugin to render the data which is
10 delivered on the HTTP protocol.
[0034lj Content streams directed at video players are typically transported ok7er HTTP using
TCP transpori techniques. In addition, content streams that are transported over networks using
HTTP progressive download typically use all the bandwidth available on the network without
regard to whether or not the end user needs or wants all the content stream data as quickly as
15 possible. The APD servers 112 estimate network conditions, estimate the temporal amount of
content stored in a client's buffer, and manage transport of the content streams being transported
over the subnetwork 1 14 using TCP.
[0035] FIG. 2 is a more detailed functional block diagram of the FIG. 1 system 100. In
particular, FIG. 2 shows various subsystems of the APD server 112, including a content stream
20 ingest and de-multiplexer (de-mux) subsystem 204, input audio first-in-Erst-out (FIFO) buffers
208, input video FIFO buffers 2 12, an APD controller 2 16. a multiplexer queue 224, a content
stream multiplexer 228, a content stream output FIFO buffer 232, and a delivery interface 236.
100361 The ingesvde-mux subsystem 204 receives content data streams that have been
intercepted by the DPI router 108. The multiple content streams can be in one sf a plurality of
25 container formats such as Adobe FLV or Microsoft WMV. The ingest/de-mux subsystem 204
splits the individual content streams into audio and video substreams. The individual audio
substreams are stored in corresponding buffers of the audio FIFO buffer 208. The audio
substreams can be aanscoded or re-encoded for bit rate reduction in some embodiments. The
sampling rate of audio is determined at the beginning of content stream processing and is kept
30 fixed for the duration ofthe content stream. However, the bits assigned per packet due to
quantization can be changed.
WO 20121174474 PCTlUS20121042811
(0037j The ingestlde-mux subsystem 204 splits the individual video substreams into epochs,
In the illustrated system, the epochs are of about five seconds in length. An epoch length of
about five seconds is a reasonable compromise that allows a sufficiently large piece of video to
be sent to the client to have a reasonable impact on the amount of video stored in the client
5 buffers, while at the same time not putting the APD server 112 into a situation where the adapted
bitrates would be changed too frequently. The individual video epochs are stored in
con-esponding buffers of the video FIFO buffer 212.
[0038] While splitting the video of the content stream into epochs, the iasgestkie-mux
subsystem 204 looks for an intra-coded frame, or I-frame (also referred to as an l[DRIPFfL4ME in
10 H.264 codecs), which is at the beginning of a GOP beginning boundary which will be the start of
the next epoch. Those skilled in the art will understand that a "GOY refers to a group of
pictures comprising a collection of consecutive frames of video. The frames within a GOP
typically comprise either I-frames, P-frames, or B-frames. According to the MPEG standard, as
noted above, a 60P ordinarily begins with an 1-frame. Video frames at a GOP boundary are not
15 typically dependent on reference frames of a previous GOP. In this way, each epoch can be
decoded independently of other epochs. That is, each epoch can be manipulated independently
of the other epochs for transfer over the network. I-Frames are typically encoded every 30 to 60
frames but could occur less fiequently. Hence, the epochs are nominally about five seconds of
viewing time and are typically under seven seconds.
20 [0039] The APD controller 2 16 determines the rate at which to send the multiplexed stream
(e.g., video and audio epochs) to the user equipment 116. The APD controller 216 also
determines when to re-encode the video or audio epochs to adapt the bitrate, frame rate, or other
characteristic of the video epoch to adapt to network conditions. The APD controller 216 uses
two main measurements in determining when to send epochs and when to re-encode epochs.
25 The two main measurements used by the APD controller 2 16 in managing the transport of the
content streams are (I) an estimated network bandwidth of the subnetwork 1 14 as determined by
the delivery interface 236 using bandwidth estimation methods described herein, and (2) an
estimate of the temporal amount of an individual content stream stored at the user equipment
114.
30 [0040] To determine the temporal amount of a content stream stored at the user equipment
1 16, the APD controller 216 keeps track of the duration of the epochs (in seconds of viewing
time) that have been delivered via the delivery interface 236. The r%BP server 112 also keeps
track of the average video rate of the epochs, the estimated network bandwidth being utilized to
9
WO 20121174474 PCTlUS20121042811
transport the video, and previous estimates of the temporal amount of content stored at the user
equipment 1 16 by knowing the timestamps of multiplexed audio/video being sent over network.
[0049] The APD controller 2 16 is coupled to a bank of audio encoders 21 8 and a bank of video
encoders 220. A "bank" of video encoders 220 is typically controlled by one APD controller
5 216 because video encoding is a much more computationally demanding task than the tasks
performed by the APD controller 216. Similarly, audio encoding could also require a bank of
audio encoders 2 1 8. If the M D controller 21 6 determines that a lower bit rate for the current
epoch of video andor audio is needed, the APD controller 21 4 triggers the video encoders 220
and/or the audio encoders 21 8 to re-encode the video stream and/or the audio stream,
10 respectively, at specified bitrates. The APD controller 2 16 controls the audio encoders 2 18 and
the video encoders 220 to re-encode portions of audio and/or video to maintain client buffers at
or above a low buffer limit.
[0042] During low bandwidth conditions, it is desirable to reduce the data rate of audio andor
video Streams. The I ~CDontro liler 2 16 can decide, in extremely low network conditions, to reli
5 rate or re-encode the audio from the input audio FIFO 208. In order to achieve audio encoding,
the bank of audio encoders 2 18 is used. The output from the bank of audio encoders 2 18 is given
to the stream multiplexer 228 input queue.
[00431] When the video encoders 220 finish re-encoding an epoch of a video stream, the video
stream epoch is communicated to an input queue of the video interface 224 of the APD server
20 112. The video interface 224 also receives epochs that have not been re-encoded from the APD
controller 216. The video interface 224 forwards the re-encoded and non-re-encoded epochs to
the content stream multiplexer 228. The content stream multiplexer 228 reunites the video
epochs received from the video interface 224 with the corresponding audio epochs that were
stored in the audio FIFOs 208. The content stream multiplexer 228 creates new containers
25 including synchronized audio and video. The containers can be in, for example, Adobe FLV or
Microsoft WMV format, Upon reuniting the audio and video epochs. the content stream
multiplexer 228 forwards the containers to the output FIFO buffer 232.
100441 The content stream containers are stored in the output FIFO buffer 232 until the
delivery interface 236 retrieves them for delivery to the router and subsequent delivery to the
30 correspon~dingu ser equipment 116. The delivery interface 236 is controlled to deliver the
content stream epochs as determined by the APD controller 2 16 to keep the temporal amount of
WO 20121174474 PCTlUS20121042811
content stream stored in the buffer of the user equipment 116 at a desired level, as discussed
above.
[0045] Referring to FIG. 3, a functional block diagram of a system 300 for ananaging
download of streaming data in a wireless operator network is shown. Wireless networks that can
5 use methods and systems described herein include cellular, WiFi, WiMax, LAN, WAN, satellite,
in-flight ISP's that provide video to aircraft and passengers, and others, including wired
networks. The system 300 includes a content source 302, a main network 306 and a wireless
operator network 3 10. The main network 306 can be, for example, the Internet. Located at an
interface between the main network 306 and the wireless network 3 10 is one of the DPI routers
10 108 discussed above, The DPI router 108 controls the flow of data into a regional network 322
of the wireless operator network 3 10. The DPI router 1108 and the regional network 322 can be
located in what is referred to as a backbone of the wireless operator network 3 10. The regional
network 322 is coupled to base stations 326, 328 and 330 of the wireless opcrator network 3 10.
Three base stations 326-330 are illustrated in this example, but rnore or fewer base stations can
15 be used. The base stations 326, 328 and 330 control the flow of data to mobile devices 334 and
336 that are located in sectors or cells of the base stations 326-330.
100461 The DPI router 108 is coupled to an internet video optimization gateway (IVOG)
system 3 18. The IVOG system 3 18 includes subsystems such as APD servers 1 12, and audio
and video rate adaptors 21 8 and 220 discussed above. The DPI router I08 intercepts streamed
20 content and hrwards the intercepted data to the IVOG 3 18 for processing, The APD servers 1 12
and audio and video rate adaptors 21 8 and 220 of the HVOG 3 18 process the data and return the
processed data to the DPI router 108 as discussed above in reference to FIG. 2.
I00471 FIG. 4 shows a functional block diagram of an illustration of subsystems of the HVOG
system 3 18-1 used in the system 300 of FIG. 3. The IVOG system 318-1 is arranged in a stack
25 with one or rnore data path processing (DPP) systems 402 on top, one or more APD servers 406
under a DPP system 402, and one or more video/audio rate adapters 4 10 below the APD servers
406. An example of the DPP system 402 is described in related U.S. Patent Application No.
13/466023 entitled "Data Path Processing" filed May 7, 2012. The DPP system 402 receives
data intercepted by the DPI router 108, The DPP system 402 coordinates the processing of
30 content streams to the one or more APD servers 406. The APD scwers 406 then coordinate the
re-encoding of video/audio and/or other content with the one or more rate adapters 4 101. Reencoding
of content can include one or more of changing frame rate, changing frame type,
WO 20121174474 PCTlUS20121042811
increasing and/or decreasing the quantization levels of the content, removing content and adding
content.
[OO48]1 The APD servers 406 can be configured similarly to the APD server 212 shown in FIG.
2. The rate adapters 4 10 can include audio and video rate adapters such as the audio rate
5 adapters 21 8 and the video rate adapters 220 shown in FIG. 2.
100498 The lVOG system 31 8-1 is illustrated in a divert mode in FIG. 3. In contrast to
receiving content streams from the DPI router 108 in the divert mode, as illustrated in FIG. 3, the
DPP system 402 can be configured in a bridge mode where all the data goes through the DPP
system 402. In the bridge mode, the DPP system 402 communicates data directly to and from
10 both the main network 306 and the regional network 322.
[DOSO] With reference to FIGS. 3 and 4, the DPP system 402 is configured to identify content
streams such as multimedia (audio, video and/or text), video streams andor audio streams that
are being communicated froin the main network 306 (e.g., from the content source 302) to
recipient devices (e.g., the mobile devices 334 and 336). In the case of TCP, a connection
15 between the recipient device and the content source is normally established by an exchange of
signaling messages between a respective one of the recipient devices and the content source.
When a content stream is identified, the DPP 402 diverts the content stream to one of the ABD
servers 406.
[005B] FIG. 5 is a is a functional block diagram of subsystems of an IVOG system 3 B 8-2. The
20 IVOG system 3 1 8-2 includes one DPP system 402: one AQD server 406 and one or more rate
adapters 410. Other IVOG systems 3 18 can include more components that those in the HVOG
system 3 18-2. As discussed above in reference to FIG. 3, the DPP 402 can be located between
the network 306 (e.g.. the Internet) and the regional network 322. The DPP 402 communicates
data to and from the networks 306 and 322 (in the bridge mode) or to and fsom the DPI 108 (in
25 the divert mode) through a respective network interface 501. When the DPP system 402
identifies a content stream of interest, e.g., a TCP stream that can be rate adapted, a TCP ingest
buffer 504 in an operating system 502 stores the content data stream. The operating system 502
could be any one of a variety of operating systems such as Windows, Symbian, or another
operating system. TCP controls the packet flow of multiple users. Using bandwidth estimation
30 techniques in accordance with the: disclosure, the available bandwidth of the lower rate networks
at the ends of the network (e.g., the individual cells controlled by the base stations 326, 328, and
330 as shown in FIG. 3) can be estimated.
WO 20121174474 PCTlUS20121042811
100521 After content streams are stored in the TCP ingest buffer 504, the ingestlde-mux
subsystem 204 receives the content data streams that have been intercepted. The ingest/&-mux
subsystem 204 splits the individual content streams into audio and video substreams. The
individual audio substreams are stored in corresponding buffers of the audio FIFO buffer 208
5 and the individual video substreams are stored in corresponding buffers of the video FIFO buffer
2 12 (audio buffers 208 and video buffers 2 12 are combined in FIG. 5).
[BBS;P]I The APD controller 21 6, the video interface 224, and the content stream multiplexer
228 are contained within a rate adaptation module 506 and perform similar functions as those
discussed above in reference to FIG. 2. Content streams that are to be re-encoded are forwarded
10 to the rate adapters 410. The rate adapters 410 can include the audio rate adapters 2 18 and the
video rate adapters 220. The re-encoded video/audio streams, and the non-re-encoded
video/audio streams are stored in the FIFO buffer 232. The pacing rate estimate write module
508 is part of, or used instead of, the delivery interface 236 of FIG. 2. The rate adaptation
module 506 determines whether or not to re-encode video and/or audio streams based on a
15 bandwidth estimate 5 14 received from a pacing rated estimate write rnodule 508. The rate
adaptation module 506 determines whether or not to re-encode content streams periodicallyg e.g..
every one second, every two seconds, every three seconds or more. The determination to reencode
is based on the bandwidth estimate 5 14 as well as an estimate of the amount of data
estimated to be stored in the client buffers, as was discussed above in reference to FIG. 2.
20 [0054] The pacing rate estimate write module 508 (referred to from herein as the write module
508) retrieves blocks of data from individual content streams that are to be transmitted over the
networks 306 and/or 322 from the FIFO buffer 232. The size of the pulse data is related to the
rate of the stream that is being streamed. The pulse data is simply a byte stream, and is not
necessarily aligned to any particular container boundary. For typical video, the range of data
25 rates covers a range on an order of about 2.5 or thee times from the lowest data rates to the
highest data rates. For example, complex scenes typically take about 2.5 times the bandwidth of
simple scenes. For this reason, the write module 508 typically sends about three or four times
the lowest bandwidth of a video in a single data block, but the write module 508 sends blocks of
data about !h as often. For example, the write module 508 may transmit about two seconds of
30 media data, in a BOO msec. time period, depending on the bandwidth. If the data is completely
sent out faster than the media duration, the write module 508 estimates the capacity of the
channel, and then the write module 508 can (1) adjust the block size to take up a portion of the
excess capacity, or (2) the rate change module 506 can transrate the video to take up less
83
WO 20121174474 PCTlUS20121042811
capacity if the estimated bandwidth of the channel is insufficient to support the non-re-encoded
video data rate.
[0055] The bandwidth estimator of the write module 508 can be implemented as a user-space
process, or a thread within a user space process. The user space process can be, for example, a
5 process in the Linux operating system. The user-space process is above the kernel Bevel. The
kernel level contains the TCP functions. User space processes and threads are simple and do not
require modification of the kernel layer.
[0056] Upon retrieving the block of data for an individual content stream from the FIFO buffer
232, the write module 508 writes the chunk of data to a TCP out-buffer 5 10. The TCP out buffer
10 5 10 is part of the TCP stack in the application layer. The TCP out buffer 5 10 includes multiple
inelastic hffers 5 12. Two buffers 5 12-1 and 5 12-2 are shown, but there can be more buffers
5 12, one for each content stream being transmitted out of the IVOG 3 B 8-2. Often times the
buffers used for TCP are elastic, but the IVOG system 3 18-2 utilizes fixed or inelastic outbuffers
5 12.
15 100571 The individual out-buffers 5 12 are small enough such that the chunks of data being
written to the out buffers 5 12 by the write module 508 will fill up the individual out-braEers 512.
For example, the chunks of data can be about ten times as large as the individual out-buffers 5 12.
The TCP application controls the transmission and re-transmission of data out of the individual
out-buffers 512 based on feedback from the downstream networks 306 and/or 322. The data in
20 the individual out-buffers 5 12 is com~nunicatedto the DPP 402 to be transmitted to the client
device using a normal protocol such as TCP.
[BB058]1 As a non-limiting example, if the average data rate that is being transmitted in one
content stream is about 1 Mbps (125Dps) ., then the out-buffer 512 should hold about 32
kbytes of data, and in order to transmit a 2 second pulse of data the buffer will be filled and
25 emptied 8 times. The total time for the complete write of the data pulse is used for the network
bandwidth estimate, Sizing the bufier to 32K allows efficient transport of content rates from
10OKbps to 2Mbps. Significantly lower content rates may be better suited to a smaller buffer.
The out-buffer 5 I2 fills up before the write completes, but the write still keeps writing to the
buffer. The write module 508 is simply doing block-writes to the out-buffers 5 12 in the TCP
30 stack. Since an individual out-buffer 5 12 fills up, the write module 508 just keeps writing. As
Bong as the write module is writing faster than the individual out-buffer 5 12 is being emptied, the
write module can determine how long it takes to completely write the entire chunk of data. The
WO 20121174474 PCTlUS20121042811
internal dynamics of the TCP application controls the emptying of the individual out-buffers
512.
[0059] Knowing the time to complete the writing of the chunk of data, and knowing the size of
the pulse data chunk, the write module 508 computes the estimated bandwidth 5 14 by dividing
5 the data chunk size by the time to complete the block write operation. The block write operation
is a conventional TCP feature. The block write operation doesn't complete until the entire data
block has been sent. A socket library of the operating system 502 provides a completion signal
to the write module 508 when the block write operation has completed writing the data block to
the out-buffer 512. Upon coanpletion of writing a data block, the write module 588 determines
18 the estimated bandwidth 5 14 and communicates the determined bandwidth 5 14 to the rate
adaptation module 506 such that the rate adaptation module 506 can determine whether or not to
re-encode the associated content stream.
100601 If, while writing a data chunk to one of the individual output buffers 5 12, the time to
transmit the data chunk exceeds a threshold amount of time (e.g., a fraction of the display time of
15 the data chunk), the write module 508 stops writing data from the data chunk to the individual
out-buffers 5 12 and the write module 508 makes an estimate of the bandwidth based on how
nluch of the data chunk has been written in the threshold amount of time. The write module 508
can use a running average of a number of previous computed bandwidths to compute the
bandwidth.
20 i0061] Another limiting factor that determines the size of the data chunks written to the
individual out-buffers 5 12 is the size of the buffers in the TCP channel. There are buffers upon
buffers in the TCP stack going to the destination device. The individual out-buffers 5 12 are
large enough and being written to fast enough by the write module 508, such that the buffers in
the TCP stack are being written to faster than the out-buffers 5 12 are being emptied. 'The
25 estimated bandwidth does not need to be an accurate indication of data rates that are much larger
than the data rate of the content being supplied. In other words, if the write module 508 is able
to estimate bandwidths about 10 times greater than the average data rate of the content stream,
then this is sufficient for present wireless network conditions. It is not necessary in most
circumstances to be able to measure bandwidths that are more than 10 times the data rate of the
30 content being written to the individual out buf'fers 512. It is more important t be able to detect
when the bandwidth of the channel is getting too small for their desired content data rate.
WO 20121174474 PCTlUS20121042811
880621 FIG. 6 is a flow diagram of operations performed in a process 600 by the video
optimization gateway 3 18-2 of FIG. 5. With reference to FIGS. 5 and 6, the process 600 starts at
step 605 where the write module 508 retrieves content data from the FIFO buffer 232. The FIFO
buffer 232 contains multiple re-encoded and non-re-encoded content streams that have been -
5 processed by the rate adaptation engine 506. The multiple content streams are identified by
content stream identification numbers, frame number, etc. At block 610, the writing module 508
breaks individual content streams into data blocks to be written to the individual out-buffers 5 12
and transmitted to the receiving devices via the DPP system 402. The size of the content blocks
is determined by (1) the amount of content buffered at the client devices, and (2) by the
10 estimated bandwidth of the transmit channel. Usually, each block will contain I, 2, 3 or more
epochs of content, where each epoch is decodable independently from other epochs. The rate
adaptation module 506 provides epochs re-encoded to be compatible with past bandwidth
estimates 5 14 that the write module 508 has communicated to the rate adaptation module 506.
[0063]1 Upon breaking the content stream(s) into block(s) at step 610, the process 600
15 continues at step 615 where the writing module 508 performs block writes of the data blocks to
the individual out-buffers 5 12 in the TCP out-buffer 5 10. The data blocks are written to the outbuffers
as fast as the operating system 502 allows such that the out-buffers 5 12 fill up while the
TCP application controls transmission and re-transmission of data packets over the networks 306
and/or 322. At decision block 620, the writing module 508 determines whether or not a data
20 block has finished being transmitted to the receiving device. If the data block has finished
transmission, the process 600 proceeds to block 625, where the writing module 508 determines
the amount of time that was needed to transmit the data block.
/[6)064] Upon determining the transmit time, the writing module determines an estimated
bandwidth at step 630 by dividing the known size of the data block by the transmit time. Video
25 data rate and channel bandwidth can vary significantly over time. The bandwidth determination
at step 630 can include time-averaging (e.g., low-pass filtering) of a number of previous
bandwidth estimates in order to smooth out sporadic changes in data rate and/or bandwidth.
I00651 If it is determined at step 620 that the data block is not finished being transmitted, the
process 600 continues at block 635 where the writing module 508 determines if a maximum
30 threshold transmission time for the data block has been exceeded. The maximum transmission
time threshold can be a function of the display time of the data block. For example, if four
seconds of video are contained in a data block, the maximum transmission threshold could be set
to two, three or four seconds. If the maximum transmission threshold has been exceeded, the
16
WO 20121174474 PCTlUS20121042811
writing module temporarily stops and determines the estimated bandwidth at step 640 based on
the amount of the data block that was transmitted and the maximum transmission threshold time.
After the network bandwidth estimation update, the remaining bytes of the pulse are sent in the
next pulse. If the maximum transmission threshold is determined not to be exceeded for the
5 current data block at decision block 635, the process repeats steps 615,620 and 635.
I00661 After the writing module 508 has determined the estimated bandwidth, either at step
630 or step 640, the process continues at step 645 where the writing module adjusts a size of a
subsequent data block and/or the rate adaptation module 504 re-encodes subsequent epochs of
video andor audio to be compatible with the determined bandwidth estimate. At block 650, the
10 writing module 508 determines if more content remains in a content stream to be transmitted. If
more content exists, the steps 605, 610,615, 620, 625, 630, 635, 640, 645, and 650 are repeated.
If no more content exists in a present content stream, the process 600 terminates for the current
content stream and continues for other content streams that have content to be transmitted.
10067) FIG. 7 illustrates an example time history 700 for pacing the writing of a content stream
15 using the process 600. The time history 700 includes a content stream data rate time history 705
that varies over time. The data rate time history 705 represents changing encoded data rates of
video where low complexity video requires a small data rate and more complex video requires a
higher data rate. The video data rate time history 705 varies in a range between Ro to about %.
At time to, a first data. block of size Bo is written to one of the out-buffers 5 12 by the writing
20 module 508. Typically, the first data block of a content stream will be larger than subsequent
data blocks in order to quickly buiad up the amount of content contained in the receiving device
buffers for playback. For example, the first data block of size Bo could contain ten to fifteen
seconds of playback time.
[0068] The first data block takes a time Ato to finish transmitting to the receiving device.
25 Subsequent to the transmission of the first data block, the writing module 508 determines an
estimated bandwidth by dividing the data block size Bo by the time Ato. The estimated channel
bandwidth is illustrated by the dashed Iine labeled BW Est. The first BW Est. is between a data
rate of RS and %. The w~itingm odule 508 determines subsequent data block sizes B1, B2, B3
and B4 based on the estimated bandwidth and/or based on the amount playback time contained
30 in the receiving device buffers (not illustrated in FIG. 7).
BOO691 The time history 700 i8lustrates a situation where the rate adaptation module 506
determines a need to re-encode the content stream at a Iower data rate. A spike in the data rate
WO 20121174474 PCTlUS20121042811
time history 705 is shown by reference numeral 710. The data rate spike 705 is greater khan the
estimated bandwidth of the channel (illustrated by the dashed lines labeled BW Est.). Because
the content data rate is greater than the estimated channel bandwidth, the rate adaption module
506 determines to re-encode the content at a lower date rate 715 of about R5 which is lower than
5 the channel bandwidth estimate BW Est. When the content data rate faHs below the channel
bandwidth estimate at about time b, the rate adaptation module 506 stops re-encoding the
content. The time history 700 of FIG. 7 is not drawn to scale and is used here for illustrative
purposes only.
[0070] The time between the start of the transmission of data blocks is fairly periodic (about
10 every 5 seconds), but it can vary somewhat due to re-encoding delays and the length of time to
transmit the data blocks. When the available channel bandwidth is dropping, the time bemeen
transmission of data blocks basically disappears while the content is being re-encoded at lower
&dta rates.
[00781[ In one embodiment, a computer system (such as the IVBG systems 3 18 sf FIGS. 3,4
15 and 5) is used to perform methods as described herein. According to a set of embodiments, some
or all of the procedures of such methods are performed by the computer system in response to a
processor of the system executing one or more sequences of one or more instructions (which
might be incorporated into the operating system and/or other code of the computer system, such
as an application program) contained in working memory of the computer system. Such
20 instructions may be read into the working memory from a machine-readable medium, such as
one or more storage devices. Merely by way of example, execution of the sequences of
instructions contained in the working memory might cause the IVOG system 3 18-2 to perform
one or more procedures of the methods described herein.
1[00'92] The terns "machine readable medium" and "computer readable medium," as used
25 herein, refer to any medium that participates in providing data that causes a machine to operate
in a specific fashion. In an embodiment implemented using the IVOG system 3 18, various
machine-readable media might be involved in providing instructions/code to processors for
execution and/or might be used to store and/or cany such instructions/code (e.g., as signals). In
many implementations, a computer readable medium is a physical andor tangible storage
30 medium. Such a medim may take many forms, including but not limited to, non-volatile media,
volatile media, and transmission media. Non-volatile media includes, for example, optical or
magnetic disks, such as storage devices. Volatile media includes, without limitation, dynamic
memory, such as the working memory. Transmission media includes coaxial cables, copper
18
WO 20121174474 PCTlUS20121042811
wire, and fiber optics, including the wires that comprise a system bus of the HVOG system 3 18,
as well as various components of subsystems such as a communications subsystem or network
delivery interface (and/or the media by which the communications subsystem provides
communication with other devices).
5 BOO731 Common forms of physical andor tangible computer readable media include, for
example, a floppy disk, a flexible disk, hard disk: magnetic tape, or any other magnetic medium,
a CD-ROM, any other optical medium, punchcards. papertape, any other physical medium with
patterns of holes, a RAM, a PROM, an EPROM, a FLASH-EPROM, any other memory chip or
cartridge, or any other medium from which a colnputer can read instructions andos code.
18 [00741/ Various forms of machine-readable media may be involved in carrying one or more
sequences of one or more instructions to the computer processor for execution. Merely by way
of example, the instructions may initially be carried on a magnetic disk and/or optical disc of a
remote computer. A remote computer might 'load the instructions into its dynamic memory and
send the instructions as signals over a transmission medium to be received and/or executed by
15 the IVOG system 3 18. These signals, which might be in the form of electromagnetic signals,
acoustic signals, optical signals, and/or the like, are all examples of carrier waves on which
instructions can be encoded. in accordance with various embodiments of the invention.
[OO75]1 The present invention has been described above in teams of presently preferred
embodiments so that an understanding of the present invention can be conveyed. There are,
20 however, many configurations of systems for managing the delivery of progressively
downloaded video data not specifically described herein but with which the present invention is
applicable, The present invention should therefore not be seen as limited to the particular
embodiments described herein, but rather, it should be understood that the present invention has
wide applicability with respect to video data delivery systenms generally. ,411 modifications,
25 variations, or equivalent arrangements and implementations that are within the scope of the
attached claims should therefore be considered within the scope of the invention.

WE CLAIM:
1. A method of streaming data over a network, the method comprising:
sending a first portion of data via one or more data networks toward a first device,
the first portion of data comprising contiguous data of a content stream, wherein the first portion
of data has a first presentation time, and the first portion of data is of a first size that is greater
than a buffer size of an application Iayer write buffer;
determining a first transmit time to send the first portion of data based on an
operating system write completion signal;
determining a second presentation time to include in a second portion of data to
be sent via the one or more data networks towards the first device based on the determined first
transmit time and the size of the first portion of data, the second presentation time being
detemined to be greater than an estimated second transmit time to send the second portion of
data.
2. The method of claim 1, further comprising:
detemining a metric of a bandwidth capacity available on the one or more data
networks based on the size of the first portion of data and the first transmit time;
wherein the second presentation time is determined based on the determined
metric.
3. The method of claim 2, wherein the second portion of data is encoded at
an encoded bit rate. the method further comprising:
deterlnining a bit rate at which to re-encode the second portion based on the
detern~ineds econd presentation time axid the determined metric.
4. The method of claim 2, further comprising:
detemining that the first transmit time to send the first portion has exceeded a
threshold time; and
wherein determining the metric of the bandwidth capacity is based on an amount
of the first portion that has been transmitted and the threshold time.
F The method of claim 1, further comprising determining the second
presentation time to be a predetermined multiple of the first transmit time.
6. A computer system comprising:
a netw~orki nterface through which data is sent and received over one or mere
computer networks;
a processor configured to send a first portion of data via the one or more data
networks toward a first device, the first portion of data comprising contiguous data of a content
stream, wherein the first portion of data has a first presentation time, and the first portion of data
is of a first size that is greater than a buffer size of an application layer write buffer, the processor
further configured to determine a first transmit time to send the first portion of data based on an
operating system write completion signal and to determine a second presentation time to include
in a second portion of data to be sent via the one or more data networks towards the first device
based on the determined first transmit time and the size of the first portion of data, the second
presentation time being determined to be greater than an estimated second transmit time to send
the second portion of data.
7. The system of claim 6, further comprising:
determining a metric of a bandwidth capacity available on the one or more data
networks based on the size of the first portion of data and the first transmit time;
wherein the second presentation time is determined based on the determined
metric.
8. The system of claim 7, wherein the second portion of data is encoded at an
encoded bit rate, and the processor is further configured to determine a bit rate at which to reencode
the second portion based on the determined second presentation time and the determined
metric.
9. The system of claim 7, wherein the processor is further configured to
determined that the first transmit time to send the first portion has exceeded a threshold time,
wherein the processor determines the metric of the bandwidth capacity based on an amount of
the first portion that has been transmitted and the threshold time.
IO. The system of claim 6, further comprising determining the second
presentation time to be a predetermined multiple of the first transmit time.
I 1. A computer method comprising:
writing a preceding data block to a fixed size out-buffer for transmission;
determining an estimated amount of time to ~ s m ithte preceding data block
over a communication network to a dicnt device;
determining an estimated bandwidth for transmitthg the preceding data block to
the client device over the communication network;
adjusting a subsequent data block for transmitting the subsequent data block over
the communication network within the estimated bandwidth.
12. The computer method of claim 11, wherein the estimated bandwidth for
transmitting the preceding data block is less than a channel capacity for the client device.
13. The computer method of claim 1 1, wherein adjusting comprises changing
a size of the subsequent data block, as compared with a size of the preceding data block, for
transmission within the estimated bandwidth.
14. The computer method of claim 1 1, wherein adjusting comprises changing
a data rate at which the subsequent data block is transmitted, as compared with a data rate at
which the preceding data block was transmitted, for transmission within the estimated
txindwidth.

Documents

Application Documents

#	Name	Date
1	10913-DELNP-2013-AbandonedLetter.pdf	2019-09-26
1	pct document.pdf	2013-12-18
2	form 5.pdf	2013-12-18
2	10913-DELNP-2013-FER.pdf	2018-10-30
3	form 3.pdf	2013-12-18
3	Form 3 [19-04-2017(online)].pdf	2017-04-19
4	10913-delnp-2013-Correspondence-Others-(17-06-2014).pdf	2014-06-17
4	drawings.pdf	2013-12-18
5	Complete specification.pdf	2013-12-18
5	10913-delnp-2013-Form-3-(17-06-2014).pdf	2014-06-17
6	10913-DELNP-2013.pdf	2014-01-09
6	10913-delnp-2013-GPA-(17-06-2014).pdf	2014-06-17
7	10913-delnp-2013-Correspondence-Others-(10-02-2014).pdf	2014-02-10
8	10913-DELNP-2013.pdf	2014-01-09
8	10913-delnp-2013-GPA-(17-06-2014).pdf	2014-06-17
9	Complete specification.pdf	2013-12-18
9	10913-delnp-2013-Form-3-(17-06-2014).pdf	2014-06-17
10	10913-delnp-2013-Correspondence-Others-(17-06-2014).pdf	2014-06-17
10	drawings.pdf	2013-12-18
11	Form 3 [19-04-2017(online)].pdf	2017-04-19
11	form 3.pdf	2013-12-18
12	form 5.pdf	2013-12-18
12	10913-DELNP-2013-FER.pdf	2018-10-30
13	pct document.pdf	2013-12-18
13	10913-DELNP-2013-AbandonedLetter.pdf	2019-09-26

Search Strategy

1	10913DELNP2013Searchstratgy_26-10-2018.pdf