You are on page 1of 4

This article has been accepted for publication in a future issue of this journal, but has not been

fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2016.2628378, IEEE
Communications Letters
1

A Cross-Layer Bandwidth Allocation Scheme for


HTTP-based Video Streaming in LTE Cellular
Networks
Stefania Colonnese, Member, IEEE, Francesca Cuomo, Senior Member, IEEE, Tommaso Melodia, Senior
Member, IEEE, Izhak Rubin, Fellow, IEEE

AbstractThis paper investigates the benefits of flexible The proposed architectural model fits the standard LTE
resource allocation when performing HTTP-based Adaptive architecture [8]; the LTE Radio Resource Manager operates
Streaming (HAS) across cellular systems such as Long Term by collecting the CQIs reported by the UEs (A2), it inter-
Evolution (LTE). To guarantee video fluidity in the presence of
fluctuations of the instantaneous video source rate and channel acts with the upper layer to receive the bandwidth requests
capacity, we consider a HAS based proxy video manager and available at the proxy (A1), and then allocates the requested
resource controller located at the cellular base station. Based resources (number of Resource Blocks-RB) by also selecting
on the channel quality observed by mobile clients, the manager the proper Modulation and Coding Scheme (MCS) (A3). All
allocates the wireless bandwidth to mobile clients for transmitting these operations are only at the BS side, and apart from the
the video streams. We propose a cross-layer bandwidth allocation
scheme that takes into account the channel quality as well as interaction with the proxy that provides the actual chunk size
the video quality requirements and encoding rate fluctuations of per user, all the other operations are fully standard compliant
the HAS video stream and minimizes the transmission delays and both the server and the client are unaltered.
experienced by users. This cross-layer bandwidth allocation In this framework, we propose a new bandwidth allocation
achieves the optimum in terms of HAS streams delays and mechanism denoted as Dynamic Minimum Average Delay (D-
it outperforms different bandwidth allocations procedures and
state-of-the-art LTE schedulers. MAD) that periodically operates at a temporal scale of the
chunk duration (seconds), and jointly accounts for the users
fluctuating channel qualities to guarantee the targeted video
I. I NTRODUCTION QoE while minimizing the streams playout delay. We evaluate
Mobile streaming services will be pervasive in 4G and the D-MAD in case of a LTE cellular system and we show
future cellular platforms [1]. In the HTTP Adaptive Streaming that it outperforms different state-of-the art schedulers [8][9].
(HAS) client-server architecture, the server stores a few ver-
sions of the same video content encoded at different average
II. R ELATED W ORK
rates and parsed into short duration video chunks; the client
requests chunks at the average rate that matches the end-to-end Several recent works address the problem of video stream-
throughput [2][3] and/or playout buffer status [4]. Switching ing bandwidth allocation to avoid quality variations during
among different rates induces visual quality fluctuations that the session. Recent approaches use proxy based adaptation
affect the user Quality of Experience (QoE) [5][6]. To prevent mechanisms [10][11]. The work in [10] uses a DASH proxy
this effect, in this paper, we consider an architectural model to provide scalable video services, whereas [11] optimizes
under which a Base Station (BS) (e-NodeB in the LTE system) network resource allocation by accounting for contents and
i) acts as a proxy between the client (User Equipment-UE) channel characteristics and for client playout buffer levels.
and the Video Server, and ii) optimally allocates the users As for scheduling of video packets in new generation
bandwidth, so as to maintain the users selected average video cellular systems, in [12] the authors clarify key interactions
rate and the playout buffer status. between the application layer and the lower layers in 4G and
The basic system assumptions are: 5G networks and propose a resource allocator that optimizes
A1 the BS either partially pre-caches the video data in the the user video quality while limiting application layer quality
local servers proxy or it loads side information on the switches. The paper in [9] addresses progressive streaming via
average rates and the chunks sizes of the video streams; OFDMA systems, and takes into account the QoE of the end
A2 the BS estimates the average spectral efficiency of the users, as expressed by the data stored in the playout buffer, in
user on a time period of a chunk duration (e.g., 2 the user priority at the MAC scheduler. The procedure relies
seconds). This corresponds to several LTE Transmission on the accurate knowledge of the current player buffer status,
Time Intervals (TTI) where periodical channel reports which needs for periodical updating from the client to properly
are collected from the UEs, as defined in LTE [7]; account for encoding rate fluctuations and initial buffering and
A3 the BS optimally allocates the available bandwidth BWs rebuffering events. Still, the paper confirms that scheduling
[Hz], to the different UEs; the allocation acts as a can be optimized to reduce pauses at the UE. By sharing
constraint by the actual scheduling algorithms operating these principles (proxy based and QoE awareness), we herein
at the MAC layer. adopt the chunk delay as QoE parameter to be included in

1089-7798 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2016.2628378, IEEE
Communications Letters
2

0.5 1 To solve the D-MAD allocation problem we introduce


av. delay S-SUA
Lagrange multipliers and construct the following functional:
Average delay (s)

0.4 av. delay D-BWA 0.8


av. delay D-MAD
0.3 0.6 N 1 N 1

PTD
X    X
(i) (i) (i) .
0.2 0.4
= max Bk /B k 1 , 0 + B k (3)
PTDS-SUA
i=0 i=0
PTDD-BWA
0.1 0.2
PTDD-MAD
Differentiating w.r.t. B (i) , i = 0, . . . N 1, we obtain
0 0 k
2 4 6 8 10 4 6 8 10 12 (i) (i) (i) )2 + = 0 standing when
User index Number of users / B k = Bk /(B k q
(a) Average delay (b) PTD B (i) < B (i) for all i. Substituting B (i) = B (i) / into
PkN 1 (i)k k k
(i) =
B
i=0 q k BW sq we compute , so obtaining: B k
Fig. 1. (a) Average delay vs user index in case of c {7, 8, 9}; (b) PTD vs (i) PN 1 (i)
number of users for c = 7. BWs Bk / i=0 Bk . As the total bandwidth increases,
using the Kuhn-Tucker conditions leads to: / B (i) =
k

the resource allocation. The proposed approach operates on a 0 if B (i) < B (i) and < 0 if B (i) B (i) . The solution
k k k k
B (i)
chunk by chunk basis and leverages the bandwidth efficiency k
to the Kuhn-Tucker equations is then found as follows.
that can be achieved with an optimal dynamic allocation [13].
The minimal overall delay result is obtained by solving the
following bandwidth allocation problem:
III. R ESOURCE A LLOCATION S CHEME N 1 N 1 n  o
X X (i) (i)
i = max Bk /B k 1 , 0

We consider a system of N UEs served by the BS controller.



i=0 i=0
Each user selects video contents at quality q (i) associated q (4)
(i)
with a video stream whose net average rate is equal to R(q)
(i) (i)
Bk

p
[bps]. User i experiences channel quality levels c(i) , with i =

B = min Bk , P BWs
k
q
N 1 (i)

B
0, N 1. At a given time k, a user requests a chunk of video i=0 k
(i)
that contains k , i = 0, N 1 bytes. The chunk size is PN 1 (i)
where is such that i=0 B 1
k = BWs .
inherently random and it depends on the video content and on The main novelties with respect to other cross-layer alloca-
the actual encoder settings. The bandwidth requested by each tion procedures in the literature are as follows. Differently
(i) (i) (i)
user is then computed as Bk = k /((ck ) ) [Hz], where from progressive streaming approaches (e.g.,[9]), D-MAD
(i)
(ck ) represents the average spectral efficiency attained by performs an allocation jointly for all users demanding for the
(i)
user i under its reported channel quality level ck and , with bandwidth at a given time interval (chunk interval) by con-
< 1, accounts for protocol overhead. sidering their dynamic bandwidth demands, B (i) constrained
k
We propose a joint optimal allocation scheme, referred to as by their QoE target, i , and by the system bandwidth BWs .
the Dynamic Minimum Average Delay (D-MAD). According Our solution then optimizes the use of the overall bandwidth
to D-MAD, the allocated bandwidth, denoted by B (i) , is in a given chunk interval and at the same time assures a fair
k
selected dynamically chunk by chunk and jointly for all users. allocation by also considering QoE constraints. Furthermore,
The D-MAD scheduler targets HTTP streaming services with respect to [12], our solution presents three important
with application-layer paced chunk download every sec- differences: i) it does not interfere with the rate selection
onds. With respect to a progressive streaming as in [9], the which is left to the client server architecture, according to the
cumulative receiver buffer delay does not increase unless the DASH standard; ii) it explicitly accounts for the introduced
actual chunk experiences a delay larger than seconds. This chunk delays, which are a key factor affecting the QoE in
nonlinear behavior of delay accumulation is exploited by D- HAS streaming; iii) it guarantees that the assigned bandwidth
MAD, that does not target maximal video download rates but reaches the maximum available value, whereas in [12] the
chunk delays as close to as possible. In case of bandwidth bandwidth allocation is actually realized by the so called
requests overcoming the available bandwidth, D-MAD assigns enforcer stage, which does not exploit resources unused by
resources in an optimal way, so that the average of users a user to improve the rate of other users.
chunk delays is minimum. With this model the chunk delay
i can be expressed as follows: IV. N UMERICAL RESULTS
We first compare D-MAD with an allocation proce-
  
(i) (i)
i = max B k /B k 1 , 0 . (1)
dure named Dynamic, Bandwidth Weighted Allocation (D-
Let us denote by BWs [MHz] the level of available cell BWA); in this case, the allocated bandwidth B (i) is se-
k
bandwidth. Under D-MAD, the joint allocation process is lected dynamically chunk by chunk by assigning B (i) =
P 1 k
performed to minimize the following objective function: Bk
(i) (i)
BWs . D-BWA leads to equalization of
i=0 Bk
N
X 1 
(i) (i)
 NX1 
(i) (i)
  the delays perceived by the different users, i.e. the delays
i Bk , B k = max Bk /B k 1 , 0 (2)
i=0 i=0 1 The above formulas are implemented recursively until the condition
PN 1 (i) 2
i=0 Bk = BWs is met; D-MAD complexity can be shown to be O(N ),
while satisfying
P (i) BWs .
B but depending on the user bandwidth requests it boils down to O(N ).
i=0 k

1089-7798 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2016.2628378, IEEE
Communications Letters
3

35 35 35

500 30 500 30 500 30

25 25 25
1000 1000 1000
20 20 20
2

2
1500 1500 1500
15 15 15
D-MAD D-MAD D-MAD
2000 QD-MAD 10 2000 QD-MAD 10 2000 QD-MAD 10
D-BWA D-BWA D-BWA
1 +2 =BWs PF 1 +2 =BWs PF 1 +2 =BWs PF
5 5 5
2500 QoE-AW 2500 QoE-AW 2500 QoE-AW

500 1000 1500 2000 2500 500 1000 1500 2000 2500 500 1000 1500 2000 2500
1 1 1

(a) B1 = B2 = 1308, (b) B1 = 803, B2 = 3209 (c) B1 = 355, B2 = 5137

Fig. 2. Average delay per chunk (1 , 2 ; B1 , B2 ) (color bar expressed in seconds) versus allocated bandwidth (1 , 2 ) (kHz) for different bandwidth
requests of two users.

computed as in Eq.(1) are either 0 or equal for all users, for a aware (QoE-AW) scheduling in [9]. The former is a classical
given chunk index k. For reference sake, we also consider the layered scheme while the latter addresses progressive stream-
results achieved by statically and fairly dividing the bandwidth ing of video flows without considering the chunk timing and
among the users by setting B (i) = BWs /N ; we refer to this requires knowledge of the client playout buffer level2 . D-MAD
k
simple case as Static Single User Allocation (S-SUA) scheme. operates at a chunk temporal scale (s) while the two schedulers
We used H.265/HEVC 10.1 encoded videos with a spatial operate at the TTI scale (ms). For fair comparison, we con-
resolution of 1920x1080 [14][15]. Traces correspond to five sider the scheduling over one chunk, i.e. over NT T I = 2000
movies (Harry Potter,HP, Finding Neverland,FN, Lake House, TTIs. Firstly, we consider a toy case of N = 2 users, requiring
Speed, Blue Planet). They are K = 1200 chunk long. We each the HP video at encoding rate of 1351 kbps; we analyze
considered a video quality q at quantization level 25, resulting three reference spectral efficiency pairs, summarized by (a)
in video bit rates of 1.35, 1.34, 1.17, 1.20, 1.08 M bps each, (c(1) ) = (c(2) ) = 1.033, (b) (c(1) ) = 1.683, (c(2) ) = 0.421,
respectively. The video chunk duration is constant and equal (c) (c(1) )) = 3.883, (c(2) ) = 0.263. We implemented PF and
to = 2s; conversely chunk sizes in bits are variable and QoE-AW under the hypothesis that the video users share a
approximately Gamma distributed [16], with mean value equal fraction of the system bandwidth BWs = 0.6 5 M Hz, that
to 337, 334, 292, 299, 271 KByte, respectively. We select = corresponds to N RB = 15 RBs. Both the schedulers operate
0.64 to model the cascade of a net throughput of about 80% at the TTI/RB time/frequency scale, and we implemented a
both at the LTE bearer and at the TCP layer [16]. We consider Quantized version of D-MAD (QD-MAD) allocating to each
a LTE BS operating over a BWs = BLT E = 20M Hz, stream- user the integer number of RBs closest to the D-MAD solution.
ing video towards N users. Each user streams a randomly We plot in Fig.2(a)-2(c) P2 the average delay
selected video, randomly cyclically shifted. We use CQIs c in (1 , 2 ; B1 , B2 ) = i=1 max {(Bi /i 1) , 0}
7 9 and the associated net spectral efficiencies in [13]. experienced by 2 users that attain from the scheduler the
With these positions, we compare D-MAD, D-BWA and allocated bandwidths 1 , 2 when they request bandwidths
S-SUA by measuring the average delay per chunk and the B1 , B2 , for the three cases (a) B1 = B2 = 1308 kHz, (b)
Probability of Timely Delivery (PTD) of chunk packets for B1 = 803, B2 = 3209 kHz, (c) B1 = 355, B2 = 5137 kHz,
each user, computed as the fraction of chunks encountering respectively. The allocated bandwidth constraint is
delay events during the video streaming process. In Figure 1 + 2 = BWs , represented by the dashed slanted
1, we plot the aforementioned two metrics. The left figure straight-lines for different values of BWs . The bandwidth
represents the average delay versus the user index when the pairs allocated by the schedulers lie on the line
users have a spectral efficiency (c) with c randomly selected 1 + 2 = 15 180 = 2700 kHz. Points below this
in c {7, 8, 9}. Each point is obtained by averaging the line are not reachable; available bandwidth reduction due
numerical results over 50 Monte Carlo runs. We recognize to concurrent services just corresponds to different parallel
that D-MAD outperforms the other schemes in terms of the lines in the 1 , 2 plane. We observe that QD-MAD is
attained average delay per chunk. Since each chunk carries always almost perfectly overlapped to D-MAD, achieving
few seconds of video even reducing the delay of just 0.2 the minimum average delay (that is represented in the figure
s per chunk significantly reduces the depletion rate of the by a deep blue color on the plane); the PF, by allocating
client buffer, so increasing time of uninterrupted playout the bandwidth resources fairly to the users, like the S-SUA
and improving the QoE. Secondly, we observe that D-MAD 2 The QoE-AW scheduler is implemented by computing at each TTI the
outperforms D-BWA and S-SUA also in terms of PTD. As RB priorities as in Eq. (2) of [9], adopting the therein defined parameters
expected, under D-BWA, each user observes the same delay apart for the target buffer that is T B = 40s. This condition has been chosen
value. The S-SUA exhibits much lower performance. to observe buffer depletion without encountering rebuffering events, so as
to evaluate the scheduler performances independently from different client-
We then compare D-MAD with two state-of-the art LTE originated rebuffering policies. As for the PF, the Eq. (2) of [9] is modified
schedulers, namely Proportional Fair (PF) [8] and the QoE- by setting the buffer level weight to 1 for all users.

1089-7798 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/LCOMM.2016.2628378, IEEE
Communications Letters
4

0.1 sum up to 0.0063, 0.0938, and 0.0937 for D-MAD, PF and


D-MAD,CQI in [10,12]
PF,CQI in [10,12] QoE-AW, respectively.

Average per-chunk delay


0.08 QoE-AW,CQI in [10,12]
D-MAD,CQI in [9,13]
PF,CQI in [9,13]
0.06 QoE-AW,CQI in [9,13] V. C ONCLUSIONS
D-MAD,CQI in [8,14]
PF,CQI in [8,14]
0.04 QoE-AW,CQI in [8,14] We discussed a Dynamic Minimum Average Delay band-
D-MAD,CQI in [7,15]
PF,CQI in [7,15] width allocation scheme for HAS streaming over LTE. The
QoE-AW,CQI in [7,15]
0.02
D-MAD periodically operates at a temporal scale of the
0
video packets duration (seconds), and accounts for the users
16 17 18 19 20 21 22
N fluctuating channel qualities to guarantee the targeted video
QoE level while minimizing the streams playout delay. The
Fig. 3. Average per-chunk delay
PN 1
i vs number of users N . proposed architectural model requires changes only at the
0
BS side, while leaving both server and clients unaltered. In
reducing the video chunk transfer delay, the D-MAD reduces
approach, presents degrading performance as the users the user buffer depletion rate and the rate at which the
requests become unbalanced; the QoE-AW scheduler, being application layer is required to dynamically adapt the quality
dynamically adapted to the client buffer status, improves of the video stream, thus improving the QoE in terms of video
the performance with respect to PF, still it not reaches the fluidity and video quality smoothness.
optimal performance of D-MAD and QD-MAD, although
using a client side additional information; the D-BWA R EFERENCES
approach definitely over-rates the bandwidth requests from [1] H. Nam, K. H. Kim, B. H. Kim, D. Calin, and H. Schulzrinne, Towards
dynamic QoS-aware over-the-top video streaming, in IEEE WoWMoM
user with low channel quality. Furthermore, we consider an 2014, June 2014, pp. 19.
increasing number of users and dynamic channel conditions. [2] S. Colonnese, P. Frossard, S. Rinauro, L. Rossi, and G. Scarano, Joint
Specifically, we consider CQI uniformly randomly varying source and sending rate modeling in adaptive video streaming, Signal
Processing: Image Communication, vol. 28, no. 5, pp. 403 416, 2013.
both in time and frequency, as CQI 1, 2, 3, 4 around [3] S. Colonnese, F. Cuomo, T. Melodia, and R. Guida, Cloud-assisted
CQI = 11. This results in an average in 2.3 2.4. PF and Buffer Management for HTTP-based Mobile Video Streaming, in ACM
QoE-AW have full knowledge of CQI both in time (every PE-WASUN 13, 2013, pp. 18.
[4] P. Juluri, V. Tamarapalli, and D. Medhi, SARA: Segment aware rate
TTI) and frequency, while D-MAD relies only on the average adaptation algorithm for dynamic adaptive streaming over HTTP, in
spectral
PN 1 efficiency. We compare the average per chunk delay IEEE ICCW, June 2015, pp. 17651770.
i /N for D-MAD, PF and QoE-AW (Fig.3). For any [5] M. Seufert, S. Egger, M. Slanina, T. Zinner, T. Hossfeld, and P. Tran-
0
Gia, A Survey on Quality of Experience of HTTP Adaptive Streaming,
number of users, the average delay is minimized by the MAD Communications Surveys Tutorials, IEEE, vol. PP, no. 99, pp. 11, 2014.
approach; besides, the QoE aware scheduler outperforms the [6] C. Mueller, S. Lederer, and C. Timmerer, A proxy effect analyis and
PF scheduler. In fact, the PF scheduler is fair but it does fair adatpation algorithm for multiple competing Dynamic Adaptive
Streaming over HTTP clients, in IEEE VCIP, 2012.
not account any QoE parameter; the QoE-AW scheduler, [7] LTE; Evolved Universal Terrestrial Radio Access (E-UTRA); Physical
suited to HAS streaming and progressive download, accounts layer procedures (3GPP TS 36.213 version 8.8.0 Release 8), 2009.
for QoE by introducing heuristic weight of the client buffer [8] F. Capozzi, G. Piro, L. Grieco, G. Boggia, and P. Camarda, Downlink
Packet Scheduling in LTE Cellular Networks: Key Design Issues and a
occupancy; the MAD optimally allocates the bandwidth Survey, IEEE Comm. SurvTutorials, vol. 15, no. 2, pp. 678700, 2013.
exploiting the chunk-based HAS protocol structure. [9] J. Navarro-Ortiz, P. Ameigeiras, J. M. Lopez-Soler, J. Lorca-Hernando,
Finally, we have conducted some experiments considering Q. Perez-Tarrero, and R. Garcia-Perez, A QoE-Aware Scheduler for
HTTP Progressive Video in OFDMA Systems, IEEE Communications
10 users, divided in 2 groups at CQI randomly varying within Letters, vol. 17, no. 4, pp. 677680, April 2013.
4 6 (LQ users) and 11 13 (HQ users), streaming the [10] M. Zhao, X. Gong, J. Liang, W. Wang, X. Que, and S. Cheng,
video sequences HP and FN encoded at LQ average rates QoE-Driven Cross-Layer Optimization for Wireless Dynamic Adaptive
Streaming of Scalable Videos Over HTTP, Circuits and Syst. for Video
of 320, 247 kbps and HQ rates 609, 480 kbps, respectively. Technology, IEEE Trans. on, vol. 25, no. 3, pp. 451465, March 2015.
We implemented a D-MAD compliant Frequency Domain [11] A. El Essaili, D. Schroeder, E. Steinbach, D. Staehle, and M. Shehada,
Packet Scheduling (FDPS) algorithm, computed by resorting QoE-Based Traffic and Resource Management for Adaptive HTTP
Video Delivery in LTE, Circuits and Syst. for Video Technology, IEEE
to the same computational scheme as the QoE-AW scheduler, Trans. on, vol. 25, no. 6, pp. 9881001, June 2015.
but applying the nonlinear weight targeting the occupied [12] J. Chen, R. Mahindra, M. A. Khojastepour, S. Rangarajan, and M. Chi-
bandwidth level3 . Numerical simulations have been carried out ang, A scheduling framework for adaptive video delivery over cellular
networks, in ACM MobiCom 13, 2013, pp. 389400.
over 1190 chunks; the results, averaged over 10 runs, show [13] I. Rubin, S. Colonnese, F. Cuomo, F. Calanca, and T. Melodia, Mobile
that the percentage of timely delivered chunks using D-MAD, HTTP-based streaming using flexible LTE base station control, in IEEE
namely P T DDM AD = 97.9%, improves wrt to average WoWMoM 2015, June 2015, pp. 19.
[14] P. Seeling and M. Reisslein, Video Transport Evaluation With H.264
P T DP F = P T DQoEAW = 89.15% achieved by state of Video Traces, IEEE Communications Surveys Tutorials, vol. 14, no. 4,
the art FDPS scheduling; the same behaviorPN 1is observed on pp. 11421165, Fourth 2012.
the measured average delay per chunk i /N , which [15] Video Trace Files and Statistics, 2014. [Online]. Available:
0 http://trace.eas.asu.edu/videotraces2/h265/
[16] S. Colonnese, S. Russo, F. Cuomo, T. Melodia, and I. Rubin, Timely
3 The D-MAD compliant FDPS scheduler computes the RB priorities as Delivery Versus Bandwidth Allocation for DASH-Based Video Stream-
in Eq. (2) of [9], applying the nonlinearity f () to the actually allocated ing Over LTE, IEEE Communications Letters, vol. 20, no. 3, pp. 586
bandwidth with of f set = 1, priorityAtT arget = 20, a = 0.5, and T B 589, March 2016.
equal to the target D-MAD bandwidth.

1089-7798 (c) 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

You might also like