[go: up one dir, main page]

CN103313054A - Transferring and scheduling method of scalable video coding (SVC) video - Google Patents

Transferring and scheduling method of scalable video coding (SVC) video Download PDF

Info

Publication number
CN103313054A
CN103313054A CN201310191314XA CN201310191314A CN103313054A CN 103313054 A CN103313054 A CN 103313054A CN 201310191314X A CN201310191314X A CN 201310191314XA CN 201310191314 A CN201310191314 A CN 201310191314A CN 103313054 A CN103313054 A CN 103313054A
Authority
CN
China
Prior art keywords
nalu
video
quality
time
level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310191314XA
Other languages
Chinese (zh)
Other versions
CN103313054B (en
Inventor
要瑞宵
刘延伟
赵平华
慈松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Acoustics CAS
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CN201310191314.XA priority Critical patent/CN103313054B/en
Publication of CN103313054A publication Critical patent/CN103313054A/en
Application granted granted Critical
Publication of CN103313054B publication Critical patent/CN103313054B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明涉及一种SVC视频的传输调度方法,其利用SVC的可伸缩特性对时变信道的适应性,通过协调信源,信道以及接收端的综合影响主观体验的因素,优化SVC视频在认知无线电这一信道时变环境中的传输调度,从而优化接收端视频的主观质量。本发明通过针对CR网络的特性,在SVC视频发送调度中考虑接收端缓冲区的状态,在资源受限的情况下优先发送那些对提高视频主观体验贡献较高的数据,降低了接收端缓冲区下溢概率;在接收端通过自适应视频播放机制降低了接收端缓冲区下溢概率,同时由于考虑了视频的内容特征,用户几乎察觉不到播放速率的调整。本发明通过在发送端和接收端对影响视频主观体验的因素的综合考虑和协调,使得SVC视频的主观体验得以优化。

Figure 201310191314

The invention relates to a transmission scheduling method of SVC video, which utilizes the adaptability of SVC's scalability to time-varying channels, and optimizes SVC video in cognitive radio by coordinating factors that comprehensively affect subjective experience of information sources, channels and receiving ends The scheduling of transmissions in this time-varying environment of the channel optimizes the subjective quality of the video at the receiving end. According to the characteristics of the CR network, the present invention considers the status of the receiving end buffer in the SVC video transmission scheduling, and in the case of limited resources, sends those data that contribute more to improving the subjective video experience first, reducing the receiving end buffer Probability of underflow: at the receiving end, the buffer underflow probability at the receiving end is reduced through an adaptive video playback mechanism. At the same time, due to the consideration of the content characteristics of the video, the user hardly notices the adjustment of the playback rate. The present invention optimizes the subjective experience of the SVC video by comprehensively considering and coordinating factors affecting the subjective experience of the video at the sending end and the receiving end.

Figure 201310191314

Description

可伸缩视频编码SVC视频的传输调度方法Transmission Scheduling Method for Scalable Video Coding SVC Video

技术领域technical field

本发明涉及视频编码与传输领域,尤其涉及一种SVC视频在认知无线电环境下的传输调度方法。The invention relates to the field of video coding and transmission, in particular to a transmission scheduling method of SVC video in a cognitive radio environment.

背景技术Background technique

随着无线技术在移动通信、公共安全、广播电视等方面的广泛应用,现代社会对无线电频谱资源的依赖程度越来越高。目前,对无线频谱资源的需求越来越高,而其利用率却非常低,兼之其分配也极不平衡。具体来说,目前的频谱分配政策将大部分的频谱资源分配给了授权频段应用(如电视、广播),相应地,非授权频段的频谱资源则要少的多。另外,许多授权频段上的频谱资源利用率非常低,而大部分的新兴无线应用却工作在非授权频段上,因此使得非授权频段上的频谱占用显得拥挤不堪。无线频谱是一种不可再生资源,如果想要提高频谱资源的利用率,只能通过对频谱资源的再利用这一途径。于是,认知无线电技术(Cognitive Radio,CR)这种环境下应运而生。With the wide application of wireless technology in mobile communications, public safety, broadcasting and television, etc., modern society relies more and more on radio spectrum resources. At present, the demand for wireless spectrum resources is getting higher and higher, but its utilization rate is very low, and its allocation is also extremely unbalanced. Specifically, the current spectrum allocation policy allocates most of the spectrum resources to licensed band applications (such as television and broadcasting), and correspondingly, the spectrum resources in unlicensed bands are much less. In addition, the utilization rate of spectrum resources in many licensed frequency bands is very low, while most emerging wireless applications work in unlicensed frequency bands, thus making the spectrum occupancy in unlicensed frequency bands extremely crowded. Wireless spectrum is a non-renewable resource. If you want to improve the utilization rate of spectrum resources, you can only reuse spectrum resources. Therefore, cognitive radio technology (Cognitive Radio, CR) came into being in this environment.

CR是一种通过频谱感知和系统智能学习来实现频谱动态分配和频谱共享的技术。通过该技术,可以在不影响授权频段正常通信的基础上发现其中的空闲频谱并对其加以合理利用。CR在一定程度上改善了目前因频谱的固定分配政策而导致的频谱资源利用率低下的情况。CR is a technology that realizes spectrum dynamic allocation and spectrum sharing through spectrum sensing and system intelligent learning. Through this technology, the vacant spectrum in the licensed frequency band can be found and rationally used without affecting the normal communication in the licensed frequency band. To a certain extent, CR has improved the current low utilization rate of spectrum resources caused by the fixed spectrum allocation policy.

CR网络中存在主用户和次级用户之分,次级用户检测多个信道的状态,并且只有在出现主用户空闲时才能接入信道并传输数据,这种“机会主义”式的接入方式使得次级用户的可用带宽变化很大。这种带宽的不稳定性会使CR信道上的应用出现丢包率较高、延时、抖动等情况,从而使CR网络中的次级用户的服务质量(Quality of Service,QoS)就很难得到保证。这类问题在对延时、抖动比较敏感的视频业务中显得尤为突出。There are primary users and secondary users in the CR network. The secondary user detects the status of multiple channels, and can only access the channel and transmit data when the primary user is idle. This "opportunistic" access method The available bandwidth of the secondary users varies greatly. The instability of this bandwidth will cause the application on the CR channel to have high packet loss rate, delay, jitter, etc., so that the quality of service (Quality of Service, QoS) of the secondary users in the CR network is difficult. Guaranteed. Such problems are particularly prominent in video services that are sensitive to delay and jitter.

可伸缩视频编码(Scalable Video Coding,SVC)视频与其他的不可伸缩视频编码(Non Scalable Video Coding)视频相比,更能适应这种状况极其不稳定的CR网络。采用分层编码结构的SVC在时间、空间和质量三个维度上都具有可伸缩性,它可以将一个视频源编码成一个包含多个子集的SVC视频流,其中每一个子集都是源视频的一个重构,各子集只是在时间、空间或质量的分辨率上有差别而已,这种特性使得SVC在网络状况较差情况下具有突出的抗丢包能力。Compared with other non-scalable video coding (Non Scalable Video Coding) videos, scalable video coding (Scalable Video Coding, SVC) video is more adaptable to the extremely unstable CR network. SVC with a layered coding structure is scalable in three dimensions of time, space, and quality, and it can encode a video source into an SVC video stream containing multiple subsets, each of which is the source video A reconstruction of SVC, each subset is only different in the resolution of time, space or quality. This characteristic makes SVC have outstanding anti-packet loss ability under poor network conditions.

就无线环境中的传输调度方法而言,当前的一些SVC视频多采用SVC视频源在发送端主动适配信道状况或者根据视频内容主动分配相应的可用信道资源的方法。然而,这些方法都只是在发送端考虑视频源与信道的适配,而没有从全局的角度考虑视频主观体验的影响因素。视频主观体验受多方面因素影响,其中包括接收端的缓冲区状态和视频播放策略等因素。As far as the transmission scheduling method in the wireless environment is concerned, some current SVC videos mostly use the method that the SVC video source actively adapts to the channel conditions at the sending end or actively allocates corresponding available channel resources according to the video content. However, these methods only consider the adaptation of the video source and the channel at the sending end, but do not consider the influencing factors of the subjective video experience from a global perspective. The subjective experience of video is affected by many factors, including buffer status at the receiving end and video playback strategy.

发明内容Contents of the invention

本发明的目的是提供一种能够克服上述缺陷的SVC视频在认知无线电环境下的传输调度方法,其利用SVC的可伸缩特性对时变信道的适应性,通过协调信源,信道以及接收端的综合影响主观体验的因素,优化SVC视频在认知无线电这一信道时变环境中的传输调度,从而优化接收端视频的主观质量。The purpose of the present invention is to provide a transmission scheduling method of SVC video in a cognitive radio environment that can overcome the above-mentioned defects. It utilizes the adaptability of the scalable characteristics of SVC to time-varying channels, and coordinates the information source, channel and receiving end. The factors that affect the subjective experience are integrated to optimize the transmission scheduling of SVC video in the time-varying environment of the cognitive radio channel, so as to optimize the subjective quality of the video at the receiving end.

在本发明的第一方面,提供了一种可伸缩视频编码SVC视频的传输调度方法,所述SVC视频由多个网络抽象层单元NALU组成,每个NALU在质量维度和时间维度上具有唯一的质量层级和唯一的时间层级,所述质量层级分为一个质量基本层和多个质量增强层,所述时间层级分为一个时间基本层和多个时间增强层,所述方法包括:根据SVC视频的接收端缓冲区的状态确定传输调度窗口的长度,所述传输调度窗口用于调度待传输的NALU;从所述SVC视频中选择第一部分NALU置于所述传输调度窗口中,所述第一部分NALU的质量层级为质量基本层并且其时间层级小于或等于第一时间层级阈值;如果所述第一部分NALU的数据量未达到当前可用的认知无线电信道带宽,则从所述SVC视频中选择第二部分NALU置于所述传输调度窗口中,所述第二部分NALU的质量层级等于质量基本层并且其时间层级小于或等于第二时间层级阈值;如果所述第一部分NALU和所述第二部分NALU的数据量未达到当前可用的认知无线电信道带宽,则从所述SVC视频中选择第三部分NALU置于所述传输调度窗口中,所述第三部分NALU的质量层级等于质量基本层,其中,所述第一部分NALU、所述第二部分NALU和所述第三部分NALU在时间维度上的跨度不超过所述传输调度窗口的长度;通过当前可用的认知无线电信道将所述传输调度窗口中的NALU传输给SVC视频的接收端。In a first aspect of the present invention, a transmission scheduling method of Scalable Video Coding SVC video is provided. The SVC video is composed of a plurality of network abstraction layer units NALU, and each NALU has a unique quality dimension and time dimension. Quality level and unique time level, the quality level is divided into a quality base layer and a plurality of quality enhancement layers, the time level is divided into a time base layer and a plurality of time enhancement layers, the method includes: according to the SVC video The state of the buffer at the receiving end determines the length of the transmission scheduling window, and the transmission scheduling window is used to schedule the NALUs to be transmitted; select the first part of the NALUs from the SVC video and place them in the transmission scheduling window, and the first part The quality level of the NALU is the quality basic level and its time level is less than or equal to the first time level threshold; if the data volume of the first part of the NALU does not reach the currently available cognitive radio channel bandwidth, select the first time level from the SVC video Two parts of NALU are placed in the transmission scheduling window, the quality level of the second part of NALU is equal to the quality basic layer and its time level is less than or equal to the second time level threshold; if the first part of NALU and the second part The amount of data of the NALU does not reach the currently available cognitive radio channel bandwidth, then select a third part of the NALU from the SVC video and place it in the transmission scheduling window, the quality level of the third part of the NALU is equal to the quality basic layer, Wherein, the span of the first part of NALU, the second part of NALU and the third part of NALU in the time dimension does not exceed the length of the transmission scheduling window; the transmission is scheduled through the currently available cognitive radio channel The NALU in the window is transmitted to the receiver of the SVC video.

在本发明的第二方面,提供了一种可伸缩视频编码SVC视频的传输调度控制器,所述SVC视频由多个网络抽象层单元NALU组成,每个NALU在质量维度和时间维度上具有唯一的质量层级和唯一的时间层级,所述质量层级分为一个质量基本层和多个质量增强层,所述时间层级分为一个时间基本层和多个时间增强层,所述控制器被配置用于:根据SVC视频的接收端缓冲区的状态确定传输调度窗口的长度,所述传输调度窗口用于调度待传输的NALU;从所述SVC视频中选择第一部分NALU置于所述传输调度窗口中,所述第一部分NALU的质量层级为质量基本层并且其时间层级小于或等于第一时间层级阈值;如果所述第一部分NALU的数据量未达到当前可用的认知无线电信道带宽,则从所述SVC视频中选择第二部分NALU置于所述传输调度窗口中,所述第二部分NALU的质量层级等于质量基本层并且其时间层级小于或等于第二时间层级阈值;如果所述第一部分NALU和所述第二部分NALU的数据量未达到当前可用的认知无线电信道带宽,则从所述SVC视频中选择第三部分NALU置于所述传输调度窗口中,所述第三部分NALU的质量层级等于质量基本层,其中,所述第一部分NALU、所述第二部分NALU和所述第三部分NALU在时间维度上的跨度不超过所述传输调度窗口的长度;通过当前可用的认知无线电信道将所述传输调度窗口中的NALU传输给SVC视频的接收端。In a second aspect of the present invention, a transmission scheduling controller for Scalable Video Coding SVC video is provided, the SVC video is composed of a plurality of network abstraction layer units NALU, and each NALU has a unique quality dimension and time dimension. The quality level and the unique time level, the quality level is divided into a quality basic layer and a plurality of quality enhancement layers, the time level is divided into a time basic layer and a plurality of time enhancement layers, the controller is configured to In: determining the length of the transmission scheduling window according to the state of the receiver buffer of the SVC video, the transmission scheduling window is used to schedule the NALUs to be transmitted; selecting the first part of the NALUs from the SVC video and placing them in the transmission scheduling window , the quality level of the first part of the NALU is the quality basic level and its time level is less than or equal to the first time level threshold; if the data volume of the first part of the NALU does not reach the currently available cognitive radio channel bandwidth, then from the Select the second part of NALUs in the SVC video to be placed in the transmission scheduling window, the quality level of the second part of the NALUs is equal to the quality basic layer and its time level is less than or equal to the second time level threshold; if the first part of the NALU and The data volume of the second part of NALU does not reach the currently available cognitive radio channel bandwidth, then select a third part of NALU from the SVC video and place it in the transmission scheduling window, and the quality level of the third part of NALU Equal to the quality base layer, wherein the span of the first partial NALU, the second partial NALU and the third partial NALU in the time dimension does not exceed the length of the transmission scheduling window; through the currently available cognitive radio channel Transmitting the NALUs in the transmission scheduling window to the receiving end of the SVC video.

在本发明的第三方面,提供了一种可伸缩视频编码SVC视频的自适应播放方法,包括:在播放缓冲区中的数据无法支撑按当前播放速率播放预定时间的情况下,确定帧率集合,所述帧率集合中的每一帧率不低于预定帧率并且与前一图像组播放帧率的差值的绝对值不超过预定阈值;计算当前图像组的运动强度和所述帧率集合中每一帧率的比值与前一图像组的运动强度及其播放帧率的比值之差的绝对值;在所述帧率集合中选择最小绝对值对应的帧率作为当前图像组的播放帧率。In a third aspect of the present invention, a method for adaptive playback of scalable video coding SVC video is provided, including: determining the set of frame rates when the data in the playback buffer cannot support playing for a predetermined time at the current playback rate , each frame rate in the frame rate set is not lower than a predetermined frame rate and the absolute value of the difference with the frame rate of the previous image group does not exceed a predetermined threshold; calculate the motion intensity of the current image group and the frame rate The absolute value of the difference between the ratio of each frame rate in the set and the ratio of the motion intensity of the previous image group and its playback frame rate; in the frame rate set, select the frame rate corresponding to the minimum absolute value as the playback of the current image group frame rate.

在本发明的第四方面,提供了一种可伸缩视频编码SVC视频的自适应播放器,所述播放器被配置用于:在播放缓冲区中的数据无法支撑按当前播放速率播放预定时间的情况下,确定帧率集合,所述帧率集合中的每一帧率不低于预定帧率并且与前一图像组播放帧率的差值的绝对值不超过预定阈值;计算当前图像组的运动强度和所述帧率集合中每一帧率的比值与前一图像组的运动强度及其播放帧率的比值之差的绝对值;在所述帧率集合中选择最小绝对值对应的帧率作为当前图像组的播放帧率。In a fourth aspect of the present invention, there is provided an adaptive player for Scalable Video Coding SVC video, the player is configured to: the data in the playback buffer cannot support playing the video for a predetermined time at the current playback rate In this case, determine the frame rate set, each frame rate in the frame rate set is not lower than the predetermined frame rate and the absolute value of the difference with the frame rate of the previous image group does not exceed the predetermined threshold; calculate the current image group The absolute value of the difference between the ratio of the motion intensity to each frame rate in the frame rate set and the ratio of the motion intensity of the previous image group to its playback frame rate; select the frame corresponding to the minimum absolute value in the frame rate set rate as the playback frame rate of the current image group.

针对CR网络特性,本发明将在SVC视频发送调度中考虑接收端缓冲区的状态,在资源受限的情况下优先发送那些对提高视频主观体验贡献较高的数据,这也降低了接收端缓冲区下溢概率;在接收端我们通过采用自适应视频播放机制降低了接收端缓冲区下溢概率,同时由于考虑了视频的内容特征,用户几乎察觉不到播放速率的调整。通过在发送端和接收端对影响视频主观体验的因素的综合考虑和协调,视频主观体验得以优化。Aiming at the characteristics of CR network, the present invention will consider the state of buffer at the receiving end in the scheduling of SVC video transmission, and give priority to sending those data that contribute more to improving the subjective experience of video in the case of limited resources, which also reduces the buffering at the receiving end Area underflow probability; at the receiving end, we reduce the buffer underflow probability at the receiving end by adopting an adaptive video playback mechanism. At the same time, due to the consideration of the content characteristics of the video, the user hardly notices the adjustment of the playback rate. Through the comprehensive consideration and coordination of the factors affecting the subjective experience of video at the sending end and receiving end, the subjective experience of video is optimized.

附图说明Description of drawings

图1是根据本发明实施例的SVC视频在认知无线电环境下的传输播放示意图;1 is a schematic diagram of transmission and playback of SVC video in a cognitive radio environment according to an embodiment of the present invention;

图2是传统的和根据本发明实施例的SVC传输调度方法的比较示意图。Fig. 2 is a schematic diagram of a comparison between traditional and SVC transmission scheduling methods according to an embodiment of the present invention.

具体实施方式Detailed ways

下面通过附图和实施例,对本发明的技术方案做进一步的详细描述。The technical solutions of the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.

为了更透彻地理解本发明,下面先对接收端缓冲区、认知无线电环境和SVC视频的特性进行说明。In order to understand the present invention more thoroughly, the characteristics of the receiver buffer, the cognitive radio environment and the SVC video will be described below.

首先,介绍接收端的缓冲区的功能、状态以及与之相关的注意事项。First, the function, status and related precautions of the buffer at the receiving end are introduced.

在接收端,缓冲区的设置有助于平滑由于时变信道造成的包到达时间的抖动。对于缓冲区而言,应尽量避免缓冲区中积存的数据太多,以免造成较大的播放时延;同时更要避免出现缓冲区下溢的情况,否则将导致视频播放的中断。由于不断地接收来自于发送端的数据以及视频播放器从播放缓冲区中读取视频数据进行播放,接收端缓冲区中的数据量也随之变化。当缓冲的数据足够支撑视频播放器按正常播放速度(如30fps)连续播放一定时间(如1秒),可以称当前的接收端缓冲区是“充盈的”;当缓冲的数据无法支撑视频播放器以正常播放速度连续播放达到这一时间(如上述的1秒),则可以称当前的接收端缓冲区是“饥饿的”;当接收端缓冲区中没有数据时,这种情况被称之为缓冲区下溢。At the receiving end, the setting of the buffer helps to smooth out the jitter of packet arrival time caused by the time-varying channel. For the buffer, try to avoid too much data accumulated in the buffer, so as not to cause a large playback delay; at the same time, avoid buffer underflow, otherwise the video playback will be interrupted. Since the data from the sending end is continuously received and the video player reads video data from the playback buffer for playing, the amount of data in the buffer of the receiving end also changes accordingly. When the buffered data is enough to support the video player to play continuously for a certain period of time (such as 1 second) at the normal playback speed (such as 30fps), it can be said that the current receiving end buffer is "full"; when the buffered data cannot support the video player If the continuous playback at normal playback speed reaches this time (such as the above 1 second), it can be said that the current receiving end buffer is "hungry"; when there is no data in the receiving end buffer, this situation is called Buffer underflow.

接下来,介绍认知无线电的基本环境及其用户的关系。Next, the basic environment of cognitive radio and its user relationship are introduced.

在认知无线电环境中有主用户和次级用户之分。主用户有其各自的特定信道并且对其特定信道有接入优先权。只有当主用户处于空闲状态时,其特定信道才可以被次级用户以一种机会主义的方式接入并传输数据。当主用户重新夺回信道并传输数据的时候,其所能容忍的次级用户干扰的最大时间为Tts,Tts例如为50ms。因此,能够以Tts为单元为系统时间划分时隙,次级用户在每个时隙中都会对信道进行一次扫描,从而发现可用信道。In a cognitive radio environment, there are primary users and secondary users. Primary users have their own specific channels and have access priority to their specific channels. Only when the primary user is idle, its specific channel can be accessed and transmitted by the secondary user in an opportunistic manner. When the primary user regains the channel and transmits data, the maximum time that the primary user can tolerate interference from the secondary user is T ts , where T ts is, for example, 50 ms. Therefore, the system time can be divided into time slots with T ts as a unit, and the secondary user will scan the channel once in each time slot, so as to find available channels.

一个次级用户可能监视着认知无线电网络中的N个信道,例如N=10,并且在发现空闲信道时接入以传输数据。假设一个时间段跨越了Q个时隙,并且任意信道n(n=1,2,…,N)在第q(q=1,2,…,Q)个时隙中能为其用户提供的可用信道带宽为R(n,q)。那么,该时间段内一个次级用户所能获得的可用信道带宽Rc可以表示如公式(1)所示,其中R*(n,q)表示信道n在第q个时隙时对该次级用户可用带宽的贡献量。A secondary user may monitor N channels in the cognitive radio network, for example N=10, and access to transmit data when it finds a free channel. Suppose a time period spans Q slots, and any channel n (n=1, 2, ..., N) can provide its users in the qth (q = 1, 2, ..., Q) slot The available channel bandwidth is R(n,q). Then, the available channel bandwidth R c that a secondary user can obtain in this period of time can be expressed as shown in formula (1), where R * (n,q) represents the The contribution of the bandwidth available to level users.

R c = Σ q = 1 Q Σ n = 1 N R * ( n , q )      (1) R c = Σ q = 1 Q Σ no = 1 N R * ( no , q ) (1)

Figure BDA00003226567000062
Figure BDA00003226567000062

接下来,介绍SVC的特性及其传输调度原理。Next, the characteristics of SVC and its transmission scheduling principle are introduced.

SVC在时间(Temporal)、空间(Spatial)和质量(Fidelity)三个维度上都提供了可伸缩特性,SVC视频在每一个可伸缩维度上都有一个基本层和若干个增强层。优选地,可以在发送调度中只采用SVC在时间(Temporal)和质量(Fidelity)两个维度上的可伸缩特性。例如,SVC视频源在时间可伸缩维度上有4层(T0,T1,T2,T3,T0为基本层,其他3层为增强层),在质量可伸缩维度上有3层(F0,F1,F2,F0为基本层,其他2个为增强层)。SVC provides scalability in three dimensions of time (Temporal), space (Spatial) and quality (Fidelity). SVC video has a basic layer and several enhancement layers in each scalable dimension. Preferably, only the scalability characteristics of the SVC in the two dimensions of time (Temporal) and quality (Fidelity) can be used in sending scheduling. For example, an SVC video source has 4 layers in the time scalable dimension (T0, T1, T2, T3, T0 are base layers, and the other 3 layers are enhancement layers), and has 3 layers in the quality scalable dimension (F0, F1, F2, F0 are base layers, and the other 2 are enhancement layers).

编码后的SVC视频是以网络抽象层单元(Network Abstraction LayerUnit, NALU)的形式组织起来的,其中,时间分层标识符相同、空间分层标识符相同、质量分层标识符也相同的数据块被组织到一个NALU中。因此,对SVC视频的发送调度,其实就是对其中的NALU的发送调度。调度时间单元设为TDSU,它在数值上等于按正常帧率播放一个图像组(Group of Pictures,GOP)所需的时间。例如,设置一个GOP的大小(Size of GOP, SG)是8帧,其播放的正常帧率为30帧/秒,TDSU等于8/30秒。在以Tts=50ms划分时隙的情况下,TDSU所跨越的时隙数Q向下取整为5。无线电认知环境中的一个次级用户在一个时隙同时监视的信道总数为10。在上述条件下,将Q=5,N=10代入公式(1),即可计算出该TDSU中的一个次级用户所能获得的可用信道带宽。The encoded SVC video is organized in the form of a Network Abstraction Layer Unit (NALU), in which data blocks with the same temporal layer identifier, the same spatial layer identifier, and the same quality layer identifier are organized into a NALU. Therefore, the sending scheduling of the SVC video is actually the sending scheduling of the NALU in it. The scheduling time unit is set to T DSU , which is numerically equal to the time required to play a Group of Pictures (GOP) at a normal frame rate. For example, if the size of a GOP (Size of GOP, SG) is set to 8 frames, the normal frame rate of its playback is 30 frames per second, and T DSU is equal to 8/30 seconds. In the case of dividing time slots by T ts =50 ms, the number Q of time slots spanned by T DSU is rounded down to 5. A secondary user in a radio cognitive environment monitors a total of 10 channels simultaneously in one time slot. Under the above conditions, by substituting Q=5 and N=10 into formula (1), the available channel bandwidth that a secondary user in this T DSU can obtain can be calculated.

本发明提供了一种考虑用户主观体验的SVC视频在认知无线电环境中的传输调度方法。按照视频流的方向,该方法具体涉及:在发送端对SVC视频进行传输调度、传输调度后的视频经由认知无线电信道传输到接收端缓冲区,最后在自适应播放模块进行播放。本发明的改进主要是在满足视频播放的基本质量要求的前提下,避免缓冲区下溢的情形,降低视频质量突变的可能性。本发明的改进不考虑调整SVC视频的空间分辨率,只考虑其时间可伸缩性和空间可伸缩性,并且改进是从网络协议层的应用层为认知无线电信道进行建模,不具体考虑MAC层和物理层的操作。综上所述,根据本发明实施例的针对SVC视频的调度原则有三点,具体如下所述。The invention provides a method for scheduling transmission of SVC video in a cognitive radio environment considering user subjective experience. According to the direction of the video stream, the method specifically involves: scheduling the transmission of the SVC video at the sending end, transmitting the scheduled video to the buffer of the receiving end through the cognitive radio channel, and finally playing it in the adaptive playback module. The improvement of the present invention is mainly to avoid buffer underflow and reduce the possibility of video quality mutation on the premise of meeting the basic quality requirements of video playback. The improvement of the present invention does not consider adjusting the spatial resolution of the SVC video, but only considers its temporal scalability and spatial scalability, and the improvement is to model the cognitive radio channel from the application layer of the network protocol layer, without specifically considering the MAC layer and physical layer operations. To sum up, the scheduling principle for SVC video according to the embodiment of the present invention has three points, which are specifically described as follows.

1)视频播放满足基本的质量要求。1) The video playback meets the basic quality requirements.

在质量可伸缩维度上,质量基本层可以提供一个粗糙的,但又满足基本质量要求的视频重构。也就是说,质量可伸缩维度上的增强层可以被选择性传输而非必须被传输。因此,应该从质量基本层中的数据中选择传送数据来满足SVC视频的基本质量要求。为了满足视频播放的基本质量要求,可以根据视频场景的内容特征对质量基本层中数据的时间可伸缩维度进行选择。换言之,对特定的GOP选定对应的时间层级阈值,时间分层等级大于该时间层级阈值的数据将是被选择性发送的,而不是必须被发送的。In the dimension of quality scalability, the quality base layer can provide a rough video reconstruction that meets the basic quality requirements. That is, enhancement layers in the quality scalability dimension can be selectively transmitted but not necessarily transmitted. Therefore, the transmission data should be selected from the data in the quality base layer to meet the basic quality requirements of SVC video. In order to meet the basic quality requirements of video playback, the time scalable dimension of data in the quality basic layer can be selected according to the content characteristics of the video scene. In other words, a corresponding time level threshold is selected for a specific GOP, and data whose time level is greater than the time level threshold will be selectively sent, rather than necessarily sent.

因此,为了选定GOP的时间层级阈值,需要确定该GOP的内容特征。例如,可以用运动强度(Motion Intensity,MI)来衡量视频内容特征,运动强度MI的计算如公式(2)所示,其中NF表示当前GOP中的帧数目并且会随着GOP中帧的主动丢弃而变化,SPIC表示一帧中的像素数目,lume(f,p)表示当前GOP中第f帧的第p个像素的亮度值。例如,SVC视频的分辨率为352×288,可以得出SPIC=101376。Therefore, in order to select the time level threshold of a GOP, it is necessary to determine the content characteristics of the GOP. For example, Motion Intensity (MI) can be used to measure video content features, and the calculation of motion intensity MI is shown in formula (2), where NF represents the number of frames in the current GOP and will follow the active frame number in the GOP. Discarded and changed, S PIC represents the number of pixels in a frame, and lume(f,p) represents the brightness value of the p-th pixel of the f-th frame in the current GOP. For example, the resolution of SVC video is 352×288, which can give S PIC =101376.

MIMI == 11 NN Ff ΣΣ ff == 11 NN Ff ΣΣ pp == 11 SS PICPIC || lumelume (( ff ,, pp )) -- lumelume (( ff -- 11 ,, pp )) || -- -- -- (( 22 ))

MI较大处的视频场景运动剧烈,MI较小处的视频场景运动舒缓。如果主动丢掉一些运动剧烈处的视频帧,则接收到的视频内容的不连续特性很容易在播放中被察觉到;而如果主动丢掉一些运动舒缓处的视频帧,则造成的视频内容的不连续性就不会特别明显,甚至用户毫无察觉。基于此,可以在必要的时候有选择性地丢弃一些视频帧。对视频帧的丢弃是通过SVC的时间可伸缩性实现的。换言之,通过丢弃那些时间层级在时间层级阈值之上的视频帧实现的,而对这类视频帧的丢弃不会对接收到的视频造成不可接受的质量下降。The motion of the video scene where the MI is larger is severe, while the motion of the video scene where the MI is smaller is slow. If some video frames with intense motion are actively discarded, the discontinuity of the received video content is easy to be noticed during playback; and if some video frames with slow motion are actively discarded, the discontinuity of the video content will be caused. Sex is not particularly obvious, and even the user is not aware of it. Based on this, some video frames can be selectively discarded when necessary. Dropping of video frames is achieved through the temporal scalability of SVC. In other words, by discarding video frames whose temporal levels are above a temporal level threshold without causing unacceptable degradation of the quality of the received video.

一个视频序列中的第i(i=1,2,…,I)个GOP的时间层级阈值TLt-i的计算如公式(3)所示,其中,MIi(TL)是TL被选为第i个GOP的时间层级阈值时该GOP的运动强度,其可根据公式(2)进行计算。MIU是一个序列依赖的运动强度阈值,它可以通过提前测试进行设定。一旦因为主动丢弃视频帧造成的GOP运动强度大于MIU时,接收端的视频连续性就很容易被用户察觉到,从而造成视频主观体验的下降。另外,TLH是视频中最高的时间层级,TLt-i不应该超过TLHThe calculation of the time-level threshold TL ti of the i-th (i=1, 2, ..., I) GOP in a video sequence is shown in formula (3), where MI i (TL) is TL selected as the i-th The time level threshold of a GOP is the motion intensity of the GOP, which can be calculated according to formula (2). MI U is a sequence-dependent exercise intensity threshold that can be set by prior testing. Once the GOP motion intensity caused by actively discarding video frames is greater than the MIU , the video continuity at the receiving end is easily perceived by the user, resulting in a decline in the subjective experience of the video. In addition, TL H is the highest temporal level in the video, and TL ti should not exceed TL H .

Figure BDA00003226567000081
Figure BDA00003226567000081

2)减小接收端缓冲区出现下溢的概率,尽量避免视频播放的中断。2) Reduce the probability of buffer underflow at the receiving end, and try to avoid interruption of video playback.

3)降低视频发送调度造成的视频质量突变的可能性。3) Reduce the possibility of sudden changes in video quality caused by video transmission scheduling.

原则2)和3)所要满足的原则是在SVC视频的发送调度方案中体现的,调度方案分①、②、③来分开描述。The principles to be satisfied by principles 2) and 3) are reflected in the SVC video transmission scheduling scheme, and the scheduling scheme is divided into ①, ②, and ③ to describe separately.

图1是根据本发明实施例的SVC视频在认知无线电环境下的传输播放示意图。FIG. 1 is a schematic diagram of transmission and playback of SVC video in a cognitive radio environment according to an embodiment of the present invention.

下面,结合图1对根据本发明实施例的SVC视频在认知无线电环境中传输调度方法进行描述。在该调度方案中,采用了发送调度窗口这一概念,处于该发送调度窗口中的数据在调度时间单元中都将被发送。In the following, a method for scheduling transmission of SVC video in a cognitive radio environment according to an embodiment of the present invention will be described with reference to FIG. 1 . In this scheduling scheme, the concept of sending scheduling window is adopted, and the data in the sending scheduling window will be sent in the scheduling time unit.

①确定发送调度窗口的长度。① Determine the length of the sending scheduling window.

每一个调度时间单元中的发送调度窗口的初始化值都等于图像组尺寸(Size of GOP,SG)。SVC视频传输调度控制器接收来自接收端的反馈,获知接收缓冲区当前是处于充溢状态还是饥饿状态。当接收端缓冲区处于充盈状态时,SVC视频传输调度模块可以设定发送调度窗口长度的上边界为SG以保持接收端缓冲区的充盈状况;当接收端缓冲区已经处于饥饿状态时,可以设定发送调度窗口长度的上边界为2*SG,以使得在该调度时间单元中能尽量多发送一些帧,避免接收端缓冲区出现下溢。例如,SG=8,则发送调度窗口的上边界可以被设定为16。The initialization value of the sending scheduling window in each scheduling time unit is equal to the size of the group of pictures (Size of GOP, SG). The SVC video transmission scheduling controller receives feedback from the receiving end to know whether the receiving buffer is currently in a full state or a hungry state. When the receiving end buffer is in a full state, the SVC video transmission scheduling module can set the upper boundary of the sending scheduling window length to be SG to keep the receiving end buffer full; when the receiving end buffer is already in a hungry state, it can be set The upper boundary of the length of the sending scheduling window is set as 2*SG, so that as many frames as possible can be sent in the scheduling time unit, so as to avoid underflow of the buffer at the receiving end. For example, if SG=8, then the upper boundary of the sending scheduling window can be set to 16.

在发送调度窗口长度的上边界确定后,要尽量最大化发送调度窗口的长度。发送调度窗口越长,表示所发送的帧越多,意味着在接收端所能维持的持续播放时间越长。这样,就达到了减小接收端缓冲区下溢的概率的目的,从而有效地避免了发生播放中断。After the upper boundary of the length of the sending scheduling window is determined, the length of the sending scheduling window should be maximized as much as possible. The longer the sending scheduling window, the more frames are sent, which means the longer the continuous playback time can be maintained at the receiving end. In this way, the purpose of reducing the probability of buffer underflow at the receiving end is achieved, thereby effectively avoiding playback interruption.

最大化发送调度窗口是在考虑带宽限制和基本视频质量要求的条件下求得的。在第k个调度时间单元中的可用信道带宽Rc(k)可以通过公式(1)求得。在该调度时间单元中发送的数据不应该超过该可用带宽,否则就会发生严重的丢包,导致视频主观体验下降。为了能最大化发送调度窗口长度,同时满足基本的视频质量要求,先考虑将那些能满足基本质量要求的数据准备于发送调度窗口中,能满足基本质量要求的数据如1)中所述。The maximum sending scheduling window is obtained under the condition of considering bandwidth limitation and basic video quality requirement. The available channel bandwidth R c (k) in the kth scheduling time unit can be obtained by formula (1). The data sent in the scheduling time unit should not exceed the available bandwidth, otherwise serious packet loss will occur, resulting in a decline in the subjective experience of the video. In order to maximize the length of the sending scheduling window and meet the basic video quality requirements, first consider preparing the data that can meet the basic quality requirements in the sending scheduling window. The data that can meet the basic quality requirements are as described in 1).

如果这些数据的总数据量还未超过该调度时间单元中的可用带宽,则进一步考虑把一些时间可伸缩维度的增强层数据和质量可伸缩维度的增强层数据准备于发送调度窗口中,以备决策完成后发送,如步骤②和③所示。如果这些满足基本质量要求的数据总量超过了可用带宽,则优先将那些发送顺序靠前的数据置于发送调度窗口中,并且跳过步骤②和③,决策阶段结束,开始发送数据。If the total data volume of these data has not exceeded the available bandwidth in the scheduling time unit, it is further considered to prepare some enhancement layer data of time scalable dimension and enhancement layer data of quality scalable dimension in the sending scheduling window for preparation Send after the decision is made, as shown in steps ② and ③. If the total amount of data that meets the basic quality requirements exceeds the available bandwidth, the data sent earlier will be prioritized in the sending scheduling window, and steps ② and ③ will be skipped. The decision-making phase ends and the data starts to be sent.

②增加时间可伸缩维度的增强层。② Increase the enhancement layer of time scalable dimension.

为了用户对更平滑的视频播放的需求,可以用剩余的可用带宽发送更多的数据以使得视频播放更平滑,并且使得视频清晰度更高。平滑视频播放可以通过发送更多的时间可伸缩维度的增强层达到目的。In order to meet the user's demand for smoother video playback, more data can be sent with the remaining available bandwidth to make the video playback smoother and the video definition higher. Smooth video playback can be achieved by sending more enhancement layers with temporal scalability dimensions.

首先,在质量基本层增加时间可伸缩维度的增强层。在将那些尚未放置于发送调度窗口中的质量基本层数据准备于发送调度窗口中的过程中,时间分层等级越低的NALU优先级越高。如果有多个NALU具有相同的时间分层等级,则发送顺序靠前的NALU具有更高的优先级。如果所有的质量基本层数据都已被置于发送调度窗口中而仍有剩余的可用带宽,则在发送调度窗口中继续增加质量可伸缩维度的增强层如③所示,否则跳过步骤③,决策阶段结束,开始发送数据。First, an enhancement layer of temporal scalability is added to the quality base layer. In the process of preparing those quality base layer data that have not been placed in the sending scheduling window in the sending scheduling window, the NALU with a lower temporal hierarchical level has a higher priority. If there are multiple NALUs with the same temporal hierarchical level, the NALU sent earlier has higher priority. If all the quality basic layer data have been placed in the sending scheduling window and there is still remaining available bandwidth, then continue to increase the enhancement layer of the quality scalable dimension in the sending scheduling window as shown in ③, otherwise skip step ③, The decision phase ends and data is sent.

③增加质量可伸缩维度的增强层。③ Increase the enhancement layer of quality scalable dimension.

如果质量基本层数据都已经被置于发送调度窗口之中而仍有可用带宽剩余,则考虑进一步增加质量可伸缩维度的增强层数据。由于质量可伸缩维度各层之间存在依赖关系,所以质量增强层等级越低,其被选择的优先级越高。在某一个质量增强层,仍然有可能因为可用带宽限制而无法将该增强层的数据都置于发送调度窗口之中,那么该层中时间分层等级越低的NALU被选择的优先级越高。如果有多个NALU具有相同的时间分层等级,则发送顺序靠前的NALU有着更高的优先级。为了尽量避免视频播放的质量突变,可以限制相邻GOP之间的质量层数之差不超过2,这样用户就不易察觉到视频播放有明显的质量突变。If the quality base layer data has been placed in the sending scheduling window and there is still available bandwidth remaining, consider further adding the enhancement layer data of the quality scalability dimension. Since there is a dependency relationship between layers in the quality scalability dimension, the lower the level of the quality enhancement layer, the higher its selection priority. In a certain quality enhancement layer, it is still possible that the data of the enhancement layer cannot be placed in the sending scheduling window due to the limitation of the available bandwidth, so the NALU with the lower time hierarchical level in this layer is selected with a higher priority . If there are multiple NALUs with the same temporal hierarchical level, the NALU sent earlier has a higher priority. In order to avoid sudden changes in the quality of video playback, the difference in the number of quality layers between adjacent GOPs can be limited to no more than 2, so that users are less likely to perceive obvious quality changes in video playback.

通过步骤①、②和③,那些在可用带宽限制下最能优化视频主观体验的数据就都被选择于发送调度窗口之中,决策阶段完成。随后,发送调度窗口中的数据将按照各自的发送顺序通过可用信道1-10进行发送。Through steps ①, ② and ③, those data that can best optimize the subjective experience of video under the limitation of available bandwidth are all selected in the sending scheduling window, and the decision-making phase is completed. Subsequently, the data in the sending scheduling window will be sent through the available channels 1-10 according to their respective sending order.

当接收缓冲区中的数据处于充盈状态时,采用正常的播放速率当然是最好的选择。然而,当接收缓冲区中的数据处于饥饿状态时,如果仍采用正常的播放速率,则有可能会因为数据消耗过快而造成接收端缓冲区出现下溢情况,从而导致视频播放的中断。在后一种情况下,可以采用适当减小播放速率的方式,以使得该速率的视频播放既不会造成接收端缓冲区的下溢,又不会造成视频主观体验的明显下降。When the data in the receiving buffer is full, the normal playback rate is of course the best choice. However, when the data in the receiving buffer is in a hungry state, if the normal playback rate is still used, the buffer at the receiving end may underflow due to excessive data consumption, resulting in interruption of video playback. In the latter case, an appropriate method of reducing the playback rate can be adopted, so that video playback at this rate will neither cause underflow of the buffer at the receiving end nor cause a significant decrease in the subjective experience of the video.

在视频场景运动剧烈处减小播放速率会造成明显的画面迟滞,导致视频主观体验下降;在视频场景运动舒缓处减小播放速率,则用户对视频主观体验不会有明显的下降。因此,在接收端缓冲区中的数据处于饥饿状态时,基于内容的自适应播放模块需要根据视频场景的运动强度适当、适时地调整视频的播放速率。Reducing the playback rate at places where the video scene moves violently will cause obvious picture lag, resulting in a decline in the subjective experience of the video; reducing the playback rate at places where the video scene moves slowly, the user's subjective experience of the video will not be significantly reduced. Therefore, when the data in the buffer at the receiving end is in a hungry state, the content-based adaptive playback module needs to properly and timely adjust the video playback rate according to the motion intensity of the video scene.

举例来说,对于播放速率的调整以GOP为单位的情况,每一个GOP的播放速率与视频播放的帧率(Frame Rate,FR)相对应。为了体现在上一段中分析的视频场景的运动强度MI与帧率FR应该呈现正相关关系,在选择每个GOP对应的播放帧率时,可以采用两个相邻GOP之间的MI/FR值之差的绝对值应该尽量的小的原则。For example, when the playback rate is adjusted in units of GOP, the playback rate of each GOP corresponds to the frame rate (Frame Rate, FR) of video playback. In order to reflect that the motion intensity MI and frame rate FR of the video scene analyzed in the previous paragraph should show a positive correlation, when selecting the playback frame rate corresponding to each GOP, the MI/FR value between two adjacent GOPs can be used The absolute value of the difference should be as small as possible.

另外,为了避免由于视频播放的帧率突变或帧率过小而造成视频主观体验下降,可以设定相邻的两个GOP的播放帧率之差不应该超过阈值FDbench,每个GOP的播放帧率都不应该小于阈值FRlow。综上所述,对每个GOP的对应帧率FRc的选择如公式(4)所示。在公式(4)中,FRSET表示当前GOP可选用的帧率集合(15,16,…,30);MIc和MIp分别表示当前GOP的运动强度和前一GOP的运动强度,运动强度MI的值可以通过公示(2)进行计算;FRp表示前一个GOP所采用的播放帧率。In addition, in order to avoid the degradation of the subjective experience of the video due to sudden changes in the frame rate of the video playback or the frame rate is too small, it can be set that the difference between the playback frame rates of two adjacent GOPs should not exceed the threshold FD bench , and the playback of each GOP None of the frame rates should be less than the threshold FR low . In summary, the selection of the corresponding frame rate FR c for each GOP is shown in formula (4). In formula (4), FR SET represents the frame rate set (15, 16, ..., 30) available for the current GOP; MI c and MI p represent the motion intensity of the current GOP and the motion intensity of the previous GOP, and the motion intensity The value of MI can be calculated by publicity (2); FR p represents the playback frame rate adopted by the previous GOP.

FR c = arg min FR ∈ FR SET | MI c FR - MI p FR p |   (5) FR c = arg min FR ∈ FR SET | MI c FR - MI p FR p | (5)

sthe s .. tt .. {{ || FRFR -- FRFR pp || ≤≤ FDFD benchbench FRFR ≥&Greater Equal; FRFR lowlow

本发明综合考虑多个对视频主观体验的影响因素,优化了接收端视频的用户主观体验。本发明的方案主要应用于非直播的视频应用,视频源的所有信息可以提前准备好,因此该调度方案的计算复杂度较低。The present invention comprehensively considers multiple influencing factors on the subjective experience of the video, and optimizes the subjective experience of the user of the video at the receiving end. The solution of the present invention is mainly applied to non-live video applications, and all information of video sources can be prepared in advance, so the calculation complexity of the scheduling solution is low.

图2是传统和根据本发明实施例的SVC发送调度方法的比较示意图。Fig. 2 is a schematic diagram of a comparison between traditional and SVC transmission scheduling methods according to an embodiment of the present invention.

首先,假设在发送端进行发送调度之前,通过接收端的反馈已知接收缓冲区已经处于“饥饿”状态,并且用户在一个时隙中的可用信道带宽足够支撑发送SVC编码视频中的24个NALU。下面,通过图2的比较和具体实施来说明根据本发明的SVC发送调度方法。First, it is assumed that the receiving buffer is known to be in a "starved" state through the feedback of the receiving end before the sending end schedules the transmission, and the user's available channel bandwidth in one time slot is sufficient to support sending 24 NALUs in the SVC encoded video. Next, the SVC transmission scheduling method according to the present invention will be described through the comparison and specific implementation of FIG. 2 .

图2中的(a)部分示出了在缓冲区处于“饥饿”状态下仍采用传统方法进行发送调度。如(a)部分所示,发送端将数据至于发送调度窗口(黑框)中以备发送,发送调度窗口中的数据为第i个GOP。在这个待发送的GOP中,NALU数据在质量可伸缩维度上分属于质量基本层F0以及质量增强层F1和F2,在时间可伸缩维度上分属于时间基本层T0以及时间增强层T1、T2和T3。质量可伸缩维度的层级包含F0,F1,F2,其中较高的质量层级,即F1和F2,会使播放的视频画面更清晰;而其时间可伸缩维度的层级按照发送顺序包含T0,T1,T2,T3,T3,T2,T3,T3,即发送的数据在时间维度上只跨越8帧,不能保证能够较好地为接收端缓冲区补充数据而避免缓冲区下溢。Part (a) in Figure 2 shows that the traditional method is still used for sending scheduling when the buffer is in a "starved" state. As shown in part (a), the sender puts the data in the sending scheduling window (black box) for sending, and the data in the sending scheduling window is the i-th GOP. In this GOP to be sent, the NALU data belongs to the quality basic layer F0 and the quality enhancement layers F1 and F2 in the quality scalable dimension, and belongs to the time basic layer T0 and the time enhancement layers T1, T2 and F2 in the time scalable dimension. T3. The levels of the quality scalability dimension include F0, F1, and F2, and the higher quality levels, namely F1 and F2, will make the played video clearer; and the levels of the time scalability dimension include T0, T1, T2, T3, T3, T2, T3, T3, that is, the sent data only spans 8 frames in the time dimension, and there is no guarantee that the receiving end buffer can be supplemented with data to avoid buffer underflow.

图2中的(b)到(e)的部分采用了根据本发明实施例的SVC视频发送调度方法。Parts (b) to (e) in FIG. 2 adopt the SVC video transmission scheduling method according to the embodiment of the present invention.

在(b)部分中,先考虑只发送那些保证基本质量的数据。如前文所述,通过计算可知第i个GOP的时间层级阈值是TL2。如(b)部分所示,发送端将能够保证基本质量的基本层数据置于发送调度窗口中,即将时间维度的层级不超过TL2的基本层数据的都置于发送调度窗口中。由于在当前时隙中的用户可用带宽足以支撑24个NALU,在上述基本层数据被置于发送调度窗口中之后,当前的可用信道带宽尚可以支撑20个NALU。In part (b), consider sending only those data that guarantee basic quality. As mentioned above, it can be known through calculation that the time level threshold of the i-th GOP is TL2. As shown in part (b), the sender puts the basic layer data that can guarantee the basic quality in the sending scheduling window, that is, puts all the basic layer data whose time dimension level does not exceed TL2 in the sending scheduling window. Since the available bandwidth of the user in the current time slot is sufficient to support 24 NALUs, after the above-mentioned basic layer data is placed in the sending scheduling window, the current available channel bandwidth can still support 20 NALUs.

在(c)部分中,节省下来的可用信道带宽可用来增长发送调度窗口的长度。增长后的发送调度窗口为增长前的长度的两倍,待发送数据在时间维度上的跨度可达到16帧。另外,如前文所述,通过计算可知第i+1个GOP的时间层级阈值是TL1,于是将第i+1个GOP中时间层级不超过TL1的NALU都置于发送调度窗口中。发送调度窗口的数据仅能保证接收端视频最基本的清晰度和连续性要求。此时,发送调度窗口中的数据总量未超过当前调度时间单元中的可用信道带宽。In part (c), the saved available channel bandwidth can be used to increase the length of the sending scheduling window. The increased sending scheduling window is twice the length before the increasing, and the span of data to be sent in the time dimension can reach 16 frames. In addition, as mentioned above, it can be known through calculation that the time level threshold of the i+1th GOP is TL1, so all NALUs whose time level does not exceed TL1 in the i+1th GOP are placed in the sending scheduling window. Sending the data of the scheduling window can only guarantee the most basic clarity and continuity requirements of the video at the receiving end. At this time, the total amount of data in the sending scheduling window does not exceed the available channel bandwidth in the current scheduling time unit.

(c)和(d)之间的数字1-10标示了属于F0的NALU被选择到发送调度窗口中的优先级顺序,数字越小表示被选择的优先级越高。可以看出,时间层级越低的NALU的优先级越高,时间层级相同的NALU,其发送顺序靠前的优先级更高。The numbers 1-10 between (c) and (d) indicate the priority order in which the NALUs belonging to F0 are selected into the sending scheduling window, and the smaller the number, the higher the selected priority. It can be seen that NALUs with lower time levels have higher priorities, and NALUs with the same time level have higher priorities in sending order.

在(d)部分中,属于F0的多个具有各自时间层级的NALU被选择加入发送调度窗口,可见按照发送顺序从时间层级低且发送顺序靠前的NALU开始选择,直到将F0的NALU都加入发送调度窗口为止。无疑,这增加了SVC视频的时间分层数目,进而使得视频的视觉连续性得以优化。至此,所有质量基本层的NALU数据都已被准备于发送调度窗口中,而当前的调度时间单元的可用信道带宽尚可容纳8个NALU。In part (d), multiple NALUs with their own time levels belonging to F0 are selected to join the sending scheduling window. It can be seen that the NALUs with lower time levels and higher sending order are selected according to the sending order until all NALUs of F0 are added. until the dispatch window is sent. Undoubtedly, this increases the number of temporal layers of the SVC video, thereby optimizing the visual continuity of the video. So far, all NALU data of the basic quality layer have been prepared in the sending scheduling window, and the available channel bandwidth of the current scheduling time unit can accommodate 8 NALUs.

在(e)部分中,进一步向发送调度窗口中增加增强层的数据。如前文所述,质量可伸缩维度的各层之间存在依赖关系,层级越低的质量增强层被选择的优先级越高。因此,相对于F2层和F3层有更高的优先级的F1层的NALU添加到发送调度窗口中。也如前文所述,由于可用信道带宽的限制,质量增强层中时间层级越低的NALU被选择的优先级越高。因此,在尚余8个可选NALU的情况下,从质量增强层F1中的数据中顺序选择T0,T1和T2层的8个NALU加入发送调度窗口。通过增加发送调度窗口中纵向的质量层级,充分利用了当前时间调度单元中的可用信道带宽,同时也达到了优化视频清晰度的目的。In part (e), the enhancement layer data is further added to the sending scheduling window. As mentioned above, there is a dependency relationship between layers in the quality scalability dimension, and the quality enhancement layer with a lower layer has a higher priority for selection. Therefore, the NALUs of the F1 layer that have higher priority than the F2 layer and the F3 layer are added to the sending scheduling window. Also as mentioned above, due to the limitation of the available channel bandwidth, the NALU with the lower temporal level in the quality enhancement layer is selected with a higher priority. Therefore, in the case that there are still 8 optional NALUs, 8 NALUs of layers T0, T1 and T2 are sequentially selected from the data in the quality enhancement layer F1 to add to the sending scheduling window. By increasing the vertical quality level in the sending scheduling window, the available channel bandwidth in the current time scheduling unit is fully utilized, and the purpose of optimizing video clarity is also achieved.

经过图2中的(b)到(e),整个发送调度结束,发送调度窗口中的数据将按照其自身的发送顺序被发送到接收端。After (b) to (e) in Figure 2, the entire sending scheduling ends, and the data in the sending scheduling window will be sent to the receiving end according to its own sending order.

专业人员应该还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Professionals should further realize that the units and algorithm steps described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software, or a combination of the two. In order to clearly illustrate the relationship between hardware and software Interchangeability. In the above description, the composition and steps of each example have been generally described according to their functions. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present invention.

结合本文中所公开的实施例描述的方法或算法的步骤可以用硬件、处理器执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。The steps of the methods or algorithms described in connection with the embodiments disclosed herein may be implemented by hardware, software modules executed by a processor, or a combination of both. Software modules can be placed in random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other Any other known storage medium.

以上所述的具体实施方式,对本发明的目的、技术方案和有益效果进行了进一步详细说明,所应理解的是,以上所述仅为本发明的具体实施方式而已,并不用于限定本发明的保护范围,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The specific embodiments described above have further described the purpose, technical solutions and beneficial effects of the present invention in detail. It should be understood that the above descriptions are only specific embodiments of the present invention and are not intended to limit the scope of the present invention. Protection scope, within the spirit and principles of the present invention, any modification, equivalent replacement, improvement, etc., shall be included in the protection scope of the present invention.

Claims (10)

1. the transmission dispatching method of a scalable video SVC video, described SVC video is comprised of a plurality of network abstraction layer unit NALU, each NALU has unique quality level and unique time level in quality dimensions and time dimension, described quality layers level is divided into the basic layer of a quality and a plurality of quality enhancement layer, described time horizon level is divided into basic layer of a time and a plurality of time enhancement layer, and described method comprises:
Determine the length of transmitting and scheduling window according to the state in the rx-side buffering district of SVC video, described transmitting and scheduling window is used for dispatching NALU waiting for transmission;
Select the NALU of first to place described transmitting and scheduling window from described SVC video, the quality level of the described NALU of first is that the basic layer of quality and its time horizon level are less than or equal to very first time level threshold value;
If the data volume of the described NALU of first does not reach current available cognitive radio channel bandwidth, then select second portion NALU to place described transmitting and scheduling window from described SVC video, the quality level of described second portion NALU equals the basic layer of quality and its time horizon level is less than or equal to the second time level threshold value;
If the data volume of the described NALU of first and described second portion NALU does not reach current available cognitive radio channel bandwidth, then from described SVC video, select third part NALU to place described transmitting and scheduling window, the quality level of described third part NALU equals the basic layer of quality, wherein, the described NALU of first, described second portion NALU and the span of described third part NALU on time dimension are no more than the length of described transmitting and scheduling window;
The NALU in the described transmitting and scheduling window is transferred to the receiving terminal of SVC video by current available cognitive radio channel.
2. method according to claim 1 also comprises:
After the described NALU of first, described second portion NALU and described third part NALU are placed described transmitting and scheduling window, if also have remaining bandwidth in the current available cognitive radio channel, then select the 4th part NALU to place described transmitting and scheduling window from the SVC video, the quality level of described the 4th part NALU is quality enhancement layer and near the basic layer of described quality.
3. method according to claim 1, wherein, described very first time level threshold value and described the second time level threshold value are that the video content features of the present image group that adheres to separately by the described NALU of first and described second portion NALU calculates in advance.
4. method according to claim 1 and 2, wherein, in the process of selecting described second portion NALU, third part NALU and the 4th part NALU, the preferential relatively low NALU of select time level, the preferential forward NALU of sending order that selects in the identical situation of the time of NALU level.
5. the transmitting and scheduling controller of a scalable video SVC video, described SVC video is comprised of a plurality of network abstraction layer unit NALU, each NALU has unique quality level and unique time level in quality dimensions and time dimension, described quality layers level is divided into the basic layer of a quality and a plurality of quality enhancement layer, described time horizon level is divided into basic layer of a time and a plurality of time enhancement layer, and described controller is arranged to:
Determine the length of transmitting and scheduling window according to the state in the rx-side buffering district of SVC video, described transmitting and scheduling window is used for dispatching NALU waiting for transmission;
Select the NALU of first to place described transmitting and scheduling window from described SVC video, the quality level of the described NALU of first is that the basic layer of quality and its time horizon level are less than or equal to very first time level threshold value;
If the data volume of the described NALU of first does not reach current available cognitive radio channel bandwidth, then select second portion NALU to place described transmitting and scheduling window from described SVC video, the quality level of described second portion NALU equals the basic layer of quality and its time horizon level is less than or equal to the second time level threshold value;
If the data volume of the described NALU of first and described second portion NALU does not reach current available cognitive radio channel bandwidth, then from described SVC video, select third part NALU to place described transmitting and scheduling window, the quality level of described third part NALU equals the basic layer of quality, wherein, the described NALU of first, described second portion NALU and the span of described third part NALU on time dimension are no more than the length of described transmitting and scheduling window;
The NALU in the described transmitting and scheduling window is transferred to the receiving terminal of SVC video by current available cognitive radio channel.
6. controller according to claim 5 also is arranged to:
After the described NALU of first, described second portion NALU and described third part NALU are placed described transmitting and scheduling window, if also have remaining bandwidth in the current available cognitive radio channel, then select the 4th part NALU to place described transmitting and scheduling window from the SVC video, the quality level of described the 4th part NALU is quality enhancement layer and near the basic layer of described quality.
7. controller according to claim 5, wherein, described very first time level threshold value and described the second time level threshold value are that the video content features of the present image group that adheres to separately by the described NALU of first and described second portion NALU calculates in advance.
8. according to claim 5 or 6 described controllers, wherein, in the process of selecting described second portion NALU, third part NALU and the 4th part NALU, the preferential relatively low NALU of select time level, the preferential forward NALU of sending order that selects in the identical situation of the time of NALU level.
9. the self adaptation player method of a scalable video SVC video comprises:
Data in play buffer can't support by current playback rate and play in the situation of the scheduled time,
Determine frame per second set, each frame per second in the described frame per second set is not less than predetermined frame rate and is no more than predetermined threshold with absolute value that the previous image multicast is put the difference of frame per second;
The absolute value of the difference of the ratio of the exercise intensity of the ratio of each frame per second and previous image group and play frame rate thereof in the exercise intensity of calculating present image group and the set of described frame per second;
In described frame per second set, select frame per second corresponding to least absolute value as the play frame rate of present image group.
10. the self adaptation player of a scalable video SVC video, described player is arranged to:
Data in play buffer can't support by current playback rate and play in the situation of the scheduled time,
Determine frame per second set, each frame per second in the described frame per second set is not less than predetermined frame rate and is no more than predetermined threshold with absolute value that the previous image multicast is put the difference of frame per second;
The absolute value of the difference of the ratio of the exercise intensity of the ratio of each frame per second and previous image group and play frame rate thereof in the exercise intensity of calculating present image group and the set of described frame per second;
In described frame per second set, select frame per second corresponding to least absolute value as the play frame rate of present image group.
CN201310191314.XA 2013-05-22 2013-05-22 The transmission dispatching method of scalable video SVC video Expired - Fee Related CN103313054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310191314.XA CN103313054B (en) 2013-05-22 2013-05-22 The transmission dispatching method of scalable video SVC video

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310191314.XA CN103313054B (en) 2013-05-22 2013-05-22 The transmission dispatching method of scalable video SVC video

Publications (2)

Publication Number Publication Date
CN103313054A true CN103313054A (en) 2013-09-18
CN103313054B CN103313054B (en) 2016-05-04

Family

ID=49137772

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310191314.XA Expired - Fee Related CN103313054B (en) 2013-05-22 2013-05-22 The transmission dispatching method of scalable video SVC video

Country Status (1)

Country Link
CN (1) CN103313054B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104683987A (en) * 2013-11-26 2015-06-03 中国科学院声学研究所 Transmission Scheduling Method of SVC Video in Cognitive Radio Multi-channel
CN105263067A (en) * 2014-07-17 2016-01-20 中国科学院声学研究所 Multi-network concurrent transmission method of scalable encoding video stream
CN105635738A (en) * 2014-09-05 2016-06-01 三星Sds株式会社 Video stream providing method and system and relay apparatus
WO2018145522A1 (en) * 2017-02-09 2018-08-16 中国移动通信有限公司研究院 Virtual reality service processing method, wireless access network element device and storage medium
CN110418143A (en) * 2019-07-19 2019-11-05 山西大学 A transmission method of SVC video in Internet of Vehicles
CN111601109A (en) * 2020-05-13 2020-08-28 西安万像电子科技有限公司 Image data processing method, device and system
CN113395522A (en) * 2021-05-12 2021-09-14 陕西师范大学 Video transmission method and device
CN117596425A (en) * 2023-10-24 2024-02-23 书行科技(北京)有限公司 Method and device for determining coding frame rate, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1726717A (en) * 2002-12-13 2006-01-25 皇家飞利浦电子股份有限公司 Transformation method for Multiple Description Coding/Scalable Coding
CN1738424A (en) * 2005-05-27 2006-02-22 上海大学 A rate control method for fine-grained scalable coding based on H.264
WO2007081148A1 (en) * 2006-01-09 2007-07-19 Electronics And Telecommunications Research Institute Transmitting/receiving method of svc file and apparatus thereof
CN101682766A (en) * 2007-06-05 2010-03-24 汤姆逊许可公司 Apparatus and method for encoding video content in hierarchical stream form
CN101707701A (en) * 2009-11-26 2010-05-12 上海交通大学 Multi-stream information transmission system and transmission method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1726717A (en) * 2002-12-13 2006-01-25 皇家飞利浦电子股份有限公司 Transformation method for Multiple Description Coding/Scalable Coding
CN1738424A (en) * 2005-05-27 2006-02-22 上海大学 A rate control method for fine-grained scalable coding based on H.264
WO2007081148A1 (en) * 2006-01-09 2007-07-19 Electronics And Telecommunications Research Institute Transmitting/receiving method of svc file and apparatus thereof
CN101682766A (en) * 2007-06-05 2010-03-24 汤姆逊许可公司 Apparatus and method for encoding video content in hierarchical stream form
CN101707701A (en) * 2009-11-26 2010-05-12 上海交通大学 Multi-stream information transmission system and transmission method thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BIANCHI, G. ETAL.: "Application-aware H.264 scalable video coding delivery over wireless LAN: experimental assessment", 《2009 SECOND INTERNATIONAL WORKSHOP ON CROSS LAYER DESIGN (IWCLD 2009)》, 12 June 2009 (2009-06-12) *
DONGLIN HU ETAL.: "On medium grain scalable video streaming over cognitive radio femtocell networks", 《IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS》, vol. 30, no. 3, 30 April 2012 (2012-04-30), XP011437089, DOI: doi:10.1109/JSAC.2012.120413 *
牛建伟 等: "一种面向无线视频流的包调度算法", 《北京航空航天大学学报》, vol. 30, no. 11, 30 November 2004 (2004-11-30) *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104683987A (en) * 2013-11-26 2015-06-03 中国科学院声学研究所 Transmission Scheduling Method of SVC Video in Cognitive Radio Multi-channel
CN104683987B (en) * 2013-11-26 2018-02-16 中国科学院声学研究所 Transmission dispatching method of the SVC videos in cognitive radio multichannel
CN105263067A (en) * 2014-07-17 2016-01-20 中国科学院声学研究所 Multi-network concurrent transmission method of scalable encoding video stream
CN105263067B (en) * 2014-07-17 2018-02-16 中国科学院声学研究所 A kind of more net concurrent transmission methods of scalable coded video stream
CN105635738A (en) * 2014-09-05 2016-06-01 三星Sds株式会社 Video stream providing method and system and relay apparatus
CN105635738B (en) * 2014-09-05 2019-10-11 三星Sds株式会社 Method and system for providing video stream and relay device
CN108419268A (en) * 2017-02-09 2018-08-17 中国移动通信有限公司研究院 A virtual reality service processing method and wireless access network element equipment
WO2018145522A1 (en) * 2017-02-09 2018-08-16 中国移动通信有限公司研究院 Virtual reality service processing method, wireless access network element device and storage medium
CN108419268B (en) * 2017-02-09 2020-10-16 中国移动通信有限公司研究院 Virtual reality service processing method and wireless access network element equipment
CN110418143A (en) * 2019-07-19 2019-11-05 山西大学 A transmission method of SVC video in Internet of Vehicles
CN110418143B (en) * 2019-07-19 2021-05-14 山西大学 SVC video transmission method in Internet of vehicles
CN111601109A (en) * 2020-05-13 2020-08-28 西安万像电子科技有限公司 Image data processing method, device and system
CN113395522A (en) * 2021-05-12 2021-09-14 陕西师范大学 Video transmission method and device
CN117596425A (en) * 2023-10-24 2024-02-23 书行科技(北京)有限公司 Method and device for determining coding frame rate, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN103313054B (en) 2016-05-04

Similar Documents

Publication Publication Date Title
CN103313054B (en) The transmission dispatching method of scalable video SVC video
WO2016082220A1 (en) Method and device for transmitting multiple video streams
CN102104538B (en) Mapping parameter dynamic adaptive wireless streaming media transmission control method
TWI501592B (en) Systems and methods for prioritization of data for intelligent discard in a communication network
Yoon et al. MuVi: A multicast video delivery scheme for 4G cellular networks
Shehada et al. QoE-based cross-layer optimization for video delivery in long term evolution mobile networks
CN1523846A (en) Multimedia data transmission method and point coordinator in wireless local area network
Xing et al. Rate adaptation strategy for video streaming over multiple wireless access networks
Siekkinen et al. Optimized upload strategies for live scalable video transmission from mobile devices
Khorov et al. SEBRA: SAND-enabled bitrate and resource allocation algorithm for network-assisted video streaming
Liu et al. Smart downlink scheduling for multimedia streaming over LTE networks with hard handoff
US20110067072A1 (en) Method and apparatus for performing MPEG video streaming over bandwidth constrained networks
El Essaili et al. QoE-driven resource optimization for user generated video content in next generation mobile networks
Deng et al. Cross-layer DASH-based multipath video streaming over LTE and 802.11 ac networks
Ramamurthi et al. Video-QoE aware resource management at network core
CN100431362C (en) Method for packet service scheduling in mobile communication system
Xu et al. Resource allocation for scalable video streaming in highway VANET
Guo et al. Quality-aware streaming in heterogeneous wireless networks
Shehada et al. QoE-based resource reservation for unperceivable video quality fluctuation during Handover in LTE
CN103987133B (en) Resource allocation methods based on client video broadcasting condition under LTE system
CN105282848A (en) Resource scheduling method and device
Yao et al. Perceptual experience oriented transmission scheduling for scalable video streaming over cognitive radio networks
Khan et al. Opportunistic proportional fair downlink scheduling for scalable video transmission over lte systems
JP4227438B2 (en) Video transmission system
Luo et al. On using cooperative game theory to solve the wireless scalable video multicasting problem

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160504

Termination date: 20190522