[go: up one dir, main page]

CN1650626A - Method and apparatus for supporting AVC in MP4 - Google Patents

Method and apparatus for supporting AVC in MP4 Download PDF

Info

Publication number
CN1650626A
CN1650626A CN03809209.3A CN03809209A CN1650626A CN 1650626 A CN1650626 A CN 1650626A CN 03809209 A CN03809209 A CN 03809209A CN 1650626 A CN1650626 A CN 1650626A
Authority
CN
China
Prior art keywords
metadata
sample
multimedia data
data
parameter set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN03809209.3A
Other languages
Chinese (zh)
Other versions
CN100379290C (en
Inventor
M·Z·维沙拉姆
A·塔巴塔拜
T·瓦尔克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Electronics Inc
Original Assignee
Sony Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/371,438 external-priority patent/US20040199565A1/en
Application filed by Sony Electronics Inc filed Critical Sony Electronics Inc
Publication of CN1650626A publication Critical patent/CN1650626A/en
Application granted granted Critical
Publication of CN100379290C publication Critical patent/CN100379290C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8451Structuring of content, e.g. decomposing content into time segments using Advanced Video Coding [AVC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4621Controlling the complexity of the content stream or additional data, e.g. lowering the resolution or bit-rate of the video stream for a mobile client with a small screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/85406Content authoring involving a specific file format, e.g. MP4 format

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Television Systems (AREA)
  • Storage Device Security (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Paramater set metadata identifying parameter sets for multiple portions of multimedia data is created. Further, a file associated with the multimedia data is formed. This file includes the parameter set metadata, as well as other information pertaining to the multimedia data.

Description

用于支持MP4中的AVC的方法和设备Method and apparatus for supporting AVC in MP4

相关申请related application

本申请涉及并要求了下列美国临时专利申请的权益:在2002年2月25日提交的60/359,606号专利申请、在2002年3月5日提交的60/361,773号专利申请、及2002年3月8日提交的60/363,643号专利申请,将这些临时专利申请引入于此,以供参考。This application is related to and claims the benefit of the following U.S. provisional patent applications: 60/359,606, filed February 25, 2002; 60/361,773, filed March 5, 2002; Patent Application No. 60/363,643, filed May 8, which is hereby incorporated by reference into these Provisional Patent Applications.

发明领域field of invention

本发明总体上涉及以多媒体文件格式存储和检索视听内容,具体而言涉及与ISO媒体文件格式相兼容的文件格式。The present invention relates generally to storing and retrieving audiovisual content in multimedia file formats, and in particular to file formats compatible with the ISO media file format.

版权标记/许可Copyright Notation/Permission

这篇专利文件的一部分公开包含了受版权保护的材料。版权所有者不反对专利文件或专利公开的任何人进行传真复制,因为在专利与商标局内它已经在专利文件或记录当中出版了,但是无论如何在别的方面却保留所有的版权权利。下列标记适用于如下所述的软件和数据,并且在图中附此标记:Copyright2001,Sony Electronics,Inc.,版权所有。Portions of the disclosure of this patent document contain material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it has been published in the patent document or records in the Patent and Trademark Office, but otherwise reserves all copyright rights whatsoever. The following marks apply to the software and data described below and are included in the drawings: Copyright2001, Sony Electronics, Inc. All rights reserved.

发明背景Background of the invention

随着对网络、多媒体、数据库及其它数字容量需要的快速增长,已经演进出许多多媒体编码和存储方案。其中一个众所周知的用于编码和存储视听数据的文件格式就是由苹果计算机公司开发的QuickTime文件格式。把QuickTime文件格式用作为创建国际标准化组织(ISO)多媒体文件格式的起点,ISO/IEC 14496-12,信息技术—视听对象编码——第12部分:ISO媒体文件格式(又名ISO文件格式),所述QuickTime文件格式又依次被用作为以下两种标准文件格式的模板:(1)用于由活动图像专家组开发的MPEG-4文件格式,通常所说的MP4(ISO/IEC 14496-14,信息技术——视听对象编码——第14部分:MP4文件格式);和(2)由联合图像专家组(JPEG)开发的JPEG 2000(ISO/IEC 15444-1)的文件格式。With the rapid growth of network, multimedia, database and other digital capacity needs, many multimedia encoding and storage schemes have evolved. One of the well-known file formats for encoding and storing audiovisual data is the QuickTime (R) file format developed by Apple Computer Corporation. Use of the QuickTime file format as a starting point for the creation of the International Organization for Standardization (ISO) multimedia file format, ISO/IEC 14496-12, Information technology — Coding of audiovisual objects — Part 12: ISO media file format (aka ISO file format), The QuickTime file format is used in turn as a template for the following two standard file formats: (1) for the MPEG-4 file format developed by the Moving Picture Experts Group, commonly known as MP4 (ISO/IEC 14496-14, Information technology—Coding of audiovisual objects—Part 14: MP4 file format); and (2) the file format of JPEG 2000 (ISO/IEC 15444-1), developed by the Joint Photographic Experts Group (JPEG).

ISO媒体文件格式由称为框(也称为原子或对象)的面向对象的结构组成。两个重要的顶层框包含媒体数据或元数据。大多数框都描述了元数据的层次,所述元数据提供了关于实际媒体数据的说明性的、结构上的和时间上的信息。这个框的集合是包含在通称为电影框的框中。媒体数据本身可以位于媒体数据框之中或之外。将每个媒体数据流称为轨道(又名基本流或简单称为流)。The ISO media file format consists of object-oriented structures called boxes (also known as atoms or objects). Two important top-level boxes contain media data or metadata. Most boxes describe a hierarchy of metadata that provides descriptive, structural and temporal information about the actual media data. This collection of boxes is contained in a box known as a movie box. The media data itself can be inside or outside the media data frame. Each stream of media data is called a track (aka elementary stream or simply stream).

最初的元数据是电影对象。电影框包括轨道框,所述轨道框描述临时显示的媒体数据。对应轨道的媒体数据可以具有各种类型(例如,视频数据、音频数据、二进制格式屏幕表示(BIFS)等等)。每个轨道都进一步划分为采样(又名访问单元或画面)。采样代表在特定时间点处的媒体数据的单元。采样元数据是包含在一组采样框中的。每个轨道框都包含采样表框元数据框,其包含为它的媒体数据等提供每个采样的时间、其字节大小及其位置(文件的外部或内部)等等的框。采样是最小的数据实体,它能够表示时间、位置及其它元数据信息。The original metadata is the movie object. The movie box includes a track box that describes temporarily displayed media data. The media data corresponding to the track may be of various types (eg, video data, audio data, binary format screen representation (BIFS), etc.). Each track is further divided into samples (aka access units or frames). A sample represents a unit of media data at a particular point in time. Sampling metadata is contained in a set of sampling boxes. Each track box contains a sample table box metadata box, which contains boxes giving its media data, etc. the time of each sample, its size in bytes, and its location (outside or inside the file), etc. A sample is the smallest data entity that can represent time, location, and other metadata information.

最近,MPEG视频组与国际电信联盟(ITU)的视频编码专家组(VCEG)作为联合视频组(JVT)一起开始工作,以开发称为ITURecommendation H.264或MPEG-4-Part 10的新视频编码/解码(编解码器)标准,高级视频编解码器(AVC)或JVT编解码器。在此,可互换地使用这些术语及其缩写,比如像H.264、JVT和AVC。Recently, the MPEG Video Group started work together with the Video Coding Experts Group (VCEG) of the International Telecommunication Union (ITU) as the Joint Video Team (JVT) to develop a new video coding called ITU Recommendation H.264 or MPEG-4-Part 10 /decode(codec) standard, Advanced Video Codec (AVC) or JVT codec. Herein, these terms and their abbreviations are used interchangeably, such as H.264, JVT and AVC.

JVT编解码器设计区分了两种不同的概念层:视频编码层(VCL)和网络抽象层(NAL)。VCL包含编解码器中有关编码的部分,比如像运动补偿、系数变换编码和熵编码。VCL的输出是时间片(slice),每个时间片都包含一系列的宏块以及相关联的首部信息。NAL从用于VCL数据的运输层的细节中抽象化出VCL。它为时间片层上的信息定义了一般的且运输独立的表示。NAL定义视频编解码器本身与外界之间的接口。在内部,NAL使用NAL分组。NAL分组包括表明净载荷类型的类型字段外加净载荷中的比特集。单个时间片内的数据可以进一步分成不同的数据分区。The JVT codec design distinguishes two different conceptual layers: Video Coding Layer (VCL) and Network Abstraction Layer (NAL). VCL contains codec related coding parts, such as motion compensation, coefficient transform coding and entropy coding. The output of VCL is time slice (slice), each time slice contains a series of macroblocks and associated header information. NAL abstracts VCL from the details of the transport layer used for VCL data. It defines a general and transport-independent representation for information on time slices. NAL defines the interface between the video codec itself and the outside world. Internally, NAL uses NAL grouping. A NAL packet includes a type field indicating the payload type plus a set of bits in the payload. Data within a single time slice can be further divided into different data partitions.

在许多现有的视频编码格式中,编码的数据流包括备类包含控制解码过程的参数的首部。例如,MPEG-2视频标准包括序列首部、增强式画面组(GOP)和对应于那些项的视频数据前面的画面首部。在JVT中,把解码VCL数据所需的信息集合成参数集。给每个参数集一个标识符,该标识符随后被用作为来自时间片的引用。可以在流外部(带外)发送参数集,而不是在流内部(带内)发送所述参数集。In many existing video encoding formats, the encoded data stream includes a header that contains parameters that control the decoding process. For example, the MPEG-2 video standard includes a sequence header, an enhanced group of pictures (GOP), and a picture header in front of video data corresponding to those items. In JVT, the information needed to decode VCL data is assembled into a parameter set. Each parameter set is given an identifier, which is then used as a reference from the timeslice. The parameter set may be sent outside the stream (out-of-band) instead of inside the stream (in-band).

现有的文件格式没有提供用于存储与已编码的媒体数据相关联的参数集的工具;它们也没有提供用于有效地将媒体数据(即,采样或子采样)链接到参数集以便能够有效地检索和发送参数集的装置。Existing file formats do not provide facilities for storing parameter sets associated with encoded media data; nor do they provide facilities for efficiently linking media data (i.e., samples or subsamples) to parameter sets in order to be able to efficiently A means of retrieving and sending parameter sets efficiently.

在ISO媒体文件格式中,在不解析媒体数据的情况下能够被访问的最小单位是采样,即AVC中的整个画面。在许多编码格式中,能够把采样进一步分成更小的单元,称为子采样(也称为采样片段或访问单元片段)。就AVC来说,子采样相当于时间片。然而,现有的文件格式不支持对采样的子部分的访问。对于需要灵活地将存储在文件中的数据形成用于流送(streaming)的系统而言,这种缺乏对子采样的访问,阻碍了用于流送的JVT媒体数据的灵活分组化。In the ISO media file format, the smallest unit that can be accessed without parsing the media data is the sample, that is, the entire picture in AVC. In many encoding formats, samples can be further divided into smaller units called subsamples (also known as sample slices or access unit slices). As far as AVC is concerned, subsampling is equivalent to time slices. However, existing file formats do not support access to sampled subsections. This lack of access to subsampling prevents flexible packetization of JVT media data for streaming for systems that require flexible shaping of data stored in files for streaming.

现有的存储格式的另一个限制与响应于当流送媒体数据时改变网络条件而在已存储的流和不同带宽之间进行切换有关。在典型的流送情形中,其中一个关键要求就是响应于改变网络条件来缩放压缩数据的比特率。典型情况下,这是通过对具有为典型的网络条件而设置的不同带宽和质量的多个流进行编码并将它们存储在一个或多个文件中来实现的。然后,服务器能够响应于网络条件而在这些预先编码的流当中进行切换。在现有的文件格式中,仅可以在不依赖于用来重构的早先采样的那些采样上进行流之间的切换。这类采样称为I帧。目前,在依赖于用来重构的早先采样的那些采样(即,依赖于用于参考的多个采样的P帧或B帧)上,没有为流之间的切换提供支持。Another limitation of existing storage formats relates to switching between stored streams and different bandwidths in response to changing network conditions while streaming media data. In a typical streaming scenario, one of the key requirements is to scale the bitrate of the compressed data in response to changing network conditions. Typically, this is accomplished by encoding multiple streams with different bandwidths and qualities set for typical network conditions and storing them in one or more files. The server can then switch among these pre-encoded streams in response to network conditions. In existing file formats, switching between streams is only possible on those samples that do not depend on previous samples used for reconstruction. Such samples are called I-frames. Currently, no support is provided for switching between streams on those samples that depend on previous samples for reconstruction (ie, P-frames or B-frames that depend on multiple samples for reference).

AVC标准提供通称切换画面(称为SI画面和SP画面)的工具,以实现流之间的高效切换、随机访问和错误恢复力以及其它的特征。切换画面是一种特殊类型的画面,这种画面的重构值刚好等于它应当切换成的画面的值。切换画面能够使用不同于那些用于预测它们匹配的画面的参考画面,由此比使用I帧提供了更高效的编码。为了有效地使用文件中存储的切换画面,必须知道哪些组画面是等同的,并且必须知道哪些画面被用于预测。现有的文件格式没有提供这种信息,因此必须通过解析编码的流来提取这些信息,这将是低效而缓慢的。The AVC standard provides facilities known as switching pictures (referred to as SI pictures and SP pictures) to enable efficient switching between streams, random access and error resilience, among other features. A cutscreen is a special type of screen whose reconstruction value is exactly equal to the value of the screen it should switch to. Switching pictures can use reference pictures different from those used to predict their matching pictures, thus providing more efficient coding than using I-frames. In order to effectively use cutaway pictures stored in a file, it must be known which group pictures are equivalent and which pictures are used for prediction. Existing file formats do not provide this information, so it must be extracted by parsing the encoded stream, which would be inefficient and slow.

因此,需要增强存储方法以便解决通过出现视频编码标准而提供的新能力,并且解决那些存储方法的现有限制。Accordingly, there is a need to enhance storage methods in order to address new capabilities provided by emerging video coding standards, and to address existing limitations of those storage methods.

发明概述Summary of the invention

创建标识多媒体数据的多个部分的参数集的参数集元数据。此外,形成与多媒体数据相关联的文件。这个文件包括参数集元数据以及与多媒体数据有关的其它信息。Parameter set metadata identifying parameter sets of portions of the multimedia data is created. In addition, a file associated with the multimedia data is formed. This file includes parameter set metadata and other information related to the multimedia data.

附图简述Brief description of the drawings

本发明是在附图中按照举例的方式而非限制的方式加以说明的,并且在图中相同的附图标记指代相似的元件,在图中:The present invention is illustrated in the accompanying drawings by way of example and not limitation, and like reference numerals refer to similar elements in the drawings, in which:

图1是编码系统的一个实施例的框图;Figure 1 is a block diagram of one embodiment of an encoding system;

图2是解码系统的一个实施例的框图;Figure 2 is a block diagram of one embodiment of a decoding system;

图3是适用于实践本发明的计算机环境的框图;Figure 3 is a block diagram of a computer environment suitable for practicing the invention;

图4是用于在编码系统上存储子采样元数据的方法的流程图;4 is a flowchart of a method for storing subsampled metadata on an encoding system;

图5是用于在解码系统上使用子采样元数据的方法的流程图;5 is a flowchart of a method for using subsampled metadata on a decoding system;

图6举例说明了具有子采样的扩展的MP4媒体流模型;Figure 6 illustrates an extended MP4 media stream model with subsampling;

图7A-7K举例说明了用于存储子采样元数据的示例性数据结构;7A-7K illustrate exemplary data structures for storing subsampling metadata;

图8是用于在编码系统上存储参数集元数据的方法的流程图;8 is a flowchart of a method for storing parameter set metadata on an encoding system;

图9是用于在解码系统上使用参数集元数据的方法的流程图;9 is a flowchart of a method for using parameter set metadata on a decoding system;

图10A-10E举例说明了用于存储参数集元数据的示例性数据结构;10A-10E illustrate exemplary data structures for storing parameter set metadata;

图11举例说明了示例性的增强式画面组(GOP);Figure 11 illustrates an exemplary enhanced group of pictures (GOP);

图12是用于在编码系统上存储序列元数据的流程图;Figure 12 is a flow diagram for storing sequence metadata on an encoding system;

图13是用于在解码系统上使用序列元数据的方法的流程图;13 is a flowchart of a method for using sequence metadata on a decoding system;

图14A-14E举例说明了用于存储序列元数据的示例性数据结构;14A-14E illustrate exemplary data structures for storing sequence metadata;

图15A和15B举例说明了用于比特流切换的切换采样集的使用;Figures 15A and 15B illustrate the use of switching sample sets for bitstream switching;

图15C是用于确定在其上执行两个比特流之间的切换的点的方法的一个实施例的流程图;Figure 15C is a flowchart of one embodiment of a method for determining a point at which to perform a switch between two bitstreams;

图16是用于在编码系统上存储切换采样元数据的方法的流程图;Figure 16 is a flowchart of a method for storing switched sample metadata on an encoding system;

图17是用于在解码系统上使用切换采样元数据的方法的流程图;17 is a flowchart of a method for using switched sample metadata on a decoding system;

图18举例说明了用于存储切换采样元数据的示例性数据结构;Figure 18 illustrates an exemplary data structure for storing switching sample metadata;

图19A和19B举例说明了用以简化到比特流中的随机访问入口点的切换采样集的使用;Figures 19A and 19B illustrate the use of switched sample sets to simplify random access entry points into the bitstream;

图19C是用于确定采样的随机访问点的方法的一个实施例的流程图;Figure 19C is a flowchart of one embodiment of a method for determining random access points for samples;

图20A和20B举例说明了用以简化错误恢复的切换采样集的使用;和Figures 20A and 20B illustrate the use of switched sample sets to simplify error recovery; and

图20C是当发送采样时用于简化错误恢复的方法的一个实施例的流程图。Figure 20C is a flowchart of one embodiment of a method for simplifying error recovery when sending samples.

发明详述Detailed description of the invention

在下文对本发明实施例的详细说明中,参照了附图,在这些图中相同的附图标记表示相似的元件,并且在这些图中通过举例说明的方式示出了具体实施例,在这些具体实施例当中可以实施本发明。足够详细地描述了这些实施例,以使得本领域的技术人员能够实施本发明,且将要理解的是,也可以采用其它实施例,并且在不背离本发明范围的情况下可以作出逻辑的、机械的、电的、功能性的及其它的改变。因此,不应把下列详细说明视作为限制意义,而应当仅仅由所附的权利要求来限定本发明的范围。In the following detailed description of embodiments of the invention, reference is made to the accompanying drawings, in which like reference numerals indicate like elements, and in which are shown by way of illustration specific embodiments in which The present invention can be implemented in the embodiments. These embodiments have been described in sufficient detail to enable those skilled in the art to practice the invention, and it will be understood that other embodiments may be utilized and that logical, mechanical, and other embodiments may be made without departing from the scope of the invention. physical, electrical, functional and other changes. Accordingly, the following detailed description should not be taken in a limiting sense, but the scope of the invention should be defined only by the appended claims.

概观Overview

从本发明的操作概观起,图1举例说明了编码系统100的一个实施例。编码系统100包括:媒体编码器104、元数据生成器106和文件创建器108。媒体编码器104接收可能包含视频数据(例如,从自然源视频场景中创建的视频对象及其它外部视频对象)、音频数据(例如,从自然源音频场景中创建的音频对象及其它外部音频对象)、合成对象或上述的任意组合的媒体数据。媒体编码器104可以由许多单独的编码器组成或包括子编码器,以处理各种类型的媒体数据。媒体编码器104对媒体数据进行编码并将其传递到元数据生成器106。元数据生成器106根据媒体文件格式来生成提供关于媒体数据的信息的元数据。媒体文件格式可能来源于ISO媒体文件格式(或任何其变型,比如MPEG-4、JPEG 2000等)、QuickTime或任何其它的媒体文件格式,并且还包括一些附加的数据结构。在一个实施例中,定义附加的数据结构以存储与媒体数据内的子采样有关的元数据。在另一个实施例中,定义附加数据结构以存储把部分媒体数据(例如,采样或子采样)链接到相应的参数集的元数据,所述参数集包含了传统上一直存储在媒体数据中的解码信息。在又一个实施例中,定义附加数据结构以存储与元数据内的各种采样组有关的元数据,所述元数据是根据媒体数据中采样的相互依赖性而创建的。在又一个实施例中,定义附加数据结构以存储与切换采样集有关的元数据,所述采样集与媒体数据相关联。切换采样集指的是这样一组采样,它们具有相同解码值但可以依赖于不同采样。在其它的实施例中,以正在使用的文件格式来定义附加数据结构的各种组合。这些附加数据结构及其功能将在下面更详细地描述。From an overview of the operation of the present invention, Figure 1 illustrates one embodiment of an encoding system 100 . The encoding system 100 includes: a media encoder 104 , a metadata generator 106 and a file creator 108 . The media encoder 104 receives data that may contain video data (e.g., video objects created from a natural source video scene and other external video objects), audio data (e.g., audio objects created from a natural source audio scene and other external audio objects) , composite objects, or any combination of the above. Media encoder 104 may consist of many individual encoders or include sub-encoders to process various types of media data. The media encoder 104 encodes the media data and passes it to the metadata generator 106 . The metadata generator 106 generates metadata providing information about media data according to a media file format. The media file format may be derived from ISO media file format (or any of its variants, such as MPEG-4, JPEG 2000, etc.), QuickTime or any other media file format, and also includes some additional data structures. In one embodiment, additional data structures are defined to store metadata related to sub-samples within the media data. In another embodiment, an additional data structure is defined to store metadata linking parts of the media data (e.g., samples or subsamples) to corresponding parameter sets containing parameters that have traditionally been stored in the media data Decode the message. In yet another embodiment, additional data structures are defined to store metadata related to various groups of samples within metadata created from interdependencies of samples in the media data. In yet another embodiment, an additional data structure is defined to store metadata related to switching sample sets associated with the media data. A switched sample set refers to a set of samples that have the same decoded value but can depend on different samples. In other embodiments, various combinations of additional data structures are defined in the file format being used. These additional data structures and their functions are described in more detail below.

文件创建器108在文件中存储元数据,所述文件的结构是按媒体文件格式定义的。在一个实施例中,所述文件既包含已编码的媒体数据也包含与那个媒体数据有关的元数据。作为选择,已编码的媒体数据是部分地或全部地包含在独立的文件中的,并且按元数据文件中所含的引用(例如,经由URL)而链接于元数据。文件创建器108所创建的文件在用于存储或发送的信道110上是可用的。The file creator 108 stores metadata in files whose structure is defined by the media file format. In one embodiment, the file contains both encoded media data and metadata related to that media data. Alternatively, the encoded media data is partially or fully contained in a separate file and linked to the metadata by reference (eg, via a URL) contained in the metadata file. Files created by file creator 108 are available on channel 110 for storage or transmission.

图2举例说明了解码系统200的一个实施例。解码系统200包括:元数据提取器204、媒体数据流处理器206、媒体解码器210、合成器212和再现器。解码系统200可以驻留在客户端设备上,并且用于本地重放。作为选择,解码系统200可以用来流送数据,并且具有彼此在网络(例如,因特网)208上相互通信的服务器部分和客户端部分。服务器部分可以包括元数据提取器204和媒体数据流处理器206。客户端部分可以包括媒体解码器210、合成器212和再现器214。FIG. 2 illustrates one embodiment of a decoding system 200 . The decoding system 200 includes: a metadata extractor 204, a media data stream processor 206, a media decoder 210, a synthesizer 212 and a renderer. Decoding system 200 may reside on a client device and be used for local playback. Alternatively, the decoding system 200 may be used to stream data and have a server portion and a client portion that communicate with each other over a network (eg, the Internet) 208 . The server portion may include a metadata extractor 204 and a media data stream processor 206 . The client portion may include a media decoder 210 , a compositor 212 and a renderer 214 .

元数据提取器204负责从存储在数据库216中的文件当中提取元数据或者在网络上(从编码系统100中)接收元数据。所述文件可以包括也可以不包括与正在提取的元数据相关联的媒体数据。从文件中提取的元数据包括上述的一个或多个附加数据结构。Metadata extractor 204 is responsible for extracting metadata from files stored in database 216 or receiving metadata over the network (from encoding system 100). The file may or may not include media data associated with the metadata being extracted. Metadata extracted from a file includes one or more of the additional data structures described above.

把提取出的元数据传递到媒体数据流处理器206,所述媒体数据流处理器206还接收相关联的已编码媒体数据。媒体数据流处理器206利用该元数据来形成将要发送到媒体解码器210的媒体数据流。在一个实施例中,媒体数据流处理器206利用与子采样有关的元数据来定位媒体数据中的子采样(例如,以供分组化)。在另一个实施例中,媒体数据流处理器206利用与参数集有关的元数据来把部分媒体数据链接到其相对应的参数集上。在又一个实施例中,媒体数据流处理器206利用定义元数据内的各种采样组的元数据来访问一定组中的采样(例如,用于通过丢弃包含如下采样的组来缩放比例,响应于传输条件,没有其它采样依赖于所述采样以降低发送的比特率)。在又一个实施例中,媒体数据流处理器206利用定义切换采样集的元数据来定位与应当切换成到采样具有相同解码值的切换采样,但是不依赖于这个结果采样将要依赖的那些采样(例如,以便允许在P帧或B帧上切换成具有不同比特率的流)。The extracted metadata is passed to the media data stream processor 206, which also receives the associated encoded media data. Media data stream processor 206 utilizes the metadata to form a media data stream to be sent to media decoder 210 . In one embodiment, the media data stream processor 206 utilizes metadata related to the subsamples to locate the subsamples in the media data (eg, for packetization). In another embodiment, the media data stream processor 206 utilizes metadata associated with parameter sets to link portions of media data to their corresponding parameter sets. In yet another embodiment, the media data stream processor 206 utilizes metadata defining various sample groups within the metadata to access samples in a certain group (e.g., for scaling by discarding groups containing samples in response to Depending on transmission conditions, no other samples depend on the sample to reduce the transmitted bit rate). In yet another embodiment, the media data stream processor 206 utilizes the metadata defining the set of switched samples to locate switched samples that have the same decoding value as the sample that should be switched to, but independent of those samples that this resulting sample will depend on ( For example, to allow switching to streams with different bitrates on P-frames or B-frames).

一旦形成媒体数据流,就直接地(例如,本地重放)或在网络208上(例如,用于流送数据)将它发送给媒体解码器210以供进行解码。合成器212接收媒体解码器210的输出,并且由再现器214把继而将在用户显示设备上再现的场景合成起来。Once the media data stream is formed, it is sent to the media decoder 210 for decoding, either directly (eg, for local playback) or over the network 208 (eg, for streaming data). A compositor 212 receives the output of the media decoder 210, and a renderer 214 composes the scene which will then be rendered on the user display device.

图3的下列说明意在提供适于实现本发明的计算机硬件及其它操作组件的概观,而不是意在限制可适用的环境。图3举例说明了适于用作为图1的元数据生成器106和/或文件创建器108或者图2的元数据提取器204和/或媒体数据流处理器206的计算机系统的一个实施例。The following description of FIG. 3 is intended to provide an overview of computer hardware and other operating components suitable for implementing the invention, and is not intended to limit the applicable environments. FIG. 3 illustrates one embodiment of a computer system suitable for use as metadata generator 106 and/or file creator 108 of FIG. 1 or metadata extractor 204 and/or media data stream processor 206 of FIG. 2 .

计算机系统340包括耦合于系统总线365的处理器350、存储器355和输入/输出能力360。将存储器355配置成能存储指令,当由处理器350执行所述指令时,执行这里所述的方法。输入/输出360也涵盖了各种类型的计算机可读媒体,包括可由处理器350访问的任意类型的存储装置。本领域技术人员将立刻认识到的是,术语“计算机可读介质/媒体”还涵盖对数据信号进行编码的载波。还应当认识到,系统340受存储器355中所执行的操作系统软件的控制。输入/输出及相关媒体360存储用于本发明的操作系统和方法的计算机可执行指令。图1和2中所示的元数据生成器106、文件创建器108、元数据提取器204和媒体数据流处理器206中的每一个都可以是耦合于处理器350的独立的组件,或者可以用处理器350所执行计算机可执行指令加以具体化。在一个实施例中,计算机系统340可以是ISP(因特网服务提供商)的一部分或者经输入/输出360耦合于ISP,以便在因特网上发送或接收媒体数据。显而易见的是,本发明不限于因特网访问和基于Web的因特网站点;还意在包括直接耦合的网络和专用网络。Computer system 340 includes processor 350 coupled to system bus 365 , memory 355 and input/output capabilities 360 . Memory 355 is configured to store instructions that, when executed by processor 350, perform the methods described herein. Input/output 360 also encompasses various types of computer-readable media, including any type of storage accessible by processor 350 . Those skilled in the art will immediately recognize that the term "computer-readable medium/media" also encompasses a carrier wave encoding a data signal. It should also be appreciated that system 340 is under the control of operating system software executing in memory 355 . Input/output and related media 360 store computer-executable instructions for the operating system and methods of the present invention. Each of the metadata generator 106, file creator 108, metadata extractor 204, and media stream processor 206 shown in FIGS. 1 and 2 may be separate components coupled to processor 350, or may It is embodied by computer-executable instructions executed by processor 350 . In one embodiment, computer system 340 may be part of or coupled to an ISP (Internet Service Provider) via input/output 360 for sending or receiving media data over the Internet. It will be apparent that the present invention is not limited to Internet access and Web-based Internet sites; it is also intended to include directly coupled networks and private networks.

将会认识到的是,计算机系统340只是许多可能的具有不同结构的计算机系统的一个示例。典型的计算机系统将通常至少包括处理器、存储器以及将存储器耦合到处理器的总线。本领域技术人员将立刻认识到的是,本发明能够利用其它的计算机系统配置加以实施,包括多处理器系统、微型计算机、大型计算机等。本发明还可以在分布式计算环境中加以实施,在所述分布计算环境中由经通信网络链接的远程处理装置来执行任务。It will be appreciated that computer system 340 is but one example of many possible computer systems having different configurations. A typical computer system will usually include at least a processor, memory, and a bus coupling the memory to the processor. Those skilled in the art will immediately recognize that the invention can be practiced using other computer system configurations, including multiprocessor systems, microcomputers, mainframe computers, and the like. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.

子采样可访问性Subsampling for accessibility

图4和5举例说明了分别由编码系统100和200解码系统执行的、用于存储和检索子采样元数据的过程。该过程可以由如下处理逻辑来执行,所述处理逻辑可以包括:硬件(例如,电路、专用逻辑等)、软件(比如,在通用计算机系统或专用机器上运行的)或上述两者的结合。对于软件实现的过程来讲,流程图的说明使本领域技术人员能够开发此类包含指令的程序,以便在适当配置的计算机(执行来自于计算机可读媒体的指令的计算机的处理器,包括存储器)上执行该过程。计算机可执行指令可以用计算机程序设计语言来编写,或者可以用固件逻辑来具体化。如果用程序设计语言编写符合公认的标准,那么就能够在各种硬件平台和到各种操作系统的接口上执行这类指令。另外,本发明的实施例并不是参照任何特定的程序设计语言而加以描述的。将会认识到的是,各种程序设计语言都可以用来实现这里所述的教导。此外,谈到当采取动作或产生结果时处于一种形式或另一种形式(例如,程序、方法、过程、应用、模块、逻辑等)的软件,这都是本领域中常见的。这类表达只是一种表述由计算机执行软件令计算机的处理器执行动作或产生结果的简捷方式。将会认识到的是,在不背离本发明范围的情况下,可以将更多或更少的操作并入到图4和5中举例说明的过程中,而且这里所描述和所示出的块的布局方案并未暗示特定的顺序。4 and 5 illustrate the processes performed by the encoding system 100 and decoding system 200, respectively, for storing and retrieving sub-sampled metadata. The process may be performed by processing logic that may include: hardware (eg, circuitry, dedicated logic, etc.), software (eg, running on a general-purpose computer system or a dedicated machine), or a combination of both. For a software-implemented process, the illustration of the flowchart enables one skilled in the art to develop such a program containing instructions for processing on a suitably configured computer (a computer's processor that executes instructions from a computer-readable medium, including a memory ) to execute the process. Computer-executable instructions may be written in a computer programming language, or may be embodied in firmware logic. If written in a programming language conforming to recognized standards, such instructions can be executed on a variety of hardware platforms and interfaces to a variety of operating systems. In addition, embodiments of the present invention are not described with reference to any particular programming language. It will be appreciated that various programming languages can be used to implement the teachings described herein. Furthermore, it is common in the art to speak of software in one form or another (eg, program, method, process, application, module, logic, etc.) when taking an action or producing a result. Such expressions are simply a shorthand way of saying that execution of the software by a computer causes the processor of the computer to perform an action or produce a result. It will be appreciated that more or fewer operations may be incorporated into the processes illustrated in FIGS. 4 and 5 without departing from the scope of the invention, and that the blocks described and illustrated herein The layout scheme does not imply a particular order.

图4是用于在编码系统100上创建子采样元数据的方法400的一个实施例的流程图。最初,方法400始于如下处理逻辑,所述处理逻辑接收具有已编码的媒体数据的文件(处理块402)。接下来,处理逻辑提取标识媒体数据中的子采样的边界的信息(处理块404)。依据正在使用的文件格式,能将时间属性附加到其上的数据流的最小单位称为:采样(如ISO媒体文件格式或QuickTime定义的)、访问单元(如MPEG-4定义的)或画面(如JVT定义的)等等。子采样代表采样层下的数据流的连续部分。子采样的定义取决于编码格式,但是一般而言,子采样是有意义的采样子单元,可以将所述子单元作为单独实体或作为子单元组合来进行编码,以便获得采样的部分重构。子采样还可以称为访问单元片段。往往,子采样代表采样的数据流的划分,以便在相同个采样中每个子采样都具有对其它子采样的极小依赖性或者没有依赖性。例如,在JVT中,子采样是NAL分组。同样,对于MPEG-4视频而言,子采样将是视频分组。FIG. 4 is a flowchart of one embodiment of a method 400 for creating subsampled metadata on encoding system 100 . Initially, method 400 begins with processing logic that receives a file with encoded media data (processing block 402). Next, processing logic extracts information identifying boundaries of subsamples in the media data (processing block 404). Depending on the file format being used, the smallest unit of a data stream to which time attributes can be attached is called: a sample (as defined by the ISO media file format or QuickTime), an access unit (as defined by MPEG-4), or a picture (as defined by MPEG-4) as defined by the JVT), etc. Subsamples represent successive portions of the data stream below the sampling layer. The definition of a subsample depends on the encoding format, but in general, a subsample is a meaningful subunit of sampling that can be encoded as a single entity or as a combination of subunits in order to obtain a partial reconstruction of the sample. Subsampling may also be referred to as an access unit fragment. Often, a subsample represents a division of the sampled data stream such that each subsample has little or no dependence on other subsamples within the same sample. For example, in JVT, subsampling is a NAL packet. Likewise, for MPEG-4 video, the subsamples would be video packets.

在一个实施例中,编码系统100在上述JVT所定义的网络抽象层上进行操作。JVT媒体数据流由一系列NAL分组组成,其中每个NAL分组(也称为NAL单元)都包含首部部分和净载荷部分。其中一种类型的NAL分组用于包含每个时间片的已编码的VCL数据,或者包含时间片的单个数据分区(partition)。另外,NAL分组可以是包含补充增强信息(SEI)消息的信息分组。SEI消息代表将在对相应时间片进行解码时使用的任选数据。在JVT中,子采样可能是具有首部和净载荷两者的完整NAL分组。In one embodiment, encoding system 100 operates on the network abstraction layer defined by the JVT described above. A JVT media data stream consists of a series of NAL packets, where each NAL packet (also called a NAL unit) includes a header part and a payload part. One type of NAL packet is used to contain encoded VCL data per time slice, or to contain a single partition of data for a time slice. In addition, the NAL packet may be an information packet containing a Supplemental Enhancement Information (SEI) message. The SEI message represents optional data to be used when decoding the corresponding slot. In JVT, a subsample may be a complete NAL packet with both header and payload.

在处理块406中,处理逻辑创建定义媒体数据中的子采样的子采样元数据。在一个实施例中,把子采样元数据编组成一组预定的数据结构(例如,一组框)。预定的数据结构组可以包括:包含关于每个子采样的大小的信息的数据结构、包含关于每个采样中的子采样总数的信息的数据结构、包含描述每个子采样的信息(例如,把什么定义为子采样)的数据结构、或包含与子采样有关的数据的任何其它数据结构。In processing block 406, processing logic creates subsampling metadata that defines subsampling in the media data. In one embodiment, subsampling metadata is grouped into a set of predetermined data structures (eg, a set of boxes). The predetermined set of data structures may include: a data structure containing information about the size of each subsample, a data structure containing information about the total number of subsamples in each sample, a data structure containing information describing each subsample (e.g., what defines is a subsampling), or any other data structure containing subsampling-related data.

接下来,在一个实施例中,处理逻辑判断任一数据结构是否包含数据的重复序列(判断框408)。如果是肯定的判断,则处理逻辑就把数据的每个重复序列转换成对序列出现和重复序列出现次数的引用(处理块410)。Next, in one embodiment, processing logic determines whether any data structure contains a repeating sequence of data (decision block 408). If so, processing logic converts each repetition of the data into a reference to the occurrence of the sequence and the number of occurrences of the repetition (processing block 410).

而后,在处理块412中,处理逻辑利用特定的媒体文件格式(例如,JVT文件格式)将子采样元数据归入到与媒体数据相关联的文件当中。依据媒体文件格式,可以把子采样元数据与采样元数据存储在一起(例如,可以把子采样数据结构归入到包含采样数据结构的采样表框中),或者独立于采样元数据将其存储起来。Then, in processing block 412, processing logic incorporates the subsampled metadata into files associated with the media data using a particular media file format (eg, JVT file format). Depending on the media file format, the subsampling metadata can be stored together with the sampling metadata (for example, the subsampling data structure can be grouped into a sample table box containing the sampling data structure), or it can be stored separately from the sampling metadata stand up.

图5是用于在解码系统200上使用子采样元数据的方法500的一个实施例的流程图。最初,方法500始于如下处理逻辑,所述处理逻辑接收与已编码的媒体数据相关联的文件(处理块502)。可以从数据库(本地的或外部的)、编码系统100中或者从网络上的任何其它装置那里接收所述文件。所述文件包括定义媒体数据中的子采样的子采样元数据。FIG. 5 is a flowchart of one embodiment of a method 500 for using subsampled metadata on the decoding system 200 . Initially, method 500 begins with processing logic that receives a file associated with encoded media data (processing block 502). The file may be received from a database (local or external), from the encoding system 100, or from any other device on the network. The file includes subsampling metadata defining subsamples in the media data.

接下来,处理逻辑从文件中提取子采样元数据(处理块504)。正如上面所论述的那样,子采样元数据可以存储在一组数据结构中(例如,一组框)。Next, processing logic extracts subsampling metadata from the file (processing block 504). As discussed above, subsampling metadata can be stored in a set of data structures (eg, a set of boxes).

此外,在处理块506中,处理逻辑利用提取出的元数据来标识已编码的媒体数据中的子采样(存储在相同文件中或者存储在不同文件中),并且把各种子采样组合成将要发送给媒体解码器的分组,由此实现了用于流送的媒体数据的灵活分组化(例如,支持错误恢复力、可缩放性等等)。Additionally, in processing block 506, processing logic utilizes the extracted metadata to identify sub-samples in the encoded media data (stored in the same file or in different files) and combine the various sub-samples into Packets sent to a media decoder, thereby enabling flexible packetization of media data for streaming (eg, to support error resilience, scalability, etc.).

现在,将参照扩展的ISO媒体文件格式(称为扩展的MP4)来描述示例性的子采样元数据结构。将对本领域熟练技术人员显而易见的是,其它的媒体文件格式也易于被扩展以并入类似的用于存储子采样元数据的数据结构中。An exemplary subsampling metadata structure will now be described with reference to an extended ISO media file format, referred to as extended MP4. It will be apparent to those skilled in the art that other media file formats are readily extended to incorporate similar data structures for storing sub-sampled metadata.

图6举例说明了具有子采样的扩展的MP4媒体流模型。用电影602来代表显示数据(例如,包含同步音频和视频的显示)。所述电影602包括一组轨道604。每个轨道604都代表一个媒体数据流。每个轨道604都被分成采样606。每个采样606都代表在特定时点上的媒体数据的单元。采样606还被分成子采样608。在JVT标准中,子采样608可以代表NAL分组或单元,比如,画面的单个时间片、具有多个数据分区的时间片的一个数据分区、带内参数集、或SEI信息分组。作为选择,子采样606可以代表采样的任何其它的结构元素,比如,代表媒体中的空间或时间区域的已编码数据。在一个实施例中,能够把根据一些结构或语义准则的已编码媒体数据的任何分区都当作子采样对待。Figure 6 illustrates an extended MP4 media stream model with subsampling. Display data (eg, a display including simultaneous audio and video) is represented by movie 602 . The movie 602 includes a set of tracks 604 . Each track 604 represents a stream of media data. Each track 604 is divided into samples 606 . Each sample 606 represents a unit of media data at a particular point in time. Sample 606 is also divided into sub-samples 608 . In the JVT standard, subsamples 608 may represent NAL packets or units, such as a single time slice of a picture, one data partition of a time slice with multiple data partitions, an in-band parameter set, or a packet of SEI information. Alternatively, subsamples 606 may represent any other structural elements of sampling, such as encoded data representing spatial or temporal regions in the media. In one embodiment, any partition of the encoded media data according to some structural or semantic criteria can be treated as a sub-sample.

图7A-7L举例说明了用于存储子采样元数据的示例性数据结构。7A-7L illustrate exemplary data structures for storing subsampling metadata.

参照图7A,扩展含有按ISO媒体文件格式定义的采样元数据框的采样表框700,以便包含诸如子采样大小框702、子采样描述关联框704、子采样-采样框706和子采样描述框708之类的子采样访问框。在一个实施例中,子采样访问框的使用是任意的。Referring to FIG. 7A, the sample table box 700 containing the sample metadata box defined by the ISO media file format is extended to include items such as a subsample size box 702, a subsample description association box 704, a subsample-sample box 706, and a subsample description box 708. Subsampling access boxes like . In one embodiment, the use of subsampling access boxes is optional.

参照图7B,例如,可以把采样710分成诸如时间片712之类的时间片、诸如分区714之类的数据分区和诸如ROI 716之类的感兴趣区域(ROI)。这些示例中的每一个都代表不同种类的采样到子采样的划分。单个采样内的子采样可以具有不同的大小。Referring to FIG. 7B , for example, samples 710 may be divided into time slices such as time slice 712 , data partitions such as partition 714 , and regions of interest (ROIs) such as ROI 716 . Each of these examples represents a different kind of sampling-to-subsampling division. Subsamples within a single sample can have different sizes.

子采样大小框718包含:规定子采样大小框718的版本的版本字段、规定缺省子采样大小的子采样大小字段、用于提供轨道中的子采样数目的子采样计数字段和规定每个子采样大小的条目大小字段。如果将子采样大小字段设置成0,那么子采样就具有存储在子采样大小表720中的不同大小。如果不将子采样大小字段设置为0,那么它就规定表明子采样大小表720为空的恒定子采样大小。表720可以具有32位的固定大小或用于表示子采样大小的可变长字段。如果字段是长度变化的,那么子采样表就包含表明子采样大小字段字节长度的字段。The subsample size box 718 contains: a version field specifying the version of the subsample size box 718, a subsample size field specifying the default subsample size, a subsample count field providing the number of subsamples in a track, and a subsample count field specifying the The size of the entry size field. If the subsample size field is set to 0, then the subsample has a different size stored in the subsample size table 720 . If the subsample size field is not set to 0, it specifies a constant subsample size indicating that the subsample size table 720 is empty. Table 720 may have a fixed size of 32 bits or a variable length field for representing the subsample size. If the field is variable length, the subsample table contains a field indicating the byte length of the subsample size field.

参照图7C,子采样-采样框722包括:规定子采样-采样框722的版本的版本字段和提供表723中的条目数目的条目计数字段。子采样-采样表中的每个条目都包含:提供共享相同数目的子采样-每采样的那些采样的流送过程中的第一采样的索引的第一采样字段,和提供采样的流送过程内每个采样中的子采样数目的子采样-每采样字段。Referring to FIG. 7C , the subsampling-sampling box 722 includes a version field specifying the version of the subsampling-sampling box 722 and an entry count field providing the number of entries in the table 723 . Each entry in the subsample-sample table contains: a first-sample field that provides the index of the first sample in the streaming process for those samples that share the same number of subsample-per-samples, and the streaming process that provides the sample The number of subsamples in each sample within the subsample-per-sample field.

通过计算多少采样正在流送,用适当子采样-每采样乘上这个数值,并且再把所有流送的结果加起来,就能利用表723来找出轨道中的子采样的总数。Table 723 can be used to find the total number of subsamples in a track by counting how many samples are being streamed, multiplying this value by the appropriate subsamples-per-sample, and adding up the results for all streams.

参照图7D,子采样描述关联框724包括:规定子采样描述关联框724的版本的版本字段、表明正在描述的子采样(例如,NAL分组、感兴趣区域等)类型的描述类型标识符和提供表726中的条目数目的条目计数字段。表726中的每个条目都包括:表明子采样描述ID的子采样描述类型标识符字段,和给出在共享相同子采样描述ID的那些子采样的流送过程中的第一子采样的索引的第一子采样字段。Referring to FIG. 7D, the subsample description association box 724 includes a version field specifying the version of the subsample description association box 724, a description type identifier indicating the type of subsample (e.g., NAL packet, region of interest, etc.) being described, and a provision An entry count field for the number of entries in table 726. Each entry in table 726 includes: a subsample description type identifier field indicating the subsample description ID, and an index giving the first subsample in the streaming of those subsamples sharing the same subsample description ID The first subsample field of .

子采样描述类型标识符控制子采样描述ID字段的使用。也就是说,取决于描述类型标识符中规定的类型,子采样描述ID字段本身可以规定直接对ID本身内部的子采样描述进行编码的描述ID,或者子采样描述ID字段可以充当到不同表(即,如下所述的子采样描述表)的索引?例如,如果描述类型标识符表示JVT描述,则子采样描述ID标识符字段就可以包括规定JVT子采样的特性的代码。在这种情况下,子采样描述ID标识符字段可以是32位字段,具有用作为位屏蔽的最低有效的8位以表示子采样内的预定数据分区的存在,还具有用以表示NAL分组类型或用于将来扩展的高阶的24位。The subsample description type identifier controls the use of the subsample description ID field. That is, depending on the type specified in the description type identifier, the subsample description ID field itself may specify a description ID directly encoding the subsample description inside the ID itself, or the subsample description ID field may serve as a link to a different table ( i.e., the index of the subsampling description table) as described below? For example, if the description type identifier represents a JVT description, the subsample description ID identifier field may include a code specifying the nature of the JVT subsample. In this case, the Subsample Description ID Identifier field may be a 32-bit field with the least significant 8 bits used as a bitmask to indicate the presence of a predetermined data partition within the subsample, and a field to indicate the NAL packet type Or higher order 24 bits for future extensions.

参照图7E,子采样描述框728包括:规定子采样描述框728的版本的版本字段;提供表730中的条目数目的条目计数字段;提供子采样描述字段的描述类型的描述类型标识符字段,所述子采样描述字段提供了关于子采样的特性的信息;和包含一个或多个子采样描述条目730的表。子采样描述类型标识描述性信息涉及到的类型,并且对应于子采样描述关联表724中的相同字段。表730中的每个条目都包含具有关于子采样的特性的信息的子采样描述条目,所述子采样与这个描述条目相关联。描述条目的信息和格式取决于描述类型字段。例如,当描述类型是参数集时,那么每个描述条目都将包含该参数集的值。Referring to FIG. 7E, the subsampling description box 728 includes: a version field specifying the version of the subsampling description box 728; an entry count field providing the number of entries in the table 730; a description type identifier field providing the description type of the subsampling description field, The subsample description field provides information about the characteristics of the subsample; and a table containing one or more subsample description entries 730 . The subsample description type identifies the type to which the descriptive information relates, and corresponds to the same field in the subsample description association table 724 . Each entry in table 730 contains a subsample description entry with information about the characteristics of the subsample that is associated with this description entry. The information and format of the description entry depends on the description type field. For example, when the description type is a parameter set, then each description entry will contain the value of that parameter set.

描述性信息可以涉及参数集信息、与ROI有关的信息或刻画子采样特性所需的任何其它信息。对于参数集来讲,子采样描述关联表724表明与每个子采样相关联的参数集。在这种情况下子采样描述ID对应于参数集标识符。同样,如下述那样,子采样能够表示不同的感兴趣区域。把子采样定义为一个或多个已编码的宏块,然后利用子采样描述关联表来表示已编码宏块到不同区域的图像帧或图像的划分。例如,帧中的已编码宏块能够被分成具有两个子采样描述ID(例如,子采样描述ID 1和2)的前景宏块和背景宏块,以分别表明给前景区域和背景区域的赋值。The descriptive information may relate to parameter set information, ROI related information or any other information needed to characterize the subsampling. For parameter sets, the subsample description association table 724 indicates the parameter set associated with each subsample. In this case the subsample description ID corresponds to the parameter set identifier. Also, subsampling can represent different regions of interest as described below. The sub-sampling is defined as one or more coded macroblocks, and then the sub-sampling description association table is used to represent the division of the coded macroblocks into image frames or images in different regions. For example, coded macroblocks in a frame can be divided into foreground and background macroblocks with two subsample description IDs (e.g., subsample description IDs 1 and 2) to indicate assignments to foreground and background regions, respectively.

图7F举例说明了不同类型的子采样。子采样可以表示:不带分区的时间片732、具有多个数据分区的时间片734、时间片内的首部736、时间片中间的数据分区738、时间片末尾的数据分区740、SEI信息分组742等等。这些子采样类型中的每一种都可以与图7G所示出的8位屏蔽744的特定值相关联。正如上面所论述的那样,8位屏蔽可以形成32位子采样描述ID字段的8个最低有效位。图7H举例说明了具有等于“jvtd的描述类型标识符的子采样描述关联框724。表726包括存储图7G中的举例说明的值的32位子采样描述ID标识符字段。Figure 7F illustrates different types of subsampling. Subsampling can represent: time slice without partition 732, time slice with multiple data partitions 734, header within a time slice 736, data partition in the middle of a time slice 738, data partition at the end of a time slice 740, SEI information packet 742 etc. Each of these subsampling types may be associated with a particular value of the 8-bit mask 744 shown in FIG. 7G. As discussed above, an 8-bit mask can form the 8 least significant bits of the 32-bit subsample description ID field. Figure 7H illustrates a subsample description association box 724 with a description type identifier equal to "jvtd. Table 726 includes a 32-bit subsample description ID identifier field that stores the value illustrated in Fig. 7G.

图7H-7K举例说明了子采样描述关联表中的数据压缩。7H-7K illustrate data compression in subsampling description association tables.

参照图7I,未压缩的表726包括重复序列748的子采样描述ID的序列750。在已压缩的表746中,重复的序列750已经被压缩成对序列748的引用以及这个序列出现的次数。Referring to FIG. 71 , the uncompressed table 726 includes a sequence 750 of subsample description IDs that repeat the sequence 748 . In compressed table 746, repeated sequence 750 has been compressed into a reference to sequence 748 and the number of occurrences of this sequence.

在图7J中举例说明的一个实施例中,能够通过把序列出现的最高有效位用作为序列标志754的游程,把它的下一个23位用作为出现索引756,且把它的最低有效位用作为出现长度758,来在子采样描述ID标识符字段中对序列出现进行编码。如果将标志754设置成1,那么就表示这个条目是重复的序列出现。否则,这个条目是就子采样描述ID。发生索引756是序列第一次发生的子采样描述关联框724中的索引,而长度758表示重复的序列出现的长度。In one embodiment illustrated in FIG. 7J , it is possible to use the most significant bit of a sequence occurrence as the run of the sequence flag 754, use its next 23 bits as the occurrence index 756, and use its least significant bit as As occurrence length 758, the sequence occurrence is encoded in the subsample description ID identifier field. If flag 754 is set to 1, it means that this entry is a repeated sequence. Otherwise, this entry is the subsampling description ID. Occurrence index 756 is the subsample description index in association box 724 for the first occurrence of the sequence, and length 758 indicates the length of the repeated sequence occurrence.

在图7K中举例说明的另一个实施例中,使用重复序列出现表760来表示重复的序列出现。将子采样描述ID字段的最高有效位用作为序列标志762的游程,表明所述条目是否是子采样描述ID,或者用作为重复序列出现表760中的条目序列索引,所述重复序列出现表760是子采样描述关联框724的一部分。重复序列出现表760包括:规定重复序列中第一项的子采样描述关联框724中的索引的发生索引字段,和规定重复序列的长度的长度字段。In another embodiment, illustrated in FIG. 7K , repeated sequence occurrences are represented using a repeat sequence occurrence table 760 . The most significant bit of the subsample description ID field is used as the run of the sequence flag 762, indicating whether the entry is a subsample description ID, or as the sequence index of an entry in a repeating sequence occurrence table 760, which is part of the subsampling description association box 724. The repeat sequence occurrence table 760 includes an occurrence index field specifying the index in the subsample description association box 724 of the first item in the repeat sequence, and a length field specifying the length of the repeat sequence.

参数集parameter set

在某些媒体格式中,诸如像JVT,将“首部”信息包含媒体数据的适当解码所需的临界控制值从已编码的数据的剩余部分中分离出来/去耦合,并且将其存储在参数集当中。然后,已编码的数据能够使用诸如唯一标识符之类的机制来指代必要的参数集,而不是将流中的这些控制值与已编码的数据混合在一起。这种方法使得高层编码参数的发送与已编码的数据去耦合。同时,也通过将控制值的共用集共享为参数集而减少了冗余。In some media formats, such as JVT, the "header" information containing the critical control values needed for proper decoding of the media data is separated/decoupled from the remainder of the encoded data and stored in the parameter set among. The encoded data is then able to use mechanisms such as unique identifiers to refer to the necessary set of parameters, rather than mixing these control values with the encoded data in the stream. This approach decouples the sending of high-level encoding parameters from the encoded data. At the same time, redundancy is also reduced by sharing a common set of control values as a parameter set.

为了支持使用参数集的已存储媒体流的有效传输,发送器或播放器必须能快速地将已编码数据链接于相应的参数,以便了解参数集必须被发送或访问的时间和地点。本发明的一个实施例通过存储把参数集与相应的部分媒体数据之间的关联性指定为媒体文件格式中的参数集元数据的数据,而提供了这种能力。To support efficient transmission of stored media streams using parameter sets, a sender or player must be able to quickly link the encoded data to the corresponding parameters in order to know when and where the parameter sets must be sent or accessed. One embodiment of the present invention provides this capability by storing data specifying the association between parameter sets and corresponding portions of media data as parameter set metadata in a media file format.

图8和9举例说明了分别由编码系统100和解码系统200执行的用于存储和检索参数集元数据的过程。所述过程可以由如下处理逻辑来执行,所述处理逻辑可以包括硬件(例如,电路、专用逻辑等)、软件(比如运行在通用计算机系统或专用机器上的)或者上述两者的结合。8 and 9 illustrate the processes performed by encoding system 100 and decoding system 200 for storing and retrieving parameter set metadata, respectively. The processes may be performed by processing logic that may comprise hardware (eg, circuitry, dedicated logic, etc.), software (eg, run on a general purpose computer system or a dedicated machine), or a combination of both.

图8是用于在编码系统100创建参数集元数据的方法800的一个实施例的流程图。最初,方法800始于如下处理逻辑,所述处理逻辑接收具有已编码的媒体数据的文件(处理块802)。所述文件包括规定如何对部分媒体数据进行解码的编码参数集。接下来,处理逻辑检查称为参数集的编码参数集与相对应的部分媒体数据之间的关系(处理块804),并利用媒体数据部分来创建定义参数集及其关联的参数集元数据(处理块806)。所述媒体数据部分可以用采样或子采样来表示。FIG. 8 is a flowchart of one embodiment of a method 800 for creating parameter set metadata at encoding system 100 . Initially, method 800 begins with processing logic that receives a file with encoded media data (processing block 802). The file includes a set of encoding parameters specifying how portions of the media data are to be decoded. Next, processing logic examines the relationship between a set of encoding parameters, called a parameter set, and the corresponding portion of the media data (processing block 804), and utilizes the media data portion to create the defined parameter set and its associated parameter set metadata ( Processing block 806). The media data portions may be represented in samples or subsamples.

在一个实施例中,将参数集元数据编组成一组预定的数据结构(例如,一组框)。该预定的数据结构组可以包括:包含关于参数集的描述性信息的数据结构,和包含定义采样与相应参数集之间的关联的信息的数据结构。在一个实施例中,该预定的数据结构组还包括:包含定义子采样与对应参数集之间的关联的信息的数据结构。包含子采样与参数集之间关联的信息的数据结构可以或可以不必重载(override)包含采样与参数集之间关联的信息的数据结构。In one embodiment, parameter set metadata is organized into a set of predetermined data structures (eg, a set of boxes). The predetermined set of data structures may include data structures containing descriptive information about the parameter sets, and data structures containing information defining associations between samples and corresponding parameter sets. In one embodiment, the predetermined set of data structures further comprises: data structures containing information defining associations between subsamples and corresponding parameter sets. The data structure containing information on associations between subsamples and parameter sets may or may not override the data structure containing information on associations between samples and parameter sets.

接下来,在一个实施例中,处理逻辑判断任何参数集数据结构是否包含数据的重复序列(判断框808)。如果这一判断是肯定的,那么处理逻辑就把数据的每个重复序列转换成对序列出现的引用和序列出现的次数(处理块810)。Next, in one embodiment, processing logic determines whether any parameter set data structures contain repeating sequences of data (decision block 808). If this determination is positive, processing logic converts each repeated sequence of data into a reference to the sequence occurrence and the number of occurrences of the sequence (processing block 810).

然后,在处理块812中,处理逻辑利用特定的媒体文件格式(例如,JVT文件格式)将参数集元数据归入到与媒体数据相关联的文件中。取决于媒体文件格式,可以将参数集元数据与轨道元数据和/或采样元数据存储在一起(例如,可以将包含关于参数集的描述性信息的数据结构归入到轨道框中,且可以将包含关联信息的数据结构归入到采样表框中),或者独立于轨道元数据和/或采样元数据而存储所述参数集元数据。Then, in processing block 812, processing logic incorporates the parameter set metadata into a file associated with the media data using a particular media file format (eg, the JVT file format). Depending on the media file format, parameter set metadata may be stored together with track metadata and/or sample metadata (e.g., data structures containing descriptive information about parameter sets may be grouped into track boxes, and A data structure containing associated information is subsumed into a sample table box), or the parameter set metadata is stored independently of the track metadata and/or sample metadata.

图9是用于在解码系统200上使用参数集元数据的方法900的一个实施例的流程图。最初,方法900始于如下处理逻辑,所述处理逻辑接收与已编码的媒体数据相关联的文件(处理块902)。可以从数据库(本地的或外部的)、编码系统100中接收所述文件,或者从网络上的任何其它装置那里接收所述文件。所述文件包括定义媒体数据的参数集以及参数集与相对应的部分媒体数据(例如,相对应的采样或子采样)之间的关联的参数集元数据。FIG. 9 is a flowchart of one embodiment of a method 900 for using parameter set metadata on the decoding system 200 . Initially, method 900 begins with processing logic that receives a file associated with encoded media data (processing block 902). The file may be received from a database (local or external), from the encoding system 100, or from any other device on the network. The file includes parameter set metadata defining parameter sets of media data and associations between parameter sets and corresponding portions of media data (eg, corresponding samples or subsamples).

接下来,处理逻辑从文件中提取参数集元数据(处理块904)。正如上面所论述的那样,参数集元数据可以存储在一组数据结构(例如,一组框)中。Next, processing logic extracts parameter set metadata from the file (processing block 904). As discussed above, parameter set metadata can be stored in a set of data structures (eg, a set of boxes).

此外,在处理块906中,处理逻辑利用提取出的元数据来判断哪个参数集与特定的媒体数据部分相关联(例如,采样或子采样)。然后,可以使用这一信息来控制媒体数据部分以及相应参数集的传输时间。也就是说,必须在包含采样或子采样的分组前面或者与包含采样或子采样的分组一起发送将被用来对特定采样或子采样进行解码的参数集。Additionally, in processing block 906, processing logic utilizes the extracted metadata to determine which parameter set is associated with a particular media data portion (eg, sample or sub-sample). This information can then be used to control the transmission time of media data parts and corresponding parameter sets. That is, the set of parameters to be used to decode a particular sample or subsample must be sent before or with the packet containing the sample or subsample.

因此,参数集元数据的使用实现了参数集在更可靠的信道上的独立发送,减少了错误几率或令媒体流的一部分丢失的数据损失几率。Thus, the use of parameter set metadata enables independent transmission of parameter sets on a more reliable channel, reducing the chance of errors or data loss where part of the media stream is lost.

现在,将参照扩展的ISO媒体文件格式(称为扩展的ISO)来描述示例性的参数集元数据结构。然而,应当指出的是,其它的媒体文件格式也可以被扩展,以便并入用于存储参数集元数据的各种数据结构。An exemplary parameter set metadata structure will now be described with reference to the extended ISO media file format (referred to as extended ISO). It should be noted, however, that other media file formats may also be extended to incorporate various data structures for storing parameter set metadata.

图10A-10E举例说明了用于存储参数集元数据的示例性数据结构。10A-10E illustrate exemplary data structures for storing parameter set metadata.

参照图10A,扩展包含按ISO文件格式定义的轨道元数据框的轨道框1002,以便包括参数集描述框1004。另外,扩展包含按ISO文件格式定义的采样元数据框的采样表框1006,以便包括采样到参数集框1008。在一个实施例中,采样表框1006包括子采样到参数集框,该子采样到本参数集框可以重载如下面将更详细论述的采样到参数集框1008。Referring to FIG. 10A , a track box 1002 containing a track metadata box defined in the ISO file format is extended to include a parameter set description box 1004 . Additionally, the Sample Table box 1006 containing the Sample Metadata box defined in the ISO file format is extended to include a Sample to Parameter Set box 1008 . In one embodiment, the sample list box 1006 includes a subsample to parameter set box, which may override the sample to parameter set box 1008 as will be discussed in more detail below.

在一个实施例中,参数集元数据框1004和1008是强制性的。在另一个实施例中,只有参数集描述框1004是强制性的。在又一个实施例中,所有参数集元数据框都是任意的。In one embodiment, parameter set metadata boxes 1004 and 1008 are mandatory. In another embodiment, only the parameter set description box 1004 is mandatory. In yet another embodiment, all parameter set metadata boxes are arbitrary.

参照图10B,参数集描述框1010包含:规定参数集描述框1010的版本的版本字段、用以提供表1012中的条目数目的参数集描述计数字段和包含对应参数集本身的条目的参数集条目字段。Referring to Figure 10B, the parameter set description box 1010 contains: a version field specifying the version of the parameter set description box 1010, a parameter set description count field to provide the number of entries in the table 1012, and a parameter set entry containing an entry for the corresponding parameter set itself field.

可以从采样层或子采样层那里引用参数集。参照图10C,采样到参数集框1014提供了从采样层对参数集的引用。采样到参数集框1014包括:规定采样到参数集框1014的版本的版本字段、规定缺省参数设置ID的缺省参数设置ID字段、提供表1016中的条目数目的条目计数字段。表1016中的每个条目都包含:提供共享相同参数集的那些采样的游程中第一采样的索引的第一采样字段,和指定到参数集描述框1010的索引的参数集索引。如果缺省参数设置ID等于0,那么采样就具有存储在表1016中的不同参数集。否则,使用恒定参数设置并且无数组跟随。Parameter sets can be referenced from a sampling layer or a subsampling layer. Referring to FIG. 10C , a sample to parameter set box 1014 provides a reference to a parameter set from the sampling layer. Sampled to parameter set box 1014 includes a version field specifying the version of sampled to parameter set box 1014 , a default parameter set ID field specifying a default parameter set ID, an entry count field providing the number of entries in table 1016 . Each entry in table 1016 contains a first sample field that provides the index of the first sample in the run of those samples that share the same parameter set, and a parameter set index that specifies the index to parameter set description box 1010 . If the default parameter set ID is equal to 0, then the sample has a different parameter set stored in table 1016. Otherwise, constant parameter settings are used and no array follows.

在一个实施例中,通过把每个重复序列转换成初始序列到的引用和这个序列出现的次数,来压缩表1016中的数据,正如上面结合子采样描述关联表更详细论述的那样。In one embodiment, the data in table 1016 is compressed by converting each repeating sequence into a reference to the original sequence and the number of occurrences of this sequence, as discussed in more detail above in connection with subsampling description association tables.

可以通过定义参数集与子采样之间的关联,来从子采样层引用参数集。在一个实施例中,参数集与子采样之间的关联是利用上述的子采样描述关联框来定义的。图10D举例说明了具有引用参数集的描述类型标识符(例如,描述类型标识符等于“解析”)的子采样描述关联框1018。根据这个描述类型标识符,表1020中的子采样描述ID表明参数集描述框1010中的索引。A parameter set can be referenced from a subsampling layer by defining an association between the parameter set and the subsampling. In one embodiment, the association between the parameter set and the sub-sampling is defined using the above-mentioned sub-sampling description association box. FIG. 10D illustrates a subsampling description association box 1018 with a description type identifier referencing a parameter set (eg, description type identifier equal to "parse"). The subsample description ID in table 1020 indicates the index in parameter set description box 1010 according to this description type identifier.

在一个实施例中,当具有引用参数集的描述类型标识符的子采样描述关联框1018存在时,它重载采样到参数集框1014。In one embodiment, the subsample description association box 1018 reloads the sample to parameter set box 1014 when present with a description type identifier referencing the parameter set.

参数集可以在创建参数集时与使用参数集来对相对应的部分媒体数据进行解码时之间变化。如果这类变化发生的话,解码系统200就接收规定对参数集的改变的参数更新分组。参数集元数据包括标识更新前后的参数集状态的数据。A parameter set may vary between when the parameter set is created and when the parameter set is used to decode the corresponding portion of media data. If such changes occur, decoding system 200 receives parameter update packets specifying changes to the parameter set. The parameter set metadata includes data identifying the state of the parameter set before and after the update.

参照图10E,参数集描述框1010包括:在t0时创建的初始参数设置1022的条目、和响应于在时间t1时接收的参数更新分组1026而创建的更新参数设置1024的条目。子采样描述关联框1018把两个参数集与相对应的子采样关联起来。Referring to FIG. 10E , parameter set description box 1010 includes an entry for initial parameter set 1022 created at time t0 and an entry for updated parameter set 1024 created in response to parameter update packet 1026 received at time t1 . A subsample description association block 1018 associates two parameter sets with corresponding subsamples.

采样组sampling group

虽然轨道内的采样可能具有编组成如下序列的采样的各种逻辑编组(分区),其中所述序列表示媒体数据中的高层结构,但是现有的文件格式未提供用于表示和存储这类编组的便利机制。例如,高级的编码格式(诸如JVT)根据单个轨道内采样的相互依赖性而将这些采样编组成组群。当网络条件需要时,这些组群(这里称为序列或采样组)可以用来标识可任意处理的采样链,由此支持时间的可缩放性。对定义文件格式中的采样组的元数据进行存储,使媒体的发送器能够容易而有效地实现上述特征。While samples within a track may have various logical groupings (partitions) of samples grouped into sequences that represent high-level structures in the media data, existing file formats provide no means for representing and storing such groupings convenience mechanism. For example, advanced encoding formats such as JVT group samples within a single track into groups according to their interdependence. These groups (referred to herein as sequences or sample groups) can be used to identify chains of samples that can be processed arbitrarily when network conditions require it, thereby supporting temporal scalability. Storing metadata defining groups of samples in a file format enables senders of media to implement the above features easily and efficiently.

采样组的一个示例是这样的一组采样,它们的帧间依赖性允许独立于其它采样而对它们进行解码。在JVT中,这类采样组称为增强式画面组(增强式GOP)。在增强式GOP中,可以把采样分成子序列。每个子序列都包括这样的一组采样,它们相互依赖并且可以把它们作为单元来处理。另外,可以把增强式GOP的采样分层地构造成层,以便仅仅从较低层中的采样预测较高层中的采样,由此允许在不影响解码其它采样的能力的情况下处理最高层的采样。包括不依赖于任何其它层中的采样的那些采样的最低层称为基层。非基层的任何其它层都称为增强层。An example of a sample group is a set of samples whose inter-frame dependencies allow them to be decoded independently of other samples. In JVT, such groups of samples are called Enhanced Group of Pictures (Enhanced GOP). In an enhanced GOP, samples may be divided into subsequences. Each subsequence consists of a set of samples that are interdependent and can be treated as a unit. In addition, the samples of an enhanced GOP can be structured hierarchically into layers such that samples in higher layers are only predicted from samples in lower layers, thereby allowing processing of the highest layer's sampling. The lowest layer that includes those samples that do not depend on samples in any other layer is called the base layer. Any other layer than the base layer is called an enhancement layer.

图11举例说明了示例性的增强式GOP,在其中采样被分成两层—基层1102和增强层1104,以及两个子序列1106和1108。两个子序列1106和1108中的每一个能够被彼此独立地丢弃。FIG. 11 illustrates an exemplary enhanced GOP in which samples are divided into two layers—a base layer 1102 and an enhancement layer 1104 , and two subsequences 1106 and 1108 . Each of the two subsequences 1106 and 1108 can be discarded independently of each other.

图12和13举例说明了分别由编码系统100和解码系统200执行的用于存储和检索采样组元数据的过程。所述过程可以由如下处理逻辑,所述处理逻辑可以包括硬件(例如,电路、专用逻辑等)、软件(比如运行在通用计算机系统或专用机器上的)或者上述两者的结合。12 and 13 illustrate the processes performed by encoding system 100 and decoding system 200, respectively, for storing and retrieving sample group metadata. The process may be comprised of processing logic that may include hardware (eg, circuitry, dedicated logic, etc.), software (eg, run on a general-purpose computer system or a dedicated machine), or a combination of both.

图12是用于在编码系统100上创建采样组元数据的方法1200的一个实施例的流程图。最初,方法1200始于如下处理逻辑,所述处理逻辑接收具有已编码媒体数据的文件(处理块1202)。媒体数据的轨道内的采样具有一定的相互依赖性。例如,所述轨道可以包括:不依赖于任何其它采样的I帧、依赖于单个在前采样的P帧、以及依赖于两个在前采样的B帧,还包含I帧、P帧和B帧的任意组合。根据它们的相互依赖性,能够把轨道中的采样逻辑上组合成采样组(例如,增强式GOP、层、子序列等等)。FIG. 12 is a flowchart of one embodiment of a method 1200 for creating sample group metadata on encoding system 100 . Initially, method 1200 begins with processing logic that receives a file having encoded media data (processing block 1202). The samples within a track of media data have certain interdependencies. For example, the track may include: I-frames that do not depend on any other samples, P-frames that depend on a single previous sample, and B-frames that depend on two previous samples, including I-frames, P-frames, and B-frames any combination of . Samples in a track can be logically grouped into sample groups (eg, enhanced GOPs, layers, subsequences, etc.) according to their interdependencies.

接下来,处理逻辑检查媒体数据以便标识每个轨道中的采样组(处理块1204),并创建描述所述采样组的采样组元数据,并且定义将在每个采样组中包含哪些采样(处理块1206)。在一个实施例中,把采样组元数据编组成一组预定数据结构(例如,一组框)。预定的数据结构组可以包括:包含关于每个采样组的描述性信息的数据结构和包含标识每个采样组中所含的信息的数据结构。Next, processing logic examines the media data to identify sample groups in each track (processing block 1204), and creates sample group metadata describing the sample groups, and defines which samples will be included in each sample group (processing block 1204). block 1206). In one embodiment, sample group metadata is grouped into a set of predetermined data structures (eg, a set of boxes). The predetermined set of data structures may include: a data structure containing descriptive information about each sample group and a data structure containing information identifying what is contained in each sample group.

接下来,在一个实施例中,处理逻辑判断任何采样分组数据结构是否包含数据的重复序列(判断框1208)。如果这个判断是肯定的,则处理逻辑就把数据的每个重复序列转换成到序列出现的引用和序列出现的次数(处理块1210)。Next, in one embodiment, processing logic determines whether any of the sample packet data structures contain repeating sequences of data (decision block 1208). If this determination is positive, then processing logic converts each repeated sequence of data into a reference to the sequence occurrence and the number of sequence occurrences (processing block 1210).

然后,在处理块1212上,处理逻辑利用特定的媒体文件格式(例如,JVT文件格式)将采样组元数据归入到与媒体数据相关联的文件中。取决于媒体文件格式,可以将采样组元数据与采样元数据(例如,可以把采样分组数据结构包含到采样表框中)存储在一起,或者独立于采样元数据将所述采样组元数据存储起来。Then, at processing block 1212, processing logic incorporates the sample group metadata into a file associated with the media data using a particular media file format (eg, the JVT file format). Depending on the media file format, the sample group metadata can be stored together with the sample metadata (for example, a sample group data structure can be included into the sample table box), or it can be stored independently of the sample metadata stand up.

图13是用于在解码系统200上使用采样组元数据的方法1300的一个实施例的流程图。最初,方法1300始于如下处理逻辑,所述处理逻辑接收与已编码的媒体数据相关联的文件(处理块1302)。可以从数据库(本地的或外部的)、编码系统100中或者从网络上的任何其它装置那里接收所述文件。所述文件包括定义媒体数据中的采样组的采样组元数据。FIG. 13 is a flowchart of one embodiment of a method 1300 for using sample group metadata on the decoding system 200 . Initially, method 1300 begins with processing logic that receives a file associated with encoded media data (processing block 1302). The file may be received from a database (local or external), from the encoding system 100, or from any other device on the network. The file includes sample group metadata defining sample groups in the media data.

接下来,处理逻辑从文件中提取采样组元数据(处理块1304)。正如上面所论述的那样,可以在数据结构组(例如,一组框)中存储采样组元数据。Next, processing logic extracts sample group metadata from the file (processing block 1304). As discussed above, sampling group metadata can be stored in a data structure group (eg, a group of boxes).

此外,在处理块1306上,处理逻辑利用提取出的采样组元数据来标识采样链,可以在不影响解码其它采样的能力的情况下处理所述采样链。在一个实施例中,这个信息可以用来访问特定采样组中的采样,并用来判断响应于网络能力方面的变化能丢弃哪些采样。在其它实施例中,利用采样组元数据来过滤采样,以便仅仅处理或再现轨道中的一部分采样。Furthermore, at processing block 1306, processing logic utilizes the extracted sample group metadata to identify chains of samples that can be processed without affecting the ability to decode other samples. In one embodiment, this information can be used to access samples in a particular sample group and to determine which samples can be discarded in response to changes in network capabilities. In other embodiments, the samples are filtered using the sample group metadata so that only a portion of the samples in the track are processed or rendered.

因此,采样组元数据方便了对采样的选择性访问和可缩放性。Thus, sample group metadata facilitates selective access and scalability to samples.

现在,将参照扩展的ISO媒体文件格式(称为扩展的MP4)来描述示例性采样组元数据结构。然而,应当指出的是,其它媒体文件格式也可以被扩展,以便并入用于存储采样组元数据的各种数据结构。An exemplary sample group metadata structure will now be described with reference to an extended ISO media file format, referred to as extended MP4. It should be noted, however, that other media file formats may also be extended to incorporate various data structures for storing sample group metadata.

图14A-14E举例说明了用于存储采样组元数据的示例性数据结构。14A-14E illustrate exemplary data structures for storing sample group metadata.

参照图14A,扩展包含按MP4定义的采样元数据框的采样表框1400,以便包含采样组框1402和采样组描述框1404。在一个实施例中,采样组元数据框1402和1404是任意的。Referring to FIG. 14A , the sample table box 1400 containing the sample metadata box defined by MP4 is expanded to include a sample group box 1402 and a sample group description box 1404 . In one embodiment, sample group metadata boxes 1402 and 1404 are arbitrary.

参照图14B,使用采样组框1406来找出特定采样组中所含的一组采样。允许采样组框1406的多个实例,以便对应于采样组的不同类型(例如,增强式GOP、子序列、层、参数集等)。采样组框1406包含:规定采样组框1406版本的版本字段、用以提供表1408中的条目数目的条目计数字段、用以标识采样组类型的采样组标识符字段、提供相同采样组中所含的那些采样的流送过程中的第一采样的索引的第一采样字段、和规定到采样组描述框的索引的采样组描述索引。Referring to Figure 14B, the sample group box 1406 is used to find the set of samples contained in a particular sample group. Multiple instances of the sample group box 1406 are allowed to correspond to different types of sample groups (eg, enhanced GOP, subsequence, layer, parameter set, etc.). The sample group box 1406 contains: a version field specifying the version of the sample group box 1406, an entry count field to provide the number of entries in the table 1408, a sample group identifier field to identify the sample group type, The first sample field of the index of the first sample in the streaming process of those samples, and the sample group description index specifying the index to the sample group description box.

参照图14C,采样组描述框1410提供了关于采样组的特性的信息。采样组描述框1410包含:规定采样组描述框1410的版本的版本字段、用以提供表1412中的条目数目的条目计数字段、用以标识采样组类型的采样组标识符字段、和用以提供采样组描述符的采样组描述字段。Referring to Figure 14C, the sample group description box 1410 provides information about the characteristics of the sample group. The sample group description box 1410 contains: a version field specifying the version of the sample group description box 1410, an entry count field to provide the number of entries in the table 1412, a sample group identifier field to identify the sample group type, and a sample group identifier field to provide The sample group description field of the sample group descriptor.

参照图14D,举例说明了层(“layr”)采样组类型的采样组框1416的使用。根据采样的相互依赖性将采样1至11分成三层。在第0层(基层)中,采样(采样1、6和11)都只彼此依赖,而不是依赖于任何其它层中的采样。在第1层中,采样(采样2、5、7、10)依赖于较低层(即,第0层)中的采样和此第1层内的采样。在第2层中,采样(采样3、4、8、9)依赖于较低层(第0和1层)中的采样和此第2层内的采样。因此,能够在不影响对来自于较低的第0和1层的采样解码的能力的情况下安排第2层的采样。Referring to Figure 14D, the use of the sample group box 1416 of the layer ("layr") sample group type is illustrated. Samples 1 to 11 are divided into three layers according to their interdependence. In layer 0 (the base layer), samples (samples 1, 6, and 11) all only depend on each other, not on samples in any other layer. In layer 1, samples (samples 2, 5, 7, 10) depend on samples in lower layers (ie, layer 0) and samples within this layer 1. In layer 2, samples (samples 3, 4, 8, 9) depend on samples in lower layers (layers 0 and 1) and samples within this layer 2. Thus, layer 2 samples can be arranged without affecting the ability to decode samples from lower layers 0 and 1 .

采样组框1416中的数据说明了采样与所述层之间的上述关联。如图所示,这个数据包括重复的层模式1414,可以通过把每个重复的层模式转换成对初始层模式的引用和这个模式出现的次数,来压缩所述重复的层模式,正如上面详细论述的那样。The data in the Samples group box 1416 illustrates the aforementioned association between samples and the layer. As shown, this data includes repeated layer patterns 1414 that can be compressed by converting each repeated layer pattern into a reference to the initial layer pattern and the number of occurrences of the pattern, as detailed above as discussed.

参照图14E,举例说明了子序列(“sseq”)采样组类型的采样组框1418的使用。根据采样的相互依赖性将采样1至11分成四个子序列。除在第0层上的子序列0之外,每个子序列都包括无其它的子序列依赖于它的采样。因此,必要时可以把子序列中的采样作为单元来安排。Referring to Figure 14E, the use of the sample group box 1418 of the subsequence ("sseq") sample group type is illustrated. Samples 1 to 11 are divided into four subsequences according to their interdependencies. With the exception of subsequence 0 at level 0, each subsequence includes samples on which no other subsequence depends. Therefore, samples in subsequences can be arranged as units if necessary.

采样组框1418中的数据说明了采样与子序列之间的关联性。这个数据允许在相对应的子序列的起始处随机访问采样。The data in Samples group box 1418 describes the association between samples and subsequences. This data allows random access to samples at the beginning of the corresponding subsequence.

流切换stream switching

在典型的流送的情形下,其中一个关键要求就是:响应于改变网络条件而缩放压缩数据的比特率。实现这个的简单方法就是:对具有用于典型网络条件的不同比特率和质量设置的多个流进行编码。然后,能够响应于网络条件而在这些预先编码的流当中切换服务器。In a typical streaming situation, one of the key requirements is to scale the bitrate of the compressed data in response to changing network conditions. A simple way to achieve this is to encode multiple streams with different bitrate and quality settings for typical network conditions. Servers can then be switched among these pre-encoded streams in response to network conditions.

JVT标准提供了称为切换画面的新型画面,在不需要两个画面都使用用于预测的相同帧的情况下,所述切换画面允许一个画面同等地重构另一个。具体来讲,JVT提供了两种类型的切换画面:类似于I帧的SI画面,独立于任何其它画面而对其进行编码;和SP画面,参照其它画面来对其进行编码。响应于改变传送条件,可以使用切换画面来实现在具有不同的比特率和质量设置的流当中进行切换,以便提供错误恢复力,并且实现像快速进带和倒带一样的变速杆模式(trick mode)。The JVT standard provides a new type of picture called a cutout picture that allows one picture to reconstruct the other equally without requiring both pictures to use the same frame for prediction. Specifically, JVT provides two types of switching pictures: SI pictures, which are like I-frames, are coded independently of any other pictures; and SP pictures, which are coded with reference to other pictures. Switching between streams with different bitrate and quality settings in response to changing delivery conditions can be achieved using a toggle screen to provide error resilience and enable trick mode like fast forward and rewind. ).

然而,为了当实现流切换、错误恢复力、变速杆模式及其它特征时有效地使用JVT切换画面,播放器必须知道所存储的媒体数据中的哪些采样具有可选的表示并且它们的依赖物是什么。现有的文件格式没有提供这种能力。However, in order to effectively switch scenes using the JVT when implementing stream switching, error resilience, stick mode, and other features, the player must know which samples in the stored media data have alternative representations and their dependencies are What. Existing file formats do not provide this capability.

本发明的一个实施例通过定义切换采样集而解决了上述限制。切换采样集表示这样的一组采样,它们的解码值相等,但是它们可以使用不同的参考采样。参考采样是用于预测另一个采样的值的采样。切换采样集的每个成员称为切换采样。图15A举例说明了用于比特流切换的切换采样集的使用。One embodiment of the present invention addresses the above limitations by defining a switching sample set. A switched sample set represents a set of samples whose decoded values are equal, but which may use different reference samples. A reference sample is a sample used to predict the value of another sample. Each member of a switched sample set is called a switched sample. Figure 15A illustrates the use of switched sample sets for bitstream switching.

参照图15A,流1和流2是两个具有不同质量和比特率参数的相同内容的编码。采样S12是不出现在每一个流中的SP画面,它用于实现从流1到流2的切换(切换是方向性特性)。采样S12和S2是包含在切换采样集中的。S1和S12两者都是根据轨道1中的采样P12预测的,而S2是根据轨道2中的采样P22预测的。尽管采样S12和S2使用不同的参考采样,但是它们的解码值是相等的。因此,通过切换采样S12能够实现从流1到流2的切换(在流1中的采样1处和流2中的S2处)。Referring to FIG. 15A, stream 1 and stream 2 are two encodings of the same content with different quality and bit rate parameters. Sample S12 is an SP picture that does not appear in every stream, and it is used to realize switching from stream 1 to stream 2 (switching is a directional property). Samples S12 and S2 are included in the switching sample set. Both S1 and S12 are predicted from sample P12 in track 1, while S2 is predicted from sample P22 in track 2. Although samples S12 and S2 use different reference samples, their decoded values are equal. Thus, switching from stream 1 to stream 2 can be achieved by switching sample S12 (at sample 1 in stream 1 and at S2 in stream 2).

图16和17举例说明了分别由编码系统100和解码系统200执行的用于存储和检索切换采样元数据的过程。所述过程可以由如下处理逻辑来执行,所述处理逻辑可以包括硬件(例如,电路、专用逻辑等)、软件(比如在通用计算机系统或专用机上执行的)或上述两者的结合。16 and 17 illustrate the processes performed by the encoding system 100 and the decoding system 200, respectively, for storing and retrieving switching sample metadata. The process may be performed by processing logic that may include hardware (eg, circuitry, dedicated logic, etc.), software (eg, executed on a general-purpose computer system or a dedicated machine), or a combination of both.

图16是用于在编码系统100上创建切换采样元数据的方法1600的一个实施例的流程图。最初,方法1600始于如下处理逻辑,所述处理逻辑接收具有已编码的媒体数据的文件(处理块1602)。所述文件包括用于媒体数据的一个或多个可选的编码(例如,用于典型网络条件的不同带宽和质量设置)。所述可选的编码包括一个或多个切换画面。这类画面可以包含在可选的媒体数据流之内,或者作为实现比如错误恢复力或变速杆模式之类的特定特征的独立实体。用于创建这些轨道和切换画面的方法并不是本发明指定的,而是各种可能性都对于本领域熟练技术人员而言将是显而易见的。例如,每对包含可选编码的轨道之间的切换采样的定期(例如,每一秒)设置。FIG. 16 is a flowchart of one embodiment of a method 1600 for creating switching sample metadata on encoding system 100 . Initially, method 1600 begins with processing logic that receives a file with encoded media data (processing block 1602). The file includes one or more optional encodings for the media data (eg, different bandwidth and quality settings for typical network conditions). The optional encoding includes one or more switching pictures. Such scenes may be included within optional media data streams, or as separate entities implementing specific features such as error resilience or gearshift modes. The method for creating these tracks and cutscenes is not specified by the present invention, but various possibilities will be apparent to those skilled in the art. For example, a periodic (eg, every second) setting for switching samples between each pair of tracks containing optional encoding.

接下来,当使用不同的参考采样时,处理逻辑检查文件以创建切换采样集(处理块1604),所述切换采样集包括那些具有相同解码值的采样,并且创建定义媒体数据的切换采样集的切换采样元数据并描述切换采样集内的采样(处理块1606)。在一个实施例中,把切换采样元数据编组成预定的数据结构,比如包含一组嵌套表的表框。Next, processing logic examines the file to create a switched sample set (processing block 1604) that includes those samples that have the same decoded value when a different reference sample is used, and creates the Switch sample metadata and describe the samples within the switch sample set (processing block 1606). In one embodiment, the switching sample metadata is organized into a predetermined data structure, such as a table box containing a set of nested tables.

接下来,在一个实施例中,处理逻辑判断切换采样元数据结构是否包含数据的重复序列(判断框1608)。如果这一判断是肯定的,则处理逻辑就把数据的每个重复序列转换成到序列出现的引用和序列出现的次数(处理块1610)。Next, in one embodiment, processing logic determines whether the toggle sample metadata structure contains a repeating sequence of data (decision block 1608). If this determination is positive, then processing logic converts each repeated sequence of data into a reference to the sequence occurrence and the number of sequence occurrences (processing block 1610).

然后,在处理块1612中,处理逻辑利用特定的媒体文件格式(例如,JVT文件格式)将切换采样元数据包含到与媒体数据相关联的文件中。在一个实施例中,切换采样元数据可以存储在为流切换而标明的独立轨道中。在另一个实施例中,将切换采样元数据与采样元数据一起存储起来(例如,可以把序列数据结构包含在采样表框中)。Then, in processing block 1612, processing logic includes switched sample metadata into the file associated with the media data using a particular media file format (eg, the JVT file format). In one embodiment, switch sample metadata may be stored in a separate track marked for stream switching. In another embodiment, switching sample metadata is stored with the sample metadata (eg, a sequence data structure may be included in a sample table box).

图17是用于在解码系统200上使用切换采样元数据的方法1700的一个实施例的流程图。最初,方法1700始于如下处理逻辑,所述处理逻辑接收与已编码的媒体数据相关联的文件(处理块1702)。可以从数据库(本地的或外部的)、编码系统100中接收所述文件,或者从网络上的任何其它装置那里接收所述文件。所述文件包括定义与媒体数据相关联的切换采样集的切换采样元数据。FIG. 17 is a flowchart of one embodiment of a method 1700 for using switching sample metadata on the decoding system 200 . Initially, method 1700 begins with processing logic that receives a file associated with encoded media data (processing block 1702). The file may be received from a database (local or external), from the encoding system 100, or from any other device on the network. The file includes switch sample metadata defining a set of switch samples associated with the media data.

接下来,处理逻辑从文件中提取切换采样元数据(处理块1704)。正如上面所论述的那样,切换采样元数据可以存储在诸如包含一组嵌套表的表框之类的数据结构中。Next, processing logic extracts switched sample metadata from the file (processing block 1704). As discussed above, toggle sampling metadata may be stored in a data structure such as a table box containing a set of nested tables.

此外,在处理块1706中,处理逻辑利用提取出的元数据来找出包含特定采样的切换采样集,并从所述切换采样集中选择可选的采样。响应于改变网络条件,可以使用作为和初始采样具有相同解码值的可选采样来在两个不同编码的比特流之间进行切换,以便提供到比特流中的随机访问入口点,从而便于错误恢复等等。Additionally, in processing block 1706, processing logic utilizes the extracted metadata to find a switch sample set that contains a particular sample and select an alternative sample from the switch sample set. In response to changing network conditions, an optional sample that is the same decoded value as the original sample can be used to switch between two differently encoded bitstreams, in order to provide a random access entry point into the bitstream to facilitate error recovery etc.

现在,将参照扩展的ISO媒体文件格式(称为扩展的MP4)来描述示例性切换采样元数据结构。然而,应当指出的是,其它媒体文件格式也能被扩展,以便并入用于存储切换采样元数据的各种数据结构。An exemplary switched sample metadata structure will now be described with reference to an extended ISO media file format, referred to as extended MP4. It should be noted, however, that other media file formats can also be extended to incorporate various data structures for storing switched sample metadata.

图18举例说明了用于存储切换采样元数据的示例性数据结构。所述示例性数据结构是包括一组嵌套表的切换采样表框的形式。表1802中的每个条目都标识一个切换采样集。每个切换采样集都由一组切换采样组成,所述切换采样组的重构客观上是等同的(或感性上是等同的),但是能够根据可以或可以不必作为切换采样而处于相同轨道(流)中的不同参考采样来预测所述切换采样组。表1802中的每个条目都链接于相对应的表1804。表1804标识了切换采样集中所含的每个切换采样。表1804中的每个条目还都链接于相对应的表1806,该表定义了切换采样的位置(即,它的轨道号和采样号),所述轨道包含:切换采样所使用的参考采样、切换采样所使用的参考采样的总数以及切换采样所使用的每个参考采样。Figure 18 illustrates an exemplary data structure for storing switching sample metadata. The exemplary data structure is in the form of a toggle sample table box comprising a set of nested tables. Each entry in table 1802 identifies a switch sample set. Each switched sample set consists of a set of switched samples whose reconstructions are objectively equivalent (or perceptually equivalent), but capable of being on the same orbit as may or may not necessarily be switched samples ( stream) to predict the switch sample set. Each entry in table 1802 is linked to a corresponding table 1804 . Table 1804 identifies each toggle sample contained in the toggle sample set. Each entry in table 1804 is also linked to a corresponding table 1806, which defines the position of the switch sample (i.e., its track number and sample number), said track contains: the reference sample used by the switch sample, The total number of reference samples used to switch samples and each reference sample used to switch samples.

正如图15A中举例说明的那样,在一个实施例中,可以使用切换采样元数据来在相同内容的不同编码版本之间进行切换。在MP4中,把每个可选的编码存为独立的MP4轨道,并且轨道首部中的“可选组”表明它是特定内容的可选编码。As illustrated in Figure 15A, in one embodiment, switching sample metadata may be used to switch between different encoded versions of the same content. In MP4, each optional encoding is stored as an independent MP4 track, and the "optional group" in the track header indicates that it is an optional encoding for a specific content.

图15B举例说明了根据图15A的、包含定义切换采样集1502的元数据的表,所述切换采样集1502由采样S2和S12组成。FIG. 15B illustrates a table according to FIG. 15A containing metadata defining a switched sample set 1502 consisting of samples S2 and S12.

图15C是用于判断如下点的方法1510的一个实施例的流程图,其中在所述点处将执行两个比特流之间的切换。假定将要从流1到流2执行切换,方法1510始于搜索切换采样元数据,以找出所有包含具有流1的参考轨道的切换采样和具有流2的切换采样轨道的切换采样的切换采样集(处理块1512)。接下来,评估得到的切换采样集,以选择其中具有流1的参考轨道的切换采样的所有参考采样都可用的切换采样集(处理块1514)。例如,如果具有流1的参考轨道的切换采样是P帧,那么在切换之前要求一个采样是可用的。此外,利用所选的切换采样集中的采样来确定切换点(处理块1516)。也就是说,切换点被认定为是将经由具有流1的参考轨道的切换采样而紧接在具有流1的参考轨道的切换采样的最高参考采样之后,且直到紧跟具有流2的切换采样轨道的切换采样的采样那里。Figure 15C is a flowchart of one embodiment of a method 1510 for determining the point at which switching between two bitstreams is to be performed. Assuming a switch is to be performed from stream 1 to stream 2, method 1510 begins by searching the switch sample metadata to find all switch sample sets that contain switch samples of the reference track with stream 1 and switch samples of the switch sample track with stream 2 (processing block 1512). Next, the resulting switched sample set is evaluated to select a switched sample set in which all reference samples with switched samples of the reference track of stream 1 are available (processing block 1514). For example, if the switch sample of the reference track with stream 1 is a P frame, then one sample is required to be available before switching. Additionally, a switch point is determined using the samples in the selected switch sample set (processing block 1516). That is, the switch point is considered to be the highest reference sample that will immediately follow the switch samples of the reference track with stream 1 via the switch sample of the reference track with stream 1, and until immediately following the switch sample with stream 2 The track's toggle sample-to-sample there.

在另一个实施例,可以使用切换采样元数据来便于随机访问到比特流中的入口点,正如在图19A-19C中举例说明的那样。In another embodiment, switching sample metadata may be used to facilitate random access to entry points in the bitstream, as illustrated in Figures 19A-19C.

参照图19A和19B,切换采样1902由采样S2和S12组成。S2是根据P22预测的P帧,并且在普通的流重放期间使用所述S2。S12用作为随机访问点(用于编接)。一旦S12被解码,流重放就继续进行P24的解码,就好似P24是在S2之后被解码一样。Referring to Figures 19A and 19B, switch sample 1902 consists of samples S2 and S12. S2 is a P frame predicted from P22 and is used during normal stream playback. S12 is used as a random access point (for splicing). Once S12 is decoded, stream playback proceeds with decoding of P24 as if P24 was decoded after S2.

图19C是用于确定采样(例如,轨道T上的采样S)的随机访问点的方法1910的一个实施例的流程图。方法1910始于搜索切换采样元数据以找出包含具有切换采样轨道T的切换采样的所有切换采样集(处理块1912)。接下来,评估得到的切换采样集,以便选择这样的切换采样集,在所述切换采样集中,具有切换采样轨道T的切换采样在解码顺序上是在采样S之前的最邻近的采样(处理块1914)。此外,从所选的切换采样集中选出除具有切换采样轨道T的切换采样之外的切换采样(采样SS),以作为到采样S的随机访问点(处理块1916)。在流重放期间,对采样SS进行解码(继之以对在对应采样SS的条目中指定的任何参考采样进行解码),而不是对采样S进行解码。Figure 19C is a flowchart of one embodiment of a method 1910 for determining a random access point for a sample (eg, sample S on track T). Method 1910 begins by searching the switch sample metadata to find all switch sample sets that contain switch samples with switch sample track T (processing block 1912). Next, the resulting switched sample set is evaluated in order to select the switched sample set in which the switched sample with switched sample track T is the nearest sample preceding sample S in decoding order (processing block 1914). Additionally, a switched sample (sample SS) other than the switched sample with switched sample track T is selected from the selected set of switched samples as a random access point to sample S (processing block 1916). During stream playback, a sample SS (followed by any reference samples specified in the entry for the corresponding sample SS) is decoded instead of a sample S.

在又一个实施例中,可以使用切换采样元数据来便于错误恢复,正如图20A-20C中所举例说明的那样。In yet another embodiment, switching sample metadata may be used to facilitate error recovery, as exemplified in Figures 20A-20C.

参照图20A和20B,切换采样2002由采样S2、S12和S22组成。采样S2是根据采样P4预测的。采样S12是根据采样S1预测的。如果在采样P2和P4之间发生错误,那么就可以对切换采样S12进行解码,而不是对采样S2进行解码。那么,流送照常继续采样P6。如果错误也影响了采样S1,则就可以对切换采样S22进行解码而不是对采样S2进行解码,然后流送将照常继续采样P6。Referring to FIGS. 20A and 20B , the switching sample 2002 is composed of samples S2 , S12 and S22 . Sample S2 is predicted from sample P4. Sample S12 is predicted from sample S1. If an error occurs between samples P2 and P4, then switching sample S12 may be decoded instead of sample S2. Streaming then continues with sampling P6 as usual. If the error also affected sample S1, then switching sample S22 could be decoded instead of sample S2, and streaming would then continue with sample P6 as usual.

图20c是用于当发送采样(例如,采样S)时便于错误恢复的方法2010的一个实施例的流程图。方法2010始于搜索切换采样元数据以找出包含等于采样S或在按解码顺序上紧随采样S的切换采样的所有切换采样集(处理块2012)。接下来,评估得到的切换采样集以选择具有切换采样SS的切换采样集,所述切换采样SS最接近于采样S并且(经由反馈或其它信息源)已知它的参考采样将是正确的(处理块2014)。此外,发送切换采样SS而不是发送采样S(处理块2016)。Figure 20c is a flowchart of one embodiment of a method 2010 for facilitating error recovery when sending samples (eg, sample S). Method 2010 begins by searching the switched sample metadata to find all switched sample sets that contain switched samples equal to or immediately following sample S in decoding order (processing block 2012). Next, the resulting switched sample set is evaluated to select the switched sample set with switched sample SS that is closest to sample S and for which it is known (via feedback or other information source) that its reference sample will be correct ( processing block 2014). Also, instead of sending samples S, switch samples SS are sent (processing block 2016).

已经描述了视听元数据的存储和检索。尽管这里已经举例说明并描述了特定的实施例,但是本领域普通技术人员将会认识到的是,适于实现相同目的的任何布局方案都可以代替这里所示的特定实施例。本申请是用来涵盖本发明的任何修改或变形。Storage and retrieval of audiovisual metadata has been described. Although specific embodiments have been illustrated and described herein, those of ordinary skill in the art will recognize that any layout scheme suitable for accomplishing the same purpose may be substituted for the specific embodiments shown. This application is intended to cover any adaptations or variations of the present invention.

Claims (85)

1.一种方法,包括:1. A method comprising: 为创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据;和parameter set metadata for creating one or more parameter sets identifying portions of the multimedia data; and 形成与多媒体数据相关联的文件,所述文件包括参数集元数据。A file is formed associated with the multimedia data, the file including parameter set metadata. 2.如权利要求1所述的方法,其中多媒体数据的多个部分中的每一部分都是多媒体数据内的采样。2. The method of claim 1, wherein each of the plurality of portions of the multimedia data is a sample within the multimedia data. 3.如权利要求1所述的方法,其中多媒体数据的多个部分中的每一部分都是一部分多媒体数据内的子采样。3. The method of claim 1, wherein each of the plurality of portions of the multimedia data is a sub-sample within a portion of the multimedia data. 4.如权利要求1所述的方法,其中创建参数集元数据包括:4. The method of claim 1, wherein creating parameter set metadata comprises: 接收具有已编码的多媒体数据的文件;receiving a file with encoded multimedia data; 检查一个或多个参数集与多媒体数据的多个部分之间的关系;以及Examining relationships between one or more parameter sets and portions of multimedia data; and 根据检查出的关系来定义参数集元数据。Define parameter set metadata based on checked out relationships. 5.如权利要求1所述的方法,其中创建参数集元数据包括:把参数集元数据编组成一组预定的数据结构。5. The method of claim 1, wherein creating parameter set metadata includes organizing parameter set metadata into a set of predetermined data structures. 6.如权利要求5所述的方法,其中创建参数集元数据进一步包括:6. The method of claim 5, wherein creating parameter set metadata further comprises: 把预定数据结构组内的数据的每个重复序列转换成对序列出现的引用和出现次数。Each repeating sequence of data within a predetermined set of data structures is converted into a reference to the sequence occurrence and the number of occurrences. 7.如权利要求5所述的方法,其中预定的数据结构组包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。7. The method of claim 5, wherein the predetermined set of data structures comprises: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information about the association between a parameter set and multiple parts of multimedia data. 8.如权利要求1所述的方法,进一步包括:8. The method of claim 1, further comprising: 将与多媒体数据相关联的文件发送到解码系统;sending the files associated with the multimedia data to the decoding system; 在解码系统上接收与多媒体数据相关联的文件;以及receiving a file associated with the multimedia data on the decoding system; and 在解码系统上从与多媒体数据相关联的文件中提取参数集元数据,所提取出的参数集元数据随后被用于标识解码至少一部分的多媒体数据所需的一个或多个参数集中的任一个。Parameter set metadata is extracted from a file associated with the multimedia data at the decoding system, the extracted parameter set metadata is then used to identify any of one or more parameter sets required to decode at least a portion of the multimedia data . 9.一种方法,包括:9. A method comprising: 接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据;以及receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data; and 从所述文件中提取参数集元数据,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系。Parameter set metadata is extracted from the file, and the extracted parameter set metadata is then used to determine a relationship between one or more parameter sets and portions of the multimedia data. 10.如权利要求9所述的方法,其中多媒体数据的多个部分中的每一部分是多媒体数据内的采样。10. The method of claim 9, wherein each of the plurality of portions of multimedia data is a sample within the multimedia data. 11.如权利要求9所述的方法,其中多媒体数据的多个部分中的每一部分是一部分多媒体数据内的子采样。11. The method of claim 9, wherein each of the plurality of portions of multimedia data is a sub-sample within a portion of the multimedia data. 12.如权利要求9所述的方法,进一步包括:12. The method of claim 9, further comprising: 利用判定的关系来控制多媒体数据的多个部分和一个或多个参数集的传输时间。The determined relationship is utilized to control the timing of transmission of portions of multimedia data and one or more sets of parameters. 13.如权利要求9所述的方法,其中将提取出的参数集元数据编组成一组预定的数据结构。13. The method of claim 9, wherein the extracted parameter set metadata is organized into a set of predetermined data structures. 14.如权利要求13所述的方法,其中预定的数据结构组包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。14. The method of claim 13, wherein the predetermined set of data structures comprises: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information about the association between a parameter set and multiple parts of multimedia data. 15.一种方法,包括:15. A method comprising: 创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据;creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data; 创建定义多媒体数据内的多个采样的编组的采样组元数据;以及creating sample group metadata defining a grouping of a plurality of samples within the multimedia data; and 形成与多媒体数据相关联的文件,所述文件包括参数集元数据和采样组元数据。A file is formed associated with the multimedia data, the file including parameter set metadata and sample group metadata. 16.如权利要求15所述的方法,其中多媒体数据的多个部分中的每一部分是多媒体数据内的采样和子采样中的任一个。16. The method of claim 15, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 17.如权利要求15所述的方法,其中创建参数集元数据包括:17. The method of claim 15, wherein creating parameter set metadata comprises: 把参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。organizing parameter set metadata into a predetermined set of data structures comprising: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information about associations between parts of multimedia data. 18.如权利要求15所述的方法,其中所述编组是以多个采样的相互依赖性为基础的。18. The method of claim 15, wherein the grouping is based on interdependencies of a plurality of samples. 19.如权利要求15所述的方法,其中创建采样组元数据包括:19. The method of claim 15, wherein creating sample group metadata comprises: 把采样组元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于多媒体数据内的多个采样组的描述性信息;和第二数据结构,其包含标识多个采样组中的每一个中的采样的信息。grouping sample group metadata into a predetermined set of data structures comprising: a first data structure containing descriptive information about a plurality of sample groups within the multimedia data; and a second data structure containing The sampled information in each of the . 20.一种方法,包括:20. A method comprising: 接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义多媒体数据内的多个采样的编组的采样组元数据;以及receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and sample group metadata defining groupings of a plurality of samples within the multimedia data; and 从文件中提取参数集元数据和采样组元数据,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系,而所提取出的采样组元数据随后被用于标识能在将来的处理过程中处理的采样。Extract parameter set metadata and sample group metadata from the file, the extracted parameter set metadata is then used to determine the relationship between one or more parameter sets and multiple parts of the multimedia data, and the extracted The sample group metadata is then used to identify samples that can be processed in future processing. 21.如权利要求20所述的方法,其中多媒体数据的多个部分中的每一部分都是多媒体数据内的采样和子采样中的任一个。21. The method of claim 20, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 22.如权利要求20所述的方法,进一步包括:22. The method of claim 20, further comprising: 利用判定的关系来控制多媒体数据的多个部分和一个或多个参数集的传输时间。The determined relationship is utilized to control the timing of transmission of portions of multimedia data and one or more sets of parameters. 23.如权利要求20所述的方法,其中将提取出的参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。23. The method of claim 20, wherein grouping the extracted parameter set metadata into a set of predetermined data structures comprises: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing information defining associations between one or more parameter sets and portions of the multimedia data. 24.如权利要求20所述的方法,其中所述编组是以多个采样的相互依赖性为基础的。24. The method of claim 20, wherein the grouping is based on interdependencies of a plurality of samples. 25.如权利要求20所述的方法,进一步包括:25. The method of claim 20, further comprising: 响应于网络能力方面的变化,找出能在不影响多媒体数据的剩余采样的解码的情况下处理的一个或多个采样。In response to a change in network capabilities, one or more samples are found that can be processed without affecting decoding of remaining samples of the multimedia data. 26.如权利要求20所述的方法,进一步包括:26. The method of claim 20, further comprising: 根据提取出的采样组元数据来过滤多个采样以减少将要再现的采样数目。Multiple samples are filtered according to the extracted sample group metadata to reduce the number of samples to be rendered. 27.如权利要求20所述的方法,其中将提取出的采样组元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于多媒体数据内的多个采样组的描述性信息;和第二数据结构,其包含标识多个采样组中的每一个中的采样的信息。27. The method of claim 20, wherein grouping the extracted sample group metadata into a predetermined set of data structures comprises: a first data structure containing descriptive information about a plurality of sample groups within the multimedia data information; and a second data structure containing information identifying samples in each of the plurality of sample groups. 28.一种方法,包括:28. A method comprising: 创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据;creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data; 创建定义多媒体数据内的多个采样的编组的采样组元数据;creating sample group metadata defining a grouping of a plurality of samples within the multimedia data; 创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据,所述多个切换采样集中的每一个都包含具有相同解码值的采样;以及creating switched sample metadata defining a plurality of switched sample sets associated with the multimedia data, each of the multiple switched sample sets comprising samples having the same decoded value; and 形成与多媒体数据相关联的文件,所述文件包括参数集元数据、采样组元数据和切换采样元数据。A file is formed associated with the multimedia data, the file including parameter set metadata, sample group metadata, and switch sample metadata. 29.如权利要求28所述的方法,其中多媒体数据的多个部分中的每一部分都是多媒体数据内的采样和子采样中的任一个。29. The method of claim 28, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 30.如权利要求28所述的方法,其中创建参数集元数据包括:30. The method of claim 28, wherein creating parameter set metadata comprises: 把参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。organizing parameter set metadata into a predetermined set of data structures comprising: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information about associations between parts of multimedia data. 31.如权利要求28所述的方法,其中所述编组是以多个采样的相互依赖性为基础的。31. The method of claim 28, wherein the grouping is based on interdependencies of a plurality of samples. 32.如权利要求28所述的方法,其中创建采样组元数据包括:32. The method of claim 28, wherein creating sample group metadata comprises: 把采样组元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于多媒体数据内的多个采样组的描述性信息;和第二数据结构,其包含标识多个采样组中的每一个中的采样的信息。grouping sample group metadata into a predetermined set of data structures comprising: a first data structure containing descriptive information about a plurality of sample groups within the multimedia data; and a second data structure containing The sampled information in each of the . 33.如权利要求28所述的方法,其中多个切换采样集中的每一个中的采样使用了不同的参考采样。33. The method of claim 28, wherein samples in each of the plurality of switched sample sets use different reference samples. 34.如权利要求28所述的方法,其中创建切换采样元数据包括:34. The method of claim 28, wherein creating switching sample metadata comprises: 把切换采样元数据编组成表示为包含一组嵌套表的表框的预定数据结构。The switch sampling metadata is organized into a predetermined data structure represented as a table box containing a set of nested tables. 35.一种方法,包括:35. A method comprising: 接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据、定义多媒体数据内的多个采样的编组的采样组元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据;以及Receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data, sample group metadata defining groupings of a plurality of samples within the multimedia data, and defining parameters associated with the multimedia data switch sample metadata for multiple switch sample sets linked together; and 从文件中提取参数集元数据、采样组元数据和切换采样元数据,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系,所提取出的采样组元数据随后被用于标识能在将来的处理过程中处理的采样,而所提取出的切换采样元数据随后被用于找出特定采样的代替物。extracting parameter set metadata, sample group metadata, and switching sample metadata from the file, the extracted parameter set metadata is then used to determine the relationship between one or more parameter sets and multiple parts of the multimedia data, The extracted sample group metadata is then used to identify samples that can be processed in future processing, while the extracted switch sample metadata is then used to find a surrogate for a particular sample. 36.如权利要求35所述的方法,其中多媒体数据的多个部分中的每一部分都是多媒体数据内的采样和子采样中的任一个。36. The method of claim 35, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 37.如权利要求35所述的方法,进一步包括:37. The method of claim 35, further comprising: 利用判定的关系来控制多媒体数据的多个部分和一个或多个参数集的传输时间。The determined relationship is utilized to control the timing of transmission of portions of multimedia data and one or more sets of parameters. 38.如权利要求35所述的方法,其中将提取出的参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。38. The method of claim 35, wherein grouping the extracted parameter set metadata into a predetermined set of data structures comprises: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing information defining associations between one or more parameter sets and portions of the multimedia data. 39.如权利要求35所述的方法,其中所述编组是以多个采样的相互依赖性为基础的。39. The method of claim 35, wherein the grouping is based on interdependencies of multiple samples. 40.如权利要求35所述的方法,进一步包括:40. The method of claim 35, further comprising: 响应于网络能力方面的变化,找出能在不影响多媒体数据的剩余采样的解码的情况下处理的一个或多个采样。In response to a change in network capabilities, one or more samples are found that can be processed without affecting decoding of remaining samples of the multimedia data. 41.如权利要求35所述的方法,进一步包括:41. The method of claim 35, further comprising: 根据所提取出的采样组元数据来过滤多个采样以减少将要再现的采样数目。The plurality of samples are filtered according to the extracted sample group metadata to reduce the number of samples to be rendered. 42.如权利要求35所述的方法,其中将提取出的采样组元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于多媒体数据内的多个采样组的描述性信息;和第二数据结构,其包含标识多个采样组中的每一个中的采样的信息。42. The method of claim 35, wherein grouping the extracted sample group metadata into a predetermined set of data structures comprises: a first data structure containing descriptive information about a plurality of sample groups within the multimedia data information; and a second data structure containing information identifying samples in each of the plurality of sample groups. 43.如权利要求35所述的方法,其中当使用不同的参考采样时,多个切换采样集中的每一个都包含具有相同解码值的采样。43. The method of claim 35, wherein each of the plurality of switched sample sets contains samples having the same decoded value when different reference samples are used. 44.如权利要求35所述的方法,进一步包括:44. The method of claim 35, further comprising: 在多个切换采样集中找出包含特定采样的切换采样集;以及finding a switched sample set containing a particular sample among multiple switched sample sets; and 从找出的切换采样集中选择可选的采样。Select an optional sample from the found switch sample set. 45.如权利要求35所述的方法,其中将提取出的切换采样元数据编组成表示为包含一组嵌套表的表框的预定数据结构。45. The method of claim 35, wherein the extracted switched sample metadata is organized into a predetermined data structure represented as a table box containing a set of nested tables. 46.一种方法,包括:46. A method comprising: 创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据;creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data; 创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据,所述多个切换采样集中的每一个都包含具有相同解码值的采样;以及creating switched sample metadata defining a plurality of switched sample sets associated with the multimedia data, each of the multiple switched sample sets comprising samples having the same decoded value; and 形成与多媒体数据相关联的文件,所述文件包括参数集元数据和切换采样元数据。A file is formed associated with the multimedia data, the file including parameter set metadata and switching sample metadata. 47.如权利要求46所述的方法,其中多媒体数据的多个部分中的每一个都是多媒体数据内的采样和子采样中的任一个。47. The method of claim 46, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 48.如权利要求46所述的方法,其中创建参数集元数据包括:48. The method of claim 46, wherein creating parameter set metadata comprises: 把参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。organizing parameter set metadata into a predetermined set of data structures comprising: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information about associations between parts of multimedia data. 49.如权利要求46所述的方法,其中多个切换采样集中的每一个中的采样都使用了不同的参考采样。49. The method of claim 46, wherein samples in each of the plurality of switched sample sets use different reference samples. 50.如权利要求46所述的方法,其中创建切换采样元数据包括:50. The method of claim 46, wherein creating switching sample metadata comprises: 把切换采样元数据编组成表示为包含一组嵌套表的表框的预定数据结构。The switch sampling metadata is organized into a predetermined data structure represented as a table box containing a set of nested tables. 51.一种方法,包括:51. A method comprising: 接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据;以及receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and switching sample metadata defining a plurality of switching sample sets associated with the multimedia data; and 从文件中提取参数集元数据和切换采样元数据,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系,而所提取出的切换采样元数据随后被用于找出特定采样的代替物。Extract parameter set metadata and switching sampling metadata from the file, the extracted parameter set metadata is then used to determine the relationship between one or more parameter sets and multiple parts of the multimedia data, and the extracted Switching sample metadata is then used to find substitutes for a particular sample. 52.如权利要求51所述的方法,其中多媒体数据的多个部分中的每一个都是多媒体数据内的采样和子采样中的任一个。52. The method of claim 51, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 53.如权利要求51所述的方法,进一步包括:53. The method of claim 51 , further comprising: 利用判定的关系来控制多媒体数据的多个部分和一个或多个参数集的传输时间。The determined relationship is utilized to control the timing of transmission of portions of multimedia data and one or more sets of parameters. 54.如权利要求51所述的方法,其中将提取出的参数集元数据编组成一组预定的数据结构,包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。54. The method of claim 51 , wherein grouping the extracted parameter set metadata into a set of predetermined data structures comprises: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing information defining associations between one or more parameter sets and portions of the multimedia data. 55.如权利要求51所述的方法,其中当使用不同的参考采样时,多个切换采样集中的每一个都包含具有相同解码值的采样。55. The method of claim 51, wherein each of the plurality of switched sample sets contains samples having the same decoded value when different reference samples are used. 56.如权利要求51所述的方法,进一步包括:56. The method of claim 51 , further comprising: 在多个切换采样集中找出包含特定采样的切换采样集;以及finding a switched sample set containing a particular sample among multiple switched sample sets; and 从找出的切换采样集中选择可选的采样。Select an optional sample from the found switch sample set. 57.如权利要求51所述的方法,其中将提取出的切换采样元数据编组成表示为包含一组嵌套表的表框的预定数据结构。57. The method of claim 51, wherein the extracted switched sample metadata is organized into a predetermined data structure represented as a table box containing a set of nested tables. 58.一种用于存储由在数据处理系统上执行的应用程序来访问的数据的存储器,包括:58. A memory for storing data accessed by an application program executing on a data processing system, comprising: 存储在所述存储器中的多个数据结构,所述多个数据结构驻留在所述应用程序所使用的文件中,所述文件与多媒体数据相关联并且包括定义多媒体数据的多个部分的一个或多个参数集的参数集元数据。a plurality of data structures stored in the memory, the plurality of data structures residing in files used by the application, the files being associated with the multimedia data and including a or parameter set metadata for multiple parameter sets. 59.如权利要求58所述的存储器,其中包括参数集元数据的文件还包括相关联的多媒体数据。59. The memory of claim 58, wherein the file including parameter set metadata further includes associated multimedia data. 60.如权利要求58所述的存储器,其中包括参数集元数据的文件包括对包含相关联的多媒体数据的文件的引用。60. The memory of claim 58, wherein the file including parameter set metadata includes a reference to a file containing associated multimedia data. 61.如权利要求58所述的存储器,其中所述多个数据结构包括:第一数据结构,其包含关于一个或多个参数集的描述性信息;和第二数据结构,其包含定义一个或多个参数集与多媒体数据的多个部分之间的关联的信息。61. The memory of claim 58, wherein the plurality of data structures includes: a first data structure containing descriptive information about one or more parameter sets; and a second data structure containing Information on the association between the plurality of parameter sets and the plurality of parts of the multimedia data. 62.一种用于存储由在数据处理系统上执行的应用程序来访问的数据的存储器,所述存储器包括:62. A memory for storing data accessed by an application program executing on a data processing system, the memory comprising: 存储在所述存储器中的多个数据结构,所述多个数据结构驻留在所述应用程序所使用的文件中,所述文件与多媒体数据相关联并且包括:A plurality of data structures stored in the memory, the plurality of data structures residing in files used by the application, the files being associated with multimedia data and comprising: 定义多媒体数据的多个部分的一个或多个参数集的参数集元数据,和parameter set metadata defining one or more parameter sets of parts of the multimedia data, and 定义多媒体数据内的多个采样的编组的采样组元数据。Sample group metadata defining a grouping of a plurality of samples within the multimedia data. 63.一种用于存储由在数据处理系统上执行的应用程序来访问的数据的存储器,所述存储器包括:63. A memory for storing data accessed by an application program executing on a data processing system, the memory comprising: 存储在所述存储器中的多个数据结构,所述多个数据结构驻留在所述应用程序所使用的文件中,所述文件与多媒体数据相关联并且包括:A plurality of data structures stored in the memory, the plurality of data structures residing in files used by the application, the files being associated with multimedia data and comprising: 定义多媒体数据的多个部分的一个或多个参数集的参数集元数据,parameter set metadata defining one or more parameter sets of parts of the multimedia data, 定义多媒体数据内的多个采样的编组的采样组元数据,和sample group metadata defining a grouping of multiple samples within the multimedia data, and 定义与多媒体数据相关联的多个切换采样集的切换采样元数据。Switching sample metadata defining a plurality of switching sample sets associated with the multimedia data. 64.一种用于存储由在数据处理系统上执行的应用程序来访问的数据的存储器,所述存储器包括:64. A memory for storing data accessed by an application program executing on a data processing system, the memory comprising: 存储在所述存储器中的多个数据结构,所述多个数据结构驻留在所述应用程序所使用的文件中,所述文件与多媒体数据相关联并且包括:A plurality of data structures stored in the memory, the plurality of data structures residing in files used by the application, the files being associated with multimedia data and comprising: 定义多媒体数据的多个部分的一个或多个参数集的参数集元数据,和parameter set metadata defining one or more parameter sets of parts of the multimedia data, and 定义与多媒体数据相关联的多个切换采样集的切换采样元数据。Switching sample metadata defining a plurality of switching sample sets associated with the multimedia data. 65.一种设备,包括:65. A device comprising: 元数据生成器,用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据;和a metadata generator for creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data; and 文件创建器,用于形成与多媒体数据相关联的文件,所述文件包括参数集元数据。A file creator for forming a file associated with the multimedia data, the file including parameter set metadata. 66.如权利要求65所述的设备,其中多媒体数据的多个部分中的每一个都是多媒体数据内的采样和子采样中的任一个。66. The apparatus of claim 65, wherein each of the plurality of portions of the multimedia data is any one of a sample and a sub-sample within the multimedia data. 67.如权利要求65所述的设备,其中所述元数据生成器用于通过接收具有已编码的多媒体数据的文件、检查一个或多个参数集与多媒体数据的多个部分之间的关系并根据检查的关系定义参数集元数据,来创建参数集元数据。67. The apparatus as claimed in claim 65, wherein the metadata generator is configured to receive a file with encoded multimedia data, check the relationship between one or more parameter sets and a plurality of parts of the multimedia data and based on Check the relationship definition parameter set metadata to create parameter set metadata. 68.如权利要求65所述的设备,进一步包括:68. The device of claim 65, further comprising: 元数据提取器,用于在解码系统上接收与多媒体数据相关联的文件,并且用于从与多媒体数据相关联的文件中提取参数集元数据;和a metadata extractor for receiving a file associated with the multimedia data at the decoding system and for extracting parameter set metadata from the file associated with the multimedia data; and 媒体数据流处理器,用于使用提取出的参数集元数据以标识解码至少一部分多媒体数据所需的一个或多个参数集中的任一个。A media data stream processor configured to use the extracted parameter set metadata to identify any one of the one or more parameter sets required to decode at least a portion of the multimedia data. 69.一种设备,包括:69. A device comprising: 元数据提取器,用于接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据,并且用于从文件中提取参数集元数据;和a metadata extractor for receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data, and for extracting the parameter set metadata from the file; and 媒体数据流处理器,用于使用提取出的参数集元数据以判断一个或多个参数集与多媒体数据的多个部分之间的关系。The media data stream processor is configured to use the extracted parameter set metadata to determine the relationship between one or more parameter sets and multiple parts of the multimedia data. 70.如权利要求69所述的设备,其中多媒体数据的多个部分中的每一部分都是多媒体数据内的采样或子采样。70. The apparatus of claim 69, wherein each of the plurality of portions of the multimedia data is a sample or sub-sample within the multimedia data. 71.如权利要求69所述的设备,其中所述媒体数据流处理器进一步用于利用判定的关系来控制多媒体数据的多个部分和一个或多个参数集的传输时间。71. The apparatus of claim 69, wherein the media data stream processor is further configured to utilize the determined relationship to control transmission times of portions of multimedia data and one or more parameter sets. 72.一种设备,包括:72. A device comprising: 元数据生成器,用于创建标识多媒体数据的多个部分一个或多个参数集的参数集元数据,并且用于创建定义多媒体数据内的多个采样的编组的采样组元数据;和a metadata generator for creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data, and for creating sample group metadata defining groupings of samples within the multimedia data; and 文件创建器,用于形成与多媒体数据相关联的文件,所述文件包括参数集元数据和采样组元数据。A file creator for forming a file associated with the multimedia data, the file including parameter set metadata and sample group metadata. 73.一种设备,包括:73. A device comprising: 元数据提取器,用于接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义多媒体数据内的多个采样的编组的采样组元数据,并且用于从文件中提取参数集元数据和采样组元数据;和a metadata extractor for receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and a sample component defining a grouping of a plurality of samples within the multimedia data data, and used to extract parameter set metadata and sample group metadata from the file; and 媒体数据流处理器,用于使用提取出的参数集元数据以供判断一个或多个参数集与多媒体数据的多个部分之间的关系,并且用于使用提取出的采样组元数据以标识能在将来的处理过程中处理的采样。a media data stream processor, configured to use the extracted parameter set metadata to determine the relationship between one or more parameter sets and portions of the multimedia data, and to use the extracted sample group metadata to identify Samples that can be processed in future processing. 74.一种设备,包括:74. A device comprising: 元数据生成器,用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据,用于创建定义多媒体数据内的多个采样的编组的采样组元数据,并且用于创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据;和a metadata generator for creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data, for creating sample group metadata defining groupings of a plurality of samples within the multimedia data, and for creating switch sample metadata defining a plurality of switch sample sets associated with the multimedia data; and 文件创建器,用于形成与多媒体数据相关联的文件,所述文件包括参数集元数据、采样组元数据和切换采样元数据。A file creator for forming a file associated with the multimedia data, the file including parameter set metadata, sample group metadata, and switch sample metadata. 75.一种设备,包括:75. A device comprising: 元数据提取器,用于接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据、定义多媒体数据内的多个采样的编组的采样组元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据,并且用于从文件中提取参数集元数据、采样组元数据和切换采样元数据;和a metadata extractor for receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data, a sample component defining a grouping of a plurality of samples within the multimedia data data and toggle sample metadata defining a plurality of toggle sample sets associated with the multimedia data and for extracting parameter set metadata, sample group metadata, and toggle sample metadata from the file; and 媒体数据流处理器,用于使用提取出的参数集元数据以判断一个或多个参数集与多媒体数据的多个部分之间的关系,用于使用提取出的采样组元数据以标识能在将来的处理过程中处理的采样,并且用于使用提取出的切换采样元数据以找出特定采样的代替物。A media data stream processor, configured to use the extracted parameter set metadata to determine the relationship between one or more parameter sets and multiple parts of the multimedia data, and to use the extracted sample group metadata to identify the relationship between Samples processed during future processing and used to find substitutes for specific samples using the extracted toggle sample metadata. 76.一种设备,包括:76. A device comprising: 元数据生成器,用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据,并且用于创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据;和a metadata generator for creating parameter set metadata identifying one or more parameter sets of portions of the multimedia data, and for creating switch sample metadata defining a plurality of switch sample sets associated with the multimedia data; and 文件创建器,用于形成与多媒体数据相关联的文件,所述文件包括参数集元数据和切换采样元数据。A file creator for forming a file associated with the multimedia data, the file including parameter set metadata and switching sample metadata. 77.一种设备,包括:77. A device comprising: 元数据提取器,用于接收与多媒体数据相关联的文件,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据,并且用于从文件中提取参数集元数据和切换采样元数据;和a metadata extractor for receiving a file associated with the multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and switches defining a plurality of switch sample sets associated with the multimedia data sample metadata, and is used to extract parameter set metadata and switch sample metadata from the file; and 媒体数据流处理器,用于使用提取出的参数集元数据以判断一个或多个参数集与多媒体数据的多个部分之间的关系,并且用于使用提取出的切换采样元数据以找出特定采样的代替物。a media data stream processor configured to use the extracted parameter set metadata to determine a relationship between one or more parameter sets and portions of the multimedia data, and to use the extracted switch sample metadata to find Surrogates for Specific Sampling. 78.一种设备,包括:78. A device comprising: 用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据的装置;和means for creating parameter set metadata identifying one or more parameter sets of portions of multimedia data; and 用于形成与多媒体数据相关联的文件的装置,所述文件包括参数集元数据。Means for forming a file associated with multimedia data, the file including parameter set metadata. 79.一种设备,包括:79. A device comprising: 用于接收与多媒体数据相关联的文件的装置,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据;和means for receiving a file associated with multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data; and 用于从文件中提取参数集元数据的装置,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系。Means for extracting parameter set metadata from a file, the extracted parameter set metadata being subsequently used to determine a relationship between one or more parameter sets and portions of multimedia data. 80.一种设备,包括:80. A device comprising: 用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据的装置;means for creating parameter set metadata identifying one or more parameter sets of portions of multimedia data; 用于创建定义多媒体数据内的多个采样的编组的采样组元数据的装置;和means for creating sample group metadata defining a grouping of a plurality of samples within the multimedia data; and 用于形成与多媒体数据相关联的文件的装置,所述文件包括参数集元数据和采样组元数据。Means for forming a file associated with multimedia data, the file including parameter set metadata and sample group metadata. 81.一种设备,包括:81. A device comprising: 用于接收与多媒体数据相关联的文件的装置,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义多媒体数据内的多个采样的编组的采样组元数据;和means for receiving a file associated with multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and sample group metadata defining groupings of a plurality of samples within the multimedia data; and 用于从文件中提取参数集元数据和采样组元数据的装置,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系,而所提取出的采样组元数据随后被用于标识能在将来的处理过程中处理的采样。means for extracting parameter set metadata and sample group metadata from a file, the extracted parameter set metadata is then used to determine the relationship between one or more parameter sets and a plurality of parts of the multimedia data, and The extracted sample group metadata is then used to identify samples that can be processed in future processing. 82.一种设备,包括:82. A device comprising: 用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据的装置;means for creating parameter set metadata identifying one or more parameter sets of portions of multimedia data; 用于创建定义多媒体数据内的多个采样的编组的采样组元数据的装置;means for creating sample group metadata defining a grouping of a plurality of samples within multimedia data; 用于创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据的装置,所述多个切换采样集中的每一个都包含具有相同解码值的采样;和means for creating switching sample metadata defining a plurality of switching sample sets associated with the multimedia data, each of the plurality of switching sample sets comprising samples having the same decoded value; and 用于形成与多媒体数据相关联的文件的装置,所述文件包括参数集元数据、采样组元数据和切换采样元数据。Means for forming a file associated with multimedia data, the file including parameter set metadata, sample group metadata, and switch sample metadata. 83.一种设备,包括:83. A device comprising: 用于接收与多媒体数据相关联的文件的装置,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据、定义多媒体数据内的多个采样的编组的采样组元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据;和means for receiving a file associated with multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data, sample group metadata defining groupings of a plurality of samples within the multimedia data, and definitions switching sample metadata for a plurality of switching sample sets associated with the multimedia data; and 用于从文件中提取参数集元数据、采样组元数据和切换采样元数据的装置,所提取出的参数集元数据随后被用于判断一个或多个参数与多媒体数据的多个部分之间的关系,所提取出的采样组元数据随后被用于标识能在将来的处理过程中处理的采样,而所提取出的切换采样元数据随后被用于找出特定采样的代替物。Means for extracting parameter set metadata, sample group metadata and switching sample metadata from a file, the extracted parameter set metadata being used to determine the relationship between one or more parameters and portions of multimedia data , the extracted sample group metadata is then used to identify samples that can be processed in future processing, while the extracted switch sample metadata is then used to find a surrogate for a particular sample. 84.一种设备,包括:84. A device comprising: 用于创建标识多媒体数据的多个部分的一个或多个参数集的参数集元数据的装置;means for creating parameter set metadata identifying one or more parameter sets of portions of multimedia data; 用于创建定义与多媒体数据相关联的多个切换采样集的切换采样元数据的装置,所述多个切换采样集中的每一个都包含具有相同解码值的采样;和means for creating switching sample metadata defining a plurality of switching sample sets associated with the multimedia data, each of the plurality of switching sample sets comprising samples having the same decoded value; and 用于形成与多媒体数据相关联的文件的装置,所述文件包括参数集元数据和切换采样元数据。Means for forming a file associated with multimedia data, the file including parameter set metadata and switching sample metadata. 85.一种设备,包括:85. A device comprising: 用于接收与多媒体数据相关联的文件的装置,所述文件包括标识多媒体数据的一个或多个参数集的参数集元数据和定义与多媒体数据相关联的多个切换采样集的切换采样元数据;和means for receiving a file associated with multimedia data, the file including parameter set metadata identifying one or more parameter sets of the multimedia data and switching sample metadata defining a plurality of switching sample sets associated with the multimedia data ;and 用于从文件中提取参数集元数据和切换采样元数据的装置,所提取出的参数集元数据随后被用于判断一个或多个参数集与多媒体数据的多个部分之间的关系,而所提取出的切换采样元数据随后被用于找出特定采样的代替物。means for extracting parameter set metadata and switching sampling metadata from a file, the extracted parameter set metadata is then used to determine the relationship between one or more parameter sets and multiple parts of the multimedia data, and The extracted switching sample metadata is then used to find a surrogate for a particular sample.
CNB038092093A 2002-02-25 2003-02-24 Method and apparatus for supporting AVC in MP4 Expired - Lifetime CN100379290C (en)

Applications Claiming Priority (9)

Application Number Priority Date Filing Date Title
US35960602P 2002-02-25 2002-02-25
US60/359,606 2002-02-25
US36177302P 2002-03-05 2002-03-05
US60/361,773 2002-03-05
US36364302P 2002-03-08 2002-03-08
US60/363,643 2002-03-08
US10/371,438 US20040199565A1 (en) 2003-02-21 2003-02-21 Method and apparatus for supporting advanced coding formats in media files
US10/371,438 2003-02-21
PCT/US2003/005636 WO2003073770A1 (en) 2002-02-25 2003-02-24 Method and apparatus for supporting avc in mp4

Publications (2)

Publication Number Publication Date
CN1650626A true CN1650626A (en) 2005-08-03
CN100379290C CN100379290C (en) 2008-04-02

Family

ID=27767924

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB038092093A Expired - Lifetime CN100379290C (en) 2002-02-25 2003-02-24 Method and apparatus for supporting AVC in MP4

Country Status (7)

Country Link
EP (1) EP1481555A1 (en)
JP (1) JP2006507553A (en)
CN (1) CN100379290C (en)
AU (1) AU2003219877B2 (en)
DE (1) DE10392284T5 (en)
GB (1) GB2402248B (en)
WO (1) WO2003073770A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101690218B (en) * 2007-06-28 2014-02-19 汤姆逊许可公司 Method, apparatus and system for providing display device specific content over a network architecture
CN103716649A (en) * 2007-06-28 2014-04-09 汤姆逊许可公司 Method, equipment and system for providing content special for display device through network structure

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0418464B1 (en) 2004-02-10 2019-02-05 Thompson Licensing storing advanced video encoding (avc) parameter sets in avc file format
US7725593B2 (en) * 2005-07-15 2010-05-25 Sony Corporation Scalable video coding (SVC) file format
EP2041974A4 (en) 2006-07-12 2014-09-24 Lg Electronics Inc Method and apparatus for encoding/decoding signal
EP2084708A4 (en) * 2006-10-19 2010-11-24 Lg Electronics Inc Encoding method and apparatus and decoding method and apparatus
WO2008085150A1 (en) 2006-12-21 2008-07-17 Thomson Licensing Method, apparatus and system for providing color grading for displays
US9247276B2 (en) 2008-10-14 2016-01-26 Imagine Communications Corp. System and method for progressive delivery of media content
JP5652642B2 (en) 2010-08-02 2015-01-14 ソニー株式会社 Data generation apparatus, data generation method, data processing apparatus, and data processing method
US20130195171A1 (en) 2012-01-30 2013-08-01 Qualcomm Incorporated Method of coding video and storing video content
JP2014017003A (en) * 2013-08-27 2014-01-30 Thomson Licensing Method, apparatus and system for providing display device specific content over network architecture
WO2025037566A1 (en) * 2023-08-14 2025-02-20 ソニーセミコンダクタソリューションズ株式会社 Information processing device, solid-state imaging device, display device, and transmission/reception system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101690218B (en) * 2007-06-28 2014-02-19 汤姆逊许可公司 Method, apparatus and system for providing display device specific content over a network architecture
CN103716649A (en) * 2007-06-28 2014-04-09 汤姆逊许可公司 Method, equipment and system for providing content special for display device through network structure

Also Published As

Publication number Publication date
AU2003219877A1 (en) 2003-09-09
GB2402248A (en) 2004-12-01
GB0421328D0 (en) 2004-10-27
JP2006507553A (en) 2006-03-02
CN100379290C (en) 2008-04-02
WO2003073770A1 (en) 2003-09-04
GB2402248B (en) 2005-10-12
AU2003219877B2 (en) 2008-09-04
DE10392284T5 (en) 2005-04-07
EP1481555A1 (en) 2004-12-01

Similar Documents

Publication Publication Date Title
US7613727B2 (en) Method and apparatus for supporting advanced coding formats in media files
US20040167925A1 (en) Method and apparatus for supporting advanced coding formats in media files
CN1653818A (en) Method and apparatus for supporting avc in mp4
AU2003237120B2 (en) Supporting advanced coding formats in media files
US20040006575A1 (en) Method and apparatus for supporting advanced coding formats in media files
AU2003213555B2 (en) Method and apparatus for supporting AVC in MP4
CA2730543C (en) Method and apparatus for track and track subset grouping
CN1751517A (en) image decoding method
CN1781315A (en) Method for encoding image sequences
CN1820494A (en) Stream switching based on gradual decoder refresh
CN1650626A (en) Method and apparatus for supporting AVC in MP4
JP2010104030A (en) Method and apparatus for supporting avc in mp4

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CX01 Expiry of patent term
CX01 Expiry of patent term

Granted publication date: 20080402