[go: up one dir, main page]

HK1220059B - Method of video layout and processing in mcu stacking - Google Patents

Method of video layout and processing in mcu stacking Download PDF

Info

Publication number
HK1220059B
HK1220059B HK16107880.0A HK16107880A HK1220059B HK 1220059 B HK1220059 B HK 1220059B HK 16107880 A HK16107880 A HK 16107880A HK 1220059 B HK1220059 B HK 1220059B
Authority
HK
Hong Kong
Prior art keywords
control unit
multipoint control
endpoint
video
conference
Prior art date
Application number
HK16107880.0A
Other languages
Chinese (zh)
Other versions
HK1220059A1 (en
Inventor
Frank YUAN
Bidong Yan
Original Assignee
Polycom, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Polycom, Inc. filed Critical Polycom, Inc.
Publication of HK1220059A1 publication Critical patent/HK1220059A1/en
Publication of HK1220059B publication Critical patent/HK1220059B/en

Links

Description

MCU堆叠中的视频布局及处理的方法Video layout and processing methods in MCU stacking

技术领域Technical Field

本发明涉及多点会议技术,并且更具体地涉及在通过堆叠两个或更多 个多点控制单元(MCU)进行的会议中的视频布局。The present invention relates to multipoint conferencing technology, and more particularly to video layout in a conference conducted by stacking two or more Multipoint Control Units (MCUs).

背景技术Background Art

公司和组织越来越多地使用音频/视频会议技术和多点会议技术来提 高组织内的通信和效率。大型组织在组织各处分布大量的多媒体终端。通常, 一个或多个多点控制单元(MCU)为这些端点的内部多点多媒体会议技术的需 要而服务。Companies and organizations are increasingly using audio/video conferencing and multipoint conferencing technologies to improve communication and efficiency. Large organizations have numerous multimedia endpoints distributed throughout the organization. Typically, one or more multipoint control units (MCUs) serve the internal multipoint multimedia conferencing needs of these endpoints.

多媒体端点是一种联网的终端,其能够提供与其他终端或MCU(例 如,VSX8000)的实时双向视听通信。端点还可以包括MCU。MCU 是位于网络节点处或在终端中的会议控制实体,并可以经由访问端口接收来自 端点的多个媒体信道。根据特定标准,MCU处理视听和数据信号,并将所处理 的信号分发到所连接的信道。端点和MCU的更详细的说明可以在国际电信联 盟(“ITU”)标准中找到,所述标准例如但不限于H.320、H.324和H.323标准。A multimedia endpoint is a networked terminal that can provide real-time, two-way audiovisual communication with other terminals or an MCU (e.g., a VSX8000). An endpoint may also include an MCU. An MCU is a conference control entity located at a network node or in a terminal and can receive multiple media channels from an endpoint via an access port. The MCU processes audiovisual and data signals and distributes the processed signals to connected channels according to specific standards. A more detailed description of endpoints and MCUs can be found in International Telecommunication Union ("ITU") standards, such as, but not limited to, H.320, H.324, and H.323.

几种技术已被用于改善多点通信系统的使用和效率。这些技术中的一 些通过使得能进行无预约呼叫、临时呼叫(ad-hoc call)、虚拟会议室等而改善了 建立通信会话的过程。这些技术的一些在美国专利号7,085,243、7,310,320和 7,830,824中公开,其各自通过引用并入本文。Several technologies have been developed to improve the use and efficiency of multipoint communication systems. Some of these technologies improve the process of establishing communication sessions by enabling walk-up calls, ad-hoc calls, virtual conference rooms, and the like. Some of these technologies are disclosed in U.S. Patent Nos. 7,085,243, 7,310,320, and 7,830,824, each of which is incorporated herein by reference.

其他技术通过供应监测和控制多个MCU的Web服务器来改善对多点 通信网络的控制。使用快速更新的web控制技术的示例公开在美国专利号 6,760,750中,其通过引用而并入本文。已经公开了附加的技术,以用于利用一 个或多个MCU的资源、以用于级联一个或多个MCU、并且一般以用于改善一 个或多个MCU的资源使用。例如,参见美国专利号8,542,266、7,800,642、 7,492,730、7,174,365、7,113,992和美国专利公开号2012-0236111 A1,其各自通 过引用并入本文。Other techniques improve control of multipoint communication networks by providing a web server that monitors and controls multiple MCUs. An example of a web control technique using rapid updates is disclosed in U.S. Patent No. 6,760,750, which is incorporated herein by reference. Additional techniques have been disclosed for utilizing the resources of one or more MCUs, for cascading one or more MCUs, and generally for improving resource utilization of one or more MCUs. See, for example, U.S. Patent Nos. 8,542,266, 7,800,642, 7,492,730, 7,174,365, 7,113,992, and U.S. Patent Publication No. 2012-0236111 A1, each of which is incorporated herein by reference.

参考图1,MCU 116一般具有有限的硬件和/或软件资源,例如,MCU 116可能只具有对来自一定数目的端点114的视频信号进行编码/解码的能力。 当MCU 116可能具有有限的处理能力(硬件或软件)以支持视频信号的编码或 解码时,就可能会发生这种情况。在另一个示例中,MCU 116可以具有有限数 量的视频输入端口。在每种情况下,使用级联MCU(116a、116b)的级联会议 可用于克服资源限制。Referring to Figure 1 , MCU 116 typically has limited hardware and/or software resources. For example, MCU 116 may only be capable of encoding/decoding video signals from a certain number of endpoints 114. This situation may arise when MCU 116 may have limited processing power (hardware or software) to support encoding or decoding of video signals. In another example, MCU 116 may have a limited number of video input ports. In either case, a cascaded conference using cascaded MCUs (116a, 116b) can be used to overcome resource limitations.

在级联会议中,会议参与者可以被划分成两个或更多个网络中的两组 或更多个组,并且每一个与不同的MCU 116相关联,并且其中一个MCU 116 被指定为会议主MCU 116a(MMCU),而另一个MCU 116被指定为会议从MCU 116b(SMCU)。这种技术还可以在位于不同站点的参与者之间的会议中使用, 每个站点具有其自己的MCU 116,每个参与者可以使用他自己的本地MCU 116, 并且整个会议可以通过级联不同的本地MCU 116进行。In a cascaded conference, conference participants can be divided into two or more groups across two or more networks, each associated with a different MCU 116. One MCU 116 is designated as the conference master MCU 116a (MMCU), while the other MCU 116 is designated as the conference slave MCU 116b (SMCU). This technology can also be used in conferences between participants located at different sites, each with its own MCU 116. Each participant can use their own local MCU 116, and the entire conference can be conducted by cascading different local MCUs 116.

图1是多点会议技术通信系统100的框图。例如,系统100可以由具 有多个区域网络110的公司或由具有多个区域网络110的全球会议技术服务提 供商使用。区域网络110可以对应于单独的会议技术站点,并可以经由分组交 换网络120和/或电路交换网络130彼此通信。虽然区域网络110可以对应于特 定的基于分组的网络段或域,但是这不是必须的,并且区域网络110可以跨越 多个网络段或域。每个区域网络110可具有多个多媒体端点114和一个或多个 本地或特定于站点的MCU 116。FIG1 is a block diagram of a multipoint conferencing communication system 100. For example, system 100 may be used by a company with multiple regional networks 110 or by a global conferencing service provider with multiple regional networks 110. Regional networks 110 may correspond to individual conferencing sites and may communicate with each other via packet-switched networks 120 and/or circuit-switched networks 130. While regional networks 110 may correspond to specific packet-based network segments or domains, this is not required and regional networks 110 may span multiple network segments or domains. Each regional network 110 may have multiple multimedia endpoints 114 and one or more local or site-specific MCUs 116.

一个或多个控制服务器112(CS)可以用于每个区域网络110中。在 每个区域网络110内,本地端点114的每个可以经由分组交换网络和/或电路交 换网络与其相关联的本地MCU 116进行通信(未示出)。在通信系统100的一 个示例中,控制服务器112是Web服务器,其可以通过网络120使用互联网协 议(IP)与每个本地MCU 116进行通信。与MCU 116的通信可以经由应用程 序接口(API)模块(未示出)来完成,所述应用程序接口模块可以是MCU116 的一部分。控制服务器112可以是用于执行多站点会议的级联的专用服务器, 但也可被嵌入在MCU 116中,或者可以包括共享单个Web服务器的应用。例 如,如在上文并入的专利和专利申请参考中所公开的,单个Web服务器可以执 行附加的会议技术应用,并可以用于管理连接、呼叫、虚拟会议室、监测和控 制MCU 116等。One or more control servers 112 (CS) can be used in each regional network 110. Within each regional network 110, each of the local endpoints 114 can communicate with its associated local MCU 116 via a packet-switched network and/or a circuit-switched network (not shown). In one example of communication system 100, control server 112 is a web server that can communicate with each local MCU 116 via network 120 using the Internet Protocol (IP). Communication with MCU 116 can be accomplished via an application program interface (API) module (not shown), which can be part of MCU 116. Control server 112 can be a dedicated server for performing cascaded multi-site conferencing, but can also be embedded in MCU 116 or include applications sharing a single web server. For example, as disclosed in the patents and patent application references incorporated above, a single web server can execute additional conferencing technology applications and can be used to manage connections, calls, virtual conference rooms, monitor and control MCU 116, and so on.

在一个替代示例中,除了与MCU 116进行通信外,控制服务器112 可以与管理服务器(未示出)进行通信(例如,公司或全球服务提供商的服务 器)。管理服务器可以包括潜在端点用户的管理数据库(未示出),所述潜在端 点用户诸如是公司员工或全球服务提供商的客户。管理数据库可以包括诸如姓 名、不同类型的地址(例如电子邮件、电话等)、ID号(例如员工ID号、客户 账单号或客户ID号)、认证号码和会议室号之类的信息。在另一个实施例中, 控制服务器112和/或MCU 116可以包括管理服务器。In an alternative example, in addition to communicating with MCU 116, control server 112 may communicate with a management server (not shown) (e.g., a server at a company or a global service provider). The management server may include a management database (not shown) of potential endpoint users, such as company employees or customers of a global service provider. The management database may include information such as name, different types of addresses (e.g., email, phone, etc.), ID numbers (e.g., employee ID number, customer billing number, or client ID number), authentication numbers, and conference room numbers. In another embodiment, control server 112 and/or MCU 116 may include the management server.

根据正在进行的会议的类型,组织相关联的会议的视频。一种会议类 型被称为视频切换会议,其中每个与会者看到一个所选择的参与者(视频的一 个端点114的源)。会议期间,所选择的参与者可以保持不变,或者可以根据会 议的动态而改变。例如,当前活跃的发言者可作为会议视频而向所有的参与者 (即在所有端点114处)示出。一旦活跃发言者改变,则可示出一个新的活跃发 言者。Depending on the type of conference in progress, the associated conference videos are organized. One type of conference is called a video-switched conference, in which each participant sees a selected participant (the source of the video from one endpoint 114). The selected participant can remain constant during the conference or change based on the dynamics of the conference. For example, the currently active speaker can be shown as conference video to all participants (i.e., at all endpoints 114). If the active speaker changes, a new active speaker can be shown.

另一种会议类型是连续呈现会议(CP),其中贯穿会议,来自一个或 多个所选择的端点114的视频可以被连续地示出。在连续呈现级联会议中, SMCU 116b可以根据SMCU116b相关联的会议布局,将来自其相关联区域网 络110的所选参与者的视频信号构成连续呈现视频。相关联的会议的混合音频 和视频以类似于单个参与者的视频和音频的方式被传递到MMCU 116a。一个常 见的连续呈现过程涉及缩放来自各种源用户终端的视频数据来改变帧分辨率, 以便在以后将其并入到连续呈现布局和视频混合中。Another type of conference is a continuous presence conference (CP), in which video from one or more selected endpoints 114 can be shown continuously throughout the conference. In a continuous presence cascaded conference, SMCU 116b can compose a continuous presence video from the video signals of selected participants from its associated area network 110, according to the conference layout associated with SMCU 116b. The mixed audio and video of the associated conference is delivered to MMCU 116a in a manner similar to the video and audio of a single participant. A common continuous presence process involves scaling the video data from various source user terminals to change the frame resolution so that it can be later incorporated into the continuous presence layout and video mix.

MMCU 116a可以将从一个或多个SMCU 116b接收的音频和视频和 来自与MMCU116a相关联的参与者组的所选参与者的音频和视频相混合。其 结果是级联会议的所混合的音频和视频。然后MMCU 116a可以将级联会议的 所混合的音频和视频递送到与它自已相关联的端点114a中的每一个,并且递送 到所连接的一个或多个SMCU 116b。一个或多个SMCU 116b中的每一个可将 级联会议的所混合的音频和视频分发到与其相关联的端点114b。The MMCU 116a can mix the audio and video received from one or more SMCUs 116b with the audio and video of the selected participant from the participant group associated with the MMCU 116a. The result is the mixed audio and video of the cascade conference. The MMCU 116a can deliver the mixed audio and video of the cascade conference to each of the endpoints 114a associated with itself then, and deliver it to the one or more SMCUs 116b that are connected. Each of the one or more SMCUs 116b can distribute the mixed audio and video of the cascade conference to the endpoints 114b associated therewith.

管理级联会议的一个挑战是,每个MCU 116(MMCU 116a和SMCU 116b)从与其相关联的组选择要被混合和显示的参与者(端点114),而这与所 选择的参与者与其他相关联组中的参与者如何相关无关。另外,与MMCU 116a 相关联的参与者的图像的尺寸往往不同于与SMCU 116b相关联的参与者的图 像。One challenge in managing cascaded conferences is that each MCU 116 (MMCU 116a and SMCU 116b) selects participants (endpoints 114) from its associated group to be mixed and displayed, regardless of how the selected participants relate to participants in other associated groups. In addition, the size of the image of the participant associated with MMCU 116a is often different from the image of the participant associated with SMCU 116b.

如图2所示,连续呈现布局200被用作是在与MMCU 116a相关联的 四个端点114a1-114a4和与SMCU 116b相关联的四个端点114b1-114b4之间的 会议的布局。参与者AM、BM、CM和DM可分别与端点114a1-114a4相关联, 而参与者AS、BS、CS和DS可分别与端点114b1-114b4相关联。最活跃的发 言者可以是参与者AM、BM、DM和AS,并且期望从在2x2的连续呈现布局 中的最活跃的参与者中选择和示出视频。然而,在级联会议中,来自SMCU 116b 的视频被当做是来自任何其他端点114a的视频。布局200示出了来自SMCU 116b(AS、BS、CS和DS)的所构成的视频如何取代在级联布局中的单个参与 者的图块位置。其结果是,与SMCU116b相关联的参与者的每一个都得到了一 个更小的屏幕区域(例如,在2X2布局中通常向参与者指派四分之一的空间)。As shown in Figure 2, a continuous presentation layout 200 is used as a layout for a conference between four endpoints 114a1-114a4 associated with MMCU 116a and four endpoints 114b1-114b4 associated with SMCU 116b. Participants AM, BM, CM, and DM may be associated with endpoints 114a1-114a4, respectively, while participants AS, BS, CS, and DS may be associated with endpoints 114b1-114b4, respectively. The most active speakers may be participants AM, BM, DM, and AS, and it is desirable to select and display video from the most active participant in the 2x2 continuous presentation layout. However, in a cascaded conference, video from SMCU 116b is treated as video from any other endpoint 114a. Layout 200 illustrates how the composed video from SMCU 116b (AS, BS, CS, and DS) replaces the tile position of a single participant in the cascaded layout. As a result, each of the participants associated with SMCU 116b is given a smaller screen area (e.g., one-quarter of the space typically allocated to a participant in a 2x2 layout).

一种校正与MMCU 116a相关联的参与者的图像尺寸相比于与SMCU 116b相关联的参与者的图像尺寸的差别的方式是,通过迫使SMCU 116b使用视 频切换布局并递送单个所选参与者的视频。单个所选参与者的图像被放置在级 联会议的布局中。布局220图示了迫使SMCU 116b工作在切换模式中的2X2 的连续呈现级联会议的快照。在切换模式中,SMCU116b递送覆盖整个帧的单 个所选参与者的图像。因此,当MMCU 116a将图像按比例缩小以将其放置在 级联视频的连续呈现布局中时,按比例缩小的图像具有与MMCU 116a相关联的参与者的图像相同的尺寸。One way to correct for the difference in image size between the participants associated with MMCU 116a and the participants associated with SMCU 116b is to force SMCU 116b to use a video switching layout and deliver the video of a single selected participant. The image of the single selected participant is placed in the layout of the cascaded conference. Layout 220 illustrates a snapshot of a 2x2 continuous presentation cascaded conference in which SMCU 116b is forced to operate in switching mode. In switching mode, SMCU 116b delivers the image of the single selected participant that covers the entire frame. Therefore, when MMCU 116a scales the image down to place it in the continuous presentation layout of the cascaded video, the scaled-down image has the same size as the image of the participant associated with MMCU 116a.

使用这种方法校正了尺寸的问题,但阻止了观看与SMCU 116b相关 联的其他参与者,即使他们的音频能量比AM和/或BM和/或DM的音频能量 更高。因此,需要用于构成针对级联会议的视频的系统和方法,其中,参与者 的每一个在相同准则下被评估,以用于被显示在布局中,而不管参与者是与 MMCU还是与SMCU相关联。Using this approach corrects the size issue, but prevents viewing of other participants associated with SMCU 116b, even if their audio energy is higher than that of AM and/or BM and/or DM. Therefore, there is a need for a system and method for composing video for a cascaded conference in which each of the participants is evaluated under the same criteria for display in the layout, regardless of whether the participant is associated with an MMCU or an SMCU.

发明内容Summary of the Invention

公开了一种用于在堆叠会议中提供连续呈现布局的系统和方法。根据 一个实施例,两个或更多个MCU用于在具有一个或多个MCU连接控制器 (MCC)模块的MCU堆叠视频会议中创建连续呈现布局。在连续呈现堆叠会议 期间,MCC确定哪个参与者端点呈现在连续呈现布局中和参与者应出现在哪个 窗口中。在会议期间,选择可以动态改变。MCC接收诸如会议的每位参与者的 音频能量之类的信息。该信息可以周期性地由与会议相关联的每个MCU自动 递送,或者可以由MCU从每个MCC处检索到。基于该信息,MCC确定:哪 些端点将连接到哪一个MCU,并且在对于每个端点的连续呈现布局的每个窗口 中将显示哪个端点视频。将该决定传递到被牵涉到堆叠会议中的每个MCU和 每个端点。A system and method for providing a continuous presence layout in a stacked conference is disclosed. According to one embodiment, two or more MCUs are used to create a continuous presence layout in an MCU stacked video conference with one or more MCU connection controller (MCC) modules. During a continuous presence stacked conference, the MCC determines which participant endpoints are presented in the continuous presence layout and in which window the participant should appear. This selection can change dynamically during the conference. The MCC receives information such as the audio power of each participant in the conference. This information can be automatically delivered periodically by each MCU associated with the conference or can be retrieved by the MCU from each MCC. Based on this information, the MCC determines which endpoints will be connected to which MCU and which endpoint video will be displayed in each window of the continuous presence layout for each endpoint. This decision is communicated to each MCU and each endpoint involved in the stacked conference.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

通过阅读以下描述并参考附图,将更加容易地理解示例性实施例,在 附图中:The exemplary embodiments will be more readily understood by reading the following description and referring to the accompanying drawings, in which:

图1图示出了现有技术的级联会议的基本框图;FIG1 illustrates a basic block diagram of a cascade conference in the prior art;

图2图示出了根据现有技术的用于显示连续呈现参与者的布局;FIG2 illustrates a layout for displaying continuous presentation participants according to the prior art;

图3是具有MCU连接控制器(MCC)的堆叠主MCU的框图;FIG3 is a block diagram of a stacked master MCU with an MCU connection controller (MCC);

图4是MCC的框图;和FIG4 is a block diagram of an MCC; and

图5是图示了用于使用MCC管理连续呈现堆叠布局的过程的流程 图。5 is a flow chart illustrating a process for managing a continuous presentation stack layout using MCC.

具体实施方式DETAILED DESCRIPTION

图3是根据本公开内容的可以用于进行具有一个或多个连续呈现布 局的多媒体堆叠会议的堆叠主MMCU 116a的简化框图。MMCU 116a可以包括 网络接口420、音频模块430、控制模块440和视频模块450。网络接口420可 以根据各种各样的通信标准经由相关的网络通信区域110(图1)从终端114(图 1)接收通信并处理通信。3 is a simplified block diagram of a stack master MMCU 116 a that can be used to conduct a multimedia stack conference with one or more continuous presentation layouts according to the present disclosure. The MMCU 116 a can include a network interface 420, an audio module 430, a control module 440, and a video module 450. The network interface 420 can receive and process communications from the terminal 114 ( FIG. 1 ) via the associated network communication area 110 ( FIG. 1 ) according to a variety of communication standards.

网络接口420可用于从其他的MCU 116b和/或一个或多个控制服务 器112(图1)接收控制和数据信息,并向其他的MCU 116b和/或一个或多个控 制服务器112(图1)传输控制和数据信息。有关在端点和/或MCU之间通过不 同网络的通信的更多信息和描述信令、控制、压缩以及如何设置和传递视频呼 叫的信息例如可以在国际电信联盟标准H.320、H.321、H.323、H.261、H.263 和H.264中找到。The network interface 420 may be used to receive control and data information from other MCUs 116b and/or one or more control servers 112 ( FIG. 1 ), and to transmit control and data information to other MCUs 116b and/or one or more control servers 112 ( FIG. 1 ). Further information regarding communication between endpoints and/or MCUs over various networks and information describing signaling, control, compression, and how to set up and deliver video calls may be found, for example, in International Telecommunication Union standards H.320, H.321, H.323, H.261, H.263, and H.264.

视频模块450经由网络接口420接收来自与MCU 116a相关联的多个 端点114的压缩视频。此外,视频模块450可以从被牵涉到堆叠会议中的其他 MCU 116b处接收一个或多个相关联的连续呈现布局和/或可以向被牵涉到堆叠 会议中的其他MCU 116b创建一个或多个相关联的连续呈现布局。由视频模块 450处理、构成和编码视频输入。视频模块450可以有多个输入模块452、输出 模块456和通用接口,例如示出了四个输入模块452a-452d和五个输出模块 456a-456e。输入模块452以及输出模块456中的一些可以与终端114相关联,并且输入模块452以及输出模块456中的一些可以与其他MCU 116b相关联。 每个输入模块452可包括视频解码器,并且每个输出模块456可包括视频编码 器。在堆叠会议期间,I/O组(输入模块452和输出模块456)可与被牵涉到堆 叠会议中的其他MCU 116d相关联。The video module 450 receives compressed video from multiple endpoints 114 associated with the MCU 116a via the network interface 420. Furthermore, the video module 450 can receive one or more associated continuous presentation layouts from other MCUs 116b involved in the stacked conference and/or can create one or more associated continuous presentation layouts for other MCUs 116b involved in the stacked conference. The video module 450 processes, structures, and encodes the video input. The video module 450 can have multiple input modules 452, output modules 456, and a common interface. For example, four input modules 452a-452d and five output modules 456a-456e are shown. Some of the input modules 452 and output modules 456 can be associated with endpoints 114, and some of the input modules 452 and output modules 456 can be associated with other MCUs 116b. Each input module 452 may include a video decoder, and each output module 456 may include a video encoder. During a stacking conference, the I/O group (input module 452 and output module 456) may be associated with other MCUs 116d involved in the stacking conference.

与MCU 116相关联的组的数目取决于会议的架构。例如,如果会议 是主/从架构,则MCU 116中的一个被指定为MMCU 116a,而一个或多个MCU 116被指定为SMCU 116b。在一个实施例中,MMCU 116a的控制模块440包括 控制服务器112a,其包括用于控制端点114到MCU 116的连接的MCU连接控 制器(MCC)460(下面将更详细的描述)。虽然在这里被示为是集成的,但是 MCC 460可以与MMCU 116a分开。The number of groups associated with an MCU 116 depends on the conference architecture. For example, if the conference is a master/slave architecture, one of the MCUs 116 is designated as an MMCU 116a, while one or more MCUs 116 are designated as SMCUs 116b. In one embodiment, the control module 440 of the MMCU 116a includes a control server 112a, which includes an MCU connection controller (MCC) 460 (described in more detail below) for controlling the connection of endpoints 114 to the MCU 116. Although shown as integrated here, the MCC 460 can be separate from the MMCU 116a.

在示例性堆叠MCU架构中,MMCU 116a具有至少一个输出模块 456,以将比特流分发到SMCU 116b的每一个。从MMCU 116a流出到SMCU 116b的至少一个输出可以以最高的保真度被编码,并可包括将在CP布局中显 示的端点视频流的数目多至少一个。这允许SMCU116b的输入模块452解码从 会议主MMCU 116a接收到的比特流,混合对于每个端点的音频和视频(例如 减去属于接收端点114的音频和/或视频)和重新编码以用于递送到与会议从SMCU 116b相关联的端点114。In the exemplary stacked MCU architecture, MMCU 116a has at least one output module 456 to distribute the bitstream to each of SMCUs 116b. The at least one output from MMCU 116a to SMCU 116b can be encoded with the highest fidelity and can include at least one more endpoint video stream than will be displayed in the CP layout. This allows the input module 452 of SMCU 116b to decode the bitstream received from conference master MMCU 116a, mix the audio and video for each endpoint (e.g., minus the audio and/or video belonging to receiving endpoint 114), and re-encode it for delivery to endpoints 114 associated with conference slave SMCU 116b.

视频模块450的各种组件的通用功能性是本领域已知的,并且将不在 本文中穷尽地详细描述。然而,这样的细节的更详细的描述可以通过查阅美国 专利公开号2002/0188731和美国专利号6,300,973而找到,其内容通过引用并入 本文。本公开内容描述了在下文中结合图4描述的准备堆叠连续呈现布局时的 视频模块450的操作。The general functionality of the various components of video module 450 is known in the art and will not be exhaustively described in detail herein. However, a more detailed description of such details can be found by consulting U.S. Patent Publication No. 2002/0188731 and U.S. Patent No. 6,300,973, the contents of which are incorporated herein by reference. This disclosure describes the operation of video module 450 when preparing a stacked continuous presentation layout, as described below in conjunction with FIG. 4 .

音频模块430可以经由网络接口420从多个端点114或SMCU 116b 接收经压缩的音频流。音频模块430可以处理经压缩的音频流(包括混合相关 的音频流),并通过网络接口420将经压缩的混合信号发送回端点114和SMCU 116b。在实施例中,被发送到端点114或SMCU 116b中的每一个的音频流可以 是不同的。例如,被发送到端点114中的不同端点的音频流可以根据不同的通 信标准或根据单独的端点114的需要来格式化。作为另一个示例,音频流可以 不包括与音频流被发送到的端点114相关联的用户的话音,但该话音可以被包括在所有其他音频流中。The audio module 430 can receive compressed audio streams from multiple endpoints 114 or SMCU 116b via the network interface 420. The audio module 430 can process the compressed audio streams (including mixing related audio streams) and send the compressed mixed signals back to the endpoints 114 and SMCU 116b via the network interface 420. In an embodiment, the audio streams sent to each of the endpoints 114 or SMCU 116b can be different. For example, the audio streams sent to different endpoints in the endpoints 114 can be formatted according to different communication standards or according to the needs of the individual endpoints 114. As another example, the audio stream may not include the voice of the user associated with the endpoint 114 to which the audio stream is sent, but the voice can be included in all other audio streams.

音频模块430可被适配于分析来自端点114的所接收的音频信号,并 确定每个端点114的音频信号能量。对于信号能量的信息可被传递到控制模块 440。能量水平可以用作选择参数,以用于选择适当的一个或多个端点114作为 用于混合会议的音频和/或视频的源,其有时被称为“呈现端点”。Audio module 430 may be adapted to analyze received audio signals from endpoints 114 and determine the audio signal energy for each endpoint 114. Information regarding the signal energy may be passed to control module 440. The energy level may be used as a selection parameter for selecting an appropriate one or more endpoints 114 as the source of audio and/or video for the mixed conference, which are sometimes referred to as "presentation endpoints."

MMCU 116a和控制服务器112a可以使用API通过IP网络与每个 SMCU 116b进行通信。控制服务器112a可以执行会议技术的应用,并且可以用 于管理预约呼叫、临时呼叫、呼叫转移、虚拟会议室、监测和控制MCU(例如 “web指挥官”)等,如在前面并入的专利和专利申请参考中公开的。The MMCU 116a and the control server 112a can communicate with each SMCU 116b over an IP network using an API. The control server 112a can execute conferencing technology applications and can be used to manage scheduled calls, ad hoc calls, call forwarding, virtual conference rooms, monitor and control MCUs (e.g., "web commanders"), etc., as disclosed in the previously incorporated patents and patent application references.

控制服务器112a可以包括MCC 460,以用于通过管理哪些端点114 连接到哪一个MCU 116来管理堆叠会议的一个或多个连续呈现布局中的不同窗 口的构成。在实施例中,MCC460驻留在控制服务器112内。在其他实施例中, MCC 460可以是独立的设备/模块和/或与多于一个的MCU 116相关联,其中在 堆叠会议期间只有一个MCC 460处于活跃状态。将参考附图4和5更详细地讨 论MCC 460。Control server 112a may include an MCC 460 for managing the composition of different windows in one or more continuous presentation layouts of a stacked conference by managing which endpoints 114 are connected to which MCU 116. In one embodiment, MCC 460 resides within control server 112. In other embodiments, MCC 460 may be a standalone device/module and/or associated with more than one MCU 116, with only one MCC 460 active during a stacked conference. MCC 460 will be discussed in more detail with reference to Figures 4 and 5.

控制模块440可以是控制MMCU 116a操作的逻辑单元。除了典型 MCU 116的通用操作,由于具有控制模块440,所以MMCU 116a能进行附加 操作。具体地,控制模块440可包括逻辑模块(未示出),以用于控制连续呈现 布局的构成。MCC 460可以处理来自被牵涉到堆叠会议中的其他MCU 116b的 信息以及来自与MCU 116a相关联的端点114的信息。通过选择哪些端点114 将连接到MMCU 116a的网络区域110中,该信息可以用于确定哪个端点114 将被选择为在连续呈现布局中显示。The control module 440 may be a logic unit that controls the operation of the MMCU 116a. In addition to the general operations of a typical MCU 116, the MMCU 116a can perform additional operations due to the control module 440. Specifically, the control module 440 may include a logic module (not shown) for controlling the formation of a continuous presentation layout. The MCC 460 may process information from other MCUs 116b involved in a stacked conference and information from endpoints 114 associated with the MCU 116a. By selecting which endpoints 114 will be connected to the network area 110 of the MMCU 116a, this information may be used to determine which endpoint 114 will be selected to be displayed in the continuous presentation layout.

图4示出了示出MCC 460的各种组件的框图。除了其他元件之外, MCC 460可以包括通信模块462、堆叠会议参数记录器(logger)464、决定模 块466和堆叠架构数据库468。MCC 460经由通信模块462和(如果存在)控 制服务器112a(图3)与被牵涉到堆叠会议中的每个MCU 116通信。MCC460 可以接收与一个或多个堆叠连续呈现布局的构成相关联的信息(参数)。该信息 可以从不同的MCU 116收集。参数可以包括与特定的MCU 116相关联的不同参与者的音频能量,并包括在由MCU 116生成的所构成的相关联的连续呈现视 频中的每一个参与者的窗口位置。来自不同MCU 116的信息可以周期性地由 MCC 460收集。例如,在替换实施例中,发送该信息可以是自动的,并且可以 由MCU 116在检测到例如参数中的一个参数的改变时发起。FIG4 shows a block diagram illustrating various components of MCC 460. MCC 460 may include, among other elements, a communication module 462, a stacked conference parameter logger 464, a decision module 466, and a stacked architecture database 468. MCC 460 communicates with each MCU 116 involved in the stacked conference via communication module 462 and, if present, control server 112a ( FIG3 ). MCC 460 may receive information (parameters) associated with the configuration of one or more stacked continuous presentation layouts. This information may be collected from different MCUs 116. Parameters may include the audio power of different participants associated with a particular MCU 116 and the window position of each participant in the associated configured continuous presentation video generated by the MCU 116. Information from different MCUs 116 may be periodically collected by MCC 460. For example, in an alternative embodiment, the transmission of this information may be automatic and may be initiated by an MCU 116 upon detecting a change in one of the parameters.

MCC 460可以经由通信模块462向堆叠会议会话中所牵涉的每一个 SMCU 116b发送指令。指令可以包括关于哪些端点114应被传递到SMCU 116b 或从MMCU 116a传递哪个端点114的选择指令。详细说明端点114的传递的更 多信息可以在并入本文参考的ITU标准H.450.2中找到。通信模块462可以从 控制服务器112a(附图3)接收关于被牵涉到会议中的架构、MCU 116b和端点 114等的指令,并发送对于堆叠会议的状态信息到控制服务器112a。MCC 460 can send instructions to each SMCU 116b involved in the stacked conference session via communication module 462. The instructions can include selection instructions regarding which endpoints 114 should be transferred to SMCU 116b or which endpoint 114 should be transferred from MMCU 116a. Further information detailing the transfer of endpoints 114 can be found in ITU standard H.450.2, which is incorporated herein by reference. Communication module 462 can receive instructions from control server 112a ( FIG. 3 ) regarding the architecture, MCUs 116b, and endpoints 114 involved in the conference and send status information for the stacked conference to control server 112a.

会议参数记录器464是堆叠会议和其相关联的端点114和MCU 116 的动态参数可以被存储在其中的模块。会议参数记录器464可以是在采样周期 中组织的循环数据结构,并可以包括最后一个“T”周期,其中“T”是例如一 到几十个周期的范围。每个周期性部分可包括关于每个参与者的音频能量的信 息,不管他们的图像是否被包括在当前一个或多个所构成的相关联的连续呈现 布局中等等。例如,采样周期可以在几十毫秒到几秒的范围内。在一个实施例 中,从不同的MCU 116采样参数可以由会议参数记录器464管理,并且与MCC 460的活动并行运行。在一个替换实施例中,会议参数记录器464可由决定模块 466管理。Conference parameter recorder 464 is a module in which dynamic parameters of a stacked conference and its associated endpoints 114 and MCUs 116 can be stored. Conference parameter recorder 464 can be a cyclic data structure organized in sampling periods and can include a final "T" period, where "T" is, for example, in the range of one to several dozen periods. Each periodic portion can include information about the audio energy of each participant, whether their image is included in the current one or more associated continuous presentation layouts, and so on. For example, the sampling period can range from tens of milliseconds to several seconds. In one embodiment, sampling parameters from different MCUs 116 can be managed by conference parameter recorder 464 and run in parallel with the activities of MCC 460. In an alternative embodiment, conference parameter recorder 464 can be managed by decision module 466.

为了消除频繁的端点连接的改变和所呈现的端点114的视频图像的 改变,参数记录器464可以从被存储在会议参数记录器464中的“J”个采样周 期收集参数,并可以选择特定参与者频繁地比其他人更响亮的端点114。虽然那 些参与者不是在最后一个或两个采样周期中更响亮的发言者,但是他们可能被 任命为当前呈现的端点。“J”的值可以是小于或等于“T”。本发明的实施例也 可以使用用于其他选择所呈现的端点的其他方法。To eliminate frequent changes in endpoint connectivity and changes in the video image of the presented endpoint 114, parameter recorder 464 may collect parameters from "J" sampling periods stored in conference parameter recorder 464 and select endpoints 114 where specific participants frequently speak louder than others. Although those participants were not the loudest speakers in the last one or two sampling periods, they may be designated as the currently presented endpoint. The value of "J" may be less than or equal to "T." Embodiments of the present invention may also utilize other methods for selecting the presented endpoint.

架构数据库468是用于存储相关于堆叠会议中端点114和MCU 116 的每个的信息的数据库。该信息可以包括在端点114和MCU 116之间的关联和 在用于当前堆叠会议的不同MCU 116之间的关联。例如,相关于可由端点114 请求的不同堆叠布局的信息以及用于选择合适的端点114的选择规则也可以被 存储在架构数据库468中。附加地,也可以存储相关于每个端点114的地址以 及别名信息,例如,对于每个端点114和MCU 116的IP地址。也可以从控制 服务器112a接收关于架构的信息。此外,架构数据库468可以包括关于当前布 局的动态信息,诸如但不限于,哪些端点114当前被连接以及它们被连接到哪 个MCU 116、新端点何时加入或离开会议等。可以从相关的MCU 116递送该信 息。The architecture database 468 is a database that stores information related to each endpoint 114 and MCU 116 in a stacked conference. This information may include associations between endpoints 114 and MCUs 116, as well as associations between different MCUs 116 used for the current stacked conference. For example, information related to different stacking layouts that can be requested by an endpoint 114, as well as selection rules for selecting an appropriate endpoint 114, may also be stored in the architecture database 468. Additionally, address and alias information related to each endpoint 114, such as the IP address for each endpoint 114 and MCU 116, may also be stored. Information about the architecture may also be received from the control server 112a. Furthermore, the architecture database 468 may include dynamic information about the current layout, such as, but not limited to, which endpoints 114 are currently connected and to which MCU 116 they are connected, when new endpoints join or leave the conference, and so on. This information may be delivered from the relevant MCU 116.

决定模块466通过管理不同MCU 116的连接来管理被牵涉到当前的 堆叠连续呈现会议中的不同堆叠会议布局的构成。基于存储在会议参数记录器 464中的会议参数和存储在架构数据库468中的布局要求,决定模块466可以确 定哪个端点114应被连接到MMCU116a,以及因此哪些端点114应该在每个堆 叠会议布局中进行混合。Decision module 466 manages the composition of different stacked conference layouts involved in the current stacked continuous presence conference by managing the connections of different MCUs 116. Based on the conference parameters stored in conference parameter recorder 464 and the layout requirements stored in architecture database 468, decision module 466 can determine which endpoints 114 should be connected to MMCU 116a, and therefore which endpoints 114 should be mixed in each stacked conference layout.

图5是图示了根据所公开的实施例的用于建立和控制堆叠连续呈现 视频会议的过程600的流程图。为了清楚和简单起见,过程600被公开为其由 活跃的MCC 460作为控制模块440中的控制服务器112a(如图3中所示)的一 部分而被实施。过程600可以由来自堆叠主MCU 116的控制服务器112a在合 适的开始时间(例如,在预定会议的开始时间或在开始临时呼叫堆叠连续呈现 会议时)发起(602)。堆叠主MCU 116管理所堆叠会议中的资源的全部,并可 以是连接到堆叠会议的MCU 116中的任何一个。为了清楚和简单起见,尽管这 不是必须的,但仍然假定堆叠主MCU 116也是会议主MMCU 116a。FIG5 is a flow chart illustrating a process 600 for establishing and controlling a stacked continuous presence video conference according to the disclosed embodiment. For clarity and simplicity, process 600 is disclosed as being implemented by the active MCC 460 as part of the control server 112a in the control module 440 (as shown in FIG3 ). Process 600 can be initiated (602) by the control server 112a from the stack master MCU 116 at an appropriate start time (e.g., at the start time of a scheduled conference or when initiating an ad hoc call stacked continuous presence conference). The stack master MCU 116 manages all resources in the stacked conference and can be any of the MCUs 116 connected to the stacked conference. For clarity and simplicity, it is assumed that the stack master MCU 116 is also the conference master MMCU 116a, although this is not required.

在发起时,堆叠主MCU 116可以对会议中的其他MCU 116进行轮询, 以确定每个MCU 116拥有的资源数量(例如,视频编码器和解码器的数量)。 可替换地,在启动后,MCU116将自己报告其对堆叠主MCU 116可用的资源数 量。At initiation, the stack master MCU 116 can poll the other MCUs 116 in the conference to determine the amount of resources (e.g., the number of video encoders and decoders) each MCU 116 has. Alternatively, after startup, the MCU 116 will report the amount of resources available to the stack master MCU 116.

然后,堆叠主MCU 116选择(604)会议主MMCU 116a,其可以具 有用于支持要被显示在连续呈现(CP)布局中的端点114数量的基本资源(即 编码器和解码器的最小数目)。例如,如果期望2x2的CP布局,则要被呈现的 端点视频流的数量(“N”)为四。并且(“K”)可能是要参与到会议中的端点的 总数量。在一个实施例中,具有最大数量的资源的MCU 116被选择作为会议主 MMCU 116a。在另一个实施例中,选择具有最小数量的资源的第一MCU 116。最小数量的资源可以例如被定义为N个解码器和N+1个编码器(“基本资源”)。 如果没有MCU116被确定为至少具有基本资源,则第二MCU 116将被选择作 为主会议从设备,并被配置为堆叠或级联SMCU 116b。The stack master MCU 116 then selects (604) a conference master MMCU 116a that may have the basic resources (i.e., the minimum number of encoders and decoders) to support the number of endpoints 114 to be displayed in the continuous presentation (CP) layout. For example, if a 2x2 CP layout is desired, the number of endpoint video streams to be presented ("N") is four. And ("K") may be the total number of endpoints to participate in the conference. In one embodiment, the MCU 116 with the largest number of resources is selected as the conference master MMCU 116a. In another embodiment, the first MCU 116 with the smallest number of resources is selected. The smallest number of resources may be defined, for example, as N decoders and N+1 encoders ("basic resources"). If no MCU 116 is determined to have at least the basic resources, the second MCU 116 will be selected as the master conference slave and configured as a stacked or cascaded SMCU 116b.

然后堆叠主MMCU 116a可能针对每个SMCU 116的端点信息而轮询 每个SMCU 116。该信息(其可以包括对于每个端点的IP地址)被存储(606) 在上文参考图4讨论的堆叠架构数据库468中。因此,堆叠主设备跟踪每个端 点114和每个端点114连接到哪里。端点114可以排序为E(1)到E(K)。The stack master MMCU 116a may then poll each SMCU 116 for endpoint information for each SMCU 116. This information (which may include the IP address for each endpoint) is stored (606) in the stack architecture database 468 discussed above with reference to FIG4. Thus, the stack master keeps track of each endpoint 114 and where each endpoint 114 is connected. The endpoints 114 may be ordered as E(1) to E(K).

MCC 460然后根据期望的连续呈现布局,指导每个SMCU 116b来传 递端点连接(608)。例如,如果两个端点114 E(1)和E(2)初始地连接到MMCU 116a,则MCC 460可以指导控制端点114 E(3)和E(4)的MCU 116b以将它们的 连接传递到MMCU 116a,使得MMCU 116a具有至少N个端点连接到其网络区 域110。可以根据预定的参数,选择要显示在CP布局中的初始选择的端点114。 在这一方面,某些端点114可以被指定为将在CP布局中初始示出的端点114。 在一个实施例中,这是通过播种对于与端点114相关联的音频能量的值来实现 的。The MCC 460 then directs each SMCU 116b to transfer endpoint connections according to the desired continuous presentation layout (608). For example, if two endpoints 114 E(1) and E(2) are initially connected to the MMCU 116a, the MCC 460 can direct the MCU 116b controlling endpoints 114 E(3) and E(4) to transfer their connections to the MMCU 116a so that the MMCU 116a has at least N endpoints connected to its network area 110. The endpoints 114 that are initially selected to be displayed in the CP layout can be selected based on predetermined parameters. In this regard, certain endpoints 114 can be designated as the endpoints 114 that will be initially shown in the CP layout. In one embodiment, this is achieved by seeding a value for the audio energy associated with the endpoints 114.

循环开始于步骤612,只要堆叠连续呈现会议是活动的则运行,并在 堆叠连续呈现会议结束时终止。在步骤612,关于每个端点114的信息从不同的 MCU 116检索到。关于每个端点114的信息可以包括当前的音频能量和其在相 关布局中的窗口位置(如果存在)。信息可以被存储在会议参数记录器464中(图 4)。在会议参数记录仪464是循环缓冲器的实施例中,属于最早采样周期的一 组信息可能会被删除。存储在会议参数记录器464中的信息可以由决定模块466 (图4)检索和处理(616),以用于确定布局和/或端点114的连接是否需要改变。The loop begins at step 612 and runs as long as the stacked continuous presentation conference is active and terminates when the stacked continuous presentation conference ends. In step 612, information about each endpoint 114 is retrieved from different MCUs 116. The information about each endpoint 114 can include the current audio energy and its window position in the relevant layout (if any). The information can be stored in the conference parameter recorder 464 (Figure 4). In an embodiment where the conference parameter recorder 464 is a circular buffer, a set of information belonging to the earliest sampling period may be deleted. The information stored in the conference parameter recorder 464 can be retrieved and processed (616) by the decision module 466 (Figure 4) to determine whether the layout and/or the connection of the endpoint 114 need to be changed.

在步骤616,在选择要显示的呈现端点之后,检查每个堆叠会议布局 的当前混合,并作出是否需要一个或多个MCU 116b混合的改变的决定。如果 不需要,则决定模块466可等待下一次迭代。如果需要视频混合的改变,则决 定模块466可确定哪个MCU 116被牵涉到当前改变中和在CP布局的哪一个窗 口中放置新的一个或多个端点的视频图像。发送适当的指令到相关的MCU 116, 即,连接到相关端点的那些MCU 116。当执行所要求的改变时,决定模块466 可等待下一次迭代。In step 616, after selecting the presentation endpoints to display, the current mix of each stacked conference layout is examined and a determination is made as to whether a change to the mix of one or more MCUs 116b is required. If not, the decision module 466 may wait for the next iteration. If a change to the video mix is required, the decision module 466 may determine which MCUs 116 are involved in the current change and in which window of the CP layout the video images of the new endpoint or endpoints should be placed. Appropriate instructions are sent to the relevant MCUs 116, i.e., those connected to the relevant endpoints. Once the required changes have been made, the decision module 466 may wait for the next iteration.

可使用不同的方法来确定是否需要改变。例如,一种方法可能将参与 者按从最高到最低的音频能量排名(参与者的数量取决于CP布局中可能窗口的 数量,在本示例中为N),并选择在前“t0”个采样周期中具有更高音频能量的 端点作为所呈现的端点(“t0”可以是被存储在会议参数记录器464中的从一到 采样周期的最大数量之间的任何数目)。例如,如果连接到MMCU 116a的端点 E(1)至E(N)的音频能量大于从端点E(N+1)到E(K)的音频能量,则不需要任何 改变。其他确定是可能的,另一种方法可以在存储在会议参数记录器464中的 整个采样周期中选择最频繁最响亮的N个发言者;其他方法可能会增加一个新 的参与者来替代在最后的周期中所选择的最弱的发言者。在一个替换实施例中, 所移除的呈现端点可以是较不频繁的发言者等。然后过程600可以将新选择的 参与者与当前显示的参与者比较,以用于确定(620)当前布局中是否需要改变。 如果在步骤620,不需要改变当前CP布局的任何内容,则过程600可以返回到 步骤612,并等待下一次迭代。Different methods can be used to determine whether a change is needed. For example, one method might rank the participants from highest to lowest audio energy (the number of participants depends on the number of possible windows in the CP layout, N in this example) and select the endpoint with the higher audio energy in the first "t0" sampling periods as the presented endpoint ("t0" can be any number from one to the maximum number of sampling periods stored in the conference parameter recorder 464). For example, if the audio energy of endpoints E(1) to E(N) connected to the MMCU 116a is greater than the audio energy from endpoints E(N+1) to E(K), then no change is needed. Other determinations are possible; another method might select the most frequent and loudest N speakers over the entire sampling period stored in the conference parameter recorder 464; another method might add a new participant to replace the weakest speaker selected in the last period. In an alternative embodiment, the removed presented endpoint can be a less frequent speaker, etc. The process 600 can then compare the newly selected participant with the currently displayed participants to determine (620) whether changes are required in the current layout. If, at step 620, no changes are required to the current CP layout, the process 600 can return to step 612 and wait for the next iteration.

如果在步骤620,需要CP布局(在布局中或在一个或多个参与者的选择中)的改变,并且要被提出(新显示)的端点没有连接到MMCU 116a,则决定模块466可确定(624)在每个MCU 116中的所要求的改变。改变可包括用于将新提出的端点114传递到MMCU 116a的网络区域l10a,以及传递端点E(N)(例如具有最少量的音频能量的端点E(1)--E(N)中的端点114)的指令。新组织可以通过信令和控制连接被传递到每个MCU 116。在接收到新设置后,每个MCU 116相应地更新(626)其内部资源,以便提供所要求的新的音频和视频混合。然后过程600返回到步骤612,并执行下一次迭代。If, at step 620, a change in the CP layout (either in the layout or in the selection of one or more participants) is required, and the endpoint to be proposed (newly displayed) is not connected to the MMCU 116a, the decision module 466 may determine (624) the required changes in each MCU 116. The changes may include instructions for transferring the newly proposed endpoint 114 to the network area 110a of the MMCU 116a, and transferring the endpoint E(N) (e.g., the endpoint 114 among the endpoints E(1)--E(N) with the least amount of audio energy). The new organization may be communicated to each MCU 116 via the signaling and control connections. Upon receiving the new settings, each MCU 116 updates (626) its internal resources accordingly to provide the required new audio and video mix. The process 600 then returns to step 612 and performs the next iteration.

会议主MCU 116a采用附加编码器对CP布局编码,并发送已编码的 音频视频流到所有的会议从MCU 116b。编码器使用会议的最高参数以用于编 码。然后,从MCU 116b可以使用被分配在该从MCU 116b中的端点114的解 码器中的一个解码器来解码从主MCU 116a中接收到的流。然后从MCU 116b 使用被分配给与从MCU 116b相关联的每个端点114的编码器重新编码所述流, 以产生适合于该端点114的流。The conference master MCU 116a encodes the CP layout using an additional encoder and sends the encoded audio and video streams to all conference slave MCUs 116b. The encoder uses the highest parameters of the conference for encoding. Slave MCUs 116b can then use a decoder assigned to an endpoint 114 in that slave MCU 116b to decode the stream received from the master MCU 116a. Slave MCUs 116b then re-encode the stream using the encoder assigned to each endpoint 114 associated with that slave MCU 116b to generate a stream suitable for that endpoint 114.

在本公开内容中,词语“单元”、“元件”和“模块”可以互换使用。 指定为单元或模块的任何事物可以是独立的单元、或专用或集成的模块。单元 或模块可以是模块化的或具有模块化的方面以允许其被容易地移除并用另一个 相似的单元或模块代替。每个单元或模块可以是软件、硬件和/或固件的任一种 或任何组合。Throughout this disclosure, the terms "unit," "element," and "module" are used interchangeably. Anything designated as a unit or module can be a standalone unit, or a dedicated or integrated module. A unit or module can be modular or have modular aspects that allow it to be easily removed and replaced with another similar unit or module. Each unit or module can be any one or any combination of software, hardware, and/or firmware.

在本公开内容的描述和权利要求中,“包括”、“包含”、“具有”和其 动词变化用于指示动词的一个或者多个对象不一定是该动词的完整的一列构 件、部件、元件、或者一个或者多个主体的部分。本领域技术人员将领会的是, 本公开内容的主题可以以用于MCU116软件、加入到MCU 116的附加硬件、 或分布于MCU 116之间附加的软件/硬件的形式来实现。Throughout the description and claims of this disclosure, the words "comprise," "include," "have," and their conjugations are used to indicate that the object of the verb is one or more components, not necessarily a complete list of components, parts, elements, or portions of one or more entities. Those skilled in the art will appreciate that the subject matter of this disclosure can be implemented in software for MCU 116, in additional hardware added to MCU 116, or in additional software/hardware distributed among MCU 116.

将领会的是,上述装置、系统和方法可以以多种方式变化,其包括改 变步骤顺序和所使用的确切实现方案。所述实施例包括不同的特征,并非在本 公开内容的所有实施例中都要求所有的这些特征。此外,本公开内容的一些实 施例只使用特征中的一些特征或特征的可能组合。在所描述的实施例中指出的 特征的不同组合对于本领域技术人员是容易想到的。此外,本公开内容的一些 实施例可以由与本公开内容的不同实施例相关联地描述的特征和元件的组合来 实现。It will be appreciated that the above-described apparatus, systems, and methods may be varied in a variety of ways, including by changing the order of steps and the exact implementation scheme used. The described embodiments include different features, not all of which are required in all embodiments of the present disclosure. Furthermore, some embodiments of the present disclosure utilize only some of the features or possible combinations of features. Different combinations of the features indicated in the described embodiments will be readily apparent to those skilled in the art. Furthermore, some embodiments of the present disclosure may be implemented by combining the features and elements described in connection with different embodiments of the present disclosure.

Claims (18)

1.一种准备用于视频会议的视频布局的方法,包括:1. A method for preparing a video layout for video conferencing, comprising: 选择多个多点控制单元中的第一多点控制单元作为主多点控制单元;Select the first multipoint control unit among multiple multipoint control units as the master multipoint control unit; 从所述多个多点控制单元中的第二多点控制单元接收与多个端点相关联的信息;Receive information associated with multiple endpoints from the second multipoint control unit among the plurality of multipoint control units; 将所述多个端点中的第一端点指派至所述第一多点控制单元;Assign the first endpoint of the plurality of endpoints to the first multipoint control unit; 将所述多个端点中的第二端点指派至所述第二多点控制单元;The second endpoint of the plurality of endpoints is assigned to the second multipoint control unit; 由所述第一多点控制单元接收关于所述多个端点的参数,所述参数是音频参数并且与所述视频布局的构成相关联;和The first multi-point control unit receives parameters regarding the plurality of endpoints, the parameters being audio parameters and associated with the composition of the video layout; and 响应于所述参数中的改变,将所述第一端点重新指派至所述第二多点控制单元,并将所述第二端点重新指派至所述第一多点控制单元,从而改变所述视频布局。In response to the change in the parameters, the first endpoint is reassigned to the second multipoint control unit, and the second endpoint is reassigned to the first multipoint control unit, thereby changing the video layout. 2.根据权利要求1所述的方法,进一步包括:2. The method according to claim 1, further comprising: 分析所述参数,并确定所述第一端点和所述第二端点的音频能量从大到小的次序;和Analyze the parameters and determine the order of audio energy from largest to smallest for the first and second endpoints; and 确定所述音频能量从大到小的次序中的改变。Determine the changes in the audio energy from high to low. 3.根据权利要求1所述的方法,进一步包括:3. The method according to claim 1, further comprising: 响应于将所述第二端点重新指派至所述第一多点控制单元,确定用于所述第一多点控制单元的混合音频和视频的指令。In response to reassigning the second endpoint to the first multipoint control unit, instructions for the mixed audio and video of the first multipoint control unit are determined. 4.根据权利要求1所述的方法,进一步包括:4. The method according to claim 1, further comprising: 响应于将所述第一端点重新指派至所述第二多点控制单元,确定用于所述第二多点控制单元的混合音频和视频的指令;和In response to reassigning the first endpoint to the second multipoint control unit, instructions for mixing audio and video for the second multipoint control unit are determined; and 将所述混合音频和视频的指令从所述第一多点控制单元发送至所述第二多点控制单元。The instruction to mix audio and video is sent from the first multipoint control unit to the second multipoint control unit. 5.根据权利要求1所述的方法,其中,响应于所述参数中的改变,将所述第一端点重新指派至所述第二多点控制单元,并将所述第二端点重新指派至所述第一多点控制单元,包括:5. The method of claim 1, wherein, in response to a change in the parameters, reassigning the first endpoint to the second multipoint control unit and reassigning the second endpoint to the first multipoint control unit, comprises: 将传递端点连接的指令从所述第一多点控制单元发送至所述第二多点控制单元。The instruction to transmit endpoint connection is sent from the first multipoint control unit to the second multipoint control unit. 6.根据权利要求1所述的方法,其中,响应于所述参数中的改变,将所述第一端点重新指派至所述第二多点控制单元,并将所述第二端点重新指派至所述第一多点控制单元,包括:6. The method of claim 1, wherein, in response to a change in the parameters, reassigning the first endpoint to the second multipoint control unit and reassigning the second endpoint to the first multipoint control unit, comprises: 将混合音频和视频的指令从所述第一多点控制单元发送至所述第二多点控制单元。Instructions for mixing audio and video are sent from the first multipoint control unit to the second multipoint control unit. 7.一种用于视频会议的多点控制单元,包括:7. A multipoint control unit for video conferencing, comprising: 用于将所述多点控制单元标识为视频会议的主多点控制单元的装置;A means for identifying the multipoint control unit as the master multipoint control unit of a video conference; 用于从被标识为从多点控制单元的第二多点控制单元接收与多个端点相关联的信息的装置;A means for receiving information associated with multiple endpoints from a second multipoint control unit identified as being from a multipoint control unit; 用于将所述多个端点中的第一端点指派至所述主多点控制单元的装置;A means for assigning a first endpoint of the plurality of endpoints to the master multipoint control unit; 用于将所述多个端点中的第二端点指派至所述从多点控制单元的装置;A means for assigning a second endpoint of the plurality of endpoints to the multipoint control unit; 用于由所述主多点控制单元接收关于所述多个端点的参数的装置,所述参数是音频参数并且与所述视频布局的构成相关联;和A means for receiving parameters about the plurality of endpoints by the main multipoint control unit, the parameters being audio parameters and associated with the composition of the video layout; and 用于响应于所述参数中的改变,将所述第一端点重新指派至所述从多点控制单元并将所述第二端点重新指派至所述主多点控制单元从而改变所述视频会议的视频布局的装置。A means for changing the video layout of the video conference by reassigning the first endpoint to the slave multipoint control unit and the second endpoint to the master multipoint control unit in response to a change in the parameters. 8.根据权利要求7所述的多点控制单元,进一步包括:8. The multi-point control unit according to claim 7, further comprising: 用于分析所述参数,并确定所述第一端点和所述第二端点的音频能量从大到小的次序的装置;和A means for analyzing the parameters and determining the order of audio energy from largest to smallest at the first and second endpoints; and 用于确定所述音频能量从大到小的次序中的改变的装置。A device for determining changes in the order of the audio energy from high to low. 9.根据权利要求7所述的多点控制单元,进一步包括:9. The multi-point control unit according to claim 7, further comprising: 用于响应于将所述第二端点重新指派至所述主多点控制单元,确定用于所述主多点控制单元的混合音频和视频的指令的装置。A means for determining instructions for the mixed audio and video of the main multipoint control unit in response to reassigning the second endpoint to the main multipoint control unit. 10.根据权利要求7所述的多点控制单元,进一步包括:10. The multi-point control unit according to claim 7, further comprising: 用于响应于将所述第一端点重新指派至所述从多点控制单元,确定用于所述从多点控制单元的混合音频和视频的指令的装置;和Means for determining instructions for the mixed audio and video of the multipoint control unit in response to reassigning the first endpoint to the multipoint control unit; and 用于将所述混合音频和视频的指令从所述主多点控制单元发送至所述从多点控制单元的装置。A means for sending instructions for mixing audio and video from the master multipoint control unit to the slave multipoint control unit. 11.根据权利要求7所述的多点控制单元,其中用于响应于所述参数中的改变,将所述第一端点重新指派至所述从多点控制单元并将所述第二端点重新指派至所述主多点控制单元从而改变所述视频会议的视频布局的装置进一步被配置来:11. The multipoint control unit of claim 7, wherein the means for changing the video layout of the video conference by reassigning the first endpoint to the slave multipoint control unit and the second endpoint to the master multipoint control unit in response to a change in the parameters is further configured to: 将传递端点连接的指令从所述主多点控制单元发送到所述从多点控制单元。The instruction to transmit endpoint connection is sent from the master multipoint control unit to the slave multipoint control unit. 12.根据权利要求7所述的多点控制单元,其中用于响应于所述参数中的改变,将所述第一端点重新指派至所述从多点控制单元并将所述第二端点重新指派至所述主多点控制单元从而改变所述视频会议的视频布局的装置进一步被配置来:12. The multipoint control unit of claim 7, wherein the means for changing the video layout of the video conference by reassigning the first endpoint to the slave multipoint control unit and the second endpoint to the master multipoint control unit in response to a change in the parameters is further configured to: 将混合音频和视频的指令从所述主多点控制单元发送至所述从多点控制单元。Instructions for mixing audio and video are sent from the master multipoint control unit to the slave multipoint control unit. 13.一种用于视频会议技术的多点控制单元连接控制器,包括:13. A multipoint control unit connection controller for video conferencing technology, comprising: 通信模块,其被配置为The communication module is configured as follows: 接收与视频会议相关联和与被牵涉到所述视频会议中的端点和多点控制单元相关联的指令;和Receive instructions associated with the video conference and with the endpoints and multipoint control units involved in the video conference; and 传输关于所述视频会议的状态信息;Transmit status information about the video conference; 决定模块,其被配置为:The decision module is configured as follows: 将所述视频会议中的多个端点中的第一端点指派至所述视频会议的多个多点控制单元中的主多点控制单元,The first endpoint among the multiple endpoints in the video conference is assigned to the master multipoint control unit among the multiple multipoint control units of the video conference. 将所述多个端点中的第二端点指派至所述多个多点控制单元中的第二多点控制单元,和Assign the second endpoint among the plurality of endpoints to the second multipoint control unit among the plurality of multipoint control units, and 响应于所述主多点控制单元从所述多个端点接收的参数中的改变,将所述第一端点重新指派至所述第二多点控制单元,并将所述第二端点重新指派至所述主多点控制单元,从而改变所述视频会议的视频布局,所述参数是音频参数并且与所述视频布局的构成相关联;In response to a change in parameters received by the master multipoint control unit from the plurality of endpoints, the first endpoint is reassigned to the second multipoint control unit, and the second endpoint is reassigned to the master multipoint control unit, thereby changing the video layout of the video conference, wherein the parameters are audio parameters and are associated with the composition of the video layout; 堆叠会议参数记录器,其被配置为Stacked conference parameter recorder, which is configured as 收集所述视频会议的参数;和Collect the parameters of the video conference; and 存储所述参数;和Store the parameters; and 堆叠会议架构数据库,其被配置为Stacked meeting architecture database, which is configured as 存储与被牵涉到所述视频会议中的端点和多点控制单元相关的信息。Store information related to the endpoints and multipoint control units involved in the video conference. 14.根据权利要求13所述的多点控制单元连接控制器,其中所述堆叠会议参数记录器被进一步配置为:14. The multipoint control unit connection controller of claim 13, wherein the stacked conference parameter recorder is further configured to: 在一个采样周期内采样会议参数。Conference parameters are sampled within a sampling period. 15.根据权利要求14所述的多点控制单元连接控制器,其中所采样的会议参数包括在所述视频会议中的每个参与者的音频能量。15. The multipoint control unit connection controller of claim 14, wherein the sampled conference parameters include the audio energy of each participant in the video conference. 16.根据权利要求13所述的多点控制单元连接控制器,其中所述堆叠会议参数记录器被进一步配置为:16. The multipoint control unit connection controller of claim 13, wherein the stacked conference parameter recorder is further configured to: 选择在所述视频会议中所关联的参与者频繁地比其他参与者更响亮的端点。The endpoints of participants associated with the video conference are selected to be louder than those of other participants. 17.根据权利要求16所述的多点控制单元连接控制器,其中,即使所选择的端点在最近的采样周期内不是较响亮的发言者,也选择所述所选择的端点。17. The multipoint control unit connection controller of claim 16, wherein the selected endpoint is selected even if the selected endpoint was not a louder speaker in the most recent sampling period. 18.根据权利要求13所述的多点控制单元连接控制器,其中所述堆叠会议架构数据库包括:18. The multipoint control unit connection controller of claim 13, wherein the stacked conference architecture database comprises: 定义在所述视频会议的端点和多点控制单元之间的关联的信息;和Information defining the association between the endpoints and the multipoint control unit of the video conference; and 定义在所述视频会议的多点控制单元之间的关联的信息。Information defining the associations between the multipoint control units of the video conference.
HK16107880.0A 2014-06-30 2016-07-07 Method of video layout and processing in mcu stacking HK1220059B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201462019269P 2014-06-30 2014-06-30
US62/019269 2014-06-30

Publications (2)

Publication Number Publication Date
HK1220059A1 HK1220059A1 (en) 2017-04-21
HK1220059B true HK1220059B (en) 2021-06-25

Family

ID=

Similar Documents

Publication Publication Date Title
US9402054B2 (en) Provision of video conference services
US8582474B2 (en) Video conference system and method
US8290128B2 (en) Unified communication based multi-screen video system
US9380268B2 (en) Provision of video conferencing services using reflector multipoint control units (MCU) and transcoder MCU combinations
US9544341B2 (en) Distributed audio conferencing architecture with optimum resource utilization and seamless scalability
US9571793B2 (en) Methods, systems and program products for managing resource distribution among a plurality of server applications
US8134587B2 (en) Aggregation of video receiving capabilities
US10334206B2 (en) Provision of video conferencing services using a micro pop to extend media processing into enterprise networks
CN110213526B (en) Video conference creation method, video conference system, storage medium, and electronic apparatus
US9398257B2 (en) Methods and systems for sharing a plurality of encoders between a plurality of endpoints
CN103327287B (en) A method and device for playing conference signals, a video conference terminal, and a mobile device
US8984156B2 (en) Multi-party mesh conferencing with stream processing
WO2016082577A1 (en) Video conference processing method and device
CN105227895B (en) Video layout and processing method in MCU stack
WO2014150992A1 (en) Cloud-based interoperability platform using a software-defined networking architecture
WO2018126134A1 (en) Unified, browser-based enterprise collaboration platform
CN115604045B (en) Online conference fusion method, device and computer storage medium
CN115695387B (en) Audio and video conference implementation method, audio and video conference system and related devices
HK1220059B (en) Method of video layout and processing in mcu stacking
EP3563248B1 (en) Unified, browser-based enterprise collaboration platform
JP2017092802A (en) Conference call system and back-end system used therefor
US20110093590A1 (en) Event Management System
CN115426336B (en) Conference cascading method and system based on SVC
KR100755100B1 (en) Video service device and method in multicast based multilateral collaboration system