[go: up one dir, main page]

CN110475094B - Video conference processing method, device and readable storage medium - Google Patents

Video conference processing method, device and readable storage medium Download PDF

Info

Publication number
CN110475094B
CN110475094B CN201910770400.3A CN201910770400A CN110475094B CN 110475094 B CN110475094 B CN 110475094B CN 201910770400 A CN201910770400 A CN 201910770400A CN 110475094 B CN110475094 B CN 110475094B
Authority
CN
China
Prior art keywords
video
stream
signaling
networking
double
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910770400.3A
Other languages
Chinese (zh)
Other versions
CN110475094A (en
Inventor
乔金龙
杨春晖
王艳辉
沈军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Visionvera Information Technology Co Ltd
Original Assignee
Visionvera Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Visionvera Information Technology Co Ltd filed Critical Visionvera Information Technology Co Ltd
Priority to CN201910770400.3A priority Critical patent/CN110475094B/en
Publication of CN110475094A publication Critical patent/CN110475094A/en
Application granted granted Critical
Publication of CN110475094B publication Critical patent/CN110475094B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明实施例提供了一种视频会议处理方法、装置及可读存储介质,方法应用于视频会议系统,视频会议系统包括互联网终端、会话初始协议SIP协转服务器、视联网服务器及视联网终端,方法包括:互联网终端向SIP协转服务器发送双流BFCP信令;SIP协转服务器将双流BFCP信令转换为视联网双流请求信令,并将视联网双流请求信令发送给视联网服务器;视联网服务器将视联网双流请求信令转发给视联网终端;并根据视联网终端返回的视联网双流回复信令,开启双视频流传输模式,双视频流传输模式用于将互联网终端采集的主视频流及辅视频流分别独立传输至视联网终端。实现了视联网终端和支持SIP协议的互联网终端在视频会议中进行双视频流的传输。

Figure 201910770400

Embodiments of the present invention provide a video conference processing method, device, and readable storage medium. The method is applied to a video conference system. The video conference system includes an Internet terminal, a session initiation protocol SIP protocol transfer server, a video networking server, and a video networking terminal. The method includes: the Internet terminal sends dual-stream BFCP signaling to the SIP protocol transfer server; the SIP protocol transfer server converts the dual-stream BFCP signaling into the video networking dual-stream request signaling, and sends the video networking dual-stream request signaling to the video networking server; the video networking server; The server forwards the video networking dual-stream request signaling to the video networking terminal; and according to the video networking dual-stream reply signaling returned by the video networking terminal, the dual video stream transmission mode is enabled, and the dual video stream transmission mode is used for the main video stream collected by the Internet terminal. and auxiliary video streams are independently transmitted to the video network terminal. It realizes the transmission of dual video streams in the video conference between the video network terminal and the Internet terminal supporting the SIP protocol.

Figure 201910770400

Description

Video conference processing method and device and readable storage medium
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a video conference processing method, a video conference processing apparatus, and a computer-readable storage medium.
Background
Currently, with the popularization and development of the video networking service in the whole country, the video networking high-definition video networking interaction technology plays a significant role in other industries already in government departments. The video networking adopts the most advanced worldwide Vision Vera real-time high-definition video exchange technology, realizes the real-time transmission of the whole-network high-definition video which cannot be realized by the current Internet, integrates dozens of services such as high-definition video conferences, video monitoring, remote training, intelligent monitoring analysis, emergency command, video telephone, live broadcast, television mails, information distribution and the like into a system platform, and realizes the real-time interconnection and intercommunication of high-definition quality video communication through various terminal devices.
At present, the applicant realizes that a video network Protocol terminal watches H239 double-stream video of an internet terminal of an H323 Protocol by using a Protocol conversion server of an H323 Protocol version, but there are many internet terminals, and some internet terminals are not compatible with the H323 Protocol, so that transmission of double-stream video between the video network terminal and an internet terminal of an SIP (Session Initiation Protocol) Protocol cannot be realized in the video network. Therefore, in the current video conference, the H323 protocol can only be installed on the internet terminal of the SIP protocol to implement the dual video stream transmission, which increases the cost of the video conference, and the transmission process of the video stream is more complicated because the H323 protocol performs the dual video stream transmission in a mixed screen manner.
Disclosure of Invention
In view of the above problems, embodiments of the present invention are proposed to provide a video conference processing method, a video conference processing apparatus, and a computer-readable storage medium that overcome or at least partially solve the above problems.
In a first aspect of the embodiments of the present invention, a video conference processing method is provided, where the method is applied to a video conference system, the video conference system includes an internet terminal, a session initiation protocol SIP protocol conversion server, a video networking server, and a video networking terminal, and the method includes:
the internet terminal sends a double-flow binary speech right control protocol BFCP signaling to the SIP protocol conversion server according to the double-flow operation of the user in the current video conference;
the SIP protocol conversion server converts the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal, and sends the video networking double-flow request signaling to the video networking server;
the video networking server forwards the video networking double-flow request signaling to the video networking terminal;
the video networking terminal sends a video networking double-flow reply signaling to the video networking server aiming at the video networking double-flow request signaling;
and the video networking server starts a dual-video stream transmission mode according to the video networking dual-stream reply signaling, wherein the dual-video stream transmission mode is used for independently transmitting the main video stream and the auxiliary video stream acquired by the Internet terminal to the video networking terminal respectively.
In a second aspect of the embodiments of the present invention, another video conference processing method is provided, where the method is applied to an SIP protocol conversion server in a video conference system, the video conference system further includes a video networking terminal, a video networking server, and an internet terminal, and the method includes:
receiving a double-flow binary speech right control protocol (BFCP) signaling sent by the Internet terminal, wherein the double-flow BFCP signaling is generated by the Internet terminal according to double-flow operation of a user in a currently-performed video conference;
converting the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal;
sending the video networking double-flow request signaling to the video networking server; the video networking server is used for forwarding the video networking double-stream request signaling to the video networking terminal, and starting a double-video stream transmission mode according to a video networking double-stream reply signaling returned by the video networking terminal aiming at the video networking double-stream request signaling, wherein the double-video stream mode is used for independently transmitting a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal respectively.
In a third aspect of the embodiments of the present invention, a video conference processing apparatus is provided, where the apparatus is applied to a video conference system, the video conference system includes a video networking terminal, a video networking server, an SIP protocol conversion server, and an internet terminal, and the apparatus includes:
a double-flow BECP signaling sending module, configured to send a double-flow binary floor control protocol BFCP signaling to the SIP protocol server according to a double-flow operation performed by a user in a currently performed video conference;
the signaling conversion module is used for converting the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal, and sending the video networking double-flow request signaling to the video networking server;
the signaling forwarding module is used for forwarding the video networking double-flow request signaling to the video networking terminal;
the reply signaling module is used for sending a video networking double-flow reply signaling to the video networking server aiming at the video networking double-flow request signaling;
and the double video stream transmission mode starting module is used for starting a double video stream transmission mode according to the video networking double-stream reply signaling, and the double video stream transmission mode is used for respectively and independently transmitting the main video stream and the auxiliary video stream acquired by the internet terminal to the video networking terminal.
In a fourth aspect of the embodiments of the present invention, a computer-readable storage medium is provided, on which a computer program is stored, which when executed by a processor implements the steps in the video processing method.
The embodiment of the invention has the following advantages:
the internet terminal sends the double-flow BFCP signaling to the SIP protocol conversion server, and the SIP protocol conversion server converts the double-flow BFCP signaling into the video network double-flow request signaling, so that the video network double-flow request signaling can be analyzed by the video network server and the video network terminal in the video network, the video network terminal can reply the video network double-flow reply signaling to the video network server according to the video network double-flow request signaling, and the video network server can start a double-video-stream transmission mode according to the video network double-flow reply signaling, so that the internet terminal can independently send the collected main video stream and the collected auxiliary video stream to the video network terminal. The internet terminal and the video network terminal adopting the SIP protocol can also transmit double video streams in the video conference without being compatible with the H323 protocol, thereby reducing the cost of the video conference, leading the transmission process of the video streams to be more concise and widening the application range of the video network.
Drawings
FIG. 1 is a schematic networking diagram of a video network of the present invention;
FIG. 2 is a schematic diagram of a hardware architecture of a node server according to the present invention;
fig. 3 is a schematic diagram of a hardware structure of an access switch of the present invention;
fig. 4 is a schematic diagram of a hardware structure of an ethernet protocol conversion gateway according to the present invention;
fig. 5 is an environment architecture diagram of a video conference system to which a video conference processing method according to an embodiment of the present invention is applied;
FIG. 6 is a flow chart of the steps of a method of video conference processing according to an embodiment of the present invention;
fig. 7 is a flowchart illustrating an example of a video conference processing method according to an embodiment of the present invention;
fig. 8 is a schematic diagram of a dual video stream transmission mode of an example of a video conference processing method according to an embodiment of the present invention;
FIG. 9 is a flow chart of steps in another video conference processing method in accordance with an embodiment of the present invention;
fig. 10 is a schematic structural diagram of an embodiment of a video conference processing apparatus according to the present invention;
FIG. 11 is a schematic structural diagram of another embodiment of a video conference processing apparatus according to the present invention
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
The video networking is an important milestone for network development, is a real-time network, can realize high-definition video real-time transmission, and pushes a plurality of internet applications to high-definition video, and high-definition faces each other.
The video networking adopts a real-time high-definition video exchange technology, can integrate required services such as dozens of services of video, voice, pictures, characters, communication, data and the like on a system platform on a network platform, such as high-definition video conference, video monitoring, intelligent monitoring analysis, emergency command, digital broadcast television, delayed television, network teaching, live broadcast, VOD on demand, television mail, Personal Video Recorder (PVR), intranet (self-office) channels, intelligent video broadcast control, information distribution and the like, and realizes high-definition quality video broadcast through a television or a computer.
To better understand the embodiments of the present invention, the following description refers to the internet of view:
some of the technologies applied in the video networking are as follows:
network technology (network technology)
Network technology innovation in video networking has improved the traditional Ethernet (Ethernet) to face the potentially huge first video traffic on the network. Unlike pure network Packet Switching (Packet Switching) or network Circuit Switching (Circuit Switching), the Packet Switching is adopted by the technology of the video networking to meet the Streaming requirement. The video networking technology has the advantages of flexibility, simplicity and low price of packet switching, and simultaneously has the quality and safety guarantee of circuit switching, thereby realizing the seamless connection of the whole network switching type virtual circuit and the data format.
Switching Technology (Switching Technology)
The video network adopts two advantages of asynchronism and packet switching of the Ethernet, eliminates the defects of the Ethernet on the premise of full compatibility, has end-to-end seamless connection of the whole network, is directly communicated with a user terminal, and directly bears an IP data packet. The user data does not require any format conversion across the entire network. The video networking is a higher-level form of the Ethernet, is a real-time exchange platform, can realize the real-time transmission of the whole-network large-scale high-definition video which cannot be realized by the existing Internet, and pushes a plurality of network video applications to high-definition and unification.
Server technology (Servertechnology)
The server technology on the video networking and unified video platform is different from the traditional server, the streaming media transmission of the video networking and unified video platform is established on the basis of connection orientation, the data processing capacity of the video networking and unified video platform is independent of flow and communication time, and a single network layer can contain signaling and data transmission. For voice and video services, the complexity of video networking and unified video platform streaming media processing is much simpler than that of data processing, and the efficiency is greatly improved by more than one hundred times compared with that of a traditional server.
Storage Technology (Storage Technology)
The super-high speed storage technology of the unified video platform adopts the most advanced real-time operating system in order to adapt to the media content with super-large capacity and super-large flow, the program information in the server instruction is mapped to the specific hard disk space, the media content is not passed through the server any more, and is directly sent to the user terminal instantly, and the general waiting time of the user is less than 0.2 second. The optimized sector distribution greatly reduces the mechanical motion of the magnetic head track seeking of the hard disk, the resource consumption only accounts for 20% of that of the IP internet of the same grade, but concurrent flow which is 3 times larger than that of the traditional hard disk array is generated, and the comprehensive efficiency is improved by more than 10 times.
Network Security Technology (Network Security Technology)
The structural design of the video network completely eliminates the network security problem troubling the internet structurally by the modes of independent service permission control each time, complete isolation of equipment and user data and the like, generally does not need antivirus programs and firewalls, avoids the attack of hackers and viruses, and provides a structural carefree security network for users.
Service Innovation Technology (Service Innovation Technology)
The unified video platform integrates services and transmission, and is not only automatically connected once whether a single user, a private network user or a network aggregate. The user terminal, the set-top box or the PC are directly connected to the unified video platform to obtain various multimedia video services in various forms. The unified video platform adopts a menu type configuration table mode to replace the traditional complex application programming, can realize complex application by using very few codes, and realizes infinite new service innovation.
Networking of the video network is as follows:
the video network is a centralized control network structure, and the network can be a tree network, a star network, a ring network and the like, but on the basis of the centralized control node, the whole network is controlled by the centralized control node in the network.
As shown in fig. 1, the video network is divided into an access network and a metropolitan network.
The devices of the access network part can be mainly classified into 3 types: node server, access switch, terminal (including various set-top boxes, coding boards, memories, etc.). The node server is connected to an access switch, which may be connected to a plurality of terminals and may be connected to an ethernet network.
The node server is a node which plays a centralized control function in the access network and can control the access switch and the terminal. The node server can be directly connected with the access switch or directly connected with the terminal.
Similarly, devices of the metropolitan network portion may also be classified into 3 types: a metropolitan area server, a node switch and a node server. The metro server is connected to a node switch, which may be connected to a plurality of node servers.
The node server is a node server of the access network part, namely the node server belongs to both the access network part and the metropolitan area network part.
The metropolitan area server is a node which plays a centralized control function in the metropolitan area network and can control a node switch and a node server. The metropolitan area server can be directly connected with the node switch or directly connected with the node server.
Therefore, the whole video network is a network structure with layered centralized control, and the network controlled by the node server and the metropolitan area server can be in various structures such as tree, star and ring.
The access network part can form a unified video platform (the part in the dotted circle), and a plurality of unified video platforms can form a video network; each unified video platform may be interconnected via metropolitan area and wide area video networking.
Video networking device classification
1.1 devices in the video network of the embodiment of the present invention can be mainly classified into 3 types: server, exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, code board, memory, etc.). The video network as a whole can be divided into a metropolitan area network (or national network, global network, etc.) and an access network.
1.2 wherein the devices of the access network part can be mainly classified into 3 types: node server, access exchanger (including Ethernet protocol conversion gateway), terminal (including various set-top boxes, coding board, memory, etc.).
The specific hardware structure of each access network device is as follows:
a node server:
as shown in fig. 2, the system mainly includes a network interface module 201, a switching engine module 202, a CPU module 203, and a disk array module 204;
the network interface module 201, the CPU module 203, and the disk array module 204 all enter the switching engine module 202; the switching engine module 202 performs an operation of looking up the address table 205 on the incoming packet, thereby obtaining the direction information of the packet; and stores the packet in a queue of the corresponding packet buffer 206 based on the packet's steering information; if the queue of the packet buffer 206 is nearly full, it is discarded; the switching engine module 202 polls all packet buffer queues for forwarding if the following conditions are met: 1) the port send buffer is not full; 2) the queue packet counter is greater than zero. The disk array module 204 mainly implements control over the hard disk, including initialization, read-write, and other operations on the hard disk; the CPU module 203 is mainly responsible for protocol processing with an access switch and a terminal (not shown in the figure), configuring an address table 205 (including a downlink protocol packet address table, an uplink protocol packet address table, and a data packet address table), and configuring the disk array module 204.
The access switch:
as shown in fig. 3, the network interface module mainly includes a network interface module (a downlink network interface module 301 and an uplink network interface module 302), a switching engine module 303 and a CPU module 304;
wherein, the packet (uplink data) coming from the downlink network interface module 301 enters the packet detection module 305; the packet detection module 305 detects whether the Destination Address (DA), the Source Address (SA), the packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id) and enters the switching engine module 303, otherwise, discards the stream identifier; the packet (downstream data) coming from the upstream network interface module 302 enters the switching engine module 303; the incoming data packet of the CPU module 304 enters the switching engine module 303; the switching engine module 303 performs an operation of looking up the address table 306 on the incoming packet, thereby obtaining the direction information of the packet; if the packet entering the switching engine module 303 is from the downstream network interface to the upstream network interface, the packet is stored in the queue of the corresponding packet buffer 307 in association with the stream-id; if the queue of the packet buffer 307 is nearly full, it is discarded; if the packet entering the switching engine module 303 is not from the downlink network interface to the uplink network interface, the data packet is stored in the queue of the corresponding packet buffer 307 according to the guiding information of the packet; if the queue of the packet buffer 307 is nearly full, it is discarded.
The switching engine module 303 polls all packet buffer queues and may include two cases:
if the queue is from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) the port send buffer is not full; 2) the queued packet counter is greater than zero; 3) obtaining a token generated by a code rate control module;
if the queue is not from the downlink network interface to the uplink network interface, the following conditions are met for forwarding: 1) The port send buffer is not full; 2) the queue packet counter is greater than zero.
The rate control module 308 is configured by the CPU module 304, and generates tokens for packet buffer queues from all downstream network interfaces to upstream network interfaces at programmable intervals to control the rate of upstream forwarding.
The CPU module 304 is mainly responsible for protocol processing with the node server, configuration of the address table 306, and configuration of the code rate control module 308.
Ethernet protocol conversion gateway
As shown in fig. 4, the apparatus mainly includes a network interface module (a downlink network interface module 401 and an uplink network interface module 402), a switching engine module 403, a CPU module 404, a packet detection module 405, a rate control module 408, an address table 406, a packet buffer 407, a MAC adding module 409, and a MAC deleting module 410.
Wherein, the data packet coming from the downlink network interface module 401 enters the packet detection module 405; the packet detection module 405 detects whether the ethernet MAC DA, the ethernet MAC SA, the ethernet length or frame type, the video network destination address DA, the video network source address SA, the video network packet type, and the packet length of the packet meet the requirements, and if so, allocates a corresponding stream identifier (stream-id); then, the MAC deletion module 410 subtracts MAC DA, MAC SA, length or frame type (2byte) and enters the corresponding receiving buffer, otherwise, discards it;
the downlink network interface module 401 detects the sending buffer of the port, and if there is a packet, obtains the ethernet MAC DA of the corresponding terminal according to the destination address DA of the packet, adds the ethernet MAC DA of the terminal, the MAC SA of the ethernet protocol gateway, and the ethernet length or frame type, and sends the packet.
The other modules in the ethernet protocol gateway function similarly to the access switch.
A terminal:
the system mainly comprises a network interface module, a service processing module and a CPU module; for example, the set-top box mainly comprises a network interface module, a video and audio coding and decoding engine module and a CPU module; the coding board mainly comprises a network interface module, a video and audio coding engine module and a CPU module; the memory mainly comprises a network interface module, a CPU module and a disk array module.
1.3 devices of the metropolitan area network part can be mainly classified into 2 types: node server, node exchanger, metropolitan area server. The node switch mainly comprises a network interface module, a switching engine module and a CPU module; the metropolitan area server mainly comprises a network interface module, a switching engine module and a CPU module.
2. Video networking packet definition
2.1 Access network packet definition
The data packet of the access network mainly comprises the following parts: destination Address (DA), Source Address (SA), reserved bytes, payload (pdu), CRC.
As shown in the following table, the data packet of the access network mainly includes the following parts:
DA SA Reserved Payload CRC
wherein:
the Destination Address (DA) is composed of 8 bytes (byte), the first byte represents the type of the data packet (such as various protocol packets, multicast data packets, unicast data packets, etc.), there are 256 possibilities at most, the second byte to the sixth byte are metropolitan area network addresses, and the seventh byte and the eighth byte are access network addresses;
the Source Address (SA) is also composed of 8 bytes (byte), defined as the same as the Destination Address (DA);
the reserved byte consists of 2 bytes;
the payload part has different lengths according to different types of datagrams, and is 64 bytes if the datagram is various types of protocol packets, and is 32+1024 or 1056 bytes if the datagram is a unicast packet, of course, the length is not limited to the above 2 types;
the CRC consists of 4 bytes and is calculated in accordance with the standard ethernet CRC algorithm.
2.2 metropolitan area network packet definition
The topology of a metropolitan area network is a graph and there may be 2, or even more than 2, connections between two devices, i.e., there may be more than 2 connections between a node switch and a node server, a node switch and a node switch, and a node switch and a node server. However, the metro network address of the metro network device is unique, and in order to accurately describe the connection relationship between the metro network devices, parameters are introduced in the embodiment of the present invention: a label to uniquely describe a metropolitan area network device.
In this specification, the definition of the Label is similar to that of the Label of MPLS (Multi-Protocol Label Switch), and assuming that there are two connections between the device a and the device B, there are 2 labels for the packet from the device a to the device B, and 2 labels for the packet from the device B to the device a. The label is classified into an incoming label and an outgoing label, and assuming that the label (incoming label) of the packet entering the device a is 0x0000, the label (outgoing label) of the packet leaving the device a may become 0x 0001. The network access process of the metro network is a network access process under centralized control, that is, address allocation and label allocation of the metro network are both dominated by the metro server, and the node switch and the node server are both passively executed, which is different from label allocation of MPLS, and label allocation of MPLS is a result of mutual negotiation between the switch and the server.
As shown in the following table, the data packet of the metro network mainly includes the following parts:
DA SA Reserved label (R) Payload CRC
Namely Destination Address (DA), Source Address (SA), Reserved byte (Reserved), tag, payload (pdu), CRC. The format of the tag may be defined by reference to the following: the tag is 32 bits with the upper 16 bits reserved and only the lower 16 bits used, and its position is between the reserved bytes and payload of the packet.
Based on the characteristics of the video network, the applicant firstly realizes the high-definition video conference among a plurality of video network terminals in the video network, and then realizes the high-definition video conference between the video network terminals in the video network and the internet terminals in the internet based on the increase of the demands of vast users on the high-definition video conference, and further constructs a realization scheme for the video conference of the video network terminals and the internet terminals to carry out double video streams based on some application scenes, for example, the video network terminals watch the double video streams of the internet terminals.
However, in the actually performed dual-video-stream video conference service, the applicant finds that the constructed technical solution is only applicable to internet terminals supporting the H323 protocol. The H323 protocol is an audio-video communication protocol, is a standard protocol cluster specified by the International Telecommunications Union (ITU), can support point-to-point or point-to-multipoint communication of audio, video and data, and can implement a video conference on an existing communication network. In practice, if a plurality of video streams need to be transmitted simultaneously in a video conference, the internet terminal supporting H323 mixes two video streams and transmits the mixed video streams through one video stream transmission channel, so that the video networking terminal performing the video conference with the internet terminal receives the mixed video streams.
With the continuous expansion of the service scope, the following application scenarios are encountered in the application process: the video networking terminal needs to be docked with the internet terminal supporting the SIP protocol double-stream, the video networking terminal can watch the double-stream video sent by the internet terminal based on the SIP protocol conveniently in the video conference, and the interactivity and flexibility of the video conference are improved by watching two different video streams sent by the internet terminal. Since the technical solution currently constructed by the applicant supports the H323 protocol, in order to implement this application scenario, the H323 protocol needs to be deployed on the internet terminal as well. This process is complex and time consuming for the user, making the user experience poor and also increasing the cost of conducting the video conference.
Therefore, the applicant improves the existing technical scheme, so that when a video conference is carried out between a video networking terminal and an internet terminal only supporting the SIP protocol, the transmission of double video streams with the internet terminal can be realized.
Based on the above, the applicant proposes one of the core concepts of the embodiments of the present invention, in a video conference system, a BFCP signaling sent by an internet terminal is converted into a video networking dual-stream request signaling that can be analyzed by a video networking server and a video networking terminal, so that the video networking terminal and the internet terminal of an SIP protocol can perform negotiation and transmission of dual video streams, thereby implementing dual video stream transmission in the video conference between the video networking terminal and the internet terminal of the SIP protocol, and further solving the problem that dual video stream transmission cannot be performed by the video networking terminal and the internet terminal of the SIP protocol in the current video networking video conference.
Referring to fig. 5, an environment architecture diagram of a video conference system to which a video conference processing method according to an embodiment of the present invention is applied is shown, where the video conference system includes an internet terminal, a session initiation protocol SIP protocol conversion server, a video networking server, and a video networking terminal.
In the embodiment of the present invention, the internet terminal may be, but is not limited to, the following devices: smart phones, tablets, personal computers, and the like, the video networking terminal may be, but is not limited to, the following devices: the eye is an aurora series device. The video network aurora series equipment is used for 1080P double-current video conference and command, and supports all services of the video network, such as monitoring video services, video conference services and the like. The internet terminal is a terminal supporting the SIP protocol.
The video networking server can provide video networking technical service for users and support the operation of audio and video services in the video networking. The sip protocol conversion server is responsible for communication, conversion of streaming media protocol, audio and video data packet encapsulation and other services in the video network.
In the video conference system, the video network terminal, the video network server and the SIP protocol conversion server are communicated through a video network protocol, and the SIP protocol conversion server and the Internet terminal are communicated through an Internet protocol. The SIP protocol conversion server can be understood as a bridging server between the video network and the Internet, and realizes the intercommunication between the video network and the Internet.
It should be noted that the video networking protocol described in the embodiment of the present invention is different from the internet protocol, and is a communication protocol which is independently developed by visual linkage and operates in the video networking, and is mainly a protocol developed based on the data link information exchange function of the two-layer switch.
Referring to fig. 6, a flowchart illustrating steps of a video conference processing method according to an embodiment of the present invention is shown, which may specifically include the following steps:
step 601, the internet terminal sends a dual-stream binary speech right control protocol BFCP signaling to the SIP protocol server according to the dual-stream operation performed by the user in the current video conference.
In this embodiment, the currently performed video conference is an audio and video conference in which the video networking terminal and the internet terminal currently participate, the dual-stream operation performed by the user may be a mouse click operation or a finger touch operation performed in a video conference interface displayed by the internet terminal of the user, and the dual-stream operation may be understood as an operation performed when the user wants to send a dual video stream.
In practice, when a user performs a dual-stream operation, the internet terminal may send a dual-stream BFCP signaling to the SIP coordination server, where the dual-stream BFCP signaling is a signaling conforming to an internet protocol, and in practice, the dual-stream BFCP signaling may be understood as a signaling for controlling the starting of the SIP dual stream in a video stream sent by the SIP protocol. The BFCP is applied to the SIP dual stream, and the opening and closing of the SIP dual stream are mainly controlled by the BFCP message.
Step 602, the SIP protocol conversion server converts the dual-stream BFCP signaling into a dual-stream request signaling for the video network, which can be analyzed by the video network server and the video network terminal, and sends the dual-stream request signaling for the video network to the video network server.
In this embodiment, the SIP protocol conversion server converts the dual-stream BFCP signaling into a dual-stream request signaling for video networking, which may be understood as converting the dual-stream BFCP signaling of the internet protocol into a dual-stream request signaling for video networking conforming to the video networking protocol. If the dual-stream BFCP signaling is the signaling of the BFCPFloorrequest message, the BFCPFloorrequest message indicates a request to send the dual-stream video, then the dual-stream request signaling 8e01, 8e01 of the video network obtained after the conversion by the SIP negotiation server is the signaling for sending the dual-stream video for the common representation request in the video network, so that the signaling can be identified by the video network server and the video network terminal, and the video network terminal of the video network protocol can perform communication interaction on the signaling with the internet terminal supporting the SIP protocol, thereby realizing the sending of the dual-stream video.
Step 603, the video network server forwards the video network double-flow request signaling to the video network terminal.
In practice, because the video networking terminal and the internet terminal are in a video conference, the video networking server can perform background service for the video conference, after the dual-stream BFCP signaling sent by the internet terminal is converted into the video networking dual-stream request signaling, a destination address in the video networking dual-stream request signaling can be an MAC address of the video networking server, and a source address can be an MAC address of the internet terminal, the video networking server can determine the video conference in which the internet terminal participates according to the MAC address of the internet terminal, and then can determine the video networking terminal in the video conference, and further can update the destination address of the video networking dual-stream request signaling to the MAC address of the video networking terminal, so as to send the video networking dual-stream request signaling to the video networking terminal.
Step 604, the video networking terminal sends a video networking double-flow reply signaling to the video networking server aiming at the video networking double-flow request signaling.
In practice, after receiving the video networking dual-stream request signaling, the video networking terminal may perform video networking protocol analysis on the video networking dual-stream request signaling, and automatically generate a video networking dual-stream reply signaling according to an analysis result, where the video networking dual-stream reply signaling may use a video networking server as a destination device.
If the video network terminal analyzes the 8e01 signaling to obtain the result of sending the dual-stream video by the internet terminal, the video network dual-stream reply signaling is automatically generated according to the result to indicate that the dual-stream video is agreed to be received.
Step 605, the video networking server starts a dual video stream transmission mode according to the video networking dual stream reply signaling, where the dual video stream transmission mode is used to separately transmit the main video stream and the auxiliary video stream acquired by the internet terminal to the video networking terminal.
In the embodiment of the invention, when receiving a video networking double-stream reply signaling sent by a video networking terminal, a video networking server can control the video networking terminal and an SIP protocol conversion server to start a double-video stream mode so as to establish two video stream transmission channels between the video networking terminal and the Internet terminal, and respectively transmit a main video stream and an auxiliary video stream acquired by the Internet terminal through the two video stream transmission channels. The main video stream and the auxiliary video stream of the internet terminal can be different video streams acquired by a camera of the internet terminal aiming at different objects, for example, the main video stream is a character video acquired by the camera, and the auxiliary video stream is an environment video acquired by another camera. Or the video collected by the internet terminal and the monitoring video played by the internet terminal. For example, the main video stream is a video acquired by a camera of the internet terminal, and the auxiliary video stream is a monitoring video being played or stored by the internet terminal.
In the embodiment of the invention, the SIP protocol conversion server converts the double-flow BFCP signaling sent by the Internet terminal into the video network double-flow signaling which can be analyzed by the video network terminal and the video network server, the video network server can start a double-video-stream transmission mode according to the video network double-flow reply signaling returned by the video network terminal aiming at the video network double-flow signaling, and then successfully establish the transmission basis of the double video streams between the Internet terminal supporting the SIP protocol and the video network terminal through the double-video-stream transmission mode, so as to realize the transmission of the double video streams between the video network terminal and the Internet terminal supporting the SIP protocol.
When the technical scheme is adopted, the following two advantages exist:
firstly, because the video network server and the video network terminal can send signals to the internet terminal supporting the SIP protocol for interaction in the signaling through the signaling conversion of the SIP protocol conversion server, the technical barrier that the equipment in the video network cannot be compatible with the communication protocol of the terminal supporting the SIP protocol in the prior art is broken, and the internet terminal supporting the SIP protocol is not required to be additionally provided with the H323 protocol during the communication of the two parties, so that the user experience is optimized, and the cost is saved.
Secondly, when the H323 protocol performs dual video stream transmission, two video streams are mixed to form one video stream transmission. Therefore, in the existing technical architecture, when the internet terminal supporting the SIP protocol is additionally installed with the H323 protocol, when a dual-stream video is transmitted and when the internet terminal supporting the SIP protocol is installed and transmitted using the H323 protocol, the internet terminal originally supporting the SIP protocol needs to mix two video streams according to the H323 protocol and then transmit the mixed two video streams. Therefore, the memory resource of the internet terminal is occupied, and the problem of low sending efficiency of the double-stream video in the video conference is caused. The embodiment of the invention realizes that the video network server and the video network terminal can send signals to the Internet terminal supporting the SIP protocol for interaction, so that the Internet terminal can directly adopt the SIP protocol for double-stream transmission, the started double-video-stream transmission mode can respectively and independently transmit the main video stream and the auxiliary video stream, the Internet terminal can directly and independently transmit the main video stream and the auxiliary video stream, and the internal memory of the Internet terminal can choose not to carry out secondary screen mixing processing on the main video stream and the auxiliary video stream, thereby reducing the internal memory occupation of the Internet terminal and further improving the sending efficiency of the double-stream video in the video conference.
In combination with the above embodiments, in a possible implementation manner, the method may further include the following steps:
step 605', the video network server sends the video network double-flow reply signaling to the SIP corollary server.
Step 606', the SIP protocol conversion server converts the dual-stream reply signaling of the video network into a dual-stream BFCP reply signaling that can be analyzed by the internet terminal, and sends the dual-stream BFCP reply signaling to the internet terminal.
In this embodiment, after receiving the dual stream reply signaling of the video network, the video network server may further send the dual stream reply signaling of the video network to the SIP protocol server, so that the SIP protocol server finally transmits the content of the dual stream reply signaling of the video network to the internet terminal. Specifically, the SIP protocol conversion server converts the video networking double-flow reply signaling into a double-flow BFCP signaling which accords with the Internet protocol and can be analyzed by the Internet terminal so that the Internet terminal can identify the signaling. The internet terminal knows that the video network terminal agrees to receive the double-stream video, and then sends the main video stream and the auxiliary video stream.
If the dual-stream reply signaling of the video network is 8e01Ack signaling, the SIP negotiation server converts the 8e01Ack signaling into BFCPFloorRequestStatus, and the internet terminal can know that the video network terminal has agreed to receive the dual-stream video according to the BFCPFloorRequestStatus.
And step 607', the internet terminal sends the main video stream and the auxiliary video stream separately based on the SIP protocol for the dual-stream BFCP reply signaling.
In this embodiment, when receiving the dual-stream BFCP reply signaling, the internet terminal may independently send the main video stream and the auxiliary video stream acquired by the internet terminal to the SIP coordination server according to the SIP protocol. Specifically, the internet terminal may send both the main video stream and the auxiliary video stream to the protocol conversion server through independent channels.
By adopting the technical scheme, the SIP protocol conversion server can convert the video network double-flow reply signaling into the double-flow BFCP reply signaling which can be analyzed by the Internet terminal, so that the Internet terminal can start sending the double video streams according to the double-flow BFCP reply signaling, the SIP protocol conversion server can realize the conversion between the double-flow BFCP signaling and the video network double-flow request signaling, and between the video network double-flow reply signaling and the double-flow BFCP reply signaling, namely, the SIP protocol conversion server can realize the conversion from the BFCP message to the video network protocol signaling, and also can realize the conversion from the video network protocol signaling to the BFCP message, thereby widening the functions of the SIP protocol conversion server, and further satisfying the interaction of the signaling between the video network terminal and the Internet terminal supporting the SIP protocol.
With reference to the foregoing embodiment, in a possible implementation manner, step 605 may specifically include the following steps to implement the dual video streaming mode:
step 6051, the video networking server generates a double-stream transmission instruction for the double-stream reply signaling of the video networking, and sends the double-stream transmission instruction to the SIP corotation server and the video networking terminal respectively.
In practice, the dual stream transmission instruction may be used to instruct the video networking terminal to start two codecs to receive the dual video stream, and may be used to instruct the SIP collaboration server to establish two video transmission channels to receive and process the two video streams.
Step 6052, the SIP collaboration server opens two video transmission channels for the dual-stream transmission instruction, and separately receives the main video stream and the auxiliary video stream sent by the internet terminal based on the SIP protocol through the two video transmission channels.
In practice, the SIP protocol conversion server opens two video transmission channels, which may be understood as two video transmission channels established between the internet terminal and the SIP protocol conversion server and between the SIP protocol conversion server and the video network server, wherein one video transmission channel is used for receiving a main video stream sent by the internet terminal, and the other video transmission channel is used for receiving an auxiliary video stream sent by the internet terminal.
Step 6053, the SIP collaboration server converts the primary video stream to a video networking primary video stream conforming to a video networking protocol and converts the secondary video stream to a video networking secondary video stream conforming to a video networking protocol.
In the embodiment of the present invention, the process of converting the main video stream into the main video stream of the video network conforming to the video network protocol at the SIP protocol server may be: the method comprises the steps of extracting bare video data in a main video stream, adding a video networking protocol packet header for the bare video data, and transmitting the bare video data added with the video networking protocol packet header in the video networking so as to reach a video networking terminal. The same may be used for the conversion of the secondary video stream.
Step 6054, the SIP collaboration server sends the video networking main video stream and the video networking auxiliary video stream to the video networking server through the two video transmission channels.
In the embodiment of the invention, the SIP protocol conversion server can send the main video stream of the video network through the channel for sending the main video stream, and can send the auxiliary video stream of the video network through the channel for sending the auxiliary video stream.
Step 6055, the video networking terminal starts two video networking codecs according to the dual-stream transmission instruction, and receives and decodes the video networking main video stream and the video networking auxiliary video stream forwarded by the video networking server through the two video networking codecs respectively.
In this embodiment, the video networking server may also use two video transmission channels to respectively transmit the video networking main video stream and the video networking auxiliary video stream, and the video networking terminal may enable two video networking codecs to respectively receive the video networking main video stream for decoding and the video networking auxiliary video stream. Specifically, one of the video networking codecs decodes the video networking primary video stream, and the other video networking codec decodes the video networking secondary video stream.
In practice, the video networking terminal may update the current display mode in the video conference to a dual-split-screen mode, so as to display the decoded video networking main video stream and the decoded video networking auxiliary video stream on the display screen respectively.
With reference to the foregoing embodiment, in a possible implementation manner, the video conference system further includes a display terminal connected to the internet terminal, and the video conference processing method may further include the following steps:
step 605', the internet terminal obtains the screen video stream collected by the display terminal from the display terminal.
Wherein the screen video stream is generated by the display terminal from the plurality of screen display images captured.
In this embodiment, the internet terminal may be wired to a display terminal via a Video Graphics Array (VGA) interface, where the display terminal may be a computer or a smart phone. Therefore, a monitoring video or a demonstration PPT document can be played in the display terminal, the screen display image of the display terminal can be captured periodically by using desktop capture software of the display terminal to obtain a plurality of screen display pictures, and the plurality of screen display pictures are coded into a screen video stream. The internet terminal can acquire the screen video stream through the auxiliary stream VGA interface, and then can independently send the screen video stream when the dual video stream transmission mode is started.
It should be noted that, in step 601, the dual-stream BFCP signaling may be sent according to the dual-stream operation of the user after the internet terminal is connected to the display terminal.
Correspondingly, step 607' may specifically include the following steps:
and step 607', the Internet terminal respectively sends the auxiliary video stream formed by the screen video stream and the main video stream formed by the monitoring video stream collected by the camera of the Internet terminal based on the SIP protocol.
In this embodiment, the internet terminal can regard screen video stream as the auxiliary video stream, regard the surveillance video stream that self camera was gathered as main video stream, then can send screen video stream and surveillance video stream alone based on SIP protocol, in order to satisfy the video conference of scenes such as some distance education study and emergency dispatch, for example, in distance education study, the user can gather the video that self lectured through internet terminal, show the manuscript demonstration process that the meeting was aimed at for the video conference terminal of video conference through display terminal, and then strengthen the lecture effect of the video conference of distance education study. For another example, in a video conference of emergency scheduling, a user can acquire a commanded video through an internet terminal, and a display screen of a flood levee breaking monitoring video targeted by emergency command is displayed to a video networking terminal of the video conference through a display terminal, so that the effect of the emergency command is enhanced, and members in the video conference can accurately grasp command intentions and strategies.
By adopting the technical scheme, the internet terminal is connected with the display terminal, and the screen video stream of the display terminal can be acquired, so that the screen video stream and the monitoring video stream acquired by the camera of the internet terminal are independently sent to the video networking terminal. Therefore, in the video conference, the internet terminal can play the file or the video on other equipment without influencing the original business operation of the user on the internet terminal, so that the video conference mode is more flexible, and the application scene is more diversified.
With reference to the foregoing embodiment, in a possible implementation manner, a conference configuration document is preset in the SIP collaboration server, where the conference configuration document includes a video networking number, and a binding relationship between an SIP conference address and a conference number.
Specifically, the SIP protocol conversion server may allocate a conference number according to a video conference creation operation of a user, and bind the respective video networking numbers of the plurality of video networking virtual terminals with the conference number, and because the SIP protocol conversion server may communicate with the internet, the SIP protocol conversion server itself has an IP address, and the video conference has an SIP conference address in the internet. For example, the conference number is 132, the video network number of the bound video network virtual terminal is 11155, and the IP address of the SIP protocol server is: 171.214.219.109, the SIP conferencing address may be 132@ 171.214.219.109. Thus, the SIP collaboration server generates a conference profile according to the configuration for the video conference creation, and the conference profile can be understood as conference configuration information set by the SIP collaboration server for the video conference.
Accordingly, prior to step 601, the video networking terminal and the internet terminal may be joined to the video conference by:
step 6001, the video networking terminal generates a video networking conference call signaling according to a video conference call operation performed by a user, and sends the video networking conference call signaling to the SIP coordination server through the video networking server.
In practice, after configuring a conference configuration document for a video conference, the SIP protocol conversion server may notify a video networking terminal user of a video networking number bound to the video conference, and specifically, may notify in a manner of sending conference information including the video networking number to the video networking terminal. The video conference call operation performed by the user may refer to an operation of inputting the video networking number on the video networking terminal by the user, so that the video networking terminal may generate a video networking conference call signaling conforming to a video networking protocol according to the video networking number input by the user, wherein the video networking conference call signaling includes the video networking number.
If the user inputs the video networking number 11155 in the video networking number input field of the video networking terminal and dials the video conference, the video networking terminal generates video networking conference call signaling including 11155 to request the SIP collaboration server to join the video conference with the conference number 132.
And step 6002, the internet terminal generates an internet conference call signaling according to a video conference address input operation performed by a user, and sends the internet conference call signaling to the SIP coordination server.
In practice, after configuring a conference configuration document for a video conference, the SIP collaboration server may also notify an internet terminal user of an SIP conference address bound to the video conference, specifically, may notify the internet terminal user by sending a short message including the SIP conference address. The video conference address input operation performed by the user may specifically be an input operation of an SIP conference address. Therefore, the internet terminal can generate internet conference call signaling according with the internet protocol according to the SIP conference address input by the user so as to request to join the video conference created by the SIP protocol conversion server. Wherein, the internet conference call signaling comprises the SIP conference address.
If the user inputs the SIP conference address 132@171.214.219.109 in the conference address input field of the internet terminal to dial a video conference, the internet terminal generates an internet conference call signaling including 132@171.214.219.109 to request the SIP collaboration server to join the video conference with the conference number 132.
Step 6003, the SIP collaboration server converts the video networking conference call signaling into an SIP conference call signaling based on the conference configuration document, generates a first successful conference entry signaling for the SIP conference call signaling, and sends the first successful conference entry signaling to the video networking terminal through the video networking server through the virtual terminal corresponding to the video networking number. Wherein the first conference successful signaling comprises a conference number bound with the video networking number.
In this embodiment, the SIP protocol conversion server may parse the video conference call signaling, read a video networking number, determine an SIP conference address bound to the video networking number based on the binding relationship included in the conference configuration document, map the video networking conference number as the SIP conference address, add a packet header conforming to an SIP protocol to the SIP conference address, and generate the SIP conference call signaling. Therefore, the SIP protocol conversion server can determine the video conference to be added to the video networking terminal according to the converted SIP call signaling, and further generate a first conference successful signaling which accords with the video networking protocol so as to represent that the video networking terminal is allowed to be added to the video conference. Wherein the first conference successful signaling includes a conference number to inform the video networking terminal which video conference to join.
The SIP protocol conversion server binds the video conference with the video network virtual terminal, the video network virtual terminal corresponds to a calling channel, and the SIP protocol conversion server can return a first conference successful signaling to the video network terminal through the video network virtual terminal. In practice, since the virtual terminal of the video network corresponds to a calling channel, the SIP coordination server may also receive a video network call signaling through the virtual terminal of the video network.
Illustratively, after parsing the video conference call signaling, the SIP protocol translation server reads the video conference number 11155, determines, based on the conference configuration document, that the conference number bound to the 11155 is 132, and the SIP conference address bound to the conference number 132 is 132@171.214.219.109, adds an SIP protocol header to the SIP conference address 132@171.214.219.109, converts the SIP protocol header into the SIP call signaling, and then the SIP protocol translation server can know, according to the SIP call signaling, that the video conference call signaling is to request to add to the video conference with the conference number 132, and then returns a first successful conference signaling 200 ok.
Step 6004, the SIP collaboration server generates a second successful conference entering signaling according to the conference configuration document for the internet conference call signaling, and sends the second successful conference entering signaling to the internet terminal.
In this embodiment, since the internet conference call signaling sent by the internet terminal conforms to the internet protocol and includes the SIP conference address, the SIP negotiation server can analyze the internet conference call signaling, directly read the SIP conference address, determine the video conference to be added by the internet terminal according to the conference configuration document, and then return a second successful conference entry signaling conforming to the internet protocol to the internet terminal. Wherein the second conference successful signaling comprises a conference number bound with the SIP conference address.
Illustratively, if the SIP conference address included in the internet conference call signaling sent by the internet terminal is 132@171.214.219.109, the SIP collaboration server determines that the video conference requested to be joined by the internet terminal is 132 according to the binding relationship in the conference configuration document, and then returns a second conference entering signaling 180Ring to the internet terminal.
And step 6005, the video networking terminal joins the video conference corresponding to the conference number for the first successful conference entering signaling.
And step 6006, the internet terminal joins the video conference corresponding to the conference number for the second successful conference entering signaling.
When the video network terminal receives the first conference successful signaling, the video network terminal can join the video conference, and meanwhile, the conference successful signaling can be returned to the SIP protocol conversion server through the video network server so as to inform the SIP protocol conversion server that the conference is successfully entered. When the internet terminal receives the second conference successful signaling, the internet terminal can join the video conference, and simultaneously, a conference successful signaling can be returned to the SIP protocol conversion server so as to inform the SIP protocol conversion server that the conference is successfully entered.
Correspondingly, after the above steps 6003 and 6004, the SIP negotiation Server may also negotiate BFCP roles of the virtual internet terminal and the internet terminal according to the internet call request signaling sent by the internet terminal and the video internet call request signaling sent by the video internet terminal, where the BFCP roles include BFCP Server and BFCP Client. In the embodiment of the invention, the BFCP role of the virtual terminal of the video network can be a BFCP Server (BFCP Server), and the BFCP role of the internet terminal can be a BFCP Client (BFCP Client). In this way, the virtual terminal of the video network, which serves as a BFCP Server, can listen to the dual-stream BFCP signaling sent by the internet terminal in step 602. The internet terminal serving as the BFCP Client can establish TCP connection with the virtual terminal of the video network to send the double-flow BFCP signaling to the virtual terminal of the video network, and further transmission of the double-flow BFCP signaling between the internet terminal and the SIP protocol conversion server is realized.
By adopting the technical scheme, the video networking terminal and the Internet terminal supporting the SIP protocol are added into the video conference without the limitation brought by the difference of the protocol layers of the two parties, and the application of the video networking is expanded.
With reference to the foregoing embodiment, in a possible implementation manner, the dual-stream BFCP signaling includes signaling data, and step 602 may specifically include the following steps:
step 6021, the SIP protocol server analyzes the signaling data from the dual-flow BFCP signaling.
In this embodiment, the dual-flow BFCP signaling may be a signaling data packet, and the signaling data packet may include signaling data and a protocol header encapsulating the signaling data. The signaling data is a BFCP message, and the protocol header comprises a SIP protocol (application layer protocol) and a TCP/IP protocol (transport layer protocol). The protocol header may indicate a transmission mode of the dual-stream BFCP signaling in the internet, a target device, and an application program of the target device. The signaling data may indicate matters or content in which the signaling is specifically involved.
Illustratively, the signaling data in the dual-stream BFCP signaling is, upon parsing, a bfcplorerrequest.
Step 6022, the SIP protocol conversion server converts the signaling data into video network signaling data which is matched with the signaling data and can be identified by the video network server and the video network terminal.
In the embodiment of the invention, because the video networking protocol is a protocol different from the internet protocol, in order to realize that the double-flow BFCP signaling can be transmitted in the video networking and can be analyzed by a video networking terminal and a video networking server so as to overcome the barrier that the existing video networking protocol and the SIP protocol cannot be communicated, the SIP protocol conversion server can convert the signaling data into the video networking signaling data, specifically, can determine the character strings included in the signaling data, and converts the character strings in the signaling data into the character strings in the video networking according to the pre-defined conversion program so as to obtain the video networking signaling data. Wherein the video networking signaling data is consistent with the item or content characterized by the signaling data.
For example, the signaling data is BFCPFloorrequest, and the characterization to be performed is: requesting to send the dual video streams, converting the loorrequest into sipdouble according to the character string included in the signaling data and a self-defined program, wherein the content to be performed represented by the sipdouble in the video network is as follows: the SIP terminal sends the double video stream.
Step 6023, the SIP protocol conversion server adds a video networking protocol header to the video networking signaling data, and the video networking signaling data added with the video networking protocol header is the video networking dual-flow request signaling.
In this embodiment, after the SIP protocol conversion server obtains the video networking signaling data, a video networking protocol header may be added to the video networking signaling data to obtain a video networking dual-stream request signaling, where the video networking protocol header represents a transmission mode of the video networking dual-stream request signaling in the video networking.
For example, the video network dual stream request signaling obtained by adding the video network protocol header to the sippouble is 8e 01. Then, after obtaining the 8e01, the video network terminal analyzes the 8e01, and can obtain video network signaling data sipouble carried by the video network terminal, and further reply the video network dual-stream reply signaling for the sipouble.
By adopting the technical scheme, the signaling data and the protocol packet header of the double-flow BFCP signaling can be converted into the video networking signaling data and the video networking protocol packet header to obtain the video networking double-flow request signaling, so that the embodiment of the invention overcomes the barrier that the video networking protocol and the SIP protocol cannot be intercommunicated in the existing video networking technical architecture, and further realizes that the video networking terminal and the Internet terminal of the SIP protocol transmit double video streams in a video conference.
In combination with the above embodiments, in an implementation manner, after step 605, the following steps may be further included:
and step s1, the internet terminal sends a dual-flow stopping BFCP signaling to the SIP protocol conversion server according to the dual-flow stopping operation of the user.
And step s2, the SIP protocol conversion server stops the dual-flow BFCP signaling to be converted into a video network stopping dual-flow request signaling which can be identified by the video network server and the video network terminal, and the signaling is sent to the video network terminal through the video network server.
And step s3, the video network terminal sends a video network double-flow stopping reply signaling to the video network server for the video network double-flow stopping request signaling.
And step s4, the video network server closes the dual video streaming mode according to the dual stream reply signaling of the video network stop.
Steps s1 to s4 are similar to steps 601 to 604, respectively, and are not described in detail herein.
And step s5, the SIP cooperative conversion server closes the video transmission channel for the auxiliary video stream in the two video transmission channels based on the operation of closing the dual video stream transmission mode.
And then, the internet terminal stops sending the auxiliary video stream according to the operation of closing the double video stream transmission mode.
Referring to fig. 7 and 8, fig. 7 is a schematic flowchart of a video conference processing method shown in combination with a specific example, and fig. 8 is a schematic diagram of a dual video stream transmission mode in the specific example, in the example, a video networking terminal is an aurora terminal, an internet terminal is te30 (integrated high definition video conference terminal), where te30 has a VGA interface, te30 and the aurora terminal joins an emergency command teleconference, and the aurora terminal is connected with a large-size high definition display screen. The VGA interface of Hua Te30 is connected with a personal computer, a monitoring video of a site where a debris flow disaster happens is played on the personal computer, and commanders pause operation and mark key areas on the monitoring video through a mouse so as to determine an emergency scheme. The method specifically comprises the following steps:
step 701, the user clicks to initiate a double-flow operation, and Huawei te30 sends a double-flow binary floor control protocol BFCP signaling to the SIP corotation server according to the double-flow operation of the user.
Step 702, the SIP protocol conversion server converts the dual-stream BFCP signaling into a dual-stream request signaling of the video network, which can be analyzed by the video network server and the aurora terminal, and sends the dual-stream request signaling of the video network to the video network server.
Step 703, the video network server forwards the video network double-flow request signaling to the aurora terminal.
Step 704, the polar light terminal sends a video network double-stream reply signaling to the video network server according to the video network double-stream request signaling.
Step 705, the video network server starts a dual video stream transmission mode according to the video network dual stream reply signaling, where the dual video stream transmission mode is used to separately and independently transmit the main video stream and the auxiliary video stream collected by huawei te30 to the aurora terminal.
Step 706, acquiring a video stream of a camera of the mobile terminal te30 as a main video stream, acquiring a screen video stream acquired by a personal computer when a monitoring video of a site where a debris flow disaster occurs is played in a player through a VGA interface, and taking the screen video stream as an auxiliary video stream.
Step 706, hua is te30 transmits the main video stream and the auxiliary video stream to the SIP corotation server through the main video channel and the auxiliary video channel, respectively, based on the SIP protocol. The SIP protocol conversion server also receives the main video stream through the main video channel and receives the auxiliary video stream through the auxiliary video channel.
Step 707, the SIP protocol conversion server converts the main video stream into a video network main video stream conforming to the video network protocol, and converts the auxiliary video stream into a video network auxiliary video stream conforming to the video network protocol.
And 708, the SIP protocol conversion server sends the main video stream to the video network server through the main video channel, and sends the auxiliary video stream to the video network server through the auxiliary video channel.
And step 709, the aurora terminal receives the main video stream sent by the video network server through the opened main video network codec, and receives the auxiliary video stream sent by the video network server through the opened auxiliary video network codec. And the decoded main video stream and the decoded auxiliary video stream are respectively played on the connected large-size high-definition display screen, namely the video commanded by the commander and the screen video stream of the monitoring video of the site where the debris flow disaster happens are played on the personal computer. The key area marked by the commander aiming at the site where the debris flow disaster occurs can be obtained through the displayed screen video stream, so that the emergency scheme can be more accurately comprehended.
Referring to fig. 9, a flowchart of steps of another video conference processing method according to an embodiment of the present invention is shown, where the method is applied to an SIP collaboration server in a video conference system, where the video conference system further includes a video networking terminal, a video networking server, and an internet terminal, and specifically includes the following steps:
step 901, receiving a dual-stream binary floor control protocol BFCP signaling sent by the internet terminal, where the dual-stream BFCP signaling is generated by the internet terminal according to a dual-stream operation performed by a user in a currently performed video conference;
step 902, converting the dual-stream BFCP signaling into a video networking dual-stream request signaling capable of being analyzed by the video networking server and the video networking terminal;
step 903, sending the video networking double-flow request signaling to the video networking server; the video networking server is used for forwarding the video networking double-stream request signaling to the video networking terminal, and starting a double-video stream transmission mode according to a video networking double-stream reply signaling returned by the video networking terminal aiming at the video networking double-stream request signaling, wherein the double-video stream mode is used for independently transmitting a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal respectively.
Steps 901 to 903 are similar to steps 601 to 604, and reference may be made to the above steps for relevant points, which are not described in detail herein.
In combination with the above embodiments, in a possible implementation manner, the method may further include the following steps:
step 904, receiving a double-stream transmission instruction sent by the video network server for the double-stream reply signaling; the double-stream transmission instruction is generated by the video networking server according to the double-stream reply signaling, and the double-stream transmission instruction is used for starting the double-video stream transmission mode;
step 905, for the double-stream transmission instruction, opening two video transmission channels, and independently receiving the main video stream and the auxiliary video stream sent by the internet terminal based on the SIP protocol through the two video transmission channels, respectively;
step 906, converting the main video stream into a video networking main video stream conforming to a video networking protocol, and converting the auxiliary video stream into a video networking auxiliary video stream conforming to the video networking protocol;
step 907, sending the video networking main video stream and the video networking auxiliary video stream to the video networking terminal through the video networking server through the two video transmission channels; and the video networking terminal is used for respectively receiving the video networking main video stream and the video networking auxiliary video stream through two video networking codecs.
Steps 904 to 907 are similar to steps 6051 to 6055, and reference may be made to the above steps for relevant points, which are not described in detail herein.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 10, a video conference processing apparatus according to an embodiment of the present invention is shown, where the apparatus is applied to a video conference system, where the video conference system includes a video network terminal, a video network server, an SIP collaboration server, and an internet terminal, and the apparatus may specifically include the following modules:
a double-stream BECP signaling sending module 1001, configured to send a double-stream binary floor control protocol BFCP signaling to the SIP negotiation server according to a double-stream operation performed by a user in a currently performed video conference;
a signaling conversion module 1002, configured to convert the dual-stream BFCP signaling into a video networking dual-stream request signaling that can be analyzed by the video networking server and the video networking terminal, and send the video networking dual-stream request signaling to the video networking server;
a signaling forwarding module 1003, configured to forward the dual-stream request signaling of the video networking to the video networking terminal;
a reply signaling module 1004, configured to send a video networking dual-stream reply signaling to the video networking server according to the video networking dual-stream request signaling;
a dual video stream transmission mode starting module 905, configured to start a dual video stream transmission mode according to the video networking dual stream reply signaling, where the dual video stream transmission mode is used to separately and independently transmit a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal.
In combination with the above embodiment, in an optional implementation manner, the apparatus may further include the following modules:
the first signaling forwarding module is used for sending the video networking double-flow reply signaling to the SIP protocol forwarding server;
the first signaling conversion module is used for converting the video networking double-flow reply signaling into a double-flow BFCP reply signaling which can be analyzed by the Internet terminal and sending the double-flow BFCP reply signaling to the Internet terminal;
and the video stream sending module is used for independently sending the main video stream and the auxiliary video stream respectively based on an SIP (session initiation protocol) for the double-stream BFCP reply signaling.
In combination with the above embodiment, in an optional implementation manner, the dual video streaming mode starting module may include the following modules:
the double-current transmission instruction module is used for responding to a double-current reply signaling of the video network, generating a double-current transmission instruction and respectively sending the double-current transmission instruction to the SIP protocol conversion server and the video network terminal;
the double-stream video receiving module is used for starting two video transmission channels according to the double-stream transmission instruction, and respectively and independently receiving the main video stream and the auxiliary video stream which are sent by the internet terminal based on the SIP protocol through the two video transmission channels;
the double-current video conversion module is used for converting the main video stream into a video networking main video stream conforming to a video networking protocol and converting the auxiliary video stream into a video networking auxiliary video stream conforming to the video networking protocol;
the double-stream video sending module is used for sending the video network main video stream and the video network auxiliary video stream to the video network server through the two video transmission channels;
and the double-stream video playing module is used for starting two video networking codecs according to the double-stream transmission instruction, and respectively receiving and decoding the video networking main video stream and the video networking auxiliary video stream forwarded by the video networking server through the two video networking codecs.
With reference to the foregoing embodiment, in an optional implementation manner, the video conference system further includes a display terminal connected to the internet terminal, and the apparatus may further include the following modules:
the screen video stream acquisition module is used for acquiring the screen video stream acquired by the display terminal from the display terminal; wherein the screen video stream is generated by the display terminal from the plurality of captured screen display images:
the video stream sending module is specifically configured to send, based on an SIP protocol, an auxiliary video stream formed by the screen video stream and a main video stream formed by a monitoring video stream acquired by a camera of the video stream sending module.
With reference to the foregoing embodiment, in an optional implementation manner, a conference configuration document is preset in the SIP coordination server, where the conference configuration document includes a video networking number, a binding relationship between an SIP conference address and a conference number; the apparatus may further include the following modules:
the first calling module is used for generating a video networking conference calling signaling according to the video conference calling operation performed by a user and sending the video networking conference calling signaling to the SIP protocol conversion server through the video networking server; wherein the video networking conference call signaling comprises the video networking number;
the second calling module is used for generating an internet conference calling signaling according to the video conference address input operation performed by the user and sending the internet conference calling signaling to the SIP protocol conversion server; wherein, the internet conference call signaling comprises the SIP conference address;
the first call return module is used for converting the video networking conference call signaling into SIP conference call signaling based on the conference configuration document, generating a first successful conference call signaling aiming at the SIP conference call signaling, and sending the first successful conference call signaling to the video networking terminal through a virtual terminal corresponding to the video networking number via the video networking server; wherein the first conference successful signaling comprises a conference number bound with the video networking number;
a second call return module, configured to generate a second successful conference entering signaling according to the conference configuration document for the internet conference call signaling, and send the second successful conference entering signaling to the internet terminal; the second conference successful signaling comprises a conference number bound with the SIP conference address;
a first joining module, configured to join the video conference corresponding to the conference number according to the first conference successful signaling;
and the second joining module is used for joining the video conference corresponding to the conference number aiming at the second conference successful entering signaling.
With reference to the foregoing embodiment, in an optional implementation manner, the signaling conversion module may specifically include the following modules:
the signaling data analyzing module is used for analyzing the signaling data from the double-flow BFCP signaling;
the signaling data conversion module is used for converting the signaling data into video networking signaling data which is matched with the signaling data and can be identified by the video networking server and the video networking terminal;
and the request signaling generation module is used for adding a video networking protocol header to the video networking signaling data, and the video networking signaling data added with the video networking protocol header is the video networking double-flow request signaling.
Referring to fig. 11, another video conference processing apparatus according to an embodiment of the present invention is shown, where the apparatus is applied to an SIP collaboration server in a video conference system, where the video conference system further includes a video network terminal, a video network server, and an internet terminal, and the apparatus may specifically include the following modules:
a signaling receiving module 1110, configured to receive a dual-stream binary floor control protocol BFCP signaling sent by the internet terminal, where the dual-stream BFCP signaling is generated by the internet terminal according to a dual-stream operation performed by a user in a currently performed video conference;
a signaling conversion module 1120, configured to convert the dual-stream BFCP signaling into a dual-stream request signaling for the video network, which can be analyzed by the video network server and the video network terminal;
a signaling sending module 1130, configured to send the dual-stream request signaling of the video networking to the video networking server; the video networking server is used for forwarding the video networking double-stream request signaling to the video networking terminal, and starting a double-video stream transmission mode according to a video networking double-stream reply signaling returned by the video networking terminal aiming at the video networking double-stream request signaling, wherein the double-video stream mode is used for independently transmitting a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal respectively.
In combination with the above embodiment, in an optional implementation manner, the apparatus may further include the following modules:
a double-stream transmission instruction receiving module, configured to receive a double-stream transmission instruction sent by the video networking server for the double-stream reply signaling; the double-stream transmission instruction is generated by the video networking server according to the double-stream reply signaling, and the double-stream transmission instruction is used for starting the double-video stream transmission mode;
the video transmission channel starting module is used for starting two video transmission channels according to the double-stream transmission instruction and respectively and independently receiving the main video stream and the auxiliary video stream which are sent by the internet terminal based on the SIP protocol through the two video transmission channels;
the double-video-stream receiving module is used for converting the main video stream into a video networking main video stream conforming to a video networking protocol and converting the auxiliary video stream into a video networking auxiliary video stream conforming to the video networking protocol;
the dual video stream sending module is used for sending the video networking main video stream and the video networking auxiliary video stream to the video networking terminal through the video networking server through the two video transmission channels; and the video networking terminal is used for respectively receiving the video networking main video stream and the video networking auxiliary video stream through two video networking codecs.
For the embodiment of the video conference processing apparatus, since it is basically similar to the embodiment of the video conference processing method, the description is relatively simple, and for relevant points, reference may be made to part of the description of the embodiment of the video conference processing method.
An embodiment of the present invention further provides an apparatus, including: one or more processors; and one or more machine readable media having instructions stored thereon that, when executed by the one or more processors, cause the apparatus to perform one or more video conference processing methods according to embodiments of the invention.
Embodiments of the present invention further provide a computer-readable storage medium, where a stored computer program causes a processor to execute the video conference processing method according to the embodiments of the present invention.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The video conference processing method, the video conference processing device and the computer-readable storage medium provided by the present invention are described in detail above, and specific examples are applied in the present document to explain the principle and the implementation of the present invention, and the description of the above embodiments is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (8)

1. A video conference processing method is applied to a video conference system, the video conference system comprises an internet terminal, a Session Initiation Protocol (SIP) protocol conversion server, a video network server and a video network terminal, and the method comprises the following steps:
the internet terminal sends a double-flow binary speech right control protocol BFCP signaling to the SIP protocol conversion server according to the double-flow operation of the user in the current video conference;
the SIP protocol conversion server converts the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal, and sends the video networking double-flow request signaling to the video networking server;
the video networking server forwards the video networking double-flow request signaling to the video networking terminal;
the video networking terminal sends a video networking double-flow reply signaling to the video networking server aiming at the video networking double-flow request signaling;
the video networking server starts a dual-video stream transmission mode according to the video networking dual-stream reply signaling, wherein the dual-video stream transmission mode is used for independently transmitting a main video stream and an auxiliary video stream which are acquired by the Internet terminal to the video networking terminal respectively;
wherein, the video network server starts a dual video stream transmission mode according to the video network dual stream reply signaling, and the method comprises the following steps:
the video network server generates a double-stream transmission instruction aiming at a video network double-stream reply signaling, and sends the double-stream transmission instruction to the SIP protocol server and the video network terminal respectively;
the SIP protocol conversion server starts two video transmission channels according to the double-stream transmission instruction, and respectively and independently receives the main video stream and the auxiliary video stream which are sent by the Internet terminal based on the SIP protocol through the two video transmission channels;
the SIP protocol conversion server converts the main video stream into a video network main video stream conforming to a video network protocol, and converts the auxiliary video stream into a video network auxiliary video stream conforming to the video network protocol;
the SIP protocol conversion server sends the video network main video stream and the video network auxiliary video stream to the video network server through the two video transmission channels;
and the video networking terminal starts two video networking codecs according to the double-stream transmission instruction, and respectively receives and decodes the video networking main video stream and the video networking auxiliary video stream forwarded by the video networking server through the two video networking codecs.
2. The method of claim 1, further comprising:
the video networking server sends the video networking double-flow reply signaling to the SIP protocol conversion server;
the SIP protocol conversion server converts the video networking double-flow reply signaling into a double-flow BFCP reply signaling which can be analyzed by the Internet terminal and sends the double-flow BFCP reply signaling to the Internet terminal;
and the internet terminal respectively and independently sends the main video stream and the auxiliary video stream based on an SIP (session initiation protocol) aiming at the double-flow BFCP reply signaling.
3. The method of claim 2, wherein the video conference system further comprises a display terminal connected to the internet terminal, the method further comprising:
the internet terminal acquires a screen video stream collected by the display terminal from the display terminal; wherein the screen video stream is generated by the display terminal from the plurality of captured screen display images:
the internet terminal sends the main video stream and the auxiliary video stream respectively based on an SIP protocol aiming at the double-flow BFCP reply signaling, and the method comprises the following steps:
and the Internet terminal respectively sends an auxiliary video stream formed by the screen video stream and a main video stream formed by the monitoring video stream collected by a camera of the Internet terminal based on an SIP protocol.
4. The method according to claim 1, wherein a conference profile is preset in the SIP collaboration server, and the conference profile comprises a video networking number, a binding relationship between a SIP conference address and a conference number; before the internet terminal sends a dual-stream binary floor control protocol BFCP signaling to the SIP negotiation server according to a dual-stream operation performed by a user in a currently performed video conference, the method further includes:
the video networking terminal generates a video networking conference call signaling according to the video conference call operation performed by the user, and sends the video networking conference call signaling to the SIP protocol conversion server through the video networking server; wherein the video networking conference call signaling comprises the video networking number;
the Internet terminal generates an Internet conference call signaling according to the video conference address input operation performed by the user and sends the Internet conference call signaling to the SIP protocol conversion server; wherein, the internet conference call signaling comprises the SIP conference address;
the SIP protocol conversion server converts the video networking conference call signaling into SIP conference call signaling based on the conference configuration document, generates a first successful conference call signaling aiming at the SIP conference call signaling, and sends the first successful conference call signaling to the video networking terminal through a virtual terminal corresponding to the video networking number via the video networking server; wherein the first conference successful signaling comprises a conference number bound with the video networking number;
the SIP protocol server generates a second successful conference entering signaling according to the conference configuration document aiming at the internet conference call signaling, and sends the second successful conference entering signaling to the internet terminal; wherein the second conference successful signaling comprises a conference number bound with the SIP conference address;
the video networking terminal joins the video conference corresponding to the conference number aiming at the first conference successful entering signaling;
and the Internet terminal joins the video conference corresponding to the conference number aiming at the second successful conference joining signaling.
5. The method of claim 1, wherein the dual-stream BFCP signaling includes signaling data, and wherein the SIP protocol conversion server converts the dual-stream BFCP signaling into video networking dual-stream request signaling that can be parsed by the video networking server and the video networking terminal, comprising:
the SIP protocol conversion server analyzes the signaling data from the double-flow BFCP signaling;
the SIP protocol conversion server converts the signaling data into video networking signaling data which is matched with the signaling data and can be identified by the video networking server and the video networking terminal;
and the SIP protocol conversion server adds a video networking protocol head to the video networking signaling data, and the video networking signaling data added with the video networking protocol head is the video networking double-flow request signaling.
6. A video conference processing method is applied to an SIP protocol conversion server in a video conference system, the video conference system further comprises a video network terminal, a video network server and an Internet terminal, and the method comprises the following steps:
receiving a double-flow binary speech right control protocol (BFCP) signaling sent by the Internet terminal, wherein the double-flow BFCP signaling is generated by the Internet terminal according to double-flow operation of a user in a currently-performed video conference;
converting the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal;
sending the video networking double-flow request signaling to the video networking server; the video networking server is used for forwarding the video networking double-stream request signaling to the video networking terminal, and starting a double-video stream transmission mode according to a video networking double-stream reply signaling returned by the video networking terminal aiming at the video networking double-stream request signaling, wherein the double-video stream mode is used for independently transmitting a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal respectively;
wherein the method further comprises:
receiving a double-stream transmission instruction sent by the video network server aiming at the double-stream reply signaling; the double-stream transmission instruction is generated by the video networking server according to the double-stream reply signaling, and the double-stream transmission instruction is used for starting the double-video stream transmission mode;
starting two video transmission channels aiming at the double-stream transmission instruction, and respectively and independently receiving the main video stream and the auxiliary video stream which are sent by the internet terminal based on the SIP protocol through the two video transmission channels;
converting the main video stream into a video networking main video stream conforming to a video networking protocol, and converting the auxiliary video stream into a video networking auxiliary video stream conforming to the video networking protocol;
sending the video networking main video stream and the video networking auxiliary video stream to the video networking terminal through the video networking server through the two video transmission channels; and the video networking terminal is used for respectively receiving the video networking main video stream and the video networking auxiliary video stream through two video networking codecs.
7. The video conference processing device is applied to a video conference system, the video conference system comprises a video network terminal, a video network server, an SIP protocol conversion server and an Internet terminal, and the device comprises:
a double-flow BECP signaling sending module, configured to send a double-flow binary floor control protocol BFCP signaling to the SIP protocol server according to a double-flow operation performed by a user in a currently performed video conference;
the signaling conversion module is used for converting the double-flow BFCP signaling into a video networking double-flow request signaling which can be analyzed by the video networking server and the video networking terminal, and sending the video networking double-flow request signaling to the video networking server;
the signaling forwarding module is used for forwarding the video networking double-flow request signaling to the video networking terminal;
the reply signaling module is used for sending a video networking double-flow reply signaling to the video networking server aiming at the video networking double-flow request signaling;
a dual video stream transmission mode starting module, configured to start a dual video stream transmission mode according to the video networking dual stream reply signaling, where the dual video stream transmission mode is used to separately and independently transmit a main video stream and an auxiliary video stream acquired by the internet terminal to the video networking terminal;
wherein, the dual video stream transmission mode starting module comprises:
a double-stream transmission instruction generation submodule, configured to generate a double-stream transmission instruction for a double-stream reply signaling of the video networking, and send the double-stream transmission instruction to the SIP corotation server and the video networking terminal, respectively;
the channel opening submodule is used for opening two paths of video transmission channels aiming at the double-stream transmission instruction and respectively and independently receiving the main video stream and the auxiliary video stream which are sent by the internet terminal based on the SIP protocol through the two paths of video transmission channels;
the protocol conversion submodule is used for converting the main video stream into a video networking main video stream conforming to a video networking protocol and converting the auxiliary video stream into a video networking auxiliary video stream conforming to the video networking protocol;
the sending submodule is used for sending the video network main video stream and the video network auxiliary video stream to the video network server through the two video transmission channels;
and the video networking codec starting submodule is used for starting two video networking codecs according to the double-stream transmission instruction, and respectively receiving and decoding the video networking main video stream and the video networking auxiliary video stream forwarded by the video networking server through the two video networking codecs.
8. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1-5 or 6.
CN201910770400.3A 2019-08-20 2019-08-20 Video conference processing method, device and readable storage medium Active CN110475094B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910770400.3A CN110475094B (en) 2019-08-20 2019-08-20 Video conference processing method, device and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910770400.3A CN110475094B (en) 2019-08-20 2019-08-20 Video conference processing method, device and readable storage medium

Publications (2)

Publication Number Publication Date
CN110475094A CN110475094A (en) 2019-11-19
CN110475094B true CN110475094B (en) 2021-07-20

Family

ID=68512987

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910770400.3A Active CN110475094B (en) 2019-08-20 2019-08-20 Video conference processing method, device and readable storage medium

Country Status (1)

Country Link
CN (1) CN110475094B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111131251B (en) * 2019-12-24 2022-12-23 杭州视联动力技术有限公司 Video networking-based conference data transmission method, system, equipment and medium
CN111541857B (en) * 2020-04-01 2024-12-31 视联动力信息技术股份有限公司 A membership method, device, electronic device and storage medium
CN111614927B (en) * 2020-04-09 2024-11-26 视联动力信息技术股份有限公司 Video session establishment method, device, electronic device and storage medium
CN113163231B (en) * 2021-03-10 2024-03-15 视联动力信息技术股份有限公司 Data sharing method, device, server and media
CN113271425A (en) * 2021-04-19 2021-08-17 瑞芯微电子股份有限公司 Interaction system and method based on virtual equipment
CN113347391B (en) * 2021-05-31 2022-12-06 北京字跳网络技术有限公司 Data transmission method, data transmission interruption method and device
CN114189649A (en) * 2021-11-17 2022-03-15 海南乾唐视联信息技术有限公司 Video conference live broadcasting method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2737690B2 (en) * 1995-04-14 1998-04-08 日本電気株式会社 Video communication system
CN108881792A (en) * 2017-11-23 2018-11-23 北京视联动力国际信息技术有限公司 A kind of method and apparatus of double fluid communication
CN110062191B (en) * 2019-04-17 2021-03-02 视联动力信息技术股份有限公司 A method and server for multi-party group meeting based on Internet of Views
CN110072115A (en) * 2019-04-17 2019-07-30 视联动力信息技术股份有限公司 Data processing method, device and storage medium
CN110121059B (en) * 2019-05-08 2021-06-18 视联动力信息技术股份有限公司 Monitoring video processing method, device and storage medium

Also Published As

Publication number Publication date
CN110475094A (en) 2019-11-19

Similar Documents

Publication Publication Date Title
CN110475094B (en) Video conference processing method, device and readable storage medium
CN108965776B (en) A kind of communication means and communication system
CN110062191B (en) A method and server for multi-party group meeting based on Internet of Views
CN109640028B (en) Method and device for carrying out conference combining on multiple video networking terminals and multiple Internet terminals
CN110049271B (en) Video networking conference information display method and device
CN110166728B (en) Video networking conference opening method and device
CN110493554B (en) Method and system for switching speaking terminal
CN108462589B (en) A kind of method for connecting network and device of terminal
CN109120879B (en) Video conference processing method and system
CN110049273B (en) Video networking-based conference recording method and transfer server
CN110138728B (en) Video data sharing method and device
CN111131754A (en) Control split screen method and device of conference management system
CN110191304B (en) Data processing method, device and storage medium
CN109587433B (en) Point adjusting method and point adjusting device
CN109862307B (en) Video conference initiating method and device
CN111131743A (en) Video call method and device based on browser, electronic equipment and storage medium
CN111614927B (en) Video session establishment method, device, electronic device and storage medium
CN111541857B (en) A membership method, device, electronic device and storage medium
CN113194278A (en) Conference control method and device and computer readable storage medium
CN110830750A (en) Data transmission method and device based on video network
CN112751849A (en) Conference control method, device, equipment and medium based on video networking
CN110392275B (en) A method and device for sharing document presentation, and a video network soft terminal
CN110149305B (en) A method and transfer server for multi-party playing audio and video based on video networking
CN110430385B (en) Video conference processing method, device and storage medium
CN111641800A (en) Method and device for realizing conference

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 33rd Floor, No.1 Huasheng Road, Yuzhong District, Chongqing 400013

Patentee after: VISIONVERA INFORMATION TECHNOLOGY Co.,Ltd.

Country or region after: China

Address before: 100000 Beijing Dongcheng District Qinglong Hutong 1 Song Hua Building A1103-1113

Patentee before: VISIONVERA INFORMATION TECHNOLOGY Co.,Ltd.

Country or region before: China

CP03 Change of name, title or address