[go: up one dir, main page]

WO2018000953A1 - Audio and video processing method, apparatus and microphone - Google Patents

Audio and video processing method, apparatus and microphone Download PDF

Info

Publication number
WO2018000953A1
WO2018000953A1 PCT/CN2017/083816 CN2017083816W WO2018000953A1 WO 2018000953 A1 WO2018000953 A1 WO 2018000953A1 CN 2017083816 W CN2017083816 W CN 2017083816W WO 2018000953 A1 WO2018000953 A1 WO 2018000953A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
audio
microphone
channels
channel
Prior art date
Application number
PCT/CN2017/083816
Other languages
French (fr)
Chinese (zh)
Inventor
丁鹏
李靖
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2018000953A1 publication Critical patent/WO2018000953A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • the present invention relates to the field of communications, and in particular to an audio and video processing method, apparatus, and microphone.
  • the split screen device is a simple input device, which can output a certain way to the display device.
  • the input source is not only all the way, but also needs to be Multi-channel synthesis, and also available for selection, the traditional split screen device does not have the access source of the mobile device, or the access of the data file, and the input device of the conference television terminal is many, in addition to the video source, there is an audio source, There is no product on the market that combines audio, video and multi-channel video.
  • the video source can be mobile devices, computers, data sources, etc.
  • the auxiliary stream of the traditional conference TV can only be connected one way, and many people discuss it. In the case of multi-person access, the switching of the auxiliary stream is very troublesome.
  • a conventional video access device in the related art cannot provide an effective solution because the input source interface is limited and cannot meet the required problem.
  • the embodiment of the invention provides an audio and video processing method, device and a microphone, so as to at least solve the problem that the conventional video access device in the related art cannot meet the requirement due to the limited input source interface.
  • an audio and video processing method including: a microphone receiving one or more audio and video; the microphone synthesizing the one or more channels into one channel video, and combining one channel of audio or video The selected audio in the multi-channel audio is encoded; the microphone transmits the synthesized one-channel video and the encoded audio to the audio-visual device.
  • the method further includes: the microphone externally broadcasting audio and video access capability by using a universal protocol, where the universal protocol includes DLNA, wireless transmission airplay, Wireless display WIFI display.
  • the universal protocol includes DLNA, wireless transmission airplay, Wireless display WIFI display.
  • the receiving, by the mic, one or more audio and video comprises: receiving the one or more audio and video by means of a physical port, a wireless local area network (WLAN), a Bluetooth or a near field communication NFC.
  • WLAN wireless local area network
  • NFC near field communication
  • the method further includes: the microphone decoding the received one or more channels of video; according to the foregoing The encoded format negotiated by the audio device for the decoded one or more channels of video Encoding is performed, wherein the encoding format includes H263, H264, H265, Moving Picture Experts Group (MPEG), MP4, VP8, VP9.
  • MPEG Moving Picture Experts Group
  • the microphone synthesizing the one or more channels into one channel of video includes: the microphone receiving the selected input source and the information of the synthesis mode sent by the video and audio device; and selecting one or more corresponding ones according to the information
  • the road video and the corresponding synthesis method combine the selected one or more channels into one channel video.
  • the synthesizing manner includes one of the following: a font-shaped layout manner, and a left-right symmetric layout manner.
  • the method further includes: the microphone selecting, by the video and audio device, the video to be played from the one or more channels of video .
  • an audio and video processing apparatus including: a receiving module configured to receive one or more audio and video; and a synthesizing module configured to synthesize the one or more channels into One channel of video, and encodes one channel of audio or audio selected from multiple channels of audio; the transmitting module is configured to send the synthesized video and the encoded audio to the audio and video device.
  • the device further includes: a broadcast module, configured to externally broadcast audio and video access capability by using a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  • a broadcast module configured to externally broadcast audio and video access capability by using a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  • the receiving module comprises: a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
  • a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
  • the apparatus further includes: a decoding module configured to decode the received one or more channels of video; and an encoding module configured to decode the decoded image according to an encoding format negotiated in advance with the video and audio device
  • the one or more video is encoded, wherein the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9.
  • a microphone is also provided, including the above device.
  • a computer storage medium is further provided, and the computer storage medium may store an execution instruction for performing the implementation of the audio and video processing method in the foregoing embodiment.
  • the microphone receives one or more audio and video; the microphone synthesizes the one or more channels into one channel video, and encodes one channel of audio or audio selected from the plurality of channels of audio; The microphone sends the synthesized video and the encoded audio to the audio and video equipment, which solves the problem that the traditional video access equipment in the related art cannot meet the needs due to the limited input source interface, and improves the convenience of cooperation and interaction.
  • FIG. 1 is a flowchart of an audio and video processing method according to an embodiment of the present invention.
  • FIG. 2 is a block diagram of an audio and video processing apparatus according to an embodiment of the present invention.
  • FIG. 3 is a block diagram 1 of an audio and video processing apparatus in accordance with a preferred embodiment of the present invention.
  • FIG. 4 is a block diagram 2 of an audio and video processing apparatus in accordance with a preferred embodiment of the present invention.
  • Figure 5 is a block diagram showing the structure of a novel microphone in accordance with a preferred embodiment of the present invention.
  • FIG. 6 is a first schematic diagram of an audio video access process in accordance with a preferred embodiment of the present invention.
  • FIG. 7 is a second schematic diagram of an audio video access process according to a preferred embodiment of the present invention.
  • FIG. 8 is a third schematic diagram of an audio video access process according to a preferred embodiment of the present invention.
  • FIG. 9 is a schematic diagram 4 of an audio video access process according to a preferred embodiment of the present invention.
  • FIG. 10 is a fifth schematic diagram of an audio video access process according to a preferred embodiment of the present invention.
  • FIG. 11 is a sixth schematic diagram of an audio video access process in accordance with a preferred embodiment of the present invention.
  • FIG. 12 is a schematic diagram 7 of an audio video access process according to a preferred embodiment of the present invention.
  • FIG. 13 is a schematic diagram 8 of an audio video access process in accordance with a preferred embodiment of the present invention.
  • FIG. 1 is a flowchart of an audio and video processing method according to an embodiment of the present invention. As shown in FIG. 1, the process includes the following steps:
  • Step S102 the microphone receives one or more audio and video
  • Step S104 the microphone synthesizes one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from the plurality of channels of audio;
  • step S106 the microphone sends the synthesized video and the encoded audio to the audio and video device.
  • the microphone receives one or more channels of audio and video; the microphone combines one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from multiple channels of audio, wherein The audio is selected by one or more channels of audio for encoding; the microphone sends the synthesized video and the encoded audio to the video and audio device, which solves the problem that the traditional video access device in the related art cannot meet the requirement due to the limited input source interface.
  • the problem is to improve the convenience of collaborative interaction.
  • the microphone broadcasts audio and video access capabilities through a universal protocol before receiving one or more audio and video.
  • the universal protocol includes the Digital Living Network Alliance DLNA, wireless transmission airplay, Wireless display WIFI display, it should be noted that it is not limited to the above protocols.
  • the microphone receiving one or more audio and video may include: the microphone is connected through a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC. Receive one or more audio and video.
  • the microphone decodes the received one or more channels of video; according to the encoding format negotiated in advance with the video and audio device, the decoded one way or The multi-channel video is encoded, wherein the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9, and the like.
  • the merging of the one or more channels of video into one channel of the video may include: receiving, by the mic, the information of the selected input source and the compositing mode sent by the video and audio device; selecting corresponding one or more channels of video according to the information, and The corresponding synthesis method combines the selected one or more channels into one channel video.
  • the above-mentioned synthesis method includes one of the following: a font layout manner, a left-right symmetric layout manner, and it should be noted that it is not limited to the two implementation manners.
  • the microphone can select the video to be played from the one or more videos by controlling the video and audio device, synthesize the selected video, and transmit the selected video to the video and audio device for playing.
  • FIG. 2 is a block diagram of an audio and video processing device according to an embodiment of the present invention. As shown in FIG. 2, the method includes:
  • the receiving module 22 is configured to receive one or more audio and video
  • the synthesizing module 24 is configured to combine the one or more channels of video into one channel of video, and encode one channel of audio or audio selected from the plurality of channels of audio;
  • the sending module 26 is configured to send the combined video and the encoded audio to the audio and video device.
  • FIG. 3 is a block diagram 1 of an audio and video processing apparatus according to a preferred embodiment of the present invention. As shown in FIG. 3, the apparatus further includes:
  • the broadcast module 32 is configured to broadcast audio and video access capabilities through a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  • the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  • the receiving module comprises: a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
  • a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
  • FIG. 4 is a block diagram 2 of an audio and video processing apparatus according to a preferred embodiment of the present invention. As shown in FIG. 4, the apparatus further includes:
  • the decoding module 42 is configured to decode the received one or more channels of video
  • the encoding module 44 is configured to encode the decoded one or more channels according to an encoding format negotiated in advance with the video and audio device, where the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9, etc. .
  • Embodiments of the present invention also provide a microphone including the above device.
  • Embodiments of the present invention also provide a storage medium.
  • the storage medium may be configured to store program code set to perform the following steps:
  • Step S1 the microphone receives one or more audio and video
  • Step S2 the microphone synthesizes one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from the plurality of channels of audio;
  • step S3 the microphone sends the synthesized video and the encoded audio to the audio and video device.
  • the foregoing storage medium may include, but not limited to, a USB flash drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), a mobile hard disk, and a magnetic memory.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • a mobile hard disk e.g., a hard disk
  • magnetic memory e.g., a hard disk
  • the processor performs the above steps S1, S2 and S3 according to the stored program code in the storage medium.
  • the embodiment of the invention surrounds the audio collection device, such as a microphone, to add video access on the microphone to solve the current video and audio field.
  • the drawbacks of the present invention are the following technical solutions:
  • FIG. 5 is a structural block diagram of a novel microphone according to a preferred embodiment of the present invention. As shown in FIG. 5, the following modules are mainly included:
  • the capability notification module 52 is configured to report its ability to be externally accessed to facilitate access to the device by an external source.
  • the video capture module 54 is configured to collect video data with physical access, such as a common physical interface such as VGA, HDMI, or DVI.
  • the audio collection module 56 is a microphone acquisition sound module.
  • the data receiving module 58 is configured to receive video and audio data through a non-physical interface in addition to audio input with a physical interface, and is processed by the data receiving module 58.
  • Received data includes wireless WIFI, miracast, wifi display, airplay, dlna and other interconnection protocols, or other data such as NFC, Bluetooth, etc., audio and so on.
  • the media negotiation module 510 is configured to be responsible for negotiating with the remote device the media capabilities employed between the two parties.
  • the media processing module 512 is configured to process the collected and received video, the audio data, including superimposing or synthesizing the multi-access video data, and generating data in a corresponding format according to the E compression encoding.
  • the media sending module 514 is configured to send the superimposed or synthesized data to an external video and audio device, such as a conference television terminal, as needed, and the superimposed and synthesized data may be one of multiple paths in the access system, or in the access system. All roads determine the superposition or composition of several of them as needed.
  • the input source control module 516 is configured to receive the control signaling sent by the video and audio device, and is used to select the video and audio source for the new microphone acquisition, to select which way to view and the audio source according to the control, and select which specific synthesis mode to send. For audio and video equipment.
  • the method for supporting a new type of microphone application for video input and output includes the following content: a new type of microphone exposes its own vision through the capability notification module 52 through a general protocol. Audio access capability. General protocols include and are not limited to DLNA, airplay, wifi display, etc.
  • the communication carrier includes, but is not limited to, WIFI, Bluetooth, NFC, and the like. If the external video source is a physical video signal, the external video source is directly connected to the new microphone and processed by the video capture module 54. If the external source is a wireless video input source, such as a cell phone, PAD, etc.
  • the external video source searches for a new type of microphone through a universal protocol, and the new type of microphone realizes access to the wireless video source through the data receiving module 58;
  • the general protocol includes and is not limited to DLNA, airplay, wifi display, and the like.
  • the wireless method includes and is not limited to WIFI, Bluetooth, NFC and other communication methods. If the external source is a wireless audio input source, such as the music of a mobile phone. Then, the external audio source searches for a new type of microphone through a universal protocol, and the new type of microphone realizes access to the wireless audio frequency source through the data receiving module 58; the general protocol includes and is not limited to DLNA, airplay, wifi display, and the like.
  • the wireless method includes and is not limited to WIFI, Bluetooth, NFC and other communication methods.
  • the processing of the video and audio data collected and received by the media processing module 512 for the system includes: decoding the collected video physical signal, and then encoding according to the capability negotiated by the negotiation module, the format includes and is not limited to H264, Moving Picture Experts Group (Moving Picture) Experts Group, referred to as MPEG), MP4, etc.
  • the non-audio and audio data to be received, such as file data is decoded by means of a folder or a file, and then encoded according to the negotiated capability.
  • the encoding format includes and is not limited to H264, MPEG, MP4, and the like.
  • the video information collected by the physical and non-physical methods is superimposed or synthesized to synthesize a video.
  • the synthesis method includes, but is not limited to, a variety of layout manners such as a font shape and a left-right symmetry.
  • the physically acquired audio, as well as the NFC, Bluetooth incoming audio, are encoded as needed.
  • the superimposed, synthesized video, and encoded audio data are transmitted to an external AV device through the data module.
  • the new microphone communicates with the video and audio device through the input source input control module, receives the selected input source and the synthesis mode information sent by the video and audio device, selects the corresponding input source according to the information new microphone, and performs the corresponding synthesis mode through the media sending module 514. , the video and audio data is sent to the audio and video equipment.
  • FIG. 6 is a schematic diagram 1 of the audio video access processing according to a preferred embodiment of the present invention. As shown in FIG. 6, the method includes:
  • the first step the new microphone broadcasts its own video and audio access capability through the capability notification module 52.
  • the second step the notebook A accesses the new type of microphone, including: physically accessing the new type of microphone, the access mode can be HDMI, VGA, etc., and the new microphone collects the media signal of the notebook through the video acquisition module 54.
  • the notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the third step the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
  • the fourth step the input source control module 516 obtains which video source needs to be selected by the external conference television terminal, and the synthesis mode, since only one video source is selected, the notebook A is selected;
  • the fifth step the media processing module 512 performs encoding according to the synthesized mode, the selected video source, and the negotiated encoding format;
  • Step 6 The media sending module 514 sends the encoded media data to the conference television terminal.
  • Step 7 The user can see the video of the processed notebook through the output of the AV processing device;
  • the eighth step the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
  • the first step the new microphone broadcasts its own video and audio access capability through the capability notification module 52;
  • Step 2 Notebook A accesses the new mic, including: physically accessing the new mic,
  • the access mode may be HDMI, VGA, etc.
  • the new microphone collects the media signal of the notebook through the video capture module 54.
  • the notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the third step the notebook B accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54.
  • the notebook B searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the fourth step the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
  • Step 5 The input source control module 516 obtains which video source needs to be selected for the external conference television terminal, and the synthesis mode.
  • FIG. 7 is a second schematic diagram of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 7, notebook A and notebook B are simultaneously selected.
  • the synthesis method may be that the notebook A and the notebook B are stacked on the left or right, or may be vertically symmetrical, and is not limited to a specific screen layout.
  • FIG. 8 is a third schematic diagram of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 8, a notebook A is selected accordingly. Since only one video source is selected, the synthesis method is the content of the notebook A.
  • FIG. 9 is a schematic diagram 4 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 9, a notebook B is selected correspondingly. Since only one video source is selected, the synthesis method is the content of the notebook B.
  • the fifth step the media processing module 512 performs encoding according to the synthesized mode, the selected video source, and the negotiated encoding format;
  • Step 6 The media sending module 514 sends the encoded media data to the conference television terminal.
  • Step 7 The user can see the processed video through the output of the AV processing device.
  • the eighth step the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
  • the first step the new microphone broadcasts its own video and audio access capability through the capability notification module 52.
  • the second step the notebook A accesses the new type of microphone, including: physically accessing the new type of microphone, the access mode can be HDMI, VGA, etc., and the new microphone collects the media signal of the notebook through the video acquisition module 54.
  • the notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the third step the notebook B accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54.
  • the notebook B searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the third step the notebook C accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54.
  • the notebook C searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
  • the fourth step the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal.
  • Step 5 The input source control module 516 obtains which video source needs to be selected for the external conference television terminal, and the synthesis mode.
  • FIG. 10 is a schematic diagram 5 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 10, notebook A and notebook B, notebook C are simultaneously selected.
  • the synthesis method can be notebook A and notebook B, and notebook C accounts for one-third of each, and is not limited to a specific screen layout.
  • FIG. 11 is a schematic diagram 6 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 11, notebook A and notebook B are selected accordingly.
  • the composition method can be half of the contents of the notebook A and the notebook B, and is not limited to the layout of the screen.
  • FIG. 12 is a schematic diagram 7 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 12, a notebook C is selected correspondingly. Since only one video source is selected, the synthesis method is the content of the notebook C. You can choose any of the input sources.
  • Step 6 The media processing module 512 encodes according to the synthesized mode, the selected video source, and the negotiated encoding format.
  • Step 7 The media sending module 514 sends the encoded media data to the conference television terminal.
  • Step 8 The user can see the processed video through the output of the AV processing device.
  • the ninth step the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
  • FIG. 13 is a schematic diagram of the audio video access processing according to a preferred embodiment of the present invention. As shown in FIG. 13, the method includes:
  • the first step notebook A, notebook B, notebook C according to the first step of the previous example, the second step, the third step, etc. access to the new microphone.
  • the second step the media processing module 512 encodes the signal, and the processed video signals of the notebook A, the notebook B, and the notebook C, and the NFC/Bluetooth device transmits the file to the new microphone through the NFC/Bluetooth, and the new microphone displays the contents of the folder;
  • the third step the new microphone and the conference television terminal negotiate the coding capability and the synthesis mode
  • the fourth step superimposing or synthesizing and encoding the video content to be displayed in the above A, B, C and the received file content according to the result of the third step negotiation;
  • Step 5 The new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
  • Step 6 The media sending module 514 sends all the processed data to the conference television terminal
  • Step 7 According to the needs of the user, the user can select to view the contents of notebook A, notebook B, notebook C, NFC/Bluetooth information or simultaneously watch notebook A and notebook B, notebook C through the input source control module 516 of the new microphone.
  • Video content content displayed by NFC/Bluetooth information.
  • the NFC/Bluetooth device accesses the output of the new microphone to the conference television device, and is not limited to three devices, and is not limited to being sent to the conference television device, and the video and audio device capable of outputting can be used.
  • the eighth step the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
  • modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the microphone receives one or more audio and video; the microphone synthesizes the one or more channels into one channel video, and encodes one channel of audio or audio selected from the plurality of channels of audio; The microphone sends the synthesized video and the encoded audio to the audio and video equipment, which solves the problem that the traditional video access equipment in the related art cannot meet the needs due to the limited input source interface, and improves the convenience of cooperation and interaction.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)

Abstract

Provided in the present invention are an audio and video processing method, an apparatus, and a microphone, the method comprising: the microphone receiving one or a plurality of audio and video; the microphone synthesizing the one or a plurality of video into one video, and encoding the one audio or an audio selected from among the plurality of audio; the microphone sending the synthesized video and the encoded audio to a video and audio device. The present invention solves the problem wherein a traditional video access device in related technologies cannot fulfill needs due to limits of an input source interface thereof, thereby increasing convenience of collaborative interaction.

Description

音视频处理方法、装置及麦克Audio and video processing method, device and microphone 技术领域Technical field
本发明涉及通信领域,具体而言,涉及一种音视频处理方法、装置及麦克。The present invention relates to the field of communications, and in particular to an audio and video processing method, apparatus, and microphone.
背景技术Background technique
随着视音频技术的发展及应用的普及,有越来越多的视频产品了,如无线音箱,无线麦克,高清晰度多媒体接口(High Definition Multimedia Interface,简称为HDMI)分屏器,会议电视终端等产品,正是这些产品,使得人们沟通变得更加容易。但是在多媒体通信领域,这些设备却都是独立使用,没有做到移动设备,无线麦克,分屏器,终端等一起配套整体使用,整体集成能力比较弱。如分屏器,市面上分屏器就是简单一个输入设备,能够接入能力中的某一路输出到显示设备上,而针对会议电视等多媒体通信领域,输入源不仅仅是一路,而也需要是多路的合成,并且还可供选择,传统的分屏器也没有移动设备的接入源,或者数据文件的接入,而会议电视终端的输入设备很多,除了视频源,还有音频源,市场上还没有一款产品具备将音频,视频,多路视频合成,视频源可以是移动设备,电脑,数据源等的设备,随着多媒体通信的发展,尤其是云会议的发展,协作交互的便利将极大的提升产品的竞争力。With the development of video and audio technology and the popularity of applications, there are more and more video products, such as wireless speakers, wireless microphones, high definition multimedia interface (HDMI) split screen, conference TV Products such as terminals, these products, make people's communication easier. However, in the field of multimedia communication, these devices are used independently. Mobile devices, wireless microphones, split screens, terminals, etc. are not used together, and the overall integration capability is relatively weak. For example, the split screen device is a simple input device, which can output a certain way to the display device. For multimedia communication fields such as conference TV, the input source is not only all the way, but also needs to be Multi-channel synthesis, and also available for selection, the traditional split screen device does not have the access source of the mobile device, or the access of the data file, and the input device of the conference television terminal is many, in addition to the video source, there is an audio source, There is no product on the market that combines audio, video and multi-channel video. The video source can be mobile devices, computers, data sources, etc. With the development of multimedia communication, especially the development of cloud conferences, collaborative interaction Convenience will greatly enhance the competitiveness of products.
会议电视领域,目前大部分的麦克只是纯粹的音频采集设备,而视频的输入大部分是其他专用视音频设备负责,如传统会议电视终端,机顶盒等等,而这些专业设备的输入源接口有限,无法满足协作交互便利性的场景需要。传统的视频的接入处理围绕在会议电视,机顶盒等设备上,这样存在如下几个弊端:In the conference TV field, most of the current microphones are pure audio collection devices, and most of the video input is responsible for other dedicated audio and video equipment, such as traditional conference television terminals, set-top boxes, etc., and the input interfaces of these professional devices are limited. A scenario that does not meet the convenience of collaborative interaction. The traditional video access processing is centered on conference TV, set-top boxes and other devices, which have the following drawbacks:
1)随着视音频接入能力的提高,如数字分量串行接口(serial digital interface,简称为SDI),HDMI,显卡上输出模拟信号的接口(Video Graphics Adapter,简称为VGA),DVG等各种接入接口等不同而给会议 电视设备等造成设备外围接口众多,接线复杂。1) With the improvement of video and audio access capabilities, such as digital component serial interface (SDI), HDMI, analog signal output interface (Video Graphics Adapter, VGA for short), DVG, etc. Kind of access interface, etc. for the conference TV equipment, etc. cause numerous peripheral interfaces and complicated wiring.
2)随着视音频传输接入方式的增多,如airplay,数字生活网络联盟(Digital Living Network Alliance,简称为DLNA),miracast,NFC,蓝牙等等,传统会议电视设备涉及到硬件软硬件开发版本制作,生产周期长等很难马上切换接入最新的视音频接入方式。2) With the increase of video and audio transmission access methods, such as airplay, Digital Living Network Alliance (DLNA), miracast, NFC, Bluetooth, etc., traditional conference television equipment involves hardware software and hardware development versions. Production, long production cycle, etc. It is difficult to switch to the latest video and audio access methods.
3)传统会议电视的辅流往往只能接一路,而多人讨论,多人接入的情况下辅流的切换非常麻烦。3) The auxiliary stream of the traditional conference TV can only be connected one way, and many people discuss it. In the case of multi-person access, the switching of the auxiliary stream is very troublesome.
针对相关技术中传统的视频接入设备由于输入源接口有限无法满足需要的问题,还未提出有效的解决方案。A conventional video access device in the related art cannot provide an effective solution because the input source interface is limited and cannot meet the required problem.
发明内容Summary of the invention
本发明实施例提供了一种音视频处理方法、装置及麦克,以至少解决相关技术中传统的视频接入设备由于输入源接口有限无法满足需要的问题。The embodiment of the invention provides an audio and video processing method, device and a microphone, so as to at least solve the problem that the conventional video access device in the related art cannot meet the requirement due to the limited input source interface.
根据本发明的一个实施例,提供了一种音视频处理方法包括:麦克接收一路或多路音频和视频;所述麦克将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;所述麦克将合成后的一路视频以及编码后的音频发送给视音频设备。According to an embodiment of the present invention, an audio and video processing method is provided, including: a microphone receiving one or more audio and video; the microphone synthesizing the one or more channels into one channel video, and combining one channel of audio or video The selected audio in the multi-channel audio is encoded; the microphone transmits the synthesized one-channel video and the encoded audio to the audio-visual device.
优选地,在所述麦克接收一路或多路音频和视频之前,所述方法还包括:所述麦克通过通用协议对外广播音视频接入能力,其中,所述通用协议包括DLNA,无线传输airplay,无线显示WIFI display。Preferably, before the mic receives one or more audio and video, the method further includes: the microphone externally broadcasting audio and video access capability by using a universal protocol, where the universal protocol includes DLNA, wireless transmission airplay, Wireless display WIFI display.
优选地,所述麦克接收一路或多路音频和视频包括:通过物理端口、无线局域网WIFI、蓝牙或近场通讯NFC的方式接收所述一路或多路音频和视频。Preferably, the receiving, by the mic, one or more audio and video comprises: receiving the one or more audio and video by means of a physical port, a wireless local area network (WLAN), a Bluetooth or a near field communication NFC.
优选地,在所述麦克将所述一路或多路视频合成为一路视频之前,所述方法还包括:所述麦克对接收到的所述一路或多路视频进行解码;根据预先与所述视音频设备协商的编码格式对解码后的所述一路或多路视频 进行编码,其中,所述编码格式包括H263,H264,H265,运动图象专家组运动图像专家组(Moving Picture Experts Group,简称为MPEG),MP4、VP8、VP9。Preferably, before the microphone synthesizes the one or more channels into one channel video, the method further includes: the microphone decoding the received one or more channels of video; according to the foregoing The encoded format negotiated by the audio device for the decoded one or more channels of video Encoding is performed, wherein the encoding format includes H263, H264, H265, Moving Picture Experts Group (MPEG), MP4, VP8, VP9.
优选地,所述麦克将所述一路或多路视频合成为一路视频包括:所述麦克接收所述视音频设备发送的选择输入源及合成方式的信息;根据所述信息选择对应的一路或多路视频,以及对应的合成方式将选择出的一路或多路视频合成为一路视频。Preferably, the microphone synthesizing the one or more channels into one channel of video includes: the microphone receiving the selected input source and the information of the synthesis mode sent by the video and audio device; and selecting one or more corresponding ones according to the information The road video and the corresponding synthesis method combine the selected one or more channels into one channel video.
优选地,所述合成方式包括以下之一:品字形布局方式,左右对称布局方式。Preferably, the synthesizing manner includes one of the following: a font-shaped layout manner, and a left-right symmetric layout manner.
优选地,在所述麦克将所述一路或多路视频合成为一路视频之前,所述方法还包括:所述麦克通过所述视音频设备从所述一路或多路视频中选择待播放的视频。Preferably, before the microphone synthesizes the one or more channels into one channel video, the method further includes: the microphone selecting, by the video and audio device, the video to be played from the one or more channels of video .
本发明实施例的另一方面,还提供了一种音视频处理装置,包括:接收模块,设置为接收一路或多路音频和视频;合成模块,设置为将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;发送模块,设置为将合成后的一路视频以及编码后的音频发送给视音频设备。Another aspect of the present invention provides an audio and video processing apparatus, including: a receiving module configured to receive one or more audio and video; and a synthesizing module configured to synthesize the one or more channels into One channel of video, and encodes one channel of audio or audio selected from multiple channels of audio; the transmitting module is configured to send the synthesized video and the encoded audio to the audio and video device.
优选地,所述装置还包括:广播模块,设置为通过通用协议对外广播音视频接入能力,其中,所述通用协议包括数字生活网络联盟DLNA,无线传输airplay,无线显示WIFI display。Preferably, the device further includes: a broadcast module, configured to externally broadcast audio and video access capability by using a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
优选地,所述接收模块包括:接收单元,设置为通过物理端口、无线局域网WIFI、蓝牙或近场通讯NFC的方式接收所述一路或多路音频和视频。Preferably, the receiving module comprises: a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
优选地,所述装置还包括:解码模块,设置为对接收到的所述一路或多路视频进行解码;编码模块,设置为根据预先与所述视音频设备协商的编码格式对解码后的所述一路或多路视频进行编码,其中,所述编码格式包括H263,H264,H265,MPEG,MP4、VP8、VP9。 Preferably, the apparatus further includes: a decoding module configured to decode the received one or more channels of video; and an encoding module configured to decode the decoded image according to an encoding format negotiated in advance with the video and audio device The one or more video is encoded, wherein the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9.
本发明实施例,还提供了一种麦克,包括上述的装置。In an embodiment of the invention, a microphone is also provided, including the above device.
在本发明实施例中,还提供了一种计算机存储介质,该计算机存储介质可以存储有执行指令,该执行指令用于执行上述实施例中的音视频处理方法的实现。In the embodiment of the present invention, a computer storage medium is further provided, and the computer storage medium may store an execution instruction for performing the implementation of the audio and video processing method in the foregoing embodiment.
通过本发明实施例,麦克接收一路或多路音频和视频;所述麦克将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;所述麦克将合成后的一路视频以及编码后的音频发送给视音频设备,解决了相关技术中传统的视频接入设备由于输入源接口有限无法满足需要的问题,提高了协作交互便利性。Through the embodiment of the present invention, the microphone receives one or more audio and video; the microphone synthesizes the one or more channels into one channel video, and encodes one channel of audio or audio selected from the plurality of channels of audio; The microphone sends the synthesized video and the encoded audio to the audio and video equipment, which solves the problem that the traditional video access equipment in the related art cannot meet the needs due to the limited input source interface, and improves the convenience of cooperation and interaction.
附图说明DRAWINGS
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The drawings described herein are intended to provide a further understanding of the invention, and are intended to be a part of the invention. In the drawing:
图1是根据本发明实施例的音视频处理方法的流程图;1 is a flowchart of an audio and video processing method according to an embodiment of the present invention;
图2是根据本发明实施例的音视频处理装置的框图;2 is a block diagram of an audio and video processing apparatus according to an embodiment of the present invention;
图3是根据本发明优选实施例的音视频处理装置的框图一;3 is a block diagram 1 of an audio and video processing apparatus in accordance with a preferred embodiment of the present invention;
图4是根据本发明优选实施例的音视频处理装置的框图二;4 is a block diagram 2 of an audio and video processing apparatus in accordance with a preferred embodiment of the present invention;
图5是根据本发明优选实施例的新型麦克的结构框图;Figure 5 is a block diagram showing the structure of a novel microphone in accordance with a preferred embodiment of the present invention;
图6是根据本发明优选实施例的音频视频接入处理的示意图一;6 is a first schematic diagram of an audio video access process in accordance with a preferred embodiment of the present invention;
图7是根据本发明优选实施例的音频视频接入处理的示意图二;FIG. 7 is a second schematic diagram of an audio video access process according to a preferred embodiment of the present invention; FIG.
图8是根据本发明优选实施例的音频视频接入处理的示意图三;FIG. 8 is a third schematic diagram of an audio video access process according to a preferred embodiment of the present invention; FIG.
图9是根据本发明优选实施例的音频视频接入处理的示意图四;9 is a schematic diagram 4 of an audio video access process according to a preferred embodiment of the present invention;
图10是根据本发明优选实施例的音频视频接入处理的示意图五;FIG. 10 is a fifth schematic diagram of an audio video access process according to a preferred embodiment of the present invention; FIG.
图11是根据本发明优选实施例的音频视频接入处理的示意图六;11 is a sixth schematic diagram of an audio video access process in accordance with a preferred embodiment of the present invention;
图12是根据本发明优选实施例的音频视频接入处理的示意图七; FIG. 12 is a schematic diagram 7 of an audio video access process according to a preferred embodiment of the present invention; FIG.
图13是根据本发明优选实施例的音频视频接入处理的示意图八。13 is a schematic diagram 8 of an audio video access process in accordance with a preferred embodiment of the present invention.
具体实施方式detailed description
下文中将参考附图并结合实施例来详细说明本发明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。The invention will be described in detail below with reference to the drawings in conjunction with the embodiments. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It is to be understood that the terms "first", "second" and the like in the specification and claims of the present invention are used to distinguish similar objects, and are not necessarily used to describe a particular order or order.
在本实施例中提供了一种音视频处理方法,图1是根据本发明实施例的音视频处理方法的流程图,如图1所示,该流程包括如下步骤:An audio and video processing method is provided in this embodiment. FIG. 1 is a flowchart of an audio and video processing method according to an embodiment of the present invention. As shown in FIG. 1, the process includes the following steps:
步骤S102,麦克接收一路或多路音频和视频;Step S102, the microphone receives one or more audio and video;
步骤S104,麦克将该一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;Step S104, the microphone synthesizes one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from the plurality of channels of audio;
步骤S106,麦克将合成后的一路视频以及编码后的音频发送给视音频设备。In step S106, the microphone sends the synthesized video and the encoded audio to the audio and video device.
通过上述步骤,麦克接收一路或多路音频和视频;该麦克将该一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码,其中,可以从多路音频中选择一路或多路音频进行编码;该麦克将合成后的一路视频以及编码后的音频发送给视音频设备,解决了相关技术中传统的视频接入设备由于输入源接口有限无法满足需要的问题,提高了协作交互便利性。Through the above steps, the microphone receives one or more channels of audio and video; the microphone combines one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from multiple channels of audio, wherein The audio is selected by one or more channels of audio for encoding; the microphone sends the synthesized video and the encoded audio to the video and audio device, which solves the problem that the traditional video access device in the related art cannot meet the requirement due to the limited input source interface. The problem is to improve the convenience of collaborative interaction.
为了让其他设备发现可以接入麦克,在接收一路或多路音频和视频之前,该麦克通过通用协议对外广播音视频接入能力,其中,该通用协议包括数字生活网络联盟DLNA,无线传输airplay,无线显示WIFI display,需要说明的是,不仅限于上述的协议。In order to allow other devices to discover that the microphone can be accessed, the microphone broadcasts audio and video access capabilities through a universal protocol before receiving one or more audio and video. The universal protocol includes the Digital Living Network Alliance DLNA, wireless transmission airplay, Wireless display WIFI display, it should be noted that it is not limited to the above protocols.
在一个可选的实施例中,该麦克接收一路或多路音频和视频可以包括:该麦克通过物理端口、无线局域网WIFI、蓝牙或近场通讯NFC的方式接 收该一路或多路音频和视频。In an optional embodiment, the microphone receiving one or more audio and video may include: the microphone is connected through a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC. Receive one or more audio and video.
优选地,在将该一路或多路视频合成为一路视频之前,该麦克对接收到的该一路或多路视频进行解码;根据预先与该视音频设备协商的编码格式对解码后的该一路或多路视频进行编码,其中,该编码格式包括H263,H264,H265,MPEG,MP4、VP8、VP9等。Preferably, before synthesizing the one or more channels of video into one channel of video, the microphone decodes the received one or more channels of video; according to the encoding format negotiated in advance with the video and audio device, the decoded one way or The multi-channel video is encoded, wherein the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9, and the like.
优选地,该麦克将该一路或多路视频合成为一路视频可以包括:该麦克接收该视音频设备发送的选择输入源及合成方式的信息;根据该信息选择对应的一路或多路视频,以及对应的合成方式将选择出的一路或多路视频合成为一路视频。Preferably, the merging of the one or more channels of video into one channel of the video may include: receiving, by the mic, the information of the selected input source and the compositing mode sent by the video and audio device; selecting corresponding one or more channels of video according to the information, and The corresponding synthesis method combines the selected one or more channels into one channel video.
上述的合成方式包括以下之一:品字形布局方式,左右对称布局方式,需要说明的是,并不仅限于这两种实现方式。The above-mentioned synthesis method includes one of the following: a font layout manner, a left-right symmetric layout manner, and it should be noted that it is not limited to the two implementation manners.
为了更好的实现对视频的选择,麦克可以通过控制视音频设备从所述一路或多路视频中选择待播放的视频,将选择的视频进行合成后传给视音频设备播放。In order to better realize the selection of the video, the microphone can select the video to be played from the one or more videos by controlling the video and audio device, synthesize the selected video, and transmit the selected video to the video and audio device for playing.
本发明实施例还提供了一种流量监控处理装置,图2是根据本发明实施例的音视频处理装置的框图,如图2所示,包括:The embodiment of the present invention further provides a traffic monitoring processing device. FIG. 2 is a block diagram of an audio and video processing device according to an embodiment of the present invention. As shown in FIG. 2, the method includes:
接收模块22,设置为接收一路或多路音频和视频;The receiving module 22 is configured to receive one or more audio and video;
合成模块24,设置为将该一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;The synthesizing module 24 is configured to combine the one or more channels of video into one channel of video, and encode one channel of audio or audio selected from the plurality of channels of audio;
发送模块26,设置为将合成后的一路视频以及编码后的音频发送给视音频设备。The sending module 26 is configured to send the combined video and the encoded audio to the audio and video device.
图3是根据本发明优选实施例的音视频处理装置的框图一,如图3所示,该装置还包括:3 is a block diagram 1 of an audio and video processing apparatus according to a preferred embodiment of the present invention. As shown in FIG. 3, the apparatus further includes:
广播模块32,设置为通过通用协议对外广播音视频接入能力,其中,该通用协议包括数字生活网络联盟DLNA,无线传输airplay,无线显示WIFI display。 The broadcast module 32 is configured to broadcast audio and video access capabilities through a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
优选地,该接收模块包括:接收单元,设置为通过物理端口、无线局域网WIFI、蓝牙或近场通讯NFC的方式接收该一路或多路音频和视频。Preferably, the receiving module comprises: a receiving unit configured to receive the one or more audio and video by means of a physical port, a wireless local area network WIFI, a Bluetooth or a near field communication NFC.
图4是根据本发明优选实施例的音视频处理装置的框图二,如图4所示,该装置还包括:4 is a block diagram 2 of an audio and video processing apparatus according to a preferred embodiment of the present invention. As shown in FIG. 4, the apparatus further includes:
解码模块42,设置为对接收到的该一路或多路视频进行解码;The decoding module 42 is configured to decode the received one or more channels of video;
编码模块44,设置为根据预先与该视音频设备协商的编码格式对解码后的该一路或多路视频进行编码,其中,该编码格式包括H263,H264,H265,MPEG,MP4、VP8、VP9等。The encoding module 44 is configured to encode the decoded one or more channels according to an encoding format negotiated in advance with the video and audio device, where the encoding format includes H263, H264, H265, MPEG, MP4, VP8, VP9, etc. .
本发明实施例还提供了一种麦克,包括上述的装置。Embodiments of the present invention also provide a microphone including the above device.
本发明的实施例还提供了一种存储介质。可选地,在本实施例中,上述存储介质可以被设置为存储设置为执行以下步骤的程序代码:Embodiments of the present invention also provide a storage medium. Optionally, in the embodiment, the storage medium may be configured to store program code set to perform the following steps:
步骤S1,麦克接收一路或多路音频和视频;Step S1, the microphone receives one or more audio and video;
步骤S2,麦克将该一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;Step S2, the microphone synthesizes one or more channels of video into one channel of video, and encodes one channel of audio or audio selected from the plurality of channels of audio;
步骤S3,麦克将合成后的一路视频及编码后的音频发送给视音频设备。In step S3, the microphone sends the synthesized video and the encoded audio to the audio and video device.
可选地,在本实施例中,上述存储介质可以包括但不限于:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。Optionally, in this embodiment, the foregoing storage medium may include, but not limited to, a USB flash drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), a mobile hard disk, and a magnetic memory. A variety of media that can store program code, such as a disc or a disc.
可选地,在本实施例中,处理器根据存储介质中已存储的程序代码执行上述的步骤S1、S2和S3。Optionally, in the embodiment, the processor performs the above steps S1, S2 and S3 according to the stored program code in the storage medium.
可选地,本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例,本实施例在此不再赘述。For example, the specific examples in this embodiment may refer to the examples described in the foregoing embodiments and the optional embodiments, and details are not described herein again.
在会议电视领域,用户往往离声音采集设备最近,本发明实施例围绕音频采集设备如麦克,在麦克上增加视频的接入,以解决目前视音频领域 的弊端,本发明采用以下技术方案:In the field of conference television, the user is often closest to the sound collection device, and the embodiment of the invention surrounds the audio collection device, such as a microphone, to add video access on the microphone to solve the current video and audio field. The drawbacks of the present invention are the following technical solutions:
本发明实施例所述支持视频输入输出的新型麦克,图5是根据本发明优选实施例的新型麦克的结构框图,如图5所示,主要包括以下模块:A new type of microphone supporting video input and output according to an embodiment of the present invention. FIG. 5 is a structural block diagram of a novel microphone according to a preferred embodiment of the present invention. As shown in FIG. 5, the following modules are mainly included:
能力通知模块52,设置为将自己能被外部接入的能力上报,以方便让外部源能够接入设备。The capability notification module 52 is configured to report its ability to be externally accessed to facilitate access to the device by an external source.
视频采集模块54,设置为采集带有物理接入的视频数据,如可以采用VGA,HDMI,DVI等常用物理接口。The video capture module 54 is configured to collect video data with physical access, such as a common physical interface such as VGA, HDMI, or DVI.
音频采集模块56,是麦克采集声音模块。The audio collection module 56 is a microphone acquisition sound module.
数据接收模块58,设置为除了带有物理接口的视频输入外,音频采集外,还可以接收通过非物理方式接口的视音频数据,由数据接收模块58处理。接收的数据包括通过无线WIFI,采用miracast,wifi display,airplay,dlna等各种互联互通协议,或者其他如NFC,蓝牙等接收到的数据,音频等。The data receiving module 58 is configured to receive video and audio data through a non-physical interface in addition to audio input with a physical interface, and is processed by the data receiving module 58. Received data includes wireless WIFI, miracast, wifi display, airplay, dlna and other interconnection protocols, or other data such as NFC, Bluetooth, etc., audio and so on.
媒体协商模块510,设置为负责和远端设备协商双方之间采用的媒体能力。The media negotiation module 510 is configured to be responsible for negotiating with the remote device the media capabilities employed between the two parties.
媒体处理模块512,设置为负责处理采集和接收到的视频,音频数据,包括将多路接入的视频数据进行叠加或者合成,并根据E压缩编码生成相应格式的数据。The media processing module 512 is configured to process the collected and received video, the audio data, including superimposing or synthesizing the multi-access video data, and generating data in a corresponding format according to the E compression encoding.
媒体发送模块514,设置为根据需要将叠加或者合成的数据发送至外部视音频设备如会议电视终端测,叠加和合成的数据可以是接入系统中多路中的某一路,或者接入系统中所有路按需确定其中几路的叠加或者合成。The media sending module 514 is configured to send the superimposed or synthesized data to an external video and audio device, such as a conference television terminal, as needed, and the superimposed and synthesized data may be one of multiple paths in the access system, or in the access system. All roads determine the superposition or composition of several of them as needed.
输入源控制模块516,设置为接收视音频设备传来的控制信令,用来做新型麦克采集的视音频源选择,以根据控制选择将哪一路视、音频源,选择具体哪种合成方式发送给视音频设备。The input source control module 516 is configured to receive the control signaling sent by the video and audio device, and is used to select the video and audio source for the new microphone acquisition, to select which way to view and the audio source according to the control, and select which specific synthesis mode to send. For audio and video equipment.
本发明所述支持视频输入输出的一种新型麦克应用的方法包括以下内容:新型麦克通过通用协议,通过能力通知模块52对外公开自己的视 音频接入能力。通用协议包含且不限于DLNA,airplay,wifi display等。通信载体包含且不限于WIFI,蓝牙,NFC等通讯方式。如果外部视频源是物理的视频信号,则外部视频源直接连入新型麦克,通过视频采集模块54处理即可。如果外部源是无线视频输入源,如手机,PAD等。则外部视频源通过通用协议,搜索到新型麦克,新型麦克通过数据接收模块58实现无线视频源的接入;通用协议包含且不限于DLNA,airplay,wifi display等。无线的方式包含且不限于WIFI,蓝牙,NFC等通讯方式。如果外部源是无线音频输入源,如手机的音乐等。则外部音频源通过通用协议,搜索到新型麦克,新型麦克通过数据接收模块58实现无线音频频源的接入;通用协议包含且不限于DLNA,airplay,wifi display等。无线的方式包含且不限于WIFI,蓝牙,NFC等通讯方式。媒体协商模块510和新型麦克要接入的视音频处理设备如会议电视终端协商出视音频的编解码格式。媒体处理模块512针对系统所采集和接收的视音频数据处理包括:将采集的视频物理信号进行解码,然后根据协商模块协商出的能力编码,格式包含且不限于H264,运动图像专家组(Moving Picture Experts Group,简称为MPEG),MP4等。将接收到的非视音频数据,如文件数据,则用文件夹,文件的方式解码,再根据协商出的能力编码。编码格式包含且不限于H264,MPEG,MP4等。将物理采集到的,以及非物理方式接收到的视频信息进行叠加,或者合成,合成出一路视频。合成方式包括且不限于品字形,左右对称等各种布局方式。将物理采集的音频,以及NFC,蓝牙进场采集的音频根据需要将其中的一个进行编码。将叠加,合成后的视频,及编码的音频数据通过数据模块发送给外部的视音频设备。The method for supporting a new type of microphone application for video input and output according to the present invention includes the following content: a new type of microphone exposes its own vision through the capability notification module 52 through a general protocol. Audio access capability. General protocols include and are not limited to DLNA, airplay, wifi display, etc. The communication carrier includes, but is not limited to, WIFI, Bluetooth, NFC, and the like. If the external video source is a physical video signal, the external video source is directly connected to the new microphone and processed by the video capture module 54. If the external source is a wireless video input source, such as a cell phone, PAD, etc. Then, the external video source searches for a new type of microphone through a universal protocol, and the new type of microphone realizes access to the wireless video source through the data receiving module 58; the general protocol includes and is not limited to DLNA, airplay, wifi display, and the like. The wireless method includes and is not limited to WIFI, Bluetooth, NFC and other communication methods. If the external source is a wireless audio input source, such as the music of a mobile phone. Then, the external audio source searches for a new type of microphone through a universal protocol, and the new type of microphone realizes access to the wireless audio frequency source through the data receiving module 58; the general protocol includes and is not limited to DLNA, airplay, wifi display, and the like. The wireless method includes and is not limited to WIFI, Bluetooth, NFC and other communication methods. The media negotiation module 510 and the video and audio processing device to be accessed by the new microphone, such as the conference television terminal, negotiate a codec format of the video and audio. The processing of the video and audio data collected and received by the media processing module 512 for the system includes: decoding the collected video physical signal, and then encoding according to the capability negotiated by the negotiation module, the format includes and is not limited to H264, Moving Picture Experts Group (Moving Picture) Experts Group, referred to as MPEG), MP4, etc. The non-audio and audio data to be received, such as file data, is decoded by means of a folder or a file, and then encoded according to the negotiated capability. The encoding format includes and is not limited to H264, MPEG, MP4, and the like. The video information collected by the physical and non-physical methods is superimposed or synthesized to synthesize a video. The synthesis method includes, but is not limited to, a variety of layout manners such as a font shape and a left-right symmetry. The physically acquired audio, as well as the NFC, Bluetooth incoming audio, are encoded as needed. The superimposed, synthesized video, and encoded audio data are transmitted to an external AV device through the data module.
新型麦克通过输入源输入控制模块和视音频设备通信,接收到视音频设备发来的选择输入源及合成方式信息,根据信息新型麦克选择对应的输入源,做对应的合成方式通过媒体发送模块514,将视音频数据发送给视音频设备。The new microphone communicates with the video and audio device through the input source input control module, receives the selected input source and the synthesis mode information sent by the video and audio device, selects the corresponding input source according to the information new microphone, and performs the corresponding synthesis mode through the media sending module 514. , the video and audio data is sent to the audio and video equipment.
实例一 Example one
笔记本A接入会议电视,图6是根据本发明优选实施例的音频视频接入处理的示意图一,如图6所示,包括:The notebook A is connected to the conference television. FIG. 6 is a schematic diagram 1 of the audio video access processing according to a preferred embodiment of the present invention. As shown in FIG. 6, the method includes:
第一步:新型麦克通过能力通知模块52广播广播自己的视音频接入能力。The first step: the new microphone broadcasts its own video and audio access capability through the capability notification module 52.
第二步:笔记本A接入新型麦克,包括:通过物理方式接入新型麦克,接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本A通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。The second step: the notebook A accesses the new type of microphone, including: physically accessing the new type of microphone, the access mode can be HDMI, VGA, etc., and the new microphone collects the media signal of the notebook through the video acquisition module 54. The notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第三步:新型麦克通过媒体协商模块510和会议电视终端协商出需要编码的视音频格式;The third step: the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
第四步:输入源控制模块516获取到外部会议电视终端需要选择哪一路视频源,合成方式,由于只有一路视频源,则就选择笔记本A;The fourth step: the input source control module 516 obtains which video source needs to be selected by the external conference television terminal, and the synthesis mode, since only one video source is selected, the notebook A is selected;
第五步:媒体处理模块512根据合成方式,选择的视频源,以及协商好的编码格式进行编码;The fifth step: the media processing module 512 performs encoding according to the synthesized mode, the selected video source, and the negotiated encoding format;
第六步:媒体发送模块514将编码后的媒体数据发给会议电视终端;Step 6: The media sending module 514 sends the encoded media data to the conference television terminal.
第七步:使用者可以通过视音频处理设备的输出看到处理过后的笔记本的视频;Step 7: The user can see the video of the processed notebook through the output of the AV processing device;
第八步:用户选择的视频发生变化,输入源控制模块516选择的对应的视频源和合成方式发送给会议电视终端。The eighth step: the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
实例二Example two
笔记本A和笔记本B通过新型麦克接入会议电视,包括:Notebook A and Notebook B access the conference TV through the new microphone, including:
第一步:新型麦克通过能力通知模块52广播广播自己的视音频接入能力;The first step: the new microphone broadcasts its own video and audio access capability through the capability notification module 52;
第二步:笔记本A接入新型麦克,包括:通过物理方式接入新型麦克, 接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本A通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。Step 2: Notebook A accesses the new mic, including: physically accessing the new mic, The access mode may be HDMI, VGA, etc., and the new microphone collects the media signal of the notebook through the video capture module 54. The notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第三步:笔记本B接入新型麦克,包括:通过物理方式接入新型麦克,接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本B通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。The third step: the notebook B accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54. The notebook B searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第四步:新型麦克通过媒体协商模块510和会议电视终端协商出需要编码的视音频格式;The fourth step: the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
第五步:输入源控制模块516获取到外部会议电视终端需要选择哪一路视频源,合成方式。Step 5: The input source control module 516 obtains which video source needs to be selected for the external conference television terminal, and the synthesis mode.
图7是根据本发明优选实施例的音频视频接入处理的示意图二,如图7所示,对应同时选择笔记本A和笔记本B。合成方式可以是笔记本A和笔记本B左右堆成,也可以是上下对称,不限于具体的画面布局。FIG. 7 is a second schematic diagram of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 7, notebook A and notebook B are simultaneously selected. The synthesis method may be that the notebook A and the notebook B are stacked on the left or right, or may be vertically symmetrical, and is not limited to a specific screen layout.
图8是根据本发明优选实施例的音频视频接入处理的示意图三,如图8所示,对应选择笔记本A。由于只选择一路视频源,合成方式就是笔记本A的内容了。FIG. 8 is a third schematic diagram of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 8, a notebook A is selected accordingly. Since only one video source is selected, the synthesis method is the content of the notebook A.
图9是根据本发明优选实施例的音频视频接入处理的示意图四,如图9所示,对应选择笔记本B。由于只选择一路视频源,合成方式就是笔记本B的内容了。FIG. 9 is a schematic diagram 4 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 9, a notebook B is selected correspondingly. Since only one video source is selected, the synthesis method is the content of the notebook B.
第五步:媒体处理模块512根据合成方式,选择的视频源,以及协商好的编码格式进行编码;The fifth step: the media processing module 512 performs encoding according to the synthesized mode, the selected video source, and the negotiated encoding format;
第六步:媒体发送模块514将编码后的媒体数据发给会议电视终端;Step 6: The media sending module 514 sends the encoded media data to the conference television terminal.
第七步:使用者可以通过视音频处理设备的输出看到处理过后的视频。 Step 7: The user can see the processed video through the output of the AV processing device.
第八步:用户选择的视频发生变化,输入源控制模块516选择的对应的视频源和合成方式发送给会议电视终端The eighth step: the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
实例三Example three
笔记本A和笔记本B,笔记本C通过新型麦克接入会议电视,包括:Notebook A and Notebook B, Notebook C access to the conference TV through the new microphone, including:
第一步:新型麦克通过能力通知模块52广播广播自己的视音频接入能力。The first step: the new microphone broadcasts its own video and audio access capability through the capability notification module 52.
第二步:笔记本A接入新型麦克,包括:通过物理方式接入新型麦克,接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本A通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。The second step: the notebook A accesses the new type of microphone, including: physically accessing the new type of microphone, the access mode can be HDMI, VGA, etc., and the new microphone collects the media signal of the notebook through the video acquisition module 54. The notebook A searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第三步:笔记本B接入新型麦克,包括:通过物理方式接入新型麦克,接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本B通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。The third step: the notebook B accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54. The notebook B searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第三步:笔记本C接入新型麦克,包括:通过物理方式接入新型麦克,接入方式可以是HDMI,VGA等信号接入,新型麦克通过视频采集模块54采集到笔记本的媒体信号。笔记本C通过且不限于wifi display或者DLNA,airplay等协议搜索到新型麦克,与新型麦克的数据接收模块58通讯,将媒体数据发送给新型麦克,完成接入。The third step: the notebook C accesses the new type of microphone, including: physical access to the new type of microphone, the access mode can be HDMI, VGA and other signal access, the new microphone collects the media signal of the notebook through the video capture module 54. The notebook C searches for a new type of microphone through a protocol such as wifi display or DLNA, airplay, etc., communicates with the data receiving module 58 of the new microphone, and transmits the media data to the new microphone to complete the access.
第四步:新型麦克通过媒体协商模块510和会议电视终端协商出需要编码的视音频格式。The fourth step: the new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal.
第五步:输入源控制模块516获取到外部会议电视终端需要选择哪一路视频源,合成方式。 Step 5: The input source control module 516 obtains which video source needs to be selected for the external conference television terminal, and the synthesis mode.
图10是根据本发明优选实施例的音频视频接入处理的示意图五,如图10所示,对应同时选择笔记本A和笔记本B,笔记本C。合成方式可以是笔记本A和笔记本B,笔记本C各左右占三分之一,不限于具体的画面布局。FIG. 10 is a schematic diagram 5 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 10, notebook A and notebook B, notebook C are simultaneously selected. The synthesis method can be notebook A and notebook B, and notebook C accounts for one-third of each, and is not limited to a specific screen layout.
图11是根据本发明优选实施例的音频视频接入处理的示意图六,如图11所示,对应选择笔记本A和笔记本B。合成方式可以是笔记本A和笔记本B各占一半的内容,不局限于画面的布局。FIG. 11 is a schematic diagram 6 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 11, notebook A and notebook B are selected accordingly. The composition method can be half of the contents of the notebook A and the notebook B, and is not limited to the layout of the screen.
图12是根据本发明优选实施例的音频视频接入处理的示意图七,如图12所示,对应选择笔记本C。由于只选择一路视频源,合成方式就是笔记本C的内容了。可以是选择任一个输入源。FIG. 12 is a schematic diagram 7 of an audio video access process according to a preferred embodiment of the present invention. As shown in FIG. 12, a notebook C is selected correspondingly. Since only one video source is selected, the synthesis method is the content of the notebook C. You can choose any of the input sources.
第六步:媒体处理模块512根据合成方式,选择的视频源,以及协商好的编码格式进行编码Step 6: The media processing module 512 encodes according to the synthesized mode, the selected video source, and the negotiated encoding format.
第七步:媒体发送模块514将编码后的媒体数据发给会议电视终端。Step 7: The media sending module 514 sends the encoded media data to the conference television terminal.
第八步:使用者可以通过视音频处理设备的输出看到处理过后的视频。Step 8: The user can see the processed video through the output of the AV processing device.
第九步:用户选择的视频发生变化,输入源控制模块516选择的对应的视频源和合成方式发送给会议电视终端。The ninth step: the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
实例四Example four
笔记本A和笔记本B,笔记本C,NFC/蓝牙设备通过新型麦克接入会议电视,图13是根据本发明优选实施例的音频视频接入处理的示意图八,如图13所示,包括:The notebook A and the notebook B, the notebook C, and the NFC/Bluetooth device are connected to the conference television through a new type of microphone. FIG. 13 is a schematic diagram of the audio video access processing according to a preferred embodiment of the present invention. As shown in FIG. 13, the method includes:
第一步:笔记本A,笔记本B,笔记本C根据前面实例的第一步,第二步,第三步等接入新型麦克。The first step: notebook A, notebook B, notebook C according to the first step of the previous example, the second step, the third step, etc. access to the new microphone.
第二步:媒体处理模块512编码信号,将处理后的笔记本A、笔记本B和笔记本C的视频信号,NFC/蓝牙设备通过NFC/蓝牙传送文件到新型麦克,新型麦克以文件夹的内容展示; The second step: the media processing module 512 encodes the signal, and the processed video signals of the notebook A, the notebook B, and the notebook C, and the NFC/Bluetooth device transmits the file to the new microphone through the NFC/Bluetooth, and the new microphone displays the contents of the folder;
第三步:新型麦克和会议电视终端协商出编码能力,合成方式;The third step: the new microphone and the conference television terminal negotiate the coding capability and the synthesis mode;
第四步:将上述A,B,C及接收到的文件内容要展示的视频内容按第三步协商的结果叠加或者合成,编码;The fourth step: superimposing or synthesizing and encoding the video content to be displayed in the above A, B, C and the received file content according to the result of the third step negotiation;
第五步:新型麦克通过媒体协商模块510和会议电视终端协商出需要编码的视音频格式;Step 5: The new microphone negotiates the video and audio format to be encoded through the media negotiation module 510 and the conference television terminal;
第六步:媒体发送模块514将处理后的所有数据发给会议电视终端;Step 6: The media sending module 514 sends all the processed data to the conference television terminal;
第七步:根据使用者需要,通过新型麦克的输入源控制模块516使用者可以选择查看笔记本A,笔记本B,笔记本C,NFC/蓝牙信息展现的内容或者同时观看笔记本A和笔记本B,笔记本C的视频内容,NFC/蓝牙信息展现的内容。需要说明的是,NFC/蓝牙设备接入新型麦克发送到会议电视设备输出,不局限于3个设备接入,也不局限于发送给会议电视设备,能有输出的视音频设备都可以。Step 7: According to the needs of the user, the user can select to view the contents of notebook A, notebook B, notebook C, NFC/Bluetooth information or simultaneously watch notebook A and notebook B, notebook C through the input source control module 516 of the new microphone. Video content, content displayed by NFC/Bluetooth information. It should be noted that the NFC/Bluetooth device accesses the output of the new microphone to the conference television device, and is not limited to three devices, and is not limited to being sent to the conference television device, and the video and audio device capable of outputting can be used.
第八步:用户选择的视频发生变化,输入源控制模块516选择的对应的视频源和合成方式发送给会议电视终端。The eighth step: the video selected by the user changes, and the corresponding video source and the synthesized mode selected by the input source control module 516 are sent to the conference television terminal.
与现有技术相比,方便了视音频领域视音频沟通交付的效果,及简化线路接入等,极大的增强了沟通的效果。Compared with the prior art, it facilitates the effect of video and audio communication delivery in the video and audio field, and simplifies line access, etc., greatly enhancing the communication effect.
显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本发明不限制于任何特定的硬件和软件结合。It will be apparent to those skilled in the art that the various modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein. The steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software.
以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明 的保护范围之内。The above description is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc., made within the spirit and scope of the present invention are intended to be included in the present invention. Within the scope of protection.
工业实用性Industrial applicability
通过本发明实施例,麦克接收一路或多路音频和视频;所述麦克将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;所述麦克将合成后的一路视频以及编码后的音频发送给视音频设备,解决了相关技术中传统的视频接入设备由于输入源接口有限无法满足需要的问题,提高了协作交互便利性。 Through the embodiment of the present invention, the microphone receives one or more audio and video; the microphone synthesizes the one or more channels into one channel video, and encodes one channel of audio or audio selected from the plurality of channels of audio; The microphone sends the synthesized video and the encoded audio to the audio and video equipment, which solves the problem that the traditional video access equipment in the related art cannot meet the needs due to the limited input source interface, and improves the convenience of cooperation and interaction.

Claims (11)

  1. 一种音视频处理方法,包括:An audio and video processing method includes:
    麦克接收一路或多路音频和视频;Mike receives one or more audio and video;
    所述麦克将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;The microphone synthesizes the one or more channels into one channel of video, and encodes one channel of audio or audio selected from the plurality of channels of audio;
    所述麦克将合成后的一路视频以及编码后的音频发送给视音频设备。The microphone sends the synthesized video and the encoded audio to the audio and video device.
  2. 根据权利要求1所述的方法,其中,在所述麦克接收一路或多路音频和视频之前,所述方法还包括:The method of claim 1 wherein before the mic receives one or more audio and video, the method further comprises:
    所述麦克通过通用协议对外广播音视频接入能力,其中,所述通用协议包括数字生活网络联盟DLNA,无线传输airplay,无线显示WIFI display。The microphone broadcasts audio and video access capabilities through a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  3. 根据权利要求2所述的方法,其中,所述麦克接收一路或多路音频和视频包括:The method of claim 2 wherein said receiving one or more audio and video by said microphone comprises:
    所述麦克通过物理端口、无线局域网WIFI、蓝牙或近场通讯NFC的方式接收所述一路或多路音频和视频。The microphone receives the one or more audio and video through a physical port, a wireless local area network (WIFI), a Bluetooth or a near field communication NFC.
  4. 根据权利要求1所述的方法,其中,在所述麦克将所述一路或多路视频合成为一路视频之前,所述方法还包括:The method of claim 1, wherein before the mic combines the one or more channels of video into one channel of video, the method further comprises:
    所述麦克对接收到的所述一路或多路视频进行解码;The microphone decodes the received one or more channels of video;
    所述麦克根据预先与所述视音频设备协商的编码格式对解码后的所述一路或多路视频进行编码,其中,所述编码格式包括H263,H264,H265,运动图象专家组MPEG,MP4、VP8、VP9。The microphone encodes the decoded one or more channels according to an encoding format negotiated in advance with the video and audio device, wherein the encoding format includes H263, H264, H265, Moving Picture Experts Group MPEG, MP4 , VP8, VP9.
  5. 根据权利要求4所述的方法,其中,所述麦克将所述一路或 多路视频合成为一路视频包括:The method of claim 4 wherein said mic will be said one way or Multi-channel video synthesis into one channel video includes:
    所述麦克接收所述视音频设备发送的选择输入源及合成方式的信息;Receiving, by the microphone, a selection input source and a synthesis mode information sent by the video and audio device;
    所述麦克根据所述信息选择对应的一路或多路视频,以及对应的合成方式将选择出的一路或多路视频合成为一路视频。The microphone selects one or more channels of video according to the information, and combines the selected one or more channels into one channel video according to the corresponding synthesis manner.
  6. 根据权利要求5所述的方法,其中,所述合成方式包括以下之一:品字形布局方式,左右对称布局方式。The method according to claim 5, wherein the synthesizing manner comprises one of the following: a font-shaped layout manner, and a left-right symmetric layout manner.
  7. 根据权利要求1至6中任一项所述的方法,其中,在所述麦克将所述一路或多路视频合成为一路视频之前,所述方法还包括:The method according to any one of claims 1 to 6, wherein before the merging the one or more channels of video into one video, the method further comprises:
    所述麦克通过所述视音频设备从所述一路或多路视频中选择待播放的视频。The microphone selects a video to be played from the one or more videos through the video and audio device.
  8. 一种音视频处理装置,应用于麦克,包括:An audio and video processing device applied to a microphone, comprising:
    接收模块,设置为接收一路或多路音频和视频;a receiving module configured to receive one or more audio and video;
    合成模块,设置为将所述一路或多路视频合成为一路视频,并将一路音频或从多路音频中选择的音频进行编码;a synthesis module configured to combine the one or more channels of video into one channel of video and encode one channel of audio or audio selected from the plurality of channels of audio;
    发送模块,设置为将合成后的一路视频以及编码后的音频发送给视音频设备。The sending module is configured to send the synthesized video and the encoded audio to the audio and video device.
  9. 根据权利要求8所述的装置,其中,所述装置还包括:The apparatus of claim 8 wherein said apparatus further comprises:
    广播模块,设置为通过通用协议对外广播音视频接入能力,其中,所述通用协议包括数字生活网络联盟DLNA,无线传输airplay,无线显示WIFI display。The broadcast module is configured to broadcast audio and video access capabilities through a universal protocol, where the universal protocol includes a digital living network alliance DLNA, a wireless transmission airplay, and a wireless display WIFI display.
  10. 根据权利要求8所述的装置,其中,所述装置还包括: The apparatus of claim 8 wherein said apparatus further comprises:
    解码模块,设置为对接收到的所述一路或多路视频进行解码;a decoding module, configured to decode the received one or more channels of video;
    编码模块,设置为根据预先与所述视音频设备协商的编码格式对解码后的所述一路或多路视频进行编码,其中,所述编码格式包括H263,H264,H265,运动图象专家组MPEG,MP4、VP8、VP9。And an encoding module, configured to encode the decoded one or more channels according to an encoding format negotiated in advance with the video and audio device, where the encoding format includes H263, H264, H265, Moving Picture Experts Group MPEG , MP4, VP8, VP9.
  11. 一种麦克,包括权利要求8至10中任一项所述的装置。 A mic comprising the apparatus of any one of claims 8 to 10.
PCT/CN2017/083816 2016-06-29 2017-05-10 Audio and video processing method, apparatus and microphone WO2018000953A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610495723.2 2016-06-29
CN201610495723.2A CN107547824A (en) 2016-06-29 2016-06-29 Audio/video processing method, device and Mike

Publications (1)

Publication Number Publication Date
WO2018000953A1 true WO2018000953A1 (en) 2018-01-04

Family

ID=60785831

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/083816 WO2018000953A1 (en) 2016-06-29 2017-05-10 Audio and video processing method, apparatus and microphone

Country Status (2)

Country Link
CN (1) CN107547824A (en)
WO (1) WO2018000953A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114157828A (en) * 2021-12-03 2022-03-08 北京达佳互联信息技术有限公司 Device audio adjustment method, device, electronic device, medium and program product
CN116962790A (en) * 2023-08-02 2023-10-27 深圳市辉宏科技有限公司 A video interactive system, method and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101309390A (en) * 2007-05-17 2008-11-19 华为技术有限公司 Visual communication system, apparatus and subtitle displaying method
US20100157016A1 (en) * 2008-12-23 2010-06-24 Nortel Networks Limited Scalable video encoding in a multi-view camera system
CN102404547A (en) * 2011-11-24 2012-04-04 中兴通讯股份有限公司 Method and terminal for realizing video conference cascade
CN103841360A (en) * 2013-12-11 2014-06-04 三亚中兴软件有限责任公司 Distributed video conference achieving method and system, video conference terminal and audio and video integrated device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100505864C (en) * 2005-02-06 2009-06-24 中兴通讯股份有限公司 A multi-point video conferencing system and its media processing method
CN103888488A (en) * 2012-12-20 2014-06-25 三星电子(中国)研发中心 Method for sharing data based on WIFI
CN104010155B (en) * 2013-02-27 2017-12-22 联芯科技有限公司 Video telephone realization method and mobile terminal
CN103426431B (en) * 2013-07-24 2016-08-10 阳光凯讯(北京)科技有限公司 The converged communication system of satellite network and terrestrial network system and dynamic acoustic code conversion method
CN104994247A (en) * 2015-05-19 2015-10-21 苏州方位通讯科技有限公司 Communication access method of SIP terminals serving as VoIP hot spots

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101309390A (en) * 2007-05-17 2008-11-19 华为技术有限公司 Visual communication system, apparatus and subtitle displaying method
US20100157016A1 (en) * 2008-12-23 2010-06-24 Nortel Networks Limited Scalable video encoding in a multi-view camera system
CN102404547A (en) * 2011-11-24 2012-04-04 中兴通讯股份有限公司 Method and terminal for realizing video conference cascade
CN103841360A (en) * 2013-12-11 2014-06-04 三亚中兴软件有限责任公司 Distributed video conference achieving method and system, video conference terminal and audio and video integrated device

Also Published As

Publication number Publication date
CN107547824A (en) 2018-01-05

Similar Documents

Publication Publication Date Title
CN1893431B (en) Content integration system with format and protocol conversion
US9497390B2 (en) Video processing method, apparatus, and system
CN101778285B (en) A kind of audio-video signal wireless transmitting system and method thereof
CN104125434B (en) A kind of system of long range high definition transmission
WO2011050690A1 (en) Method and system for recording and replaying multimedia conference
WO2008071110A1 (en) A method and a system for playing multi-channel tv signal simultaneously
EP3253066B1 (en) Information processing device
KR101582795B1 (en) High definition multimedia interface dongle and control method thereof
CN110798644A (en) Wireless screen-casting conference system based on HDMI signal conversion
JP2015084513A (en) Method for sharing content using display forwarding function and compatibility notification to related devices
JP5870149B2 (en) Audio playback device, multimedia video playback system, and playback method thereof
CN104301657B (en) A kind of video conference terminal and its secondary flow data cut-in method
WO2018000953A1 (en) Audio and video processing method, apparatus and microphone
US9237304B2 (en) Multi-channel sharing apparatus and multi-channel sharing method
CN104602111A (en) A method for playing audio from a digital TV set-top box using a mobile phone
US20130097648A1 (en) Internet-enabled smart television
CN206517484U (en) Audio frequency and video instructor in broadcasting's equipment
US9338503B2 (en) Decoding apparatus for a set top box
CN105812907B (en) A kind of online multimedia program stream sharing method and device
RU159037U1 (en) AUDIO STREAM DEVICE
CN215499319U (en) Open IP Network Interface Protocol Communication Device
WO2023279326A1 (en) Audio and video transmission method and apparatus, device and storage medium
CN104519393A (en) Method for processing video and audio data and related module
TWI631853B (en) Audiovisual control apparatus, and associated method
CN116456138A (en) Mirror image screen projection system and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17818950

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17818950

Country of ref document: EP

Kind code of ref document: A1