CN103390410A - Telephone conference system and method - Google Patents
Telephone conference system and method Download PDFInfo
- Publication number
- CN103390410A CN103390410A CN2012101442289A CN201210144228A CN103390410A CN 103390410 A CN103390410 A CN 103390410A CN 2012101442289 A CN2012101442289 A CN 2012101442289A CN 201210144228 A CN201210144228 A CN 201210144228A CN 103390410 A CN103390410 A CN 103390410A
- Authority
- CN
- China
- Prior art keywords
- sound
- sources
- far
- remote phone
- conference system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Telephonic Communication Services (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
Description
技术领域 technical field
本发明涉及电话会议技术。The invention relates to teleconferencing technology.
背景技术 Background technique
远程电话会议系统是一种商务办公常见的通信手段,其能够使双方、三方、甚至多方人员不受地域限制的进行沟通。The teleconferencing system is a common means of communication in business offices, which enables two parties, three parties, or even multiple parties to communicate without geographical restrictions.
在远程电话会议中,就通话的双方而言,远端或近端的与会人员皆可能不只一人。某些会议系统会分别为各个与会人员配置专用的麦克风,如此虽可确保每个与会人员的发言可被确实接收,但其身份验证程序及会议管理机制较为复杂;除此之外,当与会人员增加时,对麦克风数量的需求即随之增加,而相邻麦克风间声音干扰的情形亦会变得更加严重。为了方便电话会议系统的架设,多数的电话会议不会为每个与会人员配置专用的麦克风,而是让各方所有的与会人员共享相同的麦克风。然而,受限于座位的安排,当与会人员距离麦克风的远近有所不同时,麦克风的收音效果也会随之有所不同,如此即减损了双方通话的质量。In a teleconference, as far as both parties are concerned, there may be more than one participant at the far or near end. Some conferencing systems will configure dedicated microphones for each participant. Although this can ensure that the speech of each participant can be received, the authentication procedure and conference management mechanism are more complicated; in addition, when the participants When the number of microphones increases, the demand for the number of microphones will increase, and the sound interference between adjacent microphones will become more serious. In order to facilitate the erection of the conference call system, most conference calls do not configure a dedicated microphone for each participant, but let all participants share the same microphone. However, limited by the arrangement of the seats, when the distance between the participants and the microphone is different, the sound collection effect of the microphone will also be different, which will impair the quality of the conversation between the two parties.
因此需要一种更方便好用的远程电话会议系统及方法。Therefore, a more convenient and easy-to-use teleconferencing system and method are needed.
发明内容 Contents of the invention
为了克服现有技术的缺陷,本发明提供一种远程电话会议系统,其包括:一远端麦克风数组,设置于远端,用以接收远端声音;一声音辨识模块,耦接至该远端麦克风数组,用以从远端声音中辨识出多个音源;一近端显示界面,设置于近端,耦接至该声音辨识模块,用以显示该声音辨识模块所辨识出的所述多个音源;一声音调整模块,耦接至该声音辨识模块,用以分别针对各该音源的一声音特征进行调整。In order to overcome the defects of the prior art, the present invention provides a teleconferencing system, which includes: a remote microphone array, arranged at the far end, to receive the sound from the far end; a sound recognition module, coupled to the far end A microphone array, used to identify multiple sound sources from far-end sounds; a near-end display interface, set at the near end, coupled to the sound recognition module, for displaying the multiple sound sources identified by the sound recognition module a sound source; a sound adjustment module, coupled to the sound recognition module, for adjusting a sound feature of each of the sound sources.
本发明另提供一种远程电话会议方法,其包括:以一远端麦克风数组接收远端声音;从远端声音中辨识出多个音源;以一近端显示界面显示所辨识出的所述多个音源;分别针对各该音源的至少一声音特征进行调整。The present invention also provides a method for teleconferencing, which includes: receiving far-end sound with a far-end microphone array; identifying multiple sound sources from the far-end sound; and displaying the identified multiple sound sources with a near-end display interface. sound sources; adjusting at least one sound characteristic of each of the sound sources.
本发明可将远端与会人员的空间位置予以视觉化,相对于现有技术而言,更有助于近端与会人员了解远端与会人员的座位关系,并借此提供调整声音参数的基础,达到提升远程电话会议质量的目的。The present invention can visualize the spatial position of the far-end participants. Compared with the prior art, it is more helpful for the near-end participants to understand the seat relationship of the far-end participants, thereby providing a basis for adjusting sound parameters. To achieve the purpose of improving the quality of remote teleconferencing.
附图说明 Description of drawings
图1是依据本发明一实施例的远程电话会议系统架构示意图。FIG. 1 is a schematic diagram of a teleconferencing system architecture according to an embodiment of the present invention.
图2为依据本发明一实施例的远程电话会议方法流程图。FIG. 2 is a flow chart of a teleconferencing method according to an embodiment of the present invention.
其中,附图标记说明如下:Wherein, the reference signs are explained as follows:
100~远程电话会议系统;100~ remote teleconferencing system;
102~远端麦克风数组;102~far-end microphone array;
104~声音辨识模块;104~sound recognition module;
106~近端显示界面;106~proximal display interface;
108~近端控制界面;108~ near-end control interface;
110~声音调整模块;110~sound adjustment module;
112~声音播放模块;112~sound playback module;
S202~S210~步骤。S202~S210~steps.
具体实施方式 Detailed ways
下文为介绍本发明的最佳实施例。各实施例用以说明本发明的原理,但非用以限制本发明。本发明的范围当以所附的权利要求书为准。The following describes the preferred embodiment of the present invention. Each embodiment is used to illustrate the principles of the present invention, but not to limit the present invention. The scope of the present invention should be determined by the appended claims.
为了使远程电话会议系统更易于使用,本发明提供一种新式远程电话会议系统。下文将配合附图说明本发明的远程电话会议系统的各种实施例。In order to make the teleconferencing system easier to use, the present invention provides a novel teleconferencing system. Various embodiments of the teleconferencing system of the present invention will be described below with reference to the accompanying drawings.
远程电话会议系统teleconferencing system
图1是依据本发明一实施例的远程电话会议系统架构示意图。本发明的远程电话会议系统100至少包括:一远端麦克风数组102、一声音辨识模块104、一近端显示界面106、一近端控制界面108、一声音调整模块110以及一声音播放模块112。为方便说明,下文的实施例皆以单向通信为例(即远端使用者说话、近端使用者收听),然而,本发明当然不必以此为限,本领域普通技术人员可轻易将本发明应用在双向通信上。同理,本发明不限定于双方通话的类型,多方通话的类型也在本发明所涵盖范围之内。FIG. 1 is a schematic diagram of a teleconferencing system architecture according to an embodiment of the present invention. The
本发明的远端麦克风数组102设置于远端,可用以接收远端声音。一般而言,麦克风数组102通常包括两个或两个以上的麦克风。本发明的麦克风不限于动圈式、电容式或其它各种类型的麦克风。本领域普通技术人员可依麦克风数量、各个麦克风的指向性以及会议空间将麦克风数组102设置于适当位置。举例而言,在圆桌会议中可采用具有全指向性声场灵敏度的麦克风数组,并将其设置于圆桌中心位置。The far-
本发明的声音辨识模块104不限定设置于远端或近端,只要能通过有线或无线通信方式连接至前述远端麦克风数组102即可。值得注意的是,本发明的重要特征即在于本发明的声音辨识模块104可依据各种既有的声学演算技术,从麦克风数组102所取得的混杂的远端声音中辨识及分离出多个各自不同的音源。举例而言,这些音源即包括各个与会人员的语音,以及各种非语音的杂音。大体来说,声学演算技术主要可分为声音方向辨识技术以及音质辨识技术。声音方向辨识技术可利用麦克风数组102中各麦克风的位置及灵敏度,计算出各个音源的方向及距离(即音源在空间中的位置);而音质辨识技术则可对各音源的音压、频谱及波形进行分析,借以取得各个音源诸如音量、清晰度、音频及音质(或称音色)等声音特征,甚至从中判断各个音源是否为语音、是否为杂音、对说话者的概略性别及年纪加以估测。更详细地说,由于语音并非持续不断的声音,且其音量及音频皆可能发生变化,因此,在更佳的实施例中,本发明的声音辨识模块104可持续交叉比对一音源在空间中的位置以及其音质,达到追踪锁定该音源的目的。除此之外,在某些实施例中,声音辨识模块104亦可进行一般性的噪声过滤及回声消除的动作。然而,由于前述声音处理技术细节非本发明欲强调的重点,且其可由各种既有技术达成,因此,本文不再加以赘述以节省篇幅。The
本发明的近端显示界面106(即屏幕)设置于近端,其耦接至该声音辨识模块104,可用以向近端使用者显示该声音辨识模块104所辨识出的各个音源,甚至,在某些实施例中,显示所述多个音源的各项声音特征。举例而言,在一最简单的实施例中,近端显示界面106仅以文字显示声音辨识模块104所辨识出的远端音源,并分别赋与各个既存的音源如“与会者1”、“与会者2”等名称。每当声音辨识模块104检测到远端有新成员加入时,近端显示界面106即可将其以醒目文字予以标注。在一较佳的实施例中,近端显示界面106可以二维或三维画面模拟远端会议空间,并依照声音辨识模块104所检测到各个音源的所在空间位置的坐标,将其标注在虚拟画面的对应位置之上。其中,各个音源除了有“与会者1”、“与会者2”等名称之外,尚可附注各种声音特征,例如:音量、清晰度、音频、音质、是否为语音、说话者的性别年纪等相关估测信息,本领域普通技术人员可依据本发明的精神自行设计近端显示界面106所显示的信息项目及其显示风格。值得注意的是,本发明的电话会议技术亦可进一步应用在视讯会议中,而近端显示界面106亦可同步显示远端传来的实际画面以代替前述虚拟画面。通过本发明的近端显示界面106,近端使用者可轻易掌握远端的与会情况。The near-end display interface 106 (i.e. screen) of the present invention is arranged at the near-end, and it is coupled to the
本发明的近端控制界面108耦接至本发明的声音调整模块110,可用以接收使用者对声音调整模块110的控制,而本发明的声音调整模块110可依据使用者的控制而针对声音辨识模块104所辨识出的各个音源分别调整其声音特征,而声音特征即包括:音量、清晰度、音频和/或音质。举例而言,近端使用者可通过控制声音调整模块110而增加某些远端重要与会人员的音量,或提升其清晰度;同样的,可降低、甚至滤除某些杂音或非与会人员所发出的语音,借此强化会议的通话质量。在某些特殊的实施例中,声音调整模块110甚至可对各个音源进行各种音效处理,包括改变其音频或音质,达到隐匿说话者身份的目的。本发明的声音调整模块110不限于设置在近端或远端,只要能通过有线或无线方式连接至该声音辨识模块104即可。在较佳的实施例中,声音调整模块110与声音辨识模块104可整合于一处理器之中,达到强化声音处理效能的目的。The near-
最后,本发明的声音播放模块112耦接至近端喇叭,可用以播放前述调整声音特征后的各个音源。本发明的声音播放模块112同样不限于设置在近端或远端,只要能通过有线或无线方式连接至该声音调整模块110即可。在较佳的实施例中,声音播放模块112亦可与声音调整模块110及声音辨识模块104整合于一处理器之中。本领域普通技术人员可了解到,声音辨识模块104、声音调整模块110及声音播放模块112的区别仅为方便说明,任何处理器具有前述模块的功能者皆属于本发明所涵盖的范围之内。Finally, the
远程电话会议方法Teleconferencing method
除了前述的远程电话会议系统之外,本发明另提供一种远程电话会议方法。图2为依据本发明一实施例的远程电话会议方法流程图。该方法200包括:在步骤S202中,以一远端麦克风数组接收远端声音;在步骤S204中,从远端声音中辨识出多个音源;在步骤S206中,以一近端显示界面显示所辨识出的所述多个音源及其声音特征;在步骤S208中,分别针对各该音源的至少一声音特征进行调整;以及在步骤S210中,播放调整声音特征后的所述多个音源。其中,步骤S204可通过声音方向辨识技术和/或音质辨识技术而从远端声音中辨识出所述多个音源,而这些声音特征即各个音源的方向、距离、音量、清晰度、音频和/或音质。由于本领域普通技术人员可参照前述关于远程电话会议系统的各个实施例中了解本发明的远程电话会议方法,故此处将不再赘述其相关细节以节省篇幅。In addition to the aforementioned teleconferencing system, the present invention further provides a teleconferencing method. FIG. 2 is a flow chart of a teleconferencing method according to an embodiment of the present invention. The method 200 includes: in step S202, receiving far-end sound with a far-end microphone array; in step S204, identifying multiple sound sources from the far-end sound; in step S206, displaying the The identified plurality of sound sources and their sound characteristics; in step S208 , adjusting at least one sound characteristic of each of the sound sources; and in step S210 , playing the plurality of sound sources after adjusting the sound characteristics. Wherein, step S204 can identify the multiple sound sources from the remote sound through the sound direction recognition technology and/or sound quality recognition technology, and these sound features are the direction, distance, volume, clarity, audio and/or sound quality of each sound source. or sound quality. Since those skilled in the art can refer to the aforementioned various embodiments of the teleconferencing system to understand the teleconferencing method of the present invention, relevant details will not be repeated here to save space.
本发明虽以较佳实施例揭示如上,然其并非用以限定本发明的范围,任何本领域普通技术人员,在不脱离本发明的精神和范围内,当可做些许的更动与润饰,因此本发明的保护范围当视所附的权力要求所界定的范围为准。Although the present invention is disclosed above with preferred embodiments, it is not intended to limit the scope of the present invention. Anyone skilled in the art may make some changes and modifications without departing from the spirit and scope of the present invention. Therefore, the protection scope of the present invention should be determined by the scope defined by the appended claims.
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012101442289A CN103390410A (en) | 2012-05-10 | 2012-05-10 | Telephone conference system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012101442289A CN103390410A (en) | 2012-05-10 | 2012-05-10 | Telephone conference system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103390410A true CN103390410A (en) | 2013-11-13 |
Family
ID=49534656
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012101442289A Pending CN103390410A (en) | 2012-05-10 | 2012-05-10 | Telephone conference system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103390410A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105991854A (en) * | 2014-09-29 | 2016-10-05 | 上海兆言网络科技有限公司 | System and method for visualizing VOIP teleconference on intelligent terminal |
WO2016176951A1 (en) * | 2015-05-06 | 2016-11-10 | 小米科技有限责任公司 | Method and device for optimizing sound signal |
CN106210365A (en) * | 2015-05-28 | 2016-12-07 | 仁宝电脑工业股份有限公司 | Method and system for adjusting volume of teleconference |
CN108922538A (en) * | 2018-05-29 | 2018-11-30 | 平安科技(深圳)有限公司 | Conferencing information recording method, device, computer equipment and storage medium |
CN112148182A (en) * | 2019-06-28 | 2020-12-29 | 华为技术服务有限公司 | Interaction control method, terminal and storage medium |
CN115361474A (en) * | 2022-08-18 | 2022-11-18 | 上海复旦通讯股份有限公司 | A method for auxiliary identification of sound source in teleconference |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030185411A1 (en) * | 2002-04-02 | 2003-10-02 | University Of Washington | Single channel sound separation |
US20090015651A1 (en) * | 2007-07-11 | 2009-01-15 | Hitachi, Ltd. | Voice Communication Device, Voice Communication Method, and Voice Communication Program |
US20090220065A1 (en) * | 2008-03-03 | 2009-09-03 | Sudhir Raman Ahuja | Method and apparatus for active speaker selection using microphone arrays and speaker recognition |
CN101690149A (en) * | 2007-05-22 | 2010-03-31 | 艾利森电话股份有限公司 | Methods and arrangements for group sound telecommunication |
-
2012
- 2012-05-10 CN CN2012101442289A patent/CN103390410A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030185411A1 (en) * | 2002-04-02 | 2003-10-02 | University Of Washington | Single channel sound separation |
CN101690149A (en) * | 2007-05-22 | 2010-03-31 | 艾利森电话股份有限公司 | Methods and arrangements for group sound telecommunication |
US20090015651A1 (en) * | 2007-07-11 | 2009-01-15 | Hitachi, Ltd. | Voice Communication Device, Voice Communication Method, and Voice Communication Program |
US20090220065A1 (en) * | 2008-03-03 | 2009-09-03 | Sudhir Raman Ahuja | Method and apparatus for active speaker selection using microphone arrays and speaker recognition |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105991854A (en) * | 2014-09-29 | 2016-10-05 | 上海兆言网络科技有限公司 | System and method for visualizing VOIP teleconference on intelligent terminal |
CN105991854B (en) * | 2014-09-29 | 2020-03-13 | 上海兆言网络科技有限公司 | System and method for visualizing VoIP (Voice over Internet protocol) teleconference on intelligent terminal |
WO2016176951A1 (en) * | 2015-05-06 | 2016-11-10 | 小米科技有限责任公司 | Method and device for optimizing sound signal |
CN106205628A (en) * | 2015-05-06 | 2016-12-07 | 小米科技有限责任公司 | Acoustical signal optimization method and device |
CN106205628B (en) * | 2015-05-06 | 2018-11-02 | 小米科技有限责任公司 | Voice signal optimization method and device |
US10499156B2 (en) | 2015-05-06 | 2019-12-03 | Xiaomi Inc. | Method and device of optimizing sound signal |
CN106210365B (en) * | 2015-05-28 | 2019-06-21 | 仁宝电脑工业股份有限公司 | Method and system for adjusting volume of teleconference |
CN106210365A (en) * | 2015-05-28 | 2016-12-07 | 仁宝电脑工业股份有限公司 | Method and system for adjusting volume of teleconference |
CN108922538A (en) * | 2018-05-29 | 2018-11-30 | 平安科技(深圳)有限公司 | Conferencing information recording method, device, computer equipment and storage medium |
CN108922538B (en) * | 2018-05-29 | 2023-04-07 | 平安科技(深圳)有限公司 | Conference information recording method, conference information recording device, computer equipment and storage medium |
CN112148182A (en) * | 2019-06-28 | 2020-12-29 | 华为技术服务有限公司 | Interaction control method, terminal and storage medium |
CN112148182B (en) * | 2019-06-28 | 2022-10-04 | 华为技术服务有限公司 | Interaction control method, terminal and storage medium |
CN115361474A (en) * | 2022-08-18 | 2022-11-18 | 上海复旦通讯股份有限公司 | A method for auxiliary identification of sound source in teleconference |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11991315B2 (en) | Audio conferencing using a distributed array of smartphones | |
US8503653B2 (en) | Method and apparatus for active speaker selection using microphone arrays and speaker recognition | |
US8606249B1 (en) | Methods and systems for enhancing audio quality during teleconferencing | |
US10491643B2 (en) | Intelligent augmented audio conference calling using headphones | |
US8315366B2 (en) | Speaker identification and representation for a phone | |
JP4255461B2 (en) | Stereo microphone processing for conference calls | |
JP5857674B2 (en) | Image processing apparatus and image processing system | |
US10079941B2 (en) | Audio capture and render device having a visual display and user interface for use for audio conferencing | |
US9973561B2 (en) | Conferencing based on portable multifunction devices | |
CN103390410A (en) | Telephone conference system and method | |
CN103581608A (en) | Spokesman detecting system, spokesman detecting method and audio/video conference system | |
US10978085B2 (en) | Doppler microphone processing for conference calls | |
WO2004010414A1 (en) | Method and apparatus for improving listener differentiation of talkers during a conference call | |
CA3228068A1 (en) | Multi-source audio processing systems and methods | |
CN113203988A (en) | Sound source positioning method and device | |
WO2022253003A1 (en) | Speech enhancement method and related device | |
CN110035372A (en) | Output control method and device of sound amplification system, sound amplification system and computer equipment | |
US10192566B1 (en) | Noise reduction in an audio system | |
US20100266112A1 (en) | Method and device relating to conferencing | |
EP4184507A1 (en) | Headset apparatus, teleconference system, user device and teleconferencing method | |
CN112543302B (en) | Intelligent noise reduction method and equipment in multi-person teleconference | |
US10580410B2 (en) | Transcription of communications | |
TW201347507A (en) | Remote conference system and method | |
CN205921750U (en) | Sound image localization trails round table conference system | |
CN117319879A (en) | Method, apparatus, device and storage medium for processing audio data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20131113 |