[go: up one dir, main page]

CN108154880A - The robot that environmental noise carries out speech recognition can be differentiated in real time - Google Patents

The robot that environmental noise carries out speech recognition can be differentiated in real time Download PDF

Info

Publication number
CN108154880A
CN108154880A CN201611103797.3A CN201611103797A CN108154880A CN 108154880 A CN108154880 A CN 108154880A CN 201611103797 A CN201611103797 A CN 201611103797A CN 108154880 A CN108154880 A CN 108154880A
Authority
CN
China
Prior art keywords
peak value
environmental noise
robot
audio data
threshold values
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611103797.3A
Other languages
Chinese (zh)
Inventor
胡扬
邬惠林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Big Warehouse Robot Technology Co Ltd
Original Assignee
Guangdong Big Warehouse Robot Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Big Warehouse Robot Technology Co Ltd filed Critical Guangdong Big Warehouse Robot Technology Co Ltd
Priority to CN201611103797.3A priority Critical patent/CN108154880A/en
Publication of CN108154880A publication Critical patent/CN108154880A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Manipulator (AREA)

Abstract

The invention discloses a kind of robot that can be differentiated environmental noise in real time and carry out speech recognition, it is characterized in that:The robot analyzes the peak value that volume is represented in audio data by real-time sampling, and peak value for a long time, regular is included by environmental noise threshold values;The robot by real-time sampling analyze audio data in occur new long-time, regularity existing for peak value when, new peak value is updated to new environmental noise threshold values.It is found when the robot samples, when there is the audio data fragment than environmental noise threshold values more peak value, it segment will just be sampled is sent into sound identification module and carry out formal speech recognition, until the peak value of newest audio data is lower than noise threshold values, is considered as segment and terminates.The invention has the advantages that can adapt to environmental change, environmental noise is differentiated in real time, is solved influence of the invalid audio data to sound identification module, is improved machine human efficiency.

Description

The robot that environmental noise carries out speech recognition can be differentiated in real time
Technical field
The present invention relates to a kind of robot more particularly to a kind of machines that can be differentiated environmental noise in real time and carry out speech recognition People.
Background technology
The sound identification module and order word identification module of existing robot are difficult to correctly identify under noisy environment.Though Right above-mentioned module has carried the processing such as noise reduction, extraction phonetic feature in itself, but once enter identification state, and module is in itself Real-time will decline to a great extent or even identification be caused to malfunction because meaningless noise is excessive.And audio is carried out in itself The method of identifying system excessively redundancy again is re-fed into after noise reduction, the extraction processing such as phonetic feature, computer performance is required it is very high, And also there is audio distortion after noise reduction in this method.
Invention content
To overcome the above disadvantages, the present invention provides a kind of machines that can be differentiated environmental noise in real time and carry out speech recognition Device people.
Present invention technical solution used for the above purpose is:
A kind of robot that can be differentiated environmental noise in real time and carry out speech recognition, it is characterized in that:
The robot is analyzed by real-time sampling and the peak value of volume is represented in audio data, by for a long time, regularity Peak value is included by environmental noise threshold values;The robot, which is analyzed by real-time sampling in audio data, there are new long-time, rule Existing for rule property during peak value, new peak value is updated to new environmental noise threshold values.Find occur when the robot samples Than environmental noise threshold values more peak value audio data fragment when, will just sample segment be sent into sound identification module carry out it is formal Speech recognition until the peak value of newest audio data is lower than noise threshold values, is considered as segment and terminates.
The invention has the advantages that can adapt to environmental change, environmental noise is differentiated in real time, solves invalid audio Influence of the data to sound identification module, improves machine human efficiency.
Description of the drawings
The present invention is further described with implementation below in conjunction with the accompanying drawings.Fig. 1 is the block diagram of the present invention.
In Fig. 1,1 is sampled audio data, and 2 be to judge to determine environmental noise threshold values, and 3 be determining environmental noise valve again Value,4 be compared with threshold values, and 5 be to sample segment to be sent into the formal speech recognition of sound identification module progress, and 6 be environmental noise valve Value.
Specific embodiment
In Fig. 1, the robot analyzes the peak value that volume is represented in audio data by real-time sampling audio data 1, For example peak value existing for 2 second time, regularity is 35, is just determined as environmental noise threshold values 2 35;The robot passes through real-time Occur in sampling analysis audio data the 2 second new time, regularity existing for peak value 45 when, new peak value 45 is updated to new Environmental noise threshold values.It is found when the robot samples, during than the more audio data fragment of peak value 55 of environmental noise threshold values 45, This sampling segment is just sent into sound identification module and carries out formal speech recognition 5, until the peak value 42 of newest audio data compares Noise threshold values 45 is low, is considered as segment and terminates.
The above described is only a preferred embodiment of the present invention, not making limitation in any form to the present invention, appoint What without departing from the present invention program content, any simple modification that technical spirit according to the present invention makees above example, etc. With variation and modification, in the range of still falling within technical solution of the present invention.

Claims (2)

1. the robot that environmental noise carries out speech recognition can be differentiated in real time, it is characterized in that:
The robot analyzes the peak value that volume is represented in audio data by real-time sampling, and long-time, regularity are deposited Peak value be included by environmental noise threshold values;Find occur than environmental noise threshold values more peak value when the robot samples Audio data fragment when, will just sample segment and be sent into sound identification module and carry out formal speech recognition, until newest audio The peak value of data is lower than noise threshold values, is considered as segment and terminates.
2. the robot according to claim 1 that environmental noise can be differentiated in real time and carry out speech recognition, it is characterized in that:It is described Robot by real-time sampling analyze audio data in occur new long-time, regularity existing for peak value when, by new peak value It is updated to new environmental noise threshold values.
CN201611103797.3A 2016-12-05 2016-12-05 The robot that environmental noise carries out speech recognition can be differentiated in real time Pending CN108154880A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611103797.3A CN108154880A (en) 2016-12-05 2016-12-05 The robot that environmental noise carries out speech recognition can be differentiated in real time

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611103797.3A CN108154880A (en) 2016-12-05 2016-12-05 The robot that environmental noise carries out speech recognition can be differentiated in real time

Publications (1)

Publication Number Publication Date
CN108154880A true CN108154880A (en) 2018-06-12

Family

ID=62469920

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611103797.3A Pending CN108154880A (en) 2016-12-05 2016-12-05 The robot that environmental noise carries out speech recognition can be differentiated in real time

Country Status (1)

Country Link
CN (1) CN108154880A (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5983186A (en) * 1995-08-21 1999-11-09 Seiko Epson Corporation Voice-activated interactive speech recognition device and method
CN101821971A (en) * 2007-08-22 2010-09-01 杜比实验室特许公司 Systems and methods for noisy activity detection
CN101859583A (en) * 2009-04-13 2010-10-13 索尼公司 Noise reducing apparatus and noise reducing method
CN102541509A (en) * 2012-01-13 2012-07-04 河南科技大学 True random number generation method based on chaotic encryption
CN103117067A (en) * 2013-01-19 2013-05-22 渤海大学 Voice endpoint detection method under low signal-to-noise ratio
CN104519447A (en) * 2013-10-08 2015-04-15 三星电子株式会社 Apparatus and method of reducing noise and audio playing apparatus with non-magnet speaker
CN104751845A (en) * 2015-03-31 2015-07-01 江苏久祥汽车电器集团有限公司 Voice recognition method and system used for intelligent robot
CN104942810A (en) * 2015-04-29 2015-09-30 成都陌云科技有限公司 Intelligent robot capable of denoising
CN105611461A (en) * 2016-01-04 2016-05-25 浙江宇视科技有限公司 Noise suppression method, apparatus and system for voice application system of front-end device
CN105850154A (en) * 2013-12-20 2016-08-10 微软技术许可有限责任公司 Adapting audio based upon detected environmental acoustics
CN105845135A (en) * 2015-01-12 2016-08-10 芋头科技(杭州)有限公司 Sound recognition system and method for robot system

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5983186A (en) * 1995-08-21 1999-11-09 Seiko Epson Corporation Voice-activated interactive speech recognition device and method
CN101821971A (en) * 2007-08-22 2010-09-01 杜比实验室特许公司 Systems and methods for noisy activity detection
CN101859583A (en) * 2009-04-13 2010-10-13 索尼公司 Noise reducing apparatus and noise reducing method
CN102541509A (en) * 2012-01-13 2012-07-04 河南科技大学 True random number generation method based on chaotic encryption
CN103117067A (en) * 2013-01-19 2013-05-22 渤海大学 Voice endpoint detection method under low signal-to-noise ratio
CN104519447A (en) * 2013-10-08 2015-04-15 三星电子株式会社 Apparatus and method of reducing noise and audio playing apparatus with non-magnet speaker
CN105850154A (en) * 2013-12-20 2016-08-10 微软技术许可有限责任公司 Adapting audio based upon detected environmental acoustics
CN105845135A (en) * 2015-01-12 2016-08-10 芋头科技(杭州)有限公司 Sound recognition system and method for robot system
CN104751845A (en) * 2015-03-31 2015-07-01 江苏久祥汽车电器集团有限公司 Voice recognition method and system used for intelligent robot
CN104942810A (en) * 2015-04-29 2015-09-30 成都陌云科技有限公司 Intelligent robot capable of denoising
CN105611461A (en) * 2016-01-04 2016-05-25 浙江宇视科技有限公司 Noise suppression method, apparatus and system for voice application system of front-end device

Similar Documents

Publication Publication Date Title
WO2021128741A1 (en) Voice emotion fluctuation analysis method and apparatus, and computer device and storage medium
CN108962227B (en) Voice starting point and end point detection method and device, computer equipment and storage medium
WO2018145584A1 (en) Voice activity detection method and voice recognition method
JP6229046B2 (en) Speech data recognition method, device and server for distinguishing local rounds
CN108022587B (en) Speech recognition method, speech recognition device, computer equipment and storage medium
CN105913849B (en) A kind of speaker's dividing method based on event detection
JP2014510942A5 (en)
CN110060693A (en) Model training method and device, electronic equipment and storage medium
GB201005913D0 (en) Method for real-time data compression and transmission
WO2006007290B1 (en) Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
CN103400576B (en) Based on speech model update method and the device of User action log
CN106098079B (en) Method and device for extracting audio signal
CN106448659B (en) A kind of sound end detecting method based on short-time energy and fractal dimension
WO2018095167A1 (en) Voiceprint identification method and voiceprint identification system
WO2021127990A1 (en) Voiceprint recognition method based on voice noise reduction and related apparatus
CN107132311A (en) A kind of fast gas recognizer extracted based on frequency domain character
CN110689885A (en) Machine-synthesized speech recognition method, device, storage medium and electronic equipment
Sharma et al. Automatic identification of silence, unvoiced and voiced chunks in speech
CN107767860A (en) A kind of voice information processing method and device
CN108154880A (en) The robot that environmental noise carries out speech recognition can be differentiated in real time
CN107121530B (en) The electronic nose response spectra feature extracting method that a kind of part and global characteristics combine
CN117935862A (en) A system for extracting key information from speech
TW200811833A (en) Detection method for voice activity endpoint
CN103438983B (en) Data processing method of signal random average spectrum
CN107993666B (en) Speech recognition method, speech recognition device, computer equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180612