CN108154880A

CN108154880A - The robot that environmental noise carries out speech recognition can be differentiated in real time

Info

Publication number: CN108154880A
Application number: CN201611103797.3A
Authority: CN
Inventors: 胡扬; 邬惠林
Original assignee: Guangdong Big Warehouse Robot Technology Co Ltd
Current assignee: Guangdong Big Warehouse Robot Technology Co Ltd
Priority date: 2016-12-05
Filing date: 2016-12-05
Publication date: 2018-06-12

Abstract

The invention discloses a kind of robot that can be differentiated environmental noise in real time and carry out speech recognition, it is characterized in that：The robot analyzes the peak value that volume is represented in audio data by real-time sampling, and peak value for a long time, regular is included by environmental noise threshold values；The robot by real-time sampling analyze audio data in occur new long-time, regularity existing for peak value when, new peak value is updated to new environmental noise threshold values.It is found when the robot samples, when there is the audio data fragment than environmental noise threshold values more peak value, it segment will just be sampled is sent into sound identification module and carry out formal speech recognition, until the peak value of newest audio data is lower than noise threshold values, is considered as segment and terminates.The invention has the advantages that can adapt to environmental change, environmental noise is differentiated in real time, is solved influence of the invalid audio data to sound identification module, is improved machine human efficiency.

Description

The robot that environmental noise carries out speech recognition can be differentiated in real time

Technical field

The present invention relates to a kind of robot more particularly to a kind of machines that can be differentiated environmental noise in real time and carry out speech recognition People.

Background technology

The sound identification module and order word identification module of existing robot are difficult to correctly identify under noisy environment.Though Right above-mentioned module has carried the processing such as noise reduction, extraction phonetic feature in itself, but once enter identification state, and module is in itself Real-time will decline to a great extent or even identification be caused to malfunction because meaningless noise is excessive.And audio is carried out in itself The method of identifying system excessively redundancy again is re-fed into after noise reduction, the extraction processing such as phonetic feature, computer performance is required it is very high, And also there is audio distortion after noise reduction in this method.

Invention content

To overcome the above disadvantages, the present invention provides a kind of machines that can be differentiated environmental noise in real time and carry out speech recognition Device people.

Present invention technical solution used for the above purpose is：

A kind of robot that can be differentiated environmental noise in real time and carry out speech recognition, it is characterized in that：

The robot is analyzed by real-time sampling and the peak value of volume is represented in audio data, by for a long time, regularity Peak value is included by environmental noise threshold values；The robot, which is analyzed by real-time sampling in audio data, there are new long-time, rule Existing for rule property during peak value, new peak value is updated to new environmental noise threshold values.Find occur when the robot samples Than environmental noise threshold values more peak value audio data fragment when, will just sample segment be sent into sound identification module carry out it is formal Speech recognition until the peak value of newest audio data is lower than noise threshold values, is considered as segment and terminates.

The invention has the advantages that can adapt to environmental change, environmental noise is differentiated in real time, solves invalid audio Influence of the data to sound identification module, improves machine human efficiency.

Description of the drawings

The present invention is further described with implementation below in conjunction with the accompanying drawings.Fig. 1 is the block diagram of the present invention.

In Fig. 1,1 is sampled audio data, and 2 be to judge to determine environmental noise threshold values, and 3 be determining environmental noise valve again Value,4 be compared with threshold values, and 5 be to sample segment to be sent into the formal speech recognition of sound identification module progress, and 6 be environmental noise valve Value.

Specific embodiment

In Fig. 1, the robot analyzes the peak value that volume is represented in audio data by real-time sampling audio data 1, For example peak value existing for 2 second time, regularity is 35, is just determined as environmental noise threshold values 2 35；The robot passes through real-time Occur in sampling analysis audio data the 2 second new time, regularity existing for peak value 45 when, new peak value 45 is updated to new Environmental noise threshold values.It is found when the robot samples, during than the more audio data fragment of peak value 55 of environmental noise threshold values 45, This sampling segment is just sent into sound identification module and carries out formal speech recognition 5, until the peak value 42 of newest audio data compares Noise threshold values 45 is low, is considered as segment and terminates.

The above described is only a preferred embodiment of the present invention, not making limitation in any form to the present invention, appoint What without departing from the present invention program content, any simple modification that technical spirit according to the present invention makees above example, etc. With variation and modification, in the range of still falling within technical solution of the present invention.

Claims

1. the robot that environmental noise carries out speech recognition can be differentiated in real time, it is characterized in that：

The robot analyzes the peak value that volume is represented in audio data by real-time sampling, and long-time, regularity are deposited Peak value be included by environmental noise threshold values；Find occur than environmental noise threshold values more peak value when the robot samples Audio data fragment when, will just sample segment and be sent into sound identification module and carry out formal speech recognition, until newest audio The peak value of data is lower than noise threshold values, is considered as segment and terminates.

2. the robot according to claim 1 that environmental noise can be differentiated in real time and carry out speech recognition, it is characterized in that：It is described Robot by real-time sampling analyze audio data in occur new long-time, regularity existing for peak value when, by new peak value It is updated to new environmental noise threshold values.