CN101859565A - System and method for realizing voice recognition on television - Google Patents
System and method for realizing voice recognition on television Download PDFInfo
- Publication number
- CN101859565A CN101859565A CN201010198592A CN201010198592A CN101859565A CN 101859565 A CN101859565 A CN 101859565A CN 201010198592 A CN201010198592 A CN 201010198592A CN 201010198592 A CN201010198592 A CN 201010198592A CN 101859565 A CN101859565 A CN 101859565A
- Authority
- CN
- China
- Prior art keywords
- data
- sound
- thread
- data analysis
- recording
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Machine Translation (AREA)
Abstract
The invention relates to the technical field of consumer electronics, and provides a system for realizing voice recognition on a television. The system comprises a voice input system, a data analysis system and a coordinating system for connecting the voice input system and the data analysis system, wherein the voice input system is used for sampling human voice, converting an analog signal into a digital signal and performing data caching to finish the preparation work of initial data; the data analysis system analyzes spectral characteristics according to human voice characteristics and is used for extracting effective voice, removing noise and further analyzing the voice content; and the coordinating system is mainly used for coordinating the work of the voice input system and the data analysis system. Voice recognition technology is introduced into the television; the operation of a remote controller of the television can be simplified by using the voice recognition technology; song order, movie on demand, channel selection and the like by voice can be easily realized; and the functions of the television are more abundant.
Description
Technical field
The invention belongs to the consumption electronic products technical field, relate in particular to a kind of system and method thereof that on televisor, realizes speech recognition.
Background technology
Speech recognition technology is that voice signal with the people is as input, be converted into digital signal, pass through Computer Processing again, sound and implication thereof that identification is human, make corresponding reaction, reach the purpose that exchanges or communicate by letter, promptly allow machine pass through identification and understanding process, voice signal is changed into corresponding text or order.But voice technology can not get using widely so far, is because itself exist three big gordian techniquies ripe not enough.The first, to the identification of band accent sound, in different areas different pronunciations is arranged with a kind of language, interpersonal sound also all varies, and is difficult to realize compatible identification; The second, environmental noise problem, environmental noise has increased the identification difficulty, is difficult to realize accurate data collection and analysis; Three, spoken problem, spoken language does not also meet normal syntactic structure, and the abnormal characteristics of word order lack of standardization of grammer bring difficulty can for semantic analysis and understanding.This three big problem always is that speech recognition technology moves towards the stumbling-block of commercialization, if can address the above problem, speech recognition technology is incorporated in the televisor, utilize speech recognition technology can simplify the operation of TV remote controller, and can easily be achieved as follows function: ordering song by voice, playing speech on demand film, voice channel selection etc.The user only needs to say the name of song, TV under certain scene, the platform of TV number can replace the telepilot operation, thereby makes television function abundanter.
Summary of the invention
The purpose of the embodiment of the invention is to provide a kind of system and method thereof that realizes speech recognition on televisor.
The embodiment of the invention is achieved in that a kind of system that realizes speech recognition on televisor, comprises sound input system, data analysis system and the coherent system that connects sound input system and data analysis system; Wherein, the responsible sampling to human sound of sound input system, simulating signal are finished the preliminary work of primary data to the conversion of digital signal and the operation of metadata cache; Data analysis system is according to people's characteristic voice analysis spectrum characteristic, be responsible for extraction to effective sound, the removal of noise, and further analyze the content of sound, be translated into the people's of mating most intention, thereby reach the effect that machine can be discerned people's sound with sound; And coherent system mainly is the work of coordinating sound input system and data analysis system.
A kind of method that realizes speech recognition on televisor comprises the steps:
The sound input system is sampled to sound, converts digital signal then to and is sent to data analysis system, is handled by the MCU of data subsystem;
Data analysis system is according to people's characteristic voice analysis spectrum characteristic, be responsible for extraction to effective sound, the removal of noise, and further analyze the content of sound, be translated into the people's of mating most intention, thereby reach the effect that machine can be discerned people's sound with sound;
Coherent system is coordinated the work of sound input system and data analysis system, and synchronous processing is done in the data acquisition of sound input system and analyzing and processing two parts of data analysis system, and it is divided into recording thread and two thread modules of data analysis thread.
Compared to prior art, the present invention is incorporated into speech recognition technology in the televisor, utilizes speech recognition technology can simplify the operation of TV remote controller, and can easily be achieved as follows function: ordering song by voice, playing speech on demand film, voice channel selection etc.The user only needs to say the name of song, TV under certain scene, the platform of TV number can replace the telepilot operation, thereby makes television function abundanter.
Description of drawings
Fig. 1 is a theory structure block diagram of the present invention.
Fig. 2 is the data analysis flow process diagram of data analysis system of the present invention.
Fig. 3 is the workflow diagram of coherent system of the present invention.
Fig. 4 is the flow chart of data processing diagram of coherent system of the present invention.
Embodiment
In order to make purpose of the present invention, technical scheme and advantage clearer,, the present invention is further elaborated below in conjunction with drawings and Examples.Should be appreciated that specific embodiment described herein only in order to explanation the present invention, and be not used in qualification the present invention.
Fig. 1 shows the present invention realizes speech recognition on televisor system, comprises sound input system, data analysis system and the coherent system that connects sound input system and data analysis system.Wherein, the responsible sampling to human sound of sound input system, simulating signal are finished the preliminary work of primary data to the conversion of digital signal and the operation of metadata cache; Data analysis system is according to people's characteristic voice analysis spectrum characteristic, be responsible for extraction to effective sound, the removal of noise, and further analyze the content of sound, be translated into the people's of mating most intention, thereby reach the effect that machine can be discerned people's sound with sound; And coherent system mainly is the work of coordinating sound input system and data analysis system, as when beginning to carry out the input of sound, when send raw data to data analysis system, how to discern, and when discerns success, and how to operate after the success etc.
The sound input system includes the input system of sound mainly by a miaow head, and the circuit that A/D converter is formed is sampled to sound, converts digital signal then to and is sent to data analysis system, is handled by the MCU of data subsystem.The data that MCU obtains AD are carried out the eigenwert extraction, and cepstral mean subtracts, acoustic layer identification, and acoustic layer is known aftertreatment, the conversion of sound speech, speech figure retrieves beta pruning etc., and then the output control information is to corresponding control module.
Data analysis system is to be finished by speech recognition engine, with reference to shown in Figure 2, after voice signal is converted to speech data, at first carries out feature extraction, and according to the effective frequency speech data of feature extraction of speech data, these frequencies relatively meet people's sounding characteristics; Use the method for " cepstral mean subtracts " to carry out the noise abatement processing then, this method speed is fast, and real-time high-efficiency satisfies service condition in the televisor; Then do the identification and the identification aftertreatment of acoustic layer, according to existing acoustic model, comprise a series of speech parameter such as word speed, intonation, tone colors etc. are analyzed the content of recognizing voice, can obtain the content that the people speaks after analysis is finished, further do the conversion of sound speech again, can obtain the word content of voice.For the coupling of further accurately word content and order, but the words and phrases after continuing comparison utility command keyword and changing find the order of mating most as recognition result, send to televisor then and do next step processing.Adopt " cepstral mean subtracts " that the speech data that extracts is carried out noise abatement in the data analysis system and handle, tentatively discern according to the feature of sound then, then do the conversion of sound speech aspect, obtain exporting the result.
Coherent system is done synchronous processing to the data acquisition of sound input system and analyzing and processing two parts of data analysis system, with reference to shown in Figure 3, is divided into recording thread and two thread modules of data analysis thread.In the recording thread, at first start the recording thread, wherein can initiating hardware equipment, distribute the required memory source of thread and some initial parameter settings, after finishing initialization, just enter the thread circulation, at first judged whether the speech data input, if have then the data of recording are deposited in the buffer area of this thread, and judge then whether recording finishes; If not then judge directly whether recording finishes.If recording does not finish, then continue the execution thread circulation, if finish recording, then close recording module.And in the data analysis thread, at first log-on data is analyzed thread, after foundation and distribute data are analyzed required system resource, enters the thread circulation.Whether elder generation's judgment data buffer area has is upgraded the data of coming, if having, then takes out one section content analysis process from the data analysis district, and whether the judgment data analysis bears results then; If whether no, then directly enter the judgment data analysis bears results.If bear results then stop the thread of recording, stop data analysis, the result is preserved, if do not bear results, then proceed the thread circulation.
The present invention creates multithreading and the recycling technology of buffer area of having adopted, recording and these two steps of data analysis start simultaneously, recording thread independent operating audio frequency acquiring data, with the deposit data of gathering at recording buffer memory block, with the interruption form data supplementing is arrived data field to be analyzed again then, notice that this recording buffer memory block is the circular buffer block, distribute suitable number and size, otherwise just be not capped when data also are not appended to data field to be analyzed, caused losing of data.The back judges whether recording needs to stop, if then stop recording.Data analysis thread independent operating is handled the content of buffer area to be analyzed, when existing content, buffer area just takes out one section contents processing analysis from buffer zone, take out one section content from buffer zone at every turn, the data field to be analyzed corresponding size that just moves up, append with regard to leaving the end of being close to data field to be analyzed in the time of data supplementing, so data have been recorded in reception that just can be rationally correct and processing, stop data analysis after producing recognition result.
In the coherent system of the present invention's creation software is realized optimizing, to recording data buffer area piecemeal, set a block size, when the data of gathering are filled full data block, produces an interruption, in this interruption, the data block that obtains moved and be appended to the language data process data buffer area, and meanwhile recording module can continue recording, in next data block, data block circulation storage is appended, and just can make the length of record length unrestricted with its deposit data that obtains.Because at the beginning in whole work, recording thread and data analysis thread just begin to have moved together, when obtaining a data block, the recording thread just passed to the data analysis thread immediately, when recording is not finished, the data analysis thread just can carry out the analysis of data, do not need to wait for that recording stops just to begin then analyzing and processing, improved efficient.Its part key code is as follows:
Two worker threads of // establishment
RockCreateThread(ProcGetProcGuid(GUID_EXE_VOICERECOG),AitalkWorkThread,
TPRI_LOW);
RockCreateThread(ProcGetProcGuid(GUID_EXE_VOICERECOG),
VoiceRecogHighTaskMsgCallBack,TPRI_HIGH);
// recording datacycle divides block cache
pRxBuf=&gAudioData.PCMdata[gAudioData.RxBuflndex*AUDIO_BUF_LEN];
DmaTransmit(AUDIO_DMACHANNEL,
(UINT32)RegI2s_RXR,
(UINT32)pRxBuf,
(UINT32)gAudioData.nPCMlength,
(UINT32)DmaI2sRecordCopy,
(DMACallBack)Voice_RecISR);
if(++gAudioData.RxBufIndex>=AUDIO_BUF_NUM)
gAudioData.RxBufIndex=0;
// supplemental data is to data field to be analyzed
EsrAppendData(g_hESRObj,lpBuf,nSample);
// taking-up data analysis
EsrRunStep(g_hESRObj);
// judge whether to produce recognition result
IStatus=EsrGetResultParameterA (g_hESRObj , ﹠amp; PCmdID , ﹠amp; NSame, (ivCStrA) " song title
″);
if(pCmdID[0]>0?&&?pCmdID[0]<nFileNum)
return?TRUE;
In addition, coherent system also be responsible for initialization sound pick-up outfit and speech parameter and scene setting, speech processes result obtain and obtain the operation that will carry out the back etc., with reference to shown in Figure 4, the work sequence of coherent system is divided two lines, article one, be the sound pick-up outfit operating path, second is the voice recognition processing operating path.From left to right increase progressively line for the time, flow chart of data processing can be divided 3 time periods according to treatment step and time corresponding point, recording and speech processes are provided with the time period, the processing procedure time period, the identification back time period, and require these 3 time periods synchronous in two lines.Be provided with in the time period, sound pick-up outfit needs initialization, and voice recognition processing need be created instance objects, and scene and dictionary (being the established command words and phrases) are discerned in initialization then, and send the startup recognition command.In processing procedure in the time period, need in the sound pick-up outfit operating path, start recording earlier, gather the recording data then, when gathering, the voice recognition processing route is accepted the recording data of gathering and is attempted bearing results, if bear results then enter the identification back time period, if not then continue collection analysis.After identification in the time period, in the sound pick-up outfit operating path, stop recording earlier, in the speech recognition operating path, then obtain recognition result earlier, operate according to result's (promptly ordering words and phrases) then, for example " play xxx.mp3 ", then begin to play the xxx.mp3 music file.
Article one, the startup of the sound pick-up outfit of route and stop to be subjected to the influence of second route identifying, when identifying initialization object, after finishing the input of dictionary and scene, can start the speech recognition of a scene, just start recording this time, when the recording data were constantly imported, the identification route was also handled the recording data always, up to manual termination or there is recognition result to produce, recording afterwards is stopped.At this moment just can obtain recognition result, and carry out corresponding operation, as according to the song title playing back music etc. according to the result.Wherein, the part key code is as follows:
The recording route:
// initialization sound pick-up outfit
Codec_SetSampleRate (8000); // sampling rate is set
CodecGainSet (5); // gain is set
SetAudioDataBuf(&gAudioData.pLeft,&gAudioData.pRight,&gAudioData.nPCMleng
Th); // buffer is set
PMU_EnterModule(PMU_RECORDADPCM);
Codec_SetMode (Codec_MICAdc); // MIC is set import
// start and record
I2sStart (I2S_Start_Rx); //I2S interface configuration
AudioInputBuffSwitch (); // being provided with and opening the DMA transmission, buffer is related with buffer memory
// collection recording data
FlushRecData();
// stop to record
I2sStop();
HDMA_Stop(0);
The identification route:
// establishment object
TUserSys.pWorkBuffer=WorkBuff; // distribute data is analyzed buffer memory
tUserSys.nWorkBufferBytes=USER_WORKBUFFER_BYTES;
// data analysis cache size
iStatus=EsrCreate(&g_hESRObj,&tUserSys,&tResPackDesc,1);
// establishment recognition engine
// input dictionary and scene
iStatus=EsrSetACP(g_hESRObj,ivESR_CP_GBK);
IStatus=EsrBeginLexiconA (g_hESRObj, (ivCStrA) " song title ");
for(i=0;i<nFileNum;i++)
{
iStatus=EsrAddLexiconItemW(g_hESRObj,\
(ivCStrW)pFileUnit[i].LongFileName,i+1);
}
iStatus=EsrEndLexicon(g_hESRObj);
IStatus=EsrBeginSceneA (g_hESRObj (ivCStrA) " plays scene ");
IStatus=EsrAddSyntaxA (g_hESRObj (ivCStrA) " opens { song title } ", 1);
iStatus=EsrEndScene(g_hESRObj);
The identification of a scene of // startup
EsrStartA (g_hESRObj (ivCStrA) " plays scene ");
// processing recording data
EsrRunStep(g_hESRObj);
// obtain recognition result
IStatus=EsrGetResultParameterA (g_hESRObj , ﹠amp; PCmdID , ﹠amp; NSame, (ivCStrA) " song
Name ");
// playing back music
PlayMusic(pCmdID[0]-1);
After coherent system is finished, indicating that whole process finishes substantially, the back only need define how opening voice identification gets final product.Solve problems such as noise, accent identification, spoken identification, improved accuracy of identification.
The above only is preferred embodiment of the present invention, not in order to restriction the present invention, all any modifications of being done within the spirit and principles in the present invention, is equal to and replaces and improvement etc., all should be included within protection scope of the present invention.
Claims (10)
1. a system that realizes speech recognition on televisor is characterized in that: comprise sound input system, data analysis system and the coherent system that connects sound input system and data analysis system; Wherein, the responsible sampling to human sound of sound input system, simulating signal are finished the preliminary work of primary data to the conversion of digital signal and the operation of metadata cache; Data analysis system is according to people's characteristic voice analysis spectrum characteristic, be responsible for extraction to effective sound, the removal of noise, and further analyze the content of sound, be translated into the people's of mating most intention, thereby reach the effect that machine can be discerned people's sound with sound; And coherent system mainly is the work of coordinating sound input system and data analysis system.
2. the system that on televisor, realizes speech recognition as claimed in claim 1, it is characterized in that, described sound input system includes miaow head and the A/D converter that sound is sampled, miaow head and A/D converter are sampled to sound, convert digital signal then to and be sent to data analysis system, handle by the MCU of data subsystem.
3. the method that on televisor, realizes speech recognition as claimed in claim 1 or 2, it is characterized in that, described data analysis system is to be finished by speech recognition engine, the data that the MCU of data subsystem obtains A/D converter are carried out the eigenwert extraction, and cepstral mean subtracts, acoustic layer identification, acoustic layer is known aftertreatment, the conversion of sound speech, speech figure retrieves beta pruning, exports control information then to control module.
4. the method that on televisor, realizes speech recognition as claimed in claim 3, it is characterized in that, described coherent system is done synchronous processing to the data acquisition of sound input system and analyzing and processing two parts of data analysis system, and it is divided into recording thread and two thread modules of data analysis thread.
5. a method that realizes speech recognition on televisor is characterized in that, comprises the steps:
The sound input system is sampled to sound, converts digital signal then to and is sent to data analysis system, is handled by the MCU of data subsystem;
Data analysis system is according to people's characteristic voice analysis spectrum characteristic, be responsible for extraction to effective sound, the removal of noise, and further analyze the content of sound, be translated into the people's of mating most intention, thereby reach the effect that machine can be discerned people's sound with sound;
Coherent system is coordinated the work of sound input system and data analysis system, and synchronous processing is done in the data acquisition of sound input system and analyzing and processing two parts of data analysis system, and it is divided into recording thread and two thread modules of data analysis thread.
6. the method that on televisor, realizes speech recognition as claimed in claim 5, it is characterized in that, data analysis system is to be finished by speech recognition engine, after voice signal is converted to speech data, at first carry out feature extraction, according to the effective frequency speech data of feature extraction of speech data, use the method for " cepstral mean subtracts " to carry out the noise abatement processing then; Then do the identification and the identification aftertreatment of acoustic layer,, analyze the content of identification voice, can obtain the content that the people speaks after analysis is finished, further do the conversion of sound speech again, can obtain the word content of voice according to existing acoustic model.
7. the method that on televisor, realizes speech recognition as claimed in claim 6, it is characterized in that, described coherent system is in the recording thread, at first start the recording thread, initiating hardware equipment is distributed the required memory source of thread and some initial parameter settings, after finishing initialization, just enter the thread circulation, judged whether the speech data input, the data of recording are deposited in the buffer area of this thread.
8. the method that on televisor, realizes speech recognition as claimed in claim 7, it is characterized in that, described coherent system is in the data analysis thread, at first log-on data is analyzed thread, after foundation and distribute data are analyzed required system resource, enter the thread circulation, whether the judgment data buffer area has is upgraded the data of coming, take out one section content analysis process from the data analysis district, whether the judgment data analysis bears results then; If bear results then stop the thread of recording, stop data analysis, the result is preserved.
9. the method that on televisor, realizes speech recognition as claimed in claim 8, it is characterized in that, described recording thread independent operating audio frequency acquiring data, with the deposit data of gathering at recording buffer memory block, with the interruption form data supplementing is arrived data field to be analyzed again then, and data analysis thread independent operating is handled the content of buffer area to be analyzed, when existing content, buffer area just takes out one section contents processing analysis from buffer zone, take out one section content from buffer zone at every turn, the data field to be analyzed corresponding size that just moves up is appended with regard to leaving the end of being close to data field to be analyzed in the time of data supplementing.
10. the method that on televisor, realizes speech recognition as claimed in claim 9, it is characterized in that in the coherent system to recording data buffer area piecemeal, set a block size, when a data block is expired in the data filling of gathering, produce an interruption, in this interrupts, the data block that obtains moved and be appended to the language data process data buffer area, and meanwhile recording module can continue recording, in next data block, data block circulation storage is appended with its deposit data that obtains.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010198592A CN101859565A (en) | 2010-06-11 | 2010-06-11 | System and method for realizing voice recognition on television |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010198592A CN101859565A (en) | 2010-06-11 | 2010-06-11 | System and method for realizing voice recognition on television |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101859565A true CN101859565A (en) | 2010-10-13 |
Family
ID=42945420
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201010198592A Pending CN101859565A (en) | 2010-06-11 | 2010-06-11 | System and method for realizing voice recognition on television |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101859565A (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102036033A (en) * | 2010-12-31 | 2011-04-27 | Tcl集团股份有限公司 | Method for remotely controlling television with voice and remote voice control |
CN102075797A (en) * | 2010-12-29 | 2011-05-25 | 深圳市同洲电子股份有限公司 | Channel or program voice browsing method and digital television receiving terminal |
CN102469363A (en) * | 2010-11-11 | 2012-05-23 | Tcl集团股份有限公司 | Television system with speech comment function and speech comment method |
CN102710968A (en) * | 2012-05-22 | 2012-10-03 | 袁华安 | Method for synchronizing video streams in cloud television system |
CN103116976A (en) * | 2012-11-09 | 2013-05-22 | 魏显辉 | Method and system for wirelessly controlling smart phone or tablet computer based on audio signal |
CN103220563A (en) * | 2013-03-25 | 2013-07-24 | 苏州德鲁克供应链管理有限公司 | Automatic channel selection software of television programs |
CN103474065A (en) * | 2013-09-24 | 2013-12-25 | 贵阳世纪恒通科技有限公司 | Method for determining and recognizing voice intentions based on automatic classification technology |
CN104658546A (en) * | 2013-11-19 | 2015-05-27 | 腾讯科技(深圳)有限公司 | Method and device for processing recorded voice |
CN105334471A (en) * | 2015-10-21 | 2016-02-17 | 中国人民解放军海军航空工程学院青岛校区 | Portable airplane power supply quality test system and method |
CN106328146A (en) * | 2016-08-22 | 2017-01-11 | 广东小天才科技有限公司 | Video subtitle generating method and device |
CN106531168A (en) * | 2016-11-18 | 2017-03-22 | 北京云知声信息技术有限公司 | Voice recognition method and voice recognition device |
CN107276777A (en) * | 2017-07-27 | 2017-10-20 | 苏州科达科技股份有限公司 | The audio-frequency processing method and device of conference system |
CN108877781A (en) * | 2018-06-13 | 2018-11-23 | 东方梦幻文化产业投资有限公司 | A kind of method and system of intelligent sound search film |
CN110351445A (en) * | 2019-06-19 | 2019-10-18 | 成都康胜思科技有限公司 | A kind of high concurrent VOIP recording service system based on intelligent sound identification |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1429019A (en) * | 2001-12-18 | 2003-07-09 | 松下电器产业株式会社 | TV set with sound discrimination function and its control method |
CN101211504A (en) * | 2006-12-31 | 2008-07-02 | 康佳集团股份有限公司 | Method, system and apparatus for remote control for TV through voice |
CN101521722A (en) * | 2008-02-27 | 2009-09-02 | 深圳Tcl新技术有限公司 | Speech recognition television and realization method thereof |
-
2010
- 2010-06-11 CN CN201010198592A patent/CN101859565A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1429019A (en) * | 2001-12-18 | 2003-07-09 | 松下电器产业株式会社 | TV set with sound discrimination function and its control method |
CN101211504A (en) * | 2006-12-31 | 2008-07-02 | 康佳集团股份有限公司 | Method, system and apparatus for remote control for TV through voice |
CN101521722A (en) * | 2008-02-27 | 2009-09-02 | 深圳Tcl新技术有限公司 | Speech recognition television and realization method thereof |
Non-Patent Citations (2)
Title |
---|
《中国优秀硕士学位论文全文数据库-信息科技辑》 20070915 祁玉林 基于PC的无线语音控制系统设计 1,5-6,16,24-36 1-10 , 第3期 2 * |
《微计算机应用》 20060331 何成林等 电话语音监控系统的设计与实现 174-176页 1-10 第27卷, 第2期 2 * |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102469363A (en) * | 2010-11-11 | 2012-05-23 | Tcl集团股份有限公司 | Television system with speech comment function and speech comment method |
CN102075797A (en) * | 2010-12-29 | 2011-05-25 | 深圳市同洲电子股份有限公司 | Channel or program voice browsing method and digital television receiving terminal |
CN102036033A (en) * | 2010-12-31 | 2011-04-27 | Tcl集团股份有限公司 | Method for remotely controlling television with voice and remote voice control |
CN102710968A (en) * | 2012-05-22 | 2012-10-03 | 袁华安 | Method for synchronizing video streams in cloud television system |
CN103116976B (en) * | 2012-11-09 | 2015-08-12 | 魏显辉 | Method and system for wirelessly controlling smart phone or tablet computer based on audio signal |
CN103116976A (en) * | 2012-11-09 | 2013-05-22 | 魏显辉 | Method and system for wirelessly controlling smart phone or tablet computer based on audio signal |
CN103220563A (en) * | 2013-03-25 | 2013-07-24 | 苏州德鲁克供应链管理有限公司 | Automatic channel selection software of television programs |
CN103474065A (en) * | 2013-09-24 | 2013-12-25 | 贵阳世纪恒通科技有限公司 | Method for determining and recognizing voice intentions based on automatic classification technology |
CN104658546B (en) * | 2013-11-19 | 2019-02-01 | 腾讯科技(深圳)有限公司 | Recording treating method and apparatus |
CN104658546A (en) * | 2013-11-19 | 2015-05-27 | 腾讯科技(深圳)有限公司 | Method and device for processing recorded voice |
CN105334471A (en) * | 2015-10-21 | 2016-02-17 | 中国人民解放军海军航空工程学院青岛校区 | Portable airplane power supply quality test system and method |
CN106328146A (en) * | 2016-08-22 | 2017-01-11 | 广东小天才科技有限公司 | Video subtitle generating method and device |
CN106531168A (en) * | 2016-11-18 | 2017-03-22 | 北京云知声信息技术有限公司 | Voice recognition method and voice recognition device |
CN106531168B (en) * | 2016-11-18 | 2020-04-28 | 北京云知声信息技术有限公司 | Voice recognition method and device |
CN107276777A (en) * | 2017-07-27 | 2017-10-20 | 苏州科达科技股份有限公司 | The audio-frequency processing method and device of conference system |
CN107276777B (en) * | 2017-07-27 | 2020-05-29 | 苏州科达科技股份有限公司 | Audio processing method and device of conference system |
CN108877781A (en) * | 2018-06-13 | 2018-11-23 | 东方梦幻文化产业投资有限公司 | A kind of method and system of intelligent sound search film |
CN108877781B (en) * | 2018-06-13 | 2021-07-13 | 东方梦幻文化产业投资有限公司 | Method and system for searching film through intelligent voice |
CN110351445A (en) * | 2019-06-19 | 2019-10-18 | 成都康胜思科技有限公司 | A kind of high concurrent VOIP recording service system based on intelligent sound identification |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101859565A (en) | System and method for realizing voice recognition on television | |
CN105245917B (en) | A kind of system and method for multi-media voice subtitle generation | |
CN101001294B (en) | Intelligent household voice recording and prompt system based on voice recognition technology | |
CN110473546B (en) | Media file recommendation method and device | |
TWI425500B (en) | Indexing digitized speech with words represented in the digitized speech | |
CN102543073B (en) | Shanghai dialect phonetic recognition information processing method | |
CN107293293A (en) | A kind of voice instruction recognition method, system and robot | |
CN103336773B (en) | System and method for audio and video speech processing and retrieval | |
CN103491429A (en) | Audio processing method and audio processing equipment | |
CN103198829A (en) | Method, device and equipment of reducing interior noise and improving voice recognition rate | |
CN111489743B (en) | Operation management analysis system based on intelligent voice technology | |
CN102862587B (en) | A kind of railway vehicle machine joint control speech analysis method and equipment | |
CN109147820A (en) | Vehicle audio control method, device, electronic equipment and storage medium | |
CN103491406A (en) | Android intelligent television system based on voice recognition | |
CN110223677A (en) | Spatial audio signal filtering | |
CN201869323U (en) | Stb | |
CN101833977A (en) | Court trial video real-time indexing method triggered by specific voice | |
CN107688571A (en) | The video retrieval method of diversification | |
CN101833982A (en) | Special sound-triggered court trial audio file real-time indexing method | |
CN105869636A (en) | Speech recognition apparatus and method thereof, smart television set and control method thereof | |
CN101299332B (en) | Method for implementing speech synthesis function by GSM mobile phone | |
CN106550268B (en) | Video processing method and video processing device | |
US12154545B2 (en) | Audio information processing method, audio information processing apparatus, electronic device, and storage medium | |
US20080167879A1 (en) | Speech delimiting processing system and method | |
CN201509246U (en) | Voice channel-selecting device based on DVB |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20101013 |