CN102592628A - Play control method of audio and video play file - Google Patents
Play control method of audio and video play file Download PDFInfo
- Publication number
- CN102592628A CN102592628A CN2012100337158A CN201210033715A CN102592628A CN 102592628 A CN102592628 A CN 102592628A CN 2012100337158 A CN2012100337158 A CN 2012100337158A CN 201210033715 A CN201210033715 A CN 201210033715A CN 102592628 A CN102592628 A CN 102592628A
- Authority
- CN
- China
- Prior art keywords
- literal
- time
- audio frequency
- search
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 230000000295 complement effect Effects 0.000 claims description 18
- 238000000926 separation method Methods 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 5
- 238000002955 isolation Methods 0.000 description 2
- 230000005587 bubbling Effects 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the relevant technical field of audio and video play and in particular to a play control method of an audio and video play file. The play control method comprises the steps of: performing audio and video separation on the audio and video play files to obtain an audio file; performing voice recognition on the audio file based on voice resource so as to obtain voice data; converting the voice data to one or multiple characters and associating each character with the time when the character corresponding to each character appears and storing the characters and the time in a database, wherein the character appearing time is the time when each character appears in the audio and video play file; when a character search demand from a user is received, searching a character matching with the character search demand in the database; and starting playing the audio and video play file at the character appearing time corresponding to the character which is matched with the character search demand. Through the voice character search, the user can search the concerned contents more exactly.
Description
Technical field
The present invention relates to audio frequency and video and play correlative technology field, particularly a kind of control method for playing back of audio frequency and video played file.
Background technology
Present people have got used to playing audio-video document on computers, for example various movie or television programs etc.But not all in film or TV programme usually all is the user's interest part.Therefore, most audio frequency and video playout softwares all can add progress bar, and when the user only wanted to see some part wherein, general way was to drag progress bar, searches for through naked eyes.
But this way of search not science very be difficult to guarantee when dragging that the user can see all video pictures, hears all audio frequency, so the user drags before and after often will be repeatedly, just can find its interested part.
Summary of the invention
The present invention provides a kind of control method for playing back of audio frequency and video played file, can only search for through the mode that drags progress bar the search of audio frequency and video played file to solve prior art, causes searching for coarse technical matters.
The technical scheme that adopts is following:
A kind of control method for playing back of audio frequency and video played file comprises:
Carry out audio frequency and video to the audio frequency and video played file and separate, obtain audio file;
Audio file is carried out speech recognition according to voice resource, obtain speech data;
Convert speech data into one or more literal, each literal and carry out relatedly, and be stored in the database time that said literal time of occurrence occurs in the audio frequency and video played file for each literal with the pairing literal time of occurrence of each literal;
When the text search requirement that receives the user, the literal that search and the requirement of said text search are complementary in database:
If search with said text search and require the literal that is complementary, then the audio frequency and video played file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
Described audio frequency and video played file refers to has human speech utterance, and has the audio-video document that continuous video is play, for example film, TV programme etc.
Further:
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said audio frequency and video played file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of audio frequency and video played file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
Further:
Surpass one if search the literal that requires to be complementary with said text search, then point out the user to select reproduction time;
When the selection of time that receives the user, the audio frequency and video played file begins to play according to user-selected reproduction time.
Further, when the prompting user selects reproduction time, show the video interception of audio frequency and video played file when the pairing literal time of occurrence of literal is play.
Further, said voice resource comprises: language model, acoustic model and/or dictionary.
The present invention makes the user can search the content that it is paid close attention to more accurately through the language and characters search, can see the programme content that it is concerned about fast, and need not the progress bar that drags repeatedly.
Description of drawings
Fig. 1 is the process flow diagram of the embodiment of the invention.
Fig. 2 plays the search synoptic diagram for the embodiment of the invention;
Fig. 3 is an embodiment of the invention display of search results synoptic diagram;
Fig. 4 is an embodiment of the invention explicit user selection result synoptic diagram.
Embodiment
Below in conjunction with accompanying drawing and specific embodiment the present invention is done further detailed explanation.
In the present embodiment, the audio frequency and video played file is a movie file, and in reality, the audio frequency and video played file can be variously to have human speech utterance, and has the audio-video document that continuous video is play, for example film, TV programme etc.
In the present embodiment, adopt computer to carry out the broadcast of audio frequency and video played file, but in the reality, can adopt other various playback equipments, DVD player for example, portable electronic equipment (PAD, mobile phone etc.).
Be illustrated in figure 1 as the process flow diagram of present embodiment:
In the time of step S101, playout software movie file through computer, the computer backstage is carried out audio frequency and video to movie file and is separated, and obtains audio file.
Audio frequency and video are separated can adopt existing various audio frequency and video isolation technics.For example, simply the most also be modal method, be movie file is recorded, be stored in buffer zone to voice data, then obtain audio file.
Because the audio frequency and video isolation technics has been ripe mode, therefore no longer details here.
Step S102 carries out speech recognition to audio file according to voice resource, obtains speech data.
Shown in voice resource comprise: language model, acoustic model and/or dictionary.For example draw corresponding sound acoustic feature vector, then movie file is extracted according to sound acoustic feature vector according to language model, acoustic model and/or dictionary.
Because speech recognition technology has been ripe mode, therefore no longer details here.
Step S103 converts speech data into one or more literal, each literal and carry out relatedly with the pairing literal time of occurrence of each literal, and is stored in the database time that said literal time of occurrence occurs in movie file for each literal;
Step S104 receives user's text search requirement, and search and said text search require the literal that is complementary in database:
If search with said text search and require the literal that is complementary, execution in step S105 then, otherwise execution in step S016.
In this step, receive user's text search requirement, can be through in playout software, increasing a search column can realize.All literal that the user is imported in search column all can be considered to user's text search requirement.And in database, search for, can adopt existing various database text search mode, for example carry out full database search from the beginning of database always backward, perhaps adopt bubbling algorithm etc. to search for.Because way of search has been ripe mode, therefore no longer details here.
Step S105, movie file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said movie file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of movie file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
For example, when the reproduction time 20:46 of user at film, search " today "; The computer backstage is searched for from database; And obtaining 3 " today ", its corresponding respectively reproduction time is: 19:32,23:03,40:27, then play from 23:03; Because 23:03 is a movie file the in progress time: after the 20:46, and the immediate time.
Step S106 continued to play in the in progress time of current movie file.
For step S105, surpass one if search the literal that requires to be complementary with said text search, can also adopt following more convenient user's method:
Show the video interception of movie file when the pairing literal time of occurrence of literal is play, and the prompting user selects reproduction time;
When the selection of time that receives the user, movie file begins to play according to user-selected reproduction time.
For example; As shown in Figure 2, when user's playout software 1 movie file, when the reproduction time 20:46 of movie file; In search column 2 search " today "; The computer backstage is searched for from database, and obtains 3 " today ", and its corresponding respectively reproduction time is: 19:32,23:03,40:27
Then as shown in Figure 3, playout software 13 is play current picture in the current picture district in left side, and the display field 4 on the right side shows three time 19:32,23:03,40:27, with and three corresponding respectively pictures, picture 41, picture 42, picture 43.
The user select 40:27's " today ", then as shown in Figure 4, playout software 1 begins to play from 40:27.
Claims (5)
1. the control method for playing back of an audio frequency and video played file is characterized in that, comprising:
Carry out audio frequency and video to the audio frequency and video played file and separate, obtain audio file;
Audio file is carried out speech recognition according to voice resource, obtain speech data;
Convert speech data into one or more literal, each literal and carry out relatedly, and be stored in the database time that said literal time of occurrence occurs in the audio frequency and video played file for each literal with the pairing literal time of occurrence of each literal;
When the text search requirement that receives the user, the literal that search and the requirement of said text search are complementary in database:
If search with said text search and require the literal that is complementary, then the audio frequency and video played file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
2. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that:
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said audio frequency and video played file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of audio frequency and video played file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
3. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that:
Surpass one if search the literal that requires to be complementary with said text search, then point out the user to select reproduction time;
When the selection of time that receives the user, the audio frequency and video played file begins to play according to user-selected reproduction time.
4. the control method for playing back of audio frequency and video played file according to claim 3 is characterized in that, when the prompting user selects reproduction time, shows the video interception of audio frequency and video played file when the pairing literal time of occurrence of literal is play.
5. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that, said voice resource comprises: language model, acoustic model and/or dictionary.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012100337158A CN102592628A (en) | 2012-02-15 | 2012-02-15 | Play control method of audio and video play file |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012100337158A CN102592628A (en) | 2012-02-15 | 2012-02-15 | Play control method of audio and video play file |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102592628A true CN102592628A (en) | 2012-07-18 |
Family
ID=46481150
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012100337158A Pending CN102592628A (en) | 2012-02-15 | 2012-02-15 | Play control method of audio and video play file |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102592628A (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103581694A (en) * | 2012-07-19 | 2014-02-12 | 冠捷投资有限公司 | Smart TV with human voice search function, intelligent audio-visual system and method for human voice search |
CN103838723A (en) * | 2012-11-20 | 2014-06-04 | 联想(北京)有限公司 | Data association method and electronic device |
CN103885693A (en) * | 2012-12-20 | 2014-06-25 | 联想(北京)有限公司 | Method for processing information and electronic equipment |
CN104284219A (en) * | 2013-07-11 | 2015-01-14 | Lg电子株式会社 | Mobile terminal and method of controlling the mobile terminal |
CN104572714A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | Learning video inquiring system and learning video inquiring method |
CN104572716A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | System and method for playing video files |
CN104572712A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | Multimedia file browsing system and multimedia file browsing method |
CN104809220A (en) * | 2015-04-30 | 2015-07-29 | 努比亚技术有限公司 | Audio playing method and device |
CN103581694B (en) * | 2012-07-19 | 2016-11-30 | 冠捷投资有限公司 | Smart TV with human voice search function, intelligent audio-visual system and method for human voice search |
WO2018027730A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for synchronisation in piano video teaching |
WO2018027729A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for teaching video synchronisation in music course |
WO2018027731A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for video synchronisation in english learning |
CN108806692A (en) * | 2018-05-29 | 2018-11-13 | 深圳市云凌泰泽网络科技有限公司 | A kind of audio content is searched and visualization playback method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1662053A (en) * | 2004-02-24 | 2005-08-31 | 皇家飞利浦电子股份有限公司 | Program content positioning method and device |
CN101281534A (en) * | 2008-05-28 | 2008-10-08 | 叶睿智 | Method for searching multimedia resource based on audio content retrieval |
US20090222442A1 (en) * | 2005-11-09 | 2009-09-03 | Henry Houh | User-directed navigation of multimedia search results |
CN101908053A (en) * | 2009-11-27 | 2010-12-08 | 新奥特(北京)视频技术有限公司 | Voice retrieval method and device |
-
2012
- 2012-02-15 CN CN2012100337158A patent/CN102592628A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1662053A (en) * | 2004-02-24 | 2005-08-31 | 皇家飞利浦电子股份有限公司 | Program content positioning method and device |
US20090222442A1 (en) * | 2005-11-09 | 2009-09-03 | Henry Houh | User-directed navigation of multimedia search results |
CN101281534A (en) * | 2008-05-28 | 2008-10-08 | 叶睿智 | Method for searching multimedia resource based on audio content retrieval |
CN101908053A (en) * | 2009-11-27 | 2010-12-08 | 新奥特(北京)视频技术有限公司 | Voice retrieval method and device |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103581694B (en) * | 2012-07-19 | 2016-11-30 | 冠捷投资有限公司 | Smart TV with human voice search function, intelligent audio-visual system and method for human voice search |
CN103581694A (en) * | 2012-07-19 | 2014-02-12 | 冠捷投资有限公司 | Smart TV with human voice search function, intelligent audio-visual system and method for human voice search |
CN103838723A (en) * | 2012-11-20 | 2014-06-04 | 联想(北京)有限公司 | Data association method and electronic device |
CN103838723B (en) * | 2012-11-20 | 2017-04-19 | 联想(北京)有限公司 | Data association method and electronic device |
CN103885693A (en) * | 2012-12-20 | 2014-06-25 | 联想(北京)有限公司 | Method for processing information and electronic equipment |
CN104284219A (en) * | 2013-07-11 | 2015-01-14 | Lg电子株式会社 | Mobile terminal and method of controlling the mobile terminal |
US9639251B2 (en) | 2013-07-11 | 2017-05-02 | Lg Electronics Inc. | Mobile terminal and method of controlling the mobile terminal for moving image playback |
CN104572712A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | Multimedia file browsing system and multimedia file browsing method |
CN104572716A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | System and method for playing video files |
CN104572714A (en) * | 2013-10-18 | 2015-04-29 | 英业达科技有限公司 | Learning video inquiring system and learning video inquiring method |
CN104809220A (en) * | 2015-04-30 | 2015-07-29 | 努比亚技术有限公司 | Audio playing method and device |
WO2018027730A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for synchronisation in piano video teaching |
WO2018027729A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for teaching video synchronisation in music course |
WO2018027731A1 (en) * | 2016-08-11 | 2018-02-15 | 张婧 | Method and system for video synchronisation in english learning |
CN108806692A (en) * | 2018-05-29 | 2018-11-13 | 深圳市云凌泰泽网络科技有限公司 | A kind of audio content is searched and visualization playback method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102592628A (en) | Play control method of audio and video play file | |
US9942599B2 (en) | Methods and apparatus to synchronize second screen content with audio/video programming using closed captioning data | |
KR101992475B1 (en) | Using an audio stream to identify metadata associated with a currently playing television program | |
EP3175442B1 (en) | Systems and methods for performing asr in the presence of heterographs | |
US9804729B2 (en) | Presenting key differences between related content from different mediums | |
JP2014132464A (en) | Interactive type interface device and control method of the same | |
CN103686200A (en) | Intelligent television video resource searching method and system | |
US9704536B2 (en) | Video display device and method for operating the same | |
US9158435B2 (en) | Synchronizing progress between related content from different mediums | |
US11803589B2 (en) | Systems, methods, and media for identifying content | |
CN103414948A (en) | Method and device for playing video | |
TW201206166A (en) | Linking real time media context to related applications and services | |
CN106210901A (en) | Display device | |
CN104410924B (en) | A kind of multimedia titles display methods and device | |
US10911831B2 (en) | Information processing apparatus, information processing method, program, and information processing system | |
JP5209129B1 (en) | Information processing apparatus, broadcast receiving apparatus, and information processing method | |
KR20200008341A (en) | Media play device and method for controlling screen and server for analyzing screen | |
JP5703321B2 (en) | Information processing apparatus and information processing method | |
KR100944958B1 (en) | Device and server that provide multimedia data and caption data of specific section | |
JP2014207619A (en) | Video recording and reproducing device and control method of video recording and reproducing device | |
JP5840026B2 (en) | Content storage apparatus and content storage method | |
CN102572534A (en) | System and method for synchronizing with multimedia broadcast program | |
KR20150078930A (en) | Method of providing content and apparatus therefor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20120718 |