[go: up one dir, main page]

CN102592628A - Play control method of audio and video play file - Google Patents

Play control method of audio and video play file Download PDF

Info

Publication number
CN102592628A
CN102592628A CN2012100337158A CN201210033715A CN102592628A CN 102592628 A CN102592628 A CN 102592628A CN 2012100337158 A CN2012100337158 A CN 2012100337158A CN 201210033715 A CN201210033715 A CN 201210033715A CN 102592628 A CN102592628 A CN 102592628A
Authority
CN
China
Prior art keywords
literal
time
audio frequency
search
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012100337158A
Other languages
Chinese (zh)
Inventor
张群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN2012100337158A priority Critical patent/CN102592628A/en
Publication of CN102592628A publication Critical patent/CN102592628A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the relevant technical field of audio and video play and in particular to a play control method of an audio and video play file. The play control method comprises the steps of: performing audio and video separation on the audio and video play files to obtain an audio file; performing voice recognition on the audio file based on voice resource so as to obtain voice data; converting the voice data to one or multiple characters and associating each character with the time when the character corresponding to each character appears and storing the characters and the time in a database, wherein the character appearing time is the time when each character appears in the audio and video play file; when a character search demand from a user is received, searching a character matching with the character search demand in the database; and starting playing the audio and video play file at the character appearing time corresponding to the character which is matched with the character search demand. Through the voice character search, the user can search the concerned contents more exactly.

Description

A kind of control method for playing back of audio frequency and video played file
Technical field
The present invention relates to audio frequency and video and play correlative technology field, particularly a kind of control method for playing back of audio frequency and video played file.
Background technology
Present people have got used to playing audio-video document on computers, for example various movie or television programs etc.But not all in film or TV programme usually all is the user's interest part.Therefore, most audio frequency and video playout softwares all can add progress bar, and when the user only wanted to see some part wherein, general way was to drag progress bar, searches for through naked eyes.
But this way of search not science very be difficult to guarantee when dragging that the user can see all video pictures, hears all audio frequency, so the user drags before and after often will be repeatedly, just can find its interested part.
Summary of the invention
The present invention provides a kind of control method for playing back of audio frequency and video played file, can only search for through the mode that drags progress bar the search of audio frequency and video played file to solve prior art, causes searching for coarse technical matters.
The technical scheme that adopts is following:
A kind of control method for playing back of audio frequency and video played file comprises:
Carry out audio frequency and video to the audio frequency and video played file and separate, obtain audio file;
Audio file is carried out speech recognition according to voice resource, obtain speech data;
Convert speech data into one or more literal, each literal and carry out relatedly, and be stored in the database time that said literal time of occurrence occurs in the audio frequency and video played file for each literal with the pairing literal time of occurrence of each literal;
When the text search requirement that receives the user, the literal that search and the requirement of said text search are complementary in database:
If search with said text search and require the literal that is complementary, then the audio frequency and video played file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
Described audio frequency and video played file refers to has human speech utterance, and has the audio-video document that continuous video is play, for example film, TV programme etc.
Further:
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said audio frequency and video played file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of audio frequency and video played file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
Further:
Surpass one if search the literal that requires to be complementary with said text search, then point out the user to select reproduction time;
When the selection of time that receives the user, the audio frequency and video played file begins to play according to user-selected reproduction time.
Further, when the prompting user selects reproduction time, show the video interception of audio frequency and video played file when the pairing literal time of occurrence of literal is play.
Further, said voice resource comprises: language model, acoustic model and/or dictionary.
The present invention makes the user can search the content that it is paid close attention to more accurately through the language and characters search, can see the programme content that it is concerned about fast, and need not the progress bar that drags repeatedly.
Description of drawings
Fig. 1 is the process flow diagram of the embodiment of the invention.
Fig. 2 plays the search synoptic diagram for the embodiment of the invention;
Fig. 3 is an embodiment of the invention display of search results synoptic diagram;
Fig. 4 is an embodiment of the invention explicit user selection result synoptic diagram.
Embodiment
Below in conjunction with accompanying drawing and specific embodiment the present invention is done further detailed explanation.
In the present embodiment, the audio frequency and video played file is a movie file, and in reality, the audio frequency and video played file can be variously to have human speech utterance, and has the audio-video document that continuous video is play, for example film, TV programme etc.
In the present embodiment, adopt computer to carry out the broadcast of audio frequency and video played file, but in the reality, can adopt other various playback equipments, DVD player for example, portable electronic equipment (PAD, mobile phone etc.).
Be illustrated in figure 1 as the process flow diagram of present embodiment:
In the time of step S101, playout software movie file through computer, the computer backstage is carried out audio frequency and video to movie file and is separated, and obtains audio file.
Audio frequency and video are separated can adopt existing various audio frequency and video isolation technics.For example, simply the most also be modal method, be movie file is recorded, be stored in buffer zone to voice data, then obtain audio file.
Because the audio frequency and video isolation technics has been ripe mode, therefore no longer details here.
Step S102 carries out speech recognition to audio file according to voice resource, obtains speech data.
Shown in voice resource comprise: language model, acoustic model and/or dictionary.For example draw corresponding sound acoustic feature vector, then movie file is extracted according to sound acoustic feature vector according to language model, acoustic model and/or dictionary.
Because speech recognition technology has been ripe mode, therefore no longer details here.
Step S103 converts speech data into one or more literal, each literal and carry out relatedly with the pairing literal time of occurrence of each literal, and is stored in the database time that said literal time of occurrence occurs in movie file for each literal;
Step S104 receives user's text search requirement, and search and said text search require the literal that is complementary in database:
If search with said text search and require the literal that is complementary, execution in step S105 then, otherwise execution in step S016.
In this step, receive user's text search requirement, can be through in playout software, increasing a search column can realize.All literal that the user is imported in search column all can be considered to user's text search requirement.And in database, search for, can adopt existing various database text search mode, for example carry out full database search from the beginning of database always backward, perhaps adopt bubbling algorithm etc. to search for.Because way of search has been ripe mode, therefore no longer details here.
Step S105, movie file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said movie file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of movie file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
For example, when the reproduction time 20:46 of user at film, search " today "; The computer backstage is searched for from database; And obtaining 3 " today ", its corresponding respectively reproduction time is: 19:32,23:03,40:27, then play from 23:03; Because 23:03 is a movie file the in progress time: after the 20:46, and the immediate time.
Step S106 continued to play in the in progress time of current movie file.
For step S105, surpass one if search the literal that requires to be complementary with said text search, can also adopt following more convenient user's method:
Show the video interception of movie file when the pairing literal time of occurrence of literal is play, and the prompting user selects reproduction time;
When the selection of time that receives the user, movie file begins to play according to user-selected reproduction time.
For example; As shown in Figure 2, when user's playout software 1 movie file, when the reproduction time 20:46 of movie file; In search column 2 search " today "; The computer backstage is searched for from database, and obtains 3 " today ", and its corresponding respectively reproduction time is: 19:32,23:03,40:27
Then as shown in Figure 3, playout software 13 is play current picture in the current picture district in left side, and the display field 4 on the right side shows three time 19:32,23:03,40:27, with and three corresponding respectively pictures, picture 41, picture 42, picture 43.
The user select 40:27's " today ", then as shown in Figure 4, playout software 1 begins to play from 40:27.

Claims (5)

1. the control method for playing back of an audio frequency and video played file is characterized in that, comprising:
Carry out audio frequency and video to the audio frequency and video played file and separate, obtain audio file;
Audio file is carried out speech recognition according to voice resource, obtain speech data;
Convert speech data into one or more literal, each literal and carry out relatedly, and be stored in the database time that said literal time of occurrence occurs in the audio frequency and video played file for each literal with the pairing literal time of occurrence of each literal;
When the text search requirement that receives the user, the literal that search and the requirement of said text search are complementary in database:
If search with said text search and require the literal that is complementary, then the audio frequency and video played file begins to play from the literal corresponding character time of occurrence that requires to be complementary with said text search.
2. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that:
If search the literal that requires to be complementary with said text search above one; Then from the pairing literal time of occurrence of literal that all and said text search requires to be complementary, select preferential reproduction time according to preferential reproduction time selective rule; Said audio frequency and video played file begins to play from preferential reproduction time, and said preferential reproduction time selective rule is:
Confirm that the in progress time of audio frequency and video played file is reproduction time;
After said reproduction time, and be preferential reproduction time near the literal time of occurrence of reproduction time.
3. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that:
Surpass one if search the literal that requires to be complementary with said text search, then point out the user to select reproduction time;
When the selection of time that receives the user, the audio frequency and video played file begins to play according to user-selected reproduction time.
4. the control method for playing back of audio frequency and video played file according to claim 3 is characterized in that, when the prompting user selects reproduction time, shows the video interception of audio frequency and video played file when the pairing literal time of occurrence of literal is play.
5. the control method for playing back of audio frequency and video played file according to claim 1 is characterized in that, said voice resource comprises: language model, acoustic model and/or dictionary.
CN2012100337158A 2012-02-15 2012-02-15 Play control method of audio and video play file Pending CN102592628A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012100337158A CN102592628A (en) 2012-02-15 2012-02-15 Play control method of audio and video play file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012100337158A CN102592628A (en) 2012-02-15 2012-02-15 Play control method of audio and video play file

Publications (1)

Publication Number Publication Date
CN102592628A true CN102592628A (en) 2012-07-18

Family

ID=46481150

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012100337158A Pending CN102592628A (en) 2012-02-15 2012-02-15 Play control method of audio and video play file

Country Status (1)

Country Link
CN (1) CN102592628A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103581694A (en) * 2012-07-19 2014-02-12 冠捷投资有限公司 Smart TV with human voice search function, intelligent audio-visual system and method for human voice search
CN103838723A (en) * 2012-11-20 2014-06-04 联想(北京)有限公司 Data association method and electronic device
CN103885693A (en) * 2012-12-20 2014-06-25 联想(北京)有限公司 Method for processing information and electronic equipment
CN104284219A (en) * 2013-07-11 2015-01-14 Lg电子株式会社 Mobile terminal and method of controlling the mobile terminal
CN104572714A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Learning video inquiring system and learning video inquiring method
CN104572716A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 System and method for playing video files
CN104572712A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Multimedia file browsing system and multimedia file browsing method
CN104809220A (en) * 2015-04-30 2015-07-29 努比亚技术有限公司 Audio playing method and device
CN103581694B (en) * 2012-07-19 2016-11-30 冠捷投资有限公司 Smart TV with human voice search function, intelligent audio-visual system and method for human voice search
WO2018027730A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for synchronisation in piano video teaching
WO2018027729A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for teaching video synchronisation in music course
WO2018027731A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for video synchronisation in english learning
CN108806692A (en) * 2018-05-29 2018-11-13 深圳市云凌泰泽网络科技有限公司 A kind of audio content is searched and visualization playback method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1662053A (en) * 2004-02-24 2005-08-31 皇家飞利浦电子股份有限公司 Program content positioning method and device
CN101281534A (en) * 2008-05-28 2008-10-08 叶睿智 Method for searching multimedia resource based on audio content retrieval
US20090222442A1 (en) * 2005-11-09 2009-09-03 Henry Houh User-directed navigation of multimedia search results
CN101908053A (en) * 2009-11-27 2010-12-08 新奥特(北京)视频技术有限公司 Voice retrieval method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1662053A (en) * 2004-02-24 2005-08-31 皇家飞利浦电子股份有限公司 Program content positioning method and device
US20090222442A1 (en) * 2005-11-09 2009-09-03 Henry Houh User-directed navigation of multimedia search results
CN101281534A (en) * 2008-05-28 2008-10-08 叶睿智 Method for searching multimedia resource based on audio content retrieval
CN101908053A (en) * 2009-11-27 2010-12-08 新奥特(北京)视频技术有限公司 Voice retrieval method and device

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103581694B (en) * 2012-07-19 2016-11-30 冠捷投资有限公司 Smart TV with human voice search function, intelligent audio-visual system and method for human voice search
CN103581694A (en) * 2012-07-19 2014-02-12 冠捷投资有限公司 Smart TV with human voice search function, intelligent audio-visual system and method for human voice search
CN103838723A (en) * 2012-11-20 2014-06-04 联想(北京)有限公司 Data association method and electronic device
CN103838723B (en) * 2012-11-20 2017-04-19 联想(北京)有限公司 Data association method and electronic device
CN103885693A (en) * 2012-12-20 2014-06-25 联想(北京)有限公司 Method for processing information and electronic equipment
CN104284219A (en) * 2013-07-11 2015-01-14 Lg电子株式会社 Mobile terminal and method of controlling the mobile terminal
US9639251B2 (en) 2013-07-11 2017-05-02 Lg Electronics Inc. Mobile terminal and method of controlling the mobile terminal for moving image playback
CN104572712A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Multimedia file browsing system and multimedia file browsing method
CN104572716A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 System and method for playing video files
CN104572714A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Learning video inquiring system and learning video inquiring method
CN104809220A (en) * 2015-04-30 2015-07-29 努比亚技术有限公司 Audio playing method and device
WO2018027730A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for synchronisation in piano video teaching
WO2018027729A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for teaching video synchronisation in music course
WO2018027731A1 (en) * 2016-08-11 2018-02-15 张婧 Method and system for video synchronisation in english learning
CN108806692A (en) * 2018-05-29 2018-11-13 深圳市云凌泰泽网络科技有限公司 A kind of audio content is searched and visualization playback method

Similar Documents

Publication Publication Date Title
CN102592628A (en) Play control method of audio and video play file
US9942599B2 (en) Methods and apparatus to synchronize second screen content with audio/video programming using closed captioning data
KR101992475B1 (en) Using an audio stream to identify metadata associated with a currently playing television program
EP3175442B1 (en) Systems and methods for performing asr in the presence of heterographs
US9804729B2 (en) Presenting key differences between related content from different mediums
JP2014132464A (en) Interactive type interface device and control method of the same
CN103686200A (en) Intelligent television video resource searching method and system
US9704536B2 (en) Video display device and method for operating the same
US9158435B2 (en) Synchronizing progress between related content from different mediums
US11803589B2 (en) Systems, methods, and media for identifying content
CN103414948A (en) Method and device for playing video
TW201206166A (en) Linking real time media context to related applications and services
CN106210901A (en) Display device
CN104410924B (en) A kind of multimedia titles display methods and device
US10911831B2 (en) Information processing apparatus, information processing method, program, and information processing system
JP5209129B1 (en) Information processing apparatus, broadcast receiving apparatus, and information processing method
KR20200008341A (en) Media play device and method for controlling screen and server for analyzing screen
JP5703321B2 (en) Information processing apparatus and information processing method
KR100944958B1 (en) Device and server that provide multimedia data and caption data of specific section
JP2014207619A (en) Video recording and reproducing device and control method of video recording and reproducing device
JP5840026B2 (en) Content storage apparatus and content storage method
CN102572534A (en) System and method for synchronizing with multimedia broadcast program
KR20150078930A (en) Method of providing content and apparatus therefor

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120718