[go: up one dir, main page]

CN106909638A - A kind of method and apparatus for finding hot video in real time based on user's inquiry log - Google Patents

A kind of method and apparatus for finding hot video in real time based on user's inquiry log Download PDF

Info

Publication number
CN106909638A
CN106909638A CN201710083558.4A CN201710083558A CN106909638A CN 106909638 A CN106909638 A CN 106909638A CN 201710083558 A CN201710083558 A CN 201710083558A CN 106909638 A CN106909638 A CN 106909638A
Authority
CN
China
Prior art keywords
word
user
atom
video
inquiry log
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710083558.4A
Other languages
Chinese (zh)
Inventor
李力行
姚键
潘柏宇
卢述奇
尹玉宗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
1Verge Internet Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 1Verge Internet Technology Beijing Co Ltd filed Critical 1Verge Internet Technology Beijing Co Ltd
Priority to CN201710083558.4A priority Critical patent/CN106909638A/en
Publication of CN106909638A publication Critical patent/CN106909638A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of method and apparatus for finding hot video in real time based on user's inquiry log, the user video inquiry log in a period of time is carried out cutting word, obtains atom word by it first;Then, the number of times and any two atom word that occur in each the user video inquiry log within described a period of time for counting the atom word appear in the number of times in same user inquiry simultaneously;The degree of association in method calculating user video inquiry log according to the secondary numerical value for obtaining using pointwise mutual information (PMI) between any two atom word, merges into any two atom word that the degree of association exceedes certain threshold value a compound word and is put into compound word vocabulary;Descending sort finally is carried out to compound word, compound word above as the keyword of discovery hot video in real time will be finally come by a certain percentage.

Description

A kind of method and apparatus for finding hot video in real time based on user's inquiry log
It is on December 7th, 2012 applying date that the application is, entitled " one kind is found in real time based on user's inquiry log The divisional application of the Chinese patent application 201210525735.7 of the method and apparatus of hot video ".
Technical field
It is real-time based on user's inquiry log the invention belongs to internet data statistical analysis technique field, more particularly to one kind It was found that the method and apparatus of hot video.
Background technology
With the fast development of internet, user proposes requirement higher to video search result, not only needs search Result is related, but also should have real-time higher, and this causes that search becomes increasingly important in real time.Search is referred to video in real time Instant, quick search is carried out to the information in video library, realizes searching the effect for obtaining final product.By search in real time, user can be with The first-hand information of focus incident is got in the very first time.However, compared to conventional search, search in real time also brings huge Challenge.For focus incident, because its is sudden and unpredictability, it is likely that the quantity and click volume of associated video compared with It is few, cause to be sorted rearward in Search Results at them, so as to demand of the user to real-time cannot really be met.
Because real-time focus has the characteristics of volumes of searches is big within a short period of time, by newest user's inquiry log point Analysis most possibly finds out new focus word and focus incident, and reaction of the searching order result to real-time is improved with this.Point Analysing the difficulty run into during user journal is:New term, new heat can be all continued to bring out out in daily user's inquiry log Point, such as " Europe Cup ", " corridor Wang Fei Liu Mei unicorns " etc., and original cutting word program cannot reflect the semantic pass of these neologisms Connection, that is, the character string cutting of one word of composition for being possible to semantically link together is multiple words.Cutting word program is general Using the method based on vocabulary, i.e., character string is scanned according to a previously given vocabulary, by certain matching process (just To maximum, reverse maximum, bi-directional matching etc.) a kind of most suitable word segmentation is found, the shortcoming of this method is:Cannot cut Separate the word not included in former vocabulary, i.e. neologisms.This defect would potentially result in fuzzy matching (only to partial query when searching for Word is matched) result it is not ideal enough.
The content of the invention
In view of problems of the prior art, real-time based on user's inquiry log it is an object of the invention to provide one kind It was found that the method for hot video, it is characterised in that comprise the following steps:
S1, the user video inquiry log in a period of time is input to cutting word program, completes each user video and look into The cutting word of daily record is ask, the cutting word result of every user inquiry is obtained, and the different terms of the cutting word result as original will be constituted Sub- word;
What is occurred in S2, each user video inquiry log within described a period of time of the statistics atom word is secondary Number;
S3, the cutting word result statistics any two atom word obtained according to step S1 are while appear in same user inquiry In number of times;
S4, the secondary numerical value obtained according to step S2 and S3 calculate user video and look into using the method for pointwise mutual information (PMI) The degree of association in inquiry daily record between any two atom word;
S5, by the degree of association calculated in step S4 exceed certain threshold value any two atom word merge into a compound word It is put into compound word vocabulary;
S6, the user video in compound vocabulary according to the atom word of composition compound word within described a period of time inquire about day The number of times occurred in will carries out descending sort to compound word, and the compound word that will finally come by a certain percentage above is sent out as real-time The keyword of existing hot video is returned.
Further, the method for finding hot video in real time based on user's inquiry log of the present invention, it is characterised in that The circular of the method for pointwise mutual information (PMI) is as follows described in the step S4:
Appoint to two atoms word A, B, their association table is shown as
Wherein, P (A, B) represents that A, B appear in number of times in same user video inquiry log, P (A), P (B) difference table Show the number of times occurred in the user video inquiry log of A, B within described a period of time.
Further, the method for finding hot video in real time based on user's inquiry log of the present invention, it is characterised in that Using maximal possibility estimation (Maximum likelihood estimate) come calculation times.
Additionally, present invention also offers a kind of device for finding hot video in real time based on user's inquiry log, its feature It is to include such as lower module:
Cutting word module, for the user video inquiry log in a period of time to be input into cutting word program, completes each The cutting word of user video inquiry log, obtains every cutting word result of user's inquiry, and will constitute the difference of the cutting word result Word is used as atom word;
Atom word occurrence number statistical module, each use within described a period of time for counting the atom word The number of times occurred in the query video daily record of family;
Atom Term co-occurrence number of times statistical module, for the cutting word knot obtained according to the atom word occurrence number statistical module Fruit statistics any two atom word appears in the number of times in same user inquiry simultaneously;
Calculation of relationship degree module, for according to the atom word occurrence number statistical module and the atom Term co-occurrence number of times Any two is former during the secondary numerical value that statistical module is obtained calculates user video inquiry log using the method for pointwise mutual information (PMI) The degree of association between sub- word;
Compound word generation module, the degree of association for the calculation of relationship degree module to be calculated exceedes appointing for certain threshold value Two atom words of meaning are merged into a compound word and are put into compound word vocabulary;
Hot keyword determining module, during in compound vocabulary according to the atom word of compound word is constituted at described one section The number of times occurred in interior user video inquiry log carries out descending sort to compound word, before finally coming by a certain percentage The compound word in face is returned as the keyword for finding hot video in real time.
Further, the device for finding hot video in real time based on user's inquiry log of the present invention, it is characterised in that The circular of the method for pointwise mutual information (PMI) is as follows described in the calculation of relationship degree module:
Appoint to two atoms word A, B, their association table is shown as
Wherein, P (A, B) represents that A, B appear in number of times in same user video inquiry log, P (A), P (B) difference table Show the number of times occurred in the user video inquiry log of A, B within described a period of time.
Further, the device for finding hot video in real time based on user's inquiry log of the present invention, it is characterised in that Using maximal possibility estimation (Maximum likelihood estimate) come calculation times.
Concept in this information theory by pointwise mutual information of the invention, is applied to the analysis of user's inquiry log, solves Cutting word is inaccurate and the real-time focus of video pinpoint the problems caused by new term emerges in large numbers.The present invention does not only have strict theory Basis, and it is simple efficient in Project Realization, efficiently avoid because using cascade system (i.e. exhaustive any two word or many words Combination) and the problem of multiple shot array brought.This method may be such that video hotspot finds full automation, without artificial ginseng With accuracy rate higher is in turn ensure that while efficiency is greatly improved.
Brief description of the drawings
Fig. 1 is the realization principle figure of the method that the present invention has found hot video based on user's inquiry log in real time;
Fig. 2 is the flow chart of the method that the present invention has found hot video based on user's inquiry log in real time;
Fig. 3 is the functional block diagram of the device that the present invention has found hot video based on user's inquiry log in real time.
Specific embodiment
To make the above objects, features and advantages of the present invention more obvious understandable, below in conjunction with the accompanying drawings and specific embodiment party The present invention is further detailed explanation for formula:
Because real-time focus has the characteristics of volumes of searches is big within a short period of time, by newest user's inquiry log point Analysis most possibly finds out new focus word and focus incident, and reaction of the searching order result to real-time is improved with this.Fig. 1 It is the realization principle figure of the method that the present invention has found hot video based on user's inquiry log in real time;As shown in figure 1, the present invention will User's inquiry log in a period of time is input in cutting word program, obtains every cutting word result of user's inquiry, is carried here The word of taking-up we be referred to as atom word.Then, the word frequency information and their co-occurrence number of times of atom word are counted on this basis (i.e. Two words are appeared in same user inquiry simultaneously), and using the computational methods of pointwise mutual information (PMI), will be semantically close Two or more atom words of association synthesize a compound word, and thus grey iterative generation goes out new vocabulary.Finally, by word in new vocabulary Word frequency sequence, focus word and focus incident are found out automatically.
Fig. 2 is the flow chart of the method that the present invention has found hot video based on user's inquiry log in real time;As illustrated, this Invention finds that the method for the real-time focus of video comprises the following steps based on user's inquiry log:
S1, the user video inquiry log in a period of time is input to cutting word program, completes each user video and look into The cutting word of daily record is ask, the cutting word result of every user inquiry is obtained, and the different terms of the cutting word result as original will be constituted Sub- word;
Cutting word program carries out cutting in the method that maximum forward is matched according to existing vocabulary to inquiry.
For example, user input query:" also pearl sound of laughing theme song ", cutting word program returning result " go back pearl | sound of laughing | theme It is bent ", i.e., three atom words are contained in the inquiry:" also pearl ", " sound of laughing " and " theme song ".
What is occurred in S2, each user video inquiry log within described a period of time of the statistics atom word is secondary Number;
For example, counted from user's inquiry log of a day obtaining:" also pearl " occurs 61,661 times, " sound of laughing " occur 65, 564 times, " theme song " occurs 306,050 time.
S3, the cutting word result statistics any two atom word obtained according to step S1 are while appear in same user inquiry In number of times;
For example, counted from user's inquiry log of a day obtaining:" also pearl sound of laughing " occurs 60,245 times, " sound of laughing theme It is bent " occur 1,505 times.
S4, the secondary numerical value obtained according to step S2 and S3 calculate user video and look into using the method for pointwise mutual information (PMI) The degree of association in inquiry daily record between any two atom word;
Method using pointwise mutual information (Pointwise Mutual Information, be abbreviated as PMI) is used to portray The degree of association in user's inquiry log between two words.The basic thought of the method is described below.
The computational methods of pointwise mutual information
PMI is a classical concept in information theory, for correlation between two chance events of measurement.It is considered that PMI is equally applicable to calculate in video search the degree of association between two words.Intuitively, by analyzing user's inquiry log, if two Number of times of the individual Term co-occurrence in same inquiry is a lot, then mean that two words have and may greatly merge into a compound word.Under Face provides the circular of PMI.
Appoint to two words A, B, their association table is shown as
Wherein, P (A, B) represents that A, the number of times of B co-occurrences, P (A), P (B) are respectively A, the number of times that B occurs.
From above formula, if A, B independence, then the value of PMI (A, B) is 0;If there is association (herein referring to co-occurrence) in A, B, then PMI (A, B) > 0, and the degree of association is higher, PMI value is bigger.
When using maximal possibility estimation (Maximum likelihood estimate) to estimate count parameter, formula (1) it is equivalent to
Wherein, freq (A, B) represents the user's inquiry quantity comprising A and B simultaneously, and freq (A), freq (B) are represented respectively User's inquiry quantity comprising A, B, the user's inquiry sum in a period of time of | Q | expressions.
By calculating PMI, we can be assigned to a numerical value for any two word, and its degree of association is represented with it, and in this base On plinth, easily comparing word and word are associated, and generate compound word.
For example, user's inquiry sum that statistics obtains a day is 42,567,550 times, then can be obtained according to formula (2):
S5, by the degree of association calculated in step S4 exceed certain threshold value any two atom word merge into a compound word It is put into compound word vocabulary;
For example, choose one day user's inquiry log calculate obtained by PMI averages 3.83 as threshold value, then go up in example " and also Pearl " and " sound of laughing " because the degree of association be higher than threshold value, so compound word " also pearl sound of laughing " can be combined into;And " sound of laughing " and " theme song " then because The degree of association is too low, it is impossible to merge.
S6, the user video in compound vocabulary according to the atom word of composition compound word within described a period of time inquire about day The number of times occurred in will carries out descending sort to compound word, and the compound word that will finally come by a certain percentage above is sent out as real-time The keyword of existing hot video is returned.
For example, the new focus word about 150,000 obtained by the analysis of user's inquiry log in a day, discovery, wherein sorting It is most forward including " distorting the truth by despicable means " (584,435 times), " happy base camp " (485,773 times), " Must Be yours " (476,852 It is secondary) etc..
Fig. 3 is the functional block diagram of the device that the present invention has found hot video based on user's inquiry log in real time, as illustrated, It is of the invention to find that the device of hot video includes such as lower module in real time based on user's inquiry log:
Cutting word module 1, for the user video inquiry log in a period of time to be input into cutting word program, completes each The cutting word of user video inquiry log, obtains every cutting word result of user's inquiry, and will constitute the difference of the cutting word result Word is used as atom word;
Atom word occurrence number statistical module 2, for count the atom word each within described a period of time The number of times occurred in user video inquiry log;
Atom Term co-occurrence number of times statistical module 3, for the cutting word obtained according to the atom word occurrence number statistical module Result statistics any two atom word appears in the number of times in same user inquiry simultaneously;
Calculation of relationship degree module 4, for according to the atom word occurrence number statistical module and the atom Term co-occurrence The secondary numerical value that number statistical module is obtained is using any two in the method calculating user video inquiry log of pointwise mutual information (PMI) The degree of association between atom word;
Compound word generation module 5, the degree of association for the calculation of relationship degree module to be calculated exceedes certain threshold value Any two atom word is merged into a compound word and is put into compound word vocabulary;
Hot keyword determining module 6, in compound vocabulary according to constituting the atom word of compound word at described one section The number of times occurred in user video inquiry log in time carries out descending sort to compound word, will finally come by a certain percentage Compound word above is returned as the keyword for finding hot video in real time.
Concept in this information theory by pointwise mutual information of the invention, is applied to the analysis of user's inquiry log, solves Cutting word is inaccurate and the real-time focus of video pinpoint the problems caused by new term emerges in large numbers.The present invention does not only have strict theory Basis, and it is simple efficient in Project Realization, efficiently avoid because using cascade system (i.e. exhaustive any two word or many words Combination) and the problem of multiple shot array brought.This method may be such that video hotspot finds full automation, without artificial ginseng With accuracy rate higher is in turn ensure that while efficiency is greatly improved.By to one day 5,0000000 left side in certain video website Right user video inquiry log is tested using invention proposed method, by six iterative calculation of PMI, is obtained automatically altogether Compound word 150,000 is obtained, and with more than 85% accuracy rate.
Above is the detailed description carried out to the preferred embodiments of the present invention, but one of ordinary skill in the art should anticipate Know, within the scope of the present invention, and guided by the spirit, various improvement, addition and replacement are all possible, for example, adjust interface Call order, change message format and content, use different programming languages (such as C, C++, Java) to realize etc..These all exist In the protection domain that claim of the invention is limited.

Claims (6)

1. a kind of method for finding hot video in real time based on user's inquiry log, it is characterised in that comprise the following steps:
S1, the user video inquiry log in a period of time is input to cutting word program, completes each user video inquiry day The cutting word of will, obtains the cutting word result of every user inquiry, and will constitute the different terms of the cutting word result as atom word;
The number of times occurred in S2, each user video inquiry log within described a period of time of the statistics atom word;
During S3, the cutting word result statistics any two atom word obtained according to step S1 are while appear in same user inquiry Number of times;
S4, the secondary numerical value obtained according to step S2 and S3 are appointed in calculating user video inquiry log using the method for pointwise mutual information The degree of association between two atom words of meaning;
S5, any two atom word that the degree of association calculated in step S4 exceedes certain threshold value is merged into a compound word and is put into In compound word vocabulary;
In S6, the user video inquiry log in compound vocabulary according to the atom word of composition compound word within described a period of time The number of times of appearance carries out descending sort to compound word, and the compound word that will finally come by a certain percentage above is warm as finding in real time The keyword of point video is returned.
2. the method for finding hot video in real time based on user's inquiry log according to claim 1, it is characterised in that institute The circular for stating the method for pointwise mutual information described in step S4 is as follows:
Appoint to two atoms word A, B, their association table is shown as
P M I ( A , B ) = log 2 P ( A , B ) P ( A ) P ( B ) - - - ( 1 )
Wherein, P (A, B) represents that A, B appear in number of times in same user video inquiry log, and P (A), P (B) represent A, B respectively The number of times occurred in user video inquiry log within described a period of time.
3. the method for finding hot video in real time based on user's inquiry log according to claim 2, it is characterised in that adopt With maximal possibility estimation come calculation times.
4. a kind of device for finding hot video in real time based on user's inquiry log, it is characterised in that including such as lower module:
Cutting word module, for the user video inquiry log in a period of time to be input into cutting word program, completes each user The cutting word of query video daily record, obtains every cutting word result of user's inquiry, and will constitute the different terms of the cutting word result As atom word;
Atom word occurrence number statistical module, each user within described a period of time for counting the atom word regards The number of times occurred in frequency inquiry log;
Atom Term co-occurrence number of times statistical module, for the cutting word result system obtained according to the atom word occurrence number statistical module Meter any two atom word appears in the number of times in same user inquiry simultaneously;
Calculation of relationship degree module, for being counted according to the atom word occurrence number statistical module and the atom Term co-occurrence number of times The secondary numerical value that module is obtained is using between any two atom word in the method calculating user video inquiry log of pointwise mutual information The degree of association;
Compound word generation module, the degree of association for the calculation of relationship degree module to be calculated exceedes any the two of certain threshold value Individual atom word is merged into a compound word and is put into compound word vocabulary;
Hot keyword determining module, in compound vocabulary according to constituting the atom word of compound word within described a period of time User video inquiry log in occur number of times descending sort is carried out to compound word, will finally come above by a certain percentage Compound word is returned as the keyword for finding hot video in real time.
5. the device for finding hot video in real time based on user's inquiry log according to claim 4, it is characterised in that institute The circular for stating the method for pointwise mutual information described in calculation of relationship degree module is as follows:
Appoint to two atoms word A, B, their association table is shown as
P M I ( A , B ) = log 2 P ( A , B ) P ( A ) P ( B ) - - - ( 1 )
Wherein, P (A, B) represents that A, B appear in number of times in same user video inquiry log, and P (A), P (B) represent A, B respectively The number of times occurred in user video inquiry log within described a period of time.
6. the device for finding hot video in real time based on user's inquiry log according to claim 5, it is characterised in that adopt With maximal possibility estimation come calculation times.
CN201710083558.4A 2012-12-07 2012-12-07 A kind of method and apparatus for finding hot video in real time based on user's inquiry log Pending CN106909638A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710083558.4A CN106909638A (en) 2012-12-07 2012-12-07 A kind of method and apparatus for finding hot video in real time based on user's inquiry log

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210525735.7A CN103020212B (en) 2012-12-07 2012-12-07 A method and device for discovering hot videos in real time based on user query logs
CN201710083558.4A CN106909638A (en) 2012-12-07 2012-12-07 A kind of method and apparatus for finding hot video in real time based on user's inquiry log

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201210525735.7A Division CN103020212B (en) 2012-12-07 2012-12-07 A method and device for discovering hot videos in real time based on user query logs

Publications (1)

Publication Number Publication Date
CN106909638A true CN106909638A (en) 2017-06-30

Family

ID=47968816

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201210525735.7A Expired - Fee Related CN103020212B (en) 2012-12-07 2012-12-07 A method and device for discovering hot videos in real time based on user query logs
CN201710083558.4A Pending CN106909638A (en) 2012-12-07 2012-12-07 A kind of method and apparatus for finding hot video in real time based on user's inquiry log

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201210525735.7A Expired - Fee Related CN103020212B (en) 2012-12-07 2012-12-07 A method and device for discovering hot videos in real time based on user query logs

Country Status (1)

Country Link
CN (2) CN103020212B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107832444A (en) * 2017-11-21 2018-03-23 北京百度网讯科技有限公司 Event based on search daily record finds method and device
CN108681564A (en) * 2018-04-28 2018-10-19 北京京东尚科信息技术有限公司 The determination method, apparatus and computer readable storage medium of keyword and answer

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593469B (en) * 2013-11-30 2016-04-20 合一网络技术(北京)有限公司 A kind of association keyword calculation method and device adopting complementary information
CN104199898B (en) * 2014-08-26 2018-05-15 北京小度互娱科技有限公司 Acquisition methods and device, the method for pushing and device of a kind of attribute information
CN105677664B (en) * 2014-11-19 2019-11-19 腾讯科技(深圳)有限公司 Method and device is determined based on the tightness of web search
CN105025339A (en) * 2015-07-22 2015-11-04 天脉聚源(北京)传媒科技有限公司 Classified search method and device for programs in TV box
CN105898425A (en) * 2015-12-14 2016-08-24 乐视网信息技术(北京)股份有限公司 Video recommendation method and system and server
CN108170693B (en) * 2016-12-07 2020-07-31 北京国双科技有限公司 Hot word pushing method and device
CN106682411B (en) * 2016-12-22 2019-04-16 浙江大学 A method for converting physical examination diagnostic data into disease labels
CN107992473B (en) * 2017-11-24 2021-04-27 国家计算机网络与信息安全管理中心 Fraud information feature word extraction method and system based on point-to-point mutual information technology
CN108509555B (en) * 2018-03-22 2021-07-23 武汉斗鱼网络科技有限公司 Search term determination method, device, device and storage medium
CN108763196A (en) * 2018-05-03 2018-11-06 上海海事大学 A kind of keyword extraction method based on PMI
CN110188261A (en) * 2019-05-30 2019-08-30 江苏神州信源系统工程有限公司 A kind of hot spot object determines method and device
CN116028558A (en) * 2023-01-10 2023-04-28 上海中通吉网络技术有限公司 Method, system, storage medium and equipment for displaying association words

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101046809A (en) * 2006-03-28 2007-10-03 吴风勇 New word identification method based on association rule model
CN101187921A (en) * 2007-12-20 2008-05-28 腾讯科技(深圳)有限公司 Chinese compound words extraction method and system
CN101833587A (en) * 2010-05-28 2010-09-15 上海交通大学 Internet Video Search System
CN102043845A (en) * 2010-12-08 2011-05-04 百度在线网络技术(北京)有限公司 Method and equipment for extracting core keywords based on query sequence cluster
CN102200984A (en) * 2010-03-24 2011-09-28 深圳市腾讯计算机系统有限公司 Search method based on compound words and search engine server

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102253982B (en) * 2011-06-24 2013-03-20 北京理工大学 Query suggestion method based on query semantics and click-through data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101046809A (en) * 2006-03-28 2007-10-03 吴风勇 New word identification method based on association rule model
CN101187921A (en) * 2007-12-20 2008-05-28 腾讯科技(深圳)有限公司 Chinese compound words extraction method and system
CN102200984A (en) * 2010-03-24 2011-09-28 深圳市腾讯计算机系统有限公司 Search method based on compound words and search engine server
CN101833587A (en) * 2010-05-28 2010-09-15 上海交通大学 Internet Video Search System
CN102043845A (en) * 2010-12-08 2011-05-04 百度在线网络技术(北京)有限公司 Method and equipment for extracting core keywords based on query sequence cluster

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张锋 等: "基于互信息的中文术语抽取系统", 《计算机应用研究》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107832444A (en) * 2017-11-21 2018-03-23 北京百度网讯科技有限公司 Event based on search daily record finds method and device
CN107832444B (en) * 2017-11-21 2021-08-13 北京百度网讯科技有限公司 Event discovery method and device based on search log
CN108681564A (en) * 2018-04-28 2018-10-19 北京京东尚科信息技术有限公司 The determination method, apparatus and computer readable storage medium of keyword and answer
CN108681564B (en) * 2018-04-28 2021-06-29 北京京东尚科信息技术有限公司 Method, device and computer-readable storage medium for determining keywords and answers

Also Published As

Publication number Publication date
CN103020212A (en) 2013-04-03
CN103020212B (en) 2017-05-10

Similar Documents

Publication Publication Date Title
CN106909638A (en) A kind of method and apparatus for finding hot video in real time based on user's inquiry log
US9529895B2 (en) Method and system for discovering dynamic relations among entities
CN106557558B (en) Data analysis method and device
US9317550B2 (en) Query expansion
CN101119326B (en) A management method and device for instant messaging session records
JP6355840B2 (en) Stopword identification method and apparatus
CN103258025B (en) Generate the method for co-occurrence keyword, the method that association search word is provided and system
CN103593336B (en) Knowledge pushing system and method based on semantic analysis
US9665561B2 (en) System and method for performing analysis on information, such as social media
CN104216954A (en) Prediction device and prediction method for state of emergency topic
CN108304444A (en) Information query method and device
JP2009151760A (en) Method and system for calculating competitiveness metric between objects
CN114357159B (en) Event clustering/context construction method and related devices, equipment and storage medium
CN105956197A (en) Social media graph representation model-based social risk event extraction method
Huang et al. Kb-enabled query recommendation for long-tail queries
CN104503991B (en) A kind of information search method and device
CN106682049B (en) Issue display system and issue display method
CN110096646A (en) The generation of category related information and its video pushing method and relevant device
Plummer et al. Analysing the Sentiment Expressed by Political Audiences on Twitter: The case of the 2017 UK general election
CN108197335B (en) A method and device for personalized query recommendation based on user behavior
Hongwimol et al. ESRA: Explainable scientific research assistant
Lee et al. Exploiting online social data in ontology learning for event tracking and emergency response
Nguyen et al. Pagerank-based approach on ranking social events: a case study with flickr
CN118132818B (en) Tourist area resource assessment method based on image difference
CN119961380B (en) Intelligent searching method and system for government affair data based on Internet of things

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Applicant after: Youku network technology (Beijing) Co.,Ltd.

Address before: 100080 area a and C, 5 / F, block a, Sinosteel International Plaza, No. 8, Haidian Street, Haidian District, Beijing

Applicant before: 1VERGE INTERNET TECHNOLOGY (BEIJING) Co.,Ltd.

CB02 Change of applicant information
TA01 Transfer of patent application right

Effective date of registration: 20200615

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after: Alibaba (China) Co.,Ltd.

Address before: 100080 area a and C, 5 / F, block a, Sinosteel International Plaza, No. 8, Haidian Street, Haidian District, Beijing

Applicant before: Youku network technology (Beijing) Co.,Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20170630

RJ01 Rejection of invention patent application after publication