CN102004772A - Method and equipment for sequencing search results according to terms - Google Patents
Method and equipment for sequencing search results according to terms Download PDFInfo
- Publication number
- CN102004772A CN102004772A CN 201010545582 CN201010545582A CN102004772A CN 102004772 A CN102004772 A CN 102004772A CN 201010545582 CN201010545582 CN 201010545582 CN 201010545582 A CN201010545582 A CN 201010545582A CN 102004772 A CN102004772 A CN 102004772A
- Authority
- CN
- China
- Prior art keywords
- internet resources
- relevant information
- user
- message
- unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 21
- 238000011156 evaluation Methods 0.000 claims description 73
- 238000004458 analytical method Methods 0.000 claims description 8
- 238000010586 diagram Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000011218 segmentation Effects 0.000 description 3
- 235000014510 cooky Nutrition 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Landscapes
- Information Transfer Between Computers (AREA)
Abstract
The invention relates to a method and equipment for sequencing search results according to terms, wherein network equipment firstly acquires an input sequence input through user equipment by a user, divides the input sequence into a plurality of information units, then acquires the relevant information of a plurality of network resources relevant to each information unit according to the information units and finally sequences the relevant information of the network resources according to the number of each relevant information unit in the relevant information of the network resources so as to provide the search result items of the sequenced network resources for the user equipment in sequence. Compared with the prior art, the invention has the advantages that the search results which are most relevant to the user input sequence can be provided for the user, thereby avoiding the user changing the terms for many times, improving the experience of the user and meanwhile also reducing the retrieval pressure of the network equipment.
Description
Technical field
The present invention relates to computer realm, relate in particular to a kind of method and apparatus that is used for carrying out the Search Results ordering according to term.
Background technology
Various search plan of the prior art, when a plurality of result for retrieval that term obtained according to user's input are sorted, the sortord of taking is often according to user's selection number of times, be that clicking rate is determined, therefore, cause result for retrieval ordering that some users really do not need preceding easily, and the result for retrieval that the user really wishes sort after situation, reduced user's Experience Degree.
Summary of the invention
The purpose of this invention is to provide a kind of method and apparatus that is used for carrying out the Search Results ordering according to term.
According to an aspect of the present invention, provide a kind of method that is used for carrying out according to term the Search Results ordering, this method may further comprise the steps:
A obtains the list entries of user via the subscriber equipment input;
B is divided into a plurality of message units with described list entries;
C obtains the relevant information of a plurality of Internet resources that are associated with each message unit based on described a plurality of message units;
D sorts according to the quantity of each the associated message unit in the relevant information of the described a plurality of Internet resources relevant information to described a plurality of Internet resources, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.
According to another aspect of the present invention, also provide a kind of network equipment that is used for carrying out according to term the Search Results ordering, this network equipment comprises:
First deriving means is used to obtain the list entries of user via the subscriber equipment input;
Segmenting device is used for described list entries is divided into a plurality of message units;
Second deriving means is used for obtaining based on described a plurality of message units the relevant information of a plurality of Internet resources that are associated with each message unit;
Collator, be used for sorting, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn according to the quantity of each associated message unit of the relevant information of described a plurality of Internet resources relevant information to described a plurality of Internet resources.
Compared with prior art, the present invention has the following advantages: can will offer the user with the maximally related Search Results of user input sequence, avoid repeatedly transformation search speech of user, improve user's Experience Degree, also reduce the retrieval pressure of the network equipment simultaneously.
Description of drawings
By reading the detailed description of doing with reference to the following drawings that non-limiting example is done, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 is the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of one aspect of the invention;
Fig. 2 is the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of another aspect of the present invention;
Fig. 3 is the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of another aspect of the present invention;
Fig. 4 is the network equipment structural representation that is used for carrying out according to term the Search Results ordering of one aspect of the invention;
Fig. 5 is the network equipment structural representation that is used for carrying out according to term the Search Results ordering of another aspect of the present invention;
Fig. 6 is the network equipment structural representation that is used for carrying out according to term the Search Results ordering of another aspect of the present invention;
Same or analogous Reference numeral is represented same or analogous parts in the accompanying drawing.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 shows the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of one aspect of the invention.Wherein, the described network equipment includes but not limited to: 1) a plurality of webserver collection; 2) distributed network equipment; 3) based on set of computers of the cloud that constitutes by a large amount of computing machines or the webserver of cloud computing (Cloud Computing) etc.Wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine of being made up of the loosely-coupled computing machine collection of a group.
Concrete, in step S1, the described network equipment obtains the list entries of user via the subscriber equipment input.Wherein, subscriber equipment can be any can with the user by the electronic product that keyboard, mouse, telepilot, touch pad or voice-operated device carry out man-machine interaction, include but not limited to computing machine, smart mobile phone, PDA or IPTV etc.With the keyboard is example, for example, the list entries that the user imports in the shown search box of subscriber equipment by keyboard is: " World Expo is historical to be introduced ", then subscriber equipment is sent to the network equipment by network with this list entries " World Expo is historical to be introduced ", and the network equipment gets access to the list entries of user's input thus.The network that described subscriber equipment is connected with the network equipment includes but not limited to: internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.It should be appreciated by those skilled in the art that the mode that the network equipment obtains list entries is not that the above exceeds, in fact, subscriber equipment can be earlier be sent to a transferring equipment with the list entries of user's input, is forwarded to the network equipment etc. by transferring equipment again.
Then, in step S2, the described network equipment is divided into a plurality of message units with described list entries.Wherein, the described network equipment is cut apart described list entries and can be carried out based on second preset rules, and described second preset rules is divided into a plurality of message units with reference to following at least one factor with described list entries:
1, semantic analysis.The mode of analyzing includes but not limited to:
1) the independently entry that comprises in list entries and the dictionary is compared, obtained the message unit that comprises in the list entries thus.For example, comprise in the dictionary: " World Expo ", " history ", " introduction " etc. are entry independently, the network equipment with comprise in list entries " World Expo historical introduce " and the dictionary each independently entry compare, thus, the network equipment is divided into 3 message units based on semantic analysis with list entries " World Expo is historical to be introduced ": " World Expo ", " history " and " introduction ";
2) by specific speech, for example, auxiliary word etc., the message unit that obtains to comprise in the list entries.For example, list entries is " history of World Expo ", by auxiliary word " ", list entries is divided into 3 message units: " World Expo ", " ", " history ";
3) by the sentence structure analysis, judge the sentence formula of list entries, and come the carve information unit according to the sentence formula, for example,, judge that the sentence formula is the structure of subject, predicate and object, then according to this segmentation of structures according to the structure of " noun+verb+noun ".For example, for list entries " which day the World Expo cut-off date is ", the network equipment is divided into according to SVO: " World Expo cut-off date ", "Yes", " which day " 3 message units.It should be appreciated by those skilled in the art that a formula is not that the above exceeds, and for different language, the sentence formula is also inequality, differs at this and one describe in detail for example.
In addition, those skilled in the art should be appreciated that also the network equipment is cut apart list entries based on semanteme, are not to exceed in 3 kinds of modes shown in above-mentioned.
2, cut apart list entries according to user related information.Wherein, user related information includes but not limited to:
A) user's preference setting, the user can carry out the multiple setting of cutting apart according to the preference of oneself, and for example user's preference setting includes but not limited to: cut apart list entries, cut apart list entries, cut apart list entries etc. according to long word according to short speech according to individual character.If user's preference is set to cut apart according to short speech, then the network equipment is divided into list entries " World Expo is historical to be introduced ": " World Expo ", " history ", " introduction " three message units; If user's preference is set to cut apart according to long word, then the network equipment is divided into list entries " World Expo is historical to be introduced ": " World Expo history ", " introduction " two message units;
B) user's personal attribute includes but not limited to: sex, age, education degree, income, occupation etc.; User for the different user attribute, the network equipment can adopt different partitioning schemes, for example, for education degree is " university " or above user, the network equipment is cut apart list entries based on long word, for education degree is " junior middle school " or following user, and the network equipment is cut apart list entries etc. based on word.Wherein, user's personal attribute can initiatively be provided by the user, also can infer according to user's historical behavior to obtain;
C) user's historical behavior.For example, the user once imported " World Expo history ", and then the network equipment is divided into " World Expo history " and " introduction " two message units according to the historical input behavior of this kind of user with list entries " World Expo is historical to be introduced ".
More than describe the network equipment in detail and cut apart list entries, but it should be appreciated by those skilled in the art that the mode of cutting apart list entries is not that the above exceeds based on second preset rules.
Then, in step S3, the described network equipment obtains the relevant information of a plurality of Internet resources that are associated with each message unit based on described a plurality of message units.Wherein, described Internet resources include but not limited to: webpage, picture, video, audio frequency or the like, the relevant information of Internet resources includes but not limited to: full content of the brief introduction of title, keyword, summary, picture, song list, song writer's inventory, video content brief introduction, webpage or the like.The mode that the described network equipment obtains the relevant information of the Internet resources that are associated with each message unit includes but not limited to:
1,, in index database, obtains the relevant information of described a plurality of Internet resources based on described a plurality of message units.In this kind index database, include the whole content information of brief introduction, song list, song writer's inventory, video content brief introduction even webpage of title, keyword, summary, the picture of webpage, picture, video or the audio frequency etc. that are associated with various message units, the network equipment can directly obtain the relevant information of Internet resources in this index database.For example, for message unit " World Expo ", the relevant information of the Internet resources associated therewith that the described network equipment obtains in described index database comprises: network address is the web page title information of A, the webpage summary info that network address is B etc.; For message unit " history ", the described network equipment obtains Internet resources associated therewith in described index database relevant information comprises: network address is the webpage summary info of B, author's inventory information that network address is D etc.Those skilled in the art should understand that, the network equipment is based on a message unit, the relevant information of the Internet resources that obtain from this index database also may also be associated with other message units, for example, foregoing network address is the webpage summary info of B, both and message unit " World Expo " be associated, again and message unit " history " be associated.
2, based on described a plurality of message units, the network equipment obtains earlier the linking of relevant information of described a plurality of Internet resources that are associated with each message unit in index database, and then based on described a plurality of links, obtains the relevant information of described a plurality of Internet resources.For example, the described network equipment obtains earlier the linking of relevant information that reaches the Internet resources that " introduction " be associated respectively with message unit " World Expo " from described index database, the link that relates to World Expo in for example described index database comprises: 202.198.2.2,202.198.2.3, the link that relates to " introduction " comprises: 230.10.10.10.Wherein, the corresponding webpage E of link 202.198.2.2; The corresponding network address of link 202.198.2.3 is the key word information of the webpage of F; The corresponding network address of link 230.10.10.10 is the video content introduction of I.Then, the described network equipment obtains the relevant information of a plurality of Internet resources based on each link.For example, the network equipment links based on this: 202.198.2.2, the relevant information of the Internet resources that obtain is exactly webpage E, based on this link 202.195.1.3, the relevant information of the Internet resources that obtain be exactly network address be the key word information of the webpage of F, based on this link 230.10.10.10, the relevant information of the Internet resources that obtain be exactly network address be the video content introduction of I.Those skilled in the art should understand that, though the network equipment obtains link, obtains the relevant information of Internet resources based on this link again based on a message unit in index database, be not just to show in the relevant information of these Internet resources only to comprise this message unit, in fact, it also may also comprise other message units.Say it for example, the relevant information of the Internet resources related with message unit " World Expo " that obtain for the network equipment comprises: network address is the key word information of the webpage of F, and this network address is in the key word information of webpage of F, except comprising message unit " World Expo ", also comprise message unit " history " etc.
In addition, it should be appreciated by those skilled in the art that above-mentioned shown in only just for better explanation technical scheme of the present invention, but not be used to limit the present invention, in fact, to obtain the mode of the relevant information of Internet resources be not only to only limit to obtain in the index database to the network equipment.
As a kind of optimal way, the network equipment also can be discerned each message unit earlier before the relevant information of obtaining the Internet resources that are associated with each message unit, to obtain invalid information unit wherein.For the invalid information unit that identifies, the network equipment does not obtain the relevant information of the Internet resources related with described invalid information unit.Wherein, the network equipment can be tabulated based on invalid information to the identification of message unit and be carried out.For example, belong to the message unit of part of speech listed in the invalid information tabulation, sentence element etc., be regarded as the invalid information unit.Listed part of speech includes but not limited in the invalid information tabulation: auxiliary word, interrogative, verb or the like, listed sentence element includes but not limited to: predicate etc.For example, for list entries " history of World Expo ", its be split into " World Expo ", " " reach " history " 3 message units, wherein, message unit " " part of speech be auxiliary word, so network equipment identification information unit " " be the invalid information unit, and then, the network equipment from index database, do not obtain with " " relevant information of related Internet resources.Again for example, for list entries " which day the World Expo cut-off date is ", the network equipment is " World Expo cut-off date ", "Yes", " which day " 3 message units with this list entries according to the SVO segmentation of structures, for the message unit "Yes", belong to the predicate in the invalid information tabulation, so the network equipment is identified as the invalid information unit with it, the corresponding relevant information of also not obtaining the Internet resources related with "Yes" from index database.Those skilled in the art should understand that, the mode of network equipment identification invalid information unit is not to exceed to tabulate based on invalid information, in fact, can also carry out based on preset rules, for example, preset rules includes but not limited to: according to the order of message unit at list entries, last message unit is considered as invalid information unit etc.For example, for message unit " which day ", in list entries " which day the World Expo cut-off date is ", be last message unit, then the network equipment is identified as the invalid information unit with it.
Then, in step S4, the network equipment is to the relevant information of a plurality of Internet resources of obtaining, sort according to the quantity of each message unit that is associated in the relevant information of described a plurality of Internet resources relevant information, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn to described a plurality of Internet resources.Wherein, the search result items of described Internet resources includes but not limited to: the description content of the link of these Internet resources and/or Internet resources etc.For example, be the summary info of the webpage of network address A correspondence for the Internet resources relevant information, then the search result items of Dui Ying Internet resources is the brief introduction of link, network address A of network address A or multimedia files such as pairing picture of network address A or audio frequency and video etc.For simplicity's sake, below relate to the part of search result items, all adopt the example that is linked as of network address A, but it should be appreciated by those skilled in the art that search result items is not as limit.
More detailed speech it, for example, the relevant information of the Internet resources that the network equipment obtains comprises: network address is the webpage summary info of A, network address is the web page title information of B, network address is the webpage key word information of C, wherein, network address is the webpage summary info and " World Expo " of A, " history " reach " introduction " totally 3 message units be associated, network address is that this 1 message unit of web page title information and " World Expo " of B is associated, network address be webpage key word information and " history " of C reach " introduction " totally 2 message units be associated, therefore, the network equipment will be according to the quantity of the message unit that is associated with the relevant information of each Internet resources, to the relevant information ordering of each Internet resources is:
1) the webpage summary info of network address A;
2) network address is the webpage key word information of C;
3) network address is the web page title information of B.
The search result items of the Internet resources after accordingly, the network equipment will sort: promptly
1) network address A;
2) network address C;
3) network address B,
Offer subscriber equipment in regular turn.
In addition, the network equipment is in sequencer procedure, if when having the quantity of the message unit that the relevant information of two or more Internet resources is associated separately identical, the network equipment can adopt multiple sortord that the relevant information of those Internet resources is sorted, include but not limited to the random fashion ordering, preferably, can also adopt other sortords, described other sortords will be described in detail in the following preferred embodiment of describing with reference to Fig. 2.
Fig. 2 shows the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of the preferred embodiment of the present invention.
Concrete, the step S1 shown in Fig. 2 describes in detail in the described embodiment of reference Fig. 1 to step S3, for simplicity's sake, is contained in this by reference, repeats no more.
In step S4 ', the network equipment is estimated described each message unit to obtain the evaluation result of this each message unit according to first pre-defined rule, and come the relevant information of described a plurality of Internet resources is sorted in conjunction with described evaluation result, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.Wherein, described first pre-defined rule is estimated each message unit with reference to following at least one factor:
1, the quantity of the relevant information of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information that the network equipment obtains Internet resources associated therewith comprises: the webpage of the webpage of the webpage of network address A correspondence, network address B correspondence and network address C correspondence; For message unit " history ", the relevant information of the Internet resources associated therewith that the network equipment obtains comprises: the webpage of the webpage of network address A correspondence and network address D correspondence; For message unit " introduction ", the relevant information of the Internet resources associated therewith that the network equipment obtains comprises: the webpage of network address B correspondence.This shows, for message unit " World Expo ", the quantity of the relevant information of the Internet resources that the network equipment obtains is 3, for message unit " history ", the quantity of the relevant information of the Internet resources that the network equipment obtains is 2, for message unit " introduction ", the quantity of the relevant information of the Internet resources that the network equipment obtains is 1, therefore, the weighted value that the network equipment is estimated message unit " World Expo " can be 3, the weighted value that message unit " history " is estimated can be 2, and the weighted value that message unit " introduction " is estimated can be 1.Those skilled in the art should understand that, the quantity of above-mentioned relevant information according to Internet resources comes message unit is estimated, the weighted value that obtains not is to exceed with the quantity of the relevant information that equals Internet resources, for example, for message unit " World Expo ", the quantity of the relevant information of the Internet resources that the network equipment obtains is 3, and the weighted value that the network equipment is estimated message unit " World Expo " can be worth for other, for example, multiple of 3 or the like.
2, the number of times selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information that the network equipment obtains Internet resources associated therewith comprises: the webpage of the webpage of the webpage of network address A correspondence, network address B correspondence and network address C correspondence, accordingly, the relevant information of Internet resources is the webpage of network address A correspondence, the search result items of these Internet resources is the link of network address A, and the number of times that this link is selected by the user is 2 times; The relevant information of Internet resources is the webpage of network address B correspondence, and the search result items of these Internet resources is the link of network address B, and the number of times that this link is selected by the user is 2 times; The relevant information of Internet resources is the webpage of network address C correspondence, the search result items of these Internet resources is the link of network address C, the number of times that this link is selected by the user is 1 time, this shows that the number of times that the search result items of the Internet resources that are associated with message unit " World Expo " is selected by the user is 2+2+1=5 time.Thus, the network equipment can be 5 to the weighted value of message unit " World Expo " evaluation.It should be appreciated by those skilled in the art that the above-mentioned number of times of being selected by the user according to the search result items of Internet resources comes message unit is estimated, the weighted value of acquisition is not to be exceeded by the number of times that the user selects with the search result items that equals Internet resources yet.
3, the frequency selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " introduction ", the relevant information of Internet resources associated therewith comprises the song list of the song that network address C comprises and the introductory video of the video that network address D comprises, the search result items of corresponding Internet resources, that is the link of the link of network address C and network address D.Wherein, the link of network address C was selected 100 times by the user in one month, and being linked in month of network address D selected 200 times by the user, and then the network equipment is 300 to the weighted value of message unit " introduction " evaluation.It should be appreciated by those skilled in the art that the above-mentioned frequency of being selected by the user according to the search result items of Internet resources comes message unit is estimated, the weighted value of acquisition is not to be exceeded by the frequency that the user selects with the search result items that equals Internet resources yet.In addition, frequency is not to calculate according to the number of times of being selected by the user in month to exceed yet, and can calculate with reference to one day or other times yet.
4, time of being selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information of Internet resources associated therewith comprises: network address is that webpage and the network address of A is the webpage of B, the search result items of corresponding Internet resources, promptly, the link of the link of network address A and network address B, the time of being selected by the user all be in January, 2010 to October, for message unit " history ", the relevant information of Internet resources associated therewith comprises: network address is that webpage and the network address of C is the webpage of D, the search result items of corresponding Internet resources, that is, the link of the link of network address C and network address D, the time of being selected by the user all is in year May in August, 2009 to 2010, so the network equipment is estimated the evaluation be higher than message unit " history " to message unit " World Expo ".
5, user related information, it includes but not limited to:
A) user's preference setting, for example, user's preference setting is according to the order of list entries each message unit to be estimated, for example, for list entries " World Expo is historical to be introduced ", it is split into message unit " World Expo ", " history " and " introduction ", and the network equipment is provided with being evaluated as message unit according to user's preference: the weighted value of the weighted value>message unit " introduction " of the weighted value>message unit " history " of message unit " World Expo ";
B) user's personal attribute, for example, include but not limited to: sex, age, education degree, income, occupation etc., for example, for the user of education degree below senior middle school, the network equipment is estimated the evaluation be higher than message unit " World Expo " to message unit " history ", for the user of education degree more than university, message unit " World Expo " evaluation is higher than the evaluation of message unit " history " etc.;
C) user's historical behavior, for example, user 1 often login relates to historical webpage, and then the network equipment is higher than evaluation of message unit " World Expo " or the like to message unit " history " evaluation.
Those skilled in the art should understand that, only be for technical scheme of the present invention better is described shown in above-mentioned, but not be used to limit the present invention, in fact, user's preference setting also can comprise linguistic property of message unit etc. as one of factor of evaluation, for example, be higher than evaluation as the evaluation of the message unit of subject as the message unit of object.
In addition, the mode that the network equipment obtains user related information includes but not limited to: obtain in the left log-on message during according to the user by the subscriber equipment log-on webpage, or according to during the user is by the subscriber equipment browsing page by obtaining or the like in user's historical behavior information of extracting in subscriber equipment end or the cookies information that network-side write down or by subscriber equipment.
Have, the network equipment is not to exceed in the mode of above-mentioned weighted value to the evaluation of each message unit, in fact, can also comprise again: the mode with ordering is estimated or the like each message unit.
As a kind of optimal way, the network equipment is when ordering, can carry out minor sort just according to the quantity of each message unit that is associated in the relevant information of described a plurality of Internet resources earlier, again to the relevant information of each identical Internet resources of the message unit quantity that is associated, after associated separately message unit is estimated, again in conjunction with the evaluation result of those message units is come minor sort again.For example, the quantity of the message unit that is associated with the relevant information A of Internet resources is 2, the quantity of the message unit that is associated with the relevant information B of Internet resources is 2, the quantity of the message unit that is associated with the relevant information C of Internet resources is 1, and then the first minor sort of the network equipment is: after the relevant information A that the relevant information C of Internet resources sorts at Internet resources and the relevant information B of Internet resources.Subsequently, if 2 message units that are associated with the relevant information A of Internet resources are " World Expo " and " history ", 2 message units that are associated with the relevant information B of Internet resources are " World Expo " and " introduction ", because message unit " World Expo " all is associated with the relevant information A of Internet resources and the relevant information B of Internet resources, then the network equipment only needs message unit " history " and " introduction " are estimated, for example, the network equipment to the evaluation result of message unit " history " and " introduction " is: first of message unit " history " ordering, second of message unit " introduction " ordering, in conjunction with this evaluation result, the network equipment to the minor sort again of the relevant information B of the relevant information A of Internet resources and Internet resources is: first of the relevant information A ordering of Internet resources, second of the relevant information B ordering of Internet resources, to sum up, the network equipment to the final ordering of the relevant information of each Internet resources is:
1) the relevant information A of Internet resources;
2) the relevant information B of Internet resources;
3) the relevant information C of Internet resources.
As another optimal way of the present invention, after the network equipment also can be earlier estimated all message units, again in conjunction with the evaluation result of all message units relevant information to each Internet resources is sorted.For example, all message units comprise: " World Expo ", " history " and " introduction ", the evaluation result of the network equipment after to all message unit evaluations is: the weighted value of message unit " World Expo " is 3, the weighted value of message unit " history " is 2, the weighted value of message unit " introduction " is 1, subsequently, the network equipment sorts to a plurality of Internet resources relevant informations, if the relevant information of Internet resources is the webpage of network address A correspondence, 2 message units of its association are " World Expo " and " history ", the relevant information of Internet resources is the webpage of network address B correspondence, 2 message units of its association are message unit " World Expo " and " introduction ", the relevant information of Internet resources is the webpage of network address C correspondence, 2 message units of its association are message unit " history " and " introduction ", and the network equipment is in conjunction with to the evaluation result of all message units webpage to network address A correspondence, the webpage ordering of network address B correspondence and the webpage ordering of network address C correspondence are:
1) webpage of network address A correspondence;
2) webpage of network address B correspondence;
3) webpage of network address C correspondence.
Again for example, if the relevant information of Internet resources is the webpage of network address A correspondence, 2 message units of its association are " World Expo " and " history ", the relevant information of Internet resources is the webpage of network address B correspondence, 2 message units of its association are " World Expo " and " introduction ", the relevant information of Internet resources is the webpage of network address C correspondence, 1 message unit of its association is " World Expo ", the relevant information of Internet resources is the webpage of network address D correspondence, 1 message unit of its association is " introduction ", the network equipment is in conjunction with being that the webpage of 2 network address A correspondence and the webpage ordering of network address B correspondence are: first of the webpage ordering of network address A correspondence, second of the webpage ordering of network address B correspondence to the message unit quantity of association to the evaluation result of each message unit; And the network equipment is in conjunction with being that the webpage of 1 network address C correspondence and the webpage ordering of network address D correspondence are: first of the webpage ordering of network address C correspondence, second of the webpage ordering of network address D correspondence to the message unit quantity of association to the evaluation result of each message unit.Simultaneously, because the webpage of network address A correspondence and all related separately 2 message units of webpage of network address B correspondence, and the webpage of network address C correspondence and only related separately 1 message unit of network address D, therefore, the quantity of the message unit that the relevant information of network equipment resource Network Based is associated and to the evaluation result of each message unit, total ordering is to the relevant information of each Internet resources:
1) webpage of network address A correspondence;
2) webpage of network address B correspondence;
3) webpage of network address C correspondence;
4) webpage of network address D correspondence.
Accordingly, the search result items of the Internet resources after the network equipment will sort, that is:
1) link of network address A;
2) link of network address B;
3) link of network address C;
4) link of network address D,
Offer subscriber equipment in regular turn.
Fig. 3 shows the process flow diagram that is used for carrying out according to term the method for Search Results ordering in the network equipment of another aspect of the present invention.
Concrete, step S1 describes in detail in the described embodiment of reference Fig. 1 to step S3, comprises by reference at this, repeats no more.
At step S4 " in; the quantity of the message unit that comprises in the relevant information of the network equipment according to each Internet resources and in conjunction with user related information to the ordering of the relevant information of described a plurality of Internet resources, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.Wherein, user related information includes but not limited to as previously mentioned:
1) user's preference setting, for example, user's preference setting be will be selected by the user relevant information ordering of Internet resources often preceding, perhaps user's preference setting is to will be referred to the ordering of the relevant information of a certain star personality's Internet resources preceding or the like;
2) user's personal attribute for example, includes but not limited to: sex, age, education degree, income, occupation etc.For example, for the age the user below 15 years old, the relevant information ordering that will be referred to learn class, educational Internet resources is preceding or the like;
3) user's historical behavior, for example, the user often carries out shopping online, and then the network equipment will be referred to the relevant information ordering of Internet resources of marketing information preceding or the like.
Fig. 4 shows the network equipment structural representation that is used for carrying out according to term the Search Results ordering of one aspect of the invention.Wherein, the network equipment comprises: first deriving means 11, segmenting device 12, second deriving means 13 and collator 14.
Concrete, described first deriving means 11 obtains the list entries of user via the subscriber equipment input.Wherein, subscriber equipment can be any can with the user by the electronic product that keyboard, mouse, telepilot, touch pad or voice-operated device carry out man-machine interaction, include but not limited to computing machine, smart mobile phone, PDA or IPTV etc.With the keyboard is example, for example, the list entries that the user imports in the shown search box of subscriber equipment by keyboard is: " World Expo is historical to be introduced ", then subscriber equipment is sent to the network equipment by network with this list entries " World Expo is historical to be introduced ", makes first deriving means 11 obtain the list entries of user's input thus.Described subscriber equipment includes but not limited to the network that first deriving means 11 is connected: internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.Those skilled in the art should understand that, the mode that first deriving means 11 obtains list entries is not that the above exceeds, in fact, the list entries that subscriber equipment also can be imported the user earlier is sent to a transferring equipment, is forwarded to first deriving means 11 etc. by transferring equipment again.
Then, described segmenting device 12 is divided into a plurality of message units with described list entries.Wherein, described segmenting device 12 is cut apart described list entries and can be carried out based on second preset rules, and described second preset rules is divided into a plurality of message units with reference to following at least one factor with described list entries:
1, semantic analysis.The mode of analyzing includes but not limited to:
1) the independently entry that comprises in list entries and the dictionary is compared, obtained the message unit that comprises in the list entries thus.For example, comprise in the dictionary: " World Expo ", " history ", " introduction " etc. are entry independently, described segmenting device 12 with comprise in list entries " World Expo historical introduce " and the dictionary each independently entry compare, thus, described segmenting device 12 is divided into 3 message units based on semantic analysis with list entries " World Expo is historical to be introduced ": " World Expo ", " history " and " introduction ";
2) by specific speech, for example, auxiliary word etc., the message unit that obtains to comprise in the list entries.For example, list entries is " history of World Expo ", by auxiliary word " ", described segmenting device 12 is divided into 3 message units with list entries: " World Expo ", " ", " history ";
3) by the sentence structure analysis, judge the sentence formula of list entries, and come the carve information unit according to the sentence formula, for example,, judge that the sentence formula is the structure of subject, predicate and object, then according to this segmentation of structures according to the structure of " noun+verb+noun ".For example, for list entries " which day the World Expo cut-off date is ", described segmenting device 12 is divided into according to SVO: " World Expo cut-off date ", "Yes", " which day " 3 message units.It should be appreciated by those skilled in the art that a formula is not that the above exceeds, and for different language, the sentence formula is also inequality, differs at this and one describe in detail for example.
In addition, those skilled in the art also should be appreciated that, described segmenting device 12 is cut apart list entries based on semanteme, are not to exceed in 3 kinds of modes shown in above-mentioned.
2, cut apart list entries according to user related information.Wherein, user related information includes but not limited to:
A) user's preference setting, the user can carry out the multiple setting of cutting apart according to the preference of oneself, and for example user's preference setting includes but not limited to: cut apart list entries, cut apart list entries, cut apart list entries etc. according to long word according to short speech according to individual character.If user's preference is set to cut apart according to short speech, then described segmenting device 12 is divided into list entries " World Expo is historical to be introduced ": " World Expo ", " history ", " introduction " three message units; If user's preference is set to cut apart according to long word, then described segmenting device 12 is divided into list entries " World Expo is historical to be introduced ": " World Expo history ", " introduction " two message units;
B) user's personal attribute includes but not limited to: sex, age, education degree, income, occupation etc.; User for the different user attribute, described segmenting device 12 can adopt different partitioning schemes, for example, for education degree is " university " or above user, described segmenting device 12 is cut apart list entries based on long word, for education degree is " junior middle school " or following user, and described segmenting device 12 is cut apart list entries etc. based on word.Wherein, user's personal attribute can initiatively be provided by the user, also can infer according to user's historical behavior to obtain;
C) user's historical behavior.For example, the user once imported " World Expo history ", and then described segmenting device 12 is divided into " World Expo history " and " introduction " two message units according to the historical input behavior of this kind of user with list entries " World Expo is historical to be introduced ".
More than describe the network equipment in detail and cut apart list entries, but it should be appreciated by those skilled in the art that the mode of cutting apart list entries is not that the above exceeds based on second preset rules
Then, described second deriving means 13 obtains the relevant information of a plurality of Internet resources that are associated with each message unit based on described a plurality of message units.Wherein, described Internet resources include but not limited to: webpage, picture, video, audio frequency or the like, the relevant information of Internet resources includes but not limited to: full content of the brief introduction of title, keyword, summary, picture, song list, song writer's inventory, webpage or the like.
As a preferred embodiment of the present invention, described second deriving means 13 comprises the first sub-acquiring unit (figure does not show), and this first sub-acquiring unit obtains the relevant information of described a plurality of Internet resources based on described a plurality of message units in index database.In this kind index database, include the whole content information of brief introduction, song list, song writer's inventory, video content brief introduction even webpage of title, keyword, summary, the picture of webpage, picture, video or the audio frequency etc. that are associated with various message units, the first sub-acquiring unit can directly obtain the relevant information of Internet resources in this index database.For example, the first sub-acquiring unit relevant information of obtaining the Internet resources that are associated with message unit " World Expo " in described index database comprises: network address is the web page title information of A, the webpage summary info that network address is B etc.; The described first sub-acquiring unit obtains the Internet resources that are associated with message unit " history " in described index database relevant information comprises: network address is the webpage summary info of B, author's inventory information that network address is D etc.Those skilled in the art should understand that, the described first sub-acquiring unit is based on a message unit, the relevant information of the Internet resources that obtain from this index database also may also be associated with other message units, for example, foregoing network address is the webpage summary info of B, both and message unit " World Expo " be associated, again and message unit " history " be associated.
As another preferred embodiment of the present invention, described second deriving means 13 also can comprise: second sub-acquiring unit (figure does not show) and the 3rd sub-acquiring unit (figure does not show).Wherein, the second sub-acquiring unit is used for based on described a plurality of message units, obtain the linking of relevant information of described a plurality of Internet resources that are associated with each message unit in index database, the 3rd sub-acquiring unit is used for obtaining the relevant information of described a plurality of Internet resources based on described a plurality of links.For example, the second sub-acquiring unit obtains earlier the linking of relevant information that reaches the Internet resources that " introduction " be associated respectively with message unit " World Expo " from described index database, the link that relates to World Expo in for example described index database comprises: 202.198.2.2,202.198.2.3, the link that relates to " introduction " comprises: 230.10.10.10.Wherein, the corresponding webpage E of link 202.198.2.2; The corresponding network address of link 202.198.2.3 is the key word information of the webpage of F; The corresponding network address of link 230.10.10.10 is the video content introduction of I.Then, the described the 3rd sub-acquiring unit obtains the relevant information of a plurality of Internet resources based on each link.For example, the described the 3rd sub-acquiring unit links based on this: 202.198.2.2, the relevant information of the Internet resources that obtain is exactly webpage E, based on this link 202.195.1.3, the relevant information of the Internet resources that obtain be exactly network address be the key word information of the webpage of F, based on this link 230.10.10.10, the relevant information of the Internet resources that obtain be exactly network address be the video content introduction of I.Those skilled in the art should understand that, though the described second sub-acquiring unit obtains Internet resources are obtained in link, the 3rd sub-acquiring unit again based on this connection relevant information based on a message unit in index database, be not just to show in the relevant information of these Internet resources only to be associated with this message unit, in fact, it can also be associated with other message units.Say it for example, obtain for the described the 3rd sub-acquiring unit with the related network resource information of message unit " World Expo ": network address is the key word information of the webpage of F, this network address is in the key word information of webpage of F, except with message unit " World Expo " out-of-context, yet can be related with message unit " history ".
In addition, it should be appreciated by those skilled in the art that above-mentioned shown in only just for better explanation technical scheme of the present invention, but not be used to limit the present invention, in fact, described second deriving means 13 relevant information of obtaining Internet resources is not only to only limit to obtain in the index database.
As a kind of optimal way, segmenting device 12 also can comprise recognition unit (figure does not show), be used to discern described a plurality of message unit, to obtain the invalid information unit, so that described second deriving means 13 does not obtain the relevant information of the Internet resources related with the invalid information unit.Wherein, recognition unit can be tabulated based on invalid information to the identification of message unit and be carried out.For example, belong to the message unit of part of speech listed in the invalid information tabulation, sentence element etc., be regarded as the invalid information unit.Listed part of speech includes but not limited in the invalid information tabulation: auxiliary word, interrogative, verb or the like, listed sentence element includes but not limited to: predicate etc.For example, for list entries " history of World Expo ", its be split into " World Expo ", " " reach " history " 3 message units, wherein, message unit " " part of speech be auxiliary word, so the recognition unit identification information unit " " be the invalid information unit, and then, second deriving means 13 from index database, do not obtain with " " relevant information of related Internet resources.Again for example, for list entries " which day the World Expo cut-off date is ", segmenting device 12 is divided into this list entries " World Expo cut-off date ", "Yes", " which day " 3 message units based on SVO sentence formula, for the message unit "Yes", belong to the predicate in the invalid information tabulation, so recognition unit is identified as the invalid information unit with it, corresponding second deriving means 13 does not also obtain the relevant information of the Internet resources related with "Yes" from index database.Those skilled in the art should understand that, the mode of recognition unit identification invalid information unit is not to exceed to tabulate based on invalid information, in fact, can also carry out based on preset rules, for example, preset rules includes but not limited to: according to the order of message unit at list entries, last message unit is considered as invalid information unit etc.For example, for message unit " which day ", in list entries " which day the World Expo cut-off date is ", be last message unit, then recognition unit is identified as the invalid information unit with it.
Then, collator 14 sorts according to the quantity of the message unit that each comprised in the relevant information of the described a plurality of Internet resources relevant information to described a plurality of Internet resources, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.Wherein, the search result items of described Internet resources includes but not limited to: the description content of the link of these Internet resources and/or Internet resources etc.For example, for the Internet resources relevant information is the summary info of the webpage of network address A correspondence, and then the search result items of Dui Ying Internet resources comprises the brief introduction of link, network address A of network address A or multimedia files such as pairing picture of network address A or audio frequency and video etc.For simplicity's sake, below relate to the part of search result items, all adopt the example that is linked as of network address A, but it should be appreciated by those skilled in the art that search result items is not as limit.
More detailed speech it, for example, the relevant information of the Internet resources that second deriving means 13 obtains comprises: network address is the webpage summary info of A, network address is the web page title information of B, network address is the webpage key word information of C, wherein, network address is the webpage summary info and " World Expo " of A, " history " reach " introduction " totally 3 message units be associated, network address is that this 1 message unit of web page title information and " World Expo " of B is associated, network address be webpage key word information and " history " of C reach " introduction " totally 2 message units be associated, therefore, collator 14 is according to the quantity of the message unit that is associated with the relevant information of each Internet resources, to the relevant information ordering of each Internet resources is:
1) the webpage summary info of network address A;
2) network address is the webpage key word information of C;
3) network address is the web page title information of B.
The search result items of the Internet resources after accordingly, the network equipment will sort: promptly
1) network address A;
2) network address C;
3) network address B,
Offer subscriber equipment in regular turn.
In addition, collator 14 is in sequencer procedure, if when having the quantity of the message unit that the relevant information of two or more Internet resources is associated separately identical, collator 14 can adopt multiple sortord that the relevant information of those Internet resources is sorted, include but not limited to the random fashion ordering, preferably, can also adopt other sortords, described other modes will be described in detail in the following preferred embodiment of describing with reference to Fig. 5.
Fig. 5 shows the network equipment structural representation that is used for carrying out according to term the Search Results ordering of the preferred embodiment of the present invention.
Concrete, first deriving means 11 shown in Figure 5, segmenting device 12 and second deriving means 13 describe in detail in the described embodiment of reference Fig. 4, for simplicity's sake, are contained in this by reference, repeat no more.Wherein, collator 14 comprises: the evaluation unit 141 and the first sub-sequencing unit 142.
Described evaluation unit 141 is estimated described each message unit to obtain the evaluation result of this each message unit according to first pre-defined rule, the first sub-sequencing unit 142 sorts in conjunction with the relevant information of described evaluation result to described a plurality of Internet resources, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.Wherein, described first pre-defined rule is estimated each message unit with reference to following at least one factor:
1, the quantity of the relevant information of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information that second deriving means 13 obtains Internet resources associated therewith comprises: the webpage of the webpage of the webpage of network address A correspondence, network address B correspondence and network address C correspondence adds up to the relevant information of 3 Internet resources; For message unit " history ", the relevant information of the Internet resources associated therewith that second deriving means 13 obtains comprises: the webpage of the webpage of network address A correspondence and network address D correspondence adds up to the relevant information of 2 Internet resources; For message unit " introduction ", the relevant information of the Internet resources associated therewith that second deriving means 13 obtains comprises: the webpage of network address B correspondence adds up to the relevant information of 1 Internet resources.Therefore, the weighted value that 141 pairs of message units of evaluation unit " World Expo " are estimated can be 3, and the weighted value that message unit " history " is estimated can be 2, and the weighted value that message unit " introduction " is estimated can be 1.Those skilled in the art should understand that, above-mentioned evaluation unit 141 comes message unit is estimated according to the quantity of the relevant information of Internet resources, the weighted value that obtains not is to exceed with the quantity of the relevant information that equals Internet resources, for example, for message unit " World Expo ", the quantity of the relevant information of the Internet resources that second deriving means 13 obtains is 3, and 141 pairs of message units of evaluation unit " World Expo " weighted value can be worth for other, for example, multiple of 3 or the like.
2, the number of times selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information that second deriving means 13 obtains Internet resources associated therewith comprises: the webpage of the webpage of the webpage of network address A correspondence, network address B correspondence and network address C correspondence, accordingly, the relevant information of Internet resources is the webpage of network address A correspondence, the search result items of these Internet resources is the link of network address A, and the number of times that this link is selected by the user is 2 times; The relevant information of Internet resources is the webpage of network address B correspondence, and the search result items of these Internet resources is the link of network address B, and the number of times that this link is selected by the user is 2 times; The relevant information of Internet resources is the webpage of network address C correspondence, the search result items of these Internet resources is the link of network address C, the number of times that this link is selected by the user is 1 time, this shows that the number of times that the search result items of the Internet resources that are associated with message unit " World Expo " is selected by the user is 2+2+1=5 time.Thus, the weighted value of 141 pairs of message units of evaluation unit " World Expo " evaluation can be 5.Those skilled in the art should understand that, the number of times that above-mentioned evaluation unit 141 is selected by the user according to the search result items of Internet resources comes message unit is estimated, and the weighted value of acquisition is not to be exceeded by the number of times that the user selects with the search result items that equals Internet resources yet.
3, the frequency selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " introduction ", the relevant information of Internet resources associated therewith comprises the introductory video of corresponding song list introduction of network address C and network address D correspondence, the search result items of corresponding Internet resources, that is, and the link of the link of network address C and network address D.Wherein, the link of network address C was selected 100 times by the user in one month, and being linked in month of network address D selected 200 times by the user, and then the weighted value of 141 pairs of message units of evaluation unit " introduction " evaluation is 300.Those skilled in the art should understand that, above-mentioned evaluation unit 141 comes message unit is estimated according to the frequency that the search result items of Internet resources is selected by the user, the weighted value that obtains is not to be exceeded by the frequency that the user selects with the search result items that equals Internet resources yet, and, frequency is not to calculate according to the number of times of being selected by the user in month to exceed yet, and can calculate with reference to one day or other times yet.
4, time of being selected by the user of the search result items of the Internet resources related with message unit.For example, for message unit " World Expo ", the relevant information of Internet resources associated therewith comprises: network address is that webpage and the network address of A is the webpage of B, the search result items of corresponding Internet resources, promptly, the link of the link of network address A and network address B, the time of being selected by the user all be in January, 2010 to October, for message unit " history ", the relevant information of Internet resources associated therewith comprises: network address is that webpage and the network address of C is the webpage of D, the search result items of corresponding Internet resources, that is, the link of the link of network address C and network address D, the time of being selected by the user all is in year May in August, 2009 to 2010, so 141 pairs of message units of evaluation unit " World Expo " evaluation is higher than the evaluation to message unit " history ".
5, user related information, it includes but not limited to:
A) user's preference setting, for example, user's preference setting is according to the order of list entries each message unit to be estimated, for example, for list entries " World Expo is historical to be introduced ", it is split into message unit " World Expo ", " history " and " introduction ", and evaluation unit 141 is provided with being evaluated as message unit according to user's preference: the weighted value of the weighted value>message unit " introduction " of the weighted value>message unit " history " of message unit " World Expo ";
B) user's personal attribute, include but not limited to: sex, age, education degree, income, occupation etc., for example, for the user of education degree below senior middle school, 141 pairs of message units of evaluation unit " history " evaluation is higher than the evaluation of message unit " World Expo ", for the user of education degree more than university, message unit " World Expo " evaluation is higher than the evaluation of message unit " history " etc.;
C) user's historical behavior, for example, user 1 often login relates to historical webpage, and then evaluation unit 141 is higher than evaluation of message unit " World Expo " or the like with message unit " history " evaluation.
Those skilled in the art should understand that, only be for technical scheme of the present invention better is described shown in above-mentioned, but not be used to limit the present invention, in fact, user's preference setting also can comprise linguistic property of message unit etc. as one of factor of evaluation, for example, the evaluation as the message unit of subject is higher than as evaluation of the message unit of object or the like.
In addition, the mode that the network equipment obtains user related information includes but not limited to: obtain in the left log-on message during according to the user by the subscriber equipment log-on webpage, or according to during the user is by the subscriber equipment browsing page by obtaining or the like in user's historical behavior information of extracting in subscriber equipment end or the cookies information that network-side write down or by subscriber equipment.
Have, the evaluation method of 141 pairs of each message units of evaluation unit is not to exceed in the mode of above-mentioned weighted value again, in fact, can also estimate each message unit or the like with sortord.
As a kind of optimal way, when ordering, the first sub-sequencing unit 142 can carry out minor sort just according to the quantity of each message unit that is associated in the relevant information of described a plurality of Internet resources earlier, relevant information to the identical Internet resources of the message unit quantity that is associated, estimate by 141 pairs of corresponding message units of evaluation unit, then by first sub-sequencing unit 142 combinations evaluation result of those message units is come minor sort more again.For example, the quantity of the message unit that is associated with the relevant information A of Internet resources is 2, the quantity of the message unit that is associated with the relevant information B of Internet resources is 2, the quantity of the message unit that is associated with the relevant information C of Internet resources is 1, and then the first minor sort of the first sub-sequencing unit 142 is: after the relevant information A that the relevant information C of Internet resources sorts at Internet resources and the relevant information B of Internet resources.Subsequently, if 2 message units that are associated with the relevant information A of Internet resources are " World Expo " and " history ", 2 message units that are associated with the relevant information B of Internet resources are " World Expo " and " introduction ", because message unit " World Expo " all is associated with the relevant information A of Internet resources and the relevant information B of Internet resources, then evaluation unit 141 only needs message unit " history " and " introduction " are estimated, for example, the evaluation result of the 141 pairs of message units in on-Line review valency unit " history " and " introduction " is: first of message unit " history " ordering, second of message unit " introduction " ordering, the first sub-sequencing unit 142 is in conjunction with this evaluation result, minor sort again to the relevant information B of the relevant information A of Internet resources and Internet resources is: first of the relevant information A ordering of Internet resources, second of the relevant information B ordering of Internet resources, to sum up, the final ordering of the relevant information of 142 pairs of each Internet resources of the first sub-sequencing unit is:
1) the relevant information A of Internet resources;
2) the relevant information B of Internet resources;
3) the relevant information C of Internet resources.
As another optimal way of the present invention, after can estimating all message units earlier by evaluation unit 141, again by the first sub-sequencing unit 142 in conjunction with the evaluation result of all message units relevant information to each Internet resources is sorted.For example, all message units comprise: " World Expo ", " history " and " introduction ", evaluation result after 141 pairs of all message unit evaluations of evaluation unit is: the weighted value of message unit " World Expo " is 3, the weighted value of message unit " history " is 2, the weighted value of message unit " introduction " is 1, subsequently, 142 pairs of a plurality of Internet resources relevant information orderings of the first sub-sequencing unit, if the relevant information of Internet resources is the webpage of network address A correspondence, 2 message units of its association are " World Expo " and " history ", the relevant information of Internet resources is the webpage of network address B correspondence, 2 message units of its association are message unit " World Expo " and " introduction ", the relevant information of Internet resources is the webpage of network address C correspondence, 2 message units of its association are message unit " history " and " introduction ", and the first sub-sequencing unit 142 in conjunction with the evaluation result ordering to all message units is:
1) webpage of network address A correspondence;
2) webpage of network address B correspondence;
3) webpage of network address C correspondence.
Again for example, if the relevant information of Internet resources is the webpage of network address A correspondence, 2 message units of its association are " World Expo " and " history ", the relevant information of Internet resources is the webpage of network address B correspondence, 2 message units of its association are " World Expo " and " introduction ", the relevant information of Internet resources is the webpage of network address C correspondence, 1 message unit of its association is " World Expo ", the relevant information of Internet resources is the webpage of network address D correspondence, 1 message unit of its association is " introduction ", the first sub-sequencing unit 142 is in conjunction with being that the webpage of 2 network address A correspondence and the webpage ordering of network address B correspondence are: first of the webpage ordering of network address A correspondence, second of the webpage ordering of network address B correspondence to the message unit quantity of association to the evaluation result of each message unit; And the first sub-sequencing unit 142 is in conjunction with being that the webpage of 1 network address C correspondence and the webpage ordering of network address D correspondence are: first of the webpage ordering of network address C correspondence, second of the webpage ordering of network address D correspondence to the message unit quantity of association to the evaluation result of each message unit.Simultaneously, because the webpage of network address A correspondence and all related separately 2 message units of webpage of network address B correspondence, and the webpage of network address C correspondence and only related separately 1 message unit of network address D, therefore, the first sub-sequencing unit 142 is based on the quantity of the message unit related with the relevant information of each Internet resources, also in conjunction with the evaluation result to message unit, and total ordering is to the relevant information of each Internet resources:
1) webpage of network address A correspondence;
2) webpage of network address B correspondence;
3) webpage of network address C correspondence;
4) webpage of network address D correspondence.
Accordingly, the search result items of the Internet resources after the network equipment will sort, that is:
1) link of network address A;
2) link of network address B;
3) link of network address C;
4) link of network address D,
Offer subscriber equipment in regular turn.
Fig. 6 shows the network equipment structural representation that is used for carrying out according to term the Search Results ordering of another aspect of the present invention.
Concrete, first deriving means 11, segmenting device 12 and second deriving means 13 describe in detail in the described embodiment of reference Fig. 4, comprise by reference at this, repeat no more.Wherein, collator 14 comprises: the second sub-sequencing unit 143.
The described second sub-sequencing unit 143 is used for the quantity of the message unit that the relevant information according to each Internet resources comprises and in conjunction with the relevant information ordering of user related information to described a plurality of Internet resources, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.Wherein, user related information includes but not limited to as previously mentioned:
1) user's preference setting, for example, user's preference setting is to select the ordering of the relevant information of Internet resources often preceding, perhaps user preference will be referred to a certain star personality's the relevant information ordering of Internet resources preceding or the like;
2) user's personal attribute for example, includes but not limited to: sex, age, education degree, income, occupation etc.For example, for the age the user below 15 years old, the relevant information ordering that the described second sub-sequencing unit 143 will be referred to learn class, educational Internet resources is preceding or the like;
3) user's historical behavior, for example, the user often carries out shopping online, and the then described second sub-sequencing unit 143 will be referred to the relevant information ordering of Internet resources of marketing information preceding or the like.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned one exemplary embodiment, and under the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in the scope.Any Reference numeral in the claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " speech, and odd number is not got rid of plural number.A plurality of unit of stating in system's claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.
Claims (22)
1. one kind is used for carrying out the method that Search Results sorts according to term in the network equipment, and this method may further comprise the steps:
A obtains the list entries of user via the subscriber equipment input;
B is divided into a plurality of message units with described list entries;
C obtains the relevant information of a plurality of Internet resources that are associated with each message unit based on described a plurality of message units;
D sorts to a plurality of Internet resources that obtained according to the quantity of each message unit that is associated in the relevant information of described a plurality of Internet resources, so that the search result items of each Internet resources after will sorting offers described subscriber equipment in regular turn.
2. method according to claim 1, wherein, described step c is further comprising the steps of:
-based on described a plurality of message units, in index database, obtain the relevant information of described a plurality of Internet resources.
3. method according to claim 1, wherein, described step c is further comprising the steps of:
-based on described a plurality of message units, in index database, obtain the linking of relevant information of described a plurality of Internet resources that are associated with each message unit;
-based on described a plurality of links, obtain the relevant information of described a plurality of Internet resources.
4. according to each described method in the claim 1 to 3, wherein, described steps d is further comprising the steps of:
-estimate described each message unit to obtain the evaluation result of this each message unit according to first pre-defined rule;
-in conjunction with the quantity of described evaluation result and described message unit, come the relevant information of described a plurality of Internet resources is sorted, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.
5. method according to claim 2, wherein, described first pre-defined rule is estimated each message unit with reference to following at least one factor:
The quantity of the relevant information of-Internet resources related with message unit;
The number of times that the search result items of-Internet resources related with message unit is selected by the user;
The frequency that the search result items of-Internet resources related with message unit is selected by the user;
The time that the search result items of-Internet resources related with message unit is selected by the user;
-user related information.
6. according to each described method in the claim 1 to 5, wherein, described steps d is further comprising the steps of:
The quantity of the message unit that comprises in-the relevant information and in conjunction with the relevant information ordering of user related information, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn to described a plurality of Internet resources according to each Internet resources.
7. according to each described method in the claim 1 to 6, wherein, described step b also comprises step:
-according to second preset rules list entries is divided into a plurality of message units.
8. method according to claim 7, wherein, described second preset rules is divided into a plurality of message units with reference to following at least one factor with described list entries:
-semantic analysis;
-user related information.
9. according to each described method in the claim 1 to 8, wherein, described step c also comprises step:
The described a plurality of message units of-identification with acquisition invalid information unit, and for described invalid information unit, do not obtain the relevant information of the Internet resources related with described invalid information unit.
10. according to claim 5 or 6 or 8 described methods, wherein, described user related information comprises at least with the next item down:
-user's preference setting;
-user's personal attribute;
-user's historical behavior.
11. according to each described method in the claim 1 to 10, wherein, the described network equipment comprises: the cloud that webserver group that single network server, a plurality of webserver are formed or computing machine collection are formed.
12. a network equipment that is used for carrying out according to term the Search Results ordering, wherein, described equipment comprises:
First deriving means is used to obtain the list entries of user via the subscriber equipment input;
Segmenting device is used for described list entries is divided into a plurality of message units;
Second deriving means is used for obtaining based on described a plurality of message units the relevant information of a plurality of Internet resources that are associated with each message unit;
Collator, be used for sorting, so that the search result items of each Internet resources after will sorting offers described subscriber equipment in regular turn according to the quantity of each message unit that is associated of the relevant information of described a plurality of Internet resources relevant information to a plurality of Internet resources of being obtained.
13. equipment according to claim 12, wherein, described second deriving means also comprises:
The first sub-acquiring unit is used for obtaining the relevant information of described a plurality of Internet resources based on described a plurality of message units in index database.
14. equipment according to claim 12, wherein, described second deriving means also comprises:
The second sub-acquiring unit is used for based on described a plurality of message units, obtains the linking of relevant information of described a plurality of Internet resources that are associated with each message unit in index database;
The 3rd sub-acquiring unit is used for obtaining the relevant information of described a plurality of Internet resources based on described a plurality of links.
15. according to each described equipment of claim 12 to 14, wherein, described collator also comprises:
Evaluation unit is used for estimating described each message unit to obtain the evaluation result of this each message unit according to first pre-defined rule;
The first sub-sequencing unit is used for the quantity in conjunction with described evaluation result and described message unit, comes the relevant information of described a plurality of Internet resources is sorted, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn.
16. equipment according to claim 15, wherein, described first pre-defined rule is estimated each message unit with reference to following at least one factor:
The quantity of the relevant information of-Internet resources related with message unit;
The number of times that the search result items of-Internet resources related with message unit is selected by the user;
The frequency that the search result items of-Internet resources related with message unit is selected by the user;
The time that the search result items of-Internet resources related with message unit is selected by the user;
-user related information.
17. according to each described equipment of claim 12 to 16, wherein, described collator also comprises:
The second sub-sequencing unit, be used for the quantity of the message unit that the relevant information according to each Internet resources comprises and in conjunction with the relevant information ordering of user related information, so that the search result items of the Internet resources after will sorting offers described subscriber equipment in regular turn to described a plurality of Internet resources.
18. according to each described equipment of claim 12 to 17, wherein, described segmenting device also is used for:
-according to second preset rules list entries is divided into a plurality of message units.
19. equipment according to claim 18, wherein, described second preset rules is divided into a plurality of message units with reference to following at least one factor with described list entries:
-semantic analysis;
-user related information.
20. according to each described equipment of claim 12 to 19, wherein, described second deriving means also comprises:
Recognition unit is used to discern described a plurality of message unit, and with acquisition invalid information unit, thereby described second deriving means does not obtain the relevant information of the Internet resources related with described invalid information unit.
21. according to claim 16 or 17 or 19 described equipment, wherein, described user related information comprises at least with the next item down:
-user's preference setting;
-user's personal attribute;
-user's historical behavior.
22. according to each described equipment in the claim 12 to 21, wherein, the described network equipment is included in: the cloud that webserver group that single network server, a plurality of webserver are formed or computing machine collection are formed.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN 201010545582 CN102004772A (en) | 2010-11-15 | 2010-11-15 | Method and equipment for sequencing search results according to terms |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN 201010545582 CN102004772A (en) | 2010-11-15 | 2010-11-15 | Method and equipment for sequencing search results according to terms |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN102004772A true CN102004772A (en) | 2011-04-06 |
Family
ID=43812134
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN 201010545582 Pending CN102004772A (en) | 2010-11-15 | 2010-11-15 | Method and equipment for sequencing search results according to terms |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN102004772A (en) |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102214207A (en) * | 2011-04-27 | 2011-10-12 | 百度在线网络技术(北京)有限公司 | Method and equipment for sorting attribute sets in information entities |
| CN102760156A (en) * | 2012-06-05 | 2012-10-31 | 百度在线网络技术(北京)有限公司 | Method, device and equipment used for generating release information corresponding to key words |
| CN102833594A (en) * | 2012-08-14 | 2012-12-19 | 中兴通讯股份有限公司 | Method, device and system for searching IPTV (internet protocol television) programs |
| CN103152621A (en) * | 2013-02-27 | 2013-06-12 | 四三九九网络股份有限公司 | Configuration method, displaying method and playing method of recommended video |
| CN103365858A (en) * | 2012-03-28 | 2013-10-23 | 百度在线网络技术(北京)有限公司 | Method and device for acquiring searching results from multiple source devices and based on one inquiry sequence |
| CN103389974A (en) * | 2012-05-07 | 2013-11-13 | 腾讯科技(深圳)有限公司 | Method and server for searching information |
| CN104090981A (en) * | 2014-07-24 | 2014-10-08 | 山东大学 | Method for rapidly searching PHP variable keywords and pushing interested contents |
| CN106484766A (en) * | 2016-09-07 | 2017-03-08 | 北京百度网讯科技有限公司 | Searching method based on artificial intelligence and device |
| CN107430615A (en) * | 2015-09-30 | 2017-12-01 | 谷歌公司 | Deep linking to multiple native apps |
| CN110297971A (en) * | 2019-05-30 | 2019-10-01 | 百度在线网络技术(北京)有限公司 | Personalized resource retrieval method, device, equipment and computer readable storage medium |
| CN110765356A (en) * | 2019-10-23 | 2020-02-07 | 绍兴柯桥浙工大创新研究院发展有限公司 | Industrial design man-machine data query system for retrieving and sorting according to user habits |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101158971A (en) * | 2007-11-15 | 2008-04-09 | 深圳市迅雷网络技术有限公司 | Method and device for sorting search results based on search engine |
| CN101233513A (en) * | 2005-07-29 | 2008-07-30 | 雅虎公司 | Systems and methods for reordering result sets |
-
2010
- 2010-11-15 CN CN 201010545582 patent/CN102004772A/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101233513A (en) * | 2005-07-29 | 2008-07-30 | 雅虎公司 | Systems and methods for reordering result sets |
| CN101158971A (en) * | 2007-11-15 | 2008-04-09 | 深圳市迅雷网络技术有限公司 | Method and device for sorting search results based on search engine |
Non-Patent Citations (1)
| Title |
|---|
| 《黎明职业大学学报》 20091231 温云辉 多关键词查找相关产品的一种实现 第27-30页 1-22 第4卷, 第4期 * |
Cited By (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102214207A (en) * | 2011-04-27 | 2011-10-12 | 百度在线网络技术(北京)有限公司 | Method and equipment for sorting attribute sets in information entities |
| CN103365858A (en) * | 2012-03-28 | 2013-10-23 | 百度在线网络技术(北京)有限公司 | Method and device for acquiring searching results from multiple source devices and based on one inquiry sequence |
| CN103365858B (en) * | 2012-03-28 | 2017-11-03 | 百度在线网络技术(北京)有限公司 | The method and apparatus of search result is obtained by multiple source devices based on a search sequence |
| US9454613B2 (en) | 2012-05-07 | 2016-09-27 | Tencent Technology (Shenzhen) Company Limited | Method and server for searching information |
| CN103389974B (en) * | 2012-05-07 | 2017-12-08 | 深圳市世纪光速信息技术有限公司 | Carry out the method and server of information search |
| CN103389974A (en) * | 2012-05-07 | 2013-11-13 | 腾讯科技(深圳)有限公司 | Method and server for searching information |
| WO2013166916A1 (en) * | 2012-05-07 | 2013-11-14 | 深圳市世纪光速信息技术有限公司 | Information search method and server |
| CN102760156B (en) * | 2012-06-05 | 2016-01-13 | 百度在线网络技术(北京)有限公司 | A kind of for generating the method that release news, device and the equipment corresponding with keyword |
| CN102760156A (en) * | 2012-06-05 | 2012-10-31 | 百度在线网络技术(北京)有限公司 | Method, device and equipment used for generating release information corresponding to key words |
| CN102833594A (en) * | 2012-08-14 | 2012-12-19 | 中兴通讯股份有限公司 | Method, device and system for searching IPTV (internet protocol television) programs |
| CN102833594B (en) * | 2012-08-14 | 2017-11-24 | 中兴通讯股份有限公司 | A kind of network protocol television IPTV program searching methods, apparatus and system |
| CN103152621B (en) * | 2013-02-27 | 2016-03-23 | 四三九九网络股份有限公司 | Recommend the collocation method of video, display packing and player method |
| CN103152621A (en) * | 2013-02-27 | 2013-06-12 | 四三九九网络股份有限公司 | Configuration method, displaying method and playing method of recommended video |
| CN104090981B (en) * | 2014-07-24 | 2017-11-14 | 山东大学 | It is a kind of to PHP variable keyword fast searchs and content of interest method for pushing |
| CN104090981A (en) * | 2014-07-24 | 2014-10-08 | 山东大学 | Method for rapidly searching PHP variable keywords and pushing interested contents |
| CN107430615B (en) * | 2015-09-30 | 2021-02-02 | 谷歌有限责任公司 | Deep linking to multiple native applications |
| CN107430615A (en) * | 2015-09-30 | 2017-12-01 | 谷歌公司 | Deep linking to multiple native apps |
| CN106484766A (en) * | 2016-09-07 | 2017-03-08 | 北京百度网讯科技有限公司 | Searching method based on artificial intelligence and device |
| CN106484766B (en) * | 2016-09-07 | 2019-10-22 | 北京百度网讯科技有限公司 | Search method and device based on artificial intelligence |
| CN110297971A (en) * | 2019-05-30 | 2019-10-01 | 百度在线网络技术(北京)有限公司 | Personalized resource retrieval method, device, equipment and computer readable storage medium |
| CN110297971B (en) * | 2019-05-30 | 2022-09-20 | 百度在线网络技术(北京)有限公司 | Personalized resource retrieval method, device, equipment and computer readable storage medium |
| CN110765356A (en) * | 2019-10-23 | 2020-02-07 | 绍兴柯桥浙工大创新研究院发展有限公司 | Industrial design man-machine data query system for retrieving and sorting according to user habits |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN102004772A (en) | Method and equipment for sequencing search results according to terms | |
| CN105893609B (en) | A mobile APP recommendation method based on weighted mixture | |
| US8650198B2 (en) | Systems and methods for facilitating the gathering of open source intelligence | |
| US8620849B2 (en) | Systems and methods for facilitating open source intelligence gathering | |
| CN103914478B (en) | Webpage training method and system, webpage Forecasting Methodology and system | |
| US9864803B2 (en) | Method and system for multimodal clue based personalized app function recommendation | |
| CN108073568A (en) | keyword extracting method and device | |
| Chen | RETRACTED ARTICLE: Research on personalized recommendation algorithm based on user preference in mobile e-commerce | |
| CN102236710A (en) | Method and equipment for displaying news information in query result | |
| EP2307951A1 (en) | Method and apparatus for relating datasets by using semantic vectors and keyword analyses | |
| CN101639857A (en) | Method, device and system for establishing knowledge questioning and answering sharing platform | |
| US20170235836A1 (en) | Information identification and extraction | |
| CN110110225A (en) | Online education recommended models and construction method based on user behavior data analysis | |
| US10339191B2 (en) | Method of and a system for processing a search query | |
| US20170235835A1 (en) | Information identification and extraction | |
| US10698888B1 (en) | Answer facts from structured content | |
| WO2016137690A1 (en) | Efficient retrieval of fresh internet content | |
| JP2008171395A (en) | Analysis and processing method of request applied to search engine | |
| CN102231147A (en) | Method, equipment and system for displaying associational words in real time | |
| CN113569118A (en) | Self-media pushing method and device, computer equipment and storage medium | |
| Lee et al. | Web document classification using topic modeling based document ranking | |
| CN106874368B (en) | RTB bidding advertisement position value analysis method and system | |
| CN105117438A (en) | Information processing method and electronic equipment | |
| CN103399879A (en) | Method and device for obtaining interest entities based on user search logs | |
| Gali et al. | Extracting representative image from web page |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C12 | Rejection of a patent application after its publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20110406 |