[go: up one dir, main page]

CN103235796B - A search method and system based on user click behavior - Google Patents

A search method and system based on user click behavior Download PDF

Info

Publication number
CN103235796B
CN103235796B CN201310117725.4A CN201310117725A CN103235796B CN 103235796 B CN103235796 B CN 103235796B CN 201310117725 A CN201310117725 A CN 201310117725A CN 103235796 B CN103235796 B CN 103235796B
Authority
CN
China
Prior art keywords
mouse
search result
search
mouse clicks
search results
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310117725.4A
Other languages
Chinese (zh)
Other versions
CN103235796A (en
Inventor
程昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201310117725.4A priority Critical patent/CN103235796B/en
Publication of CN103235796A publication Critical patent/CN103235796A/en
Application granted granted Critical
Publication of CN103235796B publication Critical patent/CN103235796B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明提供了一种基于用户点击行为的搜索方法,包括:依据用户输入的查询词得到搜索结果;统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;依据所述鼠标点击次数对搜索结果排序,将排序后的搜索结果提供给用户;本发明还提供一种基于用户点击行为的搜索系统。根据本发明提供的技术方案,能够简单方便地直接获取互联网上有价值的搜索结果。

The present invention provides a search method based on the user's click behavior, including: obtaining search results according to the query words input by the user; counting the number of mouse clicks on the search results, the number of mouse clicks is equal to the number of mouse clicks on the Uniform Resource Locator URL Add the number of mouse clicks on the summary text; sort the search results according to the number of mouse clicks, and provide the sorted search results to the user; the present invention also provides a search system based on user click behavior. According to the technical scheme provided by the invention, valuable search results on the Internet can be directly obtained simply and conveniently.

Description

一种基于用户点击行为的搜索方法及系统A search method and system based on user click behavior

【技术领域】【Technical field】

本发明涉及互联网应用领域,尤其涉及一种基于用户点击行为的搜索方法及系统。The invention relates to the field of Internet applications, in particular to a search method and system based on user click behavior.

【背景技术】【Background technique】

目前,搜索引擎依据用户输入的查询词(query)提供相应的搜索结果之前,都会对搜索结果进行排序处理,用于优化搜索结果,提高用户体验。现有技术中,一般是依据用户点击行为对搜索结果进行排序,如果一个搜索结果被越多的用户选择,表示这个搜索结果越能满足用户的搜索需求,那么这个搜索结果就会在所有搜索结果中排序靠前;目前通过用户是否点击了该搜索结果的统一资源定位符(URL,Uniform Resource Locator)以及该URL是否满足用户搜索需求获得搜索结果的权重,最后依据权重对搜索结果排序,排序后的搜索结果才会被推送给用户。At present, before providing corresponding search results according to the query word (query) input by the user, the search engine will sort the search results to optimize the search results and improve user experience. In the prior art, the search results are generally sorted according to the user's click behavior. If a search result is selected by more users, it means that the search result can better meet the user's search needs, and then this search result will be ranked among all search results. Sort in the top; at present, whether the user clicks on the Uniform Resource Locator (URL, Uniform Resource Locator) of the search result and whether the URL meets the user's search needs is obtained to obtain the weight of the search result, and finally the search results are sorted according to the weight, after sorting The search results will be pushed to the user.

但是,搜索结果页中除了提供URL之外,还会同时提供可直接满足用户搜索需求的摘要文本,例如在搜索结果页中直接提供词语的释义、问题的答复、特定图片等,对于这种搜索结果,用户不需要点击搜索结果的URL就可以直接获得满意的答复,如果仅仅依据用户是否点击搜索结果的URL来对搜索结果进行排序,将不能向用户提供满意的搜索结果,用户需要在搜索结果页中浏览、筛选后才能找到满意的搜索结果,因此导致搜索效率较低,给搜索引擎带来不必要的负担,用户体验较差。However, in addition to providing the URL, the search result page also provides summary text that can directly meet the user's search needs, such as the definition of words, answers to questions, specific pictures, etc., provided directly on the search result page. As a result, the user can directly get a satisfactory answer without clicking the URL of the search result. If the search results are sorted only based on whether the user clicks the URL of the search result, satisfactory search results cannot be provided to the user. Satisfactory search results can only be found after browsing and filtering on the page, which leads to low search efficiency, brings unnecessary burden to the search engine, and poor user experience.

【发明内容】【Content of invention】

本发明提供了一种基于用户点击行为的搜索方法及系统,能够简单方便地直接获取互联网上有价值的搜索结果。The invention provides a search method and system based on user click behavior, which can directly obtain valuable search results on the Internet simply and conveniently.

本发明的具体技术方案如下:Concrete technical scheme of the present invention is as follows:

根据本发明一优选实施例,一种基于用户点击行为的搜索方法,包括:According to a preferred embodiment of the present invention, a search method based on user click behavior includes:

依据用户输入的查询词得到搜索结果;Obtain search results based on the query words entered by the user;

统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;The number of mouse clicks of the search result is counted, and the number of mouse clicks is equal to the number of mouse clicks on the URL of the Uniform Resource Locator plus the number of mouse clicks on the summary text;

依据所述鼠标点击次数对搜索结果排序,将排序后的搜索结果提供给用户。The search results are sorted according to the number of mouse clicks, and the sorted search results are provided to the user.

上述方法中,获得鼠标点击URL次数的方法为:In the above method, the method for obtaining the number of times the mouse is clicked on the URL is as follows:

从数据库获取一段时间内的鼠标点击日志,所述鼠标点击日志中包括用户点击的URL和点击URL的时间;Obtain the mouse click log within a period of time from the database, including the URL clicked by the user and the time when the URL is clicked;

依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数。According to the URL of the search result and the mouse click log, count the number of times the mouse clicks on the URL of the search result.

上述方法中,获得鼠标点击摘要文本次数的方法为:In the above method, the method for obtaining the number of mouse clicks on the summary text is as follows:

依据预设的摘要文本的路径在搜索结果的元素id下获得搜索结果的摘要文本;Obtain the summary text of the search result under the element id of the search result according to the preset summary text path;

依据所述摘要文本和预设的鼠标轨迹日志,统计在所述摘要文本上的鼠标点击次数;所述鼠标轨迹日志用于记录用户鼠标点击的摘要文本。According to the summary text and the preset mouse track log, the number of mouse clicks on the summary text is counted; the mouse track log is used to record the summary text clicked by the user's mouse.

上述方法中,所述依据鼠标点击次数对搜索结果排序具体为:In the above method, the sorting of the search results according to the number of mouse clicks is specifically as follows:

依据所述鼠标点击次数查找满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前;Searching for search results that meet the preset adjustment conditions according to the number of mouse clicks, and adjusting the sorting of the search results that meet the preset adjustment conditions;

所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且所述搜索结果的结果类型是具有丰富摘要内容的结果类型。The preset adjustment condition is: the number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result is a result type with rich summary content.

上述方法中,所述依据所述鼠标点击次数对搜索结果排序具体为:In the above method, the sorting of the search results according to the number of clicks of the mouse is specifically as follows:

利用计算得到搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序;其中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si为搜索结果i作为鼠标最后点击的搜索结果的次数,α和β为调权因子。use Calculate the weight value of the search result, and sort the search result according to the order of the weight value from large to small; where, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i is the number of search results that the search result i is the last click of the mouse, and α and β are weighting factors.

一种基于用户点击行为的搜索系统,包括:搜索单元、统计单元、排序单元、推送单元;其中,A search system based on user click behavior, including: a search unit, a statistical unit, a sorting unit, and a push unit; wherein,

搜索单元,用于依据用户输入的查询词得到搜索结果;A search unit, configured to obtain search results based on query words input by the user;

统计单元,用于统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;Statistical unit, for counting the number of mouse clicks of the search results, the number of mouse clicks is equal to the number of mouse clicks on the Uniform Resource Locator URL plus the number of mouse clicks on the summary text;

排序单元,用于依据所述鼠标点击次数对搜索结果排序;a sorting unit, configured to sort the search results according to the number of mouse clicks;

推送单元,用于将排序后的搜索结果提供给用户。The push unit is configured to provide the sorted search results to the user.

上述系统中,统计单元在获得鼠标点击URL次数时,具体为:In the above system, when the statistical unit obtains the number of clicks on the URL by the mouse, it is specifically:

从数据库获取一段时间内的鼠标点击日志,所述鼠标点击日志中包括用户点击的URL和点击URL的时间;Obtain the mouse click log within a period of time from the database, including the URL clicked by the user and the time when the URL is clicked;

依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数。According to the URL of the search result and the mouse click log, count the number of times the mouse clicks on the URL of the search result.

上述系统中,统计单元在获得鼠标点击摘要文本次数时,具体为:In the above system, when the statistical unit obtains the number of mouse clicks on the summary text, it is specifically:

依据预设的摘要文本的路径在搜索结果的元素id下获得搜索结果的摘要文本;Obtain the summary text of the search result under the element id of the search result according to the preset summary text path;

依据所述摘要文本和预设的鼠标轨迹日志,统计在所述摘要文本上的鼠标点击次数;所述鼠标轨迹日志用于记录用户鼠标点击的摘要文本。According to the summary text and the preset mouse track log, the number of mouse clicks on the summary text is counted; the mouse track log is used to record the summary text clicked by the user's mouse.

上述系统中,所述排序单元在依据鼠标点击次数对搜索结果排序时,具体为:In the above system, when the sorting unit sorts the search results according to the number of mouse clicks, it is specifically:

依据所述鼠标点击次数查找满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前;Searching for search results that meet the preset adjustment conditions according to the number of mouse clicks, and adjusting the sorting of the search results that meet the preset adjustment conditions;

所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且所述搜索结果的结果类型是具有丰富摘要内容的结果类型。The preset adjustment condition is: the number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result is a result type with rich summary content.

上述系统中,所述排序单元在依据所述鼠标点击次数对搜索结果排序时,具体为:In the above system, when the sorting unit sorts the search results according to the number of mouse clicks, specifically:

利用计算得到搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序;其中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si为搜索结果i作为鼠标最后点击的搜索结果的次数,α和β为调权因子。use Calculate the weight value of the search result, and sort the search result according to the order of the weight value from large to small; where, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i is the number of search results that the search result i is the last click of the mouse, and α and β are weighting factors.

由以上技术方案可以看出,本发明提供的具有以下有益效果:As can be seen from the above technical solutions, the present invention has the following beneficial effects:

将鼠标点击搜索结果的摘要文本事件作为有效数据来判断搜索结果对用户的是否有价值,从而能够简单方便的获取到有价值的搜索结果,并依据鼠标点击行为调整有价值的搜索结果的排序,实现优先显示有价值的搜索结果,用户能够及时直接的得到互联网上有价值的搜索结果,从而降低用户获取信息的成本,提高搜索效率,带来良好的用户体验。Use the summary text event of the mouse click on the search result as valid data to judge whether the search result is valuable to the user, so that valuable search results can be easily and conveniently obtained, and the ranking of valuable search results can be adjusted according to the mouse click behavior. Realize the priority display of valuable search results, and users can obtain valuable search results on the Internet in a timely and direct manner, thereby reducing the cost for users to obtain information, improving search efficiency, and bringing good user experience.

【附图说明】【Description of drawings】

图1是本发明实现基于用户点击行为的搜索方法的优选实施例的流程示意图;FIG. 1 is a schematic flow diagram of a preferred embodiment of the present invention to realize a search method based on user click behavior;

图2是本发明中优先显示有价值的搜索结果的示例图;Fig. 2 is an example diagram showing valuable search results preferentially in the present invention;

图3是本发明实现基于用户点击行为的搜索相同的优选实施例的结构示意图。FIG. 3 is a schematic structural diagram of a preferred embodiment of the present invention for realizing the same search based on the user's click behavior.

【具体实施方式】【Detailed ways】

本发明的基本思想是:依据用户输入的查询词得到搜索结果;统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;依据所述鼠标点击次数对搜索结果排序,将排序后的搜索结果提供给用户。The basic thought of the present invention is: obtain search result according to the query word that user inputs; Count the number of mouse clicks of described search result, described mouse click number is equal to mouse click URL number of times and mouse clicks abstract text number of times; According to The number of mouse clicks sorts the search results, and provides the sorted search results to the user.

为了使本发明的目的、技术方案和优点更加清楚,下面结合附图和具体实施例对本发明进行详细描述。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

本发明提供一种基于用户点击行为的搜索方法,图1是本发明实现基于用户点击行为的搜索方法的优选实施例的流程示意图,如图1所示,该优选实施例包括以下步骤:The present invention provides a search method based on user click behavior. FIG. 1 is a flow diagram of a preferred embodiment of the present invention to realize a search method based on user click behavior. As shown in FIG. 1 , the preferred embodiment includes the following steps:

步骤101,接收用户输入的query,得到该query的搜索结果。Step 101, receiving a query input by a user, and obtaining a search result of the query.

具体的,搜索引擎接收用户输入的query,得到该query默认排序后的搜索结果。Specifically, the search engine receives a query input by a user, and obtains search results sorted by default for the query.

步骤102,统计搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击URL次数加上鼠标点击摘要文本次数。Step 102, counting the number of mouse clicks on the search results, where the number of mouse clicks is equal to the number of times the mouse clicks on the URL plus the number of times the mouse clicks on the summary text.

具体的,对于得到的query的搜索结果,统计每个搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击URL次数加上鼠标点击摘要文本次数。Specifically, for the obtained search results of the query, the number of mouse clicks for each search result is counted, and the number of mouse clicks is equal to the number of times the mouse clicks on the URL plus the number of times the mouse clicks on the summary text.

本优选实施例中,统计鼠标点击URL次数的方法为:搜索引擎可以预先从数据库获取一段时间内的鼠标点击日志,鼠标点击日志中包括用户标识、点击的URL以及点击URL的时间,依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数;其中,一段时间可以依据需求进行配置,如一天、一周或一个月等。In this preferred embodiment, the method for counting the number of times the mouse clicks on the URL is: the search engine can obtain the mouse click log in a period of time from the database in advance, and the mouse click log includes the user identification, the clicked URL and the time of clicking the URL, according to the search results URL and mouse click logs, counting the number of mouse click URLs in search results; among them, a period of time can be configured according to requirements, such as one day, one week, or one month.

本优选实施例中,统计鼠标点击摘要文本次数的方法为:query的搜索结果页上对于每个组成元素都有一个唯一路径,如搜索结果的URL和摘要文本都有相应的唯一路径,在搜索结果页上对于每个搜索结果都有相应的元素id,例如:In this preferred embodiment, the method for counting the number of times the summary text is clicked by the mouse is as follows: on the search result page of query, there is a unique path for each component element, such as the URL of the search result and the summary text. There is an element id for each search result on the results page, for example:

其中,id=”1”表示搜索结果页中的第一个搜索结果的元素id为1,在该元素id下可以找到该搜索结果的特定元素,如:Among them, id="1" indicates that the element id of the first search result in the search result page is 1, and specific elements of the search result can be found under the element id, such as:

依据预设的摘要文本的路径1→tbody→tr→td→h3就可以找到该搜索结果的摘要文本<font size=”-1”>…</font>。According to the preset summary text path 1→tbody→tr→td→h3, the summary text <font size=”-1”>…</font> of the search result can be found.

预设的鼠标脚本代码会实时记录用户鼠标在摘要文本上的动作及该动作的发生时间,并将记录摘要文本、鼠标点击动作及时间的对应关系保存到鼠标轨迹日志中,通过上述方法得到搜索结果的摘要文本后,依据该摘要文本和鼠标轨迹日志,统计在该摘要文本上的鼠标点击次数。The preset mouse script code will record the action of the user's mouse on the abstract text and the time when the action occurred in real time, and save the corresponding relationship between the recorded abstract text, mouse click action and time in the mouse track log, and obtain the search result through the above method After the summary text of the result, count the number of mouse clicks on the summary text according to the summary text and the mouse track log.

例如,在某个鼠标轨迹日志中找到搜索结果的摘要文本,该鼠标轨迹日志中该摘要文本对应的动作为鼠标点击动作(mousedown),就表示该摘要文本被鼠标点击了1次。For example, if the summary text of the search result is found in a mouse track log, and the action corresponding to the summary text in the mouse track log is a mouse click action (mousedown), it means that the summary text is clicked once by the mouse.

步骤103,依据鼠标点击次数对搜索结果排序,将排序后的搜索结果提供给用户。Step 103, sorting the search results according to the number of mouse clicks, and providing the sorted search results to the user.

具体的,在统计得到每个搜索结果的鼠标点击次数后,依据鼠标点击次数找到满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前,然后将排序后的搜索结果推送给用户;本优选实施例中,所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且该搜索结果的结果类型是具有丰富摘要内容的结果类型。Specifically, after counting the number of mouse clicks of each search result, find the search results that meet the preset adjustment conditions according to the number of mouse clicks, adjust the sorting of the search results that meet the preset adjustment conditions, and then sort the search results after sorting. The result is pushed to the user; in this preferred embodiment, the preset adjustment condition is: the number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result has rich summary content result type.

其中,查找满足预设调整条件的搜索结果的方法为:对搜索结果进行遍历,判断相邻两个搜索结果的鼠标点击次数的大小,如果相邻两个搜索结果中,后一个搜索结果的鼠标点击次数大于前一个搜索结果的鼠标点击次数,则进一步判断后一个搜索结果的结果类型是否是具有丰富摘要内容的结果类型,如果是具有丰富摘要内容的结果类型,则该后一个搜索结果满足预设的调整条件,将该后一个搜索结果的排序位置与前一个搜索结果的排序位置互换,反之,后一个搜索结果不满足预设的调整条件,继续遍历其他搜素结果。判断第二个搜索结果的结果类型是否是具有丰富摘要内容的结果类型的方法为:预先配置具有丰富摘要内容的结果类型与主域名的对应关系,依据query的搜索结果的主域名判断搜索结果的结果类型是否为具有丰富摘要内容的结果类型;不同的搜索引擎中具有丰富摘要内容的结果类型不同,一般的,具有丰富摘要内容的结果类型包括词典、问答页、地图等,相应的,词典的主域名为dict,问答页的主域名为zhidao,地图的主域名为map。这里,如果搜索结果的鼠标点击次数较大,但是排序位置靠后,而比该搜索结果排序位置靠前的搜索结果的鼠标点击次数较小,如果搜索结果同时具有丰富的摘要内容,则认为需要对该搜索结果进行排序调前,用以需要保证鼠标点击次数大且具有丰富摘要内容的搜索结果的排序靠前。Among them, the method of finding search results that meet the preset adjustment conditions is: traverse the search results, and judge the size of the number of mouse clicks of two adjacent search results. If the number of clicks is greater than the number of mouse clicks of the previous search result, it is further judged whether the result type of the latter search result is a result type with rich summary content, and if it is a result type with rich summary content, then the latter search result meets the requirements. Set the adjustment condition to exchange the sorting position of the latter search result with the sorting position of the previous search result. Otherwise, if the latter search result does not meet the preset adjustment condition, continue to traverse other search results. The method for judging whether the result type of the second search result is a result type with rich abstract content is: pre-configure the corresponding relationship between the result type with rich abstract content and the main domain name, and judge the search result according to the main domain name of the query search result Whether the result type is a result type with rich abstract content; different search engines have different result types with rich abstract content. Generally, the result types with rich abstract content include dictionaries, question-and-answer pages, maps, etc. Correspondingly, dictionaries The main domain name is dict, the main domain name of the question and answer page is zhidao, and the main domain name of the map is map. Here, if the number of mouse clicks of the search result is large, but the ranking position is lower, and the number of mouse clicks of the search result higher than the search result is small, if the search result also has rich summary content, it is considered necessary The search result is sorted and adjusted to ensure that the search results with a large number of mouse clicks and rich abstract content are sorted first.

例如,在输入框中输入的query为doll时,得到图2所示的搜索结果页,依据鼠标点击次数对词典中的搜索结果的排序调前,使得用户在第一个搜索结果页就可以通过摘要文本获得满意的搜索结果,不需要浏览之后搜索结果页,也不需要点击搜索结果的URL。For example, when the query entered in the input box is doll, the search result page shown in Figure 2 is obtained, and the sorting of the search results in the dictionary is adjusted according to the number of mouse clicks, so that the user can pass through the first search result page Summary text to obtain satisfactory search results without browsing subsequent search results pages or clicking on search result URLs.

或者,在统计得到每个搜索结果的鼠标点击次数后,依据鼠标点击次数计算搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序,将排序后的搜索结果提供给用户。Alternatively, after the number of mouse clicks of each search result is counted, the weight value of the search result is calculated according to the number of mouse clicks, the search results are sorted in descending order of the weight value, and the sorted search results are provided to the user.

本优选实施例中,可以利用如下公式计算搜索结果的权重值:In this preferred embodiment, the following formula can be used to calculate the weight value of the search result:

该公式中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si表示搜索结果i作为鼠标最后点击的搜索结果的次数,这里,鼠标最后点击的搜索结果仅仅指鼠标点击搜索结果的URL,不包括鼠标点击搜索结果的摘要文本的情况;α和β为预设的调权因子,优选的,α等于0.5,β等于0.82;其中,鼠标点击日志用于记录用户在某query下点击的所有URL,依据鼠标点击日志可以获得用户在每个query下最后点击的URL,因此依据鼠标点击日志可以统计搜索结果i作为鼠标最后点击的搜索结果的次数SiIn this formula, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i represents the number of times that the search result i is the last click of the mouse, and here, the last click of the mouse The search result only refers to the URL of the mouse click on the search result, excluding the summary text of the mouse click on the search result; α and β are preset weighting factors, preferably, α is equal to 0.5, and β is equal to 0.82; wherein, the mouse click log It is used to record all the URLs clicked by the user under a certain query. According to the mouse click log, the last URL clicked by the user under each query can be obtained. Therefore, according to the mouse click log, the search result i can be counted as the number S of the last mouse clicked search results. i .

本优选实施例中,为了提高搜索效率,可以仅调整默认排序中排名靠前的搜索结果的排序,例如,可以依据鼠标点击次数仅计算默认排序中排名前10个搜索结果的权重值,依据计算的权重值对这10个搜索结果进行重新排序,其他搜索结果的排序不变;或者,仅判断排名前10个搜索结果中是否有满足预设调整条件的搜索结果,不对其他搜索结果进行判断;这里,调整的搜索结果的个数可以依据需求进行动态配置。In this preferred embodiment, in order to improve search efficiency, only the ranking of the top search results in the default sorting can be adjusted. For example, the weight values of the top 10 search results in the default sorting can be calculated according to the number of mouse clicks. The weight value of the 10 search results is reordered, and the ranking of other search results remains unchanged; or, only judge whether there are search results that meet the preset adjustment conditions in the top 10 search results, and do not judge other search results; Here, the number of adjusted search results may be dynamically configured according to requirements.

为实现上述方法,本发明还提供一种基于用户点击行为的搜索系统,图3是本发明实现基于用户点击行为的搜索系统的优选实施例的结构示意图,如图3所示,该系统包括:搜索单元301、统计单元302、排序单元303、推送单元304;其中,In order to realize the above method, the present invention also provides a search system based on user click behavior. FIG. 3 is a schematic structural diagram of a preferred embodiment of the present invention to realize the search system based on user click behavior. As shown in FIG. 3 , the system includes: Search unit 301, statistics unit 302, sorting unit 303, push unit 304; wherein,

搜索单元301,用于依据用户输入的查询词得到搜索结果;A search unit 301, configured to obtain search results according to query words input by the user;

统计单元302,用于统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;The statistical unit 302 is used to count the number of mouse clicks of the search results, the number of mouse clicks is equal to the number of mouse clicks on the Uniform Resource Locator URL plus the number of mouse clicks on the summary text;

排序单元303,用于依据所述鼠标点击次数对搜索结果排序;A sorting unit 303, configured to sort the search results according to the number of mouse clicks;

推送单元304,用于将排序后的搜索结果提供给用户。Pushing unit 304, configured to provide the sorted search results to the user.

其中,统计单元302在获得鼠标点击URL次数时,具体为:从数据库获取一段时间内的鼠标点击日志,所述鼠标点击日志中包括用户点击的URL和点击URL的时间;依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数。Wherein, when the statistical unit 302 obtains the number of clicks of the URL by the mouse, it specifically includes: obtaining the mouse click log within a period of time from the database, which includes the URL clicked by the user and the time when the URL is clicked; according to the URL of the search result and Mouse click log, counts the number of times the mouse clicks on the URL of the search results.

其中,统计单元302在获得鼠标点击摘要文本次数时,具体为:依据预设的摘要文本的路径在搜索结果的元素id下获得搜索结果的摘要文本;依据所述摘要文本和预设的鼠标轨迹日志,统计在所述摘要文本上的鼠标点击次数;所述鼠标轨迹日志用于记录用户鼠标点击的摘要文本。Wherein, when the statistical unit 302 obtains the number of times the summary text is clicked by the mouse, it is specifically: obtain the summary text of the search result under the element id of the search result according to the path of the preset summary text; obtain the summary text of the search result according to the summary text and the preset mouse track A log for counting the number of mouse clicks on the summary text; the mouse track log is used to record the summary text clicked by the user's mouse.

其中,所述排序单元303在依据鼠标点击次数对搜索结果排序时,具体为:依据所述鼠标点击次数查找满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前;所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且所述搜索结果的结果类型是具有丰富摘要内容的结果类型。Wherein, when the sorting unit 303 sorts the search results according to the number of mouse clicks, specifically: search for the search results that meet the preset adjustment conditions according to the number of mouse clicks, and adjust the sorting of the search results that meet the preset adjustment conditions. ; The preset adjustment condition is: the number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result is a result type with rich summary content.

其中,所述排序单元303在依据所述鼠标点击次数对搜索结果排序时,具体为:利用计算得到搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序;其中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si为搜索结果i作为鼠标最后点击的搜索结果的次数,α和β为调权因子。Wherein, when the sorting unit 303 sorts the search results according to the number of mouse clicks, specifically: using Calculate the weight value of the search result, and sort the search result according to the order of the weight value from large to small; where, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i is the number of search results that the search result i is the last click of the mouse, and α and β are weighting factors.

本发明优选实施例的技术方案中,将鼠标点击搜索结果的摘要文本事件作为有效数据来判断搜索结果对用户的是否有价值,从而能够简单方便的获取到有价值的搜索结果,并依据鼠标点击行为调整有价值的搜索结果的排序,实现优先显示有价值的搜索结果,用户能够及时直接的得到互联网上有价值的搜索结果,不需要进一步浏览搜索结果页,也不需要点击任何搜索结果的URL就可以得到满意的搜索结果,从而降低用户获取信息的成本,提高搜索效率,带来良好的用户体验。In the technical solution of the preferred embodiment of the present invention, the abstract text event of the mouse clicked search result is used as valid data to judge whether the search result is valuable to the user, so that valuable search results can be easily and conveniently obtained, and based on the mouse click Behavior adjusts the sorting of valuable search results to achieve priority display of valuable search results. Users can get valuable search results on the Internet in a timely and direct manner without further browsing the search result page or clicking on any search result URL Satisfactory search results can be obtained, thereby reducing the cost for users to obtain information, improving search efficiency, and bringing good user experience.

以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the present invention. within the scope of protection.

Claims (6)

1.一种基于用户点击行为的搜索方法,其特征在于,该方法包括:1. A search method based on user click behavior, characterized in that the method comprises: 依据用户输入的查询词得到搜索结果;Obtain search results based on the query words entered by the user; 统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;The number of mouse clicks of the search result is counted, and the number of mouse clicks is equal to the number of mouse clicks on the URL of the Uniform Resource Locator plus the number of mouse clicks on the summary text; 其中,获得鼠标点击摘要文本次数的方法为:依据预设的摘要文本的路径在搜索结果的元素id下获得搜索结果的摘要文本;依据所述摘要文本和预设的鼠标轨迹日志,统计在所述摘要文本上的鼠标点击次数;所述鼠标轨迹日志用于记录用户鼠标点击的摘要文本;Wherein, the method for obtaining the number of times the summary text is clicked by the mouse is: obtain the summary text of the search result under the element id of the search result according to the path of the preset summary text; The number of mouse clicks on the summary text; the mouse track log is used to record the summary text of the user's mouse click; 依据所述鼠标点击次数对搜索结果排序,Sort the search results according to the number of mouse clicks mentioned, 所述依据鼠标点击次数对搜索结果排序具体为:依据所述鼠标点击次数查找满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前;所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且所述搜索结果的结果类型是具有丰富摘要内容的结果类型;The sorting of the search results according to the number of mouse clicks is specifically: searching for the search results satisfying the preset adjustment conditions according to the number of mouse clicks, and forwarding the sorting of the search results satisfying the preset adjustment conditions; the preset adjustment conditions are : The number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result is a result type with rich summary content; 将排序后的搜索结果提供给用户。Serves the sorted search results to the user. 2.根据权利要求1所述的方法,其特征在于,获得鼠标点击URL次数的方法为:2. The method according to claim 1, characterized in that, the method for obtaining the number of mouse clicks on the URL is: 从数据库获取一段时间内的鼠标点击日志,所述鼠标点击日志中包括用户点击的URL和点击URL的时间;Obtain the mouse click log within a period of time from the database, including the URL clicked by the user and the time when the URL is clicked; 依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数。According to the URL of the search result and the mouse click log, count the number of times the mouse clicks on the URL of the search result. 3.根据权利要求1所述的方法,其特征在于,所述依据所述鼠标点击次数对搜索结果排序具体为:3. The method according to claim 1, wherein the sorting of the search results according to the number of mouse clicks is specifically: 利用计算得到搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序;其中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si为搜索结果i作为鼠标最后点击的搜索结果的次数,α和β为调权因子。use Calculate the weight value of the search result, and sort the search result according to the order of the weight value from large to small; where, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i is the number of search results that the search result i is the last click of the mouse, and α and β are weighting factors. 4.一种基于用户点击行为的搜索系统,其特征在于,该系统包括:搜索单元、统计单元、排序单元、推送单元;其中,4. A search system based on user click behavior, characterized in that the system includes: a search unit, a statistical unit, a sorting unit, and a push unit; wherein, 搜索单元,用于依据用户输入的查询词得到搜索结果;A search unit, configured to obtain search results based on query words input by the user; 统计单元,用于统计所述搜索结果的鼠标点击次数,所述鼠标点击次数等于鼠标点击统一资源定位符URL次数加上鼠标点击摘要文本次数;Statistical unit, for counting the number of mouse clicks of the search results, the number of mouse clicks is equal to the number of mouse clicks on the Uniform Resource Locator URL plus the number of mouse clicks on the summary text; 其中,统计单元在获得鼠标点击摘要文本次数时,具体为:依据预设的摘要文本的路径在搜索结果的元素id下获得搜索结果的摘要文本;依据所述摘要文本和预设的鼠标轨迹日志,统计在所述摘要文本上的鼠标点击次数;所述鼠标轨迹日志用于记录用户鼠标点击的摘要文本;Wherein, when the statistical unit obtains the number of times the summary text is clicked by the mouse, it is specifically: obtain the summary text of the search result under the element id of the search result according to the path of the preset summary text; and obtain the summary text of the search result according to the summary text and the preset mouse track log , counting the number of mouse clicks on the summary text; the mouse track log is used to record the summary text clicked by the user's mouse; 排序单元,用于依据所述鼠标点击次数对搜索结果排序,a sorting unit, configured to sort the search results according to the number of mouse clicks, 所述排序单元在依据鼠标点击次数对搜索结果排序时,具体为:依据所述鼠标点击次数查找满足预设调整条件的搜索结果,将满足预设调整条件的搜索结果的排序调前;所述预设调整条件为:搜索结果的鼠标点击次数大于相邻的前一个搜索结果的鼠标点击次数且所述搜索结果的结果类型是具有丰富摘要内容的结果类型;When the sorting unit sorts the search results according to the number of mouse clicks, it is specifically: searching for the search results satisfying the preset adjustment conditions according to the number of mouse clicks, and adjusting the sorting of the search results satisfying the preset adjustment conditions; The preset adjustment condition is: the number of mouse clicks of the search result is greater than the number of mouse clicks of the adjacent previous search result and the result type of the search result is a result type with rich summary content; 推送单元,用于将排序后的搜索结果提供给用户。The push unit is configured to provide the sorted search results to the user. 5.根据权利要求4所述的系统,其特征在于,统计单元在获得鼠标点击URL次数时,具体为:5. The system according to claim 4, wherein, when the statistics unit obtains the number of clicks on the URL by the mouse, it is specifically: 从数据库获取一段时间内的鼠标点击日志,所述鼠标点击日志中包括用户点击的URL和点击URL的时间;Obtain the mouse click log within a period of time from the database, including the URL clicked by the user and the time when the URL is clicked; 依据搜索结果的URL以及鼠标点击日志,统计搜索结果的鼠标点击URL次数。According to the URL of the search result and the mouse click log, count the number of times the mouse clicks on the URL of the search result. 6.根据权利要求4所述的系统,其特征在于,所述排序单元在依据所述鼠标点击次数对搜索结果排序时,具体为:6. The system according to claim 4, wherein when the sorting unit sorts the search results according to the number of mouse clicks, it is specifically: 利用计算得到搜索结果的权重值,依据权重值由大到小的顺序对搜索结果排序;其中,Ψi为搜索结果i的权重值,Ci为统计得到的搜索结果i的鼠标点击次数,Si为搜索结果i作为鼠标最后点击的搜索结果的次数,α和β为调权因子。use Calculate the weight value of the search result, and sort the search result according to the order of the weight value from large to small; where, Ψ i is the weight value of the search result i, C i is the number of mouse clicks of the search result i obtained by statistics, S i is the number of search results that the search result i is the last click of the mouse, and α and β are weighting factors.
CN201310117725.4A 2013-04-07 2013-04-07 A search method and system based on user click behavior Expired - Fee Related CN103235796B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310117725.4A CN103235796B (en) 2013-04-07 2013-04-07 A search method and system based on user click behavior

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310117725.4A CN103235796B (en) 2013-04-07 2013-04-07 A search method and system based on user click behavior

Publications (2)

Publication Number Publication Date
CN103235796A CN103235796A (en) 2013-08-07
CN103235796B true CN103235796B (en) 2019-12-24

Family

ID=48883837

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310117725.4A Expired - Fee Related CN103235796B (en) 2013-04-07 2013-04-07 A search method and system based on user click behavior

Country Status (1)

Country Link
CN (1) CN103235796B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9626439B2 (en) * 2013-05-31 2017-04-18 Xilopix Method for searching in a database
CN105224567B (en) * 2014-06-26 2020-08-14 腾讯科技(深圳)有限公司 Keyword searching method and device
US11636120B2 (en) * 2014-11-21 2023-04-25 Microsoft Technology Licensing, Llc Offline evaluation of ranking functions
CN104657415A (en) * 2014-11-28 2015-05-27 百度在线网络技术(北京)有限公司 Method and device for displaying click result pages
CN105683966A (en) * 2016-01-30 2016-06-15 深圳市博信诺达经贸咨询有限公司 Searching method and searching system based on big data
CN106528892A (en) * 2016-12-23 2017-03-22 北京奇虎科技有限公司 Searching-based data recommendation method and device
CN106777201B (en) * 2016-12-23 2021-01-08 北京奇元科技有限公司 Method and device for sorting recommended data on search result page
CN107450808B (en) * 2017-09-22 2020-09-01 北京知道创宇信息技术股份有限公司 Mouse pointer positioning method of browser and computing device
CN107977452A (en) * 2017-12-15 2018-05-01 金陵科技学院 A kind of information retrieval system and method based on big data
CN109034867B (en) * 2018-06-21 2022-10-25 腾讯科技(深圳)有限公司 Click traffic detection method and device and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN1963816A (en) * 2006-12-01 2007-05-16 清华大学 Automatization processing method of rating of merit of search engine
CN101263493A (en) * 2005-06-20 2008-09-10 谷歌公司 System and method for providing search results
CN101828185A (en) * 2007-10-18 2010-09-08 微软公司 Ranking and providing search results based in part on a number of click-through features

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2438540A1 (en) * 2009-06-01 2012-04-11 AOL Inc. Providing suggested web search queries based on click data of stored search queries

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN101263493A (en) * 2005-06-20 2008-09-10 谷歌公司 System and method for providing search results
CN1963816A (en) * 2006-12-01 2007-05-16 清华大学 Automatization processing method of rating of merit of search engine
CN101828185A (en) * 2007-10-18 2010-09-08 微软公司 Ranking and providing search results based in part on a number of click-through features

Also Published As

Publication number Publication date
CN103235796A (en) 2013-08-07

Similar Documents

Publication Publication Date Title
CN103235796B (en) A search method and system based on user click behavior
CN100440224C (en) An automatic processing method for search engine performance evaluation
CN102426610B (en) Microblog rank searching method and microblog searching engine
US8688667B1 (en) Providing intent sensitive search results
AU2009276354B2 (en) Providing posts to discussion threads in response to a search query
CN103186574B (en) Method and device for generating search results
CN102663054B (en) A kind of method and device determining weight of website
CN101986297B (en) Accessibility web browsing method based on linkage cluster
CN104268142B (en) Based on the Meta Search Engine result ordering method for being rejected by strategy
WO2009000174A1 (en) Method and device of web page rank
US10685073B1 (en) Selecting textual representations for entity attribute values
US11789946B2 (en) Answer facts from structured content
CN102932206A (en) Method and system for monitoring website access information
US10417334B2 (en) Systems and methods for providing a microdocument framework for storage, retrieval, and aggregation
CN102880647A (en) Method and device for acquiring another name of organization
CN103559203A (en) Method, device and system for web page sorting
CN103020141B (en) It is a kind of for the method and apparatus of Search Results is provided
CN103049497A (en) Method and device for website navigation
CN107977452A (en) A kind of information retrieval system and method based on big data
CN104008213B (en) A kind of more new discovery of info web and the method and apparatus of statistics
CN104794209B (en) Chinese microblogging mood sorting technique based on Markov logical network and system
CN105808605B (en) A search log merging method and system
CN102945272B (en) The treatment process of Information on Collection, equipment and server
CN103714093A (en) Method and device for mining key pages of website
CN103646034B (en) One kind is based on content believable Web search automotive engine system and searching method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20191224

CF01 Termination of patent right due to non-payment of annual fee