CN103455524B - Method and device for displaying and acquiring entry information - Google Patents
Method and device for displaying and acquiring entry information Download PDFInfo
- Publication number
- CN103455524B CN103455524B CN201210183870.8A CN201210183870A CN103455524B CN 103455524 B CN103455524 B CN 103455524B CN 201210183870 A CN201210183870 A CN 201210183870A CN 103455524 B CN103455524 B CN 103455524B
- Authority
- CN
- China
- Prior art keywords
- information
- entry
- page
- category
- entry information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 74
- 230000008569 process Effects 0.000 claims abstract description 13
- 230000011218 segmentation Effects 0.000 claims description 13
- 238000012545 processing Methods 0.000 claims description 11
- 235000012813 breadcrumbs Nutrition 0.000 claims description 6
- 238000010586 diagram Methods 0.000 description 12
- 235000013305 food Nutrition 0.000 description 9
- 235000015241 bacon Nutrition 0.000 description 8
- 238000004590 computer program Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 235000013622 meat product Nutrition 0.000 description 5
- 238000010422 painting Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000003203 everyday effect Effects 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 235000013372 meat Nutrition 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 239000002699 waste material Substances 0.000 description 2
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 241000282312 Proteles Species 0.000 description 1
- 235000009776 Rathbunia alamosensis Nutrition 0.000 description 1
- 244000097202 Rathbunia alamosensis Species 0.000 description 1
- 240000005572 Syzygium cordatum Species 0.000 description 1
- 235000006650 Syzygium cordatum Nutrition 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 210000001520 comb Anatomy 0.000 description 1
- 230000002354 daily effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 239000013049 sediment Substances 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000009182 swimming Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Landscapes
- User Interface Of Digital Computer (AREA)
- Information Transfer Between Computers (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The application provides a method and a device for displaying and acquiring entry information, wherein the method for displaying the entry information specifically comprises the following steps: sending the page information of the current browsing page; receiving entry information of encyclopedia entries corresponding to the current browsing page; the term information is returned after analyzing and selecting each term information when the term information corresponding to the encyclopedic term is more than one term; and displaying the entry information. According to the method and the device, the entry information related to the currently browsed page can be automatically displayed in the process that a user browses the webpage by using the browser, and the information access efficiency of the browser can be improved.
Description
Technical Field
The present application relates to the field of network technologies, and in particular, to a method and an apparatus for displaying and acquiring entry information.
Background
At present, with the popularization of the internet, a great amount of news, events and other messages are rapidly spread through the network every day, and the spreading enthusiasm and the sharing degree of various information of people reach unprecedented height. In order to facilitate the continuous accumulation of sediment and the propagation of civilization, the method of recording encyclopedia entries through an electronic big encyclopedia is developed. People can sort and accumulate the existing knowledge or the intellectual achievement just formed, or edit and perfect the related knowledge theme of the interested field by the knowledge ability of the people. The continuously perfected electronic encyclopedia not only effectively combs and stores a knowledge system, but also is beneficial to retrieval or reference of knowledge.
For example, when a user opens a browser to read a news item, if the user encounters a new or unknown word, the user needs to go to an electronic large encyclopedia to search for a corresponding encyclopedia definition; a typical search in the prior art is a search box + keyword search, and generally, a user needs to open a web page of an electronic large encyclopedia, input an unfamiliar word in a search box of the web page, and obtain an encyclopedia meaning corresponding to the unfamiliar word by browsing a search result. Therefore, when more than one strange word exists in one news, especially when the strange word has a plurality of entry information, multiple times of retrieval are needed, and the entry information is analyzed and judged, so that a large amount of resources are consumed, and the information access efficiency of the browser is influenced.
In summary, one of the technical problems that needs to be urgently solved by those skilled in the art is: how the information access efficiency of the browser can be improved.
Disclosure of Invention
The technical problem to be solved by the application is to provide a method and a device for displaying and acquiring entry information, which can automatically display the entry information related to a currently browsed page in the process that a user browses a webpage by using a browser, and can improve the information access efficiency of the browser.
In order to solve the above problem, the present application discloses a method for acquiring entry information, including:
analyzing page information of a currently browsed page to obtain corresponding encyclopedia entries;
retrieving according to the encyclopedic entry to obtain corresponding entry information;
when the entry information corresponding to the encyclopedic entry is more than one item, analyzing the entry information and selecting the entry information corresponding to one item;
and returning the selected entry information.
Preferably, the step of analyzing the entry information and selecting the entry information further includes:
and selecting one item of entry information from the entry information according to the category of the current browsing page and/or the category of the user information of the current user.
Preferably, the method further comprises:
obtaining entry information types of entry information of the encyclopedic entries;
the step of selecting an item of entry information from the entry information according to the category of the currently browsed page and/or the user information category of the current user specifically is to select an item of entry information of which the entry information category is most matched with the category of the currently browsed page and/or the user information category of the current user from more than one item of entry information of the encyclopedic entry.
Preferably, the method further comprises:
and analyzing the page information of the current browsing page to obtain the category of the corresponding current browsing page.
Preferably, the method further comprises:
and analyzing the use history information of the current user to obtain the corresponding user information category.
Preferably, the step of analyzing the page information of the current browsing page to obtain the category of the corresponding current browsing page includes:
analyzing the uniform resource locator information of the current browsing page to obtain the category of the corresponding current browsing page; and/or
Analyzing the breadcrumbs of the current browsed page, and navigating to obtain the corresponding category of the current browsed page; and/or
And analyzing the weight of the encyclopedic entry in the current browsing page in each set type respectively to obtain the total weight of each set type of the current browsing page, and taking the set type with the maximum total weight as the type of the current browsing page.
Preferably, the usage history information includes: the browser of the current user accesses the history and/or inputs the history.
Preferably, the step of analyzing the usage history information of the current user to obtain a corresponding user information category includes:
analyzing and acquiring the page category of the corresponding page in the browser access history record of the current user, and taking the page category with the highest frequency as the user information category of the current user; and/or
Analyzing and counting the input history of the current user to obtain the vocabulary category corresponding to the input history, and taking the vocabulary category with the highest frequency as the user information category of the current user.
Preferably, the page information of the current browsing page includes content of the current browsing page;
the step of analyzing the page information of the current browsing page to obtain the corresponding encyclopedia entry comprises the following steps:
performing word segmentation processing on the content of the current browsing page to obtain corresponding entries;
analyzing and matching the entries with a word bank stored at a server side;
and taking the successfully matched corresponding entry as an encyclopedic entry.
Preferably, the method further comprises:
counting the frequency of each entry appearing in the current browsing page;
the step of using the successfully matched corresponding entry as an encyclopedic entry specifically includes using the successfully matched corresponding entry whose statistical frequency exceeds a predetermined frequency threshold as an encyclopedic entry.
On the other hand, the application also discloses a method for showing entry information, which comprises the following steps:
sending the page information of the current browsing page;
receiving entry information of encyclopedia entries corresponding to the current browsing page; when the entry information corresponding to the encyclopedic entry is more than one item, the entry information is returned after being analyzed and selected;
and displaying the entry information.
Preferably, when the entry information is displayed, the entry information is displayed in a pop-up window mode.
Preferably, when the entry information is displayed, the length of the entry information is estimated, and the entry information is displayed by combining an estimation result.
On the other hand, the application also discloses a device for acquiring entry information, which comprises:
the encyclopedia entry acquisition module is used for analyzing the page information of the current browsed page to obtain corresponding encyclopedia entries;
the encyclopedic entry retrieval module is used for retrieving corresponding entry information according to the encyclopedic entries;
the information selection module is used for analyzing all entry information and selecting corresponding entry information when the entry information corresponding to the encyclopedic entry is more than one entry; and
and the returning module is used for returning the selected entry information.
Preferably, the information selection module is specifically configured to select entry information from the entry information according to the category of the currently browsed page and/or the category of user information of the current user.
Preferably, the apparatus further comprises:
the entry information category acquisition module is used for acquiring entry information categories of entry information of the encyclopedic entries;
the information selection module is specifically configured to select, from the one or more items of entry information of the encyclopedic entry, one item of entry information whose entry information category is most matched with the category of the currently browsed page and/or the user information category of the current user.
On the other hand, the application also discloses a device for showing entry information, which comprises:
the sending module is used for sending the page information of the current browsing page;
the receiving module is used for receiving entry information of encyclopedia entries corresponding to the current browsing page; when the entry information corresponding to the encyclopedic entry is more than one item, the entry information is returned after being analyzed and selected; and
and the display module is used for displaying the entry information.
Preferably, the display module is specifically configured to display the entry information in a pop-up window manner.
Compared with the prior art, the method has the following advantages:
when a user encounters strange or unknown words and the like in the process of browsing a webpage by using a browser, encyclopedic entries are used for representing some key words appearing in the current browsing page, and entry information of the encyclopedic entries corresponding to the current browsing page is automatically displayed at a browser client; for a user, corresponding entry information can be directly obtained without opening a search box and inputting keywords for retrieval; therefore, the method and the device for displaying the current browsing page provide the entry information of the corresponding encyclopedia entry in the page while displaying the current browsing page, enrich the information content provided for the user, and improve the information access efficiency of the browser.
In addition, the method and the device can also select one item of entry information from more than one item of entry information of the encyclopedic entries according to the category of the current browsing page and/or the category of the user information of the current user when the entry information corresponding to the encyclopedic entries is more than one item; the selection can select one item of entry information most relevant to the current browsing page and/or the current user from more than one item of entry information of encyclopedic entries, so that the entry information most close to the user requirement can be displayed in an area as small as possible, the corresponding entry information in the polysemous words can be accurately displayed for the user, and the information access efficiency of the browser is improved.
Drawings
FIG. 1 is a flowchart of an embodiment of a method for presenting entry information according to the present application;
fig. 2 is a flowchart of an embodiment 1 of a method for obtaining entry information according to the present application;
fig. 3 is a flowchart of embodiment 2 of a method for obtaining entry information according to the present application;
FIG. 4 is a block diagram of an embodiment of an apparatus for presenting entry information according to the present application;
fig. 5 is a block diagram of an embodiment of an apparatus for acquiring entry information according to the present application.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present application more comprehensible, the present application is described in further detail with reference to the accompanying drawings and the detailed description.
In the method, the device and the system for browsing the webpage, in the process that a user browses the webpage by using the browser, the current browsing page is combined with the requirements of the current user, and entry information of encyclopedic entries corresponding to the current browsing page is automatically displayed at a browser client; for a user, when the user encounters strange or unknown words and other key words of a currently browsed webpage, the entry information of the strange or unknown words can be directly obtained without opening a search box and inputting key words for retrieval; therefore, the embodiment of the application can accurately show the corresponding entry information in the polysemous words to the user, so that the information content provided for the user is enriched, and the information access efficiency of the browser is improved.
Referring to fig. 1, a flowchart of an embodiment of a method for presenting entry information in the present application is shown, which may specifically include:
102, a browser client receives entry information of encyclopedia entries corresponding to the current browsing page; when the entry information corresponding to the encyclopedic entry is more than one item, the browser server analyzes and selects the entry information and returns the selected entry information;
and 103, displaying the entry information by the browser client.
For example, when a user opens a browser to read a piece of news, the browser client of the embodiment of the present application automatically presents, below the currently browsed page, entry information of some key words in the news title or the text, where the key words may specifically include the names of main people, place names, events, and the like involved in the news, and where the entry information may specifically include encyclopedias of the key words, and the like.
In the embodiment of the application, encyclopedia entries are used for representing some key words appearing in a current browsing page, and the key words are probably unknown or unknown to a user, namely, the user has a high possibility of having a need of clarifying the meaning of the detailed information of the key words, so the encyclopedia entries are used as the encyclopedia entries and corresponding entry information is automatically displayed.
Term information in the embodiments of the present application generally refers to all information capable of enhancing user understanding of encyclopedia terms, which may include explanatory information, and a typical example is encyclopedia definitions, wherein the encyclopedia definitions may be mainly derived from professional encyclopedia websites or website channels edited manually and having a certain authority universality, such as wiki, encyclopedia, fox entertainment star library, and the like. According to the method and the device, paraphrase information of the vocabulary corresponding to the encyclopedic entry in the current browsing page is automatically displayed, so that the user can find out the meaning of the vocabulary without searching.
In the specific implementation, when a user opens a browsing page, the browser client will take the browsing page as a current browsing page, send page information of the current browsing page to the browser server, receive entry information of encyclopedia entries corresponding to the current browsing page returned by the browser server, and display the entry information.
In this embodiment of the application, the page information of the current browsing page sent to the browser server mainly includes: URL (Uniform Resource Locator) and/or content of the currently browsed page (for example, title and text of news), etc., and the present application does not limit the specific page information of the currently browsed page sent to the browser server. The browser client can send the URL of the current browsing page to the browser server, the browser server triggers the URL to obtain the content of the current browsing page, and the content of the current browsing page can also be directly sent to be analyzed and counted by the browser server.
In a specific implementation, the entry information of the encyclopedia entry corresponding to the currently browsed page, which is returned by the browser server, is usually in an XML (Extensible Markup Language) format. In view of the differences between XML and HTML: XML is used to store data, the focus of which is the content of the data. Whereas HTML is designed to display data, and the focus of HTML is the appearance of the data, in an application example of the present application, the process represented by step 102 may specifically include: the browser client converts the entry information in the XML format into an HTML (Hypertext Markup Language) format, embeds the entry information in the HTML format into a plug-in, and loads and displays the entry information in the HTML format by using front-end scripting technologies such as JavaScript, ajax (Asynchronous JavaScript and XML), jquery and the like.
Regarding the presentation position of the entry information, it may be any position of the browser, for example, above, below, left, right, and the like. In order not to affect and interfere with the content of the currently browsed page browsed by the user, in a preferred embodiment of the present application, the presentation position may be below or to the right of the browser.
Regarding the presentation form of the entry information, it may be in any form, such as a pop window, a bubble, and so on. In a preferred embodiment of the present application, the implementation process of step 102 may be to display entry information of an encyclopedic entry corresponding to the currently browsed page in a pop-up window manner. The popup window can be positioned at any position of the browser; the pop-up window may provide a close button that may be closed upon user manipulation of the close button, or the pop-up window may be configured to have a life cycle (e.g., 50 seconds) and automatically exit after the life cycle is completed.
Of course, in addition to the presentation forms of the pop-up window and the bubble, the presentation forms of the present application may further include a floating toolbar, a menu bar, a toolbar, a status bar, and the like, and all UI (user interface) capable of presenting entry information entry for a user belong to the protection scope of the present application.
The entry information of encyclopedic entries is different in length, and the length is different, so that the content of the current browsing page browsed by the user is not influenced and interfered, and the browser client can adopt a certain strategy.
In a preferred embodiment of the present application, when the browser client displays the entry information, the browser client may predict the length of the entry information, and display the entry information by combining a prediction result.
An application example of estimating the length of the entry information is provided. The browser client can set an area upper limit threshold value aiming at the area of the display area, so that when the actual area occupied by the length of the entry information is less than or equal to the area upper limit threshold value, the display can be directly performed according to the actual area, and the displayed content is all the content of the entry information; when the actual area occupied by the length of the entry information is greater than the upper area threshold, only the entry information that can be borne by the upper area threshold may be displayed, for example, the entry information of the upper area threshold is intercepted from front to back, other entry information is discarded, and at this time, part of the content of the entry information is displayed. The upper threshold of the area may be set according to the area of the display region of the display form of the pop-up window and the bubble.
In some cases, there may be only one encyclopedia entry corresponding to the currently browsed page, and entry information of the encyclopedia entry may be directly displayed at this time.
In other cases, there may be more than one encyclopedia entry corresponding to the currently browsed page, and in order not to affect and interfere with the content of the currently browsed page browsed by the user, in a preferred embodiment of the present application, the encyclopedia entry is more than one; the method may further comprise: and the browser client displays the identification information of more than one encyclopedia entry corresponding to the current browsing page.
The identification information is mainly used for distinguishing different encyclopedia entries, namely, the user can know the information of which encyclopedia entry is wanted by seeing the displayed identification information. Assuming that the encyclopedia entry is a name of a person, the corresponding identification information may include the name of the person and a corresponding avatar, and assuming that the encyclopedia entry is a place name, the corresponding identification information may include the place name and a corresponding identification thumbnail, and so on. The present application does not impose any limitation on the identification information of a specific encyclopedic entry.
In a preferred embodiment of the present application, the method may further include:
and the browser client displays the entry information of the selected encyclopedic entry according to the received selection information of the user aiming at the displayed identification information of the encyclopedic entries.
Assuming that the user selects or clicks the selection information of the identification information of a certain encyclopedia entry through a mouse, a keyboard or a touch gesture, the user is considered to want to view the selected entry information of the encyclopedia entry, and then the selected entry information is displayed.
In other embodiments, the process of sending the page information of the current browsing page is not limited to the browser client, and the browser server may also execute the operation; similarly, the entities involved in steps 102 to 103 are not limited to the browser client, and the browser server may also perform corresponding operations to complete the logical process described in the present application, so that the corresponding entry information is displayed to the user.
Referring to fig. 2, a flowchart of embodiment 1 of a method for acquiring entry information according to the present application is shown, which may specifically include:
and step 204, the browser server side returns the selected entry information to the browser client side.
In the embodiment of the application, encyclopedia entries are used for representing some key words appearing in a current browsing page, and the key words are probably unknown or unknown to a user, namely, the user probably has a need for detailed information of the key words, so the encyclopedia entries are used as the encyclopedia entries and automatically display corresponding entry information.
In a preferred embodiment of the present application, the page information of the currently browsed page may specifically include content of the currently browsed page;
the step of analyzing the page information of the currently browsed page by the browser server to obtain the corresponding encyclopedia entry may further include:
in the substep A1, the browser server side performs word segmentation processing on the content of the current browsing page to obtain a corresponding entry;
a substep A2, the browser server side analyzes and matches the vocabulary entry of the content of the current browsing page with a word bank stored by the server side;
and a substep A3, taking the successfully matched corresponding entry as an encyclopedia entry by the browser server side.
In the embodiment of the present application, the thesaurus stored at the server side can be used for storing a series of entries. In practical applications, the entries may be organized according to professional encyclopedia websites or websites with certain authority universality through manual editing, such as wikipedia, encyclopedia, fox-search entertainment star library, electronic encyclopedia, and the like. And the word stock stored by the server side can be synchronously updated along with the change of the website channel.
In the preferred embodiment, if the vocabulary entry of the currently browsed page content hits the vocabulary base stored in the server, it can be inferred that the user has a need for the detailed information of the vocabulary entry, and then the vocabulary entry is used as an encyclopedic vocabulary entry and corresponding vocabulary entry information is obtained according to the encyclopedic vocabulary entry.
In a specific implementation, there may be a plurality of schemes for obtaining corresponding entry information according to the encyclopedic entry retrieval. For example, the encyclopedia entry can be directly input into an existing retrieval website such as Wikipedia, encyclopedia, Saifu entertainment star library, electronic big encyclopedia and the like for query; for another example, a new encyclopedic database may be obtained by capturing and sorting existing data of a search website such as wikipedia, encyclopedic, fox-search entertainment star library, electronic encyclopedic, and the like, and the new encyclopedic database stores encyclopedic entries and corresponding entry information, so that the encyclopedic entries are input into the new encyclopedic database for query, and corresponding entry information may also be obtained. In short, the present application does not limit the scheme for obtaining the corresponding entry information according to the encyclopedic entry search.
In another preferred embodiment of the present application, the step of analyzing the page information of the currently browsed page by the browser server to obtain the corresponding encyclopedia entry may further include:
counting the frequency of each entry in the content of the current browsing page;
the step of the browser server side taking the successfully matched corresponding entry as an encyclopedia entry may specifically be to take the successfully matched corresponding entry with the statistical frequency exceeding a predetermined frequency threshold as an encyclopedia entry.
In order to avoid interference of entry information of the encyclopedia entry to the user, the preferred embodiment increases a threshold for determining the encyclopedia entry, that is, when the entry of the content of the currently browsed page hits the word bank stored at the server side, and the frequency of the entry appearing in the content of the currently browsed page exceeds a predetermined frequency threshold, it is inferred that the user has a need for detailed information of the entry. The predetermined frequency threshold can be set by those skilled in the art according to actual needs, and the specific predetermined frequency threshold is not limited in the present application.
The entry information of encyclopedic entries is different in length, and the length is different, so that the content of the current browsing page browsed by the user is not influenced and interfered, and a certain strategy can be adopted by the browser server side.
For example, in an application example of the present application, the browser server may adjust entry information of an encyclopedic entry according to specific requirements, for example, when the entry information of the encyclopedic entry is short, no adjustment may be made, and when the entry information of the encyclopedic entry is long, main content is extracted from the entry information until the length of the entry information does not exceed a certain length threshold, and the like. The length threshold can be set by those skilled in the art according to actual needs, and the specific length threshold is not limited in the present application.
It should be noted that, in a specific implementation, the browser server may first encapsulate the entry information into an XML format. And then returned to the browser client.
The encyclopedia entry may correspond to more than one entry information in some cases due to ambiguous words, etc. For example, the encyclopedia term "Sunyue" may relate to "singer Sunyue" and may also relate to "Player Sunyue"; as another example, the encyclopedia term "apple" may relate to plants, companies, domestic movies, and foreign movies, among others.
Under the condition that the encyclopedic entries correspond to more than one entry information, if the browser server side directly returns more than one entry information to the browser client side, the browser client side can display all or part of the content of more than one entry information according to own strategies. Undoubtedly, the waste of the display area is caused by displaying all contents of more than one entry information, and the contents of the current browsing page browsed by the user are easily influenced and interfered; however, there is a risk that the partial content of the term information of more than one item cannot meet the user's needs, for example, when the user wants the term information of "singer is happy" while viewing the web page related to the literature report, the term information of "player is happy" is presented, and at this time, the present application becomes worthless to automatically present the term content.
Therefore, in order to automatically display the entry information desired by the user on the premise of avoiding waste of a display area and avoiding influence and interference on the content of the currently browsed page browsed by the user, in this embodiment, when the entry information corresponding to the encyclopedic entry is more than one item, one entry information is selected from the more than one entry information of the encyclopedic entry and displayed to the user according to the category of the currently browsed page and/or the category of the user information of the current user; therefore, the entry information which is closest to the user requirement is displayed in the smallest area as possible, and the information access efficiency of the browser can be further enhanced while the user experience is improved.
In a preferred embodiment of the present application, the step of analyzing the entry information by the browser server and selecting the entry information corresponding to the entry information may further include: and the browser server side selects one item of entry information from the entry information according to the category of the current browsing page and/or the category of the user information of the current user.
In a preferred embodiment of the present application, the method may further include:
and the browser server side analyzes the page information of the current browsing page to obtain the category of the corresponding current browsing page.
The application can provide the following technical scheme for acquiring the page category:
page type acquisition scheme 1,
The page category obtaining scheme 1 may specifically include: and the browser server side analyzes the uniform resource locator information of the current browsing page to obtain the category of the corresponding current browsing page.
In practice, each web site is typically provided with more than one channel category, such as news, sports, entertainment, finance, video, women, science, mobile, digital, automotive, travel, real estate, forums, blogs, games, micro-blogs, apparel, applications, and so on. The URLs of the channel categories generally have a certain regularity, that is, the URLs of the same frequency category have the same characteristics, while the URLs of different frequency categories have differences.
Therefore, the page category obtaining scheme 1 may analyze the uniform resource locator information of the current browsing page by using the URL rule of each large website channel category, and locate the category of the corresponding current browsing page to the fine granularity of the channel category.
Some URL rules for website channel categories are provided, for example, some URL rules for website entertainment channel categories include "yule", for examplehttp://yule.sohu.com/There are also web-site entertainment channel categories that include "ent (english abbreviation for entertainment)", for example,http:// ent.sina.com.cn/、http://ent.163.com/and so on. Thus, if "yule" or "ent" is included in the URL of the currently browsed page, it can be considered as being "yule" or "entThe page category of the previously viewed page is the "entertainment" category.
Of course, the above URL rules for the category of the website entertainment channels are only examples, and are not intended to be limiting.
A page type acquisition scheme 2,
The page category obtaining scheme 2 may specifically include: and the browser server side analyzes the breadcrumbs of the current browsing page and navigates to obtain the corresponding category of the current browsing page.
Breadcrumbs are a "history" application that aims to help users trace back ways, and thus are a linear navigation approach. Interface elements mainly used for expressing the content affiliation, namely, a mode of main classification > primary classification > secondary classification > tertiary classification > … … > final content page.
In an application example of the application, after a web crawler captures HTML content of a currently browsed page, the HTML content can be analyzed, and an off-site navigation bar can be positioned according to a template of each website or content containing more than one symbol '>', so that characters corresponding to '>' positions are obtained; because the in-station navigation bar generally describes the directory hierarchy of the current browsing page in the station, the page category of the current browsing page can be positioned through the keywords corresponding to the corresponding category. For example, when the in-station navigation bar is "internet easy > sports channel > chinese basketball > text", the currently browsed page may be determined to be in the "sports" category.
A page type acquisition scheme 3,
The page category obtaining scheme 3 may specifically include: and the browser server side analyzes the weight of the encyclopedia entry in the current browsing page in each established category respectively to obtain the total weight of each established category of the current browsing page, and takes the established category with the maximum total weight as the page category of the current browsing page.
In a specific implementation, a series of predetermined categories (the predetermined categories in the present application are mainly used to indicate the page categories to which the vocabulary entry belongs, and may actually be according to the channel categories of each large website) may be preset, and the weight of the encyclopedic vocabulary entry in each predetermined category in the currently browsed page may be obtained. A weight acquisition scheme is given here, although it is not intended as an application limitation of the present application.
The weight obtaining scheme adopts a machine learning method, a basic training set is preset, each entry sample and the weight of each set category are labeled manually, and a corresponding weight classifier is obtained according to the training set. In this way, the encyclopedia entries in the current browsing page can be respectively input into the weight classifier, and the weight of each encyclopedia entry in each established category can be output.
In practical applications, the weight typically ranges from 0 to 1, with the weight approaching 1, indicating that the encyclopedia entry is more likely to be in the given category, and vice versa indicating that the encyclopedia entry is less likely to be in the given category. For example, "NBA" is weighted very heavily in the sports category, and very heavily in the "military" category, etc. And summing the weights of all the established categories of all the encyclopedic entries respectively to obtain the total weight of all the established categories of the current browsing page, and selecting the established category with the maximum total weight as the page category of the current browsing page.
The above three page category obtaining schemes are described in detail, and it can be understood that a person skilled in the art may combine and use the above several schemes according to needs, or use any one of the schemes, or use other schemes to obtain the page category of the currently browsed page, which is not limited by the present invention.
In a preferred embodiment of the present application, the method may further include:
and the browser server side analyzes the use history information of the current user to obtain the corresponding user information category.
In this embodiment of the present application, preferably, the usage history information may specifically include: the browser of the current user accesses the history and/or inputs the history. The browser access history record can be obtained by browser log statistics, and the input history record can be obtained by input method client statistics.
The application can provide the following technical scheme for acquiring the user information category:
user information category acquisition scheme 1,
The user information category acquisition scheme 1 may specifically include: and the browser server side analyzes and acquires the page category of the corresponding page in the browser access history record of the current user, and takes the page category with the highest frequency as the user information category of the current user.
The user personalized demand is usually originated from the interest and love of the user, for example, a certain user has the interest of pursuing stars and is a movie and television fan, so that the user surfs the internet every day mainly for browsing the dynamic states of stars and movies at home and abroad; for another example, a user is a football and basketball fan, and the purpose of surfing the internet every day is mainly to browse the football and basketball trends at home and abroad; for another example, a user is a digital fan who surfs the web daily to browse pages that are mostly digital. Therefore, the user information category acquisition scheme 1 is used for counting the user information category of the current user according to the browser access history of the current user, and the user information category in the situation is equivalent to the interest and hobbies of the user.
In practice, the browser access history record usually records information such as user identification, page URL, access time, and the like, and in a specific implementation, the page category of the page in the browser access history record of the current user may be obtained by using one or more of the above three page category obtaining schemes, and then, the page category with the largest occurrence frequency (i.e., the highest frequency) is counted out and used as the user information category of the current user. Therefore, the page category with the highest frequency is selected, which indicates that the current user pays more attention to the page category, that is, the user information category under the condition is equivalent to the interests and hobbies of the user.
In a specific implementation, the browser access history of the current user to be counted may be limited according to a time point. For example, the browser access history of the current user to be counted may be: all history records from the opening of the browser interface to the current browsing page; as another example, the browser access history of the current user to be counted may be all the histories from N days (which may be set by those skilled in the art according to actual needs), and so on.
In a specific implementation, the browser access history of the current user to be counted can be limited according to the number. For example, the number of browser access histories of the current user to be counted is 10 or 100, and so on. Of course, the above time points and numbers may be used in combination, and the application is not limited thereto.
User information category acquisition scheme 2,
The user information category obtaining scheme 2 may specifically include: and analyzing and counting the input history of the current user by the browser server, namely analyzing and counting the input history in a browser interface, wherein the input history comprises the input history of a browser address bar, the input history of controls such as a search box, an input box and the like in a browser page and the like, so as to obtain the vocabulary category corresponding to the input history, and taking the vocabulary category with the highest frequency as the user information category of the current user.
At present, with the development of internet and information technology, the current era is just becoming an information era, and most office workers need to face a computer for a long time, and usually input vocational related words on the computer, for example, administrative assistants input office management related words, accounting related words, architectural designers input real estate and structure related words, programmers input computers and code related words, and the like.
The vocational related vocabulary can be counted under a specific formal application environment, for example, word, excel, autocad, powerpoint, protel, technical forum and the like.
Therefore, the input history of the user can reflect the professional information of the user to a certain extent, and further, the requirement of the user for information related to the professional information can be extended, for example, a programmer focuses on information related to a computer, and the like.
Besides vocabularies related to profession, the input history of the user can reflect the interest and hobby information of the user to a certain extent; the input history related to the hobbies and interests can be counted under some specific informal application environments, such as instant messaging programs like QQ and Feixin, and various entertainment forums like Skyline, water tree, cat puppet, and the like. If the user is interested in the constellation, the user can input corresponding words such as 'balance', 'golden ox' and the like under the informal application environments; if the user is interested in swimming, the user can input corresponding words such as 'breaststroke', 'treading on water' and the like under the informal application environments; if the user is interested in football, he or she will enter the corresponding vocabulary in these informal application environments, such as "ewing tours", "cherries", etc.
If the user has fixed professional habits and/or hobbies, words of a particular vocabulary category are often entered. Therefore, the user information category acquisition scheme 2 obtains the vocabulary categories of the input history record, and takes the vocabulary category with the highest frequency as the user information category of the current user. In a specific implementation, a series of vocabulary categories (categories to which vocabularies belong, which may be preset in practice according to professional vocabulary categories of an input method and/or channel categories of each large website) may be preset.
The above two user information category obtaining schemes are described in detail, it can be understood that a person skilled in the art may use the above several schemes in combination according to needs, or use any one of the schemes, or use other schemes to obtain the user information category of the current user, which is not limited by the present invention.
In summary, the category of the currently browsed page is mainly an identifier of the content browsed by the user, and is closely related to the content requirement of the user; the user information category of the current user can reflect the interests, hobbies and/or professional habits of the user to a certain extent, and is closely related to the personalized requirements of the user; therefore, in this embodiment, when the entry information corresponding to the encyclopedic entry is more than one item, one item of entry information is selected from the more than one item of entry information of the encyclopedic entry according to the category of the currently browsed page and/or the category of the user information of the current user; the selection can select one item of entry information which is most relevant to the current browsing page and/or the current user from more than one item of entry information of encyclopedic entries, so that the entry information which is most close to the user requirement can be displayed in an area as small as possible.
In a preferred embodiment of the present application, the method may further include:
the browser server side obtains entry information types of entry information of all the encyclopedia entries;
the step of selecting one item of entry information from the one or more items of entry information of the encyclopedic entries according to the category of the currently browsed page and/or the user information category of the current user may specifically be selecting one item of entry information whose entry information category is most matched with the category of the currently browsed page and/or the user information category of the current user from the one or more items of entry information of the encyclopedic entries when the entry information corresponding to the encyclopedic entries is one or more items.
Since the entry information of the encyclopedic entry is composed of characters, in practice, the entry information category of the entry information of the encyclopedic entry can be obtained by adopting the principle of the page category obtaining scheme 3, and the obtaining process specifically includes:
step B1, the browser server side carries out word segmentation processing on certain entry information of the encyclopedic entry to obtain a series of words;
and step B2, analyzing the weight of each word in the corresponding entry information category, adding the weights to obtain the total weight corresponding to each entry information category, and taking the entry information category corresponding to the maximum total weight as the entry information category of the entry information of the encyclopedic entry.
In a specific implementation, a series of entry information categories (categories to which entry information belongs, which can be preset according to channel categories of each large website in practice) can be preset.
In practice, the entry information category of each entry information item may be matched with the category of the currently browsed page and/or the user information category of the current user, and if the entry information category matching rate corresponding to a certain entry information item is the maximum, the entry information category may be considered to be the most matched with the category of the currently browsed page and/or the user information category of the current user, and the entry information under the entry information category is selected and returned to the browser client.
Referring to fig. 3, a flowchart of embodiment 2 of a method for acquiring entry information according to the present application is shown, which may specifically include:
301, analyzing page information of a currently browsed page by a browser server side to obtain corresponding encyclopedia entries;
304, when the entry information corresponding to the encyclopedic entry is more than one item, the browser server side selects one item of entry information from the more than one item of entry information of the encyclopedic entry according to the category of the current browsing page and/or the category of the user information of the current user;
and step 305, the browser server side returns the selected entry information to the browser client side.
In the above method embodiments for obtaining entry information, descriptions of the embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
Also, as is readily contemplated by those skilled in the art: any combination of the above-mentioned method for obtaining entry information, examples 1-2, is feasible, so that any combination of the above-mentioned method for obtaining entry information, examples 1-2, is an embodiment of the present invention, but due to space limitations, the present specification is not described in detail herein.
In order to make the application better, the following provides an embodiment of an application method for acquiring and presenting entry information.
Application method example 1,
The application scenario of the application method embodiment 1 is that a user browses a web page related to "sun happy": http:// sports.163.com/12/0317/16/7SQFA0VM00052UUC.html, and the title of the current browsing page is "Sunyue reiterates a duplicate called forest book luxury in the state of finding by competition generation"; the application method embodiment 1 may specifically include:
step R1, the browser client side sends the page information of the current browsing page to the browser server side;
step R2, the browser server side analyzes the content of the current browsing page, namely, the word segmentation and word frequency statistics are carried out, the word segmentation and word frequency statistics are matched with the word stock stored by the server side, and the word stock stored by the server side is hit after the Sun Yue appears for a plurality of times in the current browsing page, so that the Sun Yue is used as an encyclopedic entry;
r3, the browser server side retrieves corresponding entry information according to the encyclopedia entry; the entry information is encyclopedia paraphrase http,// baike, baidu, com/view/6886.htm, and can be seen that the encyclopedia entry is an ambiguous word and corresponds to various encyclopedia paraphrases such as Chinese popular female singers, Chinese basketball professional athletes, Chinese painting system instructors in the Luxun Art academy and the like;
step R4, according to the breadcrumbs navigation information (net easy > sports channel > Chinese basketball > text) of the current browsing page, identifying the current browsing page as a 'sports' category;
r5, for the corresponding multiple encyclopedia definitions, obtaining the total weight corresponding to each entry information category according to the weight of all words after word segmentation in the corresponding entry information category, and determining the entry information category corresponding to the maximum total weight as the corresponding entry information category;
such as a Chinese fashion female singer. The encyclopedia definition of the China popular music altar … …' in the middle of the nineties is determined as the category of entertainment by calculating that the weight value of the encyclopedia definition in the category of entertainment is highest because the weight values of the terms such as singers, music and the like in the category of term information of entertainment are very high; such as "Chinese professional basketball players. The national team: china. Number of Chinese basket: 9. - ", because the terms such as" basketball "," player ", etc. have high weight values in the term information category" sports ", the encyclopedia definition has the highest weight value in the" sports "category, and is determined as the" sports "category; such as the Chinese painting of the Luxun Art college, namely the lecturer. In the encyclopedia paraphrase of Chinese character painting and Hua-island painting, because the terms of art, character painting and the like have high weight values in the category of art, the highest weight value of the paraphrase in the category of term information, namely art, is calculated, and the paraphrase is determined to be the category of art.
Step R6, the browser server side selects an encyclopedia paraphrase from a plurality of encyclopedia paraphrases of the encyclopedia entry according to the category of the current browsing page;
determining that the entry information category 'sports' corresponding to the current browsing page category 'sports' and 'Chinese professional basketball player' is most matched, and selecting the 'Chinese professional basketball player' item as an encyclopedia paraphrase to be sent.
And step R7, the browser server side packages the selected encyclopedia paraphrases into an XML format and returns the XML format to the browser client side.
An example of an encyclopedia definition in packed XML format is given here:
and step R8, the browser client receives the encyclopedia paraphrases packaged into the XML format, converts the encyclopedia paraphrases in the XML format into the HTML format, embeds the encyclopedia paraphrases in the HTML format into the plug-in, and loads and displays the encyclopedia paraphrases in the HTML format by the plug-in by utilizing the front-end script technology.
In summary, the application method embodiment 1 automatically presents the encyclopedia that is most relevant to the category of the currently browsed page of the user, so that the entry information that is closest to the user's needs can be presented in as small an area as possible.
Application method example 2,
The application scenario of the application method embodiment 2 is that a user is assumed to be a food destiny, has browsed a series of webpages in food aspects, and then has clicked a page for introducing a bacon chicken roll; the application method embodiment 2 may specifically include:
step S1, the browser client sends the page information of the current browsing page to the browser server;
step S2, the browser server side analyzes the content of the current browsing page, namely, the content is divided into words and word frequency statistics, the words and the word frequency statistics are matched with a word bank stored in the server side, and the word bank stored in the server side is hit after the fact that the 'bacon' appears in the current browsing page for multiple times, so that the 'bacon' is used as an encyclopedic entry;
s3, the browser server side retrieves corresponding entry information according to the encyclopedia entry, wherein the entry information is encyclopedia paraphrase http:// baike.baidu.com/view/1102.htm, and as can be seen, the entry information is a polysemous word and corresponds to multiple encyclopedia paraphrases such as British philosophers, Ireland painters, bacon meat products and the like;
step S4, analyzing the access history of the current user in the browser, namely the previous 100 access webpages, and determining that the user information category of the current user belongs to the category of 'food' by acquiring the page category of each webpage by using one or more of the three page category acquisition schemes, wherein the page category with the most occurrence times, namely the page category with the highest frequency, is the user information category of the current user;
r5, for the corresponding multiple encyclopedia definitions, obtaining the total weight corresponding to each entry information category according to the weight of all words after word segmentation in the corresponding entry information category, and determining the entry information category corresponding to the maximum total weight as the corresponding entry information category;
such as "british philosophers. The encyclopedia definition of the most important writer … …' in the period of Renhai of Francis bacon and English literature revival is calculated to have the highest weight value in the category of history because the weight values of the terms such as philosophy, the English literature revival and the like in the category of term information history are high, and the encyclopedia definition is determined to be the category of history; such as "Ireland painters. An encyclopedia explanation of ' Dublin, Ireland, because the weights of terms such as ' painter ' and ' art museum ' in the term information category ' art ' are very high, the encyclopedia explanation is calculated to have the highest weight in the ' art ' category, and is determined to be the ' art ' category; such as "bacon meat products. The idea is that the encyclopedia definition of smoked rib meat is originally defined as 'meat product', 'rib meat' and the like, and the weight value of the entry in the entry information category 'food' is very high, the encyclopedia definition is calculated to have the highest weight in the 'food' category, and the encyclopedia definition is determined as the 'food' category.
Step R6, the browser server side selects an encyclopedia paraphrase from a plurality of encyclopedia paraphrases of the encyclopedia entry according to the user information category of the current user;
determining that the item information category 'food' corresponding to the 'bacon meat product' is most matched with the item information category 'food' corresponding to the user information category 'food' of the current user, and selecting the 'bacon meat product' item as an encyclopedia explanation to be sent.
And step R7, the browser server side packages the selected encyclopedia paraphrases into an XML format and returns the XML format to the browser client side.
An example of an encyclopedia definition in packed XML format is given here:
and step R8, the browser client receives the encyclopedia paraphrases packaged into the XML format, converts the encyclopedia paraphrases in the XML format into the HTML format, embeds the encyclopedia paraphrases in the HTML format into the plug-in, and loads and displays the encyclopedia paraphrases in the HTML format by the plug-in by utilizing the front-end script technology.
In summary, the application method embodiment 2 automatically displays the encyclopedia most relevant to the user information category, so that the entry information closest to the user requirement can be displayed in the smallest possible area, and the corresponding entry information in the polysemous words can be accurately displayed to the user.
In other embodiments, the page information of the current browsing page is analyzed to obtain the corresponding encyclopedia entry, which is not limited to the browser server side, and the browser client side can also execute the operation; similarly, the entities involved in steps 302-305 are not limited to the browser server, and the browser client may also perform corresponding operations to complete the logical process described herein, so that the corresponding entry information is displayed to the user.
Referring to fig. 4, a block diagram of an embodiment of an apparatus for presenting entry information according to the present application is shown, which may specifically include:
a sending module 401, configured to send page information of a currently browsed page to a browser server;
a receiving module 402, configured to receive entry information of an encyclopedia entry corresponding to the currently browsed page; when the entry information corresponding to the encyclopedic entry is more than one item, the browser server analyzes and selects the entry information and returns the selected entry information; and
a displaying module 403, configured to display the entry information.
In a preferred embodiment of the present application, the displaying module 403 may be specifically configured to display the entry information in a pop-up window manner.
In another preferred embodiment of the present application, the displaying module 403 may be specifically configured to predict the length of the entry information when the entry information is displayed, and display the entry information by combining a prediction result.
For the embodiment of the apparatus for presenting entry information, since it is basically similar to the embodiment of the method for presenting entry information, the description is simple, and for relevant points, refer to the partial description of the embodiment of the method for presenting entry information.
Referring to fig. 5, a block diagram of an embodiment of an apparatus for acquiring entry information according to the present application is shown, which may specifically include:
an encyclopedia entry obtaining module 501, configured to analyze page information of a currently browsed page to obtain a corresponding encyclopedia entry;
an encyclopedic entry retrieval module 502, configured to obtain corresponding entry information according to the encyclopedic entry retrieval;
the information selection module 503 is configured to analyze each item of entry information and select one item of entry information when the entry information corresponding to the encyclopedic entry is more than one item; and
a returning module 504, configured to return the selected entry information to the browser client.
In a preferred embodiment of the present application, the information selecting module 503 may be specifically configured to select entry information from more than one entry information of the encyclopedia entries according to the category of the currently browsed page and/or the category of the user information of the current user.
In another preferred embodiment of the present application, the apparatus for acquiring entry information may further include:
the entry information category acquisition module is used for acquiring entry information categories of entry information of the encyclopedic entries;
the information selection module may be specifically configured to select, from the one or more items of entry information of the encyclopedic entry, one item of entry information whose entry information category is most matched with the category of the currently browsed page and/or the user information category of the current user.
In another preferred embodiment of the present application, the apparatus for acquiring entry information may further include:
and the page type acquisition module is used for analyzing the page information of the current browsing page to obtain the corresponding type of the current browsing page.
In a preferred embodiment of the present application, the apparatus for acquiring entry information may further include:
and the user information category acquisition module is used for analyzing the use history information of the current user to obtain the corresponding user information category.
In a preferred embodiment of the present application, the page category obtaining module may further include:
the first page type acquisition sub-module is used for analyzing the uniform resource locator information of the current browsing page to obtain the corresponding type of the current browsing page; and/or
The second page category acquisition sub-module is used for analyzing the breadcrumbs of the current browsed page and navigating to obtain the category of the corresponding current browsed page; and/or
And the third page category acquisition sub-module is used for analyzing the weight of the encyclopedic entry in the current browsing page in each established category to obtain the total weight of each established category of the current browsing page, and taking the established category with the maximum total weight as the page category of the current browsing page.
In this embodiment of the present application, preferably, the usage history information may specifically include: the browser accesses the history and/or inputs the history.
In a preferred embodiment of the present application, the user information category obtaining module may further include:
the first user information category acquisition submodule is used for analyzing and acquiring the page category of the page in the browser access history record of the current user and taking the page category with the highest frequency as the user information category of the current user; and/or
And the second user information category acquisition submodule is used for analyzing and counting the input history record of the current user to obtain the vocabulary category of the input history record, and taking the vocabulary category with the highest frequency as the user information category of the current user.
In a preferred embodiment of the present application, the page information of the currently browsed page may include content of the currently browsed page;
the encyclopedia entry obtaining module may further include:
the word segmentation sub-module is used for carrying out word segmentation on the content of the current browsing page to obtain a corresponding entry;
the matching submodule is used for analyzing and matching the vocabulary entries of the content of the current browsing page with a word bank stored at a server side;
and the determining submodule is used for taking the successfully matched corresponding entry as an encyclopedia entry.
In a preferred embodiment of the present application, the encyclopedia entry extraction module may further include:
the statistic submodule is used for counting the frequency of each entry in the content of the current browsing page;
the determining submodule may be specifically configured to use, as an encyclopedia entry, a corresponding entry that is successfully matched and has a statistical frequency exceeding a predetermined frequency threshold.
For the embodiment of the apparatus for acquiring the term information, since it is basically similar to the embodiment of the method for acquiring the term information, the description is simple, and the relevant points can be referred to the partial description of the embodiment of the method for acquiring the term information.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While the preferred embodiments of the present application have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all alterations and modifications as fall within the scope of the application.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
The method and the device for presenting and acquiring entry information provided by the application are introduced in detail, and a specific example is applied in the text to explain the principle and the implementation of the application, and the description of the above embodiment is only used for helping to understand the method and the core idea of the application; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present application.
Claims (12)
1. A method for obtaining entry information is applied to a browser server side, and comprises the following steps:
analyzing page information of a current browsing page sent by a browser client to obtain corresponding encyclopedia entries;
retrieving according to the encyclopedic entry to obtain corresponding entry information;
when the entry information corresponding to the encyclopedic entry is more than one item, analyzing the entry information and selecting the entry information corresponding to one item;
returning the selected entry information;
the analyzing each item of entry information and selecting corresponding one item of entry information further comprises: selecting one or more items of entry information of the encyclopedic entries, wherein the entry information category is the entry information which is most matched with the category of the current browsing page and/or the user information category of the current user;
the page information of the current browsing page comprises: content of a currently browsed page; the analyzing the page information of the current browsing page to obtain the corresponding encyclopedia entry comprises:
performing word segmentation processing on the content of the current browsing page to obtain corresponding entries;
counting the frequency of the entries appearing in the current browsing page
Analyzing and matching the entries with a word bank stored at a server side;
and taking the corresponding entry which is successfully matched and the statistical frequency exceeds a set frequency threshold value as an encyclopedic entry.
2. The method of claim 1, further comprising:
and analyzing the page information of the current browsing page to obtain the category of the corresponding current browsing page.
3. The method of claim 1, further comprising:
and analyzing the use history information of the current user to obtain the corresponding user information category.
4. The method of claim 2, wherein the step of analyzing the page information of the currently viewed page to obtain the category of the corresponding currently viewed page comprises:
analyzing the uniform resource locator information of the current browsing page to obtain the category of the corresponding current browsing page; and/or
Analyzing the breadcrumbs of the current browsed page, and navigating to obtain the corresponding category of the current browsed page; and/or
And analyzing the weight of the encyclopedic entry in the current browsing page in each set type respectively to obtain the total weight of each set type of the current browsing page, and taking the set type with the maximum total weight as the type of the current browsing page.
5. The method of claim 3, wherein the usage history information comprises: the browser of the current user accesses the history and/or inputs the history.
6. The method of claim 5, wherein the step of analyzing the usage history information of the current user to obtain a corresponding user information category comprises:
analyzing and acquiring the page category of the corresponding page in the browser access history record of the current user, and taking the page category with the highest frequency as the user information category of the current user; and/or
Analyzing and counting the input history of the current user to obtain the vocabulary category corresponding to the input history, and taking the vocabulary category with the highest frequency as the user information category of the current user.
7. A method for showing entry information is applied to a browser client, and comprises the following steps:
sending page information of a current browsing page to a browser server;
receiving entry information of encyclopedia entries corresponding to the current browsing page; when the entry information corresponding to the encyclopedic entry is more than one item, the entry information is returned after being analyzed and selected; the analysis selection comprises: selecting one or more items of entry information of the encyclopedic entries, wherein the entry information category is the entry information which is most matched with the category of the current browsing page and/or the user information category of the current user;
displaying the entry information;
the page information of the current browsing page comprises: content of a currently browsed page; the encyclopedia entry determining process comprises the following steps: performing word segmentation processing on the content of the current browsing page to obtain corresponding entries; counting the frequency of the entries appearing in the current browsing page; analyzing and matching the entries with a word bank stored at a server side; and taking the corresponding entry which is successfully matched and the statistical frequency exceeds a set frequency threshold value as an encyclopedic entry.
8. The method of claim 7, wherein the displaying of the entry information is performed in a pop-up window.
9. The method of claim 7, wherein the length of the entry information is estimated when the entry information is presented, and the entry information is presented in combination with the estimation result.
10. An apparatus for obtaining entry information, which is applied to a browser server side, includes:
the encyclopedia entry acquisition module is used for analyzing the page information of the current browsing page sent by the browser client to obtain corresponding encyclopedia entries;
the encyclopedic entry retrieval module is used for retrieving corresponding entry information according to the encyclopedic entries;
the information selection module is used for analyzing all entry information and selecting corresponding entry information when the entry information corresponding to the encyclopedic entry is more than one entry; and
the return module is used for returning the selected entry information;
the information selection module is specifically used for selecting more than one item of entry information of the encyclopedic entries, wherein the category of the entry information is the entry information which is most matched with the category of the current browsing page and/or the category of the user information of the current user;
the page information of the current browsing page comprises: content of a currently browsed page; the analyzing the page information of the current browsing page to obtain the corresponding encyclopedia entry comprises:
performing word segmentation processing on the content of the current browsing page to obtain corresponding entries;
counting the frequency of the entries appearing in the current browsing page
Analyzing and matching the entries with a word bank stored at a server side;
and taking the corresponding entry which is successfully matched and the statistical frequency exceeds a set frequency threshold value as an encyclopedic entry.
11. An apparatus for displaying entry information, applied to a browser client, includes:
the sending module is used for sending the page information of the current browsing page to the browser server side;
the receiving module is used for receiving entry information of encyclopedia entries corresponding to the current browsing page; when the entry information corresponding to the encyclopedic entry is more than one item, the entry information is returned after being analyzed and selected; the analysis selection comprises: selecting one or more items of entry information of the encyclopedic entries, wherein the entry information category is the entry information which is most matched with the category of the current browsing page and/or the user information category of the current user; and
the display module is used for displaying the entry information;
the page information of the current browsing page comprises: content of a currently browsed page; the encyclopedia entry determining process comprises the following steps: performing word segmentation processing on the content of the current browsing page to obtain corresponding entries; counting the frequency of the entries appearing in the current browsing page; analyzing and matching the entries with a word bank stored at a server side; and taking the corresponding entry which is successfully matched and the statistical frequency exceeds a set frequency threshold value as an encyclopedic entry.
12. The apparatus of claim 11, wherein the presentation module is specifically configured to present the entry information in a pop-window format.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210183870.8A CN103455524B (en) | 2012-06-05 | 2012-06-05 | Method and device for displaying and acquiring entry information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210183870.8A CN103455524B (en) | 2012-06-05 | 2012-06-05 | Method and device for displaying and acquiring entry information |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103455524A CN103455524A (en) | 2013-12-18 |
CN103455524B true CN103455524B (en) | 2021-06-22 |
Family
ID=49737902
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210183870.8A Active CN103455524B (en) | 2012-06-05 | 2012-06-05 | Method and device for displaying and acquiring entry information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103455524B (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103823868B (en) * | 2014-02-26 | 2017-05-03 | 中国科学院计算技术研究所 | Event recognition method and event relation extraction method oriented to on-line encyclopedia |
CN104951450A (en) | 2014-03-26 | 2015-09-30 | 国际商业机器公司 | Information processing method and system |
CN104102739B (en) * | 2014-07-28 | 2018-03-06 | 百度在线网络技术(北京)有限公司 | A kind of method and device for expanding entity storehouse |
CN104537080B (en) * | 2014-12-31 | 2018-08-07 | 北京畅游天下网络技术有限公司 | Information recommends method and system |
CN105095441A (en) * | 2015-07-23 | 2015-11-25 | 百度在线网络技术(北京)有限公司 | Information acquisition method and device |
CN106997363A (en) * | 2016-01-26 | 2017-08-01 | 华为技术有限公司 | A kind of data processing method and equipment |
CN106202041B (en) * | 2016-07-01 | 2019-07-09 | 北京奇虎科技有限公司 | A kind of method and apparatus of entity alignment problem in solution knowledge mapping |
CN106454426B (en) * | 2016-10-27 | 2019-04-30 | 四川长虹电器股份有限公司 | A kind of method of identification intelligent TV analog channel |
CN106802921A (en) * | 2016-12-19 | 2017-06-06 | 福建天泉教育科技有限公司 | Entry exhibiting method and represent system |
CN108427508B (en) * | 2017-02-15 | 2024-01-19 | 北京搜狗科技发展有限公司 | Input method and device, and method and device for establishing local area network word stock |
CN109271615B (en) * | 2017-07-13 | 2023-10-31 | 北京搜狗科技发展有限公司 | Entry processing method, apparatus and machine readable medium |
CN107679043A (en) * | 2017-09-22 | 2018-02-09 | 广州阿里巴巴文学信息技术有限公司 | Data processing method, device and terminal device |
CN107885888A (en) * | 2017-12-11 | 2018-04-06 | 北京百度网讯科技有限公司 | Information processing method and device, terminal device and computer-readable recording medium |
CN109002292B (en) * | 2018-06-11 | 2022-02-11 | 广州环通信息技术有限公司 | Method and system for realizing pop-up frame based on webpage pop-up layer |
CN110209814B (en) * | 2019-05-23 | 2021-02-02 | 西安交通大学 | Method for extracting knowledge topic from encyclopedic knowledge website by utilizing domain modeling |
CN111666018B (en) * | 2020-06-08 | 2022-05-20 | 上海连尚网络科技有限公司 | Reading content processing method, electronic device and medium |
CN113127641A (en) * | 2021-04-23 | 2021-07-16 | 北京字节跳动网络技术有限公司 | Encyclopedic entry display method, encyclopedic entry display device, encyclopedic entry display equipment, encyclopedic entry display medium and program product |
CN113918809A (en) * | 2021-09-29 | 2022-01-11 | 北京搜狗科技发展有限公司 | Entry page display method and device, storage medium and electronic equipment |
CN113961848A (en) * | 2021-11-09 | 2022-01-21 | 北京锐安科技有限公司 | Webpage element labeling processing method and device, electronic equipment and storage medium |
CN115062589B (en) * | 2022-06-08 | 2024-10-18 | 北京字跳网络技术有限公司 | Encyclopedia entry processing method, device, electronic device, medium and program product |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101216842A (en) * | 2008-01-07 | 2008-07-09 | 华为技术有限公司 | Method for acquiring page keywords and page information processing device |
CN102314456A (en) * | 2010-06-30 | 2012-01-11 | 百度在线网络技术(北京)有限公司 | Web page move search method and system |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100905029B1 (en) * | 2007-03-22 | 2009-06-30 | 주식회사 시공미디어 | Method of providing knowledge information to mobile communication means and system |
CN101827320A (en) * | 2010-02-04 | 2010-09-08 | 重庆索伦互联网信息服务有限公司 | 3G network-based method for transmitting encyclopedic data to mobile terminal |
CN101976246A (en) * | 2010-09-30 | 2011-02-16 | 互动在线(北京)科技有限公司 | Classification retrieval method for encyclopedia entries |
CN102129454A (en) * | 2011-03-08 | 2011-07-20 | 国网信息通信有限公司 | Method and system for processing encyclopaedia data based on cloud storage |
-
2012
- 2012-06-05 CN CN201210183870.8A patent/CN103455524B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101216842A (en) * | 2008-01-07 | 2008-07-09 | 华为技术有限公司 | Method for acquiring page keywords and page information processing device |
CN102314456A (en) * | 2010-06-30 | 2012-01-11 | 百度在线网络技术(北京)有限公司 | Web page move search method and system |
Also Published As
Publication number | Publication date |
---|---|
CN103455524A (en) | 2013-12-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103455524B (en) | Method and device for displaying and acquiring entry information | |
US10140368B2 (en) | Method and apparatus for generating a recommendation page | |
US10402479B2 (en) | Method, server, browser, and system for recommending text information | |
US9135354B2 (en) | Method and system for topical browser history | |
KR101667344B1 (en) | Method and system for providing search results | |
US8863000B2 (en) | Method and system for action suggestion using browser history | |
CN103324665B (en) | Hot spot information extraction method and device based on micro-blog | |
CN104111941B (en) | The method and apparatus that information is shown | |
CN107784092A (en) | A kind of method, server and computer-readable medium for recommending hot word | |
CN104598556A (en) | Search method and search device | |
CN104077388A (en) | Summary information extraction method and device based on search engine and search engine | |
CN104077415B (en) | Searching method and device | |
US10783192B1 (en) | System, method, and user interface for a search engine based on multi-document summarization | |
CN102708174A (en) | Method and device for displaying rich media information in browser | |
WO2014194689A1 (en) | Method, server, browser, and system for recommending text information | |
CN111061954B (en) | Search result sorting method and device and storage medium | |
EP3311300A1 (en) | System and method for generating an electronic page | |
CN104090757A (en) | Method and device for displaying rich media information in browser | |
CN104503988A (en) | Searching method and device | |
CN104881447A (en) | Searching method and device | |
CN103425767B (en) | A kind of determination method and system pointing out data | |
Gali et al. | Extracting representative image from web page | |
Hall et al. | Enabling the discovery of digital cultural heritage objects through wikipedia | |
West et al. | A data-driven sketch of Wikipedia editors | |
Kim et al. | TrendsSummary: a platform for retrieving and summarizing trendy multimedia contents |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |