[go: up one dir, main page]

CN106649366B - Keyword search result classification method and device - Google Patents

Keyword search result classification method and device Download PDF

Info

Publication number
CN106649366B
CN106649366B CN201510729732.9A CN201510729732A CN106649366B CN 106649366 B CN106649366 B CN 106649366B CN 201510729732 A CN201510729732 A CN 201510729732A CN 106649366 B CN106649366 B CN 106649366B
Authority
CN
China
Prior art keywords
domain name
target
target domain
group
names
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510729732.9A
Other languages
Chinese (zh)
Other versions
CN106649366A (en
Inventor
王名洋
祁文
吴丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201510729732.9A priority Critical patent/CN106649366B/en
Publication of CN106649366A publication Critical patent/CN106649366A/en
Application granted granted Critical
Publication of CN106649366B publication Critical patent/CN106649366B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a method and a device for classifying keyword search results. Wherein, the method comprises the following steps: acquiring a target domain name associated with a keyword to be searched; configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name; and displaying the number of the target domain names in the domain name group in the target webpage. The method and the device solve the technical problem that the search results cannot be automatically classified in the prior art.

Description

Keyword search result classification method and device
Technical Field
The application relates to the field of computers, in particular to a method and a device for classifying keyword search results.
Background
The display modes of the search results of any keyword in the search engine are various, and the current crawler technology can grab the search results and directly return the search results to the system. For the captured results, there is no effective data sorting method to analyze and sort the captured results, and all search result data obtained by the crawler are often judged and accumulated manually. And the cost is high, the speed is low and the error rate is high when all search result data obtained by the crawler are judged and accumulated manually.
For Social Media Optimization (SMO) service, it is very important to reasonably arrange the front content of any brand in a website type, and if the ranking proportion type arrangement of the existing keyword search results and the optimized results are checked, the search results of the keywords need to be automatically judged.
Aiming at the problems in the prior art, no effective solution is provided at present.
Disclosure of Invention
The embodiment of the application provides a method and a device for classifying keyword search results, which are used for at least solving the technical problem that the search results cannot be automatically classified in the prior art.
According to an aspect of an embodiment of the present application, there is provided a method for classifying keyword search results, the method including: acquiring a target domain name associated with a keyword to be searched; configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name; and displaying the number of the target domain names in the domain name group in a target webpage.
Further, the acquiring of the number of the target domain names associated with the keyword to be searched for a plurality of target domain names includes: crawling address information of a plurality of associated webpages of the keywords to be searched; and intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
Further, the number of the domain name group and the number of the preset identifiers are multiple, and configuring the obtained target domain name and the preset identifiers in a domain name library, where obtaining the number of the target domain names included in the domain name group and the domain name group includes: classifying the target domain names according to the preset identifications to obtain a plurality of domain name groups, wherein each domain name group corresponds to one preset identification, and the preset identifications corresponding to different domain name groups are different; and counting the number of the target domain names in each domain name group.
Further, classifying each target domain name according to the preset identifier to obtain a plurality of domain name groups includes: searching a preset identifier of a target domain name Ai in the domain name library, wherein i is 1 to n in sequence, and n is the number of the target domain names; searching an identifier which is the same as a preset identifier of the target domain name Ai in a target database, wherein the target database is used for storing a plurality of domain name groups and the number of the target domain names contained in the domain name groups, and the identifier for representing the domain name type is stored in the target database; and under the condition that the identifier which is the same as the preset identifier of the target domain name Ai is found in the target database, storing the target domain name Ai to the domain name group where the preset identifier of the target domain name Ai is located.
Further, before displaying the number of the target domain names in the domain name group in the target web page, the method further includes: sequencing the domain name groups according to the number of the target domain names in each domain name group to obtain a plurality of sequenced domain name groups, and displaying the number of the target domain names in the domain name groups in a target webpage comprises: and displaying the number group of the target domain names in the plurality of domain name groups after sorting in the target webpage.
According to another aspect of the embodiments of the present application, there is also provided an apparatus for classifying keyword search results, the apparatus including: an acquisition unit configured to acquire a target domain name associated with a keyword to be searched; a configuration unit, configured to configure the obtained target domain name with a preset identifier in a domain name library, so as to obtain a domain name group and the number of the target domain names included in the domain name group, where the preset identifier is an identifier for representing a domain name type of the target domain name; and the display unit is used for displaying the number of the target domain names in the domain name group in a target webpage.
Further, the number of the target domain names is multiple, and the obtaining unit includes: the crawling module is used for crawling address information of a plurality of associated webpages of the keywords to be searched; and the intercepting module is used for intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
Further, the number of the domain name groups and the number of the preset identifiers are both multiple, and the configuration unit includes: the classification module is used for classifying the target domain names according to the preset identifications to obtain a plurality of domain name groups, wherein each domain name group corresponds to one preset identification, and the preset identifications corresponding to different domain name groups are different; and the counting module is used for counting the number of the target domain names in each domain name group.
Further, the classification module includes: the first searching submodule is used for searching a preset identifier of a target domain name Ai in the domain name library, wherein i is 1 to n in sequence, and n is the number of the target domain names; a second searching sub-module, configured to search, in a target database, an identifier that is the same as a preset identifier of the target domain name Ai, where the target database is a database that is used to store a plurality of domain name groups and the number of the target domain names included in the plurality of domain name groups, and an identifier that represents a domain name type has been stored in the target database; and the storage sub-module is used for storing the domain name group from the target domain name Ai to the preset identifier of the target domain name Ai under the condition that the identifier which is the same as the preset identifier of the target domain name Ai is found in the target database.
Further, the apparatus further comprises: a sorting unit, configured to sort the domain name groups according to the number of the target domain names in each domain name group before the display unit displays the number of the target domain names in the domain name groups in a target web page, so as to obtain a plurality of domain name groups after sorting, where the display unit includes: and the display module is used for displaying the quantity group of the target domain names in the plurality of domain name groups after the sorting in the target webpage.
In the embodiment of the application, a target domain name associated with a keyword to be searched is obtained; configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name; and displaying the number of the target domain names in the domain name group in a target webpage. The method comprises the steps of automatically acquiring a domain name (namely a target domain name) associated with a keyword to be searched through a system, configuring the target domain name and a preset identification used for representing the domain name type of the target domain name in a domain name library, configuring to obtain a domain name group, and realizing automatic classification of the target domain name, wherein the target domain name with the same domain name type is stored in the same domain name group, so that the purposes of automatically acquiring the target domain name and configuring the target domain name are achieved, compared with the method for manually sorting and classifying searched search results in the prior art, the technical problem that the search results cannot be automatically sorted in the prior art is solved, and the technical effect of automatically sorting the search results is realized.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
FIG. 1 is a flow chart of a method for classifying keyword search results according to an embodiment of the present application; and
fig. 2 is a schematic diagram of a keyword search result classification apparatus according to an embodiment of the present application.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
The noun explains:
domain name library: i.e. a database obtained by tagging known domain names, e.g. (the label of zhidao
SMO: social Media Optimization (SMO) is a whole set of methods to get public dissemination through social media, online organizations and community sites. The news of the website is published externally by using social media, and certain elements of the website are not optimized. Social media optimization is a recent form of network marketing.
In accordance with an embodiment of the present application, there is provided a method embodiment of a method for classifying keyword search results, it is noted that the steps illustrated in the flowchart of the drawings may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than here.
Fig. 1 is a flowchart of a method for classifying keyword search results according to an embodiment of the present application, and as shown in fig. 1, the method includes the following steps S102 to S106:
step S102, obtaining a target domain name associated with the keyword to be searched.
Specifically, if any keyword "coca cola" to be searched is input in the search engine, domain name information related to "coca cola", such as domain name information of "www.baidu.com" and "www.coca-cola. Com is the target domain name, wherein, www.baidu.com and www.coca-cola.
Step S104, configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name.
Specifically, the preset identifier is used for representing the domain name type of the target domain name. If any acquired domain name is: "www.aaa.com", and the website is a category a website, the preset identifier can be represented by "00", and can also be represented by other identifiers; if any acquired domain name is 'www.abc.com', and the website is a B-type website, the preset identifier may be represented by '01', or may be represented by other identifiers; if any of the obtained domain names is "www.bbb.com" and the website is other types of websites, it can be represented by a different identifier than the identifiers of the a-type and B-type websites. The domain name library is a database obtained by inducing and sorting known domain names by a user and in a labeling mode, and each domain name is classified and stored in the database according to the type of the domain name.
And step S106, displaying the number of the target domain names in the domain name group in the target webpage.
Specifically, after configuring the target domain name and the preset identifier, a domain name group is obtained, where the domain name group includes one or more target domain names, and the target domain names included in each domain name group are domain names of the same type.
According to the method for classifying the keyword search results provided by the embodiment of the application, the domain name (namely the target domain name) associated with the keyword to be searched is automatically acquired through the system, the target domain name and the preset identification used for representing the domain name type of the target domain name in the domain name library are configured, and the domain name group is obtained through configuration, so that the automatic classification of the target domain name is realized, wherein the target domain name with the same domain name type is stored in the same domain name group, the purposes of automatically acquiring the target domain name and configuring the target domain name are achieved, compared with the prior art that the searched search results are manually sorted and classified, the technical problem that the search results cannot be automatically classified in the prior art is solved, and the technical effect of automatically sorting the search results is realized.
Optionally, the number of the target domain names is multiple, and acquiring the target domain name associated with the keyword to be searched includes the following steps S1021 to S1023:
step S1021, address information of a plurality of associated webpages of the keywords to be searched is crawled.
And S1023, intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
Specifically, when a keyword to be searched is input, the web crawler searches in a search engine to obtain search results (i.e., address information of a plurality of associated web pages, also called URLs), for example, the searched URLs of the associated web pages are www.abc.com/1234/a.html, www.bbb.com/5678/a.html, etc., and the obtained URLs are intercepted to obtain domain names of urlww.abc.com/1234/a.html and www.bbb.com/5678/a.html, which are respectively: www.abc.com and www.bbb.com, wherein the domain names www.abc.com and www.bbb.com obtained by interception are target domain names.
It should be noted that each associated web page is intercepted to obtain a target domain name. Suppose that the multiple associated web pages crawled by the web crawler are: www.abc.com/1234/a.html, www.abc.com/5678/a.html, and the target domain names obtained by intercepting the address information of the associated web pages are: www.abc.com, and www.abc.com, the number of target domain names counted is 2.
Optionally, the number of the domain name group and the number of the preset identifiers are multiple, and configuring the acquired target domain name and the preset identifiers in the domain name library to obtain the number of the target domain names included in the domain name group and the domain name group includes the following steps S1041 to S1043:
step S1041, classifying the multiple target domain names according to preset identifiers to obtain multiple domain name groups, where each domain name group corresponds to one preset identifier, and the preset identifiers corresponding to different domain name groups are different.
Step S1043, counting the number of target domain names in each domain name group.
Specifically, the intercepted target domain names are classified according to preset identifications in a domain name library to obtain a plurality of domain name groups, and the domain names of the target domain names in different domain name groups are different in type, that is, each domain name group corresponds to one preset identification. For example, the target domain names stored in the domain name group a are all domain names of "class a websites", the target domain names stored in the domain name group B are all domain names of "class B websites", and so on.
And classifying the plurality of target domain names to obtain a plurality of domain name groups, and then counting the number of the target domain names contained in each domain name group.
Optionally, the classifying each target domain name according to the preset identifier to obtain a plurality of domain name groups includes the following steps S1 to S5:
and step S1, searching the preset identification of the target domain name Ai in the domain name library, wherein i sequentially takes 1 to n, and n is the number of the target domain names.
Step S3, finding the identifier that is the same as the preset identifier of the target domain name Ai in a target database, where the target database is a database for storing a plurality of domain name groups and the number of target domain names included in the plurality of domain name groups, and the identifier that represents the domain name type has been stored in the target database.
Step S5, storing the domain name group from the target domain name Ai to the preset identifier of the target domain name Ai when the identifier identical to the preset identifier of the target domain name Ai is found in the target database.
Specifically, in this embodiment of the present application, the target database is configured to store a domain name group and the number of target domain names in the domain name group, where before the domain name group and the number of target domain names in the domain name group are stored, a plurality of identifiers are stored in the target database in advance, where the identifiers are also identifiers representing types of domain names, and the identifiers include identifiers that are the same as preset identifiers. Assuming that the preset identifier 1, the preset identifier 2 and the preset identifier 3 are pre-stored in the target database, the region corresponding to the preset identifier 1 is used for storing the target domain name identified as the preset identifier 1, the region corresponding to the preset identifier 2 is used for storing the target domain name identified as the preset identifier 2, and the region corresponding to the preset identifier 3 is used for storing the target domain name identified as the preset identifier 3. And classifying and storing the domain names known by the user in a domain name library according to preset identifications.
When classifying a plurality of target domain names, first, a preset identifier of a target domain name Ai is searched in a domain name library, for example, any target domain name Ai searched in the domain name library: www.abc.com, the preset identification is "00", where "00" indicates that the domain name stored in the preset identification area is the domain name of the class a website. After the preset identifier of the target domain name Ai is found, whether the preset identifier '00' is stored in advance is found in a target database, if the preset identifier '00' is found, the target domain name Ai is stored into the area where the preset identifier of the target domain name Ai is located, wherein under the condition that the classification of the target domain names A1 to An is completed, the target domain names contained in the area where each preset identifier is located form a domain name group.
It should be noted that the number of target domain names in each domain name group can be counted in the following two ways.
The first method is as follows: after the preset identifier of the target domain name a1 is obtained, the preset identifier is found in the target database, and the target domain name a1 is stored in the domain name group corresponding to the preset identifier, the parameters of the domain name group are cumulatively increased by 1 from the initial 0, when the next target domain name is stored in the domain name group, the parameters of the domain name group are increased by 1 on the basis of 1, and the increased value 2 is obtained, wherein each domain name group is correspondingly provided with one parameter. And after the target domain name An is stored in the corresponding domain name group, the parameter of each domain name group is the number of the target domain names in the domain name group.
The second method comprises the following steps: and after all the target domain names are respectively stored in the corresponding domain name groups, sequentially counting the number of the target domain names in each domain name group to obtain the number of the target domain names in each domain name group.
Optionally, before displaying the number of the target domain names in the domain name group in the target web page, the following step S7 is further included: and sequencing the plurality of domain name groups according to the number of the target domain names in each domain name group to obtain the sequenced domain name groups.
Displaying the number of the target domain names in the domain name group on the target web page includes step S9: and displaying the number group of the target domain names in the plurality of domain name groups after sorting in the target webpage.
Specifically, before the number of target domain names in the domain name group is displayed on the front-end web page (i.e., the target web page), the plurality of domain name groups may be further sorted from top to bottom according to the number of target domain names in the domain name group, for example, the number of target domain names in the domain name group corresponding to "class a website" is 10, the number of target domain names in the domain name group corresponding to "class B website" is 8, the number of target domain names in the domain name group corresponding to "class C website" is 6, and the like. And displaying the domain name group after the sorting in a front-end webpage.
According to the method for classifying the keyword search results, the obtained address information of the multiple associated webpages is intercepted by the web crawler to obtain the multiple target domain names, the multiple target domain names are processed and configured to obtain aggregated data, the aggregated data is displayed in the front-end webpage, and the content distribution condition of the search results is visually displayed to a user, so that the user can visually know the search result environment of the keywords to be searched. Meanwhile, the obtained target domain name is automatically configured, and the problems of high error rate and low efficiency caused by manual checking are solved.
The embodiment of the present application further provides a device for classifying a keyword search result, where the device is mainly used to perform the method for classifying a keyword search result provided in the foregoing content of the embodiment of the present application, and a specific description is given below to the device for classifying a keyword search result provided in the embodiment of the present application.
Fig. 2 is a schematic diagram of a keyword search result classification apparatus according to an embodiment of the present application, and as shown in fig. 2, the keyword search result classification apparatus mainly includes an obtaining unit 10, a configuration unit 20, and a display unit 30, where:
an obtaining unit 10, configured to obtain a target domain name associated with a keyword to be searched.
Specifically, if any keyword "coca cola" to be searched is input in the search engine, domain name information related to "coca cola", such as domain name information of "www.baidu.com" and "www.coca-cola. Com is the target domain name, wherein, www.baidu.com and www.coca-cola.
The configuration unit 20 is configured to configure the acquired target domain name and a preset identifier in the domain name library to obtain a domain name group and the number of the target domain names included in the domain name group, where the preset identifier is an identifier used for representing a domain name type of the target domain name.
Specifically, the preset identifier is used for representing the domain name type of the target domain name. If any acquired domain name is: "www.aaa.com", and the website is a category a website, the preset identifier can be represented by "00", and can also be represented by other identifiers; if any acquired domain name is 'www.abc.com', and the website is a B-type website, the preset identifier may be represented by '01', or may be represented by other identifiers; if any of the obtained domain names is "www.bbb.com" and the website is other types of websites, it can be represented by a different identifier than the identifiers of the a-type and B-type websites. The domain name library is a database obtained by inducing and sorting known domain names by a user and in a labeling mode, and each domain name is classified and stored in the database according to the type of the domain name.
And a display unit 30, configured to display the number of target domain names in the domain name group in the target web page.
Specifically, after configuring the target domain name and the preset identifier, a domain name group is obtained, where the domain name group includes one or more target domain names, and the target domain names included in each domain name group are domain names of the same type.
According to the classification device for the keyword search results, the domain name (namely the target domain name) associated with the keyword to be searched is automatically acquired through the system, the target domain name and the preset identification used for representing the domain name type of the target domain name in the domain name library are configured, and the domain name group is obtained through configuration, so that automatic classification of the target domain name is achieved, wherein the target domain name with the same domain name type is stored in the same domain name group, the purposes of automatically acquiring the target domain name and configuring the target domain name are achieved, compared with the prior art that search results searched are manually sorted and classified, the technical problem that the search results cannot be automatically sorted in the prior art is solved, and the technical effect of automatically sorting the search results is achieved.
Optionally, the number of the target domain names is multiple, and the obtaining unit includes: the crawling module is used for crawling address information of a plurality of associated webpages of the keywords to be searched; and the intercepting module is used for intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
Specifically, when a keyword to be searched is input, a crawler module searches in a search engine to obtain search results (namely, address information of a plurality of associated web pages, also called as URLs), for example, the searched URLs of the associated web pages are www.abc.com/1234/a.html, www.bbb.com/5678/a.html and the like, and an intercepting module intercepts the acquired urlww.abc.com/1234/a.html and www.bbb.com/5678/a.html to obtain domain names www.abc.com and www.bbb.com, wherein the intercepted domain names www.abc.com and www.bbb.com are target domain names.
It should be noted that each associated web page is intercepted to obtain a target domain name. Suppose that the multiple associated web pages crawled by the web crawler are: www.abc.com/1234/a.html, www.abc.com/5678/a.html, and the target domain names obtained by intercepting the address information of the associated web pages are: www.abc.com, and www.abc.com, the number of target domain names counted is 2.
Optionally, the number of the domain name groups and the number of the preset identifiers are multiple, and the configuration unit includes: the classification module is used for classifying the target domain names according to preset identifications to obtain a plurality of domain name groups, wherein each domain name group corresponds to one preset identification, and the preset identifications corresponding to different domain name groups are different; and the counting module is used for counting the number of the target domain names in each domain name group.
Specifically, the intercepted target domain name is classified by a classification module according to preset identifications in a domain name library to obtain a plurality of domain name groups, and the domain name types of the target domain name contained in different domain name groups are different, that is, each domain name group corresponds to one preset identification. For example, the target domain names stored in the domain name group a are all domain names of "class a websites", the target domain names stored in the domain name group B are all domain names of "class B websites", and so on.
After the plurality of target domain names are classified to obtain a plurality of domain name groups, the number of the target domain names contained in each domain name group is counted through a counting module.
Optionally, the classification module includes: the first searching submodule is used for searching a preset identifier of a target domain name Ai in a domain name library, wherein i is 1 to n in sequence, and n is the number of the target domain names; the second searching sub-module is used for searching for an identifier which is the same as the preset identifier of the target domain name Ai in a target database, wherein the target database is used for storing a plurality of domain name groups and the number of the target domain names contained in the domain name groups, and the identifier for representing the domain name type is stored in the target database; and the storage sub-module is used for storing the domain name group from the target domain name Ai to the preset identifier of the target domain name Ai under the condition that the identifier which is the same as the preset identifier of the target domain name Ai is found in the target database.
Specifically, when classifying a plurality of target domain names, first, a preset identifier of a target domain name Ai is searched in a domain name library through a first search sub-module, for example, any target domain name Ai found in the domain name library: www.abc.com, the preset identification is "00", where "00" indicates that the domain name stored in the preset identification area is the domain name of the class a website. After the preset identifier of the target domain name Ai is found, whether the preset identifier '00' is stored in advance is found in a target database through a second finding submodule, if the preset identifier '00' is found to be stored, the target domain name Ai is stored into a domain name group where the preset identifier of the target domain name Ai is located through a storage submodule, wherein under the condition that the target domain names A1 to An are equally classified, the target domain name contained in the area where each preset identifier is located forms a domain name group.
It should be noted that the number of target domain names in each domain name group can be counted in the following two ways.
The first method is as follows: after the preset identifier of the target domain name a1 is obtained, the preset identifier is found in the target database, and the target domain name a1 is stored in the domain name group corresponding to the preset identifier, the parameters of the domain name group are cumulatively increased by 1 from the initial 0, when the next target domain name is stored in the domain name group, the parameters of the domain name group are increased by 1 on the basis of 1, and the increased value 2 is obtained, wherein each domain name group is correspondingly provided with one parameter. And after the target domain name An is stored in the corresponding domain name group, the parameter of each domain name group is the number of the target domain names in the domain name group.
The second method comprises the following steps: and after all the target domain names are respectively stored in the corresponding domain name groups, sequentially counting the number of the target domain names in each domain name group to obtain the number of the target domain names in each domain name group.
Optionally, the apparatus for classifying a keyword search result provided by the present application further includes: the sorting unit is used for sorting the plurality of domain name groups according to the number of the target domain names in each domain name group before displaying the number of the target domain names in the domain name groups in the target webpage to obtain a plurality of domain name groups after sorting; the display unit includes: and the display module is used for displaying the number group of the target domain names in the plurality of domain name groups after the sorting in the target webpage.
Specifically, before the number of the target domain names in the domain name group is displayed on the front-end webpage (i.e., the target webpage), the plurality of domain name groups may be further sorted from top to bottom according to the number of the target domain names in the domain name group by the sorting unit, for example, the number of the target domain names in the domain name group corresponding to "a-type website" is 10, the number of the target domain names in the domain name group corresponding to "B-type website" is 8, the number of the target domain names in the domain name group corresponding to "C-type website" is 6, and the like. And displaying the domain name group after the sorting in a front-end webpage.
The device for classifying the keyword search results comprises a processor and a memory, wherein the acquisition unit, the configuration unit, the display unit and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.
The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more than one, the target domain name is automatically acquired and configured by adjusting the kernel parameters, and the technical problem that the search result cannot be automatically classified in the prior art is solved.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
The present application further provides a computer program product adapted to perform program code for initializing the following method steps when executed on a data processing device: acquiring a target domain name associated with a keyword to be searched; configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name; displaying the number of target domain names in the domain name group in the target webpage
The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.
In the above embodiments of the present application, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed to by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present application and it should be noted that those skilled in the art can make several improvements and modifications without departing from the principle of the present application, and these improvements and modifications should also be considered as the protection scope of the present application.

Claims (8)

1. A method for classifying keyword search results is characterized by comprising the following steps:
acquiring a target domain name associated with a keyword to be searched;
configuring the acquired target domain name and a preset identifier in a domain name library to obtain a domain name group and the number of the target domain names contained in the domain name group, wherein the preset identifier is an identifier for representing the domain name type of the target domain name; and
displaying the number of the target domain names in the domain name group in a target webpage;
before displaying the number of the target domain names in the domain name group in the target webpage, the method further comprises:
sequencing the domain name groups according to the number of the target domain names in each domain name group to obtain a plurality of sequenced domain name groups,
displaying the number of the target domain names in the domain name group in a target webpage comprises: and displaying the number group of the target domain names in the plurality of domain name groups after sorting in the target webpage.
2. The method according to claim 1, wherein the number of the target domain names is multiple, and acquiring the target domain name associated with the keyword to be searched comprises:
crawling address information of a plurality of associated webpages of the keywords to be searched; and
intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
3. The method according to claim 1, wherein the number of the domain name group and the number of the preset identifiers are both multiple, and the step of configuring the obtained target domain name with the preset identifiers in a domain name library to obtain the number of the target domain names included in the domain name group and the domain name group comprises:
classifying the target domain names according to the preset identifications to obtain a plurality of domain name groups, wherein each domain name group corresponds to one preset identification, and the preset identifications corresponding to different domain name groups are different; and
and counting the number of the target domain names in each domain name group.
4. The method of claim 3, wherein classifying each of the target domain names according to the preset identifier to obtain a plurality of domain name groups comprises:
searching a preset identifier of a target domain name Ai in the domain name library, wherein i is 1 to n in sequence, and n is the number of the target domain names;
searching an identifier which is the same as a preset identifier of the target domain name Ai in a target database, wherein the target database is used for storing a plurality of domain name groups and the number of the target domain names contained in the domain name groups, and the identifier for representing the domain name type is stored in the target database; and
and under the condition that the identifier which is the same as the preset identifier of the target domain name Ai is found in a target database, storing the target domain name Ai to the domain name group where the preset identifier of the target domain name Ai is located.
5. An apparatus for classifying a keyword search result, comprising:
an acquisition unit configured to acquire a target domain name associated with a keyword to be searched;
a configuration unit, configured to configure the obtained target domain name with a preset identifier in a domain name library, so as to obtain a domain name group and the number of the target domain names included in the domain name group, where the preset identifier is an identifier for representing a domain name type of the target domain name; and
the display unit is used for displaying the number of the target domain names in the domain name group in a target webpage;
the device further comprises:
a sorting unit, configured to sort the domain name groups according to the number of the target domain names in each domain name group before the display unit displays the number of the target domain names in the domain name groups in a target web page, so as to obtain a plurality of domain name groups after sorting,
the display unit includes: and the display module is used for displaying the quantity group of the target domain names in the plurality of domain name groups after the sorting in the target webpage.
6. The apparatus according to claim 5, wherein the number of the target domain names is plural, and the obtaining unit includes:
the crawling module is used for crawling address information of a plurality of associated webpages of the keywords to be searched; and
and the intercepting module is used for intercepting the address information of each associated webpage to obtain a plurality of target domain names, wherein each associated webpage is intercepted to obtain one target domain name.
7. The apparatus according to claim 5, wherein the number of the domain name group and the number of the preset identities are both multiple, and the configuration unit includes:
the classification module is used for classifying the target domain names according to the preset identifications to obtain a plurality of domain name groups, wherein each domain name group corresponds to one preset identification, and the preset identifications corresponding to different domain name groups are different; and
and the counting module is used for counting the number of the target domain names in each domain name group.
8. The apparatus of claim 7, wherein the classification module comprises:
the first searching submodule is used for searching a preset identifier of a target domain name Ai in the domain name library, wherein i is 1 to n in sequence, and n is the number of the target domain names;
a second searching sub-module, configured to search, in a target database, an identifier that is the same as a preset identifier of the target domain name Ai, where the target database is a database that is used to store a plurality of domain name groups and the number of the target domain names included in the plurality of domain name groups, and an identifier that represents a domain name type has been stored in the target database; and
and the storage sub-module is used for storing the domain name group from the target domain name Ai to the preset identifier of the target domain name Ai under the condition that the identifier which is the same as the preset identifier of the target domain name Ai is found in the target database.
CN201510729732.9A 2015-10-30 2015-10-30 Keyword search result classification method and device Active CN106649366B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510729732.9A CN106649366B (en) 2015-10-30 2015-10-30 Keyword search result classification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510729732.9A CN106649366B (en) 2015-10-30 2015-10-30 Keyword search result classification method and device

Publications (2)

Publication Number Publication Date
CN106649366A CN106649366A (en) 2017-05-10
CN106649366B true CN106649366B (en) 2020-05-22

Family

ID=58809574

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510729732.9A Active CN106649366B (en) 2015-10-30 2015-10-30 Keyword search result classification method and device

Country Status (1)

Country Link
CN (1) CN106649366B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102682011A (en) * 2011-03-14 2012-09-19 腾讯科技(深圳)有限公司 Method, device and system for establishing domain description name information sheet and searching
CN103399912A (en) * 2013-07-30 2013-11-20 腾讯科技(深圳)有限公司 Fishing web page clustering method and device
CN104166730A (en) * 2014-08-28 2014-11-26 无锡天脉聚源传媒科技有限公司 Method and device for displaying search result

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030088553A1 (en) * 2002-11-23 2003-05-08 Emergency 24, Inc. Method for providing relevant search results based on an initial online search query
US20050097088A1 (en) * 2003-11-04 2005-05-05 Dominic Bennett Techniques for analyzing the performance of websites
CN102609539B (en) * 2012-02-16 2015-06-10 北京搜狗信息服务有限公司 Search method and search system
CN104216928A (en) * 2013-06-05 2014-12-17 腾讯科技(深圳)有限公司 Site information acquiring method and device
CN104965894A (en) * 2015-06-19 2015-10-07 成都国腾实业集团有限公司 Data analysis system for IDC hazardous information monitoring platform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102682011A (en) * 2011-03-14 2012-09-19 腾讯科技(深圳)有限公司 Method, device and system for establishing domain description name information sheet and searching
CN103399912A (en) * 2013-07-30 2013-11-20 腾讯科技(深圳)有限公司 Fishing web page clustering method and device
CN104166730A (en) * 2014-08-28 2014-11-26 无锡天脉聚源传媒科技有限公司 Method and device for displaying search result

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
金路." 网站搜索引擎优化技术研究与应用".《中国优秀硕士学位论文全文数据库 信息科技辑》.2013,I138-742. *

Also Published As

Publication number Publication date
CN106649366A (en) 2017-05-10

Similar Documents

Publication Publication Date Title
JP5575902B2 (en) Information retrieval based on query semantic patterns
CN108304410B (en) Method and device for detecting abnormal access page and data analysis method
CN102855309B (en) A kind of information recommendation method based on user behavior association analysis and device
US10216848B2 (en) Method and system for recommending cloud websites based on terminal access statistics
CN104217031B (en) A kind of method and apparatus that user's classification is carried out according to server search daily record data
US20120198056A1 (en) Techniques for Analyzing Website Content
JP2016505993A (en) Method and apparatus for configuring search terms, delivering advertisements, and retrieving product information
CN104537065A (en) Search result pushing method and system
CN104462397A (en) Promotion information processing method and promotion information processing device
CN107888606B (en) Domain name credit assessment method and system
CN104484406B (en) The dispensing analysis method and device of recommending data
CN105760380A (en) Database query method, device and system
CN104699837B (en) Method, device and server for selecting illustrated pictures of web pages
CN107483381A (en) Method and device for monitoring associated accounts
CN104462396A (en) Method and device for handing character strings
US20110270691A1 (en) Method and system for providing url possible new advertising
US9305054B2 (en) System and method for extracting analogous queries
CN104408144A (en) Detection method and device for web search keyword
CN110955855B (en) Information interception method, device and terminal
CN106649385B (en) Data reordering method and device based on HBase database
US20090063464A1 (en) System and method for visualizing and relevance tuning search engine ranking functions
CN111177566B (en) Information processing method, device, electronic equipment and storage medium
CN107544994B (en) Associated data processing method and device
CN106844406B (en) Search method and search device
CN106649366B (en) Keyword search result classification method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant