[go: up one dir, main page]

CN101350154B - Method and device for sorting electronic map data - Google Patents

Method and device for sorting electronic map data Download PDF

Info

Publication number
CN101350154B
CN101350154B CN 200810222422 CN200810222422A CN101350154B CN 101350154 B CN101350154 B CN 101350154B CN 200810222422 CN200810222422 CN 200810222422 CN 200810222422 A CN200810222422 A CN 200810222422A CN 101350154 B CN101350154 B CN 101350154B
Authority
CN
China
Prior art keywords
electronic map
map data
importance
sorting
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN 200810222422
Other languages
Chinese (zh)
Other versions
CN101350154A (en
Inventor
董正斌
佟子健
王云峰
王登
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sohu New Media Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sohu New Media Information Technology Co Ltd filed Critical Beijing Sohu New Media Information Technology Co Ltd
Priority to CN 200810222422 priority Critical patent/CN101350154B/en
Publication of CN101350154A publication Critical patent/CN101350154A/en
Application granted granted Critical
Publication of CN101350154B publication Critical patent/CN101350154B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a priority method of electronic map data and a device, which solves the problem that a traditional artificial priority method causes a poor priority effect, wastes manpower and has high cost. The method comprises that a key word of each electronic map data is extracted, web sets of search results, which correspond to each electronic map data are obtained through that the key word is used to search. According the corresponding wet sets of search results of each electronic map data, the importance of the electronic map data is calculated, and the electronic map data is sorted according to the importance. The invention uses the internet popularity of internet to depict the importance degree of POI data, because the depiction represents the recognition of vast netizen even broad masses, the priority effect is better and has excellent mass basis and rationality. Further, a machine is used to automatically score and sort, thereby the man labor is effectively saved, the efficiency is higher, and the cost is very low.

Description

一种电子地图数据的排序方法及装置Method and device for sorting electronic map data

技术领域 technical field

本发明涉及网络技术领域,特别是涉及一种电子地图数据的排序方法及装置。The invention relates to the field of network technology, in particular to a sorting method and device for electronic map data.

背景技术 Background technique

随着地理信息系统的发展与完善,电子地图的设计开发技术也日趋成熟。电子地图中,有一类数据称为兴趣点数据(即Point of Interest,POI数据),是指人们感兴趣的数据,如餐馆、公园、商场等建筑物的地理信息,或是一些街道的信息等等。通常,POI数据包括名称、类别、经度、纬度四个方面的信息,有时也包括其他一些信息,如地址,电话、邮编等等。POI数据是电子地图最重要的元素之一,也是人们使用电子地图时最为关注的信息。With the development and improvement of geographic information system, the design and development technology of electronic map is becoming more and more mature. In electronic maps, there is a type of data called point of interest data (Point of Interest, POI data), which refers to data that people are interested in, such as geographic information of restaurants, parks, shopping malls and other buildings, or information about some streets, etc. wait. Usually, POI data includes information on four aspects: name, category, longitude, and latitude, and sometimes other information, such as address, phone number, zip code, and so on. POI data is one of the most important elements of electronic maps, and it is also the information that people pay most attention to when using electronic maps.

一个电子地图通常包含很多的POI数据,这些POI数据涵盖了该地图范围内的绝大部分地理信息。但是,该电子地图中地理信息的重要程度有所不同,如“天安门广场”比“中关村广场”重要,“北京大学”比“北京大学附属中学”重要,这种地理信息重要性的不同导致POI数据的重要性存在差异。An electronic map usually contains a lot of POI data, and these POI data cover most of the geographical information within the range of the map. However, the importance of geographic information in the electronic map is different. For example, "Tiananmen Square" is more important than "Zhongguancun Square", and "Peking University" is more important than "Peking University High School". This difference in the importance of geographic information leads to POI The importance of data varies.

POI排序是指根据POI数据重要性的不同对POI数据进行的排序,POI数据的重要性体现在其所指代地理信息的重要性。POI排序可应用在搜索引擎的排序中,即根据POI数据的重要性对电子地图的查询结果进行排序展示。POI sorting refers to the sorting of POI data according to the importance of POI data. The importance of POI data is reflected in the importance of the geographic information it refers to. POI sorting can be applied in the sorting of search engines, that is, sorting and displaying the query results of electronic maps according to the importance of POI data.

目前,还没有比较成熟的POI排序方法。传统上,电子地图的开发商会请一些编辑或者普通民众,根据人们对POI数据的熟悉程度来对POI数据进行排序,这种根据熟悉程度进行排序的核心思想是:如果一个POI数据所指代的地理位置非常重要,则它一定为人们所熟悉。这一思想具有一定的合理性,由于电子地图乃至实际的地理信息的使用者是普通民众,因此被普通民众熟悉的地理信息应该具有较高的重要性。At present, there is no relatively mature POI sorting method. Traditionally, developers of electronic maps will invite some editors or ordinary people to sort POI data according to people's familiarity with POI data. The core idea of sorting according to familiarity is: if a POI data refers to Geographic location is very important, then it must be familiar to people. This idea is reasonable to a certain extent. Since the users of electronic maps and even actual geographic information are ordinary people, geographic information familiar to ordinary people should be of high importance.

但是,这种方法存在如下问题:However, this method has the following problems:

第一,虽然可以用熟悉程度来刻画POI数据的重要程度,但是如何计算熟悉程度是一个非常困难的问题。因此,上述人工排序的方法由于只有极小一部分人参与,无法代表广大用户,所以排序效果没有保障,排序效果差;而且,由于人数较少,所以错误率也比较高。First, although familiarity can be used to describe the importance of POI data, how to calculate familiarity is a very difficult problem. Therefore, the above-mentioned manual sorting method cannot represent the vast number of users because only a small number of people participate, so the sorting effect is not guaranteed, and the sorting effect is poor; and, because the number of people is small, the error rate is also relatively high.

第二,由于POI数据量极大,而且更新很快,所以采用人工进行排序非常耗费人力,而且成本非常昂贵。Second, due to the large amount of POI data and the rapid update, manual sorting is very labor-intensive and expensive.

因此,这种人工排序方法无法得到实际使用。Therefore, this manual sorting method cannot be practically used.

发明内容 Contents of the invention

本发明提供一种电子地图数据的排序方法及装置,以解决传统的人工排序方法造成排序效果差、耗费人力、成本太高的问题。The invention provides a sorting method and device for electronic map data to solve the problems of poor sorting effect, manpower consumption and high cost caused by the traditional manual sorting method.

为解决上述技术问题,根据本发明提供的具体实施例,本发明公开了以下技术方案:In order to solve the above technical problems, according to the specific embodiments provided by the invention, the invention discloses the following technical solutions:

一种电子地图数据的排序方法,包括:A method for sorting electronic map data, comprising:

提取出每个电子地图数据的关键词;Extract keywords of each electronic map data;

利用所述关键词搜索,获取对应每个电子地图数据的搜索结果网页集合;Using the keyword search to obtain a set of search result webpages corresponding to each electronic map data;

针对集合中每个搜索结果网页,分别计算用于表示网页重要程度的第一数值和用于表示网页与关键词匹配程度的第二数值,根据相应集合中所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度;For each search result webpage in the collection, calculate the first numerical value used to represent the importance of the webpage and the second numerical value used to represent the degree of matching between the webpage and the keyword, according to the first numerical value and the second numerical value of all search result webpages in the corresponding collection Two values, calculating the importance of the electronic map data;

按照所述重要度对所述电子地图数据进行排序;sorting the electronic map data according to the importance;

其中,所述根据相应集合中所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度,具体包括:将集合中每个搜索结果网页的第一数值和第二数值相乘,然后再将集合中所有搜索结果网页的相乘结果求和,得到该电子地图数据的重要度。Wherein, the calculating the importance of the electronic map data according to the first value and the second value of all the search result web pages in the corresponding set specifically includes: comparing the first value and the second value of each search result web page in the set and then sum the multiplication results of all the search result webpages in the collection to obtain the importance of the electronic map data.

优选的,所述第一数值通过计算网页级别得到。Preferably, the first value is obtained by calculating the web page rank.

优选的,所述计算该电子地图数据的重要度之后,还包括:根据电子地图数据所属类别所具有的不同权重,将该电子地图数据的重要度乘以该电子地图数据所属类别的权重值,得到调整后的结果数据,用于排序。Preferably, after the calculation of the importance of the electronic map data, it further includes: according to the different weights of the categories to which the electronic map data belongs, multiplying the importance of the electronic map data by the weight value of the category to which the electronic map data belongs, Get the adjusted result data for sorting.

其中,所述提取出每个电子地图数据的关键词,具体包括:提取出每个电子地图数据的名称作为关键词。Wherein, the extracting the keyword of each electronic map data specifically includes: extracting the name of each electronic map data as the keyword.

优选的,还包括:提取出每个电子地图数据的地址信息,与名称一同作为关键词。Preferably, the method also includes: extracting the address information of each electronic map data, and using the name together as a keyword.

优选的,所述提取出每个电子地图数据的关键词之前,还包括:对原始的电子地图数据进行预处理,所述预处理包括去除无关符号、字符编码转换、调整统一格式;将预处理结果用于关键词的提取;Preferably, before extracting the keywords of each electronic map data, it also includes: preprocessing the original electronic map data, the preprocessing includes removing irrelevant symbols, character code conversion, and adjusting the unified format; The results are used for keyword extraction;

优选的,按照所述重要度对所述电子地图数据进行排序之后,还包括:在电子地图检索中,根据用户输入的查询词返回相匹配的检索结果,将检索结果中排序靠前的电子地图数据优先显示。Preferably, after sorting the electronic map data according to the importance, it also includes: in the electronic map retrieval, returning matching search results according to the query words input by the user, and sorting the top ranked electronic maps in the search results Data is displayed first.

优选的,按照所述重要度对所述电子地图数据进行排序之后,还包括:在图层显示时,选取显示范围内排序靠前的电子地图数据进行显示。Preferably, after sorting the electronic map data according to the importance, the method further includes: when displaying the layers, selecting and displaying the top-ranked electronic map data within the display range.

优选的,按照所述重要度对所述电子地图数据进行排序之后,还包括:对排序靠前的电子地图数据进行优先更新。Preferably, after sorting the electronic map data according to the importance, the method further includes: prioritizing updating of the electronic map data ranked higher.

本发明还提供了一种电子地图数据的排序装置,包括:The present invention also provides a sorting device for electronic map data, comprising:

关键词提取单元,用于提取出每个电子地图数据的关键词;A keyword extraction unit is used to extract keywords of each electronic map data;

查询单元,用于利用所述关键词进行搜索,获取对应每个电子地图数据的搜索结果网页集合;a query unit, configured to use the keyword to search, and obtain a set of search result webpages corresponding to each electronic map data;

计算单元,包括:第一计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页重要程度的第一数值;第二计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页与关键词匹配程度的第二数值;综合计算子单元,用于根据每个电子地图数据相应集合中的所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度;The calculation unit includes: a first calculation subunit, for each search result webpage in the collection, respectively calculating a first numerical value representing the importance of the webpage; a second calculation subunit, for each search result webpage in the collection The webpage is used to calculate the second numerical value used to represent the degree of matching between the webpage and the keyword; the comprehensive calculation subunit is used to calculate the second numerical value according to the first numerical value and the second numerical value of all search result webpages in the corresponding set of each electronic map data. The importance of electronic map data;

排序单元,用于按照所述重要度对所述电子地图数据进行排序;a sorting unit, configured to sort the electronic map data according to the importance;

其中,所述综合计算子单元将集合中每个搜索结果网页的第一数值和第二数值相乘,然后再将集合中所有搜索结果网页的相乘结果求和,得到该电子地图数据的重要度。Wherein, the comprehensive calculation subunit multiplies the first value and the second value of each search result webpage in the collection, and then sums the multiplication results of all the search result webpages in the collection to obtain the important value of the electronic map data. Spend.

优选的,所述第一计算子单元通过计算网页级别得到第一数值。Preferably, the first calculation subunit obtains the first value by calculating the webpage level.

优选的,所述装置还包括:调整单元,用于根据电子地图数据所属类别所具有的不同权重,将该电子地图数据的重要度乘以该电子地图数据所属类别的权重值,得到调整后的结果数据,并输出到排序单元用于排序。Preferably, the device further includes: an adjustment unit, configured to multiply the importance of the electronic map data by the weight value of the category to which the electronic map data belongs according to the different weights of the category to which the electronic map data belongs, to obtain the adjusted The resulting data, and output to the sort unit for sorting.

其中,所述关键词提取单元将提取出的电子地图数据的名称作为关键词。Wherein, the keyword extracting unit uses the name of the extracted electronic map data as a keyword.

优选的,所述关键词提取单元还将提取出的电子地图数据的地址信息,与名称一同作为关键词。Preferably, the keyword extracting unit also uses the extracted address information of the electronic map data together with the name as keywords.

优选的,所述装置还包括:预处理单元,用于对原始的电子地图数据进行预处理,并将预处理结果输出到关键词提取单元;其中,所述预处理包括去除无关符号、字符编码转换、调整统一格式。Preferably, the device further includes: a preprocessing unit, configured to preprocess the original electronic map data, and output the preprocessing result to the keyword extraction unit; wherein, the preprocessing includes removing irrelevant symbols, character encoding Convert and adjust the unified format.

优选的,所述装置还包括:检索单元,用于在电子地图检索中,根据用户输入的查询词返回相匹配的检索结果,将检索结果中排序靠前的电子地图数据优先显示。Preferably, the device further includes: a retrieval unit, configured to return matching retrieval results according to the query words input by the user during the electronic map retrieval, and preferentially display the electronic map data ranked first in the retrieval results.

优选的,所述装置还包括:图层显示单元,用于在图层显示时,选取显示范围内排序靠前的电子地图数据进行显示。Preferably, the device further includes: a layer display unit, configured to select and display the top-ranked electronic map data within the display range when the layer is displayed.

优选的,所述装置还包括:数据更新单元,用于对排序靠前的电子地图数据进行优先更新。Preferably, the device further includes: a data update unit, configured to update the electronic map data ranked first.

本发明还提供了一种搜索引擎系统,所述系统包括上述任一装置实施例所述的装置。The present invention also provides a search engine system, which includes the device described in any one of the above device embodiments.

根据本发明提供的具体实施例,本发明具有以下技术效果:According to the specific embodiments provided by the invention, the invention has the following technical effects:

首先,本发明利用互联网技术对POI数据进行排序,使用互联网的网络知名度来刻画POI数据的重要程度,而网络知名度是根据关键词(是从POI数据中提取出)在搜索引擎中返回的结果网页进行计算得到。由于这种刻画代表了广大网民乃至广大群众的认识,因此,利用网络知名度来对POI数据进行排序,排序的效果比较好,具有很好的群众基础和合理性。而且,使用机器自动对POI数据进行打分和排序,极大地节省了人力,效率更高,成本非常低廉。First of all, the present invention uses Internet technology to sort POI data, and uses Internet network popularity to describe the importance of POI data, and network popularity is the result webpage returned in the search engine according to keywords (extracted from POI data) Get calculated. Since this kind of description represents the understanding of the majority of netizens and even the general public, the use of Internet popularity to sort POI data has a better sorting effect and has a good mass base and rationality. Moreover, the use of machines to automatically score and sort POI data greatly saves manpower, is more efficient, and costs are very low.

其次,在利用网络知名度刻画POI数据的重要程度时,本发明主要使用了网页的重要程度、网页与关键词的匹配程度这两个指标,而且每个指标也有不同的计算方法。Secondly, when using network popularity to describe the importance of POI data, the present invention mainly uses two indicators, the importance of web pages and the matching degree of web pages and keywords, and each index also has a different calculation method.

再次,本发明还充分考虑了POI数据的类别对POI重要程度的影响,利用POI数据的类别信息来对基本的网络知名度得分进行调整从而得到POI的最终得分,从而更加准确地刻画了POI数据的重要程度。Again, the present invention also fully considers the impact of the category of POI data on the importance of POI, and uses the category information of POI data to adjust the basic network popularity score to obtain the final score of POI, thereby more accurately describing the POI data. Importance.

附图说明 Description of drawings

图1是本发明实施例一所述一种电子地图数据的排序方法流程图;Fig. 1 is a flowchart of a method for sorting electronic map data according to Embodiment 1 of the present invention;

图2是本发明实施例二所述一种POI数据的排序方法流程示意图;2 is a schematic flow chart of a method for sorting POI data according to Embodiment 2 of the present invention;

图3是本发明实施例所述一种电子地图数据的排序装置结构图。Fig. 3 is a structural diagram of an electronic map data sorting device according to an embodiment of the present invention.

具体实施方式 Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

实施例一:Embodiment one:

针对传统的POI人工排序方法,本发明实施例提供了一种利用互联网技术进行的排序方法。参照图1,是本发明实施例一所述一种电子地图数据的排序方法流程图。本实施例中,所述电子地图数据以POI数据为例进行说明,但所述电子地图数据包括但不限于POI数据。Aiming at the traditional POI manual sorting method, the embodiment of the present invention provides a sorting method using Internet technology. Referring to FIG. 1 , it is a flowchart of a method for sorting electronic map data according to Embodiment 1 of the present invention. In this embodiment, the electronic map data is described by taking POI data as an example, but the electronic map data includes but not limited to POI data.

S101,提取出每个POI数据的关键词;S101, extracting keywords of each POI data;

本实施例需要从每个POI数据中提取出一个关键词,用来在互联网的搜索引擎中进行查询。由于每个POI数据具有一些属性,包括名称、类别、坐标或其它属性信息,因此提取时可以从这些属性信息中提取出最能代表这个POI数据的词作为关键词。本实施例中,关键词的基本部分是POI的名称,因为名称是POI数据最重要的部分。In this embodiment, a keyword needs to be extracted from each POI data to be used for querying in an Internet search engine. Since each POI data has some attributes, including name, category, coordinates or other attribute information, the words that best represent the POI data can be extracted from these attribute information as keywords during extraction. In this embodiment, the basic part of the keyword is the name of the POI, because the name is the most important part of the POI data.

优选的,在提取POI数据的名称时,需要对名称进行一些处理,如去除名称中的分店、分公司等信息。因为如餐饮、公司这样的名称,里面经常存在分店、分公司的情况,而POI排序的目的是为了把总店、总公司排在靠前的位置,所以这时就可以把这种分店、分公司的字符去除。如“xx公司五道口分店”,就可以把“五道口分店”去除,只剩“xx公司”。Preferably, when extracting the name of the POI data, some processing needs to be performed on the name, such as removing information such as branches and branches in the name. Because there are often branches and branches in names such as catering and companies, and the purpose of POI sorting is to rank the head office and head office at the top, so this kind of branch and branch can be ranked at this time. Company characters removed. For example, "xx company Wudaokou branch", you can remove "Wudaokou branch", leaving only "xx company".

优选的,也可以加入其它一些信息作为名称的补充,如地址、区县等。因为有些名称太短,不具有实际意义,如公厕、停车场等词,这时候就可以把POI的地址加入进来,和名称一起作为关键词,这样处理的效果更好。Preferably, some other information can also be added as a supplement to the name, such as address, district and county, etc. Because some names are too short to have practical meaning, such as public toilet, parking lot, etc., at this time, you can add the address of the POI and use it as a keyword together with the name, so that the processing effect is better.

S102,利用所述关键词进行搜索,获取对应每个POI数据的搜索结果网页集合;S102, using the keyword to search, and obtaining a set of search result webpages corresponding to each POI data;

上述提取出的关键词,在搜索引擎中进行查询并取得返回的结果集合。The keywords extracted above are queried in a search engine and a returned result set is obtained.

S103,根据每个POI数据的相应搜索结果网页集合,计算该POI数据的重要度;S103. Calculate the importance of the POI data according to the corresponding search result web page set of each POI data;

本发明是利用互联网的网络知名度来刻画POI数据的重要程度,而POI的网络知名度是根据对应该POI的搜索结果网页集合计算。其中,所述网络知名度是指一个名称在网络中的知名程度。The present invention uses the network popularity of the Internet to describe the importance of POI data, and the network popularity of POI is calculated according to the set of search result webpages corresponding to the POI. Wherein, the network popularity refers to the degree of popularity of a name in the network.

针对每个POI数据,利用提取出的关键词进行查询能够得到多个搜索结果网页(即网页集合),而每一个网页具有两个指标:一个是网页的重要程度,另一个是网页与关键词的匹配程度。本实施例主要利用所述两个指标来衡量POI数据的网络知名度。For each POI data, multiple search result pages (that is, a collection of web pages) can be obtained by using the extracted keywords to query, and each web page has two indicators: one is the importance of the web page, and the other is the web page and keyword. degree of matching. In this embodiment, the above two indicators are mainly used to measure the network popularity of POI data.

由于每种指标都有不同的计算方法,本实施例只采用其中一种比较常用的方法。对于网页的重要程度,采用计算网页级别(PageRank)的方法。网页的PageRank是度量网页重要程度的一种指标,是根据网页之间的超链接来进行计算,源自于Google创始人提出的PageRank算法。当然,也可以用网页的流量来表示网页的重要程度。对于网页与关键词的匹配程度(MatchRank),通常采用的计算方法是:如果关键词在网页中完整出现,则匹配程度较高,如果关键词被切分后出现,则匹配程度较低。本发明包含但不限于以上计算方法。Since each indicator has a different calculation method, this embodiment only adopts one of the commonly used methods. For the importance of the webpage, a method of calculating the webpage rank (PageRank) is adopted. The PageRank of a webpage is an indicator to measure the importance of a webpage. It is calculated based on the hyperlinks between webpages and originated from the PageRank algorithm proposed by the founder of Google. Of course, the web page traffic can also be used to indicate the importance of the web page. For the matching degree (MatchRank) between a web page and a keyword, the calculation method usually adopted is: if the keyword appears completely in the web page, the matching degree is high; if the keyword appears after being segmented, the matching degree is low. The present invention includes but is not limited to the above calculation methods.

得到每个网页的PageRank和MatchRank后,将每个网页的PageRank和MatchRank相乘,然后再将对应同一个POI数据的所有网页的相乘结果相加,即得到一个POI数据的计算结果。本实施例中,采用对POI数据打分的方式,所以所述计算结果得到的是一个对该POI数据的网络知名度进行刻画的分值。After obtaining the PageRank and MatchRank of each webpage, multiply the PageRank and MatchRank of each webpage, and then add the multiplication results of all webpages corresponding to the same POI data to obtain a calculation result of POI data. In this embodiment, the method of scoring POI data is adopted, so the calculation result is a score describing the network popularity of the POI data.

需要说明的是,上述根据网页的PageRank和MatchRank采用相乘再相加的计算来获得一个POI分值的方法,仅作为本实施例的一种实现方式,本发明包括但不限于所述方法。It should be noted that the above method of obtaining a POI score by multiplying and adding the PageRank and MatchRank of the webpage is only an implementation of this embodiment, and the present invention includes but is not limited to the method.

S104,按照所述重要度对所述POI数据进行排序。S104, sort the POI data according to the importance.

得到每个POI数据的得分后,利用所述得分即可以对所有的POI数据进行排序。After obtaining the score of each POI data, all POI data can be sorted by using the score.

由上述处理流程可知,本发明使用互联网的网络知名度来刻画POI数据的重要程度,由于这种刻画代表了广大网民乃至广大群众的认识,因此利用网络知名度来对POI数据进行排序,排序的效果比较好,具有很好的群众基础和合理性。而且,使用机器自动对POI数据进行打分和排序,极大地节省了人力,效率更高,成本非常低廉。As can be seen from the above processing flow, the present invention uses the network popularity of the Internet to describe the importance of POI data. Since this description represents the understanding of the majority of netizens and even the general public, the network popularity is used to sort the POI data. The effect of sorting is compared. Well, it has a good mass base and rationality. Moreover, the use of machines to automatically score and sort POI data greatly saves manpower, is more efficient, and costs are very low.

实施例二:Embodiment two:

本发明实施例二提供了一种具体应用实例。Embodiment 2 of the present invention provides a specific application example.

参照图2,是本发明实施例二所述一种POI数据的排序方法流程示意图。Referring to FIG. 2 , it is a schematic flowchart of a method for sorting POI data according to Embodiment 2 of the present invention.

S201,对原始的POI数据进行预处理;S201, preprocessing the original POI data;

对原始POI数据进行清洗过滤,主要功能是使数据符合一定的输入标准。所述预处理主要包括去除无关符号、字符编码转换、调整统一格式三个部分。其中,The main function of cleaning and filtering the original POI data is to make the data meet certain input standards. The preprocessing mainly includes three parts: removal of irrelevant symbols, conversion of character codes, and adjustment of unified format. in,

1)去除无关符号:由于数据的来源或者其他问题,数据中可能存在一些无关符号,这些符号没有实际意义,如!、#等符号,还有乱码等,需要将这些无关符号去除,起到一个清洗过滤作用;1) Remove irrelevant symbols: Due to the source of the data or other problems, there may be some irrelevant symbols in the data, these symbols have no practical meaning, such as! , # and other symbols, as well as garbled characters, etc., these irrelevant symbols need to be removed to play a cleaning and filtering role;

2)字符编码转换:使字符的编码一致,可以有利于后面打分的公平。如半角转全角,繁体转简体等;2) Character encoding conversion: Make the encoding of characters consistent, which can be beneficial to the fairness of scoring later. Such as half-width to full-width, traditional to simplified, etc.;

3)调整格式:数据的输入格式应该统一,这样利于编程。3) Adjust the format: the input format of the data should be unified, which is convenient for programming.

S202,针对预处理后的POI数据,提取出每个POI数据的关键词;S202, extracting keywords of each POI data for the preprocessed POI data;

提取过程中,可以根据地名库和别名库识别出名称中包含的分店、分公司等信息,然后去除这些信息。例如“xx公司五道口分店”,如果“五道口”是地名库中的一个词,“分店”是特有词库中的词,这样就可以把“五道口分店”去除,只剩“xx公司”。During the extraction process, information such as branches and branches contained in the name can be identified according to the geographical name database and alias database, and then these information can be removed. For example " Wudaokou branch of xx company ", if " Wudaokou " is a word in the place-name database, " branch " is the word in the special lexicon, so just can remove " Wudaokou branch ", only remaining " xx company ".

S203,利用所述关键词进行搜索,获取对应每个POI数据的搜索结果网页集合;S203, use the keyword to search, and obtain a set of search result webpages corresponding to each POI data;

S204,针对每个POI数据,根据相对应的搜索结果网页集合计算得到用于表示该POI数据重要程度的基本分值;S204, for each POI data, calculate and obtain a basic score for indicating the importance of the POI data according to the corresponding search result webpage set;

本实施例中,根据网页的PageRank和MatchRank计算得到的分值作为POI数据的基本分值,这个基本分值是对该POI数据的网络知名度的刻画。In this embodiment, the score calculated according to the PageRank and MatchRank of the webpage is used as the basic score of the POI data, and this basic score is a description of the network popularity of the POI data.

S205,根据POI数据的类别信息调整所述基本分值;S205. Adjust the basic score according to the category information of the POI data;

由于POI数据具有很多类别,而不同类别的数据在网络上具有不同的性质。例如,餐饮类的POI数据要比政府机关类的POI数据在网络上更受到关注,但是政府机关类的POI数据要比餐饮类的POI数据更为重要,因为在实际生活中人们更关注政府机关类的POI数据。因此,为了平衡不同类别POI数据的得分,本实施例引入了类别权重,需要根据类别的权重来调整POI的基本得分,使得类别重要的POI得分提高,类别不重要的POI得分降低。类别的权重可以根据经验来设定,也可以使用一些训练数据来训练获得。调整过程是:用POI数据的基本得分乘以其所属类别的权重大小,这样就得到最终得分。Since POI data has many categories, and different categories of data have different properties on the network. For example, POI data of catering is more concerned than POI data of government agencies on the Internet, but POI data of government agencies is more important than POI data of catering, because people pay more attention to government agencies in real life Class POI data. Therefore, in order to balance the scores of different categories of POI data, this embodiment introduces category weights, and the basic scores of POIs need to be adjusted according to the category weights, so that the scores of POIs with important categories are increased, and the scores of POIs with unimportant categories are decreased. The weight of the category can be set according to experience, or can be obtained by training using some training data. The adjustment process is: the basic score of POI data is multiplied by the weight of the category to which it belongs, so as to obtain the final score.

例如,有两个POI数据,一个是北京大学第三医院,一个是郭林家常菜。由于餐饮类的名称在网页中出现比较多,所以郭林家常菜的基本得分为5分,而北京大学第三医院的得分为4分。但是根据人们的经验和习惯来说,医院会比餐饮类重要,所以医院类的类别权重较大,设为1.5,而餐饮的权重较低,设为0.8。这样最终两个POI的得分分别为:北京大学第三医院4×1.5=6,郭林家常菜5×0.8=4。从而北京大学第三医院比郭林家常菜的得分高,排序靠前,这就符合了人们的一般认识。For example, there are two POI data, one is Peking University Third Hospital, and the other is Guo Lin's home cooking. Since the names of catering categories appear more frequently on the webpage, the basic score of Guo Lin’s home cooking is 5 points, while the score of Peking University Third Hospital is 4 points. But according to people's experience and habits, hospitals are more important than catering, so the hospital category has a higher weight, which is set to 1.5, while the catering category has a lower weight, which is set to 0.8. In this way, the final scores of the two POIs are respectively: Peking University Third Hospital 4×1.5=6, and Guo Lin’s home cooking 5×0.8=4. Therefore, Peking University Third Hospital has a higher score than Guo Lin's home cooking, and the ranking is higher, which is in line with people's general understanding.

S206,按照所述调整后的最终分值对所述POI数据进行排序。S206. Sorting the POI data according to the adjusted final score.

对比实施例一和实施例二,实施例二增加了预处理过程和基本分值的调整过程。实施例二还充分考虑了POI数据的类别对POI重要程度的影响,利用POI数据的类别信息来对基本的网络知名度得分进行调整从而得到POI的最终得分,从而更加准确地刻画了POI数据的重要程度。Comparing Embodiment 1 and Embodiment 2, Embodiment 2 adds a preprocessing process and an adjustment process of basic scores. Embodiment 2 also fully considers the impact of POI data categories on the importance of POIs, and uses the category information of POI data to adjust the basic network popularity score to obtain the final POI score, thereby more accurately describing the importance of POI data. degree.

电子地图POI数据的排序具有很多实用价值,例如:The sorting of electronic map POI data has many practical values, such as:

1)查询检索方面:用户在电子地图查询时输入一个查询词,会返回很多检索结果,这些检索结果都与该查询词匹配,但这些结果中往往还有重要程度之分。如果对POI进行排序后,就可以在匹配的同时,把重要的POI显示在前面,不重要的放在后面,这样更方便用户使用。例如,查询“全聚德”,会出现全聚德的很多分店和一些附属公司或培训机构,它们都与这个查询词匹配,但是不能把一些附属公司和培训机构显示在前面,因为一般这些不太重要,而应该把重要的总店或者分店排在前面。再如:查询北京大学,会出现北京大学和它的附属机构,北京大学应该排在第一位,但它的众多附属机构应该有一个排序的前后之分。1) In terms of query and retrieval: when the user inputs a query word when searching the electronic map, many search results will be returned, and these search results all match the query word, but these results often have different degrees of importance. If the POIs are sorted, the important POIs can be displayed in the front while the matching is performed, and the unimportant ones can be placed in the back, which is more convenient for users to use. For example, if you query "Quanjude", there will be many branches of Quanjude and some affiliated companies or training institutions. They all match this query word, but some affiliated companies and training institutions cannot be displayed in the front, because generally these are not important, but Important head offices or branches should be listed first. Another example: querying Peking University, Peking University and its affiliated institutions will appear, Peking University should be ranked first, but its many affiliated institutions should have a sorting order.

2)图层显示方面:电子地图一般由很多图层组成,当用户在查看某个图层时,应该将该图层的POI显示出来供用户查看。但是用户在某个图层中关注点的周围也许有很多的POI,如果把这些POI全部显示出来,则整个页面会非常杂乱且臃肿,这就不利于用户查看。因此,需要按照重要程度来选取一部分POI进行显示,这样不但用户可以查看到自己需要的信息,而且整个显示效果比较好。2) In terms of layer display: an electronic map generally consists of many layers. When a user views a certain layer, the POI of the layer should be displayed for the user to view. However, there may be many POIs around the user's point of interest in a certain layer. If all these POIs are displayed, the entire page will be very messy and bloated, which is not conducive to users' viewing. Therefore, it is necessary to select a part of POIs for display according to the degree of importance, so that not only users can view the information they need, but also the overall display effect is better.

3)数据更新方面:由于POI更新速度较快,而且更新量较大,如果在精力有限的情况下可以只针对比较重要的数据先更新。3) Data update: Since the POI update speed is faster and the update volume is larger, if the energy is limited, only the more important data can be updated first.

针对上述方法实施例,本发明还提供了一种电子地图数据的排序装置实施例。参照图3,是本发明实施例所述一种电子地图数据的排序装置结构图。所述装置主要包括:With respect to the above method embodiments, the present invention also provides an embodiment of an electronic map data sorting device. Referring to FIG. 3 , it is a structural diagram of an electronic map data sorting device according to an embodiment of the present invention. Described device mainly comprises:

关键词提取单元U32,用于提取出每个电子地图数据的关键词;Keyword extraction unit U32, used to extract the keywords of each electronic map data;

查询单元U33,用于利用所述关键词进行搜索,获取对应每个电子地图数据的搜索结果网页集合;The query unit U33 is used to search by using the keywords to obtain a set of search result webpages corresponding to each electronic map data;

计算单元U34,用于根据每个电子地图数据的相应搜索结果网页集合,计算该电子地图数据的重要度;A calculation unit U34, configured to calculate the importance of the electronic map data according to the corresponding search result webpage set of each electronic map data;

排序单元U36,用于按照所述重要度对所述电子地图数据进行排序。A sorting unit U36, configured to sort the electronic map data according to the importance.

其中,所述计算单元U34具体包括:Wherein, the calculation unit U34 specifically includes:

第一计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页重要程度的第一数值;网页的重要程度可以由网页级别(PageRank)来表示,所以所述第一数值即指计算所得的PageRank;当然,也可以用网页的流量来表示;The first calculation subunit is used to calculate the first numerical value used to represent the importance of the webpage for each search result webpage in the collection; the importance of the webpage can be represented by the webpage level (PageRank), so the first numerical value It refers to the calculated PageRank; of course, it can also be expressed by the traffic of the webpage;

第二计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页与查询词匹配程度的第二数值;网页与查询词的匹配程度(MatchRank)可以由多种方法计算得到;The second calculation subunit is used to calculate the second numerical value used to represent the matching degree of the webpage and the query word for each search result webpage in the collection; the matching degree (MatchRank) of the webpage and the query word can be calculated by multiple methods ;

综合计算子单元,用于针对每个电子地图数据,根据相对应集合中的所有搜索结果网页的第一数值和第二数值,计算用于表示该电子地图数据重要程度的结果数据。一种计算方式是:所述综合计算子单元将集合中每个搜索结果网页的第一数值和第二数值相乘,然后再将集合中所有搜索结果网页的相乘结果求和,得到该电子地图数据的重要程度值。The comprehensive calculation subunit is used for calculating, for each electronic map data, the result data indicating the importance of the electronic map data according to the first value and the second value of all the search result web pages in the corresponding set. One calculation method is: the comprehensive calculation subunit multiplies the first value and the second value of each search result web page in the collection, and then sums the multiplication results of all search result web pages in the collection to obtain the electronic The importance value of the map data.

其中,所述关键词提取单元U32将提取出的电子地图数据的名称作为关键词;或者,将提取出的电子地图数据的地址信息,与名称一同作为关键词。优选的,在提取名称时去掉包含分店、分公司的信息。Wherein, the keyword extracting unit U32 uses the name of the extracted electronic map data as a keyword; or uses the extracted address information of the electronic map data together with the name as a keyword. Preferably, when the name is extracted, the information including branch stores and branch companies is removed.

优选的,在本发明的另一装置实施例中,所述装置还包括调整单元U35,用于根据电子地图数据所属类别所具有的不同权重,将该电子地图数据的重要度乘以该电子地图数据所属类别的权重值,得到调整后的结果数据,并输出到排序单元U36用于排序。Preferably, in another device embodiment of the present invention, the device further includes an adjustment unit U35, configured to multiply the importance of the electronic map data by the electronic map data according to the different weights of the categories to which the electronic map data belongs. The weight value of the category to which the data belongs is obtained from the adjusted result data, and is output to the sorting unit U36 for sorting.

优选的,在本发明的另一装置实施例中,所述装置还包括预处理单元U31,用于对原始的电子地图数据进行预处理,并将预处理结果输出到关键词提取单元U32;其中,所述预处理包括去除无关符号、进行字符编码转换、调整统一格式。Preferably, in another device embodiment of the present invention, the device further includes a preprocessing unit U31, configured to preprocess the original electronic map data, and output the preprocessing result to the keyword extraction unit U32; wherein , the preprocessing includes removing irrelevant symbols, performing character code conversion, and adjusting a unified format.

优选的,在本发明的另一装置实施例中,所述装置还包括检索单元U37,用于在电子地图检索中,根据用户输入的查询词返回相匹配的检索结果,将检索结果中排序靠前的电子地图数据优先显示。Preferably, in another device embodiment of the present invention, the device further includes a retrieval unit U37, configured to return matching retrieval results according to the query words input by the user during electronic map retrieval, and sort the retrieval results by The previous electronic map data is displayed first.

优选的,在本发明的另一装置实施例中,所述装置还包括图层显示单元U38,用于在图层显示时,选取显示范围内排序靠前的电子地图数据进行显示。Preferably, in another device embodiment of the present invention, the device further includes a layer display unit U38, configured to select and display the top-ranked electronic map data within the display range when the layer is displayed.

优选的,在本发明的另一装置实施例中,所述装置还包括数据更新单元U39,用于对排序靠前的电子地图数据进行优先更新。Preferably, in another device embodiment of the present invention, the device further includes a data update unit U39, configured to update the electronic map data ranked first.

图3所示装置中未详述的部分可以参见图1、图2所示方法的相关部分,为了篇幅考虑,在此不再详述。The parts not detailed in the device shown in FIG. 3 can refer to the relevant parts of the method shown in FIG. 1 and FIG. 2 , and will not be described in detail here for the sake of space.

此外,本发明还提供了一种搜索引擎系统,所述系统包括上述任一装置实施例所述的装置。所述搜索引擎系统在电子地图数据的搜索应用方面,能够提供更加优质的检索结果。In addition, the present invention also provides a search engine system, which includes the device described in any one of the above device embodiments. The search engine system can provide more high-quality search results in the search application of electronic map data.

以上对本发明所提供的一种电子地图数据的排序方法及装置,进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处。综上所述,本说明书内容不应理解为对本发明的限制。The method and device for sorting electronic map data provided by the present invention have been introduced in detail above. In this paper, specific examples have been used to illustrate the principle and implementation of the present invention. The description of the above embodiments is only used to help understanding The method of the present invention and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of the present invention, there will be changes in the specific implementation and application range. In summary, the contents of this specification should not be construed as limiting the present invention.

Claims (19)

1.一种电子地图数据的排序方法,其特征在于,包括:1. A method for sorting electronic map data, comprising: 提取出每个电子地图数据的关键词;Extract keywords of each electronic map data; 利用所述关键词进行搜索,获取对应每个电子地图数据的搜索结果网页集合;Searching by using the keywords to obtain a set of search result webpages corresponding to each electronic map data; 针对集合中每个搜索结果网页,分别计算用于表示网页重要程度的第一数值和用于表示网页与关键词匹配程度的第二数值,根据相应集合中所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度;For each search result webpage in the collection, calculate the first numerical value used to represent the importance of the webpage and the second numerical value used to represent the degree of matching between the webpage and the keyword, according to the first numerical value and the second numerical value of all search result webpages in the corresponding collection Two values, calculating the importance of the electronic map data; 按照所述重要度对所述电子地图数据进行排序;sorting the electronic map data according to the importance; 其中,所述根据相应集合中所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度,具体包括:Wherein, the calculation of the importance of the electronic map data according to the first value and the second value of all search result webpages in the corresponding collection specifically includes: 将集合中每个搜索结果网页的第一数值和第二数值相乘,然后再将集合中所有搜索结果网页的相乘结果求和,得到该电子地图数据的重要度。Multiply the first value and the second value of each search result webpage in the collection, and then sum the multiplication results of all the search result webpages in the collection to obtain the importance of the electronic map data. 2.根据权利要求1所述的方法,其特征在于:所述第一数值通过计算网页级别得到。2. The method according to claim 1, characterized in that: the first value is obtained by calculating the web page rank. 3.根据权利要求1所述的方法,其特征在于,所述计算该电子地图数据的重要度之后,还包括:3. The method according to claim 1, characterized in that, after calculating the importance of the electronic map data, further comprising: 根据电子地图数据所属类别所具有的不同权重,将该电子地图数据的重要度乘以该电子地图数据所属类别的权重值,得到调整后的结果数据,用于排序。According to the different weights of the category of the electronic map data, the importance of the electronic map data is multiplied by the weight value of the category of the electronic map data to obtain adjusted result data for sorting. 4.根据权利要求1所述的方法,其特征在于,所述提取出每个电子地图数据的关键词,具体包括:4. method according to claim 1, is characterized in that, described extracting the keyword of each electronic map data, specifically comprises: 提取出每个电子地图数据的名称作为关键词。The name of each electronic map data is extracted as a keyword. 5.根据权利要求4所述的方法,其特征在于,还包括:5. The method according to claim 4, further comprising: 提取出每个电子地图数据的地址信息,与名称一同作为关键词。Extract the address information of each electronic map data, and use it as a keyword together with the name. 6.根据权利要求1所述的方法,其特征在于,所述提取出每个电子地图数据的关键词之前,还包括:6. The method according to claim 1, wherein, before the keyword of each electronic map data is extracted, further comprising: 对原始的电子地图数据进行预处理,所述预处理包括去除无关符号、字符编码转换、调整统一格式;Preprocessing the original electronic map data, the preprocessing includes removing irrelevant symbols, character code conversion, and adjusting the unified format; 将预处理结果用于关键词的提取。The preprocessing results are used for keyword extraction. 7.根据权利要求1所述的方法,其特征在于,按照所述重要度对所述电子地图数据进行排序之后,还包括:7. The method according to claim 1, characterized in that, after sorting the electronic map data according to the importance, further comprising: 在电子地图检索中,根据用户输入的查询词返回相匹配的检索结果,将检索结果中排序靠前的电子地图数据优先显示。In electronic map retrieval, matching retrieval results are returned according to the query words input by the user, and the top-ranked electronic map data in the retrieval results are preferentially displayed. 8.根据权利要求1所述的方法,其特征在于,按照所述重要度对所述电子地图数据进行排序之后,还包括:8. The method according to claim 1, characterized in that, after sorting the electronic map data according to the degree of importance, further comprising: 在图层显示时,选取显示范围内排序靠前的电子地图数据进行显示。When the layer is displayed, select the top-ranked electronic map data within the display range for display. 9.根据权利要求1所述的方法,其特征在于,按照所述重要度对所述电子地图数据进行排序之后,还包括:9. The method according to claim 1, characterized in that, after sorting the electronic map data according to the degree of importance, further comprising: 对排序靠前的电子地图数据进行优先更新。Prioritize the update of the top-ranked electronic map data. 10.一种电子地图数据的排序装置,其特征在于,包括:10. A sorting device for electronic map data, comprising: 关键词提取单元,用于提取出每个电子地图数据的关键词;A keyword extraction unit is used to extract keywords of each electronic map data; 查询单元,用于利用所述关键词进行搜索,获取对应每个电子地图数据的搜索结果网页集合;a query unit, configured to use the keyword to search, and obtain a set of search result webpages corresponding to each electronic map data; 计算单元,包括:第一计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页重要程度的第一数值;第二计算子单元,用于针对集合中每个搜索结果网页,分别计算用于表示网页与关键词匹配程度的第二数值;综合计算子单元,用于根据每个电子地图数据相应集合中的所有搜索结果网页的第一数值和第二数值,计算该电子地图数据的重要度;The calculation unit includes: a first calculation subunit, for each search result webpage in the collection, respectively calculating a first numerical value representing the importance of the webpage; a second calculation subunit, for each search result webpage in the collection The webpage is used to calculate the second numerical value used to represent the degree of matching between the webpage and the keyword; the comprehensive calculation subunit is used to calculate the second numerical value according to the first numerical value and the second numerical value of all search result webpages in the corresponding set of each electronic map data. The importance of electronic map data; 排序单元,用于按照所述重要度对所述电子地图数据进行排序;a sorting unit, configured to sort the electronic map data according to the importance; 其中,所述综合计算子单元将集合中每个搜索结果网页的第一数值和第二数值相乘,然后再将集合中所有搜索结果网页的相乘结果求和,得到该电子地图数据的重要度。Wherein, the comprehensive calculation subunit multiplies the first value and the second value of each search result webpage in the collection, and then sums the multiplication results of all the search result webpages in the collection to obtain the important value of the electronic map data. Spend. 11.根据权利要求10所述的装置,其特征在于:所述第一计算子单元通过计算网页级别得到第一数值。11. The device according to claim 10, wherein the first calculation subunit obtains the first value by calculating the webpage level. 12.根据权利要求10所述的装置,其特征在于,所述装置还包括:12. The device according to claim 10, further comprising: 调整单元,用于根据电子地图数据所属类别所具有的不同权重,将该电子地图数据的重要度乘以该电子地图数据所属类别的权重值,得到调整后的结果数据,并输出到排序单元用于排序。The adjustment unit is used to multiply the importance of the electronic map data by the weight value of the category of the electronic map data according to the different weights of the categories of the electronic map data to obtain the adjusted result data, and output it to the sorting unit for use for sorting. 13.根据权利要求10所述的装置,其特征在于:所述关键词提取单元将提取出的电子地图数据的名称作为关键词。13. The device according to claim 10, wherein the keyword extracting unit uses the name of the extracted electronic map data as the keyword. 14.根据权利要求13所述的装置,其特征在于:所述关键词提取单元还将提取出的电子地图数据的地址信息,与名称一同作为关键词。14. The device according to claim 13, wherein the keyword extracting unit also uses the extracted address information of the electronic map data as keywords together with the name. 15.根据权利要求10所述的装置,其特征在于,所述装置还包括:15. The device according to claim 10, further comprising: 预处理单元,用于对原始的电子地图数据进行预处理,并将预处理结果输出到关键词提取单元;其中,所述预处理包括去除无关符号、字符编码转换、调整统一格式。A preprocessing unit is used to preprocess the original electronic map data, and output the preprocessing result to the keyword extraction unit; wherein, the preprocessing includes removing irrelevant symbols, converting character codes, and adjusting a unified format. 16.根据权利要求10所述的装置,其特征在于,所述装置还包括:16. The device according to claim 10, further comprising: 检索单元,用于在电子地图检索中,根据用户输入的查询词返回相匹配的检索结果,将检索结果中排序靠前的电子地图数据优先显示。The retrieval unit is used for returning matching retrieval results according to the query words input by the user during the electronic map retrieval, and preferentially displaying the top-ranked electronic map data in the retrieval results. 17.根据权利要求10所述的装置,其特征在于,所述装置还包括:17. The device according to claim 10, further comprising: 图层显示单元,用于在图层显示时,选取显示范围内排序靠前的电子地图数据进行显示。The layer display unit is used to select and display the top-ranked electronic map data within the display range when the layer is displayed. 18.根据权利要求10所述的装置,其特征在于,所述装置还包括:18. The device according to claim 10, further comprising: 数据更新单元,用于对排序靠前的电子地图数据进行优先更新。The data updating unit is used for updating the electronic map data ranked first. 19.一种搜索引擎系统,其特征在于,所述系统包括权利要求10至18任一权利要求所述的装置。19. A search engine system, characterized in that the system comprises the device according to any one of claims 10 to 18.
CN 200810222422 2008-09-16 2008-09-16 Method and device for sorting electronic map data Active CN101350154B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200810222422 CN101350154B (en) 2008-09-16 2008-09-16 Method and device for sorting electronic map data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200810222422 CN101350154B (en) 2008-09-16 2008-09-16 Method and device for sorting electronic map data

Publications (2)

Publication Number Publication Date
CN101350154A CN101350154A (en) 2009-01-21
CN101350154B true CN101350154B (en) 2013-01-30

Family

ID=40268929

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200810222422 Active CN101350154B (en) 2008-09-16 2008-09-16 Method and device for sorting electronic map data

Country Status (1)

Country Link
CN (1) CN101350154B (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102667759B (en) * 2009-12-14 2014-07-30 北京友迈在地科技有限公司 Method and system for displaying special symbols in priority order in electronic map
CN102541936A (en) * 2010-12-31 2012-07-04 高德软件有限公司 Method and device for acquiring popularity of POI (Point of Interest)
CN103185596A (en) * 2011-12-30 2013-07-03 上海博泰悦臻电子设备制造有限公司 Interest point searching method and interest point searching device
CN103577442B (en) * 2012-07-30 2019-02-05 腾讯科技(深圳)有限公司 A kind of map datum importance calculation method and device
CN102890725B (en) * 2012-11-02 2015-08-19 瑞庭网络技术(上海)有限公司 The result ordering method of search engine
WO2014145106A1 (en) * 2013-03-15 2014-09-18 Shimanovsky Boris Apparatus, systems, and methods for grouping data records
CN104123318B (en) * 2013-04-28 2019-01-15 百度在线网络技术(北京)有限公司 A kind of method and system of map denotation point of interest
CN103258057B (en) * 2013-06-03 2017-06-23 北京奇虎科技有限公司 The method and apparatus for showing point of interest POI in electronic map interface
CN103336807B (en) * 2013-06-25 2018-01-05 百度在线网络技术(北京)有限公司 A kind of method and system for showing point of interest
CN104281577B (en) * 2013-07-02 2018-11-16 威盛电子股份有限公司 Data file sorting method
CN104281576B (en) * 2013-07-02 2018-08-31 威盛电子股份有限公司 Method for displaying landmark data
CN104462143B (en) * 2013-09-24 2018-01-30 高德软件有限公司 Chain brand word dictionary, classifier dictionary method for building up and device
CN104899200A (en) * 2014-03-04 2015-09-09 高德软件有限公司 POI search feedback method and device
CN104317909B (en) * 2014-10-27 2018-09-28 百度在线网络技术(北京)有限公司 The method of calibration and device of interest point data
CN105786915A (en) * 2014-12-25 2016-07-20 高德软件有限公司 POI importance degree determination method and device
CN105069079B (en) * 2015-07-31 2022-04-29 北京奇虎科技有限公司 Method and device for screening POI (Point of interest) data
CN105222803A (en) * 2015-10-20 2016-01-06 北京百度网讯科技有限公司 Map POI display packing and terminal
CN105608112A (en) * 2015-12-10 2016-05-25 北京奇虎科技有限公司 Method and apparatus for measuring quality of map POI data
CN105550330B (en) * 2015-12-21 2020-09-11 北京奇虎科技有限公司 Method and system for sorting POI information of points of interest
CN107315750A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map figure layer display methods, device, terminal device and user interface system
CN107315748A (en) * 2016-04-26 2017-11-03 斑马网络技术有限公司 Electronic map indexing means, device, terminal device and user interface system
CN107798018B (en) * 2016-09-06 2020-04-10 高德软件有限公司 Method and device for setting display information of interest points
CN107918512B (en) * 2017-11-16 2020-08-04 携程旅游信息技术(上海)有限公司 Hotel information display method and device, electronic equipment and storage medium
CN108984640A (en) * 2018-06-22 2018-12-11 华北电力大学 A Geographical Information Acquisition Method Based on Web Data Mining
CN111026937B (en) 2019-11-13 2021-02-19 百度在线网络技术(北京)有限公司 Method, device and equipment for extracting POI name and computer storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN1825308A (en) * 2005-02-22 2006-08-30 台湾积体电路制造股份有限公司 Network search system and method
CN101000608A (en) * 2006-01-11 2007-07-18 吴风勇 Key word dynamic matching generating based on search engine technology

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1389811A (en) * 2002-02-06 2003-01-08 北京造极人工智能技术有限公司 Intelligent search method of search engine
CN1825308A (en) * 2005-02-22 2006-08-30 台湾积体电路制造股份有限公司 Network search system and method
CN101000608A (en) * 2006-01-11 2007-07-18 吴风勇 Key word dynamic matching generating based on search engine technology

Also Published As

Publication number Publication date
CN101350154A (en) 2009-01-21

Similar Documents

Publication Publication Date Title
CN101350154B (en) Method and device for sorting electronic map data
CN109145169B (en) Address matching method based on statistical word segmentation
CN100481077C (en) Visual method and device for strengthening search result guide
CN102591867B (en) Searching service method based on mobile device position
CN109657068B (en) Cultural relic knowledge graph generation and visualization method for intelligent museum
US20150356088A1 (en) Tile-based geocoder
WO2017076205A1 (en) Method and apparatus for obtaining reply prompt content for chat start sentence
CN108062375A (en) A kind of processing method, device, terminal and the storage medium of user's portrait
CN101299217B (en) Method, device and system for map information processing
CN102253972B (en) Maintenance method of geographical name database based on web crawler
CN100478960C (en) Method for locating unknown place name in network map service
JP2022532451A (en) How to disambiguate Chinese place name meanings based on encyclopedia knowledge base and word embedding
CN114780680B (en) Retrieval and completion method and system based on place name and address database
CN102541936A (en) Method and device for acquiring popularity of POI (Point of Interest)
CN105117494B (en) Spatial entities mapping method in fuzzy context
CN103823900A (en) Information point significance determining method and device
CN116340541A (en) Method for constructing knowledge graph system of Wenbo
Ahlers et al. Location-based Web search
CN103377224B (en) Identify the method and device of problem types, set up the method and device identifying model
CN105975477B (en) A Method of Automatically Constructing Place Name Dataset Based on Network
JP5302614B2 (en) Facility related information search database formation method and facility related information search system
CN103885947A (en) Mining method for searching demands, intelligent searching method and device thereof
CN108984640A (en) A Geographical Information Acquisition Method Based on Web Data Mining
CN111882224B (en) Method and device for classifying consumption scenarios
CN118656698A (en) A classification method and device for multiple historical and cultural resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: BEIJING SOHU NEW MEDIA INFORMATION TECHNOLOGY CO.,

Free format text: FORMER OWNER: SOGO SCIENCE-TECHNOLOGY DEVELOPMENT CO., LTD., BEIJING

Effective date: 20101020

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100084 ROOM 01, 9/F, SOHU.COM INTERNET PLAZA, BUILDING 9, YARD 1, ZHONGGUANCUN EAST ROAD, HAIDIAN DISTRICT, BEIJING TO: 100084 ROOM 802, 8/F, SOHU.COM INTERNET PLAZA, BUILDING 9, YARD 1, ZHONGGUANCUN EAST ROAD, HAIDIAN DISTRICT, BEIJING

TA01 Transfer of patent application right

Effective date of registration: 20101020

Address after: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 8, room, Room 802

Applicant after: Beijing Sohu New Media Information Technology Co., Ltd.

Address before: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 9, room, room 01

Applicant before: Sogo Science-Technology Development Co., Ltd., Beijing

C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: SOGO SCIENCE-TECHNOLOGY DEVELOPMENT CO., LTD., BEI

Free format text: FORMER OWNER: BEIJING SOHU NEW MEDIA INFORMATION TECHNOLOGY CO., LTD.

Effective date: 20130902

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20130902

Address after: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 9, room, room 01

Patentee after: Sogo Science-Technology Development Co., Ltd., Beijing

Address before: 100084 Beijing, Zhongguancun East Road, building 1, No. 9, Sohu cyber building, room 8, room, Room 802

Patentee before: Beijing Sohu New Media Information Technology Co., Ltd.