WO2014029314A1 - 信息聚合归类的显示方法及系统 - Google Patents
信息聚合归类的显示方法及系统 Download PDFInfo
- Publication number
- WO2014029314A1 WO2014029314A1 PCT/CN2013/081802 CN2013081802W WO2014029314A1 WO 2014029314 A1 WO2014029314 A1 WO 2014029314A1 CN 2013081802 W CN2013081802 W CN 2013081802W WO 2014029314 A1 WO2014029314 A1 WO 2014029314A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- aggregation
- category
- content
- belonging
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Definitions
- the present invention relates to a polymerization technique, and in particular, to a display method and system for information aggregation classification. Background technique
- the information used by users in interaction is usually displayed in the form of a single message. That is to say, the display of information is finally displayed by the attributes of a single piece of information, and a message is displayed when the user sends a message. In this way, the disorder and fragmentation of information display is caused.
- the amount of information is huge.
- the vast amount of information is vast and disorderly displayed on social networks and media, which is very unfavorable for information sharing and interaction, because it is difficult for users to directly retrieve their own concerns from a huge amount of information.
- Useful information but first through a large number of readings and non-stop refresh information, from the information exchange sharing platform to obtain the source data, and then through the user's own collection of source data.
- the problems existing in the prior art are: Since the display of information is finally displayed by the attributes of a single piece of information, the disorder and fragmentation of the display of a large amount of information is caused, which is not conducive to information sharing and interaction. Users are required to classify and integrate information, and user operations are highly complex. Summary of the invention
- the embodiment of the present invention provides a display method and system for information aggregation and classification, which realizes display of information aggregation and classification, facilitates information sharing and interaction, and reduces user operation complexity.
- An embodiment of the present invention provides a display method for information aggregation and classification, the method includes: acquiring information from an information interaction sharing platform, extracting a content keyword of the information; performing information aggregation and classification according to the content keyword, respectively Displayed according to its attribution class.
- An embodiment of the present invention provides a display system for information aggregation, the system includes: a key word extraction unit, an aggregation classification unit, and a display unit;
- the keyword extracting unit is configured to acquire information from an information interaction sharing platform, and extract a content keyword of the information
- the aggregation categorization unit is configured to perform information aggregation and classification according to the content keyword; and the display unit is configured to display information according to the attribution class thereof.
- the embodiment of the present invention obtains information from the information interaction sharing platform, extracts content key words of the information, performs information aggregation and classification according to the content keywords, and displays the information according to the attribution class thereof.
- the prior art does not classify the information, and displays the information in the form of a single piece of information.
- the embodiment of the present invention aggregates the information according to the content keyword, and finally displays the result after the aggregation and classification, and the aggregation is performed.
- the categorization display is an automated operation. After the user does not need to obtain the source data such as a piece of information, it can manually classify and integrate itself, thereby facilitating information sharing and interaction, and reducing the user's operation complexity.
- FIG. 1 is a flow chart of a method according to an embodiment of the present invention.
- FIG. 2 is a schematic structural diagram of a system according to an embodiment of the present invention. detailed description
- the information is obtained from the information interaction sharing platform, and the content keywords of the information are extracted; the information is aggregated according to the content keywords, and the information is displayed according to the belonging class.
- the display method of the information aggregation classification in the embodiment of the present invention includes the following steps:
- Step 101 Obtain information from the information interaction sharing platform, and extract content keywords of the information.
- the step 101 specifically includes: retrieving a plurality of information in the information interaction sharing platform, and using the content of the same information, the similarity or the frequency of occurrence, the specified position (such as the position where the quotation marks, parentheses, book name, etc. appear) as the content Key words.
- Step 102 Perform information aggregation and classification according to content keywords.
- the step 102 specifically includes: using the content keyword as the belonging class to which the corresponding information belongs, and aggregating the corresponding information in the same belonging class as a subset of the belonging class.
- Step 103 Display the information according to its belonging class.
- the step 103 specifically includes: aggregating the header according to the information of the belonging class, the information aggregation heat of the belonging class, and the information aggregation feedback of the belonging class, respectively performing three specific implementation manners, which are respectively described below.
- the candidate set includes: a specified wildcard, an identifier, a text, a letter, a character, a word within the specified punctuation mark (such as a quotation mark, a parenthesis, a matching rule of a combination of one or at least one of the first or last paragraph of the information;
- the retrieved content is compared with the content keyword corresponding to the attribution class of the information, and the repeated occurrence probability of the retrieved content and the content keyword is selected.
- the content is displayed as the title of the belonging class.
- the frequency superposition is separately performed, and the result of the frequency superposition is used as the information of the belonging class to be aggregated and displayed. For example, when the frequency of occurrence is the number of times of forwarding information, if the total number of times of forwarding a piece of information in the current belonging class is 10, the message is "forwarded 10 times" and displayed. For another example, if there are 10 related information in a belonging class, and each piece of information is forwarded 10 times, the total forwarding heat of this class is 100. The heat that will mark this belonging class is 100.
- the display specifically includes:
- the information feedback of all the information in each belonging class is retrieved, and the retrieved information feedback aggregation is classified into corresponding information and displayed.
- information feedback can be aggregated for each piece of information, and corresponding to this information, that is, the information set aggregated by the information feedback of one piece of information is A subset of this information.
- the information set aggregated by the feedback of the information can be further classified and refined, and will not be described here.
- the information feedback may be directed to a type of information, such as information feedback for each attribution class, in addition to one piece of information, and will not be described herein.
- the information aggregation and classification display system of the embodiment of the present invention includes: a keyword extraction unit, a aggregation classification unit, and a display unit; wherein the keyword extraction unit is used in the information interaction sharing platform Get information, extract the content keywords of the information; aggregate the classification unit And used for performing information aggregation and classification according to the content keyword; the display unit is configured to display information according to its belonging class.
- the keyword extracting unit is further configured to retrieve a plurality of pieces of information in the information interaction sharing platform, and extract the same, similar, or frequently occurring content among the plurality of pieces of information as content keywords.
- the aggregation and classification unit is further configured to use the content keyword as a class to which the corresponding information belongs, and aggregate the corresponding information in the same home class, as a child of the belonging class, where the display unit is further used for
- the information is aggregated according to the information of the class, the information aggregation heat of the class, and the information aggregation feedback of the class are displayed separately.
- the information exchange sharing platform is specifically described as a microblog platform, but the embodiment of the present invention is not limited to the microblog platform.
- the method flow based on the Weibo platform includes the following steps:
- Step 201 Obtain news data from the microblog platform, and extract content key words in the news data, and automatically aggregate and classify the news data according to the content keywords. And this category is constantly updated as new news data is continuously generated and updated.
- Step 202 After the automatic aggregation classification, similar news data is automatically aggregated into the belonging class of a news topic.
- step 202 After the step 202 is performed, the following optional steps 203a to 203c complete the method flow. among them,
- Step 203a Select a sentence from all the news data in each belonging class according to an algorithm as a title of the entire news topic for display.
- the algorithm for extracting the above title may be: extracting the first sentence in each microblog, or a special symbol, such as a book title number [[]
- the statement contained in as a candidate, can be used as a collection of titles.
- the keywords extracted in each statement in the calculation candidate set are similar to the cosine angle of the central node of the attribution class. Degree. The one with the highest similarity is the title of this belonging class.
- Step 203b Calculate the heat of each news data in the belonging class, and aggregate the heat of each news data as the heat of the news topic for display.
- the algorithm for calculating the heat for example, after the aggregation is classified, 30 microblogs in a belonging class A belong to the belonging class, and the number of retransmissions per microblog is 50.
- Step 203c Aggregate user comments of each news data in the belonging class as user comments of the news topic for display.
- each piece of news data has its own user comments.
- the user's comments can be aggregated at the same time, as the user's comments on the news topic are displayed, not just comments on one news. .
- Step 204 Each home class is sorted by the popularity of the category, instead of the heat of a news, outputting the sort result, and outputting the title of each news topic, the news data under the topic category, and all the user comments of the topic. , not a user comment for a news.
- the heat of related news from different sources of the same topic can be aggregated as the heat of a news topic, rather than the heat display order of a single news.
- the Economic Observer, The Daily Economic News, etc., and each piece of news data may present a different perspective on the same news topic.
- the user can only see the display of a single piece of news data, such as "The Daily Economic News", the news media's heat or time of a news report about "industrial gelatin”, and using the embodiment of the present invention,
- the display is sorted according to the category of the theme, that is, according to the title, heat and evaluation of the news topic, so that the "industrial gelatin” is still taken as an example, the news theme of "industrial gelatin” can be used to display All relevant news about "industrial gelatin” in the Bo platform is aggregated in a class "industrial gelatin".
- the class of this news topic is used as a way to participate in display sorting, which is more convenient for information interaction and sharing.
- the user since the information is classified, and there are various display sorting prompts of heat, title, and feedback, the user is allowed to obtain more valid data in the shortest time, because, by using the embodiment of the present invention, Pre-previous First, the information is displayed in the information interaction sharing platform, and the user can directly obtain the valid data instead of the unprocessed source data. Therefore, the user operation complexity is reduced, the access efficiency is improved, the number of interactions is reduced, and correspondingly, the economy is saved. The overhead of network resources and bandwidth.
- the integrated modules described in the embodiments of the present invention may also be stored in a computer readable storage medium if they are implemented in the form of software functional modules and sold or used as separate products. Based on such understanding, the technical solution of the embodiments of the present invention may be embodied in the form of a software product in essence or in the form of a software product.
- the computer software product is stored in a storage medium and includes a plurality of instructions.
- a computer device (which may be a personal computer, server, or network device, etc.) is implemented to perform all or part of the methods described in various embodiments of the present invention.
- the foregoing storage medium includes: a U disk, a removable hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk or an optical disk, and the like, which can store program codes. .
- ROM read-only memory
- RAM random access memory
- magnetic disk or an optical disk and the like, which can store program codes.
- the embodiment of the present invention further provides a computer storage medium, wherein a computer program is stored, and the computer program is used to execute the information aggregation and classification display method of the embodiment of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
RU2015103949A RU2015103949A (ru) | 2012-08-22 | 2013-08-19 | Способ и система агрегирования, классификации и отображения информации |
KR1020157000716A KR20150018880A (ko) | 2012-08-22 | 2013-08-19 | 정보 취합 분류의 디스플레이 방법 및 시스템 |
US14/584,221 US20150120708A1 (en) | 2012-08-22 | 2014-12-29 | Information aggregation, classification and display method and system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210300750.1 | 2012-08-22 | ||
CN201210300750.1A CN103631791B (zh) | 2012-08-22 | 2012-08-22 | 信息聚合归类的显示方法及系统 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/584,221 Continuation US20150120708A1 (en) | 2012-08-22 | 2014-12-29 | Information aggregation, classification and display method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014029314A1 true WO2014029314A1 (zh) | 2014-02-27 |
Family
ID=50149439
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2013/081802 WO2014029314A1 (zh) | 2012-08-22 | 2013-08-19 | 信息聚合归类的显示方法及系统 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20150120708A1 (zh) |
KR (1) | KR20150018880A (zh) |
CN (1) | CN103631791B (zh) |
RU (1) | RU2015103949A (zh) |
WO (1) | WO2014029314A1 (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140310363A1 (en) * | 2013-04-10 | 2014-10-16 | Passur Aerospace, Inc. | System and Method for Collaborative Decision Making at an Airport |
CN104980476B (zh) * | 2014-04-14 | 2019-06-07 | 金蝶软件(中国)有限公司 | 活动流的分拣推送方法及装置 |
CN105100370A (zh) * | 2014-04-24 | 2015-11-25 | 阿尔派株式会社 | 显示装置以及显示方法 |
CN104504024B (zh) * | 2014-12-11 | 2018-09-07 | 中国科学院计算技术研究所 | 基于微博内容的关键词挖掘方法及系统 |
CN105630929B (zh) * | 2015-12-22 | 2019-08-30 | 北京奇虎科技有限公司 | 基于评论确定新闻推荐权重的方法及装置 |
CN106777324A (zh) * | 2017-01-09 | 2017-05-31 | 北京奇虎科技有限公司 | 社交应用平台资源的聚类显示方法、装置和移动终端 |
CN109062945B (zh) * | 2018-06-21 | 2021-07-09 | 北京三快在线科技有限公司 | 一种社交网络的信息推荐方法、装置及系统 |
CN109446323A (zh) * | 2018-10-16 | 2019-03-08 | 北京小米智能科技有限公司 | 信息聚合方法、装置和设备 |
CN111209390B (zh) * | 2020-01-06 | 2023-09-05 | 新方正控股发展有限责任公司 | 新闻展示方法和系统、计算机可读存储介质 |
CN118245650B (zh) * | 2024-05-21 | 2024-10-25 | 天津中新智冠信息技术有限公司 | 非结构化系统中内容聚合方法、装置、设备及存储介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1773492A (zh) * | 2004-11-09 | 2006-05-17 | 国际商业机器公司 | 组织多个文档的方法以及显示多个文档的设备 |
CN101246501A (zh) * | 2008-03-27 | 2008-08-20 | 腾讯科技(深圳)有限公司 | 一种聚合相同主题网络文档的方法及系统 |
CN101408885A (zh) * | 2007-10-05 | 2009-04-15 | 富士通株式会社 | 利用统计分布对主题进行建模 |
US20100312726A1 (en) * | 2009-06-09 | 2010-12-09 | Microsoft Corporation | Feature vector clustering |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7814089B1 (en) * | 2003-12-17 | 2010-10-12 | Topix Llc | System and method for presenting categorized content on a site using programmatic and manual selection of content items |
US8271495B1 (en) * | 2003-12-17 | 2012-09-18 | Topix Llc | System and method for automating categorization and aggregation of content from network sites |
AU2005258080A1 (en) * | 2004-06-18 | 2006-01-05 | Pictothink Corporation | Network content organization tool |
CN1983255A (zh) * | 2006-05-17 | 2007-06-20 | 唐红春 | 一种互联网搜索方法 |
KR20090033728A (ko) * | 2007-10-01 | 2009-04-06 | 삼성전자주식회사 | 컨텐트 요약 정보 제공 방법 및 그 장치 |
CN101446959A (zh) * | 2008-12-30 | 2009-06-03 | 深圳市迅雷网络技术有限公司 | 一种基于互联网的新闻推荐方法和系统 |
CN101917456B (zh) * | 2010-07-06 | 2012-10-03 | 杭州热点信息技术有限公司 | 一种内容聚合无线发布系统 |
CN102236719A (zh) * | 2011-07-25 | 2011-11-09 | 西交利物浦大学 | 基于网页分类的网页搜索引擎及快速查找方法 |
US20130041901A1 (en) * | 2011-08-12 | 2013-02-14 | Rawllin International Inc. | News feed by filter |
CN102279894B (zh) * | 2011-09-19 | 2013-01-09 | 嘉兴亿言堂信息科技有限公司 | 基于语义的查找、集成和提供评论信息的方法及搜索系统 |
-
2012
- 2012-08-22 CN CN201210300750.1A patent/CN103631791B/zh active Active
-
2013
- 2013-08-19 KR KR1020157000716A patent/KR20150018880A/ko not_active Ceased
- 2013-08-19 RU RU2015103949A patent/RU2015103949A/ru not_active Application Discontinuation
- 2013-08-19 WO PCT/CN2013/081802 patent/WO2014029314A1/zh active Application Filing
-
2014
- 2014-12-29 US US14/584,221 patent/US20150120708A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1773492A (zh) * | 2004-11-09 | 2006-05-17 | 国际商业机器公司 | 组织多个文档的方法以及显示多个文档的设备 |
CN101408885A (zh) * | 2007-10-05 | 2009-04-15 | 富士通株式会社 | 利用统计分布对主题进行建模 |
CN101246501A (zh) * | 2008-03-27 | 2008-08-20 | 腾讯科技(深圳)有限公司 | 一种聚合相同主题网络文档的方法及系统 |
US20100312726A1 (en) * | 2009-06-09 | 2010-12-09 | Microsoft Corporation | Feature vector clustering |
Also Published As
Publication number | Publication date |
---|---|
RU2015103949A (ru) | 2016-10-10 |
KR20150018880A (ko) | 2015-02-24 |
CN103631791A (zh) | 2014-03-12 |
CN103631791B (zh) | 2017-04-12 |
US20150120708A1 (en) | 2015-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2014029314A1 (zh) | 信息聚合归类的显示方法及系统 | |
CN106980692B (zh) | 一种基于微博特定事件的影响力计算方法 | |
US9672283B2 (en) | Structured and social data aggregator | |
US10387559B1 (en) | Template-based identification of user interest | |
Zhang et al. | Automatic detection of rumor on social network | |
Vuong et al. | On ranking controversies in wikipedia: models and evaluation | |
CN103745000B (zh) | 一种中文微博客的热点话题检测方法 | |
US8380697B2 (en) | Search and retrieval methods and systems of short messages utilizing messaging context and keyword frequency | |
US20130085745A1 (en) | Semantic-based approach for identifying topics in a corpus of text-based items | |
US10825110B2 (en) | Entity page recommendation based on post content | |
WO2013026325A1 (zh) | 一种人物搜索方法、装置及存储介质 | |
CN104572757B (zh) | 微博群体处理方法及装置 | |
WO2013037223A1 (zh) | 网络微博名人信息的推荐处理方法和装置 | |
CN101217515A (zh) | 基于问题分类推送问题的系统及方法 | |
CN107291886A (zh) | 一种基于增量聚类算法的微博话题检测方法及系统 | |
CN108509437A (zh) | 一种ElasticSearch查询加速方法 | |
CN107451208A (zh) | 一种数据搜索方法与装置 | |
CN103577405A (zh) | 基于兴趣分析的微博博主社区分类方法 | |
KR101559719B1 (ko) | 효과적인 마케팅을 도출하는 자동학습 시스템 및 방법 | |
CN101477527A (zh) | 一种检索多媒体资源的方法及装置 | |
CN104951478A (zh) | 信息处理方法和信息处理装置 | |
CN104317796A (zh) | 一种基于搜索的多用户交互方法、服务器,以及系统 | |
US20180101615A1 (en) | Systems, methods and techniques for customizable domain-based searching | |
CN105786834A (zh) | 一种社交类网页结构化摘要的生成方法和系统 | |
CN111882224A (zh) | 对消费场景进行分类的方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13830430 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20157000716 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2015103949 Country of ref document: RU Kind code of ref document: A |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205N DATED 29/04/2015) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13830430 Country of ref document: EP Kind code of ref document: A1 |