[go: up one dir, main page]

CN114491231A - A method for semi-structured retention of browser access information - Google Patents

A method for semi-structured retention of browser access information Download PDF

Info

Publication number
CN114491231A
CN114491231A CN202111569817.7A CN202111569817A CN114491231A CN 114491231 A CN114491231 A CN 114491231A CN 202111569817 A CN202111569817 A CN 202111569817A CN 114491231 A CN114491231 A CN 114491231A
Authority
CN
China
Prior art keywords
information
sharing
module
unit
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111569817.7A
Other languages
Chinese (zh)
Other versions
CN114491231B (en
Inventor
陆璧隆
孙雪峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Ririfeng Information Technology Co ltd
Original Assignee
Shanghai Ririfeng Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Ririfeng Information Technology Co ltd filed Critical Shanghai Ririfeng Information Technology Co ltd
Priority to CN202111569817.7A priority Critical patent/CN114491231B/en
Publication of CN114491231A publication Critical patent/CN114491231A/en
Application granted granted Critical
Publication of CN114491231B publication Critical patent/CN114491231B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本发明公开了一种半结构化留存浏览器访问信息的方法,其中,该方法采用的系统包括信息复制单元、信息处理单元、信息发送单元、信息接收单元、信息存储单元、信息查询单元,所述信息查询单元与信息处理单元电连接,所述信息存储单元和信息复制单元均与信息处理单元电连接,所述信息接收单元和信息发送单元均与信息处理单元电连接,所述信息复制单元用于将浏览器插件程序浏览器的浏览信息,所述信息处理单元用于根据http协议处理数据,所述信息存储单元用于将网页上的信息储存在本地,所述信息查询单元用于用户再次访问信息时在本地进行检索,本发明让用户更方便的查找、利用用户自己曾经通过浏览器访问过的部分数据。

Figure 202111569817

The invention discloses a method for semi-structured retention of browser access information, wherein the system adopted in the method comprises an information copying unit, an information processing unit, an information sending unit, an information receiving unit, an information storage unit, and an information query unit. The information query unit is electrically connected to the information processing unit, the information storage unit and the information copying unit are both electrically connected to the information processing unit, the information receiving unit and the information sending unit are both electrically connected to the information processing unit, and the information copying unit is electrically connected to the information processing unit. Used for browsing information of the browser plug-in program browser, the information processing unit is used to process data according to the http protocol, the information storage unit is used to store the information on the web page locally, and the information query unit is used for the user When the information is accessed again, the retrieval is carried out locally, and the present invention makes it easier for the user to find and utilize part of the data that the user himself has accessed through the browser.

Figure 202111569817

Description

一种半结构化留存浏览器访问信息的方法A method for semi-structured retention of browser access information

技术领域technical field

本发明属于浏览器技术领域,具体涉及一种半结构化留存浏览器访问信息的方法。The invention belongs to the technical field of browsers, and in particular relates to a method for semi-structured retention of browser access information.

背景技术Background technique

随着时代的发展,通过互联网(浏览器)获取信息变得越来越方便,但是怎么快速的找到自己曾经访问过的信息并进行数据分析成了一个难题。现在用户大部分都是通过手动复制信息到电子表格,或者收藏夹收藏网页。With the development of the times, it has become more and more convenient to obtain information through the Internet (browser), but how to quickly find the information that one has accessed and conduct data analysis has become a difficult problem. Most of the time users now do this by manually copying information into spreadsheets, or bookmarking web pages.

然而,本申请发明人在实现本申请技术方案的过程中,发现:However, in the process of realizing the technical solution of the present application, the inventor of the present application found that:

第一,很多时候,由于很多网页是动态的,就导致再次打开这个网页和当初打开的时候网页的信息不同,用户很难获取曾经看过的信息。First, in many cases, because many web pages are dynamic, the information of the web page is different when the web page is opened again, and it is difficult for users to obtain the information they have seen before.

第二,浏览器收藏夹查找这个行为随着收藏网页变多,查找也会变得困难。Second, the behavior of browser favorites search will become more difficult as the number of favorite web pages increases.

第三,这两种行为都不便于用户利用一些统计学的软件去对信息分析及处理Third, these two behaviors are inconvenient for users to use some statistical software to analyze and process information

第四,浏览器关闭后,信息即丢失,再次访问信息必须重新发起网络请求,反复的访问同一个网页,本质也是对社会资源的一种浪费。(至少同一个信息占了多次网络资源是对网络资源的浪费);Fourth, after the browser is closed, the information is lost. To access the information again, you must re-initiate a network request, and repeatedly visit the same web page, which is essentially a waste of social resources. (At least the same information occupies multiple network resources is a waste of network resources);

在信息共享时将信息的发送数量与使用者的共享程度有关,有效避免了信息泄漏给对共享程度较低的人,鼓励人参与共享。When information is shared, the number of information sent is related to the degree of sharing of users, which effectively avoids information leakage to people with a low degree of sharing, and encourages people to participate in sharing.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于针对现有的集材装置一种半结构化留存浏览器访问信息的方法,以解决上述背景技术中提出的问题。The purpose of the present invention is to provide a semi-structured method for retaining browser access information for existing skidding devices, so as to solve the problems raised in the above background art.

为了解决上述技术问题,本发明提供如下技术方案:一种半结构化留存浏览器访问信息的方法,该方法采用的系统包括信息复制单元、信息处理单元、信息发送单元、信息接收单元、信息存储单元、信息查询单元,所述信息查询单元与信息处理单元电连接,所述信息存储单元和信息复制单元均与信息处理单元电连接,所述信息接收单元和信息发送单元均与信息处理单元电连接;In order to solve the above-mentioned technical problems, the present invention provides the following technical solutions: a method for semi-structured retention of browser access information, the system adopted in the method includes an information copying unit, an information processing unit, an information sending unit, an information receiving unit, and an information storage unit. unit and information query unit, the information query unit is electrically connected to the information processing unit, the information storage unit and the information replication unit are both electrically connected to the information processing unit, the information receiving unit and the information sending unit are both electrically connected to the information processing unit connect;

所述信息复制单元用于将浏览器插件程序浏览器的浏览信息,所述信息处理单元用于根据http协议处理数据,所述信息存储单元用于将网页上的信息储存在本地,所述信息查询单元用于用户再次访问信息时在本地进行检索,所述信息接收单元用于接收处理过的信息,所述信息发送单元用于发送处理过的信息。The information copying unit is used to copy the browsing information of the browser plug-in program browser, the information processing unit is used to process data according to the http protocol, the information storage unit is used to store the information on the web page locally, the information The query unit is used for local retrieval when the user accesses the information again, the information receiving unit is used for receiving processed information, and the information sending unit is used for sending the processed information.

本发明进一步说明,所述信息处理单元包括信息体处理单元和访问处理单元,所述信息体处理单元用于数据处理成,域名,请求路径,查询参数,COOKIE,请求信息,响应信息复合的信息体,所述访问处理单元用于将信息处理使其方便查询。The present invention further illustrates that the information processing unit includes an information body processing unit and an access processing unit, and the information body processing unit is used for data processing into, domain name, request path, query parameter, COOKIE, request information, and response information composite information The access processing unit is used to process the information to facilitate query.

本发明进一步说明,包括以下具体步骤:The present invention is further described, including following concrete steps:

S1、在用户利用浏览器上网时,浏览器内置插件中的利用信息复制单元复制信息;S1. When the user uses the browser to surf the Internet, the information copying unit in the built-in plug-in of the browser copies the information;

S2、信息体处理单元根据HTTP协议处理数据,把数据处理成,域名,请求路径,查询参数,COOKIE,请求信息,响应信息合称信息体I;S2, the information body processing unit processes the data according to the HTTP protocol, and processes the data into, domain name, request path, query parameter, COOKIE, request information, and response information collectively referred to as information body I;

S3、利用信息发送单元信息体I发送到接收单元接收信息;S3, utilize the information sending unit information body 1 to send to the receiving unit to receive the information;

S4、利用访问处理单元将信息进一步处理后利用信息存储单元将信息体I存到用户本地计算机;S4, utilize the information storage unit to store the information body I to the user's local computer after the information is further processed by the access processing unit;

S4、下次访问时,用户之后就可以通过信息查询单元查询获取信息体I。S4. During the next visit, the user can then query and obtain the information body I through the information query unit.

本发明进一步说明,还包括信息有效性评级模块,所述信息有效性评级模块包括评级选择模块、评级统计模块、评级差值记录模块、评级排序模块,所述评级统计模块与评级差值记录模块、评级排序模块电连接;The present invention further illustrates that it also includes an information validity rating module, the information validity rating module includes a rating selection module, a rating statistics module, a rating difference recording module, and a rating sorting module, the rating statistics module and the rating difference recording module . The rating and sorting module is electrically connected;

所述评级选择模块用于让使用者选择信息有效性评级,所述评级统计模块用于统计所有使用者的信息有效性评级,所述评级差值记录模块用于记录使用者每次信息有效性评级的差值,所述评级排序模块用于根据信息有效性评级的高低对使用者进行排序。The rating selection module is used to allow users to select an information validity rating, the rating statistics module is used to count the information validity ratings of all users, and the rating difference recording module is used to record the information validity of each user. The difference between the ratings, the rating sorting module is used to sort the users according to the level of the information validity rating.

本发明进一步说明,所述信息查询单元包括信息发出模块、信息显示模块、信息有效性计算模块、信息排序模块,所述信息发出模块与信息显示模块电连接,所述信息有效性计算模块与信息排序模块、评级差值记录模块电连接;The present invention further describes that the information query unit includes an information sending module, an information display module, an information validity calculation module, and an information sorting module, the information sending module is electrically connected with the information display module, and the information validity calculation module is connected with the information The sorting module and the rating difference recording module are electrically connected;

所述信息发出模块用于发送使用记录的有关信息给使用者,所述信息显示模块用于显示使用记录的有关信息,所述信息有效性计算模块用于根据使用者每次信息有效性评级的差值判断信息的,所述信息排序模块用于根据信息的有效性对信息进行排序。The information sending module is used to send the relevant information of the usage record to the user, the information display module is used to display the relevant information of the usage record, and the information validity calculation module is used to calculate the validity of each information according to the user's rating. In the case of difference judgment information, the information sorting module is used for sorting the information according to the validity of the information.

本发明进一步说明,所述信息有效性评级模块的工作过程为:The present invention further illustrates that the working process of the information validity rating module is:

Sa、最初所有使用者对被发起共享的历史记录的信息的掌握程度都相同,第一轮共享,使用者凭个人意愿共享,通过共享选择模块给共享程度,系统通过共享统计模块统计所有使用者的共享程度,再通过共享排序模块根据共享程度高低对使用者进行排序,共享高的使用者可以获得更多与历史记录有关的信息,信息通过信息发出模块提供给使用者,使用者可通过信息显示模块查看;Sa. Initially, all users have the same degree of mastery of the information of the history records that are initiated to be shared. In the first round of sharing, users share according to their personal wishes, and the sharing degree is given by the sharing selection module, and the system counts all users through the sharing statistics module. Users with high sharing degree can be sorted according to the sharing degree through the sharing sorting module. Users with high sharing can obtain more information related to historical records. The information is provided to users through the information sending module. Display module view;

Sb、第二轮共享,使用者根据上一轮共享后系统提供的历史记录的相关信息进行共享程度调整,系统通过共享差值记录模块下记录使用者共享的差值,再通过信息有效性计算模块,根据使用者共享的差值来判断什么信息更吸引使用者,并通过信息排序模块将这些信息根据有效性进行排序,系统对使用者的第二轮共享排序,优先将有效性更高的信息提供给使用者;Sb, the second round of sharing, the user adjusts the sharing degree according to the relevant information of the historical records provided by the system after the previous round of sharing, the system records the difference shared by the user through the shared difference recording module, and then calculates the validity of the information. module, which judges which information is more attractive to users according to the difference value shared by users, and sorts the information according to its effectiveness through the information sorting module. information provided to users;

Sc、系统通过数据获取模块收集使用者两次共享的数据,加以平均得到历史记录的合理共享程度;Sc. The system collects the data shared by the user twice through the data acquisition module, and averagely obtains the reasonable sharing degree of the historical records;

Sd、第三轮共享,使用者根据第二轮共享后提供的信息,进行最后一次共享;Sd, the third round of sharing, the user conducts the last sharing according to the information provided after the second round of sharing;

Sd、第三轮共享最高的使用者发起共享成功。Sd, the user with the highest sharing in the third round initiates the sharing successfully.

本发明进一步说明,上述步骤Sa中,信息分享的规则为:The present invention further illustrates, in above-mentioned step Sa, the rule of information sharing is:

系统对共享更高的使用者会提供更多信息,对共享低的使用者会提供少量信息,具体的方法为;The system will provide more information to users with higher sharing, and a small amount of information to users with low sharing. The specific methods are as follows;

某电子历史记录有m条信息,有n个使用者参与发起共享,第一轮共享时每个使用者的共享程度为Ai,第二轮共享时每个使用者的共享为Bi,其中i∈[1,n],则第一轮使用者的平均共享为

Figure BDA0003423242770000041
There are m pieces of information in an electronic history record, and n users participate in initiating sharing. The sharing degree of each user in the first round of sharing is A i , and the sharing of each user in the second round of sharing is B i , where i∈[1,n], then the average sharing of users in the first round is
Figure BDA0003423242770000041

使用者第一轮共享与他第一轮共享后所获得信息的关系为The relationship between the user's first round of sharing and the information obtained after his first round of sharing is:

Figure BDA0003423242770000042
Figure BDA0003423242770000042

其中q为使用者在第一轮共享后所获得的信息数,μ为调节系数,q条信息将从总的m条信息中随机抽取。Among them, q is the number of information obtained by the user after the first round of sharing, μ is the adjustment coefficient, and q pieces of information will be randomly selected from the total m pieces of information.

本发明进一步说明,上述步骤S2中,信息的有效性计算的具体方法为:The present invention further illustrates that in the above step S2, the specific method for calculating the validity of the information is:

每条信息的初始有效性为100%,每个使用者在第二轮共享前收到了qi条信息,并根据信息给出了新的共享程度Bi,则每个使用者对他所收到的信息的总的有效性的评价为

Figure BDA0003423242770000043
The initial validity of each piece of information is 100%. Each user receives q i pieces of information before the second round of sharing, and gives a new sharing degree B i according to the information. The overall validity of the information is evaluated as
Figure BDA0003423242770000043

信息随机提供给使用者,一条信息被提供给使用者的次数不定,一条信息的有效性表现为对所有收到此条信息的使用者的总有效性评价取平均值,一条信息的有效性为Information is provided to users randomly, and the number of times a piece of information is provided to users is not fixed. The validity of a piece of information is expressed as the average of the total validity evaluations of all users who have received this piece of information. The validity of a piece of information is

Figure BDA0003423242770000051
Figure BDA0003423242770000051

其中W为信息的有效性,

Figure BDA0003423242770000052
为对收到此条信息的所有使用者的总有效性评价的总和,k为收到此条信息的所有使用者的人数。where W is the validity of the information,
Figure BDA0003423242770000052
is the sum of the total validity evaluations of all users who have received this piece of information, and k is the number of all users who have received this piece of information.

与现有技术相比,本发明所达到的有益效果是:本发明,通过程序接口M把数据留存在本地,有以下优点:Compared with the prior art, the beneficial effects achieved by the present invention are: the present invention, through the program interface M, retains the data locally, and has the following advantages:

第一、可以忽略掉大部分的图片音频请求,过滤掉了很多无用的信息,First, most of the picture and audio requests can be ignored, and a lot of useless information can be filtered out.

第二、使得用户直接可以在本地获取到曾经访问过的信息,无需再次通过网络请求数据节约了网络资源;Second, users can directly obtain the information they have accessed locally, without the need to request data through the network again, saving network resources;

第三、程序半结构化的存储比起HTML页面更方便信息统计处理(即数据挖掘),可以为用户(信息使用者)节省时间。Third, the semi-structured storage of programs is more convenient for statistical processing of information (ie data mining) than HTML pages, which can save time for users (information users).

附图说明Description of drawings

附图用来提供对本发明的进一步理解,并且构成说明书的一部分,与本发明的实施例一起用于解释本发明,并不构成对本发明的限制。在附图中:The accompanying drawings are used to provide a further understanding of the present invention, and constitute a part of the specification, and are used to explain the present invention together with the embodiments of the present invention, and do not constitute a limitation to the present invention. In the attached image:

图1是本发明的整体结构示意图;Fig. 1 is the overall structure schematic diagram of the present invention;

具体实施方式Detailed ways

以下结合较佳实施例及其附图对本发明技术方案作进一步非限制性的详细说明。显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solution of the present invention is further described in non-limiting detail below in conjunction with the preferred embodiments and the accompanying drawings. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

请参阅图1,本发明提供技术方案:一种半结构化留存浏览器访问信息的方法,该方法采用的系统包括信息复制单元、信息处理单元、信息发送单元、信息接收单元、信息存储单元、信息查询单元,信息查询单元与信息处理单元电连接,信息存储单元和信息复制单元均与信息处理单元电连接,信息接收单元和信息发送单元均与信息处理单元电连接;Referring to FIG. 1, the present invention provides a technical solution: a method for semi-structured retention of browser access information, the system used in the method includes an information copying unit, an information processing unit, an information sending unit, an information receiving unit, an information storage unit, an information query unit, the information query unit is electrically connected to the information processing unit, the information storage unit and the information duplication unit are both electrically connected to the information processing unit, and the information receiving unit and the information sending unit are both electrically connected to the information processing unit;

信息复制单元用于将浏览器插件程序浏览器的浏览信息,信息处理单元用于根据http协议处理数据,信息存储单元用于将网页上的信息储存在本地,信息查询单元用于用户再次访问信息时在本地进行检索,信息接收单元用于接收处理过的信息,信息发送单元用于发送处理过的信息;The information copying unit is used for browsing the browser plug-in program browser information, the information processing unit is used to process data according to the http protocol, the information storage unit is used to store the information on the web page locally, and the information query unit is used for the user to access the information again. When the retrieval is performed locally, the information receiving unit is used to receive the processed information, and the information sending unit is used to send the processed information;

信息处理单元包括信息体处理单元和访问处理单元,信息体处理单元用于数据处理成,域名,请求路径,查询参数,COOKIE,请求信息,响应信息复合的信息体,访问处理单元用于将信息处理使其方便查询;The information processing unit includes an information body processing unit and an access processing unit. The information body processing unit is used for data processing into, domain name, request path, query parameter, COOKIE, request information, and response information composite information body, and the access processing unit is used to convert the information. processing to make it easy to query;

包括以下具体步骤:It includes the following specific steps:

S1、在用户利用浏览器上网时,浏览器内置插件中的利用信息复制单元复制信息;S1. When the user uses the browser to surf the Internet, the information copying unit in the built-in plug-in of the browser copies the information;

S2、信息体处理单元根据HTTP协议处理数据,把数据处理成,域名,请求路径,查询参数,COOKIE,请求信息,响应信息合称信息体I;S2, the information body processing unit processes the data according to the HTTP protocol, and processes the data into, domain name, request path, query parameter, COOKIE, request information, and response information collectively referred to as information body I;

S3、利用信息发送单元信息体I发送到接收单元接收信息;S3, utilize the information sending unit information body 1 to send to the receiving unit to receive the information;

S4、利用访问处理单元将信息进一步处理后利用信息存储单元将信息体I存到用户本地计算机;S4, utilize the information storage unit to store the information body I to the user's local computer after the information is further processed by the access processing unit;

S4、下次访问时,用户之后就可以通过信息查询单元查询获取信息体I;S4, during the next visit, the user can query and obtain the information body I through the information query unit afterwards;

还包括信息有效性评级模块,信息有效性评级模块包括评级选择模块、评级统计模块、评级差值记录模块、评级排序模块,评级统计模块与评级差值记录模块、评级排序模块电连接;It also includes an information validity rating module, which includes a rating selection module, a rating statistics module, a rating difference recording module, and a rating sorting module, and the rating statistics module is electrically connected with the rating difference recording module and the rating sorting module;

评级选择模块用于让使用者选择信息有效性评级,评级统计模块用于统计所有使用者的信息有效性评级,评级差值记录模块用于记录使用者每次信息有效性评级的差值,评级排序模块用于根据信息有效性评级的高低对使用者进行排序;The rating selection module is used to allow users to select the information validity rating, the rating statistics module is used to count the information validity ratings of all users, and the rating difference recording module is used to record the difference of each user's information validity rating. The sorting module is used to sort users according to the information validity rating;

信息查询单元包括信息发出模块、信息显示模块、信息有效性计算模块、信息排序模块,信息发出模块与信息显示模块电连接,信息有效性计算模块与信息排序模块、评级差值记录模块电连接;The information query unit includes an information sending module, an information display module, an information validity calculation module, and an information sorting module, the information sending module is electrically connected with the information display module, and the information validity calculating module is electrically connected with the information sorting module and the rating difference recording module;

信息发出模块用于发送使用记录的有关信息给使用者,信息显示模块用于显示使用记录的有关信息,信息有效性计算模块用于根据使用者每次信息有效性评级的差值判断信息的,信息排序模块用于根据信息的有效性对信息进行排序;The information sending module is used to send the relevant information of the usage record to the user, the information display module is used to display the relevant information of the usage record, and the information validity calculation module is used to judge the information according to the difference between the user's information validity rating each time, The information sorting module is used to sort the information according to the validity of the information;

信息有效性评级模块的工作过程为:The working process of the information validity rating module is as follows:

Sa、最初所有使用者对被发起共享的历史记录的信息的掌握程度都相同,第一轮共享,使用者凭个人意愿共享,通过共享选择模块给共享程度,系统通过共享统计模块统计所有使用者的共享程度,再通过共享排序模块根据共享程度高低对使用者进行排序,共享高的使用者可以获得更多与历史记录有关的信息,信息通过信息发出模块提供给使用者,使用者可通过信息显示模块查看;Sa. Initially, all users have the same degree of mastery of the information of the history records that are initiated to be shared. In the first round of sharing, users share according to their personal wishes, and the sharing degree is given by the sharing selection module, and the system counts all users through the sharing statistics module. Users with high sharing degree can be sorted according to the sharing degree through the sharing sorting module. Users with high sharing can obtain more information related to historical records. The information is provided to users through the information sending module. Display module view;

Sb、第二轮共享,使用者根据上一轮共享后系统提供的历史记录的相关信息进行共享程度调整,系统通过共享差值记录模块下记录使用者共享的差值,再通过信息有效性计算模块,根据使用者共享的差值来判断什么信息更吸引使用者,并通过信息排序模块将这些信息根据有效性进行排序,系统对使用者的第二轮共享排序,优先将有效性更高的信息提供给使用者;Sb, the second round of sharing, the user adjusts the sharing degree according to the relevant information of the historical records provided by the system after the previous round of sharing, the system records the difference shared by the user through the shared difference recording module, and then calculates the validity of the information. module, which judges which information is more attractive to users according to the difference value shared by users, and sorts the information according to its effectiveness through the information sorting module. information provided to users;

Sc、系统通过数据获取模块收集使用者两次共享的数据,加以平均得到历史记录的合理共享程度;Sc. The system collects the data shared by the user twice through the data acquisition module, and averagely obtains the reasonable sharing degree of the historical records;

Sd、第三轮共享,使用者根据第二轮共享后提供的信息,进行最后一次共享;Sd, the third round of sharing, the user conducts the last sharing according to the information provided after the second round of sharing;

Sd、第三轮共享最高的使用者发起共享成功;Sd, the user with the highest sharing in the third round initiates the sharing successfully;

上述步骤Sa中,信息分享的规则为:In the above step Sa, the rules for information sharing are:

系统对共享更高的使用者会提供更多信息,对共享低的使用者会提供少量信息,具体的方法为;The system will provide more information to users with higher sharing, and a small amount of information to users with low sharing. The specific methods are as follows;

某电子历史记录有m条信息,有n个使用者参与发起共享,第一轮共享时每个使用者的共享程度为Ai,第二轮共享时每个使用者的共享为Bi,其中i∈[1,n],则第一轮使用者的平均共享为

Figure BDA0003423242770000081
There are m pieces of information in an electronic history record, and n users participate in initiating sharing. The sharing degree of each user in the first round of sharing is A i , and the sharing of each user in the second round of sharing is B i , where i∈[1,n], then the average sharing of users in the first round is
Figure BDA0003423242770000081

使用者第一轮共享与他第一轮共享后所获得信息的关系为The relationship between the user's first round of sharing and the information obtained after his first round of sharing is:

Figure BDA0003423242770000082
Figure BDA0003423242770000082

其中q为使用者在第一轮共享后所获得的信息数,μ为调节系数,q条信息将从总的m条信息中随机抽取;Among them, q is the number of information obtained by the user after the first round of sharing, μ is the adjustment coefficient, and q pieces of information will be randomly selected from the total m pieces of information;

上述步骤S2中,信息的有效性计算的具体方法为:In the above step S2, the specific method for calculating the validity of the information is:

每条信息的初始有效性为100%,每个使用者在第二轮共享前收到了qi条信息,并根据信息给出了新的共享程度Bi,则每个使用者对他所收到的信息的总的有效性的评价为

Figure BDA0003423242770000083
The initial validity of each piece of information is 100%. Each user receives q i pieces of information before the second round of sharing, and gives a new sharing degree B i according to the information. The overall validity of the information is evaluated as
Figure BDA0003423242770000083

信息随机提供给使用者,一条信息被提供给使用者的次数不定,一条信息的有效性表现为对所有收到此条信息的使用者的总有效性评价取平均值,一条信息的有效性为Information is provided to users randomly, and the number of times a piece of information is provided to users is not fixed. The validity of a piece of information is expressed as the average of the total validity evaluations of all users who have received this piece of information. The validity of a piece of information is

Figure BDA0003423242770000084
Figure BDA0003423242770000084

其中W为信息的有效性,

Figure BDA0003423242770000091
为对收到此条信息的所有使用者的总有效性评价的总和,k为收到此条信息的所有使用者的人数。where W is the validity of the information,
Figure BDA0003423242770000091
is the sum of the total validity evaluations of all users who have received this piece of information, and k is the number of all users who have received this piece of information.

实施例:步骤1、假设用户访问国家统计局网站Example: Step 1. Suppose the user visits the website of the National Bureau of Statistics

https://data.stats.gov.cn/easyquery.htm?cn=A01,我们很容易发现结构化的信息的在https://data.stats.gov.cn/easyquery.htm? cn=A01, we can easily find the structure of the information in

https://data.stats.gov.cn/easyquery.htm?m=QueryData&dbcode=hgyd&rowcode=zb&colcode=sj&wds=%5B%5D&dfwds=%5B%5D&k1=1636513644385&h=1https://data.stats.gov.cn/easyquery.htm? m=QueryData&dbcode=hgyd&rowcode=zb&colcode=sj&wds=%5B%5D&dfwds=%5B%5D&k1=1636513644385&h=1

步骤2、信息被处理成域名:data.stats.gov.cn,路径:/easyquery.htm,查询参数Step 2. The information is processed into domain name: data.stats.gov.cn, path: /easyquery.htm, query parameters

:{m:QueryData,dbcode:hgyd,rowcode:zb,colcode:sj,wds:%5B%5D,dfwds:%5B%5D,k1:1636513644385,h:1},COOKIE:{(太长省略)},请求信息:{(太长省略)},响应信息:{(太长省略)}:{m:QueryData,dbcode:hgyd,rowcode:zb,colcode:sj,wds:%5B%5D,dfwds:%5B%5D,k1:1636513644385,h:1},COOKIE:{(too long omitted)} , request information: {(too long to omit)}, response information: {(too long to omit)}

步骤3、程序2信息处理单元处理数据,程序2信息存储单元把信息存到用户本地Step 3, the program 2 information processing unit processes the data, and the program 2 information storage unit stores the information in the user's local

步骤4、用户想再次访问该信息时,用户可以通过程序2信息查询单元在本地检索,域名=data.stats.gov.cn和查询参数.dbcode=hgyd就可以获取到相应信息I,无需再次向data.stats.gov.cn所在服务器申请数据,而且查询到是JSON结构的数据相对更方便分析。Step 4. When the user wants to access the information again, the user can retrieve the information locally through the information query unit of program 2. The domain name=data.stats.gov.cn and the query parameter .dbcode=hgyd can obtain the corresponding information I, and there is no need to go to the information again. The server where data.stats.gov.cn is located applies for data, and the data in JSON structure is relatively easier to analyze.

在本发明的描述中,需要理解的是,术语“上”、“下”、“前”、“后”、“左”、“右”、等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that the orientation or positional relationship indicated by the terms "upper", "lower", "front", "rear", "left", "right", etc. is based on the drawings shown in the drawings. The orientation or positional relationship is only for the convenience of describing the present invention, rather than indicating or implying that the referred device or element must have a specific orientation, be constructed and operated in a specific orientation, and therefore should not be construed as a limitation of the present invention.

最后需要指出的是:以上实施例仅用以说明本发明的技术方案,而非对其限制。尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be pointed out that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them. Although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it is still possible to modify the technical solutions described in the foregoing embodiments, or perform equivalent replacements to some of the technical features therein, and these modifications Or alternatively, the essence of the corresponding technical solutions does not deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (8)

1. A method of semi-structured retention of browser access information, characterized by: the system adopted by the method comprises an information copying unit, an information processing unit, an information sending unit, an information receiving unit, an information storage unit and an information inquiry unit, wherein the information inquiry unit is electrically connected with the information processing unit, the information storage unit and the information copying unit are both electrically connected with the information processing unit, and the information receiving unit and the information sending unit are both electrically connected with the information processing unit;
the information copying unit is used for copying browsing information of a browser plug-in program browser, the information processing unit is used for processing data according to an http protocol, the information storage unit is used for storing information on a webpage locally, the information query unit is used for retrieving the information locally when a user accesses the information again, the information receiving unit is used for receiving the processed information, and the information sending unit is used for sending the processed information.
2. A method of semi-structured persistence browser access information according to claim 1, further comprising: the information processing unit comprises an information body processing unit and an access processing unit, the information body processing unit is used for processing data into an information body with a domain name, a request path, a query parameter, COOKIE, request information and response information composition, and the access processing unit is used for processing the information to facilitate query.
3. A method of semi-structured persistence browser access information according to claim 2, further comprising: the method comprises the following specific steps:
s1, when the user surfs the internet by using the browser, the information is copied by using the information copying unit in the plug-in built in the browser;
s2, the information body processing unit processes data according to HTTP protocol, and processes the data into domain name, request path, inquiry parameter, COOKIE, request information, response information and name information body I;
s3, sending the information to the receiving unit by using the information sending unit information body I to receive the information;
s4, using the access processing unit to further process the information and using the information storage unit to store the information body I in the local computer of the user;
and S4, when accessing next time, the user can inquire and acquire the information body I through the information inquiry unit.
4. A method of semi-structured persistence browser access information according to claim 3, wherein: the system also comprises an information effectiveness rating module, wherein the information effectiveness rating module comprises a rating selection module, a rating statistics module, a rating difference value recording module and a rating sequencing module, and the rating statistics module is electrically connected with the rating difference value recording module and the rating sequencing module;
the rating selection module is used for enabling users to select information effectiveness ratings, the rating statistics module is used for counting the information effectiveness ratings of all users, the rating difference value recording module is used for recording the difference value of each information effectiveness rating of the users, and the rating sorting module is used for sorting the users according to the information effectiveness ratings.
5. A method of semi-structured persistence browser access information according to claim 4, wherein: the information inquiry unit comprises an information sending module, an information display module, an information effectiveness calculation module and an information sorting module, wherein the information sending module is electrically connected with the information display module, and the information effectiveness calculation module is electrically connected with the information sorting module and the rating difference value recording module;
the information sending module is used for sending the relevant information of the use records to a user, the information display module is used for displaying the relevant information of the use records, the information effectiveness calculation module is used for judging the information according to the difference value of the information effectiveness rating of each time of the user, and the information sorting module is used for sorting the information according to the effectiveness of the information.
6. A method of semi-structured persistence browser access information according to claim 5, further comprising: the working process of the information effectiveness rating module is as follows:
sa, all users have the same mastery degree on the information of the initiated shared history record at first, the users share the information in the first round, the users share the information according to personal wishes, the sharing degree is given by a sharing selection module, the system counts the sharing degree of all users by a sharing counting module, the users are ranked by a sharing ranking module according to the sharing degree, the users with high sharing degree can obtain more information related to the history record, the information is provided for the users by an information sending module, and the users can check the information by an information display module;
sb, sharing in the second round, adjusting the sharing degree of a user according to related information of a history record provided by the system after the previous round of sharing, recording a difference value shared by the user by the system through a sharing difference value recording module, judging what information attracts the user according to the difference value shared by the user through an information effectiveness calculating module, sequencing the information according to effectiveness through an information sequencing module, sequencing the second round of sharing of the user through the system, and preferentially providing the information with higher effectiveness for the user;
the Sc and system collects the data shared twice by the user through a data acquisition module, and the data are averaged to obtain the reasonable sharing degree of the historical records;
sd, third round sharing, the user carries out the last sharing according to the information provided after the second round sharing;
sd, the third round of users with highest sharing initiate sharing success.
7. A method of semi-structured retention of browser access information according to claim 6, further comprising: in the step Sa, the rule of information sharing is as follows:
the system can provide more information for users with higher sharing and provide a small amount of information for users with low sharing, and the specific method is as follows;
some electronic history record has m pieces of information, n users participate in initiating sharing, and the sharing degree of each user is A in the first round of sharingiThe share of each user in the second round of sharing is BiWhere i ∈ [1, n ]]The average share of the first round of users is
Figure FDA0003423242760000031
The relation between the first round of sharing of the user and the information obtained after the first round of sharing is
Figure FDA0003423242760000032
Where q is the number of messages obtained by the user after the first round of sharing, μ is the adjustment factor, and q messages will be randomly extracted from the total of m messages.
8. A method of semi-structured persistence browser access information according to claim 7, further comprising: in step S2, the specific method for calculating the validity of the information includes:
the initial validity of each message is 100%, and each user received q before the second round of sharingiInformation and gives a new sharing degree B according to the informationiThen each user evaluates the overall validity of the information he receives as
Figure FDA0003423242760000041
The information is randomly provided for the user, the number of times a piece of information is provided for the user is variable, the effectiveness of a piece of information is represented by averaging the total effectiveness evaluation of all users who receive the piece of information, and the effectiveness of a piece of information is represented by
Figure FDA0003423242760000042
Where W is the validity of the information,
Figure FDA0003423242760000043
k is the total of all users who received the piece of information and the total validity rating, and k is the number of all users who received the piece of information.
CN202111569817.7A 2021-12-21 2021-12-21 A semi-structured method for storing browser access information Active CN114491231B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111569817.7A CN114491231B (en) 2021-12-21 2021-12-21 A semi-structured method for storing browser access information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111569817.7A CN114491231B (en) 2021-12-21 2021-12-21 A semi-structured method for storing browser access information

Publications (2)

Publication Number Publication Date
CN114491231A true CN114491231A (en) 2022-05-13
CN114491231B CN114491231B (en) 2024-11-26

Family

ID=81494402

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111569817.7A Active CN114491231B (en) 2021-12-21 2021-12-21 A semi-structured method for storing browser access information

Country Status (1)

Country Link
CN (1) CN114491231B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080126484A1 (en) * 2006-06-30 2008-05-29 Meebo, Inc. Method and system for determining and sharing a user's web presence
JP2009294768A (en) * 2008-06-03 2009-12-17 Toshiba Corp Information sharing device and information sharing program
US7725526B1 (en) * 2000-06-23 2010-05-25 International Business Machines Corporation System and method for web based sharing of search engine queries
CN102647481A (en) * 2012-03-31 2012-08-22 奇智软件(北京)有限公司 A device and method for accessing a preset network address
CN103365860A (en) * 2012-03-28 2013-10-23 腾讯科技(深圳)有限公司 Method, device and terminal for processing web pages
CN103761310A (en) * 2014-01-23 2014-04-30 贝壳网际(北京)安全技术有限公司 Method and device for processing webpage access records and browser
CN103916402A (en) * 2014-04-18 2014-07-09 北京奇虎科技有限公司 Method and device for protecting files downloaded by browser
US20160019298A1 (en) * 2014-07-15 2016-01-21 Microsoft Corporation Prioritizing media based on social data and user behavior
CN112597116A (en) * 2020-12-23 2021-04-02 中国电子信息产业集团有限公司第六研究所 Document sharing intelligent management system under autonomous controllable platform

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7725526B1 (en) * 2000-06-23 2010-05-25 International Business Machines Corporation System and method for web based sharing of search engine queries
US20080126484A1 (en) * 2006-06-30 2008-05-29 Meebo, Inc. Method and system for determining and sharing a user's web presence
JP2009294768A (en) * 2008-06-03 2009-12-17 Toshiba Corp Information sharing device and information sharing program
CN103365860A (en) * 2012-03-28 2013-10-23 腾讯科技(深圳)有限公司 Method, device and terminal for processing web pages
CN102647481A (en) * 2012-03-31 2012-08-22 奇智软件(北京)有限公司 A device and method for accessing a preset network address
CN103761310A (en) * 2014-01-23 2014-04-30 贝壳网际(北京)安全技术有限公司 Method and device for processing webpage access records and browser
CN103916402A (en) * 2014-04-18 2014-07-09 北京奇虎科技有限公司 Method and device for protecting files downloaded by browser
US20160019298A1 (en) * 2014-07-15 2016-01-21 Microsoft Corporation Prioritizing media based on social data and user behavior
CN112597116A (en) * 2020-12-23 2021-04-02 中国电子信息产业集团有限公司第六研究所 Document sharing intelligent management system under autonomous controllable platform

Also Published As

Publication number Publication date
CN114491231B (en) 2024-11-26

Similar Documents

Publication Publication Date Title
US8359313B2 (en) Extensible custom variables for tracking user traffic
CN102946319B (en) Networks congestion control information analysis system and analytical method thereof
CN102663048B (en) Method and device for providing search result
CN108363815B (en) Webpage pre-reading method and device and intelligent terminal equipment
CN102831199B (en) Method and device for establishing interest model
CN102932206B (en) The method and system of monitoring website access information
US20170013072A1 (en) Webpage pre-reading method, apparatus and smart terminal device
CN104216921B (en) A kind of addition reminding method, apparatus and system for realizing quick links in browser
CN102932207B (en) The method of monitoring website access information and server
CN110062025A (en) Method, apparatus, server and the storage medium of data acquisition
CN103970753B (en) The method for pushing and device of association knowledge
KR20120022893A (en) Generating improved document classification data using historical search results
CN103853822A (en) Method and device for pushing news information in browser
CN108804566A (en) A kind of mass small documents read method based on Hadoop
JP2011154467A (en) Retrieval result ranking method and system
Huntington et al. Website usage metrics: A re-assessment of session data
CN103780625A (en) Method and device for discovering interest of users
CN101694656A (en) Search request method, search method, device and system
CN103530390A (en) Webpage crawling method and device
CN107239542A (en) A kind of data statistical approach, device, server and storage medium
CN107436940A (en) The method of web front-end Dynamic Display data based on user profile behavioural analysis
CN102982079B (en) Personalized website navigation method and apparatus
KR100273775B1 (en) Method and apparatus for information service
CN108319622A (en) A kind of media content recommendations method and device
CN113449212B (en) Quality evaluation and optimization methods, devices and equipment for search results

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant