[go: up one dir, main page]

CN103413054A - Internet addiction detection device and method based on user-computer interactive events - Google Patents

Internet addiction detection device and method based on user-computer interactive events Download PDF

Info

Publication number
CN103413054A
CN103413054A CN2013103686051A CN201310368605A CN103413054A CN 103413054 A CN103413054 A CN 103413054A CN 2013103686051 A CN2013103686051 A CN 2013103686051A CN 201310368605 A CN201310368605 A CN 201310368605A CN 103413054 A CN103413054 A CN 103413054A
Authority
CN
China
Prior art keywords
addiction
frequent
complex
events
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013103686051A
Other languages
Chinese (zh)
Other versions
CN103413054B (en
Inventor
于亚新
王国仁
杨宝栓
闫珂
张旭
邹显鹏
朱歆华
许虎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northeastern University China
Original Assignee
Northeastern University China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northeastern University China filed Critical Northeastern University China
Priority to CN201310368605.1A priority Critical patent/CN103413054B/en
Publication of CN103413054A publication Critical patent/CN103413054A/en
Application granted granted Critical
Publication of CN103413054B publication Critical patent/CN103413054B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides an Internet addiction detection device and method based on user-computer interactive events and belongs to the field of data mining. According to the Internet addiction detection device and method based on the user-computer interactive events, quantifiable human-computer interactive operational data are collected through Internet surfing tools which are used by a user frequently, the data are used for computing and analyzing Internet surfing behaviors of the user, accordingly whether the user has an Internet addiction can be detected, and the Internet surfing tools can be controlled effectively. According to the Internet addiction detection device and method based on the user-computer interactive events, the accuracy of Internet addiction detection can reach more than 85%, errors of existing detection methods can be effectively avoided, and the accuracy of the detection can be improved; the detection costs can be reduced, a user can carry out detection at any time, high application value to primary and secondary school students can be obtained, Internet addiction behaviors can be effectively prevented and controlled, and damage caused by Internet addiction can be reduced.

Description

基于用户计算机交互事件的网瘾检测装置及方法Internet addiction detection device and method based on user computer interaction events

技术领域technical field

本发明属于数据挖掘领域,具体涉及一种基于用户计算机交互事件的网瘾检测装置及方法。The invention belongs to the field of data mining, and in particular relates to an Internet addiction detection device and method based on user computer interaction events.

背景技术Background technique

网络已成为现代人们生活的重要组成部分,尤其是诸如Facebook、Twitter、新浪微博等社交网的兴起,越来越多的人加入到了上网的队伍中。毫无疑问,网络在给人类社会带来高效便捷服务的同时,也给人类社会带来负面和消极影响。部分网络用户由于无节制地过度沉迷于网络,造成其生理、心里、情感、道德等多方面受损,干扰了正常的工作、学习和生活,甚至诱发犯罪。The Internet has become an important part of modern people's lives, especially with the rise of social networks such as Facebook, Twitter, and Sina Weibo, more and more people have joined the ranks of surfing the Internet. There is no doubt that while the network brings efficient and convenient services to human society, it also brings negative and negative impacts to human society. Due to unrestrained and excessive indulging in the Internet, some Internet users have suffered physical, psychological, emotional, and moral damages, interfered with normal work, study, and life, and even induced crimes.

通过查阅有关网瘾的相关资料,目前的研究工作和成果大都基于心理学、社会学、医学等领域展开,上述现有的网瘾检测方法繁琐、成本高、误差性大,而从计算机采集数据进行检测属于空白,事实上,用户与网络进行交互的计算机操作可以被详细记录下来,并用于分析,这对判断网瘾甚至是发现潜在网瘾趋势具有重要价值,但目前却被忽略。By consulting relevant information on Internet addiction, most of the current research work and achievements are based on psychology, sociology, medicine and other fields. Detection is blank. In fact, the computer operations that users interact with the Internet can be recorded in detail and used for analysis. This is of great value in judging Internet addiction and even discovering potential Internet addiction trends, but it is currently ignored.

发明内容Contents of the invention

针对现有技术的缺点,本发明提出一种基于用户计算机交互事件的网瘾检测装置及方法,以达到降低检测成本、提高检测准确度、简化检测方法的目的。Aiming at the shortcomings of the prior art, the present invention proposes a device and method for detecting Internet addiction based on user computer interaction events, so as to reduce detection cost, improve detection accuracy, and simplify detection methods.

一种基于用户计算机交互事件的网瘾检测装置,包括简单事件采集模块、复杂事件生成模块、时间泛化及归一化处理模块、频繁行为产生子获取模块、样本库构建模块、数据处理与检测模块和预警干预模块,其中,An Internet addiction detection device based on user computer interaction events, including a simple event collection module, a complex event generation module, a time generalization and normalization processing module, a frequent behavior generation sub-acquisition module, a sample library construction module, data processing and detection module and early warning intervention module, in which,

简单事件采集模块:用于采集用户与计算机交互变量作为简单事件,包括计算机CPU使用率、内存使用率、单位时间点击鼠标左键次数、单位时间点击鼠标右键次数、单位时间键盘敲击次数、单位时间鼠标移动的像素个数、网络流量和运行进程,并将上述输入变量发送至复杂事件生成模块;Simple event collection module: used to collect user-computer interaction variables as simple events, including computer CPU usage rate, memory usage rate, number of clicks of the left mouse button per unit time, number of times of right mouse clicks per unit time, number of keyboard strokes per unit time, unit The number of pixels moved by the time mouse, network traffic and running process, and send the above input variables to the complex event generation module;

复杂事件生成模块:用于将每个简单事件分别与各自阈值进行比较,若大于等于阈值范围,则保留该简单事件,再根据所保留的全部简单事件组合情况生成复杂事件;否则继续采集简单事件;所述的复杂事件包括离线看视频、在线看视频、即时战略游戏、桌游类游戏、浏览网页、在线聊天和下载操作;Complex event generation module: used to compare each simple event with its respective threshold, if it is greater than or equal to the threshold range, then keep the simple event, and then generate complex events according to the combination of all the retained simple events; otherwise, continue to collect simple events ; The complex events include watching videos offline, watching videos online, real-time strategy games, board games, browsing web pages, online chatting and downloading operations;

时间泛化及归一化处理模块:用于将每个用户某天所做的复杂事件进行整理,并将复杂事件发生时间进行泛化处理,即将一天划分为若干个时间段;将持续时间进行归一化处理,即将复杂事件的持续时间进行圆整操作;并将处理后的复杂事件发送至频繁行为产生子获取模块;Time generalization and normalization processing module: used to sort out the complex events that each user does on a certain day, and generalize the occurrence time of complex events, that is, to divide a day into several time periods; Normalization processing, that is, rounding the duration of complex events; and sending the processed complex events to the frequent behavior generation sub-acquisition module;

频繁行为产生子获取模块:当建立样本库时,用于获取多个网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁网瘾复杂事件,再获得由上述频繁网瘾复杂事件所构成的集合用于描述用户行为特征分布的产生子,并发送至样本库构建模块;用于获取多个非网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁非网瘾复杂事件,再获得由上述频繁非网瘾复杂事件所构成集合的用于描述用户行为特征分布的产生子,并发送至样本库构建模块;Frequent behavior generation sub-acquisition module: when establishing a sample library, it is used to obtain the same complex events between multiple Internet addicted users, collect the generation times of the above-mentioned same complex events, and select out according to the number threshold set by the user The complex events greater than the threshold are regarded as frequent Internet addiction complex events, and then a set composed of the above-mentioned frequent Internet addiction complex events is obtained to describe the generator of the distribution of user behavior characteristics, and sent to the sample library construction module; used to obtain multiple non- For the same complex events between Internet addicted users, collect the occurrence times of the above-mentioned same complex events, and select the complex events greater than the threshold as frequent non-Internet addiction complex events according to the number threshold set by the user, and then obtain the above frequent non-Internet addiction complex events. A generator for describing the distribution of user behavior characteristics, which is a collection of non-Internet addiction complex events, and sent to the sample library construction module;

当检测用户网瘾状况时,用于获取待测用户一段时间之内所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁复杂事件,再获得由上述频繁复杂事件所构成集合的用于描述行为特征分布的产生子,并发送至数据处理与检测模块;When detecting the user's Internet addiction status, it is used to obtain the same complex events performed by the user to be tested within a period of time, collect the number of occurrences of the above-mentioned same complex events, and select complex events greater than the threshold according to the number of times threshold set by the user As a frequent complex event, obtain a generator for describing the distribution of behavioral characteristics composed of the above frequent complex events, and send it to the data processing and detection module;

样本库构建模块:用于采用基于产生子的样本构建算法对频繁网瘾复杂事件产生子进行计算得分,若得分超过阈值,则保存为网瘾类,否则删除,并将频繁非网瘾复杂事件产生子保存保存为非网瘾类;或采用基于显露模式的样本构建算法对频繁复杂事件产生子进行计算,获得显露模式的产生子并保存;Sample library construction module: used to calculate the score of frequent Internet addiction complex event generators using the generator-based sample construction algorithm. If the score exceeds the threshold, it will be saved as an Internet addiction category, otherwise it will be deleted, and the frequent non-Internet addiction complex event The generators are saved as non-Internet addiction categories; or the generators of frequent and complex events are calculated using the sample construction algorithm based on the exposure patterns, and the generators of the exposure patterns are obtained and saved;

数据处理与检测模块:用于将待检测频繁行为产生子与网瘾样本库进行比较,若待检测频繁复杂事件产生子包含在样本库中,则判定其为网瘾,否则是非网瘾,并输出判断结果;或将待检测频繁行为产生子与样本库进行比较,采用基于显露模式的网瘾检测算法分别计算其网瘾得分与非网瘾得分,根据得分高低判断其所属类别;Data processing and detection module: used to compare the frequent behavior generators to be detected with the Internet addiction sample library, if the frequent complex event generators to be detected are included in the sample library, it is determined to be Internet addiction, otherwise it is not Internet addiction, and Output the judgment result; or compare the frequent behavior generators to be detected with the sample library, use the Internet addiction detection algorithm based on the exposure mode to calculate their Internet addiction scores and non-Internet addiction scores, and judge their category according to the score;

预警干预模块:用于当检测到用户计算机交互行为是网瘾行为时,发出预警提示,并通过定时器限定该用户可继续使用计算机的时间范围,1~2小时;若超过时限范围,则关闭计算机。Early warning intervention module: when it is detected that the user's computer interaction behavior is an Internet addiction behavior, an early warning prompt is issued, and the time range for the user to continue using the computer is limited by a timer, 1 to 2 hours; if it exceeds the time limit, it will be closed computer.

采用基于用户计算机交互事件的网瘾检测装置进行检测的方法,包括以下步骤:The method for detecting an Internet addiction detection device based on user computer interaction events comprises the following steps:

步骤1、采集多个网瘾用户和多个非网瘾用户与计算机之间的交互变量作为简单事件,包括计算机CPU使用率、内存使用率、单位时间点击鼠标左键次数、单位时间点击鼠标右键次数、单位时间键盘敲击次数、单位时间鼠标移动的像素个数、网络流量和运行进程;Step 1. Collect the interaction variables between multiple Internet addicted users and multiple non-Internet addicted users and the computer as simple events, including computer CPU usage, memory usage, the number of times the left mouse button is clicked per unit time, and the right mouse button clicked per unit time Number of times, number of keyboard strokes per unit time, number of pixels moved by the mouse per unit time, network traffic and running process;

步骤2、将每个简单事件分别与各自阈值进行比较,若大于等于阈值范围,则保留该简单事件,再根据所保留的全部简单事件组合情况生成复杂事件;否则继续采集简单事件;所述的复杂事件包括离线看视频、在线看视频、即时战略游戏、桌游类游戏、浏览网页、在线聊天和下载操作;Step 2. Compare each simple event with its respective threshold. If it is greater than or equal to the threshold range, then keep the simple event, and then generate a complex event according to the combination of all the retained simple events; otherwise, continue to collect simple events; Complex events include watching videos offline, watching videos online, real-time strategy games, board games, browsing the web, chatting online, and downloading operations;

生成复杂事件的过程如下:The process of generating a complex event is as follows:

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,以及上网流量超过或等于阈值范围35~45MB,则生成一个在线看视频的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, a complex event of watching videos online is generated. ;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,以及运行进程为多媒体播放器,则生成一个离线看视频的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the running process is a multimedia player, a complex event of watching videos offline is generated;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间点击鼠标右键次数超过或等于阈值范围10~20次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为及时战略游戏,则生成一个玩即时战略游戏的复杂事件;If the CPU usage exceeds or is equal to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range 25 to 35 times, click the mouse per unit time The number of right clicks exceeds or equals the threshold range of 10 to 20 times, the number of pixels moved by the mouse per unit time exceeds or equals the threshold range of 1550 to 1650, the number of keyboard strokes per unit time exceeds or equals the threshold range of 80 to 120 times, and the running process is For real-time strategy games, generate a complex event for playing real-time strategy games;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为桌游类游戏,则生成一个玩桌游类游戏的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the mouse moves per unit time. If the number of pixels exceeds or is equal to the threshold range of 1550 to 1650, and the running process is a board game, a complex event of playing a board game is generated;

若CPU使用率超过或等于阈值范围45%~55%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为桌游类游戏,则生成另一个玩桌游类游戏的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, and the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, And if the running process is a board game, another complex event of playing a board game is generated;

若单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为浏览器,则生成一个浏览网页的复杂事件;If the number of clicks on the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, and the running process is a browser, a complex page browsing process is generated. event;

若单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,上网流量超过或等于阈值范围35~45MB,以及运行进程为浏览器,则生成另一个浏览网页的复杂事件;If the number of clicks on the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, and the running process For the browser, another complex event for browsing the web page is generated;

若单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,上网流量超过或等于阈值范围35~45MB,以及运行进程为浏览器,则生成又一个浏览网页的复杂事件;If the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, and the running process is a browser, another complex event of web browsing is generated;

若单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为即时聊天软件,则生成一个网上聊天的复杂事件;If the number of keystrokes per unit time exceeds or is equal to the threshold range of 80 to 120 times, and the running process is instant chat software, a complex event of online chat will be generated;

若单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为即时聊天软件,则生成另一个网上聊天的复杂事件;If the number of pixels moved by the mouse per unit time exceeds or equals the threshold range of 1550 to 1650, the number of keystrokes per unit time exceeds or equals the threshold range of 80 to 120 times, and the running process is instant chat software, another online chat will be generated complex events;

若上网流量超过或等于阈值范围35~45MB,则生成一个下载资料的复杂事件;If the Internet traffic exceeds or is equal to the threshold range of 35-45MB, a complex event of downloading data will be generated;

步骤3、将复杂事件发生时间进行泛化处理,即将一天划分为6:00~11:00、11:00~14:00、14:00~18:00、18:00~23:00、23:00~6:00五个时间段,并将持续时间进行归一化处理,即将复杂事件的持续时间以10分钟为单位进行圆整操作;Step 3. Generalize the occurrence time of complex events, that is, divide a day into 6:00~11:00, 11:00~14:00, 14:00~18:00, 18:00~23:00, 23 :00~6:00 five time periods, and the duration is normalized, that is, the duration of complex events is rounded up in units of 10 minutes;

步骤4、获取多个网瘾用户之间所进行的相同复杂事件和多个非网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁网瘾复杂事件和频繁非网瘾复杂事件,再获得由上述频繁网瘾复杂事件所构成集合的用于描述用户行为特征分布的多个产生子,获得由上述频繁非网瘾复杂事件所构成集合的用于描述用户行为特征分布的多个产生子,并发送至样本库构建模块;Step 4. Acquire the same complex events between multiple Internet addicted users and the same complex events between multiple non-Internet addicted users, collect the number of occurrences of the above-mentioned same complex events, and set the threshold according to the number of times set by the user , select complex events greater than the threshold as frequent Internet addiction complex events and frequent non-Internet addiction complex events, and then obtain multiple generators for describing the distribution of user behavior characteristics composed of the above-mentioned frequent Internet addiction complex events, and obtain by Multiple generators used to describe the distribution of user behavior characteristics formed by the above-mentioned frequent non-internet addiction complex events are sent to the sample library construction module;

步骤5、采用基于产生子的样本构建算法对频繁网瘾复杂事件的产生子进行计算得分,若得分超过阈值,则保存为网瘾类,否则删除,并将频繁非网瘾复杂事件的产生子保存为非网瘾类;或采用基于显露模式的样本采集算法对频繁复杂事件产生子进行计算,获得显露模式的产生子并保存;Step 5. Use the generator-based sample construction algorithm to calculate the score for the generators of frequent Internet addiction complex events. If the score exceeds the threshold, save it as an Internet addiction category, otherwise delete it, and save the generators of frequent non-Internet addiction complex events. Save it as a non-Internet addiction category; or use the sample collection algorithm based on the exposure pattern to calculate the generators of frequent and complex events, obtain the generators of the exposure pattern and save them;

步骤6、采集待测用户一段时间之内与计算机之间的交互变量,并重复步骤2到步骤3;Step 6, collect the interaction variables between the user to be tested and the computer within a period of time, and repeat steps 2 to 3;

步骤7、获取待测用户一段时间之内所进行的相同复杂事件,并采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁复杂事件,再获得由上述频繁复杂事件所构成集合的用于描述行为特征分布的产生子,并发送至数据处理与检测模块;Step 7. Obtain the same complex events performed by the user to be tested within a period of time, and collect the occurrence times of the above-mentioned same complex events, and select complex events greater than the threshold as frequent complex events according to the number threshold set by the user, and then Obtain a generator for describing the distribution of behavioral characteristics composed of the above-mentioned frequent and complex events, and send it to the data processing and detection module;

步骤8、将待检测频繁行为产生子与网瘾样本库进行比较,若待检测频繁复杂事件产生子包含在样本库中,则判定其为网瘾,否则是非网瘾,并输出判断结果;或将待检测频繁行为产生子与样本库进行比较,采用基于显露模式的网瘾检测算法分别计算其网瘾得分与非网瘾得分,判断两者得分高低,待检测频繁复杂事件属于得分高一类别;Step 8, comparing the frequent behavior generator to be detected with the Internet addiction sample library, if the frequent complex event generator to be detected is included in the sample library, then determine it is Internet addiction, otherwise it is not Internet addiction, and output the judgment result; or Compare the frequent behavior generators to be detected with the sample library, and use the Internet addiction detection algorithm based on the exposure model to calculate their Internet addiction scores and non-Internet addiction scores, and judge the scores of the two. The frequent and complex events to be detected belong to the category with a higher score ;

步骤9、若检测到用户计算机交互行为是网瘾行为,则采用预警干预模块发出预警提示,并通过定时器限定该用户可继续使用计算机的时间范围,1~2小时;若超过时限范围,则关闭计算机。Step 9. If it is detected that the user's computer interaction behavior is an Internet addiction behavior, the early warning intervention module is used to issue an early warning prompt, and the time range within which the user can continue to use the computer is limited by a timer, 1 to 2 hours; if the time limit is exceeded, then Shut down the computer.

步骤2所述的简单事件的各自阈值:计算机CPU使用率取值范围45%~55%、内存使用率取值范围55%~65%、单位时间点击鼠标左键次数取值范围25~35次、单位时间点击鼠标右键次数取值范围10~20次,单位时间键盘敲击次数取值范围80~120、单位时间鼠标移动的像素个数取值范围1550~1650、网络流量取值范围35~45MB。The respective thresholds of the simple events described in step 2: the computer CPU usage ranges from 45% to 55%, the memory usage ranges from 55% to 65%, and the number of clicks on the left mouse button per unit time ranges from 25 to 35 times , The value range of the number of clicks on the right mouse button per unit time is 10-20 times, the value range of the number of keyboard strokes per unit time is 80-120, the value range of the number of pixels moved by the mouse per unit time is 1550-1650, and the value range of network traffic is 35- 45MB.

步骤5所述的采用基于产生子的样本构建算法对频繁网瘾复杂事件产生子进行计算得分,具体为:In Step 5, the generator-based sample construction algorithm is used to calculate the score of frequent Internet addiction complex event generators, specifically:

步骤5-1、根据用户需求确定复杂事件类型的权重系数,保证即时战略游戏的权重系数>在线看视频的权重系数>桌游类游戏的权重系数>浏览网页的权重系数>离线看视频的权重系数>在线聊天的权重系数>下载的权重系数,并且权重系数总数为1;Step 5-1. Determine the weight coefficient of complex event types according to user needs, and ensure that the weight coefficient of real-time strategy games > the weight coefficient of watching videos online > the weight coefficient of board games > the weight coefficient of browsing webpages > the weight of watching videos offline Coefficient > weight coefficient of online chat > weight coefficient of download, and the total weight coefficient is 1;

步骤5-2、根据用户需求确定复杂事件时间泛化段的权重系数,保证黎明的权重系数>晚上的权重系数>中午的权重系数>下午的权重系数>上午的权重系数,并且权重系数总数为1;Step 5-2. Determine the weight coefficient of the time generalization segment of complex events according to user needs, and ensure that the weight coefficient of dawn > the weight coefficient of evening > the weight coefficient of noon > the weight coefficient of afternoon > the weight coefficient of morning, and the total number of weight coefficients is 1;

步骤5-3、将产生子中的每个复杂事件类型权重系数、复杂事件时间泛化段权重系数与持续时间相乘求得评分,并将每个复杂事件评分求和获取该产生子的评分;Step 5-3: Multiply the weight coefficient of each complex event type in the generator, the weight coefficient of the complex event time generalization section and the duration to obtain the score, and sum the scores of each complex event to obtain the score of the generator ;

步骤5-4、若得分超过阈值,阈值取值范围为1~6,则保存,否则删除。Step 5-4. If the score exceeds the threshold, and the threshold ranges from 1 to 6, save it, otherwise delete it.

步骤5所述的采用基于显露模式的样本构建算法对频繁复杂事件产生子进行计算,获得显露模式的产生子并分类保存,具体为:In step 5, the sample construction algorithm based on the exposure pattern is used to calculate the generators of frequent complex events, and the generators of the exposure patterns are obtained and stored in categories, specifically:

步骤5-a、获取属于频繁网瘾复杂事件内而不属于频繁非网瘾复杂事件内的产生子;Step 5-a, obtaining the generators that belong to the frequent Internet addiction complex event but not the frequent non-Internet addiction complex event;

步骤5-b、获取不属于频繁网瘾复杂事件内而属于频繁非网瘾复杂事件内的产生子;Step 5-b. Obtain the generators that do not belong to the frequent Internet addiction complex event but belong to the frequent non-Internet addiction complex event;

步骤5-c、获取即属于频繁网瘾复杂事件内又属于频繁非网瘾复杂事件内的共同产生子,并确定其在频繁网瘾复杂事件中的支持度,即出现率,确定在频繁非网瘾复杂事件中的支持度;Step 5-c, obtain the co-producers that belong to both frequent Internet addiction complex events and frequent non-Internet addiction complex events, and determine their support degree in the frequent Internet addiction complex events, that is, the occurrence rate. The degree of support in complex events of Internet addiction;

步骤5-d、计算在频繁网瘾复杂事件中该共同产生子的增长率,即将其在频繁网瘾复杂事件中的支持度除以在频繁非网瘾复杂事件中的支持度;再计算在频繁非网瘾复杂事件中该共同产生子的增长率,即将其在频繁非网瘾复杂事件中的支持度除以在频繁网瘾复杂事件中的支持度;Step 5-d, calculate the growth rate of the co-producer in the frequent Internet addiction complex event, that is, divide its support degree in the frequent Internet addiction complex event by the support degree in the frequent non-Internet addiction complex event; The growth rate of the co-producer in the frequent non-Internet addiction complex event is to divide its support degree in the frequent non-Internet addiction complex event by the support degree in the frequent Internet addiction complex event;

步骤5-e、比较上述两个增长率的大小,保留增长率大于1的产生子,删除增长率小于1的产生子;Step 5-e, compare the size of the above two growth rates, retain the generator whose growth rate is greater than 1, and delete the generator whose growth rate is less than 1;

步骤5-f、将频繁网瘾复杂事件中的产生子取并集,将频繁非网瘾复杂事件中的产生子取并集,即获得显露模式的产生子并保存。Step 5-f: Union the generators of the frequent Internet addiction complex events, and union the generators of the frequent non-Internet addiction complex events, that is, obtain and save the generators of the exposure pattern.

步骤8所述的采用基于显露模式的网瘾检测算法分别计算其网瘾得分与非网瘾得分,得分score(C,Di)公式如下:The Internet addiction detection algorithm based on the exposure mode described in step 8 is used to calculate its Internet addiction score and non-Internet addiction score respectively, and the score score (C, D i ) formula is as follows:

ScoreScore (( CC ,, DD. ii || jj )) == ΣΣ Xx ∈∈ RSRS GrGr (( Xx ,, DD. jj || ii ,, DD. ii || jj )) GrGr (( Xx ,, DD. jj || ii ,, DD. ii || jj )) ++ MaxMax GrGr RSRS ×× SupSup cc (( Xx ))

++ ΣΣ YY ∈∈ WSWS 11 GrGr (( YY ,, DD. ii || jj ,, DD. jj || ii )) ++ MinMin GrGr WSWS ×× SupSup cc (( YY )) ,, (( ii ≠≠ jj )) -- -- -- (( 22 ))

其中:Di|j表示Di样本库或Dj样本库,Dj|i表示Dj样本库或Di样本库,i和j是样本库类标签号,X表示Di的显露模式的产生子,Y表示Dj的显露模式的产生子;RS是目标类的正向样本集合,即起积极作用的集合,WS是目标类的反向样本集合,即起消极作用的集合;Gr(X,Dj|i,Di|j)表示X从一个类到另一个类的增长率;MaxGrRS表示目标类集合中的最大增长率阈值,RS表示目标类的正向样本集合,MinGrWS表示非目标类集合中的最小增长率阈值,WS表示目标类的反向样本集合,C为待分类复杂事件,Supc(X)表示X在C中的支持度,Supc(Y)表示Y在C中的支持度。Among them: D i|j represents D i sample library or D j sample library, D j|i represents D j sample library or D i sample library, i and j are the class label numbers of sample library, X represents the exposure mode of D i Generator, Y represents the generator of the exposure pattern of D j ; RS is the positive sample set of the target class, that is, the set that plays an active role; WS is the reverse sample set of the target class, that is, the set that plays a negative role; Gr( X,D j|i ,D i|j ) represents the growth rate of X from one class to another class; MaxGr RS represents the maximum growth rate threshold in the target class set, RS represents the positive sample set of the target class, MinGr WS Represents the minimum growth rate threshold in the non-target class set, WS represents the reverse sample set of the target class, C is the complex event to be classified, Sup c (X) represents the support of X in C, Sup c (Y) represents Y Support in C.

本发明优点:Advantages of the present invention:

本发明一种基于用户计算机交互事件的网瘾检测装置及方法,通过人们常用的上网工具(如台式计算机和笔记本电脑等),采集可量化的人机交互操作数据,并利用这些数据计算分析用户上网行为,从而检测出用户是否罹患网瘾,并对该上网工具进行有效控制;本专利检测网瘾的正确率可高达85%以上,有效避免了现有检测方法的失误,提高检测的准确度;本发明还可降低检测成本,用户可随时进行检测,对于中小学生应用价值高,有效预防并控制网瘾行为,减少网瘾伤害。A device and method for detecting Internet addiction based on user computer interaction events in the present invention collects quantifiable human-computer interaction operation data through commonly used Internet tools (such as desktop computers and notebook computers), and uses these data to calculate and analyze user Internet behavior, so as to detect whether the user suffers from Internet addiction, and effectively control the Internet tool; the correct rate of detecting Internet addiction in this patent can be as high as 85%, which effectively avoids the mistakes of existing detection methods and improves the accuracy of detection The invention can also reduce the detection cost, and the user can carry out the detection at any time. It has high application value for primary and middle school students, effectively prevents and controls Internet addiction behavior, and reduces Internet addiction damage.

附图说明Description of drawings

图1为本发明一种实施例的装置结构图;Fig. 1 is a device structure diagram of an embodiment of the present invention;

图2为本发明一种实施例的网瘾检测装置进行检测的方法流程图;Fig. 2 is the flow chart of the method for detection by the Internet addiction detection device of an embodiment of the present invention;

图3为本发明一种实施例的等价类和产生子示意图;Fig. 3 is a schematic diagram of an equivalence class and a generator of an embodiment of the present invention;

图4为本发明一种实施例的效率测试结果图,其中,图(a)为运行时间坐标图,图(b)为内存空间坐标图;Fig. 4 is an efficiency test result graph of an embodiment of the present invention, wherein, graph (a) is a running time coordinate graph, and graph (b) is a memory space coordinate graph;

图5为本发明一种实施例的有效性测试结果图,其中,图(a)为正确率坐标图,图(b)为漏诊率坐标图,图(c)为误诊率坐标图。Fig. 5 is a diagram of the effectiveness test results of an embodiment of the present invention, wherein, diagram (a) is a coordinate diagram of correct rate, diagram (b) is a coordinate diagram of missed diagnosis rate, and diagram (c) is a coordinate diagram of misdiagnosis rate.

具体实施方式Detailed ways

一种基于用户计算机交互事件的网瘾检测装置,如图1所示,包括简单事件采集模块、复杂事件生成模块、时间泛化及归一化处理模块、频繁行为产生子获取模块、样本库构建模块、数据处理与检测模块和预警干预模块,其中,An Internet addiction detection device based on user computer interaction events, as shown in Figure 1, includes a simple event collection module, a complex event generation module, a time generalization and normalization processing module, a frequent behavior generation sub-acquisition module, and a sample library construction module, data processing and detection module and early warning intervention module, among which,

简单事件采集模块:用于采集用户与计算机交互变量作为简单事件,包括计算机CPU使用率、内存使用率、单位时间点击鼠标左键次数、单位时间点击鼠标右键次数、单位时间键盘敲击次数、单位时间鼠标移动的像素个数、网络流量和运行进程,并将上述输入变量发送至复杂事件生成模块;Simple event collection module: used to collect user-computer interaction variables as simple events, including computer CPU usage rate, memory usage rate, number of clicks of the left mouse button per unit time, number of times of right mouse clicks per unit time, number of keyboard strokes per unit time, unit The number of pixels moved by the time mouse, network traffic and running process, and send the above input variables to the complex event generation module;

其中,in,

CPU为计算机核心硬件部件,当用户运行不同应用程序时CPU使用率实时变化,相应地,用户的不同计算机操作行为,使其CPU使用率对应着不同的数值范围,同时,该指标还可用于进一步推断用户复杂事件的类型。在本发明实施例中,通过计算机中的任务管理器监测当前CPU使用率。The CPU is the core hardware component of the computer. When the user runs different applications, the CPU usage rate changes in real time. Correspondingly, the user's different computer operation behaviors make the CPU usage rate correspond to different value ranges. At the same time, this indicator can also be used to further Infers the type of user complex events. In the embodiment of the present invention, the current CPU usage is monitored through the task manager in the computer.

内存使用率的监测与CPU使用率的监测作用相似,都可用于判断用户操作计算机的行为,同样地,该指标也要被用来推理用户行为的复杂事件类型。在本发明实施例中,通过计算机中的任务管理器监测当前内存使用率。The monitoring of memory usage is similar to the monitoring of CPU usage, both of which can be used to judge the behavior of users operating computers. Similarly, this indicator should also be used to infer complex event types of user behavior. In the embodiment of the present invention, the current memory usage is monitored through the task manager in the computer.

单位时间内点击鼠标左键次数,不同计算机操作行为在该指标上的数据差异非常明显,例如,当用户进行较大规模的游戏操作时,单位时间内鼠标点击次数相当多,据统计,可达每分钟百次。相比之下,当进行观看影片、视频等操作时只有每分钟几次,因此该指标可以明显区分用户高级语义行为。在本发明实施例中,通过实时监听鼠标点击事件来获取单位时间内鼠标点击的次数。The number of clicks of the left mouse button per unit time. The data difference of different computer operation behaviors on this indicator is very obvious. For example, when the user performs a large-scale game operation, the number of mouse clicks per unit time is quite large. Hundreds of times per minute. In contrast, when watching movies, videos, etc., there are only a few times per minute, so this indicator can clearly distinguish the user's high-level semantic behavior. In the embodiment of the present invention, the number of mouse clicks per unit time is obtained by monitoring mouse click events in real time.

单位时间内键盘敲击次数的记录与鼠标点击次数的记录基本一致,对于大量需要键盘操作的行为可以显著表示出来,该指标亦是推理具有差异化复杂事件的重要指标之一。在本发明实施例中,通过实时监听键盘敲击事件来获取单位时间内键盘敲击的次数。The record of the number of keyboard strokes per unit time is basically the same as the record of the number of mouse clicks, and it can be significantly expressed for a large number of behaviors that require keyboard operations. This indicator is also one of the important indicators for reasoning about complex events with differences. In the embodiment of the present invention, the number of keyboard strokes per unit time is obtained by monitoring keyboard stroke events in real time.

计算机连接网络之后的网络流量(包括上行流量和下行流量),可以有效区分上网行为与本机行为。当用户频繁使用网络时,上行或者下行流量数据远远高于本机行为,这对判断用户是否进行上网行为具有重要意义。在本发明实施例中,通过调用命令提示符中以太网的统计信息,来获取计算机的网络流量。The network traffic (including uplink traffic and downlink traffic) after the computer is connected to the network can effectively distinguish the online behavior from the local machine behavior. When a user frequently uses the network, the uplink or downlink traffic data is much higher than the local behavior, which is of great significance for judging whether the user is going online. In the embodiment of the present invention, the network flow of the computer is acquired by invoking the statistical information of the Ethernet in the command prompt.

计算机中正在运行的所有程序都会在计算机任务管理器中显示出来,但很多进程与网瘾无关,比如计算机操作系统进程等,可以不用监测。而与网瘾有关的一些典型进程,比如,浏览器进程(IE浏览器进程、Google浏览器进程、搜狗浏览器进程等)、游戏进程(帝国时代游戏、三国杀游戏等)、线下看视频软件(Windows多媒体播放器),即时通讯软件(QQ、Skype、MSN等)都需要重点监测。同时,不同网瘾用户其运行的程序也可能不同,例如,对于游戏网瘾用户,其可能运行更多的游戏进程,所以监测此项可以更准确判定用户高级复杂行为事件。All the programs running in the computer will be displayed in the computer task manager, but many processes have nothing to do with Internet addiction, such as computer operating system processes, etc., which do not need to be monitored. And some typical processes related to Internet addiction, such as browser process (IE browser process, Google browser process, Sogou browser process, etc.), game process (Age of Empires game, Three Kingdoms game, etc.), offline video watching software (Windows multimedia player), instant messaging software (QQ, Skype, MSN, etc.) need to focus on monitoring. At the same time, different Internet addiction users may run different programs. For example, for game Internet addiction users, they may run more game processes, so monitoring this item can more accurately determine the user's advanced and complex behavior events.

复杂事件生成模块:用于将每个简单事件分别与各自阈值进行比较,若大于等于阈值范围,则保留该简单事件,再根据所保留的全部简单事件组合情况生成复杂事件;否则继续采集简单事件;所述的复杂事件包括离线看视频、在线看视频、即时战略游戏、桌游类游戏、浏览网页、在线聊天和下载操作;Complex event generation module: used to compare each simple event with its respective threshold, if it is greater than or equal to the threshold range, then keep the simple event, and then generate complex events according to the combination of all the retained simple events; otherwise, continue to collect simple events ; The complex events include watching videos offline, watching videos online, real-time strategy games, board games, browsing web pages, online chatting and downloading operations;

时间泛化及归一化处理模块:用于将每个用户某天所做的复杂事件进行整理,并将复杂事件发生时间进行泛化处理,即将一天划分为若干个时间段;将持续时间进行归一化处理,即将复杂事件的持续时间进行圆整操作;并将处理后的复杂事件发送至频繁行为产生子获取模块;Time generalization and normalization processing module: used to sort out the complex events that each user does on a certain day, and generalize the occurrence time of complex events, that is, to divide a day into several time periods; Normalization processing, that is, rounding the duration of complex events; and sending the processed complex events to the frequent behavior generation sub-acquisition module;

频繁行为产生子获取模块:当建立样本库时,用于获取多个网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁网瘾复杂事件,再获得由上述频繁网瘾复杂事件所构成的集合用于描述用户行为特征分布的产生子,并发送至样本库构建模块;用于获取多个非网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁非网瘾复杂事件,再获得由上述频繁非网瘾复杂事件所构成集合的用于描述用户行为特征分布的产生子,并发送至样本库构建模块;Frequent behavior generation sub-acquisition module: when establishing a sample library, it is used to obtain the same complex events between multiple Internet addicted users, collect the generation times of the above-mentioned same complex events, and select out according to the number threshold set by the user The complex events greater than the threshold are regarded as frequent Internet addiction complex events, and then a set composed of the above-mentioned frequent Internet addiction complex events is obtained to describe the generator of the distribution of user behavior characteristics, and sent to the sample library construction module; used to obtain multiple non- For the same complex events between Internet addicted users, collect the occurrence times of the above-mentioned same complex events, and select the complex events greater than the threshold as frequent non-Internet addiction complex events according to the number threshold set by the user, and then obtain the above frequent A generator for describing the distribution of user behavior characteristics, which is a collection of non-Internet addiction complex events, and sent to the sample library construction module;

当检测用户网瘾状况时,用于获取待测用户一段时间之内所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁复杂事件,再获得由上述频繁复杂事件所构成集合的用于描述行为特征分布的产生子,并发送至数据处理与检测模块;When detecting the user's Internet addiction status, it is used to obtain the same complex events performed by the user to be tested within a period of time, collect the number of occurrences of the above-mentioned same complex events, and select complex events greater than the threshold according to the number of times threshold set by the user As a frequent complex event, obtain a generator for describing the distribution of behavioral characteristics composed of the above frequent complex events, and send it to the data processing and detection module;

样本库构建模块:用于采用基于产生子的样本构建算法对频繁网瘾复杂事件产生子进行计算得分,若得分超过阈值,则保存为网瘾类,否则删除,或采用EPBSBA算法(基于显露模式的样本构建算法)对频繁复杂事件产生子进行计算,获得显露模式的产生子并分类保存;Sample library construction module: it is used to calculate the score of frequent Internet addiction complex event generators using the generator-based sample construction algorithm. If the score exceeds the threshold, it will be saved as Internet addiction, otherwise it will be deleted, or the EPBSBA algorithm (based on the exposure mode The sample construction algorithm) calculates the generators of frequent and complex events, obtains the generators of the revealed patterns and saves them by category;

数据处理与检测模块:用于将待检测频繁行为产生子与网瘾样本库进行比较,若待检测频繁复杂事件产生子包含在样本库中,则判定其为网瘾,否则是非网瘾,并输出判断结果;或将待检测频繁行为产生子与样本库进行比较,采用EPBPDA算法(基于显露模式的网瘾检测算法)分别计算其网瘾得分与非网瘾得分,根据得分高低判断其所属类别;Data processing and detection module: used to compare the frequent behavior generators to be detected with the Internet addiction sample library, if the frequent complex event generators to be detected are included in the sample library, it is determined to be Internet addiction, otherwise it is not Internet addiction, and Output judgment results; or compare frequent behavior generators to be detected with the sample library, use EPBPDA algorithm (Internet addiction detection algorithm based on exposure mode) to calculate their Internet addiction scores and non-Internet addiction scores, and judge their category according to the score ;

预警干预模块:用于当检测到用户计算机交互行为是网瘾行为时,该装置会发出预警提示,告诫用户已具有网瘾症状,并通过定时器限定该用户可继续使用计算机的时间范围,一般是1-2小时;一旦超过该时限范围,就会进一步启动软件装置关闭计算机,以阻止该用户使用计算机,从而实现对网瘾的有效干预。Early warning intervention module: when it is detected that the user's computer interaction behavior is an Internet addiction behavior, the device will issue an early warning prompt to warn the user that he has symptoms of Internet addiction, and limit the time range for the user to continue using the computer through a timer. It is 1-2 hours; once the time limit is exceeded, the software device will be further started to shut down the computer, so as to prevent the user from using the computer, so as to effectively intervene in Internet addiction.

采用基于用户计算机交互事件的网瘾检测装置进行检测的方法,流程图如图2所示,包括以下步骤:The method for detecting an Internet addiction detection device based on user computer interaction events, as shown in Figure 2, comprises the following steps:

步骤1、采集多个网瘾用户和多个非网瘾用户与计算机之间的交互变量作为简单事件,如表1所示,包括计算机CPU使用率、内存使用率、单位时间左点击鼠标次数、单位时间右点击鼠标次数、单位时间键盘敲击次数、单位时间鼠标移动的像素个数、网络流量和运行进程;Step 1. Collect the interaction variables between multiple Internet addict users and multiple non-Internet addict users and the computer as simple events, as shown in Table 1, including computer CPU usage, memory usage, left mouse click times per unit time, The number of right mouse clicks per unit time, the number of keyboard strokes per unit time, the number of pixels moved by the mouse per unit time, network traffic and running processes;

表1Table 1

Figure BDA0000368918850000101
Figure BDA0000368918850000101

步骤2、将每个简单事件分别与各自阈值进行比较,若大于等于阈值范围,则保留该简单事件,再根据所保留的全部简单事件组合情况生成复杂事件;否则继续采集简单事件;所述的复杂事件包括离线看视频、在线看视频、即时战略游戏、桌游类游戏、浏览网页、在线聊天和下载操作;Step 2. Compare each simple event with its respective threshold. If it is greater than or equal to the threshold range, then keep the simple event, and then generate a complex event according to the combination of all the retained simple events; otherwise, continue to collect simple events; Complex events include watching videos offline, watching videos online, real-time strategy games, board games, browsing the web, chatting online, and downloading operations;

复杂事件的类型如表2所示,The types of complex events are shown in Table 2,

表2Table 2

Figure BDA0000368918850000102
Figure BDA0000368918850000102

Figure BDA0000368918850000111
Figure BDA0000368918850000111

生成复杂事件的过程如下:The process of generating a complex event is as follows:

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,以及上网流量超过或等于阈值范围35~45MB,则生成一个在线看视频的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, a complex event of watching videos online is generated. ;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,以及运行进程为多媒体播放器,则生成一个离线看视频的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the running process is a multimedia player, a complex event of watching videos offline is generated;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间点击鼠标右键次数超过或等于阈值范围10~20次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为及时战略游戏,则生成一个玩即时战略游戏的复杂事件;If the CPU usage exceeds or is equal to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range 25 to 35 times, click the mouse per unit time The number of right clicks exceeds or equals the threshold range of 10 to 20 times, the number of pixels moved by the mouse per unit time exceeds or equals the threshold range of 1550 to 1650, the number of keyboard strokes per unit time exceeds or equals the threshold range of 80 to 120 times, and the running process is For real-time strategy games, generate a complex event for playing real-time strategy games;

若CPU使用率超过或等于阈值范围45%~55%,内存占用率超过或等于阈值范围55%~65%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为桌游类游戏,则生成一个玩桌游类游戏的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the memory usage exceeds or equals to the threshold range of 55% to 65%, and the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the mouse moves per unit time. If the number of pixels exceeds or is equal to the threshold range of 1550 to 1650, and the running process is a board game, a complex event of playing a board game is generated;

若CPU使用率超过或等于阈值范围45%~55%,单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为桌游类游戏,则生成另一个玩桌游类游戏的复杂事件;If the CPU usage exceeds or equals to the threshold range of 45% to 55%, the number of clicks of the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, and the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, And if the running process is a board game, another complex event of playing a board game is generated;

若单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,以及运行进程为浏览器,则生成一个浏览网页的复杂事件;If the number of clicks on the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, and the running process is a browser, a complex page browsing process is generated. event;

若单位时间点击鼠标左键次数超过或等于阈值范围25~35次,单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,上网流量超过或等于阈值范围35~45MB,以及运行进程为浏览器,则生成另一个浏览网页的复杂事件;If the number of clicks on the left mouse button per unit time exceeds or equals to the threshold range of 25 to 35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, and the running process For the browser, another complex event for browsing the web page is generated;

若单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,上网流量超过或等于阈值范围35~45MB,以及运行进程为浏览器,则生成又一个浏览网页的复杂事件;If the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550 to 1650, the Internet traffic exceeds or equals to the threshold range of 35 to 45MB, and the running process is a browser, another complex event of web browsing is generated;

若单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为即时聊天软件,则生成一个网上聊天的复杂事件;If the number of keystrokes per unit time exceeds or is equal to the threshold range of 80 to 120 times, and the running process is instant chat software, a complex event of online chat will be generated;

若单位时间鼠标移动的像素个数超过或等于阈值范围1550~1650个,单位时间键盘敲击次数超过或等于阈值范围80~120次,以及运行进程为即时聊天软件,则生成另一个网上聊天的复杂事件;If the number of pixels moved by the mouse per unit time exceeds or equals the threshold range of 1550 to 1650, the number of keystrokes per unit time exceeds or equals the threshold range of 80 to 120 times, and the running process is instant chat software, another online chat will be generated complex events;

若上网流量超过或等于阈值范围35~45MB,则生成一个下载资料的复杂事件;If the Internet traffic exceeds or is equal to the threshold range of 35-45MB, a complex event of downloading data will be generated;

例如,当检测到以下简单事件:CPU使用率大于50%,内存使用率大于60%,每分内鼠标点击次数100次/分钟(根据鼠标左击、右击次数以及鼠标移动的像素数综合计算后得出),单位时间内键盘敲击次数大于90次/分钟,监测到正在运行的应用程序为三国杀游戏sanguosha.exe,网络流量大于40MB/分钟,则生成一个玩即时战略游戏的复杂事件。For example, when the following simple events are detected: the CPU usage is greater than 50%, the memory usage is greater than 60%, and the number of mouse clicks per minute is 100 times/minute (comprehensively calculated based on the number of left and right mouse clicks and the number of pixels moved by the mouse) obtained later), the number of keystrokes per unit time is greater than 90 times/minute, and the running application is detected to be the game sanguosha.exe, and the network traffic is greater than 40MB/minute, a complex event of playing a real-time strategy game is generated.

步骤3、将复杂事件发生时间进行泛化处理,即将一天划分为6:00~11:00、11:00~14:00、14:00~18:00、18:00~23:00、23:00~6:00五个时间段,并将持续时间进行归一化处理,即将复杂事件的持续时间以10分钟为单位进行圆整操作;Step 3. Generalize the occurrence time of complex events, that is, divide a day into 6:00~11:00, 11:00~14:00, 14:00~18:00, 18:00~23:00, 23 :00~6:00 five time periods, and the duration is normalized, that is, the duration of complex events is rounded up in units of 10 minutes;

在复杂事件生成过程中,由于事件发生时刻的多样性,会产生大量不同的时间点,导致过多复杂事件的产生,即复杂事件碎片化问题。事实上,很多发生时间不同的事件在语义上并无本质区别,比如,“上午8点上网47分钟”和“上午9点上网52分钟”,都是上午上网,开始时间差不多,持续时间也相近,因此从语义合理性角度,可以将这两个复杂事件看作是一个事件,这就是复杂事件的时间泛化处理操作。表3列举出了泛化处理后的发生时间范围。In the process of complex event generation, due to the diversity of event occurrence times, a large number of different time points will be generated, resulting in the generation of too many complex events, that is, the problem of fragmentation of complex events. In fact, there is no essential difference in the semantics of many events that occur at different times. For example, "47 minutes online at 8 am" and "52 minutes online at 9 am" are both online in the morning, with similar start times and similar durations , so from the perspective of semantic rationality, these two complex events can be regarded as one event, which is the time generalization processing operation of complex events. Table 3 lists the occurrence time range after generalization processing.

Figure BDA0000368918850000121
Figure BDA0000368918850000121

Figure BDA0000368918850000131
Figure BDA0000368918850000131

从表3中可以看出,将一天泛化成5个层次段,分别是:早晨时段:早上6点到上午11点;中午时段:上午11点到下午2点;下午时段:下午2点到晚上6点;晚上时段:晚上6点到夜里11点;黎明时段:夜里11点到第二天凌晨6点。采用该泛化策略可以有效减少时间碎片化,基于以上方法,只要事件发生时刻位于其中某个泛化层次,就将其归于为某个时间段。As can be seen from Table 3, a day is generalized into five levels, namely: morning period: 6:00 am to 11:00 am; noon period: 11:00 am to 2:00 pm; afternoon period: 2:00 pm to evening 6 o'clock; evening time: 6:00 pm to 11:00 pm; dawn time: 11:00 pm to 6:00 am the next day. Using this generalization strategy can effectively reduce time fragmentation. Based on the above method, as long as the event occurs at a certain generalization level, it will be attributed to a certain time period.

此外,考虑到持续时间同样具有多样性问题,即47分钟和52分钟实际上相差并不明显,因此当发生时刻相同时,可以将它们看作同一个事件,这就需要对持续时间进行归一化操作,即通过圆整(Round)技术并根据四舍五入原则将持续时间规范化为整数形式,此时,47分钟和52分钟就都圆整为50分钟,方便了复杂事件的进一步处理。In addition, considering that the duration also has the problem of diversity, that is, the difference between 47 minutes and 52 minutes is not obvious, so when the time of occurrence is the same, they can be regarded as the same event, which requires normalization of the duration The rounding operation is to normalize the duration into an integer form according to the rounding principle. At this time, both 47 minutes and 52 minutes are rounded to 50 minutes, which facilitates the further processing of complex events.

本发明实施例中,一分钟内监测到用户计算机CPU使用率大于60%,内存占用率大于50%,正在运行的程序是Windows多媒体播放器程序,则该用户正在离线看视频。如果在中午12:00~12:46这46分钟的时间内连续推测出用户离线看视频,那么利用复杂事件泛化和归一化处理方法,便产生了一个复杂事件,即中午离线看视频50分钟,记做Bc50。In the embodiment of the present invention, it is detected that the CPU usage rate of the user's computer is greater than 60%, the memory usage rate is greater than 50%, and the running program is a Windows multimedia player program, then the user is watching videos offline. If it is inferred that the user watched the video offline continuously during the 46 minutes from 12:00 to 12:46 noon, then a complex event will be generated by using the complex event generalization and normalization processing method, that is, watching the video offline at noon 50 Minutes, recorded as Bc50.

步骤4、获取多个网瘾用户之间所进行的相同复杂事件和多个非网瘾用户之间所进行的相同复杂事件,采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁网瘾复杂事件和频繁非网瘾复杂事件,再获得由上述频繁网瘾复杂事件所构成集合的用于描述用户行为特征分布的多个产生子,获得由上述频繁非网瘾复杂事件所构成集合的用于描述用户行为特征分布的多个产生子,并发送至样本库构建模块;Step 4. Acquire the same complex events between multiple Internet addicted users and the same complex events between multiple non-Internet addicted users, collect the number of occurrences of the above-mentioned same complex events, and set the threshold according to the number of times set by the user , select complex events greater than the threshold as frequent Internet addiction complex events and frequent non-Internet addiction complex events, and then obtain multiple generators for describing the distribution of user behavior characteristics composed of the above-mentioned frequent Internet addiction complex events, and obtain by Multiple generators used to describe the distribution of user behavior characteristics formed by the above-mentioned frequent non-internet addiction complex events are sent to the sample library construction module;

本发明实施例中,以采集网瘾用户为例,采集10名有网瘾的用户的某天计算机交互行为简单事件数据后,通过复杂事件推理得出行为序列如下:In the embodiment of the present invention, taking the collection of Internet addicted users as an example, after collecting the simple event data of computer interaction behaviors of 10 Internet addicted users on a certain day, the behavior sequence is obtained through complex event reasoning as follows:

网瘾用户1:{Bc60,Ce80,Ae70,Ed80,Be100,Ab100,Eb30};Internet addiction user 1: {Bc60, Ce80, Ae70, Ed80, Be100, Ab100, Eb30};

网瘾用户2:{Bc60,Ce80,Be100,Bb40,Ga20,Eb30,Ac20};Internet addiction user 2: {Bc60, Ce80, Be100, Bb40, Ga20, Eb30, Ac20};

网瘾用户3:{Bc60,Ce80,Ae70,Ed80,Da30,Bb40};Internet addiction user 3: {Bc60, Ce80, Ae70, Ed80, Da30, Bb40};

网瘾用户4:{Ae70,Bc60,Ed80,Be100,Ab100};Internet addiction user 4: {Ae70, Bc60, Ed80, Be100, Ab100};

网瘾用户5:{Ae70,Bc60,Ed80,Be100,Ab100,Ga20,Eb30};Internet addiction user 5: {Ae70, Bc60, Ed80, Be100, Ab100, Ga20, Eb30};

网瘾用户6:{Ae70,Ed80,Da30,Bb40,Fa20};Internet addiction user 6: {Ae70, Ed80, Da30, Bb40, Fa20};

网瘾用户7:{Be100,Ab100,Bc60,Ce80,Da30};Internet addiction user 7: {Be100, Ab100, Bc60, Ce80, Da30};

网瘾用户8:{Be100,Ab100,Ga20,Ad60};Internet addiction user 8: {Be100, Ab100, Ga20, Ad60};

网瘾用户9:{Ab100,Bc60,Bb40,Fa20,Ad60};Internet addiction user 9: {Ab100, Bc60, Bb40, Fa20, Ad60};

网瘾用户10:{Be100,Ed80,Da30,Eb30,Ac20}。Internet addiction user 10: {Be100, Ed80, Da30, Eb30, Ac20}.

利用频繁行为产生子获取算法求出上述10名有网瘾的用户某天计算机交互行为序列对应的产生子(Generator)如下:Use the frequent behavior generator acquisition algorithm to find the generator (Generator) corresponding to the computer interaction behavior sequence of the above 10 Internet addicted users on a certain day as follows:

产生子G1:<Ab100,Be100>Generate child G1: <Ab100,Be100>

产生子G2:<Ab100,Bc60>Producer G2: <Ab100,Bc60>

产生子G3:<Ed80,Ae70>Generate child G3: <Ed80,Ae70>

产生子G4:<Ce80,Bc60>Produces child G4: <Ce80,Bc60>

产生子G5:<Ed80,Ab100>Generate child G5: <Ed80, Ab100>

产生子G6:<Ae70,Be100>Generate child G6: <Ae70,Be100>

产生子G7:<Ae70,Bc60>Generate child G7: <Ae70,Bc60>

产生子G8:<Ae70,Ab100>Produced child G8: <Ae70, Ab100>

产生子G9:<Ce80,Be100>Produce child G9: <Ce80,Be100>

产生子G10:<Be100>Generate child G10: <Be100>

产生子G11:<Bc60>Generate child G11: <Bc60>

产生子G12:<Ab100>Produced child G12: <Ab100>

产生子G13:<Ed80>Generate child G13: <Ed80>

产生子G14:<Ae70>Spawn child G14: <Ae70>

产生子G15:<Ce80>Generate child G15: <Ce80>

获取产生子方法如下:Get the sub method as follows:

设定输入为D和MinSup,其中,D代表规范化的复杂事件数据库,MinSup是要获取的频繁行为产生子的支持度阈值(用户根据需要自行设定),输出结果为FG(即频繁行为产生子集合)。Set the input as D and MinSup, where D represents the standardized complex event database, MinSup is the support threshold of the frequent behavior generator to be obtained (set by the user according to the needs), and the output result is FG (that is, the frequent behavior generator gather).

频繁行为产生子的获取步骤如下:The steps to obtain frequent behavior generators are as follows:

1)扫描复杂事件数据库D中的每一项,生成项集集合F,同时记录每一项在数据库中出现的次数(即支持度),并按支持度降序排列项集集合F。1) Scan each item in the complex event database D to generate an item set F, record the number of occurrences of each item in the database (that is, support), and arrange the item set F in descending order of support.

2)对项集集合F构建FP-Tree树。2) Construct an FP-Tree tree for the itemset set F.

3)从具有最小支持度的项开始访问(即按照项集集合F中的项集的逆序来访问),扫描复杂事件数据库并建立其条件数据库。3) Start accessing from the item with the minimum support (that is, access according to the reverse order of the itemsets in the itemset set F), scan the complex event database and establish its condition database.

4)选取条件数据库中的每一项及其幂集,根据产生子定义来判断其是否是产生子。如果是,再进一步判断是否有支持度与这个产生子的支持度相同的真子集,如果存在,则删除此产生子,否则保留此产生子并存入频繁行为产生子集合FG,最终输出FG。4) Select each item in the conditional database and its power set, and judge whether it is a generator according to the definition of the generator. If yes, further judge whether there is a proper subset with the same support as the generator, if it exists, delete the generator, otherwise keep the generator and store it in the frequent behavior generation subset FG, and finally output FG.

针对于产生子的说明如下:The instructions for generating children are as follows:

在相同复杂事件序列中出现且出现次数相同的复杂事件的所有组合,称为频繁复杂事件等价类。产生子则是该等价类的最简单表示。如果产生子行为发生,就意味着在其所属等价类中的所有伴随行为都会发生,因此只用产生子来表示其对应的等价类即可。All combinations of complex events that appear in the same sequence of complex events with the same number of occurrences are called frequent complex event equivalence classes. A producer is the simplest representation of this equivalence class. If the behavior of creating a child occurs, it means that all accompanying behaviors in the equivalence class to which it belongs will occur, so it is only necessary to use the child to represent its corresponding equivalence class.

表4Table 4

Figure BDA0000368918850000151
Figure BDA0000368918850000151

表5table 5

Figure BDA0000368918850000152
Figure BDA0000368918850000152

假设a和c分别表示9点上网打游戏60分钟,11点上网购物40分钟。等价类{a,ac}中,a为ac的子行为,因此可以说只要a发生,ac就一定发生,那么ac就可用a表示,于是等价类就只用a表示即可。Assume that a and c represent 60 minutes of online gaming at 9:00 and 40 minutes of online shopping at 11:00, respectively. In the equivalence class {a, ac}, a is a sub-behavior of ac, so it can be said that as long as a occurs, ac must occur, then ac can be represented by a, so the equivalence class can only be represented by a.

表5中的所有等价类如图3所示。在图3中,黑色框线部分展示出了频繁程度为2的等价类,该等价类包含的行为集合为{ab,ae,abc,abe,ace,abce},它所对应的产生子为{ab,ae},它表示只要ab和ae这两联合行为发生就可以显著代表集合中所有行为发生,后续处理只要针对此产生子对应行为即可,而不必处理等价类中的所有行为。All equivalence classes in Table 5 are shown in Figure 3. In Figure 3, the black box line shows the equivalence class with a frequency of 2. The behavior set contained in this equivalence class is {ab, ae, abc, abe, ace, abce}, and its corresponding generator It is {ab, ae}, which means that as long as the two joint behaviors of ab and ae occur, it can significantly represent all the behaviors in the set. Subsequent processing only needs to generate sub-corresponding behaviors for this, instead of dealing with all behaviors in the equivalence class .

步骤5、采用基于产生子的样本构建算法对频繁网瘾复杂事件产生子进行计算得分,若得分超过阈值,则分类保存,否则删除,并将频繁非网瘾复杂事件产生子进行保存;或采用EPBSBA算法对频繁复杂事件产生子进行计算,获得显露模式的产生子并分类保存;Step 5. Use the generator-based sample construction algorithm to calculate the score of frequent Internet addiction complex event generators. If the score exceeds the threshold, save it by classification, otherwise delete it, and save the frequent non-Internet addiction complex event generators; or use The EPBSBA algorithm calculates the generators of frequent and complex events, and obtains the generators of the revealed patterns and saves them by category;

采用基于产生子的样本构建算法对频繁网瘾复杂事件产生子进行计算得分,具体为:The generator-based sample construction algorithm is used to calculate the score of frequent Internet addiction complex event generators, specifically:

步骤5-1、根据用户需求确定复杂事件类型的权重系数,保证即时战略游戏的权重系数>在线看视频的权重系数>桌游类游戏的权重系数>浏览网页的权重系数>离线看视频的权重系数>在线聊天的权重系数>下载的权重系数,并且权重系数总数为1;Step 5-1. Determine the weight coefficient of complex event types according to user needs, and ensure that the weight coefficient of real-time strategy games > the weight coefficient of watching videos online > the weight coefficient of board games > the weight coefficient of browsing webpages > the weight of watching videos offline Coefficient > weight coefficient of online chat > weight coefficient of download, and the total weight coefficient is 1;

本发明实施例中,复杂事件类型权重系数如表6所示:In the embodiment of the present invention, the complex event type weight coefficients are shown in Table 6:

表6Table 6

Figure BDA0000368918850000161
Figure BDA0000368918850000161

步骤5-2、根据用户需求确定复杂事件时间泛化段的权重系数,保证黎明的权重系数>晚上的权重系数>中午的权重系数>下午的权重系数>上午的权重系数,并且权重系数总数为1;Step 5-2. Determine the weight coefficient of the time generalization segment of complex events according to user needs, and ensure that the weight coefficient of dawn > the weight coefficient of evening > the weight coefficient of noon > the weight coefficient of afternoon > the weight coefficient of morning, and the total number of weight coefficients is 1;

本发明实施例中,复杂事件时间泛化段权重系数如表7所示:In the embodiment of the present invention, the weight coefficients of the complex event time generalization section are shown in Table 7:

表7Table 7

Figure BDA0000368918850000171
Figure BDA0000368918850000171

步骤5-3、将产生子中的每个复杂事件类型权重系数、复杂事件时间泛化段权重系数与持续时间相乘求得评分,并将每个复杂事件评分求和获取该产生子的评分;Step 5-3: Multiply the weight coefficient of each complex event type in the generator, the weight coefficient of the complex event time generalization section and the duration to obtain the score, and sum the scores of each complex event to obtain the score of the generator ;

一个用户的频繁计算机交互行为可由多个频繁行为产生子表示,但这些产生子并不都具有网瘾特征,因此需要进一步筛选出网瘾行为产生子,以便分类网瘾行为。A user's frequent computer interaction behavior can be represented by multiple frequent behavior generators, but not all of these generators have the characteristics of Internet addiction, so it is necessary to further screen out the Internet addiction behavior generators in order to classify Internet addiction behavior.

该算法采用打分方式来选择网瘾行为产生子,其打分公式如公式(1)所示,其中,参数

Figure BDA0000368918850000172
分别代表第i个行为产生子集合中第k个复杂事件的行为类型、发生时间和持续时间,这与前面的复杂事件含义相同,并由用户给出这些参数对应的权重系数;The algorithm uses scoring method to select Internet addiction behavior generators, and its scoring formula is shown in formula (1), where the parameter
Figure BDA0000368918850000172
respectively represent the behavior type, occurrence time and duration of the kth complex event in the i-th behavior generation subset, which have the same meaning as the previous complex event, and the weight coefficients corresponding to these parameters are given by the user;

ScoreScore (( GG ii )) == &Sigma;&Sigma; kk == 11 nno weightweight (( Typetype kk ii )) &times;&times; weightweight (( tt kk ii )) &times;&times; durdur kk ii -- -- -- (( 11 ))

其中,Gi为第i个产生子,n为Gi所包含的复杂事件数,

Figure BDA0000368918850000174
为Gi中第k个复杂事件的事件类型权重系数;
Figure BDA0000368918850000175
为Gi中第k个复杂事件的事件时间泛化段权重系数;
Figure BDA0000368918850000176
为Gi中第k个复杂事件的持续时间。Among them, G i is the i-th generator, n is the number of complex events contained in G i ,
Figure BDA0000368918850000174
is the event type weight coefficient of the kth complex event in G i ;
Figure BDA0000368918850000175
is the event time generalization segment weight coefficient of the kth complex event in G i ;
Figure BDA0000368918850000176
is the duration of the kth complex event in G i .

步骤5-4、若得分超过阈值4,则保存,否则删除。Step 5-4, if the score exceeds the threshold 4, save it, otherwise delete it.

EP是显露模式(Emerging Pattern)的缩写,包括JEP(跳跃显露模式)和eEP(基本显露模式),显露模式EP是一种新的对比挖掘模式,是从一个数据集到另外一个数据集支持度发生显著变化的项集,其能够捕获目标类与非目标类之间的差异化特征,因此基于EP可以建立分类效果良好的分类器。EP is the abbreviation of Emerging Pattern, including JEP (Jumping Exposure Pattern) and eEP (Essential Exposure Pattern). Itemsets with significant changes can capture the differential features between target classes and non-target classes, so a classifier with good classification effect can be established based on EP.

采用EPBSBA算法对频繁复杂事件产生子进行计算,获得显露模式的产生子并分类保存;该算法分别筛选出网瘾类和非网瘾类的JEP(跳跃显露模式)和eEP(基本显露模式),其中前者通过差集求得,后者通过并集求得。然后,根据预先给定的EP阈值(假定阈值为1)筛选出EP并计算相应增长率。接下来,再看筛选集合里是否有互相包含的子集,如果有,需要进一步执行删除操作;如果没有就结束。The EPBSBA algorithm is used to calculate the generators of frequent and complex events, and the generators of exposure patterns are obtained and stored in categories; the algorithm screens out JEP (Jumping Exposure Pattern) and eEP (Essential Exposure Pattern) of Internet addiction and non-Internet addiction respectively, Among them, the former is obtained through the difference set, and the latter is obtained through the union set. Then, filter out EPs according to a predetermined EP threshold (assuming the threshold is 1) and calculate the corresponding growth rate. Next, check whether there are subsets that contain each other in the screening set. If so, further delete operations are required; if not, end.

根据最小支持度阈值MinSup(用户自定),调用频繁行为产生子算法,对网瘾和非网瘾两类数据库(即D1和D2)分别求出产生子集合FG1和FG2According to the minimum support threshold MinSup (defined by the user), the sub-algorithm for generating frequent behaviors is invoked, and the sub-sets FG 1 and FG 2 are calculated for the two types of databases (namely D 1 and D 2 ) for Internet addiction and non-Internet addiction respectively.

本发明实施例中,假定MinSup=3/10,筛选出9个网瘾行为产生子G3、G4、G7、G10、G11、G12、G13、G14和G15,并构成初始的网瘾类Generator库,如表8中网瘾类标签下的Generator所示。由于非网瘾类Generator的产生与网瘾类Generator的产生方法相同,差别只在于要通过采集正常用户的计算机交互数据获取,此处不再赘述。假设非网瘾类的Generator如表8非网瘾类标签下所示,此类库为EPBPDA算法所用,GBPDA算法不使用该类库。Generator后面括号中的数字代表支持度,这仅对EPBPDA算法有用,GBPDA算法并不使用该指标。In the embodiment of the present invention, assuming that MinSup=3/10, 9 Internet addiction behavior generators G 3 , G 4 , G 7 , G 10 , G 11 , G 12 , G 13 , G 14 and G 15 are screened out, and Constitute the initial Internet addiction Generator library, as shown in the Generator under the Internet addiction label in Table 8. Since the non-Internet addiction Generator is generated in the same way as the Internet Addiction Generator, the only difference is that it is obtained by collecting computer interaction data of normal users, so I won’t go into details here. Assume that the non-Internet addiction Generator is shown in Table 8 under the Non-Internet Addiction label, this type of library is used by the EPBPDA algorithm, and the GBPDA algorithm does not use this type of library. The number in parentheses after the Generator represents the support, which is only useful for the EPBPDA algorithm, and the GBPDA algorithm does not use this indicator.

表8Table 8

Figure BDA0000368918850000181
Figure BDA0000368918850000181

JEP的求取:Obtaining JEP:

JEP是增长率为∞的EP。在求出产网瘾类和非网瘾类的频繁行为产生子FG1和FG2后,通过集合的差集运算可以得出两类数据库之间的JEP,即对于网瘾数据D1来说,FG1-FG2的结果集合表示数据库D1对D2的JEP,因为FG1-FG2差集表示在FG1中存在但在FG2不存在的产生子,因此,FG1-FG2差集中的元素在D1中无论具有多大支持度,其在D2中的支持度一定为零,因为D2不包含此元素。进一步,根据增长率定义,则差集FG1-FG2中的元素从D2到D1的增长率为∞,故差集FG1-FG2中的元素一定为D1对D2的JEP,记作JEPD1/D2。同理,FG1-FG2的差集则表示D2对D1的JEP,记作JEPD2/D1A JEP is an EP with a growth rate of ∞. After obtaining the frequent behavior generators FG 1 and FG 2 of Internet addiction and non-Internet addiction, the JEP between the two types of databases can be obtained through the set difference operation, that is, for the Internet addiction data D 1 , The result set of FG 1 -FG 2 represents the JEP of database D 1 to D 2 , because the difference set of FG 1 -FG 2 represents the generator that exists in FG 1 but does not exist in FG 2 , therefore, the difference of FG 1 -FG 2 No matter how much support an element in the set has in D 1 , its support in D 2 must be zero, because D 2 does not contain this element. Further, according to the definition of the growth rate, the growth rate of the elements in the difference set FG 1 -FG 2 from D 2 to D 1 is ∞, so the elements in the difference set FG 1 -FG 2 must be the JEP of D 1 to D 2 , denoted as JEP D1/D2 . Similarly, the difference set of FG 1 -FG 2 represents the JEP of D 2 to D 1 , denoted as JEP D2/D1 .

eEP的求取:Obtaining eEP:

eEP是大于增长率阈值1的EP。首先,对频繁行为产生子集合FG1和FG2求交运算。然后,对交集的每一项判断其增长率是否大于或者等于给定的增长率阈值MinGr,如果满足条件,则存入候选集中。接下来,判断候选集中是否有互相包含的子集,如果有,则取其最小子集并删除其超集,如果没有就不对候选集进行操作。于是,最终候选集中所包含的元素就是eEP。An eEP is an EP greater than the growth rate threshold 1. First, the intersection operation of subsets FG 1 and FG 2 is generated for frequent behaviors. Then, for each item in the intersection, it is judged whether its growth rate is greater than or equal to the given growth rate threshold MinGr, and if the condition is met, it is stored in the candidate set. Next, judge whether there are subsets that contain each other in the candidate set. If so, take the smallest subset and delete its superset. If not, do not operate on the candidate set. Thus, the elements contained in the final candidate set are eEPs.

EP的求取。分别求出网瘾D1和非网瘾数据库D2的eEP与JEP的并集,即D1的EP是eEP与JEPD1/D2的并,D2的EP是eEP与JEPD2/D1的并。The acquisition of EP. Calculate the union of eEP and JEP of Internet addiction D 1 and non-internet addiction database D 2 respectively, that is, the EP of D 1 is the union of eEP and JEP D1/D2 , and the EP of D 2 is the union of eEP and JEP D2/D1 .

具体步骤如下:Specific steps are as follows:

步骤5-a、获取属于频繁网瘾复杂事件而不属于频繁非网瘾复杂事件的产生子;Step 5-a, obtaining the generators that belong to frequent Internet addiction complex events but not frequent non-Internet addiction complex events;

网瘾类的JEPPIU为:The JEP PIU for internet addiction is:

D1-D2={<Be100>,<Ab100>,<Bc60,Ce80>}D1-D2={<Be100>,<Ab100>,<Bc60,Ce80>}

步骤5-b、获取不属于频繁网瘾复杂事件而属于频繁非网瘾复杂事件的产生子;Step 5-b. Obtain the generators that do not belong to frequent Internet addiction complex events but belong to frequent non-Internet addiction complex events;

非网瘾类的JEPNPIU为:The JEP NPIU for non-Internet addiction is:

D2-D1={<Ga40>,<Fe40>,<Ae70,Fe40>}D 2 -D 1 ={<Ga40>,<Fe40>,<Ae70,Fe40>}

步骤5-c、获取即属于频繁网瘾复杂事件又属于频繁非网瘾复杂事件的共同产生子,并确定其在频繁网瘾复杂事件中的支持度,即出现率,确定在频繁非网瘾复杂事件中的支持度;Step 5-c, obtain the co-producers that belong to both frequent Internet addiction complex events and frequent non-Internet addiction complex events, and determine their support in the frequent Internet addiction complex events, that is, the occurrence rate, and determine the frequent non-Internet addiction complex events support in complex events;

eEPPIU(网瘾类eEP)=eEPNPIU(非网瘾类eEP)为:eEP PIU (Internet Addiction eEP) = eEP NPIU (Non-Internet Addiction eEP) is:

D1∩D2={<Ae70>,<Bc60>,<Ce80>,<Ed80>,<Ae70,Ed80>,<Ae70,Bc60>}D1∩D2={<Ae70>,<Bc60>,<Ce80>,<Ed80>,<Ae70,Ed80>,<Ae70,Bc60>}

各频繁复杂事件在频繁网瘾复杂事件中的支持度和在频繁非网瘾复杂事件中的支持度,如表8中所示(括号内的百分数)。The support degree of each frequent complex event in frequent Internet addiction complex events and in frequent non-Internet addiction complex events is shown in Table 8 (percentage in brackets).

步骤5-d、计算在频繁网瘾复杂事件中该共同产生子的增长率,即将其在频繁网瘾复杂事件中的支持度除以在频繁非网瘾复杂事件中的支持度;再计算在频繁非网瘾复杂事件中该共同产生子的增长率,即将其在频繁非网瘾复杂事件中的支持度除以在频繁网瘾复杂事件中的支持度;Step 5-d, calculate the growth rate of the co-producer in the frequent Internet addiction complex event, that is, divide its support degree in the frequent Internet addiction complex event by the support degree in the frequent non-Internet addiction complex event; The growth rate of the co-producer in the frequent non-Internet addiction complex event is to divide its support degree in the frequent non-Internet addiction complex event by the support degree in the frequent Internet addiction complex event;

以<Bc60>[7/6.5]为例,由于<Bc60>在网瘾类中的支持度为70%,在非网瘾类中的支持度为65%,因此其增长率为7/6.5,其他EP的增长率也按此可分别求得,此处不再赘述。Taking <Bc60>[7/6.5] as an example, since <Bc60> has a support rate of 70% in the Internet addiction category and 65% in the non-Internet addiction category, its growth rate is 7/6.5, The growth rates of other EPs can also be obtained separately according to this, and will not be repeated here.

步骤5-e、比较上述两个增长率的大小,保留增长率大于1的产生子,删除增长率小于1的产生子;Step 5-e, compare the size of the above two growth rates, retain the generator whose growth rate is greater than 1, and delete the generator whose growth rate is less than 1;

根据预先给定的EPPIU阈值(假定阈值为1)筛选出网瘾类(eEPPIU)并计算相应增长率,筛选结果为{<Bc60>[7/6.5],<Ae70,Ed80>[1.5]>},其中方括号中的数字表示增长率。According to the pre-specified EP PIU threshold (assuming the threshold is 1), the Internet addiction category (eEP PIU ) is screened out and the corresponding growth rate is calculated. The screening result is {<Bc60>[7/6.5],<Ae70,Ed80>[1.5] >}, where the numbers in square brackets indicate the growth rate.

再看筛选集合里是否有互相包含的子集,如果有,则删除;如果没有就结束。由于此集合没有相互包含的子集,于是结束。Then check whether there are subsets that contain each other in the screening set, and if so, delete it; if not, end it. Ends because this set has no mutually inclusive subsets.

因此,网瘾类最终的eEPPIU={<Bc60>[7/6.5],<Ae70,Ed80>[1.5]>},其相应增长率的最小阈值和最大阈值分别为7/6.5和1.5。Therefore, the final eEP PIU of Internet addiction = {<Bc60>[7/6.5],<Ae70,Ed80>[1.5]>}, and the minimum and maximum thresholds of the corresponding growth rates are 7/6.5 and 1.5, respectively.

非网瘾类同理如下:Non-Internet addicts are similarly as follows:

根据预先给定的EPNPIU阈值(假定阈值为1)筛选非网瘾类的eEPNPIU,得出{<Ae70>,<Ce80>,<Ed80>,<Ae70,Bc60>}。According to the pre-specified EP NPIU threshold (the threshold is assumed to be 1), the eEP NPIU of non-Internet addiction category is screened to obtain {<Ae70>,<Ce80>,<Ed80>,<Ae70,Bc60>}.

由于筛选集合中的<Ae70>为<Ae70,Bc60>的子集,需删除<Ae70,Bc60>,于是Since <Ae70> in the screening set is a subset of <Ae70, Bc60>, <Ae70, Bc60> needs to be deleted, so

eEPNPIU={<Ae70>[90%/50%=1.6],<Ce80>[70%/40%=1.75],<Ed80>[70%/55%=7/5.5]}eEP NPIU ={<Ae70>[90%/50%=1.6],<Ce80>[70%/40%=1.75],<Ed80>[70%/55%=7/5.5]}

因此,非网瘾类中增长率的最小阈值和最大阈值分别为7/5.5和1.75。Therefore, the minimum threshold and maximum threshold of growth rate in the non-Internet addiction category are 7/5.5 and 1.75, respectively.

步骤5-f、将频繁网瘾复杂事件中的产生子取并集,将频繁非网瘾复杂事件中的产生子取并集,即获得显露模式的产生子并分类保存。Step 5-f, combining the generators of the frequent Internet addiction complex events and the union of the generators of the frequent non-Internet addiction complex events, that is, obtaining the generators of the exposure pattern and storing them in categories.

步骤6、采集待测用户一段时间之内与计算机之间的交互变量,并重复步骤2到步骤3;Step 6, collect the interaction variables between the user to be tested and the computer within a period of time, and repeat steps 2 to 3;

采集到的待检测用户一周的上网行为序列如下:The collected online behavior sequence of the user to be detected for a week is as follows:

周一:{Be100,Ab100,Bc60,Ae70,Db90,Fe40,Da50};Monday: {Be100, Ab100, Bc60, Ae70, Db90, Fe40, Da50};

周二:{Ed80,Ab100,Ee30,Fd30};Tuesday: {Ed80, Ab100, Ee30, Fd30};

周三:{Be100,Ed80,Bc60,Ae70,Db90,Fe40}Wednesday: {Be100, Ed80, Bc60, Ae70, Db90, Fe40}

周四:{Cc30,Da40};Thursday: {Cc30,Da40};

周五:{Be100,Ce20};Friday: {Be100, Ce20};

周六:{Fa90,Ad20};Saturday: {Fa90,Ad20};

周日:{Be100,Ab100,Ed80,Bc60,Ae70,Db90,Fe40,Ge60,Aa70};Sunday: {Be100, Ab100, Ed80, Bc60, Ae70, Db90, Fe40, Ge60, Aa70};

步骤7、获取待测用户一段时间之内所进行的相同复杂事件,并采集上述相同复杂事件的产生次数,并根据用户所设置的次数阈值,选出大于阈值的复杂事件作为频繁复杂事件,再获得由上述频繁复杂事件所构成集合的用于描述行为特征分布的产生子,并发送至数据处理与检测模块;Step 7. Obtain the same complex events performed by the user to be tested within a period of time, and collect the occurrence times of the above-mentioned same complex events, and select complex events greater than the threshold as frequent complex events according to the number threshold set by the user, and then Obtain a generator for describing the distribution of behavioral characteristics composed of the above-mentioned frequent and complex events, and send it to the data processing and detection module;

步骤8、将待检测频繁行为产生子与网瘾样本库进行比较,若待检测频繁复杂事件产生子包含在样本库中,则判定其为网瘾,否则是非网瘾,并输出判断结果;或将待检测频繁行为产生子与样本库进行比较,采用EPBPDA算法分别计算其网瘾得分与非网瘾得分,判断两者得分高低,待检测频繁复杂事件属于得分高一类别;Step 8, comparing the frequent behavior generator to be detected with the Internet addiction sample library, if the frequent complex event generator to be detected is included in the sample library, then determine it is Internet addiction, otherwise it is not Internet addiction, and output the judgment result; or Compare the frequent behavior generators to be detected with the sample library, and use the EPBPDA algorithm to calculate their Internet addiction scores and non-Internet addiction scores respectively, and judge the scores of the two. The frequent and complex events to be detected belong to the category with a higher score;

本专利实施例中提出了两种网瘾模式检测算法,即基于产生子的PIU检测算法(Generator-Based PIU Detecting Algorithm-GBPDA)和基于EP的PIU检测算法(EP-Based PIU Detecting Algorithm-EPBPDA)。In the embodiment of this patent, two Internet addiction pattern detection algorithms are proposed, namely, Generator-Based PIU Detecting Algorithm-GBPDA and EP-Based PIU Detecting Algorithm-EPBPDA. .

GBPDA算法:GBPDA algorithm:

对待检测的行为事件进行扫描,如果该行为事件包含数据库中的任何一个产生子Generator,就判断待检测行为是有网瘾行为。考虑到持续时间的值越大,待检测的行为是网瘾行为的可能性就越大,因此,对于待检测行为序列中的复杂事件CEm(typem,tm,durm)以及Generator中的复杂事件CEn(typen,tn,durn)来说,如果typem=typen,tm=tn,且durm>=durn,那么仍然认为CEm与CEn是同一复杂事件的。The behavior event to be detected is scanned, and if the behavior event includes any sub-Generator in the database, it is judged that the behavior to be detected is an Internet addiction behavior. Considering that the larger the value of the duration, the more likely the behavior to be detected is an Internet addiction behavior, therefore, for the complex event CE m (type m ,t m ,dur m ) in the behavior sequence to be detected and Generator For the complex event CE n (type n ,t n ,dur n ), if type m =type n , t m =t n , and dur m >=dur n , then CE m and CE n are still considered to be the same complex event.

本发明实施例中,扫描待检测用户一周上网行为序列的频繁项集,这里取支持度阈值为3/7,那么频繁项集为{Be100,Ab100,Bc60,Ae70,Db90,Fe40}。发现其至少包含网瘾类中的一个产生子,即<Be100>,根据GBPDA算法网瘾判定原则,即只要包含网瘾行为模式库中的一个产生子,就判断其具有网瘾行为,因此该待检测行为最终判定其属于网瘾行为。In the embodiment of the present invention, scan the frequent itemsets of the online behavior sequence of the user to be detected for a week, where the support threshold is 3/7, then the frequent itemsets are {Be100, Ab100, Bc60, Ae70, Db90, Fe40}. It is found that it contains at least one generator in the Internet addiction class, namely <Be100>, according to the GBPDA algorithm Internet addiction determination principle, that is, as long as it contains a generator in the Internet addiction behavior pattern library, it is judged to have Internet addiction behavior, so the The behavior to be tested is finally determined to be an Internet addiction behavior.

EPBPDA算法:EPBPDA algorithm:

在获得JEP和eEP后,分别计算网瘾类和非网瘾类的分值,然后进行比较,选择分值较大作为最后判断结果。After obtaining JEP and eEP, the scores of Internet addiction and non-Internet addiction are calculated respectively, and then compared, and the higher score is selected as the final judgment result.

得分Score(C,Di|j)公式如下:The formula of Score(C,D i|j ) is as follows:

ScoreScore (( CC ,, DD. ii || jj )) == &Sigma;&Sigma; Xx &Element;&Element; RSRS GrGr (( Xx ,, DD. jj || ii ,, DD. ii || jj )) GrGr (( Xx ,, DD. jj || ii ,, DD. ii || jj )) ++ MaxMax GrGr RSRS &times;&times; SupSup cc (( Xx ))

++ &Sigma;&Sigma; YY &Element;&Element; WSWS 11 GrGr (( YY ,, DD. ii || jj ,, DD. jj || ii )) ++ MinMin GrGr WSWS &times;&times; SupSup cc (( YY )) ,, (( ii &NotEqual;&NotEqual; jj )) -- -- -- (( 22 ))

其中:i和j是样本库类标签号,Di、Dj均是样本库,为统一表达形式,用下标i|j或j|i加以区别。注意,在本专利中只有2类样本库,即网瘾类样本库和非网瘾类样本库,因此为更好区分二者,用D1表示网瘾类库,用D2表示非网瘾库。X表示D1的显露模式的产生子,Y表示D2的显露模式的产生子;RS是目标类的正向样本集合,即起积极作用的集合,WS是目标类的反向样本集合,即起消极作用的集合;Gr(X,Dj|i,Di|j)表示X从一个类到另一个类的增长率,这由分割符“|”的前后因子所在顺序决定。比如,Gr(X,D2,D1)表示X从D2到D1的增长率,Gr(X,D1,D2)表示X从D1到D2的增长率;MaxGrRS表示目标类集合中的最大增长率阈值,MinGrWS表示非目标类集合中的最小增长率阈值,C为待分类复杂事件,Supc(X)表示X在C中的支持度,Supc(Y)表示Y在C中的支持度。Among them: i and j are the class label numbers of the sample library, D i and D j are both sample libraries, which are unified expressions, and are distinguished by the subscript i|j or j|i. Note that there are only 2 types of sample libraries in this patent, i.e. Internet addiction sample library and non-Internet addiction sample library, so in order to better distinguish between the two, D 1 is used to represent the Internet addiction library, and D 2 is used to represent the non-Internet addiction sample library library. X represents the generator of the exposure pattern of D 1 , and Y represents the generator of the exposure pattern of D 2 ; RS is the positive sample set of the target class, that is, the set that plays an active role, and WS is the reverse sample set of the target class, namely A set that plays a negative role; Gr(X,D j|i ,D i|j ) represents the growth rate of X from one class to another class, which is determined by the order of the factors before and after the separator "|". For example, Gr(X,D 2 ,D 1 ) indicates the growth rate of X from D 2 to D 1 , Gr(X,D 1 ,D 2 ) indicates the growth rate of X from D 1 to D 2 ; MaxGr RS indicates the target The maximum growth rate threshold in the class set, MinGr WS represents the minimum growth rate threshold in the non-target class set, C is the complex event to be classified, Sup c (X) represents the support of X in C, Sup c (Y) represents The support of Y in C.

如果将待检测样本C看作是RS,即目标类,则利用公式(2)中的第一项计算得分;如果将C看作是WS,即非目标类,则利用公式(2)中的第二项计算得分。其中,公式(2)中的第一项 &Sigma; X &Element; RS Gr ( X , D j | i , D i | j ) Gr ( X , D j | i , D i | j ) + Max Gr RS &times; Sup c ( X ) , 不仅给出了eEP与JEP对打分的贡献,参数MaxGr还特别强调了JEP的贡献。与此同时,非目标类的EP对于样本C属于目标类也具有贡献。对于C来说,当评断C是否属于D1类时,D2类中的EP即Y,也能给予一定的贡献。当C中包含Y时,可以判断出C属于D1类的概率为1/(Gr(X,D2,D1)+1),当Gr(Y,D2,D1)比较大时,Y的贡献值很小可以忽略不计,但是当Gr(Y,D2,D1)在满足最小阈值的基础上且较小时,则对于判断C属于D1会体现出较大贡献,故需要考虑此因素带来的影响,同时考虑普通EP与JEP的区别,公式(2)中的第二项

Figure BDA0000368918850000224
表示了Y的贡献值。因此,公式(2)是综合考虑的结果。If the sample C to be detected is regarded as RS, that is, the target class, the first item in formula (2) is used to calculate the score; if C is regarded as WS, that is, the non-target class, then the score is calculated using The second item calculates the score. Among them, the first term in formula (2) &Sigma; x &Element; RS Gr ( x , D. j | i , D. i | j ) Gr ( x , D. j | i , D. i | j ) + Max Gr RS &times; Sup c ( x ) , Not only the contribution of eEP and JEP to scoring is given, but the parameter MaxGr also emphasizes the contribution of JEP. At the same time, the EP of the non-target class also contributes to the fact that sample C belongs to the target class. For C, when judging whether C belongs to D 1 category, the EP in D 2 category, namely Y, can also make a certain contribution. When C contains Y, it can be judged that the probability that C belongs to D 1 class is 1/(Gr(X,D 2 ,D 1 )+1), when Gr(Y,D 2 ,D 1 ) is relatively large, The contribution of Y is very small and negligible, but when Gr(Y,D 2 ,D 1 ) meets the minimum threshold and is small, it will show a large contribution to the judgment that C belongs to D 1 , so it needs to be considered The impact of this factor, while considering the difference between ordinary EP and JEP, the second item in formula (2)
Figure BDA0000368918850000224
Indicates the contribution value of Y. Therefore, formula (2) is the result of comprehensive consideration.

本发明实施例中,从该待检测用户一周的上网行为序列中获取Generator及其对应的支持度,即In the embodiment of the present invention, the Generator and its corresponding support degree are obtained from the online behavior sequence of the user to be detected for a week, that is,

GC={<Be100>(50%),<Ab100>(60%),<Bc60>(45%),<Ae70>(30%),<Ae70,Fe40>(50%)}。G C ={<Be100>(50%),<Ab100>(60%),<Bc60>(45%),<Ae70>(30%),<Ae70,Fe40>(50%)}.

待检测类C的JEP获取如下:The JEP of class C to be tested is obtained as follows:

JEPPIU-C={<Be100>(50%),<Ab100>(60%)}JEP PIU-C ={<Be100>(50%),<Ab100>(60%)}

JEPNPIU-C={<Ae70,Fe40>(50%)}JEP NPIU-C ={<Ae70,Fe40>(50%)}

由于EP阈值=1,因此待检测类C的产生子中网瘾类和非网瘾类的eEP集合分别为:Since the EP threshold = 1, the eEP sets of the Internet addiction class and the non-Internet addiction class in the generation of the class C to be detected are respectively:

eEPPIU-C={<Bc60>(45%)}eEP PIU-C ={<Bc60>(45%)}

eEPNPIU-C={<Ae70>(30%)}eEP NPIU-C ={<Ae70>(30%)}

因此,待检测类的网瘾类和非网瘾类EP集和分别为:Therefore, the sum of the Internet addiction and non-Internet addiction EP sets of the class to be detected is respectively:

EPPIU-C={<Be100>(50%),<Ab100>(60%),<Bc60>(45%)}EP PIU-C ={<Be100>(50%),<Ab100>(60%),<Bc60>(45%)}

EPNPIU-C={<Ae70,Fe40>(50%),<Ae70>(30%)}EP NPIU-C ={<Ae70,Fe40>(50%),<Ae70>(30%)}

前面已求得网瘾类EP的MaxGrRS=MaxGrD1=1.5,MinGrWS=MinGrD2=7/6.5;非网瘾类EP的MaxGrRS=MarGrD2=1.75,MinGrWS=MinGrD1=7/5.5。MaxGr RS = MaxGr D1 = 1.5, MinGr WS = MinGr D2 = 7/6.5 for Internet addiction EP; MaxGr RS = MarGr D2 = 1.75 for non-Internet addiction EP, MinGr WS = MinGr D1 = 7/5.5 .

下面,分别计算待检测用户行为的各个产生子在网瘾类D1和非网瘾类D2的得分。Next, the scores of each generator of the user behavior to be detected in the Internet addiction category D 1 and the non-Internet addiction category D 2 are calculated respectively.

1)因为<Be100>(50%)和<Ab100>(60%)是网瘾类D1的跳跃显露模式,对于网瘾类D1来说是X,即属于正向集和RS,对于非网瘾类D2来说是Y,即属于反向集合WS,因此有:1) Because <Be100> (50%) and <Ab100> (60%) are the jumping exposure patterns of Internet addiction class D 1 , for Internet addiction class D 1 , it is X, that is, it belongs to the positive set and RS, and for non- Internet addiction class D 2 is Y, that is, it belongs to the reverse set WS, so there are:

Gr(<Be100>,D2,D1)=∞,Supc(<Be100>)=50%Gr(<Be100>,D 2 ,D 1 )=∞,Sup c (<Be100>)=50%

ScoreScore (( << Bebe 100100 >> ,, DD. 11 )) == limlim &infin;&infin; &infin;&infin; ++ MaxMax GrGr DD. 11 &times;&times; 0.50.5 == limlim &infin;&infin; &infin;&infin; ++ 1.51.5 &times;&times; 0.50.5 == 11 &times;&times; 0.50.5

ScoreScore (( << Bebe 100100 >> ,, DD. 22 )) == limlim 11 &infin;&infin; ++ MinMin GrGr DD. 22 &times;&times; 0.50.5 == limlim 11 &infin;&infin; ++ 77 // 6.56.5 &times;&times; 0.50.5 == 00 &times;&times; 0.50.5

Gr(<Ab100>,D2,D1)=∞,Supc(<Ad100>)=60%Gr(<Ab100>,D 2 ,D 1 )=∞,Sup c (<Ad100>)=60%

ScoreScore (( << AbAb 100100 >> ,, DD. 11 )) == limlim &infin;&infin; &infin;&infin; ++ MaxMax GrGr DD. 11 &times;&times; 0.60.6 == limlim &infin;&infin; &infin;&infin; ++ 11 .. 55 == 11 &times;&times; 0.60.6

ScoreScore (( << AbAb 100100 >> ,, DD. 22 )) == limlim 11 &infin;&infin; ++ MinMin GrGr DD. 22 &times;&times; 0.60.6 == limlim 11 &infin;&infin; ++ 77 // 6.56.5 &times;&times; 0.60.6 == 00 &times;&times; 0.60.6

2)因为<Bc60>(45%)是网瘾类D1的产生子,对于网瘾类D1来说是X,即属于正向集和RS,对于非网瘾类D2来说是Y,即属于反向集合WS,因此有:2) Because <Bc60> (45%) is the child of Internet addiction class D1, it is X for Internet addiction class D1, that is, it belongs to the forward set and RS, and it is Y for non-Internet addiction class D2, that is, it belongs to The reverse set WS, therefore has:

Gr(<Bc60>,D2,D1)=70%/65%=7/6.5,Supc(<Bc60>)=45%Gr(<Bc60>,D 2 ,D 1 )=70%/65%=7/6.5,Sup c (<Bc60>)=45%

ScoreScore (( << BcBc 6060 >> ,, DD. 11 )) == GrGr (( << BcBc 6060 ,, DD. 22 ,, DD. 11 )) GrGr (( << BcBc 6060 ,, DD. 22 ,, DD. 11 >> )) ++ MaxMax GrGr DD. 11 &times;&times; SupSup cc (( << BcBc 6060 >> ))

== 77 // 6.56.5 77 // 6.56.5 ++ 1.51.5 &times;&times; 0.450.45

ScoreScore (( << BcBc 6060 >> ,, DD. 22 )) == 11 GrGr (( << BcBc 6060 ,, DD. 22 ,, DD. 11 >> )) ++ MinMin GrGr DD. 22 &times;&times; SupSup cc (( << BcBc 6060 >> ))

== 11 77 // 6.56.5 ++ 77 // 6.56.5 &times;&times; 0.450.45

3)因为<Ae70>(30%)是非网瘾类D2的产生子,对于非网瘾类D2来说是X,即属于正向集和RS,对于网瘾类D1来说是Y,即属于反向集合WS,因此有:3) Because <Ae70> (30%) is the child of non-Internet addiction class D 2 , it is X for non-Internet addiction class D 2 , that is, it belongs to positive set and RS, and it is Y for Internet addiction class D 1 , which belong to the reverse set WS, so there are:

Gr(<Ae70>,D1,D2)=90%/50%=1.6,Supc(<Ae70>)=30%;Gr(<Ae70>,D 1 ,D 2 )=90%/50%=1.6, Sup c (<Ae70>)=30%;

ScoreScore (( << AeAe 7070 >> ,, DD. 11 )) == 11 GrGr (( << AeAe 7070 ,, DD. 11 ,, DD. 22 >> )) ++ MinMin GrGr DD. 11 &times;&times; 0.30.3 == 11 1.61.6 ++ 77 // 5.55.5 &times;&times; 0.30.3

ScoreScore (( << AeAe 7070 >> ,, DD. 22 )) == GrGr (( << BcBc 6060 ,, DD. 11 ,, DD. 22 >> )) GrGr (( << AeAe 7070 ,, DD. 11 ,, DD. 22 >> )) ++ MaxMax GrGr DD. 22 &times;&times; 0.30.3 == 1.61.6 1.61.6 ++ 11 .. 7575 &times;&times; 0.30.3

4)因为<Ae70,Fe40>(50%)是非网瘾类D2的跳跃显露模式,对于非网瘾类D2来说是X,即属于正向集和RS,对于网瘾类D1来说是Y,即属于反向集合WS,因此有:4) Because <Ae70, Fe40> (50%) is the jumping exposure pattern of non-Internet addiction D 2 , it is X for non-Internet addiction D 2 , that is, it belongs to the positive set and RS, and for Internet addiction D 1 is Said to be Y, which belongs to the reverse set WS, so there are:

Gr(<Ae70,Fe40>,D1,D2)=∞,Supc(<Ae70,Fe40>)=50%;Gr(<Ae70,Fe40>,D 1 ,D 2 )=∞,Sup c (<Ae70,Fe40>)=50%;

ScoreScore (( << AeAe 7070 ,, FeFe 4040 >> ,, DD. 11 )) == limlim 11 &infin;&infin; ++ MinMin GrGr DD. 11 &times;&times; 0.50.5 == limlim 11 &infin;&infin; ++ 77 // 5.55.5 &times;&times; 0.50.5 == 00 &times;&times; 0.50.5

ScoreScore (( << AeAe 7070 ,, FeFe 4040 >> ,, DD. 22 )) == limlim &infin;&infin; &infin;&infin; ++ MarMar. GrGr DD. 22 &times;&times; 0.50.5 == limlim &infin;&infin; &infin;&infin; ++ 11 .. 7575 &times;&times; 0.50.5 == 11 &times;&times; 0.50.5

将上述计算结果带入公式(2),求得待检测行为的网瘾类和非网瘾类得分如下:Putting the above calculation results into formula (2), the scores of Internet addiction and non-Internet addiction of the behavior to be tested are obtained as follows:

ScoreScore PIUPIU == 11 &times;&times; 5050 %% ++ 11 &times;&times; 6060 %% ++ 77 // 6.56.5 77 // 6.56.5 ++ 1.51.5 &times;&times; 0.450.45 ++ 11 1.61.6 ++ 77 // 5.55.5 &times;&times; 0.30.3 ++ 00 &times;&times; 0.50.5 == 1.391.39

ScoreScore NPIUNPIU == 00 &times;&times; 0.50.5 ++ 00 &times;&times; 0.60.6 ++ 11 77 // 6.56.5 ++ 77 // 6.56.5 &times;&times; 0.450.45 ++ 1.61.6 1.61.6 ++ 11 .. 7575 &times;&times; 0.30.3 ++ 11 &times;&times; 0.50.5 == 0.850.85

由于SCOREPIU=1.39大于SCORENPIU=0.85,因此,最终判定该待检测行为属于网瘾类。Since SCORE PIU =1.39 is greater than SCORE NPIU =0.85, it is finally determined that the behavior to be detected belongs to the category of Internet addiction.

步骤9、若检测到用户计算机交互行为是网瘾行为,则采用预警干预模块发出预警提示,并通过定时器限定该用户可继续使用计算机1小时;若超过时限范围,则关闭计算机。Step 9. If it is detected that the user's computer interaction behavior is an Internet addiction behavior, an early warning intervention module is used to issue an early warning prompt, and a timer is used to limit the user to continue using the computer for 1 hour; if the time limit is exceeded, the computer is turned off.

一旦超过该时限范围,就会进一步启动软件装置关闭计算机,本发明实施例中,调用Windows系统的API接口,即Shutdown函数,实现对计算机的关闭,从而实现对网瘾的有效干预。Once the time limit is exceeded, the software device will be further started to shut down the computer. In the embodiment of the present invention, the API interface of the Windows system, namely the Shutdown function, is called to shut down the computer, thereby effectively intervening in Internet addiction.

本发明实施例中,通过实验测试提出的网瘾检测方法的应用效果。In the embodiment of the present invention, the application effect of the proposed method for detecting Internet addiction is tested through experiments.

(1)对网瘾检测算法效率的评价。(1) Evaluation of the efficiency of Internet addiction detection algorithm.

利用提出的两种网瘾检测算法GBPDA算法和EPBPDA算法,主要从时间复杂度和空间复杂度进行评价。从图4中图(a)中可以看出,随着数据规模(即以一天产生的计算机交互行为数据为检测单位)的增长,从采集四天数据到十天数据,两种算法的运行时间均程递增趋势,这时因为数据越多算法的处理时间也越多。此外,EPBPDA算法的运行时间要略高于GBPDA算法,原因在于EPBPDA算法要比GBPDA算法需要更多处理过程,因此导致更多运行时间。从图4中图(b)中的内存空间来看,两种算法所需内存上限都不大于16MB字节内存空间。同时,随着数据集规模增大,内存空间占用亦随之逐渐增多。此外,EPBPDA算法要比GBPDA算法耗费更多内存空间,原因是EP挖掘要在Generator挖掘基础上进行,因此需要占用相对较多的内存资源。Using the proposed two Internet addiction detection algorithms GBPDA algorithm and EPBPDA algorithm, mainly evaluate from the time complexity and space complexity. From Figure (a) in Figure 4, it can be seen that with the increase of the data scale (that is, the computer interactive behavior data generated in one day as the detection unit), the running time of the two algorithms from collecting four days of data to ten days of data The average increase trend, at this time, because the more data, the more processing time the algorithm takes. In addition, the running time of the EPBPDA algorithm is slightly higher than that of the GBPDA algorithm because the EPBPDA algorithm requires more processing than the GBPDA algorithm, thus resulting in more running time. Judging from the memory space in (b) in Figure 4, the upper limit of memory required by the two algorithms is not greater than 16MB of memory space. At the same time, as the size of the data set increases, the memory space occupation will gradually increase. In addition, the EPBPDA algorithm consumes more memory space than the GBPDA algorithm, because EP mining is based on Generator mining, so it needs to occupy relatively more memory resources.

(2)对网瘾检测算法效果的评价。(2) Evaluation of the effect of Internet addiction detection algorithm.

从图5中图(a)中可以看出,首先,两种算法都取得了较高的正确率,说明了这两种网瘾检测算法的有效性,且随着数据规模增大,两种算法的准确率都有一定程度的提高,说明数据越多检测效果越好。其次,EPBPDA算法的正确率要高于GBPDA算法,原因在于,EPBPDA算法是在两类数据集上对比得出,即网瘾类和非网瘾类,更加客观,对于网瘾评判也更加公平,而GBPDA算法,只通过与网瘾类Generator对比来体现,即只评判用户行为是否与网瘾类接近,且评判阈值的确定也具有很强的主观因素性。It can be seen from Figure (a) in Figure 5 that, firstly, both algorithms have achieved a high accuracy rate, which shows the effectiveness of the two Internet addiction detection algorithms, and as the data scale increases, the two algorithms The accuracy of the algorithm has been improved to a certain extent, indicating that the more data, the better the detection effect. Secondly, the correct rate of the EPBPDA algorithm is higher than that of the GBPDA algorithm. The reason is that the EPBPDA algorithm is obtained by comparing two types of data sets, that is, Internet addiction and non-Internet addiction. It is more objective and fairer for the evaluation of Internet addiction. The GBPDA algorithm is only reflected by comparing with the Internet addiction generator, that is, it only judges whether the user behavior is close to the Internet addiction category, and the determination of the judgment threshold is also highly subjective.

分别从漏检率和误检率两个角度,对两种算法进行了评价。漏检率是将网瘾判断成了非网瘾,误检率则是将非网瘾判断成了网瘾,显然这两个指标都会影响网瘾检测效果,因此可以将它们统一为错误率。首先,从图5中图(b)和图(c)中可以看出,错误率都处于一个较低水平,说明分类效果良好。其次,当数据规模较小时,错误率较高,这也正好从另外一方面验证了准确率所反应的规律;而当数据规模增大时,错误率在一定程度有所下降,这再次证明进行网瘾行为检测需要尽可能多地采集用户行为数据。接下来,EPBPDA算法的错误率略低于GBPDA算法,说明EPBPDA算法分类效果更好,这源于其所构建的分类器不但形式简洁且保留了区分性较强的特征属性,因而保证了分类效果。The two algorithms are evaluated from two angles of missed detection rate and false detection rate respectively. The missed detection rate is to judge Internet addiction as non-Internet addiction, and the false detection rate is to judge non-Internet addiction as Internet addiction. Obviously, these two indicators will affect the detection effect of Internet addiction, so they can be unified as error rate. First of all, it can be seen from the graphs (b) and (c) in Figure 5 that the error rate is at a low level, indicating that the classification effect is good. Secondly, when the data size is small, the error rate is high, which just verifies the law reflected by the accuracy rate from another aspect; and when the data size increases, the error rate decreases to a certain extent, which proves once again that the Internet addiction behavior detection needs to collect as much user behavior data as possible. Next, the error rate of the EPBPDA algorithm is slightly lower than that of the GBPDA algorithm, indicating that the classification effect of the EPBPDA algorithm is better. .

Claims (6)

1. A net addiction detection device based on user computer interaction events is characterized in that: comprises a simple event acquisition module, a complex event generation module, a time generalization and normalization processing module, a frequent behavior generator acquisition module, a sample library construction module, a data processing and detection module and an early warning intervention module, wherein,
simple event collection module: the system comprises a complex event generating module, a user interaction module and a user interaction module, wherein the simple event generating module is used for acquiring interaction variables of the user and a computer, and the interaction variables comprise the CPU utilization rate, the memory utilization rate, the left mouse button clicking times in unit time, the right mouse button clicking times in unit time, the keyboard clicking times in unit time, the pixel moving times of the mouse in unit time, network flow and an operation process of the computer, and transmitting the input variables to the complex event generating module;
a complex event generation module: the system comprises a plurality of simple events, a plurality of complex events and a plurality of storage units, wherein the simple events are used for comparing each simple event with respective threshold, if the simple events are larger than or equal to the threshold range, the simple events are reserved, and then the complex events are generated according to the combination conditions of all the reserved simple events; otherwise, continuing to collect simple events; the complex events comprise offline video watching, online video watching, instant strategy games, table game games, webpage browsing, online chatting and downloading operations;
a time generalization and normalization processing module: the system is used for sorting the complex events made by each user on a certain day and generalizing the occurrence time of the complex events, namely dividing one day into a plurality of time periods; normalizing the duration, namely rounding the duration of the complex event; sending the processed complex event to a frequent behavior generation sub-acquisition module;
the frequent behavior generation sub-acquisition module: when a sample library is established, the method is used for acquiring the same complex events carried out among a plurality of network addiction users, collecting the generation times of the same complex events, selecting the complex events larger than a threshold value as frequent network addiction complex events according to a time threshold value set by the user, acquiring a generator for describing the behavior characteristic distribution of the user by a set formed by the frequent network addiction complex events, and sending the generator to a sample library construction module; the system is used for acquiring the same complex events carried out among a plurality of non-network-addiction users, collecting the generation times of the same complex events, selecting the complex events larger than a threshold value as frequent non-network-addiction complex events according to a time threshold value set by the users, then acquiring a generator which is a set formed by the frequent non-network-addiction complex events and used for describing the behavior characteristic distribution of the users, and sending the generator to a sample library construction module;
when the internet addiction condition of a user is detected, the method is used for acquiring the same complex event performed by the user to be detected within a period of time, acquiring the generation times of the same complex event, selecting the complex event larger than a threshold value as a frequent complex event according to a time threshold value set by the user, then acquiring a generator for describing behavior feature distribution of a set formed by the frequent complex event, and sending the generator to a data processing and detecting module;
a sample library construction module: the method is used for calculating scores of the frequent net addiction complex event generators by adopting a generator-based sample construction algorithm, if the scores exceed a threshold value, the frequent net addiction complex event generators are stored as net addiction classes, otherwise, the frequent net addiction complex event generators are deleted, and the frequent non-net addiction complex event generators are stored as non-net addiction classes; or calculating the frequent complex event producers by adopting a sample construction algorithm based on the exposure mode to obtain and store the producers of the exposure mode;
the data processing and detecting module: the network addiction judging module is used for comparing the to-be-detected frequent behavior generator with the network addiction sample library, judging the to-be-detected frequent complex event generator as network addiction if the to-be-detected frequent complex event generator is contained in the sample library, otherwise judging the to-be-detected frequent complex event generator as non-network addiction, and outputting a judgment result; or comparing the to-be-detected frequent behavior generator with a sample library, respectively calculating the net addiction score and the non-net addiction score by adopting a net addiction detection algorithm based on an exposure mode, and judging the category of the to-be-detected frequent behavior generator according to the score;
an early warning intervention module: the computer interaction behavior detection device is used for sending out an early warning prompt when the fact that the computer interaction behavior of the user is the internet addiction behavior is detected, and limiting the time range of the user for continuously using the computer through the timer for 1-2 hours; if the time limit range is exceeded, the computer is shut down.
2. The method for detecting the internet addiction based on the user computer interaction events, which is characterized by comprising the following steps of: the method comprises the following steps:
step 1, collecting interaction variables between a plurality of network addict users and a plurality of non-network addict users and a computer as simple events, wherein the interaction variables comprise the utilization rate of a computer CPU (Central processing Unit), the utilization rate of a memory, the times of clicking a left mouse button in unit time, the times of clicking a right mouse button in unit time, the times of clicking a keyboard in unit time, the number of pixels moved by the mouse in unit time, network flow and an operation process;
step 2, comparing each simple event with respective threshold value, if the simple event is larger than or equal to the threshold value range, reserving the simple event, and generating a complex event according to the combination condition of all reserved simple events; otherwise, continuing to collect simple events; the complex events comprise offline video watching, online video watching, instant strategy games, table game games, webpage browsing, online chatting and downloading operations;
the process of generating complex events is as follows:
if the CPU utilization rate exceeds or is equal to the threshold range of 45% -55%, the memory occupancy rate exceeds or is equal to the threshold range of 55% -65%, and the internet traffic exceeds or is equal to the threshold range of 35-45 MB, generating a complex event of watching videos online;
if the CPU utilization rate exceeds or is equal to 45% -55% of the threshold range, the memory occupancy rate exceeds or is equal to 55% -65% of the threshold range and the running process is a multimedia player, generating a complex event for watching videos offline;
if the CPU utilization rate exceeds or is equal to 45-55% of the threshold range, the memory occupancy rate exceeds or is equal to 55-65% of the threshold range, the left mouse button clicking frequency exceeds or is equal to 25-35 times of the threshold range in unit time, the right mouse button clicking frequency exceeds or is equal to 10-20 times of the threshold range in unit time, the number of pixels moved by the mouse exceeds or is equal to 1550-1650 times of the threshold range in unit time, the keyboard knocking frequency exceeds or is equal to 80-120 times of the threshold range in unit time, and the running process is a timely strategy game, a complex event for playing the instant strategy game is generated;
if the CPU utilization rate exceeds or is equal to 45-55% of the threshold range, the memory occupancy rate exceeds or is equal to 55-65% of the threshold range, the number of times of clicking the left mouse button per unit time exceeds or is equal to 25-35 times of the threshold range, the number of pixels moved by the mouse per unit time exceeds or is equal to 1550-1650 pixels, and the running process is a table game, a complex event for playing the table game is generated;
if the CPU utilization rate exceeds or equals to 45% -55% of the threshold range, the number of times of clicking the left button of the mouse per unit time exceeds or equals to 25-35 times of the threshold range, the number of pixels moved by the mouse per unit time exceeds or equals to 1550-1650 pixels of the threshold range, and the running process is a table game, generating another complex event for playing the table game;
if the number of times of clicking the left mouse button per unit time exceeds or equals to the threshold range of 25-35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range of 1550-1650 pixels, and the running process is a browser, generating a complex event for browsing a webpage;
if the number of times of clicking the left mouse button per unit time exceeds or equals to the threshold range by 25-35 times, the number of pixels moved by the mouse per unit time exceeds or equals to the threshold range by 1550-1650, the internet traffic exceeds or equals to the threshold range by 35-45 MB, and the running process is a browser, generating another complex event for browsing the webpage;
if the number of pixels moved by the mouse in unit time exceeds or is equal to 1550-1650 pixels in the threshold range, the internet traffic exceeds or is equal to 35-45 MB in the threshold range, and the running process is a browser, generating another complex event for browsing the webpage;
if the number of times of keyboard knocking exceeds or is equal to the threshold range of 80-120 times in unit time and the running process is instant chat software, generating a complex event of online chat;
if the number of pixels moved by the mouse in unit time exceeds or is equal to 1550-1650 pixels in the threshold range, the number of times of keyboard knocking in unit time exceeds or is equal to 80-120 times in the threshold range, and the running process is instant chat software, generating another complex event of online chatting;
if the internet traffic exceeds or equals to the threshold range of 35-45 MB, generating a complex event for downloading data;
step 3, generalizing the occurrence time of the complex event, namely dividing one day into five time periods of 6: 00-11: 00, 11: 00-14: 00, 14: 00-18: 00, 18: 00-23: 00 and 23: 00-6: 00, and normalizing the duration, namely rounding the duration of the complex event by taking 10 minutes as a unit;
step 4, acquiring the same complex events carried out among a plurality of network addiction users and the same complex events carried out among a plurality of non-network addiction users, collecting the generation times of the same complex events, selecting the complex events larger than a threshold value as frequent network addiction complex events and frequent non-network addiction complex events according to a time threshold value set by the user, then acquiring a plurality of generators for describing user behavior characteristic distribution in a set formed by the frequent network addiction complex events, acquiring a plurality of generators for describing user behavior characteristic distribution in a set formed by the frequent non-network addiction complex events, and sending the generators to a sample library construction module;
step 5, calculating scores of the generators of the frequent Internet addiction complex events by adopting a generator-based sample construction algorithm, if the scores exceed a threshold value, saving the generators as Internet addicts, otherwise, deleting the generators, and saving the generators of the frequent non-Internet addiction complex events as non-Internet addicts; or calculating the frequent complex event producers by adopting a sample acquisition algorithm based on the exposure mode to obtain and store the producers of the exposure mode;
step 6, collecting interaction variables between the user to be tested and the computer within a period of time, and repeating the step 2 to the step 3;
step 7, acquiring the same complex events performed by a user to be detected within a period of time, collecting the generation times of the same complex events, selecting the complex events larger than a threshold value as frequent complex events according to a time threshold value set by the user, acquiring a generator for describing behavior feature distribution of a set formed by the frequent complex events, and sending the generator to a data processing and detecting module;
step 8, comparing the frequent behavior generator to be detected with a network addiction sample library, if the frequent complex event generator to be detected is contained in the sample library, judging that the frequent behavior generator is network addiction, otherwise, judging that the frequent behavior generator is non-network addiction, and outputting a judgment result; or comparing the to-be-detected frequent behavior generator with a sample library, respectively calculating the net addiction score and the non-net addiction score by adopting a net addiction detection algorithm based on an exposure mode, and judging the scores of the net addiction score and the non-net addiction score, wherein the to-be-detected frequent complex events belong to the category with one higher score;
step 9, if the computer interaction behavior of the user is detected to be the internet addiction behavior, an early warning intervention module is adopted to send out an early warning prompt, and the time range within which the user can continue to use the computer is limited by a timer for 1-2 hours; if the time limit range is exceeded, the computer is shut down.
3. The method for detecting internet addiction detection device based on user computer interaction events as claimed in claim 2, wherein: respective thresholds for the simple events described in step 2: the utilization rate of a computer CPU ranges from 45% to 55%, the utilization rate of a memory ranges from 55% to 65%, the left button clicking frequency of a mouse in unit time ranges from 25 to 35 times, the right button clicking frequency of the mouse in unit time ranges from 10 to 20 times, the keyboard clicking frequency in unit time ranges from 80 to 120, the pixel moving frequency of the mouse in unit time ranges from 1550 to 1650, and the network flow ranges from 35 to 45 MB.
4. The method for detecting internet addiction detection device based on user computer interaction events as claimed in claim 2, wherein: step 5, calculating scores of the frequent net addiction complex event generators by adopting a generator-based sample construction algorithm, which specifically comprises the following steps:
step 5-1, determining a weight coefficient of a complex event type according to user requirements, ensuring that the weight coefficient of the instant strategy game is more than the weight coefficient of online watching video, more than the weight coefficient of a table game, more than the weight coefficient of browsing a webpage, more than the weight coefficient of offline watching video, more than the weight coefficient of online chatting, more than the downloaded weight coefficient, and the total weight of the weight coefficients is 1;
step 5-2, determining a weight coefficient of a time generalization section of the complex event according to user requirements, ensuring that the dawn weight coefficient is greater than the evening weight coefficient is greater than the noon weight coefficient is greater than the afternoon weight coefficient is greater than the morning weight coefficient, and the total number of the weight coefficients is 1;
step 5-3, multiplying each complex event type weight coefficient and the complex event time generalization section weight coefficient in the generator by the duration to obtain a score, and summing the scores of each complex event to obtain the score of the generator;
and 5-4, if the score exceeds a threshold value, keeping the threshold value range from 1 to 6, and otherwise, deleting the score.
5. The method for detecting internet addiction detection device based on user computer interaction events as claimed in claim 2, wherein: step 5, calculating the frequent complex event generators by adopting a sample construction algorithm based on the exposure mode, obtaining the generators of the exposure mode and storing the generators in a classified manner, specifically:
step 5-a, obtaining producers belonging to frequent net addiction complex events but not to frequent non-net addiction complex events;
step 5-b, acquiring producers which do not belong to the frequent Internet addiction complex events but belong to the frequent non-Internet addiction complex events;
step 5-c, obtaining a common producer belonging to both frequent and frequent non-network addiction complex events, determining the support degree, namely the occurrence rate, of the producer in the frequent and frequent network addiction complex events, and determining the support degree in the frequent and non-network addiction complex events;
step 5-d, calculating the growth rate of the joint producers in the frequent net addiction complex events, namely dividing the support degree of the joint producers in the frequent net addiction complex events by the support degree of the joint producers in the frequent non-net addiction complex events; calculating the growth rate of the joint product in the frequent non-network addiction complex events, namely dividing the support degree of the joint product in the frequent non-network addiction complex events by the support degree in the frequent network addiction complex events;
step 5-e, comparing the two growth rates, reserving the producers with the growth rates larger than 1, and deleting the producers with the growth rates smaller than 1;
and 5-f, merging the generation sub-sets in the frequent network addiction complex events, and merging the generation sub-sets in the frequent non-network addiction complex events, namely acquiring and storing the generation sub-sets in the exposure mode.
6. The method for detecting internet addiction detection device based on user computer interaction events as claimed in claim 2, wherein: extraction as described in step 8Respectively calculating net addiction score and non-net addiction score with net addiction detection algorithm based on exposure mode, and scoring score (C, D)i) The formula is as follows:
Score ( C , D i | j ) = &Sigma; X &Element; RS Gr ( X , D j | i , D i | j ) Gr ( X , D j | i , D i | j ) + Max Gr RS &times; Sup c ( X )
+ &Sigma; Y &Element; WS 1 Gr ( Y , D i | j , D j | i ) + Min Gr WS &times; Sup c ( Y ) , ( i &NotEqual; j ) - - - ( 2 )
wherein: di|jRepresents DiSample library or DjSample library, Dj|iRepresents DjSample library or DiSample library, i and j are sample library class tag numbers, X denotes DiY represents DjThe generator of the exposure mode of (1); RS is the forward sample set, i.e. the actively acting set, of the target class, WS is the reverse sample set, i.e. the negatively acting set, of the target class; gr (X, D)j|i,Di|j) Represents the growth rate of X from one class to another; MaxGrRSRepresenting a maximum growth rate threshold in the set of target classes, RS representing a target classSet of forward samples, MinGrWSRepresenting the minimum growth rate threshold value in the non-target class set, WS representing the reverse sample set of the target class, C representing the complex event to be classified, Supc(X) represents the support of X in C, Supc(Y) represents the degree of support of Y in C.
CN201310368605.1A 2013-08-20 2013-08-20 Network addiction checkout gear and method based on subscriber computer alternative events Active CN103413054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310368605.1A CN103413054B (en) 2013-08-20 2013-08-20 Network addiction checkout gear and method based on subscriber computer alternative events

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310368605.1A CN103413054B (en) 2013-08-20 2013-08-20 Network addiction checkout gear and method based on subscriber computer alternative events

Publications (2)

Publication Number Publication Date
CN103413054A true CN103413054A (en) 2013-11-27
CN103413054B CN103413054B (en) 2016-05-11

Family

ID=49606065

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310368605.1A Active CN103413054B (en) 2013-08-20 2013-08-20 Network addiction checkout gear and method based on subscriber computer alternative events

Country Status (1)

Country Link
CN (1) CN103413054B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103955614A (en) * 2014-04-29 2014-07-30 北京盛世光明软件股份有限公司 Method and system for predicting psychological crisis
CN104714449A (en) * 2015-03-09 2015-06-17 湖南工学院 Method and device for obtaining operation data for man-machine interaction task
CN104778591A (en) * 2015-04-01 2015-07-15 北京三快在线科技有限公司 Extracting and identifying methods of feature information of abnormal behavior and devices
WO2016095516A1 (en) * 2014-12-15 2016-06-23 华为技术有限公司 Complex event processing method, apparatus and system
TWI576763B (en) * 2015-05-29 2017-04-01 yu-xuan Lin A method of assessing the extent to which a user is using a portable mobile device
CN107707421A (en) * 2017-08-16 2018-02-16 深信服科技股份有限公司 User's online recognition methods, device and storage medium
CN108848109A (en) * 2018-08-06 2018-11-20 山东亚圣家丁传统文化传承有限公司 A kind of online monitoring method and system
CN109862392A (en) * 2019-03-20 2019-06-07 济南大学 Identification method, system, device and medium of Internet game video traffic

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1858759A (en) * 2006-03-10 2006-11-08 华为技术有限公司 Method and system for limiting time of network gaming user
CN1959689A (en) * 2005-12-29 2007-05-09 康佳集团股份有限公司 Method for preventing indulging in games of handset
CN101034421A (en) * 2006-03-07 2007-09-12 上海新致软件有限公司 Control method for preventing indulging network game and device thereof
US20110250576A1 (en) * 2010-03-16 2011-10-13 Reid Kevin Hester System and method for recovering form addictions

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1959689A (en) * 2005-12-29 2007-05-09 康佳集团股份有限公司 Method for preventing indulging in games of handset
CN101034421A (en) * 2006-03-07 2007-09-12 上海新致软件有限公司 Control method for preventing indulging network game and device thereof
CN1858759A (en) * 2006-03-10 2006-11-08 华为技术有限公司 Method and system for limiting time of network gaming user
US20110250576A1 (en) * 2010-03-16 2011-10-13 Reid Kevin Hester System and method for recovering form addictions

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LI,J ET AL: "Mining statistically important equivalence classes and delta discriminative emerging patterns", 《PROCEEDINGS OF THE 13TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING》, 31 December 2007 (2007-12-31), pages 430 - 439 *
YAXIN YU ET AL: "Mining Emerging Patterns of PIU from Computer-Mediated Interaction Events", 《9TH INTERNATIONAL WORKSHOP,ADMI 2013,SAINT PAUL,MN,USA,MAY 6-7,2013》, 6 May 2013 (2013-05-06), pages 66 - 78 *
李卫民: "EP算法在电信客户细分中的应用", 《产经》, no. 1, 31 December 2010 (2010-12-31), pages 142 - 143 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103955614A (en) * 2014-04-29 2014-07-30 北京盛世光明软件股份有限公司 Method and system for predicting psychological crisis
US10915822B2 (en) 2014-12-15 2021-02-09 Huawei Technologies Co., Ltd. Complex event processing method, apparatus, and system
WO2016095516A1 (en) * 2014-12-15 2016-06-23 华为技术有限公司 Complex event processing method, apparatus and system
CN105786451A (en) * 2014-12-15 2016-07-20 华为技术有限公司 Method, device and system for processing complicated event
CN104714449A (en) * 2015-03-09 2015-06-17 湖南工学院 Method and device for obtaining operation data for man-machine interaction task
CN104714449B (en) * 2015-03-09 2018-02-27 湖南工学院 The method and apparatus for obtaining the operation data for man-machine interaction task
CN104778591A (en) * 2015-04-01 2015-07-15 北京三快在线科技有限公司 Extracting and identifying methods of feature information of abnormal behavior and devices
CN104778591B (en) * 2015-04-01 2018-05-22 北京三快在线科技有限公司 A kind of extraction, recognition methods and the device of the characteristic information of abnormal behaviour
TWI576763B (en) * 2015-05-29 2017-04-01 yu-xuan Lin A method of assessing the extent to which a user is using a portable mobile device
CN107707421A (en) * 2017-08-16 2018-02-16 深信服科技股份有限公司 User's online recognition methods, device and storage medium
CN108848109A (en) * 2018-08-06 2018-11-20 山东亚圣家丁传统文化传承有限公司 A kind of online monitoring method and system
CN109862392A (en) * 2019-03-20 2019-06-07 济南大学 Identification method, system, device and medium of Internet game video traffic
CN109862392B (en) * 2019-03-20 2021-04-13 济南大学 Identification method, system, device and medium of Internet game video traffic

Also Published As

Publication number Publication date
CN103413054B (en) 2016-05-11

Similar Documents

Publication Publication Date Title
CN103413054B (en) Network addiction checkout gear and method based on subscriber computer alternative events
Gama et al. A survey on concept drift adaptation
CN106650780B (en) Data processing method and device, classifier training method and system
CN104216954B (en) The prediction meanss and Forecasting Methodology of accident topic state
Le Merrer et al. Setting the record straighter on shadow banning
CN103176982B (en) The method and system that a kind of e-book is recommended
CN113139134B (en) Method and device for predicting popularity of user-generated content in social network
Tabibian et al. Distilling information reliability and source trustworthiness from digital traces
CN111754241B (en) User behavior perception method, device, equipment and medium
CN110297207A (en) Method for diagnosing faults, system and the electronic device of intelligent electric meter
CN109933720A (en) A Dynamic Recommendation Method Based on Adaptive Evolution of User Interests
CN112069039A (en) Monitoring and predicting alarm method and device for artificial intelligence development platform and storage medium
Li et al. WeSeer: Visual analysis for better information cascade prediction of WeChat articles
US11665185B2 (en) Method and apparatus to detect scripted network traffic
Zhang et al. Social network information propagation model based on individual behavior
CN111340540B (en) Advertisement recommendation model monitoring method, advertisement recommendation method and advertisement recommendation model monitoring device
US20170004401A1 (en) Artificial intuition
CN118233851B (en) 5G message push task management system, method, equipment and medium
US12169526B2 (en) Generating and presenting a text-based graph object
US8930362B2 (en) System and method for streak discovery and prediction
Zhang AI-driven Statistical Modeling for Social Network Analysis
CN118708444B (en) A server operation and maintenance management method and system for self-recommended operation
Costa et al. Mining User Activity Data in Social Media Services
Yuan et al. Behavioral Homophily in Social Media via Inverse Reinforcement Learning: A Reddit Case Study
Balti et al. Occupancy detection for enhanced energy disaggregation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant