[go: up one dir, main page]

CN108287989A - A kind of man-machine recognition methods of sliding identifying code based on track - Google Patents

A kind of man-machine recognition methods of sliding identifying code based on track Download PDF

Info

Publication number
CN108287989A
CN108287989A CN201810050045.8A CN201810050045A CN108287989A CN 108287989 A CN108287989 A CN 108287989A CN 201810050045 A CN201810050045 A CN 201810050045A CN 108287989 A CN108287989 A CN 108287989A
Authority
CN
China
Prior art keywords
feature
trajectory
track
machine recognition
extract
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810050045.8A
Other languages
Chinese (zh)
Other versions
CN108287989B (en
Inventor
张敏
陈媛
阳小龙
朱翔宇
孙奇福
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongzi Highway Maintenance And Inspection Technology Co Ltd
CHECC Data Co Ltd
Original Assignee
University of Science and Technology Beijing USTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology Beijing USTB filed Critical University of Science and Technology Beijing USTB
Priority to CN201810050045.8A priority Critical patent/CN108287989B/en
Publication of CN108287989A publication Critical patent/CN108287989A/en
Application granted granted Critical
Publication of CN108287989B publication Critical patent/CN108287989B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/36User authentication by graphic or iconic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)

Abstract

The man-machine recognition methods of sliding identifying code based on track that the invention discloses a kind of, includes the following steps:Acquire user trajectory data;Multidimensional characteristic system is built according to track data;Track differentiation is carried out to multidimensional characteristic system according to designed man-machine identification model.In the present invention; by combining two kinds of phenomenons of mankind track to carry out the design of multidimensional characteristic system; it is accustomed to the sliding identifying code of feature description user; and then user's operation and machine imitation are distinguished; advantage can be occupied in the confrontation with the black production tool of attacker, plays preferable confrontation protective effect.

Description

一种基于轨迹的滑动验证码人机识别方法A Trajectory-Based Sliding Verification Code Human-Machine Recognition Method

技术领域technical field

本发明涉及生物认证技术领域,尤其涉及一种基于轨迹的滑动验证码人机识别方法。The invention relates to the technical field of biometric authentication, in particular to a trajectory-based human-machine identification method for sliding verification codes.

背景技术Background technique

滑动验证码作为一种生物认证技术,能够满足当前网络环境对身份认证安全性需求,已经广泛应用于多种人机验证产品中,此验证方法不仅便于用户的理解记忆,同时极大增加了暴力破解难度。与此同时,也受到了攻击者们的重点关注,攻击者们开发出能够模仿人类行为的黑产工具开始对滑动验证码验证过程中的鼠标轨迹进行挑战。As a biometric authentication technology, the sliding verification code can meet the security requirements of the current network environment for identity authentication. It has been widely used in a variety of man-machine verification products. This verification method is not only easy for users to understand and remember, but also greatly increases the violence. Crack difficulty. At the same time, it has also attracted the attention of attackers. Attackers have developed black production tools that can imitate human behavior and began to challenge the mouse trajectory during the verification process of sliding verification codes.

攻击者通过黑产工具产生类人轨迹批量操作以绕过检测,并在对抗过程中不断升级其伪造数据以持续绕过同样升级的检测技术。现有检测技术主要针对机器进行识别,针对不断更新的机器行为进行对抗的方式具有滞后性,检测更新往往在黑产工具造成一定损失之后。因此,在双方都不断升级的技术对抗中,如何在与攻击者的黑产工具的对抗中占据优势,就显得尤为重要。The attackers generate human-like trajectory batch operations through black production tools to bypass detection, and continuously upgrade their forged data during the confrontation process to continue to bypass the same upgraded detection technology. Existing detection technologies are mainly aimed at machine identification, and the way of fighting against constantly updated machine behavior has a lag, and detection updates are often after certain losses are caused by black production tools. Therefore, in the escalating technological confrontation between both sides, how to gain an advantage in the confrontation with the attacker's black production tools is particularly important.

发明内容Contents of the invention

本发明的目的在于:提供一种基于轨迹的滑动验证码人机识别方法,旨在构建多维有效特征体系对滑动验证码的触发者进行识别,确保验证行为所保护的网络环境的安全。The purpose of the present invention is to provide a trajectory-based man-machine recognition method for sliding verification codes, aiming to build a multi-dimensional effective feature system to identify the triggerer of sliding verification codes and ensure the security of the network environment protected by the verification behavior.

本发明采用的技术方案如下:The technical scheme that the present invention adopts is as follows:

一种基于轨迹的滑动验证码人机识别方法,包括以下步骤:A trajectory-based human-machine recognition method for sliding verification codes, comprising the following steps:

S1:采集用户轨迹数据;S1: collect user trajectory data;

S2:根据轨迹数据构建多维特征体系;S2: Construct a multi-dimensional feature system based on trajectory data;

S3:根据设计好的人机识别模型对多维特征体系进行轨迹区分。S3: According to the designed human-machine recognition model, the trajectory of the multi-dimensional feature system is distinguished.

进一步的,所述多维特征体系包括X特征、Y特征、T特征。Further, the multi-dimensional feature system includes X features, Y features, and T features.

进一步的,所述X特征提取具体步骤如下:Further, the specific steps of the X feature extraction are as follows:

S201:提取X特征类,对轨迹横向坐标x进行归一化处理;S201: Extracting the X feature class, and normalizing the horizontal coordinate x of the trajectory;

S202:将轨迹横向坐标分为前半段和后半段;S202: Divide the horizontal coordinates of the trajectory into the first half and the second half;

S203:分别提取轨迹前半段x front、后半段x rear、前半段领位差x front diff、后半段领位差x rear diff、停止段final stop多个X特征群;S203: Extract multiple X feature groups of the first half of the trajectory x front, the second half x rear, the first half of the leading position difference x front diff, the second half of the leading position difference x rear diff, and the stop segment final stop;

S204:提取每个X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S204: Extract features in each X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference.

进一步的,所述Y特征提取具体步骤如下:Further, the specific steps of the Y feature extraction are as follows:

S211:提取Y特征类,对轨迹纵向坐标y进行归一化处理;S211: Extract the Y feature class, and perform normalization processing on the longitudinal coordinate y of the trajectory;

S212:分别提取轨迹全段y、折半y half、全段邻位差y diff、全段邻位差的邻位差y diff diff多个Y特征群;S212: Extract multiple Y feature groups of the entire trajectory y, the half y half, the entire adjacent difference y diff, and the adjacent difference y diff diff of the entire trajectory;

S213:提取每个Y特征群中的特征,包括方差、平均值、极差、和值。S213: Extract features in each Y feature group, including variance, average, range, and value.

进一步的,所述T特征提取具体步骤如下:Further, the specific steps of the T feature extraction are as follows:

S221:提取T特征类,对时间特征t进行归一化处理;S221: extract T feature class, and perform normalization processing on time feature t;

S222:提取T-X特征群,利用归一化后的横向坐标x减去归一化后的时间特征t;S222: Extract the T-X feature group, and subtract the normalized time feature t from the normalized horizontal coordinate x;

S223:提取T-X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S223: Extract features in the T-X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference.

进一步的,所述步骤S3人机识别模型设计具体步骤如下:Further, the specific steps of the design of the human-machine recognition model in step S3 are as follows:

S301:将多维特征体系中的特征输入到多个训练模型中进行算法训练;S301: Input the features in the multi-dimensional feature system into multiple training models for algorithm training;

S302:将特征算法训练输出进行线性加权。S302: Linearly weight the training output of the feature algorithm.

进一步的,所述训练模型包括:CatBoost模型、XGBoost模型、RandomForest模型、LogisticRegression模型。Further, the training model includes: CatBoost model, XGBoost model, RandomForest model, LogisticRegression model.

综上所述,由于采用了上述技术方案,本发明的有益效果是:In summary, owing to adopting above-mentioned technical scheme, the beneficial effect of the present invention is:

1、本发明中,通过结合人类轨迹的两种现象进行多维特征体系的设计,用特征描述用户的滑动验证习惯,进而将用户操作与机器模仿进行区别开来,在与攻击者的黑产工具的对抗中能占据优势,起到较好的对抗保护作用。1. In the present invention, the multi-dimensional feature system is designed by combining the two phenomena of human trajectories, using features to describe the user's sliding verification habits, and then distinguishing user operations from machine imitation. It can take advantage in the confrontation and play a better role in protection against confrontation.

2、本发明中,通过采用横向特征x为主,描述“人”在进行滑动验证码时的行为习惯,并用纵向特征y描述“机器”的特点,用时间特征t作为补充描述“人”与“机器”的区别,能够更加准确的将用户操作与机器模仿进行区别开来,提高轨迹区分的正确率。2. In the present invention, by using the horizontal feature x as the main method, the behavior habits of "people" when performing sliding verification codes are described, and the vertical feature y is used to describe the characteristics of "machines", and the time feature t is used as a supplement to describe the relationship between "people" and The difference between "machines" can more accurately distinguish user operations from machine imitation, and improve the accuracy of trajectory distinction.

3、本发明中,实际验证效果在200万条轨迹记录的测试集上,准确率、召回率的调和F值达到88.56,远高于以描述“机器”为主的方案的效果87.89。3. In the present invention, the actual verification effect is on a test set of 2 million track records, and the harmonic F value of accuracy and recall reaches 88.56, which is much higher than the effect of 87.89 for the scheme mainly describing "machines".

附图说明Description of drawings

图1为本发明多维特征体系特征关系图;Fig. 1 is a feature relationship diagram of the multi-dimensional feature system of the present invention;

图2为本发明多维特征体系概念图;Fig. 2 is a conceptual diagram of the multi-dimensional feature system of the present invention;

图3为本发明人机识别模型关系图。Fig. 3 is a relational diagram of the human-machine recognition model of the present invention.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

实施例1Example 1

一种基于轨迹的滑动验证码人机识别方法,包括以下步骤:A trajectory-based human-machine recognition method for sliding verification codes, comprising the following steps:

S1:采集用户轨迹数据;S1: collect user trajectory data;

采集用户轨迹数据(x,y,t),包括轨迹触发过程中,不同时间点t的横向坐标x,纵向坐标y,具体来说,就是获得用户在进行滑动验证码码触发过程的轨迹记录,为滑动验证码码多维特征体系的构建提供数据支持。Collect user trajectory data (x, y, t), including the horizontal coordinate x and vertical coordinate y at different time points t during the trajectory triggering process. Specifically, it is to obtain the trajectory record of the user during the sliding verification code trigger process, Provide data support for the construction of the multi-dimensional feature system of the sliding verification code code.

S2:根据轨迹数据构建多维特征体系;S2: Construct a multi-dimensional feature system based on trajectory data;

基于两种模式的发现,其一,人类轨迹的末端折回现象;其二,人类轨迹的远急近缓现象;进行多维特征提取,进而构建多维特征体系,便于后期人机识别模型的算法训练,用特征描述“人”,而非用特征描述‘机器’,寻二者之异。Based on the discovery of two modes, one is the end-retracement phenomenon of human trajectories; the other is the phenomenon of long-distance and short-distance human trajectories; multi-dimensional feature extraction is carried out, and then a multi-dimensional feature system is constructed to facilitate the algorithm training of the human-machine recognition model in the later stage. Use characteristics to describe "people" instead of using characteristics to describe "machines", and find the difference between the two.

S3:根据设计好的人机识别模型对多维特征体系进行轨迹区分;S3: According to the designed man-machine recognition model, the trajectory of the multi-dimensional feature system is distinguished;

将多维特征体系特征输入设计好的人机识别模型进行人机识别模型优化,将多维特征体特征输入人机识别模型进行特征学习,得到不同的概率值,通过线性加权的方式让人机识别模型输出的概率更能接近轨迹的真实类型。Input the multi-dimensional feature system features into the designed man-machine recognition model to optimize the man-machine recognition model, input the multi-dimensional feature body features into the man-machine recognition model for feature learning, obtain different probability values, and make the machine-machine recognition model through linear weighting The output probabilities are closer to the true types of trajectories.

本发明中,通过对人类轨迹进行分析,基于两种模式的发现,模式一:人类轨迹的折回现象,模式二:人类轨迹的远急近缓现象,进行多维度特征体系的构建,进而对人机识别模型进行设计,如图1所示,通过将用户轨迹对应的训练集进行多维特征体系构建,进而对人机识别模型进行训练,并将预测轨迹对应预测集进行相同的多维特征体系构建并输入训练好的人机识别模型中进行模型优化和轨迹类别区分训练,并将训练输出进行线性加权,得到区分轨迹类别的概率值。In the present invention, through the analysis of the human trajectory, based on the discovery of two modes, the first mode: the turning back phenomenon of the human trajectory, and the second mode: the rapid and slow phenomenon of the human trajectory, the construction of a multi-dimensional feature system is carried out, and then the human As shown in Figure 1, the human-machine recognition model is trained by constructing a multi-dimensional feature system for the training set corresponding to the user trajectory, and the same multi-dimensional feature system is constructed for the prediction set corresponding to the predicted trajectory. Input the trained man-machine recognition model for model optimization and trajectory classification training, and linearly weight the training output to obtain the probability value of distinguishing trajectory categories.

实施例2Example 2

在实施例1的基础上,所述多维特征体系包括X特征、Y特征、T特征。On the basis of Example 1, the multi-dimensional feature system includes X features, Y features, and T features.

本发明采用横向特征x为主,描述“人”在进行滑动验证时的行为习惯,并用纵向特征y描述“机器”的特点,用时间特征T作为补充描述“人”与“机器”的区别,如图2所示。The present invention mainly uses the horizontal feature x to describe the behavior habits of "people" when performing sliding verification, and uses the longitudinal feature y to describe the characteristics of "machines", and uses the time feature T as a supplement to describe the difference between "human" and "machine". as shown in picture 2.

进一步的,所述X特征提取具体步骤如下:Further, the specific steps of the X feature extraction are as follows:

S201:提取X特征类,对轨迹横向坐标x进行归一化处理;S201: Extracting the X feature class, and normalizing the horizontal coordinate x of the trajectory;

S202:结合人类轨迹模式二的“远急近缓”现象,将将轨迹分为前半段和后半段;S202: Combining the phenomenon of "far, fast, near slow" in human trajectory mode 2, the trajectory will be divided into the first half and the second half;

具体来说,“远急近缓”表明在滑动验证码的过程中,在离目标点较远时速度较快,在离目标点较近时速度较慢。因此,横向x特征群构造上将轨迹分为前、后半段分别提取。Specifically, "far, fast, near, slow" indicates that in the process of sliding the verification code, the speed is faster when the distance from the target point is farther, and the speed is slower when the distance is closer to the target point. Therefore, in the structure of the horizontal x feature group, the trajectory is divided into the front half and the second half, which are extracted separately.

S203:分别提取轨迹前半段x front、后半段x rear、前半段领位差x front diff、后半段领位差x rear diff、停止段final stop多个X特征群;S203: Extract multiple X feature groups of the first half of the trajectory x front, the second half x rear, the first half of the leading position difference x front diff, the second half of the leading position difference x rear diff, and the stop segment final stop;

结合人类轨迹模式一的“末端折回”现象,构造提取停止段final stop特征群。Combined with the phenomenon of "terminal retracement" in human trajectory mode 1, the final stop feature group of the extraction stop segment is constructed.

具体来说,从轨迹数据(x,y,t)中提取出整个轨迹的横向坐标数据组成横向序列{x1,x2,...,xt,...,xn},取轨迹序列中的前半段{x1,x2,...,xn/2}组成x_front,取轨迹序列中的后半段{xn/2,xn/2+1,...,xn}组成x_rear,取轨迹序列中的前半段的邻位差{x2-x1,x3-x2,...}组成x_front_diff,取轨迹序列中的后半段的邻位差{...,xn-1-xn-2,xn-xn-1}组成x_rear_diff,结合人类轨迹模式一,取轨迹序列中的最后五分之一组成停止段final_stop。Specifically, the horizontal coordinate data of the entire trajectory is extracted from the trajectory data (x,y,t) to form a horizontal sequence {x 1 ,x 2 ,...,x t ,...,x n }, and the trajectory The first half of the sequence {x 1 ,x 2 ,...,x n/2 } forms x_front, and the second half of the trajectory sequence {x n/2 ,x n/2+1 ,...,x n } form x_rear, take the adjacent position difference {x 2 -x 1 ,x 3 -x 2 ,...} in the first half of the trajectory sequence to form x_front_diff, take the adjacent position difference {. .., x n-1 -x n-2 , x n -x n-1 } form x_rear_diff, combined with human trajectory mode 1, take the last fifth of the trajectory sequence to form the stop segment final_stop.

S204:提取每个X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S204: Extract features in each X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference.

本发明中,从多维度设计特征体系的横向特征,能够更好的为模型提供输入。In the present invention, the lateral features of the feature system are designed from multiple dimensions, which can better provide input for the model.

进一步的,所述Y特征获取具体步骤如下:Further, the specific steps for obtaining the Y feature are as follows:

S211:提取Y特征类,对轨迹纵向坐标y进行归一化处理;S211: Extract the Y feature class, and perform normalization processing on the longitudinal coordinate y of the trajectory;

具体来说,从轨迹数据(x,y,t)中提取出整个轨迹的纵向坐标数据组成横向序列{y1,y2,...,yt,...,yn},对序列进行归一化处理。Specifically, the longitudinal coordinate data of the entire trajectory is extracted from the trajectory data (x,y,t) to form a horizontal sequence {y 1 ,y 2 ,...,y t ,...,y n }, for the sequence Perform normalization.

S212:分别提取轨迹全段y、折半y half、全段邻位差y diff、全段邻位差的邻位差y diff diff多个Y特征群;S212: Extract multiple Y feature groups of the entire trajectory y, the half y half, the entire adjacent difference y diff, and the adjacent difference y diff diff of the entire trajectory;

具体来说,对归一化后的纵向序列{y1,y2,...,yt,...,yn},取轨迹序列中的全段{y1,y2,...,yt,...,yn}组成y,取轨迹序列中的全段{y1,y2,...,yt,...,yn}分别减去0.5后组成y_half,取轨迹序列中的全段的邻位差{y2-y1,y3-y2,...}组成y_diff,取轨迹序列中的全段的邻位差的邻位差{(y3-y2)-(y2-y1),...}组成y_diffdiff。Specifically, for the normalized longitudinal sequence {y 1 ,y 2 ,...,y t ,...,y n }, take the entire segment {y 1 ,y 2 ,.. .,y t ,...,y n } form y, take the entire segment {y 1 ,y 2 ,...,y t ,...,y n } in the trajectory sequence and subtract 0.5 to form y_half , take the adjacent position difference {y 2 -y 1 ,y 3 -y 2 ,...} of the entire segment in the trajectory sequence to form y_diff, and take the adjacent position difference of the entire segment in the trajectory sequence {(y 3 -y 2 )-(y 2 -y 1 ),...} form y_diffdiff.

S213:提取每个Y特征群中的特征,包括方差、平均值、极差、和值。S213: Extract features in each Y feature group, including variance, average, range, and value.

当横向特征X为在某条轨迹的描述性较弱时,对“机器”具有较好描述性的纵向特征y可以在模型判别中起到辅助作用。When the horizontal feature X is weakly descriptive on a certain trajectory, the vertical feature y that is more descriptive of the "machine" can play an auxiliary role in model discrimination.

进一步的,所述T特征获取具体步骤如下:Further, the specific steps of obtaining the T feature are as follows:

S221:提取T特征类,对时间特征t进行归一化处理;S221: extract T feature class, and perform normalization processing on time feature t;

具体来说,独立的时间序列只是采样标志,不具有良好的可解释性,但当与横向x序列结合后则具有较好的表述意义。Specifically, the independent time series is just a sampling mark, which does not have good interpretability, but it has a better expressive meaning when combined with the horizontal x-series.

利用归一化的X减T在另一层面上表示轨迹产生过程中的速度情况,提供给模型更好的特征输入。The normalized X minus T is used to represent the velocity during trajectory generation on another level, providing better feature input to the model.

S222:提取T-X特征群,利用归一化后的横向坐标x减去归一化后的时间特征t;S222: Extract the T-X feature group, and subtract the normalized time feature t from the normalized horizontal coordinate x;

具体来说,从轨迹数据(x,y,t)中提取出整个轨迹的横向坐标数据组成横向序列{x1,x2,...,xt,...,xn}和时间序列{t1,t2,...,tn},分别进行归一化,然后将横向序列{x1,x2,...,xt,...,xn}减去时间序列{t1,t2,...,tn}。Specifically, the horizontal coordinate data of the entire trajectory is extracted from the trajectory data (x,y,t) to form a horizontal sequence {x 1 ,x 2 ,...,x t ,...,x n } and a time series {t 1 ,t 2 ,...,t n }, normalize respectively, and then subtract the time series from the horizontal series {x 1 ,x 2 ,...,x t ,...,x n } {t 1 ,t 2 ,...,t n }.

S223:提取T-X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S223: Extract features in the T-X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference.

综上,本实施例中提供的特征群、特征列表如表1所示:In summary, the feature groups and feature lists provided in this embodiment are shown in Table 1:

实施例3Example 3

在实施例1的基础上,所述步骤S3人机识别模型设计具体步骤如下:On the basis of embodiment 1, the specific steps of the design of the human-machine recognition model of the step S3 are as follows:

S301:分别将多维特征体系中的特征输入到多个训练模型中进行算法训练;S301: Input the features in the multi-dimensional feature system into multiple training models for algorithm training;

S302:将特征算法训练输出进行线性加权;S302: Perform linear weighting on the training output of the feature algorithm;

进一步的,所述训练模型包括:CatBoost模型、XGBoost模型、RandomForest模型、LogisticRegression模型。Further, the training model includes: CatBoost model, XGBoost model, RandomForest model, LogisticRegression model.

如图3所示,具体来说,将CatBoost模型、XGBoost模型、RandomForest模型、LogisticRegression模型的训练输出的概率值进行线性加权,得到由四个基础模型线性加权后的人机识别模型。As shown in Figure 3, specifically, the probability values of the training outputs of the CatBoost model, XGBoost model, RandomForest model, and LogisticRegression model are linearly weighted to obtain a human-machine recognition model linearly weighted by the four basic models.

以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. within range.

Claims (7)

1.一种基于轨迹的滑动验证码人机识别方法,其特征在于,包括以下步骤:1. a track-based sliding verification code man-machine recognition method, is characterized in that, comprises the following steps: S1:采集用户轨迹数据;S1: collect user trajectory data; S2:根据轨迹数据构建多维特征体系;S2: Construct a multi-dimensional feature system based on trajectory data; S3:根据设计好的人机识别模型对多维特征体系进行轨迹区分。S3: According to the designed human-machine recognition model, the trajectory of the multi-dimensional feature system is distinguished. 2.根据权利要求1所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述多维特征体系包括X特征、Y特征、T特征。2. A track-based human-machine recognition method for sliding verification codes according to claim 1, wherein the multi-dimensional feature system includes X features, Y features, and T features. 3.根据权利要求2所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述X特征提取具体步骤如下:3. according to claim 2 a kind of sliding verification code man-machine recognition method based on track, it is characterized in that, described X feature extracts specific steps as follows: S201:提取X特征类,对轨迹横向坐标x进行归一化处理;S201: Extracting the X feature class, and normalizing the horizontal coordinate x of the trajectory; S202:将轨迹横向坐标分为前半段和后半段;S202: Divide the horizontal coordinates of the trajectory into the first half and the second half; S203:分别提取轨迹前半段x front、后半段x rear、前半段领位差x front diff、后半段领位差x rear diff、停止段final stop多个X特征群;S203: Extract multiple X feature groups of the first half of the trajectory x front, the second half x rear, the first half of the leading position difference x front diff, the second half of the leading position difference x rear diff, and the stop segment final stop; S204:提取每个X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S204: Extract features in each X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference. 4.根据权利要求2所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述Y特征提取具体步骤如下:4. according to claim 2, a kind of sliding verification code man-machine recognition method based on track, it is characterized in that, described Y feature extraction specific steps are as follows: S211:提取Y特征类,对轨迹纵向坐标y进行归一化处理;S211: Extract the Y feature class, and perform normalization processing on the longitudinal coordinate y of the trajectory; S212:分别提取轨迹全段y、折半y half、全段邻位差y diff、全段邻位差的邻位差ydiff diff多个Y特征群;S212: Extract multiple Y feature groups of the entire trajectory y, the half-half y half, the entire adjacent difference y diff, and the adjacent difference ydiff diff of the entire adjacent difference; S213:提取每个Y特征群中的特征,包括方差、平均值、极差、和值。S213: Extract features in each Y feature group, including variance, average, range, and value. 5.根据权利要求2所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述T特征提取具体步骤如下:5. a kind of trajectory-based man-machine recognition method for sliding verification codes according to claim 2, is characterized in that, the specific steps of said T feature extraction are as follows: S221:提取T特征类,对时间特征t进行归一化处理;S221: extract T feature class, and perform normalization processing on time feature t; S222:提取T-X特征群,利用归一化后的横向坐标x减去归一化后的时间特征t;S222: Extract the T-X feature group, and subtract the normalized time feature t from the normalized horizontal coordinate x; S223:提取T-X特征群中的特征,包括最大值、峰值、中值、方差、最小值、极差。S223: Extract features in the T-X feature group, including maximum value, peak value, median value, variance, minimum value, and extreme difference. 6.根据权利要求1所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述步骤S3人机识别模型设计具体步骤如下:6. according to claim 1, a kind of sliding verification code human-machine recognition method based on track, it is characterized in that, described step S3 human-machine recognition model design specific steps are as follows: S301:将多维特征体系中的特征输入到多个训练模型中进行算法训练;S301: Input the features in the multi-dimensional feature system into multiple training models for algorithm training; S302:将特征算法训练输出进行线性加权。S302: Linearly weight the training output of the feature algorithm. 7.根据权利要求6所述一种基于轨迹的滑动验证码人机识别方法,其特征在于,所述训练模型包括:CatBoost模型、XGBoost模型、RandomForest模型、LogisticRegression模型。7. A track-based human-machine recognition method for sliding verification codes according to claim 6, wherein the training models include: CatBoost models, XGBoost models, RandomForest models, and LogisticRegression models.
CN201810050045.8A 2018-01-18 2018-01-18 A Trajectory-based Sliding Verification Code Human-Machine Recognition Method Active CN108287989B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810050045.8A CN108287989B (en) 2018-01-18 2018-01-18 A Trajectory-based Sliding Verification Code Human-Machine Recognition Method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810050045.8A CN108287989B (en) 2018-01-18 2018-01-18 A Trajectory-based Sliding Verification Code Human-Machine Recognition Method

Publications (2)

Publication Number Publication Date
CN108287989A true CN108287989A (en) 2018-07-17
CN108287989B CN108287989B (en) 2021-05-28

Family

ID=62835600

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810050045.8A Active CN108287989B (en) 2018-01-18 2018-01-18 A Trajectory-based Sliding Verification Code Human-Machine Recognition Method

Country Status (1)

Country Link
CN (1) CN108287989B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109033793A (en) * 2018-07-11 2018-12-18 北京京东金融科技控股有限公司 Slide the man-machine recognition methods of identifying code and device, electronic equipment and readable medium
CN109086594A (en) * 2018-09-21 2018-12-25 郑州云海信息技术有限公司 A kind of method, apparatus and system for realizing the verifying of sliding-type identifying code
CN109241709A (en) * 2018-08-03 2019-01-18 平安科技(深圳)有限公司 User behavior recognition method and device based on the verifying of sliding block identifying code
CN109522692A (en) * 2018-11-19 2019-03-26 第四范式(北京)技术有限公司 Webpage machine behavioral value method and system
CN109543393A (en) * 2018-10-23 2019-03-29 武汉极意网络科技有限公司 Behavior verification method and device
CN109600361A (en) * 2018-11-26 2019-04-09 武汉极意网络科技有限公司 Identifying code anti-attack method and device based on hash algorithm
CN109726543A (en) * 2018-12-30 2019-05-07 北京城市网邻信息技术有限公司 A kind of login method of application program, device, terminal device and storage medium
CN110046647A (en) * 2019-03-08 2019-07-23 同盾控股有限公司 A kind of identifying code machine Activity recognition method and device
CN110807183A (en) * 2019-10-12 2020-02-18 广州多益网络股份有限公司 Sliding verification code man-machine behavior identification method of multi-dimensional feature system
CN110889103A (en) * 2018-09-07 2020-03-17 北京京东金融科技控股有限公司 Method and system for slider verification and model training method thereof
CN111310156A (en) * 2019-11-28 2020-06-19 苏宁金融科技(南京)有限公司 Automatic identification method and system for slider verification code
CN111666968A (en) * 2020-04-21 2020-09-15 北京奇艺世纪科技有限公司 Man-machine recognition method and device, electronic equipment and computer readable storage medium
CN114117390A (en) * 2021-11-12 2022-03-01 东方财富信息股份有限公司 Machine learning and interactive man-machine distinguishing verification generation method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105975823A (en) * 2016-05-05 2016-09-28 百度在线网络技术(北京)有限公司 Verification method and apparatus used for distinguishing man and machine
US9483292B2 (en) * 2010-11-29 2016-11-01 Biocatch Ltd. Method, device, and system of differentiating between virtual machine and non-virtualized device
CN106155298A (en) * 2015-04-21 2016-11-23 阿里巴巴集团控股有限公司 Man-machine recognition methods and device, the acquisition method of behavior characteristics data and device
CN106815514A (en) * 2015-11-27 2017-06-09 阿里巴巴集团控股有限公司 A kind of man-machine recognition methods and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9483292B2 (en) * 2010-11-29 2016-11-01 Biocatch Ltd. Method, device, and system of differentiating between virtual machine and non-virtualized device
CN106155298A (en) * 2015-04-21 2016-11-23 阿里巴巴集团控股有限公司 Man-machine recognition methods and device, the acquisition method of behavior characteristics data and device
CN106815514A (en) * 2015-11-27 2017-06-09 阿里巴巴集团控股有限公司 A kind of man-machine recognition methods and device
CN105975823A (en) * 2016-05-05 2016-09-28 百度在线网络技术(北京)有限公司 Verification method and apparatus used for distinguishing man and machine

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109033793A (en) * 2018-07-11 2018-12-18 北京京东金融科技控股有限公司 Slide the man-machine recognition methods of identifying code and device, electronic equipment and readable medium
CN109241709A (en) * 2018-08-03 2019-01-18 平安科技(深圳)有限公司 User behavior recognition method and device based on the verifying of sliding block identifying code
CN109241709B (en) * 2018-08-03 2023-04-07 平安科技(深圳)有限公司 User behavior identification method and device based on slider verification code verification
WO2020024412A1 (en) * 2018-08-03 2020-02-06 平安科技(深圳)有限公司 User behavior recognition method and apparatus based on sliding block verification code verification
CN110889103B (en) * 2018-09-07 2024-04-05 京东科技控股股份有限公司 Method and system for verifying sliding block and model training method thereof
CN110889103A (en) * 2018-09-07 2020-03-17 北京京东金融科技控股有限公司 Method and system for slider verification and model training method thereof
CN109086594A (en) * 2018-09-21 2018-12-25 郑州云海信息技术有限公司 A kind of method, apparatus and system for realizing the verifying of sliding-type identifying code
CN109543393A (en) * 2018-10-23 2019-03-29 武汉极意网络科技有限公司 Behavior verification method and device
CN109522692B (en) * 2018-11-19 2019-10-18 第四范式(北京)技术有限公司 Webpage machine behavioral value method and system
CN109522692A (en) * 2018-11-19 2019-03-26 第四范式(北京)技术有限公司 Webpage machine behavioral value method and system
CN109600361B (en) * 2018-11-26 2021-05-04 武汉极意网络科技有限公司 Hash algorithm-based verification code anti-attack method and device, electronic equipment and non-transitory computer readable storage medium
CN109600361A (en) * 2018-11-26 2019-04-09 武汉极意网络科技有限公司 Identifying code anti-attack method and device based on hash algorithm
CN109726543A (en) * 2018-12-30 2019-05-07 北京城市网邻信息技术有限公司 A kind of login method of application program, device, terminal device and storage medium
CN110046647A (en) * 2019-03-08 2019-07-23 同盾控股有限公司 A kind of identifying code machine Activity recognition method and device
CN110807183A (en) * 2019-10-12 2020-02-18 广州多益网络股份有限公司 Sliding verification code man-machine behavior identification method of multi-dimensional feature system
CN111310156A (en) * 2019-11-28 2020-06-19 苏宁金融科技(南京)有限公司 Automatic identification method and system for slider verification code
CN111310156B (en) * 2019-11-28 2022-08-19 苏宁金融科技(南京)有限公司 Automatic identification method and system for slider verification code
CN111666968A (en) * 2020-04-21 2020-09-15 北京奇艺世纪科技有限公司 Man-machine recognition method and device, electronic equipment and computer readable storage medium
CN114117390A (en) * 2021-11-12 2022-03-01 东方财富信息股份有限公司 Machine learning and interactive man-machine distinguishing verification generation method

Also Published As

Publication number Publication date
CN108287989B (en) 2021-05-28

Similar Documents

Publication Publication Date Title
CN108287989A (en) A kind of man-machine recognition methods of sliding identifying code based on track
CN109977416B (en) Multi-level natural language anti-spam text method and system
Nguyen et al. Dictionary-guided scene text recognition
CN110414219B (en) Injection attack detection method based on gated cycle unit and attention mechanism
CN105897714B (en) Botnet detection method based on DNS traffic characteristics
CN107944559B (en) Method and system for automatically identifying entity relationship
CN104408153B (en) A kind of short text Hash learning method based on more granularity topic models
CN106098059A (en) customizable voice awakening method and system
CN105069434B (en) A kind of human action Activity recognition method in video
Ge et al. Semantic-guided reinforced region embedding for generalized zero-shot learning
CN113590810B (en) Abstract generation model training method, abstract generation device and electronic equipment
CN112489689B (en) Cross-database voice emotion recognition method and device based on multi-scale difference countermeasure
CN104967587A (en) Method for identifying malicious account numbers, and apparatus thereof
CN104915399A (en) Recommended data processing method based on news headline and recommended data processing method system based on news headline
CN110457707B (en) Method and device for extracting real word keywords, electronic equipment and readable storage medium
CN105069647A (en) Improved method for extracting evaluation object in Chinese commodity review
US11281714B2 (en) Image retrieval
CN117573868A (en) A multi-dimensional duplication checking method based on neural network
Samsudin et al. Mining opinion in online messages
CN115730312A (en) Deep hash-based family malware detection method
CN110223699B (en) Speaker identity confirmation method, device and storage medium
CN107562720A (en) A kind of alarm data matching process of information network security of power system linkage defense
CN118823547A (en) Pulmonary nodule detection method based on progressive multi-scale network model
CN105975643B (en) A kind of realtime graphic search method based on text index
CN113761920B (en) A word processing method and device based on dual-task model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20211228

Address after: 908, block a, floor 8, No. 116, Zizhuyuan Road, Haidian District, Beijing 100089

Patentee after: ZHONGZI DATA CO.,LTD.

Patentee after: Zhongzi highway maintenance and Inspection Technology Co., Ltd

Address before: No. 30 Xueyuan Road, Haidian District, Beijing 100083

Patentee before: University OF SCIENCE AND TECHNOLOGY BEIJING

TR01 Transfer of patent right