[go: up one dir, main page]

CN103856476A - Method and device for identifying network robot - Google Patents

Method and device for identifying network robot Download PDF

Info

Publication number
CN103856476A
CN103856476A CN201210551708.7A CN201210551708A CN103856476A CN 103856476 A CN103856476 A CN 103856476A CN 201210551708 A CN201210551708 A CN 201210551708A CN 103856476 A CN103856476 A CN 103856476A
Authority
CN
China
Prior art keywords
label
network robot
sightless
server end
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210551708.7A
Other languages
Chinese (zh)
Inventor
刘丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Oak Pacific Interactive Technology Development Co Ltd
Original Assignee
Beijing Oak Pacific Interactive Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Oak Pacific Interactive Technology Development Co Ltd filed Critical Beijing Oak Pacific Interactive Technology Development Co Ltd
Priority to CN201210551708.7A priority Critical patent/CN103856476A/en
Publication of CN103856476A publication Critical patent/CN103856476A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention relates to a method and device for identifying a network robot. Particularly, the method comprises the steps of generating an invisible label on a server terminal and judging whether a client terminal is a network robot or not based on whether the invisible label is triggered or not. In addition, the embodiment of the invention further provides a device corresponding to the method. The method and device can identify the network robot at high accuracy rate.

Description

For the method and apparatus of recognition network robot
Technical field
The embodiments of the present invention relate generally to computer realm, and relate more specifically to a kind of method and apparatus for recognition network robot.
Background technology
Network robot refers generally to automatically perform the software program of preplanned mission on the networks such as internet.Many network robots are used to realize malicious intent and/or take too much bandwidth resources, computational resource, and it is very important therefore identifying these network robots and limiting its behavior.Again because many network robots imitate the behavior of human user to a certain extent, so recognition network robot difficulty often.
For example, exist help online game player obtain the robot of improper income, in social networks or forum automatically the robot of releasing advertisements, automatically put up the link of pointing to certain website with improve this website search engine rank robot, depend on the website that user evaluates (such as, comment class website, restaurant, e-commerce website etc.) above automatically issue robot that user evaluates, help robot that user rushes to purchase etc.
In prior art, the means of a kind of recognition network robot are identifying codes, for example, show a stringly may and require client to input shown character string through the character string of deformation process, thereby the client that can not correctly identify this character string are identified as robot.But, exist by the method that computword is identified, the mode such as manpower batch process cracks identifying code.
Summary of the invention
In order to address the above problem, the invention provides a kind of method and apparatus for recognition network robot, utilize described method and apparatus, can be with higher accuracy rate recognition network robot.
According to an aspect of the present invention, provide a kind of method for recognition network robot, it comprises the following steps: generate sightless label at server end; And whether be triggered to judge according to described sightless label whether client is network robot.
According to a further aspect in the invention, provide a kind of equipment for recognition network robot, it comprises: generating apparatus, is configured at server end and generates sightless label; And judgment means, be configured for according to described sightless label whether be triggered to judge whether client is network robot.
The exemplary solution that exemplary embodiment of the invention provides at least can be brought following significant technique effect: by label being carried out to complicated hiding, ensure that normal users cannot trigger this label, thereby can identify the network robot that obtains this label by code analysis with higher accuracy rate.
Brief description of the drawings
Read detailed description below by reference to accompanying drawing, above-mentioned and other objects of embodiment of the present invention, feature and advantage will become easy to understand.In the accompanying drawings, show some execution modes of the present invention in exemplary and nonrestrictive mode, wherein:
Fig. 1 shows according to the flow chart of the method 100 for recognition network robot of an embodiment of the invention;
Fig. 2 shows according to the schematic block diagram of the equipment 200 for recognition network robot of an embodiment of the invention; And
Fig. 3 shows the schematic block diagram that is suitable for the computer system for putting into practice embodiment of the present invention 300.
Embodiment
Below with reference to some illustrative embodiments, principle of the present invention and spirit are described.Should be appreciated that providing these execution modes is only used to make those skilled in the art can understand better and then realize the present invention, and not limit the scope of the invention by any way.
Below with reference to Fig. 1, it shows according to the flow chart of the method 100 for recognition network robot of an embodiment of the invention.
In step S101, generate sightless label at server end.By some characteristic of browser, can be to one or more label (for example, this label can be HTML (HTML) label) (for example carry out layout, position, size, color and transparency to a certain picture in the page arrange), thereby label can be hidden to (for example, this label is transparent).But this label is again really to exist.It should be noted that above-mentioned hiding just for user; And for network robot, because network robot generally adopts label to resolve to the parsing of the page, therefore, network robot is in the time carrying out data analysis and data crawl for the page, and it can be that hiding label also captures together for normal users.
In step S102, whether be triggered to judge according to described sightless label whether client is network robot.After client logs on server, no matter be human user or network robot, in order to proceed subsequent operation, conventionally can capture the label on the page.But the label that the label that normal users can capture and network robot can capture is distinct.Therefore, by judge whether network robot of client according to such difference, can be with higher accuracy rate recognition network robot.
In one embodiment, generate sightless label at server end and can be set to transparent realization by script or the described label of CSS (CSS).Label in the page (for example has various attributes conventionally, position, size, color and transparency), by script or CSS, (for example, treating that hiding label is set to transparent) can be set to the attribute of label, thereby it is invisible to realize label in the page.
In one embodiment, generate at server end that sightless label can also cover described label to realize by script or CSS.For example, in the page, can generate a transparent picture by script or CSS, and for example, arrange to be covered by the attribute (, position) to this transparent picture and treat on hiding label, thereby it is invisible to realize label.
In one embodiment, generating sightless label at server end can carry out at random.In this way, the label of having avoided network robot to predict being hidden and and then carry out corresponding behavior, if the label prediction not being hidden captures etc.In addition, generate randomly sightless label at server end, include but not limited to: the hiding label for the treatment of that generates randomly arbitrary number at server end; And generate randomly the hiding label for the treatment of of optional position at server end.
In one embodiment, generating sightless label encryption at server end carries out.For example, can carry out dynamically changing etc. to the CSS of the page by predefined rule or by the javascript encrypting.Because the CSS of the page is static state setting, that is, CSS has pre-set the layout of the page, and for example, which label is hidden and is somewhere hidden.And by the way, after the CSS of the page has been carried out to dynamic change, thereby increased network robot and resolve the CSS of the page and judge the cost of the label that is hidden in this page.
In one embodiment, whether be triggered to judge according to described sightless label whether client is that network robot comprises: if client operates described sightless label, judge that described client is network robot.For normal users, conventionally only can operate the label that on the page, it is seen.But network robot is due to its characteristic that page is resolved, it can grab the label that normal users be can't see, and the label that this network robot may be can't see these normal users in the operation of carrying out subsequently operates.Therefore,, by judging that the label whether client be can't see normal users operates, can judge whether network robot of this client.
Should be used for describing the embodiments of the present invention by the friend recommendation of Renren Network below, it will be appreciated by those skilled in the art that following example is only for the embodiments of the present invention are described but not for limiting the protection range of the each execution mode of the present invention.The friend recommendation application of Renren Network,,, in the time that new user has carried out registration in Renren Network, the data that server can provide by this new user (for example, this user's essential information, school information, job information and personal information etc.), to its commending friends.In the time that server carries out friend recommendation application, for example, server may generate 6 good friends for recommending to this new user.But, when to this new user's commending friends, for example, the 6th good friend can be hidden, that is, if this new user is normal users, it can only see 5 good friends (, not comprising the 6th good friend who is hidden) that server is recommended; And if this new user is the webserver, (it can grab all 6 good friends, comprise the 6th good friend who is hidden), thereby in the operation of carrying out subsequently (for example,, to the be hidden the 6th operation that good friend carries out), can judge this new user is network robot.
Due to come by method according to the embodiment of the present invention recognition network robot likely trigger network robot detect that server has used embodiments of the present invention, so preferably, after identifying network robot by the method for embodiments of the present invention, by other means or in other processes network robot is forbidden.
Below with reference to Fig. 2, it shows according to the schematic block diagram of the equipment 200 for recognition network robot of an embodiment of the invention.Equipment 200 comprises: generating apparatus 201, is configured at server end and generates sightless label; And judgment means 202, be configured for according to described sightless label whether be triggered to judge whether client is network robot.
In one embodiment, generate sightless label at server end and be set to transparent realization by script or the described label of CSS.
In one embodiment, generating sightless label at server end covers described label to realize by script or CSS.
In one embodiment, generating sightless label at server end carries out at random.
In one embodiment, generating sightless label encryption at server end carries out.
In one embodiment, described judgment means can further be configured to: if client operates described sightless label, judge that described client is network robot.
For clarity, the sub-device that also not shown each device comprises in Fig. 2.But, should be appreciated that the device of recording in equipment 200 is corresponding with the step in the method 100 of describing with reference to figure 1 respectively.Thus, the operation of above describing for the method 100 of Fig. 1 and feature are equally applicable to equipment 200 and the device wherein comprising and sub-device, do not repeat them here.
Should be appreciated that equipment 200 can utilize variety of way to realize.For example, in some embodiments, equipment 200 can utilize software and/or firmware module to realize.In addition, equipment 200 also can utilize hardware module to realize.For example, equipment 200 can be implemented as integrated circuit (IC) chip or application-specific integrated circuit (ASIC) (ASIC).Equipment 200 also can be implemented as SOC (system on a chip) (SOC).In addition, equipment 200 also can utilize the combination of hardware module and software and/or firmware module to realize.Other modes known or exploitation in the future are also feasible now, and scope of the present invention is unrestricted in this regard.
As shown in Figure 3, computer system 300 can comprise: CPU (CPU) 301, RAM (random access memory) 302, ROM (read-only memory) 303, system bus 304, hard disk controller 305, keyboard controller 306, serial interface controller 307, parallel interface controller 308, display controller 309, hard disk 310, keyboard 311, serial external equipment 312, parallel external equipment 313 and display 314.In these equipment, there are CPU 301, RAM 302, ROM 303, hard disk controller 305, keyboard controller 306, serialization controller 307, parallel controller 308 and a display controller 309 with system bus 304 is coupled.Hard disk 310 is coupled with hard disk controller 305, keyboard 311 is coupled with keyboard controller 306, serial external equipment 312 is coupled with serial interface controller 307, and parallel external equipment 313 is coupled with parallel interface controller 308, and display 314 is coupled with display controller 309.Should be appreciated that the structured flowchart described in Fig. 3 illustrates just to the object of example, instead of limitation of the scope of the invention.In some cases, can increase or reduce as the case may be some equipment.
It should be noted that embodiments of the present invention can realize by the combination of hardware, software or software and hardware.Hardware components can utilize special logic to realize; Software section can be stored in memory, and by suitable instruction execution system, for example microprocessor or special designs hardware are carried out.Those having ordinary skill in the art will appreciate that above-mentioned equipment and method can and/or be included in processor control routine with computer executable instructions realizes, for example, at the mounting medium such as disk, CD or DVD-ROM, provide such code on such as the programmable memory of read-only memory (firmware) or the data medium such as optics or electronic signal carrier.Equipment of the present invention and module thereof can be by such as very lagre scale integrated circuit (VLSIC) or gate array, realize such as the semiconductor of logic chip, transistor etc. or such as the hardware circuit of the programmable hardware device of field programmable gate array, programmable logic device etc., also can use the software of being carried out by various types of processors to realize, also can be realized by for example firmware of the combination of above-mentioned hardware circuit and software.
Although it should be noted that some devices or the sub-device of having mentioned equipment in above-detailed, this division is only not enforceable.In fact, according to the embodiment of the present invention, the feature of above-described two or more devices and function can be specialized in a device.Otherwise, the feature of an above-described device and function can Further Division for to be specialized by multiple devices.
In addition, although described in the accompanying drawings the operation of the inventive method with particular order,, this not requires or hint must be carried out these operations according to this particular order, or the operation shown in must carrying out all could realize the result of expecting.On the contrary, the step of describing in flow chart can change execution sequence.Additionally or alternatively, can omit some step, multiple steps be merged into a step and carry out, and/or a step is decomposed into multiple steps carries out.
Although described the present invention with reference to some embodiments, should be appreciated that, the present invention is not limited to disclosed embodiment.The present invention is intended to contain interior included various amendments and the equivalent arrangements of spirit and scope of claims.The scope of claims meets the most wide in range explanation, thereby comprises all such amendments and equivalent structure and function.

Claims (12)

1. for a method for recognition network robot, it comprises the following steps:
Generate sightless label at server end; And
Whether be triggered to judge according to described sightless label whether client is network robot.
2. method according to claim 1, wherein generates sightless label at server end and is set to transparent realization by script or the described label of CSS.
3. method according to claim 1, wherein generates sightless label at server end and by script or CSS, described label is covered to realize.
4. according to the method described in any one in claims 1 to 3, wherein generate sightless label at server end and carry out at random.
5. according to the method described in any one in claims 1 to 3, wherein generate sightless label encryption at server end and carry out.
6. method according to claim 1, wherein whether be triggered to judge according to described sightless label whether client is that network robot comprises: if client operates described sightless label, judge that described client is network robot.
7. for an equipment for recognition network robot, it comprises:
Generating apparatus, is configured at server end and generates sightless label; And
Judgment means, is configured for according to described sightless label whether be triggered to judge whether client is network robot.
8. equipment according to claim 7, wherein generates sightless label at server end and is set to transparent realization by script or the described label of CSS.
9. equipment according to claim 7, wherein generates sightless label at server end and by script or CSS, described label is covered to realize.
10. according to the equipment described in any one in claim 7 to 9, wherein generate sightless label at server end and carry out at random.
11. according to the equipment described in any one in claim 7 to 9, wherein generates sightless label encryption at server end and carry out.
12. equipment according to claim 7, wherein whether be triggered to judge according to described sightless label whether client is that network robot comprises: if client operates described sightless label, judge that described client is network robot.
CN201210551708.7A 2012-11-29 2012-11-29 Method and device for identifying network robot Pending CN103856476A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210551708.7A CN103856476A (en) 2012-11-29 2012-11-29 Method and device for identifying network robot

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210551708.7A CN103856476A (en) 2012-11-29 2012-11-29 Method and device for identifying network robot

Publications (1)

Publication Number Publication Date
CN103856476A true CN103856476A (en) 2014-06-11

Family

ID=50863694

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210551708.7A Pending CN103856476A (en) 2012-11-29 2012-11-29 Method and device for identifying network robot

Country Status (1)

Country Link
CN (1) CN103856476A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104601601A (en) * 2015-02-25 2015-05-06 小米科技有限责任公司 Web crawler detecting method and device
CN105930727A (en) * 2016-04-25 2016-09-07 无锡中科富农物联科技有限公司 Web-based crawler identification algorithm
CN107437236A (en) * 2016-05-27 2017-12-05 北京京东尚科信息技术有限公司 A kind of picture blur processing method and processing device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101339666A (en) * 2008-08-12 2009-01-07 腾讯科技(深圳)有限公司 Picture identifying code creation method and device
WO2012073233A1 (en) * 2010-11-29 2012-06-07 Biocatch Ltd. Method and device for confirming computer end-user identity

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101339666A (en) * 2008-08-12 2009-01-07 腾讯科技(深圳)有限公司 Picture identifying code creation method and device
WO2012073233A1 (en) * 2010-11-29 2012-06-07 Biocatch Ltd. Method and device for confirming computer end-user identity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DAMIEN KATZ: "negative captcha", 《HTTP://DAMIENKATZ.NET/2007/01/NEGATIVE_CAPTCHA.HTML》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104601601A (en) * 2015-02-25 2015-05-06 小米科技有限责任公司 Web crawler detecting method and device
CN104601601B (en) * 2015-02-25 2018-09-04 小米科技有限责任公司 The detection method and device of web crawlers
CN105930727A (en) * 2016-04-25 2016-09-07 无锡中科富农物联科技有限公司 Web-based crawler identification algorithm
CN105930727B (en) * 2016-04-25 2018-11-09 无锡中科富农物联科技有限公司 Reptile recognition methods based on Web
CN107437236A (en) * 2016-05-27 2017-12-05 北京京东尚科信息技术有限公司 A kind of picture blur processing method and processing device
CN107437236B (en) * 2016-05-27 2021-08-13 北京京东尚科信息技术有限公司 Picture fuzzy processing method and device

Similar Documents

Publication Publication Date Title
US10956572B2 (en) Domain-specific hardwired web browser machine
CN105027121B (en) The five application page of the machine application is indexed
US9405910B2 (en) Automatic library detection
US20210203690A1 (en) Phishing detection using certificates associated with uniform resource locators
CN108449316B (en) Anti-crawler method, server and client
JP6746007B2 (en) Image generating method and image generating apparatus
US8949243B1 (en) Systems and methods for determining a rating for an item from user reviews
US11010687B2 (en) Detecting abusive language using character N-gram features
WO2014101783A1 (en) Method and server for performing cloud detection for malicious information
CN107957872A (en) A kind of full web site source code acquisition methods and illegal website detection method, system
CN104428765A (en) Reducing redirects
CN107908959A (en) Site information detection method, device, electronic equipment and storage medium
CN111612635B (en) Method and device for determining financial security risk of user based on relationship graph, and electronic equipment
CN109711160B (en) Application program detection method and device and neural network system
CN104980404B (en) Method and system for protecting account information security
CN111753302B (en) Method, device, computer readable medium and electronic equipment for detecting code loopholes
CN103064866A (en) Method and equipment for confirming attention degree of content in Internet
CN103198091B (en) The processing method of a kind of online data based on user behavior request and equipment
CN111198967A (en) User grouping method and device based on relational graph and electronic equipment
CN107766106A (en) The method and apparatus for generating configuration file
CN103634366A (en) Method and device for identifying network robot
CN103856476A (en) Method and device for identifying network robot
CN103336693B (en) The creation method of refer chain, device and security detection equipment
CN107133029A (en) Personalization component implementation method and device
CN103581321B (en) A kind of creation method of refer chains, device and safety detection method and client

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140611