[go: up one dir, main page]

CN103064987B - A kind of wash sale information identifying method - Google Patents

A kind of wash sale information identifying method Download PDF

Info

Publication number
CN103064987B
CN103064987B CN201310037691.8A CN201310037691A CN103064987B CN 103064987 B CN103064987 B CN 103064987B CN 201310037691 A CN201310037691 A CN 201310037691A CN 103064987 B CN103064987 B CN 103064987B
Authority
CN
China
Prior art keywords
information
user
data
feature
wash sale
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310037691.8A
Other languages
Chinese (zh)
Other versions
CN103064987A (en
Inventor
王永康
张爱华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing 58 Information Technology Co Ltd
Original Assignee
Beijing 58 Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing 58 Information Technology Co Ltd filed Critical Beijing 58 Information Technology Co Ltd
Priority to CN201310037691.8A priority Critical patent/CN103064987B/en
Publication of CN103064987A publication Critical patent/CN103064987A/en
Application granted granted Critical
Publication of CN103064987B publication Critical patent/CN103064987B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of wash sale information identifying method, including: step S101, obtain information characteristics, information content and/or pictorial information that user releases news;Step S201, information characteristics, information content and/or the pictorial information released news according to user, user is given out information and carries out wash sale information identification.The present invention can greatly reduce the false amount of Transaction Information, improves the verity of Transaction Information, increases Consumer's Experience, can greatly reduce human cost simultaneously.

Description

A kind of wash sale information identifying method
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of wash sale information identifying method.
Background technology
Along with the development of the Internet, online information becomes increasingly to spread unchecked, more and more hard to tell whether it is true or false.For ecommerce Or the website of the type such as classification information, if it is possible to provide the user safety, real merchandise news, have become as one important And basic content, the most how to identify that user releases news true and false has had become as the key guaranteeing information security, this Also it is the problem that all suffers from of a lot of website.
Identifying in wash sale information, current method mainly by artificial examination & verification, more additional technological means, Such as determine the IP(Internet Protocol of blacklist, the agreement of interconnection between network) address, determine the information of issue in Perhaps form is illegal, price range is illegal etc. will determine the illegal information deletion of information completely.
The shortcoming of Existing policies is: manual examination and verification consume very much manpower, the technological means of auxiliary can only delete least a portion of void False Transaction Information, the most substantial amounts of wash sale information is escaped, and can delete 100% information being defined as falseness, but to having The information that 85% may be false is helpless, because all can not judge that information is false degree.
Summary of the invention
The technical problem to be solved in the present invention is to provide a kind of wash sale information identifying method and puts, in order to solve existing skill Art carries out the problem that in wash sale information identification, manpower consumption is big, wash sale information discrimination is low.
For solving above-mentioned technical problem, on the one hand, the present invention provides a kind of wash sale information identifying method, including:
Step S101, obtains information characteristics, information content and/or pictorial information that user releases news;
Step S201, information characteristics, information content and/or the pictorial information released news according to user, user is issued Message carries out wash sale information identification.
Further, before obtaining the information characteristics that user releases news, comprise the following steps:
Step S1011, the master data that before acquisition, user gives out information;
Step S1012, the master data given out information according to user before obtaining, extract training data, determine positive and negative Sample;
Step S1013, aligns the data in negative sample and carries out Feature Conversion, obtain the data of setting data form;
Step S1014, according to the data of setting data form, sets up regression model.
Further, step S1013 specifically includes:
The feature of the every data in positive negative sample is defined as numeric type or enumeration type two class;
The dimension values of numeric type is constant, and the number of this numeric type data is disposed in the position in numeric type data is in sample Value;
The dimension values of enumeration type first calculates its md5 value, then by md5 value to W delivery, obtains delivery result;In the sample The numerical value being in delivery result position is put 1.
Further, step S1014 specifically includes:
The data of setting data form step S1013 obtained are converted into sparse matrix;
Sparse matrix (the x of generation is inputted in model training program1,x2,x3,x4,x5..., xp), p is setting data The data volume of the data of form;Obtain each parameter (β corresponding to record012345..., βp);
Setting up regression model, regression model is:Wherein g (x)=β01x12x2+…+ βpxp
Further, after setting up regression model, when receiving user and releasing news, then step S101 particularly as follows:
Step S1015, obtains the master data that user gives out information;Including extract the basic feature that gives out information of user and Obtain unit's feature;Basic feature and first feature are together as the master data excavated.
Further, after obtaining the master data that gives out information of user, step S201 specifically includes following steps:
Step S2011, the master data giving out information acquisition user carries out Feature Conversion, obtains the accessible number of model According to form;
Step S2012, data step S2011 obtained are converted into the form of sparse matrix, are carried out by regression model Spoofing identification;Wherein, P > M, then Y=1, represent that user releases news as true sale information;Otherwise, P≤M, then Y=0, table Showing that user releases news as wash sale information, M is threshold value set in advance.
Further, before obtaining the information content that user releases news, comprise the following steps:
Step S1021, information content that before acquisition, user gives out information also is audited, and by examination & verification and will not pass through The information of examination & verification is divided into two classes, as the sample data of classification;
Step S1022, carries out participle to the information content in sample;
Step S1023, by calculating, extraction feature word;
Step S1024, calculates the eigenvalue of each Feature Words in every document of every apoplexy due to endogenous wind;
Step S1025, according to the eigenvalue of each word obtained in every document of every apoplexy due to endogenous wind, is identified mould by training Type.
Further, step S1023 specifically includes:
CHI value is sought in each word;Evolution inspection formula is:Wherein, A: classify at this Under comprise the number of documents of this word;B: do not comprise the number of documents of this word under this classification;C: do not wrap under this is classified Number of documents containing this word;D: not under this classification, and do not comprise the number of documents of this word;N: represent article sum;T: Represent the word currently seeking CHI value;The classification of c: presentation class;x2: represent open inspection CHI value;
Then P maximum value of CHI value in all words is taken as Feature Words;
Step S1024 specifically includes:
The deformation algorithm using TFIDF algorithm or TFIDF calculates eigenvalue, and wherein the way of TFIDF is to calculate every apoplexy due to endogenous wind The number of times of each Feature Words in every document, and comprise the number of files of this word, by the value of TFIDF as eigenvalue;Its In, every document is converted into: category IDs t feature sequence number the form of t eigenvalue;TFIDF formula is: TFIDF=TF × IDF, Wherein, TF is the frequency that certain Feature Words occurs in this document, and IDF is anti-document frequency, and i.e. total document tree is divided by comprising The number of files of this word.
Further, after obtaining the information content that gives out information of user, step S201 specifically includes following steps:
Step S2021, the information content giving out information user carries out participle;
Step S2022, by calculating, extraction feature word;
Step S2023, the eigenvalue of each word in the information content that calculating user gives out information;
Step S2024, according to the identification model obtained, the information content giving out information user carries out wash sale information Identify.
Further, the pictorial information released news according to user, user is given out information and carries out wash sale information identification, Specifically include following steps:
Step S2031, query history picture library, it is judged that whether photo current occurs in picture library, if there is, then enter One step judges that content of posting is the most identical, and position is the most identical, if all different, then judges that the user comprising this picture sends out Cloth information is wash sale information;Otherwise, then judging that the user comprising this picture releases news is true sale information;
Or, it is judged that whether there is watermark on picture, if it has, the watermark then determined whether on picture is the most legal, as The most illegal, then judging that the user comprising this picture releases news is wash sale information;Otherwise, then judge to comprise this picture It is true sale information that user releases news.
The present invention has the beneficial effect that:
The present invention can greatly reduce the false amount of Transaction Information, improves the verity of Transaction Information, increases user's body Test, human cost can be greatly reduced simultaneously.
Accompanying drawing explanation
Fig. 1 is the flow chart of a kind of wash sale information identifying method in the embodiment of the present invention.
Detailed description of the invention
Below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated.Should be appreciated that described herein Specific embodiment only in order to explain the present invention, do not limit the present invention.
As it is shown in figure 1, the present embodiments relate to a kind of wash sale information identifying method, including:
Step S101, obtains information characteristics, information content and/or pictorial information that user releases news;
Step S201, information characteristics, information content and/or the pictorial information released news according to user, user is issued Message carries out wash sale information identification.
In step S101, being specifically related to three kinds of situations, the first is that the information characteristics released news for user carries out void False Transaction Information identification, namely carries out wash sale information identification based on user characteristics and behavior;The second is for user The information content released news carries out wash sale information identification, namely carries out wash sale information based on model content of text Identify;The third is the wash sale information identification carried out for pictorial information.
First, describe the information characteristics released news based on user and carry out wash sale information identification, send out obtaining user Before the information characteristics of cloth information, comprise the following steps:
Step S1011, the master data that before acquisition, user gives out information.In this step, by splicing data, analyze and use Posting daily record in family, extracts the basic feature that user gives out information;Wherein, basic feature refers to directly to send out from user before Cloth message extracts the data of acquisition, such as, the identity (USER ID) of user, the IP that posts, cookieid, telephone number, Temporal information (including week, month, date), duration of posting, pageview, refreshing amount, city of posting, the features such as classification of posting. Then, according to the basic feature of user, obtain unit's feature;Wherein, unit's feature refers on the basis of the basic feature of user, logical Cross statistics or calculated data;Such as with IP post number, city number of posting with IP, with user post number, post with user The unit such as city number, post with cookie number, city number of posting with cookie feature.Basic feature and first feature are together as excavation Master data.Such as, produce such record R1(123123,192.168.11.11, a DFOKIEBNGIDH1232, 18311067654 ...).
Step S1012, the master data given out information according to user before obtaining, extract training data.In this step, Based on the result of step S1011, verified by manual examination and verification and be defined as true or false data, as positive and negative sample This, truthful data is positive sample, and false data is negative sample;Such as, R1 is labeled as positive sample or negative sample.Manual examination and verification Process, can know that according to some information artificially judges altogether, it is also possible to carries out checking by means such as phones and confirms.
Step S1013, aligns the data in negative sample and carries out Feature Conversion, obtain the data of setting data form.This step In Zhou, the feature of the every data in positive negative sample being defined as numeric type or enumeration type two class, wherein, numeric type refers to data Inherently numerical value;Enumeration type refers to that data itself are not numerical value, and enumeration type carries out mapping according to original dimension and value and obtains. The data that such as USER ID, the IP that posts etc. are enumeration type, duration of posting, the data that city number is numeric type of posting with user.Number The dimension values of value type is constant;Such as, certain characteristic is 20, and position in the sample is at the 10th, then on the 10th position Put 20.The dimension values of enumeration type the most first calculates its md5(Message Digest Algorithm MD5, Message Digest 5 Five editions) value, then by md5 value to W(such as W=300000) delivery, it may be assumed that by md5 value divided by 300000, obtain remainder;So The value of enumeration type will fall between 1-300000.Such as there are two features: (telephone number, number of posting with phone), corresponding Value is (18211078765,100), posts what number was numeric type with phone, and telephone number is enumeration type, so sending out with phone The position of note number invariant position in the sample, after telephone number calculates md5 value, to 300000 deliverys, such as, obtains 180834, Now this record produce vector be (0,100,0 ..., 1), wherein, put 1 in the sample on the position of the 180834th, table Showing that there is numerical value this position, numerical value is 1.
Step S1014, according to the data of setting data form, sets up regression model.Requirement to regression model is to return The codomain of result between [0,1], or in the range of can being mapped to this by calculating, below as a example by logistic regression. Obtain in step S1013 is the vector of a rule, such as (0,0,0,0,0,0,0,0,0,12,32,43 ... 1,0, 0......1,0,0......), because these vectors there may be 300000 dimensions, represent that data volume can quite expend internal memory, institute Be converted into the form of sparse matrix with the vector by a rule, such as, if upper one is Article 1, then abscissa is 1, accordingly The form of sparse matrix be: 110(is equivalent to vertical coordinate) 12,11132,11243 etc..After each the most so converts, In model training program program, input is the sparse matrix produced above, and output is the parameter that each record is corresponding.Can be simple Be interpreted as if a record is (x1,x2,x3,x4,x5..., xp), p is the data volume of the data of setting data form;Logical Cross model training program to solve, produce (β012345..., βp) etc. corresponding parameter.Now set up recurrence Model, regression model is represented by:Wherein g (x)=β01x12x2+…+βpxp
After setting up regression model, when receiving user again and releasing news, then step S101 particularly as follows:
Step S1015, obtains the master data that user gives out information;Including extract the basic feature that gives out information of user and Obtain unit's feature;Basic feature and first feature are together as the master data excavated.Particular content is identical with step S1011, this Step is not described in detail.
After obtaining the master data that gives out information of user, step S201 specifically includes following steps:
Step S2011, the master data giving out information acquisition user carries out Feature Conversion, obtains setting data form Data.This step is identical with step S1013 method, no longer describes in detail.
Step S2012, the data of setting data form step S2011 obtained are converted into the form of sparse matrix, logical Cross regression model and carry out spoofing identification.In this step, after obtaining sparse matrix, give out information correspondence according to the user obtained (x1,x2,x3,x4,x5..., xp), it is possible to obtain g (x), thus can in the hope of the result of P (Y=1 | x), i.e. Y=1's Probability;Wherein, P > M, then Y=1, represent that user releases news as true sale information;Otherwise, P≤M, then Y=0, represent that user sends out Cloth information is wash sale information;M is threshold value set in advance.
Secondly, describe the information content released news based on user and carry out wash sale information identification, send out obtaining user Before the information content of cloth information, comprise the following steps:
Step S1021, the information content that before acquisition, user gives out information, and foregoing (is manually examined by examination & verification Core or automatically examination & verification), using by examination & verification with not by the Transaction Information model of examination & verification as two classes, as the sample number classified According to;The algorithm that can pass through expert's manual tag and part accuracy rate high (higher than arranging threshold value) automatically extracts positive negative sample instruction Practice collection;
Step S1022, carries out participle to the information content in sample, can be by the method optimizing participle of Custom Dictionaries Effect.Concrete segmenting method can use existing segmenting method, such as ICT segmenting method or other segmenting method.
Step S1023, extraction feature word.In this step, filter out and step S1022 participle stops word, rare words, common Word, then with CHI(evolution check) etc. method choose the Feature Words big with class degree of association.Concrete choosing method is: to each word All seek CHI value, then take 1000 maximum values of CHI value in all words as Feature Words.Evolution inspection formula is:Wherein, A: comprise the number of documents of this word under this is classified;B: do not comprise under this classification The number of documents of this word;C: do not comprise the number of documents of this word under this is classified;D: not under this classification, and do not comprise The number of documents of this word;N: represent article sum;T: represent the word currently seeking CHI value;The classification of c: presentation class;x2: table Show open inspection CHI value.
Step S1024, carries out vectorization, obtains the eigenvalue of each Feature Words in every document of every apoplexy due to endogenous wind.This step Use TFIDF algorithm, calculate the number of times of each Feature Words in every document of every apoplexy due to endogenous wind, and comprise the number of files of this word, By the value of TFIDF as eigenvalue.Every document is converted into: category IDs t feature sequence number the form of t eigenvalue.TFIDF is public Formula is: TFIDF=TF × IDF, and wherein, TF is the frequency that certain Feature Words occurs in this document, and IDF is anti-document frequency, The most total document tree is divided by the number of files comprising this word.
Step S1025, according to the eigenvalue of each word obtained in every document of every apoplexy due to endogenous wind, is identified mould by training Type.In this step, use SVM(support vector machine support vector machine), decision tree, the mode such as Bayes's classification Features described above value is trained, every document has been converted into the form of vector by step S1024, use classification (Waikato Environment for Knowledge Analysis, Waikato intellectual analysis environment) these vectors are trained by program, Different sorting techniques, such as SVM, decision tree, Bayes's classification etc. can be selected, produce one and identify model.SVM, decision-making Tree, Bayes's classification are the training method of existing maturation, and this step is not described in detail.
After being identified model, when receiving user again and releasing news, then step S101 particularly as follows:
Step S1026, obtains the information content that user gives out information, such as, as a example by user posts, then obtain model Particular content.
After obtaining the information content that gives out information of user, step S201 specifically includes following steps:
Step S2021, the information content giving out information user carries out participle.
Step S2022, extraction feature word.This step is identical with step S1023 method, therefore, is not described in detail.
Step S2023, carries out vectorization, the eigenvalue of each word in the information content that acquisition user gives out information.This Step is identical with step S1024 method, therefore, is not described in detail.
Step S2024, according to the identification model obtained, the information content giving out information user carries out wash sale information Identify.In this step, the identification model obtained by modes such as SVM, decision tree, Bayes's classifications is existing maturity model, its Recognition methods is also existing mature technology, and therefore this step is not described in detail.
Finally, describe and carry out wash sale information identification based on pictorial information, obtaining the picture letter that user releases news After breath, the pictorial information released news according to user, user is given out information and carries out wash sale information identification (step S201) comprise the following steps:
Step S2031, query graph valut, it is judged that whether photo current occurs in picture library, if there is, the most further Judge that content of posting is the most identical, and position is the most identical, if all different, then judge that the user comprising this picture issues letter Breath is wash sale information;Otherwise, then judging that the user comprising this picture releases news is true sale information;Or, it is judged that Whether there is watermark on picture, if it has, the watermark then determined whether on picture is the most legal, if illegal, then judge bag It is wash sale information that user containing this picture releases news;Otherwise, then it is true for judging that the user comprising this picture releases news Real Transaction Information.
It addition, above-mentioned three kinds of strategies can also be combined, combine and judge, such as, two kinds of situation combinations, or Three kinds of situation combinations;It is wash sale letter when above-mentioned three kinds of situations having any one or two kinds of situations judge that user releases news Breath, then judging that user releases news is wash sale information.
As can be seen from the above-described embodiment, the present invention can greatly reduce the false amount of Transaction Information, improves transaction letter The verity of breath, increases Consumer's Experience, can greatly reduce human cost simultaneously.
Although being example purpose, having been disclosed for the preferred embodiments of the present invention, those skilled in the art will be recognized by Various improvement, to increase and replace also be possible, and therefore, the scope of the present invention should be not limited to above-described embodiment.

Claims (8)

1. a wash sale information identifying method, it is characterised in that including:
Step S101, obtains the information characteristics that releases news of user or obtains described information characteristics, information content and picture Information;
Step S201, the information characteristics released news according to user or described information characteristics, information content and pictorial information, User is given out information and carries out wash sale information identification;
Obtain the information characteristics that releases news of user or obtain described information characteristics, information content and pictorial information it Before, comprise the following steps:
Step S1011, the master data that before acquisition, user gives out information;
Step S1012, the master data given out information according to user before obtaining, extract training data, determine positive negative sample;
Step S1013, aligns the data in negative sample and carries out Feature Conversion, obtain the data of setting data form;
Step S1014, according to the data of setting data form, sets up regression model;
Step S1013 specifically includes:
The feature of the every data in positive negative sample is defined as numeric type or enumeration type two class;
The dimension values of numeric type is constant, and the numerical value of this numeric type data is disposed in the position in numeric type data is in sample;
The dimension values of enumeration type the most first calculates its md5 value, then by md5 value to W delivery, obtains delivery result;In the sample will The numerical value being in delivery result position puts 1.
2. wash sale information identifying method as claimed in claim 1, it is characterised in that step S1014 specifically includes:
Data step S1013 obtained are converted into sparse matrix;
Sparse matrix (the x of generation is inputted in model training program1,x2,x3,x4,x5..., xp), p is setting data form The data volume of data;Obtain each parameter (β corresponding to record012345..., βp);
Setting up regression model, regression model is:Wherein g (x)=β01x12x2+... +βpxp
P (Y=1 | x) is the probability of Y=1, and Y=1 represents that user releases news as true sale information.
3. wash sale information identifying method as claimed in claim 2, it is characterised in that after setting up regression model, when Receive user when releasing news, then step S101 obtains information characteristics that user releases news particularly as follows:
Step S1015, obtains the master data that user gives out information;Including extracting basic feature and the acquisition that user gives out information Unit's feature;Basic feature and first feature are together as the master data excavated.
4. wash sale information identifying method as claimed in claim 3, it is characterised in that obtaining the base that user gives out information After notebook data, step S201 specifically includes following steps:
Step S2011, the master data giving out information acquisition user carries out Feature Conversion, obtains the number of setting data form According to;
Step S2012, the data of setting data form step S2011 obtained are converted into the form of sparse matrix, by returning Model is returned to carry out spoofing identification;Wherein, P (Y=1 | x) > M, then Y=1, represent user to release news as true sale and believe Breath;Otherwise, P (Y=1 | x)≤M, then Y=0, represent that user releases news as wash sale information;M is threshold value set in advance.
5. the wash sale information identifying method as described in claim 1 or 4, it is characterised in that release news obtaining user Information content before, comprise the following steps:
Step S1021, information content that before acquisition, user gives out information also audits, will by examination & verification with not by auditing Information be divided into two classes, as classification sample data;
Step S1022, carries out participle to the information content in sample;
Step S1023, by calculating, extraction feature word;
Step S1024, calculates the eigenvalue of each Feature Words in every document of every apoplexy due to endogenous wind;
Step S1025, according to the eigenvalue of each Feature Words obtained in every document of every apoplexy due to endogenous wind, is identified mould by training Type.
6. wash sale information identifying method as claimed in claim 5, it is characterised in that step S1023 specifically includes:
Each word is asked evolution check CHI value;Evolution inspection formula is:
Wherein, A: comprise the number of documents of this word under this is classified;B: do not comprise the number of files of this word under this classification Amount;C: do not comprise the number of documents of this word under this is classified;D: not under this classification, and do not comprise the number of files of this word Amount;N: represent article sum;T: represent the word currently seeking CHI value;The classification of c: presentation class;x2: represent evolution inspection CHI Value;
Then the P that in all words, CHI value is maximum is taken individual as Feature Words;
Step S1024 specifically includes:
Use TFIDF algorithm, calculate the number of times of each Feature Words in every document of every apoplexy due to endogenous wind, and comprise the document of this word Number, by the value of TFIDF as eigenvalue;Wherein, every document is converted into: category IDs t feature sequence number the lattice of t eigenvalue Formula;TFIDF formula is: TFIDF=TF × IDF, and wherein, TF is the frequency that certain Feature Words occurs in this document, and IDF is Anti-document frequency, i.e. total number of files is divided by the number of files comprising this word.
7. wash sale information identifying method as claimed in claim 6, it is characterised in that obtaining the letter that user gives out information After breath content, step S201 specifically includes following steps:
Step S2021, the information content giving out information user carries out participle;
Step S2022, by calculating, extraction feature word;
Step S2023, the eigenvalue of each Feature Words in the information content that calculating user gives out information;
Step S2024, according to the identification model obtained, the information content giving out information user carries out wash sale information knowledge Not.
8. the wash sale information identifying method as described in claim 1,4 or 7, it is characterised in that release news according to user Pictorial information, user is given out information and carries out wash sale information identification, specifically include following steps:
Step S2031, query graph valut, it is judged that whether photo current occurs in picture library, if there is, then determine whether Content of posting is the most identical, and position is the most identical, if all different, then judging that the user comprising this picture releases news is Wash sale information;Otherwise, then judging that the user comprising this picture releases news is true sale information;
Or, it is judged that whether there is watermark on picture, if it has, the watermark then determined whether on picture is the most legal, if not Legal, then judging that the user comprising this picture releases news is wash sale information;Otherwise, then the user comprising this picture is judged Releasing news is true sale information.
CN201310037691.8A 2013-01-31 2013-01-31 A kind of wash sale information identifying method Active CN103064987B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310037691.8A CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310037691.8A CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Publications (2)

Publication Number Publication Date
CN103064987A CN103064987A (en) 2013-04-24
CN103064987B true CN103064987B (en) 2016-09-21

Family

ID=48107617

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310037691.8A Active CN103064987B (en) 2013-01-31 2013-01-31 A kind of wash sale information identifying method

Country Status (1)

Country Link
CN (1) CN103064987B (en)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279868B (en) * 2013-05-22 2016-08-17 兰亭集势有限公司 A method and device for automatically identifying fraudulent orders
CN103530400A (en) * 2013-10-23 2014-01-22 合山市科学技术情报研究所 Information recognition device
CN104731803A (en) * 2013-12-21 2015-06-24 长沙微观信息科技有限公司 Technology for preventing false network information
CN104616183B (en) * 2015-01-27 2017-11-07 杨芳 Network trading identification system, client and discrimination method
CN104657503A (en) * 2015-03-13 2015-05-27 浪潮集团有限公司 Method for preprocessing abnormal values of e-business sales amounts based on statistical discrimination process
CN106157045B (en) * 2015-03-26 2021-07-23 创新先进技术有限公司 Method, device and server for identifying false transactions based on logistics data
CN106504137A (en) * 2015-09-03 2017-03-15 中山市八喜电脑网络有限公司 Network False Housing Information Reporting System
CN106611321B (en) * 2015-10-22 2020-09-25 百度在线网络技术(北京)有限公司 False mobile phone number identification method and device
CN107103171B (en) * 2016-02-19 2020-09-25 阿里巴巴集团控股有限公司 Modeling method and device of machine learning model
CN107133833A (en) * 2016-02-26 2017-09-05 阿里巴巴集团控股有限公司 abnormal transaction identification method and device
CN107545505B (en) * 2016-06-24 2020-09-29 深圳壹账通智能科技有限公司 Method and system for identifying insurance financing product information
CN106920109A (en) * 2017-02-21 2017-07-04 福建师范大学福清分校 Recognition methods, system and e-commerce system for ecommerce wash sale
CN106952190A (en) * 2017-03-22 2017-07-14 国信优易数据有限公司 False source of houses typing Activity recognition and early warning system
CN108734327A (en) * 2017-04-20 2018-11-02 腾讯科技(深圳)有限公司 A kind of data processing method, device and server
CN108053214B (en) * 2017-12-12 2021-11-23 创新先进技术有限公司 False transaction identification method and device
CN108600113B (en) * 2018-04-12 2022-05-31 北京五八信息技术有限公司 Preliminary auditing method and device for data to be issued and storage medium
CN109284614A (en) * 2018-08-10 2019-01-29 五八有限公司 Information Authentication method, apparatus, computer equipment and computer readable storage medium
CN109685527B (en) * 2018-12-14 2024-03-29 拉扎斯网络科技(上海)有限公司 Method, device, system and computer storage medium for detecting merchant false transaction
CN111914645A (en) * 2020-06-30 2020-11-10 五八有限公司 Method and device for identifying false information, electronic equipment and storage medium
CN116720864A (en) * 2023-06-26 2023-09-08 北京智思迪科技有限公司 Online transaction system and method with false transaction monitoring function

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102339445A (en) * 2010-07-23 2012-02-01 阿里巴巴集团控股有限公司 Method and system for evaluating credibility of network trade user

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7337962B2 (en) * 2004-12-30 2008-03-04 International Business Machines Corporation Method to detect false purchases with a consumer service device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102339445A (en) * 2010-07-23 2012-02-01 阿里巴巴集团控股有限公司 Method and system for evaluating credibility of network trade user

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于内容检索的垃圾邮件过滤器研究与实现;王园;《中国优秀硕士学位论文全文数据库信息科技辑》;20120115(第01期);第4~21页 *
基于语义的广告图像垃圾邮件过滤技术研究;郭学敏;《中国优秀硕士学位论文全文数据库信息科技辑》;20100815(第08期);第9页 *
蒋杰.基于分类技术的电子支付平台作弊账户的识别模型研究.《中国优秀硕士学位论文全文数据库信息科技辑》.2012,(第07期),第14~22页. *

Also Published As

Publication number Publication date
CN103064987A (en) 2013-04-24

Similar Documents

Publication Publication Date Title
CN103064987B (en) A kind of wash sale information identifying method
CN103150374B (en) Method and system for identifying abnormal microblog users
CN107291780B (en) User comment information display method and device
CN106471490B (en) Classification-based group communication
CN102541937B (en) Webpage information detection method and system
US20160048587A1 (en) System and method for real-time dynamic measurement of best-estimate quality levels while reviewing classified or enriched data
CN106709370B (en) A kind of long word recognition method and system based on content of text
CN104966031A (en) Method for identifying permission-irrelevant private data in Android application program
CN105956031A (en) Text classification method and apparatus
CN111553318A (en) Sensitive information extraction method, referee document processing method and device and electronic equipment
CN107517463A (en) Method and device for identifying a telephone number
CN106445963A (en) Advertisement index keyword automatic generation method and apparatus for APP platform
CN110362601A (en) Mapping method, device, equipment and the storage medium of metadata standard
CN110929110B (en) Electronic document detection method, device, equipment and storage medium
CN110609908A (en) Case serial-parallel method and device
CN102880631A (en) Chinese author identification method based on double-layer classification model, and device for realizing Chinese author identification method
CN110619212A (en) Character string-based malicious software identification method, system and related device
CN111782917B (en) Method and device for visual analysis of financial punishment data
JP5622969B1 (en) Document analysis system, document analysis method, and document analysis program
CN105159879A (en) Automatic determination method for network individual or group values
CN111178718B (en) Fair competition review method, server, system and storage medium
WO2024087754A1 (en) Multi-dimensional comprehensive text identification method
CN113034057B (en) A risk identification method, device and equipment
CN108268560A (en) A kind of file classification method and device
CN114638299B (en) Intelligent identification method, device and equipment for zombie license and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant