CN108288161A - The method and system of prediction result are provided based on machine learning - Google Patents
The method and system of prediction result are provided based on machine learning Download PDFInfo
- Publication number
- CN108288161A CN108288161A CN201710016236.8A CN201710016236A CN108288161A CN 108288161 A CN108288161 A CN 108288161A CN 201710016236 A CN201710016236 A CN 201710016236A CN 108288161 A CN108288161 A CN 108288161A
- Authority
- CN
- China
- Prior art keywords
- sample
- history
- predicted
- result
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q20/00—Payment architectures, schemes or protocols
- G06Q20/38—Payment protocols; Details thereof
- G06Q20/40—Authorisation, e.g. identification of payer or payee, verification of customer or shop credentials; Review and approval of payers, e.g. check credit lines or negative lists
- G06Q20/401—Transaction verification
- G06Q20/4016—Transaction verification involving fraud or risk level assessment in transaction processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Strategic Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- General Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Software Systems (AREA)
- Development Economics (AREA)
- Marketing (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Medical Informatics (AREA)
- Game Theory and Decision Science (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Entrepreneurship & Innovation (AREA)
- Mathematical Physics (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computer Security & Cryptography (AREA)
- Artificial Intelligence (AREA)
- Finance (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
There is provided it is a kind of providing the method and system of prediction result based on machine learning, the method includes:(A) attribute information of sample to be predicted and the attribute information of the history sample occurred before sample to be predicted are obtained;(B) result information of the history sample about forecasting problem is obtained, wherein for the history sample for not having the legitimate reading about forecasting problem among the history sample, using the prediction result of history sample as the result information of history sample;(C) result information of the attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and the history sample generates the forecast sample of sample to be predicted;(D) using the prediction model trained based on machine learning techniques, the prediction result of sample to be predicted is provided for the forecast sample of sample to be predicted.According to described method and system, it can suitably be directed to sample to be predicted with reference to both history sample and sample to be predicted and provide prediction result.
Description
Technical field
All things considered of the present invention is related to artificial intelligence field, is provided based on machine learning more specifically to one kind
Method and system of the sample to be predicted about the prediction result of forecasting problem.
Background technology
In practice, in order to provide prediction result of the sample to be predicted about forecasting problem based on machine learning techniques,
Other than needing the attribute information of sample itself to be predicted, usually also need to obtain relevant historical information, that is, to be predicted
The attribute information of the history sample occurred before sample.For example, if it is desired to predicting that user's is current using machine learning model
Whether transaction is fraudulent trading, it is also necessary to the case where obtaining the historical trading of user, and according to current transaction and historical trading
It compares to provide prediction result.
However, the attribute information of history sample is in use, it is easy to appear various problems in many cases,.For example,
In the example of above-mentioned fraudulent trading, in reality it occur frequently that the phenomenon that continuous fraudulent trading (by taking credit card steals brush as an example, if
The first stroke is stolen brush and is not found, then a lot of robber's brushes can occur in succession), at this point, being mixed into abnormal sample in history sample, cause
Relativity between history sample and current sample can not be effectively reflected the comparison between normal sample and abnormal sample
Relationship so that model can not effectively work.For example, it is assumed that machine learning model is for predicting credit card fraud transaction, model
Positive sample to correspond to current sample be abnormal sample, and it is normal sample that the negative sample of model, which corresponds to current sample,.Into one
Step ground, it is assumed that the place of swiping the card in the swipe the card place and current transaction of a upper transaction involved in the sample characteristics of model, then
For as positive sample twice in succession steal brush among second steal brush for, model be difficult by the sample learning to how
Accurate Prediction steals brush.Particularly, if certain credit card user people is in China, and his credit card has brushed two in the U.S. by continuous robber
It is secondary, then stealing brush for second and stealing brush for the first time equally all as the positive sample of model.However, corresponding for stealing brush with second
Positive sample for, upper brush card place is the U.S., and the place of current brush card is also in the U.S., in this sample
Under, model can be easy to tend to think above-mentioned second in sample that the U.S. swipes the card (that is, the in the U.S. swipes the card twice in succession
It is secondary to swipe the card) it is easy to happen fraud, and this is not inconsistent with common sense, it can be seen that, the model trained in this manner is difficult
Effectively to predict fraudulent trading.
In view of the above-mentioned problems, can consider that qualified history sample is used only, for example, above-mentioned about fraudulent trading
In example, historical fraudulent trading is weeded out using only historical arm's length dealing.However, executing prediction at that time
(for example, in scene of online Prediction) leads to not sieve from Recent Activity due to the stateful transaction still non-availability of Recent Activity
Select arm's length dealing;On the other hand, Recent Activity be the key that again can help to judge currently to merchandise whether be fraudulent trading because
Therefore element, the effect that model can be seriously affected if directly neglecting the unknown all Recent Activities of stateful transaction are used only
The scheme of qualified history sample feasibility in reality is poor, it is difficult to obtain effective prediction result.
In conclusion when in face of the particular problem of machine learning, need to be improved from modelling angle, to have
The computing resource (for example, the limitation of hardware resource in terms of the capacity of processing data and speed) and/or data resource (example of limit
Such as, lack enough training samples for training machine learning model) under effectively solve the problems, such as that history sample state is unknown,
And then ensure the prediction effect of machine learning model.
Invention content
Exemplary embodiment of the present invention is intended to overcome the existing prediction scheme based on machine learning model to be difficult to effectively
The defect learnt from history sample.
Exemplary embodiment according to the present invention provides one kind based on machine learning to provide sample to be predicted about prediction
The method of the prediction result of problem, including:(A) it obtains the attribute information of sample to be predicted and occurs before sample to be predicted
History sample attribute information;(B) result information of the history sample about forecasting problem is obtained, wherein for described
The history sample for not having the legitimate reading about forecasting problem among history sample, using the prediction result of history sample as going through
The result information of history sample;(C) attribute information of the sample to be predicted based on acquisition, the history sample attribute information and
The result information of the history sample generates the forecast sample of sample to be predicted;And (D) is utilized and is based on machine learning techniques
The prediction model trained provides the prediction result of sample to be predicted for the forecast sample of sample to be predicted.
Optionally, in the method, in step (B), the confidence level of the result information of history sample is also obtained, and
And in step (C), the attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample, the history
The confidence level of the result information of the result information of sample and the history sample generates the forecast sample of sample to be predicted.
Optionally, in the method, in step (B), the prediction result of the history sample is by the prediction model
Or previous model corresponding with the last round of iteration of the prediction model provides.
Optionally, in the method, in step (B), for the history sample with legitimate reading, by history sample
Result information of the legitimate reading as history sample.
Optionally, in the method, there is the prediction model following training process to be pressed in the training process
According to the mode consistent with forecast sample, the legitimate reading or prediction result for the history sample that training sample is based on are as described in
The result information of history sample, wherein the prediction result of the history sample that the training sample is based on is by currently training
Prediction model provides.
Optionally, in the method, in the training process, the prediction model is iterated for training sample
Training so that the prediction result for the history sample that training sample is based on is constantly updated with iteration.
Optionally, the method provides prediction result of the sample to be predicted about forecasting problem online.
Optionally, in the method, the pre- test sample of sample to be predicted is generated at least one of in the following manner
This feature:(C1) it is sieved according to the confidence level of the result information of the history sample and the result information of the history sample
It selects at least part history sample, and the attribute information based at least part history sample filtered out and waits for pre-
The attribute information of test sample example generates the feature of the forecast sample of sample to be predicted;(C2) believed according to the result of the history sample
The confidence level of the result information of breath and the history sample is weighted the respective attributes information of the history sample, and base
The attribute information of history sample after weighting and the attribute information of sample to be predicted generate the pre- test sample of sample to be predicted
This feature;And (C3) is based respectively on the attribute information of sample to be predicted, the attribute information of the history sample, the history
The confidence level of the result information of the result information of sample and the history sample generates the forecast sample of sample to be predicted
Feature.
Optionally, in the method, sample to be predicted corresponds to current transaction, and history sample corresponds to merchandises currently
The previous transaction of the predetermined quantity occurred before and/or the previous transaction occurred in the predetermined amount of time before current transaction,
Forecasting problem is whether relationship trading is fraudulent trading.
Optionally, in the method, in step (B), for the history sample without legitimate reading, based on prediction
The algorithm of model obtains the confidence level of the prediction result of history sample independently of the algorithm of prediction model, as history sample
Result information confidence level;For the history sample with legitimate reading, the confidence level of the legitimate reading of history sample is set
It is set to the preset value for indicating high confidence level, the confidence level of the result information as history sample.
Optionally, the method further includes:(E) legitimate reading of the sample to be predicted about forecasting problem is received, wherein institute
Legitimate reading is stated to be used to train the prediction model together with corresponding sample to be predicted.
Another exemplary according to the present invention is implemented, and provides one kind based on machine learning to provide sample to be predicted about pre-
The system of the prediction result of survey problem, including:Attribute information acquisition device, attribute information for obtaining sample to be predicted and
The attribute information of the history sample occurred before sample to be predicted;Result information acquisition device, for obtaining the history sample
Result information of the example about forecasting problem, wherein for the true knot not having among the history sample about forecasting problem
The history sample of fruit, result information acquisition device is using the prediction result of history sample as the result information of history sample;Sample
Generating means for the attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and described are gone through
The result information of history sample generates the forecast sample of sample to be predicted;And prediction result provides device, is based on for utilizing
The prediction model that machine learning techniques train provides the prediction knot of sample to be predicted for the forecast sample of sample to be predicted
Fruit.
Optionally, in the system, result information acquisition device also obtains the confidence level of the result information of history sample,
Also, the attribute information of to be predicted sample of the sample generating means based on acquisition, described is gone through the attribute information of the history sample
The confidence level of the result information of the result information of history sample and the history sample generates the forecast sample of sample to be predicted.
Optionally, in the system, the prediction result of the history sample obtained by result information acquisition device by
The prediction model or previous model corresponding with the last round of iteration of the prediction model provide.
Optionally, in the system, for the history sample with legitimate reading, result information acquisition device is by history
Result information of the legitimate reading of sample as history sample.
Optionally, in the system, there is the prediction model following training process to be pressed in the training process
According to the mode consistent with forecast sample, the legitimate reading or prediction result for the history sample that training sample is based on are as described in
The result information of history sample, wherein the prediction result of the history sample that the training sample is based on is by currently training
Prediction model provides.
Optionally, in the system, in the training process, the prediction model is iterated for training sample
Training so that the prediction result for the history sample that training sample is based on is constantly updated with iteration.
Optionally, in the system, the system provides prediction result of the sample to be predicted about forecasting problem online.
Optionally, in the system, sample generating means are at least one of in the following manner to be predicted to generate
The feature of the forecast sample of sample:According to setting for the result information of the result information of the history sample and the history sample
Reliability filters out at least part history sample, and based on the attribute information of at least part history sample filtered out
And the attribute information of sample to be predicted generates the feature of the forecast sample of sample to be predicted;According to the knot of the history sample
The confidence level of the result information of fruit information and the history sample is weighted the respective attributes information of the history sample,
And the pre- of sample to be predicted is generated based on the attribute information of the attribute information of the history sample after weighting and sample to be predicted
The feature of test sample sheet;And it is based respectively on the attribute information of sample to be predicted, the attribute information of the history sample, the history
The confidence level of the result information of the result information of sample and the history sample generates the forecast sample of sample to be predicted
Feature.
Optionally, in the system, sample to be predicted corresponds to current transaction, and history sample corresponds to merchandises currently
The previous transaction of the predetermined quantity occurred before and/or the previous transaction occurred in the predetermined amount of time before current transaction,
Forecasting problem is whether relationship trading is fraudulent trading.
Optionally, in the system, for the history sample without legitimate reading, result information acquisition device is based on
The algorithm of prediction model obtains the confidence level of the prediction result of history sample independently of the algorithm of prediction model, as history
The confidence level of the result information of sample;For the history sample with legitimate reading, result information acquisition device is by history sample
The confidence level of legitimate reading be set as indicating the preset value of high confidence level, the confidence of the result information as history sample
Degree.
Optionally, the system also includes:Feedback device, for receiving true knot of the sample to be predicted about forecasting problem
Fruit, wherein the legitimate reading be used to train the prediction model together with corresponding sample to be predicted.
Exemplary embodiment according to the present invention provides one kind based on machine learning to provide sample to be predicted about prediction
The computer-readable medium of the prediction result of problem, wherein record is useful for executing following on the computer-readable medium
The computer program of step:(A) the history sample for obtaining the attribute information of sample to be predicted and occurring before sample to be predicted
The attribute information of example;(B) result information of the history sample about forecasting problem is obtained, wherein be directed to the history sample
Among do not have about forecasting problem legitimate reading history sample, using the prediction result of history sample as history sample
Result information;(C) attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and the history
The result information of sample generates the forecast sample of sample to be predicted;And (D) based on machine learning techniques using being trained
Prediction model provides the prediction result of sample to be predicted for the forecast sample of sample to be predicted.
Optionally, in the computer-readable medium, in step (B), the result information of history sample is also obtained
Confidence level, also, in step (C), the attribute information of the sample to be predicted based on acquisition, the history sample attribute letter
Breath, the result information of the history sample and the history sample the confidence level of result information generate sample to be predicted
Forecast sample.
Optionally, in the computer-readable medium, in step (B), the prediction result of the history sample is by institute
It states prediction model or previous model corresponding with the last round of iteration of the prediction model provides.
Optionally, in the computer-readable medium, in step (B), for the history sample with legitimate reading,
Using the legitimate reading of history sample as the result information of history sample.
Optionally, in the computer-readable medium, the prediction model has following training process, in the training
In the process, according to the mode consistent with forecast sample, the legitimate reading or prediction knot of the history sample that training sample is based on
Result information of the fruit as the history sample, wherein the prediction result for the history sample that the training sample is based on is by working as
Before the prediction model that trains provide.
Optionally, in the computer-readable medium, in the training process, the prediction model is directed to training sample
Originally it is iterated training so that the prediction result for the history sample that training sample is based on is constantly updated with iteration.
Optionally, in the computer-readable medium, the computer program is performed to provide sample to be predicted online
Prediction result of the example about forecasting problem.
Optionally, in the computer-readable medium, sample to be predicted is generated at least one of in the following manner
The feature of the forecast sample of example:(C1) according to the result information of the result information of the history sample and the history sample
Confidence level is believed to filter out at least part history sample based on the attribute of at least part history sample filtered out
The attribute information of breath and sample to be predicted generates the feature of the forecast sample of sample to be predicted;(C2) according to the history sample
The confidence level of the result information of example and the result information of the history sample to the respective attributes information of the history sample into
Row weighting, and sample to be predicted is generated based on the attribute information of the attribute information of the history sample after weighting and sample to be predicted
The feature of the forecast sample of example;And (C3) is based respectively on the attribute letter of the attribute information of sample to be predicted, the history sample
Breath, the result information of the history sample and the history sample the confidence level of result information generate sample to be predicted
The feature of forecast sample.
Optionally, in the computer-readable medium, sample to be predicted corresponds to current transaction, and history sample corresponds to
The previous transaction of the predetermined quantity occurred before current transaction and/or occur in the predetermined amount of time before current transaction
Previously transaction, forecasting problem were whether relationship trading is fraudulent trading.
Optionally, in the computer-readable medium, in step (B), for the history sample without legitimate reading
Example, algorithm based on prediction model or obtains the confidence level of the prediction result of history sample independently of the algorithm of prediction model,
The confidence level of result information as history sample;For the history sample with legitimate reading, by the true knot of history sample
The confidence level of fruit is set as indicating the preset value of high confidence level, the confidence level of the result information as history sample.
Optionally, in the computer-readable medium, the computer program also executes following steps:(E) it receives and waits for
Predict legitimate reading of the sample about forecasting problem, wherein the legitimate reading be used to instruct together with corresponding sample to be predicted
Practice the prediction model.
Another exemplary according to the present invention is implemented, and provides one kind based on machine learning to provide sample to be predicted about pre-
It is executable to be stored with computer in storage unit for the computing device of the prediction result of survey problem, including storage unit and processor
Instruction set executes following step when set of computer-executable instructions conjunction is executed by the processor:(A) it obtains and waits for
Predict the attribute information of sample and the attribute information of the history sample occurred before sample to be predicted;(B) it is gone through described in obtaining
Result information of the history sample about forecasting problem, wherein do not have about the true of forecasting problem among the history sample
The history sample of real result, using the prediction result of history sample as the result information of history sample;(C) based on acquisition wait for it is pre-
The result information of the attribute information of test sample example, the attribute information of the history sample and the history sample is to be predicted to generate
The forecast sample of sample;And (D) utilizes the prediction model trained based on machine learning techniques, for the pre- of sample to be predicted
Test sample provided the prediction result of sample to be predicted originally.
Optionally, in the computing device, the confidence level of the result information of history sample is also obtained, also, in step
(C) in, the attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample, the history sample knot
The confidence level of the result information of fruit information and the history sample generates the forecast sample of sample to be predicted.
Optionally, in the computing device, in step (B), the prediction result of the history sample is by the prediction
Model or previous model corresponding with the last round of iteration of the prediction model provide.
Optionally, in the computing device, in step (B), for the history sample with legitimate reading, by history
Result information of the legitimate reading of sample as history sample.
Optionally, in the computing device, the prediction model has following training process, in the training process
In, according to the mode consistent with forecast sample, the legitimate reading or prediction result of the history sample that training sample is based on are made
For the result information of the history sample, wherein the prediction result for the history sample that the training sample is based on is by currently instructing
The prediction model practised provides.
Optionally, in the computing device, in the training process, the prediction model is carried out for training sample
Repetitive exercise so that the prediction result for the history sample that training sample is based on is constantly updated with iteration.
Optionally, in the computing device, the computing device provides sample to be predicted about forecasting problem online
Prediction result.
Optionally, in the computing device, the pre- of sample to be predicted is generated at least one of in the following manner
The feature of test sample sheet:(C1) according to the confidence level of the result information of the history sample and the result information of the history sample
Filter out at least part history sample, and the attribute information based at least part history sample filtered out and
The attribute information of sample to be predicted generates the feature of the forecast sample of sample to be predicted;(C2) according to the knot of the history sample
The confidence level of the result information of fruit information and the history sample is weighted the respective attributes information of the history sample,
And the pre- of sample to be predicted is generated based on the attribute information of the attribute information of the history sample after weighting and sample to be predicted
The feature of test sample sheet;And (C3) is based respectively on the attribute information, described of the attribute information of sample to be predicted, the history sample
The confidence level of the result information of the result information of history sample and the history sample generates the pre- test sample of sample to be predicted
This feature.
Optionally, in the computing device, sample to be predicted corresponds to current transaction, and history sample corresponds to current
The previous transaction for the predetermined quantity that transaction occurs before and/or the previous friendship occurred in the predetermined amount of time before current transaction
Easily, forecasting problem is whether relationship trading is fraudulent trading.
Optionally, it in the computing device, in step (B), for the history sample without legitimate reading, is based on
The algorithm of prediction model obtains the confidence level of the prediction result of history sample independently of the algorithm of prediction model, as history
The confidence level of the result information of sample;For the history sample with legitimate reading, by the confidence of the legitimate reading of history sample
Degree is set as indicating the preset value of high confidence level, the confidence level of the result information as history sample.
Optionally, computing device further includes:(E) legitimate reading of the sample to be predicted about forecasting problem is received, wherein institute
Legitimate reading is stated to be used to train the prediction model together with corresponding sample to be predicted.
The method and system of prediction result are provided based on machine learning according to an exemplary embodiment of the present invention, can be had
Effect ground obtains result information of the history sample about forecasting problem, and the result information of history sample is fused to sample to be predicted
Sample characteristics in, provide prediction to being suitably directed to sample to be predicted with reference to both history sample and sample to be predicted
As a result.
Description of the drawings
From the detailed description below in conjunction with the accompanying drawings to the embodiment of the present invention, these and or other aspects of the invention and
Advantage will become clearer and be easier to understand, wherein:
Fig. 1 shows according to an exemplary embodiment of the present invention to provide the frame of the system of prediction result based on machine learning
Figure;
Fig. 2 shows the systems for providing prediction result based on machine learning according to another exemplary embodiment of the present invention
Block diagram;
Fig. 3 shows according to an exemplary embodiment of the present invention to provide the flow of the method for prediction result based on machine learning
Figure;
Fig. 4 shows the flow chart of the method for trained prediction model according to an exemplary embodiment of the present invention;
Fig. 5 shows the method for providing prediction result based on machine learning according to another exemplary embodiment of the present invention
Flow chart;And
Fig. 6 shows the flow chart of the method for the training prediction model according to another exemplary embodiment of the present invention.
Specific implementation mode
In order to make those skilled in the art more fully understand the present invention, with reference to the accompanying drawings and detailed description to this hair
Bright exemplary embodiment is described in further detail.
In an exemplary embodiment of the present invention, it is directed to sample to be predicted in the following manner and executes prediction:In addition to waiting for
Except the attribute information for predicting sample itself, the attribute information of the history sample occurred before sample to be predicted is also additionally obtained
And result information (for example, result information is obtained by predicting means) of the history sample about forecasting problem, by above-mentioned category
Property information and result information are fused in the forecast sample of sample to be predicted so that machine learning model can be effectively with reference to going through
Both history sample and sample to be predicted provide corresponding prediction result.
Machine learning is the inevitable outcome that artificial intelligence study develops to certain phase, is dedicated to the hand by calculating
Section, improves the performance of system itself using experience.In computer systems, " experience " exists usually in the form of " data ", leads to
Machine learning algorithm is crossed, " model " can be generated from data, that is to say, that empirical data is supplied to machine learning algorithm, just
Model can be generated based on these empirical datas, when in face of new sample, model can provide corresponding judgement, that is, prediction result.
It should be noted that exemplary embodiment of the present invention is to specific machine learning algorithm and without any restrictions.In addition, should also note
Meaning, during training or application machine learning model, also using statistic algorithm, business rule and/or expertise etc.,
To further increase the accuracy of prediction result.
Fig. 1 shows according to an exemplary embodiment of the present invention to provide the frame of the system of prediction result based on machine learning
Figure.Particularly, the forecasting system propose it is a kind of based on the information of sample to be predicted and its history sample come be directed to wait for it is pre-
Test sample example execute prediction system for handling, in the system for handling, history sample about forecasting problem result information also by
It is fused to the forecast sample of sample to be predicted, wherein the result information of certain history samples can be obtained by the means of prediction.
System shown in FIG. 1 can be realized all by computer program with software mode, also can be realized by special hardware device,
It can also be realized by way of software and hardware combining.Correspondingly, each device for forming system shown in FIG. 1 can be only relied on
Computer program realizes the virtual module of corresponding function, can also be realized by hardware configuration the function general or
Dedicated devices can also be that operation has the processor etc. of corresponding computer program.Using the system, history sample can be combined
Result information utilize the attribute information of history sample, to effectively with reference to both history sample and sample to be predicted come compared with
The prediction result of current sample to be predicted is adequately provided.
As described above, in an exemplary embodiment of the present invention, it, can when being predicted for current sample to be predicted
It is judged in conjunction with both history sample and current sample, it particularly, can be under the guidance of the result information of history sample
To consider the correlation attribute information of the history sample, enabling under limited hardware environment and data resource, utilize machine
Device learning ways effectively hold the rule that history sample is embodied, and then provide accurate prediction result.
As shown in Figure 1, attribute information acquisition device 100 is for obtaining the attribute information of sample to be predicted and to be predicted
The attribute information of the history sample occurred before sample.
Exemplary embodiment according to the present invention will utilize machine learning techniques, in conjunction with sample to be predicted (for example, current
Sample) provide the prediction result of sample to be predicted with the relevant informations of one or more history samples.Sample mentioned here
Refer to can (that is, about forecasting problem) is estimated in some respect object and/or affairs, correspondingly, sample may include but
Be not limited to can be directed to whether cheat the transaction (for example, the transaction of credit card trade, deposit card, e-payment etc.) estimated,
It can be directed to whether user execute displaying content that specific behavior estimated (for example, the advertisement that can click of user, user are commercially available
The commodity etc. bought), the object (for example, physical signs etc.) or the like estimated of numberical range can be directed to.As showing
Example, when sample indicates transaction sample (for example, buying behaviors such as the transaction of credit card trade, deposit card, e-payment), attribute letter
Breath acquisition device 100 can obtain and merchandise every time related attribute information, for example, handing over incident position, the amount of money, trade company, quotient
The information such as product.For example, in the case where whether it is fraudulent trading that forecasting problem is relationship trading, sample to be predicted can correspond to work as
Preceding transaction, history sample can correspond to the previous transaction of the predetermined quantity occurred before current transaction and/or merchandise currently
The previous transaction occurred in predetermined amount of time before.As described above, history sample refers to occurring before sample to be predicted
Sample, as an example, history sample may include the one or more samples tightly occurred before sample to be predicted, particularly,
Attribute information acquisition device 100 can calculating forward the day of trade from current transaction to be predicted, obtain the predetermined quantity occurred recently
Historical trading and/or the historical trading occurred within nearest one section of predetermined time attribute information.In addition, history sample also may be used
To be the one or more samples occurred before current sample to be predicted chosen according to other rules.
As an example, attribute information acquisition device 100 can obtain the data record of sample to be predicted and at least one history
The data record of sample, these data records may include the category of each attribute about corresponding sample to be predicted or history sample
Property information.
For example, above-mentioned data can be the data for prestoring or generating, can also be the data received from outside.These
Data can relate to the identity information of object, for example, about information such as the identity of personnel, educational background, occupation, assets, contact methods.Or
Person, these data can also refer to the relevant information of affairs, for example, about the turnover of deal contract, both parties, subject matter,
The information such as loco.The content of the above attribute information is only as the example for explaining, in fact, being directed to sample to be predicted
The specific sample (for example, transaction, state, network behavior etc.) of example and history sample, can obtain the attribute information of corresponding contents.
That is the attribute information mentioned in exemplary embodiment of the present invention can relate to any sample (for example, any object or thing
Business) performance in terms of certain or property, and be not limited to limit individual, object, tissue, unit, mechanism, project, event etc.
Fixed or description.In fact, any can be by predicting that the data of attribute information of relevant issues can be applied using it as foundation
In exemplary embodiment of the present invention.
In fact, attribute information acquisition device 100 can obtain structuring or the unstructured properties data of separate sources, example
Such as, text data or numeric data etc..According to the specific sample of required prediction, attribute data may include deriving from various departments
Data, for example, from business entity data, from bank and other financial mechanism data, derive from metadata provider
Data, from internet (for example, social network sites) data, from mobile operator data, from APP run
The data of quotient, the data from express company, data from credit institution etc..These data can pass through input unit
It is input to attribute information acquisition device 100, or is automatically generated according to existing data by attribute information acquisition device 100,
Or it can (for example, storage medium (for example, data warehouse) on network) obtains from network by attribute information acquisition device 100
, it is obtained from external data source in addition, the intermediate data switch of such as server can help to attribute information acquisition device 100
Take corresponding data.Here, the data of acquisition can be by data conversions such as text analysis models in attribute information acquisition device 100
Module is converted to the format being easily processed.It should be noted that attribute information acquisition device 100 can be configured as by software, hardware and/or
The modules of firmware composition, these moulds certain module in the block or whole modules can be integrated into one or common cooperation with complete
At specific function.
Result information acquisition device 200 is for obtaining result information of the history sample about forecasting problem, wherein needle
To not having the history sample of the legitimate reading about forecasting problem, result information acquisition device 200 among the history sample
Using the prediction result of history sample as the result information of history sample.
Here, for each history sample, result information acquisition device 200 can further obtain the history sample about
The result information of forecasting problem.Wherein, result information is used to indicate the corresponding conclusion that history sample is directed to forecasting problem, the conclusion
Identical or relevant content is may indicate that with the prediction result of sample to be predicted.For example, being related to whether transaction is to take advantage of in forecasting problem
In the case of swindleness transaction, result information may be used to indicate whether each historical trading is fraudulent trading, and correspondingly, result information obtains
Take device 200 that any mode appropriate can be taken to obtain the result information of each historical trading.As an example, result information obtains
Take device 200 that can infer the result information of history sample according to the statistical result of a large amount of samples, for example, by significantly larger probability
The result that can occur as history sample result information (for example, since arm's length dealing is apparent relative to the probability of fraudulent trading
Therefore the result information of history sample can be set in advance as corresponding to arm's length dealing by higher);As another example, as a result believe
Ceasing acquisition device 200 can be using the legitimate reading of reflecting history sample actual conditions as the result information of corresponding history sample;Make
For another example, result information acquisition device 200 can be according to other judgment criterions such as Expert Rules, the attribute based on history sample
The result information of history sample is correspondingly arranged in information.It should be noted that above-mentioned example is not intended to limit the exemplary implementation of the present invention
Any mode appropriate can be used to obtain the result information of history sample in the range of example, those skilled in the art.
Since when executing prediction for sample to be predicted, the legitimate reading for the history sample being based on may still can not
, therefore, exemplary embodiment according to the present invention can be by such history for the history sample without legitimate reading
The prediction result of sample is as its result information.For example, in the example that credit card steals brush, in order to predict that currently this brush card is
No to steal brush, attribute information acquisition device 100 can obtain the current attribute information merchandised and previously merchandised, however, previously handing over
Whether it is easily to steal brush to be most likely in still undetermined state (for example, user, which is also unaware that, has occurred robber's brush or user also
Have not enough time to feedback steal brush generation), for this purpose, result information acquisition device 200 can will about this part previously transaction whether be
The prediction result of brush is stolen as corresponding result information.As another example, recommendation items are directed to (for example, recommending in prediction user
Commodity) click probability when, attribute information acquisition device 100 can obtain the case where first each for the previous period recommendation items are clicked,
However, whether some samples can be marked and be clicked in these history samples for obtaining, but still due to nearest data
It is untreated the reasons such as to finish, there can be the history sample that at least part not confirm click condition still, for this partial history sample
Example, similarly, result information acquisition device 200 is using the prediction result of these history samples as their result information.
As can be seen that in an exemplary embodiment of the present invention, other than obtaining the attribute information of history sample, going back volume
Particularly for the still uncertain history sample of result, corresponding prediction result is made for the outer result information for obtaining history sample
For the result information of this partial history sample.By the above-mentioned means, can be under the guidance of the result information of relevant historical sample
The attribute information or its statistical data for effectively applying each history sample, contribute to study/prediction of hoisting machine learning model
Effect.
As described above, other than the history sample without legitimate reading, result information acquisition device 200 can be according to appointing
What mode appropriate obtains the result information of other history samples.As an example, for the history sample with legitimate reading,
Result information acquisition device 200 can be using the legitimate reading of history sample as the result information of history sample.Particularly, according to
Exemplary embodiment of the present invention, in order to which the history sample for executing prediction for sample to be predicted and referring to can be greatly classified into two
Kind, it is another one is the history sample (for example, the history sample occurred in the period earlier) for being labelled with legitimate reading
Kind is the history sample (for example, the history sample occurred recently) that there is no method to obtain legitimate reading, is gone through accordingly for the first
History sample, result information acquisition device 200 can obtain their legitimate reading information as a result, and as second of history sample
Example, result information acquisition device 200 can obtain their prediction result information as a result.In this way, arbitrary history
Sample can be used in executing prediction for sample to be predicted, this is particularly useful for the still unlabelled sample such as recent sample
The prediction scene to play an important role.
Sample generating means 300 are for the attribute information of the sample to be predicted based on acquisition, the attribute of the history sample
The result information of information and the history sample generates the forecast sample of sample to be predicted.Here, forecast sample is machine
The basis that learning model is used to be predicted for sample to be predicted, that is to say, that correspond to sample to be predicted, sample generates dress
300 forecast samples for being made of generation multiple features are set, the feature can describe the spy of sample to be predicted from different perspectives
Property (characteristic for including the history sample of sample to be predicted), for example, the feature can be the attribute information of each sample itself,
Can also be the information field by these attribute informations obtain after characteristic processing or statistical disposition, it is particularly, described
Feature may include the result information of history sample itself, may also comprise the result information of history sample and the knot of correlation attribute information
Item is closed, for example, the end value for being counted and being generated to attribute information based on result information.
Exemplary embodiment according to the present invention, the feature of forecast sample will reflect the finger of the result information of history sample
Lead effect, it should be appreciated that the concrete mode that result information participates in the Feature Engineering of forecast sample is unrestricted, and following example is only used for
Play the role of explanation:
For example, sample generating means 300 can filter out at least part history sample according to the result information of history sample
Example, and the attribute information next life of the attribute information based at least part history sample filtered out and sample to be predicted
At the feature of the forecast sample of sample to be predicted.In this case, it can only consider that result meets the history sample of specified conditions,
To improve the effect of model pre-estimating.For example, in the example of transaction swindling, can be possible for just according to result information to choose
The historical trading often merchandised, the relativity between being merchandised by this partial history and currently being merchandised can effectively judge current
Whether transaction is fraudulent trading.
In another example sample generating means 300 can be according to the result information of history sample to the corresponding category of the history sample
Property information is weighted, and is generated based on the attribute information of the attribute information of the history sample after weighting and sample to be predicted
The feature of the forecast sample of sample to be predicted.In this case, it is contemplated that the result information of each history sample effectively obtains
The overall condition of history sample is taken, and is had together with the sample to be predicted attribute information of itself based on the overall condition of history sample
Effect ground executes prediction.
In another example sample generating means 300 can be based respectively on the attribute letter of the attribute information of sample to be predicted, history sample
Breath, the history sample result information generate the feature of the forecast sample of sample to be predicted.In this case, history sample
The result information of example extends the dimension of feature space so that model can accordingly learn to more structurally sound knowledge, Jin Erti
High prediction effect.
Above example is only not intended in any way to limit exemplary embodiment of the present invention as explaining, in fact, sample generates
Device 300 can be directed to attribute information, the attribute information of relevant historical sample and the knot of the history sample of sample to be predicted
Fruit information executes Feature Engineering to generate each feature of forecast sample, and here, those skilled in the art can be suitble to according to any
Feature Engineering mode design the feature of forecast sample, such as, it is contemplated that service logic, the machine learning model of forecasting problem
The various factors such as algorithm characteristic execute Feature Engineering, for example, can be to the attribute information of sample to be predicted or history sample
Value carries out the processing of the various general characteristics engineerings such as combination, discretization, extraction part field value, rounding or statistics.It answers
Note that exemplary embodiment of the present invention is not only restricted to any specific Feature Engineering scheme.
Furthermore, it is to be understood that exemplary embodiment according to the present invention, the feature of forecast sample can not only be based on sample to be predicted
It is related can be also additionally based on other for attribute information, the attribute information of history sample and the result information of history sample of example
Information.For example, result information acquisition device 200 can also obtain the confidence level of the result information of history sample, and correspondingly, sample life
At device 300 can be based on the sample to be predicted of acquisition attribute information, the history sample attribute information, the history sample
Result information and the confidence level of result information of the history sample generate the forecast sample of sample to be predicted.
Here, the confidence level of result information is alternatively referred to as reliability, confidence level, confidence coefficient.Make to population parameter
When going out to estimate, due to the randomness of sample, conclusion is always uncertain, therefore, interval estimation is indicated using confidence level
Assurance degree, here, the span of confidence interval is the positive function of confidence level, that is, the assurance degree required is bigger, certainly will obtain
As soon as wider confidence interval, this correspondingly reduces the order of accuarcy of estimation.
Here, those skilled in the art can obtain the confidence of the result information of history sample in any suitable fashion
Degree, for example, for the history sample without legitimate reading, result information acquisition device 200 can be based on the algorithm of prediction model
Or the confidence level of the prediction result of history sample is obtained independently of the algorithm of prediction model, result information as history sample
Confidence level;In addition, for the history sample with legitimate reading, result information acquisition device 200 can be by the true of history sample
The confidence level of real result is set as indicating the preset value of high confidence level, the confidence level of the result information as history sample.
Correspondingly, sample generating means 300 can be using the confidence level of the result information of history sample as individual pre- test sample
Eigen to extend the feature space of forecast sample, alternatively, sample generating means 300 can also by confidence level according to result information
Similar mode is applied to Feature Engineering;Alternatively, sample generating means 300 can also be answered according to the mode different from result information
Use confidence level.It should be noted that exemplary embodiment of the present invention has been not limited to form the feature of forecast sample and answer confidence level
Concrete mode for Feature Engineering.
As an example, sample generating means 300 can generate sample to be predicted at least one of in the following manner
The feature of forecast sample:
First way:Sample generating means 300 can be according to the history sample result information and the history sample
The confidence level of the result information of example is gone through to filter out at least part history sample based on the described at least part filtered out
The attribute information of the attribute information of history sample and sample to be predicted generates the feature of the forecast sample of sample to be predicted.At this
In the case of kind, it can only consider that result meets the history sample of specified conditions, to improve the effect of model pre-estimating.For example, handing over
In the example easily cheated, the historical trading for being possible for arm's length dealing can be chosen according to result information and its confidence level, passed through
The relativity that this partial history is merchandised between current transaction can effectively judge whether currently merchandise is fraudulent trading.
The second way:According to the confidence of the result information of the history sample and the result information of the history sample
Degree is weighted the respective attributes information of the history sample, and the attribute information based on the history sample after weighting and waits for
The attribute information of sample is predicted to generate the feature of the forecast sample of sample to be predicted.In this case, it is contemplated that each history
The result information of sample and its confidence level effectively obtain the overall condition of history sample, and the whole feelings based on history sample
Condition efficiently performs prediction together with the sample to be predicted attribute information of itself.
The third mode:It is based respectively on the attribute information of sample to be predicted, the attribute information of the history sample, described goes through
The confidence level of the result information of the result information of history sample and the history sample generates the forecast sample of sample to be predicted
Feature.In this case, the result information of history sample and confidence level extend the dimension of feature space so that model energy
More structurally sound knowledge is arrived in enough corresponding study, and then improves prediction effect.
Prediction result is provided device 400 and is used for using the prediction model trained based on machine learning techniques, pre- for waiting for
The forecast sample of test sample example provides the prediction result of sample to be predicted.
That is, prediction result, which provides device 400, can be used for obtaining the prediction model by being trained based on machine learning
The prediction result provided for sample to be predicted.Here, for the forecast sample of each sample to be predicted, prediction model can
The prediction result of the sample to be predicted about forecasting problem is provided.Correspondingly, prediction result provides device 400 and can obtain by predicting
The prediction result that model provides.Here, prediction result provides device 400 and can control prediction model to be directed to sample execution to be predicted
Predict and thus to obtain the prediction result of prediction model, alternatively, prediction result device 400 is provided can be from positioned at external prediction mould
Type receives corresponding prediction result.Prediction result, which provides device 400, to be supplied to user by the prediction result of acquisition, alternatively, in advance
The prediction result of acquisition can be supplied to storage device, decision making device or other devices with into traveling by surveying result and providing device 400
The processing of one step.
Here, prediction model is the machine learning model trained based on training dataset, here, training dataset packet
Include a large amount of training samples, the feature of each training sample constitutes consistent with forecast sample and includes current sample about forecasting problem
Legitimate reading (that is, label).Citing is got on very well, it is assumed that the targeted forecasting problem of prediction model is whether current transaction is fraud
Transaction can then utilize true sale whether a large amount of labeled fraud to form the training sample of prediction model, wherein very
The fraudulent trading that occurred in fact can correspond to the positive sample of prediction model, and the non-fraudulent trading really occurred can correspond to it is pre-
Survey the negative sample of model, as an example, the label of positive sample can value be 1, the label of negative sample can value be 0.About training
The characteristic of sample needs to build according to consistent mode is corresponded to completely with forecast sample, particularly, each trains sample
Originally current sample is can correspond to, correspondingly, attribute information that can be based on current sample, the history sample occurred before current sample
The attribute information of example, the history sample about forecasting problem result information (or further also based on the history sample
The confidence level of result information) generate each feature of training sample so that the feature of training sample and the feature of forecast sample
It is upper completely the same (that is, uniform in the selection mode of history sample and the various aspects such as concrete mode of Feature Engineering constituting
It causes).
As described above, in order to generate the feature of training sample, the result information for obtaining corresponding historical sample is needed, as
The acquisition modes of preferred embodiment, the result information involved by training sample can correspond to the result information involved by forecast sample
Acquisition modes.That is, can be tied by prediction result, legitimate reading, statistics according to consistent mode corresponding with forecast sample
Fruit etc. is arranged in correspondence with the result information of history sample.
As an example it is assumed that executed for sample to be predicted estimate scene (for online Prediction user clicking rate or
The scenes such as fraudulent trading) in, in order to generate the forecast sample of prediction model, not only need to obtain sample to be predicted and its history sample
The attribute information of example, it is also necessary to obtain the result information of each history sample.However, for various reasons, it is likely that executing
The legitimate reading information of all history samples can not be also obtained when estimating (for example, also having little time to collect or confirm when online Prediction
The legitimate reading of all history samples), that is to say, that among the history sample that forecast sample is based on, some history
Labeled legitimate reading, also at least part history sample do not have legitimate reading still to sample.Therefore, labeled true
As a result history sample can be by legitimate reading information as a result, and the result information of other history samples may be from this
The prediction result of a little history samples, as an example, can be the prediction result provided by prediction model.
In that case it is preferable that the training process and prediction process of prediction model can be consistently designed, in particular,
Result information involved in training sample and forecast sample can be built according to consistent mode, so that training sample is on the scene
Forecast sample is more approached on scape.For this purpose, although when building training sample, current sample and its history sample have had very
It is real as a result, still, it is right in training sample for the history sample for using prediction result information as a result in forecast sample
The history sample answered equally will be used as its result information using prediction result.For example, in stealing the example of brush about credit card,
Assuming that forecast sample be based on be it is to be predicted transaction and nearest three months occur historical trading, wherein this month occur
Historical trading does not have legitimate reading, needs to be used as result information using prediction result, and the historical trading of the other two moon
Legitimate reading information as a result can be used.Correspondingly, when forming training sample, for the of that month history occurred of currently merchandising
Transaction, also will be used as result information using prediction result, and legitimate reading conduct can be used in the historical trading of the other two moon
Result information.
In the examples described above, the prediction result of history sample can be provided by prediction model, and here, prediction model can be used
The mode of iteration is trained, the model currently trained obtained from continuous iteration, sustainable renewal training sample
The prediction result of involved history sample.Correspondingly, it when the prediction model completed in application training is estimated, can be used
Prediction model generates the prediction result of the history sample involved by forecast sample;Here, alternately, can also be used
One takes turns model caused by iteration to provide the prediction result of the history sample involved by forecast sample.
That is, the prediction knot for the history sample that the forecast sample obtained by result information acquisition device 200 is based on
Fruit can be provided by prediction model, alternatively, also can be by previous model corresponding with the last round of iteration of the prediction model
It provides.
It, can be according to the side consistent with forecast sample in the training process of the prediction model accordingly, as example
Formula, the legitimate reading of the history sample that training sample is based on or prediction result as the result information of the history sample,
Wherein, the prediction result for the history sample that the training sample is based on is provided by the prediction model currently trained.Into one
Step ground, in the training process, the prediction model is iterated training for training sample so that training sample is based on
The prediction result of history sample constantly updated with iteration.
Prediction result, which provides device 400, can be supplied to the forecast sample of sample to be predicted the prediction mould as above trained
Type, to obtain corresponding prediction result.As an example, system shown in FIG. 1 can provide sample to be predicted about prediction online
The prediction result of problem.Correspondingly, attribute information acquisition device 100, result information acquisition device 200, sample generating means 300
There is provided device 400 with prediction result can online processing data.However, it should be understood that exemplary embodiment of the present invention is not limited
In, whole system or in which certain devices can also be operated under off-line state.
As an example, the training process of above-mentioned prediction model also may be incorporated into prediction system according to an exemplary embodiment of the present invention
In system.
Fig. 2 shows the systems for providing prediction result based on machine learning according to another exemplary embodiment of the present invention
Block diagram.With reference to Fig. 2, the system comprises attribute information acquisition device 100, result information acquisition device 200, sample generating means
300, prediction result provides device 400 and model training apparatus 500.
As can be seen that compared to Figure 1, Fig. 2 further comprises model training apparatus 500, in addition to this, attribute information obtains dress
Setting 100, result information acquisition device 200 and sample generating means 300 also will additionally execute the behaviour trained about prediction model
Make.
Since attribute information acquisition device 100, result information acquisition device 200, sample being described in detail referring to Fig.1
Generating means 300 and prediction result are provided device 400 and are being executed operation when estimating using the complete prediction model of training, here will
No longer aforesaid operations are repeated, only describe processing related with the training stage of prediction model below.
In order to complete the training of prediction model, a large amount of training samples need to be built based on true sample, wherein as showing
Example, each training sample may indicate that a current legitimate reading of the sample about forecasting problem.The characteristic of training sample can
The result information of attribute information, the history sample based on current sample and its history sample (or further considers the knot
The confidence level of fruit information), and the label of training sample may indicate that the legitimate reading of current sample.
Particularly, attribute information acquisition device 100 can obtain the attribute information of current sample corresponding with training sample
And the attribute information of the history sample occurred before current sample, here, attribute information acquisition device 100 also obtains currently
Legitimate reading (that is, flag attribute) of the sample about forecasting problem.
Result information acquisition device 200 can obtain the history sample occurred before current sample about forecasting problem
Result information, here, it is preferred that, result information acquisition device 200 can be corresponding according to the constituted mode of forecast sample
Ground obtains the legitimate reading of above-mentioned history sample or prediction result is used as the result informations of these history samples.In addition to this,
Alternately, result information acquisition device 200 can also accordingly obtain each result according to the acquisition modes of result information
The confidence level of information.
It is the legitimate reading of attribute information, current sample that sample generating means 300 can be based on the current sample of acquisition, current
The result information of the attribute information of the history sample of sample and the history sample (or is based further on the confidence of result information
Degree) generate the training sample of current sample.It should be understood that sample generating means 300 can be passed through according to the characteristic Design of forecast sample
It is handled by same Feature Engineering to generate the feature of training sample, in addition, sample generating means 300 can be by the true of current sample
Label of the real result as training sample.
Model training apparatus 500 can be based on the training sample generated by sample generating means 300, according to scheduled machine
Learning algorithm trains prediction model.Here, it should be noted that exemplary embodiment of the present invention does not limit the specific calculation of prediction model
Method.Particularly, the mode of iteration can be used to train prediction model in model training apparatus 500, during this, is used as going through
The prediction result (or together with its confidence level) of the result information of history sample is thus continually updated so that the characteristic quilt of training sample
It constantly updates.For this purpose, in the model training of each round iteration, result information acquisition device 200 can be utilized and currently be trained
The model model of last round of iteration (that is, corresponding to) obtain the prediction result of relevant historical sample, using as the history
The result information of sample.For first round iteration, result information acquisition device 200 can obtain relevant historical otherwise
The prediction result of sample, using the initial results information as the history sample.For example, result information acquisition device 200 can base
The prediction result of relevant historical sample is set in the result statistical information of a large amount of samples, alternatively, result information acquisition device 200
The prediction result of relevant historical sample can be inferred to according to predetermined artificial rule, alternatively, result information acquisition device 200 can be random
The prediction result of relevant historical sample is set.For such prediction result, result information acquisition device 200 will can accordingly be set
Reliability uses as default or calculates corresponding confidence level in a predetermined manner.In addition, result information acquisition device 200
It can give up in first round iteration with the history sample of prediction result information as a result in training sample, and only with true
As a result the history sample of information as a result.After first round iteration, you can needed using the prediction model trained to be directed to
The history sample of prediction result is wanted to be predicted.As described above, model training apparatus 500 can constantly iteration prediction model, directly
To meeting the corresponding condition of convergence, the prediction model so trained can learn to the history sample of various results under how needle
Current sample is predicted.Training details about prediction model is said before can refer to about what prediction model itself carried out
Bright, details are not described herein.
Correspondingly, the prediction model that training is completed can be supplied to prediction result to provide device by model training apparatus 500
400 so that prediction result, which provides device 400, to provide the pre- of sample to be predicted using the prediction model to be directed to forecast sample
Survey result.
It as an example, in system shown in fig. 1 or fig. 2, may also include feedback device (not shown), waited for for reception pre-
Legitimate reading of the test sample example about forecasting problem, wherein the legitimate reading be used to train together with corresponding sample to be predicted
The prediction model.Particularly, in system such as shown in FIG. 1, sample to be predicted that feedback device can be received it
Legitimate reading about forecasting problem is stored, and the legitimate reading of storage is supplied to the external trainer of prediction model
Device, with re -training and/or update prediction model.Alternatively, in system such as shown in Fig. 2, feedback device can be by its institute
Receive sample to be predicted be supplied to attribute information acquisition device 100 about the legitimate reading of forecasting problem, using as with wait for it is pre-
The authentic signature of the corresponding training sample of test sample example.
It describes according to an exemplary embodiment of the present invention to provide prediction result based on machine learning hereinafter with reference to Fig. 3
Method flow chart.Here, as an example, the method can provide prediction knot of the sample to be predicted about forecasting problem online
Fruit, correspondingly, at least part step need to execute online.
Method shown in Fig. 3 can be as shown in Figure 1 forecasting system execute, also can be completely by computer program with software
Mode is realized or is realized by being stored with the computer-readable medium of the computer program.In addition, can also be matched by specific
The computing device set executes method shown in Fig. 3.Since the processing for describing correlation technique step referring to Fig.1 above is thin
Section, repeats, it should be appreciated that according to an exemplary embodiment of the present invention pre- for that will not refer again to Fig. 3 below the content of this part
Survey method can equally cover described all processing details referring to Fig.1.
With reference to Fig. 3, in step S100, obtains the attribute information of sample to be predicted and occur before sample to be predicted
The attribute information of history sample.In order to make it easy to understand, stealing the specific example of brush to be described, so below with reference to credit card
And, it should be appreciated that exemplary embodiment of the present invention is not limited to any specific forecasting problem or related sample.Specifically
Come, it is assumed that current this brush card transaction of sample instruction to be predicted, forecasting problem instruction are about current this brush card transaction
No is the fraudulent trading for such as stealing brush, and history sample refers to the transaction of swiping the card occurred before current this brush card transaction.
As an example, credit card can be obtained online when the secondary generated attribute information currently merchandised of swiping the card, for example, swiping the card
The amount of money, place of swiping the card, commodity of swiping the card, merchant identification of swiping the card etc.;In addition, can also obtain caused by the credit card swipes the card in the past
The attribute information of historical trading, for example, can obtain nearest trimestral transaction attribute information or nearest N (N is whole more than 1
Number) transaction attribute information.
Next, in step s 200, obtaining result information of the history sample about forecasting problem, wherein for described
The history sample for not having the legitimate reading about forecasting problem among history sample, using the prediction result of history sample as going through
The result information of history sample.
Citing is got on very well, in this step, need to obtain about each historical trading whether be fraudulent trading result information,
In, for the historical trading of fraudulent trading can not be confirmed whether it is in execution prediction fashion (that is, without true fraud result
Historical trading), using the prediction result of the historical trading as its result information.Here, in all relevant historical tradings
In the case of not having true fraud result, the result information that prediction result is used as historical trading can be used uniformly.As
Example, prediction result here can be made by whether for predicting currently to merchandise, to be prediction model of fraudulent trading itself provide
It, also can be corresponding to the last round of iteration by prediction model in the case where the prediction model is iterated training for optional mode
Previous model provides the prediction result of historical trading.
However, in other cases, in all relevant historical tradings, it is also possible to which there are some with legitimate reading
Historical trading merchandises for this partial history, the other information other than prediction result can be used and be used as result information, example
Such as, conclusion will can really be cheated as corresponding result information.It should be noted that the result information of this partial history transaction is in addition to can
Be historical trading legitimate reading except, can also be based on statistical probability obtained from great amount of samples, equally can also be
The fraud possibility come out using model pre-estimating.
Next, in step S300, the attribute information of the sample to be predicted based on acquisition, the history sample attribute
The result information of information and the history sample generates the forecast sample of sample to be predicted.
Citing is got on very well, and can correspond to each current transaction to generate corresponding forecast sample, feature is directed not only to currently
The attribute information of transaction is directed to the attribute information of relevant historical transaction, in particular, when generating the feature of forecast sample, also
Further combined with the fraud result information of each historical trading.
Here, the processing mode of any Feature Engineering appropriate can be used to generate the feature of forecast sample, as an example,
The feature may include at least one among following item:
(1) at least one attributive character currently merchandised, for example, transaction amount, loco, tradable commodity, trader
Family mark etc.;
(2) at least one attributive character of historical trading, for example, transaction amount, loco, tradable commodity, trader
Family mark etc.;Here, can also be the statistics of attributes feature of historical trading, for example, average value/maximum value of transaction amount/most
Small value etc.;
(3) result information of historical trading, for example, fraud result of the value between [0,1], wherein true fraud knot
By can correspond to 1, true non-fraud conclusion can correspond to 0, and model prediction result or sample statistics result can be between 0 and 1
Probability value;Here, the feature can also be the result information statistical nature of historical trading;
(4) attributive character filtered out based on result information, for example, fraud end value is less than the historical trading of predetermined threshold
Transaction amount;
(5) the statistics of attributes feature based on result information, for example, transaction amount is to cheat the value of result information as weight
Weighted value.
It should be noted that the above project is only as an example, not a limit, for example, features described above producing method may also be combined with use,
For example, the weighted feature after screening can further be obtained, for example, the historical trading for being less than predetermined threshold to fraud end value carries out
It is handled using accordingly to cheat the value of result information as the weighting of weight to obtain corresponding feature.
With continued reference to Fig. 3, in step S400, the available prediction model trained based on machine learning techniques, for
The forecast sample of sample to be predicted provides the prediction result of sample to be predicted.
As described above, as an example, prediction model here is for predicting whether currently merchandise is fraudulent trading.It will prediction
Sample is supplied to the prediction model, can get the prediction result for the probability that the current transaction of instruction is fraudulent trading.
The flow of the method for trained prediction model according to an exemplary embodiment of the present invention is described hereinafter with reference to Fig. 4
Figure, here, a part of device in the system that method shown in Fig. 4 can be as shown in Figure 2 execute, and can also be instructed by individual model
Practice device to execute, alternatively, the method can also be realized by computer program with software mode or by storing completely
The computer-readable medium of computer program is stated to realize.In addition, also Fig. 4 institutes can be executed by the computing device of specific configuration
The method shown.Similarly, due to describing the processing details of correlation technique step with reference to Fig. 2 above, for this part
Fig. 4 will not be referred again to below content to repeat, it should be appreciated that model training method according to an exemplary embodiment of the present invention equally may be used
Cover all processing details with reference to described in Fig. 2.
With reference to Fig. 4, in step S1000, obtains the attribute information of current sample and occur before current sample
The attribute information of history sample.Here, current sample refers to the sample corresponding to current training sample.
Equally, for predicting the training process of prediction model of credit card robber's brush, in this step, some brush can be obtained
The attribute information and the history that occurs before the transaction for blocking transaction are swiped the card the attribute information of transaction, for example, transaction amount, friendship
Easy place, tradable commodity, transaction merchant identification etc..
In step S1100, authentic signature of the current sample about forecasting problem is obtained.
As an example, in this step, can obtain current transaction actually whether be fraudulent trading label as a result, its
In, the label result of fraudulent trading can correspond to value 1, and the label result of non-fraudulent trading can correspond to value 0.
In step S2000, result information of the history sample about forecasting problem is obtained, here, as an example, can
According to the mode consistent with forecast sample, the legitimate reading or prediction result for the history sample that training sample is based on are as institute
State the result information of history sample, wherein the prediction result for the history sample that the training sample is based on can be by currently training
The prediction model that goes out provides.
As an example, in this step, in training sample with the history in forecast sample with true fraud result
Its true label can be used as its result information in transaction corresponding historical trading in sequential;And for training sample
In with the historical trading for not having true fraud result in forecast sample in sequential corresponding historical trading, its prediction can be used
As a result rather than its true fraud result is as its result information, prediction result here by the prediction model that currently trains Lai
It provides.
In step S3000, attribute information based on the current sample and history sample that are obtained in step S1000, in step
The label for the current sample that rapid S1100 is obtained generates current sample in the result information of the step S2000 history samples obtained
Training sample.
Here, it should be noted that the feature of training sample and the feature of forecast sample are consistent, only in training sample also
Additionally include the label information of current sample.
Next, in step S4000, using specific machine learning algorithm, prediction is trained based on training sample
Model, here, exemplary embodiment according to the present invention can iteratively train prediction model, and correspondingly, step S4000 can be right
Ying Yuyi takes turns iteration.
After a wheel training is completed, it is pre- can to judge whether the prediction model currently trained meets in step S5000
The fixed condition of convergence proceeds to step S6000 to export prediction model if meeting the condition of convergence.
If not meeting the condition of convergence, return to step S2000, using the prediction model currently trained come again
The result information for obtaining history sample correspondingly updates each training sample in step S3000, is then held in step S4000
The model training of row next round.
Exemplary embodiment according to the present invention, in forecast sample/training sample of prediction model, in addition to combining history
It, can also be further combined with the confidence level of result information, to be further ensured that the effect of model except the result information of sample.
It describes to provide prediction based on machine learning according to another exemplary embodiment of the present invention hereinafter with reference to Fig. 5
As a result the flow chart of method.As can be seen that the method for method shown in fig. 5 as shown in figure 3 is similar, the scene of the two application and
The main body of execution can be same or like, and the step S100 in Fig. 5 can be identical as the step S100 in Fig. 3, the step in Fig. 5
S400 can be similar with the step S400 in Fig. 3, will no longer repeat here repetition or similar content.
Only emphasis describes the method for Fig. 5 and the distinct technology contents of method of Fig. 3 below.Particularly, in step
In S210, other than obtaining history sample about the result information of forecasting problem, the result information of history sample is also obtained
Confidence level.Here, as an example, for the history sample without legitimate reading, algorithm or independence that can be based on prediction model
The confidence level that the prediction result of history sample is obtained in the algorithm of prediction model, the confidence of the result information as history sample
Degree;For the history sample with legitimate reading, set the confidence level of the legitimate reading of history sample to indicate high confidence water
Flat preset value, the confidence level of the result information as history sample.
Correspondingly, in step S310, the attribute information of the sample to be predicted based on acquisition, the history sample attribute
The confidence level of the result information of information, the result information of the history sample and the history sample generates sample to be predicted
Forecast sample.As an example, can according to result information similar mode, the confidence level of result information is also applied to together
The Feature Engineering of forecast sample.
Here, again for predicting whether current transaction is fraudulent trading, in step S310, it is any appropriate to can be used
The processing mode of Feature Engineering generate the feature of forecast sample, as an example, the feature may include among following item
At least one of:
(1) at least one attributive character currently merchandised, for example, transaction amount, loco, tradable commodity, trader
Family mark etc.;
(2) at least one attributive character of historical trading, for example, transaction amount, loco, tradable commodity, trader
Family mark etc.;Here, can also be the statistics of attributes feature of historical trading, for example, average value/maximum value of transaction amount/most
Small value etc.;
(3) result information and confidence level of historical trading, for example, the result information can be value between [0,1]
Fraud result, wherein true fraud conclusion can correspond to 1, and true non-fraud conclusion can correspond to 0, model prediction result or
Sample statistics result can be the probability value between 0 and 1;In addition, confidence level can weigh above-mentioned fraud result reliability
Confidence value, true conclusion or the true non-fraud conclusion of cheating can have highest confidence value, in advance as legitimate reading
Surveying the confidence level of result or statistical result can calculate according to corresponding method and be obtained;Here, the result of historical trading can also be used
The statistical nature of information and/or confidence level;
(4) attributive character filtered out based on result information and/or confidence information, for example, fraud end value is less than
The transaction amount of the historical trading of predetermined threshold, confidence level are tied higher than the transaction amount of the historical trading of predetermined threshold or fraud
Fruit value is less than the transaction amount of predetermined threshold and confidence level higher than the historical trading of predetermined threshold;
(5) the statistics of attributes feature based on result information and/or confidence information, for example, transaction amount is to cheat result
The value of information be the weighted value of weight, transaction amount using the value of confidence information as the weighted value of weight, transaction amount with
Cheat the weighted value for being combined into weight of the value of result information and the value of confidence information.
It should be noted that the above project is only as an example, not a limit, for example, features described above producing method may also be combined with use.
Correspondingly, Fig. 6 shows the flow chart of the method for the training prediction model according to another exemplary embodiment of the present invention.
As can be seen that method method as shown in fig. 4 shown in fig. 6 is similar, the scene of the two application and the main body of execution can be identical
Or it is similar, step S1000, S1100, S4000, S5000 and S6000 in Fig. 6 can or phases identical as the corresponding steps in Fig. 4
Seemingly, no longer repetition or similar content will be repeated here.
Only emphasis describes the method for Fig. 6 and the distinct technology contents of method of Fig. 4 below.Particularly, in step
In S2100, other than obtaining history sample about the result information of forecasting problem, the result information of history sample is also obtained
Confidence level.Correspondingly, in step S3100, the attribute information of the current sample based on acquisition, the history sample attribute letter
Breath, the result information of the history sample and the history sample the confidence level of result information generate the instruction of current sample
Practice sample.It should be noted that the calculation of confidence level is consistent with the content described with reference to Fig. 5, and the Feature Engineering of training sample
Can be consistent with the content with reference to Fig. 5 descriptions, it will no longer repeat here.
Above by reference to Fig. 1 to Fig. 6 describe it is according to an exemplary embodiment of the present invention based on machine learning come provide wait for it is pre-
Test sample example is about the method and system of the prediction result of forecasting problem and corresponding model training method and system.It should be understood that
The above method can be realized by the program being recorded in computer-readable media, correspondingly, exemplary implementation according to the present invention
Example, it is possible to provide a kind of to provide computer-readable Jie of prediction result of the sample to be predicted about forecasting problem based on machine learning
Matter, record is useful for executing the computer program of following methods step on the computer-readable medium:(A) it obtains to be predicted
The attribute information of the attribute information of sample and the history sample occurred before sample to be predicted;(B) the history sample is obtained
Result information of the example about forecasting problem, wherein for the true knot not having among the history sample about forecasting problem
The history sample of fruit, using the prediction result of history sample as the result information of history sample;(C) sample to be predicted based on acquisition
The result information of the attribute information of example, the attribute information of the history sample and the history sample generates sample to be predicted
Forecast sample;(D) using the prediction model that is trained based on machine learning techniques, for sample to be predicted forecast sample come
The prediction result of sample to be predicted is provided.
Computer program in above computer readable medium can be in client, host, agent apparatus, server etc.
Run in the environment disposed in computer equipment, it should be noted that the computer program can be additionally used in execute in addition to above-mentioned steps with
Outer additional step or executed when executing above-mentioned steps more specifically handles, these additional steps and is further processed
Content is described referring to figs. 1 to Fig. 6, here in order to avoid repetition will be repeated no longer.
Correspondingly, the above-mentioned system for providing prediction result of the sample to be predicted about forecasting problem based on machine learning
The operation of computer program can be completely dependent on to realize corresponding function, that is, the function structure of each device and computer program
In each step it is corresponding so that whole system is called by special software package (for example, libraries lib), corresponding to realize
Function.
On the other hand, Fig. 1 and each device shown in Fig. 2 and unshowned relevant apparatus can also be by hardware, soft
Part, firmware, middleware, microcode or its arbitrary combination are realized.When being realized with software, firmware, middleware or microcode, use
It can be stored in the computer-readable medium of such as storage medium, make in the program code or code segment for executing corresponding operating
Corresponding operation can be executed by reading and running corresponding program code or code segment by obtaining processor.
Here, exemplary embodiment of the present invention is also implemented as computing device, which includes storage unit
And processor, set of computer-executable instructions conjunction is stored in storage unit, when the set of computer-executable instructions is closed by institute
When stating processor and executing, executes and above-mentioned provide the side of prediction result of the sample to be predicted about forecasting problem based on machine learning
Method.
Particularly, the computing device can be deployed in server or client, can also be deployed in distributed network
On node apparatus in network environment.In addition, the computing device can be PC computers, board device, personal digital assistant, intelligence
Can mobile phone, web applications or other be able to carry out the device of above-metioned instruction set.
Here, the computing device is not necessarily single computing device, can also be it is any can be alone or in combination
Execute the device of above-metioned instruction (or instruction set) or the aggregate of circuit.Computing device can also be integrated control system or system
A part for manager, or can be configured as with Local or Remote (for example, via wireless transmission) with the portable of interface inter-link
Formula electronic device.
In the computing device, processor may include central processing unit (CPU), graphics processor (GPU), may be programmed and patrol
Collect device, dedicated processor systems, microcontroller or microprocessor.As an example, not a limit, processor may also include simulation
Processor, digital processing unit, microprocessor, multi-core processor, processor array, network processing unit etc..
It is above-mentioned about providing sample to be predicted about institute in the method for the prediction result of forecasting problem based on machine learning
Certain operations of description can realize that certain operations can be realized by hardware mode by software mode, in addition, can also pass through
The mode of software and hardware combining realizes these operations.
Processor can run the instruction being stored in one of storage unit or code, wherein the storage unit can be with
Store data.Instruction and data can be also sent and received via Network Interface Unit and by network, wherein the network connects
Any of transport protocol can be used in mouth device.
Storage unit can be integral to the processor and be integrated, for example, RAM or flash memory are arranged in integrated circuit microprocessor etc.
Within.In addition, storage unit may include independent device, such as, external dish driving, storage array or any Database Systems can
Other storage devices used.Storage unit and processor can be coupled operationally, or can for example by the ports I/O,
Network connection etc. communicates so that processor can read the file being stored in storage unit.
In addition, the computing device may also include video display (such as, liquid crystal display) and user's interactive interface is (all
Such as, keyboard, mouse, touch input device etc.).The all components of computing device can be connected to each other via bus and/or network.
It is above-mentioned about being provided involved by method of the sample to be predicted about the prediction result of forecasting problem based on machine learning
And operation can be described as it is various interconnection or coupling functional blocks or function diagram.However, these functional blocks or function diagram
Single logic device can be equably integrated into or operated according to non-exact boundary.
Particularly, as described above, according to an exemplary embodiment of the present invention provide sample to be predicted based on machine learning
Example may include storage unit and processor about the computing device of the prediction result of forecasting problem, and calculating is stored in storage unit
Machine executable instruction set executes following step when set of computer-executable instructions conjunction is executed by the processor:
(A) attribute information of sample to be predicted and the attribute information of the history sample occurred before sample to be predicted are obtained;(B) it obtains
Take result information of the history sample about forecasting problem, wherein do not have about prediction among the history sample
The history sample of the legitimate reading of problem, using the prediction result of history sample as the result information of history sample;(C) it is based on obtaining
The result information next life of the attribute information of the sample to be predicted taken, the attribute information of the history sample and the history sample
At the forecast sample of sample to be predicted;(D) using the prediction model trained based on machine learning techniques, for sample to be predicted
Forecast sample the prediction result of sample to be predicted is provided.
It should be noted that have been combined above Fig. 1 to Fig. 6 describe it is according to an exemplary embodiment of the present invention be based on machine learning
Details is managed everywhere in method of the sample to be predicted about the prediction result of forecasting problem to provide, will not be described in great detail calculating dress here
Set processing details when executing each step.
Each exemplary embodiment of the present invention is described above, it should be appreciated that foregoing description is merely exemplary, not
Exhaustive, and present invention is also not necessarily limited to disclosed each exemplary embodiments.Without departing from scope and spirit of the present invention
Sample under, many modifications and changes will be apparent from for those skilled in the art.Therefore, originally
The protection domain of invention should be subject to the scope of the claims.
Claims (10)
1. a kind of method providing prediction result of the sample to be predicted about forecasting problem based on machine learning, including:
(A) attribute information of sample to be predicted and the attribute information of the history sample occurred before sample to be predicted are obtained;
(B) result information of the history sample about forecasting problem is obtained, wherein for not having among the history sample
The history sample of legitimate reading about forecasting problem, using the prediction result of history sample as the result information of history sample;
(C) attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and the history sample
Result information generate the forecast sample of sample to be predicted;And
(D) it using the prediction model trained based on machine learning techniques, is waited for for the forecast sample of sample to be predicted to provide
Predict the prediction result of sample.
2. the method for claim 1, wherein in step (B), the confidence of the result information of history sample is also obtained
Degree, also,
In step (C), the attribute information of the sample to be predicted based on acquisition, the history sample attribute information, described go through
The confidence level of the result information of the result information of history sample and the history sample generates the forecast sample of sample to be predicted.
3. method as claimed in claim 2, wherein in step (B), the prediction result of the history sample is by the prediction
Model or previous model corresponding with the last round of iteration of the prediction model provide.
4. method as claimed in claim 3, wherein in step (B), for the history sample with legitimate reading, will go through
Result information of the legitimate reading of history sample as history sample.
5. method as claimed in claim 4, wherein the prediction model has following training process, in the training process
In, according to the mode consistent with forecast sample, the legitimate reading or prediction result of the history sample that training sample is based on are made
For the result information of the history sample, wherein the prediction result for the history sample that the training sample is based on is by currently instructing
The prediction model practised provides.
6. method as claimed in claim 5, wherein in the training process, the prediction model for training sample into
Row iteration is trained so that the prediction result for the history sample that training sample is based on is constantly updated with iteration.
7. method as claimed in claim 2, wherein generate the pre- of sample to be predicted at least one of in the following manner
The feature of test sample sheet:
(C1) it is filtered out according to the confidence level of the result information of the history sample and the result information of the history sample
At least part history sample, and the attribute information based at least part history sample filtered out and sample to be predicted
The attribute information of example generates the feature of the forecast sample of sample to be predicted;
(C2) it is gone through to described according to the confidence level of the result information of the history sample and the result information of the history sample
The respective attributes information of history sample is weighted, and the attribute information based on the history sample after weighting and sample to be predicted
Attribute information generates the feature of the forecast sample of sample to be predicted;And
(C3) it is based respectively on the knot of the attribute information of sample to be predicted, the attribute information of the history sample, the history sample
The confidence level of the result information of fruit information and the history sample generates the feature of the forecast sample of sample to be predicted.
8. a kind of system providing prediction result of the sample to be predicted about forecasting problem based on machine learning, including:
Attribute information acquisition device, attribute information for obtaining sample to be predicted and what is occurred before sample to be predicted go through
The attribute information of history sample;
Result information acquisition device, for obtaining result information of the history sample about forecasting problem, wherein for described
Do not have the history sample of the legitimate reading about forecasting problem among history sample, result information acquisition device is by history sample
Result information of the prediction result as history sample;
Sample generating means, for the attribute information of the sample to be predicted based on acquisition, the history sample attribute information with
And the result information of the history sample generates the forecast sample of sample to be predicted;And
Prediction result provides device, for utilizing the prediction model trained based on machine learning techniques, for sample to be predicted
Forecast sample the prediction result of sample to be predicted is provided.
9. a kind of providing the computer-readable medium of prediction result of the sample to be predicted about forecasting problem based on machine learning,
Wherein, record is useful for executing the computer program of following steps on the computer-readable medium:
(A) attribute information of sample to be predicted and the attribute information of the history sample occurred before sample to be predicted are obtained;
(B) result information of the history sample about forecasting problem is obtained, wherein for not having among the history sample
The history sample of legitimate reading about forecasting problem, using the prediction result of history sample as the result information of history sample;
(C) attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and the history sample
Result information generate the forecast sample of sample to be predicted;And
(D) it using the prediction model trained based on machine learning techniques, is waited for for the forecast sample of sample to be predicted to provide
Predict the prediction result of sample.
10. a kind of providing the computing device of prediction result of the sample to be predicted about forecasting problem based on machine learning, including
Storage unit and processor are stored with set of computer-executable instructions conjunction in storage unit, when the computer executable instructions
When set is executed by the processor, following step is executed:
(A) attribute information of sample to be predicted and the attribute information of the history sample occurred before sample to be predicted are obtained;
(B) result information of the history sample about forecasting problem is obtained, wherein for not having among the history sample
The history sample of legitimate reading about forecasting problem, using the prediction result of history sample as the result information of history sample;
(C) attribute information of the sample to be predicted based on acquisition, the attribute information of the history sample and the history sample
Result information generate the forecast sample of sample to be predicted;And
(D) it using the prediction model trained based on machine learning techniques, is waited for for the forecast sample of sample to be predicted to provide
Predict the prediction result of sample.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710016236.8A CN108288161A (en) | 2017-01-10 | 2017-01-10 | The method and system of prediction result are provided based on machine learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710016236.8A CN108288161A (en) | 2017-01-10 | 2017-01-10 | The method and system of prediction result are provided based on machine learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108288161A true CN108288161A (en) | 2018-07-17 |
Family
ID=62819696
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710016236.8A Pending CN108288161A (en) | 2017-01-10 | 2017-01-10 | The method and system of prediction result are provided based on machine learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108288161A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109242165A (en) * | 2018-08-24 | 2019-01-18 | 蜜小蜂智慧(北京)科技有限公司 | A kind of model training and prediction technique and device based on model training |
CN109242496A (en) * | 2018-09-18 | 2019-01-18 | 北京三快在线科技有限公司 | Prediction technique, device and the computer-readable medium of the means of payment |
CN109363789A (en) * | 2018-10-19 | 2019-02-22 | 上海交通大学 | Method and data acquisition system for predicting root canal length |
CN110264337A (en) * | 2019-05-28 | 2019-09-20 | 阿里巴巴集团控股有限公司 | A kind of building of prediction model and bank card belonging country prediction technique and device |
CN111127084A (en) * | 2019-12-11 | 2020-05-08 | 广州优策科技有限公司 | Transaction amount prediction method and device, readable storage medium and electronic equipment |
CN111597301A (en) * | 2020-04-24 | 2020-08-28 | 北京百度网讯科技有限公司 | Text prediction method and device and electronic equipment |
CN112785063A (en) * | 2021-01-26 | 2021-05-11 | 上海瀚银信息技术有限公司 | Transaction amount prediction system based on transaction amount prediction model |
US20220138753A1 (en) * | 2020-10-30 | 2022-05-05 | Raise Marketplace, Llc | Interactive swarming |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101820384A (en) * | 2010-02-05 | 2010-09-01 | 浪潮(北京)电子信息产业有限公司 | Method and device for dynamically distributing cluster services |
US20140006090A1 (en) * | 2004-06-21 | 2014-01-02 | Subhash C. Agrawal | Method of and apparatus for forecasting cash demand and load schedules for money dispensers |
CN103745279A (en) * | 2014-01-24 | 2014-04-23 | 广东工业大学 | Method and device for monitoring energy consumption abnormity |
CN104063747A (en) * | 2014-06-26 | 2014-09-24 | 上海交通大学 | Performance abnormality prediction method in distributed system and system |
CN104636912A (en) * | 2015-02-13 | 2015-05-20 | 银联智惠信息服务(上海)有限公司 | Identification method and device for withdrawal of credit cards |
CN104899405A (en) * | 2014-03-04 | 2015-09-09 | 携程计算机技术(上海)有限公司 | Data prediction method and system and alarming method and system |
WO2016060684A1 (en) * | 2014-10-17 | 2016-04-21 | Landmark Graphics Corporation | Casing wear prediction using integrated physics-driven and data-driven models |
CN106126751A (en) * | 2016-08-18 | 2016-11-16 | 苏州大学 | A kind of sorting technique with time availability and device |
-
2017
- 2017-01-10 CN CN201710016236.8A patent/CN108288161A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140006090A1 (en) * | 2004-06-21 | 2014-01-02 | Subhash C. Agrawal | Method of and apparatus for forecasting cash demand and load schedules for money dispensers |
CN101820384A (en) * | 2010-02-05 | 2010-09-01 | 浪潮(北京)电子信息产业有限公司 | Method and device for dynamically distributing cluster services |
CN103745279A (en) * | 2014-01-24 | 2014-04-23 | 广东工业大学 | Method and device for monitoring energy consumption abnormity |
CN104899405A (en) * | 2014-03-04 | 2015-09-09 | 携程计算机技术(上海)有限公司 | Data prediction method and system and alarming method and system |
CN104063747A (en) * | 2014-06-26 | 2014-09-24 | 上海交通大学 | Performance abnormality prediction method in distributed system and system |
WO2016060684A1 (en) * | 2014-10-17 | 2016-04-21 | Landmark Graphics Corporation | Casing wear prediction using integrated physics-driven and data-driven models |
CN104636912A (en) * | 2015-02-13 | 2015-05-20 | 银联智惠信息服务(上海)有限公司 | Identification method and device for withdrawal of credit cards |
CN106126751A (en) * | 2016-08-18 | 2016-11-16 | 苏州大学 | A kind of sorting technique with time availability and device |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109242165A (en) * | 2018-08-24 | 2019-01-18 | 蜜小蜂智慧(北京)科技有限公司 | A kind of model training and prediction technique and device based on model training |
CN109242496A (en) * | 2018-09-18 | 2019-01-18 | 北京三快在线科技有限公司 | Prediction technique, device and the computer-readable medium of the means of payment |
CN109363789A (en) * | 2018-10-19 | 2019-02-22 | 上海交通大学 | Method and data acquisition system for predicting root canal length |
CN110264337A (en) * | 2019-05-28 | 2019-09-20 | 阿里巴巴集团控股有限公司 | A kind of building of prediction model and bank card belonging country prediction technique and device |
CN111127084A (en) * | 2019-12-11 | 2020-05-08 | 广州优策科技有限公司 | Transaction amount prediction method and device, readable storage medium and electronic equipment |
CN111127084B (en) * | 2019-12-11 | 2024-06-11 | 广州优策科技有限公司 | Transaction amount prediction method and device, readable storage medium and electronic equipment |
CN111597301A (en) * | 2020-04-24 | 2020-08-28 | 北京百度网讯科技有限公司 | Text prediction method and device and electronic equipment |
US20220138753A1 (en) * | 2020-10-30 | 2022-05-05 | Raise Marketplace, Llc | Interactive swarming |
CN112785063A (en) * | 2021-01-26 | 2021-05-11 | 上海瀚银信息技术有限公司 | Transaction amount prediction system based on transaction amount prediction model |
CN112785063B (en) * | 2021-01-26 | 2023-07-04 | 上海瀚银信息技术有限公司 | Transaction amount prediction system based on transaction amount prediction model |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108288161A (en) | The method and system of prediction result are provided based on machine learning | |
Shankar et al. | How technology is changing retail | |
US10949825B1 (en) | Adaptive merchant classification | |
Bonabeau | Agent-based modeling: Methods and techniques for simulating human systems | |
Bell et al. | Data-driven agent-based exploration of customer behavior | |
Klepac | Developing churn models using data mining techniques and social network analysis | |
US20210264448A1 (en) | Privacy preserving ai derived simulated world | |
CN109711860A (en) | Prediction technique and device, storage medium, the computer equipment of user behavior | |
Manzoor et al. | A review on machine learning methods for customer churn prediction and recommendations for business practitioners | |
CN106127531A (en) | The method and system of differentiation price are performed based on machine learning | |
CN106920109A (en) | Recognition methods, system and e-commerce system for ecommerce wash sale | |
EP4343666A1 (en) | System, method and apparatus for optimization of financing programs | |
CN117495492A (en) | Method for carrying out information reinforcement learning recommendation based on consumption upgrading recommendation model | |
Adam | Improving complex sale cycles and performance by using machine learning and predictive analytics to understand the customer journey | |
Scarpi et al. | “With great power comes great responsibility”: exploring the role of corporate digital responsibility (CDR) for artificial intelligence responsibility in retail service automation (AIRRSA) | |
CN110782339A (en) | Default probability prediction method, system and readable storage medium | |
Sajawal et al. | A Predictive Analysis of Retail Sales Fore-casting using Machine Learning Techniques | |
JP2019215862A (en) | Action analysis system using action history data and program therefor | |
CN109961161A (en) | Commodity management method, system, electronic device, and computer-readable medium | |
Raj et al. | The Significance of Big Data for the Base of the Pyramid Segment | |
Chopra et al. | E-CRM–A new paradigm for managing customers | |
Thorn et al. | Internal technology valuation: real world issues | |
US11983743B2 (en) | Training an artificial intelligence engine for generating models to provide targeted actions | |
US20210326910A1 (en) | System and method for optimizing an observation campaign in response to observed real-world data | |
CN114445186A (en) | Page generating method, apparatus, electronic device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |