Embodiment
In order that the object, technical solutions and advantages of the present invention are clearer, below in conjunction with accompanying drawing the present invention is made into
One step it is described in detail, it is clear that described embodiment is only embodiment of the invention a part of, rather than whole implementation
Example.Based on the embodiment in the present invention, what those of ordinary skill in the art were obtained under the premise of creative work is not made
All other embodiment, belongs to the scope of protection of the invention.
Shown in Figure 1, the embodiment of the present invention provides a kind of method flow schematic diagram for determining targeted customer, specifically real
Existing method includes:
Step S101, obtains the history use information of each terminal, and the history use information includes user's using terminal
Communication information and user change planes information.
Step S102, according to default behavioural characteristic index, using clustering algorithm by the communication of user's using terminal
Information is clustered, and obtains the corresponding history use information subclass of each behavioural characteristic index.
Step S103, the communication information of user's using terminal in the history use information subclass, it is determined that often
The value of the behavioural characteristic index of individual history use information subclass.
Step S104, the user in the history use information subclass changes planes information, it is determined that each history is used
The corresponding attribute of changing planes of subclass.
Step S105, according to the default behavioural characteristic index, the behavioural characteristic of the history use information subclass
The value of index and each history set up decision-tree model using the corresponding attribute of changing planes of subclass.
Step S106, using the history use information of the decision-tree model analysing terminal, is used from the use of the terminal
The targeted customer for the demand of changing planes is determined in family.
In step S101, the information of historic user is obtained from operator's platform, then finding these historic users makes
The history use information of terminal, the history use information of terminal is primarily referred to as user's telephone expenses bill information monthly and user
The terminal type used, e.g. so-called terminal type, 4G cell phone or 3G mobile, mobile phone brand, model, the color used
Deng the information such as sex, age, the color preference of user can be also obtained from the log-on message of user in addition.
Because the data volume that history use information is included is huge, and without certain regularity, analysis gets up sufficiently complex,
Therefore the embodiment of the present invention is classified original history use information using default behavioural characteristic index, wherein default
Behavioural characteristic index can be the call behavior of user, short message behavior, rule of being in, evolution rule, the moon telephone expenses amount of money, on
Net is monthly using indexs such as flows.Because what the corresponding data of different indexs reflected such user uses machine feature, knot
Closing different indexs can determine that the basic of such user uses machine feature.For example, the currently used 3G mobile of user's Mr. Wang, 12
The month telephone expenses amount of money is 120 yuan, wherein monthly monthly rent is 58 yuan, 30 yuan of cost of the phone call, moon flow monthly payment is 15 yuan, and short message is taken
With 17 yuan.It can substantially determine that the moon telephone expenses amount of money of the user is very high, monthly rent is higher, consuming capacity is strong, there is certain purchase
Power.
So-called attribute of changing planes refers to the feature of changing planes of such customer group, because can be utilized when setting up decision tree useful
The feature of changing planes at family.For example according to general experience, it is known that the user of frequent hand-off machine, when the mobile phone of new type meets the needs of the market
During sale, this kind of user can be main purchasing group, such as during annual iPhone issue, before using the user of iPhone
It may proceed to buy the iPhone of new model;Either some cellphone subscriber's surfing flow every month is larger, it was demonstrated that the user
Commonly using surfing Internet with cell phone, if the user it is currently used be 3G mobile, then in view of the faster advantage of 4G cell phone network speed, that
The type user, which is substantially all, can change the mobile phone of 4G types into.In view of these historical experiences, the history in historic user
Use information reflection behavioural characteristic index the characteristics of, be each user stamp whether the label of hand-off machine, that is, category of changing planes
Property, thus decision-tree model can be set up based on historic user.
Certainly, when obtaining the history use information of each terminal, in addition it is also necessary to using history of the method sampled to each terminal
Default value or exceptional value in use information are filtered, the history use information of each terminal after being handled.Because history
Default value either exceptional value in use information can not accurately give expression to the behavioural characteristic using user, use such as some moon
The bill at family is 0, and the value is likely due to operation system failure and caused, therefore can not be placed on the set of history use information
It is middle to be clustered.This have the effect that the accuracy that can effectively improve the targeted customer finally determined.
Further, for each behavioural characteristic index, by the communication information of user's using terminal with three-dimensional array
Mode expression;Three dimensions cluster is carried out to the data in the three-dimensional array according to k-d algorithms, k cluster is obtained, its
In, one history use information subclass of each cluster correspondence, k is integer.
Specifically, the embodiment of the present invention carries out the cluster of different information in history use information, k- using k-means algorithms
The method of means algorithms cluster probably includes following several steps:
Step 1: history use information is stored as into a data set D, cluster number k and k initial clustering are then given
Center;
Step 2: calculating each data to the distance of cluster centre, and each object is assigned to most according to distance minimum
Close cluster;
Step 3: recalculating the average of each cluster and determining new cluster centre;Calculation error sum-of-squares criterion letter
Number J;
Step 4: k cluster set of output.
It is one group and personality phase by default behavioural characteristic target setting for example, studied and sampled according to personality theory
The corresponding history use information subclass of each desired value, is quantified as three-dimensional array K (x, y, z), such as Fig. 2 institutes by the index of pass
Show, realized by setting up K-d trees to each user's index progress data tissue, realize the matching of data similar features.Feature
Matching is substantially one and similarity retrieval is carried out between high n dimensional vector n by space length function, and forms cluster set, its
Flow be illustrated in fig. 3 shown below including:
Step 201, study and sample according to personality theory, be that history use information determines behavioural characteristic index;
Step 202, the history use information subclass on each behavioural characteristic index is then obtained;
Step 203, history use information subclass is quantified as Multidimensional numerical K (x, y, z), history use information subclass
In each data be recorded in the space cube of Multidimensional numerical;
Step 204, Multidimensional numerical is clustered, obtains the close quarters in history use information subclass.
So, the corresponding history use information subclass of different personality reference indexs can just be quantified as three-dimensional respectively
Array, X characters reference index then corresponds to X three-dimensional array.
Further, each three-dimensional data is subjected to a point heap sort again, the index number of client is stored by building k-d tree
According to, then realize by the neighbor searching algorithm of k-d tree the cluster client partitions of data.For example combine seven adult's lattice models reason
By, customer action characteristic index, verified by sample investigation, it is determined that 7 class close quarterses, correspond to 7 class personality, what is be divided into is close
Cluster is as shown in Figure 4.
Such as, for the corresponding history use information subclass of call behavior this behavioural characteristic index, using above
Middle three-dimensional array quantifies, then x can represent the duration of call, y and represent talk times, z and represents opening of conversing every time in K (x, y, z)
Begin the moment, then be possible to include toll message user by the 7 intensive groups obtained after the communicating data cluster in K (x, y, z)
Group's information such as group, local connection customer group, international call customer group.
After above-mentioned history use information is finely divided, it is possible to according to the history use information subset after subdivision
Close corresponding value, behavioural characteristic index, attribute of changing planes and set up decision-tree model, for example, choose certain telecommunications company client in 2006
Payment data, choose 10000 samples, after data cleansing, there remains 8725 samples, and the call to these clients is remembered
Record, short message behavior, billing amount are analyzed, and are subdivided into following 5 customers using K-means algorithms, are shown in Table one:
Corresponding behavioural characteristic index in table one (possesses 4G cell phone, frequency of changing planes, the moon telephone expenses amount of money, online bag
Month flow, demand of changing planes etc.) and the corresponding behavioural characteristic index of each group value, the demand of changing planes sets up decision-tree model, such as
Shown in Fig. 5., wherein because customers I possesses 4G cell phone in table one, therefore this kind of user does not change the demand of 4G cell phone, remaining
Customers then further segment, and customers II frequency of changing planes is very high, so have the demand for changing 4G cell phone, the like, it is complete
Into the foundation of decision-tree model, as the user that newly arrived, two hands every year on average are found in the history use information of the user
Machine, then can learn that the user is changed planes demand, and target is defined as by the user according to the analysis result of decision-tree model
User.
After targeted customer is determined, it is desirable to have specific aim is that targeted customer determines the end message to be marketed, it is determined that
The method of end message is specially:According to frequent item set data mining algorithm, set up before changing planes with machine feature and after changing planes
The corresponding relation of end message;According to the history use information of the targeted customer, before determining the targeted customer using changing planes
Use machine feature;End message after determining that the targeted customer is corresponding according to the corresponding relation and changing planes, so as to described
Targeted customer pushes the corresponding end message of the targeted customer.
Wherein, the Frequent Itemsets Mining Algorithm can be realized using FP-growth algorithms or Apriori algorithm.FP-
Growth algorithms are a kind of new Frequent Itemsets Mining Algorithms proposed in 2000, and it must be produced departing from Apriori algorithm
The traditional approach of candidate, establishes the thought for not producing candidate based on FP-tree structures, opens correlation rule
New approaches, belong to prior art, do not repeat herein.
Determining the method for end message can be further described through by following example, for example, according to the history before changing planes
Use information, is first divided into 8 colonies, respectively more than 4000 yuan users of terminal price by the user before changing planes by terminal price section
Group, 3000-4000 members customer group, 2000-3000 members customer group, 1000-2000 members customer group, 500-1000 members customer group, 500
The following customer group of member, price unknown subscriber group, then price category after analyzing attribute before each customer group is changed planes, behavioural characteristic and changing planes
Between corresponding relation.The price for the terminal that targeted customer may buy is determined according to corresponding relation.Wherein corresponding relation
Determination mode is:For renewal user, the end after obtaining the history use information before renewal user changes planes and changing planes
Client information;According to the history use information before renewal user changes planes, it is determined that the use machine before the changing planes of renewal user is special
Levy;The corresponding relation for the end message with machine feature and after changing planes set up before changing planes.
For example, user A change planes before with the mobile phone price used in machine feature 1000-2000 member between, belong to 1000-
2000 yuan of customer groups, the mobile phone price after changing planes is still between 1000-2000, it is seen that this kind of user possesses such machine
Feature, therefore mobile phone of the price between 1000-2000 is promoted and gives the user A.
Similarly, operating system preference of the user to the terminal that uses, screen size can also be calculated in the same way
The corresponding relations such as preference, network formats preference, that is to say, that be equal to each user stamp brand of knowing clearly, price, operating system,
The adaptation label of screen size, network formats etc..So, terminal sale personnel can be according to existing end message on sale, from end
The different dimensions of attribute are held to meet desired client to screen.Realize that terminal on sale and the accurate of client are fitted according to actual conditions
Match somebody with somebody, improve marketing responsiveness.
Certain above-mentioned targeted customer for being to determine the demand of changing planes, if being found according to the behavioural characteristic of the customer group after cluster
There is no long-distance telephone expenses more in the bill of certain customers, then preferential set meal long-distance to such Customer design, stimulate that client beats more
Long-distance call.If it was found that user's use information subclass of some customer group reflect short message expense it is higher, then can be to short message
Take high lead referral superposition and use short message monthly package.
It can be seen that, the embodiment of the present invention can effectively find out the potential of terminal marketing on the basis of each customers are segmented
Client, and then ensure the result that terminal is recommended.I.e. the embodiment of the present invention is thin using client's character analysis based on k-means algorithms
Different client groups, set up client's personality label, and combine the potential customers that terminal marketing is identified based on decision tree C5.0 algorithms.
After the potential customers for identifying terminal marketing, the terminal adaptation degree of target customer is further determined, and then realizes that terminal is pushed away
Recommend, find out the use before being changed planes with machine feature and client after client changes planes in the embodiment of the present invention using Apriori association algorithms
Incidence relation between machine feature, the terminal type of Accurate Prediction adaptation.
Based on identical technical concept, the embodiment of the present invention also provides a kind of device for determining targeted customer, and the device can
Perform above method embodiment.Device provided in an embodiment of the present invention as shown in fig. 6, including:Acquiring unit 401, cluster cell
402nd, value unit 403 is determined, template(-let) 404 is determined, sets up model unit 405, determines target subscriber units 406, wherein:
Acquiring unit 401, the history use information for obtaining each terminal, the history use information is used including user
The communication information of terminal and user change planes information;
Cluster cell 402, for according to default behavioural characteristic index, using clustering algorithm by user's using terminal
Communication information clustered, obtain the corresponding history use information subclass of each behavioural characteristic index;
Value unit 403 is determined, the communication for user's using terminal in the history use information subclass
Information, it is determined that the value of the behavioural characteristic index of each history use information subclass;
Template(-let) 404 is determined, is changed planes information for the user in the history use information subclass, it is determined that often
Individual history uses the corresponding attribute of changing planes of subclass;
Model unit 405 is set up, for according to the default behavioural characteristic index, the history use information subclass
Behavioural characteristic index value and each history set up decision-tree model using the corresponding attribute of changing planes of subclass;
Target subscriber units 406 are determined, for the history use information using the decision-tree model analysing terminal, from institute
The targeted customer for the demand of changing planes is determined in the use user for stating terminal.
Wherein, the acquiring unit 401 is additionally operable to:Using the device of sampling to lacking in the history use information of each terminal
Province's value or exceptional value are filtered, the history use information of each terminal after being handled.
Further, the cluster cell 402 specifically for:For each behavioural characteristic index, the user is used
The communication information of terminal representing in the way of three-dimensional array;
Three dimensions cluster is carried out to the data in the three-dimensional array according to k-d algorithms, k cluster is obtained, wherein, often
One history use information subclass of individual cluster correspondence, k is integer.
Further, in addition to:End message unit 407 is determined, for according to frequent item set data mining algorithm, setting up
The corresponding relation of the end message with machine feature and after changing planes before changing planes;According to the history use information of the targeted customer,
Determine that the targeted customer uses machine feature using before changing planes;Determine that the targeted customer is corresponding according to the corresponding relation to change
End message after machine, so as to push the corresponding end message of the targeted customer to the targeted customer.
Wherein, it is described determination end message unit specifically for:For renewal user, renewal user changes described in acquisition
History use information before machine and the end message after changing planes;According to the history use information before renewal user changes planes,
It is determined that using machine feature before the changing planes of renewal user;Set up change planes before with machine feature with change planes after end message it is corresponding
Relation.
On the one hand the embodiment of the present invention obtains the history use information of each terminal, will be gone through using default behavioural characteristic index
History use information is clustered into the history use information subclass related to behavioural characteristic index;On the other hand, each history is utilized
The behavioural characteristic index of use information subclass, the value of behavioural characteristic index, attribute of changing planes set up decision-tree model, so sharp
The history use information of other users is analyzed with the decision-tree model built up, it is possible to draw the attribute of changing planes of the user, find
Really there is the targeted customer for the demand of changing planes.It can be seen that, because make use of the history use information of historic user to set up decision-tree model,
The model can be with Reusability, and the result accuracy rate that analysis is obtained is high, instead of existing artificial experience and distinguishes so that target
Client determines that result is more accurate, also beneficial to the formulation of various marketing strategies.
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product
Figure and/or block diagram are described.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram
Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided
The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real
The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which is produced, to be included referring to
Make the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one square frame of block diagram or
The function of being specified in multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter
Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or
The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in individual square frame or multiple square frames.
, but those skilled in the art once know basic creation although preferred embodiments of the present invention have been described
Property concept, then can make other change and modification to these embodiments.So, appended claims are intended to be construed to include excellent
Select embodiment and fall into having altered and changing for the scope of the invention.
Obviously, those skilled in the art can carry out the essence of various changes and modification without departing from the present invention to the present invention
God and scope.So, if these modifications and variations of the present invention belong to the scope of the claims in the present invention and its equivalent technologies
Within, then the present invention is also intended to comprising including these changes and modification.