[go: up one dir, main page]

CN109063052B - Personalized recommendation method and device based on time entropy - Google Patents

Personalized recommendation method and device based on time entropy Download PDF

Info

Publication number
CN109063052B
CN109063052B CN201810794746.2A CN201810794746A CN109063052B CN 109063052 B CN109063052 B CN 109063052B CN 201810794746 A CN201810794746 A CN 201810794746A CN 109063052 B CN109063052 B CN 109063052B
Authority
CN
China
Prior art keywords
user
item
current user
cluster
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201810794746.2A
Other languages
Chinese (zh)
Other versions
CN109063052A (en
Inventor
唐恒亮
薛菲
刘涛
杨玺
董晨刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Wuzi University
Original Assignee
Beijing Wuzi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Wuzi University filed Critical Beijing Wuzi University
Priority to CN201810794746.2A priority Critical patent/CN109063052B/en
Publication of CN109063052A publication Critical patent/CN109063052A/en
Application granted granted Critical
Publication of CN109063052B publication Critical patent/CN109063052B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明实施例提供一种基于时间熵的个性化推荐方法及装置,所述方法包括:对已有的用户和项目评分信息进行聚类,获得聚类的类簇中心和每个类簇内包含的用户和项目信息;若判定当前用户不属于新加入系统用户,则根据所述当前用户的相关属性特征信息及用户对项目的不同评分记录计算所述当前用户对项目的时间熵;根据计算出的所述当前用户对项目的时间熵,判定所述当前用户的用户类型;根据所述当前用户的用户类型在聚类的所述类簇中心进行模式匹配,从而为所述当前用户进行多样性项目信息推荐。本发明实施例可以提高信息推荐准确度和推荐多样性。

Figure 201810794746

Embodiments of the present invention provide a method and device for personalized recommendation based on time entropy. The method includes: clustering existing user and item rating information, and obtaining the cluster center of the cluster and the information contained in each cluster. User and project information; if it is determined that the current user does not belong to the newly added system user, then the time entropy of the current user to the project is calculated according to the relevant attribute feature information of the current user and the user's different scoring records for the project; According to the time entropy of the current user to the project, the user type of the current user is determined; according to the user type of the current user, pattern matching is performed in the cluster center of the cluster, so as to perform diversity for the current user. Project information is recommended. The embodiments of the present invention can improve the information recommendation accuracy and recommendation diversity.

Figure 201810794746

Description

Personalized recommendation method and device based on time entropy
Technical Field
The invention relates to the technical field of internet intelligent information recommendation, in particular to a personalized recommendation method and device based on time entropy.
Background
With the rise of internet technology and the rapid development of information technology, the internet generates a large amount of data information. From a piece of chart Information of "data is generated in one minute through the internet" published by Excelcom company, we can know that Facebook generates 701,389 account login together, Netflix has 69,444 videos watched for a long time, Snapchat shares 527,760 photos, 51,000 apps on App Store are downloaded, Linkedin creates 120 new accounts, Twitter issues 347,222 new tweets, Instagram issues 28,194 new photos, Google generates 240 ten thousand new search requests, so that the internet goes from the age of lacking original Information to Information overload (Information overload), and the user is further difficult to find out interested Information quickly and accurately from a massive Information base.
In the face of the problem of information overload, common users often cannot adapt. In order to better meet the information requirements of users, scientists provide a recommendation system technology, which combines technologies in multiple fields of machine learning, data mining, user behavior, human-computer interaction and the like and applies a large-scale parallel data processing framework to further quickly and accurately provide personalized information service for each user. Collaborative filtering is an effective technique capable of generating personalized recommendations, and is widely applied to various recommendation systems, and the basic task is to match users according to similar preferences so as to recommend items that the users may like. Collaborative filtering algorithms can be generally classified into memory-based and model-based. The collaborative filtering based on the memory can be divided into a user-based filtering and a project-based filtering. The former is to calculate the similarity between users to obtain the nearest neighbor similar to the interest preference of the target user, and then to predict and recommend the target user based on the nearest neighbor.
However, when recommending items for a user, a traditional collaborative filtering recommendation algorithm usually only selects a user-based or item-based recommendation method to recommend a target user, and this recommendation method only selects the rating information of the user on the items, but ignores that the interest of the user may change along with the change of time due to other influences such as emotion, friends, fashion trend and the like, that is, the interest of the user may only focus on one or a few items within a certain period, i.e., the interest may migrate. Thus, a single type of recommendation cannot meet the utility needs of other diverse users.
Disclosure of Invention
The embodiment of the invention provides a personalized recommendation method and device based on time entropy, which aim to improve information recommendation accuracy and recommendation diversity.
In one aspect, an embodiment of the present invention provides a time entropy-based personalized recommendation method, where the method includes:
clustering the existing user and project scoring information to obtain a cluster center of the cluster and user and project information contained in each cluster;
if the current user is judged not to belong to the newly added system user, calculating the time entropy of the project of the current user according to the related attribute feature information of the current user and different scoring records of the project of the user;
judging the user type of the current user according to the calculated time entropy of the current user to the item;
and performing pattern matching in the cluster center of the cluster according to the user type of the current user, so as to recommend the diversity project information for the current user.
In another aspect, an embodiment of the present invention provides a time entropy-based personalized recommendation apparatus, where the apparatus includes:
the clustering unit is used for clustering the existing user and project scoring information to obtain the cluster center of the clusters and the user and project information contained in each cluster;
the computing unit is used for computing the time entropy of the current user to the project according to the related attribute feature information of the current user and different grading records of the user to the project if the current user is judged not to belong to the newly added system user;
the judging unit is used for judging the user type of the current user according to the calculated time entropy of the current user to the item;
and the matching unit is used for performing pattern matching in the cluster center of the cluster according to the user type of the current user so as to recommend the diversity item information for the current user.
The technical scheme has the following beneficial effects: the user is recommended by using the personalized recommendation method based on the time entropy, on one hand, the user and the project information are clustered, the popularity of the project is calculated, the recommendation efficiency and accuracy can be improved, and the cold start problem of the user is solved. On the other hand, the multi-interest of the user can be effectively utilized by calculating the time entropy of the user on the item, so that the diversity of the recommendation method is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flow chart of a personalized recommendation method based on temporal entropy according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a personalized recommendation device based on temporal entropy according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of another personalized recommendation device based on temporal entropy according to an embodiment of the present invention;
FIG. 4 is an overall flowchart of a personalized recommendation method based on temporal entropy according to an embodiment of the present invention;
FIG. 5 is a comparison graph of recommendation efficiency of the personalized recommendation method based on time entropy and other recommendation algorithms in the application example of the present invention;
FIG. 6 is a diagram illustrating diversity comparison of personalized recommendation methods based on temporal entropy according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, a flow chart of a personalized recommendation method based on temporal entropy according to an embodiment of the present invention is shown, where the method includes:
101. clustering the existing user and project scoring information to obtain a cluster center of the cluster and user and project information contained in each cluster;
102. if the current user is judged not to belong to the newly added system user, calculating the time entropy of the project of the current user according to the related attribute feature information of the current user and different scoring records of the project of the user;
103. judging the user type of the current user according to the calculated time entropy of the current user to the item;
104. and performing pattern matching in the cluster center of the cluster according to the user type of the current user, so as to recommend the diversity project information for the current user.
Preferably, the method further comprises:
if the current user is judged to be a newly added system user, recommending N items of information ranked at the top to the current user by adopting the following item popularity calculation mode:
Figure BDA0001735725110000031
wherein ipopFor item popularity, UiRepresents the user set of the evaluated item I in the recommendation system, wherein I is the number of all items in the recommendation system.
Preferably, according to the user interest preference, the user type of the current user is determined to be one of the following four types: recently like and past like, recently like but past dislike, recently dislike but past like, recently dislike and past dislike.
Preferably, the performing pattern matching in the cluster center of the cluster according to the user type of the current user so as to perform diversity item information recommendation for the current user includes:
performing pattern matching in the following two interest preference patterns of the cluster center of the cluster according to the user type of the current user: a regular interest preference mode and a non-regular interest preference mode; wherein the regular interest preference pattern is liked recently and liked in the past; the irregular interest preference pattern comprises: recently liked but not liked in the past, recently disliked but liked in the past, recently disliked and not liked in the past.
Preferably, the time entropy of the project of the current user is calculated according to the related attribute feature information of the current user and different scoring records of the project of the user by using the following time entropy formula:
Figure BDA0001735725110000041
where H is the time entropy, n is the number of time intervals, scoreiRating of item i for user, miRepresents the sum of the scores of all users for item i in a time interval, M refers to all MiThe sum of (a) and (b).
Corresponding to the above method embodiment, as shown in fig. 2, a schematic structural diagram of a personalized recommendation device based on temporal entropy according to an embodiment of the present invention is shown, where the device includes:
the clustering unit 21 is used for clustering the existing user and project scoring information to obtain the cluster center of the clusters and the user and project information contained in each cluster;
the calculating unit 22 is configured to calculate a time entropy of the current user for the project according to the relevant attribute feature information of the current user and different scoring records of the user for the project if it is determined that the current user does not belong to the newly added system user;
the judging unit 23 is configured to judge a user type of the current user according to the calculated time entropy of the current user on the item;
and the matching unit 24 is configured to perform pattern matching in the cluster center of the cluster according to the user type of the current user, so as to perform diversity item information recommendation for the current user.
Preferably, as shown in fig. 3, which is a schematic structural diagram of another personalized recommendation device based on temporal entropy according to an embodiment of the present invention, the device includes: clustering unit 21, calculating unit 22, judging unit 23, matching unit 24, the device further includes:
a recommending unit 25, configured to recommend, if it is determined that the current user is a newly added system user, the N item information ranked before to the current user in the following item popularity calculation manner:
Figure BDA0001735725110000051
wherein ipopFor item popularity, UiRepresents the user set of the evaluated item I in the recommendation system, wherein I is the number of all items in the recommendation system.
Preferably, the determining unit 23 is specifically configured to determine, according to the user interest preference, that the user type of the current user is one of the following four types: recently like and past like, recently like but past dislike, recently dislike but past like, recently dislike and past dislike.
Preferably, the matching unit 24 is specifically configured to perform pattern matching in the following two interest preference patterns of the cluster center of the cluster according to the user type of the current user: a regular interest preference mode and a non-regular interest preference mode; wherein the regular interest preference pattern is liked recently and liked in the past; the irregular interest preference pattern comprises: recently liked but not liked in the past, recently disliked but liked in the past, recently disliked and not liked in the past.
Preferably, the calculating unit 22 is specifically configured to calculate, according to the relevant attribute feature information of the current user and different scoring records of the user on the item, a time entropy of the current user on the item by using the following time entropy formula:
Figure BDA0001735725110000052
where H is the time entropy, n is the number of time intervals, scoreiRating of item i for user, miRepresents the sum of the scores of all users for item i in a time interval, M refers to all MiThe sum of (a) and (b).
The technical scheme of the embodiment of the invention has the following beneficial effects: the user is recommended by using the personalized recommendation method based on the time entropy, on one hand, the user and the project information are clustered, the popularity of the project is calculated, the recommendation efficiency and accuracy can be improved, and the cold start problem of the user is solved. On the other hand, the multi-interest of the user can be effectively utilized by calculating the time entropy of the user on the item, so that the diversity of the recommendation method is improved.
The above embodiment of the present invention provides a Personalized recommendation method (PTCF) based on temporal entropy for recommending a user. The method comprises the steps of firstly clustering existing user and project rating information in a recommendation system to obtain corresponding clustering centers and different cluster information; then judging whether the target user belongs to a newly added system user, if not, calculating the time entropy of the user to the project according to the related attribute characteristic information of the user and different grading records of the user to the project, judging which type of user the user belongs to, and further performing mode matching on the target user in the recommendation system, so as to perform diversity recommendation for the user by adopting a project diversity calculation formula; otherwise, recommending the N items ranked at the top to the target user by adopting an item popularity calculation mode.
The main recommendation mechanism of the above personalized recommendation method PTCF based on temporal entropy is as follows: in the selection of the recommendation algorithm, not only the collaborative filtering algorithm based on clustering is selected for recommendation, but also the interest of the user is considered to change along with the change of various factors, so that the diversity recommendation is performed on the user. The recommendation method not only solves the problem of recommendation efficiency of a recommendation system for recommending by calculating all scoring information originally, but also utilizes diversified information of the user scoring the items, thereby effectively improving the recommendation accuracy and recommendation diversity of the recommendation method.
The main content of the embodiment of the invention is the research and application of a time entropy-based personalized recommendation method PTCF, and the method mainly comprises the steps of clustering original users and item rating information, calculating the time entropy of different item ratings of target users, classifying and matching modes of the target users, and correspondingly recommending whether the target users are newly added to a system user. The technical scheme adopted by the method is as follows: 1) clustering user and project scoring information by using an RLPSO _ KM clustering algorithm so as to obtain a clustered cluster center and corresponding cluster information; 2) by introducing the definition of the time entropy, the time entropy of different items of target users who have added into the system is calculated, and then classification and pattern matching are carried out on the target users, so that effectiveness recommendation is carried out on the target users; 3) and recommending the target user newly added into the system by introducing a mode of calculating the popularity of the project.
The technical scheme adopted by the embodiment of the invention is a personalized recommendation method based on time entropy, and the method comprises the following implementation steps:
(1) and clustering the user and item scoring information. Firstly, processing the scoring information of a project by a user; then, clustering the processed information by adopting an improved Kmeans clustering algorithm RLPSO _ KM, wherein the clustering comprises a series of parameter settings such as initial clustering centers, initial clustering numbers, clustering iteration times and the like; and finally, outputting the cluster center of the clusters and the user and item information contained in each cluster.
(2) Personalized recommendation based on temporal entropy. The interest preferences of different users tend to be different, with some users always enjoying the same type of movie, and the opposite for others, with their favorite types of movies possibly changing with their mood, friends and fashion trends.
Here, we classify the interest preference of the user into like (Likes) and dislike (Dislikes) according to the rating information of the item by the user. Likewise, if measured in time, it is divided into the most Recent (Recent) and Past (Past) times, so we simply divide the user's interest preferences into four categories. The first type is recentLikes; PastLikes, the interest preferences of such users are regular, meaning that such users generally only like one type of movie and like one movie type for a longer duration, typically without changing for a longer period of time. The second user type is recentLikes; PastDislieks, the third type is RecentrtDislieks; PastLikes. For these users, their interest preferences change over time. The last is recentDislikes; and PastDislikes, the item information evaluated by the users shows a diversified trend, and meanwhile, the items show randomness and irregularity.
By classifying users, we classify the types of users into two modes, namely a first mode (first class of users) and a second mode (second, third and fourth classes of users). The value and time information implied by the project means to which mode the user belongs. For example, if a user frequently visits the same type of movie, we consider him to be a user following the first mode. This model is simple and the same type of movie exhibits a tendency to be evenly distributed. Conversely, if the user belongs to the second mode, there is no regularity to follow. Based on the inspiration, a novel pattern mining method based on time entropy is provided, the method can be used for measuring the time distribution of the user historical scoring record, and a calculation formula for defining the time entropy is as follows:
Figure BDA0001735725110000071
in formula (1), n is the number of time intervals, and the user's score for item i is scorei,miRepresents the sum of the scores of all users for item i in a time interval, M refers to all MiThe sum of (a) and (b).
Usually, an item contains a plurality of tags, and the item can be described simply by the tags. For a movie, these tags may represent the type of movie. For a user, he may comment on the same tag at different times, i.e. a tag may belong to several scoring cycles. Temporal entropy measures the degree of misordering of the timestamp, with a higher value indicating that the user prefers this tag.
As shown in table 1, the table contains scoring information for 8 items by 4 users. u. ofi(i ═ 1, 2, 3, 4) is a user information set, ik(k ═ 1, 2, …, 8) represents a set of item information. The scoring value of the user on the item is between 1 and 5, and the number of times of the user accessing the item and the scoring information of the label to which the item belongs are listed in the table.
TABLE 1 user-project Scoring information
Table 1 User-item rating
Figure BDA0001735725110000072
As shown in table 2, there are 5 time intervals in the table. For user u1And 3 time distribution records are included. For user u2It also contains 3 time distribution records.
TABLE 2 user vs tag1Scoring information of
Table 2 Users rating for tag1
Figure BDA0001735725110000073
Figure BDA0001735725110000081
But for user u1And user u2Which user prefers tag1Is there a The following are their temporal entropies:
Figure BDA0001735725110000082
Figure BDA0001735725110000083
due to the fact that
Figure BDA0001735725110000084
So user u1User u2Prefer tag1. This is in accordance with user u1User u2For tag1The fact that the score was higher agreed.
Diversity recommendation attracts a lot of attention by coordinating similarity and difference of items in the item list to improve recommendation efficiency. One common method of measuring diversity is to maximize the sum of the different items as much as possible. Thus, for a given project, we set its diversity formula to be defined as follows:
Figure BDA0001735725110000085
Figure BDA0001735725110000086
Figure BDA0001735725110000087
in the formula (2), fr (i) denotes a tag1Frequency of occurrence in the set, z (i) is tag1The sum of the similarity of all the labels in the candidate set, α, is the adjustment coefficient, where we choose this value to be 2.
In the formula (4), the term ruiAnd rujRespectively represent the scores of the user u for the item i and the item j,
Figure BDA0001735725110000088
the average rating of item j for all users who rated item j,
Figure BDA0001735725110000089
average rating of item i for all users who rated item i, UijFor a set of users that score both item i and item j, sim (i, j) has a value between [ -1,1]In the meantime.
(3) And calculating the popularity of the project. For the newly added target users in the system, the N items which are most popular with the users are recommended to the target users. We define the popularity formula for items in the recommendation system as follows:
Figure BDA00017357251100000810
in the formula (5), UiRepresents the set of users in the recommendation system who have evaluated the item I, I being all items in the recommendation system.
In conclusion, the scoring information of the user to the project in the collaborative filtering algorithm is clustered, the time entropy of the user to the project is calculated, the mode of the user is calculated, diversity recommendation is further performed on the user, and the previous N projects are recommended to the user who is not added to the system in the system. In this way, on one hand, the recommendation efficiency of the recommendation system is improved by calculating the neighborhood of the user in the class cluster; on the other hand, the interest of the user is greatly reflected by calculating the time entropy of the user on the item, the singleness of the traditional recommendation system is made up, and the recommendation system has better diversity.
In order to make the objects, technical solutions and features of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings. The overall flow chart of the personalized recommendation method based on the temporal entropy is shown in fig. 4.
The individual steps are illustrated below:
(1) the user and the scoring information are clustered based on the RLPSO _ KM clustering algorithm, so that the recommendation efficiency of the recommendation system is improved.
(2) A personalized recommendation method based on the time entropy is provided, and a time entropy calculation mode is introduced, so that the diversified interests of users are greatly reflected.
(3) A mode of calculating the popularity of the project is provided for recommending the newly added system user, and the problem of cold start of the project in the recommending system is solved.
The experimental environment was as follows:
the application example of the invention verifies the actual effect of the personalized recommendation method based on the time entropy through experiments, the experimental environment is a win7 (64-bit) host, an 8G memory and a 1T hard disk, the adopted data sets are MovieLens (10M) and data captured from Douban.com, the accuracy of recommendation is verified by taking the average value of 10 experiments of the algorithm as the final experiment result, and meanwhile, the diversity of the personalized recommendation method based on the time entropy and other diversity recommendation methods is compared.
First, the application example of the present invention clusters the rating information of users and items. Vectorizing or characterizing the scoring information to enable the scoring information to meet the format of clustering input; clustering the processed information by adopting an improved clustering algorithm RLPSO _ KM, wherein the clustering comprises a series of parameter settings such as initial clustering center, initial clustering number selection, clustering iteration times and the like; and finally, outputting the center of each cluster clustered by the clustering algorithm and the user and item information contained in each cluster.
And then, establishing a personalized recommendation method based on the time entropy. And calculating the time entropy of the user to the project by introducing a time entropy calculation mode, further classifying and pattern matching the user, and finally performing diversity recommendation for the user. The recommendation efficiency of this recommendation algorithm is plotted against the other algorithms as shown in fig. 5 and 6.
Finally, project popularity is calculated. Aiming at new users who do not join the system, the cold start problem of the users in the system is solved by calculating the popularity of the items and selecting N items with the highest popularity to recommend the users.
From the comparison between the method proposed by the present invention and other algorithms in fig. 5, it can be clearly seen that the PTCF proposed by the present invention is more dominant in accuracy rate compared to PMF, BPMF, SVD and McoC algorithms, regardless of whether the selected experimental dataset is a movilens or a Douban dataset, and the accuracy rate is respectively increased by 1.04% and 1.07% on the movilens and Douban datasets compared to the McoC algorithm when N is 10. Meanwhile, no matter whether the selected Top-N value is any one of 10,20 and 50, the MAP value of the method provided by the invention is obviously higher than that of other algorithms, which also illustrates the advantage of the PTCF method provided by the invention in the recommendation efficiency.
From fig. 6, it can be seen that by comparing the number of different recommended item lists, the PTCF method proposed by the present invention remains substantially consistent in diversity and does not change significantly, regardless of the selection of the experimental data set or when the number of recommended items is 10,20, and 50. Meanwhile, compared with the CUTA algorithm, when N is 10, the diversity of the personalized recommendation method provided by the invention is improved by 4 times under the MovieLens data set, and is improved by 1.05% compared with the CUTATime algorithm.
By combining the experiments, the application example of the invention recommends the user by using the personalized recommendation method based on the time entropy, on one hand, the user and the project information are clustered and the popularity of the project is calculated, so that the recommendation efficiency and accuracy can be improved, and the cold start problem of the user can be solved. On the other hand, the multi-interest of the user can be effectively utilized by calculating the time entropy of the user on the item, so that the diversity of the recommendation method is improved.
It should be understood that the specific order or hierarchy of steps in the processes disclosed is an example of exemplary approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged without departing from the scope of the present disclosure. The accompanying method claims present elements of the various steps in a sample order, and are not intended to be limited to the specific order or hierarchy presented.
In the foregoing detailed description, various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments of the subject matter require more features than are expressly recited in each claim. Rather, as the following claims reflect, invention lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby expressly incorporated into the detailed description, with each claim standing on its own as a separate preferred embodiment of the invention.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. To those skilled in the art; various modifications to these embodiments will be readily apparent, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
What has been described above includes examples of one or more embodiments. It is, of course, not possible to describe every conceivable combination of components or methodologies for purposes of describing the aforementioned embodiments, but one of ordinary skill in the art may recognize that many further combinations and permutations of various embodiments are possible. Accordingly, the embodiments described herein are intended to embrace all such alterations, modifications and variations that fall within the scope of the appended claims. Furthermore, to the extent that the term "includes" is used in either the detailed description or the claims, such term is intended to be inclusive in a manner similar to the term "comprising" as "comprising" is interpreted when employed as a transitional word in a claim. Furthermore, any use of the term "or" in the specification of the claims is intended to mean a "non-exclusive or".
Those of skill in the art will further appreciate that the various illustrative logical blocks, units, and steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate the interchangeability of hardware and software, various illustrative components, elements, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design requirements of the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present embodiments.
The various illustrative logical blocks, or elements, described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a digital signal processor, an Application Specific Integrated Circuit (ASIC), a field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a digital signal processor and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a digital signal processor core, or any other similar configuration.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may be stored in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. For example, a storage medium may be coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC, which may be located in a user terminal. In the alternative, the processor and the storage medium may reside in different components in a user terminal.
In one or more exemplary designs, the functions described above in connection with the embodiments of the invention may be implemented in hardware, software, firmware, or any combination of the three. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media that facilitate transfer of a computer program from one place to another. Storage media may be any available media that can be accessed by a general purpose or special purpose computer. For example, such computer-readable media can include, but is not limited to, RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store program code in the form of instructions or data structures and which can be read by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Additionally, any connection is properly termed a computer-readable medium, and, thus, is included if the software is transmitted from a website, server, or other remote source via a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wirelessly, e.g., infrared, radio, and microwave. Such discs (disk) and disks (disc) include compact disks, laser disks, optical disks, DVDs, floppy disks and blu-ray disks where disks usually reproduce data magnetically, while disks usually reproduce data optically with lasers. Combinations of the above may also be included in the computer-readable medium.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are merely exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (4)

1.一种基于时间熵的个性化推荐方法,其特征在于,所述方法包括:1. a personalized recommendation method based on time entropy, is characterized in that, described method comprises: 对已有的用户和项目评分信息进行聚类,获得聚类的类簇中心和每个类簇内包含的用户和项目信息;Cluster the existing user and item rating information to obtain the cluster center of the cluster and the user and item information contained in each cluster; 若判定当前用户不属于新加入系统用户,则根据所述当前用户的相关属性特征信息及用户对项目的不同评分记录计算所述当前用户对项目的时间熵;If it is determined that the current user does not belong to the newly added system user, the time entropy of the current user to the item is calculated according to the relevant attribute feature information of the current user and the different scoring records of the user to the item; 根据计算出的所述当前用户对项目的时间熵,判定所述当前用户的用户类型;Determine the user type of the current user according to the calculated time entropy of the current user to the item; 根据所述当前用户的用户类型在聚类的所述类簇中心进行模式匹配,从而为所述当前用户进行多样性项目信息推荐;According to the user type of the current user, pattern matching is performed in the cluster center of the cluster, so as to perform diversity item information recommendation for the current user; 其中,所述根据所述当前用户的相关属性特征信息及用户对项目的不同评分记录利用如下时间熵公式计算所述当前用户对项目的时间熵:Wherein, the time entropy of the current user to the item is calculated by the following time entropy formula according to the relevant attribute feature information of the current user and the different scoring records of the user to the item:
Figure FDA0003158909060000011
Figure FDA0003158909060000011
其中,H为时间熵,n是时间间隔数,scorei为用户对项目i的评分,mi代表一个时间间隔内所有用户对项目i的评分总和,M指所有mi的总和;Among them, H is the time entropy, n is the number of time intervals, score i is the user's score for item i, m i represents the sum of the scores of all users on item i in a time interval, and M refers to the sum of all m i ; 根据用户兴趣偏好,判定所述当前用户的用户类型为如下四种类型之一:最近喜欢且过去也喜欢、最近喜欢但过去不喜欢、最近不喜欢但过去喜欢、最近不喜欢且过去也不喜欢;According to the user's interest preference, it is determined that the user type of the current user is one of the following four types: recently liked and liked in the past, recently liked but disliked in the past, disliked recently but liked in the past, disliked recently and also disliked in the past ; 所述根据所述当前用户的用户类型在聚类的所述类簇中心进行模式匹配,从而为所述当前用户进行多样性项目信息推荐,包括:The performing pattern matching at the cluster center of the cluster according to the user type of the current user, so as to recommend diversity item information for the current user, including: 根据所述当前用户的用户类型在聚类的所述类簇中心的如下两种兴趣偏好模式中进行模式匹配:规律性兴趣偏好模式和非规律性兴趣偏好模式;其中,所述规律性兴趣偏好模式为最近喜欢且过去也喜欢;所述非规律性兴趣偏好模式包括:最近喜欢但过去不喜欢、最近不喜欢但过去喜欢、最近不喜欢且过去也不喜欢;According to the user type of the current user, pattern matching is performed in the following two interest preference patterns in the cluster center of the cluster: a regular interest preference pattern and an irregular interest preference pattern; wherein the regular interest preference The pattern is liked recently and liked in the past; the irregular interest preference pattern includes: liked recently but disliked in the past, disliked recently but liked in the past, disliked recently and disliked in the past; 其中,所述根据所述当前用户的用户类型在聚类的所述类簇中心进行模式匹配,从而为所述当前用户进行多样性项目信息推荐,具体为:当模式匹配为规律性兴趣偏好模式时,计算用户所属聚类,簇类中运用基于项目的协同过滤算法进行推荐;否则,采用项目的多样性公式为用户进行推荐;Wherein, performing pattern matching in the cluster center of the cluster according to the user type of the current user, so as to perform diversity item information recommendation for the current user, specifically: when the pattern matching is a regular interest preference pattern When the user belongs to the cluster, the item-based collaborative filtering algorithm is used for recommendation in the cluster; otherwise, the item's diversity formula is used to recommend the user; 其中,所述多样性公式为:Wherein, the diversity formula is:
Figure FDA0003158909060000021
Figure FDA0003158909060000021
Figure FDA0003158909060000022
Figure FDA0003158909060000022
Figure FDA0003158909060000023
Figure FDA0003158909060000023
其中,fr(i)指标签在集合中出现的频率,z(i)是标签候选集合中所有标签的相似度之和,α为调节系数,项rui和ruj分别代表用户u对项目i和项目j的评分,
Figure FDA0003158909060000024
为所有评价过项目j的用户对项目j的平均评分,
Figure FDA0003158909060000025
为所有评价过项目i的用户对项目i的平均评分,Uij为同时对项目i与项目j评分的用户集合,sim(i,j)的值介于[-1,1]之间。
Among them, fr(i) refers to the frequency of the tag appearing in the set, z(i) is the sum of the similarity of all tags in the tag candidate set, α is the adjustment coefficient, and the items r ui and r uj represent user u’s pairing of item i respectively and the rating of item j,
Figure FDA0003158909060000024
is the average rating of item j for all users who have evaluated item j,
Figure FDA0003158909060000025
is the average rating of item i by all users who have evaluated item i, U ij is the set of users who rated item i and item j at the same time, and the value of sim(i, j) is between [-1, 1].
2.如权利要求1所述基于时间熵的个性化推荐方法,其特征在于,所述方法还包括:2. The personalized recommendation method based on time entropy according to claim 1, wherein the method further comprises: 若判定当前用户为新加入系统用户,采用如下项目流行度计算方式将排名在前的N个项目信息推荐给所述当前用户:If it is determined that the current user is a new user joining the system, the following item popularity calculation method is used to recommend the information of the top N items to the current user:
Figure FDA0003158909060000026
Figure FDA0003158909060000026
其中,ipop为项目流行度,Ui代表推荐系统中评价过项目i的用户集合,I为推荐系统中的所有项目数。Among them, i pop is the popularity of the item, U i represents the set of users who have evaluated item i in the recommender system, and I is the number of all items in the recommender system.
3.一种基于时间熵的个性化推荐装置,其特征在于,所述装置包括:3. A personalized recommendation device based on time entropy, wherein the device comprises: 聚类单元,用于对已有的用户和项目评分信息进行聚类,获得聚类的类簇中心和每个类簇内包含的用户和项目信息;The clustering unit is used to cluster the existing user and item rating information, and obtain the cluster center of the cluster and the user and item information contained in each cluster; 计算单元,用于若判定当前用户不属于新加入系统用户,则根据所述当前用户的相关属性特征信息及用户对项目的不同评分记录计算所述当前用户对项目的时间熵;A computing unit, configured to calculate the time entropy of the current user to the project according to the relevant attribute feature information of the current user and the different scoring records of the user to the project if it is determined that the current user does not belong to the newly added system user; 判断单元,用于根据计算出的所述当前用户对项目的时间熵,判定所述当前用户的用户类型;a judgment unit, configured to judge the user type of the current user according to the calculated time entropy of the current user to the item; 匹配单元,用于根据所述当前用户的用户类型在聚类的所述类簇中心进行模式匹配,从而为所述当前用户进行多样性项目信息推荐;a matching unit, configured to perform pattern matching at the cluster center of the cluster according to the user type of the current user, thereby recommending diversity item information for the current user; 其中,所述计算单元,具体用于根据所述当前用户的相关属性特征信息及用户对项目的不同评分记录利用如下时间熵公式计算所述当前用户对项目的时间熵:Wherein, the calculation unit is specifically configured to calculate the time entropy of the current user to the item by using the following time entropy formula according to the relevant attribute feature information of the current user and the different scoring records of the user to the item:
Figure FDA0003158909060000031
Figure FDA0003158909060000031
其中,H为时间熵,n是时间间隔数,scorei为用户对项目i的评分,mi代表一个时间间隔内所有用户对项目i的评分总和,M指所有mi的总和;Among them, H is the time entropy, n is the number of time intervals, score i is the user's score for item i, m i represents the sum of the scores of all users on item i in a time interval, and M refers to the sum of all m i ; 所述判断单元,具体用于根据用户兴趣偏好,判定所述当前用户的用户类型为如下四种类型之一:最近喜欢且过去也喜欢、最近喜欢但过去不喜欢、最近不喜欢但过去喜欢、最近不喜欢且过去也不喜欢;The determining unit is specifically configured to determine, according to the user's interest preference, that the user type of the current user is one of the following four types: recently liked and liked in the past, liked recently but disliked in the past, disliked recently but liked in the past, I didn't like it recently and I didn't like it in the past; 所述匹配单元,具体用于根据所述当前用户的用户类型在聚类的所述类簇中心的如下两种兴趣偏好模式中进行模式匹配:规律性兴趣偏好模式和非规律性兴趣偏好模式;其中,所述规律性兴趣偏好模式为最近喜欢且过去也喜欢;所述非规律性兴趣偏好模式包括:最近喜欢但过去不喜欢、最近不喜欢但过去喜欢、最近不喜欢且过去也不喜欢;The matching unit is specifically configured to perform pattern matching in the following two interest preference modes in the cluster center of the cluster according to the user type of the current user: a regular interest preference mode and an irregular interest preference mode; Wherein, the regular interest preference pattern is like recently and also in the past; the irregular interest preference pattern includes: like recently but disliked in the past, disliked recently but liked in the past, disliked recently and disliked in the past; 其中,所述匹配单元,具体用于:当模式匹配为规律性兴趣偏好模式时,计算用户所属聚类,簇类中运用基于项目的协同过滤算法进行推荐;否则,采用项目的多样性公式为用户进行推荐;Wherein, the matching unit is specifically used to: when the pattern matching is a regular interest preference pattern, calculate the cluster to which the user belongs, and use the item-based collaborative filtering algorithm for recommendation in the cluster; otherwise, use the item diversity formula as: users recommend; 其中,所述多样性公式为:Wherein, the diversity formula is:
Figure FDA0003158909060000032
Figure FDA0003158909060000032
Figure FDA0003158909060000033
Figure FDA0003158909060000033
Figure FDA0003158909060000034
Figure FDA0003158909060000034
其中,fr(i)指标签在集合中出现的频率,z(i)是标签候选集合中所有标签的相似度之和,α为调节系数,项rui和ruj分别代表用户u对项目i和项目j的评分,
Figure FDA0003158909060000035
为所有评价过项目j的用户对项目j的平均评分,
Figure FDA0003158909060000036
为所有评价过项目i的用户对项目i的平均评分,Uij为同时对项目i与项目j评分的用户集合,sim(i,j)的值介于[-1,1]之间。
Among them, fr(i) refers to the frequency of the tag appearing in the set, z(i) is the sum of the similarity of all tags in the tag candidate set, α is the adjustment coefficient, and the items r ui and r uj represent user u’s pairing of item i respectively and the rating of item j,
Figure FDA0003158909060000035
is the average rating of item j for all users who have evaluated item j,
Figure FDA0003158909060000036
is the average rating of item i by all users who have evaluated item i, U ij is the set of users who rated item i and item j at the same time, and the value of sim(i, j) is between [-1, 1].
4.如权利要求3所述基于时间熵的个性化推荐装置,其特征在于,所述装置还包括:4. The personalized recommendation device based on time entropy according to claim 3, wherein the device further comprises: 推荐单元,用于若判定当前用户为新加入系统用户,采用如下项目流行度计算方式将排名在前的N个项目信息推荐给所述当前用户:The recommending unit is used to recommend the top N item information to the current user if it is determined that the current user is a new user joining the system, using the following item popularity calculation method:
Figure FDA0003158909060000041
Figure FDA0003158909060000041
其中,ipop为项目流行度,Ui代表推荐系统中评价过项目i的用户集合,I为推荐系统中的所有项目数。Among them, i pop is the popularity of the item, U i represents the set of users who have evaluated item i in the recommender system, and I is the number of all items in the recommender system.
CN201810794746.2A 2018-07-19 2018-07-19 Personalized recommendation method and device based on time entropy Expired - Fee Related CN109063052B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810794746.2A CN109063052B (en) 2018-07-19 2018-07-19 Personalized recommendation method and device based on time entropy

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810794746.2A CN109063052B (en) 2018-07-19 2018-07-19 Personalized recommendation method and device based on time entropy

Publications (2)

Publication Number Publication Date
CN109063052A CN109063052A (en) 2018-12-21
CN109063052B true CN109063052B (en) 2022-01-25

Family

ID=64817411

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810794746.2A Expired - Fee Related CN109063052B (en) 2018-07-19 2018-07-19 Personalized recommendation method and device based on time entropy

Country Status (1)

Country Link
CN (1) CN109063052B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110070134A (en) * 2019-04-25 2019-07-30 厦门快商通信息咨询有限公司 A kind of recommended method and device based on user interest perception
CN111191707B (en) * 2019-12-25 2023-06-06 浙江工商大学 A method for constructing LFM training samples with time decay factor
CN114548501B (en) * 2022-01-14 2024-06-18 北京全路通信信号研究设计院集团有限公司 Balancing checking method, system and equipment
CN114581179A (en) * 2022-03-02 2022-06-03 中国工商银行股份有限公司 Product recommendation method, device, equipment and medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101329683A (en) * 2008-07-25 2008-12-24 华为技术有限公司 Recommendation system and method
KR101030653B1 (en) * 2009-01-22 2011-04-20 성균관대학교산학협력단 User-based Collaborative Filtering Recommendation System Using Information Entropy to Correct Similarity
CN102135999A (en) * 2011-03-25 2011-07-27 南京财经大学 User credibility and item nearest neighbor combination Internet recommendation method
CN103186539B (en) * 2011-12-27 2016-07-27 阿里巴巴集团控股有限公司 A kind of method and system determining user group, information inquiry and recommendation
CN106649658B (en) * 2016-12-13 2020-06-16 重庆邮电大学 Recommendation system and method for indiscriminate treatment of user roles and data sparse
CN107391713B (en) * 2017-07-29 2020-04-28 内蒙古工业大学 Method and system for solving cold start problem in collaborative filtering recommendation technology

Also Published As

Publication number Publication date
CN109063052A (en) 2018-12-21

Similar Documents

Publication Publication Date Title
Loni et al. Bayesian personalized ranking with multi-channel user feedback
US10572565B2 (en) User behavior models based on source domain
CN109063052B (en) Personalized recommendation method and device based on time entropy
CN107833117B (en) Bayesian personalized sorting recommendation method considering tag information
US9967628B2 (en) Rating videos based on parental feedback
CN104008138B (en) A kind of music based on social networks recommends method
US9875245B2 (en) Content item recommendations based on content attribute sequence
CN104317835B (en) The new user of video terminal recommends method
CN110532479A (en) A kind of information recommendation method, device and equipment
KR101098871B1 (en) APPARATUS AND METHOD FOR MEASURING CONTENTS SIMILARITY BASED ON FEEDBACK INFORMATION OF RANKED USER and Computer Readable Recording Medium Storing Program thereof
US9607096B2 (en) System and method for content access control
US20110320441A1 (en) Adjusting search results based on user social profiles
Hu et al. Style in the long tail: Discovering unique interests with latent variable models in large scale social e-commerce
Aiolli A Preliminary Study on a Recommender System for the Million Songs Dataset Challenge.
KR102108683B1 (en) Method for providing recommendation contents including non-interest contents
US10097664B2 (en) Recommending media items based on purchase history
US20140288999A1 (en) Social character recognition (scr) system
CN115066906A (en) Method and system for recommending based on user-provided criteria
CN108876536A (en) Collaborative filtering recommending method based on arest neighbors information
CN110083764A (en) A kind of collaborative filtering cold start-up way to solve the problem
Monechi et al. Significance and popularity in music production
CN112528149A (en) Intelligent recommendation method fusing knowledge graph and Bayesian network
CN106202151A (en) One is used for improving the multifarious method of personalized recommendation system
CN108389113B (en) A collaborative filtering recommendation method and system
CN111241381A (en) Information recommendation method, apparatus, electronic device, and computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220125