CN112084427A - Interest point recommendation method based on graph neural network - Google Patents
Interest point recommendation method based on graph neural network Download PDFInfo
- Publication number
- CN112084427A CN112084427A CN202010965903.9A CN202010965903A CN112084427A CN 112084427 A CN112084427 A CN 112084427A CN 202010965903 A CN202010965903 A CN 202010965903A CN 112084427 A CN112084427 A CN 112084427A
- Authority
- CN
- China
- Prior art keywords
- interest
- user
- graph
- vector
- point
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Business, Economics & Management (AREA)
- Biomedical Technology (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Probability & Statistics with Applications (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明公开了一种基于图神经网络的兴趣点推荐方法,步骤为:构建用户‑兴趣点交互图和用户社交图,图神经网络学习图结构信息并在用户的嵌入向量中整合了协作信息和社交信息;采用k‑means算法对兴趣点按地理位置进行聚类,将聚类结果嵌入到向量中,连接通过在用户‑兴趣点交互图中得到的嵌入向量,输入到一个神经网络中得到兴趣点嵌入向量;构建一个神经网络模型,模拟机器学习中的矩阵分解方法,将用户和兴趣点的嵌入向量输入到神经网络模型中根据用户的历史评分进行评分预测。本发明在用户的向量表示中嵌入协作信息和社交网络中的信息,在兴趣点的向量表示中嵌入协作信息以及兴趣点的位置信息,将用户和兴趣点的向量表示输入到神经网络中进行推荐。
The invention discloses a method for recommending interest points based on a graph neural network. The steps are as follows: constructing a user-interest point interaction graph and a user social graph, the graph neural network learns graph structure information, and integrates collaboration information and a user's embedding vector into the user's embedding vector. Social information; use the k-means algorithm to cluster the points of interest by geographic location, embed the clustering results into a vector, connect the embedded vector obtained in the user-point of interest interaction graph, and input it into a neural network to get the interest Point embedding vector; build a neural network model to simulate the matrix factorization method in machine learning, and input the embedding vectors of users and interest points into the neural network model to predict the score according to the user's historical scores. The present invention embeds the collaboration information and the information in the social network in the vector representation of the user, embeds the collaboration information and the location information of the interest point in the vector representation of the interest point, and inputs the vector representation of the user and the interest point into the neural network for recommendation .
Description
技术领域technical field
本发明属于神经网络和推荐系统的技术领域,尤其涉及一种基于图神 经网络的兴趣点推荐方法。The invention belongs to the technical field of neural networks and recommendation systems, and in particular relates to a method for recommending points of interest based on graph neural networks.
背景技术Background technique
近年来,随着移动互联网技术和智能设备的飞速发展,用户可以方便 地获取个人的实时位置信息,同时基于位置的社交网络(LBSNs)和兴趣点 推荐技术(POIrecommendation)也受到了广泛关注。协同过滤是推荐系统 中广泛使用的方法,其基本思想是,通过分析用户的兴趣偏好找到与某个 用户偏好相似的其他用户,然后综合这些相似用户对兴趣点的评价,预测 该用户对兴趣点的偏好程度。学习用户和兴趣点的向量表示是协同过滤推 荐的研究热点。随着词嵌入技术的发展,很多方法将描述用户和兴趣点的 特征信息进行嵌入作为用户和兴趣点的向量表示,捕获了用户的偏好以及 兴趣点的特征,但是该方法没有捕获用户-兴趣点交互图中的协作信息, 产生的嵌入向量不足以捕获协同过滤影响。图神经网络被提出学习图数据 中的向量表示,可以整合图中的节点信息、边信息以及拓扑结构,它的主 要思想是使用神经网络从邻居节点中迭代的转换和聚合特征信息,与此同 时,在转换和聚合之后节点信息也可以在图上传播。图神经网络由于能够 整合节点信息和拓扑结构,所以在学习向量表示上取得了成功。例如,使 用图神经网络在用户-项目交互图中学习用户和项目的嵌入向量。In recent years, with the rapid development of mobile Internet technology and smart devices, users can easily obtain real-time personal location information. At the same time, location-based social networks (LBSNs) and point-of-interest recommendation technology (POI recommendation) have also received extensive attention. Collaborative filtering is a widely used method in recommender systems. Its basic idea is to find other users who are similar to a user's preferences by analyzing the user's interest preferences, and then synthesize the evaluation of these similar users' points of interest to predict the user's interest points. degree of preference. Learning vector representations of users and points of interest is a research hotspot in collaborative filtering recommendation. With the development of word embedding technology, many methods embed feature information describing users and POIs as vector representations of users and POIs, which capture the user's preferences and the features of POIs, but this method does not capture user-POI Collaboration information in the interaction graph, the resulting embedding vector is not enough to capture the impact of collaborative filtering. The graph neural network is proposed to learn the vector representation in the graph data, which can integrate the node information, edge information and topology structure in the graph. Its main idea is to use the neural network to iteratively transform and aggregate the feature information from the neighbor nodes. , node information can also be propagated on the graph after transformation and aggregation. Graph neural networks have been successful in learning vector representations due to their ability to integrate node information and topology. For example, using a graph neural network to learn user and item embedding vectors in a user-item interaction graph.
位置信息是兴趣点的基本属性,是兴趣点推荐算法必须考虑的关键因 素。例如,用户可能往往选择物理距离据自己相近的兴趣点。社交关系在 兴趣点推荐中吸引了很多关注,用户可以通过他的朋友、同事等获得或传 播信息,这意味着用户潜在的社交关系在用户过滤信息时起着一定作用。 Wang等人通过研究发现,两个用户之间签到行为的相似性与他们在社交 网络中的相关性有很强的联系。Gowalla和Foursquare数据集报告了超过5%、20%和40%的社交、邻居和地理位置朋友的相似度大于0.2,证明了 用户朋友对用户的决策行为存在一定影响。现有模型很多都是只考虑了用 户社交网络中的朋友,忽略了用户的决策行为可能受用户的朋友的朋友的 影响,用户之间的信息传递不仅限于用户的朋友,用户之间的信息传递存 在信息扩散现象,只考虑用户社交网络中的朋友,不足以捕获社交网络中 传递的信息带来的影响。Location information is the basic attribute of POIs and a key factor that must be considered in POI recommendation algorithms. For example, users may tend to select points of interest that are physically close to themselves. Social relations have attracted a lot of attention in POI recommendation, and users can obtain or spread information through his friends, colleagues, etc., which means that users' potential social relations play a certain role when users filter information. Through research, Wang et al. found that the similarity of check-in behavior between two users has a strong relationship with their relevance in social networks. The Gowalla and Foursquare datasets report that more than 5%, 20%, and 40% of social, neighbor, and geographic friends have a similarity greater than 0.2, proving that users' friends have a certain influence on users' decision-making behavior. Many existing models only consider the friends in the user's social network, ignoring that the user's decision-making behavior may be affected by the user's friends' friends, and the information transfer between users is not limited to the user's friends. There is a phenomenon of information diffusion, and only considering the friends in the user's social network is not enough to capture the impact of the information transmitted in the social network.
发明内容SUMMARY OF THE INVENTION
针对现有技术的不足,本发明所解决的技术问题在于提供一种基于图 神经网络的兴趣点推荐方法,构造成社交图和用户-兴趣点交互图,在用 户的向量表示中嵌入了协作信息和社交网络中的信息,同时在兴趣点的向 量表示中嵌入了协作信息以及兴趣点的位置信息,进而将用户和兴趣点的 向量表示输入到神经网络中进行推荐。In view of the deficiencies of the prior art, the technical problem solved by the present invention is to provide a method for recommending points of interest based on a graph neural network, which is constructed into a social graph and a user-point of interest interaction graph, and the collaboration information is embedded in the vector representation of the user. At the same time, the collaboration information and the location information of the interest point are embedded in the vector representation of the interest point, and then the vector representation of the user and the interest point is input into the neural network for recommendation.
为了解决上述技术问题,本发明通过以下技术方案来实现:In order to solve the above-mentioned technical problems, the present invention realizes through the following technical solutions:
本发明提供一种基于图神经网络的兴趣点推荐方法,包括以下步骤:The present invention provides a method for recommending points of interest based on a graph neural network, comprising the following steps:
步骤S1:构建用户-兴趣点交互图和用户社交图,图神经网络学习图 结构信息并在用户的嵌入向量中整合了协作信息和社交信息;Step S1: construct a user-interest point interaction graph and a user social graph, and the graph neural network learns the graph structure information and integrates collaboration information and social information in the user's embedding vector;
步骤S2:采用k-means算法对兴趣点按地理位置进行聚类,将聚类 结果嵌入到向量中,连接通过在用户-兴趣点交互图中得到的嵌入向量, 并输入到一个神经网络中得到兴趣点嵌入向量;Step S2: Use the k-means algorithm to cluster the points of interest by geographic location, embed the clustering results into a vector, connect the embedded vectors obtained in the user-point-of-interest interaction graph, and input it into a neural network to obtain Interest point embedding vector;
步骤S3:构建一个神经网络模型,模拟机器学习中的矩阵分解方法, 将用户和兴趣点的嵌入向量输入到神经网络模型中根据用户的历史评分进 行评分预测。Step S3: Construct a neural network model, simulate the matrix decomposition method in machine learning, and input the embedded vectors of users and points of interest into the neural network model to perform score prediction according to the user's historical scores.
可选的,在步骤S1中,使用交互聚合器在用户-兴趣点交互图上学习 并产生兴趣点空间的潜在向量,社交聚合器在社交图上学习产生社交空间 的潜在向量,将兴趣点空间的潜在向量和社交空间的潜在向量结合得到用 户的嵌入向量。Optionally, in step S1, an interaction aggregator is used to learn on the user-POI interaction graph and generate a latent vector of the POI space, and the social aggregator learns to generate a latent vector of the social space on the social graph, and the POI space The latent vector of , and the latent vector of the social space are combined to obtain the embedding vector of the user.
进一步的,所述步骤S2中,基于k-means的兴趣点聚类先随机选取k 个兴趣点作为初始的聚类中心,然后计算每个兴趣点与各个聚类中心之间 的距离,把每个兴趣点分配给距离它最近的聚类中心。Further, in the step S2, the interest point clustering based on k-means first randomly selects k interest points as the initial cluster centers, then calculates the distance between each interest point and each cluster center, and calculates the distance between each interest point and each cluster center. Each interest point is assigned to the cluster center closest to it.
可选的,所述步骤S2中,基于k-means算法的兴趣点聚类的处理过 程如下:Optionally, in the step S2, the processing process of the interest point clustering based on the k-means algorithm is as follows:
1)从数据集中随机选择k个兴趣点作为初始聚类中心;1) Randomly select k interest points from the dataset as initial cluster centers;
2)计算剩余兴趣点到聚类中心的欧式距离ρ,将最接近的兴趣点放 入类别中,形成新的类,ρ的计算方法为:2) Calculate the Euclidean distance ρ from the remaining interest points to the cluster center, and put the closest interest points into the category to form a new category. The calculation method of ρ is:
其中vi=(lati,loni)和vj=(latj,lonj)为兴趣点数据集V={v1,v2,......, vn}中的两个兴趣点;where v i = (lat i , lon i ) and v j = (lat j , lon j ) are two of the interest point datasets V = {v 1 , v 2 ,..., v n } Points of Interest;
3)取当前类的所有兴趣点的均值,作为中心点,更新距离类的中心 最近的兴趣点;3) Take the mean of all interest points of the current class as the center point, and update the interest point closest to the center of the class;
4)直到目标函数收敛或聚类中心不变,否则将转移到步骤2);4) Until the objective function converges or the cluster center remains unchanged, otherwise it will transfer to step 2);
5)输出聚类结果。5) Output the clustering results.
由上,本发明的基于图神经网络的兴趣点推荐方法具有如下有益效果:From the above, the interest point recommendation method based on the graph neural network of the present invention has the following beneficial effects:
1.提出了一个新的基于图神经网络的兴趣点推荐方法,该模型能够学 习用户-兴趣点交互图和社交图上的信息。1. A new method for POI recommendation based on graph neural network is proposed, which can learn information on user-POI interaction graphs and social graphs.
2.在用户-兴趣点交互图中同时考虑了用户的评价和交互,更好的捕获 了协作信息。2. The user's evaluation and interaction are taken into account in the user-point-of-interest interaction graph, which better captures collaboration information.
3.考虑了用户的社交关系中的各阶朋友以及用户之间的亲密度。3. All levels of friends in the user's social relationship and the intimacy between users are considered.
4.在Yelp数据集上进行了算法的效果与性能实验评价,验证了本发 明方法的有效性。4. The experimental evaluation of the effect and performance of the algorithm is carried out on the Yelp data set, which verifies the effectiveness of the method of the present invention.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的 技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和 其他目的、特征和优点能够更明显易懂,以下结合优选实施例,并配合附 图,详细说明如下。The above description is only an overview of the technical solutions of the present invention, in order to be able to understand the technical means of the present invention more clearly, it can be implemented according to the content of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and easy to understand , the following detailed description is given in conjunction with the preferred embodiments and in conjunction with the accompanying drawings.
附图说明Description of drawings
为了更清楚地说明本发明实施例的技术方案,下面将对实施例的附图 作简单地介绍。In order to illustrate the technical solutions of the embodiments of the present invention more clearly, the accompanying drawings of the embodiments will be briefly introduced below.
图1为本发明的基于图神经网络的兴趣点推荐方法的模型图;1 is a model diagram of a method for recommending points of interest based on a graph neural network of the present invention;
图2为用户u1三阶朋友的嵌入方式;Fig. 2 is the embedding mode of the third - order friend of user u1;
图3为本发明的总体框架图;Fig. 3 is the overall frame diagram of the present invention;
图4为MAE的变化情况图;Fig. 4 is the change situation diagram of MAE;
图5为RMSE的变化情况图。Figure 5 is a graph of the variation of RMSE.
具体实施方式Detailed ways
下面结合附图详细说明本发明的具体实施方式,其作为本说明书的一 部分,通过实施例来说明本发明的原理,本发明的其他方面、特征及其优 点通过该详细说明将会变得一目了然。在所参照的附图中,不同的图中相 同或相似的部件使用相同的附图标号来表示。The specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. As a part of this specification, the principles of the present invention will be described through examples. Other aspects, features and advantages of the present invention will become clear at a glance through the detailed description. In the figures to which reference is made, the same or similar parts are denoted by the same reference numerals in different figures.
基于图嵌入的兴趣点推荐包括四个部分:用户嵌入向量建模,兴趣点 嵌入向量建模,评分预测,模型训练。本发明分别构建用户-兴趣点交互 图和社交图,然后使用交互聚合器在用户-兴趣点交互图上学习并产生兴 趣点空间的潜在向量,社交聚合器在社交图上学习产生社交空间的潜在向 量,最后将兴趣点空间的潜在向量和社交空间的潜在向量结合得到用户的 嵌入向量;兴趣点向量建模部分,考虑了兴趣点存在于交互数据中的协作信息和位置信息,将位置信息使用k-means算法进行聚类,使每个聚类中 的兴趣点具有相同的位置属性标签,然后将其嵌入到向量中得到位置空间 的潜在向量,用户空间的潜在向量的学习与用户嵌入向量中兴趣点空间的 潜在向量的学习相同,最后集成位置空间的潜在向量和用户空间的潜在向 量得到兴趣点的嵌入向量;评分预测部分是用户和兴趣点建模部分的集成, 将其输入到神经网络模型中进行评分预测;模型训练部分指定一个目标函数进行优化,学习模型的参数。Graph embedding-based POI recommendation includes four parts: user embedding vector modeling, POI embedding vector modeling, rating prediction, and model training. The present invention constructs the user-interest point interaction graph and the social graph respectively, and then uses the interaction aggregator to learn on the user-interest point interaction graph and generate the latent vector of the interest point space, and the social aggregator learns to generate the potential vector of the social space on the social graph. Finally, the latent vector of the interest point space and the latent vector of the social space are combined to obtain the user's embedding vector; in the modeling part of the interest point vector, the collaboration information and location information of the interest point existing in the interaction data are considered, and the location information is used The k-means algorithm performs clustering, so that the interest points in each cluster have the same location attribute label, and then embed them into the vector to obtain the latent vector of the location space, and the learning of the latent vector of the user space is the same as the user embedding vector. The learning of the latent vector of the interest point space is the same. Finally, the latent vector of the position space and the latent vector of the user space are integrated to obtain the embedded vector of the interest point; the score prediction part is the integration of the user and the interest point modeling part, which is input to the neural network. The score prediction is performed in the model; the model training part specifies an objective function for optimization and learns the parameters of the model.
本发明的相关工作主要分为基于社交信息的兴趣点推荐,基于地理信 息的兴趣点推荐和基于图嵌入的推荐。The related work of the present invention is mainly divided into POI recommendation based on social information, POI recommendation based on geographic information and recommendation based on graph embedding.
基于社交信息的兴趣点推荐,兴趣点本身并没有社会属性,但访问兴 趣点的用户具有社会属性,于是通过访问兴趣点的用户之间的社会联系, 兴趣点便被赋予了社会属性。用户的行为与朋友的行为具有一定的相似性, 用户可能会访问朋友推荐的兴趣点,因此可以利用社会关系来提高兴趣点 推荐的性能。Li等人提出了用户的决策不仅受社交网络中朋友的影响,也 受地理位置上的朋友的影响以及邻居朋友的影响(地理位置上的朋友即在 同一地点上签到过的朋友,邻居朋友即在地理位置上距离目标用户近的用 户),然后结合线性聚合和随机游走进行推荐。现有技术中将用户的相似 性整合到基于用户的协同过滤算法中,有的则将用户的相似性作为正则化 项或潜在因素模型的权重。For POI recommendation based on social information, POI itself does not have social attributes, but users who visit POIs have social attributes, so POIs are given social attributes through the social connection between users who visit POIs. The user's behavior has a certain similarity with the behavior of friends, and users may visit the points of interest recommended by friends, so social relations can be used to improve the performance of point of interest recommendation. Li et al. proposed that users' decisions are not only affected by friends in social networks, but also by geographical friends and neighbor friends (geographical friends are friends who have checked in at the same place, and neighbor friends are users who are geographically close to the target user), and then combine linear aggregation and random walk for recommendation. In the prior art, the similarity of users is integrated into the user-based collaborative filtering algorithm, and some of them use the similarity of users as the weight of the regularization term or latent factor model.
基于位置信息的兴趣点推荐,位置属性是兴趣点天然拥有的属性,一 直是兴趣点推荐算法必须考虑的基本因素。有的利用用户和兴趣点之间的 地理相关性、社会相关性和类别相关性进行兴趣点推荐。现有技术将卷积 神经网络集成到概率矩阵分解中,对地理影响、社会关系以及评论信息进 行建模。Wang等人联合用户个人兴趣和目标区域人群偏好,利用兴趣点 的共现模式及内容描述,通过空间金字塔(spatial pyramid)索引结构,平 滑人群偏好,以减轻用户外出旅行时的由于缺乏用户在旅行地点的行为数 据所造成的数据稀疏性问题,从而得到更加准确的推荐效果。Xing等人 提出用户签到呈现出地理聚类现象,使用用户对兴趣点的相邻几个兴趣点 的偏好来代替用户对兴趣点的偏好然后结合矩阵分解进行推荐。For POI recommendation based on location information, the location attribute is a natural attribute of POI, and it has always been a basic factor that must be considered in POI recommendation algorithms. Some use the geographic correlation, social correlation and category correlation between users and POIs to recommend POIs. Existing techniques integrate convolutional neural networks into probabilistic matrix factorization to model geographic influence, social relations, and review information. Wang et al. combined users' personal interests and target area population preferences, used the co-occurrence pattern and content description of points of interest, and smoothed the crowd preferences through the spatial pyramid index structure, so as to alleviate the user's travel problems due to the lack of user travel. The data sparsity problem caused by the behavior data of the location, so as to obtain a more accurate recommendation effect. Xing et al. proposed that user check-in presents a phenomenon of geographic clustering, and the user's preference for several points of interest adjacent to the point of interest is used to replace the user's preference for points of interest, and then combined with matrix decomposition for recommendation.
基于图神经网络的推荐,GC-MC描述用户和项目的关系为二部图, 两个多连接的图卷机层用来聚合用户和项目的特征信息,由于GCN在学 习高质量用户和项目时的有效性,GC-MC取得了很好的推荐效果。但是 为了区分每个节点,GC-MC使用one-hot编码作为节点的向量输入,使输 入的向量维数与节点的总数成正比,因此无法扩展到大型图上。不同于 GC-MC,STAR-GCN模型学习用户和项目的低维嵌入向量输入到网络中, 为了预测冷启动节点的嵌入向量,STAR-GCN屏蔽了部分用户和项目的嵌 入向量,通过图编码-解码器重构被屏蔽的嵌入向量,有效缓解了冷启动 问题。有的提出了将用户-项目交互数据构建成二部图,使用图神经网络 学习用户和项目的嵌入向量,在用户和项目的嵌入向量中以嵌入传播的方 式显式的注入了协作信息。现有中考虑了图中节点的多阶邻居节点,但只 考虑了节点的一阶邻居节点。Based on the recommendation of graph neural network, GC-MC describes the relationship between users and items as a bipartite graph. Two multi-connected graph convolution layers are used to aggregate the feature information of users and items. Since GCN learns high-quality users and items The effectiveness of GC-MC has achieved good recommendation results. However, in order to distinguish each node, GC-MC uses one-hot encoding as the vector input of the node, so that the input vector dimension is proportional to the total number of nodes, so it cannot scale to large graphs. Different from GC-MC, the STAR-GCN model learns low-dimensional embedding vectors of users and items to input into the network. In order to predict the embedding vectors of cold-start nodes, STAR-GCN shields some embedding vectors of users and items, and encodes them by graph- The decoder reconstructs the masked embedding vector, which effectively alleviates the cold-start problem. Some propose to construct the user-item interaction data into a bipartite graph, use a graph neural network to learn the embedding vectors of users and items, and explicitly inject collaboration information in the embedding vectors of users and items in the form of embedding propagation. The existing multi-order neighbor nodes of the nodes in the graph are considered, but only the first-order neighbor nodes of the nodes are considered.
如何结合用户-兴趣点交互图和社交图上的信息是用户嵌入向量建模 的关键,因此本发明使用交互聚合器和社交聚合器两种聚合器分别在用户 -兴趣点交互图和社交图上学习兴趣点空间的用户潜在向量和社交空间的 用户潜在向量,然后集成这两种向量得到用户的嵌入向量。How to combine the information on the user-POI interaction graph and the social graph is the key to modeling the user embedding vector, so the present invention uses two aggregators, the interaction aggregator and the social aggregator, on the user-POI interaction graph and the social graph, respectively. It learns the user latent vector of the interest point space and the user latent vector of the social space, and then integrates these two vectors to obtain the user's embedding vector.
用户-兴趣点交互图中不仅包括用户和兴趣点的交互也包括用户对兴 趣点的评价。对于学习兴趣点空间的用户潜在向量,本发明提出了一种方 法共同捕获用户-兴趣点交互图中的交互和评价,考虑了用户交互过的兴 趣点和用户对兴趣点的评价,更好的聚合了用户-兴趣点交互图中的协作 信息。为了在数学上表示这种聚合,本发明使用以下函数:The user-POI interaction graph includes not only the interaction between the user and the POI but also the user's evaluation of the POI. For learning the user latent vector of the POI space, the present invention proposes a method to jointly capture the interaction and evaluation in the user-POI interaction graph, considering the POIs that the user has interacted with and the user's evaluation of the POIs, better The collaboration information in the user-POI interaction graph is aggregated. To mathematically represent this aggregation, the present invention uses the following function:
其中C(i)为用户ui交互过的兴趣点的集合(在用户-兴趣点交互图中 ui的邻居节点),Xia表示ui和兴趣点va之间的评价感知的交互向量, Aggreitems为兴趣点聚合函数。此外σ为非线性激活函数,W和b为神经网 络的权重和偏置。接下来,详细介绍评价感知的交互向量Xia和聚合函数 Aggreitems。where C(i) is the set of interest points that user ui has interacted with (the neighbor nodes of ui in the user-interest point interaction graph), and X ia represents the evaluation-aware interaction vector between ui and interest point va , Aggre items is the point of interest aggregation function. In addition, σ is the nonlinear activation function, and W and b are the weights and biases of the neural network. Next, the evaluation-aware interaction vector X ia and the aggregation function Aggre items are introduced in detail.
用户与兴趣点进行交互时可以表达自己对于兴趣点的评价(评论或评 分),评价信息表达了用户对于兴趣点的偏好,可以更好的捕获用户-兴趣 点交互图中的协作信息。为了建模评价信息,对于每种评价信息r,采用 评价嵌入向量表示每种评价信息r为稠密向量。例如,在一个五级分制系 统中,对于每一个评分,er表示嵌入向量。对于用户ui和兴趣点va的交互 和评价信息r,本发明通过多层感知器建模评价感知的交互向量Xia为兴趣 点初始嵌入向量qa和评价嵌入向量er的结合,多层感知器将qa和er的连 接向量作为输入。多层感知器的输出为ui和va之间的评价感知的交互向 量Xia,表达式如下:When users interact with POIs, they can express their own evaluations (reviews or ratings) for POIs, and the evaluation information expresses the user's preference for POIs, which can better capture the collaboration information in the user-POI interaction graph. In order to model the evaluation information, for each evaluation information r, the evaluation embedding vector is used to represent each evaluation information r as a dense vector. For example, in a five-point scoring system, for each rating, er represents the embedding vector . For the interaction and evaluation information r between the user ui and the interest point va, the present invention uses the multi-layer perceptron to model and evaluate the perceived interaction vector X ia as the combination of the interest point initial embedding vector q a and the evaluation embedding vector er. The layer perceptron takes as input the concatenated vector of q a and er . The output of the multilayer perceptron is the evaluation-perceptual interaction vector X ia between ui and v a , which is expressed as:
其中为两个向量的连接操作,gv代表多层感知器。in is the concatenation operation of two vectors, g v represents the multilayer perceptron.
对于Aggreitems,一个常用的聚合函数为均值操作,即取中 向量的元素级上的平均值。基于均值的聚合器是局部谱卷积的线性近似, 如下所示:For Aggre items , a commonly used aggregation function is the mean operation, that is, taking The element-wise mean of the vector in . The mean-based aggregator is a linear approximation of the local spectral convolution as follows:
αi固定为是对于所有兴趣点向量基于均值的聚合操作,其假设所有 的交互对用户ui的嵌入向量的贡献都是相同的。然而,每个交互对于用户 嵌入向量的贡献都可能有很大的不同。因此,通过给每个交互分配一个权 重,允许每个交互对用户的嵌入向量做出不同的贡献。为了缓解基于均值 的聚合器的局限,受注意机制的启发,调整αi,分配个性化的权重给每个 (va,ui):α i is fixed as is a mean-based aggregation operation for all interest point vectors, which assumes that all interactions contribute the same to the embedding vector of user ui . However, each interaction's contribution to the user's embedding vector can vary widely. Therefore, by assigning a weight to each interaction, each interaction is allowed to contribute differently to the user's embedding vector. To alleviate the limitations of mean-based aggregators, inspired by the attention mechanism, adjust α i , assigning individualized weights to each ( va , ui ):
αia代表ui交互过的兴趣点va的注意力权重,即对ui兴趣点空间上的潜 在向量的贡献。本发明使用两层的神经网络对αia进行参数化,称之为注意 网络。注意网络的输入为Xia和目标用户ui的初始嵌入向量pi,注意网络 定义如下:α ia represents the attention weight of the interest point v a that ui has interacted with, that is, the contribution to the latent vector on the space of interest points of u i . The present invention uses a two-layer neural network to parameterize α ia , which is called an attention network. Note that the input to the network is X ia and the initial embedding vector p i of the target user ui , and the network is defined as follows:
最后的注意力权重是通过使用Softmax函数将上述注意得分归一化来 获得,该函数可以解释为交互对用户ui兴趣点空间上的潜在向量的贡献:The final attention weights are obtained by normalizing the above attention scores using the Softmax function, which can be interpreted as the contribution of the interaction to the latent vector on the space of user u i interest points:
由于社交相关性原理,用户的偏好与社交网络中的朋友的偏好相似或 者受其影响。同时,社交图中用户与其朋友的亲密度进一步影响用户的决 策行为。因此,本发明引入注意力机制来建模用户与其朋友的亲密度,然 后对其信息进行聚合。此部分分为两部分进行说明:一阶聚合以及高阶聚 合。一阶聚合考虑了用户社交图中的邻居节点,用户可能不仅受社交网络 中朋友的影响,朋友的朋友可能也会影响着用户的决策行为,因此在高阶聚合中考虑了用户在社交图中的朋友的朋友,本发明称之为高阶朋友。Due to the principle of social relevance, a user's preferences are similar to or influenced by those of friends in a social network. At the same time, the intimacy between users and their friends in the social graph further affects the user's decision-making behavior. Therefore, the present invention introduces an attention mechanism to model the intimacy of users with their friends, and then aggregate their information. This section is described in two parts: first-order aggregation and higher-order aggregation. The first-order aggregation considers the neighbor nodes in the user's social graph. The user may not only be influenced by friends in the social network, but friends of friends may also affect the user's decision-making behavior. Therefore, the user's social graph is considered in the high-order aggregation. The friends of the friends are called high-level friends in the present invention.
为了从社交网络中表示用户的潜在向量,本发明引入了社交空间的潜 在向量,聚合了社交图中的邻居用户节点,用户ui的社交空间的潜在向量 表达式如下:In order to represent the latent vector of the user from the social network, the present invention introduces the latent vector of the social space, which aggregates the neighbor user nodes in the social graph. The latent vector expression of the social space of the user ui is as follows:
N(i)为ui的邻居节点,Aggreneigbhors为用户邻居节点的聚合函数。 Aggreneigbhors常用的聚合函数为取中向量的元素级上的均值。如 下所述:N( i ) is the neighbor node of ui, and Aggre neigbhors is the aggregation function of the user's neighbor node. Aggregate functions commonly used by Aggre neigbhors are The element-wise mean of the vector in . as described below:
其中βi固定为是对所有邻居节点的基于均值的聚合器。他假设所 有的邻居节点对于ui的贡献是相同的。然而在社交网络中用户和朋友之间 的亲密度不同,他们之间的偏好可能也会有所不同,更加亲密的朋友可能 会分享更多的信息,具有更相似的偏好。因此本发明引进具有两层的神经 网络的注意力机制来建模用户之间的亲密度,通过给用户的每个朋友分配 一个亲密度,允许每个朋友对用户的嵌入向量做出不同的贡献,用户与朋 友之间越亲密,对用户的嵌入向量贡献越大,如下所述:where β i is fixed as is a mean-based aggregator over all neighbor nodes. He assumes that all neighbor nodes contribute the same to ui . However, in a social network, the intimacy between users and friends is different, and their preferences may also be different, and more intimate friends may share more information and have more similar preferences. Therefore, the present invention introduces an attention mechanism with a two-layer neural network to model the intimacy between users, by assigning an intimacy to each friend of the user, allowing each friend to make different contributions to the user's embedding vector , the closer the user and friend are, the greater the contribution to the user's embedding vector, as follows:
βio为用户uo和用户ui之间的亲密度。β io is the intimacy between user u o and user ui .
为了考虑用户的高阶朋友给用户决策带来的影响,本发明聚合了来自 高阶朋友的信息。在l-1步,用户ui的社交空间的潜在向量迭代表示为:In order to consider the influence of the user's high-level friends on the user's decision-making, the present invention aggregates the information from the high-level friends. At step l-1, the latent vector iteratively expresses the social space of user ui as:
如图2所示,用户u1嵌入了3阶朋友的信息,从图中可以看出用户 u6对用户u1的嵌入向量产生了影响。As shown in Figure 2, user u 1 embeds the information of 3rd-order friends, and it can be seen from the figure that user u 6 has an influence on the embedding vector of user u 1 .
为了更好的学习用户的嵌入向量,需要考虑用户社交空间的潜在向量 和兴趣点空间的潜在向量,用户-兴趣点交互图和社交图提供了不同方面 的信息。将和输入到MLP中得到用户嵌入向量,定义如下:In order to better learn the user's embedding vector, it is necessary to consider the latent vector of the user's social space and the latent vector of the interest point space. The user-interest point interaction graph and the social graph provide different aspects of information. Will and Input into the MLP to get the user embedding vector, which is defined as follows:
c2=σ(W2·c1+b2) (15)c 2 =σ(W 2 ·c 1 +b 2 ) (15)
............
hi=σ(W1·cl-1+b1) (16)h i =σ(W 1 ·c l-1 +b 1 ) (16)
其中l为隐藏层的索引。where l is the index of the hidden layer.
兴趣点嵌入向量建模考虑了两部分信息:用户-兴趣点交互图中的协 作信息以及兴趣点的位置信息。The POI embedding vector modeling considers two parts of information: the collaboration information in the user-POI interaction graph and the location information of the POIs.
对于每个兴趣点vj,从与vj交互过的用户集合B(j)中聚合信息。即使 对于同一个兴趣点,每个用户也可能在用户-兴趣点交互期间表达不同的 评价,这些来自不同用户的评价可以捕捉同一个兴趣点的特征,有助于对 兴趣点的嵌入向量进行建模。对于具有r的用户ut对兴趣点vj的交互,本 发明引入了评价感知的交互向量fjt,通过将用户的初始嵌入向量pt和评价 信息的嵌入向量er输入到多层感知器gu中得到,gu融合了用户对兴趣点 的交互信息和评价信息,如下所述:For each point of interest v j , aggregate information from the set B(j) of users who have interacted with v j . Even for the same POI, each user may express different evaluations during the user-POI interaction. These evaluations from different users can capture the features of the same POI and help to build the embedding vector of POIs. mold. For the interaction of the user ut with r to the interest point vj, the present invention introduces the interaction vector f jt of evaluation perception, by inputting the initial embedding vector p t of the user and the embedding vector er of evaluation information into the multilayer perceptron obtained from g u , g u fuses the user’s interaction information and evaluation information on points of interest, as follows:
对于兴趣点vj,为了学习用户空间的嵌入向量本发明提出聚合B(j) 中用户的评价感知的交互向量。用户的聚合函数标记为Aggreusers,用来聚 合中用户的评价感知的交互向量:For the interest point v j , in order to learn the user-space embedding vector The present invention proposes to aggregate the user's evaluation-aware interaction vector in B(j). The user's aggregation function is marked as Aggre users and is used to aggregate The user's rating-aware interaction vector in :
此外,引入了两层神经注意网络的注意力机制来区分每个用户的重要 程度权重μjt,fjt和qj作为输入:In addition, an attention mechanism of a two-layer neural attention network is introduced to distinguish the importance weights μ jt , f jt and q j of each user as input:
μjt可以从用户-兴趣点交互图中捕获不同用户的不同影响。μ jt can capture the different influences of different users from the user-POI interaction graph.
兴趣点的位置信息建模首先将兴趣点的位置信息使用k-means算法进 行聚类,每个兴趣点可以得到相应的聚类标签,然后将兴趣点的聚类标签 嵌入到向量中,得到位置空间的潜在向量。The location information modeling of interest points firstly uses the k-means algorithm to cluster the location information of interest points, each interest point can get the corresponding cluster label, and then embeds the cluster label of the interest point into the vector to get the location latent vector of space.
基于k-means的兴趣点聚类是先随机选取k个兴趣点作为初始的聚类 中心。然后计算每个兴趣点与各个聚类中心之间的距离,把每个兴趣点分 配给距离它最近的聚类中心。聚类中心以及分配给它们的兴趣点就代表一 个聚类,每个聚类的聚类中心根据聚类中现有的兴趣点被重新计算。这个 过程将不断重复直到满足某个终止条件。终止条件可以是没有(或最小数 目)对象被重新分配给不同的聚类,没有(或最小数目)聚类中心再发生 变化,误差平方和局部最小。The interest point clustering based on k-means is to randomly select k interest points as the initial clustering center. Then calculate the distance between each interest point and each cluster center, and assign each interest point to its nearest cluster center. The cluster centers and the points of interest assigned to them represent a cluster, and the center of each cluster is recalculated based on the existing points of interest in the cluster. This process will repeat until a certain termination condition is met. Termination conditions can be that no (or a minimum number) of objects are reassigned to different clusters, no (or a minimum number) of cluster centers change again, and the sum of squared errors is locally minimized.
基于k-means算法的兴趣点聚类的处理过程如下:The processing process of interest point clustering based on k-means algorithm is as follows:
i)从数据集中随机选择k个兴趣点作为初始聚类中心。i) Randomly select k interest points from the dataset as initial cluster centers.
ii)计算剩余兴趣点到聚类中心的欧式距离ρ,将最接近的兴趣点放 入类别中,形成新的类,ρ的计算方法为:ii) Calculate the Euclidean distance ρ from the remaining interest points to the cluster center, and put the closest interest points into the category to form a new category. The calculation method of ρ is:
其中vi=(lati,loni)和vj=(latj,lonj)为兴趣点数据集V={v1,v2,......, vn}中的两个兴趣点。where v i = (lat i , lon i ) and v j = (lat j , lon j ) are two of the interest point datasets V = {v 1 , v 2 ,..., v n } Points of Interest.
iii)取当前类的所有兴趣点的均值,作为中心点,如图2,更新距离 类的中心最近的兴趣点。iii) Take the mean of all interest points of the current class as the center point, as shown in Figure 2, and update the interest point closest to the center of the class.
iv)直到目标函数收敛或聚类中心不变,否则将转移到ii)。iv) Until the objective function converges or the cluster centers do not change, otherwise move to ii).
v)输出聚类结果。v) Output the clustering results.
然后将聚类结果嵌入到向量中。例如,聚类结果为10类,则 k∈{1,2,3,...,10},引入了ak作为聚类结果的潜在向量。The clustering results are then embedded into a vector. For example, if the clustering result is 10 classes, then k∈{1,2,3,...,10}, introducing a k as the latent vector of the clustering result.
为了更好的学习兴趣点的嵌入向量,本发明集成兴趣点位置空间的潜 在向量ak和用户空间的潜在向量然后将其输入到多层感知器中得到兴 趣点嵌入向量。如下所述:In order to better learn the embedding vector of the interest point, the present invention integrates the latent vector a k of the position space of the interest point and the latent vector of the user space It is then fed into a multilayer perceptron to get the interest point embedding vector. as described below:
d2=σ(W2·c1+b2)d 2 =σ(W 2 ·c 1 +b 2 )
......
zj=σ(W·cl-1+b2)z j =σ(W·c l-1 +b 2 )
本发明将设计推荐任务来学习模型参数。推荐任务包括兴趣点排名和 评分预测等多种推荐任务,本发明将提出的基于图神经网络的兴趣点推荐 模型应用到评分预测的推荐任务中,利用用户和兴趣点的嵌入向量(hi和 zj),首先先将他们连接起来,然后将其输入到MLP进行评分预测,如下 所示:The present invention will design recommendation tasks to learn model parameters. Recommendation tasks include a variety of recommendation tasks such as POI ranking and scoring prediction. The present invention applies the proposed graph neural network-based POI recommendation model to the recommendation task of scoring prediction, using the embedding vectors of users and POIs ( hi and z j ), first concatenate them and then feed them to MLP for rating prediction as follows:
g2=σ(W2·g1+b2)g 2 =σ(W 2 ·g 1 +b 2 )
......
gl-1=σ(Wl·gl-1+bl)g l-1 =σ(W l ·g l-1 +b l )
r′ij=WT·gl-1 r′ ij = W T ·g l-1
其中l为隐藏层的索引,r′ij为预测的评分。where l is the index of the hidden layer and r'ij is the predicted score.
为了学习模型的参数,需指定一个目标函数进行优化,由于本发明在 这项工作中的主要任务是评分预测,所以采用一个常用的目标函数:In order to learn the parameters of the model, an objective function needs to be specified for optimization. Since the main task of the present invention in this work is scoring prediction, a commonly used objective function is adopted:
本发明使用Yelp数据集进行测试,Yelp是全球最大的点评网站之一, 它允许用户对商家进行评论或评分。本发明的实验数据截取经度在-112.0 和-111.9之间、纬度在33.3和33.45之间的数据作为实验数据,为了保证 数据质量,过滤掉用户评分次数少于5次、兴趣点被评分次数少于5次的 用户和兴趣点。社交图中的顶点代表用户,边代表用户和用户之间的朋友 关系。用户-兴趣点交互图中的顶点由用户和兴趣点组成,边代表用户和 兴趣点之间存在评分行为。最终数据包括3870名用户、2301个兴趣点以 及57756条评分记录,社交图中包含78577条边。The present invention is tested using the Yelp data set, which is one of the largest review websites in the world, which allows users to review or rate businesses. The experimental data of the present invention intercepts data whose longitude is between -112.0 and -111.9 and latitude between 33.3 and 33.45 as experimental data. In order to ensure data quality, the number of user ratings is less than 5, and the number of points of interest is less than rated. on 5 users and points of interest. The vertices in the social graph represent users, and the edges represent the friend relationship between users and users. The vertices in the user-POI interaction graph are composed of users and POIs, and the edges represent the existence of scoring behavior between users and POIs. The final data includes 3870 users, 2301 points of interest, and 57756 rating records, and the social graph contains 78577 edges.
将基于图神经网络的兴趣点推荐方法(简称为Graph-POIR)分别与五 个模型进行对比,现对本发明提出的模型以及五个模型介绍如下:The method for recommending points of interest based on a graph neural network (referred to as Graph-POIR) is compared with five models respectively, and now the model proposed by the present invention and the five models are introduced as follows:
PMF:概率矩阵分解仅使用用户-项目评级矩阵,并采用高斯分布对 用户和项目的潜在因素进行建模。PMF: Probabilistic Matrix Factorization uses only the user-item rating matrix and uses a Gaussian distribution to model latent factors for users and items.
GraphRec:是近年来新提出的图神经网络推荐模型。该模型考虑了社 交图中用户和用户之间的一阶朋友关系以及用户-兴趣点交互图中的评分 和交互信息。GraphRec: It is a newly proposed graph neural network recommendation model in recent years. The model takes into account first-order friend relationships between users and users in the social graph and ratings and interaction information in the user-POI interaction graph.
NeuMF:该推荐模型是利用多层神经网络实现的矩阵分解,是近年来 新提出的经典推荐模型。该模型使用了多个隐藏层来捕获它们的非线性特 征交互,能够更有效的捕获用户与项目之间的隐式/潜在关联关系。NeuMF: This recommendation model is a matrix decomposition implemented by a multi-layer neural network, and is a new classical recommendation model proposed in recent years. The model uses multiple hidden layers to capture their nonlinear feature interactions, which can more effectively capture the implicit/latent associations between users and items.
GCMC:模型采用GCN编码器得到用户、项目的向量表示,但仅考 虑了用户-项目图中的一阶邻居节点。GCMC: The model uses the GCN encoder to obtain the vector representation of users and items, but only considers the first-order neighbor nodes in the user-item graph.
NGCF:将用户-项目交互数据构建成二部图结构,通过嵌入传播层, 协同信息以高阶连接的形式进行显式编码以获得用户、项目的向量表示。NGCF: The user-item interaction data is constructed into a bipartite graph structure. By embedding the propagation layer, the collaborative information is explicitly encoded in the form of high-order connections to obtain the vector representation of users and items.
Graph-POIR:本发明提出的模型,在用户嵌入向量、兴趣点嵌入向量 中考虑了用户-兴趣点交互图中的交互信息以及评分信息。兴趣点嵌入向 量中还嵌入了兴趣点的位置信息,用户嵌入向量中考虑了用户的高阶朋友 以及亲密度。Graph-POIR: The model proposed by the present invention considers the interaction information and rating information in the user-interest point interaction graph in the user embedding vector and the interest point embedding vector. The location information of the POI is also embedded in the POI embedding vector, and the user's high-level friends and intimacy are considered in the user embedding vector.
基于图神经网络的兴趣点推荐方法使用Python语言基于Pytorch框架 实现,电脑配置为CPU i7-8700K3.7GHz,操作系统为Ubuntu 18.04.1。在 实验中,学习率设置为0.001,采用RMSprop作为优化器。参数初始化采 用高斯分布(均值和标准差分别为0和0.01)初始化模型的参数。随机选取 80%、60%的数据作为训练集,其余20%、40%的数据作为测试集,考虑 了不同比例下的训练集和测试集对本发明方法推荐结果的影响。The recommendation method of points of interest based on graph neural network is implemented using Python language based on Pytorch framework. The computer configuration is CPU i7-8700K 3.7GHz, and the operating system is Ubuntu 18.04.1. In the experiments, the learning rate is set to 0.001 and RMSprop is adopted as the optimizer. Parameter initialization uses a Gaussian distribution (mean and standard deviation of 0 and 0.01, respectively) to initialize the parameters of the model. 80% and 60% of the data are randomly selected as the training set, and the remaining 20% and 40% of the data are used as the test set, and the influence of the training set and the test set in different proportions on the recommended results of the method of the present invention is considered.
在k-means聚类算法中,分别实现了位置因素的聚类个数k为10、20、 30、40、50、60、70、80、90、100。嵌入层的大小分别设置为16、32、 64、128、256进行试验,测试结果表明,嵌入层大小为64时推荐性能最 佳。多层感知器中隐藏层的激活函数使用Relu函数。In the k-means clustering algorithm, the clustering number k of the location factor is 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, respectively. The size of the embedding layer is set to 16, 32, 64, 128, and 256 for experiments, and the test results show that the recommended performance is the best when the size of the embedding layer is 64. The activation function of the hidden layer in the multilayer perceptron uses the Relu function.
为了评估本发明提出的推荐方法预测的准确率,实验采用平均绝对 误差(MAE)和均方根误差(RMSE)作为评价指标。In order to evaluate the prediction accuracy of the recommended method proposed by the present invention, mean absolute error (MAE) and root mean square error (RMSE) are used as evaluation indicators in the experiment.
其中ri′为预测值,ri为真实值。where ri ' is the predicted value, and ri is the actual value.
表1、2描述了当k=40且考虑了用户的二阶朋友时NGCF、GCMC、 NeuMF、GraphRec、PMF和Graph-POIR在数据集Yelp上RMSE和MAE 的情况。表1中80%的数据作为训练集,剩余20%的数据为测试集,表2 中60%的数据作为训练集,剩余的40%的数据为测试集。实验结果表明,Graph-POIR减小了评分预测误差,显著提高了推荐性能。如表1,与 GraphRec相比,Graph-POIR的RMSE和MAE分别减少了0.59%和1.38%, 原因是Graph-POIR不仅考虑了用户之间的亲密度和用户的高阶朋友,并 且将兴趣点的位置信息嵌入到兴趣点的嵌入向量中,而GraphRec没有考 虑用户的高阶朋友对用户行为产生的影响。NGCF、GCMC和NeuMF由 于没有考虑用户的社交信息,因此推荐效果不如GraphRec,从而也体现 了考虑本发明建模用户社交信息的方法能够提高推荐性能。与PMF和 NeuMF相比,GraphRec、NGCF、GCMC、Graph-POIR使用图嵌入技术, 体现了图嵌入技术在兴趣点推荐中的有效性。PMF和NeuMF只使用了用 户和兴趣点之间的评分,NeuMF的推荐性能优于PMF,说明神经网络在 推荐中能够提高推荐性能。Tables 1 and 2 describe the RMSE and MAE of NGCF, GCMC, NeuMF, GraphRec, PMF and Graph-POIR on the dataset Yelp when k=40 and the user's second-order friends are considered. 80% of the data in Table 1 is used as the training set, the remaining 20% of the data is the test set, 60% of the data in Table 2 is used as the training set, and the remaining 40% of the data is the test set. Experimental results show that Graph-POIR reduces the rating prediction error and significantly improves the recommendation performance. As shown in Table 1, compared with GraphRec, the RMSE and MAE of Graph-POIR are reduced by 0.59% and 1.38%, respectively, the reason is that Graph-POIR not only considers the intimacy between users and users’ high-order friends, but also considers points of interest. The location information is embedded into the embedding vector of interest points, while GraphRec does not consider the influence of the user's higher-order friends on the user's behavior. Because NGCF, GCMC and NeuMF do not consider the user's social information, the recommendation effect is not as good as GraphRec, which also reflects that the method of considering the user's social information in the present invention can improve the recommendation performance. Compared with PMF and NeuMF, GraphRec, NGCF, GCMC, Graph-POIR use graph embedding technology, which reflects the effectiveness of graph embedding technology in POI recommendation. PMF and NeuMF only use the scores between users and points of interest, and the recommendation performance of NeuMF is better than that of PMF, indicating that neural networks can improve the recommendation performance in recommendation.
为了进一步测试本发明提出的Graph-POIR在k-means算法中不同参 数k下的性能,图4和图5分别给出了Graph-POIR的RMSE以及MAE 的变化情况。通过图4和图5可以看出,当k=70时,Graph-POIR的 MAE取得了最小值,当k=40时,Graph-POIR的RMSE取得了最小值。 随着k值的变化,MAE和RMSE的变化曲线没有规律,造成这种现象的 原因是k-means算法的初始聚类中心对聚类结果有较大的影响,而k- means算法的初始聚类中心是随机选择的,在日后工作中,将考虑对k- means算法中初始节点的选择进行改善。In order to further test the performance of the Graph-POIR proposed by the present invention under different parameters k in the k-means algorithm, Fig. 4 and Fig. 5 respectively show the changes of the RMSE and MAE of the Graph-POIR. It can be seen from Figure 4 and Figure 5 that when k=70, the MAE of Graph-POIR achieves the minimum value, and when k=40, the RMSE of Graph-POIR achieves the minimum value. With the change of k value, the change curves of MAE and RMSE are irregular. The reason for this phenomenon is that the initial clustering center of k-means algorithm has a great influence on the clustering results, while the initial clustering center of k-means algorithm has a great influence on the clustering results. The class center is randomly selected, and in future work, we will consider improving the selection of initial nodes in the k-means algorithm.
表3中给出了Graph-POIR在用户嵌入向量时考虑了高阶朋友对算法 的影响。Graph-POIR-1代表Graph-POIR在用户嵌入向量中只考虑了一阶 朋友,Graph-POIR-2代表Graph-POIR在用户嵌入向量中考虑了用户的二 阶朋友。实验结果如表3所示,Graph-POIR-2的RMSE和MAE均高于 Graph-POIR-1,体现了在用户嵌入向量建模时考虑用户多阶朋友的必要性。Table 3 shows that Graph-POIR considers the impact of higher-order friends on the algorithm when users embed vectors. Graph-POIR-1 represents that Graph-POIR only considers first-order friends in the user embedding vector, and Graph-POIR-2 represents that Graph-POIR considers the user's second-order friends in the user embedding vector. The experimental results are shown in Table 3. The RMSE and MAE of Graph-POIR-2 are higher than those of Graph-POIR-1, which reflects the necessity of considering users' multi-order friends when modeling user embedding vectors.
表3 RMSE和MAE的变化情况Table 3 Changes in RMSE and MAE
本发明提出了一种基于图神经网络的兴趣点推荐方法,首先构建用户 -兴趣点交互图和用户社交图,图神经网络学习图结构信息在用户的嵌入 向量中整合了协作信息和社交信息,并且考虑了社交图中的高阶朋友以及 朋友之间的亲密度;同时,采用k-means算法对兴趣点按地理位置进行聚 类,将聚类结果嵌入到向量中,连接通过在用户-兴趣点交互图中得到的 嵌入向量,将其输入到神经网络中得到兴趣点嵌入向量;最后,构建一个 神经网络模型,模拟机器学习中的矩阵分解方法,根据用户的评分行为进 行评分预测。实验结果表明,与现有兴趣点推荐模型相比,本发明提出的 推荐模型达到了更好的推荐效果。在接下来的工作中,将考虑改善k- means算法中初始节点的选择。The invention proposes a method for recommending points of interest based on a graph neural network. First, a user-interest point interaction graph and a user social graph are constructed. The graph neural network learns the graph structure information and integrates collaboration information and social information in the user's embedding vector. In addition, the high-level friends in the social graph and the intimacy between friends are considered; at the same time, the k-means algorithm is used to cluster the points of interest by geographic location, and the clustering results are embedded in the vector, and the connection is made through the user-interest. The embedded vector obtained in the point interaction graph is input into the neural network to obtain the embedded vector of interest points; finally, a neural network model is constructed to simulate the matrix decomposition method in machine learning, and the scoring prediction is performed according to the user's scoring behavior. The experimental results show that, compared with the existing point-of-interest recommendation model, the recommendation model proposed by the present invention achieves a better recommendation effect. In the following work, we will consider improving the selection of initial nodes in the k-means algorithm.
以上所述是本发明的优选实施方式而已,当然不能以此来限定本发明 之权利范围,应当指出,对于本技术领域的普通技术人员来说,在不脱离 本发明原理的前提下,还可以做出若干改进和变动,这些改进和变动也视 为本发明的保护范围。The above descriptions are only the preferred embodiments of the present invention, of course, it cannot limit the scope of rights of the present invention. It should be pointed out that for those skilled in the art, without departing from the principles of the present invention, Several improvements and changes are made, and these improvements and changes are also regarded as the protection scope of the present invention.
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010965903.9A CN112084427A (en) | 2020-09-15 | 2020-09-15 | Interest point recommendation method based on graph neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010965903.9A CN112084427A (en) | 2020-09-15 | 2020-09-15 | Interest point recommendation method based on graph neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112084427A true CN112084427A (en) | 2020-12-15 |
Family
ID=73737126
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010965903.9A Pending CN112084427A (en) | 2020-09-15 | 2020-09-15 | Interest point recommendation method based on graph neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112084427A (en) |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112435079A (en) * | 2020-12-16 | 2021-03-02 | 合肥工业大学 | Advertisement recommendation method oriented to pure social platform |
CN112580789A (en) * | 2021-02-22 | 2021-03-30 | 支付宝(杭州)信息技术有限公司 | Training graph coding network, and method and device for predicting interaction event |
CN112651487A (en) * | 2020-12-21 | 2021-04-13 | 广东交通职业技术学院 | Data recommendation method, system and medium based on graph collapse convolution neural network |
CN112837095A (en) * | 2021-02-01 | 2021-05-25 | 支付宝(杭州)信息技术有限公司 | Object distribution method and system |
CN112948668A (en) * | 2021-02-04 | 2021-06-11 | 深圳大学 | Information recommendation method, electronic device and storage medium |
CN113158088A (en) * | 2021-04-16 | 2021-07-23 | 桂林电子科技大学 | Position recommendation method based on graph neural network |
CN113344177A (en) * | 2021-05-10 | 2021-09-03 | 电子科技大学 | Depth recommendation method based on graph attention |
CN113407817A (en) * | 2021-01-25 | 2021-09-17 | 北京工业大学 | Attention mechanism-based graph nerve collaborative filtering method |
CN113468227A (en) * | 2021-06-25 | 2021-10-01 | 北京达佳互联信息技术有限公司 | Information recommendation method, system, device and storage medium based on graph neural network |
CN113568410A (en) * | 2021-07-29 | 2021-10-29 | 西安交通大学 | A kind of heterogeneous agent trajectory prediction method, system, equipment and medium |
CN113742597A (en) * | 2021-09-18 | 2021-12-03 | 辽宁工程技术大学 | Interest point recommendation method based on LBSN (location based service) and multi-graph fusion |
CN114036406A (en) * | 2021-11-04 | 2022-02-11 | 南京大学 | Recommendation method and system based on graph contrast learning and social network enhancement |
CN114154080A (en) * | 2021-12-07 | 2022-03-08 | 西安邮电大学 | A dynamic social recommendation method based on graph neural network |
CN114169418A (en) * | 2021-11-30 | 2022-03-11 | 北京百度网讯科技有限公司 | Label recommendation model training method and device, and label obtaining method and device |
CN114201682A (en) * | 2021-12-14 | 2022-03-18 | 云南大学 | Graph neural network recommendation method and system fusing social relationship and semantic relationship |
CN114266353A (en) * | 2021-12-23 | 2022-04-01 | 北京邮电大学 | A Design Method of Collaborative Filtering Model Based on Graph Neural Network |
CN114282120A (en) * | 2021-12-06 | 2022-04-05 | 中电万维信息技术有限责任公司 | Graph embedding interest point recommendation algorithm incorporating multi-dimensional relationships |
CN114722283A (en) * | 2022-04-11 | 2022-07-08 | 西北工业大学 | A combined recommendation algorithm of points of interest based on location social network |
CN115270007A (en) * | 2022-08-17 | 2022-11-01 | 烟台大学 | A POI recommendation method and system based on hybrid graph neural network |
CN115270005A (en) * | 2022-09-30 | 2022-11-01 | 腾讯科技(深圳)有限公司 | Information recommendation method, device, equipment and storage medium |
CN117591751A (en) * | 2024-01-19 | 2024-02-23 | 国网湖北省电力有限公司信息通信公司 | Interest point recommendation method and system based on graph embedding and contextual loyalty fusion |
CN118193853A (en) * | 2024-05-16 | 2024-06-14 | 烟台大学 | A method, system, device and storage medium for recommending points of interest for random groups |
CN119226612A (en) * | 2024-09-13 | 2024-12-31 | 成都锦城学院 | A social network user recommendation method and system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108875007A (en) * | 2018-06-15 | 2018-11-23 | 腾讯科技(深圳)有限公司 | The determination method and apparatus of point of interest, storage medium, electronic device |
CN110334245A (en) * | 2019-05-20 | 2019-10-15 | 山东大学 | A short video recommendation method and device based on time-series attribute graph neural network |
US20200160126A1 (en) * | 2018-11-15 | 2020-05-21 | Mobileye Vision Technologies Ltd. | Fast cnn classification of multi-frame semantic signals |
CN111241306A (en) * | 2020-01-21 | 2020-06-05 | 浙江大学 | A Path Planning Method Based on Knowledge Graph and Pointer Network |
-
2020
- 2020-09-15 CN CN202010965903.9A patent/CN112084427A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108875007A (en) * | 2018-06-15 | 2018-11-23 | 腾讯科技(深圳)有限公司 | The determination method and apparatus of point of interest, storage medium, electronic device |
US20200160126A1 (en) * | 2018-11-15 | 2020-05-21 | Mobileye Vision Technologies Ltd. | Fast cnn classification of multi-frame semantic signals |
CN110334245A (en) * | 2019-05-20 | 2019-10-15 | 山东大学 | A short video recommendation method and device based on time-series attribute graph neural network |
CN111241306A (en) * | 2020-01-21 | 2020-06-05 | 浙江大学 | A Path Planning Method Based on Knowledge Graph and Pointer Network |
Non-Patent Citations (2)
Title |
---|
WENQI FAN等: "Graph Neural Networks for Social Recommendation", 《IN PROCEEDINGS OF THE 2019 WORLD WIDE WEB CONFERENCE (WWW ’19)》, pages 2 - 3 * |
孟祥福等: "用户−兴趣点耦合关系的兴趣点推荐方法", 《智能系统学报》, pages 2 - 3 * |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112435079A (en) * | 2020-12-16 | 2021-03-02 | 合肥工业大学 | Advertisement recommendation method oriented to pure social platform |
CN112435079B (en) * | 2020-12-16 | 2022-09-16 | 合肥工业大学 | Advertisement recommendation method oriented to pure social platform |
CN112651487B (en) * | 2020-12-21 | 2021-07-27 | 广东交通职业技术学院 | Data recommendation method, system and medium based on graph collapse convolution neural network |
CN112651487A (en) * | 2020-12-21 | 2021-04-13 | 广东交通职业技术学院 | Data recommendation method, system and medium based on graph collapse convolution neural network |
CN113407817A (en) * | 2021-01-25 | 2021-09-17 | 北京工业大学 | Attention mechanism-based graph nerve collaborative filtering method |
CN112837095A (en) * | 2021-02-01 | 2021-05-25 | 支付宝(杭州)信息技术有限公司 | Object distribution method and system |
CN112948668A (en) * | 2021-02-04 | 2021-06-11 | 深圳大学 | Information recommendation method, electronic device and storage medium |
CN112580789A (en) * | 2021-02-22 | 2021-03-30 | 支付宝(杭州)信息技术有限公司 | Training graph coding network, and method and device for predicting interaction event |
CN113158088A (en) * | 2021-04-16 | 2021-07-23 | 桂林电子科技大学 | Position recommendation method based on graph neural network |
CN113344177A (en) * | 2021-05-10 | 2021-09-03 | 电子科技大学 | Depth recommendation method based on graph attention |
CN113344177B (en) * | 2021-05-10 | 2022-10-14 | 电子科技大学 | Deep Recommendation Method Based on Graph Attention |
CN113468227A (en) * | 2021-06-25 | 2021-10-01 | 北京达佳互联信息技术有限公司 | Information recommendation method, system, device and storage medium based on graph neural network |
CN113468227B (en) * | 2021-06-25 | 2024-05-24 | 北京达佳互联信息技术有限公司 | Information recommendation method, system, equipment and storage medium based on graph neural network |
CN113568410A (en) * | 2021-07-29 | 2021-10-29 | 西安交通大学 | A kind of heterogeneous agent trajectory prediction method, system, equipment and medium |
CN113742597A (en) * | 2021-09-18 | 2021-12-03 | 辽宁工程技术大学 | Interest point recommendation method based on LBSN (location based service) and multi-graph fusion |
CN114036406A (en) * | 2021-11-04 | 2022-02-11 | 南京大学 | Recommendation method and system based on graph contrast learning and social network enhancement |
CN114169418A (en) * | 2021-11-30 | 2022-03-11 | 北京百度网讯科技有限公司 | Label recommendation model training method and device, and label obtaining method and device |
CN114282120A (en) * | 2021-12-06 | 2022-04-05 | 中电万维信息技术有限责任公司 | Graph embedding interest point recommendation algorithm incorporating multi-dimensional relationships |
CN114154080A (en) * | 2021-12-07 | 2022-03-08 | 西安邮电大学 | A dynamic social recommendation method based on graph neural network |
CN114201682A (en) * | 2021-12-14 | 2022-03-18 | 云南大学 | Graph neural network recommendation method and system fusing social relationship and semantic relationship |
CN114266353A (en) * | 2021-12-23 | 2022-04-01 | 北京邮电大学 | A Design Method of Collaborative Filtering Model Based on Graph Neural Network |
CN114722283A (en) * | 2022-04-11 | 2022-07-08 | 西北工业大学 | A combined recommendation algorithm of points of interest based on location social network |
CN115270007B (en) * | 2022-08-17 | 2023-09-22 | 烟台大学 | A POI recommendation method and system based on hybrid graph neural network |
CN115270007A (en) * | 2022-08-17 | 2022-11-01 | 烟台大学 | A POI recommendation method and system based on hybrid graph neural network |
CN115270005B (en) * | 2022-09-30 | 2022-12-23 | 腾讯科技(深圳)有限公司 | Information recommendation method, device, equipment and storage medium |
CN115270005A (en) * | 2022-09-30 | 2022-11-01 | 腾讯科技(深圳)有限公司 | Information recommendation method, device, equipment and storage medium |
CN117591751A (en) * | 2024-01-19 | 2024-02-23 | 国网湖北省电力有限公司信息通信公司 | Interest point recommendation method and system based on graph embedding and contextual loyalty fusion |
CN117591751B (en) * | 2024-01-19 | 2024-04-26 | 国网湖北省电力有限公司信息通信公司 | Picture embedding-based interest point recommendation method and system based on upper-lower Wen Zhongcheng-degree fusion |
CN118193853A (en) * | 2024-05-16 | 2024-06-14 | 烟台大学 | A method, system, device and storage medium for recommending points of interest for random groups |
CN119226612A (en) * | 2024-09-13 | 2024-12-31 | 成都锦城学院 | A social network user recommendation method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112084427A (en) | Interest point recommendation method based on graph neural network | |
Guo et al. | Combining geographical and social influences with deep learning for personalized point-of-interest recommendation | |
CN111127142B (en) | An item recommendation method based on generalized neural attention | |
Dai et al. | Spatio-temporal representation learning with social tie for personalized poi recommendation | |
CN109948066B (en) | Interest point recommendation method based on heterogeneous information network | |
CN118377979B (en) | Interest point recommendation method and medium based on time sequence knowledge graph and electronic equipment | |
CN113590976A (en) | Recommendation method of space self-adaptive graph convolution network | |
Zhou et al. | Deepmove: Learning place representations through large scale movement data | |
CN106776928B (en) | A location recommendation method based on an in-memory computing framework and integrating social time and space data | |
CN111931052A (en) | Context-aware recommendation method and system based on feature interaction graph neural network | |
CN109062962A (en) | A kind of gating cycle neural network point of interest recommended method merging Weather information | |
CN114491247A (en) | Recommendation method based on knowledge graph and long-term and short-term interests of user | |
CN114385930B (en) | A method and system for recommending points of interest | |
CN114510646A (en) | Neural network collaborative filtering recommendation method based on federal learning | |
Fang et al. | A top-k POI recommendation approach based on LBSN and multi-graph fusion | |
CN111680228A (en) | Matrix factorization POI recommendation method based on geographic location fusion of social influence and category popularity | |
Chen et al. | A temporal recommendation mechanism based on signed network of user interest changes | |
Wang et al. | Collective geographical embedding for geolocating social network users | |
Xu et al. | SSSER: Spatiotemporal sequential and social embedding rank for successive point-of-interest recommendation | |
He et al. | Next point-of-interest recommendation via a category-aware Listwise Bayesian Personalized Ranking | |
Liu et al. | VGMF: visual contents and geographical influence enhanced point‐of‐interest recommendation in location‐based social network | |
Lin | Implementation of personalized scenic spot recommendation algorithm based on generalized regression neural network for 5G smart tourism system | |
Zhang et al. | Session‐Based Graph Attention POI Recommendation Network | |
Zhao et al. | A Hierarchical Attention Recommender System Based on Cross‐Domain Social Networks | |
Chen et al. | Enabling foundation models: A distributed collaboration framework based on graph federated learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20201215 |