[go: up one dir, main page]

CN110232152A - Content recommendation method, device, server and storage medium - Google Patents

Content recommendation method, device, server and storage medium Download PDF

Info

Publication number
CN110232152A
CN110232152A CN201910447811.9A CN201910447811A CN110232152A CN 110232152 A CN110232152 A CN 110232152A CN 201910447811 A CN201910447811 A CN 201910447811A CN 110232152 A CN110232152 A CN 110232152A
Authority
CN
China
Prior art keywords
vector
content
characteristic information
user
recommended
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910447811.9A
Other languages
Chinese (zh)
Other versions
CN110232152B (en
Inventor
林锡雄
贺杰
张文郁
张磊
张珩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201910447811.9A priority Critical patent/CN110232152B/en
Publication of CN110232152A publication Critical patent/CN110232152A/en
Application granted granted Critical
Publication of CN110232152B publication Critical patent/CN110232152B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2135Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application discloses a kind of content recommendation method, device, server and storage medium;The user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of the available target user of the embodiment of the present application, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship include the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;Feature space mapping is carried out to user's characteristic information, obtains the corresponding user's characteristic information vector of user's characteristic information;Calculate the vector similarity in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector;Object content characteristic information vector is determined from content characteristic information vector based on vector similarity;The corresponding object content of object content characteristic information vector is chosen based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;Recommend object content to target user.The embodiment of the present application can promote the accuracy of commending contents by obtaining content to be recommended similar with user's characteristic information as object content.

Description

Content recommendation method, device, server and storage medium
Technical field
This application involves Internet technical fields, and in particular to a kind of content recommendation method, device, server and storage Medium.
Background technique
In the information age, Infomiation Production and the consumption encouragement rapid development of information industry and information technology.Currently, it interconnects Net has become important information source, however, the swift and violent growth of the huge scale of internet and information resources brings letter The problem of breath overload, although that is, current information is resourceful, people are difficult effectively to obtain useful information.
It would generally recommend its interested content to user at present, but obtain the accuracy of user interest content at present It is low.
Summary of the invention
The embodiment of the present application provides a kind of content recommendation method, device, server and storage medium, can promote content The accuracy of recommendation.
The embodiment of the present application provides a kind of content recommendation method, comprising:
Obtain the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, the content DUAL PROBLEMS OF VECTOR MAPPING Set of relationship includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;
Feature space mapping is carried out to the user's characteristic information, obtains the corresponding user characteristics of the user's characteristic information Information vector;
It calculates in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector Vector similarity;
Object content characteristic information vector is determined from content characteristic information vector based on the vector similarity;
The corresponding object content of object content characteristic information vector is chosen based on the content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;
Recommend the object content to target user.
In some embodiments, the user's characteristic information includes user characteristics content and user characteristics vector, to described User's characteristic information carries out feature space mapping, obtains the corresponding user's characteristic information vector of the user's characteristic information, comprising:
By the user characteristics content map into default vector space, user characteristics content vector is obtained;
The user characteristics vector and user characteristics content vector are subjected to Vector Fusion, obtain the user's characteristic information Corresponding user's characteristic information vector.
In some embodiments, the user characteristics content map is obtained in user characteristics into default vector space Hold vector, comprising:
Word segmentation processing is carried out to the user characteristics content, obtains paragraph sequence, the paragraph sequence includes at least a kind of Paragraph;
The frequency of occurrences of the every kind of paragraph in the paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained;
The paragraph that paragraph frequency meets preset threshold is chosen from the paragraph sequence, obtains target paragraph;
The corresponding user's characteristic information vector of the user's characteristic information is constructed according to the target paragraph.
In some embodiments, the user's characteristic information vector includes user's characteristic information high dimension vector, to the use Family characteristic information carries out feature space mapping, obtains the corresponding user's characteristic information vector of the user's characteristic information, comprising:
Dimension-reduction treatment is carried out to the user's characteristic information high dimension vector, obtains user's characteristic information low-dimensional vector;
The content characteristic information vector includes content characteristic information low-dimensional vector, calculates the user's characteristic information vector With the vector similarity in content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector, comprising:
Calculate content characteristic information low-dimensional vector in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Between vector similarity.
In some embodiments, dimension-reduction treatment is carried out to the user's characteristic information high dimension vector, obtains user characteristics letter Cease low-dimensional vector, comprising:
Using in Matching Model connection weight and biasing the user's characteristic information high dimension vector be weighted ask And processing, user's characteristic information high dimension vector after being handled, the Matching Model is by being labelled with the instruction of true vector similarity Practice sample training to form;
Dimensionality reduction is carried out to user's characteristic information high dimension vector after the processing using the default dimensionality reduction function in Matching Model Processing, obtains user's characteristic information low-dimensional vector.
In some embodiments, using in Matching Model connection weight and biasing to the user's characteristic information higher-dimension Vector is weighted summation process, user's characteristic information high dimension vector after being handled, comprising:
It is special to the user according to connection weight and biasing when the connection weight in the Matching Model is nonnegative number Reference breath high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled;
When the connection weight in the Matching Model is negative, according to the absolute value of connection weight and biasing to described User's characteristic information high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled.
In some embodiments, using in Matching Model connection weight and biasing to the user's characteristic information higher-dimension Vector is weighted summation process, after being handled before user's characteristic information high dimension vector, further includes:
Training sample is obtained, the training sample includes user's training sample, content training sample, the training sample mark The true vector similarity between user's training sample and content training sample is infused;
Initial matching model is trained using user's training sample, content training sample, obtains user's training Predicted vector similarity between sample and content training sample;
The initial matching model is restrained according to the true vector similarity and predicted vector similarity, is obtained Matching Model after training.
In some embodiments, the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user are obtained, is wrapped It includes:
Obtain the user's characteristic information of target user and the content characteristic information of content to be recommended, the content characteristic Information includes the feature of content to be recommended, content feature vector;
The feature of the content to be recommended is mapped in default vector space, is obtained in the feature of content to be recommended Hold vector;
The content feature vector and the feature vector of content to be recommended are subjected to Vector Fusion, obtained in be recommended The content characteristic information vector of appearance;
According to the mapping relations building between the content to be recommended and the content characteristic information vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
In some embodiments, the content characteristic information vector includes content characteristic information high dimension vector, will be described interior The feature vector for holding feature vector and content to be recommended carries out Vector Fusion, obtains the content characteristic information of content to be recommended After vector, further includes:
Dimension-reduction treatment is carried out to the content characteristic information high dimension vector, obtains content characteristic information low-dimensional vector;
According to the mapping relations building between the content to be recommended and the content characteristic information vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, comprising:
According to the mapping relations between the content to be recommended and the content characteristic information low-dimensional vector of content to be recommended Content construction DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
In some embodiments, it is special to calculate content in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Levy the vector similarity between information vector, comprising:
Calculate the vector length of the user's characteristic information vector sum content characteristic information vector;
Calculate the inner product of vectors between the user's characteristic information vector sum content characteristic information vector;
The COS distance of the content characteristic information vector is calculated based on the inner product of vectors and vector length;
The COS distance is normalized, user's characteristic information vector and content characteristic information vector are obtained Vector similarity.
The embodiment of the present application also provides a kind of content recommendation device, comprising:
Acquiring unit, it is described for obtaining the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes that the mapping between content to be recommended and the content characteristic information vector of content to be recommended is closed System;
Map unit obtains the user's characteristic information for carrying out feature space mapping to the user's characteristic information Corresponding user's characteristic information vector;
Computing unit, for calculating content characteristic in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Vector similarity between information vector;
Determination unit, for determining object content feature letter from content characteristic information vector based on the vector similarity Cease vector;
Selection unit, it is corresponding for choosing object content characteristic information vector based on the content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Object content;
Recommendation unit, for recommending the content of the object content to target user.
In some embodiments, the user's characteristic information includes user characteristics content and user characteristics vector, described to reflect Penetrating unit includes:
Subelement is mapped, for into default vector space, obtaining in user characteristics the user characteristics content map Hold vector;
Subelement is merged, for the user characteristics vector and user characteristics content vector to be carried out Vector Fusion, is obtained The corresponding user's characteristic information vector of the user's characteristic information.
In some embodiments, the mapping subelement is specifically used for:
Word segmentation processing is carried out to the user characteristics content, obtains paragraph sequence, the paragraph sequence includes at least a kind of Paragraph;
The frequency of occurrences of the every kind of paragraph in the paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained;
The paragraph that paragraph frequency meets preset threshold is chosen from the paragraph sequence, obtains target paragraph;
The corresponding user's characteristic information vector of the user's characteristic information is constructed according to the target paragraph.
The embodiment of the present application also provides a kind of storage medium, and the storage medium is stored with a plurality of instruction, and described instruction is suitable It is loaded in processor, to execute the step in any content recommendation method provided by the embodiment of the present application.
The embodiment of the present application also provides a kind of server, including memory is stored with a plurality of instruction;The processor is from institute It states and loads instruction in memory, to execute the step in any content recommendation method provided by the embodiment of the present application.
The user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of the available target user of the embodiment of the present application, Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes that the mapping between content to be recommended and the content characteristic information vector of content to be recommended is closed System;Feature space mapping is carried out to user's characteristic information, obtains the corresponding user's characteristic information vector of user's characteristic information;It calculates Vector similarity in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector;It is based on Vector similarity determines object content characteristic information vector from content characteristic information vector;Based on content DUAL PROBLEMS OF VECTOR MAPPING set of relations It closes and chooses the corresponding object content of object content characteristic information vector;Recommend object content to target user.In this application, may be used Using chosen by the user's characteristic information vector of target user content to be recommended similar with target user's characteristic information as Object content, to promote the accuracy of commending contents.
Detailed description of the invention
In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, the drawings in the following description are only some examples of the present application, for For those skilled in the art, without creative efforts, it can also be obtained according to these attached drawings other attached Figure.
Fig. 1 a is the schematic diagram of a scenario of content recommendation method provided by the embodiments of the present application;
Fig. 1 b is the first flow diagram of content recommendation method provided by the embodiments of the present application;
Fig. 1 c is the basic structural schematic diagram of the DSSM model of content recommendation method provided by the embodiments of the present application;
Fig. 1 d is the relationship signal between the included angle cosine value of content recommendation method provided by the embodiments of the present application and vector Figure;
Fig. 2 a is the system structure diagram of content recommendation method provided by the embodiments of the present application;
Fig. 2 b is second of flow diagram of content recommendation method provided by the embodiments of the present application;
Fig. 2 c is the model structure schematic diagram of the DSSM of content recommendation method provided by the embodiments of the present application;
Fig. 2 d is the network architecture schematic diagram of the DSSM of content recommendation method provided by the embodiments of the present application;
Fig. 3 a is the first structural schematic diagram of content recommendation device provided by the embodiments of the present application;
Fig. 3 b is second of structural schematic diagram of content recommendation device provided by the embodiments of the present application;
Fig. 4 is the structural schematic diagram of server provided by the embodiments of the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on Embodiment in the application, those skilled in the art's every other implementation obtained without creative efforts Example, shall fall in the protection scope of this application.
The embodiment of the present application provides a kind of content recommendation method, device, server and storage medium.
Wherein, which specifically can integrate in the electronic device, which can be terminal, service The equipment such as device.Wherein, terminal can be mobile phone, tablet computer, smart bluetooth equipment, laptop or PC Equipment such as (Personal Computer, PC).Server can be single server, be also possible to be made of multiple servers Server cluster.In one embodiment, which can also be integrated in multiple electronic equipments, for example, interior Holding recommendation apparatus can also be integrated in multiple servers, and present context recommended method is realized by multiple servers.
With reference to Fig. 1 a, which can be server, the user's characteristic information of the available user terminal of the server, And content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship include the content of content to be recommended Yu content to be recommended Mapping relations between characteristic information vector;Feature space mapping is carried out to user's characteristic information, obtains user's characteristic information pair The user's characteristic information vector answered;Calculate content characteristic information in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Vector similarity between vector;Based on vector similarity from content characteristic information vector determine object content characteristic information to Amount;The corresponding object content of object content characteristic information vector is chosen based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;To target user Recommend object content.
It is described in detail separately below.It should be noted that the serial number of following embodiment is not as preferably suitable to embodiment The restriction of sequence.
In the present embodiment, a kind of content recommendation method is provided, which can be by server or terminal To execute.The embodiment of the present application will be introduced for being executed by server, specifically, can be held by the processor of server Row.As shown in Figure 1 b, the detailed process of the content recommendation method can be such that
101, the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, content DUAL PROBLEMS OF VECTOR MAPPING are obtained Set of relationship includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended.
Wherein, target user (Target customer), which refers to, provides the content of service or product using for it.For example, Target user can refer to currently in the upper user for registering, logging in and/or obtain service of goods of application.For example, target user can To be the user for being currently accessing application, currently log in the user, etc. of the application.
Wherein, user's characteristic information (Customer Information) is that target user carries personal letter in a network The data of breath.For example, classifying according to information type, user's characteristic information may include user description information, user behavior Information and subscriber association information.For example, user's characteristic information may include user's gender, historical viewings record, frequent contact, Etc..
For example, user description information can be the static data for reflecting user's essential attribute, such as gender, the surname of user Name, home address, income, personal label etc..User description information may come from the upload data of user itself, can also be with Upload data from other users, or carry out the collected user basic information of automatic network, etc..
For example, user behavior information can be the dynamic data for reflecting user behavior, preference, demand, for example, user purchases Buy record, spending amount, the historical viewings record etc. of service or product.Behavioral data can come from storage user behavior information Database, can be from the real-time record that local memory summarizes.
For example, subscriber association information refers to, reflection relevant to user behavior and influences the factors such as user behavior and psychology Related data.For example, user satisfaction, consumer loyalty degree, user's lifetime value etc..
Wherein, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, which refers to, contains the set of at least one mapping relations, the mapping relations Reciprocal correspondence, the relationship of projection of (Relational Mapping) between content to be recommended and content characteristic information vector.
In some embodiments, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship can be saved with various data structures, for example, according to number According to the difference of structure type, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship can be with chained list, table, hash table (hash), array, tree (tree) etc. forms save.
For example, reference table 1, table 1 provide a kind of content DUAL PROBLEMS OF VECTOR MAPPING set of relationship saved in a table format:
Number Content to be recommended Content characteristic information vector
0x00 Content A Vector a
0x01 Content B Vector b
0x02 Content C Vector c
Table 1
The content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of above-mentioned table 1 contains 3 mapping relations set, and respectively number 0x00's reflects Penetrate the mapping relations [content of relationship [content A --- vector a], the mapping relations of 0x01 [content B --- vector b] and 0x02 C --- vector c].
Wherein, content to be recommended refers to that waiting is personalized the things or virtual things of recommendation.For example, content to be recommended can To refer to the virtual item data that can be recommended in database, and the related data of virtual things.For example, according to type point Class, content to be recommended may include event to be recommended (item) news such as to be recommended, hot list and article to be recommended (Object) commodity such as to be recommended, service, video, music.
Wherein, content characteristic information vector refers to the vector of characteristic information entrained by content to be recommended.For example, content characteristic Information vector can refer to it is being extracted from the information content entrained by content to be recommended, can express content characteristic to be recommended to Amount.
For example, content characteristic information vector can be in text content if content to be recommended carries content of text The term vector (Word embedding) of keyword, alternatively, if content to be recommended carries two dimensional image content, it is to be recommended interior The feature vector of appearance can be the textural characteristics, etc. that the two dimensional image is saved with vector format.
In some embodiments, the user's characteristic information for obtaining target user may include step in detail below:
A. user's request of target user is obtained;
B. inquiry user's characteristic information is requested according to user.
Wherein, user's request includes the identity information of request type and user, and request type is used to show the class of the request Type, subscriber identity information are the identity (identity, ID) for issuing the user of request, for example, subscriber identity information can be use The title at family, the number of user, network address of user etc..
For example, the format of user's request can be shown in reference table 2:
Request type Customs Assigned Number User's name
0x01 20150412092236 hello_world_007
Table 2
Obtain user request mode have it is a variety of, for example, directly read from local memory storage user request, connect Receive user's request, etc. that user terminal is sent.For example, the Video Applications can take from trend when user logs in Video Applications Business device sends user's request.
Pass through the identity information of user in inquiry user's request, the user's characteristic information of the available user.Wherein, it looks into The method of inquiry have it is a variety of, for example, sending the identity information of the user to network database servers by network, then obtain The user's characteristic information that the database server returns;For another example, the subscriber identity information pair is directly read in local memory The user's characteristic information answered.
In some embodiments, content characteristic information includes the feature of content to be recommended, content feature vector, In, feature is the content characteristic information saved in the content characteristic information of content to be recommended with scalar likeness in form, and content is special Levying vector is the content characteristic information saved in the form of vectors;When the content characteristic for existing simultaneously vector form and non-vector form When information, for obtaining step content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, 101 can specifically include following sub-step:
(1) user's characteristic information of target user and the content characteristic information of content to be recommended are obtained.
Wherein, content characteristic information is to characteristic information entrained by sharing contents.For example, classify according to information type, Content characteristic information may include description information, the related information etc. to sharing contents.For example, content characteristic information may include Word content to sharing contents, the association to sharing contents share record to sharing contents, to the history of sharing contents, etc. Deng.
For example, when sharing contents are video content, the content characteristic information to sharing video frequency content may include this To the video-audio data of sharing video frequency content, video type, video tab, video name, video profile, viewing record, click volume, Uplink time, etc..
(2) feature is mapped in default vector space, obtains feature vector.
Wherein, default vector space refers to the linear space (vector space) of default dimension, for example, default vector is empty Between can for default dimension, can accommodate and support in the space of the movement or transformation that wherein occur.
For example, default vector space RnFeature is mapped to default vector by the real vector space for being n for a dimension Mode in space, which can be, is mapped to feature wherein by way of Linear Mapping.
For example, feature is mapped to by deep neural network (Deep Neural Network, DNN) preset to In quantity space, feature vector is obtained;Again for example, by feature operator (for example, local binary operator (Local Binary Pattern, LBP), Laplce (Laplace) operator, etc.) feature is mapped in default vector space, obtain spy Levy content vector, etc..
(3) the feature vector of content to be recommended and content feature vector are subjected to Vector Fusion, obtained in be recommended The content characteristic information vector of appearance.
Wherein, Vector Fusion refers to that multiple vectors are converted into a vector;According to the classification of amalgamation mode, Vector Fusion can be with Including vector splicing, addition of vectors, multiplication of vectors, etc..
For example, can using proper orthogonal decomposition by the feature vector of content to be recommended and content feature vector carry out to Amount splicing, to obtain the content characteristic information vector of content to be recommended.
In some embodiments, content characteristic information vector includes content characteristic information high dimension vector, in order to obtain content The feature vector of content feature vector and content to be recommended is carried out Vector Fusion by characteristic information low-dimensional vector, obtain to After the content characteristic information vector of recommendation, specifically can with the following steps are included:
Dimension-reduction treatment is carried out to content characteristic information high dimension vector, obtains content characteristic information low-dimensional vector.
Wherein, dimension-reduction treatment refers to using certain mapping method, by the Mapping of data points in former higher dimensional space to low dimensional Space in.The method of dimension-reduction treatment have it is a variety of, for example, according to the classification of type of dimension reduction method, dimension reduction method may include Explicit dimension reduction method, implicit dimension reduction method, linear dimension reduction method, Method of Nonlinear Dimensionality Reduction, etc..
And by dimension-reduction treatment, it may desire to error caused by the redundancy in content characteristic information high dimension vector, Retain the essential structure feature inside content characteristic information high dimension vector simultaneously, to improve the precision of commending contents.
There are many ways to dimension-reduction treatment, for example, the method for dimension-reduction treatment may include Principal Component Analysis Algorithm (Principal Component Analysis, PCA), linear dimension-reduction algorithm (Linear Discriminant Analysis, LDA), it is locally linear embedding into (Locally linear embedding, LLE), laplacian eigenmaps (Laplacian Eigenmaps), etc..
(4) it is constructed according to the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
For example, in some embodiments, can according to the content characteristic information of content to be recommended and content to be recommended to Mapping relations between amount carry out content construction DUAL PROBLEMS OF VECTOR MAPPING set of relationship with various data structures, for example, according to data structure class The difference of type can carry out content construction DUAL PROBLEMS OF VECTOR MAPPING set of relations in the form of chained list, table, hash table (hash), array, tree etc. It closes.
In some embodiments, content characteristic information vector includes content characteristic information high dimension vector, according to be recommended interior Mapping relations content construction DUAL PROBLEMS OF VECTOR MAPPING set of relationship between appearance and the content characteristic information vector of content to be recommended is specific It may comprise steps of:
According to the mapping relations building between content to be recommended and the content characteristic information low-dimensional vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
102, feature space mapping is carried out to user's characteristic information, obtains the corresponding user's characteristic information of user's characteristic information Vector.
Wherein, the mode for carrying out feature space mapping to user's characteristic information can refer to above-mentioned steps 101 (2) for feature Content map obtains the mode of feature vector into default vector space.
Specifically, in some embodiments, user's characteristic information includes user characteristics content and user characteristics vector, step 102 can specifically include following sub-step:
(1) by user characteristics content map into default vector space, user characteristics content vector is obtained.
It, specifically can be with for example, obtain user characteristics content vector by user characteristics content map into default vector space The following steps are included:
A. word segmentation processing is carried out to user characteristics content, obtains paragraph sequence, paragraph sequence includes at least a kind of paragraph.
Wherein, participle, which refers to, is divided into multiple character string subsequences according to certain specification for continuous character string sequence Process.The mode of participle have it is a variety of, for example, the segmenting method based on string matching, segmenting method and base based on understanding In the segmenting method, etc. of statistics.
For example, user characteristics content is " i have a coin ", it is based on character string dictionary, it can will be in the user characteristics Hold participle and obtains paragraph sequence [i] [have] [a] [coin].
B. the frequency of occurrences of the every kind of paragraph in paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained.
In some embodiments, in order to solve to obtain, user characteristics content vector excessively coefficient, parameter space are excessive asks Topic, can count every kind of paragraph in paragraph sequence by N-gram (a kind of language model (Language Model, LM)) The frequency of occurrences, obtain the corresponding paragraph frequency of every kind of paragraph.
Wherein, N-gram may include 1-Gram (linear model), Bi-Gram (binary model), Tri-Gram (ternary mould Type), etc..
For N-Gram based in hidden Markov it is assumed that in i.e. one section of text, the probability that passage occurs is equal to should The probability product that each word occurs in section text.The probability of each word can be by being calculated in corpus.
Such as, it is assumed that passage T is by c1, c2, c3...ciComposition, it is as follows to be formulated 1-Gram language model:
Bi-Gram model is as follows:
Tri-Gram model is as follows:
Wherein, P (T) is the probability that passage occurs, and P (ci) is the probability that i-th of word occurs, and wherein p (X | Y) refers to The number that Y occurs in number/corpus that text X and Y occurs simultaneously in corpus.
Therefore first user characteristics content carries out word segmentation processing, obtains paragraph sequence, paragraph sequence includes at least a kind of paragraph, is adopted With the above-mentioned N-Gram formula, that is, frequency of occurrences of the statistics available every kind of paragraph in paragraph sequence, the corresponding paragraph of every kind of paragraph is obtained Frequency.
For example, after segmenting to this section of text, 4 paragraphs are obtained for passage [I have a coin], point Not Wei [I] [have] [a] [coin], using 1-Gram model calculate P (I, have, a, coin) known to:
P (I, have, a, coin)=P (I) P (have | I) P (a | have) P (coin | a)
C. the paragraph that paragraph frequency meets preset threshold is chosen from paragraph sequence, obtains target paragraph.
In order to further decrease the information redundancy in vector, paragraph frequency can be chosen from paragraph sequence and meet default threshold The paragraph of value, to filter the low paragraph of the frequency of occurrences.
For example, in some embodiments, the paragraph that paragraph frequency is greater than preset threshold being chosen from paragraph sequence, is obtained To target paragraph.
Wherein, which can be set as desired by technical staff.
D. the corresponding user's characteristic information vector of user's characteristic information is constructed according to target paragraph.
Finally, a frequency meter can be constructed according to the frequency of target paragraph, it is one-dimensional as eigenmatrix, then by the matrix Change, the corresponding user's characteristic information vector of user's characteristic information can be obtained.
(2) user characteristics vector and user characteristics content vector are subjected to Vector Fusion, it is corresponding obtains user's characteristic information User's characteristic information vector.
Wherein, the concrete mode of Vector Fusion can refer to step 101 (2) for content characteristic information vector and feature Vector carries out Vector Fusion, obtains the mode of the content characteristic information vector of content to be recommended, this will not be repeated here.
In some embodiments, user's characteristic information vector includes user's characteristic information high dimension vector, and step 102 specifically may be used To include following sub-step:
Dimension-reduction treatment is carried out to user's characteristic information high dimension vector, obtains user's characteristic information low-dimensional vector.
Wherein, the method for above-mentioned dimension-reduction treatment specifically includes:
(1) connection weight and biasing in Matching Model is used to be weighted summation to user's characteristic information high dimension vector Processing, user's characteristic information high dimension vector after being handled, Matching Model is by being labelled with the training sample of true vector similarity Training forms.
Wherein, which can be semantic model (the Deep Structured Semantic based on depth network Model, DSSM), which can be by the language of user's characteristic information and content characteristic information MAP a to identical dimensional Adopted space finally calculates the cosine between the two vectors to obtain user's characteristic information vector sum content characteristic information vector (cosine) distance obtains vector similarity.
Specifically, DSSM model may include common DSSM model, can also include the DSSM model based on convolution (Convolutional DSSM, C-DSSM), circulation DSSM model (Recurrent DSSM, R-DSSM), multiple information sources DSSM Model (Multi-View DSSM, MV-DSSM), the multi-modal scale model of deep layer (Deep Multimodal Similarity Model, DMSM), etc..
Wherein, the basic structure of DSSM model can refer to Fig. 1 c, and as illustrated in figure 1 c, DSSM model may include matching Layer, expression layer and input layer.
Wherein, input layer can in expression layer, then expression layer for inputting user's characteristic information A and content characteristic information B User's characteristic information A is projected as user's characteristic information vector a, by content characteristic information B be projected as content characteristic information to Measure b, then, can be calculated in matching layer the cosine between user's characteristic information vector a and content characteristic information vector b away from From, will the COS distance normalization after obtain vector similarity.
For example, in some embodiments, using the connection weight in Matching Model and biasing to user's characteristic information height Dimensional vector is weighted summation process, after being handled before user's characteristic information high dimension vector, can also specifically include following Step:
A. training sample is obtained, training sample includes user's training sample, content training sample, and training sample is labelled with use True vector similarity between family training sample and content training sample;
B. initial matching model is trained using user's training sample, content training sample, obtains user's training sample Predicted vector similarity between sheet and content training sample;
C. initial matching model is restrained according to true vector similarity and predicted vector similarity, after being trained Matching Model.
Obtain training sample mode have it is a variety of, for example, obtaining trained sample from web data server by network Originally, training sample and the acquisition training sample, etc. in local memory are directly read.The training sample can be by art personnel Mark the true vector similarity between user's training sample and content training sample.
In some embodiments, in order to guarantee the vector similarity acquired closer to 1, user's characteristic information and to be recommended interior The content characteristic information of appearance is more similar, using the connection weight in Matching Model and biases to user's characteristic information high dimension vector It is weighted summation process, user's characteristic information high dimension vector may include specific steps below after being handled:
A. when the connection weight in Matching Model is nonnegative number, according to connection weight and biasing to user's characteristic information High dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled;
B. when the connection weight in Matching Model is negative, according to the absolute value of connection weight and biasing to user spy Reference breath high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled.
In some embodiments, the absolute value to the non-negative limitation of connection weight in addition to obtaining connection weight, can also lead to Numerical intervals where crossing judgement connection weight come to the connection weight assignment.
For example, connection weight is assigned a value of 15 when connection weight numerical value is in [- 20,10] section.
Specific assignment mode can be arranged by technical staff.
(2) user's characteristic information high dimension vector after processing is carried out at dimensionality reduction using the default dimensionality reduction function in Matching Model Reason, obtains user's characteristic information low-dimensional vector.
Wherein, presetting dimensionality reduction function is the dimensionality reduction function in Matching Model in presentation layer, for example, when Matching Model is DSSM When model, which can be principal component analysis (Principal Component Analysis, PCA) function.
For example, setting n-dimensional vector w as a change in coordinate axis direction (referred to as map vector) of target subspace, data are maximized Variance after mapping, has:
Wherein m is the number of data instance, and value can be positive integer, xiIt is the vector expression of data instance i, i's Value can be positive integer,It is the average vector of all data instances.W is defined to be column vector comprising all map vectors Matrix, by linear algebraic transformation, available following optimization object function:
The wherein mark number (trace) of tr representing matrix, A is data covariance matrix.
Known to ground, optimal W is by the corresponding feature vector of the maximum characteristic value of k before data covariance matrix as column vector It constitutes, this feature vector forms one group of orthogonal basis and retains vector information.
The output of PCA is Y=W`X, wherein is reduced to k dimension by the original dimension of X.
103, it calculates in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector Vector similarity.
Wherein, vector similarity (Similarity Measurement) can be by calculating the distance between vector (distance) it acquires.
For example, Euclidean distance (Euclidean Distance), manhatton distance (Manhattan can be passed through Distance), Chebyshev's distance (Chebyshev Distance), standardization Euclidean distance (Standardized Euclidean distance), included angle cosine (Cosine) etc. calculate vector similarity.
In some embodiments, content characteristic information vector includes content characteristic information low-dimensional vector, and step 103 specifically may be used With the following steps are included:
It calculates in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information low-dimensional vector Vector similarity.
For example, by normalization included angle cosine value as vector similarity:
Wherein, with reference to the relation schematic diagram between Fig. 1 d included angle cosine value provided and vector, a, b are two vectors, cos θ is included angle cosine value (being also COS distance, cosine similarity), and i and n are positive integer, and the calculation of vector similarity is The included angle cosine value is normalized by softmax function, obtains the numerical value between [0,1], wherein vector similarity is such as Under:
104, object content characteristic information vector is determined from content characteristic information vector based on vector similarity.
For example, in some embodiments, determining that object content is special from content characteristic information vector based on vector similarity The mode for levying information vector is sorted from large to small to vector similarity, determines preceding 100 in content characteristic information vector It is a to be used as object content characteristic information vector.
For example, in some embodiments, determining that object content is special from content characteristic information vector based on vector similarity Sign information vector mode be obtain all the elements characteristic information vector in vector similarity greater than 0.8 content characteristic information to Amount, as object content characteristic information vector.
For example, in some embodiments, determining that object content is special from content characteristic information vector based on vector similarity The mode for levying information vector is sorted from large to small to vector similarity, determines preceding 100 in content characteristic information vector A middle vector similarity is greater than 0.8 content characteristic information vector, as object content characteristic information vector.
105, the corresponding object content of object content characteristic information vector is chosen based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
For example, choosing should according to the mapping relations of object content characteristic information vector in content DUAL PROBLEMS OF VECTOR MAPPING set of relationship The corresponding object content of object content characteristic information vector.
106, recommend the content of object content to target user.
For example, the object content to be transmitted directly to the user terminal of target user.
For example, in some embodiments, can also further be segmented, be filtered to the object content, then by its name Claim the user terminal for being sent to target user.
For example, in some embodiments, the title of the object content can be searched in database server by network, The other information of the object content is obtained, the object content and other information are sent jointly to the user terminal of target user.
For example, the mesh can be searched in database server by network when object content is target video content The title for marking video content, obtains the other information such as click volume, comment, temperature, the brief introduction of the target video content, by the target Content and other information send jointly to the user terminal of target user.
Commending contents scheme provided by the embodiments of the present application can be applied in various commending contents scenes, such as.With view For frequency is launched, server can log in the user of acquisition target user when video playing is applied in target user by user terminal Characteristic information (for example, user's gender, history video tour record, age of user, user tag, etc.), and read local Video content DUAL PROBLEMS OF VECTOR MAPPING set of relationship in memory.Wherein, video content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes video to be recommended With the mapping relations between video content features information vector to be recommended.
Then, server can carry out feature space mapping to user's characteristic information, and it is corresponding to obtain user's characteristic information User's characteristic information vector, and calculate video to be recommended in user's characteristic information vector and video content DUAL PROBLEMS OF VECTOR MAPPING set of relationship The vector similarity of content characteristic information vector.
Determine target video content characteristic letter from video content features information vector to be recommended based on vector similarity again Vector is ceased, and the corresponding recommendation view of target video content characteristic information vector is chosen based on video content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Frequently, video playing application homepage finally is logged in the user terminal of target user launch the recommendation video.
For example, the characteristic information of user is [women, 23 years old, historical viewings makeups video, user tag: lipstick tries color], By carrying out feature space mapping to the user's characteristic information, user's characteristic information vector is obtained, and according to the user characteristics Information vector retrieves makeups video content features information vector similar with its from local memory, then to the view of the user terminal Frequency, which is played, launches the makeups video retrieved using homepage.
Using scheme the embodiment of the present application provided by the embodiments of the present application by obtain feature on user's characteristic information phase As content to be recommended as object content, the accuracy of commending contents can be promoted.
From the foregoing, it will be observed that the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING of the available target user of the embodiment of the present application Set of relationship, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship include between content to be recommended and the content characteristic information vector of content to be recommended Mapping relations;Feature space mapping is carried out to user's characteristic information, obtains the corresponding user's characteristic information of user's characteristic information Vector;Calculate the vector phase between user's characteristic information vector and content characteristic information vector in content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Like degree;Object content characteristic information vector is determined from content characteristic information vector based on vector similarity;Based on content vector Mapping relations set chooses the corresponding object content of object content characteristic information vector;Recommend object content to target user.By This, the embodiment of the present application, can be with by obtaining in feature content to be recommended similar with user's characteristic information as object content Promote the accuracy of commending contents.
The method according to described in above-described embodiment, will now be described in further detail below.
In the present embodiment, it will be integrated in content recommendation device in offline service device and line server, and carry out video For commending contents, the method for the embodiment of the present invention is described in detail.Offline service device and online in one embodiment Server can also be realized by a server.
Fig. 2 a is a kind of system structure diagram of commending contents provided in this embodiment, as shown in Figure 2 a, the present embodiment Content recommendation device can be integrated in respectively offline service device from different information are handled in line server.
Wherein, it can store the attribute information of video in the database of offline service device.Wherein, the attribute information of video can User, video content types, etc. are watched with the address information, title, label, the history that include video.
In offline service device, training data is can be used to train initial Matching Model in content recommendation device, and fixed Matching Model after training is sent to line server by the phase.
Then, it is mentioned by Matching Model after training to carry out feature to the attribute information that can store video in database It takes, obtains the feature vector of video to be recommended, and the feature vector of video to be recommended is periodically sent to line server.
In line server, content recommendation device can periodically obtain after the training of update Matching Model and to be recommended Content characteristic information vector, and Matching Model mentions after the training updated by these to carry out feature to user's characteristic information It takes, obtains user's characteristic information vector, then retrieval and user's characteristic information in all content characteristic information vectors to be recommended The corresponding content to be recommended of vector most like first 100 content characteristic information vectors to be recommended, as object content.
As shown in Figure 2 b, a kind of content recommendation method can be executed, detailed process by offline service device and line server It is as follows:
201, offline service device training initial matching model, Matching Model after being trained.
In offline service device, the available training data of content recommendation device, the training data includes user's training sample Originally, content training sample, the training data are also labelled with the true vector phase between user's training sample and content training sample Like degree.
Initial matching model can be trained according to these training data offline service devices, and mould will be matched after training Type is periodically uploaded to line server.
In the present embodiment, content recommendation device can obtain training number by network from network database servers According to.
Content recommendation device can read initial matching model in local memory, which is depth characteristic Matching Model (Deep Semantic Matching Model, DSSM), the model structure schematic diagram of the model can be with reference to figure 2c, as shown in Figure 2 c, DSSM include three network layers, respectively input layer, expression layer and matching layer.
The training data that input layer can will acquire imports DSSM.
Expression layer can will extract the high dimensional feature vector of training data, and respectively by these high dimensional feature vectors by more Layer perceptron (Multi-Layer Perception, MLP) Lai Jinhang dimensionality reduction, obtains low-dimensional feature vector.
The cosine value that matching layer can carry out the low-dimensional vector of data between vector two-by-two calculates, and the cosine value acquired is led to It crosses softmax function to be normalized, obtains predicted vector similarity.
Finally, according to the true vector similarity and predicted vector similarity marked in training data to initial matching model It is restrained, Matching Model after being trained.
Matching Model not only can predict the vector similarity between two word contents in matching layer after the training, but also can be with In the corresponding low-dimensional feature vector of expression layer output character content.
Specifically, with reference to the network architecture schematic diagram of Fig. 2 d DSSM provided a kind of, as shown in Figure 2 d, DSSM every time can be with One user's training sample of input layer input and multiple content training samples are converted to the high dimensional feature vector of 500k dimension, Obtain the user training high dimensional feature vector Q and content training high dimensional feature vector D of one-hot encoding (one-hot) form1、 D2...Dn, specific transform mode can be with reference to step 102, and this will not be repeated here.
Since vector dimension is excessively high, calculation amount is excessive, thereby increases and it is possible to it will lead to OOV (Out of Vocabulary) problem, therefore Word Hash (Word Hashing) operation is also needed to be implemented in expression layer, to reduce the dimension of high dimensional feature vector, generates 30k dimension The feature vector of degree.
Then, the feature vector of 30k is generated into the low-dimensional feature vector of 128 dimensions by MLP.
Finally, the content in matching layer by user's training low-dimensional feature vector Q of 128 dimensions respectively with 128 dimensions trains low-dimensional Feature vector D1、D2...DnThe calculating of vector cosine similarity is carried out, D is obtained1Cosine similarity R (Q, D1)、D2Cosine it is similar Spend R (Q, D2)...DnCosine similarity R (Q, Dn)。
Also, these cosine similarities are normalized by softmax function, are converted it into big less than 1 In 0 probability value, D is obtained1Vector similarity P (Q | D1)、D2Vector similarity P (Q | D2)...DnVector similarity P (Q |Dn), vector similarity is closer to 1, then user's training sample is more similar to content training sample.
Word hashing operation can refer to user characteristics content map in step 102 into vector space, obtain user spy The step of levying content vector, this will not be repeated here.
Wherein, the specific structure of multilayer perceptron can carry out user's characteristic information high dimension vector with reference in step 102 Dimension-reduction treatment, the step of obtaining user's characteristic information low-dimensional vector, this will not be repeated here.
Finally, minimizing loss function using Maximum-likelihood estimation in training stage the application:
Residual error can the backpropagation in the deep neural network of expression layer, eventually by stochastic gradient descent (Stochastic Gradient Descent, SGD) restrains model, obtains the parameter { Wi, bi } of each network layer.
202, offline service device obtains the content characteristic information of content to be recommended, and using Matching Model after training by content Feature information processing is constructed at content characteristic information low-dimensional vector, and according to the content characteristic information low-dimensional vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
In the present embodiment, offline service device can also be obtained by network from network database servers in be recommended The content characteristic information of appearance, and believed according to Matching Model after training in step 201 come the content characteristic to these contents to be recommended Breath processing, generates content low-dimensional language feature vector.
Wherein, content information to be recommended includes the information such as number, title, address, the label of content to be recommended.
Then, the corresponding content low-dimensional language feature vector of these contents to be recommended is constructed into content DUAL PROBLEMS OF VECTOR MAPPING set of relations It closes, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship format shown in reference table 3, which may include multiple Content to be recommended and the corresponding content low-dimensional language feature vector of each content to be recommended, url (Uniform Resource Locator) address and title.
Content to be recommended Title Content low-dimensional language feature vector The address url
Content A to be recommended Landscape painting a A [x, x, x] http://xxx
Content B to be recommended Landscape video b B [x, x, y] http://xxy
Content C to be recommended Travel video c C [x, y, y] http://xyy
Table 3
203, line server obtains user's characteristic information.
In the present embodiment, the available recommendation request from user terminal of line server, which includes user Identity information, address information etc..
Inquire the identity information of user in network database servers by network, available user's characteristic information, The user's characteristic information includes the information such as gender, age, the historical viewings record of user.
204, line server carries out feature space mapping to user's characteristic information using model after training, obtains user spy Reference ceases corresponding user's characteristic information high dimension vector, and carries out dimension-reduction treatment to user's characteristic information high dimension vector, is used Family characteristic information low-dimensional vector.
In line server, content recommendation device matches after can obtaining the training that offline service device is sent by network Model, and feature space mapping is carried out to user's characteristic information using the input layer of model after training and presentation layer, obtain table The user's characteristic information high dimension vector exported up to layer.
Wherein, for the word content in user's characteristic information, the presentation layer of Matching Model can be executed following after training Word hashing step, to be translated into vector, and solve the problems, such as data scale, improve robustness:
A. slicing treatment is carried out to user characteristics content, obtains paragraph sequence;
B. the frequency of occurrences for counting every kind of paragraph in paragraph sequence, obtains the corresponding paragraph frequency of every kind of paragraph;
C. the paragraph that paragraph frequency meets preset threshold is chosen from paragraph sequence, obtains target paragraph;
D. the corresponding feature vector of word content is constructed according to target paragraph.
The one-hot encoding vector of higher-dimension can be converted to how hot code (multi-hot) by the present embodiment as a result, for example, by literary Word content using Tri-gram be sliced to obtain the word content for paragraph sequence, so that it is corresponding special to compress word content Levy the space of vector.
For example, word content #Query#, can be sliced and generate a paragraph sequence, it include 5 in the paragraph sequence to language Section, respectively #Qu, Que, #uer, #ery, ry#.Then, these paragraphs are counted, every primary paragraph occurred and exist Its corresponding position+1, eventually forms the coding of a multi-hot.
In the present embodiment, these obtained feature vectors can also be stitched together, then, directly by user characteristics Feature vector in information is spliced after these feature vectors, and the corresponding user's characteristic information higher-dimension of user's characteristic information is obtained Vector.
Specific steps refer to step 102, and this will not be repeated here.
Again by the MLP network in presentation layer, user's characteristic information low-dimensional vector is obtained.With reference to Fig. 2 d, MLP mono- shares three Layer, WiIndicate i-th layer of weight matrix, biIndicate i-th layer of biasing (bias), y is the vector that presentation layer finally exports, can be with It respectively indicates are as follows:
l1=W1x
li=f (Wili-1+bi), i=2,3...N-1
Y=f (WNlN-1+bN)
Wherein, f is activation primitive, and in the present embodiment, f is tanh function:
205, it is special to calculate content in user's characteristic information low-dimensional vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship for line server Reference ceases the vector similarity between low-dimensional vector.
In line server, content recommendation device can obtain the content vector that offline service device is sent by network and reflect Set of relationship is penetrated, and using the matching layer of model after training come to user's characteristic information low-dimensional vector and content DUAL PROBLEMS OF VECTOR MAPPING All the elements characteristic information low-dimensional vector carries out length computation in set of relationship, inner product calculates, and obtains user's characteristic information low-dimensional The vector length of vector, the vector length of content characteristic information low-dimensional vector, user's characteristic information low-dimensional vector sum content characteristic Inner product of vectors between information low-dimensional vector.
Then, the vector length of vector length, content characteristic information low-dimensional vector based on user's characteristic information low-dimensional vector Inner product of vectors between degree, user's characteristic information low-dimensional vector sum content characteristic information low-dimensional vector, to calculate content characteristic letter Cease the COS distance of vector.
Finally, converting the COS distance to by softmax function the vector similarity of Probability Forms expression.
For example, the vector length of user's characteristic information low-dimensional vector Q is | Q |, the vector of content characteristic information low-dimensional vector D Length is | D |, the inner product of vectors between user's characteristic information low-dimensional vector Q and content characteristic information low-dimensional vector D is QD, COS distance are as follows:
Its vector similarity are as follows:
Wherein, the smoothing factor that wherein r is softmax, D are the positive sample under user's characteristic information low-dimensional vector, and D` is used Negative sample (take random negative sampling) under the characteristic information low-dimensional vector of family, all samples under D user's characteristic information low-dimensional vector This.
206, line server determines object content characteristic information based on vector similarity from content characteristic information vector Vector, and the corresponding object content of object content characteristic information vector is chosen, recommend object content to target user.
Finally, being ranked up the vector similarity being calculated is descending, chooses vector similarity and be greater than 0.9 and arrange Preceding 100 vector similarities of name, it is corresponding to be recommended in content DUAL PROBLEMS OF VECTOR MAPPING set of relationship to retrieve these vector similarities Title, the address url of content etc., as object content and the attribute of object content, then, by these object contents and mesh The attribute of mark content is sent to target user.
It in this application, can be by a portion target after executing fine sort, filtering sensitive words to these object contents The attribute of content and object content is sent to user terminal.
From the foregoing, it will be observed that the embodiment of the present application can train initial matching model, Matching Model after being trained is obtained wait push away Recommend the content characteristic information of content, and using Matching Model after training by content characteristic information processing at content characteristic information low-dimensional Vector, and according to the content characteristic information low-dimensional vector content construction DUAL PROBLEMS OF VECTOR MAPPING set of relationship of content to be recommended, obtain user Characteristic information carries out feature space mapping to user's characteristic information, and it is high to obtain the corresponding user's characteristic information of user's characteristic information Dimensional vector, and dimension-reduction treatment is carried out to user's characteristic information high dimension vector, user's characteristic information low-dimensional vector is obtained, user is calculated Vector similarity in characteristic information low-dimensional vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information low-dimensional vector, Object content characteristic information low-dimensional vector is determined from content characteristic information low-dimensional vector based on vector similarity, and chooses target The corresponding object content of content characteristic information low-dimensional vector recommends the object content to target user.
Due to the program be continuously updated training after Matching Model carry out and content characteristic information low-dimensional vector, can be improved The acquisition range of object content and the richness of object content, to meet the recommended requirements of different type user;Meanwhile passing through Calculate the vector similarity between low-dimensional feature vector, it is possible to reduce the calculation amount of similarity calculation, and line server is not necessarily to It is vector format by content transformation to be recommended, therefore calculation amount needed for can reducing retrieval in retrieval is to improve retrieval rate; Also, in the present embodiment, user's characteristic information vector considers more user's characteristic informations, thus, it is possible in being promoted Hold the accuracy recommended.
In order to better implement above method, the embodiment of the present application also provides a kind of content recommendation device, the commending contents Device specifically can integrate in the electronic device, which can be the equipment such as terminal, server, PC.For example, In the present embodiment, the method for the embodiment of the present invention will be carried out detailed so that content recommendation device is integrated in the server as an example Explanation.
For example, as shown in Figure 3a, which may include acquiring unit 301, map unit 302, calculates list Member 303, determination unit 304, selection unit 305 and recommendation unit 306 are as follows:
(1) acquiring unit 301:
Acquiring unit 301, for obtaining the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes the mapping relations between content to be recommended and content characteristic information vector.
In some embodiments, acquiring unit 301 may include obtaining subelement, content map subelement, content mergence Subelement and building subelement, as follows:
Subelement is obtained, for obtaining the user's characteristic information of target user and the content characteristic letter of content to be recommended Breath, content characteristic information includes feature, content characteristic information vector;
Content map subelement obtains feature vector for feature to be mapped in default vector space;
Content mergence subelement is obtained for content characteristic information vector and feature vector to be carried out Vector Fusion The content characteristic information vector of content to be recommended;
Subelement is constructed, for according between content to be recommended and the content characteristic information vector of content to be recommended Mapping relations content construction DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
In some embodiments, content characteristic information vector includes content characteristic information high dimension vector, and content mergence is single Member can be also used for:
Dimension-reduction treatment is carried out to content characteristic information high dimension vector, obtains content characteristic information low-dimensional vector;
According to the mapping relations content construction between content to be recommended and the content characteristic information vector of content to be recommended DUAL PROBLEMS OF VECTOR MAPPING set of relationship, comprising:
According to the mapping relations building between content to be recommended and the content characteristic information low-dimensional vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
(2) map unit 302:
It is corresponding to obtain user's characteristic information for carrying out feature space mapping to user's characteristic information for map unit 302 User's characteristic information vector.
In some embodiments, user's characteristic information vector includes user's characteristic information high dimension vector, and map unit 302 can To be used for:
Dimension-reduction treatment is carried out to user's characteristic information high dimension vector, obtains user's characteristic information low-dimensional vector.
In some embodiments, user's characteristic information vector includes user's characteristic information high dimension vector, and map unit 302 can It is as follows to include weighting subelement and dimensionality reduction subelement:
(1) subelement is weighted, for using the connection weight in Matching Model and biasing to user's characteristic information higher-dimension Vector is weighted summation process, and user's characteristic information high dimension vector after being handled, Matching Model is by being labelled with true vector The training sample training of similarity forms.
In some embodiments, weighting subelement can be also used for:
Training sample is obtained, training sample includes user's training sample, content training sample, and training sample is labelled with user True vector similarity between training sample and content training sample;
Initial matching model is trained using user's training sample, content training sample, obtains user's training sample Predicted vector similarity between content training sample;
Initial matching model is restrained according to true vector similarity and predicted vector similarity, after being trained Matching Model.
(2) dimensionality reduction subelement, for high to user's characteristic information after processing using the default dimensionality reduction function in Matching Model Dimensional vector carries out dimension-reduction treatment, obtains user's characteristic information low-dimensional vector.
In some embodiments, dimensionality reduction subelement specifically can be used for:
A. when the connection weight in Matching Model is nonnegative number, according to connection weight and biasing to user's characteristic information High dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled;
B. when the connection weight in Matching Model is negative, according to the absolute value of connection weight and biasing to user spy Reference breath high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled.
In some embodiments, with reference to Fig. 3 b, map unit 302 may include mapping subelement 3021 and fusion subelement 3022, as follows:
(1) subelement 3021 is mapped:
Subelement 3021 is mapped, for into default vector space, obtaining in user characteristics user characteristics content map Hold vector.
In some embodiments, mapping subelement 3021 can be specifically used for:
Word segmentation processing is carried out to user characteristics content, obtains paragraph sequence, paragraph sequence includes at least a kind of paragraph;
The frequency of occurrences of the every kind of paragraph in paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained;
The paragraph that paragraph frequency meets preset threshold is chosen from paragraph sequence, obtains target paragraph;
The corresponding user's characteristic information vector of user's characteristic information is constructed according to target paragraph.
(2) subelement 3022 is merged:
Subelement 3022 is merged, for user characteristics vector and user characteristics content vector to be carried out Vector Fusion, is obtained The corresponding user's characteristic information vector of user's characteristic information.
(3) computing unit 303:
Computing unit 303, for calculating content characteristic in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Vector similarity between information vector.
In some embodiments, computing unit 303 may include length subelement, inner product subelement, apart from subelement and Similarity subelement:
Length subelement, for calculating the vector length of user's characteristic information vector sum content characteristic information vector;
Inner product subelement, for calculating the inner product of vectors between user's characteristic information vector sum content characteristic information vector;
Apart from subelement, for calculating the COS distance of content characteristic information vector based on inner product of vectors and vector length;
Similarity subelement obtains user's characteristic information vector and content for COS distance to be normalized Vector similarity between characteristic information vector.
In some embodiments, content characteristic information vector includes content characteristic information low-dimensional vector, and computing unit 303 has Body can be used for:
It calculates in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information low-dimensional vector Vector similarity.
(4) determination unit 304:
Determination unit 304, for determining object content feature letter from content characteristic information vector based on vector similarity Cease vector.
(5) selection unit 305:
Selection unit 305, it is corresponding for choosing object content characteristic information vector based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Object content.
(6) recommendation unit 306:
Recommendation unit 306, for recommending the content of object content to target user.
When it is implemented, above each unit can be used as independent entity to realize, any combination can also be carried out, is made It is realized for same or several entities, the specific implementation of above each unit can be found in the embodiment of the method for front, herein not It repeats again.
From the foregoing, it will be observed that the content recommendation device of the present embodiment is obtained the user's characteristic information of target user by acquiring unit, And content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship include the content of content to be recommended Yu content to be recommended Mapping relations between characteristic information vector;Feature space mapping is carried out to user's characteristic information by map unit, obtains user The corresponding user's characteristic information vector of characteristic information;User's characteristic information vector is calculated by computing unit and content DUAL PROBLEMS OF VECTOR MAPPING is closed Vector similarity in assembly conjunction between content characteristic information vector;Vector similarity is based on by determination unit to believe from content characteristic It ceases and determines object content characteristic information vector in vector;It is single based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship selection object content by choosing The corresponding object content of characteristic information vector;Recommend the content of object content from recommendation unit to target user.
Since the program can be chosen and target user's characteristic information by the user's characteristic information vector of target user Similar content to be recommended is as object content, and the present embodiment is so as to promoting the accuracies of commending contents.
The embodiment of the present application also provides a kind of server, which can be mobile phone, tablet computer, miniature handle box Son, unmanned plane or image capture device etc..As shown in figure 4, it illustrates servers involved in the embodiment of the present application Structural schematic diagram, specifically:
The server may include one or processor 401, one or more meters of more than one processing core The components such as memory 402, power supply 403, input module 404 and the communication module 405 of calculation machine readable storage medium storing program for executing.This field skill Art personnel are appreciated that server architecture shown in Fig. 4 does not constitute the restriction to server, may include more than illustrating Or less component, perhaps combine certain components or different component layouts.Wherein:
Processor 401 is the control centre of the server, utilizes each of various interfaces and the entire server of connection Part by running or execute the software program and/or module that are stored in memory 402, and calls and is stored in memory Data in 402, the various functions and processing data of execute server, to carry out integral monitoring to server.In some realities It applies in example, processor 401 may include one or more processing cores;In some embodiments, processor 401 can integrate at Manage device and modem processor, wherein the main processing operation system of application processor, user interface and application program etc. are adjusted Demodulation processor processed mainly handles wireless communication.It is understood that above-mentioned modem processor can not also integrate everywhere It manages in device 401.
Memory 402 can be used for storing software program and module, and processor 401 is stored in memory 402 by operation Software program and module, thereby executing various function application and data processing.Memory 402 can mainly include storage journey Sequence area and storage data area, wherein storing program area can the (ratio of application program needed for storage program area, at least one function Such as sound-playing function, image player function) etc.;Storage data area, which can be stored, uses created data according to server Deng.In addition, memory 402 may include high-speed random access memory, it can also include nonvolatile memory, for example, at least One disk memory, flush memory device or other volatile solid-state parts.Correspondingly, memory 402 can also include Memory Controller, to provide access of the processor 401 to memory 402.
Server further includes the power supply 403 powered to all parts, and in some embodiments, power supply 403 can pass through electricity Management system and processor 401 are logically contiguous, to realize management charging, electric discharge and power consumption by power-supply management system The functions such as management.Power supply 403 can also include one or more direct current or AC power source, recharging system, power supply event Hinder the random components such as detection circuit, power adapter or inverter, power supply status indicator.
The server may also include input module 404, which can be used for receiving the number or character letter of input Breath, and generation keyboard related with user setting and function control, mouse, operating stick, optics or trackball signal are defeated Enter.
The server may also include communication module 405, and in some embodiments, communication module 405 may include wireless son Module, server can carry out short range wireless transmission by the wireless submodule of the communication module 405, to provide wirelessly Broadband internet access.For example, the communication module 405 can be used for that user is helped to send and receive e-mail, browse webpage and access Streaming video etc..
Although being not shown, server can also be including display unit etc., and details are not described herein.Specifically in the present embodiment, Processor 401 in server can according to following instruction, by the process of one or more application program is corresponding can It executes file to be loaded into memory 402, and runs the application program being stored in memory 402 by processor 401, thus Realize various functions, as follows:
Obtain the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, content DUAL PROBLEMS OF VECTOR MAPPING relationship Set includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;
To user's characteristic information carry out feature space mapping, obtain the corresponding user's characteristic information of user's characteristic information to Amount;
Calculate in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector to Measure similarity;
Object content characteristic information vector is determined from content characteristic information vector based on vector similarity;
The corresponding object content of object content characteristic information vector is chosen based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;
Recommend object content to target user.
The specific implementation of above each operation can be found in the embodiment of front, and details are not described herein.
From the foregoing, it will be observed that the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING relationship of the available target user of the present embodiment Set, content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes reflecting between content to be recommended and the content characteristic information vector of content to be recommended Penetrate relationship;Feature space mapping is carried out to user's characteristic information, obtains the corresponding user's characteristic information vector of user's characteristic information; Calculate the vector similarity in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector; Object content characteristic information vector is determined from content characteristic information vector based on vector similarity;It is closed based on content DUAL PROBLEMS OF VECTOR MAPPING Assembly, which is closed, chooses the corresponding object content of object content characteristic information vector;Recommend object content to target user.As a result, at this In application, it can be chosen by the user's characteristic information vector of target user similar with target user's characteristic information to be recommended Content is as object content, to promote the accuracy of commending contents.
It will appreciated by the skilled person that all or part of the steps in the various methods of above-described embodiment can be with It is completed by instructing, or relevant hardware is controlled to complete by instruction, which can store in computer-readable storage In medium, and is loaded and executed by processor.
For this purpose, the embodiment of the present application provides a kind of storage medium, wherein being stored with a plurality of instruction, which can be processed Device is loaded, to execute the step in any content recommendation method provided by the embodiment of the present application.For example, the instruction can To execute following steps:
Obtain the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, content DUAL PROBLEMS OF VECTOR MAPPING relationship Set includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;
To user's characteristic information carry out feature space mapping, obtain the corresponding user's characteristic information of user's characteristic information to Amount;
Calculate in user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector to Measure similarity;
Object content characteristic information vector is determined from content characteristic information vector based on vector similarity;
The corresponding object content of object content characteristic information vector is chosen based on content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;
Recommend object content to target user.
Wherein, which may include: read-only memory (ROM, Read Only Memory), random access memory Body (RAM, Random Access Memory), disk or CD etc..
By the instruction stored in the storage medium, any content provided by the embodiment of the present application can be executed and pushed away The step in method is recommended, it is thereby achieved that achieved by any content recommendation method provided by the embodiment of the present application Beneficial effect is detailed in the embodiment of front, and details are not described herein.
Above to a kind of content recommendation method, device, server and storage medium provided by the embodiment of the present application into It has gone and has been discussed in detail, specific examples are used herein to illustrate the principle and implementation manner of the present application, the above implementation The explanation of example is merely used to help understand the present processes and its core concept;Meanwhile for those skilled in the art, according to According to the thought of the application, there will be changes in the specific implementation manner and application range, and to sum up, the content of the present specification is not answered It is interpreted as the limitation to the application.

Claims (16)

1. a kind of content recommendation method characterized by comprising
Obtain the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, the content DUAL PROBLEMS OF VECTOR MAPPING relationship Set includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;
Feature space mapping is carried out to the user's characteristic information, obtains the corresponding user's characteristic information of the user's characteristic information Vector;
Calculate in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector to Measure similarity;
Object content characteristic information vector is determined from content characteristic information vector based on the vector similarity;
The corresponding object content of object content characteristic information vector is chosen based on the content DUAL PROBLEMS OF VECTOR MAPPING set of relationship;
Recommend the object content to target user.
2. content recommendation method as described in claim 1, which is characterized in that the user's characteristic information includes in user characteristics Hold and it is corresponding to obtain the user's characteristic information to user's characteristic information progress feature space mapping for user characteristics vector User's characteristic information vector, comprising:
By the user characteristics content map into default vector space, user characteristics content vector is obtained;
The user characteristics vector and user characteristics content vector are subjected to Vector Fusion, it is corresponding to obtain the user's characteristic information User's characteristic information vector.
3. content recommendation method as claimed in claim 2, which is characterized in that by the user characteristics content map to preset to In quantity space, user characteristics content vector is obtained, comprising:
Word segmentation processing is carried out to the user characteristics content, obtains paragraph sequence, the paragraph sequence includes at least a kind of paragraph;
The frequency of occurrences of the every kind of paragraph in the paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained;
The paragraph that paragraph frequency meets preset threshold is chosen from the paragraph sequence, obtains target paragraph;
The corresponding user's characteristic information vector of the user's characteristic information is constructed according to the target paragraph.
4. content recommendation method as described in claim 1, which is characterized in that the content characteristic information vector includes content spy Reference ceases low-dimensional vector, calculate in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship content characteristic information to Vector similarity between amount, comprising:
It calculates in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information low-dimensional vector Vector similarity.
5. content recommendation method as described in claim 1, which is characterized in that the user's characteristic information vector includes user spy Reference ceases high dimension vector, carries out feature space mapping to the user's characteristic information, it is corresponding to obtain the user's characteristic information User's characteristic information vector, comprising:
Dimension-reduction treatment is carried out to the user's characteristic information high dimension vector, obtains user's characteristic information low-dimensional vector.
6. content recommendation method as claimed in claim 5, which is characterized in that carried out to the user's characteristic information high dimension vector Dimension-reduction treatment obtains user's characteristic information low-dimensional vector, comprising:
Using in Matching Model connection weight and biasing the user's characteristic information high dimension vector is weighted at summation Reason, user's characteristic information high dimension vector after being handled, the Matching Model is by being labelled with the training sample of true vector similarity This is trained;
Dimension-reduction treatment is carried out to user's characteristic information high dimension vector after the processing using the default dimensionality reduction function in Matching Model, Obtain user's characteristic information low-dimensional vector.
7. content recommendation method as claimed in claim 6, which is characterized in that using the connection weight in Matching Model and partially It sets and summation process is weighted to the user's characteristic information high dimension vector, user's characteristic information high dimension vector after being handled, Include:
When the connection weight in the Matching Model is nonnegative number, the user characteristics are believed according to connection weight and biasing Breath high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled;
When the connection weight in the Matching Model is negative, according to the absolute value of connection weight and biasing to the user Characteristic information high dimension vector is weighted summation process, user's characteristic information high dimension vector after being handled.
8. content recommendation method as claimed in claim 6, which is characterized in that using the connection weight in Matching Model and partially Set and summation process be weighted to the user's characteristic information high dimension vector, after being handled user's characteristic information high dimension vector it Before, further includes:
Training sample is obtained, the training sample includes user's training sample and content training sample, the training sample mark True vector similarity between user's training sample and content training sample;
Initial matching model is trained using user's training sample, content training sample, obtains user's training sample Predicted vector similarity between content training sample;
The initial matching model is restrained according to the true vector similarity and predicted vector similarity, is trained Matching Model afterwards.
9. content recommendation method as described in claim 1, which is characterized in that the user's characteristic information of target user is obtained, with And content DUAL PROBLEMS OF VECTOR MAPPING set of relationship, comprising:
Obtain the user's characteristic information of target user and the content characteristic information of content to be recommended, the content characteristic information Feature, content feature vector including content to be recommended;
The feature of the content to be recommended is mapped in default vector space, obtain the feature of content to be recommended to Amount;
The content feature vector and the feature vector of content to be recommended are subjected to Vector Fusion, obtain content to be recommended Content characteristic information vector;
According to the mapping relations content construction between the content to be recommended and the content characteristic information vector of content to be recommended DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
10. content recommendation method as claimed in claim 9, which is characterized in that the content characteristic information vector includes content The content feature vector and the feature vector of content to be recommended are carried out Vector Fusion, obtained by characteristic information high dimension vector To after the content characteristic information vector of content to be recommended, further includes:
Dimension-reduction treatment is carried out to the content characteristic information high dimension vector, obtains content characteristic information low-dimensional vector;
According to the mapping relations content construction between the content to be recommended and the content characteristic information vector of content to be recommended DUAL PROBLEMS OF VECTOR MAPPING set of relationship, comprising:
According to the mapping relations building between the content to be recommended and the content characteristic information low-dimensional vector of content to be recommended Content DUAL PROBLEMS OF VECTOR MAPPING set of relationship.
11. such as the described in any item content recommendation methods of claim 1~10, which is characterized in that calculate the user characteristics letter Cease the vector similarity in vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship between content characteristic information vector, comprising:
Calculate the vector length of the user's characteristic information vector sum content characteristic information vector;
Calculate the inner product of vectors between the user's characteristic information vector sum content characteristic information vector;
The COS distance of the content characteristic information vector is calculated based on the inner product of vectors and vector length;
The COS distance is normalized, the vector of user's characteristic information vector Yu content characteristic information vector is obtained Similarity.
12. a kind of content recommendation device, feature vector are, comprising:
Acquiring unit, for obtaining the user's characteristic information and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship of target user, the content DUAL PROBLEMS OF VECTOR MAPPING set of relationship includes the mapping relations between content to be recommended and the content characteristic information vector of content to be recommended;
It is corresponding to obtain the user's characteristic information for carrying out feature space mapping to the user's characteristic information for map unit User's characteristic information vector;
Computing unit, for calculating content characteristic information in the user's characteristic information vector and content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Vector similarity between vector;
Determination unit, for based on the vector similarity from content characteristic information vector determination object content characteristic information to Amount;
Selection unit, for choosing the corresponding mesh of object content characteristic information vector based on the content DUAL PROBLEMS OF VECTOR MAPPING set of relationship Mark content;
Recommendation unit, for recommending the content of the object content to target user.
13. content recommendation device as claimed in claim 12, which is characterized in that the user's characteristic information includes user characteristics Content and user characteristics vector, the map unit include:
Map subelement, for by the user characteristics content map into default vector space, obtain user characteristics content to Amount;
Subelement is merged, for the user characteristics vector and user characteristics content vector to be carried out Vector Fusion, is obtained described The corresponding user's characteristic information vector of user's characteristic information.
14. content recommendation device as claimed in claim 13, which is characterized in that the mapping subelement is specifically used for:
Word segmentation processing is carried out to the user characteristics content, obtains paragraph sequence, the paragraph sequence includes at least a kind of paragraph;
The frequency of occurrences of the every kind of paragraph in the paragraph sequence is counted, the corresponding paragraph frequency of every kind of paragraph is obtained;
The paragraph that paragraph frequency meets preset threshold is chosen from the paragraph sequence, obtains target paragraph;
The corresponding user's characteristic information vector of the user's characteristic information is constructed according to the target paragraph.
15. a kind of server, which is characterized in that including processor and memory, the memory is stored with a plurality of instruction;It is described Processor loads instruction from the memory, to execute as in the described in any item content recommendation methods of claim 1~10 The step of.
16. a kind of storage medium, feature vector are, the storage medium is stored with a plurality of instruction, and described instruction is suitable for place Reason device is loaded, and requires the step in 1~10 described in any item content recommendation methods with perform claim.
CN201910447811.9A 2019-05-27 2019-05-27 Content recommendation method, device, server and storage medium Active CN110232152B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910447811.9A CN110232152B (en) 2019-05-27 2019-05-27 Content recommendation method, device, server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910447811.9A CN110232152B (en) 2019-05-27 2019-05-27 Content recommendation method, device, server and storage medium

Publications (2)

Publication Number Publication Date
CN110232152A true CN110232152A (en) 2019-09-13
CN110232152B CN110232152B (en) 2021-03-23

Family

ID=67858149

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910447811.9A Active CN110232152B (en) 2019-05-27 2019-05-27 Content recommendation method, device, server and storage medium

Country Status (1)

Country Link
CN (1) CN110232152B (en)

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717539A (en) * 2019-10-08 2020-01-21 腾讯科技(深圳)有限公司 Artificial intelligence-based dimensionality reduction model training method, retrieval method and device
CN111010595A (en) * 2019-12-25 2020-04-14 广州欢聊网络科技有限公司 New program recommendation method and device
CN111339240A (en) * 2020-02-10 2020-06-26 北京达佳互联信息技术有限公司 Object recommendation method and device, computing equipment and storage medium
CN111368034A (en) * 2020-03-02 2020-07-03 复旦大学 Two-way semantic feature matching method and supply content recommendation device
CN111488526A (en) * 2020-04-14 2020-08-04 北京声智科技有限公司 Recommendation method and device
CN111553763A (en) * 2020-04-26 2020-08-18 上海风秩科技有限公司 Article recommendation method and device, electronic equipment and readable storage medium
CN111597446A (en) * 2020-05-13 2020-08-28 腾讯科技(深圳)有限公司 Content pushing method and device based on artificial intelligence, server and storage medium
CN111681085A (en) * 2020-06-10 2020-09-18 创新奇智(成都)科技有限公司 Commodity pushing method and device, server and readable storage medium
CN111708950A (en) * 2020-06-22 2020-09-25 腾讯科技(深圳)有限公司 Content recommendation method and device and electronic equipment
CN111815403A (en) * 2020-06-19 2020-10-23 北京石油化工学院 Product recommendation method, device and terminal equipment
CN111932445A (en) * 2020-07-27 2020-11-13 广州市百果园信息技术有限公司 Compression method for style migration network and style migration method, device and system
CN111966886A (en) * 2019-05-20 2020-11-20 北京京东尚科信息技术有限公司 Object recommendation method, object recommendation device, electronic equipment and storage medium
CN112070586A (en) * 2020-09-09 2020-12-11 腾讯科技(深圳)有限公司 Article recommendation method and device based on semantic recognition, computer equipment and medium
CN112287225A (en) * 2020-10-29 2021-01-29 北京奇艺世纪科技有限公司 Object recommendation method and device
CN112487300A (en) * 2020-12-18 2021-03-12 上海众源网络有限公司 Video recommendation method and device, electronic equipment and storage medium
CN112579889A (en) * 2020-12-07 2021-03-30 北京百度网讯科技有限公司 Article recommendation method and device, electronic equipment and storage medium
CN112749296A (en) * 2019-10-31 2021-05-04 北京达佳互联信息技术有限公司 Video recommendation method and device, server and storage medium
CN112818251A (en) * 2021-04-13 2021-05-18 腾讯科技(深圳)有限公司 Video recommendation method and device, electronic equipment and storage medium
CN112819492A (en) * 2019-11-15 2021-05-18 北京达佳互联信息技术有限公司 An advertisement recommendation method, device and electronic device
CN112989022A (en) * 2021-03-16 2021-06-18 中国平安人寿保险股份有限公司 Intelligent virtual text selection method and device and computer equipment
CN113449739A (en) * 2020-03-26 2021-09-28 阿里巴巴集团控股有限公司 Data processing method, device and system
CN113469773A (en) * 2020-06-05 2021-10-01 海信集团有限公司 Intelligent terminal and object recommendation method
CN113674008A (en) * 2020-05-14 2021-11-19 北京达佳互联信息技术有限公司 Targeted label recommendation method, device, server and storage medium
CN113722583A (en) * 2021-07-31 2021-11-30 华为技术有限公司 Recommendation method, recommendation model training method and related products
CN113947417A (en) * 2020-07-15 2022-01-18 上海哔哩哔哩科技有限公司 Training method and device of age identification model and age identification method and device
CN114049172A (en) * 2021-11-08 2022-02-15 土巴兔集团股份有限公司 Item recommendation sorting method and device, computer equipment and storage medium
CN114065014A (en) * 2020-07-31 2022-02-18 北京达佳互联信息技术有限公司 Information matching method, device, equipment and storage medium
CN114693011A (en) * 2020-12-25 2022-07-01 航天信息股份有限公司 A policy matching method, apparatus, device and medium
CN115033719A (en) * 2021-03-05 2022-09-09 北京奇虎科技有限公司 Multimedia information recommendation method, equipment, storage medium and device
CN115331414A (en) * 2022-10-13 2022-11-11 江苏濠玥电子科技有限公司 Temperature early warning method for zirconium oxide ceramic nozzle workpiece
CN116049529A (en) * 2021-10-27 2023-05-02 北京字节跳动网络技术有限公司 Content recommendation method, device, medium and electronic equipment
CN116075843A (en) * 2020-10-13 2023-05-05 谷歌有限责任公司 Methods, systems and media for identifying relevant content
US20230281956A1 (en) * 2020-07-16 2023-09-07 Beijing Bytedance Network Technology Co., Ltd. Method for generating objective function, apparatus, electronic device and computer readable medium
CN116932533A (en) * 2022-04-11 2023-10-24 北京沃东天骏信息技术有限公司 Data processing method, device, electronic equipment and computer readable medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150074027A1 (en) * 2013-09-06 2015-03-12 Microsoft Corporation Deep Structured Semantic Model Produced Using Click-Through Data
CN104933143A (en) * 2015-06-18 2015-09-23 北京京东尚科信息技术有限公司 Method and device for acquiring recommended object
US20170032035A1 (en) * 2015-07-28 2017-02-02 Microsoft Technology Licensing, Llc Representation Learning Using Multi-Task Deep Neural Networks
CN108090807A (en) * 2017-12-13 2018-05-29 北京小度信息科技有限公司 Information recommendation method and device
CN108848152A (en) * 2018-06-05 2018-11-20 腾讯科技(深圳)有限公司 A kind of method and server of object recommendation
CN109062914A (en) * 2017-06-12 2018-12-21 东软集团股份有限公司 User's recommended method and device, storage medium and server
CN109344314A (en) * 2018-08-20 2019-02-15 腾讯科技(深圳)有限公司 A kind of data processing method, device and server
CN109508417A (en) * 2018-11-12 2019-03-22 北京三快在线科技有限公司 Method, apparatus, electronic equipment and the readable storage medium storing program for executing of recommended

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150074027A1 (en) * 2013-09-06 2015-03-12 Microsoft Corporation Deep Structured Semantic Model Produced Using Click-Through Data
CN104933143A (en) * 2015-06-18 2015-09-23 北京京东尚科信息技术有限公司 Method and device for acquiring recommended object
US20170032035A1 (en) * 2015-07-28 2017-02-02 Microsoft Technology Licensing, Llc Representation Learning Using Multi-Task Deep Neural Networks
CN109062914A (en) * 2017-06-12 2018-12-21 东软集团股份有限公司 User's recommended method and device, storage medium and server
CN108090807A (en) * 2017-12-13 2018-05-29 北京小度信息科技有限公司 Information recommendation method and device
CN108848152A (en) * 2018-06-05 2018-11-20 腾讯科技(深圳)有限公司 A kind of method and server of object recommendation
CN109344314A (en) * 2018-08-20 2019-02-15 腾讯科技(深圳)有限公司 A kind of data processing method, device and server
CN109508417A (en) * 2018-11-12 2019-03-22 北京三快在线科技有限公司 Method, apparatus, electronic equipment and the readable storage medium storing program for executing of recommended

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111966886A (en) * 2019-05-20 2020-11-20 北京京东尚科信息技术有限公司 Object recommendation method, object recommendation device, electronic equipment and storage medium
CN110717539B (en) * 2019-10-08 2025-01-07 腾讯科技(深圳)有限公司 Dimensionality reduction model training method, retrieval method and device based on artificial intelligence
CN110717539A (en) * 2019-10-08 2020-01-21 腾讯科技(深圳)有限公司 Artificial intelligence-based dimensionality reduction model training method, retrieval method and device
CN112749296A (en) * 2019-10-31 2021-05-04 北京达佳互联信息技术有限公司 Video recommendation method and device, server and storage medium
CN112749296B (en) * 2019-10-31 2024-01-26 北京达佳互联信息技术有限公司 Video recommendation method, device, server and storage medium
CN112819492A (en) * 2019-11-15 2021-05-18 北京达佳互联信息技术有限公司 An advertisement recommendation method, device and electronic device
CN111010595A (en) * 2019-12-25 2020-04-14 广州欢聊网络科技有限公司 New program recommendation method and device
CN111339240A (en) * 2020-02-10 2020-06-26 北京达佳互联信息技术有限公司 Object recommendation method and device, computing equipment and storage medium
CN111339240B (en) * 2020-02-10 2024-04-26 北京达佳互联信息技术有限公司 Object recommendation method, device, computing equipment and storage medium
CN111368034A (en) * 2020-03-02 2020-07-03 复旦大学 Two-way semantic feature matching method and supply content recommendation device
CN113449739A (en) * 2020-03-26 2021-09-28 阿里巴巴集团控股有限公司 Data processing method, device and system
CN111488526B (en) * 2020-04-14 2024-04-05 北京声智科技有限公司 Recommendation method and device
CN111488526A (en) * 2020-04-14 2020-08-04 北京声智科技有限公司 Recommendation method and device
CN111553763B (en) * 2020-04-26 2023-03-24 上海风秩科技有限公司 Article recommendation method and device, electronic equipment and readable storage medium
CN111553763A (en) * 2020-04-26 2020-08-18 上海风秩科技有限公司 Article recommendation method and device, electronic equipment and readable storage medium
CN111597446B (en) * 2020-05-13 2022-09-09 腾讯科技(深圳)有限公司 Content pushing method and device based on artificial intelligence, server and storage medium
CN111597446A (en) * 2020-05-13 2020-08-28 腾讯科技(深圳)有限公司 Content pushing method and device based on artificial intelligence, server and storage medium
CN113674008A (en) * 2020-05-14 2021-11-19 北京达佳互联信息技术有限公司 Targeted label recommendation method, device, server and storage medium
CN113469773A (en) * 2020-06-05 2021-10-01 海信集团有限公司 Intelligent terminal and object recommendation method
CN111681085A (en) * 2020-06-10 2020-09-18 创新奇智(成都)科技有限公司 Commodity pushing method and device, server and readable storage medium
CN111815403A (en) * 2020-06-19 2020-10-23 北京石油化工学院 Product recommendation method, device and terminal equipment
CN111815403B (en) * 2020-06-19 2024-05-10 北京石油化工学院 Commodity recommendation method and device and terminal equipment
CN111708950A (en) * 2020-06-22 2020-09-25 腾讯科技(深圳)有限公司 Content recommendation method and device and electronic equipment
CN111708950B (en) * 2020-06-22 2023-08-29 腾讯科技(深圳)有限公司 Content recommendation method and device and electronic equipment
CN113947417A (en) * 2020-07-15 2022-01-18 上海哔哩哔哩科技有限公司 Training method and device of age identification model and age identification method and device
US12437502B2 (en) * 2020-07-16 2025-10-07 Beijing Bytedance Network Technology Co., Ltd. Method for generating objective function, apparatus, electronic device and computer readable medium
US20230281956A1 (en) * 2020-07-16 2023-09-07 Beijing Bytedance Network Technology Co., Ltd. Method for generating objective function, apparatus, electronic device and computer readable medium
CN111932445B (en) * 2020-07-27 2024-07-16 广州市百果园信息技术有限公司 Compression method, device and system for style migration network
CN111932445A (en) * 2020-07-27 2020-11-13 广州市百果园信息技术有限公司 Compression method for style migration network and style migration method, device and system
CN114065014B (en) * 2020-07-31 2024-08-13 北京达佳互联信息技术有限公司 Information matching method, device, equipment and storage medium
CN114065014A (en) * 2020-07-31 2022-02-18 北京达佳互联信息技术有限公司 Information matching method, device, equipment and storage medium
CN112070586B (en) * 2020-09-09 2023-11-28 腾讯科技(深圳)有限公司 Item recommendation method and device based on semantic recognition, computer equipment and medium
CN112070586A (en) * 2020-09-09 2020-12-11 腾讯科技(深圳)有限公司 Article recommendation method and device based on semantic recognition, computer equipment and medium
CN116075843A (en) * 2020-10-13 2023-05-05 谷歌有限责任公司 Methods, systems and media for identifying relevant content
CN112287225B (en) * 2020-10-29 2023-09-08 北京奇艺世纪科技有限公司 Object recommendation method and device
CN112287225A (en) * 2020-10-29 2021-01-29 北京奇艺世纪科技有限公司 Object recommendation method and device
CN112579889A (en) * 2020-12-07 2021-03-30 北京百度网讯科技有限公司 Article recommendation method and device, electronic equipment and storage medium
CN112487300B (en) * 2020-12-18 2024-06-07 上海众源网络有限公司 Video recommendation method and device, electronic equipment and storage medium
CN112487300A (en) * 2020-12-18 2021-03-12 上海众源网络有限公司 Video recommendation method and device, electronic equipment and storage medium
CN114693011B (en) * 2020-12-25 2025-08-15 航天信息股份有限公司 Policy matching method, device, equipment and medium
CN114693011A (en) * 2020-12-25 2022-07-01 航天信息股份有限公司 A policy matching method, apparatus, device and medium
CN115033719A (en) * 2021-03-05 2022-09-09 北京奇虎科技有限公司 Multimedia information recommendation method, equipment, storage medium and device
CN112989022A (en) * 2021-03-16 2021-06-18 中国平安人寿保险股份有限公司 Intelligent virtual text selection method and device and computer equipment
CN112818251A (en) * 2021-04-13 2021-05-18 腾讯科技(深圳)有限公司 Video recommendation method and device, electronic equipment and storage medium
CN113722583A (en) * 2021-07-31 2021-11-30 华为技术有限公司 Recommendation method, recommendation model training method and related products
CN116049529A (en) * 2021-10-27 2023-05-02 北京字节跳动网络技术有限公司 Content recommendation method, device, medium and electronic equipment
CN114049172A (en) * 2021-11-08 2022-02-15 土巴兔集团股份有限公司 Item recommendation sorting method and device, computer equipment and storage medium
CN114049172B (en) * 2021-11-08 2025-10-10 土巴兔集团股份有限公司 Item recommendation sorting method, device, computer equipment and storage medium
CN116932533A (en) * 2022-04-11 2023-10-24 北京沃东天骏信息技术有限公司 Data processing method, device, electronic equipment and computer readable medium
CN115331414A (en) * 2022-10-13 2022-11-11 江苏濠玥电子科技有限公司 Temperature early warning method for zirconium oxide ceramic nozzle workpiece

Also Published As

Publication number Publication date
CN110232152B (en) 2021-03-23

Similar Documents

Publication Publication Date Title
CN110232152A (en) Content recommendation method, device, server and storage medium
CN111444428B (en) Information recommendation method and device based on artificial intelligence, electronic equipment and storage medium
Chen et al. Research on personalized recommendation hybrid algorithm for interactive experience equipment
CN107832338B (en) Method and system for recognizing core product words
CN110532479A (en) A kind of information recommendation method, device and equipment
WO2020211566A1 (en) Method and device for making recommendation to user, computing apparatus, and storage medium
WO2023108993A1 (en) Product recommendation method, apparatus and device based on deep clustering algorithm, and medium
CN109359244A (en) Personalized information recommendation method and device
CN117609479B (en) Model processing method, device, equipment, medium and product
CN116975615A (en) Task prediction method and device based on video multi-modal information
CN114186013A (en) Entity recognition model hot update method and its device, equipment, medium and product
JP7773677B2 (en) Content recommendation method, apparatus, device, and program
CN113806588A (en) Method and device for searching video
CN119760165B (en) Multi-mode image retrieval method, system, device, medium and program product
CN113392312A (en) Information processing method and system and electronic equipment
CN117951386A (en) Keyword recommendation method and device, electronic equipment and readable storage medium
Lee et al. Efficient image retrieval using advanced SURF and DCD on mobile platform
CN116975346A (en) Method, apparatus, device, medium and program product for generating tag map data
CN115600017A (en) Feature coding model training method and device and media object recommendation method and device
Ma et al. Content Feature Extraction-based Hybrid Recommendation for Mobile Application Services.
CN111291829A (en) Automatic determination method and system for selected pictures
CN114329004A (en) Digital fingerprint generation method, digital fingerprint generation device, data push method, data push device and storage medium
Su et al. Hybrid recommender system based on deep learning model
Hao et al. CFDIL: a context-aware feature deep interaction learning for app recommendation: Q. Hao et al.
CN119939032A (en) A content recommendation method based on cross-device behavior pattern analysis and dynamic profiling

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant