Detailed Description
The present application is described in further detail below with reference to the attached figures.
In a typical configuration of the present application, the terminal, the device serving the network, and the trusted party each include one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer readable media does not include non-transitory computer readable media (transient media), such as modulated data signals and carrier waves.
FIG. 1 shows a flowchart of a method for constructing a user representation, according to one embodiment of the present application, including step S11, step S12, and step S13.
Specifically, in step S11, device 1 acquires application usage record information of the application used by the user on the user device; in step S12, the device 1 determines one or more pieces of application label information corresponding to the user according to the application usage record information; in step S13, the device 1 constructs user representation information of the user based on the one or more application label information.
Here, the device 1 includes, but is not limited to, a user device, a network device, or a device formed by integrating a user device and a network device through a network. The user equipment includes, but is not limited to, any mobile electronic product, such as a smart phone, a tablet computer, and the like, which can perform human-computer interaction with a user through a touch panel, and the mobile electronic product may employ any operating system, such as an android operating system, an iOS operating system, and the like. The network device includes an electronic device capable of automatically performing numerical calculation and information processing according to preset or stored instructions, and the hardware includes but is not limited to a microprocessor, an Application Specific Integrated Circuit (ASIC), a programmable gate array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The network device comprises but is not limited to a computer, a network host, a single network server, a plurality of network server sets or a cloud formed by a plurality of servers; here, the Cloud is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, one virtual supercomputer consisting of a collection of loosely coupled computers. Including, but not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless Ad Hoc network (Ad Hoc network), etc. Preferably, the device 1 may also be a script program running on the user device, the network device, or a device formed by integrating the user device and the network device, the touch terminal, or the network device and the touch terminal through a network. Of course, those skilled in the art will appreciate that the above-described apparatus 1 is merely exemplary, and that other existing or future existing apparatus 1, as may be suitable for use in the present application, are also intended to be encompassed within the scope of the present application and are hereby incorporated by reference.
In step S11, the device 1 acquires application use record information of the application used by the user on the user device.
For example, the application usage record information may include the name or identification information of the application, and information of the number of times of use, the length of time of use per time, the usage consumption flow, and the like, in a period of time (one day, one week, one half month, one month, and the like). Here, the data collection may be performed through an APP installed on the user equipment to acquire the application usage record information, or the application usage record information may be acquired from a third party.
In step S12, the device 1 determines one or more pieces of application label information corresponding to the user according to the application usage record information.
In the mobile user, each user basically uses one user device independently, each user device has a unique identification (such as an imei number or a mac address), and the application use record information can reflect the real characteristics of the user relatively truly. And labeling the users with different characteristics according to the application use record information, and determining one or more pieces of application label information corresponding to the users.
Preferably, in step S12, the device 1 queries application library information according to the application usage record information, and determines one or more pieces of application tag information corresponding to the user.
Here, the application library information includes information such as category, description, price, download amount, etc. of an application, which can be obtained from a network, such as an application market (Google Play, iOS app store, etc.) can provide.
Preferably, the application tag information includes at least any one of: applying installation tag information; applying active tag information; installing tag information based on the application of the application theme; application activity tag information based on application topics.
For example, the application installation tag information is determined based on applications installed by a user, and the application active tag information is determined based on the applications installed by the user and the usage frequency (the number of usage times in a time period, the time length of each usage, the usage consumption flow, and the like) of each application. The application installation tag information comprises application installation tag information based on an application theme, and the application activity tag information comprises application activity tag information based on an application theme. The application theme is used to compare specific information that embodies the application in its entirety.
Preferably, the application installation tag information includes at least any one of: application installation tag information based on category, application installation tag information based on price; the application active tag information comprises at least any one of: category-based application activity label information, price-based application activity label information.
For example, according to the application types installed by the user, the number of the application installations of different types is counted, and one or more types of application installation label information corresponding to the user is determined according to the information, and the weight of each application is the same in the process. Suppose that user a has m mobile applications installed on the mobile phone, and belongs to k different categories: category 1, category 2 … category k, the corresponding number of applications being: c1, c2 … ck; then the category-based application installation tag information for user a may be: [ "class 1": c1, "class 2": c2, …, "class k": ck ]; alternatively, the number of applications may also use a normalized value, for example, the weight of the category 1 may be C1/C, where C is C1+ C2+ … + ck, which is the total number of installed applications.
For another example, according to price information of an application installed by a user, one or more pieces of price-based application installation tag information corresponding to the user are determined, and then the installation situation of the user on the paid application is analyzed, so that the attitude and the economic condition of the user on the paid application can be reflected to a certain extent. The price-based application installation tag information for a user may be [ "pay": c1, "free": c2, "pay for use": 0 or 1], wherein, c1 installs the number of free applications for the user, c2 installs the number of paid applications for the user, and the value of the pay label is determined to be 1 or 0 according to whether the user applies the paid applications.
For another example, according to the category/price information of the application used by the user and the corresponding usage frequency (the number of usage times in a time period, the duration of each usage, the usage consumption flow rate, etc.), one or more category/price-based application active label information corresponding to the user is determined, in the process, the weight of each application used by the user is different, and the weight value is proportional to the usage frequency of the application. Assume that user B's category-based application activity tag information may be: [ "category 1": T1, "category 2": T2, …, "category k": tk ], where "category 1" is the category of the application used by the user, and T1 is the usage traffic of the application used by the user for a period of time, where the weight may also be a normalized value T1/T, where T is T1+ T2+ … + tk. Alternatively, the category-based application activity tag information of user B may also be: [ "category 1": n1, "category 2": n2, …, "category k": nk ], where "category 1" is a category corresponding to an application used by a user, and n1 is the number of times the application of the category is used for a period of time, or takes a normalized value. In the process, different time period selections can reflect different application use interests of the user in the time period selections. A time range such as three months, half a year, etc. may be selected to obtain the long term active category label of the user, a time range such as one month, half a month, etc. may be selected to obtain the medium term active category label of the user, and a time range such as one week, one day, two days, etc. may be selected to obtain the short term active category label of the user.
More preferably, the application tag information includes application installation tag information or application active tag information based on an application theme; in step S12, the device 1 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information; and determining the application installation tag information or the application active tag information based on the application theme corresponding to the user according to one or more theme key words in the application theme vector.
With the increase of the number of applications and the abundance of application functions, it is more and more difficult for developers to classify the applications, and the specific information of the applications cannot be comprehensively reflected to a certain extent by the category information. Aiming at the possibility that the application category division and the like are inaccurate, large in roughness and small in information amount and one application may have multiple categories, the application installation label information and/or the application active label information based on the application theme are provided, and the application theme is used for comprehensively embodying the specific information of the application. For example, an application topic vector corresponding to a social-class application may be: [ "communication": 0.1, "short message": 0.15, "speech": 0.2, "video": 0.15, "chat": 0.4], an application theme vector for a motion class application may be: [ "basketball": 0.1, "running": 0.2, "calories": 0.2, "weight loss": 0.1, "sport": 0.4], the keywords (such as correspondence, basketball, etc.) are the main labels constituting the application theme, and the weights (such as 0.1, 0.2, etc.) represent the frequency of occurrence of the corresponding keywords in the application theme.
In step S13, the device 1 constructs user representation information of the user based on the one or more application label information.
For example, the user profile information of the user may be constructed based on all application tag information, or may be constructed based on some (e.g., a number of application tag information with weights greater than corresponding thresholds) of the application tag information.
Preferably, in step S13, the device 1 constructs user representation information of the user based on the target application scene of the user representation and the one or more application tag information.
For example, the target application scenarios include, but are not limited to: and (4) inspecting the scenes related to the installation of the user APP and inspecting the scenes of the preference of the user APP.
Preferably, in step S13, the device 1 constructs user representation information of the user based on a target application scene of the user representation and the one or more application tag information, wherein the user representation information includes the application tag information matching the target application scene.
For example, if the target application scenario is related to a short-term behavior of a user, the application active tag information in a time range of one week, one day, two days, etc. may be selected; if the target application scene is related to the user middle-term behavior, the application active label information in the time range of one month, half month and the like can be selected; if the target application scenario is associated with a long-term behavior of a user, the application active tag information may be selected within a time range such as three months, half a year, and the like. That is, the selected application tag information should match the target scene.
Preferably, in step S13, the device 1 constructs user representation information of the user based on a target application scene of the user representation and the one or more application tag information, wherein the application tag information matching the target application scene in the user representation information is weighted higher than other application tag information in the user representation information.
For example, when the target application scenario is a scenario related to installation of an APP of an investigation user, a relatively large weight may be given to the application installation tag information; when the target application scenario is a scenario for considering the preference of the user APP, a relatively large weight may be given to the application active tag information.
Preferably, the method further comprises: the device 1 provides application information or presentation information matching the user representation information to the user device.
For example, application information or presentation information that matches the user profile information may include application recommendation information, news, merchandise advertisement information, etc. that may be of interest to the user.
Fig. 2 shows a flowchart of a method for determining application tag information of a user according to another embodiment of the present application, the method including step S25, step S26, and step S27.
Specifically, in step S25, the device 2 acquires application usage record information of the application used by the user on the user device; in step S26, the device 2 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information; in step S27, the device 2 determines the application tag information corresponding to the user according to one or more topic keywords in the application topic vector.
Here, the device 2 includes, but is not limited to, a user device, a network device, or a device formed by integrating a user device and a network device through a network. The user equipment includes, but is not limited to, any mobile electronic product, such as a smart phone, a tablet computer, and the like, which can perform human-computer interaction with a user through a touch panel, and the mobile electronic product may employ any operating system, such as an android operating system, an iOS operating system, and the like. The network device includes an electronic device capable of automatically performing numerical calculation and information processing according to preset or stored instructions, and the hardware includes but is not limited to a microprocessor, an Application Specific Integrated Circuit (ASIC), a programmable gate array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The network device comprises but is not limited to a computer, a network host, a single network server, a plurality of network server sets or a cloud formed by a plurality of servers; here, the Cloud is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, one virtual supercomputer consisting of a collection of loosely coupled computers. Including, but not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless Ad Hoc network (Ad Hoc network), etc. Preferably, the device 2 may also be a script program running on the user device, the network device, or a device formed by integrating the user device and the network device, the touch terminal, or the network device and the touch terminal through a network. Of course, those skilled in the art will appreciate that the above-described apparatus 2 is merely exemplary, and that other existing or future existing apparatus 2, as may be suitable for use in the present application, are also intended to be encompassed within the scope of the present application and are hereby incorporated by reference.
In step S25, the device 2 acquires application use record information of the application used by the user on the user device.
For example, the application usage record information may include the name or identification information of the application, and information of the number of times of use, the length of time of use per time, the usage consumption flow, and the like, in a period of time (one day, one week, one half month, one month, and the like). Here, the data collection may be performed through an APP installed on the user equipment to acquire the application usage record information, or the application usage record information may be acquired from a third party.
In step S26, the device 2 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information.
Here, the application on the user equipment (i.e. the application currently installed by the user) may be determined according to the application usage record information, and then the application theme vector corresponding to the application theme vector may be obtained. For example, an application topic vector corresponding to a social-class application may be: [ "communication": 0.1, "short message": 0.15, "speech": 0.2, "video": 0.15, "chat": 0.4], an application theme vector for a motion class application may be: [ "basketball": 0.1, "running": 0.2, "calories": 0.2, "weight loss": 0.1, "sport": 0.4], the keywords (such as correspondence, basketball, etc.) are the main labels constituting the application theme, and the weights (such as 0.1, 0.2, etc.) represent the frequency of occurrence of the corresponding keywords in the application theme.
Preferably, in step S26, the device 2 obtains application description information corresponding to the application on the user equipment according to the application usage record information; and generating an application theme vector of the corresponding application according to the application description information, wherein the application theme vector comprises one or more theme keywords.
For example, the application description information may be obtained from a network, such as may be provided by an application market (Google Play, iOS app store, etc.). The application description information contains more information, including more keywords and function descriptions, relative to the category of the application.
More preferably, the generating an application theme vector of a corresponding application according to the application description information includes: performing word segmentation processing on the application description information to obtain a plurality of topic keywords; and performing a clustering algorithm on the plurality of topic keywords, and determining an application topic vector corresponding to application, wherein the application topic vector comprises one or more topic keywords.
For example, the application description information is processed, a sentence is segmented, and stop words without information, punctuation marks, website information, email addresses and the like are removed; and setting the number of topic models, wherein each topic model corresponds to one application topic vector, and clustering the topic models by adopting an LDA (Latent Dirichlet Allocation, an unsupervised machine learning technology and can be used for identifying Latent topic information in a large-scale document set or a corpus) topic model algorithm to obtain model keywords and weight.
Of course, those skilled in the art should understand that the LDA topic model algorithm described above is merely an example, and other existing or hereafter-presented algorithms, such as LSI (Latent Semantic Indexing) algorithms, as applicable to the present application, are also included within the scope of the present application and are hereby incorporated by reference.
In step S27, the device 2 determines the application tag information corresponding to the user according to one or more topic keywords in the application topic vector.
For example, the application tag information corresponding to the user may be determined according to all topic keywords in the application topic vector, or the application tag information corresponding to the user may be determined according to a plurality of preferred (for example, higher-weight) topic keywords in the application topic vector.
Preferably, in step S27, the device 2 determines, according to the application usage record information and one or more topic keywords in the application topic vector, application installation tag information and/or application active tag information corresponding to the user.
Here, the application tag information includes application installation tag information and/or application active tag information. The application installation tag information is determined based on applications installed by a user, and the application active tag information is determined based on the applications installed by the user and the use frequency (the use times in a time period, the use time length of each time, the use consumption flow and the like) of each application.
For example, the process of determining the application installation tag information may include: obtaining the topic key words of each application and the corresponding application topic vectors, selecting a threshold value k, wherein each topic only takes the k key words with the maximum weight, and one topic can be simplified as [ w [, n [ ]1:v1,w1:v1…wk:vk]Wherein w is1…kIs a keyword, v1…kIs the weight of the keyword. Tagging a user according to the keywords of the application, for example, if m applications are installed in one user a, the corresponding tags are m × k tags, and the weight of each tag is the keyword weight [ w [ ]11:v11,…w1k:v1k,w21:v21,…w2k:v2k,…wm1:vm1,…wmk:vmk]Wherein w isijFor the jth keyword, v, of the ith applicationijIs the corresponding weight; when the same keyword appears, the keywords can be combined, and the weights of the two keywords are added to be used as a new weight.
For another example, the process of determining the application active tag information may include: and adding labels to the user and calculating corresponding label weights according to the applied topic keywords and the weights based on the use flow or use times of the user application. Each topic only takes the k keywords with the maximum weight, and if one user B uses m applications in a time period, the corresponding labels have m multiplied by k, and the weight of each label is the weight [ w ] of the keywords11:v11×t1,…w1k:v1k×tk,w21:v21×t2,…w2k:v2k×t2,…wm1:vm1×tm,…wmk:vmk×tm]Wherein w isijFor the jth keyword, v, of the ith applicationijIs the corresponding weight, tiUsing the flow or number of times for the first application; when the same keywords appear, the keywords can be combined, and the weights of the two keywords are multiplied by the corresponding flow and then added to form a new weight.
FIG. 3 shows an apparatus 1 for composing a user representation according to an embodiment of the present application, said apparatus 1 comprising first means 11, second means 12 and third means 13.
Specifically, the first device 11 obtains application usage record information of an application used by a user on user equipment; the second device 12 determines one or more pieces of application tag information corresponding to the user according to the application use record information; the third means 13 constructs user representation information for the user based on the one or more application tag information.
Here, the device 1 includes, but is not limited to, a user device, a network device, or a device formed by integrating a user device and a network device through a network. The user equipment includes, but is not limited to, any mobile electronic product, such as a smart phone, a tablet computer, and the like, which can perform human-computer interaction with a user through a touch panel, and the mobile electronic product may employ any operating system, such as an android operating system, an iOS operating system, and the like. The network device includes an electronic device capable of automatically performing numerical calculation and information processing according to preset or stored instructions, and the hardware includes but is not limited to a microprocessor, an Application Specific Integrated Circuit (ASIC), a programmable gate array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The network device comprises but is not limited to a computer, a network host, a single network server, a plurality of network server sets or a cloud formed by a plurality of servers; here, the Cloud is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, one virtual supercomputer consisting of a collection of loosely coupled computers. Including, but not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless Ad Hoc network (Ad Hoc network), etc. Preferably, the device 1 may also be a script program running on the user device, the network device, or a device formed by integrating the user device and the network device, the touch terminal, or the network device and the touch terminal through a network. Of course, those skilled in the art will appreciate that the above-described apparatus 1 is merely exemplary, and that other existing or future existing apparatus 1, as may be suitable for use in the present application, are also intended to be encompassed within the scope of the present application and are hereby incorporated by reference.
The first device 11 obtains application usage record information of an application used by a user on a user equipment.
For example, the application usage record information may include the name or identification information of the application, and information of the number of times of use, the length of time of use per time, the usage consumption flow, and the like, in a period of time (one day, one week, one half month, one month, and the like). Here, the data collection may be performed through an APP installed on the user equipment to acquire the application usage record information, or the application usage record information may be acquired from a third party.
And the second device 12 determines one or more pieces of application tag information corresponding to the user according to the application use record information.
In the mobile user, each user basically uses one user device independently, each user device has a unique identification (such as an imei number or a mac address), and the application use record information can reflect the real characteristics of the user relatively truly. And labeling the users with different characteristics according to the application use record information, and determining one or more pieces of application label information corresponding to the users.
Preferably, the second device 12 queries application library information according to the application usage record information, and determines one or more pieces of application tag information corresponding to the user.
Here, the application library information includes information such as category, description, price, download amount, etc. of an application, which can be obtained from a network, such as an application market (Google Play, iOS app store, etc.) can provide.
Preferably, the application tag information includes at least any one of: applying installation tag information; applying active tag information; installing tag information based on the application of the application theme; application activity tag information based on application topics.
For example, the application installation tag information is determined based on applications installed by a user, and the application active tag information is determined based on the applications installed by the user and the usage frequency (the number of usage times in a time period, the time length of each usage, the usage consumption flow, and the like) of each application. The application installation tag information comprises application installation tag information based on an application theme, and the application activity tag information comprises application activity tag information based on an application theme. The application theme is used to compare specific information that embodies the application in its entirety.
Preferably, the application installation tag information includes at least any one of: application installation tag information based on category, application installation tag information based on price; the application active tag information comprises at least any one of: category-based application activity label information, price-based application activity label information.
For example, according to the application types installed by the user, the number of the application installations of different types is counted, and one or more types of application installation label information corresponding to the user is determined according to the information, and the weight of each application is the same in the process. Suppose that user a has m mobile applications installed on the mobile phone, and belongs to k different categories: category 1, category 2 … category k, the corresponding number of applications being: c1, c2 … ck; then the category-based application installation tag information for user a may be: [ "class 1": c1, "class 2": c2, …, "class k": ck ]; alternatively, the number of applications may also use a normalized value, for example, the weight of the category 1 may be C1/C, where C is C1+ C2+ … + ck, which is the total number of installed applications.
For another example, according to price information of an application installed by a user, one or more pieces of price-based application installation tag information corresponding to the user are determined, and then the installation situation of the user on the paid application is analyzed, so that the attitude and the economic condition of the user on the paid application can be reflected to a certain extent. The price-based application installation tag information for a user may be [ "pay": c1, "free": c2, "pay for use": 0 or 1], wherein, c1 installs the number of free applications for the user, c2 installs the number of paid applications for the user, and the value of the pay label is determined to be 1 or 0 according to whether the user applies the paid applications.
For another example, according to the category/price information of the application used by the user and the corresponding usage frequency (the number of usage times in a time period, the duration of each usage, the usage consumption flow rate, etc.), one or more category/price-based application active label information corresponding to the user is determined, in the process, the weight of each application used by the user is different, and the weight value is proportional to the usage frequency of the application. Assume that user B's category-based application activity tag information may be: [ "category 1": T1, "category 2": T2, …, "category k": tk ], where "category 1" is the category of the application used by the user, and T1 is the usage traffic of the application used by the user for a period of time, where the weight may also be a normalized value T1/T, where T is T1+ T2+ … + tk. Alternatively, the category-based application activity tag information of user B may also be: [ "category 1": n1, "category 2": n2, …, "category k": nk ], where "category 1" is a category corresponding to an application used by a user, and n1 is the number of times the application of the category is used for a period of time, or takes a normalized value. In the process, different time period selections can reflect different application use interests of the user in the time period selections. A time range such as three months, half a year, etc. may be selected to obtain the long term active category label of the user, a time range such as one month, half a month, etc. may be selected to obtain the medium term active category label of the user, and a time range such as one week, one day, two days, etc. may be selected to obtain the short term active category label of the user.
More preferably, the application tag information includes application installation tag information or application active tag information based on an application theme; the second device 12 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information; and determining the application installation tag information or the application active tag information based on the application theme corresponding to the user according to one or more theme key words in the application theme vector.
With the increase of the number of applications and the abundance of application functions, it is more and more difficult for developers to classify the applications, and the specific information of the applications cannot be comprehensively reflected to a certain extent by the category information. Aiming at the possibility that the application category division and the like are inaccurate, large in roughness and small in information amount and one application may have multiple categories, the application installation label information and/or the application active label information based on the application theme are provided, and the application theme is used for comprehensively embodying the specific information of the application. For example, an application topic vector corresponding to a social-class application may be: [ "communication": 0.1, "short message": 0.15, "speech": 0.2, "video": 0.15, "chat": 0.4], an application theme vector for a motion class application may be: [ "basketball": 0.1, "running": 0.2, "calories": 0.2, "weight loss": 0.1, "sport": 0.4], the keywords (such as correspondence, basketball, etc.) are the main labels constituting the application theme, and the weights (such as 0.1, 0.2, etc.) represent the frequency of occurrence of the corresponding keywords in the application theme.
The third means 13 constructs user representation information for the user based on the one or more application tag information.
For example, the user profile information of the user may be constructed based on all application tag information, or may be constructed based on some (e.g., a number of application tag information with weights greater than corresponding thresholds) of the application tag information.
Preferably, the third means 13 constructs user representation information of the user based on a target application scene of the user representation and the one or more application tag information.
For example, the target application scenarios include, but are not limited to: and (4) inspecting the scenes related to the installation of the user APP and inspecting the scenes of the preference of the user APP.
Preferably, the third means 13 constructs user representation information of the user based on a target application scene of the user representation and the one or more application tag information, wherein the user representation information comprises the application tag information matching the target application scene.
For example, if the target application scenario is related to a short-term behavior of a user, the application active tag information in a time range of one week, one day, two days, etc. may be selected; if the target application scene is related to the user middle-term behavior, the application active label information in the time range of one month, half month and the like can be selected; if the target application scenario is associated with a long-term behavior of a user, the application active tag information may be selected within a time range such as three months, half a year, and the like. That is, the selected application tag information should match the target scene.
Preferably, the third means 13 constructs user representation information of the user based on a target application scene of the user representation and the one or more application tag information, wherein the application tag information matching the target application scene in the user representation information is weighted higher than other application tag information in the user representation information.
For example, when the target application scenario is a scenario related to installation of an APP of an investigation user, a relatively large weight may be given to the application installation tag information; when the target application scenario is a scenario for considering the preference of the user APP, a relatively large weight may be given to the application active tag information.
Preferably, the apparatus 1 further comprises fourth means (not shown in the figures); the fourth device provides the application information or the presentation information matched with the user portrait information to the user equipment.
For example, application information or presentation information that matches the user profile information may include application recommendation information, news, merchandise advertisement information, etc. that may be of interest to the user.
Fig. 4 shows an apparatus 2 for determining application tag information of a user according to another embodiment of the present application, said apparatus 2 comprising fifth means 25, sixth means 26 and seventh means 27.
Specifically, the fifth device 25 obtains application usage record information of the application used by the user on the user equipment; the sixth device 26 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information; the seventh device 27 determines the application tag information corresponding to the user according to one or more topic keywords in the application topic vector.
Here, the device 2 includes, but is not limited to, a user device, a network device, or a device formed by integrating a user device and a network device through a network. The user equipment includes, but is not limited to, any mobile electronic product, such as a smart phone, a tablet computer, and the like, which can perform human-computer interaction with a user through a touch panel, and the mobile electronic product may employ any operating system, such as an android operating system, an iOS operating system, and the like. The network device includes an electronic device capable of automatically performing numerical calculation and information processing according to preset or stored instructions, and the hardware includes but is not limited to a microprocessor, an Application Specific Integrated Circuit (ASIC), a programmable gate array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The network device comprises but is not limited to a computer, a network host, a single network server, a plurality of network server sets or a cloud formed by a plurality of servers; here, the Cloud is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, one virtual supercomputer consisting of a collection of loosely coupled computers. Including, but not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless Ad Hoc network (Ad Hoc network), etc. Preferably, the device 2 may also be a script program running on the user device, the network device, or a device formed by integrating the user device and the network device, the touch terminal, or the network device and the touch terminal through a network. Of course, those skilled in the art will appreciate that the above-described apparatus 2 is merely exemplary, and that other existing or future existing apparatus 2, as may be suitable for use in the present application, are also intended to be encompassed within the scope of the present application and are hereby incorporated by reference.
The fifth means 25 obtains application usage record information of the application used by the user on the user equipment.
For example, the application usage record information may include the name or identification information of the application, and information of the number of times of use, the length of time of use per time, the usage consumption flow, and the like, in a period of time (one day, one week, one half month, one month, and the like). Here, the data collection may be performed through an APP installed on the user equipment to acquire the application usage record information, or the application usage record information may be acquired from a third party.
The sixth means 26 obtains an application theme vector corresponding to the application on the user equipment according to the application usage record information.
Here, the application on the user equipment (i.e. the application currently installed by the user) may be determined according to the application usage record information, and then the application theme vector corresponding to the application theme vector may be obtained. For example, an application topic vector corresponding to a social-class application may be: [ "communication": 0.1, "short message": 0.15, "speech": 0.2, "video": 0.15, "chat": 0.4], an application theme vector for a motion class application may be: [ "basketball": 0.1, "running": 0.2, "calories": 0.2, "weight loss": 0.1, "sport": 0.4], the keywords (such as correspondence, basketball, etc.) are the main labels constituting the application theme, and the weights (such as 0.1, 0.2, etc.) represent the frequency of occurrence of the corresponding keywords in the application theme.
Preferably, the sixth device 26 obtains the application description information corresponding to the application on the user equipment according to the application usage record information; and generating an application theme vector of the corresponding application according to the application description information, wherein the application theme vector comprises one or more theme keywords.
For example, the application description information may be obtained from a network, such as may be provided by an application market (Google Play, iOS app store, etc.). The application description information contains more information, including more keywords and function descriptions, relative to the category of the application.
More preferably, the generating an application theme vector of a corresponding application according to the application description information includes: performing word segmentation processing on the application description information to obtain a plurality of topic keywords; and performing a clustering algorithm on the plurality of topic keywords, and determining an application topic vector corresponding to application, wherein the application topic vector comprises one or more topic keywords.
For example, the application description information is processed, a sentence is segmented, and stop words without information, punctuation marks, website information, email addresses and the like are removed; and setting the number of topic models, wherein each topic model corresponds to one application topic vector, and clustering the topic models by adopting an LDA (Latent Dirichlet Allocation, an unsupervised machine learning technology and can be used for identifying Latent topic information in a large-scale document set or a corpus) topic model algorithm to obtain model keywords and weight.
Of course, those skilled in the art should understand that the LDA topic model algorithm described above is merely an example, and other existing or hereafter-presented algorithms, such as LSI (Latent Semantic Indexing) algorithms, as applicable to the present application, are also included within the scope of the present application and are hereby incorporated by reference.
The seventh device 27 determines the application tag information corresponding to the user according to one or more topic keywords in the application topic vector.
For example, the application tag information corresponding to the user may be determined according to all topic keywords in the application topic vector, or the application tag information corresponding to the user may be determined according to a plurality of preferred (for example, higher-weight) topic keywords in the application topic vector.
Preferably, the seventh device 27 determines the application installation tag information and/or the application active tag information corresponding to the user according to the application usage record information and one or more topic keywords in the application topic vector.
Here, the application tag information includes application installation tag information and/or application active tag information. The application installation tag information is determined based on applications installed by a user, and the application active tag information is determined based on the applications installed by the user and the use frequency (the use times in a time period, the use time length of each time, the use consumption flow and the like) of each application.
For example, the process of determining the application installation tag information may include: obtaining the topic key words of each application and the corresponding application topic vectors, selecting a threshold value k, wherein each topic only takes the k key words with the maximum weight, and one topic can be simplified as [ w [, n [ ]1:v1,w1:v1…wk:vk]Wherein w is1…kIs a keyword, v1…kIs the weight of the keyword. Tagging a user according to the keywords of the application, for example, if m applications are installed in one user a, the corresponding tags are m × k tags, and the weight of each tag is the keyword weight [ w [ ]11:v11,…w1k:v1k,w21:v21,…w2k:v2k,…wm1:vm1,…wmk:vmk]Wherein w isijFor the jth keyword, v, of the ith applicationijIs the corresponding weight; when the same keyword appears, the keywords can be combined, and the weights of the two keywords are added to be used as a new weight.
For another example, the process of determining the application active tag information may include: and adding labels to the user and calculating corresponding label weights according to the applied topic keywords and the weights based on the use flow or use times of the user application. Each topic only takes the k keywords with the maximum weight, and if one user B uses m applications in a time period, the corresponding labels have m multiplied by k, and the weight of each label is the weight [ w ] of the keywords11:v11×t1,…w1k:v1k×tk,w21:v21×t2,…w2k:v2k×t2,…wm1:vm1×tm,…wmk:vmk×tm]Wherein w isijFor the jth keyword, v, of the ith applicationijIs the corresponding weight, tiUsing the flow or number of times for the first application; when the same keyword appears, the keywords can be combined, and the weights of the two keywordsMultiplied by the corresponding flow rate and added as a new weight.
According to yet another aspect of the present application, there is provided a computer-readable storage medium comprising instructions that, when executed, cause a system to:
acquiring application use record information of an application used by a user on user equipment;
determining one or more pieces of application label information corresponding to the user according to the application use record information;
constructing user representation information for the user based on the one or more application tag information.
According to yet another aspect of the present application, there is provided a computer-readable storage medium comprising instructions that, when executed, cause a system to:
acquiring application use record information of an application used by a user on user equipment;
acquiring an application theme vector corresponding to the application on the user equipment according to the application use record information;
and determining the application label information corresponding to the user according to one or more topic keywords in the application topic vector.
According to yet another aspect of the application, there is provided an apparatus for constructing a user representation, wherein the apparatus comprises:
a processor; and
a memory arranged to store computer executable instructions that, when executed, cause the processor to:
acquiring application use record information of an application used by a user on user equipment;
determining one or more pieces of application label information corresponding to the user according to the application use record information;
constructing user representation information for the user based on the one or more application tag information.
According to still another aspect of the present application, there is provided an apparatus for determining application tag information of a user, wherein the apparatus includes:
a processor; and
a memory arranged to store computer executable instructions that, when executed, cause the processor to:
acquiring application use record information of an application used by a user on user equipment;
acquiring an application theme vector corresponding to the application on the user equipment according to the application use record information;
and determining the application label information corresponding to the user according to one or more topic keywords in the application topic vector.
Compared with the prior art, the method and the device have the advantages that application use record information of the application used by the user on the user equipment is obtained, one or more pieces of application label information corresponding to the user are determined according to the application use record information, and then the user portrait information of the user is constructed based on the one or more pieces of application label information; the application uses the user information such as record information for user's web browsing record, social network relation, news advertisement click record, has static stability, the data volume is little, the information volume is big advantage, therefore, the user portrait information that this application founds can more accurately define and discern the user. Further, the application label information comprises application installation label information and/or application active label information, and the user is labeled from different dimensions, so that more accurate user portrait information is constructed. Further, the application installation label information and/or the application activity label information based on the application theme are provided, richer differentiation label information is obtained, and the applications can be better classified.
It should be noted that the present application may be implemented in software and/or a combination of software and hardware, for example, implemented using Application Specific Integrated Circuits (ASICs), general purpose computers or any other similar hardware devices. In one embodiment, the software programs of the present application may be executed by a processor to implement the steps or functions described above. Likewise, the software programs (including associated data structures) of the present application may be stored in a computer readable recording medium, such as RAM memory, magnetic or optical drive or diskette and the like. Additionally, some of the steps or functions of the present application may be implemented in hardware, for example, as circuitry that cooperates with the processor to perform various steps or functions.
In addition, some of the present application may be implemented as a computer program product, such as computer program instructions, which when executed by a computer, may invoke or provide methods and/or techniques in accordance with the present application through the operation of the computer. Program instructions which invoke the methods of the present application may be stored on a fixed or removable recording medium and/or transmitted via a data stream on a broadcast or other signal-bearing medium and/or stored within a working memory of a computer device operating in accordance with the program instructions. An embodiment according to the present application comprises an apparatus comprising a memory for storing computer program instructions and a processor for executing the program instructions, wherein the computer program instructions, when executed by the processor, trigger the apparatus to perform a method and/or a solution according to the aforementioned embodiments of the present application.
It will be evident to those skilled in the art that the present application is not limited to the details of the foregoing illustrative embodiments, and that the present application may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the application being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned. Furthermore, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or means recited in the apparatus claims may also be implemented by one unit or means in software or hardware. The terms first, second, etc. are used to denote names, but not any particular order.