CN114791877A

CN114791877A - Recommendation algorithm testing method and advertisement recommendation system

Info

Publication number: CN114791877A
Application number: CN202210116486.XA
Authority: CN
Inventors: 李绍磊; 徐星
Original assignee: Alibaba China Co Ltd
Current assignee: Hangzhou Alibaba Overseas Internet Industry Co ltd
Priority date: 2022-02-07
Filing date: 2022-02-07
Publication date: 2022-07-26

Abstract

The present application provides a method for testing a recommendation algorithm and an advertising recommendation system. The method for testing a recommendation algorithm includes: constructing at least two algorithm testing experiments that are orthogonal to each other based on the recommendation algorithm to be tested, wherein, Each algorithm testing experiment includes at least two recommendation algorithms that need to be compared with each other; in each algorithm testing experiment, the target users in the experimental population are equally distributed to each recommendation algorithm in the algorithm testing experiment in a random manner, Wherein, each algorithm test experiment is based on the same experimental population; according to the recommendation algorithm corresponding to the target user, the recommended content to be pushed to each of the target users is generated; the target users in the experimental population are collected for the recommended content Feedback operation data to evaluate the algorithm quality of each recommendation algorithm.

Description

Recommendation algorithm testing method and advertisement recommendation system

Technical Field

The application relates to the field of data processing, in particular to a recommendation algorithm testing method and an advertisement recommendation system, and also relates to a recommendation algorithm testing device, electronic equipment and a computer storage medium.

Background

With the development of internet technology, the amount of information carried on the network is more and more abundant, and related information recommendation is gradually becoming a trend.

In the course of a particular application, the information recommended to the user is typically calculated by some recommendation algorithm. In order to obtain a suitable recommendation algorithm to send recommendation information to a user, the prior art generally adopts an AB test experiment method to test the recommendation algorithm, and evaluates the quality of the recommendation algorithm according to the test result. Specifically, the user is randomly divided into a group A and a group B, and two recommendation algorithms which are mutually contrasted are respectively used for distinguishing the advantages and disadvantages of the recommendation algorithms through comparison of test experiments, so that the more excellent recommendation algorithm is selected.

At present, various recommendation strategies and recommendation algorithms generated along with the recommendation strategies are greatly emerged, and the recommendation algorithms need to be tested and tested; in addition, each recommendation algorithm may need to adjust some key parameters, and each adjustment often needs to be tested; under the condition that the test experiment demand is so vigorous, the test through a special test platform becomes a conventional choice, however, how to quickly realize the test of various recommended algorithms is difficult to realize under the current technical conditions.

Disclosure of Invention

The application provides a method for testing a recommendation algorithm and an advertisement recommendation system, which are used for solving the technical problems in the background art.

The method for testing the recommendation algorithm provided by the application comprises the following steps:

constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on the recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other;

in each algorithm test experiment, target users participating in an experiment group are evenly distributed to each recommendation algorithm of the algorithm test experiment in a random mode, wherein each algorithm test experiment is based on the same experiment group;

generating recommended contents pushed to each target user according to a recommendation algorithm corresponding to the target user;

and collecting data of feedback operation of each target user aiming at the recommended content in the experimental population, and evaluating the algorithm quality of each recommendation algorithm.

Optionally, in the step of averagely allocating the target users in the experiment group to each recommendation algorithm of the algorithm test experiment, algorithm labels are allocated to the target users according to the recommendation algorithm to which the target users are allocated;

the step of generating the recommended content pushed to each target user according to the recommendation algorithm corresponding to the target user includes:

obtaining an algorithm label of the target user;

classifying the target user into a target group of a corresponding recommendation algorithm according to the algorithm label of the target user; for a target user, more than one recommended algorithm target population which can belong to different algorithm test experiments;

and each advertisement recommendation algorithm pushes the recommended content in the target crowd belonging to the recommendation algorithm.

Optionally, the method further includes:

obtaining a promotion budget set for recommended content, and averagely distributing the promotion budget to each recommendation algorithm of each algorithm test experiment, wherein each algorithm test experiment corresponds to the total promotion budget;

and correspondingly deducting the promotion cost of the recommendation algorithm corresponding to the target user corresponding to the feedback operation of the target user on the recommendation content.

Optionally, the method includes:

obtaining a preset popularization plan;

selecting the recommendation algorithm to be tested according to the promotion plan;

constructing a corresponding algorithm test experiment according to the selected to-be-tested recommendation algorithm corresponding to the promotion plan and the constraint conditions provided by the promotion plan; the algorithm test experiment can be one or more corresponding to the promotion plan.

Optionally, the collecting feedback operations of the target users in the experimental population for the recommended content includes obtaining a recommendation algorithm corresponding to the target users.

Optionally, each algorithm test experiment includes a reference recommendation algorithm.

Optionally, the generating, according to the recommendation algorithm corresponding to the target user, the recommendation content pushed to each target user includes:

determining a preset time period of an algorithm test experiment;

randomly distributing each target user in the participating experiment crowd to each time point of the preset time period;

and when the test time reaches the time point corresponding to the target user, generating recommendation content pushed to the target user according to a recommendation algorithm corresponding to the target user.

The present application also provides an advertisement recommendation system, comprising:

the advertisement main module is used for constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other;

the flow distribution module is used for distributing target users participating in the experimental population to each recommended algorithm of the algorithm testing experiment in each algorithm testing experiment in an average manner, wherein each algorithm testing experiment is based on the same experimental population;

and the advertisement recommending module is used for generating recommended advertisements pushed to the target users according to the recommending algorithm corresponding to the target users.

The application also provides a testing device for the recommendation algorithm, which comprises:

the system comprises an experiment construction unit, a test unit and a test unit, wherein the experiment construction unit is used for constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on recommendation algorithms to be tested, and each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other;

the system comprises a user distribution unit, a recommendation unit and a recommendation unit, wherein the user distribution unit is used for distributing target users participating in an experimental group to recommended algorithms of the algorithm testing experiment in each algorithm testing experiment in an average manner, and each algorithm testing experiment is based on the same experimental group;

the content recommendation unit is used for generating recommendation content pushed to each target user according to a recommendation algorithm corresponding to the target user;

and the algorithm evaluation unit is used for collecting data of feedback operation of each target user aiming at the recommended content in the experimental population and evaluating the algorithm quality of each recommended algorithm.

This application provides an electronic equipment simultaneously, includes:

a processor;

a memory for storing a program of the method, which program, when read and executed by the processor, performs the steps of: constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on the recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other; in each algorithm test experiment, target users participating in the experiment population are evenly distributed to recommended algorithms of the algorithm test experiment, wherein each algorithm test experiment is based on the same experiment population; generating recommended contents pushed to each target user according to a recommendation algorithm corresponding to the target user; and collecting data of feedback operation of each target user aiming at the recommended content in the experimental population, and evaluating the algorithm quality of each recommendation algorithm.

The present application also provides a computer storage medium storing a computer program that, when executed, performs the steps of:

constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on the recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other; in each algorithm test experiment, target users participating in the experiment population are evenly distributed to recommended algorithms of the algorithm test experiment, wherein each algorithm test experiment is based on the same experiment population; generating recommended contents pushed to each target user according to a recommendation algorithm corresponding to the target user; and collecting data of feedback operation of each target user aiming at the recommended content in the experimental population, and evaluating the algorithm quality of each recommendation algorithm.

Compared with the prior art, the method has the following advantages:

according to the method for testing the recommendation algorithm, a plurality of algorithm testing experiments which are orthogonal to each other are constructed to compare different recommendation algorithms, and the plurality of different recommendation algorithms can be tested in the same time period, so that the testing experiments have concurrency, and the testing efficiency is guaranteed; in addition, the groups of people participating in the experiment are respectively and evenly distributed to each recommendation algorithm of the algorithm test experiment in a random mode in different algorithm test experiments, so that the experimental results of the different algorithm test experiments cannot be influenced by some correlation among experimental objects in the different algorithm test experiments, and the accuracy of each algorithm test experiment is ensured.

Drawings

FIG. 1 is a schematic structural diagram of an advertisement recommendation system according to an embodiment of the present application scenario;

FIG. 2 is a diagram illustrating interaction between a target user and an advertisement recommendation system according to an embodiment of the present application scenario;

FIG. 3 is a flowchart of a method for testing a recommendation algorithm provided in a first embodiment of the present application;

fig. 4 is a structural diagram of a testing apparatus for a recommendation algorithm according to a third embodiment of the present application;

fig. 5 is a schematic structural diagram of an electronic device according to a third embodiment of the present application.

Detailed Description

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. This application is capable of embodiments in many different forms than those described herein, and it will be appreciated by those skilled in the art that similar language may be used without departing from the spirit and scope of the application, and therefore the application is not limited to the specific embodiments disclosed below.

The embodiment of the application provides a method for testing a recommendation algorithm and an advertisement recommendation system, which will be described in detail in the following embodiments one by one.

In order to facilitate understanding of the recommendation algorithm testing method provided by the present application, the present application first introduces the advertisement recommendation system in combination with a specific use scenario. It should be noted that although the specific usage scenario of the present application focuses on advertisement recommendation, the technical solution can also be applied to recommendation of various other contents. The following describes embodiments of the present application mainly in terms of advertisement recommendation.

Referring to fig. 1, fig. 1 is a schematic structural diagram of an advertisement recommendation system according to an embodiment of the present application scenario.

Fig. 1 includes: advertisement main module 101, flow distribution module 102, advertisement recommendation module 103.

The advertisement main module 101 is configured to construct at least two algorithm test experiments that are orthogonal to each other based on recommendation algorithms to be tested, where each algorithm test experiment includes at least two recommendation algorithms to be compared with each other.

Specifically, the algorithm test experiment specifically refers to an AB test experiment set for the recommended algorithm. The recommendation algorithm to be tested comprises: the method comprises a recommendation algorithm 1, a recommendation algorithm 2, a recommendation algorithm 3 and a recommendation algorithm 4, wherein the algorithm test experiment 1 and the algorithm test experiment 2 can be constructed according to the algorithms, the algorithm test experiment 1 comprises an A1 group and a B1 group, the A1 group uses the recommendation algorithm 1, the B1 group uses the recommendation algorithm 2, the recommendation algorithm 1 and the recommendation algorithm 2 are associated algorithms which can be compared with each other, in general, one algorithm can be a reference algorithm, and the other algorithm is an algorithm which actually needs to be tested; similarly, the algorithm test experiment 2 includes a group a2 and a group B2, wherein the group a2 uses the recommendation algorithm 3, and the group B2 uses the recommendation algorithm 4; the recommendation algorithm 3 and the recommendation algorithm 4 are correlation algorithms that can be compared with each other, and in general, one of the algorithms may be a reference algorithm and the other algorithm may be an algorithm that actually needs to be tested. The reference algorithm may be an algorithm whose effect has been known as a reference for comparison of the algorithms to be tested, and in the present embodiment, it is assumed that the B1 group and the B2 group are reference algorithms of the algorithm test experiment 1 and the algorithm test experiment 2, respectively.

Further, the advertisement main module further comprises: budget allocation module 1011.

The budget allocation module 1011 is configured to obtain a promotion budget set for a recommended advertisement, and evenly allocate the promotion budget to each recommendation algorithm of each algorithm test experiment, where each algorithm test experiment corresponds to a full amount of the promotion budget.

For example, assuming that the promotion budget that an advertiser intends to promote for this advertisement is 1000 yuan, the a1 and B1 groups in experiment 1 were tested by the algorithm to be able to allocate 500 yuan (50%) of budget, respectively. Also, the a1 and B1 groups in simultaneous algorithmic test experiment 2 each also allocated a budget of 500 dollars (50%). Budgets in the two groups of algorithm test experiments are calculated independently, and the budgets are consumed for promoting 1000 yuan.

The flow distribution module 102 is configured to, in each algorithm test experiment, evenly distribute target users participating in the experiment population to each recommended algorithm of the algorithm test experiment, where each algorithm test experiment is based on the same experiment population.

For example, assuming that the experimental population involved in the experiment totals 1000 persons, for the algorithm test experiment 1, these 1000 persons are randomly assigned to the a1 group and the B1 group of the algorithm test experiment 1; likewise, for algorithmic test experiment 2, these 1000 persons were randomly assigned to groups a2 and B2 of algorithmic test experiment 2.

Further, the target user 1 in the experimental population carries the algorithm label corresponding to the algorithm assigned thereto, for example, if the target user 1 is assigned to the group B1 in the algorithm test experiment 1, and is assigned to the group a2 in the algorithm test experiment 2, the algorithm label of the target user 1 is B1a 2. The algorithm label has the effect that once each target user is extracted after the algorithm label is marked on each target user, the recommendation algorithm adopted by the target user can be known through the algorithm label. For example, for the target user 1, recommendation algorithm 2 corresponding to B1 and recommendation algorithm 3 corresponding to a2 are adopted; because the recommendation algorithm 2 and the recommendation algorithm 3 are algorithms in two algorithm experiments, and are orthogonal to each other, i.e. they do not affect each other, the target user 1 can recommend advertisements by using two recommendation algorithms at the same time. For example, the advertisement recommended by both recommendation algorithms may be provided to the target user 1, or recommendation values may be calculated according to the same weight for the recommendation degrees of both recommendation algorithms, and recommended to the user according to the recommendation values.

And the advertisement recommending module 103 is configured to generate recommended advertisements pushed to the target users according to the recommending algorithm corresponding to the target users.

Further, the advertisement recommendation module 103 is specifically configured to determine, according to the algorithm tag corresponding to the target user, a recommendation algorithm corresponding to the algorithm tag; and recommending the advertisement to the target user according to the recommendation algorithm.

For example, for directly recommended advertisements, the recommendation algorithm may determine advertisements that meet the user preferences according to the preference information of the target user, and for example, for advertisements recommended based on search terms, the recommendation algorithm may recommend advertisements to the target user according to keywords output by the target user in a search engine.

Further, the advertisement recommendation module 103 includes: an algorithm access layer, a recommended content recall layer, a recommended content sequencing layer and the like.

The algorithm access layer is used for importing a corresponding recommendation algorithm in the advertisement recommendation module according to the algorithm label of the target user; the recommended content recalling layer is used for obtaining recommended content to be recommended according to the recommendation algorithm; and the recommended content sequencing layer is used for sequencing the recommended content.

In an embodiment of the present application scenario, in order to verify the quality of the recommendation algorithm, the advertisement recommendation system further includes: and the algorithm evaluation module 104 is used for collecting feedback operation data of each target user in the experimental population aiming at the recommended content and evaluating the algorithm quality of each recommendation algorithm.

Specifically, the evaluation of the algorithm quality of each recommendation algorithm is specifically obtained according to the residual budget of the test algorithm corresponding to each recommendation algorithm test experiment in the budget allocation module 1011.

In a specific and applicable process, if the target user 1 with the algorithm tag B1a2 obtains the advertisement recommended by the recommendation algorithm corresponding to the algorithm tag B1a2 and the target user 1 browses or clicks the recommended advertisement, the budget amount corresponding to the B1 group of primary advertisement of algorithm test experiment 1 and the budget amount corresponding to the a2 group of primary advertisement of algorithm test experiment 2 may be deducted respectively.

Further, in order to facilitate understanding of the interaction relationship between the target user and the advertisement recommendation system in the scenario described in the foregoing scenario embodiment, an interaction process between the target user and the advertisement recommendation system is described below with reference to fig. 2.

Referring to fig. 2, fig. 2 is a diagram of an interaction relationship between a target user and an advertisement recommendation system according to an embodiment of the present application scenario.

As shown in FIG. 2, the target user performs steps S201 to S205 in the process of interacting with the advertisement recommendation system, which performs steps S206 to S208.

First, before the interactive flow starts, a test experiment in the advertisement recommendation system is first created by the following steps.

Step S206, according to a promotion plan preset by an advertiser, setting a promotion budget and a recommendation algorithm to be tested required by the current advertisement promotion;

step S207, constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on the recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other;

step S208, based on the obtained promotion budget set for the recommended content, averagely distributing the promotion budget to each recommendation algorithm of each algorithm test experiment in a random mode, wherein each algorithm test experiment corresponds to the total promotion budget.

After the advertisement recommendation system is established, the terminal equipment of the target user and the advertisement recommendation system sequentially execute the following steps:

step S201, the target user sends first request information for obtaining an algorithm label corresponding to the target user to an advertisement recommendation system through terminal equipment;

step S202, the advertisement recommendation system returns an algorithm label corresponding to the target user to the terminal equipment of the target user in response to the first request information;

step S203, the target user sends second request information of advertisement recommendation carrying the algorithm label to the advertisement recommendation system through terminal equipment;

step S204, the advertisement recommending system analyzes the second request information to obtain the algorithm label, and sends a corresponding recommended advertisement to the terminal equipment of the target user according to the algorithm label;

step S205, the terminal device of the target user receives and displays the recommended advertisement, and sends fee deduction request information carrying the algorithm label to the advertisement recommendation system after the target user selects to click the recommended advertisement.

And S206, the advertisement recommendation system analyzes the fee deduction request information to obtain the algorithm labels, and determines the testing algorithms to be deducted in each algorithm testing experiment according to the algorithm labels to finish fee deduction operation.

At the end of the experiment, the staff judges the quality of the testing algorithms corresponding to different groups by comparing the advertisement budgets remaining in the different groups of the experiment through algorithm testing, for example, for algorithm testing experiment 1, if the remaining advertisement budget of group a1 is lower than the remaining advertisement budget of group B1, it indicates that the algorithm quality of algorithm 1 corresponding to group a1 is higher than that of algorithm 2 corresponding to group B1; for Algorithm test experiment 2, if the remaining advertising budget for group A2 is higher than the remaining advertising budget for group B2, it indicates that the algorithm quality of Algorithm 3 for group A2 is lower than that of Algorithm 4 for group B2. In addition, through budget allocation, after the budget of a certain party is deducted, the test can be quitted, and the test is finished; thus, the conditions for the end test were obtained.

It should be noted that the above description of the scenario embodiment of the present application is not used to limit the application scenario of the recommended algorithm test method provided by the present application, for example, in other scenario embodiments, the construction of the algorithm test experiment is not limited to that two algorithm test experiments are provided by the present application, and for the case that more groups of test algorithms need to be tested simultaneously, the number of the algorithm test experiments may be determined according to the number of the recommended algorithms; for another example, the method for testing a recommendation algorithm provided by the present application may also be used to test a recommendation algorithm in a search engine, and provide corresponding content recommendation for a user while testing the algorithm, which is not limited in the present application.

A first embodiment of the present application provides a method for testing a recommendation algorithm, please refer to fig. 3, and fig. 3 is a flowchart of the method for testing the recommendation algorithm provided in the first embodiment of the present application. The method for testing the recommendation algorithm includes the following steps S301 to S304. The execution subject of this embodiment is an advertisement recommendation system.

Step S301, constructing at least two algorithm test experiments which are in an orthogonal relation with each other based on the recommendation algorithms to be tested, wherein each algorithm test experiment comprises at least two recommendation algorithms which need to be compared with each other.

The test algorithm to be recommended may be understood as an algorithm for recommending content for a user in a computer system, and in a specific application configuration, the form of the recommendation algorithm may be various, for example: the recommendation algorithm may be a program deployed in a computer system for recommending content to a target user, or may be a neural network or the like that is obtained through machine learning and is capable of outputting recommended content according to a search keyword input by the target user in a search engine or a browsing record of the target user. The present application is not limited thereto.

In an implementation scenario of the present application, the algorithm test experiment is specifically used for recommending advertisements to a user and sorting recommended advertisements; of course, algorithmic testing experiments may also be used to make recommendations for other content.

In the first embodiment of the present application, the algorithm test experiments include multiple sets of algorithm test experiments performed simultaneously, each algorithm test experiment has the same test period, specifically, the test period may be in units of days/weeks, and the present application is not limited thereto. Furthermore, the different algorithm test experiments are orthogonal to each other, that is, the algorithms tested by the different algorithm test experiments do not interfere with each other, that is, one algorithm test experiment affects the test effect of the other algorithm test experiment. When constructing an algorithm test experiment of an orthogonal relation, the principle of the tested algorithm needs to be deeply understood, and the test with mutual influence of results in the simultaneously-performed algorithm test experiment is avoided, so that the distortion of the test result is avoided.

In the first embodiment of the present application, the algorithm test specifically refers to an AB test set for the recommended algorithm. The AB test experiment is to make two (A/B) or a plurality of (A/B/n) different versions for Web (World Wide Web) or APP (Application, mobile phone software) interfaces or processes, make the access groups with the same (or similar) composition randomly access the versions in the same time dimension, then collect the user experience data and business data of the access users of each version, and finally obtain the best version for formal adoption through analysis and evaluation.

For example, the algorithmic test experiment may include: an algorithm test experiment 1, an algorithm test experiment 2 and an algorithm test experiment 3.

The algorithm test experiment 1 comprises the following steps: group a1 and group B1, where group a1 was used for test algorithm 1 and group a2 was used for test algorithm 2;

algorithm test experiment 2 included: group a2 and group B2, where group a2 was used for test algorithm 3 and group a2 was used for test algorithm 4;

algorithm test experiment 3 included: group A3 and group B3, with group A3 being used for test algorithm 5 and group A3 being used for test algorithm 6.

In the construction of practical application, the construction of the algorithm test experiment is related to a specific application scenario, for example: in an optional embodiment of the present application, the algorithm test experiment is mainly used for selecting a suitable recommendation algorithm to recommend advertisements to advertisers.

The advertiser refers to a legal person, an economic organization or an individual who designs, makes or issues advertisements for a promoter or provides services, or entrusts others, and is an initiator of an advertisement campaign, and the method comprises the following steps: in merchants selling or advertising their products and services.

Specifically, if an advanced test needs to be performed for promotion of a certain advertiser so as to select the most appropriate algorithm for the advertiser, the above algorithm test experiment designed for the advertiser can be obtained by the following method:

firstly, obtaining a preset popularization plan;

secondly, selecting a candidate recommendation algorithm according to the promotion plan;

and finally, constructing corresponding algorithm test experiments according to the selected candidate recommendation algorithms corresponding to the promotion plans and the constraint conditions provided by the promotion plans, wherein each promotion plan corresponds to one or more algorithm test experiments.

Wherein the preset promotion plan may include: user group information for publicizing products in the advertisement, advertisement putting strategy information, advertisement publicity form information, advertisement total budget information and other information closely related to advertisement publicity. Because the content of the preset promotion plan is not the key point of the protection requested by the application, the content of the promotion plan is not limited in the embodiment of the application.

Further, the candidate recommendation algorithm is selected according to the promotion plan, and the candidate recommendation algorithm can be obtained according to a candidate algorithm selection model with promotion plan information as input and recommendation algorithm information as output. In a specific and applied process, the candidate algorithm selection model may be obtained by Machine Learning (ML) training, and specifically, the Learning sample of the Machine Learning may be obtained by using specific content of the promotion plan and a suitable recommendation algorithm corresponding to the promotion plan. Machine learning (a multi-field cross subject, which relates to multi-subjects such as probability theory, statistics, approximation theory, convex analysis, algorithm complexity theory and the like) is specially used for researching how a computer simulates or realizes the learning behavior of human beings so as to obtain new knowledge or skills and reorganize the existing knowledge structure to continuously improve the performance of the existing knowledge structure. Machine learning generally includes techniques such as artificial neural networks, belief networks, reinforcement learning, transfer learning, inductive learning, and the like. And feature machine learning belongs to a branch of Artificial Intelligence (AI) technology.

It should be noted that the above manner of obtaining the candidate recommendation algorithm is only an optional implementation manner of the present application, and in other implementation manners of the present application, the candidate recommendation algorithm may also be obtained in other manners, for example, a suitable candidate recommendation algorithm may be selected according to the characteristics of the promotion plan. The present application is not limited thereto.

In addition, for algorithmic testing in the context of recommending advertisements, the promotion plan typically includes the promotion budget for the advertisement, i.e., the advertiser's investment in its advertising campaign. In a specific application process, if an advertisement is recommended to a user and the user browses the advertisement by clicking the advertisement or the like, the advertisement is successfully advertised once, and advertisement promotion cost required by the advertisement once needs to be deducted from the total promotion budget. Further, for the specific application of the popularization budget in the first embodiment of the present application, reference may be made to the description of step S203, which is not described herein again.

Further, after an algorithm test experiment with a recommended algorithm as a main body is constructed, corresponding constraint conditions need to be added to the algorithm test experiment, and in the embodiment of the present application, the constraint conditions include: experimental population, popularization budget and algorithm testing time.

The above constraints are described below in connection with a specific testing procedure for the algorithm.

In the first embodiment of the present application, the method in step S302 is adopted to allocate target users in the experimental population, that is, the number of testing people for different testing algorithms in the algorithm experiment is limited by means of step S302.

Step S302, in each algorithm test experiment, target users participating in the experiment crowd are evenly distributed to each recommended algorithm of the algorithm test experiment, wherein each algorithm test experiment is based on the same experiment crowd;

for example, the algorithm test experiment 1 includes: group a1 and group B1, where group a1 was used for test algorithm 1 and group B1 was used for test algorithm 2; test experiment 2 included: group a2 and group B2, where group a2 was used for test algorithm 3 and group a2 was used for test algorithm 4.

Then, for the algorithmic test experiment 1, if the experimental population totals 100, then the 100 people will be randomly divided into 2 groups, assigned to groups a1 and B1, respectively. For algorithmic test experiment 2, the 100 persons were re-randomized and assigned to the a2 and B2 groups, respectively. (that is, the algorithm test experiment 1 and the algorithm test experiment 2 are two parallel experiments, wherein the algorithm test experiment 1 is divided into two groups to test the algorithm 1 and the algorithm 2 by using half of the total number of the experimental population, respectively, and the algorithm test experiment 2 is also divided into two groups to test the algorithm 3 and the algorithm 4 by using half of the total number of the experimental population, respectively).

For the promotion budget: in embodiments of the present application, the promotion plan typically includes a promotion budget for the advertisement, i.e., the advertiser's placement of its ad campaigns. For example, for a scenario of advertisement recommendation, if an advertisement is recommended to a user and the user browses the advertisement by clicking the advertisement, it means that the advertisement is successfully advertised once, and the advertisement promotion cost required for one advertisement needs to be deducted from the total promotion budget.

In the specific application process, the method similar to the above step S202 is adopted to limit the advertisement budgets corresponding to different test algorithms in different algorithm test experiments in the algorithm test experiments.

Specifically, the allocation of the promotion budget includes the following steps S1 and S2.

Step S1, obtaining a promotion budget set for recommended content, and averagely distributing the promotion budget to each recommendation algorithm of each algorithm test experiment, wherein each algorithm test experiment corresponds to the total promotion budget;

for example, the algorithm test experiment 1 includes: group a1 and group B1, where group a1 was used for test algorithm 1 and group a2 was used for test algorithm 2; algorithm test experiment 2 included: group a2 and group B2, with group a2 being used for test algorithm 3 and group a2 being used for test algorithm 4.

Then, if the promotion budget is 1000 yuan, the total budget allocated to the algorithm test experiment 1 is 1000 yuan, and the total budget allocated to the algorithm test experiment 2 is also 1000 yuan;

the popularization budget allocated to the A1 group experiment of the algorithm test experiment 1 is 500 yuan, and the popularization budget allocated to the B1 group experiment is also 500 yuan;

the popularization budget allocated to the a2 group of experiments of the algorithm test experiment 2 is 500 yuan, and the popularization budget allocated to the B2 group of experiments is also 500 yuan (that is, the algorithm test experiment 1 and the algorithm test experiment 2 are two experiments performed in parallel, where the algorithm test experiment 1 is used for testing the algorithm 1 and the algorithm 2, and the algorithm test experiment 2 is used for testing the algorithm 3 and the algorithm 4).

And step S2, corresponding to the feedback operation of the target user on the recommended content, carrying out corresponding deduction operation in the promotion cost of the recommendation algorithm corresponding to the target user.

Specifically, for the related description of the step S2, reference may be made to the following description of the step S203, and details are not repeated here.

Further, in order to accurately determine the algorithm corresponding to the target user, the step S202 further includes allocating an algorithm label to the target user according to the recommended algorithm allocated to the target user.

Specifically, the algorithm label of the target user is mainly used for representing an algorithm adopted when providing recommended content for the target user. For example, still taking the above algorithm test experiment 1 and algorithm test experiment 2 as an example, assuming that, in the process of performing target user allocation for the algorithm test experiment 1, the target user 1 is allocated to the test algorithm 2 in the algorithm test experiment 1 (i.e., allocated to the group B1 in the algorithm test experiment 1), and, in the process of performing target user allocation for the non-algorithm test experiment 2, the target user 1 is allocated to the test algorithm 3 in the algorithm test experiment 2 (i.e., allocated to the group a2 in the algorithm test experiment 2), then the algorithm label corresponding to the target user 1 is B1a 2.

Step S303, generating recommendation contents pushed to each target user according to the recommendation algorithm corresponding to the target user.

Specifically, the above step S303 includes the following steps S3 to S5.

Step S3, obtaining the algorithm label of the target user;

step S4, classifying the target user into a target crowd of a corresponding recommendation algorithm according to the algorithm label of the target user; for a target user, the target user can belong to more than one recommended algorithm target crowd of different algorithm test experiments;

and step S5, each advertisement recommendation algorithm pushes the recommendation content in the target crowd belonging to the recommendation algorithm.

For example, assuming that the algorithm tag of the target user 1 is B1a2, the target user 1 should be classified into group B1 in algorithm test experiment 1 and group a2 in algorithm test experiment 2.

In an alternative embodiment of the present application, the step S203 further includes assigning each target user in the experiment population in a time dimension, so as to make the algorithmic test experiment closer to the real situation.

Specifically, the step S303 further includes the following steps S6 to S8.

Step S6, determining a preset time period of an algorithm test experiment;

the time period of the algorithm test experiment, that is, the test period preset for each algorithm test experiment, specifically, the preset time period may be in units of days/weeks, which is not limited in the present application.

Step S7, randomly distributing each target user in the participating experimental population to each time point of the preset time period;

for example, assuming that the experimental population includes 100 target users and the preset time period is 1 day, the 100 target users may be randomly allocated to each time point of the preset time period.

And step S8, when the test time reaches the time point corresponding to the target user, generating the recommended content pushed to the target user according to the recommendation algorithm corresponding to the target user.

Step S304, collecting data of feedback operation of each target user aiming at the recommended content in the experimental population, and evaluating the algorithm quality of each recommendation algorithm.

In an alternative embodiment of the present application, the feedback operation includes an effective operation and an ineffective operation, and still taking the above algorithm test experiment 1 and algorithm test experiment 2 as an example, it is assumed that the algorithm label B1a2 of the target user 1 is provided, and the recommended content recommended to the target user 1 according to the algorithm label includes advertisement 1 and advertisement 2. If the target user clicks any one of the advertisement 1 and the advertisement 2, the clicking operation is an effective operation, and if the target user selects to close the advertisement or perform other operations without clicking the advertisement 1 or the advertisement 2, the operation is an ineffective operation.

It should be noted that, the introduction of the feedback operation is limited to a scene of recommending an advertisement, and for other scenes, the definition of the effective operation and the ineffective operation of the feedback operation may be set according to actual situations, for example, if the recommendation algorithm is specifically used for recommending content related to a keyword to a target user according to the keyword input by the target user in a search engine; if the user selects to click the recommended content, after the user enters an introduction interface of the recommended content, and when the stay time length of the introduction interface is greater than or equal to a preset time threshold value, the click operation of the target user is considered as effective operation; and if the target user selects to click the recommended content and enters an introduction interface of the recommended content, the staying time length of the target user on the introduction interface is smaller than a preset time threshold, or the target user does not click the recommended content, the operation of the target user is considered as invalid operation.

Further, corresponding to the effective operation in the feedback operation of the target user on the recommended content, corresponding deduction operation is performed in the promotion cost of the recommendation algorithm corresponding to the target user.

Specifically, the following still explains the operation of deducting the promotion cost in the advertisement scene by taking the above algorithm test experiment 1 and algorithm test experiment 2 as examples.

Algorithm test experiment 1 included: group a1 and group B1, where group a1 was used for test algorithm 1 and group a2 was used for test algorithm 2; algorithm test experiment 2 included: group a2 and group B2, where group a2 was used for test algorithm 3 and group a2 was used for test algorithm 4, while group a1 and group B1 of test algorithm 1, and group a2 and group B2 of test algorithm were each assigned a budget of 500 dollars, the total number of experimental population being 500 people, where group a1 of test algorithm 1 was randomly assigned 250 people and group B1 was randomly assigned 250 people, respectively, and group a2 and group B2 of test algorithm 2 were randomly assigned 250 people, respectively.

Assuming that the algorithm label of the target user 1 in the experimental population is B1a2, the test algorithms corresponding to the algorithm label B1a2 are test algorithm 2 and test algorithm 3.

Further, the advertisement 1 and the advertisement 2 are recommended to the target user 1 according to the test algorithm 2 and the test algorithm 3, and the recommended advertisements are displayed in the terminal device of the target user, at this time, if the target user 1 performs a click operation on the advertisement 1 and the advertisement 2, it indicates that 1 successful advertisement recommendation is performed to the target user 1 according to the test algorithm 2 and the test algorithm 3.

Further, after a successful advertisement recommendation, the advertisement budgets in group B1 of algorithm test experiment 1 and group a2 of algorithm test experiment 2 are deducted according to algorithm label B1a 2. For example, if the cost of successful one-time consumption of the advertisement recommendation is 1 yuan, the budgets of 1 yuan of the B1 group and the A2 group are deducted respectively.

For each algorithm test experiment, the number of people and budget allocated to the recommendation algorithms which are compared with each other are the same, so that the different recommendation algorithms are compared in the same way, and different recommendation algorithm qualities can be determined only by the residual budget amounts of the different recommendation algorithms after the algorithm test experiment is finished.

For example, after the algorithm test experiment is finished, if 100-element budgets remain in the group a1 and 50-element budgets remain in the group B1 corresponding to the algorithm test experiment 1, it may be determined that the recommendation quality of the recommendation algorithm 2 in the algorithm test experiment 1 is higher than that of the recommendation algorithm 1; similarly, if the algorithm tests the budget of the remaining 25 elements of the group a2 and the budget of the remaining 35 elements of the group B2 corresponding to experiment 2, it can be determined that the recommendation quality of the recommendation algorithm 3 in the algorithm test experiment 2 is higher than that of the recommendation algorithm 4.

Based on the description of the first embodiment of the present application, it can be seen that the method for testing a recommendation algorithm provided by the present application compares different recommendation algorithms by constructing a plurality of algorithm test experiments in an orthogonal relationship with each other, so as to ensure the high efficiency of testing a plurality of different recommendation algorithms; in addition, the groups of people participating in the experiment are averagely distributed to the recommended algorithms of the algorithm test experiment, so that the problem that the test algorithms which are mutually compared in the algorithm test experiment compete for experimental objects is solved, and the test accuracy is greatly improved.

A third embodiment of the present application provides a testing apparatus for a recommendation algorithm, please refer to fig. 4, where fig. 4 is a structural diagram of the testing apparatus for the recommendation algorithm provided in the second embodiment of the present application.

The device for testing the recommendation algorithm comprises:

the experiment construction unit 401 is configured to construct at least two algorithm test experiments in an orthogonal relationship with each other based on the recommended algorithms to be tested, where each algorithm test experiment includes at least two recommended algorithms to be compared with each other;

the user allocation unit 402 is configured to, in each algorithm test experiment, evenly allocate target users participating in the experiment population to respective recommended algorithms of the algorithm test experiment in a random manner, where each algorithm test experiment is based on the same experiment population;

a content recommending unit 403, configured to generate recommended content pushed to each target user according to a recommendation algorithm corresponding to the target user;

and an algorithm evaluation unit 404, configured to collect data of feedback operations of the target users in the experimental population for the recommended content, and evaluate the algorithm quality of each recommendation algorithm.

Optionally, in the step of averagely allocating the target users participating in the experiment population to each recommendation algorithm of the algorithm test experiment, algorithm labels are allocated to the target users according to the recommendation algorithm allocated to the target users;

obtaining an algorithm label of the target user;

Optionally, the apparatus is further configured to:

obtaining a preset promotion plan;

selecting a candidate recommendation algorithm according to the promotion plan;

constructing a corresponding algorithm test experiment according to the selected candidate recommendation algorithm corresponding to the promotion plan and the constraint conditions provided by the promotion plan; the algorithm test experiment can be one or more corresponding to the promotion plan.

determining a preset time period of an algorithm test experiment;

randomly distributing each target user in the participating experiment population to each time point of the preset time period;

and when the test time reaches the time point corresponding to the target user, generating recommendation content pushed to the target user according to the recommendation algorithm corresponding to the target user.

Referring to fig. 5, fig. 5 is a schematic structural diagram of an electronic device according to a third embodiment of the present application.

The electronic device includes: a processor 501;

the memory 502 is used for storing a program of the method, which is read and executed by the processor 501 to perform the method according to the first embodiment of the present application.

Another embodiment of the present application provides a computer storage medium storing a computer program that, when executed, performs the method of the first embodiment of the present application.

It should be noted that, for detailed descriptions of devices and computer storage media provided in the embodiments of the present application, reference may be made to the relevant description of the first embodiment provided in the present application, and details are not described here again.

Although the present application has been described with reference to the preferred embodiments, it is not intended to limit the present application, and those skilled in the art can make variations and modifications without departing from the spirit and scope of the present application, therefore, the scope of the present application should be determined by the claims that follow.

In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.

The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.

1. Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer readable media does not include non-transitory computer readable media (transient media), such as modulated data signals and carrier waves.

2. It will be apparent to those skilled in the art that embodiments of the present application may be provided as a system or an electronic device. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

Claims

1. a test method of a recommendation algorithm, is characterized in that, comprises:

Based on the recommendation algorithm to be tested, construct at least two algorithm test experiments that are orthogonal to each other, wherein each algorithm test experiment includes at least two recommended algorithms to be compared with each other;

In each algorithm testing experiment, the target users in the experimental population are equally distributed to each recommendation algorithm in the algorithm testing experiment in a random manner, wherein each algorithm testing experiment is based on the same experimental population;

generating recommended content to be pushed to each of the target users according to the recommendation algorithm corresponding to the target user;

Collect the data of the feedback operations of each target user in the experimental population on the recommended content, and evaluate the algorithm quality of each recommendation algorithm.

2. The method according to claim 1, wherein the target users in the experimental crowd are equally distributed to the steps in each recommendation algorithm of the algorithm test experiment, according to the recommended algorithm allocated by the target user. , assigning an algorithm label to the target user;

The step of generating recommended content to be pushed to each of the target users according to the recommendation algorithm corresponding to the target user includes:

obtain the algorithm tag of the target user;

According to the algorithm label of the target user, the target user is classified into the target group of the corresponding recommendation algorithm; for a target user, it can be classified into the target group of more than one recommendation algorithm in different algorithm test experiments;

Each advertisement recommendation algorithm pushes the recommended content among the target groups belonging to the recommendation algorithm.

3. The method according to claim 1, wherein the method further comprises:

obtaining the promotion budget set for the recommended content, and evenly distributing the promotion budget to each recommendation algorithm in each of the algorithm test experiments, wherein each algorithm test experiment corresponds to the full amount of the promotion budget;

Corresponding to the feedback operation of the target user on the recommended content, a corresponding deduction operation is performed in the promotion fee of the recommendation algorithm corresponding to the target user.

4. The method of claim 1, comprising:

Obtain a preset promotion plan;

According to the promotion plan, select the recommendation algorithm to be tested;

According to the selected recommendation algorithm to be tested corresponding to the promotion plan, and the constraints provided by the promotion plan, construct a corresponding algorithm test experiment; corresponding to the promotion plan, the algorithm test experiment may be a or more.

5 . The method according to claim 1 , wherein the collecting feedback operations of each target user in the experimental population on the recommended content comprises acquiring a recommendation algorithm corresponding to the target user. 6 .

6 . The method according to claim 1 , wherein each algorithm testing experiment includes a benchmark recommendation algorithm. 7 .

7. The method according to claim 1, wherein the generating the recommended content to be pushed to each of the target users according to the recommendation algorithm corresponding to the target user, comprises:

Determine the preset time period for the algorithm testing experiment;

Randomly assigning each target user in the experimental population to the source of each time point in the preset time period;

When the test time reaches the time point corresponding to the target user, the recommended content to be pushed to the target user is generated according to the recommendation algorithm corresponding to the target user.

8. An advertisement recommendation system, comprising:

The advertiser module is used to construct at least two algorithm test experiments that are orthogonal to each other based on the recommendation algorithm to be tested, wherein each algorithm test experiment includes at least two recommendation algorithms that need to be compared with each other;

The traffic distribution module is used for evenly distributing the target users in the experimental population to each recommendation algorithm in the algorithm testing experiment in each algorithm testing experiment, wherein each algorithm testing experiment is based on the same experimental population;

An advertisement recommendation module, configured to generate recommended advertisements pushed to each of the target users according to the recommendation algorithm corresponding to the target users.

9. A test device for a recommended algorithm, comprising:

An experiment construction unit, used for constructing at least two algorithm test experiments that are orthogonal to each other based on the recommendation algorithm to be tested, wherein each algorithm test experiment includes at least two recommended algorithms that need to be compared with each other;

A user allocation unit, configured to equally distribute target users in the experimental population to each recommendation algorithm in the algorithm testing experiment in each algorithm testing experiment, wherein each algorithm testing experiment is based on the same experimental population;

a content recommendation unit, configured to generate recommended content to be pushed to each of the target users according to a recommendation algorithm corresponding to the target user;

The algorithm evaluation unit is configured to collect data of feedback operations of each target user in the experimental population on the recommended content, and evaluate the algorithm quality of each recommendation algorithm.

10. An electronic device, comprising:

processor;

The memory is used to store the program of the method, and the program executes the following steps when the program is read and executed by the processor: based on the recommended algorithm to be tested, construct at least two algorithm test experiments that are orthogonal to each other, wherein each The algorithm test experiment includes at least two recommendation algorithms that need to be compared with each other; in each algorithm test experiment, the target users in the experimental population are equally distributed to each recommendation algorithm in the algorithm test experiment, wherein each algorithm The test experiment is based on the same experimental crowd; according to the recommendation algorithm corresponding to the target user, the recommended content pushed to each of the target users is generated; the data of the feedback operation of each target user in the experimental crowd for the recommended content is collected, The algorithm quality of each recommendation algorithm is evaluated.

11. A computer storage medium, wherein the computer storage medium stores a computer program, and when the program is executed, the following steps are performed:

Based on the recommendation algorithm to be tested, construct at least two algorithm test experiments that are orthogonal to each other, wherein each algorithm test experiment includes at least two recommended algorithms to be compared with each other; in each algorithm test experiment, The target users in the experimental population are equally distributed to each recommendation algorithm of the algorithm testing experiment, wherein each algorithm testing experiment is based on the same experimental population; Recommended content pushed by target users; collect data on feedback operations of each target user in the experimental population on the recommended content, and evaluate the algorithm quality of each recommendation algorithm.