CN108256881A - A kind of traffic filtering method and apparatus - Google Patents
A kind of traffic filtering method and apparatus Download PDFInfo
- Publication number
- CN108256881A CN108256881A CN201611239365.5A CN201611239365A CN108256881A CN 108256881 A CN108256881 A CN 108256881A CN 201611239365 A CN201611239365 A CN 201611239365A CN 108256881 A CN108256881 A CN 108256881A
- Authority
- CN
- China
- Prior art keywords
- traffic
- flow
- value data
- calculation model
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001914 filtration Methods 0.000 title claims abstract description 92
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000004364 calculation method Methods 0.000 claims abstract description 102
- 238000012549 training Methods 0.000 claims abstract description 63
- 238000005070 sampling Methods 0.000 claims description 57
- 230000001419 dependent effect Effects 0.000 claims description 16
- 230000005540 biological transmission Effects 0.000 claims 1
- 239000002699 waste material Substances 0.000 abstract description 11
- 230000009286 beneficial effect Effects 0.000 abstract description 2
- 230000007246 mechanism Effects 0.000 abstract description 2
- 238000012544 monitoring process Methods 0.000 description 22
- 230000000694 effects Effects 0.000 description 19
- 230000008569 process Effects 0.000 description 17
- 235000014510 cooky Nutrition 0.000 description 11
- 238000010586 diagram Methods 0.000 description 11
- 230000004044 response Effects 0.000 description 11
- 238000013507 mapping Methods 0.000 description 9
- 238000012937 correction Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 230000009471 action Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000001186 cumulative effect Effects 0.000 description 4
- 235000019800 disodium phosphate Nutrition 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000012790 confirmation Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000003542 behavioural effect Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000035479 physiological effects, processes and functions Effects 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 238000013524 data verification Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000007667 floating Methods 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004801 process automation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0277—Online advertisement
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Strategic Management (AREA)
- Finance (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Economics (AREA)
- Marketing (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明公开了一种流量过滤方法和装置,涉及计算机技术领域。所述方法,包括:接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断第一价值数据是否小于第一阈值;当第一价值数据小于第一阈值,则将流量进行过滤。解决了现有的流量分发机制导致DSP的系统资源浪费,降低DSP的性价比等问题,取得了在将流量发送至各DSP前对流量过滤,减少DSP系统资源浪费,提高性价比的有益效果。
The invention discloses a flow filtering method and device, and relates to the technical field of computers. The method includes: receiving each traffic sent by the advertising exchange server; extracting traffic characteristics from each traffic; using a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation model according to each category The target value data of the traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and consumption income data are obtained through training; each traffic in the historical traffic records is the traffic that has received the notification of successful bidding from the advertising exchange server; the judgment of the first Whether the value data is less than the first threshold; when the first value data is less than the first threshold, the traffic is filtered. It solves the problems of waste of DSP system resources caused by the existing flow distribution mechanism and reduces the cost performance of DSP, and achieves the beneficial effects of filtering the traffic before sending the traffic to each DSP, reducing the waste of DSP system resources, and improving the cost performance.
Description
技术领域technical field
本发明涉及计算机技术领域,具体涉及一种流量过滤方法和装置。The invention relates to the field of computer technology, in particular to a flow filtering method and device.
背景技术Background technique
RTB(RealTime Bidding,实时竞价),是一种利用第三方技术在数以百万计的网站上针对每一个用户展示行为进行评估以及出价的竞价技术。与大量购买投放频次不同,实时竞价规避了无效的受众到达,针对有意义的用户进行购买。它的核心是DSP(Demand SitePlatform,需求方平台),RTB对于媒体来说,可以带来更多的广告销量、实现销售过程自动化及减低各项费用的支出。而且随着智能手机的快速普及和移动网络环境的日渐成熟,移动DSP市场将被进一步发掘,程序化购买将成为数字营销时代的大趋势。RTB (Real Time Bidding, real-time bidding) is a bidding technology that uses third-party technology to evaluate and bid for each user's display behavior on millions of websites. Different from the frequency of mass purchases, real-time bidding avoids invalid audience arrival and makes purchases for meaningful users. Its core is DSP (Demand Site Platform, demand-side platform). For the media, RTB can bring more advertising sales, realize sales process automation and reduce various expenses. Moreover, with the rapid popularization of smart phones and the maturity of the mobile network environment, the mobile DSP market will be further explored, and programmatic buying will become a major trend in the digital marketing era.
在互联网广告市场中,一个DSP可以实时监听到大量的流量,但是不是所有流量都能产生足够的收益,一般而言,一个DSP监听到的流量中只有10%-20%能产生足够的收益。而对于其中最终收益不好的流量,现有的DSP依然会执行全部的计算流程,从而会导致浪费DSP的系统资源,降低DSP的性价比。In the Internet advertising market, a DSP can monitor a large amount of traffic in real time, but not all traffic can generate enough revenue. Generally speaking, only 10%-20% of the traffic monitored by a DSP can generate sufficient revenue. However, for the flow of which the final income is not good, the existing DSP will still execute all the calculation processes, which will lead to a waste of DSP system resources and reduce the cost performance of the DSP.
发明内容Contents of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决上述问题的一种流量过滤方法和相应的一种流量过滤装置。In view of the above problems, the present invention is proposed to provide a traffic filtering method and a corresponding traffic filtering device that overcome the above problems or at least partially solve the above problems.
依据本发明的一个方面,提供了一种流量过滤方法,包括:According to one aspect of the present invention, a traffic filtering method is provided, including:
接收广告交易服务器发送的各流量;Receive all traffic sent by the advertising transaction server;
从每个流量中提取流量特征;Extract flow features from each flow;
利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录 中的各流量为接收到广告交易服务器竞价成功通知的流量;Using the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation model is based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and Obtained through consumption revenue data training; each traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server;
判断所述第一价值数据是否小于第一阈值;judging whether the first value data is smaller than a first threshold;
当所述第一价值数据小于第一阈值,则将所述流量进行过滤。When the first value data is less than a first threshold, the traffic is filtered.
可选地,所述将所述流量进行过滤的步骤,包括:Optionally, the step of filtering the traffic includes:
判断所述流量所属的类别;determine the category to which the traffic belongs;
按照指定比例对所述类别的流量进行过滤。Filter the traffic of the category according to the specified ratio.
可选地,在按照指定比例对所述类别的流量进行过滤的步骤之后,还包括:Optionally, after the step of filtering the traffic of the category according to a specified ratio, the method further includes:
将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;Use the remaining traffic after filtering the traffic of the category according to the specified ratio as the sampling traffic;
每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;Obtain the sampling records of each sampling traffic that receives the successful bidding notification from the advertising exchange server at a specified time period;
从所述采样记录中提取每个采样流量的流量特征;extracting flow characteristics of each sampled flow from the sampling records;
利用预设的收益计算模型计算对应所述流量特征的第一价值数据;Using a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics;
判断多个第一价值数据的均值是否小于第一阈值;judging whether the mean value of multiple first value data is less than a first threshold;
如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。If the mean value of the plurality of first value data is not less than the first threshold, stop filtering the traffic of the category.
可选地,所述接收广告交易服务器发送的各流量的步骤之前,还包括:Optionally, before the step of receiving each flow sent by the advertising exchange server, it also includes:
根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。According to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data, the revenue calculation model is trained.
可选地,所述根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型的步骤,包括:Optionally, the step of training the revenue calculation model according to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and consumption revenue data includes:
针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流量所属类别的目标价值数据的比值作为因变量参数;For each flow, the flow characteristic of the flow is used as an independent variable parameter, and the difference between the consumption income data of the flow and the actual value data, and the ratio of the target value data of the category to which the flow belongs are used as the dependent variable parameter;
将各自变量参数和因变量参数构建该流量的训练特征向量;Construct the training feature vector of the traffic with the respective variable parameters and dependent variable parameters;
利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。A machine training model is used to train the training feature vector to obtain a revenue calculation model.
可选地,所述利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据的步骤,包括:Optionally, the step of using a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics includes:
将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。Substituting the flow characteristics of the flow into the income calculation model to calculate the first value data representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow.
根据本发明的另一方面,提供了一种流量过滤装置,包括:According to another aspect of the present invention, a flow filtering device is provided, comprising:
流量接收模块,用于接收广告交易服务器发送的各流量;The flow receiving module is used to receive each flow sent by the advertising exchange server;
第一流量特征获取模块,用于从每个流量中提取流量特征;A first traffic feature acquisition module, configured to extract traffic features from each traffic;
第一价值数据获取模块,用于利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;The first value data acquisition module is used to calculate the first value data corresponding to the traffic characteristics by using a preset revenue calculation model; the revenue calculation model is based on the target value data of each category of traffic and each traffic in the historical traffic record The characteristics and the corresponding actual value data and consumption revenue data are obtained through training; each flow in the historical flow record is the flow that received the successful bidding notification from the advertising exchange server;
第一判断模块,用于判断所述第一价值数据是否小于第一阈值;A first judging module, configured to judge whether the first value data is smaller than a first threshold;
过滤模块,用于当所述第一价值数据小于第一阈值,则将所述流量进行过滤。A filtering module, configured to filter the traffic when the first value data is less than a first threshold.
可选地,所述过滤模块,包括:Optionally, the filter module includes:
流量类别判断子模块,用于判断所述流量所属的类别;A traffic category judging submodule, configured to judge the category to which the traffic belongs;
过滤子模块,用于按照指定比例对所述类别的流量进行过滤。The filtering sub-module is configured to filter the traffic of the category according to a specified ratio.
可选地,在所述过滤模块之后,还包括:Optionally, after the filtering module, it also includes:
采样流量获取模块,用于将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;A sampling traffic acquisition module, configured to use the remaining traffic after filtering the traffic of the category according to a specified ratio as the sampling traffic;
采样记录获取模块,用于每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;The sampling record acquisition module is used to obtain the sampling records of each sampling flow received from the advertising exchange server for bidding success notification every specified time period;
第二流量特征获取模块,用于从所述采样记录中提取每个采样流量的流量特征;A second traffic feature acquisition module, configured to extract the traffic feature of each sampled flow from the sampling record;
采样第一价值数据获取模块,用于利用预设的收益计算模型计算对应所述流量特征的第一价值数据;Sampling the first value data acquisition module, configured to use a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics;
第二判断模块,用于判断多个第一价值数据的均值是否小于第一阈值;The second judging module is used to judge whether the mean value of multiple first value data is smaller than the first threshold;
过滤停止模块,用于如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。A filtering stop module, configured to stop filtering the traffic of the category if the average value of the plurality of first value data is not less than the first threshold.
可选地,所述流量接收模块之前,还包括:Optionally, before the flow receiving module, it also includes:
收益计算模型训练模块,用于根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。The revenue calculation model training module is used to train the revenue calculation model according to the target value data of each category of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data.
可选地,所述收益计算模型训练模块,包括:Optionally, the income calculation model training module includes:
参数确认子模块,用于针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流量所属类别的目标价值数据的比值作为因变量参数;The parameter confirmation sub-module is used for each flow, taking the flow characteristic of the flow as an independent variable parameter, taking the difference between the consumption revenue data and the actual value data of the flow, and the target value data of the category to which the flow belongs The ratio of is used as the dependent variable parameter;
训练特征向量构建子模块,用于将各自变量参数和因变量参数构建该流量的训练特征向量;The training feature vector construction sub-module is used to construct the training feature vector of the traffic with the respective variable parameters and dependent variable parameters;
收益计算模型训练子模块,用于利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。The income calculation model training sub-module is used to use the machine training model to train the training feature vector to obtain the income calculation model.
可选地,第一价值数据获取模块,包括:Optionally, the first value data acquisition module includes:
第一价值数据计算子模块,用于将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。The first value data calculation sub-module is used to substitute the flow characteristics of the flow into the income calculation model, and calculate the first value representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow. value data.
根据本发明的一种流量过滤方法和装置,可以接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断所述第一价值数据是否小于第一阈值;当所述第一价值数据小于第一阈值,则将所述流量进行过滤。由此解决了现有的广告交易服务器将全部的流量发送至各个DSP,导致DSP系统资源的浪费,降低了DSP的性价比的问题。取得了使广告交易服务器在将流量发送至各DSP 前对流量进行过滤,从而减少DSP系统资源浪费,提高性价比的有益效果。According to a flow filtering method and device of the present invention, each flow sent by the advertising exchange server can be received; flow characteristics are extracted from each flow; and the first value data corresponding to the flow characteristics is calculated by using a preset revenue calculation model ; The revenue calculation model is obtained according to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and consumption revenue data training; each traffic in the historical traffic records is received by the advertising transaction server Traffic notified of successful bidding; judging whether the first value data is less than a first threshold; when the first value data is less than the first threshold, filtering the traffic. This solves the problem that the existing advertising transaction server sends all the traffic to each DSP, resulting in waste of DSP system resources and reducing the cost performance of the DSP. The beneficial effects of enabling the advertising transaction server to filter the traffic before sending it to each DSP are achieved, thereby reducing waste of DSP system resources and improving cost performance.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to better understand the technical means of the present invention, it can be implemented according to the contents of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and understandable , the specific embodiments of the present invention are enumerated below.
附图说明Description of drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiment. The drawings are only for the purpose of illustrating a preferred embodiment and are not to be considered as limiting the invention. Also throughout the drawings, the same reference numerals are used to designate the same parts. In the attached picture:
图1示出了根据本发明一个实施例的一种流量过滤方法的步骤流程图;FIG. 1 shows a flowchart of steps of a traffic filtering method according to an embodiment of the present invention;
图1A示出了根据本发明一个实施例的一种RTB过程的示意图;FIG. 1A shows a schematic diagram of an RTB process according to an embodiment of the present invention;
图1B示出了根据本发明一个实施例的一种流量过滤流程图;FIG. 1B shows a flowchart of traffic filtering according to an embodiment of the present invention;
图1C示出了根据本发明一个实施例的一种同一页面的相似广告位的示意图;Fig. 1C shows a schematic diagram of a similar advertising space on the same page according to an embodiment of the present invention;
图1D示出了根据本发明一个实施例的某网站中不同点击监听率的媒体监听量的累积分布示意图;FIG. 1D shows a schematic diagram of the cumulative distribution of media monitoring volumes with different click monitoring rates in a website according to an embodiment of the present invention;
图1E示出了根据本发明一个实施例的一种根据点击监听率计算流量过滤率的函数图形示意图;FIG. 1E shows a schematic diagram of a function graph for calculating the traffic filtering rate according to the click-to-listen rate according to an embodiment of the present invention;
图1F示出了根据本发明一个实施例的一种不同点击监听率下每万次监听带来的毛利润示意图;Fig. 1F shows a schematic diagram of the gross profit brought by every 10,000 listens under different click-to-listen rates according to an embodiment of the present invention;
图1G示出了根据本发明一个实施例的一种不同点击监听率下媒体净利润在整体净利润的累积分布示意图;FIG. 1G shows a schematic diagram of the cumulative distribution of media net profit in the overall net profit under different click-to-listen rates according to an embodiment of the present invention;
图2示出了根据本发明一个实施例的一种流量过滤方法的步骤流程图;FIG. 2 shows a flowchart of steps of a traffic filtering method according to an embodiment of the present invention;
图3示出了根据本发明一个实施例的一种流量过滤装置的结构示意图;以及Fig. 3 shows a schematic structural diagram of a flow filtering device according to an embodiment of the present invention; and
图4示出了根据本发明一个实施例的一种流量过滤装置的结构示意图。Fig. 4 shows a schematic structural diagram of a flow filtering device according to an embodiment of the present invention.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.
实施例一Embodiment one
详细介绍本发明实施例提供的一种流量过滤方法。A traffic filtering method provided by an embodiment of the present invention is introduced in detail.
参照图1,示出了本发明实施例中一种流量过滤方法的步骤流程图。Referring to FIG. 1 , it shows a flowchart of steps of a traffic filtering method in an embodiment of the present invention.
步骤101,接收广告交易服务器发送的各流量。Step 101, receiving each flow sent by the advertising exchange server.
RTB是一种利用第三方技术在数以百万计的网站上针对每一个用户展示行为进行评估以及出价的竞价技术。如图1A为RTB过程的示意图,RTB过程涉及广告交易(AdExchange,ADX)平台、DSP和供应方平台(Sell-Side Platform,SSP)。它的核心是DSP平台,DSP平台如同展示型广告竞价投放的指挥部:第一步通过其数据追踪能力,来发送带有用户展示信息的请求;第二步DSP平台依据预估算法计算用户展示价值,并把竞价返回给竞价响应引擎;第三步,指令下达至广告交易系统,广告代码加载至各大媒体,最终传递给最精准的用户。其中的广告交易服务器是互联网广告交易平台,像股票交易平台一样,广告交易服务器联系的是广告交易的买方和卖方,也就是广告主方和广告位拥有方。需求方平台允许广告客户和广告机构等广告主方更方便地访问,以及更有效地购买广告库存,因为该平台汇集了各种广告交易平台的库存。有了这一平台,就不需要再出现另一个繁琐的购买步骤——购买请求。供应方平台能够让出版商等广告位拥有方也介入广告交易,从而使它们的库存广告可用。通过这一平台,出版商希望他们的库存广告可以获得最高的有效每千次展示费用,而不必以低价销售出去。RTB is a bidding technology that uses third-party technology to evaluate and bid on each user's display behavior on millions of websites. FIG. 1A is a schematic diagram of an RTB process. The RTB process involves an advertising exchange (AdExchange, ADX) platform, a DSP, and a supply-side platform (Sell-Side Platform, SSP). Its core is the DSP platform. The DSP platform is like the headquarters of the display advertisement bidding: the first step is to send a request with user display information through its data tracking capability; the second step is to calculate the user display based on the estimation algorithm. value, and return the bid to the bid response engine; in the third step, the order is issued to the advertising trading system, and the advertising code is loaded to the major media, and finally delivered to the most accurate users. The advertising exchange server is an Internet advertising exchange platform. Like a stock exchange platform, the advertisement exchange server contacts the buyer and seller of the advertisement exchange, that is, the advertiser and the owner of the advertising space. A demand-side platform allows advertisers, such as advertisers and agencies, to more easily access and more efficiently purchase ad inventory because the platform aggregates inventory from various ad exchanges. With this platform, there is no need for another cumbersome purchase step - a purchase request. Supply-side platforms enable ad slot owners such as publishers to also intervene in the ad transaction, making their inventory available. Through this platform, publishers hope to get the highest eCPM for their inventory without having to sell it at a low price.
其中,DSP平台允许广告客户和广告机构更方便地访问,以及更有效地购买广告库存,因为该平台汇集了各种广告交易平台的库存。有了这一平台,就不需要再出现另一个繁琐的购买步骤——购买请求。SSP能够让发布商也介入广告交易,从而使它们的库存广告位可用。通过这一平台,发布商希望他们的库存广告位可以获得最高的有效每千次展示费用,而不必以低价销售 出去。一个开放的、能够将发布商和广告商联系在一起的在线广告市场(类似于股票交易所)。交易平台里的广告位存货并不一定都是溢价库存,只要发布商想要提供的,都可以在里面找到。Among them, the DSP platform allows advertisers and advertising agencies to access more easily and purchase advertising inventory more efficiently, because the platform brings together the inventory of various advertising exchanges. With this platform, there is no need for another cumbersome purchase step - a purchase request. SSPs enable publishers to also intervene in the ad exchange, making their inventory available. Through this platform, publishers hope to get the highest eCPM for their inventory without having to sell it at a low price. An open online advertising marketplace (similar to a stock exchange) that connects publishers and advertisers. The inventory of advertising space in the trading platform is not necessarily all premium inventory, as long as the publisher wants to provide it, it can be found there.
下面举例说明RTB的具体过程:The following example illustrates the specific process of RTB:
步骤1,用户使用浏览器访问媒体网站,如网站A。网站A将本网站的内容,比如新闻,呈现给用户。与此同时,媒体网站也会在一些广告位上展示广告给用户。尽管媒体网站的主要工作是展现高品质的内容给用户,但也需要通过广告来赚钱,以使得提供内容的业务能够得以延续。他们可以使用自己的销售队伍销售广告库存。然而,对于媒体网站来说,RTB是越来越重要的销售广告库存的渠道。这里我们假设网站A决定将右上角广告位通过RTB方式进行售卖。Step 1, the user uses a browser to access a media website, such as website A. Website A presents the content of this website, such as news, to users. At the same time, media websites will also display advertisements to users on some advertising slots. Although the main job of media websites is to present high-quality content to users, they also need to make money through advertisements so that the business of providing content can continue. They can use their own sales force to sell advertising inventory. However, for media sites, RTB is an increasingly important channel for selling advertising inventory. Here we assume that website A decides to sell the advertising space in the upper right corner through RTB.
步骤2,网站A将信息传递给广告交易平台。传递的信息包括URL(UniformResource Locator,统一资源定位符),广告位置、用户Cookie ID(Cookie Identity,Cookie身份标识码)等。Step 2, website A transmits the information to the advertising trading platform. The transmitted information includes URL (UniformResource Locator, Uniform Resource Locator), advertisement location, user Cookie ID (Cookie Identity, Cookie identity code), etc.
步骤3,广告交易平台组织一次竞价,向多个DSP发送流量竞价响应。假定广告主1是DSP其中一个。In step 3, the advertising exchange platform organizes a bidding and sends traffic bidding responses to multiple DSPs. Assume advertiser 1 is one of the DSPs.
步骤4,当广告主1对应的DSP服务器接收到广告交易平台的流量竞价响应,将数据传递给竞价引擎。Step 4: When the DSP server corresponding to advertiser 1 receives the traffic bidding response from the advertising exchange platform, it transmits the data to the bidding engine.
步骤5,竞价引擎发送用户ID给用户与广告主信息数据库,看用户与广告主的需求是否匹配。Step 5: The bidding engine sends the user ID to the user and advertiser information database to see if the needs of the user and the advertiser match.
步骤6,竞价引擎接收到用户与广告客户的信息,并决定是否参与竞价及竞标价格。Step 6, the bidding engine receives the information of the user and the advertiser, and decides whether to participate in the bidding and the bidding price.
步骤7,竞价引擎生成一个出价响应,并把它传递给DSP服务器。竞价响应包括竞标价格及比如从哪个广告服务器获取广告创意等其它信息。Step 7, the bidding engine generates a bidding response and transmits it to the DSP server. The bid response includes the bid price and other information such as which ad server to get the ad creative from.
步骤8,广告主1对应的DSP服务器发送竞价响应给广告交易平台。Step 8: The DSP server corresponding to the advertiser 1 sends a bidding response to the advertising exchange platform.
步骤9,广告交易平台在接收到所有DSP服务器的响应或者截止期限到达后进行竞拍。例如,广告交易平台的截止期限是100ms,那么是指广告交易平台发送流量竞价响应与接收DSP出价响应的时间差需在截止期限内。 假定广告主1是本次广告竞价交易的赢家。In step 9, the advertising exchange platform conducts bidding after receiving responses from all DSP servers or when the deadline is reached. For example, the deadline of the advertising exchange platform is 100ms, which means that the time difference between the advertising exchange platform sending the traffic bidding response and receiving the DSP bidding response must be within the deadline. Assume that advertiser 1 is the winner of this advertising auction transaction.
步骤10,广告交易平台通知用户浏览器竞拍的赢家。浏览器发送广告曝光请求给广告创意所在的广告服务器。In step 10, the advertising exchange platform notifies the user browser of the winner of the auction. The browser sends an advertisement exposure request to the advertisement server where the advertisement creative is located.
步骤11,广告服务器发送广告创意给用户的浏览器。Step 11, the advertisement server sends the advertisement creative to the user's browser.
步骤12,用户看到网站上的广告。那么如果用户对该广告感兴趣,他会点击广告,从而进入广告主的登陆页。用户浏览广告主的网站,可能采取进一步的行动。例如,如果广告主是一家电子商务公司,用户可能在电子商务网站上进行一次购买活动。Step 12, the user sees an advertisement on the website. Then if the user is interested in the advertisement, he will click on the advertisement to enter the advertiser's landing page. Users browse the advertiser's website and may take further actions. For example, if the advertiser is an e-commerce company, the user may make a purchase on the e-commerce website.
在实时竞价过程中,广告交易服务器可以发送流量至需求方平台中的各广告主方,广告主方可以对接收到的各流量进行实时竞价。对于不同的流量,广告主方能够获取的收益是有区别的,而且可能存在部分流量,广告主方可能无法获得收益,那么对于此类流量,广告主可以无需参与竞价。因此在本发明实施例中,可以针对广告交易服务器发送的各流量进行过滤,以避免无效流量竞争或者是不必要的流量竞争。During the real-time bidding process, the advertising exchange server can send traffic to each advertiser in the demand-side platform, and the advertiser can conduct real-time bidding on the received traffic. For different types of traffic, the advertiser can obtain different revenues, and there may be some traffic, and the advertiser may not be able to obtain revenue, so for this type of traffic, the advertiser does not need to participate in the bidding. Therefore, in the embodiment of the present invention, it is possible to filter each traffic sent by the advertising exchange server, so as to avoid invalid traffic competition or unnecessary traffic competition.
那么首先需要在竞价之前接收广告交易服务器发送的各流量。在本发明实施例中,如果广告交易服务器发送的是流量,则接收广告交易服务器发送的流量;而如果广告交易服务器发送的是针对各流量的流量请求,则接收广告交易服务器发送的流量请求。对此本发明实施例不加以限定。其中的流量请求可以包括上述的流量竞价响应,对此本发明实施例不加以限定。Then firstly, it is necessary to receive the traffic sent by the advertising transaction server before bidding. In the embodiment of the present invention, if the advertisement transaction server sends traffic, the traffic sent by the advertisement transaction server is received; and if the advertisement transaction server sends a traffic request for each traffic, the traffic request sent by the advertisement transaction server is received. This embodiment of the present invention is not limited. The traffic request may include the aforementioned traffic bidding response, which is not limited in this embodiment of the present invention.
如图1B为加入流量过滤之后,流量从ADX中出发,最终回到ADX中的流程图。跟现有情况的最大不同是,在将流量发给后端模块以前,会提前决定是否有必要对这个流量进行竞价,如果不竞价则直接返回空的Response(响应),否则跟以前的竞价流程一样。Figure 1B is a flowchart of traffic starting from ADX and finally returning to ADX after traffic filtering is added. The biggest difference from the existing situation is that before the traffic is sent to the back-end module, it will be decided in advance whether it is necessary to bid on this traffic. If there is no bidding, an empty Response will be returned directly, otherwise it will be the same as the previous bidding process Same.
步骤102,从每个流量中提取流量特征。Step 102, extract traffic features from each traffic.
我们初步确定,一个流量质量的高低,在于其是否能带来点击,因此可以定义一个指标,点击监听率(Click ListenRate,CLR),CLR的计算如下:We preliminarily determined that the quality of a traffic depends on whether it can bring clicks, so we can define an indicator, Click Listen Rate (CLR), and the calculation of CLR is as follows:
点击监听率=点击数/监听数=点击数/展示数*展示数/竞价数*竞价数/监听数=点击率*成功率*竞价率Click monitoring rate = number of clicks/number of monitoring = number of clicks/number of impressions*number of impressions/number of bids*number of bids/number of monitoring=click rate*success rate*bidding rate
可以看到,虽然整个流程中,虽然影响点击监听率的因素很多,有竞价相关的,也有后期效果相关的。但归根到底,一个流量能给我们带来越多的点击,其质量就越好,这是最直观的概念,不用特别细节的去关注中间环节有哪些因素的影响。It can be seen that although in the whole process, there are many factors that affect the click monitoring rate, some are related to bidding, and some are related to post-effects. But in the final analysis, the more clicks a traffic can bring us, the better its quality is. This is the most intuitive concept, and there is no need to pay special attention to the factors affecting the intermediate links.
本来只需要点击监听率就足够了,但考虑到以下两种情景,我们也需要有对收益的考虑。Originally, only the click rate is enough, but considering the following two scenarios, we also need to consider the revenue.
如果一个广告位的流量上很多投放都是按CPM的,则其点击效果对于最终收益影响并不大,因此,如果其竞价效果很好的话,即使点击率低,也是赚钱的,这种情况下并不应该过滤掉。If a lot of traffic on an ad slot is based on CPM, its click effect has little impact on the final revenue. Therefore, if the bidding effect is good, even if the click-through rate is low, it is still profitable. In this case should not be filtered out.
如果一个广告位的流量上竞价效果很好,且刚好满足某个高CPC出价的广告主需求,即使其点击率低一点,也不应该过滤掉。If the bidding effect on the traffic of an ad slot is very good, and it just meets the needs of an advertiser with a high CPC bid, even if its click-through rate is a little low, it should not be filtered out.
流量收益的计算,分以下三部分:The calculation of traffic revenue is divided into the following three parts:
(1)内部成本(1) Internal cost
DSP每监听一个流量,不管是否竞价,都是有成本的,这个成本包括网络、机器资源、人力等,其跟外部因素关系不高,一般是固定并稳定的,可当作一个常量(用C表示),目前,运维部门给出的值是每千次监听2分钱。Every time a DSP monitors a flow, no matter whether it bids or not, there is a cost. This cost includes network, machine resources, manpower, etc. It has little to do with external factors. It is generally fixed and stable and can be regarded as a constant (use C Express), at present, the value given by the operation and maintenance department is 2 cents per thousand monitoring.
(2)外部成本(2) External costs
DSP对每个竞价成功的流量需要付费,这个费用是浮动的,跟媒体的低价、其他DSP的出价都相关,因此叫做外部成本。只有当竞价成功时,才有外部成交价(P),因此,一个流量的外部成本可表示为:BR*WR*PDSP needs to pay for each successful bidding traffic. This fee is floating and related to the low price of the media and the bids of other DSPs, so it is called external cost. Only when the bidding is successful, there is an external transaction price (P), therefore, the external cost of a flow can be expressed as: BR*WR*P
(3)内部收入(3) Internal income
假定DSP中所有的广告主都按CPC(Cost Per Click,以每点击一次计费)出价,则只有当竞价成功的创意产生点击之后,DSP才会对广告主扣费。因此,一个按CPC和CPM(CostPerMille,每千人成本)出价的监听流量的内部收入分别表示为BR(Bidding rate,竞价率)*WR(成功率)*CTR(Click-through Rate,点击率)*CPC和BR*WR*CPM/1000。假设用FPS(Fee Per Show,按展示次数收费)表示平均每次展示的扣费,其对应到CPC和CPM出价的值为CTR*CPC和CPM/1000。Assuming that all advertisers in the DSP bid according to CPC (Cost Per Click), the DSP will deduct fees from the advertiser only after the successful bidding idea generates a click. Therefore, the internal revenue of monitoring traffic bid by CPC and CPM (CostPerMille, cost per thousand people) is expressed as BR (Bidding rate, bid rate) * WR (success rate) * CTR (Click-through Rate, click-through rate) *CPC and BR*WR*CPM/1000. Assuming that FPS (Fee Per Show, charged per impression) is used to represent the average deducted fee per impression, the values corresponding to CPC and CPM bids are CTR*CPC and CPM/1000.
综上,一个流量的监听收益可表示为:RPL=BR*WR*FRS-C-BR*WR*P。To sum up, the monitoring revenue of a flow can be expressed as: RPL=BR*WR*FRS-C-BR*WR*P.
前文描述的案例,都是以二级域名为对象,而实际上,同一个页面下,不同广告位的完全可能会有不同的效果,通常来讲,首屏的点击率会比非首屏的高,大尺寸广告位的点击率会比小尺寸的高。这就意味着,不能将同一域名下的所有广告位统一对待。最好的情形是,由ADX对不同广告位提供一个唯一的编号,DSP就能直接通过这个唯一编号来统计其相应的竞价效果(竞价率、竞价成功率)和后期效果(点击率、收益情况)。但实际上,并不是所有ADX都有提供广告位唯一标识,因此,需要DSP根据流量请求中的不同维度信息来组合映射到一个广告位。The cases described above are all based on the second-level domain name. In fact, under the same page, different advertising positions may have different effects. Generally speaking, the click-through rate of the first screen will be higher than that of the non-first screen. High, the click-through rate of large-sized ad slots will be higher than that of small-sized ones. This means that all ad slots under the same domain cannot be treated uniformly. The best situation is that ADX provides a unique number for different advertising positions, and DSP can directly use this unique number to count its corresponding bidding effect (bidding rate, bidding success rate) and post-effect (click rate, revenue situation) ). But in fact, not all ADXs provide unique identifiers for advertising slots. Therefore, DSPs are required to combine and map to an advertising slot based on different dimensional information in traffic requests.
具体维度的选取,需要考虑多个指标:需要有足够的区分度、各维度之间条件独立(至少要近似的)、足够通用,适用于所有ADX,等等。The selection of specific dimensions needs to consider multiple indicators: there needs to be sufficient discrimination, the conditions between each dimension are independent (at least approximate), sufficiently general, applicable to all ADX, and so on.
结合DSP Cube的数据验证,最终,从媒体的角度,选取了以下几个维度:二级域名、屏数、广告位尺寸。Combined with the data verification of DSP Cube, finally, from the perspective of the media, the following dimensions were selected: second-level domain name, number of screens, and size of advertising space.
比较巧合的是,这三个维度本来就是现在DSP计算第三方广告位ID的输入参数。另外,由于同一组维度值的广告位,在不同ADX中的效果可能是不一样的。下表为www.zhibo8.cc域名下Tanx和Baidu的数据,可以看到在成功率和点击率上相差还是比较大的,虽然域名的层次太广会有一定误差,但综合数据已有足够的参考意义。Coincidentally, these three dimensions are originally the input parameters for DSP to calculate the third-party advertising slot ID. In addition, due to the same set of dimension values, the effect may be different in different ADXs. The following table shows the data of Tanx and Baidu under the www.zhibo8.cc domain name. It can be seen that there is still a relatively large difference in the success rate and click-through rate. Although the domain name level is too wide, there will be some errors, but the comprehensive data is sufficient. D.
表格1.zhibo8中Tanx和Baidu的效果对比Table 1. Comparison of the effects of Tanx and Baidu in zhibo8
很明显这几个维度并不能做到一对一的映射,即多个广告位可能被映射到了同一个组合的维度值上,这就需要映射到同一维度值上的广告位具有类似的竞价效果和后期效果。Obviously, these dimensions cannot be mapped one-to-one, that is, multiple advertising slots may be mapped to the same combined dimension value, which requires that the advertising slots mapped to the same dimension value have similar bidding effects and after effects.
下图是同一页面上的三个广告,通过这3个维度的映射,会被当作同一个广告位处理。虽然用户具体点击哪一个广告位的概率更高,不同人有不同 人的习惯,甚至会涉及生理学、心理学等相关的行为分析,但至少表格2的数据验证,这三个广告位的后期效果相差不大,因此可映射到同一组合的维度值上。如图1C是同一页面上的三个广告,通过这3个维度的映射,会被当作同一个广告位处理。虽然用户具体点击哪一个广告位的概率更高,不同人有不同人的习惯,甚至会涉及生理学、心理学等相关的行为分析,但至少表格2的数据验证,这三个广告位的后期效果相差不大,因此可映射到同一组合的维度值上。The picture below shows three advertisements on the same page. Through the mapping of these three dimensions, they will be treated as the same advertisement slot. Although the user has a higher probability of clicking on which ad slot, different people have different habits, and even involves behavioral analysis related to physiology and psychology, but at least the data in Table 2 verify that the post-effects of these three ad slots The difference is not large, so they can be mapped to the same combination of dimension values. As shown in Figure 1C, there are three advertisements on the same page. Through the mapping of these three dimensions, they will be treated as the same advertisement space. Although the user has a higher probability of clicking on which ad slot, different people have different habits, and even involves behavioral analysis related to physiology and psychology, but at least the data in Table 2 verify that the post-effects of these three ad slots The difference is not large, so they can be mapped to the same combination of dimension values.
表格2.同一页面映射到同一维度值的三个广告位Table 2. Three ad slots mapped to the same dimension value on the same page
Bid Request(竞价请求)也即本申请中的流量请求中除了媒体的广告位信息,还有用户信息,可能包括第三方ID、性别、兴趣标签、设备信息等。这部分用户数据对创意的投放选择会产生很大的影响,从而也可能会影响到之后的竞价效果和后期效果。目前,第三方ID数据大太,不适合作为一个特征。性别又不是很精确,也不合适。设备信息相对符合我们的特征选取标 准,但由于目前对设备的分析还比较少,一期从简单的角度,并不会将其纳入特征组合中,但之后,随着Mobile的上线,预计会有设备的维度加到模型中。Bid Request (bidding request), that is, the traffic request in this application, in addition to media advertising space information, also includes user information, which may include third-party ID, gender, interest tags, device information, etc. This part of user data will have a great impact on the choice of creative delivery, which may also affect the subsequent bidding effect and post-effect. Currently, third-party ID data is too large to be suitable as a feature. The gender is again imprecise and inappropriate. The device information is relatively in line with our feature selection criteria. However, due to the fact that the analysis of devices is still relatively small, from a simple point of view, it will not be included in the feature combination in the first phase, but later, with the launch of Mobile, it is expected that there will be The dimensions of the device are added to the model.
除了上述可直接从Bid Request中抽取出来信息外,最重要的用户信息,其实是Cookie Mapping的效果,可以肯定的是,Cookie Mapping成功的流量的效果,肯定比CookieMapping失败的好。虽然要纳入这部分信息,意味着流量过滤模块需要放在cookie mapping查询之后,而Cookie Mapping本身是一个异步操作,会影响到效率,但这个工作是很值得做的,DSP可以容忍每个流量都走Cookie Mapping的过程,因为即使这样也至少不会比目前更差。In addition to the above information that can be directly extracted from the Bid Request, the most important user information is actually the effect of Cookie Mapping. It is certain that the traffic effect of successful Cookie Mapping is definitely better than that of Cookie Mapping failure. Although the inclusion of this part of information means that the traffic filtering module needs to be placed after the cookie mapping query, and the cookie mapping itself is an asynchronous operation, which will affect the efficiency, but this work is worth doing. DSP can tolerate every traffic Go through the process of Cookie Mapping, because even this is at least not worse than it is now.
因此,我们只使用是否Cookie Mapping成功作为用户的特征,之后,会考虑将Cookie的时间、标签数也作为特征。Therefore, we only use whether the Cookie Mapping is successful as a feature of the user. After that, we will also consider the time of the cookie and the number of tags as features.
各广告主方在实时竞价过程中,竞价金额的数值与流量有很大的关系。例如,如果当前的流量对应的用户经常访问购物网站,那么如果广告主也是购物类的广告,则可能会出比较高的竞价金额,而且该广告主可能获取的收益也比较高;反之,如果当前的流量对应的用户几乎不访问购物网站,那么该广告主出的竞价金额会比较低,而且如果竞价成功,该广告主可能获取的收益也很低。In the real-time bidding process of each advertiser, the value of the bidding amount has a great relationship with the traffic. For example, if the current traffic corresponds to users who often visit shopping websites, then if the advertiser is also a shopping advertisement, the bid amount may be relatively high, and the advertiser may obtain relatively high revenue; conversely, if the current The traffic corresponding to the user hardly visits the shopping website, so the advertiser will bid a relatively low amount, and if the bid is successful, the advertiser may obtain very low revenue.
因此,在本发明实施例中,需要获取每个流量的流量特征。即从每个流量中提取流量特征。其中,流量特征可以包括流量对应的时间、流量对应的地域、流量对应的媒体上下文、流量对应的广告位尺寸、流量对应的用户信息等等,其中流量对应的用户信息又可以包括用户属性、用户的兴趣维度、用户的浏览记录等等。其中,流量对应的时间是指该流量对应产生的时间,也即用户浏览该流量对应的广告位所在网页的时间。流量对应的地域是指用户浏览该流量对应的广告位所在网页时,用户所在的地域,可以包括用户浏览网页的IP(InternetProtocol,互联网协议)地址等。流量对应的媒体上下文可以包括流量对应的广告位所在的网页的上下文内容。用户属性可以包括用户的性别、年龄等等个人信息。用户的兴趣维度是指用户的兴趣点,可以包括兴趣爱好、社交圈等等。在本发明实施例中,可以在本步骤之前,或者 是本步骤之前的任一步骤中之前根据需求设定需提取的流量特征的具体内容,对此本发明实施例不加以限定。Therefore, in the embodiment of the present invention, it is necessary to acquire the traffic characteristics of each traffic. That is, flow features are extracted from each flow. Among them, the traffic characteristics may include the time corresponding to the traffic, the region corresponding to the traffic, the media context corresponding to the traffic, the size of the advertisement space corresponding to the traffic, the user information corresponding to the traffic, etc., and the user information corresponding to the traffic may include user attributes, user interest dimensions, user browsing history, and so on. Wherein, the time corresponding to the traffic refers to the corresponding generation time of the traffic, that is, the time when the user browses the webpage where the advertisement slot corresponding to the traffic is located. The region corresponding to the traffic refers to the region where the user is located when the user browses the webpage where the advertisement space corresponding to the traffic is located, and may include the IP (Internet Protocol, Internet Protocol) address of the user browsing the webpage. The media context corresponding to the traffic may include the context content of the webpage where the advertisement slot corresponding to the traffic is located. User attributes may include user's gender, age and other personal information. The user's interest dimension refers to the user's points of interest, which may include hobbies, social circles, and so on. In the embodiment of the present invention, the specific content of the traffic characteristics to be extracted can be set according to requirements before this step, or in any step before this step, which is not limited in this embodiment of the present invention.
步骤103,利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量。Step 103, using the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation model is based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual Value data and consumption revenue data are obtained through training; each flow in the historical flow record is the flow that received the successful bidding notification from the advertising exchange server.
在获取了各流量的流量特征之后,即可以利用预设的收益计算模块计算对应各流量的流量特征的第一价值数据。其中的第一价值数据是广告主认可的收益价值,而不是常规的实际收益。After the flow characteristics of each flow are acquired, the preset revenue calculation module may be used to calculate the first value data corresponding to the flow characteristics of each flow. The first value data is the revenue value recognized by the advertiser, rather than the conventional actual revenue.
例如,对于某一流量,某一广告主A对此类流量的目标价值数据为10元,但是广告主A最后以1元参与竞价并竞价成功,也即该流量对应的实际价值数据为1元,此时如果该流量对应的用户给该广告主带来2元的收入,那么其实际收益为1元。但是对于该流量而言,广告主认可的收益价值是10元。可以看出,在本发明实施例中,广告主认可的收益价值可以包括目标价值数据与实际价值数据的差值,以及实际收益。For example, for a certain amount of traffic, an advertiser A’s target value data for this type of traffic is 10 yuan, but the advertiser A finally participated in the bidding with 1 yuan and the bid was successful, that is, the actual value data corresponding to this traffic is 1 yuan , at this time, if the user corresponding to the traffic brings the advertiser an income of 2 yuan, then the actual income is 1 yuan. But for this traffic, the revenue value recognized by the advertiser is 10 yuan. It can be seen that, in the embodiment of the present invention, the revenue value approved by the advertiser may include the difference between the target value data and the actual value data, and the actual revenue.
而且,在本发明实施例中,为了方便比较,可以广告主认可的收益与目标价值数据的差值作为收益参数进行比较。那么此时收益参数为目标价值数据与实际价值数据的差值与实际收益之和为分子,目标价值数据为分母的比值,即:Moreover, in the embodiment of the present invention, for the convenience of comparison, the difference between the revenue approved by the advertiser and the target value data can be used as the revenue parameter for comparison. Then the income parameter at this time is the difference between the target value data and the actual value data and the sum of the actual income is the numerator, and the target value data is the ratio of the denominator, namely:
收益参数=(目标价值数据-实际价值数据+实际收益)/目标价值数据Earnings parameter = (target value data - actual value data + actual income) / target value data
对上式进行简化后,可以得到,收益参数=1+(实际价值数据-实际收益)/目标价值数据。其中,1为固定值,那么(实际价值数据-实际收益)/目标价值数据,与最终的收益成正比关系。在本发明实施例中,即可以实际价值数据与实际收益的差值与目标价值数据的比值作为第一价值数据。显然,第一价值数据的值越大,则表明对应的流量的收益越高。当然,具体的第一价值数据包含的内容可以在本步骤之前,或者是本步骤之前的任一步骤之前根据需求进行设定,对此本发明实施例不加以限定。After simplifying the above formula, it can be obtained that income parameter=1+(actual value data-actual income)/target value data. Among them, 1 is a fixed value, then (actual value data-actual income)/target value data is directly proportional to the final income. In the embodiment of the present invention, the ratio of the difference between the actual value data and the actual income to the target value data may be used as the first value data. Obviously, the larger the value of the first value data, the higher the income of the corresponding traffic. Of course, the specific content contained in the first value data can be set according to requirements before this step, or before any step before this step, which is not limited in this embodiment of the present invention.
其中,收益计算模型根据各类别的流量的目标价值数据和历史流量记录 中各流量的特征以及相应的实际价值数据和消费收益数据训练获得。而且,只有竞价成功的流量才可以带来收益,因此本发明实施例中的历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量。Among them, the revenue calculation model is obtained according to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data. Moreover, only traffic with successful bidding can bring revenue, so each traffic in the historical traffic records in the embodiment of the present invention is the traffic that has received a successful bidding notification from the advertising transaction server.
那么,则可以历史流量记录中各流量的特征作为输入,以相应流量的消费收益数据和实际价值数据的差值与目标价值数据的比值作为输出,训练收益计算模型。Then, the characteristics of each flow in the historical flow records can be used as input, and the ratio of the difference between the consumption income data and actual value data of the corresponding flow and the target value data can be used as the output to train the income calculation model.
本着越简单越好的原则,在本发明实施例中,可以使用Bayes作为一期的流量过滤模型。Based on the principle that the simpler the better, in the embodiment of the present invention, you can use Bayes as a one-period traffic filtering model.
模型输入:一期直接使用DSP Cube,后续可能会用到某段时间(比如一周)的所有DSP原始日志。Model input: DSP Cube is directly used in the first phase, and all DSP original logs of a certain period of time (such as a week) may be used later.
模型输出:带来点击的流量的先验概率和在各个特征维度上的独立概率分布;没带来点击的流量的先验概率和在各个特征维度上的独立概率分布。Model output: the prior probability of traffic that brings clicks and the independent probability distribution on each feature dimension; the prior probability of traffic that does not bring clicks and the independent probability distribution on each feature dimension.
当DSP收到一个流量时,要决定是否过滤掉,首先需要从流量中抽取出,即将流量表示成(x1,x2,…,xn),xi表示第i个维度的值。When the DSP receives a flow, to decide whether to filter it out, it first needs to extract it from the flow, that is, express the flow as (x1, x2, ..., xn), and xi represents the value of the i-th dimension.
接下来,结合训练出来的Bayes模型,根据下面的公式,预测点击监听率(CLR):Next, combined with the trained The Bayes model predicts the click-to-listen rate (CLR) according to the following formula:
这里,事件1(正例)表示一个监听带来一次点击,事件0(负例)表示一个监听没带来一次点击。P(1)表示一个监听能够带来一次点击的先验概率,P(0)=1-P(1)。p(1|x1,x2,…,xn)表示给定一个流量(x1,x2,…,xn),其最终带来点击的概率。假定x1表示二级域名的维度值,则p(x1|1)表示所有带来点击的流量中,属于x1域名下的比例,而p(x1|0)则表示所有没带来点击的流量中,属于x1域名下的比例。Here, event 1 (positive example) indicates that a listener brought a click, and event 0 (negative example) indicates that a listener did not bring a click. P(1) represents the prior probability that one monitoring can bring about one click, P(0)=1-P(1). p(1|x1,x2,...,xn) represents the probability that a traffic (x1, x2,...,xn) will eventually bring a click. Assuming that x1 represents the dimension value of the second-level domain name, then p(x1|1) represents the proportion of all traffic that brings clicks under the domain name x1, and p(x1|0) represents the proportion of all traffic that does not bring clicks , belonging to the proportion under the x1 domain name.
可以预料的是,通过模型预测得到的点击监听率会分布在[0,1]上,且大部分都小于0.1%。如果完全使用Bayes分类器,则几乎所有流量都会归类成负例,因此,我们并不直接用Bayes分类器。如果我们期望的是把根据CLR计算流量过滤的概率,则需要将其更紧凑地映射到[0,1]区间 上。It can be expected that the click-to-listen rates predicted by the model will be distributed on [0,1], and most of them are less than 0.1%. if fully used Bayes classifier, almost all traffic will be classified as negative examples, so we do not directly use Bayesian classifier. If what we expect is to calculate the probability of traffic filtering according to the CLR, we need to map it more compactly to the [0,1] interval.
步骤104,判断所述第一价值数据是否小于第一阈值。Step 104, judging whether the first value data is smaller than a first threshold.
由于CLR本身就是一个连续的值,只是其数据主要分布在[0,1]区间上的一个很小的子区间上,如图1D表示2014年1月11日某网站中所有二级域名上流量的点击监听率的累积分布情况。平均点击展示比为1.3/10000。可以看到,低于0.5/10000的流量占了约50%,低于1/10000的流量占了接近60%,低于6/10000的流量就占了超过80%,超过90%的都是在[0,1/1000]的区间内。Since CLR itself is a continuous value, its data is mainly distributed in a small sub-interval of [0,1], as shown in Figure 1D, which shows the traffic on all second-level domain names of a website on January 11, 2014 The cumulative distribution of click listen rates for . The average click-to-display ratio is 1.3/10000. It can be seen that the traffic below 0.5/10000 accounts for about 50%, the traffic below 1/10000 accounts for nearly 60%, the traffic below 6/10000 accounts for more than 80%, and more than 90% are In the interval [0,1/1000].
因此我们只需要设定一个阀值a,当CLR高于a的,我们全部保留,在[0,a]间上的值,则直接通过线性函数使其更紧凑的分布在[0,1]上。如图1E表示根据点击监听率计算流量过滤率的函数图形。可以看到,比a越小,被过滤的概率越大。Therefore, we only need to set a threshold a. When the CLR is higher than a, we keep all the values between [0, a], and directly use the linear function to make it more compactly distributed in [0,1] superior. FIG. 1E shows the function graph of calculating the traffic filtering rate according to the click monitoring rate. It can be seen that the smaller the ratio a, the greater the probability of being filtered.
这里a的选取,将直接关系到流量过滤的严格程度,越高越严格,越低越宽松。这时,就需要将流量收益情况作为a的选取依据。如图1F表示某网站中不同监听比下每万次监听带来的毛利润分布情况,竖轴单位是元,图中的横线是我们的内部成本线,即每万次监听0.2元,竖线是监听比为1/10000的分位线。可以看到,1/10000点击监听率下的流量中,我们基本是亏钱的,而高于1/10000的则基本是亏钱的。因此,这个1/10000的值,其实就可以作为选取阀值a的重要参考。The selection of a here will be directly related to the strictness of traffic filtering, the higher the value is, the stricter it is, and the lower it is, the looser it will be. At this time, it is necessary to use the traffic revenue as the basis for selecting a. Figure 1F shows the distribution of gross profit brought by every 10,000 monitoring times in a certain website under different monitoring ratios. The unit of the vertical axis is yuan. The line is the quantile line with a monitor ratio of 1/10000. It can be seen that in the traffic with a click monitoring rate of 1/10000, we basically lose money, and those with a rate higher than 1/10000 basically lose money. Therefore, this 1/10000 value can actually be used as an important reference for selecting the threshold a.
如图1G表示不同点击监听率下,媒体净利润在整体净利润的累积分布,可以看到,二级域名的点击监听率在1/10000以下流量(占总量约60%)都是亏钱的,这部分亏的钱,直到6/10000的时候才被填平。然后从6/10000开始,才正式开始变成赚钱状态。根据某网站这一天的数据,我们是可以将阀值a设置为1/10000。Figure 1G shows the cumulative distribution of media net profit in the overall net profit under different click monitoring rates. It can be seen that traffic with a click monitoring rate of second-level domain names below 1/10000 (accounting for about 60% of the total) is a loss Yes, this part of the lost money will not be filled until 6/10000. Then starting from 6/10000, it officially started to make money. According to the data of a certain website for this day, we can set the threshold a to 1/10000.
如果纯粹的根据是否赚钱作为临界点,有时是不一定准确的。比如溢价过高就可能导致DSP的整体利润变成负的,这时,其赚钱的临界点就会非常高,甚至会比平均的点击监听率高。因此,需要提供一个线上参数μ=[0,1],这样我们就可以实时的决定以多大的程度来作流量过滤了。If the critical point is purely based on whether to make money, sometimes it is not necessarily accurate. For example, if the premium is too high, the overall profit of DSP may become negative. At this time, the critical point of making money will be very high, even higher than the average click-to-monitor rate. Therefore, an online parameter μ=[0,1] needs to be provided, so that we can decide in real time how much to filter traffic.
如前述,在本发明实施例中,第一价值数据与认可收益成正比,那么第一价值数据越大,则表明广告主的收益越高,而如果第一价值数据低于第一阈值,则表明此时广告主的收益较低,或者是无收益,那么此时广告主即不需要竞价相应的流量。其中的第一阈值可以根据需求在本步骤之前,或者是本步骤之前的任一步骤之前进行设定,对此本发明实施例不加以限定。而且,对于不同广告主而言,第一阈值的取值可以不同,也可以相同,对此本发明实施例不加以限定。As mentioned above, in the embodiment of the present invention, the first value data is directly proportional to the recognition income, then the greater the first value data, the higher the advertiser’s income, and if the first value data is lower than the first threshold, then It indicates that the advertiser's income is low or has no income at this time, so the advertiser does not need to bid for the corresponding traffic at this time. The first threshold may be set before this step or any step before this step according to requirements, which is not limited in this embodiment of the present invention. Moreover, for different advertisers, the value of the first threshold may be different or the same, which is not limited in this embodiment of the present invention.
步骤105,当所述第一价值数据小于第一阈值,则将所述流量进行过滤。Step 105, when the first value data is less than a first threshold, filter the traffic.
那么如果第一价值数据小于第一阈值,则表明广告主可以不需要参与竞价相应流量,此时可以在将该流量过滤掉,不将其发送至对应的广告主;而如果第一价值数据不用小于第一阈值,则不会过滤掉该流量。但是如果对于同一流量,对应另一广告主的第一价值数据不小于该广告主的第一阈值,那么对于该广告主而言,不会过滤掉该流量。Then, if the first value data is less than the first threshold, it indicates that the advertiser does not need to participate in bidding for the corresponding traffic, and at this time, the traffic can be filtered out and not sent to the corresponding advertiser; and if the first value data is not used If it is smaller than the first threshold, the traffic will not be filtered out. However, if for the same traffic, the first value data corresponding to another advertiser is not less than the advertiser's first threshold, then the traffic will not be filtered out for the advertiser.
引入流量过滤后,一个流量到达DSP后的生命流程,主要分三种情景:After the introduction of traffic filtering, the life process of a traffic after it reaches the DSP is mainly divided into three scenarios:
其一,DSP收到ADX的流量并传给了后端,并且最终发起了竞价。First, the DSP receives ADX traffic and passes it to the backend, and finally initiates the bidding.
其二,DSP收到ADX的流量并传给了后端,但最终不参与竞价。Second, the DSP receives ADX traffic and passes it to the backend, but does not participate in the bidding in the end.
其三,流量直接被DSP过滤掉,不发往后端。Third, the traffic is directly filtered by the DSP and is not sent to the backend.
目前监听流量的统计,包含了全部的三种情况,这在计算点击监听率时,会有一个问题,如果第三种情景越多,点击监听率(CLR)就越小,而根据前文所述,CLR本身又是决定一个流量是否要过滤的因素,因此会迭代的使得第三种情景更多,最终的后果是某个流量在某一天偶然效果比较差后,会越来越差,这时即使其他因素有所好转依然有很大的概率被当作低质量流量。这严重影响了模型的健壮性和通用性。The current monitoring traffic statistics include all three situations. When calculating the click-to-listen rate, there will be a problem. If the third scenario is more, the click-to-listen rate (CLR) will be smaller. According to the above-mentioned , CLR itself is the factor that decides whether a traffic should be filtered or not, so it will iteratively make the third scenario more, and the final result is that a certain traffic will become worse and worse after the occasional effect is poor on a certain day. Even if other factors improve, there is still a high probability that it will be regarded as low-quality traffic. This seriously affects the robustness and generalizability of the model.
为了解决这个问题,需要定义一个新的概念,有效监听(Valid Listen),对于DSP而言,只有当一个流量走了后端计算模块之后,才算是一次有效的监听,不然只能当作一次伪监听。因此,在计算CLR时,应该只考虑情景一和情景二。这样,CLR就不会受到流量过滤结果的影响。为了统计有效监听,需要在DSP日志中加上一个额外的字段,用于标记一个流量是否是 有效监听,如果不是,又是因为什么原因被过滤的。同时,dsp的cube中也需要再加上这样一个维度,到时模型的输入直接就可以从cube中获取。In order to solve this problem, it is necessary to define a new concept, valid listen (Valid Listen). For DSP, only when a flow passes through the back-end computing module, it can be regarded as a valid listen, otherwise it can only be regarded as a false listen. monitor. Therefore, when calculating the CLR, only Scenario 1 and Scenario 2 should be considered. This way, the CLR will not be affected by the results of traffic filtering. In order to count effective monitoring, an additional field needs to be added to the DSP log to mark whether a traffic is valid monitoring, and if not, why it is filtered. At the same time, such a dimension needs to be added to the cube of the dsp, so that the input of the model can be obtained directly from the cube.
现有的DSP Cube并没有有效监听的概念,需要根据日志信息在Cube中增加一个维度,用于标记监听的流量是否有效。The existing DSP Cube does not have the concept of effective monitoring. It is necessary to add a dimension to the Cube based on the log information to mark whether the monitored traffic is valid.
在本发明实施例中,可以接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断所述第一价值数据是否小于第一阈值;当所述第一价值数据小于第一阈值,则将所述流量进行过滤。由此取得了使广告交易服务器在将流量发送至各DSP前对流量进行过滤,从而减少DSP系统资源浪费,提高性价比的有益效果。In the embodiment of the present invention, it is possible to receive each traffic sent by the advertising exchange server; extract the traffic characteristics from each traffic; use the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation The model is trained based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data; the traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server ; Judging whether the first value data is less than a first threshold; when the first value data is less than the first threshold, filter the traffic. In this way, the advertising trading server can filter the traffic before sending it to each DSP, thereby reducing the waste of DSP system resources and improving the cost performance.
实施例二Embodiment two
详细介绍本发明实施例提供的一种流量过滤方法。A traffic filtering method provided by an embodiment of the present invention is introduced in detail.
参照图2,示出了本发明实施例中一种流量过滤方法的步骤流程图。Referring to FIG. 2 , it shows a flowchart of steps of a traffic filtering method in an embodiment of the present invention.
步骤201,根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。Step 201, train the revenue calculation model according to the target value data of each category of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data.
由于后续步骤中需要利用收益计算模型计算流量的第一价值数据,那么首先需要训练该收益计算模型。在本发明实施例中,具体的可以根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。而且,为了提高训练得到的收益计算模型的准确性,那么分别选择不同类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。具体的用以训练收益计算模型的流量可以根据需求选择,对此本发明实施例不加以限定。Since the revenue calculation model needs to be used to calculate the first value data of traffic in the subsequent steps, the revenue calculation model needs to be trained first. In the embodiment of the present invention, specifically, the revenue calculation model can be trained according to the target value data of each category of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data. Moreover, in order to improve the accuracy of the revenue calculation model obtained through training, the target value data of different types of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data are respectively selected to train the revenue calculation model. The specific traffic used to train the revenue calculation model can be selected according to requirements, which is not limited in this embodiment of the present invention.
需要说明的是,在训练收益计算模型时所利用的各流量的特征具体所包含的维度需要与在后续从接收到的广告交易服务器发送的流量中提取的流 量特征维度一致,或者是后续从接收到的广告交易服务器发送的流量中提取的流量特征维度包括在训练收益计算模型时所利用的各流量的特征。It should be noted that the specific dimensions of the characteristics of each traffic used in training the revenue calculation model need to be consistent with the traffic feature dimensions extracted from the traffic received from the advertising exchange server, or the subsequent The traffic feature dimension extracted from the traffic sent by the advertising exchange server includes the characteristics of each traffic used when training the revenue calculation model.
可选地,在本发明实施例中,所述步骤201进一步可以包括:Optionally, in the embodiment of the present invention, the step 201 may further include:
子步骤2011,针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流量所属类别的目标价值数据的比值作为因变量参数。Sub-step 2011, for each flow, take the flow characteristic of the flow as the independent variable parameter, and take the difference between the consumption revenue data and the actual value data of the flow and the target value data of the category to which the flow belongs as the ratio Dependent variable parameters.
在选定了用以训练收益计算模型的流量之后,则可以针对各流量,设置各流量的流量特征为自变量参数,也即收益计算模型的输入,以各流量的消费收益数据与实际价值数据的差值与相应流量所属类别的目标价值数据的比值作为因变量参数,也即收益计算模型的输出。After the traffic used to train the revenue calculation model is selected, the traffic characteristics of each traffic can be set as independent variable parameters for each traffic, that is, the input of the revenue calculation model, and the consumption revenue data and actual value data of each traffic can be used The ratio of the difference of the corresponding flow to the target value data of the category to which the corresponding flow belongs is used as the dependent variable parameter, that is, the output of the revenue calculation model.
子步骤2012,将各自变量参数和因变量参数构建该流量的训练特征向量。In sub-step 2012, the respective variable parameters and dependent variable parameters are used to construct the training feature vector of the flow.
为了方便快速自动完成对收益计算模型的训练过程,在本发明实施例中,还可以将各流量对应的自变量参数和因变量参数构建相应流量的训练特征向量。In order to conveniently and quickly complete the training process of the revenue calculation model, in the embodiment of the present invention, the independent variable parameters and dependent variable parameters corresponding to each flow may also be used to construct a training feature vector of the corresponding flow.
子步骤2013,利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。Sub-step 2013, using the machine training model to train the training feature vector to obtain a revenue calculation model.
在获取了训练特征向量之后,则可以利用训练特征向量训练选定的机器训练模型,得到收益计算模型。其中的机器训练模型可以为神经网络训练模型、模糊数学训练模型等等任何可用的机器学习模型,在本发明实施例中可以在本步骤之前,或者是本步骤之前的任一步骤之前根据需求选择机器训练模型的具体类型,例如机器训练模型中具体的参数等等,对此本发明实施例不加以限定。After the training feature vector is obtained, the selected machine training model can be trained by using the training feature vector to obtain the revenue calculation model. The machine training model wherein can be any available machine learning model such as neural network training model, fuzzy mathematics training model, can be before this step in the embodiment of the present invention, or before any step before this step according to requirement The specific type of the machine training model, such as specific parameters in the machine training model, etc., are not limited in this embodiment of the present invention.
步骤202,接收广告交易服务器发送的各流量。Step 202, receiving each flow sent by the advertising exchange server.
步骤203,从每个流量中提取流量特征。Step 203, extract traffic features from each traffic.
步骤204,将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。Step 204: Substituting the flow characteristics of the flow into the income calculation model to calculate the first value data representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow.
步骤205,判断所述第一价值数据是否小于第一阈值。Step 205, judging whether the first value data is smaller than a first threshold.
步骤206,当所述第一价值数据小于第一阈值,则判断所述流量所属的类别。Step 206, when the first value data is less than a first threshold, determine the category to which the traffic belongs.
如前述,在本发明实施中,可以针对各个流量确定是否过滤该流量。例如,如果确认某一流量的第一价值数据小于第一阈值,则可以过滤掉该流量。那么此时判断的次数较多,因此在本发明实施例中可以首次对各流量进行分类,在确认某一流量的第一价值数据小于第一阈值之后,则可以推知同一类别的其他流量的第一价值数据可能都小于第一阈值。As mentioned above, in the implementation of the present invention, it may be determined whether to filter the traffic for each traffic. For example, if it is confirmed that the first value data of a certain flow is smaller than the first threshold, the flow may be filtered out. Then, the number of judgments at this time is more, so in the embodiment of the present invention, each traffic can be classified for the first time, and after confirming that the first value data of a certain traffic is less than the first threshold value, the first value data of other traffic of the same category can be deduced. A value of data may be less than the first threshold.
但是,在实际情况中,可能存在同一类别的两个流量,一个流量的第一价值数据小于第一阈值,但是另一个流量的第一价值数据不小于第一阈值。那么对于第一价值数据小于第一阈值的流量,如果将与该流量属于同一类别的全部流量全部过滤掉,那么用户可能一直无法获取此类别的流量,而且利用收益计算模型计算的第一价值数据并不一定全部准确,那么如果全部过滤掉的话,对于其中误判的流量反而会影响相应广告主的收益,因此为了避免上述不利影响,在本发明实施例中,可以设置相应广告主仍然可以收到部分第一价值数据小于第一阈值的流量,而且具体的过滤比例可以和流量所属的类别有关。However, in actual situations, there may be two flows of the same category, the first value data of one flow is less than the first threshold, but the first value data of the other flow is not less than the first threshold. Then, for traffic whose first value data is less than the first threshold, if all traffic belonging to the same category as this traffic is filtered out, users may not be able to obtain traffic of this category, and the first value data calculated by the revenue calculation model Not all of them are necessarily accurate, so if all of them are filtered out, the misjudged traffic will affect the revenue of the corresponding advertiser. Therefore, in order to avoid the above-mentioned adverse effects, in the embodiment of the present invention, it can be set that the corresponding advertiser can still receive Part of the traffic whose first value data is less than the first threshold is detected, and the specific filtering ratio may be related to the category to which the traffic belongs.
在本发明实施例中,可以按照各流量的全部或者是部分流量特征划分流量所属的类别,具体的流量特征与流量所属类别之间的对应关系可以在本步骤之前,或者是本步骤之前的任一步骤之前根据需求进行设定,对此本发明实施例不加以限定。例如,可以按照流量特征中的各流量对应的地域进行划分流量所属的类别,也可以按照流量特征中的各流量对应的广告位尺寸、流量对应的媒体上下文、流量对应的用户信息、流量对应的时间以及流量对应的地域的组合划分流量所属的类别,对此本发明实施例不加以限定。In the embodiment of the present invention, the category to which the traffic belongs can be divided according to all or part of the traffic characteristics of each traffic, and the correspondence between the specific traffic characteristics and the category to which the traffic belongs can be before this step, or any prior to this step. The first step is set according to requirements, which is not limited in this embodiment of the present invention. For example, the category of the traffic can be divided according to the region corresponding to each traffic in the traffic characteristics, or the advertising space size corresponding to each traffic in the traffic characteristics, the media context corresponding to the traffic, the user information corresponding to the traffic, and the The combination of the time and the region corresponding to the traffic divides the category to which the traffic belongs, which is not limited in this embodiment of the present invention.
而且,在本发明实施例中,也可以不按照流量特征划分流量所属的类别,而是按照预设的参数划分流量所属的类别,其中预设的参数可以根据需求在本步骤之前,或者是本步骤之前的任一步骤之前进行设定,对此本发明实施例不加以限定。例如,可以按照各流量的第一价值数据划分流量的类别,那 么可以设定各个类别对应的第一价值数据阈值,然后确定各流量的类别。Moreover, in the embodiment of the present invention, it is also possible not to classify the categories of traffic according to traffic characteristics, but to classify the categories of traffic according to preset parameters, wherein the preset parameters can be before this step according to requirements, or this step The setting is performed before any step before the step, which is not limited in this embodiment of the present invention. For example, the traffic category can be divided according to the first value data of each traffic, then the first value data threshold corresponding to each category can be set, and then the category of each traffic can be determined.
步骤207,按照指定比例对所述类别的流量进行过滤。Step 207, filtering the traffic of the category according to a specified ratio.
在确定了流量的类别之后,即可以按照该类别对应的指定比例对该流量进行过滤。其中,各类别对应的指定比例可以根据需求在本步骤之前,或者是本步骤之前的任一步骤之前根据需求进行设定,对此本发明实施例不加以限定。After the category of the traffic is determined, the traffic can be filtered according to the specified ratio corresponding to the category. Wherein, the designated ratio corresponding to each category may be set according to requirements before this step, or before any step before this step, which is not limited in this embodiment of the present invention.
例如,如果当前流量的第一价值数据小于第一阈值,而且当前流量的类别对应的指定比例为0.7,那么对于此类别的流量,可以在每10个流量中过滤掉7个。For example, if the first value data of the current traffic is less than the first threshold, and the specified ratio corresponding to the category of the current traffic is 0.7, then 7 out of every 10 traffics of this category may be filtered out.
另外,在本发明实施例中,具体过滤流量的顺序逻辑可以根据需求预先设置。例如,对于上述的从每10个同类别的流量中过滤掉掉7个,那么可以设置过滤掉10个同类别的流量中的前7个,或者是过滤掉后7个,或者是其他规则等等。In addition, in the embodiment of the present invention, the sequence logic of specific filtering traffic may be preset according to requirements. For example, to filter out 7 out of every 10 traffic of the same category mentioned above, you can set to filter out the first 7 out of 10 traffic of the same category, or filter out the last 7, or other rules, etc. Wait.
步骤208,将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量。Step 208: Use the remaining traffic after filtering the traffic of the category according to the specified ratio as the sampling traffic.
在本发明实施例中,为了进一步对收益计算模型进行采样纠偏,可以将按照指定比例过滤后剩余的流量作为采样流量。例如上述的每10个流量过滤掉7个后,另外的三个流量会被发送至相应的广告主,那么过滤后剩余的流量即为未过滤掉的3个流量。当然,在本发明实施例中,如果过滤掉的7个流量可以再次找回,那么此时也可以过滤掉的7个流量作为采样流量,对此本发明实施例不加以限定。In the embodiment of the present invention, in order to further perform sampling correction on the revenue calculation model, the remaining traffic after filtering according to a specified ratio may be used as the sampling traffic. For example, after filtering out 7 of the above-mentioned 10 traffic, the other three traffic will be sent to the corresponding advertiser, then the remaining traffic after filtering is the unfiltered 3 traffic. Certainly, in the embodiment of the present invention, if the 7 filtered flows can be retrieved again, then the 7 filtered flows can also be used as sampling flows at this time, which is not limited in this embodiment of the present invention.
步骤209,每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录。Step 209 , acquiring the sampling records of each sampling flow that receives the bidding success notification from the advertising exchange server every specified time period.
我们知道,即使一个再好的模型,都可能会有判断错误的时候,一个好的流量可能被当作不好的过滤掉,而一个不好的流量也可能被诊断成好的流量。比如一个流量本身其实是好的,其效果不好(展示比或点击率低),也许并不是因为流量差,而是一些其他我们暂时没统计到的因素导致。这就需要我们的模型提供一种纠错机制。We know that no matter how good the model is, there may be misjudgments. A good traffic may be filtered out as bad, and a bad traffic may be diagnosed as good traffic. For example, a traffic itself is actually good, but its effect is not good (low display ratio or click-through rate), maybe not because of poor traffic, but because of some other factors that we have not yet counted. This requires our model to provide an error correction mechanism.
考虑到这种现象是上线后才可能出现的,从简单、有效和及时的角度考虑,在流量过滤模型外,还会独立存在黑名单和白名单功能。当好流量因为模型的误判被过滤掉时,需要将其加入白名单中;相反,当差流量因为误判而没被过滤掉时,需要将其加入黑名单。当然,进入黑白名单都要越小越好,每一次加入或移出,都需要强烈的数据支持。且其只能作为一种临时解决方案,根本上还是需要由模型的优化上来解决。Considering that this phenomenon may only appear after going online, from the perspective of simplicity, effectiveness and timeliness, in addition to the traffic filtering model, there will also be independent blacklist and whitelist functions. When good traffic is filtered out due to misjudgment by the model, it needs to be added to the whitelist; on the contrary, when bad traffic is not filtered out due to misjudgment, it needs to be added to the blacklist. Of course, the smaller the blacklist and blacklist, the better. Every entry or exit requires strong data support. And it can only be used as a temporary solution, and it still needs to be solved by the optimization of the model.
DSP的以后的效果评估,会越来越倾向于两个数据:点击数、有效监听量。因此,想要判断流量过滤的好坏,最直接的依据是,在有效监听流量降低的情况下,点击数下降很少。同时,由于减少了对亏损媒体的竞价,平台的利润率也会提高。The future effect evaluation of DSP will be more and more inclined to two data: the number of clicks and the amount of effective monitoring. Therefore, the most direct basis for judging the quality of traffic filtering is that the number of clicks drops very little when the effective monitoring traffic decreases. At the same time, platform margins will improve due to fewer bids on loss-making media.
如果一个媒体的流量因为后期效果不好,被流量过滤以后,设置了指定媒体投放的solution就会受到影响,广告主可能会发现其在这个媒体上的各项数据变少。但这个情况是应该的,我们不能因为广告主的需求就导致我们自己亏钱。而且,一般指定媒体会同时指定多个媒体,少也只是效果不好的量而已。另外,这个差媒体的流量也不是完全被过滤掉,会留下一部分作为exploration(比如5%),如果这个广告主特别想要这个媒体的流量,只要其出价足够,则在后头创意选择的时候自然都能胜出,抢下这部分量已经足够其消耗了。If the traffic of a media is not good in the later stage, after being filtered by the traffic, the solution with the specified media delivery will be affected, and the advertiser may find that the various data on this media are reduced. But this situation should be, we cannot cause ourselves to lose money because of the needs of advertisers. Moreover, generally designated media will designate multiple media at the same time, and less is just an amount that is not effective. In addition, the traffic of this poor media is not completely filtered out, and a part of it will be left as exploration (for example, 5%). If the advertiser particularly wants the traffic of this media, as long as the bid is sufficient, then when the creative selection is made Naturally, they can all win, and grabbing this amount is enough for them to consume.
在确认了采样流量之后,可以对全部的采样流量进行采样纠偏,但是为了减少采样纠偏和正常过滤过程的反复切换,可以设定指定时间周期,在每隔指定时间周期,则获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录。After confirming the sampling traffic, you can perform sampling correction on all the sampling traffic, but in order to reduce the repeated switching between sampling correction and normal filtering process, you can set a specified time period. Sampling record of each sampled traffic notified by the server bid successfully.
步骤210,从所述采样记录中提取每个采样流量的流量特征。Step 210, extracting the flow characteristics of each sampled flow from the sampling records.
在本发明实施例中,为了再次计算各采样流量的第一价值数据,那么相应地还是需要再次提取每个采样流量的流量特征。具体的采集过程以及流量特征与前述的步骤102类似,在此不再赘述。In the embodiment of the present invention, in order to recalculate the first value data of each sampled flow, it is correspondingly necessary to extract the flow feature of each sampled flow again. The specific collection process and traffic characteristics are similar to the aforementioned step 102, and will not be repeated here.
步骤211,利用预设的收益计算模型计算对应所述流量特征的第一价值数据。Step 211, using a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics.
本步骤与前述的步骤103类似,在此本再赘述。This step is similar to the aforementioned step 103 and will be repeated here.
步骤212,判断多个第一价值数据的均值是否小于第一阈值。Step 212, judging whether the mean value of multiple first value data is smaller than a first threshold.
此时为了对过滤流量进行采样纠偏,如果仍然是针对各个单独的流量的第一价值数据进行判断,那么结果可能仍然不够准确。因此在本发明实施例中,为了提高采样纠偏的准确度,可以判断属于同一类别的多个流量的第一价值数据的均值是否小于第一阈值。具体的第一价值数据的个数可以根据需求进行设定,对此本发明实施例不加以限定。或者也可以当前采样记录中属于同一类别的各个采样流量的第一价值数据的均值是否小于第一阈值,对此本发明实施例也不加以限定。At this time, in order to sample and correct the filtered traffic, if the judgment is still made on the first value data of each individual traffic, the result may still not be accurate enough. Therefore, in the embodiment of the present invention, in order to improve the accuracy of sampling deviation correction, it may be determined whether the average value of the first value data of multiple flows belonging to the same category is smaller than the first threshold. The number of specific first value data can be set according to requirements, which is not limited in this embodiment of the present invention. Alternatively, whether the average value of the first value data of each sampled flow belonging to the same category in the current sampled record is smaller than the first threshold is not limited by this embodiment of the present invention.
步骤213,如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。Step 213, if the average value of the plurality of first value data is not less than the first threshold, stop filtering the traffic of the category.
如前述,如果某一类别的流量的第一价值数据小于第一阈值,则说明此时该流量给相应的广告主带来的收益较小,该广告主参与该流量竞价的可能性很小。那么在纠偏的过程中,如果上述同一类别的多个流量的第一价值数据的均值不小于第一阈值,则相应可以停止对该类别的流量的过滤,也即如果上述同一类别的多个第一价值数据的均值不小于第一阈值,则相应可以停止对该类别的流量的过滤。As mentioned above, if the first value data of a certain type of traffic is less than the first threshold, it means that the traffic brings relatively little revenue to the corresponding advertiser at this time, and the possibility of the advertiser participating in the traffic bidding is very low. Then in the process of deviation correction, if the average value of the first value data of the above-mentioned multiple flows of the same category is not less than the first threshold, then the filtering of the flow of this category can be stopped correspondingly, that is, if the above-mentioned multiple first value data of the same category If the average value of the value data is not less than the first threshold, then the filtering of traffic of this category may be stopped correspondingly.
例如,假设对于属于类别1的流量A,利用收益计算模型计算得到的流量A的第一价值数据为4,而对于当前广告主,设定的第一阈值为5,那么对于当前广告主而言,会过滤掉流量A,并且对类别1包含的各流量按照指定比例进行过滤。假设当前获取的采样记录中包含属于类别1的流量B和流量C,而且利用收益计算模型再次计算得到流量B的第一价值数据为5,流量C的第一价值数据为6,那么可以得到对应类别的三个第一价值数据的均值为5,其不小于第一阈值,那么此时可以停止继续过滤该类别的流量,而是将其发送至当前广告主。For example, assuming that for traffic A belonging to category 1, the first value data of traffic A calculated by using the revenue calculation model is 4, and for the current advertiser, the set first threshold value is 5, then for the current advertiser , traffic A will be filtered out, and all traffic contained in category 1 will be filtered according to the specified ratio. Assuming that the currently acquired sampling records include traffic B and traffic C belonging to category 1, and the first value data of traffic B is calculated again using the revenue calculation model, and the first value data of traffic C is 6, then the corresponding If the average value of the three first value data of the category is 5, which is not less than the first threshold, then at this time, it is possible to stop filtering the traffic of this category and send it to the current advertiser.
在本发明实施例中,可以接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史 流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断所述第一价值数据是否小于第一阈值;当所述第一价值数据小于第一阈值,则将所述流量进行过滤。由此取得了使广告交易服务器在将流量发送至各DSP前对流量进行过滤,从而减少DSP系统资源的浪费,提高性价比的有益效果。In the embodiment of the present invention, it is possible to receive each traffic sent by the advertising exchange server; extract the traffic characteristics from each traffic; use the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation The model is trained based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data; the traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server ; Judging whether the first value data is less than a first threshold; when the first value data is less than the first threshold, filter the traffic. In this way, the advertising transaction server can filter the traffic before sending it to each DSP, thereby reducing the waste of DSP system resources and improving the cost performance.
而且,在本发明实施例中,在流量过滤时,还可以根据流量的类别,按照指定比例对相应类别的流量进行过滤。从而可以进一步减少DSP系统资源的浪费。Moreover, in the embodiment of the present invention, when filtering the traffic, the traffic of the corresponding category may also be filtered according to a specified ratio according to the category of the traffic. Thus, the waste of DSP system resources can be further reduced.
另外,在本发明实施例中,在按照指定比例对相应类别的流量进行过滤的步骤之后,还可以将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;从所述采样记录中提取每个采样流量的流量特征;利用预设的收益计算模型计算对应所述流量特征的第一价值数据;判断多个第一价值数据的均值是否小于第一阈值;如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。从而可以对过滤流量进行采样纠偏,避免流量的误过滤,保证各DSP可以接收到其中广告主所需竞价的流量。In addition, in the embodiment of the present invention, after the step of filtering the traffic of the corresponding category according to the specified ratio, the remaining traffic after filtering the traffic of the category according to the specified ratio can also be used as the sampling traffic; every specified time Periodically, obtain the sampling record of each sampled traffic that has received the bidding success notification from the advertising exchange server; extract the traffic characteristics of each sampled traffic from the sampling records; use the preset revenue calculation model to calculate the first value data; judging whether the average value of the multiple first value data is less than the first threshold; if the average value of the multiple first value data is not less than the first threshold, stop filtering the traffic of the category. Therefore, it is possible to sample and correct the filtered traffic, avoid false filtering of traffic, and ensure that each DSP can receive the traffic required by the advertiser.
需要说明的是,上述中与实施例一类似步骤可以参照实施例一,在此不再赘述。It should be noted that, for the steps similar to those in the first embodiment above, reference may be made to the first embodiment, which will not be repeated here.
对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明实施例并不受所描述的动作顺序的限制,因为依据本发明实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本发明实施例所必须的。For the method embodiment, for the sake of simple description, it is expressed as a series of action combinations, but those skilled in the art should know that the embodiment of the present invention is not limited by the described action order, because according to the embodiment of the present invention , certain steps may be performed in other order or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification belong to preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.
实施例三Embodiment three
详细介绍本发明实施例提供的一种流量过滤装置。A traffic filtering device provided by an embodiment of the present invention is introduced in detail.
参照图3,示出了本发明实施例中一种流量过滤装置的结构示意图。Referring to FIG. 3 , it shows a schematic structural diagram of a flow filtering device in an embodiment of the present invention.
流量接收模块301,用于接收广告交易服务器发送的各流量。The flow receiving module 301 is configured to receive each flow sent by the advertising transaction server.
第一流量特征获取模块302,用于从每个流量中提取流量特征。The first traffic feature acquisition module 302 is configured to extract traffic features from each traffic.
第一价值数据获取模块303,用于利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量。The first value data acquisition module 303 is configured to use a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; The characteristics of the traffic and the corresponding actual value data and consumption revenue data are obtained through training; each traffic in the historical traffic record is the traffic that received the successful bidding notification from the advertising exchange server.
第一判断模块304,用于判断所述第一价值数据是否小于第一阈值。The first judging module 304 is configured to judge whether the first value data is smaller than a first threshold.
过滤模块305,用于当所述第一价值数据小于第一阈值,则将所述流量进行过滤。A filtering module 305, configured to filter the traffic when the first value data is less than a first threshold.
在本发明实施例中,可以接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断所述第一价值数据是否小于第一阈值;当所述第一价值数据小于第一阈值,则将所述流量进行过滤。由此取得了使广告交易服务器在将流量发送至各DSP前对流量进行过滤,从而减少DSP系统资源的浪费,提高性价比的有益效果。In the embodiment of the present invention, it is possible to receive each traffic sent by the advertising exchange server; extract the traffic characteristics from each traffic; use the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation The model is trained based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data; the traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server ; Judging whether the first value data is less than a first threshold; when the first value data is less than the first threshold, filter the traffic. In this way, the advertising transaction server can filter the traffic before sending it to each DSP, thereby reducing the waste of DSP system resources and improving the cost performance.
实施例四Embodiment four
详细介绍本发明实施例提供的一种流量过滤装置。A traffic filtering device provided by an embodiment of the present invention is introduced in detail.
参照图4,示出了本发明实施例中一种流量过滤装置的结构示意图。Referring to FIG. 4 , it shows a schematic structural diagram of a flow filtering device in an embodiment of the present invention.
收益计算模型训练模块401,用于根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。The revenue calculation model training module 401 is used to train the revenue calculation model according to the target value data of each category of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data.
可选地,在本发明实施例中,所述收益计算模型训练模块401,包括:Optionally, in the embodiment of the present invention, the income calculation model training module 401 includes:
参数确认子模块4011,用于针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流 量所属类别的目标价值数据的比值作为因变量参数。The parameter confirmation sub-module 4011 is used to, for each flow, take the flow characteristic of the flow as an independent variable parameter, take the difference between the consumption revenue data and the actual value data of the flow, and the target value of the category to which the flow belongs The ratio of the data is used as the dependent variable parameter.
训练特征向量构建子模4012,用于将各自变量参数和因变量参数构建该流量的训练特征向量。The training feature vector construction submodule 4012 is used to construct the training feature vector of the traffic by using the respective variable parameters and dependent variable parameters.
收益计算模型训练子模块4013,用于利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。The revenue calculation model training sub-module 4013 is configured to use the machine training model to train the training feature vector to obtain a revenue calculation model.
流量接收模块402,用于接收广告交易服务器发送的各流量。The flow receiving module 402 is configured to receive each flow sent by the advertising exchange server.
第一流量特征获取模块403,用于从每个流量中提取流量特征。The first traffic feature acquisition module 403 is configured to extract traffic features from each traffic.
第一价值数据获取模块404,用于利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量。The first value data acquisition module 404 is configured to use a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; The characteristics of the traffic and the corresponding actual value data and consumption revenue data are obtained through training; each traffic in the historical traffic record is the traffic that received the successful bidding notification from the advertising exchange server.
可选地,在本发明实施例中,所述第一价值数据获取模块404,包括:Optionally, in this embodiment of the present invention, the first value data acquisition module 404 includes:
第一价值数据计算子模块4041,用于将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。The first value data calculation sub-module 4041 is used to substitute the flow characteristics of the flow into the income calculation model, and calculate the first value representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow. A value data.
第一判断模块405,用于判断所述第一价值数据是否小于第一阈值。The first judging module 405 is configured to judge whether the first value data is smaller than a first threshold.
过滤模块406,用于当所述第一价值数据小于第一阈值,则将所述流量进行过滤。A filtering module 406, configured to filter the traffic when the first value data is less than a first threshold.
可选地,在本发明实施例中,所述过滤模块406,包括:Optionally, in the embodiment of the present invention, the filtering module 406 includes:
流量类别判断子模块4061,用于判断所述流量所属的类别。The traffic category judging sub-module 4061 is configured to judge the category to which the traffic belongs.
过滤子模块4062,用于按照指定比例对所述类别的流量进行过滤。The filtering submodule 4062 is configured to filter the traffic of the category according to a specified ratio.
采样流量获取模块407,用于将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量。The sampling traffic acquisition module 407 is configured to use the remaining traffic after filtering the traffic of the category according to a specified ratio as the sampling traffic.
采样记录获取模块408,用于每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录。The sampling record obtaining module 408 is configured to obtain the sampling records of each sampled flow that receives the bidding success notification from the advertising transaction server every specified time period.
第二流量特征获取模块409,用于从所述采样记录中提取每个采样流量的流量特征。The second flow feature acquisition module 409 is configured to extract the flow feature of each sampled flow from the sampling record.
采样第一价值数据获取模块410,用于利用预设的收益计算模型计算对应所述流量特征的第一价值数据。The sampling first value data acquisition module 410 is configured to calculate the first value data corresponding to the traffic characteristics by using a preset revenue calculation model.
第二判断模块411,用于判断多个第一价值数据的均值是否小于第一阈值。The second judging module 411 is configured to judge whether the average value of the plurality of first value data is smaller than the first threshold.
过滤停止模块412,用于如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。The filtering stop module 412 is configured to stop filtering the traffic of the category if the average value of the plurality of first value data is not less than the first threshold.
在本发明实施例中,可以接收广告交易服务器发送的各流量;从每个流量中提取流量特征;利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;判断所述第一价值数据是否小于第一阈值;当所述第一价值数据小于第一阈值,则将所述流量进行过滤。由此取得了使广告交易服务器在将流量发送至各DSP前对流量进行过滤,从而减少DSP系统资源的浪费,提高性价比的有益效果。In the embodiment of the present invention, it is possible to receive each traffic sent by the advertising exchange server; extract the traffic characteristics from each traffic; use the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation The model is trained based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data; the traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server ; Judging whether the first value data is less than a first threshold; when the first value data is less than the first threshold, filter the traffic. In this way, the advertising exchange server can filter the traffic before sending it to each DSP, thereby reducing the waste of DSP system resources and improving the cost performance.
而且,在本发明实施例中,在流量过滤时,还可以根据流量的类别,按照指定比例对相应类别的流量进行过滤。从而可以进一步减少DSP系统资源的浪费。Moreover, in the embodiment of the present invention, when filtering the traffic, the traffic of the corresponding category may also be filtered according to a specified ratio according to the category of the traffic. Thus, the waste of DSP system resources can be further reduced.
另外,在本发明实施例中,在按照指定比例对相应类别的流量进行过滤的步骤之后,还可以将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;从所述采样记录中提取每个采样流量的流量特征;利用预设的收益计算模型计算对应所述流量特征的第一价值数据;判断多个第一价值数据的均值是否小于第一阈值;如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。从而可以对过滤流量进行采样纠偏,避免流量的误过滤,保证各DSP可以接收到其中广告主所需竞价的流量。In addition, in the embodiment of the present invention, after the step of filtering the traffic of the corresponding category according to the specified ratio, the remaining traffic after filtering the traffic of the category according to the specified ratio can also be used as the sampling traffic; every specified time Periodically, obtain the sampling record of each sampled traffic that has received the bidding success notification from the advertising exchange server; extract the traffic characteristics of each sampled traffic from the sampling records; use the preset revenue calculation model to calculate the first value data; judging whether the average value of the multiple first value data is less than the first threshold; if the average value of the multiple first value data is not less than the first threshold, stop filtering the traffic of the category. Therefore, it is possible to sample and correct the filtered traffic, avoid false filtering of traffic, and ensure that each DSP can receive the traffic required by the advertiser.
对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较 简单,相关之处参见方法实施例的部分说明即可。As for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for related parts, please refer to the part of the description of the method embodiment.
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的描述,构造这类系统所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays presented herein are not inherently related to any particular computer, virtual system, or other device. Various generic systems can also be used with the teachings based on this. The structure required to construct such a system is apparent from the above description. Furthermore, the present invention is not specific to any particular programming language. It should be understood that various programming languages can be used to implement the content of the present invention described herein, and the above description of specific languages is for disclosing the best mode of the present invention.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure the understanding of this description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, in order to streamline this disclosure and to facilitate an understanding of one or more of the various inventive aspects, various features of the invention are sometimes grouped together in a single embodiment, figure, or its description. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the Detailed Description are hereby expressly incorporated into this Detailed Description, with each claim standing on its own as a separate embodiment of this invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art can understand that the modules in the device in the embodiment can be adaptively changed and arranged in one or more devices different from the embodiment. Modules or units or components in the embodiments may be combined into one module or unit or component, and furthermore may be divided into a plurality of sub-modules or sub-units or sub-assemblies. All features disclosed in this specification (including accompanying claims, abstract and drawings) and any method or method so disclosed may be used in any combination, except that at least some of such features and/or processes or units are mutually exclusive. All processes or units of equipment are combined. Each feature disclosed in this specification (including accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组 合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。Furthermore, those skilled in the art will understand that although some embodiments described herein include some features included in other embodiments but not others, combinations of features from different embodiments are meant to be within the scope of the invention. and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的流量过滤设备中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art should understand that a microprocessor or a digital signal processor (DSP) may be used in practice to implement some or all functions of some or all components in the traffic filtering device according to the embodiment of the present invention. The present invention can also be implemented as an apparatus or an apparatus program (for example, a computer program and a computer program product) for performing a part or all of the methods described herein. Such a program for realizing the present invention may be stored on a computer-readable medium, or may be in the form of one or more signals. Such a signal may be downloaded from an Internet site, or provided on a carrier signal, or provided in any other form.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. does not indicate any order. These words can be interpreted as names.
本申请公开了A1、一种流量过滤方法,包括:This application discloses A1, a traffic filtering method, comprising:
接收广告交易服务器发送的各流量;Receive all traffic sent by the advertising exchange server;
从每个流量中提取流量特征;Extract flow features from each flow;
利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;Using the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics; the revenue calculation model is based on the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and Obtained through consumption revenue data training; each traffic in the historical traffic records is the traffic that received the successful bidding notification from the advertising exchange server;
判断所述第一价值数据是否小于第一阈值;judging whether the first value data is smaller than a first threshold;
当所述第一价值数据小于第一阈值,则将所述流量进行过滤。When the first value data is less than a first threshold, the traffic is filtered.
A2、如A1所述的方法,所述将所述流量进行过滤的步骤,包括:A2. The method as described in A1, the step of filtering the traffic includes:
判断所述流量所属的类别;determine the category to which the traffic belongs;
按照指定比例对所述类别的流量进行过滤。Filter the traffic of the category according to the specified ratio.
A3、如A1所述的方法,在按照指定比例对所述类别的流量进行过滤的步骤之后,还包括:A3. The method as described in A1, after the step of filtering the traffic of the category according to a specified ratio, further includes:
将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;Use the remaining traffic after filtering the traffic of the category according to the specified ratio as the sampling traffic;
每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;Obtain the sampling records of each sampling traffic that receives the successful bidding notification from the advertising exchange server at a specified time period;
从所述采样记录中提取每个采样流量的流量特征;extracting flow characteristics of each sampled flow from the sampling records;
利用预设的收益计算模型计算对应所述流量特征的第一价值数据;Using a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics;
判断多个第一价值数据的均值是否小于第一阈值;judging whether the mean value of multiple first value data is less than a first threshold;
如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。If the mean value of the plurality of first value data is not less than the first threshold, stop filtering the traffic of the category.
A4、如A1所述的方法,所述接收广告交易服务器发送的各流量的步骤之前,还包括:A4. The method as described in A1, before the step of receiving the traffic sent by the advertising exchange server, it also includes:
根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。According to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records, as well as the corresponding actual value data and consumption revenue data, the revenue calculation model is trained.
A5、如A4所述的方法,所述根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型的步骤,包括:A5. The method as described in A4, the step of training the revenue calculation model according to the target value data of each category of traffic and the characteristics of each traffic in the historical traffic records and the corresponding actual value data and consumption revenue data, including:
针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流量所属类别的目标价值数据的比值作为因变量参数;For each flow, the flow characteristic of the flow is used as an independent variable parameter, and the difference between the consumption income data of the flow and the actual value data, and the ratio of the target value data of the category to which the flow belongs are used as the dependent variable parameter;
将各自变量参数和因变量参数构建该流量的训练特征向量;Construct the training feature vector of the traffic with the respective variable parameters and dependent variable parameters;
利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。A machine training model is used to train the training feature vector to obtain a revenue calculation model.
A6、如A5所述的方法,所述利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据的步骤,包括:A6. The method as described in A5, the step of using the preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics includes:
将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。Substituting the flow characteristics of the flow into the income calculation model to calculate the first value data representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow.
本申请还公开了B7、一种流量过滤装置,包括:The application also discloses B7, a flow filtering device, comprising:
流量接收模块,用于接收广告交易服务器发送的各流量;The flow receiving module is used to receive each flow sent by the advertising exchange server;
第一流量特征获取模块,用于从每个流量中提取流量特征;A first traffic feature acquisition module, configured to extract traffic features from each traffic;
第一价值数据获取模块,用于利用预设的收益计算模型计算对应所述流量特征对应的第一价值数据;所述收益计算模型根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练获得;历史流量记录中的各流量为接收到广告交易服务器竞价成功通知的流量;The first value data acquisition module is used to calculate the first value data corresponding to the traffic characteristics by using a preset revenue calculation model; the revenue calculation model is based on the target value data of each category of traffic and each traffic in the historical traffic record The characteristics and the corresponding actual value data and consumption revenue data are obtained through training; each flow in the historical flow record is the flow that received the successful bidding notification from the advertising exchange server;
第一判断模块,用于判断所述第一价值数据是否小于第一阈值;A first judging module, configured to judge whether the first value data is smaller than a first threshold;
过滤模块,用于当所述第一价值数据小于第一阈值,则将所述流量进行过滤。A filtering module, configured to filter the traffic when the first value data is less than a first threshold.
B8、如B7所述的装置,所述过滤模块,包括:B8, the device as described in B7, the filter module, comprising:
流量类别判断子模块,用于判断所述流量所属的类别;A traffic category judging submodule, configured to judge the category to which the traffic belongs;
过滤子模块,用于按照指定比例对所述类别的流量进行过滤。The filtering sub-module is configured to filter the traffic of the category according to a specified ratio.
B9、如B7所述的装置,在所述过滤模块之后,还包括:B9, the device as described in B7, after the filter module, also includes:
采样流量获取模块,用于将按照指定比例对所述类别的流量进行过滤后剩余的流量作为采样流量;A sampling traffic acquisition module, configured to use the remaining traffic after filtering the traffic of the category according to a specified ratio as the sampling traffic;
采样记录获取模块,用于每隔指定时间周期,获取接收到广告交易服务器竞价成功通知的各采样流量的采样记录;The sampling record acquisition module is used to obtain the sampling records of each sampling flow received from the advertising exchange server for bidding success notification every specified time period;
第二流量特征获取模块,用于从所述采样记录中提取每个采样流量的流量特征;A second traffic feature acquisition module, configured to extract the traffic feature of each sampled flow from the sampling record;
采样第一价值数据获取模块,用于利用预设的收益计算模型计算对应所述流量特征的第一价值数据;Sampling the first value data acquisition module, configured to use a preset revenue calculation model to calculate the first value data corresponding to the traffic characteristics;
第二判断模块,用于判断多个第一价值数据的均值是否小于第一阈值;The second judging module is used to judge whether the mean value of multiple first value data is smaller than the first threshold;
过滤停止模块,用于如果多个第一价值数据的均值不小于第一阈值,则停止对所述类别的流量的过滤。A filtering stop module, configured to stop filtering the traffic of the category if the average value of the plurality of first value data is not less than the first threshold.
B10、如B7所述的装置,所述流量接收模块之前,还包括:B10, the device as described in B7, before the flow receiving module, also includes:
收益计算模型训练模块,用于根据各类别的流量的目标价值数据和历史流量记录中各流量的特征及相应的实际价值数据和消费收益数据训练收益计算模型。The revenue calculation model training module is used to train the revenue calculation model according to the target value data of each category of traffic, the characteristics of each traffic in the historical traffic records, and the corresponding actual value data and consumption revenue data.
B11、如B10所述的装置,所述收益计算模型训练模块,包括:B11, the device as described in B10, the income calculation model training module, including:
参数确认子模块,用于针对每个流量,以所述流量的流量特征为自变量参数,以所述流量的消费收益数据与实际价值数据的差值,与所述流量所属类别的目标价值数据的比值作为因变量参数;The parameter confirmation sub-module is used for each flow, taking the flow characteristic of the flow as an independent variable parameter, taking the difference between the consumption income data of the flow and the actual value data, and the target value data of the category to which the flow belongs The ratio of is used as the dependent variable parameter;
训练特征向量构建子模块,用于将各自变量参数和因变量参数构建该流量的训练特征向量;The training feature vector construction sub-module is used to construct the training feature vector of the traffic with the respective variable parameters and dependent variable parameters;
收益计算模型训练子模块,用于利用机器训练模型对所述训练特征向量进行训练,获得收益计算模型。The income calculation model training sub-module is used to use the machine training model to train the training feature vector to obtain the income calculation model.
B12、如B11所述的装置,第一价值数据获取模块,包括:B12. The device as described in B11, the first value data acquisition module, including:
第一价值数据计算子模块,用于将所述流量的流量特征代入所述收益计算模型,计算得到表示消费收益与预期实际价值的差值与所述流量所属目标价值之间的比值的第一价值数据。The first value data calculation sub-module is used to substitute the flow characteristics of the flow into the income calculation model, and calculate the first value representing the ratio between the difference between the consumption income and the expected actual value and the target value of the flow. value data.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611239365.5A CN108256881A (en) | 2016-12-28 | 2016-12-28 | A kind of traffic filtering method and apparatus |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611239365.5A CN108256881A (en) | 2016-12-28 | 2016-12-28 | A kind of traffic filtering method and apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108256881A true CN108256881A (en) | 2018-07-06 |
Family
ID=62720303
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611239365.5A Pending CN108256881A (en) | 2016-12-28 | 2016-12-28 | A kind of traffic filtering method and apparatus |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108256881A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113592547A (en) * | 2021-07-29 | 2021-11-02 | 上海苍苔信息技术有限公司 | Channel delivery amount excitation method and system |
CN115623067A (en) * | 2022-10-08 | 2023-01-17 | 上海兴容信息技术有限公司 | A traffic transfer method, transfer device, device and storage medium |
CN117040777A (en) * | 2023-06-13 | 2023-11-10 | 五矿国际信托有限公司 | Method for detecting and filtering malicious request behaviors of data interface |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130066726A1 (en) * | 2011-09-09 | 2013-03-14 | Dennoo Inc. | Methods and systems for bidding and displaying advertisements utilizing various cost models |
US20140278981A1 (en) * | 2011-10-06 | 2014-09-18 | Gueorgui Mersov | Automated allocation of media via network |
CN105046532A (en) * | 2015-08-07 | 2015-11-11 | 北京品友互动信息技术有限公司 | Bidding method and device |
CN105205696A (en) * | 2015-09-11 | 2015-12-30 | 晶赞广告(上海)有限公司 | Internet advertisement real-time bidding multi-advertiser multi-factor flow distribution method |
CN105574741A (en) * | 2015-12-22 | 2016-05-11 | 北京奇虎科技有限公司 | Flow value assessment method, device, application method and system |
CN106097025A (en) * | 2016-08-24 | 2016-11-09 | 苏州新科兰德科技有限公司 | A kind of advertisement placement method based on DSP and device |
-
2016
- 2016-12-28 CN CN201611239365.5A patent/CN108256881A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130066726A1 (en) * | 2011-09-09 | 2013-03-14 | Dennoo Inc. | Methods and systems for bidding and displaying advertisements utilizing various cost models |
US20140278981A1 (en) * | 2011-10-06 | 2014-09-18 | Gueorgui Mersov | Automated allocation of media via network |
CN105046532A (en) * | 2015-08-07 | 2015-11-11 | 北京品友互动信息技术有限公司 | Bidding method and device |
CN105205696A (en) * | 2015-09-11 | 2015-12-30 | 晶赞广告(上海)有限公司 | Internet advertisement real-time bidding multi-advertiser multi-factor flow distribution method |
CN105574741A (en) * | 2015-12-22 | 2016-05-11 | 北京奇虎科技有限公司 | Flow value assessment method, device, application method and system |
CN106097025A (en) * | 2016-08-24 | 2016-11-09 | 苏州新科兰德科技有限公司 | A kind of advertisement placement method based on DSP and device |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113592547A (en) * | 2021-07-29 | 2021-11-02 | 上海苍苔信息技术有限公司 | Channel delivery amount excitation method and system |
CN115623067A (en) * | 2022-10-08 | 2023-01-17 | 上海兴容信息技术有限公司 | A traffic transfer method, transfer device, device and storage medium |
CN117040777A (en) * | 2023-06-13 | 2023-11-10 | 五矿国际信托有限公司 | Method for detecting and filtering malicious request behaviors of data interface |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11195209B2 (en) | Method, medium, and system for keyword bidding in a market cooperative | |
TWI603273B (en) | Method and device for placing information search | |
WO2020248508A1 (en) | Method, device and apparatus for evaluating advertising channel, and storage medium | |
US8719082B1 (en) | Automatic bid adjustments for electronic advertising | |
US10134053B2 (en) | User engagement-based contextually-dependent automated pricing for non-guaranteed delivery | |
US10546305B2 (en) | Pay-per-action system for selling advertisements | |
US11216850B2 (en) | Predictive platform for determining incremental lift | |
US20150339704A1 (en) | Valuing online content served to an online publisher | |
US20030216930A1 (en) | Cost-per-action search engine system, method and apparatus | |
US20100198695A1 (en) | Advertisement Slot Allocation | |
US20100228622A1 (en) | Messaging Interface for Advertisement Submission | |
US20150278877A1 (en) | User Engagement-Based Contextually-Dependent Automated Reserve Price for Non-Guaranteed Delivery Advertising Auction | |
WO2017128933A1 (en) | Processing method, device, and system for work | |
US20150178790A1 (en) | User Engagement-Based Dynamic Reserve Price for Non-Guaranteed Delivery Advertising Auction | |
CN101379522A (en) | Ad publisher performance and mitigation of click fraud | |
US20110166942A1 (en) | Contract auctions for sponsored search | |
CN102741873A (en) | Facilitating advertisement selection using advancement bids | |
WO2014120869A1 (en) | Content sequencing | |
US11869043B2 (en) | Adaptive bidding for networked advertising | |
US11966947B1 (en) | System and methods for using a revenue value index to score impressions for users for advertisement placement | |
US20130159094A1 (en) | Real time ad targeting | |
CN108256881A (en) | A kind of traffic filtering method and apparatus | |
US10650403B2 (en) | Distributing online ads by targeting online ad requests | |
US8589234B1 (en) | Companion ad auctions | |
US20110184802A1 (en) | Auction format selection using historical data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180706 |
|
RJ01 | Rejection of invention patent application after publication |