[go: up one dir, main page]

skip to main content
10.1145/3459637.3481958acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Unbiased Filtering of Accidental Clicks in Verizon Media Native Advertising

Published: 30 October 2021 Publication History

Abstract

Verizon Media (VZM) native advertising is one of VZM largest and fastest growing businesses, reaching a run-rate of several hundred million USDs in the past year. Driving the VZM native models that are used to predict event probabilities, such as click and conversion probabilities, is OFFSET - a feature enhanced collaborative-filtering based event-prediction algorithm. In this work we focus on the challenge of predicting click-through rates (CTR) when we are aware that some of the clicks have short dwell-time and are defined as accidental clicks. An accidental click implies little affinity between the user and the ad, so predicting that similar users will click on the ad is inaccurate. Therefore, it may be beneficial to remove clicks with dwell-time lower than a predefined threshold from the training set. However, we cannot ignore these positive events, as filtering these will cause the model to under predict. Previous approaches have tried to apply filtering and then adding corrective biases to the CTR predictions, but did not yield revenue lifts and therefore were not adopted. In this work, we present a new approach where the positive weight of the accidental clicks is distributed among all of the negative events (skips), based on their likelihood of causing accidental clicks, as predicted by an auxiliary model. These likelihoods are taken as the correct labels of the negative events, shifting our training from using only binary labels and adopting a binary cross-entropy loss function in our training process. After showing offline performance improvements, the modified model was tested online serving VZM native users, and provided 1.18% revenue lift over the production model which is agnostic to accidental clicks.

Supplementary Material

MP4 File (CIKM2.mp4)
In this work we focus on the challenge of predicting click-through rates when we are aware that some of the clicks have short dwell-time and are defined as accidental clicks. An accidental click implies little affinity between the user and the ad, so predicting that similar users will click on the ad is inaccurate. Therefore, it may be beneficial to remove clicks with dwell-time lower than a predefined threshold from the training set. However, we cannot ignore these positive events, as filtering these will cause the model to under predict. In this work, we present a new approach where the positive weight of the accidental clicks is distributed among all of the negative events (skips), based on their likelihood of causing accidental clicks, as predicted by an auxiliary model. These likelihoods are taken as the correct labels of the negative events, shifting our training from using only binary labels and adopting a binary cross-entropy loss function in our training process.

References

[1]
Michal Aharon, Natalie Aizenberg, Edward Bortnikov, Ronny Lempel, Roi Adadi, Tomer Benyamini, Liron Levin, Ran Roth, and Ohad Serfaty. Off-set: one-pass factorization of feature sets for online recommendation in persistent cold start settings. In Proc. of RecSys'2013, pages 375--378, 2013.
[2]
Michal Aharon, Amit Kagian, and Oren Somekh. Adaptive online hyper-parameters tuning for ad event-prediction models. In Proc. of WWW'2017, pages 672--679, 2017.
[3]
Michal Aharon, Yohay Kaplan, Rina Levy, Oren Somekh, Ayelet Blanc, Neetai Eshel, Avi Shahar, Assaf Singer, and Alex Zlotnik. Soft frequency capping for improved ad click prediction in yahoo gemini native. In Proc. of CIKM'2019, pages 2793--2801, 2019.
[4]
Michal Aharon, Oren Somekh, Avi Shahar, Assaf Singer, Baruch Trayvas, Hadas Vogel, and Dobri Dobrev. Carousel ads optimization in yahoo gemini native. In Proc. of KDD'2019, pages 1993--2001, 2019.
[5]
Natalie Aizenberg, Yehuda Koren, and Oren Somekh. Build your own music recommender by modeling internet radio streams. In Proc. of WWW'2012, pages 1--10, 2012.
[6]
Morelle Arian, Eliran Abutbul, Michal Aharon, Yair Koren, Oren Somekh, and Rotem Stram. Feature enhancement via user similarities networks for improved click prediction in yahoo gemini native. In Proc. of CIKM'2019, pages 2557--2565, 2019.
[7]
A. Banerjee, Xin Guo, and Hui Wang. On the optimality of conditional expectation as a bregman predictor. IEEE Transactions on Information Theory, 51(7):2664--2669, 2005.
[8]
Robert M Bell and Yehuda Koren. Lessons from the netflix prize challenge. ACM SIGKDD Explorations Newsletter, 9(2):75--79, 2007.
[9]
L.M. Bregman. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Computational Mathematics and Mathematical Physics, 7(3):200--217, 1967.
[10]
John Duchi, Elad Hazan, and Yoram Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of machine learning research (JMLR), 12(7):2121--2159, 2011.
[11]
Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, et al. Practical lessons from predicting clicks on ads at facebook. In Proc. of the 8th International Workshop on Data Mining for Online Advertising, pages 1--9, 2014.
[12]
Yuchin Juan, Damien Lefortier, and Olivier Chapelle. Field-aware factorization machines in a real-world online advertising system. In Proc. of WWW'2017, pages 680--688, 2017.
[13]
Yuchin Juan, Yong Zhuang, Wei-Sheng Chin, and Chih-Jen Lin. Field-aware factorization machines for ctr prediction. In Proc. of RecSys'2016, pages 43--50, 2016.
[14]
Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8):30--37, 2009.
[15]
Shie Mannor, Dori Peleg, and Reuven Rubinstein. The cross entropy method for classification. In Proc. of the 22nd international conference on Machine learning, pages 561--568, 2005.
[16]
H Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, et al. Ad click prediction: a view from the trenches. In Proc. of KDD'2013, pages 1222--1230, 2013.
[17]
Junwei Pan, Jian Xu, Alfonso Lobos Ruiz, Wenliang Zhao, Shengjun Pan, Yu Sun, and Quan Lu. Field-weighted factorization machines for click-through rate prediction in display advertising. In Proc. of WWW'2018, pages 1349--1357, 2018.
[18]
Steffen Rendle. Factorization machines. In Proc. of IEEE International Conference on Data Mining (IDCM), pages 995--1000, 2010.
[19]
Natalia Silberstein, Oren Somekh, Yair Koren, Michal Aharon, Dror Porat, Avi Shahar, and Tingyi Wu. Ad close mitigation for improved user experience in native advertisements. In Proc. of WSDM'2020, pages 546--554, 2020.
[20]
Harald Steck. Evaluation of recommendations: rating-prediction and ranking. In Proc. of RecSys'2013, pages 213--220, 2013.
[21]
Gabriele Tolomei, Mounia Lalmas, Ayman Farahat, and Andrew Haines. You must have clicked on this ad by mistake! data-driven identification of accidental clicks on mobile ads with applications to advertiser cost discounting and click-through rate prediction. International Journal of Data Science and Analytics, 7(1):53--66, 2019.
[22]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. Doubly robust joint learning for recommendation on data missing not at random. In Proc. of ICML'2019, pages 6638--6647, 2019.
[23]
Xing Yi, Liangjie Hong, Erheng Zhong, Nanthan Nan Liu, and Suju Rajan. Beyond clicks: dwell time for personalization. In Proc. of RecSys'2014, pages 113--120, 2014.
[24]
Bowen Yuan, Jui-Yang Hsia, Meng-Yuan Yang, Hong Zhu, Chih-Yao Chang, Zhenhua Dong, and Chih-Jen Lin. Improving ad click prediction by considering non-displayed events. In Proc. of CIKM'2019, pages 329--338, 2019.

Cited By

View all
  • (2023)Audience Prospecting for Dynamic-Product-Ads in Native Advertising2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386796(1571-1580)Online publication date: 15-Dec-2023
  • (2023)Understanding Search Behavior Bias in WikipediaAdvances in Bias and Fairness in Information Retrieval10.1007/978-3-031-37249-0_11(134-146)Online publication date: 15-Jul-2023

Index Terms

  1. Unbiased Filtering of Accidental Clicks in Verizon Media Native Advertising

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
      October 2021
      4966 pages
      ISBN:9781450384469
      DOI:10.1145/3459637
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 October 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. accidental clicks
      2. click prediction
      3. collaborative filtering
      4. native ads
      5. online advertising
      6. recommender systems

      Qualifiers

      • Research-article

      Conference

      CIKM '21
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)48
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 14 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Audience Prospecting for Dynamic-Product-Ads in Native Advertising2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386796(1571-1580)Online publication date: 15-Dec-2023
      • (2023)Understanding Search Behavior Bias in WikipediaAdvances in Bias and Fairness in Information Retrieval10.1007/978-3-031-37249-0_11(134-146)Online publication date: 15-Jul-2023

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media