"No one is harder on a talented person than the person themselves" - Linda Wilkinson ; "Trust your guts and don't follow the herd" ; "Validate direction not destination" ;

August 19, 2020

Research Paper Reads - MODELING USERS FOR ONLINE ADVERTISING

Paper #1 - MODELING USERS FOR ONLINE ADVERTISING

Key Notes
  • Contribution - a neural network model (app2vec) to vectorize mobile apps by studying how users employ these apps
Data Collected from Users
  • User activity data
  • User behaviors
  • Logging user activities
  • Contents consumed by users
  • Anonymous browser cookie syncing technique
Ad Platforms
  • Targeting audiences
  • User profiling
  • Ads based on their activity history across the web
Findings
  • Users watching polymorphic videos are likely to have similar interests
Insights
  • US mobile users download more than eight apps per month on average
  • 90% of the time spent on mobile devices was spent using apps
Online Ad Targeting
  • Data - users browsing, app usage,
  • and other activities on the Internet
  • Targeting - site/page context, placement size, user behavior and geolocation
User Targeting
Publishers, Advertisers, Ad-networks, Online users

Research Directions
  • Cross-device user tracking - Users access online content through multiple devices
  • Value of user profile - Different costs associated with them, Ad targeting on user profile
Observe User Online Advertising Profile and Ad Targeting
Do ads target user profiles in the field?
What are the ads shown to different users?
How do ads impact users profiles?

Data - The capability to gather display ads and video ads from across the web is central to our work
Profile-driven crawling - Enables each crawler instance to interact with the ad ecosystem as though it were a unique user with particular characteristics.
The Anatomy of Online Advertising
  • Advertisers - Advertiser reach out to potential customers. 
  • Publisher View - premium campaigns (specific advertisers, ad networks, ad exchanges)
Types of ads - Text Ads, Display Ads, Stream Ads, Video Ads
Video ads - Pre-roll, mid-roll, post-roll, Overllay-ads, Sponsored Videos

User Modeling on Mobile
  • app2vec to represent apps in a vector space without a priori knowledge of their semantics
  • app2vec to cluster apps based on app distances in their vector space
  • Computing app similarity is through the bag-of-words method using app meta information
Large Scale Look-alike Audience Modeling
  • A simple similarity-based look-alike system can use direct user-2-user similarity  to search for users that look like (or in other words, be similar to) seeds
  • Another type of look-alike audience systems for online advertising is built with Logistic Regression (LR)
  • User segments can be user characteristics such as user interest categories. 

Real-time Attention Based Look-alike Model for Recommender System
Key Notes
  • Real-time attention based look-alike model (RALM) for recommender systems
  • Deep neural networks (DNNs) and recurrent neural networks (RNNs) are more and more popular on recommendation task
  • "Matthew effect" - low quality and poor diversity of recommended contents.

RALM
  • RALM is a similarity based look-alike model, which consists of user representation learning and look-alike learning
  • Deep interest network for multifields user interests representation learning
  • Local representation of seeds should be processed online in real-time
  • k-means clustering to partition seeds into k clusters
  • Similarity based methods determine similarity between seeds and users based on distance measurement.

System Architecture
Offline Training
  • User Representation learning. The user representation model is developed based on deep learning network
  • Look-alike learning is based on attention model and clustering algorithm
Online asynchronous processing
  • User feedback monitor: The audience extension system updates the seeds of candidates through monitoring the click behaviors of all WeChat users in real-time
  • online serving - The lookalike model predicts the global embedding of seeds through global attention unit

Metrics
  • CTR (Click-through Rate): As audience increased, many new users sharing the same interests with seeds are reached. Therefore, CTR is expected not to decrease
  • Category & Diversity. One of our purposes is enriching user’s interest in our system, so we define a metric named diversity. It is represented by a number of content categories or tags a user has read in a day. With a more comprehensive user representation, more kinds of contents will be reached and category&tag diversity is expected to increase
More Reads
Comprehensive Audience Expansion based on End-to-End Neural Prediction

Keep Thinking!!!

No comments: