FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction
April 03, 2023 · Declared Dead · 🏛 AAAI Conference on Artificial Intelligence
"Paper promises code 'coming soon'"
Evidence collected by the PWNC Scanner
Authors
Kelong Mao, Jieming Zhu, Liangcai Su, Guohao Cai, Yuru Li, Zhenhua Dong
arXiv ID
2304.00902
Category
cs.IR: Information Retrieval
Citations
125
Venue
AAAI Conference on Artificial Intelligence
Last Checked
1 month ago
Abstract
Click-through rate (CTR) prediction is one of the fundamental tasks for online advertising and recommendation. While multi-layer perceptron (MLP) serves as a core component in many deep CTR prediction models, it has been widely recognized that applying a vanilla MLP network alone is inefficient in learning multiplicative feature interactions. As such, many two-stream interaction models (e.g., DeepFM and DCN) have been proposed by integrating an MLP network with another dedicated network for enhanced CTR prediction. As the MLP stream learns feature interactions implicitly, existing research focuses mainly on enhancing explicit feature interactions in the complementary stream. In contrast, our empirical study shows that a well-tuned two-stream MLP model that simply combines two MLPs can even achieve surprisingly good performance, which has never been reported before by existing work. Based on this observation, we further propose feature gating and interaction aggregation layers that can be easily plugged to make an enhanced two-stream MLP model, FinalMLP. In this way, it not only enables differentiated feature inputs but also effectively fuses stream-level interactions across two streams. Our evaluation results on four open benchmark datasets as well as an online A/B test in our industrial system show that FinalMLP achieves better performance than many sophisticated two-stream CTR models. Our source code will be available at MindSpore/models.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
📜 Similar Papers
In the same crypt — Information Retrieval
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation
R.I.P.
👻
Ghosted
Graph Convolutional Neural Networks for Web-Scale Recommender Systems
🌅
🌅
Old Age
Neural Graph Collaborative Filtering
R.I.P.
👻
Ghosted
Self-Attentive Sequential Recommendation
R.I.P.
👻
Ghosted
DeepFM: A Factorization-Machine based Neural Network for CTR Prediction
Died the same way — ⏳ Coming Soon™
R.I.P.
⏳
Coming Soon™
Exploring Simple Siamese Representation Learning
R.I.P.
⏳
Coming Soon™
An Analysis of Scale Invariance in Object Detection - SNIP
R.I.P.
⏳
Coming Soon™
Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection
R.I.P.
⏳
Coming Soon™