To Predict or Not To Predict? Proportionally Masked Autoencoders for Tabular Data Imputation

December 26, 2024 · Declared Dead · 🏛 arXiv.org

Authors Jungkyu Kim, Kibok Lee, Taeyoung Park arXiv ID 2412.19152 Category cs.LG: Machine Learning Cross-listed cs.AI, stat.ML Citations 5 Venue arXiv.org Repository https://github.com/normal-kim/PMAE} Last Checked 2 months ago

Abstract

Masked autoencoders (MAEs) have recently demonstrated effectiveness in tabular data imputation. However, due to the inherent heterogeneity of tabular data, the uniform random masking strategy commonly used in MAEs can disrupt the distribution of missingness, leading to suboptimal performance. To address this, we propose a proportional masking strategy for MAEs. Specifically, we first compute the statistics of missingness based on the observed proportions in the dataset, and then generate masks that align with these statistics, ensuring that the distribution of missingness is preserved after masking. Furthermore, we argue that simple MLP-based token mixing offers competitive or often superior performance compared to attention mechanisms while being more computationally efficient, especially in the tabular domain with the inherent heterogeneity. Experimental results validate the effectiveness of the proposed proportional masking strategy across various missing data patterns in tabular datasets. Code is available at: \url{https://github.com/normal-kim/PMAE}.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 💻 Repository 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Machine Learning

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago

R.I.P. 👻 Ghosted

Semi-Supervised Classification with Graph Convolutional Networks

Thomas N. Kipf, Max Welling

cs.LG 🏛 ICLR 📚 33.5K cites 9 years ago

R.I.P. 👻 Ghosted

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, ... (+3 more)

cs.LG 🏛 arXiv 📚 25.1K cites 8 years ago

R.I.P. 👻 Ghosted

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel, Noam Shazeer, ... (+7 more)

cs.LG 🏛 JMLR 📚 24.4K cites 6 years ago

Died the same way — 💀 404 Not Found

R.I.P. 💀 404 Not Found

Deep High-Resolution Representation Learning for Visual Recognition

Jingdong Wang, Ke Sun, ... (+10 more)

cs.CV 🏛 IEEE TPAMI 📚 4.4K cites 6 years ago

R.I.P. 💀 404 Not Found

HuggingFace's Transformers: State-of-the-art Natural Language Processing

Thomas Wolf, Lysandre Debut, ... (+20 more)

cs.CL 🏛 arXiv 📚 3.5K cites 6 years ago

R.I.P. 💀 404 Not Found

CCNet: Criss-Cross Attention for Semantic Segmentation

Zilong Huang, Xinggang Wang, ... (+5 more)

cs.CV 🏛 ICCV 📚 2.9K cites 7 years ago

R.I.P. 💀 404 Not Found

Unified Perceptual Parsing for Scene Understanding

Tete Xiao, Yingcheng Liu, ... (+3 more)

cs.CV 🏛 ECCV 📚 2.3K cites 7 years ago