Can Who-Edits-What Predict Edit Survival?
January 12, 2018 Β· Declared Dead Β· π Knowledge Discovery and Data Mining
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ali Batuhan YardΔ±m, Victor Kristof, Lucas Maystre, Matthias Grossglauser
arXiv ID
1801.04159
Category
stat.AP
Cross-listed
cs.SI,
stat.ML
Citations
5
Venue
Knowledge Discovery and Data Mining
Last Checked
4 months ago
Abstract
As the number of contributors to online peer-production systems grows, it becomes increasingly important to predict whether the edits that users make will eventually be beneficial to the project. Existing solutions either rely on a user reputation system or consist of a highly specialized predictor that is tailored to a specific peer-production system. In this work, we explore a different point in the solution space that goes beyond user reputation but does not involve any content-based feature of the edits. We view each edit as a game between the editor and the component of the project. We posit that the probability that an edit is accepted is a function of the editor's skill, of the difficulty of editing the component and of a user-component interaction term. Our model is broadly applicable, as it only requires observing data about who makes an edit, what the edit affects and whether the edit survives or not. We apply our model on Wikipedia and the Linux kernel, two examples of large-scale peer-production systems, and we seek to understand whether it can effectively predict edit survival: in both cases, we provide a positive answer. Our approach significantly outperforms those based solely on user reputation and bridges the gap with specialized predictors that use content-based features. It is simple to implement, computationally inexpensive, and in addition it enables us to discover interesting structure in the data.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β stat.AP
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Sequence-to-point learning with neural networks for nonintrusive load monitoring
R.I.P.
π»
Ghosted
Predictive Business Process Monitoring with LSTM Neural Networks
R.I.P.
π»
Ghosted
Forecasting: theory and practice
R.I.P.
π»
Ghosted
Accurate estimation of influenza epidemics using Google search data via ARGO
R.I.P.
π»
Ghosted
Survey of resampling techniques for improving classification performance in unbalanced datasets
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted