Fun Facts: Automatic Trivia Fact Extraction from Wikipedia

December 12, 2016 · Declared Dead · 🏛 Web Search and Data Mining

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors David Tsurel, Dan Pelleg, Ido Guy, Dafna Shahaf arXiv ID 1612.03896 Category cs.SI: Social & Info Networks Cross-listed cs.IR Citations 28 Venue Web Search and Data Mining Last Checked 3 months ago

Abstract

A significant portion of web search queries directly refers to named entities. Search engines explore various ways to improve the user experience for such queries. We suggest augmenting search results with {\em trivia facts} about the searched entity. Trivia is widely played throughout the world, and was shown to increase users' engagement and retention. Most random facts are not suitable for the trivia section. There is skill (and art) to curating good trivia. In this paper, we formalize a notion of \emph{trivia-worthiness} and propose an algorithm that automatically mines trivia facts from Wikipedia. We take advantage of Wikipedia's category structure, and rank an entity's categories by their trivia-quality. Our algorithm is capable of finding interesting facts, such as Obama's Grammy or Elvis' stint as a tank gunner. In user studies, our algorithm captures the intuitive notion of "good trivia" 45\% higher than prior work. Search-page tests show a 22\% decrease in bounce rates and a 12\% increase in dwell time, proving our facts hold users' attention.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Social & Info Networks

R.I.P. 👻 Ghosted

Inductive Representation Learning on Large Graphs

William L. Hamilton, Rex Ying, Jure Leskovec

cs.SI 🏛 NeurIPS 📚 18.5K cites 8 years ago

R.I.P. 👻 Ghosted

node2vec: Scalable Feature Learning for Networks

Aditya Grover, Jure Leskovec

cs.SI 🏛 KDD 📚 11.9K cites 9 years ago

R.I.P. 👻 Ghosted

Cooperative Game Theory Approaches for Network Partitioning

Konstantin Avrachenkov, Aleksei Kondratev, Vladimir Mazalov

cs.SI 🏛 J.AR 📚 4.4K cites 8 years ago

R.I.P. 👻 Ghosted

From Louvain to Leiden: guaranteeing well-connected communities

Vincent Traag, Ludo Waltman, Nees Jan van Eck

cs.SI 🏛 Sci. Rep. 📚 4.4K cites 7 years ago

R.I.P. 👻 Ghosted

Fake News Detection on Social Media: A Data Mining Perspective

Kai Shu, Amy Sliva, ... (+3 more)

cs.SI 🏛 SKDD 📚 3.1K cites 8 years ago

R.I.P. 👻 Ghosted

Heterogeneous Graph Attention Network

Xiao Wang, Houye Ji, ... (+5 more)

cs.SI 🏛 WWW 📚 3.0K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 6 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago