Automatically Generating a Large, Culture-Specific Blocklist for China

June 04, 2018 Β· Declared Dead Β· πŸ› FOCI @ USENIX Security Symposium

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Austin Hounsel, Prateek Mittal, Nick Feamster arXiv ID 1806.03255 Category cs.CY: Computers & Society Cross-listed cs.NI Citations 9 Venue FOCI @ USENIX Security Symposium Last Checked 4 months ago
Abstract
Internet censorship measurements rely on lists of websites to be tested, or "block lists" that are curated by third parties. Unfortunately, many of these lists are not public, and those that are tend to focus on a small group of topics, leaving other types of sites and services untested. To increase and diversify the set of sites on existing block lists, we use natural language processing and search engines to automatically discover a much wider range of websites that are censored in China. Using these techniques, we create a list of 1125 websites outside the Alexa Top 1,000 that cover Chinese politics, minority human rights organizations, oppressed religions, and more. Importantly, $\textit{none of the sites we discover are present on the current largest block list}$. The list that we develop not only vastly expands the set of sites that current Internet measurement tools can test, but it also deepens our understanding of the nature of content that is censored in China. We have released both this new block list and the code for generating it.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Computers & Society

R.I.P. πŸ‘» Ghosted

Green AI

Roy Schwartz, Jesse Dodge, ... (+2 more)

cs.CY πŸ› arXiv πŸ“š 1.5K cites 6 years ago

Died the same way β€” πŸ‘» Ghosted