Structure and Content of the Visible Darknet

From 35C3 Wiki

Revision as of 17:10, 29 December 2018 by Jogli5er (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Description We analyzed the topology and the content found on the "darknet", the set of websites accessible via Tor. We created a darknet spider and crawled the darknet starting from a bootstrap list by recursively following links.
Website(s) https://github.com/decrypto-org/spider, https://arxiv.org/pdf/1811.01348.pdf, https://docs.google.com/presentation/d/1grQpTV6IV8f4kz7UNnrsKGbRdcpZ3tso7-t5dVs Ibw/edit?usp=sharing
Type Talk
Kids session No
Keyword(s) social, network, web
Tags Tor Network, Darknet, Content Analysis
Person organizing Jogli5er
Language en - English
en - English
Other sessions... ... further results

(Click here to refresh this page.)

Starts at 2018/12/29 15:30
Ends at 2018/12/29 16:00
Duration 30 minutes
Location Room:Seminar room 13

Using the crawler we explored a connected component of more than 34'000 hidden services, of which we found 10'000 to be online. Contrary to folklore belief, the visible darknet is surprisingly well-connected through hub websites such as wikis and forums. We performed a comprehensive categorization of the content using supervised machine learning. We observer that about half of the web content is related to apparently licit activities based on our classifier. A significant amount of content pertains to software repositories, blogs and activism-related websites. Among unlawful hidden services, most pertain to fraudulent websites, services selling counterfeit goods, and drug markets.
Authors: Georgia Avarikioti, Roman Brunner, Aggelos Kiayias, Roger Wattenhofer, Dionysis Zindros