Structure and Content of the Visible Darknet
From 35C3 Wiki
Description | We analyzed the topology and the content found on the "darknet", the set of websites accessible via Tor. We created a darknet spider and crawled the darknet starting from a bootstrap list by recursively following links. |
---|---|
Website(s) | https://github.com/decrypto-org/spider, https://arxiv.org/pdf/1811.01348.pdf, https://docs.google.com/presentation/d/1grQpTV6IV8f4kz7UNnrsKGbRdcpZ3tso7-t5dVs Ibw/edit?usp=sharing |
Type | Talk |
Kids session | No |
Keyword(s) | social, network, web |
Tags | Tor Network, Darknet, Content Analysis |
Person organizing | Jogli5er |
Language | en - English |
Other sessions...
|
(Click here to refresh this page.)
Starts at | 2018/12/29 15:30 |
---|---|
Ends at | 2018/12/29 16:00 |
Duration | 30 minutes |
Location | Room:Seminar room 13 |
Using the crawler we explored a connected component of more than 34'000 hidden services, of which we found 10'000 to be online. Contrary to folklore belief, the visible darknet is surprisingly well-connected through hub websites such as wikis and forums. We performed a comprehensive categorization of the content using supervised machine learning. We observer that about half of the web content is related to apparently licit activities based on our classifier. A significant amount of content pertains to software repositories, blogs and activism-related websites. Among unlawful hidden services, most pertain to fraudulent websites, services selling counterfeit goods, and drug markets.
Authors: Georgia Avarikioti, Roman Brunner, Aggelos Kiayias, Roger Wattenhofer, Dionysis Zindros