22C3 - 2.2

22nd Chaos Communication Congress
Private Investigations

Isabel Drost
Tag 1
Raum Saal 2
Beginn 19:00
Dauer 01:00
ID 415
Veranstaltungstyp Vortrag
Track Science
Sprache englisch

Developing Intelligent Search Engines

The presentation will give a short overview of the architecture of search engines and how machine learning can help improving search engines. In addition some projects you can take part in will be briefly introduced.

Developers of search engines today do not only face technical problems such as designing an efficient crawler or distributing search requests among servers. Search has become a problem of identifying reliable information in an adversarial environment. Since the web is used for purposes as diverse as trade, communication, and advertisement search engines need to be able to distinguish different types of web pages. In this paper we describe some common properties of the WWW and social networks. We show one possibility of exploiting these properties for classifying web pages.