Lightning:BTMining - A dataset looking for a problem
From 34C3_Wiki
Description | I have a dataset of 68 years of parliamentary documents from the German parliament, and no idea what to do with it. But maybe you do! So, I am sharing the dataset with all of you so you don't have to collect it yourself. |
---|---|
Slides | https://more.velcommuta.de/34c3/Lightning-BTMining.pdf |
Website(s) | https://github.com/malexmave/pdok-mirror, https://more.velcommuta.de/34c3/bundestag/ |
Tags | politics, open data, dataset, data-mining |
Person organizing | User:malexmave |
Contact: | max@velcommuta.de |
Language | en - English |
Duration | 5 |
Desired session | Day 2 |
Desired timeframe | begin, middle, end |
I have a dataset of 68 years of parliamentary documents from the German parliament with excellent OCR, and no idea what to do with it. But maybe you do! So, I am sharing the dataset with all of you so you don't have to collect it yourself. Want to do sentiment analysis? Be my guest. Build a text generator for the different political parties? Sure. Just make sure you have around 100 GB of hard drive space and go nuts.
Video starts at https://www.youtube.com/watch?v=67rh6jB2UVQ#t=1h14m38s