Data Mining for Decision-Making
Every day we generate large amounts of data. Just by going online or using our cell phones, we leave informative traces, the so called “digital breadcrumbs”, which can give away a lot of information about our individual actions, with obvious threats to privacy. However, when this individual data is gathered and analyzed collectively, it can be very revealing of global behavior patterns and, at the S&P group we ask whether we can use some of these data not only to know more, but also to make better decisions, both at the governmental and citizen levels.
During the presentation I’ll present three examples of the projects that the group is currently involved in and briefly describe some of the different data gathering and data mining tools that we are developing, for both structured and unstructured data.
First I will try to give some insights that might help answer the long standing question of whether humans have a biological reproduction cycle. Do we, as a species, tend to reproduce at certain times of the year, like other mammals do? I will argue that, in addition to the inherent importance of this question, similar “breadcrumbs” can be used to detect the onset of epidemic diseases, such as the flu, to help predict how many people will show up at the hospital on a given day. Finally, I’ll give a brief example of how these methods can be used to keep our politicians in check, by analyzing the participation and discourse of the elected members of the Parliament.