Tyler Dukes // @mtdukes
Founder, Guardian Datablog
The Manchester Guardian, May 1821
No document of a similar nature has yet been laid before the public.
At all times, such information as it contains is valuable; because, without knowing the extent to which education, and particularly the education of the labouring classes, prevails, the best opinions which can be formed of the condition and future progress of society must be necessarily incorrect.
"We tend to think of data as immutable truth. But we forget that data and data-collection systems are created by people."
Meredith Broussard, The Atlantic
Broussard found the textbook data was so untrustworthy, it was pretty much unusable by public officials.
Investigative reporter, The News & Observer
Authorities in Kenya say preteen girls in rural areas drop out of school for reasons of tribal tradition: This is the age they help parents with housework and chores.
Young girls in rural Kenya are dropping out of school because of a lack of sanitation facilities.
After ruling out medical problems from records, reporter Irene Choge used Kenya Open Data water information to show schools with the worst academic records were also the ones with the worst physical infrastructure.
Civic hackers found that Kenya's Independent Electoral and Boundaries Commission published voter registration centers only in PDF format - and decided to build an app to help people find out how to vote.
With about 24 hours and $500, the Code4Kenya team turned the PDF data into an interactive spreadsheet. They then launched a site that helped people find where to vote based on where they live.
Reporters for the Oxpeckers Centre for Investigative Environmental Journalism found that South Africa fails to convict for crimes related to rhino poaching. In 2010, the conviction rate was as 2.6 percent.
The group reused a data platform from InfoAmazonia, and using data from the Police Ministry on arrests and prosecutions since 2010, mapped and analyzed that information.
Reporters identified - for the first time - 5,171 properties in the city of Winnipeg in Canada that could develop frozen pipes - 70 percent of those at risk.
After being told that Winnipeg officials would only release data one address at a time, CBC scraped the data for more than 190,000 properties and did the analysis on their own.
Using a loophole in the country's adoption rules, children traded in an underground market for were abused and neglected in a practice called "rehoming."
Reuters scraped and analyzed 5,029 posts over a five-year period on message boards used by parents seeking to re-home children, using it as a starting point for their stories.
A team of 10 journalists from six countries built a comprehensive database to track deaths of migrants in the Mediterranean sea amid an international discussion of how to prevent these tragedies.
No database of this information existed. So journalists built their own, combining data from government sources and carefully curated news sources to track 13,718 migrants.
In a massive project, a huge team of journalists probed the use of offshore tax havens to expose government officials, wealthy citizens and ponzi schemers to hide money for financial gain.
In November 2012, investigators at La Nacion Costa Rica anonymously received a storage device with millions of data points spread over 320 spreadsheets in multiple formats and no data dictionary. They analyzed the data and spent months rebuilding and collaborated across newsrooms to figure out what they contained.
Get this presentation: