• Home
  • Blog
  • About
  • Contact
CloudAve
Software in Business. The Business of Software.
  • Business
    • Analysis
    • Entrepreneurship
    • Marketing
    • Strategy
    • Small business
  • Technology
    • Application Software
    • Infrastructure
    • Open Source
    • Mobile
    • Platforms
    • Product reviews
    • Security
  • Misc
    • Design
    • Just for fun
    • Trends & Concepts
  • Sponsors
Browse: Home / Data Journalism at The Guardian

Data Journalism at The Guardian

By Paul Miller on October 2, 2012

UK newspaper, The Guardian, has done some pioneering work to use data, and to engage readers in exploring data to share their own insights. The paper’s Simon Rogers and Google’s Kathryn Hurley shared some of the lessons at the Strata conference.

Rough notes follow.

Not going to talk about big projects like riots and Wikileaks and MP’s expenses… Going to talk about the day-to-day process of hacking around with data.

Open data journalism – more than just Google spreadsheets. Much more of a two-way process than simply writing and disseminating stories.

Numbers need context. Journalists need the skills to interpret, probe, and tell a data-backed story.

First rule of what we do – find the key data behind a story and make it public. Guardian Datablog and Data Store used to push out data relevant to the main news stories of the week.

Lots of data is available, but it’s locked up in a wide range of data sets. A lot of the Data team’s work is involved with pulling freely available data together in one place – making it comparable and useful.

Get past raw numbers, and show how they have changed over time. Measures and units and groupings change, so how do you actually compare like with like?

Don’t always just rely upon the algorithm… Need the knowledge and the question-asking capabilities to wonder whether or not the result is too good to be true. Often, it will be wrong.

Olympics… lots of data, but very little was open. IOC sold the data, and refused to allow it to be shared.

Kathryn Hurley at Google… spent the last week working directly with the Guardian team… Learned…

  • News drives the stories
  • Data journalism moves fast
  • Quick and easy tools reign supreme

What does this mean for other businesses?

  • Know what matters
  • Find the data to back it up (internally, from government, from public data sites, from data markets, etc)
  • Clean the data (a lot! Sometimes just normalisation, sometimes more serious)
    • plugging tools like Google Refine
  • Sometimes the data you have isn’t enough – find more
  • Tell the story – visualisation matters, interactivity helps
  • Sharing the data to support your story – make it available for download, or offer an api

Tools need to get easier to use and richer, to let data journalists (and others) get the results they need more quickly, and with less coding.

Published data needs to be more logically formatted… PDFs derived from printed documents are designed for human reading, not for machine processing.

Image of The Guardian‘s offices by Flickr user Mark Hillary.

Related articles
  • Data journalism at the Guardian: What is it and how do we do it? (nextlevelofnews.com)
  • Open data journalism (guardian.co.uk)
  • Four key trends changing digital journalism and society (radar.oreilly.com)

Share:

  • Twitter
  • Facebook
  • LinkedIn
  • Google +1
  • StumbleUpon

(Cross-posted @ The Cloud of Data)

Posted in Trends & Concepts | Tagged big data, data journalism, open data, oreilly, strata, strataconf, strataeu, The Guardian

Paul Miller

« Previous Next »
feed mail facebook twitter linkedin

Sponsor Posts

5 Voicemail Tactics That Will Get You More Callbacks
5 Voicemail Tactics That Will Get You More Callbacks
HR Tech Vendors: Who’s Out There?
HR Tech Vendors: Who’s Out There?
The Next Revolution for Finance -- Embedded Analytics
The Next Revolution for Finance -- Embedded Analytics
Want to Boost CRM Adoption? Eliminate These 4 Obstacles
Want to Boost CRM Adoption? Eliminate These 4 Obstacles
  • Tags
  • Calendar
  • Comments

accy2 amazon android Apple aws briefs cloud cloud computing collaboration conferences Enterprise enterprise 2.0 Entrepreneurship facebook google humor iaas IBM innovation insights integration ipad iphone marketing microsoft netsuite open source openstack paas platform services saas salesforce.com sap Security Social Business social media software as a service Startup Advice startups Tech Market Analysis twitter vc funding venture capital vmware xero

May 2013
M T W T F S S
« Apr    
 12345
6789101112
13141516171819
20212223242526
2728293031  
  • Ashoo tuli: Very informative.
  • Jarret Pazahanick: Thanks for the comment...
  • Hiks: Thanks Jarret. It’s really a very...
  • Vijay: Good Article… I have been working...
  • jarretpazahanick: Thanks Joost for the kind...
  • joost van assen: That is very good information...
  • jarretpazahanick: Volker – Here is a...
  • Chal: Hi Jarret, Could you please advise on how...
  • Volker Kuecherer: Do you have any information...
  • Experiencia Cloud (BETA): What Makes Cloud...
  • Abhishek: I see nothing wrong with rewarding...
  • CloudAve: always insightful Mark Suster...
  • fred zimny's serve4impact: See on...
  • CloudAve: 5 Key Essentials of Cloud Workloads...
  • jasonlkn: It’s natural … especially...

Archives

Authors

  • Adron Hall
  • Ben Kepes
  • Chirag Mehta
  • Chris Yeh
  • Christian Reilly
  • Colin Berkshire
  • Dan Morrill
  • Dan Pepper
  • Dave Michels
  • Dave Roberts
  • Hutch Carpenter
  • Jacob Morgan
  • Jarret Pazahanick
  • Jason M. Lemkin
  • Jeffrey Vocell
  • Joel York
  • John Taschek
  • Krishnan Subramanian
  • Mark Fidelman
  • Mark Suster
  • Martijn Linssen
  • Michael Krigsman
  • Ofir Nachmani
  • Paul Miller
  • Quinton Wall
  • Rakesh Malhotra
  • Randy Bias
  • Sadagopan
  • Scott Bils
  • Zoli Erdos
Sponsored by: