Tag Archives: Data journalism

New tool provides optional upload of iPhone location data

The Research and Development Group at the New York Times Company has released a tool to allow iPhone users to upload their location data. The information – which is anonymous – will then be available to groups who apply to access the data.

Explanations here and here on the openpaths.cc website state:

This data represents a unique opportunity to help solve some of the world’s toughest problems. We believe you should have the option of donating your data in an open, secure fashion, while maintaining control of your information and where it goes.

Research requests are received from any and all projects – public, private, commercial, academic, artistic, or governmental. Requests typically look at specific geographical areas or demographic information about their subjects, so research requests include these criteria. Based on this information, users receive monthly updates that list the projects where their data is a good fit, and are offered the opportunity to donate their data.

In return, we ask researchers to provide a small benefit to their data donors. This might be a custom visualization of a donor’s location information, access to the results of the research, or other related benefits.

When researchers revealed that iPhones had been recording location data, concerns were raised about privacy.

As explained in this article in the Guardian:

Security researchers discovered that Apple‘s iPhone keeps track of where you go – and saves every detail of it to a secret file on the device which is then copied to the owner’s computer when the two are synchronised.

The file contains the latitude and longitude of the phone’s recorded coordinates along with a timestamp, meaning that anyone who stole the phone or the computer could discover details about the owner’s movements using a simple program.

For some phones, there could be almost a year’s worth of data stored, as the recording of data seems to have started with Apple’s iOS 4 update to the phone’s operating system, released in June 2010.

Apple has now released a software update 4.3.3 to fix this. Anyone who wants to make their data available should hold off installing it.

Five tools to liven up local election reporting

If you are reporting on the referendum on the voting system, the Scottish, Welsh and Northern Irish assemblies or from one of the 305 town halls across England and Northern Ireland with local elections, how are you going to present the results?

As a text only story which reports how many seats have been lost or gained by each party? Or are you going to try visualising the results? Here are five free and easy to use tools to liven up the results.

1. Many Eyes

Many Eyes is a free data visualisation tool. If you have not tried your hand at any data journalism yet, today could be the day to start.

A. Create a Many Eyes account;

B. Create your spreadsheet using Excel, Open Office (free to download) or Google Docs (free and web based);

You could follow my example by putting ward names across the top, parties down the side and the number of each ward seats won by each party. You will need to include the total in the end column.

local elections example

C. Paste the data into Many Eyes, which will automatically read your pasted information;

D. Click ‘visualise’. In this example I selected the ‘bubble chart’ visualisation. Have a play with other visualisations too;

E. Copy the embed code and paste it into your story;

2. OpenHeatMap

OpenHeatMap is a way to visualise your results in a map. It is free and very easy to use. You start by creating a spreadsheet, uploading the data and you can then embed the map in your web page.

A. Go to OpenHeatMap (you don’t need a login);

B. Create a spreadsheet. The easiest was to do this is in Google Docs. You must name your columns so OpenHeatMap can understand it. Use ‘UK_council’ for the local council, ‘tab’ for the party and ‘value’ for the number of seats. In this example, the tab column indicates the party with the most seats; the value is the number of seats;

C. Click ‘share’ (to the right hand side of your Google Doc), ‘publish as a web page’ and copy the code;

D. Paste the code into OpenHeatMap and click to view the map. In this example you will see the parties as tabs along the top which you can toggle between. You can change the colour, zoom in to your county or region and alter the transparency so you can see place names;

E. Click ‘share’ and you can copy the embed code into your story.

3. Storify

Anyone can now join Storify (it used to be by invitation only). It allows you to tell a story using a combination of text, pictures, tweets, audio and video.

A. Sign up to Storify;

B. Create a story and start adding content. If you click on the Twitter icon and search (say for ‘local election Kent’) you can select appropriate tweets; if you click on the Flickr icon you can find photos (you could ask a photographer to upload some); you can also add YouTube videos and content from Facebook. When you find an item you want to include, you simply drag and drop it into your story;

C. The art of a good Storify story is to use your skills as a storyteller. The tweets and photos need to be part of a narrative. There are some fantastic examples of story ideas on Storify;

D. Click to publish;

E. Copy and paste the embed code into the story on your site.

4. AudioBoo

You can record audio (perhaps the results as they are announced or reaction interviews with councillors) and include it in your story.

The easiest way is to download the free smartphone app or you can upload your own audio via the website.

A. Create an AudioBoo account;

B. Download the Android or iPhone app;

C. Record your short interview. You may decide to include a photo too;

D. Login to the audioboo website and click ’embed’;

E. Paste the embed code into your story.

Listen!

5. Qik

Qik is a free and allows you to live stream video. Why not broadcast the results as they happen?

A. Create a Qik account;

B. Download the app (iPhone, Android, Blackberry – a full list of supported phones is here);

C. The video will be automatically posted live to your Qik profile but you’ll need to add the code to your website before you record (you can also live stream to your Facebook page, Twitter account and YouTube channel).

D. To do this go to ‘My Live Channel’ (under your name). Click on it to get your embed code for your live channel.

E. Paste your embed code in your website or blog, where you want the live player to be.

How did you get on with the five tools? Let us know so that we can see your election stories.

Ten things every journalist should know about data

Data visuals

Every journalist needs to know about data. It is not just the preserve of the investigative journalist but can – and should – be used by reporters writing for local papers, magazines, the consumer and trade press and for online publications.

Think about crime statistics, government spending, bin collections, hospital infections and missing kittens and tell me data journalism is not relevant to your title.

If you think you need to be a hacker as well as a hack then you are wrong. Although data journalism combines journalism, research, statistics and programming, you may dabble but you do not need to know much maths or code to get started. It can be as simple as copying and pasting data from an Excel spreadsheet.

You can find out more about getting started and trying your hand at complex data journalism at news:rewired – noise to signal, on 27 May. More details about the event are here and you can order tickets, which cost £156 including VAT, by clicking here.

Here are 10 reasons to give data a go.

1. Everybody loves a list. Did you click on this post as you wanted an easy-to-read list rather than an involved article?

2. Everybody loves a map. Try Quantum GIS (QGIS), a free, open source tool, or OpenHeatMap, a fantastic, east-to-use tool as long as your data is categorised by country, local authority, constituency, region or county.

3. Tools bring data to life. Applications such as ManyEyes and Yahoo Pipes mash data and turn complex numbers and datasets into easy to read visualisations that work well both online and in print. Try this how to guide to Yahoo Pipes to get you started. Here are 22 data visualisation tools from Computer World.

4. Data may need cleaning up. Try using clean up tools like Scraperwiki, which helps non-technical journalists copy a few lines of code to turn a document such as pdf into a number-friendly file like a csv, and Google Refine, which Paul Bradshaw has written some useful posts on over on the Online Journalism Blog.

5. Data of all sorts is increasingly available. The open data movement across the UK is resulting in an increase in the release of data. The possibilities are huge, says Paul Bradshaw on the Guardian’s Datablog. January 2010, saw the launch of data.gov.uk, a fantastic resource for searching for datasets.

6. Data journalism can answer questions. A good place to start in data journalism is to ask a question and answer it by gathering data. Numbers work well. One option is to submit a Freedom of Information request to ask for the numbers. It helps if you ask for a csv file.

7. You can use the crowd. Crowdsourcing by asking a question on Twitter or using a site like Help Me Investigate, an open source tool for people can use to collaborate to investigate questions in the public interest.

8. Data can be personal to every reader. DocumentCloud can highlight and annotate documents to help readers see what is important and learn a document’s back story.

9. “Data journalism is not always presenting the data as journalism. It’s also finding the journalism within the data,” Jay Rosen said in relation to this article on Poynter on how two journalists from the Las Vegas Sun spent two years looking at 2.9 million documents to find out what “what’s right, and wrong, about our local health care delivery system”. The result was that the journalists exposed thousands of preventable medical mistakes in Las Vegas hospitals. The Nevada legislature responded with six pieces of legislation.

10. “Data ethics is just as important as ethics in journalism, in fact they are one in the same,” according to this post on Open Data Wire. Consider the BBC’s FoI request which showed a 43 per cent rise in GPs signing prescriptions for antidepressants and the ethics of unquestioningly relating this to the recession. Ben Goldacre has highlighted the problems with seeing patterns in data.

This is a cross post originally published on the news:rewired website. You can get your tickets here.

A full agenda for news:rewired – noise to signal, is here. A list of more than 20 speakers is here.

Nieman: AP Interactive and a visual future for breaking news

Nieman Journalism Lab’s Justin Ellis has written an interesting post on the development of Associated Press’ interactive output, which has nearly doubled over the past two years.

Among other things, Ellis touches on on the work of the AP Interactive department covering breaking news stories with graphics:

The trick in being able to roll out these features so quickly (and likely another reason the department has increased its output) is the usage of templates, Nessa said. That basic form allows the artists, programmers, and others on staff to publish graphics quickly — and to continuously update them as more information comes in from reporters. That’s why when events like Japan’s earthquake and subsequent tsunami hit, you could find not only breaking reports from the AP, in text, but also incredible photography and interactive graphics that harnessed reporting from correspondents as well as accounts and images from on-the-ground witnesses.

See the full post at this link.

Interactives, graphics and visualisation are among a range of essential topics for modern journalists that will be covered at Journalism.co.uk’s upcoming news:rewired conference. See the full agenda at this link.

#ijf11: Lessons in data journalism from the New York Times

Follow this link or scroll to the bottom to start by hearing more from New York Times graphics editor Matthew Ericson on what kind of people make up his team and how they go about working on a story

The New York Times has one of the largest, most advanced graphics teams of any national newspaper in the world. Yesterday at the International Journalism Festival, NYT deputy graphics editor Matthew Ericson led an in-depth two-hour workshop on his team’s approach to visualising some of the data that flows through the paper’s stories every day.

He broke the team’s strategy down in to a few key objectives, the four main ones being:

Provide context

Describe processes

Reveal patterns

Explain the geography

Here is some of what Ericson told the audience and some of the examples he gave during the session, broken down under the different headers.

Provide context

Graphics should bring something new to the story, not just repeat the information in the lede.

Ericson emphasised a graphics team that simply illustrates what the reporter has already told the audience is not doing its job properly. “A graphic can bring together a variety of stories and provide context,” he said, citing his team’s work on the Fukushima nuclear crisis.

We would have reporters with information about the health risks, and some who were working on radiation levels, and then population, and we can bring these things together with graphics and show the context.

Describe processes

The Fukushima nuclear crisis has spurned a lot of graphics work at news organisations across thew world, and Ericson showed a few different examples of work on the situation to the #ijf11 audience. Another graphic demonstrated the process of a nuclear meltdown, and what exactly was happening at the Fukushima plant.

As we approach stories, we are not interested in a graphic showing how a standard nuclear reactor works, we want to show what is particular to a situation and what will help a reader understand this particular new story.

Like saying: “You’ve been reading about these fuel rods all over the news, this is what they actually look like and how they work”.

From nuclear meltdown to dancing. A very different graphic under the ‘desribe processes’ umbrella neatly demonstrated that graphics work is not just for mapping and data.

Disecting a Dance broke down a signature piece by US choreographer Merce Cunningham in order to explain his style.

The NYT dance critic narrated the video, over which simple outlines were overlaid at stages to demonstrate what he was saying. See the full video at this link.

Reveal patterns

This is perhaps the objective most associated with data visualisation, taking a dataset and revealing the patterns that may tell us a story: crime is going up here, population density down there, immigration changing over time, etc.

Ericson showed some of the NYT’s work on voting and immigration patterns, but more interesting was a “narrative graphic” that charted the geothermal changes in the bedrock under California created by attempts to exploit energy in hot areas of rock, which can cause earthquakes.

These so-called narrative graphics are take what we think of as visualisation close to what we have been seeing for a while in broadcast news bulletins.

Explain geography

The final main objective was to show the audience the geographical element of stories.

Examples for this section included mapping the flooding of New Orleans following hurricane Katrina, including showing what parts of the region were below sea level and overlaying population density, showing where levies had broken and showing what parts of the land were underwater.

Geography was also a feature of demonstrating the size and position of the oil slick in the Gulf following the BP Deepwater Horizon accident, and comparing it with previous major oil spills.

Some of the tools in use by the NYT team, with examples:


Google Fusion Tables
Tableau Public: Power Hitters
Google Charts from New York State Test Scores – The New York Times
HTML, CSS and Javascript: 2010 World Cup Rankings
jQuery: The Write Less, Do More, JavaScript Library
jQuery UI – Home
Protovis
Raphaël—JavaScript Library
The R Project for Statistical Computing
Processing.org

An important formula 

Data + story > data

It doesn’t take a skilled mathematician to work that one out. But don’t be fooled by it’s simplicity, it underpinned a key message to take away from the workshop. The message is equally simple: graphics and data teams have the skill to make sense of data for their audience, and throwing a ton of data online without adding analysis and extracting a story is not the right way to go about it.

More from Matthew Ericson on the NYT graphics team

I spoke to Ericson after the session about what kind of people make up his team (it includes cartographers!) and how they go about working on a story.

Here’s what he had to say:

Listen!

The BBC’s Peter Horrocks on data journalism

I spoke to Peter Horrocks, who is director of the BBC World Service and the BBC’s global online news operations after the session about his take on data journalism and whether the BBC Global News had ambitions in this direction.

Here’s what he had to say:

Listen!

See the full list of links for Ericson’s session on his blog.

See more from #ijf11 on the Journalism.co.uk Editor’s Blog.

#media140 – Impure visual data tool to tell the story

There has been a range of session formats at #media140, from in-depth keynote speeches and discussion roundtables, to more jam-packed workshops showcasing some of the latest tools in social technology.

Today I attended one of the latter, a session on visualising data by Spanish design house Bestiario.

While it was, in a way, a whirlwind tour of the company’s information processing platform Impure, delegates managed to get a great overview of what it can produce (on my part only with thanks to my translator!)

The focus of the session was not about the written story, but simply visualisation, telling the story with infographics using, in essence, a drag and drop technique.

Just today the Guardian published a visualisation by Bestiario looking at who the UK gives aid to and how it has changed.

For a more detailed explanation of how to use the tool you can visit the site itself, but in simple terms the platform enables journalists to create data visualisation projects.

Users can import data files (csv), convert into a table, pull out specific fields, create different data structures and also select from a range of visualisation formats, based on the data you’re working with.

The final visualisations are publicly published on Impure, and users can also embed the infographics on their own site.

At the moment the application is free to use, and the company says there will always be “an open version”, in order to build and maintain a community.

Al Jazeera launches Twitter dashboard to track uprisings

Al Jazeera has launched a Twitter dashboard of the Arab uprisings to show what is being tweeted about and where.

One section shows the daily total of tweets mentioning hashtags for Libya, Bahrain, Yemen and Egypt and the average number of tweets per minute. These are also shown in a graph.

Another graphic shows the hashtag distribution for each country getting the most attention in the Twittersphere. Hashtags for Libya include various spellings of Libya and Gaddafi, plus #feb17

A Twitter feed is also included.

 

OJB: Bella Hurrell on data journalism and the BBC News Specials Team

Online Journalism Blog’s Paul Bradshaw asked Bella Hurrell, specials editor with BBC News Online, how data journalism was affecting their work for a forthcoming article.

Read her full response on the OJB site at this link.

As data visualisation has come into the zeitgeist, and we have started using it more regularly in our story-telling, journalists and designers on the specials team have become much more proficient at using basic spreadsheet applications like Excel or Google Docs. We’ve boosted these and other skills through in house training or external summer schools and conferences.

Jonathan Stray: A computational journalism reading list

Journalist and computer scientist Jonathan Stray has posted an interesting breakdown of what he calls “computational journalism”, a kind of parent term for data journalism, visualisation, computational linguistics, communications technology, filtering, research and more.

I’d like to propose a working definition of computational journalism as the application of computer science to the problems of public information, knowledge, and belief, by practitioners who see their mission as outside of both commerce and government. This includes the journalistic mainstay of “reporting” — because information not published is information not known — but my definition is intentionally much broader than that.

Stray has put together a reading list under each sub-header (including our very own ‘How to: get to grips with data journalism‘).

Worth a read.

Full post on Jonathan Stray’s blog at this link.

OUseful: New public data Q&A site launches

Open University lecturer, self-proclaimed mashup artist and all-round bright spark Tony Hirst blogs about a new Q&A site designed to help people with open data questions.

GetTheData.org is in “startup/bootstrapping” phase at the moment but already has a fair bit of information up.

The idea behind the site is to field questions and answers relating to the practicalities of working with public open data: from discovering data sets, to combining data from different sources in appropriate ways, getting data into formats you can happily work with, or that will play nicely with visualisation or analysis tools you already have, and so on.

Full post on OUseful.info at this link.

h/t: Online Journalism Blog