Browse > Home /

#ijf13: Data journalism pointers and Excel starter tips

April 24th, 2013 | No Comments | Posted by in Data, Events
Image by Abron on Flickr. Some rights reserved

Image by Abron on Flickr. Some rights reserved

Data journalism is not a new phenomenon. Speaking at the International Journalism Festival in Perugia, Steve Doig from the Walter Cronkite school of journalism highlighted this by talking about the impact of the rise of the personal computer in the early 1980s and how this helped journalists track “patterns” in the data they were getting hold of.

Before this technology arrived, such reporting was “often based simply on anecdotes”, he said. Giving the example of covering “the problem of drunk driving”, journalists would have previously had to have referenced a “bad example of such an accident” before moving to discuss the “larger problem”, he explained.

The nice thing about data journalism is it lets you go beyond anecdotes to evidence.

His workshop ran through some of the key features of Excel to help journalists sort, filter, “transform” and “summarise” data.

Below is a summary of some of the key points he raised – the full tutorial is available online.

  • Sorting, filtering, transforming and summarising data with Excel

When it comes to the most common format of data, Doig said it “tends to be alphabetical”, which will not make it immediately clear to a journalist what the story, or stories, behind the data are.

So we want this to be “more journalistically interesting”, Doig said. As an example he demonstrated how journalists can sort numbers by highest or lowest.

When it comes to filtering data, he described some particularly large datasets as “forests”, and that journalists “only want to see the trees that we’re interested in”.

Using Excel journalists can hide data they are less interested in and effectively keep their work area tidy.

Journalists can also use Excel to “transform data using functions and formulas”. For example, he showed the delegates how to create new variables, such as working out a crime rate per 100,000 people when you already have statistics on population and crime. This then helps the journalist “make fair comparisons between places of different size”.

Finally, you can “collapse your data down by categories”. This can be achieved by using pivot tables, which enables the users to select certain variables and bring those together.

For example, if you wanted to look at the number of murders by region, but the data is also broken down into smaller geographic areas, you could build a pivot table, select the ‘region’ variable in ‘row labels’ and select the column stating the number of murders and put it in ‘values’. This would combine the number of murders per region.

  • Data stories are not only for economics or business journalism

Here is just a selection of the different types of data story subjects Doig highlighted:

- Budgets and taxes
- Crime patterns
- School test scores
- Auto accidents
- Demographic change
- Pet licences
- Air quality
- Sports statistics

  • A simple toolbox can get you far when you are starting out

Highlighting some of the key tools for working with datasets, Doig said Excel lets journalists do the majority of the work they would need to, supported by database software like Access, mapping tools like ArcMap, a text editor and social network analysis plug-ins such as NodeXL.

And when it comes to visualising the data he pointed to data journalism staple Google Fusion tables, as well as coding language such as Ruby, Django, perl, python.

  • Tap into industry resources

Doig recommended a number of outlets and online platforms offering industry expertise on data journalism:

- Data journalism handbook
- EJC
- NICAR
- Investigative reporters and editors
- SKUP
- Global Investigative Journalism Network

Tags: , , ,

Similar posts:

#Podcast: Getting started in data journalism

April 12th, 2013 | No Comments | Posted by in Data, Podcast
Image by Adikos on Flickr. Some rights reserved.

Image by Adikos on Flickr. Some rights reserved.

The ability to analyse and untangle datasets is a vital skill for journalists in the age of endless information, so this week’s podcast focuses on how to get started in data journalism.

Getting started in data journalism and getting further than the basics can seem like a mountain of programming tools and coding languages, but the experts we spoke to describe how to take the first steps.

  • Paul Bradshaw, online journalist, lecturer and blogger, Help Me Investigate.com
  • Marianne Bouchart, web producer and data journalism projects co-ordinator, Bloomberg News
  • Nicola Hughes, data journalist, Dataminer UK, 2011/12 Knight-Mozilla fellow at the Guardian, soon to join The Times

You can hear future podcasts by signing up to the Journalism.co.uk iTunes podcast feed.

 

Tags: , , , , , , ,

Similar posts:

#Tip: Watch these free data journalism tutorials

Image by Abron on Flickr. Some rights reserved

Image by Abron on Flickr. Some rights reserved

If you want to get started in data journalism, you might like to watch this series of videos.

There is a series of four free courses being run by kdmc at the Berkeley Graduate School of Journalism. And as they are online you don’t need to be in California to take part.

The final tutorial in the series takes place on Monday (you can register here), but you can also make the most of the seminar videos which are also at the above link.

There are tutorials on the following topics:

  • Spreadsheet basics
  • Introduction to data visualisation
  • Communicating with maps
  • Introduction to data mapping (which will run on Monday)

Journalism.co.uk offers data journalism training. We have one-day courses coming up in data visualisations, open data and an introduction to data journalism. See the full list at this link.

Tags: , , ,

Similar posts:

#Tip: Understand open data with ODI guide

March 11th, 2013 | No Comments | Posted by in Data, Top tips for journalists

For those keen to get started in data journalism, the Open Data Institute website’s guide to open data may prove useful in gaining a detailed understanding of what open data is, and the differences between big data, linked data and midata. The guide also outlines to organisations the benefits of opening up their data.

If you have a tip you would like to submit to us at Journalism.co.uk email us using this link.

 

Tags: , , ,

Similar posts:

#Podcast – Robot reporting: A look at the LA Times data desk

March 8th, 2013 | No Comments | Posted by in Data, Podcast
Image by davedehetre on Flickr. Some rights reserved

Image by davedehetre on Flickr. Some rights reserved

If you were working on a newsdesk in California when you got reports of an earthquake, would you go and copy and paste the details from the the US Geological Survey email that was auto-generated.

Would you write when and where it happened and how powerful it was? Would you grab a map and encourage sub-editors to publish quickly?

Well, the Los Angeles Times would have already beaten you to it. It would have auto-published a post, complete with auto-generated headline, a map, and a Ken Schwencke’s byline, the person who wrote the code that auto-writes a story using information from the US Geological Survey.

In this podcast we hear about this example of robot reporting from the LA Times data desk and others, plus look at how internal databases can assist journalists. We also hear about recent data projects from the LA Times.

Journalism.co.uk technology editor Sarah Marshall speaks to:

  • Ben Welsh, database producer, Los Angeles Times
  • Brian Boyer, news applications editor, NPR and project leader on the PANDA project

 

Tags: , , , , ,

Similar posts:

#Tip of the day for journalists: How to get data out of council budget reports

February 19th, 2013 | No Comments | Posted by in Data, Top tips for journalists
Financial-data

Image by Dave Dugdale on Flickr. Some rights reserved

If you are a reporter for a local newspaper, site or radio station you will no doubt be tasked with looking at council budgets reports.

If you have not tried getting data from PDFs to spreadsheets, this guide written by university lecturer Paul Bradshaw and published on Help Me Investigate is a must read.

Bradshaw includes links to lots of useful tools.

Tags: , , ,

Similar posts:

#Podcast: Examining data-driven health reporting

February 15th, 2013 | No Comments | Posted by in Data, Podcast
Image by a.drian on Flickr. Some rights reserved

Image by a.drian on Flickr. Some rights reserved

This podcast looks at how health data can be a source of stories.

We hear how journalists are using information obtained under the Freedom of Information Act, scientific reports and open data as sources.

Journalism.co.uk technology editor Sarah Marshall speaks to:

You can hear future podcasts by signing up to the Journalism.co.uk iTunes podcast feed.

Tags: , , , , , , , , , ,

Similar posts:

#Tip of the day for journalists: Try spreadsheet analysis tool QueryTree

February 6th, 2013 | No Comments | Posted by in Data, Top tips for journalists

Querytree

 

Lecturer and data journalist Paul Bradshaw has written a post highlighting what looks to be a really useful tool for journalists.

QueryTree is a drag-and-drop spreadsheet analysis tool.

In a post headlined Is this an Excel killer? QueryTree app lowers the bar on data journalism, Bradshaw explains why it is so promising.

Sometimes the most impressive tools solve a problem you never knew you had. In the case of QueryTree, a new data analysis tool, that problem is something most people never question: spreadsheets.

For all the shiny-shiny copy-and-paste-click-and-drag-ness in new journalism tools, most data digging comes back to at least some simple spreadsheet work, and that represents a significant hurdle for many journalists used to working with simpler tools.

It is worth reading the full post as Bradshaw explains how to use it.

There is with a free 30-day trial and then subscriptions start at £8 a month.

Tags: , , ,

Similar posts:

#Podcast: Three approaches to data journalism in Latin America

October 19th, 2012 | No Comments | Posted by in Data, Podcast

This podcast looks at the work of three data journalism teams in Latin America.

We hear about investigative data journalism and data scraping at La Nación in Costa Rica; we are introduced to an interactive created by the new data team at Estadão de São Paulo in Brazil; and we look at how La Nación in Argentina is dealing with data in a country without Freedom of Information.

In this podcast Journalism.co.uk speaks to:

  • Giannina Segnini, editor, investigative unit at Costa Rica’s La Nación
  • Jose Roberto de Toledo, a journalist at Estadão de São Paulo in Brazil
  • Angelica Peralta Ramos, multimedia development manager and data journalism project manager at Argentina’s La Nación

You can hear future podcasts by signing up to the Journalism.co.uk iTunes podcast feed.

Tags: , , , , , , , ,

Similar posts:

#PPAdigital: Paul Bradshaw’s five principles of data management

September 26th, 2012 | No Comments | Posted by in Data, Events

At today’s PPA Digital Publishing Conference, Paul Bradshaw, publisher of the Online Journalism Blog, visiting professor at City University, London, and course leader for the MA in Online Journalism at Birmingham City University, talked about data both in terms of data journalism and data analytics.

He set out five principles of data management.

1. Data is only as good as the person asking questions

Bradshaw said that whether the data is from analytics and used for commercial purposes, or whether it’s editorial data and you are doing an investigation, “the key thing is to have questions to ask” of the data.

That should drive everything, rather than you being led by the data.

2. Data can save time and money

Bradshaw is frequently told that data journalism is resource-intensive or a publishing company does not feel it has resources “to do data stuff”.

But he argues that data saves time, does not have to cost money or rely on having a team of developers.

He explained that people he has trained find they learn computer techniques to do things that they previously did manually.

They might scrape websites very neatly into a spreadsheet, they may pull data from an analytics package into spreadsheet, they might visualise that dynamically – and that all saves time.

You might prepare for a big event by having spreadsheets set up or feeds set up or triggers.

3. Data is about people

There can be a danger of becoming “bogged down in the data”, Bradshaw warned. “But really stories are told about people and to people.”

He advises taking “a step back from that data” to find “the people that it is telling a story about”.

He said that in the case of data journalism, that is about finding case studies; in the case of analytics you can use the data to create profiles or pictures of the people who are using your site.

4. Good data is social, sticky and useful

“If data is going to be useful it needs to have a point, people need to be able to do something with it,” Bradshaw said.

People may share it socially, he explained. And it becomes “sticky” if it allows people to spend time exploring it.

5. You can be driven by the data or driven by the story

“Sometimes you are getting data passively and you are looking for stories in it, sometimes you are seeking out data because of the story or lead or question you have,” Bradshaw explained. And that comes back to his first point. “It’s really important to have questions” rather than to be “passively driven by the data”.

And Bradshaw demonstrated how his principles make “a lot more sense” when you replace the word ‘data’ with ‘journalism’.

  • Journalism is only as good as the person asking questions
  • Journalism can save time and money
  • Journalism is about people
  • Good journalism is social, sticky and useful
  • You can be driven in journalism by the source or driven by the story

Listen below to hear audio of Paul Bradshaw setting out his five principles of data management:

Paul Bradshaw leads data journalism courses for Journalism.co.uk. The next course is on 5 December. There are details at this link.

 

Tags: , , ,

Similar posts:

© Mousetrap Media Ltd. Theme: modified version of Statement