Dataharvest 2025 - the European Investigative Journalism Conference: Full Schedule

10:00am CEST

Masterclass: Hack your way into a big dataset with R (Masterclass ticket needed)

Thursday May 22, 2025 10:00am - 12:00pm CEST

A separate ticket is required to attend this masterclass. If you would like to attend but haven't yet purchased a ticket, please contact us at info@dataharvest.eu

R is one of the most useful programming languages in data journalism. You may have heard of it, maybe even tried it a little and found the learning curve too steep. If so, this session is for you.

We are going to spend the day looking at European Environment Agency’s EPRTR (European Pollutant Release and Transfer Register) data – it’s a lot of data, and some of it is quite messy. It contains dozens, probably hundreds, of potential lines of investigation to be explored – and that’s what we’re going to do.

By the end of the day, you will know how to import data in an R environment, filter it, reshape it, and interrogate it. You will be able to make some basic graphs. Above all, you will be on the way to finding stories in the day’s chosen data, and be able to take your script away and use it again, or adapt it to other datasets. And, we hope, you will have the beginnings of a story idea.

We will assume that you are familiar with spreadsheets, but that you have no knowledge of R. You will not need to install anything – everything will be run on cloud instances of R.

If you’re already advanced with R, it is still worth coming along to use and share what you know, to support others, and to learn something new.

(If you already have a dataset you want to work with – bring that too!)

Speakers

Leopold Salzenstein

Data Coordinator, Arena for Journalism in Europe

Investigative data journalist

Jonathan Stoneman

Arena for Journalism in Europe

Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Masterclass Coordinator for the Climate Arena fellows.

Thursday May 22, 2025 10:00am - 12:00pm CEST
Z1.16

Data skills, Master class

1:00pm CEST

Masterclass: Hack your way into a big dataset with R (Masterclass ticket needed) LIMITED

Thursday May 22, 2025 1:00pm - 3:00pm CEST

Z1.16

A separate ticket is required to attend this masterclass. If you would like to attend but haven't yet purchased a ticket, please contact us at info@dataharvest.eu

R is one of the most useful programming languages in data journalism. You may have heard of it, maybe even tried it a little and found the learning curve too steep. If so, this session is for you.

We are going to spend the day looking at European Environment Agency’s EPRTR (European Pollutant Release and Transfer Register) data – it’s a lot of data, and some of it is quite messy. It contains dozens, probably hundreds, of potential lines of investigation to be explored – and that’s what we’re going to do.

By the end of the day, you will know how to import data in an R environment, filter it, reshape it, and interrogate it. You will be able to make some basic graphs. Above all, you will be on the way to finding stories in the day’s chosen data, and be able to take your script away and use it again, or adapt it to other datasets.

We will assume that you are familiar with spreadsheets, but that you have no knowledge of R. You will not need to install anything – everything will be run on cloud instances of R.

If you’re already advanced with R, it is still worth coming along to use and share what you know, to support others, and to learn something new.

(If you already have a dataset you want to work with – bring that too!)

Speakers

Leopold Salzenstein

Data Coordinator, Arena for Journalism in Europe

Investigative data journalist

Jonathan Stoneman

Arena for Journalism in Europe

Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Masterclass Coordinator for the Climate Arena fellows.

Thursday May 22, 2025 1:00pm - 3:00pm CEST
Z1.16

Data skills, Master class

3:30pm CEST

Masterclass: Hack your way into a big dataset with R (Masterclass ticket needed) LIMITED

Thursday May 22, 2025 3:30pm - 5:00pm CEST

Z1.16

A separate ticket is required to attend this masterclass. If you would like to attend but haven't yet purchased a ticket, please contact us at info@dataharvest.eu

R is one of the most useful programming languages in data journalism. You may have heard of it, maybe even tried it a little and found the learning curve too steep. If so, this session is for you.

We are going to spend the day looking at European Environment Agency’s EPRTR (European Pollutant Release and Transfer Register) data – it’s a lot of data, and some of it is quite messy. It contains dozens, probably hundreds, of potential lines of investigation to be explored – and that’s what we’re going to do.

By the end of the day, you will know how to import data in an R environment, filter it, reshape it, and interrogate it. You will be able to make some basic graphs. Above all, you will be on the way to finding stories in the day’s chosen data, and be able to take your script away and use it again, or adapt it to other datasets.

We will assume that you are familiar with spreadsheets, but that you have no knowledge of R. You will not need to install anything – everything will be run on cloud instances of R.

If you’re already advanced with R, it is still worth coming along to use and share what you know, to support others, and to learn something new.

(If you already have a dataset you want to work with – bring that too!)

Speakers

Jonathan Stoneman

Arena for Journalism in Europe

Former BBC journalist, turned datajournalist, trainer, consultant. Works with Arena as Masterclass Coordinator for the Climate Arena fellows.

Leopold Salzenstein

Data Coordinator, Arena for Journalism in Europe

Investigative data journalist

Thursday May 22, 2025 3:30pm - 5:00pm CEST
Z1.16

Data skills, Master class

1:15pm CEST

Beyond the pixels: the power of raster data in QGIS

Friday May 23, 2025 1:15pm - 2:30pm CEST

Z2.08

Manipulating and analyzing raster data can be intimidating, as it often appears more complex than vector data. However, raster data—such as satellite imagery or forest loss information—is essential for environmental and geographic storytelling. For example others, it enables journalists to assess vegetation health, visualize floods or droughts, and calculate deforested areas, even when true-colour satellite imagery is obscured by clouds.

In this hands-on session, participants will learn the key functions in QGIS needed to work with raster data. This includes loading raster layers, managing projections, setting band combinations (such as false color) for analysis, styling raster layers to enhance visibility, and performing raster calculations.

To attend this session, participants should have basic QGIS skills.

Before the session, please install QGIS on your laptops and make sure it is working properly. Download from: https://www.qgis.org/en/site/forusers/download.html

If you encounter any issues during installation, this guide may help: https://www.qgis.org/resources/installation-guide/

Speakers

Federico Acosta Rainis

Data Specialist, Pulitzer Center

Federico Acosta Rainis is a data specialist at the Pulitzer Center's Environment Investigations Unit. Previously an IT consultant, he transitioned into journalism a decade ago, working with La Nación in Argentina, where he has contributed to award-winning projects. Federico received... Read More →

Kuang Keng Kuek Ser

Data Editor, Pulitzer Center

Kuek Ser Kuang Keng is the data editor for the Pulitzer Center, where he supports investigative journalists of the Rainforest Investigations Network (RIN) and Ocean Reporting Network (ORN) to achieve their investigation goals. Based in Kuala Lumpur, Malaysia, Keng is a digital journalist... Read More →

Friday May 23, 2025 1:15pm - 2:30pm CEST
Z2.08

10:00am CEST

1:00pm CEST

3:30pm CEST

1:15pm CEST

1:15pm CEST

1:15pm CEST

3:00pm CEST

3:00pm CEST

3:00pm CEST

4:45pm CEST

4:45pm CEST

4:45pm CEST

9:30am CEST

9:30am CEST

9:30am CEST

11:15am CEST

11:15am CEST

11:15am CEST

1:45pm CEST

1:45pm CEST

1:45pm CEST

3:30pm CEST

3:30pm CEST

3:30pm CEST

5:15pm CEST

5:15pm CEST

5:15pm CEST

9:30am CEST

11:15am CEST

11:15am CEST