Managing eurostat data with R
Athanassios Stavrakoudis
In this section we will examine Eurostat’s datasets. First, as an example we will compare goverment spending in research and development.
Load the required libraries:
Here we are going to examine the goverment pending on research and development. The table tipsst10 holds the corresponding data.
We can read the dataset by calling the get_eurostat function and we can store the results to a variable (here named rnd):
We can examine the column names of the table (data frame) rnd:
Or we examine the domain values of its columns:
We can filter the dataset, thus we can restrict it to specific rows:
rnd %>%
filter(unit == 'PC_GDP' & geo %in% c('EL', 'PT')) %>%
ggplot(aes(x = time, y = values, colour = geo)) +
geom_line(size = 1.2) +
rnd %>%
filter(unit == 'PC_GDP' & geo %in% c('EL', 'PT', 'CZ', 'BE')) %>%
mutate(label = if_else(time == max(time), as.character(geo), NA_character_)) %>%
ggplot(aes(x = time, y = values, colour = geo)) +
geom_line(size = 1.2) +
geom_label_repel(aes(label = label), nudge_x = 1, na.rm = TRUE) +
scale_color_discrete(guide = FALSE) +
theme_economist() +
xlab("Time") + ylab("% GDP in RnD") +
theme(text = element_text(size = 18))