About event
Master the R/Tidyverse is a two-day workshop focused on boosting your data analysis and visualization expertise. We will use R (https://www.r-project.org/), a free environment for statistical computing, and particularly its collection of packages Tidyverse (ggplot2, dplyr, tidyr, etc.) We will cover all skills needed in a typical research project: from reading your data into R, choosing a nice tidy data format, through data transformations, statistical modelling and cool visualizations, to communicating results in a reproducible report (see the figure below).
You definitely do not have to be coders or statisticians to enjoy the course, just motivated to learn new things. We welcome both beginners and intermediate R users that want to deepen their Tidyverse knowledge. For CEITEC participants, the course is offered for free.
This two day workshop should give you a thorough training in the family of R packages called Tidyverse. You will master publication-ready visualizations with ggplot2, data transformations and summarizing with dplyr and tidyr, working with dates, character strings, and untidy data formats, generating reproducible reports with R Markdown, reading data with fast readr functions and iterate through them with purrr package.
We will proceed through a series of exercises. After the workshop, you should be fluent in R data wrangling, exploration and modelling, and be prepared for more advanced topics. Every participant gets a collection of RStudio cheat-sheets.
The course is highly inspired by the Garrett Grolemund’s course Master The Tidyverse. Due to space capacity, the workshop will be strictly limited to 20 participants.
Schedule:
Wed 11 Dec
Morning Session (A35/211)
09:00 - 09:30 Introduction
09:30 - 11:30 Visualization (ggplot2)
Lunch (11:30-12:30)
Afternoon Session (A11/211)
12:30 - 15:30 Data Manipulation 1 (readr, dplyr)
15:30 - 17:00 Reporting (rmarkdown)
Thu 12 Dec
Morning Session (A35/211)
09:00 - 11:30 Data manipulation 2 (dplyr, tidyr)
Lunch (11:30-12:30)
Afternoon Session (A11/228)
12:30 - 14:30 Data types (forecats, stringr, lubridate)
14:30 - 16:00 Tidy data and iterating through them (purrr)
16:00 - 17:00 Tidy modelling (broom, modelr)
Every participant must bring his or her laptop. The participants are recommended to install the recent version of R (https://cloud.r-project.org/), RStudio (https://rstudio.com/products/rstudio/download/) and Tidyverse (https://www.tidyverse.org/). We will have cloud instances ready for anybody with the installation problems.
Petr Šimeček:
I work as a data scientist at startup studio Central European Ai institute (CEAi) in Brno, previously at Google (California, US). But for many years I had a bioinformatician, both at the Institute of Molecular Genetics (Prague) and at The Jackson Laboratory (Maine, US), focused mostly on mouse genetic data. I am R and Python enthusiast (working with R for more than 15 years, authoring several packages).
Karla Fejfarova:
I work as a biostatistician at the Czech Centre of Phenogenomics (ÚMG AV ČR, BIOCEV, Prague), which literally means helping other scientists with computer-related tasks: anything from tweaking Excel sheets and making plots to doing data analyses and training neural networks. The only thing I haven't been asked to do is fixing a printer paper jam. You can usually find me staring at my computer screen or running in the woods.