Date: 6 February 2020 and 22 May 2020
Instructor: Dr Alexandru Cernat
Fee: £195 (£140 for those from educational, government and charitable institutions).
CMI offers up to five subsidised places at a reduced rate of £60 per course day to research staff and students within Humanities at The University of Manchester. These places are awarded in order of application.
Please note: this is not guaranteed and is considered on a case by case basis. Please contact us for more information.
Cleaning data is one of the most important and time-consuming aspects of being a data analyst and researcher. Most courses typically teach statistical models or basic use of statistical software but few of these teach students how to efficiently clean real-world data.
This course will tackle this important topic. We will do this by introducing the tidyverse package in R. This is a large package that brings together some of the best tools for data cleaning and visualization in R. Inspired by the concept of “tidy data” the package enables users to import, merge, recode, restructure and plot data very efficiently. Half of the course will focus on data cleaning while the other half will focus on data visualization. The course will combine the use of lectures with hands-on practical sessions. In the practical part, we will be using real-world data to get the students used to the typical challenges they are expected to encounter when working with that. This will also help prepare them for working independently on their own data.
- To understand the concept of tidy data
- To learn how to efficiently connect multiple commands in R using the pipe operator
- To learn how to efficiently transform variables and prepare for analysis
- To learn how to work with factor variables To learn how to visualize data using R
- Filtering cases and selecting variables
- Working with factors
- Transforming variables
- Merging data
- Using the pipe operator
- Visualizing data in R
Basic knowledge of R and R-Studio
R for Data Science - Garrett Grolemund, Hadley Wickham - https://r4ds.had.co.nz/
About the instructor
- Dr Alexandru Cernat is a lecturer in Social Statistics at the University of Manchester. Previously he was Research Associate at the National Centre for Research Methods. He has been awarded a PhD in survey methodology from the University of Essex where he has investigated data quality in longitudinal studies. His research interests cover latent variable modelling, measurement error, missing data, survey methodology, methods for longitudinal data collection and analysis.