Getting Data from the Internet with Python
Dates: 6 and 7 July 2021 (afternoons only)
This course is taught over two afternoon sessions; participants will need to attend both sessions.
Instructor: Peter Smyth
This ‘Getting Data from the Internet with Python’ workshop is for any researcher who wants to collect data from the Internet and use the Python programming language to process and analyse it. No prior computational experience is required. This hands-on workshop provides an introduction to the Jupyter Python programming environment and demonstrates the use of the Python language for performing basic data collection using a variety of techniques from the Internet for analysis.
On completion of this workshop, the participants will be able to;
- Install Python and the Jupyter environment on their own machine
- Be familiar with the Jupyter Notebook environment
- Understand Python variables, data types, objects and simple programming constructs
- Understand how to get help and make use of 3rd party Python libraries like pandas
- Automate the downloading of files from the Internet
- Read csv based files into pandas dataframes
- Collect and extract JSON based data from APIs
- Perform basic Web page scraping
- Manipulate data within pandas dataframes
- Perform simple visualisations of data
After completion of this workshop, participants will be in a position to create their own Python environment and have confidence in using it to download data from the internet, in a variety of forms and using a variety of techniques; they will be able to use their own Python programs for a variety of simple data analysis tasks.
About the instructor
Peter Smyth is a Research Associate at the University of Manchester, based in the Cathie Marsh Institute. He has spent 35 years working in IT at various large and small commercial organisations before taking an MSc in Big Data Analytics at Sheffield Hallam University and moving into academia. In his previous roles he used any convenient programming environment to hand to solve problems. Now he teaches a variety of programming languages to help others to do the same.
He is a qualified Data and Software Carpentry instructor.