Chevron Left
Back to Getting and Cleaning Data

Getting and Cleaning Data, Johns Hopkins University

5,669 ratings
896 reviews

About this Course

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data....

Top reviews


Oct 26, 2016

This course is really a challenging and compulsory for any one who wants to be a data scientist or working in any sort of data. It teaches you how to make very palatable data-set fro ma messy data.


Feb 02, 2016

Easy, mostly instructive Course. The Assignments and quizzes are quite good, and illustrates the lessons very well.\n\nSee the videos for general presentation, but use the energy on the excersizes.

Filter by:

857 Reviews

By Nimalka Weerasuriya

Feb 13, 2019

Useful general course on tidying data and learning to import into R from various sources. Doesn't get into sequencing data import, but looks at other common ones

By Hathairat Wittayapusagul

Feb 09, 2019

A very Important course for working with R and data science in general.

By David Searl

Feb 05, 2019

Better than R Programming. Still very hard, but worthwhile.

By Ehab H Abdelhamid

Feb 05, 2019

This course was too hard for me compared to the first two in the program. Not sure whether it is because of my limited background in the subject area, or because of the abrupt shift in level from course 2 to 3.

By Parker Oakes

Feb 05, 2019

Fantastic course!

By Anthony J Maddalone

Feb 01, 2019

There is a huge disconnect with the material and the HAR dataset exercise. I would suggest that there is some help with smaller exercises to help explain how to complete it. Yes, I know you're supposed to do research to help figure out problems, and I have. As a matter of fact, I have taken other courses on data wrangling to be able to figure out this problem. Merging two datasets makes this problem very confusing. Why can't you help guide students through a similar problem, instead of throwing to the fire?

By Felix Ruecker

Jan 31, 2019

Ups and Downs.

Some presented functions are deprecated according to official documentation...

By Aman Ullah

Jan 30, 2019

Just Amazing, one should take this course for betterment

By Jigar Panchal

Jan 29, 2019

this course is a hands of this specialization in R.

this hands use to play with a raw and tidy data in R.

By Kristopher Barrett

Jan 28, 2019

More challenging than the "R Programming" course. The instructions for the final project were a little vague, but I think maybe this was intentional to promote discussion. Definitely give yourself plenty of time to complete the final project if you take this course. The principles of a tidy data set might seem like common sense, but in practice it's more challenging than you might think. I highly recommend taking this course even if you think you know what a tidy data set is.