This course is part of the Mathematics for Machine Learning Specialization

4.0

(730 ratings)

18,997 already enrolled!

Offered By

Mathematics for Machine Learning Specialization

Imperial College London

About this Course

75,348

This intermediate-level course introduces the mathematical foundations to derive Principal Component Analysis (PCA), a fundamental dimensionality reduction technique. We'll cover some basic statistics of data sets, such as mean values and variances, we'll compute distances and angles between vectors using inner products and derive orthogonal projections of data onto lower-dimensional subspaces. Using all these tools, we'll then derive PCA as a method that minimizes the average squared reconstruction error between data points and their reconstruction.
At the end of this course, you'll be familiar with important mathematical concepts and you can implement PCA all by yourself. If you’re struggling, you'll find a set of jupyter notebooks that will allow you to explore properties of the techniques and walk you through what you need to do to get on track. If you are already an expert, this course may refresh some of your knowledge.
The lectures, examples and exercises require:
1. Some ability of abstract thinking
2. Good background in linear algebra (e.g., matrix and vector algebra, linear independence, basis)
3. Basic background in multivariate calculus (e.g., partial derivatives, basic optimization)
4. Basic knowledge in python programming and numpy
Disclaimer: This course is substantially more abstract and requires more programming than the other two courses of the specialization. However, this type of abstract thinking, algebraic manipulation and programming is necessary if you want to understand and develop machine learning algorithms.

Start instantly and learn at your own schedule.

Reset deadlines in accordance to your schedule.

Suggested: 4 weeks of study, 4-5 hours/week...

Subtitles: English

Python ProgrammingPrincipal Component Analysis (PCA)Projection MatrixMathematical Optimization

Start instantly and learn at your own schedule.

Reset deadlines in accordance to your schedule.

Suggested: 4 weeks of study, 4-5 hours/week...

Subtitles: English

Week

1Principal Component Analysis (PCA) is one of the most important dimensionality reduction algorithms in machine learning. In this course, we lay the mathematical foundations to derive and understand PCA from a geometric point of view. In this module, we learn how to summarize datasets (e.g., images) using basic statistics, such as the mean and the variance. We also look at properties of the mean and the variance when we shift or scale the original data set. We will provide mathematical intuition as well as the skills to derive the results. We will also implement our results in code (jupyter notebooks), which will allow us to practice our mathematical understand to compute averages of image data sets....

8 videos (Total 27 min), 6 readings, 4 quizzes

Welcome to module 141s

Mean of a dataset4m

Variance of one-dimensional datasets4m

Variance of higher-dimensional datasets5m

Effect on the mean4m

Effect on the (co)variance3m

See you next module!27s

About Imperial College & the team5m

How to be successful in this course5m

Grading policy5m

Additional readings & helpful references5m

Set up Jupyter notebook environment offline10m

Symmetric, positive definite matrices10m

Mean of datasets15m

Variance of 1D datasets15m

Covariance matrix of a two-dimensional dataset15m

Week

2Data can be interpreted as vectors. Vectors allow us to talk about geometric concepts, such as lengths, distances and angles to characterise similarity between vectors. This will become important later in the course when we discuss PCA. In this module, we will introduce and practice the concept of an inner product. Inner products allow us to talk about geometric concepts in vector spaces. More specifically, we will start with the dot product (which we may still know from school) as a special case of an inner product, and then move toward a more general concept of an inner product, which play an integral part in some areas of machine learning, such as kernel machines (this includes support vector machines and Gaussian processes). We have a lot of exercises in this module to practice and understand the concept of inner products....

8 videos (Total 36 min), 1 reading, 5 quizzes

Dot product4m

Inner product: definition5m

Inner product: length of vectors7m

Inner product: distances between vectors3m

Inner product: angles and orthogonality5m

Inner products of functions and random variables (optional)7m

Heading for the next module!35s

Basis vectors20m

Dot product10m

Properties of inner products20m

General inner products: lengths and distances20m

Angles between vectors using a non-standard inner product20m

Week

3In this module, we will look at orthogonal projections of vectors, which live in a high-dimensional vector space, onto lower-dimensional subspaces. This will play an important role in the next module when we derive PCA. We will start off with a geometric motivation of what an orthogonal projection is and work our way through the corresponding derivation. We will end up with a single equation that allows us to project any vector onto a lower-dimensional subspace. However, we will also understand how this equation came about. As in the other modules, we will have both pen-and-paper practice and a small programming example with a jupyter notebook....

6 videos (Total 25 min), 1 reading, 3 quizzes

Projection onto 1D subspaces7m

Example: projection onto 1D subspaces3m

Projections onto higher-dimensional subspaces8m

Example: projection onto a 2D subspace3m

This was module 3!32s

Full derivation of the projection20m

Projection onto a 1-dimensional subspace25m

Project 3D data onto a 2D subspace40m

Week

4We can think of dimensionality reduction as a way of compressing data with some loss, similar to jpg or mp3. Principal Component Analysis (PCA) is one of the most fundamental dimensionality reduction techniques that are used in machine learning. In this module, we use the results from the first three modules of this course and derive PCA from a geometric point of view. Within this course, this module is the most challenging one, and we will go through an explicit derivation of PCA plus some coding exercises that will make us a proficient user of PCA. ...

10 videos (Total 52 min), 5 readings, 2 quizzes

Problem setting and PCA objective7m

Finding the coordinates of the projected data5m

Reformulation of the objective10m

Finding the basis vectors that span the principal subspace7m

Steps of PCA4m

PCA in high dimensions5m

Other interpretations of PCA (optional)7m

Summary of this module42s

This was the course on PCA56s

Vector spaces20m

Orthogonal complements10m

Multivariate chain rule10m

Lagrange multipliers10m

Did you like the course? Let us know!10m

Chain rule practice20m

4.0

147 Reviewsstarted a new career after completing these courses

got a tangible career benefit from this course

got a pay increase or promotion

By JS•Jul 17th 2018

This is one hell of an inspiring course that demystified the difficult concepts and math behind PCA. Excellent instructors in imparting the these knowledge with easy-to-understand illustrations.

By JV•May 1st 2018

This course was definitely a bit more complex, not so much in assignments but in the core concepts handled, than the others in the specialisation. Overall, it was fun to do this course!

Imperial College London is a world top ten university with an international reputation for excellence in science, engineering, medicine and business. located in the heart of London. Imperial is a multidisciplinary space for education, research, translation and commercialisation, harnessing science and innovation to tackle global challenges.
Imperial students benefit from a world-leading, inclusive educational experience, rooted in the College’s world-leading research. Our online courses are designed to promote interactivity, learning and the development of core skills, through the use of cutting-edge digital technology....

For a lot of higher level courses in Machine Learning and Data Science, you find you need to freshen up on the basics in mathematics - stuff you may have studied before in school or university, but which was taught in another context, or not very intuitively, such that you struggle to relate it to how it’s used in Computer Science. This specialization aims to bridge that gap, getting you up to speed in the underlying mathematics, building an intuitive understanding, and relating it to Machine Learning and Data Science.
In the first course on Linear Algebra we look at what linear algebra is and how it relates to data. Then we look through what vectors and matrices are and how to work with them.
The second course, Multivariate Calculus, builds on this to look at how to optimize fitting functions to get good fits to data. It starts from introductory calculus and then uses the matrices and vectors from the first course to look at data fitting.
The third course, Dimensionality Reduction with Principal Component Analysis, uses the mathematics from the first two courses to compress high-dimensional data. This course is of intermediate difficulty and will require basic Python and numpy knowledge.
At the end of this specialization you will have gained the prerequisite mathematical knowledge to continue your journey and take more advanced courses in machine learning....

When will I have access to the lectures and assignments?

Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

What will I get if I subscribe to this Specialization?

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

What is the refund policy?

Is financial aid available?

More questions? Visit the Learner Help Center.

Coursera provides universal access to the world’s best education,
partnering with top universities and organizations to offer courses online.