About this Course
121,872 recent views

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Intermediate Level


Subtitles: French, Portuguese (Brazilian), German, English, Spanish, Japanese...

Skills you will gain

Google Cloud DataprocApplication Programming Interfaces (API)Machine Learning

100% online

Start instantly and learn at your own schedule.

Flexible deadlines

Reset deadlines in accordance to your schedule.

Intermediate Level


Subtitles: French, Portuguese (Brazilian), German, English, Spanish, Japanese...

Syllabus - What you will learn from this course

2 hours to complete

Module 1: Introduction to Cloud Dataproc

16 videos (Total 52 min), 2 quizzes
16 videos
Deriving value from unstructured data7m
Approaches to working with Big Data4m
MapReduce and Hadoop origins5m
On prem Hadoop has a lot of overhead1m
Cloud Dataproc versus Hadoop alternatives2m
Creating a Dataproc cluster4m
Dataproc customization3m
Dataproc and the CLI40s
Lab 1: Overview11s
Lab 1: Demo and Review7m
Custom Machine Types3m
Preemptible VMs3m
Wrap up41s
1 practice exercise
Module 1 Quiz4m
3 hours to complete

Module 2: Running Dataproc jobs

13 videos (Total 51 min), 2 readings, 3 quizzes
13 videos
Lab 2: Demo and Review11m
Separation of Storage and Compute6m
Evolution of data processing5m
The importance of networking in data processing3m
Separating storage and compute with Spark1m
Submitting Spark jobs3m
Overview of Spark concepts2m
Lab Overview45s
Lab 3: Demo and Review8m
Module Wrap Up18s
2 readings
Cloud Dataproc Storage Services10m
Cloud Dataproc Automation Features10m
1 practice exercise
Module 2 Quiz2m
3 hours to complete

Module 3: Leveraging GCP

10 videos (Total 37 min), 3 readings, 3 quizzes
10 videos
Lab 4: Demo and Review4m
Cluster customization4m
Installing software on a Dataproc7m
Lab 5: Overview17s
Lab 5: Demo and Review8m
Wrap up58s
3 readings
Using Cloud Functions for Data Processing10m
Cloud Dataproc to BigQuery Connectors10m
Cloud Dataproc Workflow Animation10m
1 practice exercise
Module 3 Quiz2m
1 hour to complete

Module 4: Analyzing Unstructured Data

7 videos (Total 24 min), 2 quizzes
7 videos
Natural Language Processing close-up2m
Lab 6: Overview1m
Lab 6: Demo and Review10m
Wrap up16s
1 practice exercise
Module 4 Quiz2m
384 ReviewsChevron Right


started a new career after completing these courses


got a tangible career benefit from this course


got a pay increase or promotion

Top reviews from Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

By RWApr 23rd 2019

The course has introduced me to hadoop tools. I have learned how easy it is to setup a hadoop cluster using Dataproc. Will sure look for cases that have implemented hadoop and replicate on GCP.

By LYMar 1st 2019

This is very handy course compared with other cloud platform where a customized environment was provided without concerning setup it on my own. This is very thoughtful and I'm very appreciated.

About Google Cloud

We help millions of organizations empower their employees, serve their customers, and build what’s next for their businesses with innovative technology created in—and for—the cloud. Our products are engineered for security, reliability, and scalability, running the full stack from infrastructure to applications to devices and hardware. Our teams are dedicated to helping customers apply our technologies to create success....

About the Data Engineering, Big Data, and Machine Learning on GCP Specialization

This five-week, accelerated online specialization provides participants a hands-on introduction to designing and building data processing systems on Google Cloud Platform. Through a combination of presentations, demos, and hand-on labs, participants will learn how to design data processing systems, build end-to-end data pipelines, analyze data and carry out machine learning. The course covers structured, unstructured, and streaming data. This course teaches the following skills: • Design and build data processing systems on Google Cloud Platform • Leverage unstructured data using Spark and ML APIs on Cloud Dataproc • Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow • Derive business insights from extremely large datasets using Google BigQuery • Train, evaluate and predict using machine learning models using Tensorflow and Cloud ML • Enable instant insights from streaming data This class is intended for developers who are responsible for: • Extracting, Loading, Transforming, cleaning, and validating data • Designing pipelines and architectures for data processing • Creating and maintaining machine learning and statistical models • Querying datasets, visualizing query results and creating reports >>> By enrolling in this specialization you agree to the Qwiklabs Terms of Service as set out in the FAQ and located at: https://qwiklabs.com/terms_of_service <<<...
Data Engineering, Big Data, and Machine Learning on GCP

Frequently Asked Questions

  • Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

  • If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

  • Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

  • If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

  • This course is one of a few offered on Coursera that are currently available only to learners who have paid or received financial aid, when available.

More questions? Visit the Learner Help Center.