Chevron Left
Back to Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

Learner Reviews & Feedback for Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform by Google Cloud

2,977 ratings
360 reviews

About the Course

This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs. In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis. Pre-requisites • Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience) • Some knowledge of Python...

Top reviews


Apr 23, 2019

The course has introduced me to hadoop tools. I have learned how easy it is to setup a hadoop cluster using Dataproc. Will sure look for cases that have implemented hadoop and replicate on GCP.


Dec 29, 2017

Really enjoyed it, woudl have liked to spend more time with the APIs and integrate with real time web downloads. There are a few bugs and misprints, but wasn't too hard to find them.

Filter by:

1 - 25 of 371 Reviews for Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

By Joe S

May 12, 2019

I think the quizzes should have had more questions, there were many more cool things you could have asked.

By Justin E

Jun 04, 2019

This course is far longer than any of the other courses in the Data Engineering specialisation. I felt that it rehashed a lot of the first course as well.

By haiyang l

Jun 22, 2019

Wish it went in more depth about RDD transformation, which was a little bit confusing... A lab about how DataProc can be used as an extention of BigQuery that does not overlap functionality would be nice. How to covert RDDs to Pandas DF, and vice versa.

By Henio T

Jun 20, 2019


By Divyangana P

Jun 20, 2019

Exceptionally well designed course for beginners.

By Anand S

Jun 18, 2019

It was a good course providing synthesized learning in an easy manner

By Akash A

Jun 16, 2019

Great Course to learn fundamentals of Dataproc and PySpark for ML

By Gustavo N R

Jun 14, 2019


By tinku

Jun 13, 2019


By Shail S

Jun 10, 2019

Would be nice if there were more examples of applications

By Satoru I

Jun 09, 2019

Take time.

By Felipe O C A

Jun 06, 2019

es aun mas genial

By Harry C

Jun 06, 2019

very good course

By Hiran W

Jun 06, 2019


By Maud B

Jun 06, 2019

Overall great! The lessons were easy to understand and had all the information I wanted.

The labs were easy to follow with pretty clear instructions.

But it was anoying to not know if and why I was missing points, since the grade is only updated after a long time, plus I lost the connection to the qwicklab tab at some point which stop the grading.

By Widiarto A

May 30, 2019

Awesome course! It really helps, especially the procedure on how to initiate a Dataproc cluster and submit a job to that cluster

By Deep C

May 30, 2019

It is a very informative course. I learned how to use DataProc, create and run custom clusters to run Hadoop jobs.Thanks to Google for providing such a easy to follow course.

By Simon H

May 28, 2019

I feel confident I could spin up custom clusters and perform ML workloads

By Ashwin S

May 28, 2019


By Leonardo L G

May 27, 2019

More labs and you should create challenges!


May 24, 2019

Very Informative

By Dhiraj P

May 23, 2019

Great way to start with Big data.

By Monish K

May 21, 2019

The overall learning experience was good

By Nithin K

May 21, 2019



May 19, 2019

I am confident that I can build a ML model and customize DataProc