Chevron Left
Back to Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform, Google Cloud

2,301 ratings
291 reviews

About this Course

This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs. In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis. Pre-requisites • Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience) • Some knowledge of Python...

Top reviews


Dec 29, 2017

Really enjoyed it, woudl have liked to spend more time with the APIs and integrate with real time web downloads. There are a few bugs and misprints, but wasn't too hard to find them.


Aug 08, 2018

The course was really helpful to understand how to use google bigdata offering - dataproc for creating and managing Hadoop/hive/spark/pig and many more opensource bigdata products.

Filter by:

290 Reviews

By Leandro de Souza

Feb 15, 2019

easy to understand and simple to do the labs

By David Fachini

Feb 14, 2019

Spark, python and JSON is hard to understand. But the course is very interisting. Teacher is very good.

By Omair Karim

Feb 13, 2019

There should be more exercises for machine learning api

By Rao Madduri

Feb 13, 2019

Good learning of ML, dataproc and running jobs from dataproc. Separation of Compute and Storage is interesting.

By Sidharth Bolar

Feb 13, 2019

Although the course is exhaustive in terms of theory and hands on exercise

The exercises itself are not challenging enough and is a breeze to complete

The above is a major let down

By Kishore Kumar Pokuru

Feb 13, 2019

Good course with proper explanation. So we can habituate to GCP very well

By Subham Mitra

Feb 09, 2019

Lab session Instructions should be rechecked as I am getting problem accessing ip. Seems that there is some firewall issue.

By David Horgan

Feb 09, 2019

This is thoroughly practical course and enables you to run big data jobs using GCP on the actual data you have..

By AYANLOWO Babatunde Ayanlola Emmanuel

Feb 07, 2019


By Norma Nidia

Feb 07, 2019

Mucha informacion para poder comprender todos los temas