Hi, I'm Mauro Pazienza, Cloudera certified instructor at PUE
and I'm going to present the data scientist training.
It´s a four day course and is very awesome course.
We will cover topics, very important topics, like the ingestion of data,
our cluster, preparing data, machine learning algorithm.
We will learn the new tool Cloudera, Data Science Workbench.
It is a self-service infrastructure from Cloudera
for analyzing data,
printing data and
arranging the data.
The Cloudera Data Science Workbench provides
the infrastructure for running your scripting Python
or Scala or R or if you want in another language.
The most important thing is our course, is a four day course,
and during the course I will guide you step by step with Python scripts.
It's very important that our attendees
the knowledge about Linux operating system,
Python very basic infrastructure and R.
