Hortonworks is sponsoring a quick, hands-on introduction to key Apache projects. Come and listen to a short technical introduction and then get hands-on with your personal machine, ask questions, and leave with a working environment to continue your journey.

Apache Spark Crash Course

Introduction: This workshop will provide a hands-on introduction to Apache Spark and Apache Zeppelin  in the cloud.

Format: A short introductory lecture about Apache Spark components used in the lab followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Apache Spark. This lab will use the following Spark and Apache Hadoop components: Spark, Spark SQL, Apache Hadoop HDFS, Apache Hadoop YARN, Apache ORC, and Apache Ambari Zepellin. You will learn how to move data into HDFS using Spark APIs, create Apache Hive tables, explore the data with Spark and Spark SQL, transform the data and then issue some SQL queries.

Pre-requisites: Registrants must bring a laptop that can run the Hortonworks Data Cloud.

Checkout our short video on Apache Spark Basics.

Data Science Crash Course

Introduction: This workshop will provide a hands on introduction to basic Machine Learning techniques with Apache Spark ML using the cloud.

Format: A short introductory lecture on a select important supervised and unsupervised Machine Learning techniques followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Machine Learning with Spark ML. In the lab, you will use the following components: Apache Zeppelin (a “Modern Data Science Toolbox”) and Apache Spark. You will learn how to analyze the data, structure the data, train Machine Learning models and apply them to answer real-world questions.

Pre-requisites: Registrants must bring a laptop that can run the Hortonworks Data Cloud.

At this Crash Course everyone will have a cluster assigned to them to try several workloads using Machine Learning, Spark and Zeppelin on the cloud.

Checkout our short video on Basic Machine Learning Algorithms.

Apache Nifi Crash Course

Introduction: This workshop will provide a hands on introduction to simple event data processing and data flow processing using a Sandbox on students’ personal machines.

Format: A short introductory lecture to Apache NiFi and computing used in the lab followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Apache NiFi. In the lab, you will install and use Apache NiFi to collect, conduct and curate data-in-motion and data-at-rest with NiFi. You will learn how to connect and consume streaming sensor data, filter and transform the data and persist to multiple data sources.

Pre-requisites: Registrants must bring a laptop that has the latest VirtualBox installed and an image for Hortonworks DataFlow (HDF) Sandbox will be provided.

GDPR Crash Course

Introduction: This workshop will  provide an overview of GDPR provisions along with relevant use cases.

Format: A short introductory lecture on GDPR.  Then we will focus on the topics of consent, profiling and right to be forgotten or data erasure and how companies can establish processes for acquiring consent, automated data processing, data discovery and classification using technologies such as Apache Atlas, Apache Ranger and Apache Hive.

 

Objective: To provide a quick and hands-on introduction to GDPR concepts.  In the lab you practice the concepts using Apache Hadoop, Atlas, Ranger and Hive to process and classify data.

Pre-requisites: Registrants must bring a laptop that has the latest VirtualBox installed and an image for Hortonworks Data Platform (HDP) Sandbox will be provided.