Hortonworks is sponsoring a quick, hands-on introduction to key Apache projects. Come and listen to a short technical introduction and then get hands-on with your personal machine, ask questions, and leave with a working environment to continue your journey.

Apache Nifi Crash Course

Introduction: This workshop will provide a hands on introduction to simple event data processing and data flow processing using a Sandbox on students’ personal machines.

Format: A short introductory lecture to Apache NiFi and computing used in the lab followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Apache NiFi. In the lab, you will install and use Apache NiFi to collect, conduct and curate data-in-motion and data-at-rest with NiFi. You will learn how to connect and consume streaming sensor data, filter and transform the data and persist to multiple data sources.

Pre-requisites: Registrants must bring a laptop that has the latest VirtualBox installed and an image for Hortonworks DataFlow (HDF) Sandbox will be provided.


Speakers: Andy LoPresto, Timothy Spann

Location: Convention Hall I - D Crash Course

Apache Nifi Crash Course Video


Apache Nifi Crash Course Slides

Apache Spark Crash Course

Introduction: This workshop will provide a hands-on introduction to Apache Spark and Apache Zeppelin  in the cloud.

Format: A short introductory lecture on Apache Spark covering core modules (SQL, Streaming, MLlib, GraphX) followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Apache Spark. This lab will use the following Spark and Apache Hadoop components: Spark, Spark SQL, Apache Hadoop HDFS, Apache Hadoop YARN, Apache ORC, and Apache Ambari Zepellin. You will learn how to move data into HDFS using Spark APIs, create Apache Hive tables, explore the data with Spark and Spark SQL, transform the data and then issue some SQL queries.df

Lab pre-requisites: Registrants must bring a laptop with a Chrome or Firefox web browser installed (with proxies disabled). Alternatively, they may download and install an HDP Sandbox as long as they have at least 16GB of RAM available (Note that the sandbox is over 10GB in size so we recommend downloading it before the crash course).

Checkout our short video on Apache Spark Basics.


Speakers: Robert Hryniewicz

Location: Convention Hall I - D Crash Course

Apache Spark Crash Course Video


Apache Spark Crash Course Slides

Data Science Crash Course

Introduction: This workshop will provide a hands on introduction to basic Machine Learning techniques with Apache Spark ML using the cloud.

Format: A short introductory lecture on a select important supervised and unsupervised Machine Learning techniques followed by a demo, lab exercises and a Q&A session. The lecture will be followed by lab time to work through the lab exercises and ask questions.

Objective: To provide a quick and short hands-on introduction to Machine Learning with Spark ML. In the lab, you will use the following components: Apache Zeppelin (a “Modern Data Science Toolbox”) and Apache Spark. You will learn how to analyze the data, structure the data, train Machine Learning models and apply them to answer real-world questions.

Pre-requisites: Registrants must bring a laptop that can run the Hortonworks Data Cloud.

At this Crash Course everyone will have a cluster assigned to them to try several workloads using Machine Learning, Spark and Zeppelin on the cloud.

Checkout our short video on Basic Machine Learning Algorithms.


Speakers: Robert Hryniewicz

Location: Convention Hall I - D Crash Course

Data Science Crash Course Video


Data Science Crash Course Slides

GDPR Crash Course

Introduction: This workshop will  provide an overview of GDPR provisions along with relevant use cases.

Format: A short introductory lecture on GDPR.  Then we will focus on the topics of consent, profiling and right to be forgotten or data erasure and how companies can establish processes for acquiring consent, automated data processing, data discovery and classification using technologies such as Apache Atlas, Apache Ranger and Apache Hive.

 

Objective: To provide a quick and hands-on introduction to GDPR concepts.  In the lab you practice the concepts using Apache Hadoop, Atlas, Ranger and Hive to process and classify data.

Pre-requisites: Registrants must bring a laptop that has the latest VirtualBox installed and an image for Hortonworks Data Platform (HDP) Sandbox will be provided.


Speakers: Ali Bajwa, Srikanth Venkat

Location: Convention Hall I - D Crash Course

GDPR Crash Course Video


GDPR Crash Course Slides