Overview

DataWorks Summit: Ideas. Insights. Innovation.

Join us in Barcelona at the world’s premier big data event for everything data—DataWorks Summit. Come learn about the latest development in big data, AI, machine learning, and more while networking with industry peers and pioneers to learn how to apply open source technology to make data work and accelerate your digital transformation.

Agenda

Agenda at a Glance

MONDAY, MARCH 18
8:30 AM - 5:00 PM
Pre-event Training
TUESDAY, MARCH 19
8:30 AM - 5:00 PM
Pre-event Training
WEDNESDAY, MARCH 20
8:00 AM - 6:00 PM
Registration
9:00 AM - 10:30 AM
Opening Keynote
10:30 AM – 4:00 PM
Community Showcase
11:00 AM – 3:30 PM
Track Sessions and Crash Courses
4:00 PM – 5:30 PM
Birds of a Feather
5:30 PM – 7:00 PM
Sponsor Reception
THURSDAY, MARCH 21
8:0 AM - 6:00 PM
Registration
9:00 AM - 10:30 AM
Opening Keynote
10:30 AM – 4:00 PM
Community Showcase
11:00 AM – 5:30 PM
Track Sessions and Crash Courses
Tracks

Tracks

    • Artificial Intelligence and Data Science
    • Big Compute and Storage
    • Cloud, Big Data Architecture and Operations
    • Data Warehousing and Operational Data Stores
    • Governance and Security
    • IoT and Streaming Analytics
Artificial intelligence (AI) is transforming every industry. Data science and machine learning are opening new doors in process automation, predictive analytics, and decision optimization. This track offers sessions spanning the entire data science lifecycle: development, test, and production.

You’ll see examples of innovative analytics applications and systems for data visualization, statistics, machine learning, cognitive systems, and deep learning. We’ll show you how to use modern open source workbenches to develop, test, and evaluate advanced AI models before deploying them. You’ll hear from leading researchers, data scientists, analysts, and practitioners who are driving innovation in AI and data science.

Sample technologies: TensorFlow, Keras, Apache Spark, PyTorch, Apache MXNet, Theano, DL4J, R, scikit-learn, DSX, Apache Zeppelin
Apache Hadoop continues to drive data management innovation at a rapid pace. Hadoop 3.0 adds container management to YARN, an object store to HDFS, and more. This track presents these advances and describes projects in incubation and the industry initiatives driving innovation in and around the Hadoop platform.

You’ll learn about key projects like HDFS, YARN, and related technologies. You’ll interact with technical leads, committers, and experts who are driving the roadmaps, key features, and advanced technology research around what is coming next and the extended open source big compute and storage ecosystem.

Sample technologies: Apache Hadoop (YARN, HDFS, Ozone), Apache Kudu, Kubernetes, Apache BookKeeper
A hybrid, multi-cloud data architecture that optimizes information placement and processing between on-premises data centers and the cloud is critical to scale and flexibility. But it must also provide a global and integrated view of all your data with consistent operations, governance, and security.

This track provides the latest best practices on how to build modern data architectures. You’ll learn about key open source projects, including Apache Ambari, Cloudbreak, and related technologies and how they integrate with the latest cloud offerings to enable transformative changes. You’ll interact with technical leads, committers, and experts who are driving research, key features, and roadmaps in the extended open source big data architecture.

Sessions cover the full deployment lifecycle, how to set up and manage high-availability configurations, and how DevOps practices can help speed solutions into production. You’ll learn how to manage data across the edge, the data center, and the cloud. And you’ll hear cutting-edge best practices for large-scale deployments.

Sample technologies: Apache Ambari, Cloudbreak, DataPlane Service, AWS, Azure,GCP
Data engineers and architects use multiple engines to process data in the most appropriate way, from batch ETL, to interactive SQL, to low latency NoSQL. Sessions will cover the SQL engines and tools that help users to derive the most from their data on premises and in the cloud and enrich their enterprise data warehouse (EDW).

You’ll learn how NoSQL stores like Apache HBase are adding transactional capabilities that bring traditional operational data store (ODS) workloads to Hadoop and why data preparation is a key workload. You’ll meet Apache community rock stars and learn how these innovators are building the applications of the future.

Sample technologies: Apache Hive, Apache Tez, Apache ORC, Apache Druid, Apache HBase, Apache Phoenix
Your data lake contains a growing volume of diverse enterprise data, so a breach could be catastrophic. Privacy violations and regulatory infractions can damage your corporate image and long-term shareholder value. Government and industry regulations demand you properly secure and govern your data to assure compliance and mitigate risks. But as Hadoop and streaming applications emerge as a critical foundation of a modern data architecture, enterprises face new requirements for protection and governance.

In this track, you’ll learn about the key enterprise requirements for governance and security of the extended data plane. You’ll hear best practices, tips, tricks, and war stories on how to secure and govern your big data infrastructure.

Sample technologies: Apache Ranger, Apache Sentry, Apache Atlas, and Apache Knox
The rapid proliferation of sensors and connected devices is fueling an explosion of data. Streaming data allows algorithms to dynamically adapt to new patterns in data, which is critical in applications like fraud detection and stock price prediction. Deploying real-time machine learning models in data streams enables insights and interactions not previously possible.

In this track you’ll learn how to apply machine learning to capture perishable insights from streaming data sources and how to interface with devices at the “jagged edge.” Sessions present new strategies and best practices for real-time data ingestion and analysis. Presenters will show how to use these technologies to develop IoT solutions and how to combine historical with streaming data to build dynamic, real-time predictive systems for actionable insights.

Sample technologies: Apache Nifi, Apache Storm, Streams Messaging Manager, Streaming Analytics Manager, Apache Flink, Apache Spark Streaming, Apache Beam, Apache Pulsar and Apache Kafka
Packages & Passes

Packages & Passes

Conference Pass
Super Early Bird
Thru Oct 31, 2018
Early Bird
Nov 1 - Dec 31, 2018
Alumni
Thru March 18, 2019
Standard
Thru March 17, 2019
On-Site
Full Conference
Access to DataWorks Summit keynotes, breakouts, meals and events, including crash courses, community showcase, and the sponsor reception.

Pre-event training is not included.
10% off
€750
€450
€900
€975
Day Pass
Single day access to keynotes, breakouts, lunch and other DataWorks Summit events.

Pre-event training is not included.
N/A
N/A
N/A
€475
€475
 
Package
Full Conference*
Day Pass**
 
Super Early Bird
Thru Oct 31, 2018
10% off
N/A
Early Bird
Nov 1 - Dec 31, 2018
€750
N/A
Alumni
Thru March 18, 2019
€450
N/A
Standard
Thru March 17, 2019
€900
€475
On-Site
€975
€475
*Access to DataWorks Summit keynotes, breakouts, meals and events, including crash courses, community showcase, and the sponsor reception.

Pre-event training is not included.
**Single day access to keynotes, breakouts, lunch and other DataWorks Summit events.

Pre-event training is not included.
Venue & Travel Info

Venue & Travel Info

Location Icon
Centre de Convencions Internacional de Barcelona (CCIB)

Centre de Convencions Internacional de Barcelona, Plaça de Willy Brandt, Barcelona, Spain

+34 932 30 10 00

Visit Event Center Website

Centre de Convencions Internacional de Barcelona (CCIB)

Centre de Convencions Internacional de Barcelona, Plaça de Willy Brandt, Barcelona, Spain

View on Google Maps
Previous Sponsors

Last Years Sponsors