Join us in Barcelona at the world’s premier big data event! Don’t miss this chance to hear about the latest developments in AI, machine learning, IoT, cloud, and more in over 70 track sessions, crash courses, and birds-of-a-feather sessions. Plus, network with industry peers and pioneers and get answers to your top questions while learning how to apply open source technology to accelerate your digital transformation initiatives. Register today!
Sophie is a futurist whose research entails meeting weirdos and troublemakers in off-the-beaten-track labs, makerspaces, garages around the globe - Shenzhen, Seoul, Detroit, Mumbai. As part of her research, she consults for exec teams and boards of large companies on understanding the explosive new technologies defining the new economy. Sophie is also CEO of a data and AI company, 1715 Labs, that she's currently spinning out of the Astrophysics department at Oxford University with her academic co-founder. This follows a career building businesses for WIRED magazine, for Singularity University at the NASA Research Park in Silicon Valley, and prior to California, the interdisciplinary Oxford Martin School at Oxford University, where Sophie raised more than $120m of research investment.
Since March 2017, Peter Weckesser serves as Airbus Defence and Space´s Digital Transformation Officer. He joined Airbus from Siemens, where he has been the COO of Product Lifecycle Management, leading the Siemens IoT and Digital Enterprise business and activities. Prior to this, he held various executive-level positions at Siemens, being CEO of Industry Services and CEO of Value Service Business Unit, as well as Vice-President of “Human Machine Interface”.
Peter Weckesser holds a degree in Physics and a PhD degree in Computer Science, both from the University of Karlsruhe (Germany), although he also spent a year studying at the Armstrong State University in the USA as part of his education in Informatics.
Hilary is general manager, machine learning, at Cloudera. She was the founder and CEO of Fast Forward Labs, an applied machine learning research company that Cloudera acquired in 2017. She also serves as data scientist in residence at Accel Partners, a leading global venture capital firm. Previously, Hilary was chief scientist at bitly. She co-hosts DataGotham, a conference for New York's home-grown data community, and co-founded HackNY, a non-profit that helps engineering students find opportunities in New York's creative technical economy. She is on the board of the Anita Borg Institute and an advisor to several companies, including Sparkfun Electronics and Wonder. Hilary served on Mayor Bloomberg’s Technology Advisory Board and is a member of the Brooklyn hacker collective NYC Resistor.
Dr. Kerem Tomak brings more than 15 years of experience as a marketing scientist and executive. He comes from Sears Holdings where he was responsible for the digital marketing of the retail trade company as Chief Marketing and Analytics Officer. He studied mathematics, economics and information systems in Turkey and the USA. Dr. Tomak embarked on his professional career as an assistant professor at the University of Texas, Austin.
He has expertise in the areas of omnichannel and cross-device attribution, price and revenue optimization, assessing promotion effectiveness, yield optimization in digital marketing and real-time analytics. He has managed mid and large-size analytics and digital marketing teams in Fortune 500 companies and delivered large-scale analytics solutions for marketing and merchandising units. His out-of-the-box thinking and problem-solving skills led to 4 patent awards and numerous academic publications. He is also a sought-after speaker in Big Data and BI Platforms for Analytics.
Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then more than a decade ago moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He is the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe).
I am an employee of T-Mobile Austria (TMA), working as a data scientist for both commercial and network related use cases. I have profound experience of designing and implementing both analytical and machine learning algorithms in Apache Hadoop ecosystem.
My interest in data modeling started six years ago when I got the chance to work on experimental data during pursuit of my PhD degree. Unlike structural modeling where the true nature of the data generating process can be modeled in close form, majority of the processes in real world are too complex to be understood in their entirety. Consequently, I gained expertise in several discriminative models, such as dynamic stochastic models, time-series analysis and the state-space modeling.
From the beginning of my career as a data scientist at TMA, I have made adequate use of machine learning and applied research in market science and mobile network, which led to several data science projects with attribution to high business value. Since industry demands the end-to-end working solution but not just a prototype model, so I have mastered several programming languages and have served as the data engineer for most of my use-cases as well.
A brief list of my prowess and skills along with the projects and publications can be found on my linkedIn profile at: https://www.linkedin.com/in/wasifmasood/
Holden is a transgender Canadian open source developer advocate @ Google with a focus on Apache Spark, BEAM, and related "big data" tools. She is the co-author of Learning Spark, High Performance Spark, and another Spark book that's a bit more out of date. She is a commiter on and PMC on Apache Spark and committer on SystemML & Mahout projects. She was tricked into the world of big data while trying to improve search and recommendation systems and has long since forgotten her original goal.
Suneel is a Member of Apache Software Foundation and is a Committer and PMC on Apache Mahout, Apache OpenNLP, Apache Streams. He's presented in the past at Flink Forward, Hadoop Summit, Berlin Buzzwords, Machine Learning Conference, Big Data Tech Warsaw and Apache Big Data.
Product Management lead at Uber with a focus on Data Platforms and Infra. I manage Uber's Storage, Analytics, BI, and Machine Learning product lines.
Currently working on Banco Santander at Innovation & Architecture of Corporate Investment Banking. I've been working on Santander Group for the last 13 years. Previously I worked at IDG Communications Spain as IT Manager for 5 years. My main role was the development and improvement of Web strategy of the group. I also wrote more than 20 articles in the publications PC World, Computerworld and Macworld.
Computer Degree at Universidad Ponitifica de Salamanca.
Uwe Weber is working since almost 20 years in the IT environment and became a Big Data Engineer at Telefónica in 2014. He initially set up Telefónica’s Hadoop environment and infrastructure and supports business departments to utilize the “new world”.
Owen O'Malley is a co-founder and technical fellow at Hortonworks, a rapidly growing company (25 to 1,000 employees in 5 years), which develops the completely open source Hortonworks Data Platform (HDP). HDP includes Hadoop and the large ecosystem of big data tools that enterprises need for their data analytics. Owen has been working on Hadoop since the beginning of 2006 at Yahoo, was the first committer added to the project, and used Hadoop to set the Gray sort benchmark in 2008 and 2009. In the last 8 years, he has been the architect of MapReduce, Security, and now Hive. Recently he has been driving the development of the ORC file format and adding ACID transactions to Hive. Before working on Hadoop, he worked on Yahoo Search's WebMap project, which was the original motivation for Yahoo to work on Hadoop. Prior to Yahoo, he wandered between testing (UCI), static analysis (Reasoning), configuration management (Sun), and software model checking (NASA). He received his PhD in Software Engineering from University of California, Irvine.
Jay is a final year student at King’s College London studying Computer Science. She joined Hotels.com in the Big Data Platform team for her industrial placement year where she spent time working with Apache Hive, modularization techniques for SQL, and mutation testing tools.
Solution Architect with more than 15 years experience in DWH and BI and last years also Big Data environments.
Designed many data warehouses including a Customer Intelligence System, Marketing Data Warehouse, Enterprise Data Warehouse and Basel II data warehouse. Currently responsible for the architecture of Data Lake, Data Factory and Data Lab based on Cloudera and HortonWorks technology.
With more than fifteen years of experience in Java technologies, Monica is currently responsible for defining Big Data architectures for Engineering’s Data and Analytics Center of Excellence. She deals mainly with leading the Group in the development of projects and POCs, offering consulting services to clients and assisting the pre-sales phase by bringing in the contribution on Big Data technologies. She runs internal courses and for clients and also carries out activities to support the dissemination of Engineering’s expertise in this context, both nationally and internationally.
Christopher Crosbie has over fifteen years of experience developing and deploying data technology in enterprise environments. He is currently on the Cloud Partner Engineering team at Google where he serves a trusted advisor to software vendors that build Data, Analytics and ML solutions on the Google Cloud platform.
Previous to joining Google, Chris was a development manager at Amazon and before that he headed up the data science team at Memorial Sloan Kettering Cancer Center where he implemented the enterprise Hortonworks architecture and strategy. Chris started his career as a biostatistics application engineer at the NSABP, a not-for-profit clinical trials cooperative group supported by the National Cancer Institute. He holds an MPH in Biostatistics and an MS in Information Science.
Vladimir Kroz is an architect at Search group in WalmartLabs, where he is building next generation of e-commerce search for walmart.com. Vladimir works on large scale low latency search, big data and machine learning systems, and has acute passion in large scale computing and AI. Prior to Walmart he has led engineering teams at number of Fortune 500 international companies in e-commerce and telecom field. He also co-founded real-time data integration company Wisdomforce. Vladimir holds Master’s degree in Computer Information Systems and Electrical Engineering.
Dor has over a decade of experience developing big data products for security industries, financial markets and banking industries. His research on metric learning and cost-sensitive learning has earned him publications in NIPS, AISTATS and a monetary prize in Cha-Learn competitions. As a senior data scientist at ING Bank, he is involved with multiple projects modelling consumer and market behavior, optimizing business and IT processes and contributing to the data science way-of-working, rapid exploration and continuous delivery processes.
I am currently a Big Data Delivery Lead at Optum (UnitedHealth Group) and based in Dublin (Ireland). Me and my teams deal with projects in the PI (fraud, waste and abuse, claims processing) and the healthcare space. I worked previously at IBM Ireland, where I switched my career path from Test Automation to Analytics and Machine Learning.
I am passionate about coding, Big Data, AI/ML/DL, test automation, Open Source, DevOps and cooking (home made pizza is my speciality).
I share my tech thoughts through my blog (http://googlielmo.blogspot.ie/) and DZone (https://dzone.com/users/2532948/virtualramblas.html) where I am a Golden Member.
During 2018 I have presentend to several international conferences such as DataWorks Summit Berlin, Google I/O Extended, Predictive Analytics World for Industry 4.0 and many others.
My first book "Hands-on Deep Learning with Apache Spark" (https://tinyurl.com/y7d98s64) is going to be released in December 2018.
Yakir Buskilla is a Director of Big Data at the Nielsen Marketing Cloud. His fields of interest are Big Data solutions and large scale machine learning.
Yakir Buskilla is a Director of Big Data at the Nielsen Marketing Cloud. His fields of interest are Big Data solutions and large scale machine learning.
All cancellation requests must be submitted in writing to the DataWorks Summit Registration Team at email@example.com. If a cancellation is received on or before Friday, February 22, 2019 (5:00pm PDT), your registration fee will be refunded, less a €100 processing fee. After this date, cancellation requests will be processed without a refund.
Thank you, your newletter signup has been successfully submitted.
Centre de Convencions Internacional de Barcelona, Plaça de Willy Brandt, Barcelona, Spain
+34 932 30 10 00
Delivering cloud-native machine learning and analytics from the Edge to AI.
Cloudera and Hortonworks have joined forces, combining significant investments in differentiated, yet complementary, areas—real-time streaming at the Edge, enterprise-grade data warehousing, and a new platform to industrialize AI.
Unlock the power of any data, running in any cloud, from the Edge to AI, on a 100 percent open-source platform with an enterprise data cloud that’s multi-cloud and hybrid cloud, with the flexibility to perform machine learning and analytics with your data, your way, with no lock-in.Learn More
IBM is a globally integrated technology and consulting company headquartered in Armonk, New York. With operations in more than 170 countries, IBM attracts and retains some of the world’s most talented people to help solve technology problems and provide an edge for businesses, governments and non-profits. Innovation is at the core of IBM’s strategy. The company has reinvented itself through multiple technology eras and economic cycles, creating differentiating value for its clients. Today, as the IT industry is fundamentally changing at an unprecedented pace, IBM is much more than a “hardware, software, services” company. IBM is now emerging as a cognitive solutions and cloud platform company. Cognitive solutions powered by analytics and the cloud are the key to clients’ digital transformation. This transformation requires breakthroughs at every level of the enterprise IT foundation, from processors and computer design to storage, applications and analytics tools, networking and the integration layer. IBM solutions are built with open technologies and designed for mission-critical applications, offering a comprehensive platform for cognitive workloads.Learn More
Attunity, voted Hortonworks ISV Partner of the Year, provides modern data integration software with change data capture technology, that efficiently delivers data in real-time and with no manual coding. Attunity software, serving half of the Fortune 100, non-disruptively replicates data from production sources such as Oracle, mainframe and SAP across database/data warehouse, data lake, streaming and cloud architectures. Attunity also accelerates data lake pipelines by automating the creation, updates and provisioning of analytics-ready data.
WANdisco is shaping the future of data infrastructure with our ground-breaking LIVE DATA Platform, enabling companies to put all their data to work for the business – all the time, at any scale. We make data always available, accurate and protected across environments, supporting exponential data growth within the same budget.Learn More
Unravel is the APM (Application Performance Management) platform for big data. Unravel accelerates application performance, optimizes multi-tenant resource usage, and provides operations insights — all in a single location. Unravel supports Big Data systems such as Hadoop, Spark, Kafka, NoSQL for both on-premises and cloud environments. Customers include leading Big Data practitioners such as Sony, Box, Autodesk and YP.comLearn More
TechTarget offers data-driven marketing services to business-to-business technology vendors. It uses purchase intent data gleaned from the readership of its 140 + technology focused web sites to help tech vendors reach buyers actively researching relevant IT products and services.Learn More
The Big Data Beard Inc. is a creative media company authoring podcasts, blogs, and videos aimed at sharing knowledge about the innovations happening in Big Data, Machine Learning, Data Science, Artificial Intelligence, and the Internet of Things.Learn More