UP NEXT:
DataWorks Summit San Jose 2017
June 13 - 15 2017 • San Jose, USA

Apache Hadoop is a distributed system that enables distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. See hadoop.apache.org for more information on Apache Hadoop.