This 1 day course details the business value for, and provides a technical overview of, Apache Hadoop. It includes high-level information about concepts, architecture, operation, and uses of the Hortonworks Data Platform (HDP) and the Hadoop ecosystem. The course serves as an optional primer for those who plan to attend a hands-on, instructor-led course.
Data architects, data integration architects, managers, C-level executives, decision makers, technical infrastructure team, and Hadoop administrators or developers who want to understand the fundamentals of Big Data and the Hadoop ecosystem.
No previous Hadoop or programming knowledge is required. Students are encouraged to bring their wi-fi enabled laptop pre-loaded with the Hortonworks Sandbox should they want to duplicate demonstrations on their own machine.
Learn Data Science techniques and best practices leveraging the Hadoop ecosystem and tools in this 2 day course.
Architects, software developers, analysts and data scientists who need to apply data science and machine learning on Apache Hadoop.
Students must have experience with at least one programming such as Python, or scripting language, knowledge in statistics and/or mathematics, and a basic understanding of big data and Hadoop principles.
This 2 day course is designed for administrators who will be managing the Hortonworks Data Platform (HDP) 2.5 with Ambari. It Covers installation, configuration, and other typical cluster management tasks.
IT administrators and operators responsible for installing, configuring, and supporting an HDP 2.5 deployment in a Linux environment using Ambari.
No previous Hadoop knowledge is required, though will be useful. Attendees should be familiar with data center operations and Linux system administration. Students will need to bring their wi-fi enabled laptop pre-loaded with Chrome or Firefox browser in order to complete hands-on labs.
This 2 day course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. The focus will be on utilizing the Spark API from Python or Scala.
Developers, Architects, and Admins who would like to learn more about developing data applications in Spark, how it will affect their environment, and ways to optimize application.
No previous Hadoop knowledge is required, though will be useful. Basic knowledge of Python or Scala is required. Previous exposure to SQL is helpful, but not required. Students will need to bring their wi-fi enabled laptop pre-loaded with Chrome or Firefox browser in order to complete hands-on labs.
This 2 day course is designed for ‘Data Stewards’ or ‘Data Flow Managers’ who are looking forward to automate the flow of data between systems.
Data Engineers, Integration Engineers and Architects who are looking forward to automate Data flow between systems.
Good to have some experience with Linux and basic understanding of DataFlow tools. Students will need to bring their wi-fi enabled laptop pre-loaded with Chrome or Firefox browser in order to complete hands-on labs.
Hortonworks is a leading innovator at creating, distributing and supporting enterprise‐ready open data platforms. Our mission is to manage the world’s data. We have a single‐minded focus on driving innovation in open source communities such as Apache Hadoop, NiFi, and Spark. Our open Connected Data Platforms power Modern Data Applications that deliver actionable intelligence from all data: data‐in‐motion and data‐at‐rest. Along with our 1600+ partners, we provide the expertise, training and services that allows our customers to unlock the transformational value of data across any line of business. We are Powering the Future of Data.Learn More
Yahoo is a guide focused on informing, connecting, and entertaining our users. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. In turn, we create value for advertisers by connecting them with the audiences that build their businesses. Yahoo is headquartered in Sunnyvale, California, and has offices located throughout the Americas, Asia Pacific (APAC) and the Europe, Middle East and Africa (EMEA) regions.Learn More
Microsoft believes anyone should be able to get insights from Big Data. So, we bring the power of the cloud to Big Data making it easier than ever to work with all data types. With Microsoft data solutions, everyone can bring Big Data business insights to life through advanced analytics and stunning visualizations – all powered by our enterprise-grade, flexible, and open cloud.Learn More
Hewlett Packard Enterprise is an industry leading technology company that enables customers to go further, faster. With the industry’s most comprehensive portfolio, spanning the cloud to the data center to workplace applications, our technology and services help customers around the world make IT more efficient, more productive and more secure.Learn More
IBM is a globally integrated technology and consulting company headquartered in Armonk, New York. With operations in more than 170 countries, IBM attracts and retains some of the world’s most talented people to help solve technology problems and provide an edge for businesses, governments and non-profits. Innovation is at the core of IBM’s strategy. The company has reinvented itself through multiple technology eras and economic cycles, creating differentiating value for its clients. Today, as the IT industry is fundamentally changing at an unprecedented pace, IBM is much more than a “hardware, software, services” company. IBM is now emerging as a cognitive solutions and cloud platform company. Cognitive solutions powered by analytics and the cloud are the key to clients’ digital transformation. This transformation requires breakthroughs at every level of the enterprise IT foundation, from processors and computer design to storage, applications and analytics tools, networking and the integration layer. IBM solutions are built with open technologies and designed for mission-critical applications, offering a comprehensive platform for cognitive workloads.Learn More
Teradata empowers companies to achieve high-impact business outcomes through analytics. With a powerful combination of Industry expertise and leading hybrid cloud technologies for data warehousing and big data analytics, Teradata unleashes the potential of great companies. Partnering with top companies around the world, Teradata helps improve customer experience, mitigate risk, drive product innovation, achieve operational excellence, transform finance, and optimize assets. Teradata is recognized by media and industry analysts as a future-focused company for its technological excellence, sustainability, ethics, and business value.Learn More
Dell EMC, a part of Dell Inc., enables organizations to modernize, automate and transform their data center using industry-leading converged infrastructure, servers, storage and data protection technologies. This provides a trusted foundation for businesses to transform IT, through the creation of a hybrid cloud, and transform their business through the creation of cloud-native applications and big data solutions. Dell EMC services its customers – including 98 percent of the Fortune 500 – with the industry’s broadest, most innovative infrastructure portfolio from edge to core to cloud.Learn More
BMC is a global leader in innovative software solutions that enable businesses to transform into digital enterprises for the ultimate competitive advantage. Our Digital Enterprise Management solutions are designed to make digital business fast, seamless, and optimized from mainframe to mobile to cloud and beyond. BMC – Bring IT to Life BMC digital IT transforms 82% of the Fortune 500®.Learn More
Cloudera delivers the modern platform for machine learning and advanced analytics The world’s leading organizations trust Cloudera to help solve their most challenging business problems with Cloudera Enterprise, the fastest, easiest and most secure data platform built on Apache Hadoop and the latest open source technologies.Learn More
Attunity, voted Hortonworks ISV Partner of the Year, provides modern data integration software with change data capture technology, that efficiently delivers data in real-time and with no manual coding. Attunity software, serving half of the Fortune 100, non-disruptively replicates data from production sources such as Oracle, mainframe and SAP across database/data warehouse, data lake, streaming and cloud architectures. Attunity also accelerates data lake pipelines by automating the creation, updates and provisioning of analytics-ready data.
Pentaho, a Hitachi Group Company, is a leading data integration and business analytics company with an enterprise-class, open source-based platform for diverse big data deployments. Pentaho’s unified data integration and analytics platform is comprehensive, completely embeddable and delivers governed data to power any analytics in any environment. Pentaho has over 15,000 product deployments and over 1,500 commercial customers including ABN-AMRO Clearing, BT, Caterpillar Marine Asset Intelligence, EMC, Halliburton, and NASDAQ.Learn More
Unlock business potential from your Big Data faster and easier with SAP Vora. SAP Vora is an in-memory, distributed computing solution to run enriched, interactive analytics on both enterprise and Hadoop data, quickly and easily. SAP Vora is a complete, production-ready, fully integrated solution between the SAP HANA® platform and Hadoop environments – enabling high-performance, interactive bi-directional analytics across enterprise data in SAP HANA and data stored in Hadoop.Learn More
MapR-DB is a high-performance database for global data-intensive applications built into the MapR Converged Data Platform. MapR-DB is a global multi-model database that brings together operational applications, analytical applications, real-time streaming, and other workloads to enable next-generation data-intensive applications.Learn More
Datameer empowers organizations to embark on a data journey that answers a wide range of new, deeper business questions to increase business agility and responsiveness. Datameer’s modern BI platform offers agile analytics on an enterprise-grade infrastructure that can rapidly answer these questions and operationalize the results across the business.Learn More
Alation’s enterprise collaborative data platform empowers employees inside of data-driven enterprises to find, understand, and use the right data for better, faster business decisions. Alation combines the power of machine learning with human insight to automatically capture information about what the data describes, where the data comes from, who’s using it and how it’s used.Learn More
Kognitio is a pioneer in the development of scale-out, in-memory software for big data analytics. It provides an ultra-fast, high concurrency SQL layer allowing modern data visualization tools to maintain interactive performance. Kognitio is fully integrated with YARN on Hadoop or can be installed on standalone hardware infrastructure.Learn More
Talend is a leader in cloud and big data integration software that helps companies make data a strategic asset that provides the data agility required for companies to rapidly adopt the latest technology innovations and scale to meet the constantly evolving demands of modern business.Learn More
In 2007 two ex-Bank Of America colleagues – Partha Sen and Mike Upchurch – formed Fuzzy Logix. With a combined passion for solving problems with quantitative methods, data mining and pattern recognition, and a foresight of how businesses would increasingly collect information and need to achieve actionable insight from this data, they created a business that transformed data analytics. By performing the analytics directly where the data resides and eliminating the need to move it, in-database analytics was created.Learn More
SAS is the global leader in analytics solutions and services, and the largest privately-held software company in the world. Our innovative solutions – driven by a 26% reinvestment into R&D – help more than 83,000 customers around the globe make better decisions faster. Since 1976, SAS has provided businesses and government agencies with industry-leading solutions to help them transform their operations. Simply put, we help organizations turn large amounts of data into knowledge they can use.Learn More
NorCom is a full-chain supplier for Big Data Solutions. Highly qualified consultants and brilliant data scientists are developing big data and data management solutions according to the business strategy of our customers. In addition to our standard products, our software experts provide customized individual applicationsLearn More
Leading organizations worldwide count on NetApp for software, systems and services to manage and store data. We help customers capitalize on the value of their data in the hybrid cloud through our Data Fabric strategy, data management expertise, portfolio and ecosystem.Learn More
SynerScope has developed a patented (eco)system for combining and analyzing all kinds of data: numerical, text, video/voice, IoT, structured and unstructured, real time and historical. For large to massive data sets. The first one able to easily match several kinds of data and thus creating valuable decisive information.Learn More
Codecentric AG sees itself as a pioneer for agile software development and innovative technologies in Germany.
More than 400 employees at 15 European locations are developing the software solutions of the future. The business model combines the know-how of the best IT architects and software developers with the practical experience of numerous projects in areas such as continuous delivery, big data, performance solutions, agile and enterprise development.
We are in the midst of a digital revolution, one which is fundamentally altering the economic and social landscape. Digitally enhanced business models, digital consumers and digital employees require methods, technologies and thought processes which go beyond the traditional. In order to maintain their success, companies must take action.
As IT service providers, we give our customers the methods, technology and cultural awareness they need to succeed in the digital arena. In order to do this, we have established ourselves as a one-stop shop provider, offering an integrated portfolio of services to get your company fit for the digital future.Learn More
Over 25 years of research between Stanford and UC Berkeley led to Trifacta’s breakthrough user experience, workflow and architecture. With Trifacta, organizations can transition from raw data to actionable intelligence with greater speed and accuracy than ever before.Learn More
WANdisco is shaping the future of data infrastructure with our ground-breaking LIVE DATA Platform, enabling companies to put all their data to work for the business – all the time, at any scale. We make data always available, accurate and protected across environments, supporting exponential data growth within the same budget.Learn More
Cloudwick architects, engineers and manages hybrid, cloud-based and on-premise analytics and visualization solutions at scale for the Global 1000.Learn More