Tim Spann was a Senior Solutions Architect at AirisData working with Apache Spark and Machine Learning. Previously he was a Senior Software Engineer at SecurityScorecard ("http://securityscorecard.com/) helping to build a reactive platform for monitoring real-time 3rd party vendor security risk in Java and Scala. Before that he was a Senior Field Engineer for Pivotal focusing on CloudFoundry, HAWQ and Big Data. He is an avid blogger and the Big Data Zone Leader for Dzone (https://dzone.com/users/297029/bunkertor.html).
He runs the the very successful Future of Data Princeton meetup with over 830 members at http://www.meetup.com/futureofdata-princeton/.
He is currently a Solutions Engineer at Hortonworks in the Princeton New Jersey area.
You can find all the source and material behind his talks at his Github and Community blog:
Nagaraj Jay is a Systems Architect at Hortonworks. His main interests are on Spark ML, AI, Flink, Geospatial data replication and data integration.
Alex is a skilled leader to drive the evolution of our analytical capabilities for its rapidly growing Marketplaces business, where over 100 million active worldwide users transact at a rate of more than $3,500 of goods every second.
For the past 13 years at Ebay, Alex is helping eBay leverage its unique end-to-end data set and drive continuous innovation to accelerate top-line growth for the company. Direct the utilization of a significant investment in large-scale data management and processing resources and technologies while helping to evolve our world-class analytics capabilities. Enable the business goals through partnering with internal customers and stakeholders to remove barriers to data-driven insights.
Carlo A. Curino received a Bachelor in Computer Science at Politecnico di Milano. He participated to a joint project between University of Illinois at Chicago (UIC) and Politecnico di Milano, obtaining a Master Degree in Computer Science at UIC and the Laurea Specialistica (cum laude) in Politecnico di Milano.
During the PhD at Politecnico di Milano, he spent almost two years as a visiting researcher at University of California, Los Angeles (UCLA) working with prof. Carlo Zaniolo (UCLA) and prof. Alin Deutsch (UCSD). He then spent two years as Post Doc Associate at CSAIL MIT working with prof. Samuel Madden and prof. Hari Balakrishnan.
At MIT he was also the primary lecturer for the course on databases CS630, taught in collaboration with Mike Stonebraker. He spent a year as Research Scientist at Yahoo! Research. Currently Carlo is a Principal Research Scientist in the Microsoft Cloud and Information Service Lab (CISL).
Carlo's recent research interests include: large scale distributed systems, performance tuning, scheduling. In the past he worked on: mobile+cloud platforms, entity dedup at scale, relational databases and cloud computing, workload management and performance analysis, schema evolution, temporal databases.
Saurabh is a Solution Architect with strong expertise in Hadoop ecosystem and rich field experience. He helps large to small enterprises solve their business problems strategically, functionally and at scale by leveraging BigData technologies. He is equipped with hands-on experience building, coding and directing successful information technology initiatives.
Saurabh has over 12 years of strong IT experience and has served in key positions as Lead Big Data Solution Architect, Performance Architect, Technology Architect in multiple large and complex enterprise programs.
He has extensive knowledge of BigData/NoSql technologies including Hadoop, Yarn, Spark, Hbase, Hive, Pig, Storm, Kafka, Nifi etc. and has been working in this space for last 6+ years.
Saurabh has architected and designed big data platforms and applications that consist of 1000s of nodes , 10s of Petabytes of data and Complex ETL workflows requirements.
Saurabh have provided solutions for Real Time analytics, Data Pipelines , leverage co-located datasets at scale for Analytics insight and pattern detection using different Data Science Algorithms. Designed and Implemented Scalable data pipelines that produce results in minutes or hours across peta-bytes of data, along with building and discovering new ways to co-locate, integrate and leverage disparate datasets using key architecture patterns like Lambda and HTAP for IOT applications.
Long time Hadoop wrangler.
Venkatesh is a senior data scientist at PayPal where he is working on building state-of-the-art tools for payment fraud detection. He has over 20+ years experience in designing, developing and leading teams to build scalable server side software. In addition to being an expert in big-data technologies, Venkatesh holds a Ph.D. degree in Computer Science with specialization in Machine Learning and Natural Language Processing (NLP) and had worked on various problems in the areas of Anti-Spam, Phishing Detection, and Face Recognition.
Janet Li has over 15 years' experience in the IT industry in the areas of databases, analytics & big data. Janet has managed internal and external teams of architects, database administrators, big data architect & infrastructure providers for large IT projects. Most recently Janet is the lead of the Hortonworks Hadoop Data Lake for HP Inc. Janet has a bachelor’s degree from the University of Wuhan University, China and a Master's Degree in Computer Science from the University of St. Thomas, MN, US. Janet is based in Austin Texas where she enjoys spending time with her family and hiking with her dog in the Texas Hill Country.
Pranay is an accomplished Hadoop Architect and Engineer, with hands-on development with Hadoop technologies, includes Installation, maintenance and upgrade of HDP cluster & development using Pig, Hive, Spark, SolR, HBase, Flume, Storm. Pranay has over 12+ years of experience with multiple technologies including server administration, Java Technologies, .NET technologies and Mainframe applications.
As a principal consultant at T4G with over 20 years’ experience, Darryl believes that ‘better delivery is achieved through better design’ on IT projects. This experience in design and delivery of software solutions cuts across a variety of industries including the retail, insurance, manufacturing, financial, and government sectors. Darryl specializes in application design with focus on data science platforms under the Hadoop ecosystem and custom application development. In an architect role, he has produced technical design deliverables on many large enterprise projects and recently provided technical guidance on several Hadoop related projects for a national telecom and banking clients.
Kenneth Poon is a Director of Data Engineering at RBC, leading the design and implementation of enterprise data services. He has built several large-scale applications in Capital Markets space and specializes in real-time streaming applications. Kenneth enjoys solving complex problems and writing performant code.
Toni LeTempt is a senior technical expert at Walmart. Toni has 18 years’ IT experience, five of them working with large secure enterprise Hadoop clusters.
Nishant is Druid PMC member and Software Engineer at Hortonworks. He is part of Business Intelligence team at Hortonworks. Prior to that he was part of Metamarkets backend team and was responsible for analytics infrastructure, including real-time analytics in Druid. He holds a B.Tech in Computer Science from National Institute of Technology, Kurukshetra, India.
Joe Olson is senior manager of big data analytics at United Airlines, focusing on running a big data warehouse, and streaming data analytics.
Prior to United Airlines, Joe worked at several start up companies in Chicago developing both big data and streaming data architectures, built around emerging open source frameworks.
Ryan Blue works on open source projects, including Iceberg, Spark, Parquet, and Avro, at Netflix.
Shaoxuan Wang is a senior engineering manager in Alibaba, leading the development of Apache Flink SQL. He is a committer of the Apache Flink project. Before Alibaba, Shaoxuan works on Facebook TAO project, a data store for social graph. Shaoxuan Wang received his Ph.D from University of California San Diego in Computer Engineering.
Born in the era of great music, Marc is best known for his ability to cause "Hello World" to segfault. He works on distributed systems with Apache Accumulo and Apache NiFi, but don't let his charm deceive you. At his heart he is still just a de-referenced null pointer. In fact, this biography is better told via assembly language:
mov rbp, rsp
mov QWORD PTR [rbp-8], rdi
mov QWORD PTR [rbp-8], 0
mov rax, QWORD PTR [rbp-8]
mov eax, DWORD PTR [rax]
Timo Walther is a committer and PMC member of the Apache Flink project. He studied Computer Science at TU Berlin. Alongside his studies, he participated in the Database Systems and Information Management Group there and worked at IBM Germany. Timo works as a software engineer at data Artisans. In Flink, he is mainly working on the Table & SQL API.
Ian is a developer evangelist with an emphasis on machine learning and data science. He frequently posts code examples and tutorials on the MapR blog and on his personal blog at http://bigendiandata.com. He enjoys connecting with people at meetups and leads the Java User Group in Portland, Oregon.
Will Ochandarena is Senior Director of Product Management at MapR, responsible for Cloud and IoT product strategy. He spends time with customers across several industries, including manufacturing, retail, and energy, helping them use MapR’s converged data fabric at the edge to solve new and interesting business problems. He also writes blogs on MapR.com - https://mapr.com/blog/author/will-ochandarena/.
Edwina Lu is a software engineer on LinkedIn’s Hadoop infrastructure development team, currently focused on supporting Spark on the company’s clusters. Previously, she worked at Oracle on database replication. Edwina holds a Master's degree in Computer Science from Stanford University.
Ye Zhou is a software engineer in LinkedIn’s Hadoop infrastructure development team and mostly focusing on Hadoop Yarn and Spark related projects. Ye holds a Master degree in computer science from Carnegie Mellon University.
Satya N Ramachandran is the vice president of engineering at Neustar, a provider of real-time, cloud-based information services. He has more than 20 years of experience in distributed computing and large scale analytics. Prior to Neustar, Satya led engineering at MarketShare and co-founded JovianDATA a large scale analytics platform built entirely on the cloud. He has held senior engineering roles in teams that built real-time and distributed analytics engines at Cognos, 3ParData and Sybase.
Satya holds a Master’s in Computer Science from the Indian Institute of Science, with emphasis in databases and compilers. Satya holds several patents in distributed computing and has been a presenter at several conferences.
Gustavo Arocena is a Big Data Architect at the IBM Toronto Lab, with over 15 years of experience in database technology. Recently, Gustavo lead the design and implementation of several components of the Big SQL engine, including the Hive-compatible IO layer, the INSERT statement, the integration with Apache Spark and the high-performance ORC ingestion layer.
Gustavo has several publications and has presented at multiple conferences. He holds a Master's degree in Computer Science from the University of Toronto in the area of database language processing.
Sr. Technologist for American Water working with HDP and HDF
Conrad Fernandes is a long time cyber security engineer and architect, and has worked extensively with US Defense agencies and the DoD since the early 2000's while at Booz Allen Hamilton. Conrad currently serves as a senior cyber security engineer at the Johns Hopkins Applied Physics Laboratory (APL), where he leads security and governance practices on emerging cloud technologies, including commercial and US GovCloud (e.g., Amazon web services) and Hadoop-based data science platforms from Cloudera and Hortonworks. Conrad recently presented strategies for "Incident Response and Spillage Handling in AWS" at Amazon's 2018 reInvent Conference. Additionally, Conrad has been researching and implementing security and audit logging and monitoring strategies on data science platforms at Johns Hopkins Medical Institute (JHMI) that utilize various emerging security services found within Hortonworks Data Platform (HDP). Conrad also enjoys sharing security best-practices and lessons-learned from the experiences with the larger cloud and big-data community.
Inigo is working as a research software developer at Microsoft Research in the System Research Group currently focusing on HDFS, specifically scaling it to 100K+ nodes and making it able to harvest idle resources. He has been working on the Hadoop ecosystem since 2011 and is a committer. Prior to Microsoft, he worked at Rutgers University as a postdoctoral researcher.
Wei Yan is currently working at Uber on its data infrastructure team, with a focus on scaling HDFS. He is a Hadoop committer, and holds a PhD in computer science from Vanderbilt University.
Matt is presently the Director of Research and Development for Think Big Analytics, A Teradata Company
He directs software engineering for the best-in-class big data and analytics division of Teradata. . Notably leading product vision, implementation, release and ongoing development of the industry leading open-source Kylo data lake management platform (http://kylo.io) on Hadoop and Spark generating tens of millions of services and support revenue. He previously held the position of applications architect and manager of IT Application Development for the National Ignition Facility at Livermore National Labs. He led a large team that undertook diverse engineering around: petascale data science platforms, product data management, project controls, data warehousing, enterprise resource planning, laser operations, and laser shot setup and visualization.
Weiwei Yang is a staff software engineer in Alibaba, he is focusing on evolving data infrastructure to serve large scale data processing in Alibaba. He started working on Hadoop ecosystem since 2010. He is very passionate on Open Source contributions, and an active Hadoop committer. He works on both HDFS and YARN projects on various of improvements to shape it be better fit for internet scale and use cases. Prior to this, he has worked for IBM for more than 6 years, and one of the startup member of Biginsights product. He got his master degree from Peking University and bachelor from Wuhan University, China.
Senior software development engineer in BigData products division , China Mobile（Suzhou） Software Technology Co. since 2015.
Responsible for the research and development of the CMH（China Mobile Hadoop） products.
Contributor of the Hadoop Community
An InfoSec Generalist. CISSP. My more than a decade long work experience revolves around all aspects of security mainly Secure-SDLC, Source Code Analysis, Vulnerability Assessment, Penetration Testing for Web Applications, Architecture Review, Incident Response, ISMS Compliance, Doing and facilitating 3rd Party Audits. Managed multiple Federal Data Center Operations, O/S and Application Hardening, Linux System Administration. Solution Deployment and Integration for Federal and various State Governments.
Have also years of experience in leading and managing a team for monitoring, securing and ensuring "Availability Round-the-Clock" for National Critical Infrastructure. Solving Brain-Teasing needle-in-haystack production issues (Architecture, Application, System & Network) and incorporating new requirements. Conducting Vulnerability Analysis and analyzing VA reports for suggesting corrective and preventive actions (Hotfixes/ CVEs / Design Change/ Hardening/ Patching /Upgrades) to Engineering and Operations team. Panelist for Big Data Security Work Group.
Designing Solution Architecture and Capacity Planning for highly-available applications on Cloud/Data Centre environment.
Shivinder Singh is a Senior Big Data Enterprise Architect . His focus has been on building systems Infrastructure for mission critical buisness applications, his work concentrates on maximizing the return on assets for the database portfolio. This involves developing an set of best practices in database strategy and Infrastructure life cycle management, covering B2B, B2C and B2E arenas. His work on strategy is centered on the major theme of consolidation, integration while focusing on Total Cost of Ownership, the critically important discipline of Infrastructure Maintenance and the need for strong application design. Having a sharp focus on innovation his work has resulted in four patents which are currently being processed at USPTO. Singh is a frequent speaker at various technology and executive conferences.
Kamil is a technology leader in the large scale data warehousing and analytics space. He is CTO of Starburst, the enterprise Presto company. Prior to co-founding Starburst, Kamil was the Chief Architect at the Teradata Center for Hadoop in Boston, focusing on the open source SQL engine Presto. Previously, he was the co-founder and chief software architect of Hadapt, the first SQL-on-Hadoop company, acquired by Teradata in 2014.
Kamil began his journey with Hadoop and modern MPP SQL architectures about 10 years ago during a doctoral program at Yale University where he co-invented HadoopDB, the original foundation of Hadapt’s technology.
Kamil holds an M.S. in Computer Science from Wroclaw University of Technology and as well as M.S. and an M.Phil. in Computer Science from Yale University.
Animesh Trivedi is a researcher at IBM Research Lab in Zurich. His interests are in anything, everything related to performance, spanning from multi-core CPUs to distributed environments. Previously, he has developed RDMA support for Java (jVerbs/DiSNI stack, SoCC'13), designed and implemented high-performance RPC on RDMA in Java (DaRPC, SoCC'14), performed detailed performance analysis of Spark workloads (HotCloud'16), proposed and implemented FlashNet - a unified high-performance storage and networking stack in Linux (HotOS'13, Systor'17 – won the best paper award), etc. All of his work is open sourced. Currently, he is investigating how modern high-performance network and storage devices can be leveraged in popular data processing frameworks such as Apache Spark, Hadoop, Hive, etc. He is one of the founding members of the Apache Crail (Incubating) project.
Yanwei (Wayne) Zhang is a senior data scientist at Uber Technologies Inc. He has a Master’s degree in statistics and a PhD degree in quantitative marketing. He has published several research papers at top journals in statistics and actuarial science. His interest is in large-scaled machine learning, with a focus on applications in driving safety and insurance.
Anu Engineer was part of the original Windows Azure team, principal author of VMware Certificate Authority, Apache Hadoop committer and PMC member. He works on HDFS and is one of the contributors to ozone.
Dr. Vega has been a renewable energy consultant and researcher for the last 10-years. He currently leads a staff of engineers, analysts and data scientists to perform analysis and provide business insights and reporting on the energy market operations of CPS Energy in ERCOT. He is responsible for the analytical skills of the team, as well as the data and associated systems required to effectively provide solid analytics.
Before his current position he led the R&D and technical performance in the subjects of renewable energy forecasting, GIS LiDAR analytics, building load forecasting and grid integration at The University of Texas at San Antonio (UTSA). Dr. Vega has led the development of 3 patent pending in the area of distributed energy forecasting and led the development of technologies for distributed IoT traffic monitoring, cyber abnormality detection and prediction for the electric utility industry.
He started and was responsible for the Renewable Energy consulting business in US, Mexico, Brazil and China for a global 1700+ employee consulting company. Dr. Vega helped develop the company’s consulting renewable energy annual revenues to about $6M in 3-years. He drives teamwork, effectively draws from the strengths of his team and focuses in innovative ideas and great communication to provide solutions. Dr. Vega’s former clients include top tier global owners, utilities and manufacturers of renewable energy assets and operations. Dr. Vega is a registered Professional Engineer and holds an active NCEES record for licensure in any U.S. state.
Daniel Scarberry has been a volunteer Pentester(hacker), DefCon goon, and open source security contributor for more than a decade. He spent a handful of years designing, building, and breaking large networks around the world for Juniper Networks, Arbor, Niksun, and JASK.ai. He has racked, stacked, configured, and crashed nearly every major network vendors routing and switching platform. Daniel is now the CEO and founder of COGNETA.ai a predictive cyber security defense company, spending his time unearthing the treasure trove of security information at the network level. Read more at cogneta.ai/blog and jask.ai/blog
Jesús Camacho Rodríguez is a Member of Technical Staff at Hortonworks, and a PMC member of Apache Hive and Apache Calcite. His current work focuses on extending and improving query processing and optimization, ensuring that the increasingly complex workloads supported by Hive are executed quickly and efficiently. Prior to that, Jesús obtained his PhD in Computer Science from Paris-Sud University and Inria, working on large-scale Web data management. Jesús received his Computer Science and Engineering degree from University of Almería, Spain.
Yanbo is a staff software engineer at Hortonworks. His main interests center around implementing effective machine learning and deep learning algorithms or models in the areas of recommendation system, natural language processing and others. He is an Apache Spark PMC member and contributes to lots of other open source projects such as TensorFlow and Apache MXNet. He delivered the implementation of some core Spark MLlib algorithms. Prior to Hortonworks, he was a software engineer at Yahoo! and France Telecom working on machine learning and distributed system.
Mingjie Tang is an engineer at Hortonworks. He is working on SparkSQL, Spark MLlib and Spark Streaming. He has broad research interest in database management system, similarity query processing, data indexing, big data computation, data mining and machine learning. Mingjie completed his PhD in Computer Science from Purdue University.
Pavan Surapaneni has over 18 yrs of experience in communications industry, specializing in implementations of Business support and business intelligence applications. Lead several complex transformation initiatives, from building large billing systems to transforming legacy BI applications to utilize right technologies to assist business meet their goals.
Lead Solutions Engineer for several implementations at Cox: few of these:
- Support Dynamic Ad Insertion - build data analysis pipeline using Hive, Nifi
- Insights in Network Performance - Using Spark, Hbase
Chris has over 10 years of experience in building and supporting Oil and Gas data solutions. His experience ranges in data management, global deployments, enterprise architecture and big data.
Ashutosh is working in Hive for last 7 years. He works at Hortonworks where his focus is on compiler and optimizer area.
Dr. Xiaoyi Lu is a Research Scientist of the Department of Computer Science and
Engineering at the Ohio State University, USA. His current research interests
include high performance interconnects and protocols, Big Data,
Hadoop/Spark/Memcached Ecosystem, Parallel Computing Models (MPI/PGAS),
Virtualization and Cloud Computing. He has published over 90 papers in
International journals and conferences related to these research areas. He has
been actively involved in various professional activities in academic journals
and conferences. Recently, Dr. Lu is leading the research and development of
RDMA-based accelerations for Apache Hadoop, Spark, HBase, and Memcached, and
OSU HiBD micro-benchmarks, which are publicly available from
(http://hibd.cse.ohio-state.edu). These libraries are currently being used by
more than 275 organizations from 34 countries. More than 24,900 downloads of
these libraries have taken place from the project site. He is a core member of
the MVAPICH2 (High-Performance MPI over InfiniBand, Omni-Path, Ethernet/iWARP,
and RoCE) project and he is leading the research and development of
MVAPICH2-Virt (high-performance and scalable MPI for hypervisor and container
based HPC cloud). He is a member of IEEE and ACM. More details about Dr. Lu are
available at http://web.cse.ohio-state.edu/~luxi/.
Dr. Dhabaleswar K. (DK) Panda is a Professor and University Distinguished Scholar of Computer Science at the Ohio State University. He obtained his Ph.D. in computer engineering from the University of Southern California. His research interests include parallel computer architecture, high-performance computing, communication protocols, big data, deep learning, files systems, network-based computing, and Quality of Service. He has published over 400 papers in major journals and international conferences related to these research areas. Dr. Panda and his research group members have been doing extensive research on modern networking technologies including InfiniBand, Omni-Path, High-Speed Ethernet and RDMA over Converged Enhanced Ethernet (RoCE). His research group is currently collaborating with National Laboratories and leading InfiniBand and Ethernet/iWARP companies on designing various subsystems of next generation high-end systems. The MVAPICH2 (High-Performance MPI over InfiniBand, iWARP and RoCE) open-source software package, developed by his research group, are currently being used by more than 2,825 organizations worldwide (in 85 countries). This software has enabled several InfiniBand clusters (including the 1st one) to get into the latest TOP500 ranking. These software packages are also available with the Open Fabrics stack for network vendors (InfiniBand and iWARP), server vendors and Linux distributors. The new RDMA-enabled Apache Hadoop and Memcached packages, consisting of acceleration for HDFS, MapReduce, RPC and Memcached, are publicly available from http://hibd.cse.ohio-state.edu. Dr. Panda's research is supported by funding from US National Science Foundation, US Department of Energy, and several industry including Intel, Cisco, SUN, Mellanox, QLogic, NVIDIA and NetApp. He is an IEEE Fellow and a member of ACM. More details about Dr. Panda, including a comprehensive CV and publications are available at http://web.cse.ohio-state.edu/~panda.2/.
Peng Du is a senior software engineer and tech lead for data science platform in Uber. He holds a PhD in computer science and an MA in applied mathematics, both from the University of California, San Diego.
I’m a technologist at heart and firmly believe in the power of technology to deliver experiences that enrich lives and delight millions globally. I have always found fulfilment by creating such avenues that serve as force-multipliers and enable others to achieve their full potential.
At Uber, I help drive product decisions to ensure our data storage and analytics capabilities are second to none, and can power Uber’s global ambitions.
Konstantinos Karanasos is a Senior Scientist at the Cloud and Information Services Lab (CISL) at Microsoft (based at the Silicon Valley office) and a PMC member of Apache Hadoop. His work at Microsoft has focused on resource management for the company's production analytics clusters and on query optimization for large-scale analytics. Within Apache Hadoop, Konstantinos has worked on adding support to YARN for opportunistic containers and for rich placement constraints. Prior to joining Microsoft, he was a postdoctoral researcher at IBM Almaden Research Center, where he was member of the Big Data analytics group, working on problems related to query optimization. Konstantinos obtained his PhD from Inria and the University Paris-Sud, France. In the context of his PhD, he worked in the areas of view-based query processing and semi-structured data management. He also holds a Diploma in Electrical and Computer Engineering from the National Technical University of Athens, Greece.
Wangda Tan is Product Management Committee (PMC) member of Apache Hadoop and Staff Software Engineer at Hortonworks. His major working field is Hadoop YARN GPU isolation and resource scheduler, participated features like node labeling, resource preemption, container resizing etc. Before join Hortonworks, he was working at Pivotal, working on integration OpenMPI/GraphLab with Hadoop YARN. Before that, he was working at Alibaba cloud computing, participated creating a large scale machine learning, matrix and statistics computation platform using Map-Reduce and MPI.
David is an Architect in the Data Science and Engineering team at GoPro and the creator of their Spark-Kafka streaming data ingestion pipeline. He has been developing scalable data processing pipelines and eCommerce systems for over 20 years in Silicon Valley. David's current big data interests include streaming data as fast as possible from devices to near real-time dashboards and switching his primary programming language to Scala from Java after nearly 20 years. He holds a B.Sc. in Computer Science from The Ohio State University.
Hao joined the Data Science and Engineering team at GoPro in 2016 and immediately started cranking out Java and Scala code for use in both the Spark Streaming and batch data pipelines. Hao continuously supports the data publishing needs of the device and software application development teams at GoPro and assists them in utilizing the most appropriate and efficient ways to stream, store, and access their data. He has a M.Sc. in Computer Science from Northeastern University.
Product manager in the data analytics industry for more than 10 years, previously worked for Microsoft, Datadog and now product manager for Hive, Druid, HBase and Phoenix at Hortonworks
Namrata Ghadi is a Software Development Engineer (ML and Data
Science) in Workday’s Syman team. She has been working on ML and DS
based projects for 2+ years and as a Software Engineer for 6+ years.
Namrata has a MS in CS from Carnegie Mellon University.
Adam Baker is a Senior Software Development Engineer (ML and Data Science) in Workday’s Machine Learning team. He's worked on computational linguistics in graduate school and in NLP and ML for one year at Workday. He's worked in software development for 7 years. He has a MA in Linguistics from University of Chicago and B.S. in CS from Ohio State University.
Big Data Software Engineer in Intel. Received Ph.D degree in Southeast University (China)
Working at: Big Data Storage
Research Interests: Data Privacy and PII based User Identification.
Joy is a Distributed System Architect working in Bloomberg, 18+ yrs of Software design and development experience, 10+ yrs of Java/Scala development experience, 7+ yrs of work experience in Big-Data and Hadoop technologies, 4+ yrs of Apache Spark experience with a special interest in distributed/parallel computing, currently working on Kubernetes, Cloud and Big Data technologies. Joy is an open-source contributor for Hadoop and Jupyter Notebook echo-system products and technologies. Also, he is actively part of various Software architectural organization. Joy is a frequent speaker in various conferences, user-groups and code-camps.
Gregory is an expert in programming language runtimes, distributed systems, and big data processing. During his time on the ETA team at Lyft, Gregory transformed data processing from a manual process that took weeks to run to a fully automated process that runs every 10 minutes. As a formative member of the Data Science Platform team, Gregory helped define and deliver the vision for expanded use of machine learning techniques across Lyft. Now as a member of the Streaming Platform team, Gregory is focused on the delivery of high quality data to analytics and machine learning applications at Lyft. Before Lyft, Gregory was the lead architect of Salesforce’s Apex ecosystem — including the definition of the Apex language, compiler, runtime, debugger and other tooling, governance, batch processing, and caching — that services billions of requests a month.
Principal Engineer with 12+ years of IT experience including experience in the areas of Cloud Computing, advance system automation & tools design and System administration. Skilled in management of infrastructure and implementing technology to support large user groups, supporting users at corporate headquarters as well as multiple remote locations, and effectively managing high end Hadoop Clusters. Build, Manage and Support hadoop clusters with Thousands of nodes and petabytes of data, running Hadoop distributed file system (HDFS) and map-reduce framework. Setup Automation framework for Management of user access and data on clusters. Automated Management of Yahoo clusters and Break-fixing of Bad nodes. Completely automated solution for 40+ clusters/45k nodes spanned across 3 colos.
Sailaja Polavarapu is an Apache committer and currently works at Hortonworks in Enterprise Security team. Sailaja is mainly responsible for user management module for Apache Ranger product. Prior to this, Sailaja was at Citrix responsible for development of device management service for XenMobile product and LDAP Authentication for Access Gateway product. Sailaja holds an M.S in Computer Engineering from San Jose State University, CA and B.S in Computer Engineering from Anna University, India.
Velmurugan Periasamy (Vel) is part of Enterprise Security Engineering Team at Hortonworks, contributing to Apache Ranger. He is Apache Ranger Committer and PMC member. He has many years of software industry experience in developing and managing large-scale enterprise systems. He has delivered many technical talks at HadoopSummit, JavaOne, OSCON, Jazoon, etc.
Carson Wang is a big data software engineer at Intel, where he focuses on developing and improving new big data technologies. He is an active open source contributor to the Apache Spark and Alluxio projects as well as a core developer and maintainer of HiBench, an open source big data microbenchmark suite. Previously, Carson worked for Microsoft on Windows Azure.
Yuanjian is a senior engineer and team leader of Baidu internal spark platform. He is also a Spark contributor. Prior of that, He worked on real time computing and distributed tracing system in Baidu.
With over twenty five years of Information Technology (IT) industry experience, Avinash specializes in providing information management and business analytics solutions. His well-rounded experience spans across areas of advanced analytics, data strategy, information architecture, data modeling, data integration design, and development. Avinash is business focused, creative in problem solving, and committed to the project quality and success. As a trusted advisor to his clients, Avinash has successfully delivered business intelligence solutions in a variety of industries that include financial services, banking, insurance, retail, and telecommunications. Avinash holds Chartered Property Casualty Underwriter (CPCU) and Associate in Insurance Data Analytics (AIDA) professional designations in property-casualty insurance and risk management, administered by The Institutes and MBA in Finance and Information Systems from Stern School of Business, NYU.
Uli has 18 years’ hands on experience as a consultant, architect, and manager in the data industry. He frequently speaks at conferences. Uli has architected and delivered data warehouses in Europe, North America, and South East Asia. He is a traveler between the worlds of traditional data warehousing and big data technologies.
Uli is a regular contributor to blogs and books, holds an Oracle ACE award, and chairs the Hadoop User Group Ireland. He is also a co-founder and VP of the Irish chapter of DAMA, a non for profit global data management organization. He has co-founded the Irish Oracle Big Data User Group.
Last but not least, Uli is the CEO of Sonra, the data liberation company. Sonra develops Flexter, a tool to automate the conversion of complex XML to a database, text, Spark, or Hadoop.
1993.9-1997.6，Bachelor of Science Degree ，Xiangtan University，China.
2000.9-2003.6，Master's degree in Engineering ，Xiangtan University，China.
2008.9-2016.12，Ph.D（Doctorate in management），Central South University，China.
expert in information processing in China Mobile
WU Liang-gang,LI Shao-nian.Evaluation of the Factors Telecom Production Based on Temporal Associative Classification with Positive and Negetive Rules[J].Journal of Systems & Management,2013,22(01):31-38.
LI Shao-nian;WU Liang-gang.The Research of Telecom Product Lifecycle Prediction Based on Temporally Restricted Bayesian Networks [J].Mathematics in Practice and Theory,2015,45(08):63-73.
LI Shao-nian;WU Liang-gang.Effective Continuous Attributes Reduction Algorithm Based on Neighborhood Entropy-based Measurement[J].Computer Engineering & Science,2016,38(02):350-355.
LI Shao-nian;WU Liang-gang.Novel Approach for Multi-criteria Group Decision Making Based on Fuzzy Evidential Ｒeasoning and Improved TOPSIS Method[J].Operations Research and Management Science,2017,26(06):16-23.
Duan Yunfeng is Chief designer of China Mobile's big data system ,and he is a Postdoctoral in the information processing Department of Peking University.
Dr. Duan Yunfeng has undertaken the design, construction and operation of China Mobile's data warehouse and big data center, and has accumulated 16 years of practical experience in the field of big data.
He led the team, from the system construction to the system operation and maintenance, and established a lot of big data applications.
The design of technical documents, there are more than 150 copies and about 12000000 words. The data model, data interface, system architecture, quality control, business application, system security and other fields are involved in the big data system.
He has published two books: Big data and big analysis and Big data Internet thinking.
China Mobile big data system currently has 16 thousand nodes (X86), which exceeds 500PB system capacity, including data volume 300PB, 800 million customers' active data.
Eric Krenz is a data engineer on the Big Data Platform team at Target. As a student, he presented his research on the integration between OpenStack Swift and Hadoop at the Midwest Instructional Computing Symposium in 2015.
Constantin is a top Hortonworks Community Connection contributor publishing multiple articles around stream analytics and geospatial. He is currently an active member of two SME groups at Hortonworks: Stream Analytics, Geospatial Analytics.
Through his work at Hortonworks, leveraging his vast experience over the last two decades building large-scale data processing systems using a variety of database technologies, he works to build end-to-end solutions involving big data technologies. He holds a Ph.D. in numerical modeling and computer simulation in the field of petroleum engineering. He is also a certified data scientist, PMP and ScrumMaster.
Dr. James Hughes is a mathematician at Commonwealth Computer Research, Inc. in Charlottesville, Virginia. He is a core committer for GeoMesa which leverages Accumulo, HBase and other distributed database systems to provide distributed computation and query engines. He is a LocationTech committer for GeoMesa, SFCurve, and JTS. He serves on the LocationTech Project Management Committee and Steering Committee. Through work with LocationTech and OSGeo projects like GeoTools and GeoServer, he works to build end-to-end solutions for big spatio-temporal problems. He holds a PhD in algebraic topology from the University of Virginia.
I began programming in high school after begging my folks to buy me an Atari 400. I wrote a stock market simulation program using a joystick for input and presented it at the Atari users group in Peoria, Illinois. I went to a junior college to learn programming and got stuck in a Cobol punch card class that I hated and jumped into Bible School. I was OK at being a youth pastor and loved speaking in public, but got a bit bored so I applied and got a job at Caterpillar. Three years in I solved the Y2K problems and finally landed a Java job.
From there I wrote many corporate systems and was known for writing systems that never went down. I personally wrote the third-generation telematics system at Caterpillar on corporate hardware while begging to go to the cloud. Three years ago my requests became a reality when we started using the Azure cloud. We immediately started writing the fourth-generation telematics system at Caterpillar capable of processing messages from 2 million assets. Currently we are up to 500 messages per second and we are preparing for 10x that using technology that includes Storm, HBase, Phoenix, SqlServer, EventHubs and Spark.
Stuart loves storage (208 PB at Criteo) and is part of Criteo's Lake team that runs some small and two rather large Hadoop clusters. He also
loves automation with Chef because configuring more than 3000 Hadoop nodes by hand is just too slow. Before discovering Hadoop he developed
user interfaces and databases for biotech companies. Stuart has presented at ACM CHI 2000, Devoxx 2016, NABD 2016, Hadoop Summit Tokyo 2016, Apache Big Data Europe 2016, Big Data Tech Warsaw 2017, and Apache Big Data North America 2017.
Cliff Leung is currently a Distinguished Engineer at Futurewei, a subsidiary of Huawei Inc. Cliff is leading a team in developing advanced capabilities in enterprise-grade cloud middleware software in big data analytics. Cliff has more than 20 years of extensive R&D experience in database technology, SQL query optimization, data warehouse, mobile database system and synchronization, database text search, database tooling and performance monitoring, and big data. Prior to joining Huawei, Cliff was a Distinguished Engineer in IBM where he pioneered and developed Query Rewrite optimization technology in DB2 and Query Tuner product portfolio. Cliff holds a Ph.D. degree in Computer Science from UCLA, and he has been granted with more than 30 US patents, mostly in SQL query optimization and database systems. Cliff also published several technical papers in major database conferences such as VLDB and SIGMOD, and was a frequent speaker in (IBM) user groups.
Andy LoPresto is a Sr. Member of Technical Staff at Hortonworks working on the Hortonworks DataFlow team. In this role he serves as both a Committer and Project Management Committee Member for Apache NiFi, an open source, robust, secure data routing and delivery system. Andy focuses on security concerns within NiFi including identity management, TLS negotiation, data protection, access control, encryption and hashing. Andy is also involved with the sub-project, Apache MiNiFi, which drives edge data collection, including secure command and control and immediate data provenance and governance. He has presented about NiFi at DataWorks Summit Sydney 2017, Hadoop Summit San Jose 2016, FOSDEM '17 in Brussels, and the OpenIoT Summit 2017.
I am a software engineer at Dremio and a committer to Apache Arrow project. Previously, I was part of database kernel team at Oracle, where I worked on storage, indexing, and the in-memory columnar query processing layers of Oracle RDBMS. I hold an MS in software engineering from CMU and a BS in information systems from BITS Pilani, India. During studies, I focused on distributed systems, databases, and software architecture.
My technical blog: https://loonytek.com/
My technical contributions on Quora: https://www.quora.com/profile/Siddharth-Teotia
Apart from my job as a Software Engineer, I love writing technical content and doing technical presentations about the work I do.
I'm a technical lead at Hortonworks where I concentrate on adding support for operations with transactional semantics to Apache Hive. Prior to that I was a lead engineer on a federated SQL engine at Composite Software. Before that I've held various engineering roles at BEA, Oracle and others.
Mukul is currently working with Hortonworks and is an active contributor to Apache Hadoop and Apache Ratis project. He received his master degree from Carnegie Mellon University and bachelors from Visveswaraya Technological University. He has been working actively on filesystems for last 8 years and has worked extensively on Ozone object store, Flash based filesystems, Shingled Magnetic Recording drives, data replication and disaster recovery solutions.
Lokesh Jain is a software engineer at Hortonworks. He has completed B.E.(Hons.) Computer Science and M.Sc.(Hons.) Mathematics from BITS Pilani. He is one of the early developers of Apache Ratis project and also contributes to Apache Hadoop. He also worked on GSOC project for SageMath organisation in 2017.
Cassandra has 20 years experience in search and knowledge management. She has been a Lucene/Solr committer since 2013 and a member of the PMC since 2016. As Director of Engineering at Lucidworks, she manages Solr and open source development.
Marcelline Saunders is Director, Global Partner Enablement at Lucidworks and has over 25 years of experience in information management including enterprise search, knowledge management, and e-discovery technologies for both on-premise and cloud solutions. Marcelline recently held senior product management roles at Hitachi Data Systems where she was responsible for global search solutions and strategy. Marcelline currently manages technical partner relationships at Lucidworks, the leader in open source and commercial search solutions.
Chris Douglas has worked in Apache Hadoop since 2007, starting as a frequent contributor to the MapReduce data path. He is one of the original designers of YARN. As a member of the Cloud an Information Services lab (CISL) at Microsoft, his research focuses on systems for large-scale analytics. His current work builds storage abstractions for big data workloads in cloud settings.
Yahoo Inc/ Oath Inc (Feb 2017 to present)
Intern, Yahoo Inc (June 2016 to Sep 2016)
Cognizant Technology Solutions (July 2014 to May 2015)
University of California, San Diego (Sep 2015 to Dec 2016)
Master in Computer Science
Institute of Technology, Nirma University (Aug 2010 to June 2014)
Bachelor in Computer Science & Engineering
Data analyst -> DBA -> Big Data engineer -> Big Data architect.
9 years of experience in Data services, Data warehouse/BI applications.
Certified Cassandra administrator and Oracle database administrator
Kyle Cooper has over 20 years of experience at Cox Communications experience ranging from Technical Installation, Systems operations Center/Regional Operations Center Management, Alarm Management\Problem Management, Network Data Analytics, and Network Planning.
I am a Principal Software Engineer focusing on Data Science at Hortonworks. I am also the Vice President and an active committer for the Apache Metron project. In the past, I've worked as an architect and senior engineer at a healthcare informatics startup spun out of the Cleveland Clinic, as a developer at Oracle and as a Research Geophysicist in the Oil & Gas industry. Before that, I was a poor graduate student in Math at Texas A&M.
I primarily work with the Apache Hadoop software stack. I specialize in writing software and solving problems where there are either scalability concerns due to large amounts of traffic or large amounts of data. I have a particular passion for data science problems or any thing vaguely mathematical. As a Principal Architect focused on data science, I spend time with a variety of clients, large and small, mentoring and helping them use Hadoop to solve their problems.
Michael Miklavcic is a committer and PMC member for Apache Metron and has been involved with the project for the past two years. He is a software engineer and architect with over ten years of industry experience and worked as a Systems Architect with Hortonworks for three years prior to transitioning to the engineering team for Metron. He has given numerous talks both on the domestic and international stage, including Hadoop Summit San Jose, Apache Con Big Data Europe, and multiple local Hadoop user groups. He is a code contributor to multiple Apache open source projects and has worked directly with clients to implement solutions using Hadoop. Michael has degrees in computer science and computer information systems from Baldwin Wallace in Cleveland, OH.
A core member of R&D Engineering Group in Hortonworks primarily working on HDP (Hortonworks Data Platform) and DPS (Data Plane Service).
An active contributor and committer of Apache Hive project with major contributions on Hive replication and ACID features. Also have rich experience on distributed systems and In-memory database technologies.
Barbara Eckman is a Principal Data Architect at Comcast. She leads data governance for an innovative, division-wide initiative comprising near-real-time ingesting, streaming, transforming, storing, and analyzing Big Data. Barbara is a recognized technical innovator in Big Data architecture and governance, as well as scientific data and model integration. Her experience includes technical leadership positions at a Human Genome Project Center, Merck, GlaxoSmithKline, and IBM. She served on the IBM Academy of Technology, an internal peer-elected organization akin to the National Academy of Sciences.
George Vetticaden is Vice President of Product Management within Emerging Products at Hortonworks. In this role, he is responsible for the strategic vision and concerted delivery across all the products within Emerging Products including Hortonworks DataFlow (HDF) that includes Nifi, Storm, Kafka, Streaming Analytics Manager, Schema Registry as well as solutions built on top of the platform including CyberSecurity/Metron.
Over the last 5 years at Hortonworks, George has spent a lot of time in field with enterprise customers helping them build big data solutions on top of Hadoop. In his previous role at Hortonworks, George was the Director of Solutions Engineering where he led a team of 15 Big Data Senior Solution Architects helping large enterprise customers with use case inception, design, architecture, to implementation of use cases monetizing data with Hadoop. In addition, he is also a committer on the Apache Metron project. George graduated from Trinity University with a BA in Computer Science.
(LinkedIn Profile: https://www.linkedin.com/in/georgevetticaden)
Product Designer behind Metron Investigator, Streaming Analytics Manager, and the new Kafka product.
Roy is a field data scientist with Interset, working with customers to identify new data sources that can be incorporated into Interset's advanced security analytics software. He holds a PhD in Applied Mathematics from McGill University, and has been applying advanced statistical methods to analyze data for over 10 years: early on as a data miner, and more recently in the capacity of a data scientist.
Have been involved with the design and development of Streaming Analytics platforms at Hortonworks for the last two and half years. Also been contributing to Apache Storm (http://storm.apache.org/) and currently a committer and a PMC member of the project. Prior to Hortonworks was involved in the development of various streaming and BigData products at Informatica and multi-tenant distributed systems at Yahoo.
Kevin Brown is the Big Data Service Platform Engineer for Data & Analytics at ExxonMobil with a team of data architects and engineers focused on a mission to embed world class analytics to solve big data problems across the corporation. Kevin’s previous experience in software development and Linux administration played a critical role in helping pioneer a Big Data platform at ExxonMobil. Kevin holds an Information Technology degree from Brigham Young University.
Magnus Hyttsten is a Developer Advocate for TensorFlow @ Google. He works on developing the TensorFlow product, is a developer fanatic, and an appreciated speaker at major industry events such as Google I/O, The AI Summit, AI Conference, ODSC, and MWC on machine learning and mobile development . Right now, he is focusing on Reinforcement Learning models, as well as making model inference effective on Mobile.
I work as a Mobile Software Engineer at Uber on iOS and fraud related problems. I'm focused on the intersection of mobile and machine learning; I combine both to build capabilities into Uber that create a great, safe, and fraud free user experience. I hold a Bachelor of Science in Physics and a Bachelor of Arts in Economics from. UC Davis.
Lenny Evans is a data scientist at Uber focused on the applications of unsupervised methods and deep learning to fraud prevention, specifically developing anomaly detection models to prevent account takeovers and computer vision models for verifying possession of credit cards.
An engineer, architect and IT strategist, who enjoys using technology as a strategic driver for business growth. I have been working with emerging technologies like Hadoop, NoSQL, big data, In-memory analytics for past 3 years and before that I played various roles in the space of business intelligence and advanced analytics for 12 years.
I am responsible for big data & advanced analytics practice at Land O Lakes. This includes setting our analytics strategy & execute them with a team of data scientist, data engineers & platform engineers. I work closely with leaders of functional/business units such as supply chain, R&D, sales, marketing & finance to understand their strategic direction & key business challenges. Translate those business needs to technology strategy and execution plan that has well defined ROI. As part of this role, my responsibility also includes education and road shows within the organization, to increase awareness among business leaders on the value of cloud, data and analytic techniques such as machine learning, deep learning, AI.
Montana leads the Shopper App Engineering team and builds tools to enable Instacart's personal shoppers to deliver groceries in an hour, including mobile apps on Android and iOS powered by a deep logistical web service. He has more than a decade of full stack software engineering experience, from internal tool creation in the heavily regulated health care industry to blue sky app development in the travel space. Previously, he held technical roles at Omniref, Tomo, RescueTime and Jobster.
Alan is a founder of Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan is PMC member on Apache Hive, Pig, and many other Apache projects. As part of the Apache Incubator PMC he has mentored many new Apache communities. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press. Follow Alan on Twitter: @alanfgates.
I have worked as a software developer at Oath for over 4 years, and I'm currently a member of the audience data team. Our team builds data pipelines which process all user activity data across Oath. I have attended DataWorks Summit for the last several years and have presented talks at other conferences such as XLDB and Tech Pulse (yahoo's internal conference).
I started worked with Druid and Druid supporting tool beginning from 2016. In 2017, it brought me a chance to build a SQL interface for Druid in order to allow more consumers to access Druid. Right now, I am still working closely with Druid and developing streaming data system with Druid. Looking forward to make more progress!
I am a Ranger project committer and PMC member.
Apache Ranger Committer
John Highcock is a Systems Architect at Hortonworks. Prior to joining Hortonworks, he worked on big data projects at the US Department of Justice.
Trevor Grant is a PMC Member of the Apache Mahout and Apache Streams projects. In his day job he is an Open Source Evangelist / AI Engineer at IBM. He strongly believes / is interested in: (1) Math at scale (2) machine learning at speed (in the stream) (3) AI on the edge (4) anyone can read code from home- conference talks should be fun and informative. He has various videos, blogs, rants, advanced degrees, and code which can all be easily found online against his common handle “rawkintrevo”.
Xiao Li is a software engineer in Databricks. His main interests are on Spark SQL, data replication and data integration. Previously, he was an IBM master inventor and an expert on asynchronous database replication. He received his Ph.D. from University of Florida in 2011. He is a Spark committer and a Spark PMC member.
Wenchen Fan is a Software Engineer at Databricks, working on Spark Core and Spark SQL. He mainly focus on the open source community, helped to discuss and review many features/fixes in Spark. He is a Spark committer and a Spark PMC member.
Jeff Baker is the Director of Data Analytics Technology at Universal Parks and Resorts. In this role, Jeff is responsible for overseeing Universal’s technological transformation into a data-driven organization.
Prior to joining Universal, Jeff spent more than 20 years overseeing the evaluation, design, and implementation of IT strategy roadmaps, architecture, technologies, and execution plans as part of multi-million-dollar transformation initiatives at JPMorgan Chase, Nationwide Insurance, and Sprint. As a Director, Jeff has directed major technology and infrastructure deployments, which have led to substantial revenue gains and cost reductions.
Jeff earned a Master’s Degree in Data Science from Northwestern University and earned his Bachelor’s Degree in Behavioral and Social Sciences from the University of Maryland. He currently lives in Orlando, FL.
-Sr. Director, Product Management, Hadoop Core, Data Science and Data Management, Hortonworks, 2016-Present
-CEO/Co-Founder, Stealth Mode Startup, 2015-2016
-Head of Product Management & Technical Marketing, Skeyra (acquired by Western Digital), 2013-2015
-Director, Product Management, EMC, 2011-2013
-Staff Engineer->Sr. Product Manager, Brocade, 2001-2011
Avrilia is a Senior Scientist at Microsoft's Cloud and Information Services Lab,
where her research is focused on scalable real-time stream processing systems.
She is also an active contributor to Heron, collaborating with Twitter.
Prior to her current role, she was a research scientist in IBM Research working
on SQL-on-Hadoop systems. She holds a PhD in data management from University of Wisconsin-Madison.
Ashvin Agrawal is a Senior Research Engineer at Microsoft, where he works on streaming systems and contributes to the Apache Heron and Dhalion project. Ashvin has more than 15+ years of software development experience. He specializes in developing large-scale distributed systems. Previously, he worked at VMware, Yahoo, and Mojo Networks. Ashvin holds an M.Tech. in Computer Science from IIT Kanpur, India.
Sanjeev Mohan is a Research Analyst for the Data Management Strategies group within the Gartner. He covers end-to-end data pipeline including ingestion, persistence, data transformation and advanced analytics. His research includes machine learning, IOT and data governance across the pipeline. research areas include storage infrastructure and architectures,
Srikanth Venkat is currently responsible for Security & Governance portfolio of products at Hortonworks which include Apache Knox, Apache Ranger, Apache Atlas, Platform wide security and Hortonworks DataPlane Service. Prior to Hortonworks, Srikanth has held multiple roles in areas of cloud services, marketplaces, security, and business applications. His experience includes leadership across Product Management, Strategy and Operations, and Technical Architecture with broad experience in startups to global organizations including Telefonica, Salesforce.com, Cisco-Webex, Proofpoint, Dataguise, Trilogy Software, and Hewlett-Packard. Srikanth holds a PhD in Engineering with a focus on Artificial Intelligence from University of Pittsburgh, and an MBA in General Management from Indiana University and a Masters in Global Management from Thunderbird School of Global Management. Srikanth is a Data Sciences & Machine Learning hobbyist and enjoys tinkering with Big Data technologies.
Mohammad Kamrul Islam is currently working at Uber on its Data Infrastructure team as a Staff Software Engineer. Previously, he worked at LinkedIn for more than two years as a Staff Software Engineer in their Hadoop Development team. Before that, he worked at Yahoo! for nearly five years as an Oozie architect/technical lead. He has been intimately involved with the Apache Hadoop ecosystem since 2009. Mohammad has a Ph.D. in computer science with a specialization in parallel job scheduling from Ohio State University. He is a Project Management Committee (PMC) member of both Apache Oozie and Apache TEZ and frequently contributes to Apache HDFS/YARN/MapReduce and Apache Hive.
Davor is serving as a chair of the Apache Beam Project Management Committee, and is a CEO of Operiant, a company he founded that helps users get Big Data to production. He was previously a software engineer at Google where he worked on Google Cloud Dataflow, the predecessor to Apache Beam, since its beginnings.
As General Manager for Insurance, Cindy Maike is responsible for global insurance strategy and customer engagement for Hortonworks. She works with customers and partners leveraging analytics for current day business growth and exploring the use of new data sources to drive innovation in the evolving world of insurance. She has over 25 years of finance, consulting and advisory services experience in the insurance industry working with clients globally on their business strategy leveraging analytics and technology to further drive business results.
Cindy has deep industry knowledge in both claims and underwriting with a focus on the use of analytics and data to enhance business outcomes. She has held positions with the IBM Watson Solution Group, Carrier Insurance, Director of Strategy at ACORD, and was co-founder of Strategy Meets Action Research and Advisory Services. Cindy has also held and is a CPA.
She is passionate about solving business problems and eternally believes in process improvement and strongly believes that today's next generation of business intelligence in the form of advanced analytics will revolutionize the insurance industry. Cindy frequently speaks to business events on the value of business analytics, cognitive computing and the evolution of insurance in a connected world.
I am a software engineer in Big Data
After studying Applied Mathematics to CS at Harvard, Jeremy build a distributed deep learning library on Apache Spark. Now he works on machine learning research at Google Brain.
I have close to 17+ years of experience in various Data Warehousing and Business Intelligence technologies. At PayPal, I am part of the Merchant team working on building analytical solutions on user behavioral data, building algorithms for efficient consumer targeting and building Customer Identification Repository.
I am currently focused on Apache Spark, Scala, Hive, HBase and Machine Learning models running on HortonWorks Hadoop platform. I am also one of the founding member at PayPal to use Druid and build analytical solutions on top of terabytes of data utilizing the existing Hadoop environment at PayPal.
My areas of Interest are Spark, Hadoop, HBase, Druid, DruidSQL and Machine Learning models. I am also interested in bringing my BI expertise and help build analytical and reporting solutions on top of the Big Data platforms.
Big Data Technologist for over 12 years
Sriram is an architect in the IBM Analytics group tasked with delivering modern cloud-native offerings such as Data Science Experience (DSX) in Private Clouds, ensuring reliability, scalability using technologies like Docker & Kubernetes. His current focus is on integrating DSX with Hadoop clusters and enabling Machine/Deep Learning at scale. Prior to this he worked on delivery and operations of data services, such as dashDB on IBM Bluemix. He has years of experience developing enterprise worthy relational database, warehouse, ETL and tools offerings. Sriram started his career at Informix Software where he worked on application server technology, web content management software and database tooling as well as on the Red Brick Data Warehouse suite.
Issac works at LinkedIn in the data management team which is in charge of ingestion, lifecycle, and compliance of most HDFS data, as well as providing tools for the big data ecosystem in LinkedIn. He is a core developer and committer for Apache Gobblin, a distributed big data integration framework for batch and streaming systems. Previous work focused on analytics for video streaming.
Anthony is a senior software engineer working on the Data Management team at LinkedIn, where he works on LinkedIn’s data access layer, Dali, and has contributed to Apache Hive and Pig. He holds a B.S. in Computer Science from Yale University.
In my current role I work on various tools & technologies related to big data, monitoring, messaging, streaming analytics, machine learning etc and develop solutions that could be used by multiple teams inside the company. Throughout my career I have helped development teams in creating highly available and scalable solutions and solved problems related to payment fraud, healthcare & customer loyalty.
Owen O'Malley is a co-founder and technical fellow at Hortonworks, a rapidly growing company (25 to 1,000 employees in 5 years), which develops the completely open source Hortonworks Data Platform (HDP). HDP includes Hadoop and the large ecosystem of big data tools that enterprises need for their data analytics. Owen has been working on Hadoop since the beginning of 2006 at Yahoo, was the first committer added to the project, and used Hadoop to set the Gray sort benchmark in 2008 and 2009. In the last 8 years, he has been the architect of MapReduce, Security, and now Hive. Recently he has been driving the development of the ORC file format and adding ACID transactions to Hive. Before working on Hadoop, he worked on Yahoo Search's WebMap project, which was the original motivation for Yahoo to work on Hadoop. Prior to Yahoo, he wandered between testing (UCI), static analysis (Reasoning), configuration management (Sun), and software model checking (NASA). He received his PhD in Software Engineering from University of California, Irvine.
Apache Hadoop PMC member and Committer working as Staff Engineer at Hortonworks in YARN team.
Julien Le Dem is the coauthor of Apache Parquet and the PMC chair of the project. He is also a committer and PMC Member on Apache Pig, Apache Arrow and a few others. Julien is a Principal Engineer at WeWork and was previously Architect at Dremio and tech lead for Twitter’s data processing tools, where he also obtained a two-character Twitter handle (@J_). Prior to Twitter, Julien was a principal engineer and tech lead working on content platforms at Yahoo, where he received his Hadoop initiation. His French accent makes his talks particularly attractive.
Aniket Mokashi is a tech lead on the engineering team that prototyped and built Procella project at Youtube. Throughout his career, he has contributed to development of large scale data processing frameworks and platforms. Prior to Google, he has worked on data platform teams at Twitter and Netflix. He is also a committer and PMC member on Apache Parquet and Apache Pig projects. Aniket holds a Master's degree in Information Networking from Carnegie Mellon University.
Vinod Kumar Vavilapalli has been contributing to Apache Hadoop project full-time since mid-2007. At Apache Software Foundation, he is a long-term Hadoop contributor, Hadoop committer, member of the Apache Hadoop Project Management Committee, and a foundation member. He is a Director of Engineering at Hortonworks Inc and runs the Hadoop compute platform teams there. Before Hortonworks, he was at Yahoo!, working in the Grid team that made Hadoop what it is today, running at large scale - upto tens of thousands of nodes.
Vinod loves reading books of all kinds and is passionate about using computers to change the world for better, bit by bit. He has a bachelor’s degree in computer science and engineering from the Indian Institute of Technology Roorkee. He can be reached at twitter handle @tshooter.
He is a PMC member of Apache Zeppelin and works at Line. In Apache Zeppelin, He focuses on stabilizing Apache Zeppelin so that enterprises use it in their production. He develops a new data pipeline at Line to make a new data platform, and which includes all kind of data produced by Line and its family app. He is really interested in developing and operating scalable and fault-tolerant application solving mission critical problems.
Kat Petre is a technology rebel. Strong open source supporter, early adopter and fully dedicated to being in a state of constant learning, she is currently a Product Specialist at MarR Technologies. Previously, she served as Solution Architect in Cloudera's professional services team, specializing in building secure yet usable, highly available distributed big data solutions, to detect the scarce meaningful signals from the datalakes of noise.
Always looking for company in tackling the interesting problems with big data technologies, her ultimate goal is to create a simulation of the real world, fed by real time data, and overclock it. Because the people that are crazy enough to think they can change the world, are the ones that do.
Jesus Alvarez is a Technical Evangelist with a passion for Data Science, Security, and Crypto-Currency. Currently an advisory engineer, building integration tools to allow IBM DSX to integrate with open and closed source components.
Pioneer to the Hadoop ecosystem, building installers for "big data accelerators" in 2012 at IBM. Deep understanding for the importance of security, having spent 4 years in Healthcare IT/Software Design. Contributor to Apache Knox, Ambari, and an array of data science notebooks and tools.
Veteran in the realm of Crypto-Current, Jesus was pushing the limits of home electric breakers in 2012-2014 running GPU and ASIC miners and architecting high frequency cryptocurrency arbitrage bots since the era before Mt-Gox and Cryptsy went dark.
Sven is a software engineer and a data scientist in the IBM Data Science Experience dev team. He is passionate about the PyData ecosystem for data analysis and he is fascinated from the idea that data is resource that can be mined with the right tools to improve the quality of our daily life. As software engineer he works on code generators that create data integration code for various cloud data sources. As data scientist he works with clients to analyze their data to solve business problems. Through this double role he is constantly learning and use open source technologies such as Jupyter, Spark, Kafka, scikit-learn, statsmodels and keras. Learning is not only a skill, but an attitude to life and so is teaching. Sven shares his knowledge through talks at conferences and meetups.
Michael Ger has over 25 years of experience working in industry and Information Technology strategy roles. He has deep cross-industry knowledge in product development, manufacturing, supply chain and customer experience related business processes. As General Manager of Manufacturing and Automotive Industries at Hortonworks, Mike is responsible for driving the solution vision and go-to-market strategies within each industry segment and works with industry leaders to drive next-generation business insights through Big Data Analytics. Prior to joining Hortonworks, Mike worked at Oracle for over 20 years as their Automotive Industry lead, at A.T. Kearney as an Automotive Management Consultant and at General Motors (Saturn Division) as a Product Engineer.
Dr. Yu is the CTO of AsiaInfo Data, which provides big data and AI solutions to all three telecom carriers in China. Its distributed big data platform processes over 7PB of data daily. Before joining AsiaInfo, Dr. Yu served as VP Engineering and Chief Architect for Mafengwo, the largest online travel community in China, with over 100 mobile and online users. Previously, he was VP Engineering and Chief Architect at OpenX, responsible for the company's data strategy, mobile product line, and overall architecture, consisting of more than 6000 servers and 15PB of data distributed in 5 global data centers. Dr. Yu is also a serial entrepreneur, co-founded two startups, Portaura in social mobile big data and Martsoft in e-commerce search engine.
Early in his career, he spent a number of years in HP(DEC) Systems Research Center, one of the top research labs in the world, where he worked closely with numerous Turing Award recipients in browser technology, search engine, multimedia, and distributed file systems. Dr. Yu holds PhD in Computer Science and Engineering from UNSW and BS in Computer Sciences and BA in Mathematics from UT Austin. He has published papers and gave keynote speeches in numerous major international conferences.
Data science expert with expertise in machine learning and big data systems. Leading innovation projects and R&D activities to promote data science best-practice in many business verticals (Telco, Finance, Healthcare, etc.). Pushing the cutting-edge application of AI and Data Science. Published and presented research paper and posters at many top-tier conferences and journals, including: ACM Computing Surveys, ACSAC, CEAS, EuroSec, FGCS, HiCoNS, HSCC, IEEE Systems Journal, MASHUPS, PST, SSS, TRUST, and WiVeC. Served as reviewers for many highly reputable international journals and conferences.
Ray is an architect and technical leader for Comcast's Enterprise Big Data ecosystem and collaborates to define strategic direction for the company's valuable data assets. He brings 25 years of experience in cable and telecommunications in both startups and fortune 50 companies to deliver next generation systems that build positive customer experiences and revenue generating products for the company. He is an avid amateur wildlife and landscape photographer and enjoys traveling and spending time with his family in the beautiful state of Colorado.
Masato Asahara (Ph.D.) is currently leading developments of Spark-based machine learning and data analytics systems, which fully automate predictive modeling. Masato received his Ph.D. degree from Keio University, and has worked at NEC for 8 years as a researcher in the field of distributed computing systems and computing resource management technologies.
Yoshiki Takahashi is a student of the master of computer science program at the graduate school of Tokyo Institute of Technology.
His academic research proposal is accepted in SysML 2018 which has attracted attention since its previous workshop era in NIPS.
He worked on development of a Spark-based machine learning platform for automatic predictive modeling in his internship program at NEC Data Science Research Laboratories in 2017.
He received his B.S. degree in 2017 from Tokyo Institute of Technology.
Gio is the Vice President of Data Technology at Canadian Tire Corporation. He brings with him 20 years of experience across different industries such as telecommunications, banking, and now retail. Currently, Gio oversees all of the Data Platforms and Technology Roadmaps within Canada's largest retailer and North America's recognized brands.
Gio is an innovative leader who is constantly striving to enhance our customers’ end-to-end experience. While he enjoys complex challenges, he also has a strong passion for using a variety of methodologies to transform his deliverables.
Gio graduated from Delta College with a specialization in Network Engineering, as well as from the Banff Centre where he received a certificate of leadership experience.
Steve Loughran works on Hadoop att Hortonworks, currently cloud storage integration, including improving integration with Amazon's S3 in Hadoop, Hive and Spark
He's the author of Ant in Action, a member of the Apache Software Foundation, and a committer on the Hadoop core since 2009. Prior to joining Hortonworks in 2012, he was a Research Scientist at HP Laboratories.
He lives and works in Bristol, England.
Sanjay is founder and chief architect at Hortonworks, and
an Apache Hadoop committer and member of the Apache Hadoop PMC.
Prior to co-founding Hortonworks, Sanjay was the chief architect of core-Hadoop at Yahoo and part of the team that created Hadoop. In Hadoop he has contributed to several areas including HDFS, MapReduce schedulers, Yarn's design, high availability, compatibility, etc.
He has also held senior engineering positions at Sun Microsystems and INRIA, where he developed software for distributed systems and grid/utility computing infrastructures.
Sanjay has a PhD in Computer Science from the University of Waterloo in Canada.
Robert Hryniewicz has over 10 years working on various projects related to Artificial Intelligence, Enterprise Software, IoT, Robotics, Blockchain and more. Currently, he’s a Data Scientist and Evangelist at Hortonworks. Previously, Robert was a CTO at a Singularity Labs startup, Sr. Architect at Cisco, NASA et al. He’s a frequent speaker at DataWorks / Hadoop Summits.
Technical architect building and operating big data platforms at Adobe. Engineer at heart, data professional by practice, I like using technology to solve problems, and deliver a great user experience.
Suneel is a Member of Apache Software Foundation and is a Committer and PMC on Apache Mahout, Apache OpenNLP, Apache Streams. He's presented in the past at Flink Forward, Hadoop Summit, Berlin Buzzwords, Machine Learning Conference, Big Data Tech Warsaw and Apache Big Data.
SDE At Amazon AI
Previously Principal Engineer at GE Healthcare, specializing in Medical Imaging
Dinesh Chandrasekhar is a technology evangelist, a thought leader and a seasoned product marketer with over 23+ years of industry experience. He has an impressive track record of taking new integration/mobile/IoT/Big Data products to market with a clear GTM strategy of pre-and-post launch activities. He has extensive experience working on enterprise software as well as SaaS products delivering sophisticated solutions for customers with complex architectures. His areas of expertise include IoT, Application/Data integration, BPM, Analytics, B2B, API management, Microservices and Mobility. He can articulate detailed use cases across multiple industry verticals like retail, manufacturing, utilities and healthcare. He is a prolific speaker, blogger and a weekend coder. He currently works at Hortonworks, managing their HDF product line. He is fascinated about new technology trends including blockchain and deep learning.
Paige Bartley is a Senior Analyst in Ovum's Data and Enterprise Intelligence team specializing in all aspects of the data lifecycle including creation, cleansing, security, privacy, and productivity.
Working across the information management space, Paige researches how data use affects both large organizations and individuals alike. She provides insight and analysis into data ROI and successful organizational strategy.
Paige’s other areas of expertise include, regulatory and legal matters, data quality, unstructured data and NLP, master data and records management, and neuroscience and cognitive science.
Prior to joining Ovum in 2016, she worked in research and marketing for ZL Technologies.
I have a great passion for technology. Always in to learn new skills and loving the way how fast things are changing in this industry. I am technical person and also like to connect with other passionate people in the technology sector. Knowledge sharing is very important to me and love the role of mentoring colleagues. My latest challenge is to know everything about Docker .
I am really a hands-on person and love to solve difficult and challenging problems. There is no greater joy then getting things done and have a good working system. Always there to go the extra mile to deliver a finished and proper implemented project.
Zakeera is a Data Specialist that has worked at many of the leading financial institutions in South Africa. She’s been involved in the full lifecycle around data, from designing and building Enterprise physical datawarehouses, logical datawarehouses, implementing master data management solutions, establishing Enterprise data governance structures, defining technical roadmaps to meet business strategies. Zakeera played a crucial role in the simplification of the data architecture at the Johannesburg Stock Exchange by combining the implementation of Master Data Management with Data Virtualisation. She’s been invited to speak at the Open Group forum SA and also participated in a webinar on Data Virtualisation. She now leads the Big Data practice within standard Bank.
Kristel is responsible for managing the delivery of business value from the Big Data stack. Kristel facilitates the prioritisation of business use cases for the Data Lake across all divisions in the Group.
Kristel is an agent for change in an ever-evolving technology landscape and has been instrumental in acceptance and approval of investment cases for open source technology.
Kristel supports the community of Data Science practitioners by facilitating regular Guilds.
Her experience spans the implementation of new technologies and platforms across the Enterprise in support of the Group strategy. Kristel is the Platform Owner for Big Data at Standard Bank.
Dr. Alex Xiaoyang Yang is the CTO and Chief Architect of IBM China Development Laboratory.
He has extensive experience with big data analytics in FSS, Transportation, and Telecom.
Software engineer at Google Seattle, working on Cloud Dataflow SDK, focusing on streaming SQL support for Apache Beam.
Krishna Potluri is a Big Data Architect at TMW Systems, A TRIMBLE COMPANY. He is passionate about technology and has immense knowledge and experience designing and architecting Big Data Solutions.
Donnie Wheat is the Senior Big Data Architect at TMW, with a devotion for business intelligence , data integration, and data science. Recently focusing on near real time analytics using Apache NiFi, Donnie has experience providing actionable business intelligence with the Hadoop platform, implementing data warehouses, and delivering transportation optimization.
Don Bosco Durai (Bosco) is a thought leader in enterprise security and is a committer in open source projects like Apache Ranger, Apache Ambari, and Apache HAWQ. He has also contributed towards the security for most of the Hadoop components. Bosco was the co-founder of XA Secure, which is the genesis of Apache Ranger. Bosco is currently working at Privacera to automate discovery, control, and monitoring of large dataset in Big Data and Cloud.
Henry Sowell is Hortonworks Technical Director in the Public Sector.
In this capacity, Mr. Sowell leads an engineering group responsible for the technical architecture and engineering of Big Data solutions supporting missions across the Intelligence Community, Department of Defense, Federal Civilian Government Agencies, and State, Local, and Higher Education institutions, helping improve speed to mission.
Prior to joining Hortonworks, Mr. Sowell used several technologies, including Apache Hadoop, to protect the nation in support of the FBI’s counterterrorism mission. In addition to supporting the counterterrorism mission, he leveraged these technologies to support cross-division law enforcement advancements with the FBI’s Cyber Division. Mr. Sowell enlisted in the United States Marine Corps in 2003. He served with distinction as a decorated combat veteran, having earned the Bronze Star with Valor for his actions in Iraq.
Stephen Wu is a senior program manager for big data at Microsoft.
Brent DeMar is a technical Senior Manager overseeing the development of next generation data ingestion pipelines for Discover Financial Services. Prior to this he managed ETL and real-time decisioning application development teams at Discover. He has also held architecture, operations, and senior engineering positions across a diverse set of technologies. Brent holds a BS in computer science from Purdue University.
Konstantin V. Shvachko is an expert in Big Data technologies, file systems, and storage solutions. He specializes in efficient data structures and algorithms for large-scale distributed storage systems. Konstantin is known as an open source software developer, author, inventor, and entrepreneur. He is a senior staff software engineer at LinkedIn.
Erik is a software engineer with a passion for all things distributed systems. He currently focuses on Big Data storage and analytics at LinkedIn. His work mainly focuses around the scalability of HDFS, both internally and via contributions to open source. Erik is particularly excited by investigating research into new and interesting storage technologies, and is passionate about the promotion of female involvement and empowerment in the technology space.
Billie Rinaldi is a Principal Software Engineer I at Hortonworks, currently prototyping new features related to long-running services and containers in Apache Hadoop YARN. Prior to August 2012, Billie engaged in big data science and research at the National Security Agency, where she provided early leadership for Apache Accumulo. Billie is a member of the Apache Software Foundation and a committer for Apache Hadoop and a number of other Apache projects in the Hadoop ecosystem. She holds a Ph.D. in applied mathematics from Rensselaer Polytechnic Institute.
Shane Kumpf is a Software Engineer on the Apache Hadoop YARN R&D team at Hortonworks.
Ankit Singhal is a committer and a member of Apache Phoenix PMC (Project Managment Committee) for more than 2 years now. He has also been contributing to projects like HBase, Tephra, Calcite . He specializes in designing and developing big data solutions for different line of business. With over 7 years of Big Data experience, he has architected and created various analytics products and data warehouse solutions using Hadoop technologies like Hadoop, Kafka, Hive, HBase, Phoenix, spark.
Engineering manager in the Big Data team at Microsoft.
Mr. Douty is a seasoned senior executive, with ample experience in roles ranging from startups to multinationals. Some of these roles are:
Director Telefonica Big Data Insight business
Director of Chinese JV: "The Smart Steps Digital Technology Company Limited"
Trading & Marketing Director
Created start-up B2B business
Plc board director
MD of health & beauty services business
General manager of retail trading business
Brand Director of major retail brands
Loyalty scheme development and management
UK & European marketing management
Devised and launched the Boots Advantage Card
While employed by some of the largest and recognizable organizations in the health care, financial services, and digital publishing industries, Andy has led major strategic data integration and advanced analytics efforts of critical importance to those firms. Currently, Andy leads the strategic redesign of Health Care Service Corporation’s information architecture and data management activities, heavily leveraging big data technology and open source projects to improve the quality of members’ health care services within a rapidly changing industry.
Dr. Leon Li serves as Software Architect and designer of Northrop Grumman’s Hadoop based enterprise data analytics platform. He is an expert in Hadoop based enterprise system architectures, and advises Northrop Grumman executive leadership on analytics technologies. At Northrop Grumman, Leon previously served as Senior Software Engineer for a national cyber security information sharing program, led a university research effort in cryptography, and led systems engineering efforts on Cloud based big data systems for genomics research. Leon graduated with a PhD in Electrical Engineering from MIT.
Plamen J. Jeliazkov is has been an HDFS contributor for about 6 years and considered an expert by his peers. He specializes in HDFS knowledge, most notably behind the NameNode internals. He was part of the team that brought truncate functionality to Hadoop. He is currently a senior Hadoop engineer at PayPal. His excitement comes in shining and polishing HDFS clusters to work at their best.
From his Twitter:
"Programmer. Gamer. Nerd. UCSD alumni. I develop Hadoop and HBase. I like computer systems, video games, and crypto."
Matt Aslett is a Research Director for the Data Platforms and Analytics Channel at 451 Research. Matt has overall responsibility for the data platforms and analytics research coverage, which includes operational and analytic databases, Hadoop, grid/cache, stream processing, search-based data platforms, data integration, data quality, data management, analytics, machine learning and advanced analytics. Matt's own primary area of focus includes data management, reporting and analytics, and exploring how the various data platforms and analytics technology sectors are converging in the form of next-generation data platforms.
Matt is a regular speaker at client and industry events and has delivered keynotes and moderated panels at Strata + Hadoop World, Hadoop Summit, Percona Live MySQL Conference, GraphConnect, Data Leadership and NoSQL Roadshow. Matt has been named by AnalyticsWeek as being among the top 200 Thought Leaders in the field of Big Data and Analytics.
Larry is a Senior Development Manager and Architect on the Hortonworks security team. He is also a committer and PMC member for the Apache Knox and Apache Ranger projects, committer for Apache Hadoop and contributor to security aspects of multiple Hadoop related projects. He is a veteran in the enterprise middleware space with a specialization in platform management and security. Larry has extensive experience in the Java EE application server technologies and has served on various expert groups for JSRs within the JCP for Java EE security. He has worked on various webservices technologies and stacks including SOAP and REST with a focus on security.
Mark is currently in his fourth year within the Partner Engineering team at Hortonworks. He has over 29 years of experience working with Data platforms, focusing on helping customers architecting Advanced Analytic and Big Data Platforms. Mark is currently focused on integrating Hortonworks products and solutions with strategic Partners capabilities, driving significant value for our customers. Mark has a BS in Computer Science from North Carolina State University and also holds a Six Sigma Black Belt.
Viplava Madasu is a Big Data Systems Engineer at Hewlett Packard Enterprise where he currently works on evaluating emerging big data technologies and creating reference architectures for HPE converged infrastructure platforms. Previously, he worked developing software in different groups at HPE in Application Server Middleware/Java Hotspot JVM/SQL database engine areas. He holds a Masters degree in Computer Science from Indian Institute of Technology, Kharagpur.