Highly efficient and results-oriented data scientist with strong quantitative skills, development experience and strong education background with a MSc (Imperial College London (World Rank within Top 10 QS)). Responsible self-starter with demonstrated experience in statistical programming language (R, Python, SAS) and programming language python for API’s. High ability holder on visualization with tools such as Tableau as well as good understanding of relational database such as SQL and oracle and non-relational database such as hbase, mongoDB and redis. Machine learning tools such as Hadoop, Spark, H2O, sparkling-water, pysparkling, SAS etc. as well as deep learning tools such as Keras, Tensorflow, Theano, MXnet, PyTorch. GPU cuda programming. Scaling data science. Expert in Predictive Modeling such as XGBoost, regression, Logit, Probit, GBM, RandomForest, Neural Network (generative model, GAN, VAE, RNN, CNN, word2vec etc.) , Naive Bays, K-nearest learn, PCA etc. (supervised learning, unsupervised learning, semi-supervised learning , reinforcement learning etc.) and also probabilistic modeling (PyMC3, Edward, Pyro) such as MCMC, HMC, NUTS, bayesian linear regression, variational models etc, Data mining skills such as parsing, nlp (natural language processing) and proficient in language modeling such as topic model, text clustering, word embedding, Word2Vec, Glove, text classification, RNN, Convolutional RNN etc. familiar with all the development environment such as Hadoop, Cloud (AWS, GCP, Azure) , GPU, Spark. etc.
Strong communication and relationship-building skills with diverse parties; fluent in English and Korean
Product Management lead at Uber with a focus on Data Platforms and Infra. I manage Uber's Storage, Analytics, BI, and Machine Learning product lines.
Tim Spann was a Senior Solutions Architect at AirisData working with Apache Spark and Machine Learning. Previously he was a Senior Software Engineer at SecurityScorecard ("http://securityscorecard.com/) helping to build a reactive platform for monitoring real-time 3rd party vendor security risk in Java and Scala. Before that he was a Senior Field Engineer for Pivotal focusing on CloudFoundry, HAWQ and Big Data. He is an avid blogger and the Big Data Zone Leader for Dzone (https://dzone.com/users/297029/bunkertor.html).
He runs the the very successful Future of Data Princeton meetup with over 1192 members at http://www.meetup.com/futureofdata-princeton/.
He is currently a Senior Solutions Engineer at Cloudera in the Princeton New Jersey area.
You can find all the source and material behind his talks at his Github and Community blog:
Dinesh Chandrasekhar is a technology evangelist, a thought leader and a seasoned product marketer with over 24+ years of industry experience. He has an impressive track record of taking new integration/mobile/IoT/Big Data products to market with a clear GTM strategy of pre-and-post launch activities. He has extensive experience working on enterprise software as well as SaaS products delivering sophisticated solutions for customers with complex architectures. His areas of expertise include IoT, Application/Data integration, BPM, Analytics, B2B, API management, Microservices and Mobility. He can articulate detailed use cases across multiple industry verticals like retail, manufacturing, utilities and healthcare. He is a prolific speaker, blogger and a weekend coder. He currently works at Cloudera, managing their Data-in-Motion product line. He is fascinated about new technology trends including blockchain and deep learning.
David is a Director of Solution Architecture at Streamlio, and also a contributor to the Apache NiFi, and Apache Pulsar projects. He was formerly the Practice Director at Hortonworks, where he was responsible for the development of best practices and solutions for the professional services team, with a focus on HDF-related technologies including Kafka, NiFi, and Storm. He is a co-author of “Practical Hive: A Guide to Hadoop’s Data Warehouse System”, and holds a B.S and Master’s Degree in Computer Science from Kent State University.
Akshitha Ramachandran is a junior at Harvard University pursuing a joint degree in both Computer Science and Statistics. She was a founding member and Lead Engineer at Harvard Student Agencies - DEV, a start-up focused on developing mobile and web applications for third party clients. She is both a senior developer and board member at ProMazo, a campus organization focused on partnering top students from leading universities with projects at leading companies. Additionally, Akshitha attends Hackathons and has been on the board of Harvard’s Women Engineers Code (WECode) conference. She is on the board of the Harvard College Consulting Group, where as an Associate Director of Engagement she is responsible for the acquisition, organization and execution of three client projects every semester. This past summer she spent time at Novetta expanding their Machine Learning practices, specifically in the Named Entity Resolution space. She has contributed to the company’s internal pipeline, designed a demo for them, and published some of her work (https://www.novetta.com/2018/09/named-entity-recognition-and-graph-visualization/ and https://www.novetta.com/2018/08/evaluating-solutions-for-named-entity-recognition/).
Don Bosco Durai (Bosco) is a thought leader in enterprise security and is a committer in open source projects like Apache Ranger, Apache Ambari, and Apache HAWQ. He has also contributed towards the security for most of the Hadoop components. Bosco was the co-founder of XA Secure, which is the genesis of Apache Ranger. Bosco is currently the co-founder of Privacera where he is tackling the data security challenges in modern data architecture, like Big Data and Cloud, where large data set constantly moves between different environments, which can result major security breaches or compliance violation if not managed properly. Privacera automates discovery of sensitive data, does transparent encryption/anonymization, manages access policies and monitors access.
Madhan Neethiraj is an Apache committer and PMC for Apache Atlas and Apache Ranger projects. He works at Hortonworks as Sr. Director of Engineering in Enterprise Security Team. His contributions include Apache Ranger features like audit framework, stack model, tag-based policies, masking and row-filter policies; and Apache Atlas features like V2 APIs, search enhancements. Prior to Hortonworks, Madhan was at Oracle in development of security access management suite, governance and real-time fraud detection/prevention products. Prior to Oracle, he was with Bharosa Inc. responsible for the development of real-time fraud detection solution for Financial Institutes, HealthCare and eCommerce.
Dave is an Enterprise Software Architect with over twenty-five years of technical leadership in the telecommunications, financial services, and healthcare domains. Dave’s diverse experience ranges from engineering event-based and rule processing systems at “PaaS” (Platform as a Service) scale to building an autonomous-agent workplace simulation engine. At Comcast, Dave is leading the end-to-end ingest, compute, and machine learning pipeline architectures for supporting Customer Experience Big Data applications.
Jeff is a software engineer and cloud architect. He is a committer and PMC on Apache OpenNLP. Jeff currently works on natural language processing pipeline projects and resides outside of Morgantown, WV.
At Partners & Co., Eric Wolok specializes in the sale of commercial real estate in Chicago. Partners & Co. uses open source tools such as emacs, sed, awk, Apache NiFi and Apache Spark to identify, track and facilitate unique investment opportunities for their clients.
Data science expert and software system architect with expertise in machine-learning and big-data systems. Rich experiences of leading innovation projects and R&D activities to promote data science best practice within large organizations. Deep domain knowledge on various vertical use cases (e.g., Finance, Telco, Healthcare). Currently working pushing the cutting-edge application of AI at the intersection of high-performance database and IoT, focusing on unleashing the value of spatial-temporal data. I am also a frequent speaker at various technology conferences, including: O’Reilly Strata AI Conference, NVidia GPU Technology Conference, Hadoop Summit, DataWorks Summit, Amazon AWS re:Invent, Global Big Data Conference, Global AI Conference, World IoT Expo, Intel Partner Summit, presenting keynote talks and sharing technology leadership thoughts.
Received my Ph.D. from the Department of Computer and Information Science (CIS), University of Pennsylvania, under the advisory of Professor Insup Lee (ACM Fellow, IEEE Fellow). Published and presented research paper and posters at many top-tier conferences and journals, including: ACM Computing Surveys, ACSAC, CEAS, EuroSec, FGCS, HiCoNS, HSCC, IEEE Systems Journal, MASHUPS, PST, SSS, TRUST, and WiVeC. Served as reviewers for many highly reputable international journals and conferences.
Michael is a Senior Solutions Engineer at Hortonworks on the Public Sector team. He has worked in the public sector space for 19 years in a broad variety of IT roles, with more than 13 years’ experience as a Solutions Architect. Michael has been focused on the Hadoop space for the last 4 years. His other “big data” passion is information retrieval using Solr and Elasticsearch.
Chris Bove is a Solutions Engineer at Hortonworks. He has designed and delivered big data solutions across multiple vendor distributions of Hadoop (HDP, CDH, and MapR). Chris has experience working with project teams for commercial and government customers, and deploying production systems on-premise and in the cloud.
Solutions Engineer with Hortonworks for almost 5 years serving the Federal sector with a heavy emphasis on DoD customers.
Sridhar is an Enterprise Architect delivering high impact IT solutions with cross functional executions. He comes with many years of applications programming in diverse industries including Retail, Healthcare, Manufacturing, Utilities and Telco. Stint includes building and managing operations for multi-tenant Hadoop clusters consisting over 500 nodes and growing, where he focuses on optimized and stable clusters, proactive maintenance and efficient operations.
I am currently working for Hortonworks as Senior Software Engineer focused on data management products. Actively contributing to the Hortonworks DataPlane Services platform and Hortonworks Data Lifecycle Manager. Prior to Hortonworks, I worked at Informatica in the Intelligent data warehouse and big data platform using Hadoop, Hive, and Teradata connectors. Prior to Informatica, I worked at Teradata in Data Movement products such as Teradata Parallel Transporter and Teradata connector for Hadoop.
Niru Anisetti is the product manager for Data Lifecycle Manager at Hortonworks. She is part of a passionate team building the next generation disaster recovery product to make millions of data managers’ lives easier. Before Hortonworks, she worked at IBM, Intuit and Yahoo among other companies to build products to not only generate revenues but to change lives of people for the better. She can be reached at email@example.com.
Have more then 15+ years of Java experiences and during theses years worked with allmost all the form of Java solutions from the low-latency multithread application to highly distributed enterprise application as developer, architect and trainer. Currently working with the Apache bigdata projects and created various type of containerized solution for the components of the Hadoop ecosystem.
Founder of the first Hungarian Java User group and regular speaker at meetup events and conferences.
Committer of Apache Hadoop and Apache Ratis project and working on the Apache Hadoop Ozone project and the dockerization of Apache Hadoop,
Experienced Software development professional with a strong exposure in various big data technologies. Skilled in Hadoop eco system components(HDFS, MapReduce, Pig, Hive, SQOOP), Cassandra, Spark, Core Java, Scala, Relational databases and Data warehousing, and also possess good skills in various SDLC methodologies.
Kamil is a technology leader in the large scale data warehousing and analytics space. He is CTO of Starburst, the enterprise Presto company. Prior to co-founding Starburst, Kamil was the Chief Architect at the Teradata Center for Hadoop in Boston, focusing on the open source SQL engine Presto. Previously, he was the co-founder and chief software architect of Hadapt, the first SQL-on-Hadoop company, acquired by Teradata in 2014.
Kamil began his journey with Hadoop and modern MPP SQL architectures about 10 years ago during a doctoral program at Yale University where he co-invented HadoopDB, the original foundation of Hadapt’s technology.
Kamil holds an M.S. in Computer Science from Wroclaw University of Technology and as well as M.S. and an M.Phil. in Computer Science from Yale University.
Dr. Dhabaleswar K. (DK) Panda is a Professor and University Distinguished Scholar of Computer Science at the Ohio State University. He obtained his Ph.D. in computer engineering from the University of Southern California. His research interests include parallel computer architecture, high-performance computing, communication protocols, big data, deep learning, files systems, network-based computing, and Quality of Service. He has published over 450 papers in major journals and international conferences related to these research areas. Dr. Panda and his research group members have been doing extensive research on modern networking technologies including InfiniBand, Omni-Path, High-Speed Ethernet and RDMA over Converged Enhanced Ethernet (RoCE). His research group is currently collaborating with National Laboratories and leading InfiniBand and Ethernet/iWARP companies on designing various subsystems of next-generation high-end systems. The MVAPICH2 (High-Performance MPI over InfiniBand, iWARP, and RoCE) open-source software package, developed by his research group, are currently being used by more than 2,925 organizations worldwide (in 86 countries). This software has enabled several InfiniBand clusters (including the 1st one) to get into the latest TOP500 ranking. These software packages are also available with the Open Fabrics stack for network vendors (InfiniBand and iWARP), server vendors and Linux distributors. The new RDMA-enabled Apache Hadoop and Memcached packages, consisting of acceleration for HDFS, MapReduce, RPC and Memcached, are publicly available from http://hibd.cse.ohio-state.edu. Dr. Panda's research is supported by funding from US National Science Foundation, US Department of Energy, and several industry including Intel, Cisco, SUN, Mellanox, QLogic, NVIDIA and NetApp. He is an IEEE Fellow and a member of ACM. More details about Dr. Panda, including a comprehensive CV and publications are available at http://web.cse.ohio-state.edu/~panda.2/.
Dr. Xiaoyi Lu is a Research Assistant Professor in the Department of Computer
Science and Engineering at the Ohio State University, USA. His current research
interests include high performance interconnects and protocols, Big Data,
Hadoop/Spark/Memcached Ecosystem, Parallel Computing Models (MPI/PGAS),
Virtualization, Cloud Computing, and Deep Learning. He has published over 100
papers in International journals and conferences related to these research
areas. He has been actively involved in various professional activities (PC
Co-Chair, PC Member, and Reviewer) in academic journals and conferences.
Recently, Dr. Lu is leading the research and development of RDMA-based
accelerations for Apache Hadoop, Spark, HBase, and Memcached, and OSU HiBD
micro-benchmarks, which are publicly available from
(http://hibd.cse.ohio-state.edu). These libraries are currently being used by
more than 290 organizations from 34 countries. More than 27,700 downloads of
these libraries have taken place from the project site. He is a core member of
the MVAPICH2 (High-Performance MPI over InfiniBand, Omni-Path, Ethernet/iWARP,
and RoCE) project and he is leading the research and development of
MVAPICH2-Virt (high-performance and scalable MPI for hypervisor and container
based HPC cloud). He is a member of IEEE and ACM. More details about Dr. Lu are
available at http://web.cse.ohio-state.edu/~lu.932/.
Carolyn Duby is a Solutions Engineer and Cyber Security SME at Hortonworks, where she helps customers harness the power of their data with Apache open source platforms. Previously, she was the architect for cybersecurity event correlation at SecureWorks. A subject-matter expert in cybersecurity and data science, Carolyn is an active leader in the community and frequent speaker at Future of Data meetups in Boston, MA, and Providence, RI, and at conferences such as Strata Data Conference, Dataworks Summit, Open Data Science Conference and Global Data Science Conference. Carolyn holds an ScB (magna cum laude) and ScM from Brown University, both in computer science. She is lifelong learner and recently completed the Johns Hopkins University Coursera Data Science Specialization.
Terry Padgett is an accomplished Hadoop Systems Architect, with over 8 years of hands-on installation, integration and development with Hadoop technologies. Terry also has extensive experience in the development and application of advanced information technologies, providing software project leadership, software architecture development and assisting the customer in the application of technologies to provide capabilities and solve pressing problems. A seasoned technical lead and software developer, Terry is experienced with multiple programming languages, among them Java and C, with application throughout the entire software development lifecycle.
I am currently a Engineering Manager at Uber where I am a member of the Hadoop Platform team working on large scale data ingestion and dispersal pipelines and libraries leveraging Apache Spark. I was also previously the tech lead on the metrics team at Uber Maps building data pipelines to produce metrics to help analyze the quality of our mapping data. Before joining Uber, I worked at Twitter as an original member of the Core Storage team building Manhattan, a key/value store powering Twitter's use cases. I love learning anything about storage and data platforms and distributed systems at scale.
Dr. Alex Xiaoyang Yang is the CTO and Chief Architect of IBM China Development Laboratory.
He has extensive experience with big data analytics in FSS, Transportation, and Telecom.
Software Engineer at Imply
Tijo is an accomplished Hadoop Expert, with over 6 years of hands-on development with Hadoop and Streaming technologies, and has over 15 years of Software Industry experience. Primarily worked with Hadoop developments related to scalability, performance, load balancing, failover, and fault tolerance improvements and solutions. Having 5-year experience in batch processing (HDFS, Yarn, Spark, and Hive) and over 3 years of experience in Apache NiFi , Ranger and Atlas. Have good exposure to handling Architecture and Design for bigdata involved solutions and POCs. Exposed to cluster operation and management systems for large scale Hadoop clusters
Barbara Eckman is a Principal Software Architect at Comcast. She leads data governance for an innovative, division-wide initiative comprising near-real-time ingesting, streaming, transforming, storing, and analyzing Big Data. Barbara is a recognized technical innovator in Big Data architecture and governance, as well as scientific data and model integration. Her experience includes technical leadership positions at a Human Genome Project Center, Merck, GlaxoSmithKline, and IBM. She served on the IBM Academy of Technology, an internal peer-elected organization akin to the National Academy of Sciences.
Accomplished product owner with multiple years of professional experience working with leading-edge technologies supporting a large landscape of business cases. Proven problem solving skills, best management practices & result-oriented decision making capabilities
I am a data enthusiast and currently working at Hortonworks as a Solutions Engineer in the United States SF bay area. I have immense fascination and equal amount of passion for anything related to data and cloud.
As a former retail and consumer goods executive and more recently as a business strategy consultant and solution provider, Brent has extensive experience working with a variety of retail and consumer goods companies to provide thought leadership and help them to align strategic business objectives with technology and analytic solutions to create a differentiated competitive advantage in the marketplace.
He has an extensive track record of imagining, designing and executing high impact business solutions, driving innovation and transformation for retail and consumer goods organizations. Brent is passionate about analytics, emerging technologies, consumer behavior, collaborative supply chains and retail transformation.
As General Manager of Retail and Consumer Goods Solutions at Hortonworks, Brent is responsible for driving the solution vision and go-to-market strategies with each segment. As industry leaders increasingly invest in Big Data Analytics to help drive transformation within their organizations,
Brent engages globally to share, discuss, provide keynote talks, and facilitated workshops to help define and create solutions to drive next-generation insights and positive business outcomes across the value chain.
Yanbo is a staff software engineer at Hortonworks. He is working on the intersection of system and algorithm for machine learning and deep learning. He is an Apache Spark PMC member and contributes to several open source projects such as TensorFlow, Keras and XGBoost. He delivered the implementation of some major Spark MLlib algorithms. Prior to Hortonworks, he was a software engineer at Yahoo! and France Telecom working on machine learning and distributed system.
Nitin Khandelwal is working at Qubole as a Staff Engineer. He has worked in a different arena of projects like adding encrypted communication for ephemeral clusters nodes running in the cloud, providing Hive as a multi-tenant service, Autoscaling, etc. He has been contributing significantly in optimizing Tez engine for ETL workloads by adding features like workload-aware autoscaling, fault-tolerance, effective use of spot nodes, etc.
Previously, Nitin was working with Microsoft on VPN Site-to-site gateway service which forms the backbone of Microsoft Azure Stack's network.
Nitin has completed his Masters in Computer Science from IIIT-Hyderabad. His main areas of focus there were distributed computing, databases and networks.
Shreya Bhatia is working in Qubole as a Member of Technical Staff. She works there on Hive Stack, and has been part of projects like providing Hive as a service on a cloud agnostic platform, building Metrics and alerting solution for HiveServer2 and stabilizing it under a highly concurrent load, performance analysis of MapReduce on Yarn in the Qubole Stack etc.
She completed here Masters in Computer Science from Stony Brook University, New York in 2016. Previously she was working in India with InfoEdge (Naukri.com) as part of Search Team and worked on building extraction systems like Resume/Email parser, Job Crawler etc.
Ian Brooks holds a Ph.D. in Computer Science from University of North Texas, and his dissertation focused on virtual teams, leadership, and predictive analytics. He is committed to improving his craft, and he has a great passion for science, data, and computing. Currently, Ian is a member of the Public Sector team at Hortonworks, and he recently relocated to Washington DC. When he isn't stressing over the details, Ian enjoys mountain biking, kettlebells, and beer making.
Pradeep is a Big Data Engineer at Hotels.com in London where he builds and manages cloud infrastructure and core services like Apiary. Pradeep has worked in the big data space for the last 7 years, building large scale platforms.
Elliot is a principal engineer at Hotels.com in London where he designs tooling and platforms in the big data space. Prior to this Elliot worked in Last.fm’s data team, developing services for managing large volumes of music metadata.
Kai Liu is a Senior Program Manager in AI and Research group of Microsoft. He has 8 years of experience in data driven engineering, big data platform and AI infrastructure for Office and Bing product families. He led his team to create a service health portal for SharePoint Online, inject a distributed log collection and storage system for Exchange Online, publish curated data sets, key business metrics, and enable sub-hour experimentations in Office 365.
Currently he is working on the next generation of Big Data and Deep Learning platform for Bing based on Open Source technologies.
Sanjeev Koranga is leading the PayPal’s instrumentation and analytics platform team. He is responsible for making PayPal’s behavioral analytics self-serve & designing and developing systems for turning data into meaningful insights.
Sunil Govindan is contributing to Apache Hadoop project since 2013 in various roles as Hadoop Contributor, Hadoop Committer and member Project Management Committee (PMC). He is working as Staff Software Engineer at Hortonworks in YARN team. He is majorly contributing in YARN Scheduling improvements such as Intra-Queue Resource preemption, Multiple Resource types support in YARN with Resource Profiles, Absolute Resource configuration support in Queues etc. He also drove efforts to improve YARN UI for better user experience with community. Before Hortonworks, he worked at Juniper on a custom resource scheduler. Prior to that, he was associated with Huawei and worked on Platform and Middleware distributed systems including Hadoop platform. He loves reading books, an ardent music lover and passionate about go-green efforts.
Weiwei is a Staff Engineer working at Hortonworks, a Apache Hadoop committer and PMC member. He has been working on Hadoop for over 8 years, and contributed to both HDFS and YARN. His work mainly includes storage features like Ozone metadata store, garbage collection, and scheduling features like YARN placement constraints, async scheduling and CSI adoption etc. Before Hortonworks, he worked in Alibaba’s data infrastructure team, with experiences of evolving big data platform at 10k+ nodes scale. Prior to that, he worked in IBM for several years as one of the startup member of Biginsights project.
Leader of HDFS/ZooKeeper project at Xiaomi, focus on distributed filesystem. 6 years experience on large scale distributed storage system
Owen O'Malley is a co-founder and technical fellow at Hortonworks, a rapidly growing company (25 to 1,000 employees in 5 years), which develops the completely open source Hortonworks Data Platform (HDP). HDP includes Hadoop and the large ecosystem of big data tools that enterprises need for their data analytics. Owen has been working on Hadoop since the beginning of 2006 at Yahoo, was the first committer added to the project, and used Hadoop to set the Gray sort benchmark in 2008 and 2009. In the last 8 years, he has been the architect of MapReduce, Security, and now Hive. Recently he has been driving the development of the ORC file format and adding ACID transactions to Hive. Before working on Hadoop, he worked on Yahoo Search's WebMap project, which was the original motivation for Yahoo to work on Hadoop. Prior to Yahoo, he wandered between testing (UCI), static analysis (Reasoning), configuration management (Sun), and software model checking (NASA). He received his PhD in Software Engineering from University of California, Irvine.
Srikanth Venkat is currently responsible for Security & Governance portfolio of products at Hortonworks which include Apache Knox, Apache Ranger, Apache Atlas, Platform wide security and Hortonworks DataPlane Service. Prior to Hortonworks, Srikanth has held multiple roles in areas of cloud services, marketplaces, security, and business applications. His experience includes leadership across Product Management, Strategy and Operations, and Technical Architecture with broad experience in startups to global organizations including Telefonica, Salesforce.com, Cisco-Webex, Proofpoint, Dataguise, Trilogy Software, and Hewlett-Packard. Srikanth holds a PhD in Engineering with a focus on Artificial Intelligence from University of Pittsburgh, and an MBA in General Management from Indiana University and a Masters in Global Management from Thunderbird School of Global Management. Srikanth is a Data Sciences & Machine Learning hobbyist and enjoys tinkering with Big Data technologies.
I am an Engineering Manager with 6+ Years of Data Engineering, Data Science & Analytics Experience across Retail, Finance & Marketing industries. In my current role at WalmartLabs, I lead the engineering team for the customer experience domain and we have been primarily working on a data lake initiative to bring all the various data assets at Walmart under one single platform. I have worked across various database systems and have been extensively working on Hadoop stack in my past 4 years at Walmart. As part of our data lake migration, I worked building end to end data platforms on Hadoop and also worked on evaluating multiple Query acceleration layers on top of Hadoop and as part of this we have evaluated Druid, LLAP, Spark, Kinetica etc to power our BI platforms.
My Speaking & Presentation Experience:
@NWA IISE conference: Topic: 'Data Cafe: Enabling Real Time Insights Through Visualization'
@Bentonville Data Science Meetup: ' Data Cafe: Ask Me Anything - Bot Framework using NLP'
My work at WalmartLabs was featured on Forbes as one of the "The Most Practical Big Data Use Cases Of 2016" https://www.forbes.com/sites/bernardmarr/2016/08/25/the-most-practical-big-data-use-cases-of-2016/#1a1206531625
This is Abhishek Gupta with around 5 years of professional experience in IT Industry, currently working in Walmart Labs as a Software Engineer 2. At Walmart, I am primarily working in the Data Lake Initiative of Walmart; mainly involved in building the data pipeline by gathering requirements from business, understanding the business rules, performing ETL and developing reports. So, I'm an active user of engineering tools and technologies such as Hadoop, Hive, Spark etc.
Prior to this, I had worked for more than 2 years in the area of data warehousing and business intelligence at AIG (American International Group). Apart from this, I have pursued my Master's in Management Information Systems with the specialization in Data Analytics from the University of Arizona.
Being a President at IET (Institution of Engineering & Technology) during my under graduation, I have given talks on various topics to engineering students. Apart from this, I have also organized and led career events during my Master's program at the University of Arizona. Moreover, I recently had the chance to be a host at the "Open Data Science Conference West 2018".
Senior Technologist at American Water working on HDP & HDF.
Experienced Technologist with a demonstrated history of working in the utility space.
Expert in Hadoop, Hive, Spark, and NiFi.
Notable expert clinical information systems specialist offering 25-plus years of strategic leadership. Successful architect of healthcare data warehouses, clinical and business intelligence tools, big data ecosystems, and a health information exchange.
Excel at leading the development of long-term systems strategy for major medical organizations and executing plans to select innovative technology, implement systems, and leverage and maximize system functionality to enhance the health care delivery process.
Evangelist for the use of clinical technology to drive daily operations, analysis, and decisioning. Reputation for building consensus among medical, nursing and research leadership, clinical departments, and IT.
Scope of expertise encompasses designing technology-enabled processes, leading the clinical model for transformation, coordinating clinical workflow, ensuring the use of standardized data elements in clinical systems to meet clinical and research data warehouse requirements, leading teams through development and launch, and directing training.
Charles Boicey MS, RN-BC is the chief innovation officer for Clearsense, an outcomes-driven healthcare technology company based in Jacksonville, FL. Previously, Charles was the enterprise analytics architect for Stony Brook Medicine, where he developed the analytics infrastructure to serve the clinical, operational, quality, and research needs of the organization. He was a founding member of the team that developed the Health and Human Services award-winning application NowTrending to assist in the early detection of disease outbreaks by utilizing social media feeds. Charles is a former president of the American Nursing Informatics Association.
Paul Boal is a nationally renowned speaker, educator and expert on healthcare information management and analytics solutions. Currently serving as the Vice President of Delivery at Amitech, Paul applies his 15+ years of experience with the development, promotion and implementation of enterprise strategies across a range of information management disciplines to lead the management of the firm’s delivery program, practice growth and maturity, and business development.
In addition to his position as an adjunct professor at both Washington University and St. Louis University, Mr. Boal is currently focused on helping healthcare companies adapt to an evolving industry via integrated big data and advanced analytics strategies and solutions, as well as leading the technical direction for Amitech’s IoT Population Health Management platform and Advanced Value-Based Contract Management solutions.
Mayank Kejriwal is a research scientist at the University of Southern California's Information Sciences Institute (ISI), and a research assistant professor in the Department of Industrial and Systems Engineering. He received his Ph.D. from the University of Texas at Austin. His dissertation involved Web-scale data linking, and in addition to being published as a book, was recently recognized with an international Best Dissertation award in his field. His research is highly applied and sits at the intersection of knowledge graphs, social networks, Web semantics, network science, data integration and AI for social good. He has contributed to systems used by both DARPA and by law enforcement, and has active collaborations across academia and industry. He is currently co-authoring a textbook on knowledge graphs (MIT Press, 2018), and has delivered tutorials and demonstrations at numerous conferences and venues, including top academic venues such as KDD, AAAI, and ISWC, and industrial venues . He is currently serving as general chair of the ACM K-CAP conference in 2019, and is co-editing a special issue on knowledge graphs in the Semantic Web Journal. He was awarded a Key Scientific Challenges award in 2018 by the Allen Institute for Artificial Intelligence, and was recently named a Forbes Under 30 Scholar. He has also been nominated as a 2019 Forbes 30 Under 30 in the Science category.
Sridhar is a technology leader and currently responsible for building a Finance data lake in Walmart. He is Sr Manager II Engineering, Global Data Analytics Platform, Walmart. Before working in the Data Analytics area, Sridhar led multiple HR implementations in Walmart. Previously, he worked in Deloitte Consulting, Hyderabad.
Sridhar has 15+ years of IT experience in Retail, Healthcare and Finance domains.
@NWA Arkansas IISE Chapter Conference on Data Cafe: Enabling Real-Time Insights Through Visualization
Pardeep is a Senior Solutions Architect at Cloudera. He has worked in the Big Data space for 9 years in a broad variety of roles.
Michael Ger has over 25 years of experience working in industry and Information Technology strategy roles. He has deep cross-industry knowledge in product development, manufacturing, supply chain and customer experience related business processes. As General Manager of Manufacturing and Automotive Industries at Hortonworks, Mike is responsible for driving the solution vision and go-to-market strategies within each industry segment and works with industry leaders to drive next-generation business insights through Big Data Analytics. Prior to joining Hortonworks, Mike worked at Oracle for over 20 years as their Automotive Industry lead, at A.T. Kearney as an Automotive Management Consultant and at General Motors (Saturn Division) as a Product Engineer.
As General Manager for Insurance, Cindy Maike is responsible for global insurance strategy and customer engagement for Hortonworks. She works with customers and partners leveraging analytics for current day business growth and exploring the use of new data sources to drive innovation in the evolving world of insurance. She has over 25 years of finance, consulting and advisory services experience in the insurance industry working with clients globally on their business strategy leveraging analytics and technology to further drive business results.
Cindy has deep industry knowledge in both claims and underwriting with a focus on the use of analytics and data to enhance business outcomes. She has held positions with the IBM Watson Solution Group, Carrier Insurance, Director of Strategy at ACORD, and was co-founder of Strategy Meets Action Research and Advisory Services. Cindy has also held and is a CPA.
She is passionate about solving business problems and eternally believes in process improvement and strongly believes that today's next generation of business intelligence in the form of advanced analytics will revolutionize the insurance industry. Cindy frequently speaks to business events on the value of business analytics, cognitive computing and the evolution of insurance in a connected world.
Sanjay is a telecom industry veteran with extensive experience in the strategy and execution of next generation data-centric industry solutions for enhancing customer experience, optimizing network operations and increasing revenue generation through digital transformation.
Sanjay currently leads the global communications & media business at Hortonworks helping communication service providers leverage Hadoop and NiFi to transform their data into a force of business growth and competitive differentiation and to drive data-centric solutions for the connected world & for Industrial IOT. Previously, he held executive roles, leading the global telecom industry business, solutions, and strategy at VMware, Pivotal, Progress Software, Savvion, and TMNG and has help drive business transformation, end-to-end architecture and new business initiatives at Bell Canada, Level3, AT&T Canada, Iowa Telecom, ETB, ATT/Ameritech, Wingcast, and other global service providers.
Lohit is part of Hadoop and Log Management team at Twitter. He has been concentrating on scaling Hadoop FileSystem, Hadoop Resource Manager, Log Ingestion and Processing pipelines at Twitter. Previously he has worked at few startups building scalable file systems and was also part of Hadoop team at Yahoo! when it was open sourced. He has Masters degree in Computer Science from Stony Brook University.
Vrushali Channapattan is an active Apache Hadoop Committer & PMC member who is currently working in the Hadoop team at Twitter focusing on ensuring that Hadoop can keep meeting the rapidly expanding storage and computation needs at Twitter. In past roles, she has also worked with Intuit, Yahoo!, Oracle, Persistent Systems and Tata Institute of Fundamental Research in India.
Love to learn. Learn to success.
Henry Sowell is Hortonworks Technical Director in the Public Sector.
In this capacity, Mr. Sowell leads an engineering group responsible for the technical architecture and engineering of Big Data solutions supporting missions across the Intelligence Community, Department of Defense, Federal Civilian Government Agencies, and State, Local, and Higher Education institutions, helping improve speed to mission.
Prior to joining Hortonworks, Mr. Sowell used several technologies, including Apache Hadoop, to protect the nation in support of the FBI’s counterterrorism mission. In addition to supporting the counterterrorism mission, he leveraged these technologies to support cross-division law enforcement advancements with the FBI’s Cyber Division. Mr. Sowell enlisted in the United States Marine Corps in 2003. He served with distinction as a decorated combat veteran, having earned the Bronze Star with Valor for his actions in Iraq.
Leo Garciga serves as the Joint Improvided-Threat Defeat Directorate (JD) Chief Technology Officer under the Defense Threat Reduction Agency (DTRA). In his role, he provides leadership and oversight of Mission Information Technology services and personnel that directly contribute to the implementation of the DTRA mission and its support to the warfighter, Department of Defense (DoD), Combatant Commanders, Coalition partners, the Intelligence & Interagency organizations.
Mr. Garciga is also DTRA JD senior information technology advisor, who discovers and rapidly implements new technology and innovation to counter threat networks, improvised threats and improvised explosive devices to support counter-terrorism and counter-insurgencies operations and to prevent battlefield surprise.
He advocates and spearheads efforts across DoD, the Intelligence Community, US Government Agencies, academia and industry to integrate a myriad of Research and Development work to rapidly introduce new information technology that provides immediate operational impacts for the warfighter and the nation. His efforts have resulted in continuous enhancements to Catapult, a rapid response data analytic platform, to improve situational awareness to thousands of users. He made JIDO (JD) an early adopter and leader in DoD of the implementation of Secure Dev Ops, which unified security, software development and operations to automate processes for innovation in information technology. He also is key contributor to DoD understanding of the potential of artificial intelligence and machine learning to future missions.
Mr. Garciga has a BA in Mechanical Engineering Technology, is a certified Information Technology professional. He has also served in a variety of roles in DoD, to include active duty service in the US Navy, the Combatant Commands, and the Intelligence Community.
Suresh Yadagotti Jayaram is the Senior IT Application Architect for Florida Blue, Florida’s Blue Cross and Blue Shield company, which is the largest health insurance provider in the state. His extensive experience includes software architecture and engineering leadership roles at multiple global firms including HP, PayPal, Tata, and Deloitte. Suresh is passionate about business intelligence and implementing business architecture to reflect strategies that support elite IT departments, regardless of industry. He holds a master’s degree in Innovations and Entrepreneurship from HEC, Paris.
Praveen Kanumarlapudi is a Lead Data Engineer with Aetna’s (a CVSHealth company) Global Security team. Prior to his time at Aetna, Praveen worked on big data solutions for Apple and Bank of America.