Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive Data Across Data Lakes

Understanding Your Crown Jewels: Finding, Organizing, and Profiling Sensitive Data Across Data Lakes

Thursday, April 19
2:50 PM - 3:30 PM

Emerging regulations such as GDPR and increasing incidence of data breaches such as those at Equifax are bringing a firm’s handling and processing of sensitive data such as personal data of its customers and employees into focus. Enterprises need to now be able to discover and manage sensitive data usage to answer compliance and regulatory reporting requirements and to prevent any reputational damage in the event of a data breach. In this talk, we will outline how using the foundation of open source technologies such as Apache Ranger, Apache Atlas and the recently announced Hortonworks DataPlane Service platform components data stewards, analysts, and data engineers can better understand their sensitive data assets across multiple data lakes at scale. We will demonstrate how enterprises can get a comprehensive 360-degree view of their sensitive data including where such data is located, who is accessing what data and how frequently, when was such data accessed, deleted, moved, how is the data protected, and where did this data come from. In addition we will show how such data can be discovered and profiled to understand their characteristics. We will also demonstrate organization and classification use cases for such sensitive data to facilitate their curation into collections for various business purposes and how such collections can be aggregated and summarized to provide a single view of sensitive data footprint in an enterprise from risk management and audit/compliance/forensics perspectives.

Presentation Video


Srikanth Venkat
Senior Director, Product Management
Srikanth Venkat is currently responsible for Security & Governance portfolio of products at Hortonworks which include Apache Knox, Apache Ranger, Apache Atlas, Platform wide security and Hortonworks DataPlane Service. Prior to Hortonworks, Srikanth has held multiple roles in areas of cloud services, marketplaces, security, and business applications. His experience includes leadership across Product Management, Strategy and Operations, and Technical Architecture with broad experience in startups to global organizations including Telefonica, Salesforce.com, Cisco-Webex, Proofpoint, Dataguise, Trilogy Software, and Hewlett-Packard. Srikanth holds a PhD in Engineering with a focus on Artificial Intelligence from University of Pittsburgh, and an MBA in General Management from Indiana University and a Masters in Global Management from Thunderbird School of Global Management. Srikanth is a Data Sciences & Machine Learning hobbyist and enjoys tinkering with Big Data technologies.
Vidyash OU
Software consultant, Big data systems, Java performance