Building trust in your data lake. A fintech case study on automated data discovery, control and monitoring data leveraging Apache Ranger and Apache Atlas

Building trust in your data lake. A fintech case study on automated data discovery, control and monitoring data leveraging Apache Ranger and Apache Atlas

Thursday, April 19
2:00 PM - 2:40 PM
Room IV

This talk talks through learning from the HDP implementation at G-Research, a leading Fin-Tech company based in London.
The team at G-Research implemented the Hortonworks Data Platform to build a data lake and
enable the business team to build analytics and machine learning tools. The team faced challenges
to accurately control and manage any sensitive data. Business teams were not able to search
through data due to lack of data classification.
G-Research implemented Privacera auto-discovery solution to precisely discover and tag data
as it is ingested into the HDP environment. The tags are pushed to Apache Atlas and then
Apache Ranger for enabling tag based policies. The G-Research team also build custom tools to push Spark lineage
information into Atlas. Finally, Privacera monitoring tools continuously analyzed access audit information to
alert if sensitive data is moved to folders that might not be protected.
Consequently, security team got real visibility into the sensitive data. Also, business users could
search and find the data within appropriate data classification in place.

Presentation Video

SPEAKERS

Balaji Ganesan
Co-Founder and CEO
Privacera
Balaji Ganesan is a co-founder and CEO at Privacera, a leading data security startup and partner for Hortonworks. Balaji leads all functions in Privacera, steering the company in its vision to help enteprises manage security and compliance risks that come with data. Balaji previously led security and governance work at Hortonworks where led the work to make Hadoop enterprise ready. Balaji had come into Hortonworks through the acquisition of his previous startup, XA Secure. The XA Secure product was eventually open sourced and became Apache Ranger.
Alberto Romero
Big Data Architect
G-Research
Alberto is, since the beginning of 2017, the Big Data Architect at G-Research, which is a leading FinTech company that uses scientific techniques, big data and world-class technology to predict future movements in financial markets. Prior to working for G-Research, Alberto joined Hortonworks as a Professional Services Engineer in 2014 and helped kick-start the Big Data lakes at HSBC and Lloyds Banking Group in the UK