Le Nguyen The Dat bio photo

Le Nguyen The Dat

Data Science and Engineering at Uber. Kaggle Master.

Email Twitter Facebook LinkedIn Github

About me

  • I am a Data Science enthusiast who enjoys building end-to-end data products (data infrastructure, recommendation systems, and predictive models)

  • I am also particularly interested in security and fraud prevention (from social engineering attack, consumer facing products, and online payment)

  • I mainly use: Python, SQL, Shell Script, Haskell, Git, Tableau, Docker, and Amazon Web Services

  • Feel free to drop me a message if you want to talk data or security :)


Open Source Projects

  1. Raptor An Online Retail Recommendation Engine developed in Haskell

  2. Minimal Data Science Source code for machine learning experiments and analyses used in my blog series “Minimal Data Science”

  3. AWS Redshift to RDS A tool to replicate tables from Amazon Redshift to (RDS) PostgreSQL databases

  4. Postgresql User Manager A simple command-line tool for managing User Privileges in PostgreSQL databases

  5. Open Source contributions: redsift, scikit-neuralnetwork, redash, kaggle-ensemble-guide, and a few more


  1. Software defined network based adaptive routing for data replication in Data Centers IEEE International Conference on Networks, ICON 2013, Singapore. December 11, 2013

  2. Virtual machine placement with two-path traffic routing for reduced congestion in data center networks Computer Communications 53. November 1, 2014