Big Data expert with solid experience in data mining, data warehousing, design and development of distributed, scalable and low latency systems for processing huge amount of data for Ads and Data, analytics platforms and enterprise applications.
Technical Skills: Hadoop (Java MapReduce, PIG, Oozie, HIVE, Sqoop, HBase), Druid, Spark, Storm, Kafka, NOSQL (MongoDB), IR Techniques, Elastic Search, Restful Webservices, J2EE, Java, C++, Perl, Python, PHP, JavaScript, MySQL and Oracle
Sr. Software Engineer @ a) Very low latency, near real time (stream data) and highly scalable data mining solution for big data analytics using Hadoop, PIG, Oozie, DRUID, REST, STORM, Kafka
b) Machine learning pipelines on HDFS with Oozie, PIG and SPARK.
c) Analytical Data Warehouse development for very huge Ads Data for targeting purpose.
d) Approximation algorithm implementation for set operations on very large set of users using Sketch algorithms providing very low latency unique user count in Ad Manager UI.
e) High throughput traffic protection solution for AdServers for Ad-Fraud detection.
f) High throughput classification system for inventory publishers for tagging and scoring.
g) Batch pipelines on grid for model generation for Traffic Fraud detection.
h) Data pipelines for scalable data aggregation from different RDBMS in different colos to HDFS using Sqoop.
i) Solution for processing and exchanging targeting data (Test and Control) with third parties for lift analysis.
j) Supply Forecasting data pipeline for supply prediction for premium impressions for yahoo premium properties.
h) Ad inventory historical trend analytic system using MongoDB, REST, Ember
i) Ad Search system for indexing ads metadata and searching with Elastic Search.
Keywords: Hadoop (Java MapReduce, PIG, Oozie, HIVE, Sqoop, HBase), Druid, Spark, Storm, Kafka, NOSQL (MongoDB), HBASE, IR Techniques, Elastic Search, Restful Webservices, Java, C++, Perl, Python, JavaScript, MySQL and Oracle From July 2010 to Present (5 years 6 months) Sr. Software Engineer @ a) Providing payment solutions to the Blackhawk partners for payment integration network for gift card transactions.
b) High throughput transaction engine/switch in Java, JPOS.
c) Providing Web service interface to Web based POS using Java REST services.
d) Partner integrations on Acquiring Switch
Keywords: Java, J2EE, JPOS, WebSevices, DB2, JBoss, Hibernate, Struts From May 2007 to July 2010 (3 years 3 months) Software Engineer @ Enterprise Application for Partner Management for Symantec that generated $800M as revenue.
Keywords: Java, J2EE, Oracle, Weblogic, JSP, Struts, Hibernate From January 2006 to February 2007 (1 year 2 months) Software Engineer @ Media product development "Ingest to broadcast workflow management".
Keywords: Java (JDK 1.5), J2EE, JSP, JUnit, Maven, Spring, JDO, SQL Server 2000. From April 2005 to December 2005 (9 months) Engineering Intern @ Worked on JFP (Java Financial Platform), an Enterprise Banking Application for Citibank to replace the legacy system for 27 countries.
Keyword: Java, J2EE, Weblogic, Oracle From January 2004 to March 2005 (1 year 3 months)
MS, Computer Science @ University of Southern California From 2003 to 2004 Bachelor of Science (BS), Computer Science @ Delhi University From 1997 to 2001 Sumit Gupta is skilled in: Java Enterprise Edition, Agile Methodologies, Distributed Systems, Web Services, Hibernate, Tomcat, Hadoop, Servlets, Perl, Struts, C++, Spring, Ant, Oracle, REST