Engineering leader with more than 9 years of work experience mostly in the field of search, analytics, server-side backend, e-commerce and at times with full stack, front end as well. Hands on with wide range of open source technologies viz. Hadoop, Spark, Kafka, Hive, HBase etc
Interested in working with me on building next-generation data processing and insight generation systems, solving some of the most interesting data engineering and data science problems at SoundHound? I am hiring talented software engineers for our Data Engineering team: http://www.soundhound.com/careers
Engineering Manager - Data Engineering and Data Science @ Engineering lead for all the data engineering and data science initiatives across the company (SoundHound, Hound and Houndify platform). Broadly, my team focuses on real time traffic analysis, reports and insight generation, personalization, billing and analytics for all Houndify developer platform users which powers houndify.com website, improve accuracy for incoming Hound queries, Ad targeting and campaigns etc.
We are hiring! From August 2015 to Present (5 months) San Francisco Bay AreaSenior Software Engineer @ Lead the design and development responsibilities in following areas:
Big Data Analytics:
- Built a new ETL platform to analyze and process live traffic data from Hound and SoundHound users, in near real time, using the best of the open-source big data technologies viz. Spark (and Streaming), Kafka, HBase, Avro, Parquet etc.
- Generated various KPI Metrics and using Tableau for Data visualization.
Hound - Unprecedented Speech Recognition + Natural Language Understanding app
- Built the Query Analyzer to analyze queries on hound, extract insights and improve accuracy
- Hound GIS Module to generate coordinates details, geohash estimation for all the road intersections to serve Hound's geo queries, for eg. "show me all restaurants near US 101 and Lawrence Expy" or "how far am from Lombard St and Hyde St".
- Backend processing and a webservice to power live hound traffic projection on globe From August 2014 to August 2015 (1 year 1 month) Senior Software Engineer @ Big Data Discovery & Analytics - https://cloud.oracle.com/bigdatadiscovery
Responsibilities: Design and Development
- built the big data enrichment product from scratch, with a goal to provide text enrichment capability on unstructured data sitting in HDFS. Data is enriched and some valuable insights are added before indexing kicks in. This improves the search and discovery experience on Endeca.
- multiple plugins viz. text cleanup, key phrase extractor, sentiment analysis, wikification, language detection, geocode extractor, named entity extractor etc. are created to enrich the input data, which runs through enrichment engine framework on Apache Spark.
Technologies: Spark, Hadoop Map Reduce, Pig, Hive, Oozie, Avro, Groovy
Build Tools: Gradle, Ant, Maven
Other Projects:
- Secure Enterprise Search (Server Technologies Division)
- Search Engine as a Service in OPC (Oracle Public Cloud) with Oracle's Endeca backend
- Agile based Development - REST APIs, XML, JSON
Additional Responsibilities: Scrum Master
- Manage Sprint using Agile tool: Jira
- Write user stories, sprint planning, retrospective
- Product roadmap From January 2011 to August 2014 (3 years 8 months) San Francisco Bay AreaSoftware Engineer II (Search) @ Search Backend, Server Side Development, Spider Management Framework, Analytics, end-to-end Search Performance Optimization, Indexing (Lucene/Hadoop), Cache Systems, Scaling Applications, Distributed Systems. From December 2008 to January 2011 (2 years 2 months) Research Assistant @ did my Masters (2007-2008)
•Worked as a Research Assistant on: Developing Next Generation Systems for Intensive Information sharing under high pressure situations.
•Co-Authored a peer reviewed chapter in the book "Cyber Security and Information Assurance" published by IGI publishers (Singh et al, 2008) From 2007 to 2008 (1 year) Software Engineer @ •Functioned as key member of team in design and development of middleware known as COS (common object server) based on IBM’s Service Provider Delivery Environment for Sprint Communications USA. Wrote Stateless EJBs and deployed on Websphere Application Server.
•Analyzed business requirements & reviewed high level design documents. From 2005 to 2007 (2 years)
Credit Course, Spring Semester '13, Global Entrepreneurial Marketing, A- @ Stanford University From 2013 to 2013 Masters, Management Information Systems, 3.9 @ State University of New York at Buffalo - School of Management From 2007 to 2008 B.Tech (Hons), Computer Science and Engineering @ National Institute of Technology Durgapur From 2001 to 2005 Pranav Singh is skilled in: Data Analytics, Apache Spark, Big Data, Hadoop, Java, NLP, Natural Language Understanding, Cloud Computing, REST, Web Services, J2EE Application Development, E-commerce, Oracle, Tableau, Enterprise Search
Websites:
http://www.pranavs.com