Software Engineer at Airtable
San Francisco, California
Software Engineer @ Cloudera I joined Cloudera as approximately employee #300. This was an incredible experience at a growth-stage company. By the time I left, it was about 3000 people.HDFS (2012-2017):HDFS is the most successful open-source distributed storage system. I joined as one of three engineers developing HDFS at Cloudera, and became the tech lead for a...
Software Engineer @ Cloudera I joined Cloudera as approximately employee #300. This was an incredible experience at a growth-stage company. By the time I left, it was about 3000 people.HDFS (2012-2017):HDFS is the most successful open-source distributed storage system. I joined as one of three engineers developing HDFS at Cloudera, and became the tech lead for a team of ten.Some project highlights:* Led development of erasure coding, a new method of storing data that reduces storage costs by more than 50%. This was a multi-year project that involved novel research ideas and coordinating 20+ developers across multiple companies and multiple timezones. This is the biggest and most complex effort we've ever shipped in HDFS.* Release manager for Apache Hadoop 3.0, the biggest release of Hadoop ever. This was spurred by our desire to ship erasure coding, and required coordinating with multiple feature owners and downstream ecosystem dependencies over a period of over a year of alpha and beta releases. This required making substantial improvements to our release tools, process, and testing along the way. Hadoop 3.0 also formed the basis for the Cloudera 6, Cloudera's first new major release in five years.* Became an Apache Hadoop committer, PMC member, and Apache Member due to my open-source contributions. Mentored multiple coworkers and community contributors into becoming committers and PMC members themselves.* Presented at conferences including Strata Hadoop World, Hadoop Summit, Apachecon, Apache Big Data, and USENIX ATC.Altus (2017-2019)Altus was Cloudera's Platform-as-a-Service offering, making it easy to run machine learning and data engineering workloads on the cloud.* Led development of a cloud-native machine learning platform built on Kubernetes.* Made improvements to Altus core services, observability, and build/test/deploy infrastructure. From December 2012 to January 2019 (6 years 2 months) San Francisco, CaliforniaGraduate Student @ University of California, Berkeley PhD track graduate student in distributed systems, advised by Professor Ion Stoica. Published at top-tier conferences about in-memory caching for distributed filesystems and providing SLOs for heterogeneous storage workloads. From August 2010 to December 2012 (2 years 5 months) Berkeley, CAPlatform Intern @ Cloudera From May 2012 to August 2012 (4 months) San FranciscoNREIP Intern @ SPAWAR From June 2010 to August 2010 (3 months) Undergraduate Researcher @ University of Virginia Department of Computer Science From May 2009 to May 2010 (1 year 1 month) Software Engineer @ Airtable San Francisco, California, United StatesSoftware Engineer @ Scale AI Scale provides high-quality training data for AI applications. I was the first senior hire in engineering, joining during the Series B.As a member of the Platform team, I am responsible for maintaining the core pieces of infrastructure and code abstractions that power the rest of the platform, as well as our observability, build/test/release, and data warehousing infrastructure.Projects I worked on:* Monitoring and observability. I rolled out structured logging within the company and maintained our ELK cluster. Setup alerts and dashboards for our production systems and defined per-team SLIs and SLOs. Led migration of our monitoring from New Relic to Datadog.* Initiated use of Terraform for managing our cloud infrastructure. This allowed us to audit and version infrastructure changes and easily manage additional production and test environments.* Developed an automated daily release process and canary environment for testing with production traffic. This greatly reduced the number of site incidents, improved developer efficiency, and sped up testing feedback loops.* Led migration of our data warehouse from AWS Athena to Snowflake. This involved evaluation of different data warehousing platforms via POC and coordinating the migration of our query workload which involved educating and communicating with dozens of SQL users at the company. From February 2019 to May 2020 (1 year 4 months) San Francisco
Cloudera
Software Engineer
December 2012 to January 2019
San Francisco, California
University of California, Berkeley
Graduate Student
August 2010 to December 2012
Berkeley, CA
Cloudera
Platform Intern
May 2012 to August 2012
San Francisco
SPAWAR
NREIP Intern
June 2010 to August 2010
University of Virginia Department of Computer Science
Undergraduate Researcher
May 2009 to May 2010
Airtable
Software Engineer
San Francisco, California, United States
Scale AI
Software Engineer
February 2019 to May 2020
San Francisco
I joined Cloudera as approximately employee #300. This was an incredible experience at a growth-stage company. By the time I left, it was about 3000 people.HDFS (2012-2017):HDFS is the most successful open-source distributed storage system. I joined as one of three engineers developing HDFS at Cloudera, and became the tech lead for a team of ten.Some project... I joined Cloudera as approximately employee #300. This was an incredible experience at a growth-stage company. By the time I left, it was about 3000 people.HDFS (2012-2017):HDFS is the most successful open-source distributed storage system. I joined as one of three engineers developing HDFS at Cloudera, and became the tech lead for a team of ten.Some project highlights:* Led development of erasure coding, a new method of storing data that reduces storage costs by more than 50%. This was a multi-year project that involved novel research ideas and coordinating 20+ developers across multiple companies and multiple timezones. This is the biggest and most complex effort we've ever shipped in HDFS.* Release manager for Apache Hadoop 3.0, the biggest release of Hadoop ever. This was spurred by our desire to ship erasure coding, and required coordinating with multiple feature owners and downstream ecosystem dependencies over a period of over a year of alpha and beta releases. This required making substantial improvements to our release tools, process, and testing along the way. Hadoop 3.0 also formed the basis for the Cloudera 6, Cloudera's first new major release in five years.* Became an Apache Hadoop committer, PMC member, and Apache Member due to my open-source contributions. Mentored multiple coworkers and community contributors into becoming committers and PMC members themselves.* Presented at conferences including Strata Hadoop World, Hadoop Summit, Apachecon, Apache Big Data, and USENIX ATC.Altus (2017-2019)Altus was Cloudera's Platform-as-a-Service offering, making it easy to run machine learning and data engineering workloads on the cloud.* Led development of a cloud-native machine learning platform built on Kubernetes.* Made improvements to Altus core services, observability, and build/test/deploy infrastructure.
What company does Andrew Wang work for?
Andrew Wang works for Cloudera
What is Andrew Wang's role at Cloudera?
Andrew Wang is Software Engineer
What industry does Andrew Wang work in?
Andrew Wang works in the Computer Software industry.
Who are Andrew Wang's colleagues?
Andrew Wang's colleagues are Derek Dahmer, Chloe Ho, Andrew Liu, Michael Woo, Alexandr Wang, Charles Watts, Atra Kermani, Steven Salka, Nathan Hayflick, and Yuri Maruyama
Enjoy unlimited access and discover candidates outside of LinkedIn
One billion email addresses and counting
Everything you need to engage with more prospects.
ContactOut is used by
76% of Fortune 500 companies
Andrew Wang's Social Media Links
/school/uc... /redir/red... /company/a...