I learned UNIX from OpenBSD man pages between classes at university and haven't looked back since. Contracting in Calgary taught me to build the bits and pieces, moving to the Bay Area taught me to put them together. I've become more engineering oriented over time but ops is too much fun to give up completely. :D I've
I learned UNIX from OpenBSD man pages between classes at university and haven't looked back since. Contracting in Calgary taught me to build the bits and pieces, moving to the Bay Area taught me to put them together. I've become more engineering oriented over time but ops is too much fun to give up completely. :D I've seen failure modes for systems of all sizes, put out the fires, understood the problem, automated the fixes, figured out how to detect the problem and how to make sure the next version will work better.
Senior Site Reliability Engineer @ Promoted to Senior Site Reliability Engineer in the Q1 2015 review cycle.
● Traffic Engineering team, working on traffic management infrastructure/tooling/services.
● Release engineering for the primary Twitter web front end services.
● SRE for the next gen mobile web service.
● SRE for the event tracking front end service.
● SRE for an externally marketed analytics product.
● SRE for the primary API front end service.
● SRE for the Twitter front end load balancer application.
● Embedded SREs are responsible for capacity planning, configuring alerts and dashboards, service configuration, release engineering, reliability engineering, fire drills, and responding to incidents.
● Tool for report generation of from historical configuration values.
● Tool for automated capacity and performance testing.
● Tool for manual control of load balancing between instances of a service.
● Tool for integration testing of the front end load balancer application.
● Standalone tool to manipulate load to instances of a service.
● Service to bridge arbitrary data sources to internal monitoring/visualization tools.
● JVM services
● load balancing From January 2013 to Present (3 years) San Francisco Bay AreaOperations Engineer @ Site Reliability Operations
● monitoring the health of primary services
● triaging problems, fixing where possible and escalating where necessary
● managing traffic across multiple datacenters
● contributing improvements to tooling and automation, including MPS and FBAR
● load balancing
● cluster automation
1 - https://www.facebook.com/notes/facebook-engineering/under-the-hood-mysql-pool-scanner-mps/10151750529723920
2 - https://www.facebook.com/note.php?note_id=10150275248698920 From October 2011 to January 2013 (1 year 4 months) Developer/System Administrator @ Sysadmin and network admin for the Imaging Informatics lab at the University of Calgary. Maintained and improved the internal dicom library and worked with students on their projects.
● dicom image format, dcmtk
● MacOS/Objective-C/Cocoa development From March 2010 to September 2011 (1 year 7 months) System/Network Administrator @ Sysadmin and network admin contracting. Built some awesome things and pulled some heroic disaster recoveries.
● Monitoring (nagios) of internal and customer services.
● Configuring services and networks.
● Linux, OpenBSD
● DNS From March 2006 to December 2009 (3 years 10 months)
Looking for a different
Get an email address for anyone on LinkedIn with the ContactOut Chrome extension