Experience
March 2025 - Present
Senior DevOps Engineer
BuildTime
Sole DevOps and Infrastructure Engineer responsible for evolving BuildTime into a scalable, highly available SaaS platform serving signatory contractors' mission-critical timekeeping, payroll, and workforce management operations.
- Rearchitected entire infrastructure from single-VM deployment to highly available, multi-AZ SaaS platform - migrated from MongoDB 3 to replicated cluster, upgraded Node.js 16→18, implemented atomic deployments with PM2, and automated multi-environment deployments via GitHub Actions with Terraform-managed GCP load balancers and Cloudflare proxy
- Reduced critical clock-in/out latency from 15 seconds to sub-1-second by implementing full-stack observability (centralized logging, Node.js runtime metrics, distributed tracing, frontend RUM/session capture) and using CPU/event-loop profiling to eliminate performance bottlenecks, establishing SLOs for business-critical operations
- Established comprehensive CI/CD pipeline including automated API tests, ephemeral full-stack QA environments, unit testing, and continuous deployment to dev/production environments
- Developed production-parity local development environment with Docker Compose stack and custom CLI tooling enabling developers to manage local services, tunnel to remote databases, retrieve logs, and streamline development workflows
- Pioneered AI-driven engineering practices across the organization - leveraged AI to accelerate infrastructure modernization, standardized AI-assisted bug reporting workflows company-wide, and upleveled engineering team's AI capabilities to improve productivity and code quality
September 2023 - February 2025
Senior Site Reliability Engineer
Workday
Site Reliability Engineer focused on developer improvements, observability, and cloud infrastructure.
- Researched, proposed, and implemented Apache's DevLake tool to help teams better understand their development practices and identify areas for improvement
- Introduced and integrated security tools (git-secrets, SonarQube) into the development pipeline, enabling early identification and resolution of security vulnerabilities, thereby reducing risk and improving code quality
- Led the migration of containerized services from CentOS 7 to CentOS 9, enhancing system security, performance, and compatibility with modern applications
March 2022 - September 2023
Senior Site Reliability Engineer
Cypress.io
Software and Cloud Infrastructure Engineer focused on site reliability, observability, and cloud infrastructure for our SaaS product.
- Instrumented our services running in Heroku with Prometheus metrics and Grafana Agent 'sidecars' to enable better observability and alerting
- Deployed and managed AWS infrastructure(RDS, ECS, Redshift, etc), Github configuration and Grafana configuration with Terraform
- Improved our processes and tooling to speed up the overall release process and reduce developer toil
- Migrated multiple services from Heroku to ECS to reduce operation spend and improve observability
April 2015 - March 2022
Senior Site Reliability Engineer
Mailchimp | Intuit
Designed and implemented a variety of tooling to improve deliverability, monitoring, performance, and efficiency.
- Transitioned Mailchimp's manual deployment process which deployed large releases every 5 weeks to a continuous deployment and delivery pattern that deploys up to 150 times a day
- Improved developer and database engineer experience around database migrations and scheduled jobs
- Implemented observability and auto-remediation for our application's load-balancers(nginx) and http servers(apache)
- Added site-wide external monitoring for our most high-level endpoints across our entire infrastructure
- Established capacity and utilization metrics to better understand our infrastructure utilization is at any given time and better estimate need for additional infrastructure to support growth
- Planned and led a major initiative to upgrade hundreds of our critical servers to latest OS and hardware with 0 downtime
- Created logging pipeline to centralize our orphaned AWS logs into our existing ELK stack which streamlined the developer and support experience
- Led a team of core site reliability engineers to handle cross-cutting engineering projects to enable the organization to more easily identify and approach hurdles preventing service migration to GCP
June 2013 - April 2015
Systems Engineer
Tropo, Inc.
Responsible for production infrastructure in AWS EC2, configuration management of that infrastructure, as well as building out tooling used by support and engineering teams to easily handle customer inquiries.
- Developed tooling (Ruby + Sinatra) which enabled support and product engineering teams to easily administer customer accounts
- Managed application infrastructure in AWS EC2
- Automated configuration management processes with Chef cookbooks
June 2011 - June 2013
Systems Engineer
Hewlett-Packard
Responsible for automation of server configuration and build-out for HP internal IT projects.
- Built and configured physical and virtual Linux and HP-UX servers for HP internal IT projects
- Developed and managed build automation tools and configuration management tools
Education
2007 - 2011
Bachelor's Degree
The University of Alabama
Management Information Systems / Computer Science