Cloud Infrastructure Engineer
About the company
Braintrust is building the modern platform for evaluating and deploying AI systems. Our mission is to help enterprises build trust in their AI by making it easy to test, monitor, and improve models using real-world evaluation frameworks. We work with cutting-edge customers in finance, healthcare, and tech who are building production-grade AI systems.
About the role
We’re looking for a Cloud Infrastructure Engineer to help us build reliable, scalable infrastructure and give developers a world-class platform to ship code with speed and confidence. You’ll lead efforts across Terraform, Kubernetes, CI/CD, observability, and support, and play a key role in how we scale Braintrust both internally and for customers self-hosting our platform.
This is a high-impact role where you’ll contribute across our internal AWS environment and help customers deploy our stack in AWS, Azure, and GCP.
What you’ll do
Build and maintain Terraform modules for both internal infrastructure and customer deployments
Work directly with customers in Slack to support self-hosting and troubleshoot infrastructure issues. Build tools to make it easier for them to support themselves.
Own and improve our CI/CD pipeline: reduce build times, improve failure visibility, and enable safer, faster releases
Centralize and scale observability - including logs, metrics, dashboards, and alerts
Partner with engineering teams to build and evolve a secure, developer-friendly infrastructure platform
Support multi-cloud deployment patterns (AWS primarily, with Azure and GCP support for enterprise customers)
Implement tools and automation to improve deployment, rollback, and infrastructure reliability
Ideal candidate credentials
5+ years of experience in DevOps, SRE, or Infrastructure Engineering roles
Deep experience with Terraform and at least one major cloud provider (AWS strongly preferred)
Strong Kubernetes skills: deploying, debugging, and scaling real workloads
Proficient in scripting or programming (Python, Typescript, or Go)
Experience supporting production systems and responding to incidents
Comfortable working directly with customers in a support or deployment context
Bonus: experience with multi-cloud environments or self-hosted enterprise software
Benefits include
Medical, dental, and vision insurance
401k plan
Daily lunch, snacks, and beverages
Flexible time off
Competitive salary and equity
AI Stipend
Equal opportunity
Braintrust is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.