Job Title: Senior Software Operations Engineer
Location: Downtown San Francisco, CA (On-site)
RocketRide.ai is a new Series A startup based in downtown San Francisco, on a mission to make building agentic AI easy, accessible, and approachable for all—powered by open standards and driven by community. We do this by hosting our open‑source solution on an innovative, compute‑optimized cloud built for the AI pipelines that power AI-first applications, with advanced management, observability, and enterprise‑grade capabilities out of the box. We’re small, sharp, and intensely product‑driven: the people who build the platform are the ones talking to users, shaping the roadmap, and seeing the impact of their work in real time.
About the Role
We’re on the hunt for a hands‑on Senior Software Operations Engineer to help scale and shape the future of our agentic‑AI SaaS platform. This isn’t a back‑seat role—you’ll be designing, building, and tuning high‑performance systems from bare metal up, while leading a small, sharp team that keeps our services fast, reliable, and cost‑effective. You’ll thrive here if you love complex distributed systems, care deeply about performance per dollar, and want your decisions to directly influence how we grow the business—not in quarters, but in weeks
What You’ll Do
- Build and run scalable SaaS infrastructure from the ground up, across bare metal, cloud, and our own compute-optimized environment for AI workloads.
- Recruit, lead, and mentor a high-performing team that keeps our platform humming 24x7 – and isn't afraid to tear things down and rebuild them better.
- Hunt down efficiency gains – whether in performance, reliability, or cost – and turn them into measurable wins that show up on both dashboards and P&L.
- Turn strategy into execution by providing data-backed insights that refine our business model, pricing, and capacity planning.
- Collaborate closely with engineering, finance, DevRel, and founders to ensure our systems support how real developers adopt and scale agentic AI.
- Champion strong Security Operations (SecOps) practices so we can move fast without breaking trust, compliance, or uptime.
- Stay hands-on – building, automating, analyzing, and continuously improving the systems that power our open-source-driven ecosystem.
What You Bring
- 8+ years of experience in software operations, SRE, or infrastructure leadership.
- Proven success building and running large-scale SaaS systems from bare metal to production in high-stakes environments.
- Deep expertise in cloud infrastructure (especially AWS) and distributed systems, with an instinct for failure modes and how to design around
- A track record of squeezing more performance and value out of every compute cycle – ideally including GPU-heavy or data-intensive workloads.
- Demonstrated leverage of AI tooling and AI-assisted coding to gain 10x productivity in operations, automation, and debugging.