FIND INTERNSHIPS

Engineering Manager Ii - Observability

Posted on April 4, 2026 by Uber

  • Full Time

Engineering Manager Ii - Observability
About the Team

Observability at Uber has evolved far beyond traditional monitoring. We build a centralized, reliable, and intelligent platform spanning metrics, logging, tracing, and on-call experiences - empowering engineers to operate services confidently at massive scale.

Our team owns end-user-facing observability applications used by 4,000+ engineers globally, enabling them to detect, understand, and resolve reliability issues before they impact customers.

We are building a real-time platform for customer experience observability and analytics at scale. This platform enables engineers to:
  • Detect and respond to customer experience degradations in real time
  • Ensure safe code deployments and fast feature rollouts
  • Leverage actionable insights to continuously improve service quality
Uber runs 5,000+ microservices with hundreds of daily deployments, making observability a critical foundation for reliability across the company.

We are investing in the next generation of observability - bringing automation, intelligent insights, and deep integration into the development lifecycle to reduce operational overhead and improve system reliability.

The Role

We are looking for an Engineering Manager II to lead a team building Uber's next-generation observability experience.

In this role, you will combine technical leadership, execution, and people management to deliver scalable systems that improve reliability and developer productivity. You will own a key area of the observability platform, driving roadmap, architecture, and delivery in a highly cross-functional environment.

What You'll Do

Technical & Product Leadership
  • Lead the design and delivery of systems powering customer experience observability, including monitoring, alerting, and incident response.
  • Build platforms that enable engineers to detect issues early, respond effectively, and improve service quality over time.
  • Drive architecture and technical direction for large-scale distributed systems.
Strategy & Execution
  • Define and execute on a roadmap, aligning team priorities with broader organizational goals.
  • Translate complex reliability and product needs into clear technical plans and deliverables.
  • Drive execution through strong prioritization, delegation, and cross-team alignment.
Incident Detection & Reliability
  • Build and improve systems that reduce time-to-detection (TTD) and time-to-resolution (TTR).
  • Enable scalable alerting and incident workflows, including signal correlation, noise reduction, and actionable alerting.
  • Improve the overall effectiveness of on-call by reducing manual effort and improving signal quality.
Rollout Safety & Monitoring
  • Develop systems that support safe deployments and feature rollouts, using real-time monitoring and guardrails.
  • Enable teams to detect regressions quickly and make data-informed rollout decisions.
Data & Platform Foundations
  • Drive development of reliable, scalable data pipelines and platforms that power observability and analytics use cases.
  • Establish best practices for instrumentation, metrics, and data consistency across services.
People & Team Development
  • Build, mentor, and grow a high-performing team of engineers.
  • Foster a culture of ownership, technical excellence, and strong engineering fundamentals.
  • Empower engineers and tech leads to take end-to-end ownership of projects.
Cross-Functional Leadership
  • Collaborate closely with Engineering, Product, TPM, and partner teams to deliver end-to-end solutions.
  • Communicate priorities, trade-offs, and outcomes clearly to stakeholders and leadership.
Basic Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 10+ years of software engineering experience, including 4+ years managing teams.
  • Experience building and operating large-scale distributed systems in production.
  • Track record of delivering impactful technical solutions at scale.
Preferred Qualifications
  • Experience with observability, reliability engineering, or developer platforms.
  • Experience working with real-time systems, monitoring, or data platforms.
  • Strong ability to drive execution through delegation and cross-team alignment.
  • Experience defining and executing technical strategy across teams.
  • Excellent communication and stakeholder management skills.
Why Join Us
  • Build systems that directly impact reliability and customer experience at Uber scale
  • Solve complex distributed systems challenges in a high-impact domain
  • Lead a team shaping the future of observability and developer experience

Advertised until:
May 4, 2026


Are you Qualified for this Role?


Click Here to Tailor Your Resume to Match this Job


Share with Friends!

Similar Internships


No similar Intern Jobs at the Moment!