We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Senior Site Reliability Engineer - CTJ - Top Secret

Microsoft
United States, Nevada, Reno
6840 Sierra Center Parkway (Show on map)
Sep 05, 2025
OverviewAre you interested in working on cutting-edge cloud security products Would you like to be part of one of the world's most advanced cyber-security solutions and protect millions of computers from thousands of active attack attempts, every month Look no further than the Microsoft Defender engineering team. We are looking for a Senior Site Reliability Engineer who will be building and delivering cloud solutions to meet the scale that few companies in the industry are required to support. Leveraging state-of-the-art technologies, you will be instrumental in delivering holistic protection within highly sensitive and secure government environments. The Microsoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever-evolving attackers. This is a team which provides on-call operational support and improvements to the operational posture of the Microsoft Defender products within US Government clouds. You will operate our production services, and work closely with other engineering teams to ensure services and systems are highly stable, meet performance SLAs, and meet the expectations of internal and external customers and users. TheMicrosoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever-evolving attackers.
ResponsibilitiesEnsure 24x7 Service Reliability Act as a Designated Responsible Individual (DRI) in anon-call rotation, leading incident response and resolution to maintain uptime and performance for Microsoft's most critical services. Support and Automate Deployments Execute and improve manual operations and deployments for our products, while designing automation to scale and streamline those processes across environments. Build Scalable Systems Develop automation for monitoring, alerting, debugging, and deployment to reduce manual effort and accelerate safe, reliable delivery. Drive Compliance and Security Ensure systems meet Microsoft's standards for security, privacy, and accessibility, especially when onboarding new technologies. Lead Post-Incident Learning Conduct postmortems, share insights, and implement solutions that prevent recurrence-fostering a culture of learning and continuous improvement. Collaborate Across Teams Partner with engineering and product teams to align reliability goals with customer needs and deliver seamless user experiences. Stay Ahead Technically Continuously invest in your technical growth to improve system availability, observability, and performance at scale.
Applied = 0

(web-759df7d4f5-mz8pj)