New
Principal Software Engineer - Azure Storage Client Team
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewMicrosoft Azure storage is a highly distributed, massively scalable, and ubiquitously accessible cloud storage system designed to scale out and serve the entire world.The Azure Storage Client team is responsible for delivering the Storage Virtualization platform to Azure Virtual Machines (VMs). Our stack handles trillions of IOs every day from Azure VMs and runs in 100s of data centers and 1000+ Clusters. The team is enabling storage offload using custom built hardware accelerators - utilizing the Field Programmable Gate Arrays (FPGAs), Remote Direct Memory Access (RDMA) for transport and Non Volatile Memory Express (NVMe) as the VM's storage interface.As a Principal Software Engineer you are responsible for designing, implementing and deploying software to enable hardware-based storage offload technologies. If you have the interest and the passion to work on new hardware and software solutions to enable performance and scalability at the highest levels, then Azure may have a place for you.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesDesign and develop system software that manages storage services in Azure.Ensure that services and components are designed with appropriate telemetry and health signals to enable remote monitoring and debugging.Rapidly deploy cloud-scale solutions and monitor health through telemetry.Drive for quality in everything you do and continuously improve the quality of the systems you work on.Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.Embody our culture and values |