New
Senior Software Engineer
Microsoft | |
United States, Texas, Irving | |
7000 State Highway 161 (Show on map) | |
Oct 24, 2025 | |
|
OverviewThe Azure Kubernetes Service (AKS) team is responsible for running Kubernetes at global cloud scale. On AKS, millions of containers are started, healed, and routed to serve production traffic every day. The team delivers essential control-plane and data-plane capabilities, and the work directly impacts reliability, performance, and developer productivity for customers around the world.As a Senior Software Engineer on Azure Kubernetes Service, you will design, build, and operate cloud services that provision, upgrade, secure, and monitor Kubernetes clusters across global infrastructure. This role involves working across distributed systems, networking, storage, and platform automation to deliver resilient customer experiences. It offers opportunities to grow your expertise in large-scale systems, deepen your knowledge of Kubernetes and cloud engineering, and strengthen your skills in Site Reliability Engineering (SRE) practices. Flexible work arrangements are supported, including hybrid and partial remote options.This position is ideal for individuals interested in building scalable, secure, and reliable cloud-native solutions. You will collaborate with a diverse team to solve complex technical challenges and contribute to the evolution of Microsoft Azure's container orchestration capabilities. The work is impactful, fast-paced, and aligned with the needs of developers and enterprises worldwide.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesCollaborate with product managers, architects, and partner teams to clarify scenarios and user requirements for AKS features and platform investments.Drive design for new or improved AKS components (e.g., cluster lifecycle, upgrades, networking/CNI, storage/CSI, policy, security, observability) including dependency mapping, design docs, and API contracts.Create, implement, optimize, and refactor production code and automation to improve reliability, performance, maintainability, and cost efficiency across control-plane and data-plane services.Leverage subject-matter expertise in Kubernetes and Azure to plan releases, break down work, and lead execution across a workgroup; provide technical mentorship and code reviews.Act as a Designated Responsible Individual (DRI): participate in on-call, follow runbooks/playbooks, monitor for degradation, triage incidents, communicate status, and drive mitigations/RCAs for complex issues.Proactively adopt new patterns and technologies to improve availability, reliability, efficiency, observability, and performance; champion consistency in telemetry, alerting, and operations at scale.Uphold security and compliance best practices (least privilege, secrets management, supply-chain security, vulnerability remediation) across services and CI/CD. | |
Oct 24, 2025