Microsoft logo

Senior Software Engineer - Microsoft

Redmond, United States

Microsoft Azure Artifical Intelligence High Performance Computing team is looking for systems engineers to enable customers in deploying, monitoring, profiling, and debugging their applications on hyperscale cloud infrastructure. Azure is enabling the largest supercomputing deployments to tackle complex computational problems in public cloud, evident from the various High Performance Computing (HPC) Stock Keeping Units (SKU) that have already made the mark on Top500, MLPerf and Graph500 rankings.

We are looking for a Senior Software Engineer who would also bring to the table establishing best practices drive architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact business goals of a wide range of users and facilitate the next wave of growth and innovation in Artifical Intelligence, and High Performance Computing (HPC) in the cloud in general.

At this supercomputing scale, we need specialized tools and techniques to maintain the reliability, runtime performance, health of the system and running jobs continuing to meet the Service Level Agreements of users. Your job would be to use the state-of-the-art tools and techniques, find operational gaps and instrument features to achieve the smooth operation of cloud-native supercomputers.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

We are looking for a Senior Software Engineer who is passionate about quality, wants the customer to succeed and get things done. You will join a phenomenal team of hardworking engineers and researchers with deep experience in high performance computing, machine learning, deep learning, middleware, and software engineering.

  • Design and code solutions that improve the management of remote systems. 
  • Leads by example within the team by producing extensible and maintainable. Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well as appropriate coding patterns and best practices.
  • Provide new features for Microsoft Cloud internal infrastructure software. 
  • Keep infrastructure services running and deliver code updates on a regular cadence to improve performance and reliability. 
  • Maintains communication with key partners across the Microsoft ecosystem of engineers. Acts as a key contact for leadership to ensure alignment with partners' expectations. Considers partner teams across own organization and their end goals for products to drive and achieve desirable user experiences and fitting dynamic needs of partners/customers through product development.

Your mission will be to help ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads. You will help building a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality.

Required/Minimum Qualifications:

  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • 3+ years of experience in Software Design, Development, coding, and troubleshooting in Python or related.
  • 3+ years of experience in Developing and Running Artificial Intelligence/High Performance Computing (HPC) applications on clusters or related.

Preferred Qualifications: 

  • Expertise in PyTorch coding and troubleshooting
  • Expertise in Compute Unified Device Architecture (CUDA) C/C++ coding and troubleshooting
  • Experience in accelerating C++ or Python application in Nvidia or Advanced Micro Devices (AMD) Graphics Processing Unit (GPUs)
  • Experience in creating and maintaining Docker container life cycle
  • Expertise in C/C++ coding and troubleshooting
  • Expertise in running Message Passing Interface (MPI) jobs. Must understand Message Passing Interface (MPI) software lifecycle and runtime behaviours
  • Working experience in Bourne Again Shell (BASH) or other SHELL scripts

Background Check Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter. 

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $112,000 - $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 - $238,600 per year.

Learn more about base pay ranges and pay equity:

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.


Published on: 3/29/2023

About Microsoft

Microsoft is on a mission to empower every person and every organization on the planet to achieve more. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world. You can help us to achieve our mission.

Jobs in Redmond, United States

Jobs at Microsoft