Senior Software Production Engineer, Infrastructure Software for AI Job at SB Telecom America Corp., Sunnyvale, CA

aGlrVFNWTlBxbnd5cG90SDltM1ZISUlGVnc9PQ==
  • SB Telecom America Corp.
  • Sunnyvale, CA

Job Description

About Softbank : Softbank is making significant investments in infrastructure for AI. Softbank Corp. has recently established a new US center in Silicon Valley, focused on infrastructure software for AI and AI foundations for mobile networks. Our goals are to challenge the norms and create products making use of our SOTA infrastructure (like Nvidia GB200, MGX and DGX Grace & Hopper platforms) and cloud-native software. These products are geared towards centralized AI data centers as well as distributed AI Radio Access Network (AI RAN) data centers. We are looking for experienced practitioners who are inspired to bring innovation and build transformative products.

Minimum Qualifications:

  • Bachelor's degree in Computer Science, Electrical Engineering, or related field.
  • 7+ years in software, hardware, engineering, including platforms and distributed systems.
  • 2 years in lead roles, leading high-impact projects and teams.
  • Experience working in systems & systems SW, Cloud and Kubernetes.
  • Deep experience with production-testing and automation of Kubernetes deployments.

Preferred Qualifications:

  • Master's or PhD in a relevant field.
  • Expertise in building scalable test and automation infrastructure to productionize workloads.
  • Experience with GPU platforms (Nvidia DGX, H100, GB200) and high-performance computing environments.
  • Experience triaging customer bugs, prioritizing, and resolving issues in production.
  • Familiarity with AI developer frameworks, tools, and automation systems.

Role: Be a key member of the infrastructure team responsible for building foundational software on top of GPU systems supporting AI workloads (training, fine-tuning and serving). Own and develop the test-automation infrastructure for Kubernetes and GPU systems. Drive process innovation in end-end systems software testing and automation for productionization velocity. As a Senior Software Production Engineer responsible for the entire test-automation infrastructure, work with Staff Engineers, product management and program management to drive execution towards commercialization.

Responsibilities:

  • Develop and build test-automation infrastructure for Kubernetes on large-scale GPU clusters.
  • Build detailed test plans for different milestones and operationalize them in test-automation infrastructure.
  • Build and own automation of the end-end system, scale and stress testing.
  • Working together with SW leads and Technical Program Manager, qualify the releases for milestones.
  • Attract and help build downstream production engineering talent.
  • Role model and foster a culture of humility and innovation for product delivery.

Salary: The base salary for this position ranges from ($150,000-$250,000), with additional attractive biannual bonus, benefits and opportunity to work with a great team in downtown Sunnyvale, CA.

Job Tags

Similar Jobs

Goodcents

Goodcents Crew Member - 4920 W Thunderbird Road Suite 102 Job at Goodcents

 ...Cheap meals, free cookies and competitive wages! Even better, no late nights to cut into your social life! We're looking to hireCrew Membersat our Goodcents location4920 W Thunderbird Rd Ste 102in Glendale, AZ . You can earn up to $12.15 an hour plus tips! This opportunity... 

MatSu Health Foundation

Communications and Media Specialist Job at MatSu Health Foundation

 ...Communications and Media Specialist FLSA Classification: Non-Exempt Reports to: Director of Public Relations Hourly Range: $29.00 - $34...  ...th. JOB DESCRIPTION About the Foundation Mat-Su Health Foundation (MSHF) is the official business name of Valley... 

Cenetene Corporation

Senior Member Engagement & Communications Specialist Job at Cenetene Corporation

 ...everything for our 28 million members. Centene is transforming the health of our communities, one person at a time. As a diversified,...  ...**Position Purpose:** Oversee specific Member Engagement & Communications initiatives to improve member engagement and retention. Oversee... 

DDM Construction Corporation

CDL A Truck Driver Job at DDM Construction Corporation

 ...Superintendent, the CDL Driver is responsible for the operation of equipment including, but not limited to tandems, triaxles, end dumps and/or mixer trucks). ESSENTIAL ROLES AND JOB FUNCTIONS: 1. Performs pre-tip and post-trip inspections in accordance with Department of... 

VDart Inc

Data QA Tester Job at VDart Inc

 ...Title: Data QA Tester Location: NYC, NY (Hybrid) Type: Contract Job Duties: Perform end-to-end testing of web-based...  ....g., JIRA). Conduct UI testing to ensure a seamless user experience across multiple browsers and devices. Collaborate with...