Distributed Systems Engineer Job at Magic, San Francisco, CA

RTBOTFRBb3BPUlpFMklQS2JyWTQzbW5mdGc9PQ==
  • Magic
  • San Francisco, CA

Job Description

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As a distributed systems engineer, you will build the data and coordination systems that enable ultra-long context inference and training on Magic’s GPU clusters. 

What you might work on: 

  • High-performance storage and caching systems to support long-context inference and training

  • Hacking on the internals of deep learning frameworks in the distributed setting

  • Automating fault detection and recovery systems to enable highly available training

  • Troubleshooting complex issues across GPUs, network, storage, OS, and cloud environments.

What we’re looking for: 

  • Deep knowledge of distributed systems design and public cloud platforms

  • Experience designing and operating highly available, high-throughput data systems

  • Experience with the internals of distributed DBMS, batch and stream processing systems, and/or distributed file systems

  • Exceptional problem-solving skills up and down the stack

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

Our culture:

  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building 

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):

  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Job Tags

Remote job, Relocation

Similar Jobs

Worldwide Flight Services

Passenger Services Agent Job at Worldwide Flight Services

 ...in your career and join Worldwide Flight Services. WFS employs over 22,200 of the finest...  ...capability that includes Aviation cargo, Airline passenger, and Aviation ramp services. Our team of...  ...us?Job SummaryAs a Passenger Services Agent at the STT Airport in St. Thomas, Virgin... 

Flutter International plc

Junior Back-end Developer Job at Flutter International plc

 ...Overview of the Role We are looking for a Senior Back-end Developer to join our team. The ideal candidate will work primarily on.NET...  ...our mission. What youll do: Develop server-side web services using .NET language (C#, .NET 8 or higher); Developing... 

MaineHealth

Real Estate Manager Job at MaineHealth

 ...MaineHealth Corporate Professional - Nonclinical Req #: 74254 Summary The Real Estate Manager role is responsible for the day-to-day tracking and management of the MaineHealth real estate portfolio in Maine and New Hampshire. This includes all database management... 

Hatch Global Search

Perm - Remote - Hospital Inpatient Coder OOJ - 35267 Job at Hatch Global Search

 ...Coder, you will be responsible for accurately coding inpatient medical records, ensuring compliance with coding guidelines, and...  ...healthcare professionals to maintain data integrity, all while working from home. Perm - Remote - Hospital Inpatient Coder Essential... 

Deloitte

Software Product Architect Job at Deloitte

 ...products. Your expertise will be pivotal in delivering solutions that delight customers and users, while also driving tangible value for Deloitte's business investments. You will leverage your extensive engineering and AI/ML craftsmanship and advanced proficiency across...