About
We're the San Francisco Compute Company, and we're building the first real-time trading platform for compute resources. Over the next decade, we anticipate thousands of startups and labs will be training and serving large models. These organizations need substantial computing power, and we're creating a platform where that compute can be traded efficiently. Our success will enable companies to scale to tens of thousands of accelerators for hours at a time without building their own infrastructure. This breakthrough will dramatically expand access to large model training, making the most transformative technology of our era available to a far wider range of organizations.
The Role
As a distributed systems software engineer, you’ll be working on our in-house resource orchestration system. This system coordinates state and access to hundreds (soon thousands) of GPU compute nodes in multi-tenant clusters spanning across multiple data centers. Some responsibilities of the role include:
Design of distributed system architectures that enable high availability fault tolerant state management
Deployment automation and performance optimization of virtual machines running on bare metal that utilize GPU passthrough
Design and deployment of multi-tier high performance network attached storage systems
About You
You have built fault tolerant distributed systems before that can manage hardware resources at scale
You enjoy creating self-correcting systems that contribute to hardware health and reliability
You have experience with Linux virtualization (Cloud Hypervisor, QEMU, libvirt, virtiofs, sr-iov, PCIe passthrough)
You appreciate and value good documentation
Some Nice to Haves
Experience with Rust (our VM orchestrator is written in Rust)
Experience with etcd
Experience with high performance storage systems (WEKA, VAST, Ceph, etc.)
Benefits
Unlimited office book budget
You can buy as many books for the office as you want. You’re encouraged to spend time during the workday reading!
Generous equity grant
Team members are offered a competitive salary along with equity in the company
Retirement matching
We match 401(k) plans up to 4%
Medical, dental & vision
We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums
Time off
We offer unlimited paid time off as well as 10+ observed holidays
Parental leave
We offer biological, adoptive, and foster parents paid time off to spend quality time with family
Daily lunch
We cover lunch daily for employees
Visa Sponsorships
Yes, we sponsor visas and work permits
The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.
We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.
We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco’s Fair Chance Ordinance and California’s ban-the-box laws.
If you require reasonable accommodation for any reason, please reach out to us at team@sfcompute.com.