On the experimentation platform team, we are building an industry-leading platform and tools that allow other teams and our developers to run and automatically analyze experiments at scale. As the Principal Engineer on the team, you will lead the evolution and scaling of the experimentation platform. With over ~80 million daily users (and growing!), and hundreds of experiments every day, the platform already serves millions in QPS.
That said, we are looking for an experienced engineer that's passionate about designing and building scalable systems for both online serving and offline analysis and to empower our creators and users alike with the benefits of Experimentation. Our team's product will act as the interface between those defining experiments, calculating and tracking metrics and finally, visualizing the results of experiments to ship a successful variant to our creators and users alike. Scale and ease-of-use is of the utmost importance as Roblox continues its development of our world class experimentation data platform.
You Will:
- Help scale our online experiment serving flow to scale 10x.
- Design and simplify the experiment creation and analysis flows for the next generation of internal and external customers leveraging AI when necessary.
- Design a scalable serving model to support novel experimentation strategies (like crossovers, multi-armed bandits) and support our Data Science partners.
- Collaborate with Data Engineering and Data Platform teams to build our experimentation data processing workflows.
- Partner with our customers all across Roblox and enable targeting experiments to audiences beyond our users - e.g. creators.
- Help build a reliable serving flow and a scalable data to support experimental analysis, gradual feature rollouts and human-centric reporting.
You Have:
- Backend engineering experience with 10+ years of work experience.
- Proven programming skills in compiled languages such as C#, Go, C++, or others and knowledge of SQL.
- Designed and built backend platform systems that can handle over 1M QPS.
- Experience with big-data processing technologies such as Spark, Apache Druid, Flink etc. and exhibit a data-driven focus for quality metrics and monitoring.
- Knowledge with REST, API design patterns, and scalable containerized systems and micro-services.
- BS in Computer Science or a related field.
Nice to Have:
- Experience in building experimentation platforms serving millions of users in preferably multi-sided marketplaces.
- Experience building and tuning Spark workflows at TB scale.
- Experience leveraging AI tooling to build simplified, human-centric experiences for internal customers.