About Luma AI
Luma's mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to be jointly trained over all signal modalities - text, video, audio, images - analogous to the human brain.
To advance our mission, we build and operate the full stack end-to-end, spanning foundation models, inference systems, and products. This integrated approach powers technologies like Ray3, which is seeing rapidly growing adoption among Fortune 500 companies across media, entertainment, and advertising. Backed by a recent $900M Series C and our partnership with Humain to build a 2 GW compute supercluster (Project Halo), our models and the Dream Machine platform are now enabling creatives worldwide to tell some of the most impactful stories of our time.
Where You Come In
This is a rare opportunity to work at the absolute frontier of creative AI, building the next generation of interactive voice agents. You will join a foundational team responsible for developing the core models that allow humans to converse with AI in real-time with unprecedented realism and expressiveness. Your work will bridge the gap between deep research and magical, shipped products that millions of users will interact with.
What You'll Do
This opportunity involves both the "science" and "engineering" parts of research.
This is a multi-stack opportunity where you will work on the intersection of modeling, data, systems, and evaluation.