Mistral AI

GPU programming Expert (San Francisco)

Mistral AI San Francisco, CA

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco.

The role will involve

  • Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity
  • Rethinking various part of the generative model architecture to make them more suitable for efficient inference-Integrating low-level efficient code in a high-level MLOps framework


The successful candidate will have

  • High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. High expertise on the distributed computation infrastructure of current generation GPU clusters
  • Overall understanding of the field of generative AI, knowledge or interest in fine-tuning and using language models for applications


About Mistral AI

We're a small team, composed of seasoned researchers and engineers in the AI field. We like to work hard and be at the edge of science. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people that foster in competitive environments, because they find them more fun to work in. We hire passionate women and men from all over the world.

Developers are using our API via la Plateforme to build incredible AI-first applications powered by our models that can understand and generate natural language text and code. We are multilingual at our core. More recently, we released le Chat, as a demonstrator of our models.
  • Seniority level

    Not Applicable
  • Employment type

    Full-time
  • Job function

    Other
  • Industries

    Technology, Information and Internet

Referrals increase your chances of interviewing at Mistral AI by 2x

See who you know

Get notified about new Programming jobs in San Francisco, CA.

Sign in to create job alert

Similar jobs

People also viewed

Similar Searches

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More