Nebius AI Logo

Nebius AI

Senior Technical Product Manager (ML/AI)

🌎

Amsterdam, Netherlands · United States

23h ago
👀 41 views
📥 3 clicked apply

Job Description

About Nebius

Nebius is a Nasdaq-listed tech company that aims to become one of the world’s leading AI infrastructure providers. Headquartered in Amsterdam, we have R&D and commercial hubs across the US, Europe and Israel. 

We build full-stack AI infrastructure to service the explosive growth of the global AI industry, including large-scale GPU clusters, cloud platforms and tools and services for developers. Our 500 employees include around 400 highly skilled engineers with a proven track record of developing world-class hardware and software solutions across cloud and AI/ML.  

We are rapidly growing our infrastructure network with an ambitious investment program to build out data centers and colocations in the US and Europe. A Reference Platform NVIDIA Cloud Partner, Nebius’ AI-native cloud platform provides high-end infrastructure and tools for training, fine-tuning and inference.  

Nebius is growing fast, and we’re always looking for the best talent to join our company. Along with highly competitive compensation and extensive opportunities for professional development, we offer a dynamic work environment where innovation, creativity and teamwork are highly prized and open up exciting new opportunities. As an equal opportunity employer, we are committed to fostering a diverse and inclusive workplace, where all applicants are given fair consideration and every team member is empowered to contribute to their fullest potential. 

The role

We are seeking a Senior Technical Product Manager, ML/AI Lifecycle Services to join our team. In this role, you will oversee the planning and prioritization of services across the ML/AI lifecycle, including data preparation, training, fine-tuning, experiments, monitoring and inference. You will deliver products for leading AI companies, utilizing thousands GPU within one cluster with cutting-edge hardware. We also provide room for creativity, empowering you to take the initiative and build what you think is best.
 
 
Responsibilities:
  • Be a center of ML/AI expertise for both dev and business teams.
  • Own the backlog of 1–3 AI/ML products.
  • Make technical requirements for IaaS and PaaS teams that are essential for your products.
  • Introduce and promote products to the market in collaboration with cross-functional teams.
  • Make materials and onboarding guides for Solution Architect teams and Sales.
  • Be an internal customer for a Marketplace and Solution Architects teams to build E2E scenarios using our products.
 
 
Requirements:
  • We expect the candidate to be the best user of the product they manage, so technical expertise is mandatory.
  • Have solid experience as an ML Engineer/MLOps Engineer/AI Engineer with one or more domains from the following list:
Distributed training that utilizes at least dozens of hosts using Slurm, Ray Cluster, MosaicML
Organizing ML infrastructure using best MLOps practices with instruments like MLflow, W&B, MosaicML, Kubeflow, Apache Airflow, ClearML, AzureML, SageMaker, VertexAI
Maintaining and optimizing a large inference cluster with KServe, vLLM, Triton, RunAI, Seldon
Experience using of data preparation tools like Databricks and Apache Spark 
Building a product on top of LLMs that leverages techniques such as RAG, fine-tuning, and function calling, with an understanding of continuous eval of the quality
  • Product management experience is not required but willingness to learn is essential.
 
Ideal Candidate:
  • You have experience as an ML engineer, specializing in developing large generative AI models. You are now eager to shift your focus toward creating tools and instruments that enhance the efficiency of such teams.
  • You have worked as an MLOps, Solution Architect or DevOps engineer, providing infrastructure for ML teams and delving deeply into ML specifics. You are keen to share your expertise through product development and know how to build MLOPS based on  serverless GPU services such as Modal, Cerebrius and Google Cloud Run.
  • You have a background as an ML engineer and transitioned to product management, with a proven track record of delivering complex products for tech customers.

What we offer 

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Hybrid working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

More Jobs at Nebius AI