Nebius AI Logo
Nebius AI
Solutions Architect - Cloud and MLOps solutions
🌎USA, Canada
2h ago
👀 17 views
📥 0 clicked apply

Job Description

Remote

About Nebius

Launched in November 2023, the Nebius platform provides high-end infrastructure and tools for training, fine-tuning and inference. Based in Europe with a global footprint we aspire to become the leading AI cloud for AI practitioners around the world.

Nebius is built around the talents of around 400 highly skilled engineers with a proven track record in developing sophisticated cloud and ML solutions and designing cutting-edge hardware. This allows all the layers of the Nebius cloud – from hardware to UI – to be built in-house, differentiating Nebius from the majority of specialized clouds. As a result, Nebius customers get a true hyperscaler-cloud experience tailored for AI practitioners.

As an NVIDIA preferred cloud service provider, Nebius offers the latest NVIDIA GPUs including H100, L40S, with H200 and Blackwell chips coming soon.

Nebius owns a data center in Finland, built from the ground up by the company’s R&D team. We are expanding our infrastructure and plan to add new colocation data centers in Europe and North America already this year, and to build several greenfield DCs in the near future.

Our Finnish data center is home to ISEG, the most powerful commercially available supercomputer in Europe and the 19th most powerful globally (Top 500 list, June 2024). It also epitomizes our commitment to sustainability, with energy efficiency levels significantly above the global average and an innovative system that recovers waste heat to warm 2,000 residential buildings in the nearby town of Mäntsälä.

Nebius is headquartered in Amsterdam, Netherlands, with R&D and commercial hubs across North America, Europe and Israel.

The role

We are seeking a highly skilled and customer-focused professional to join our team as a Solutions Architect specializing in Cloud and MLOps. As a Solutions Architect, you will play a pivotal role in designing and implementing cutting-edge solutions for our clients, leveraging cloud technologies for ML/AI teams and becoming a trusted technical advisor for building their pipelines.

You’re welcome to work remotely from the US or Canada.

Your responsibilities will include: 

  • Act as a trusted advisor to our clients, providing technical expertise and guidance throughout the engagement. Conduct PoC, workshops, presentations, and training sessions to educate clients on GPU cloud technologies and best practices.
  • Collaborate with clients to understand their business requirements and develop solution architecture that align with their needs: design and document Infrastructure as code solutions, documentation and technical how-tos in collaboration with support engineers and technical writers.
  • Help customers to optimize pipeline performance and scalability to ensure efficient utilization of cloud resources and services powered by Nebius AI.
  • Act as a single point of expertise of customer scenarios for product, technical support, marketing teams.

We expect you to have: 

  • 5+ years of experience as a cloud solutions architect, system/network engineer, developer or a similar technical role with a focus on cloud computing
  • Strong hands-on experience with IaC and configuration management tools (preferably Terraform/Asible), Kubernetes, skills of writing code in Python
  • Solid understanding of GPU computing practices for ML training and inference workloads, GPU software stack components, including drivers, libraries (e.g. CUDA, OpenCL)
  • Excellent communication skills
  • Customer-centric mindset
  • Fluent English

It will be an added bonus if you have: 

  • Hands-on experience with HPC/ML orchestration frameworks (e.g. Slurm, Kubeflow)
  • Hands-on experience with deep learning frameworks (e.g. TensorFlow, PyTorch)
  • Solid understanding of cloud ML tools landscape from industry leaders (NVIDIA, AWS, Azure, Google) 

Key Employee Benefits:

  • Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) Plan: Up to 4% company match with immediate vesting.
  • Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote Work Reimbursement: Up to $85/month for mobile and internet.
  • Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.

Compensation

We offer competitive salaries, along with equity options based on your experience, skills, and location.

Join Nebius Today!

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!