Minimum qualifications:
- Bachelor's degree in Computer Science, Mathematics, a related technical field, or equivalent practical experience.
- 10 years of experience with cloud infrastructure.
- 5 years of experience in a technical role focused on AI infrastructure or related areas.
- Experience building and operationalizing machine learning models.
- Experience with GPU programming (CUDA, OpenCL) and optimization techniques.
Preferred qualifications:
- Experience with high-performance computing (HPC) environments and contributions to open-source projects related to AI or infrastructure.
- Experience training and fine-tuning large models (i.e., image, language, segmentation, recommendation, genomics) with accelerators.
- Experience with performance profiling tools (i.e., Tensorflow profiler, PyTorch profiler, Tensorboard).
- Experience designing/architecting large-scale infrastructure farms for specialist AI use cases.
- Experience with running MLPerf benchmarks, distributed training and optimizing performance versus costs.
- Excellent communication and presentation skills.
The Google Cloud Platform team helps customers transform and build what's next for their business — all with technology built in the cloud. Our products are developed for security, reliability and scalability, running the full stack from infrastructure to applications to devices and hardware.
In this role, you will be helping our customers, developers, small and large businesses, educational institutions and government agencies, see the benefits of our technology come to life. You will be understanding the needs of our customers and helping shape the future of businesses of all sizes use technology to connect with customers, employees and partners.
You will be the technical expert and trusted advisor for our customers, helping them design, deploy, and optimize AI solutions using cutting-edge hardware and software. Your focus will be on Graphics Processing Units (GPUs), accelerators (including Field-programmable Gate Array (FPGA) and Application-Specific Integrated Circuit (ASIC)), and Google Tensor Processing Units (TPUs). You will work closely with sales, product management, and engineering to ensure our customers achieve maximum value from their AI investments. You will be responsible for scaling and helping accelerate GCP AI Infrastructure business growth.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
- Serve as a trusted advisor to customers, helping them in understanding and incorporate AI accelerators into the overall cloud strategy by recommending migration paths, integration strategies, and application architecture that incorporate Google Cloud AI optimized infrastructure.
- Demonstrate how Google Cloud is differentiated, highlighting the power of accelerators by working with customers on proof-of-concepts, demonstrating features, optimizing model performance, profiling, and bench-marking.
- Build repeatable assets to enable other customers and internal teams.
- Influence Google Cloud strategy at the intersection of infrastructure and AI/ML by advocating for enterprise customer requirements.
- Lead business and Workload acceleration on AI Infrastructure products and solutions for GCP.