AI/ML Cloud Infrastructure – Remote (US/Canada) – Up to $320K OT
Our client is a cloud technology company driving the next generation of AI infrastructure. They empowerorganizations to build and scale AI/ML solutions without the need for massive in-house teams or heavy upfrontinfrastructure costs. Their global team of engineers and researchers works at the forefront of GPU cloudcomputing, supporting businesses across industries to solve complex, real-world problems.
The company operates with a flat structure, minimal bureaucracy, and a strong focus on speed, ownership,and technical excellence. Engineers work directly with customers to design scalable solutions and influenceproduct direction, creating real impact in how modern AI applications are built and deployed
They are seeking an experienced Cloud Solutions Architect with deep expertise in cloud infrastructure andMLOps. In this role, you will act as a trusted technical advisor for customers, designing and implementingscalable AI/ML solutions powered by modern GPU cloud environments.
You will help clients optimize theirpipelines, run proof-of-concepts, and provide guidance on best practices, while also collaborating internally toshape product and customer success strategies.This position is open to candidates working remotely from the US or Canada
• Act as a trusted advisor to customers, conducting workshops, presentations, and training on GPUcloud technologies
• Translate business requirements into scalable, cloud-native solution architectures
• Design and document infrastructure-as-code (IaC) deployments and technical guides in collaborationwith support engineers and technical writers
• Optimize customer ML pipelines for performance, scalability, and cost efficiency
• Serve as the key point of expertise on customer use cases for product, engineering, and marketingteams
• 5–10+ years of experience in cloud computing roles (solutions architect, systems engineer, developer,etc.)
• Strong hands-on experience with IaC (Terraform/Ansible), Kubernetes, and Python
• Solid understanding of GPU computing for ML training/inference and GPU software stacks (CUDA,OpenCL)
• Excellent communication and presentation skills
• Strong customer-facing and problem-solving mindset
Bonus Points For:
• Experience with HPC/ML orchestration frameworks (Slurm, Kubeflow)
• Hands-on experience with deep learning frameworks (TensorFlow, PyTorch)• Familiarity with leading cloud ML ecosystems (AWS, Azure, Google, NVIDIA)