Doghouse

Cloud Solution Architect (US/Canada)

  • Kubernetes
  • Cloud
  • Python

AI/ML Cloud Infrastructure – Remote (US/Canada) – Up to $320K OT

The company

Our client is a cloud technology company driving the next generation of AI infrastructure. They empowerorganizations to build and scale AI/ML solutions without the need for massive in-house teams or heavy upfrontinfrastructure costs. Their global team of engineers and researchers works at the forefront of GPU cloudcomputing, supporting businesses across industries to solve complex, real-world problems.
The company operates with a flat structure, minimal bureaucracy, and a strong focus on speed, ownership,and technical excellence. Engineers work directly with customers to design scalable solutions and influenceproduct direction, creating real impact in how modern AI applications are built and deployed

The role

They are seeking an experienced Cloud Solutions Architect with deep expertise in cloud infrastructure andMLOps. In this role, you will act as a trusted technical advisor for customers, designing and implementingscalable AI/ML solutions powered by modern GPU cloud environments.

You will help clients optimize th
eirpipelines, run proof-of-concepts, and provide guidance on best practices, while also collaborating internally toshape product and customer success strategies.This position is open to candidates working remotely from the US or Canada

Responsibilities

Act as a trusted advisor to customers, conducting workshops, presentations, and training on GPUcloud technologies
Translate business requirements into scalable, cloud-native solution architectures
Design and document infrastructure-as-code (IaC) deployments and technical guides in collaborationwith support engineers and technical writers
Optimize customer ML pipelines for performance, scalability, and cost efficiency
Serve as the key point of expertise on customer use cases for product, engineering, and marketingteams

Requirements

 • 510+ years of experience in cloud computing roles (solutions architect, systems engineer, developer,etc.)
Strong hands-on experience with IaC (Terraform/Ansible), Kubernetes, and Python
Solid understanding of GPU computing for ML training/inference and GPU software stacks (CUDA,OpenCL)
Excellent communication and presentation skills
Strong customer-facing and problem-solving mindset

Bonus Points For:
Experience with HPC/ML orchestration frameworks (Slurm, Kubeflow)
Hands-on experience with deep learning frameworks (TensorFlow, PyTorch) Familiarity with leading cloud ML ecosystems (AWS, Azure, Google, NVIDIA)

Benefits
• ompensation: $215K$320 OTE (Base+Bonus+RSUs) -> base usually up to 215K with someexceptions. Higher expectations can always be discussed and are considered
Benefits: 100% employer-paid medical, dental, and vision coverage for employees and families
Retirement: 401(k) with up to 4% employer match (immediate vesting)
• Leave: 20 weeks paid parental leave (primary caregivers), 12 weeks (secondary)