AI Infrastructure Reliability Engineer @ Coinbase

Hi, I'm Mohsin Khan

AI Infrastructure Reliability Engineer

I build resilient, scalable cloud platforms and reliable infrastructure for GenAI workloads — with Kubernetes, AWS and Terraform. Site Reliability Engineer based in Southfield, Michigan.

Mohsin Khan Shaik
MK
AI Infra @ Coinbase
SRE · GenAI
KubernetesAWSTerraformGenAI InfrastructureSite ReliabilityDockerCI/CDObservabilityLinuxPython KubernetesAWSTerraformGenAI InfrastructureSite ReliabilityDockerCI/CDObservabilityLinuxPython
Mohsin Khan Shaik Working Workspace
3+Years in
Infrastructure
About Me

Reliable infrastructure for cloud & GenAI

I'm an AI Infrastructure and Site Reliability Engineer with 3+ years designing and operating large-scale distributed systems, cloud-native platforms and Kubernetes-based AI/ML infrastructure. At Coinbase I work on AI infrastructure reliability; previously at CirrusLabs I built GenAI and cloud-optimization infrastructure.

Day to day I work with Kubernetes, AWS and Terraform — automating infrastructure as code and designing systems that stay up under pressure. I hold a Master's in Data Analytics and a Bachelor's in Information Technology, and I'm the founder of Hamuzair.

  • Cloud & multi-region reliability
  • Infrastructure as Code
  • Kubernetes orchestration
  • Observability & automation
Get in touch
3+
Years Experience
6+
Cloud & Infra Tools
2
Degrees Earned
Skills & Experience

What I work with

My core infrastructure toolkit and the path that got me here.

Kubernetes92%
AWS Cloud90%
Terraform & IaC88%
GenAI Infrastructure85%
Site Reliability Engineering90%
Linux · Docker · CI/CD87%
Jan 2025 — Present

AI Infrastructure Reliability Engineer

Coinbase · Full-time

Feb 2022 — Dec 2023

AI Infrastructure Engineer — GenAI & Cloud Optimization

CirrusLabs · Full-time

Ongoing

Founder

Hamuzair

Jan 2024 — Aug 2025

Master's Degree, Data Analytics

Indiana Wesleyan University

Jun 2018 — Jan 2023

Bachelor's Degree, Information Technology

Shadan College of Engineering and Technology

What I Do

My Services

Cloud Infrastructure

Designing and operating scalable, secure cloud platforms on AWS — built for performance across multiple regions.

Site Reliability Engineering

Observability, incident response and automation that keep production systems fast, stable and highly available.

Kubernetes & Containers

Container orchestration, autoscaling and zero-downtime deployments with Kubernetes for resilient workloads.

GenAI Infrastructure

Reliable, optimized infrastructure for GenAI and ML workloads — from inference pipelines to cost optimization.

What I Build

Infrastructure Projects

Representative work across cloud reliability, GenAI infrastructure and automation. Filter by focus area.

Cloud & SRE

Enterprise AI Infra Monitoring & Automation

Cloud-native observability for Kubernetes with centralized logging and proactive alerting — automated provisioning cut configuration effort 45% and auto-remediation reduced incident response time 38%.

AWSKubernetesTerraformGrafanaPython
Cloud & SRE

Multi-Region Kubernetes Platform

Production-grade EKS clusters with autoscaling and zero-downtime rollouts; reduced provisioning time 35% and improved deployment reliability.

KubernetesAWS EKSHelmDocker
GenAI & Data

GenAI-Powered Infrastructure Optimization

GenAI-driven system that detects idle resources and optimizes compute — improving efficiency 25% with real-time CloudWatch monitoring and automated dashboards.

AWSPythonTerraformLambda
GenAI & Data

Model Deployment & MLOps Pipeline

Automated model-serving and deployment pipelines for AI/ML workloads, with rollout automation, monitoring and feature-store concepts.

KubernetesPythonModel ServingMLOps
Automation & IaC

End-to-End CI/CD Pipeline

Secure, automated CI/CD that cut manual deployment effort 45%, with DevSecOps compliance checks built directly into the pipeline.

GitHub ActionsPythonBashIaC
Automation & IaC

Terraform IaC Framework

Reusable Infrastructure-as-Code templates provisioning multi-account AWS infrastructure, reducing provisioning time 50%.

TerraformCloudFormationAWS
Kind Words

Testimonials

"It's great getting connected with you. The website you created helped a lot to boost up my profile."

Muzzamil Hussain
Muzzamil Hussain
Electronics & Electrical Engineer

"We increased the functionality of our website dramatically while cutting costs. It's far easier to use and maintain — we couldn't be happier."

Musharraf Ali
Musharraf Ali
Cloud Engineer

"Great experience. Helped build a professional, reliable setup and always happy to assist. Would highly recommend — 10/10."

Sheraz Nazeer
Sheraz Nazeer
Data Engineer
Get In Touch

Let's work together

Have a project, role or idea in mind? Drop me a message.

Location
Southfield, Michigan, United States