DigitalOcean is looking for a Staff Forward Deployed Engineer to drive transformational AI adoption in the fast-growing AI/ML space. The role involves building tools, migration scripts, and AI starter kits to accelerate 'time-to-inference' in production at scale and serve as a critical feedback loop to inform the product roadmap.
Requirements
- Significant experience in the AI/ML lifecycle, specifically hosting large language or multimodal models using inference engines like vLLM, SGLang, or Modular
- Expert proficiency in Kubernetes (K8s) and the design of distributed systems, including microservices, messaging systems, databases, and Infrastructure as Code
- Strong production coding skills in Python or Go with the ability to build high-quality tools, automation, and internal assets
- Proven ability to benchmark AI infrastructure and perform GPU utilization tuning to optimize customer ROI and workload performance
- Experience with distributed inference serving frameworks, GPU-level optimization, and interconnect technologies like NVlink, XGMI, or RoCE to maximize hardware efficiency
Benefits
- Competitive salary range
- Bonus potential
- Equity compensation
- Flexible time off policy
- Employee Assistance Program
- Local Employee Meetups
- Reimbursement for relevant conferences, training, and education
- Access to LinkedIn Learning's 10,000+ courses