We are seeking a highly skilled Senior Software Engineer specializing in Machine Learning, with a focus on Natural Language Processing (NLP), to join our dynamic team. The ideal candidate will play a pivotal role in understanding business objectives and leveraging machine learning models to meet these goals effectively.
Requirements
- Design, deploy, and maintain scalable and secure infrastructure supporting AI and ML workloads.
- Build and maintain AWS cloud environments for compute (EC2, ECS/EKS, Lambda), storage (S3, EFS, FSx), and networking (VPC, Transit Gateway, PrivateLink, Route 53, load balancers).
- Implement security best practices using IAM, KMS, Secrets Manager, GuardDuty, and Security Hub.
- Support and optimize AI/ML workloads across AWS services (SageMaker, Bedrock, Batch, Step Functions).
- Develop and maintain Infrastructure as Code (IaC) using Terraform, AWS CDK, and CloudFormation.
- Manage containerized workloads and orchestration platforms (Docker, EKS, Fargate), including GPU scheduling and scaling.
- Set up and maintain monitoring and observability frameworks using CloudWatch and OpenTelemetry.
- Build and manage CI/CD pipelines (CircleCI, GitHub Actions, GitLab CI) for infrastructure automation and ML/Gen AI deployments.
- Collaborate with ML and Generative AI teams to scale models, optimize performance, and design efficient prompt or inference pipelines.
- Develop runbooks and SOPs for AI service deployment, troubleshooting, and performance optimization.
- Ensure security, compliance, and data protection across AI datasets and environments.
Benefits
- Generous Paid Time Off
- 401k Matching
- Tuition Reimbursement
- Relocation Assistance