Responsibilities:
- Design and implement complex, secure cloud solutions aligned to enterprise/functional architecture; document technical decisions.
- Lead development of AWS-based applications using Lambda, API Gateway, and ECS; prototype scalable APIs focused on integration, security, reliability, and observability.
- Build and automate infrastructure as code (IaC) with CloudFormation, AWS CDK, Terraform, and Ansible for reliable, repeatable deployments.
- Design, implement, and manage CI/CD pipelines (Jenkins, GitHub Actions, CodePipeline) with automated testing, security scanning, and policy checks.
- Implement DevOps best practices using AWS CloudFormation, GitHub, and Docker; codify environments and promotion workflows.
- Deploy and operate containerized and serverless workloads (EKS/ECS/Fargate, Lambda/Step Functions, EventBridge); choose optimal compute strategies.
- Integrate AI tools (Amazon Q, Claude, Datadog AI, AWS AI/ML) to automate operations, improve monitoring, predictive scaling, and incident response.
- Develop/maintain AI-driven runbooks and workflows for anomaly detection, automated remediation, and continuous improvement.
- Monitor, troubleshoot, and optimize performance, reliability, and cost using CloudWatch, Datadog, Prometheus, Grafana, ELK, and AI observability platforms.
- Implement internal process improvements via automation, data-delivery optimization, and infrastructure redesign.
- Enforce security/compliance (IAM, KMS, Secrets Manager, security groups, encryption in transit/at rest) aligned to GxP, HIPAA, GDPR, and 21 CFR Part 11.
- Mentor engineers; collaborate with cross-disciplinary teams; champion AWS Well-Architected Framework; guide incident/problem management and cost governance.