Chennai, Tamil Nadu
DevOps Specialist #1050466
Job Description:
- GKE Platform Management at Scale: Manage and optimize large-scale GKE environments in a multi-cloud and hybrid-cloud context, ensuring the platform is highly available, scalable, and secure.
- CI/CD Pipeline Development: Build and maintain CI/CD pipelines using tools like GitHub Actions to automate deployment workflows across the GKE platform.
- Ensure smooth integration and delivery of services throughout their lifecycle.
- Enterprise GKE Management: Leverage advanced features of GKE such as ACM (Anthos Config Management) and KCC (Kubernetes Cluster Config) to manage GKE clusters efficiently at the enterprise scale.
- Workload Identity & Security: Implement workload identity and security best practices to ensure secure access and management of GKE workloads.
- Custom Operators & Controllers: Develop custom operators and controllers for GKE, automating the deployment and management of custom services to enhance the developer experience on the platform.
- Developer Experience Focus: Maintain a developer-first mindset to create an intuitive, reliable, and easy-to-use platform for developers.
- Collaborate with development teams to ensure seamless integration with the GKE platform.
- GKE Deployment Pipelines: Provide guidelines and best practices for GKE deployment pipelines, leveraging tools like Kustomize and Helm to manage and deploy GKE configurations effectively.
- Ensure pipelines are optimized for scalability, security, and repeatability.
- Zero Trust Model: Ensure GKE clusters operate effectively within a Zero Trust security model. Maintain a strong understanding of the principles of Zero Trust security, including identity and access management, network segmentation, and workload authentication.
- Ingress Patterns: Design and manage multi-cluster and multi-regional ingress patterns to ensure seamless traffic management and high availability across geographically distributed Kubernetes clusters.
- Deep Troubleshooting & Support: Provide deep troubleshooting knowledge and support to help developers pinpoint issues across the GKE platform, focusing on debugging complex Kubernetes issues, application failures, and performance bottlenecks.
- Utilize diagnostic tools and debugging techniques to resolve critical platform-related issues.
Skills Required:
- GitHub, DevOps, TERRAFORM, Kubernetes, Google Cloud Platform
Skills Preferred:
- Python
Experience Required:
- 5+ years of hands-on experience with Google Cloud Platform (GCP), including designing, deploying, and managing cloud infrastructure and services.
- 5+ years of experience specifically with Google Kubernetes Engine (GKE), managing large-scale, production-grade clusters in enterprise environments.
Education Required:
- Bachelor's Degree
Additional Information :
- Experience: 8+ years of overall experience in cloud platform engineering, infrastructure management, and enterprise-scale operations.
- 5+ years of hands-on experience with Google Cloud Platform (GCP), including designing, deploying, and managing cloud infrastructure and services.
- 5+ years of experience specifically with Google Kubernetes Engine (GKE), managing large-scale, production-grade clusters in enterprise environments.
- Experience with deploying, scaling, and maintaining GKE clusters in production environments.
- Hands-on experience with CI/CD practices and automation tools like GitHub Actions.
- Proven track record of building and managing GKE platforms in a fast-paced, dynamic environment.
- Experience developing custom Kubernetes operators and controllers for managing complex workloads.
- Deep Troubleshooting Knowledge: Strong ability to troubleshoot complex platform issues, with expertise in diagnosing problems across the entire GKE stack