(352) FASTTEK | (352) 327-8835
FASTTEK GLOBALpowered by Fast Switch - Great Lakes
info@fasttek.com
(352) FASTTEK | (352) 327-8835
Chennai, Tamil Nadu
Platform Engineering Senior Engineer #1048578
Job Description:
  • Employees in this job function are focused on developing and maintaining reusable software components that serve the needs of product developers in the organization.
  • They are responsible for designing, implementing, integrating and maintaining the underlying infrastructure and software applications that support developer productivity and self-service.
 
Key Responsibilities:
  • Collaborate with enterprise architects, software architects, software engineering teams, etc. to design the platform infrastructure and tools encompassing servers, networks, storage, databases, cloud services, etc.
  • Implement and manage the infrastructure that supports the platform tools and ensuring that upgrades, security patches and other performance improvements are regularly performed.
  • Evaluate cloud providers, containerization solutions and other complex technologies to deeply understand the configurations available and create abstractions of common configurations that can be utilized easily by application teams for their workloads
  • Write and execute automated Infrastructure as Code scripts and utilize processes like CI/CD to streamline and automate how the platform infrastructure is provisioned, configured and managed to improve consistency, traceability and repeatability
  • Integrate performance and monitoring best practices including QoS and SLA metrics to scale platform applications and managed services automatically due to demand.
  • Incorporate security and disaster recovery best practices into the infrastructure applications by integrating access control, identity management, logging/monitoring, public/private network configurations, data encryption, storage backup and disaster recovery, etc.
  • Facilitate the integration of enterprise managed software configurations into deployment pipelines managed by application teams to ensure approved configurations and best practices around security, networking, logging & monitoring, performance & scale, etc. are applied
  • Advocating feedback with service providers and developers, to ensure the platform continues to grow and evolve to meet their needs
 
Skills Required:
  • Dynatrace, Linux, Python, Ansible, Ability to communicate and work with cross-functional teams and all levels of management , Storage Capacity Management, Azure, redhat, deployment, GitHub, DevOps, AWS, VMware, TERRAFORM, Communications, Kubernetes, Problem Solving, services, Cloud Infrastructure, CI/CD, GCP, Network Protocols & Standards, Scripting, Technical Troubleshoot, Tekton, Openshift
 
Experience Required:
  • Senior Engineer OSV Exp: Prac. In 2 coding lang. or adv. Prac. in 1 lang.; guides.
  • 10+ years in IT; 8+ years in development
 
Experience Preferred:
Preferred Qualifications:
  • Red Hat Certified Specialist in OpenShift Virtualization or other relevant certifications.
  • Experience with Infrastructure as Code (IaC) tools like Ansible, Terraform, or OpenShift GitOps.
  • Familiarity with software-defined networking (SDN) and software-defined storage (SDS) solutions.
  • Experience with public cloud providers (AWS, Azure, GCP) and hybrid cloud architectures.
  • Knowledge of CI/CD pipelines and DevOps methodologies.
 
Education Required:
  • Bachelor's Degree
 
Education Preferred:
  • Master's Degree
 
Additional Information :
Key Responsibilities: Capacity Management
  • Conduct capacity planning and forecasting for the OpenShift Virtualization platform, including compute, memory, storage, and network resources, to ensure scalability and prevent resource exhaustion.
  • Analyze resource utilization trends and make recommendations for infrastructure scaling, consolidation, or optimization.
  • Collaborate with application teams and stakeholders to understand future demand and project capacity needs.
  • Develop and maintain capacity models and reports to support strategic planning. OSV Automation & Efficiency
  • Develop automation solutions (scripts, playbooks) for repetitive OSV tasks, including configuration changes, VM management (like snapshot removal), auditing, remediation and integration with ticketing systems
  • Leverage automation to enable delivering operator updates and changes efficiently at scale
  • Implement Site Reliability Engineering (SRE) principles and practices to improve overall platform stability, performance, and operational efficiency
  • Role Based Access Control deployment and auditing
  • Namespace and Resource Quota management (CPU, Disk and Storage)