(352) FASTTEK | (352) 327-8835
FASTTEK GLOBALpowered by Fast Switch - Great Lakes
info@fasttek.com
(352) FASTTEK | (352) 327-8835
Chennai, Tamil Nadu
Platform Engineering Senior Engineer #1051185
Job Description:
  • We are the movers of the world and the makers of the future.
  • We get up every day, roll up our sleeves and build a better world -- together.
  • We're all a part of something bigger than ourselves.
  • Are you ready to change the way the world moves? We are the movers of the world and the makers of the future.
  • We get up every day, roll up our sleeves and build a better world -- together.
  • We're all a part of something bigger than ourselves. Are you ready to change the way the world moves? We are seeking a highly skilled and motivated HPC CAE Support Engineer to join our team.
  • You will be responsible for integrating, profiling, supporting and maintaining HPC CAE applications and user-facing tooling, ensuring optimal performance and reliability for our critical Supercomputing HPC Platform offering.
  • This role also has a large focus on CAE applications support, integration, and interacting with consumers of the platform.
  • If you are interested in engaging in every part of the HPC stack ranging from the servers to the product development engineer using them, this position could be a good fit for you.
 
Responsibilities
  • Install, integrate, optimize, and support CAE applications and workloads across in a HPC environment with advanced CPU, GPU and Interconnects technologies.
  • Support CLI tooling and API's that customers consume to streamline access to HPC infrastructure.
  • Troubleshoot and resolve complex technical issues related to Linux systems, networking, storage, and CAE HPC applications.
  • Develop and maintain documentation for software and procedures.
  • Collaborate with software engineers and researchers to ensure seamless integration of HPC resources and scaling of applications.
  • Stay up to date on the latest advancements in HPC and AI/ML technologies and best practices.
 
Qualifications
  • Bachelor's degree in computer science, Engineering, or a related field.
  • 3-5+ years of experience in CAE, Systems or Software engineering Strong familiarity with CAE or scientific computing Strong understanding of Linux operating systems, preferably in an HPC environment Proficiency programming in one or more languages, preferably python, go or bash scripting.
  • Familiarity with how to scale applications and the metrics collection, analysis, and visualization tools used to identify bottlenecks like Prometheus and Grafana.
  • Excellent problem-solving and troubleshooting skills.
  • The ability to define what problems needs to be solved.
  • Strong communication and collaboration skills.
  • Even better, you may have... Experience with containerization technologies like Docker or Kubernetes.
  • Experience with monitoring tools like Prometheus, Icinga, Zabbix, Nagios, or similar.
 
Skills Required:
  • Linux, Linux - Clusters, Technical Troubleshoot, Troubleshooting (Problem Solving)
 
Experience Required:
  • Experience in LinuxOS and proficient in administration of LinuxOS
  • Proficient in Shell Scripting (Primary) and Python (desirable)
  • Experience and familiarity in containerization technologies like Docker or Kubernetes.
 
Experience Preferred:
  • Experience and familiarity in monitoring tools e.g., Prometheus, Zabbix, Nagios, or similar.
  • Familiarity with CAE, scientific and computational apps
 
Education Required:
  • Bachelor's Degree
 
Additional Information:
  • Responsibilities Install, integrate, optimize, and support CAE applications and workloads across in a HPC environment with advanced CPU, GPU and Interconnects technologies. Support CLI tooling and API's that customers consume to streamline access to HPC infrastructure.
  • Troubleshoot and resolve complex technical issues related to Linux systems, networking, storage, and CAE HPC applications.
  • Develop and maintain documentation for software and procedures.
  • Collaborate with software engineers and researchers to ensure seamless integration of HPC resources and scaling of applications.
  • Stay up to date on the latest advancements in HPC and AI/ML technologies and best practices.