The role is responsible for supporting application deployment in cloud computing environment; systems engineering and application deployment in Container and Kubernetes environment, as well as optimize and enhance high performance computing business capabilities for R&D applications and workflows.
Responsibilities
- Installation, upgrading, and configuration of Linux-based high-performance compute nodes, as well as deploying and managing Docker orchestration, containerization, and Kubernetes environments
- Configuration of systems and networking services for hybrid computing platforms.
- Automate configuration scripting with Shell, Python and Perl.
- Support application environments in virtual computing environments, distributed computing, burst computing, and analytical platforms for scientific data.
- Aligning with enterprise-wide architectures.
- Ensure adherence to change management and regulatory procedures including governing the creation of operational procedures and system qualifications.
Candidate Requirements
- Proven experience as a DevOps Engineer or in a similar role.
- Strong hands-on experience with AWS (Amazon Web Services), including:
- Lambda, SQS, SNS, S3 Buckets
- EKS (Elastic Kubernetes Service)
- IAM (Identity and Access Management)
- DynamoDB
- Experience with MongoDB Atlas.
- Familiarity with other cloud platforms (Azure or GCP) is a plus.
- Proficiency with infrastructure as code tools like Terraform and Ansible.
- Experience with CI/CD tools such as ArgoCD, Jenkins, GitLab CI, or CircleCI.
- Familiarity with containerization tools (Docker, Kubernetes).
- Basic knowledge of Helm and HelmChart.
- Familiarity with networking, security, and monitoring best practices.
- Excellent scripting skills (Bash, Python, etc.).
- Strong problem-solving and troubleshooting skills.
- Experience with version control (Git)
Nice to Have:
- Optional for a few applications: experience with .NET, Node.js, and Angular
Required Education, Skills and Experience:
- Bachelor’s Degree or Master's Degree in Computer Science or equivalent.
- Good communication skills (spoken and written) in English
- Ability to work independently
- Great teamwork skills.
- Seeking a candidate heavy in Linux, HPC, Scheduler and system design.
- Experience with IT asset management and compliance using enterprise CMDBs and change control..
- Experience with Amazon Web Services (EC2, S3, IAM, …) and Infrastructure as a Service (IaaS).
- Experience in server installation and configuration.
- Experience with Shell and Python-based scripting.
- Experience with virtualization technologies.
- Experience with Python.
- Experience with CI/CD
Optional but Highly Appreciated
- Azure AD.
- Google Cloud Platform.
- Virtualization with VMWare ESXi.
- GitLab and GitLab Runner.
- Prometheus and Grafana.