Back to listings
DCV TechnologiesGöteborg

HPC Infrastructure Engineer – Gothenburg, SE or Cambridge, UK (Hybrid)

Project-Based

Description

Dear Consultant, We are seeking an experienced HPC Infrastructure Engineer to support the design, delivery, and operations of high-performance computing platforms for a multinational client. The role involves managing research computing environments, automating infrastructure deployments, and supporting scientific computing users. You'll work closely with DevOps and site reliability practices in a hybrid private cloud ecosystem. Send CV to (inquiries@consultant.dev) if you are interested.

Location: Gothenburg, Sweden

  • Cambridge, UK
  • Engagement: Freelance Contract Onsite Requirement Start Date: ASAP Project Type: Long-term engagement

Key ResponsibilitiesDesign, implement, and maintain secure HPC infrastructure using Infrastructure as Code tools (e.g. Terraform).Support and operate research computing services and custom scientific workloads.Apply Site Reliability Engineering principles to manage deployment, monitoring, and incident response.Troubleshoot and optimize HPC platforms and assist users in leveraging computational services effectively.


Must-Have Skills10+ years in large-scale computing (HPC, HTC, batch computing environments)Strong experience with Slurm, LSF, or Grid EngineDevOps background with Ansible, Salt, PuppetSolid Linux system administration (TCP/IP, filesystems, networking)Hands-on with Terraform, scripting in Bash, and PythonFamiliarity with OpenStack, private cloud, and virtualized environmentsExperience with parallel filesystems: Weka, GPFS, LustreAgile methodologies and end-to-end automation practices


Nice to HaveScientific/engineering degree or computational research experiencePublic cloud exposure (AWS, Azure, GCP)Container technology (Docker, Singularity, LXD, Kubernetes)Experience with Vault, Nomad, or Consul from HashiCorp stackHigh-speed networking (InfiniBand) knowledgeProgramming: Java, C++, Perl, Ruby, SQL (optional)

Skills

AWSC++SQLDevOpsTerraformAgilePerlcplusplusJavaDockerRubyVaultAzureLinuxGCPKubernetesPythonAnsibleBashcpp