CompanyUnited States

Head of Infrastructure

Project-Based

Description

We are working with a new, fully remote market maker on prediction markets, funded by a tier 1 operator to build next-generation trading systems, leveraging huge industry IP & resources from the tier 1 operator. The founding team includes true industry veterans with a vision for the future. They are building a lean, fully remote, senior team focused on high-performance systems, pragmatic engineering, and rapid iteration. They are looking for a Head of Infrastructure to own end-to-end infrastructure, freeing software engineers to ship features rapidly while ensuring reliability, performance, and security. This is the first dedicated infrastructure leadership hire and a force multiplier for the entire engineering team. Responsibilities:

  • Design, build, and operate Azure cloud infrastructure supporting low-latency integrations with operator systems and external exchanges.
  • Run Kubernetes (cluster sizing, autoscaling, multi-environment deployments).
  • Stand up and optimize Kafka (event streaming), Redis (caching), and Postgres (OLTP).
  • Implement CI/CD, secrets management, environment isolation, and infrastructure-as-code (Terraform/Bicep).
  • Own observability (Prometheus, Grafana, OpenTelemetry), incident response, SLOs/SLIs, and on-call practices.
  • Design and manage secure cloud networking (VNets, peering, private endpoints, DNS, firewall/NSGs) and connectivity with the operator.
  • Drive security and compliance across identity, segmentation, secrets, and OS baselines in Azure.
  • Lead performance engineering for market data ingestion and order routing.
  • Partner with engineering on service boundaries, data contracts, and platform primitives.
  • Manage cost, capacity, reliability, and availability while scaling the infrastructure function. Requirements:
  • 8+ years operating production infrastructure at scale.
  • Deep cloud expertise (Azure preferred; AWS/GCP backgrounds welcome).
  • Hands-on production experience with Kubernetes, Kafka, Redis, and Postgres.
  • Strong networking, security, identity (e.g., Azure AD), and infrastructure-as-code skills.
  • Proven ability to build reliable, observable platforms with strong CI/CD; experience with Prometheus, Grafana, OpenTelemetry, or similar.
  • Cloud networking fundamentals (VNet design, private endpoints, DNS, firewall rules).
  • Performance tuning for latency- and throughput-sensitive systems.
  • Strong collaboration skills translating product/trading needs into platform capabilities.
  • Active, hands-on use of AI tooling (infra-as-code generation, log analysis, automation, incident triage). Nice to Have:
  • Experience in trading, sports betting, exchanges, financial markets, or other real-time systems.
  • Event-driven architecture or stream processing experience.
  • SRE leadership or reliability program experience.

Consultant

Jennifer Innes

CEOXXX-XXX-XXXX) Reference number: 1322957U Profession:Company: Betting JobsDate posted: 2nd Mar, 2026

Skills

KafkaAzurePrometheusAWSTerraformSecurityCI/CDPostgreSQLKubernetesGrafanaGCPRedisEvent-Driven