Head of Infrastructure
Description
We are working with a new, fully remote market maker on prediction markets, funded by a tier 1 operator to build next-generation trading systems, leveraging huge industry IP & resources from the tier 1 operator. The founding team includes true industry veterans with a vision for the future. They are building a lean, fully remote, senior team focused on high-performance systems, pragmatic engineering, and rapid iteration. They are looking for a Head of Infrastructure to own end-to-end infrastructure, freeing software engineers to ship features rapidly while ensuring reliability, performance, and security. This is the first dedicated infrastructure leadership hire and a force multiplier for the entire engineering team. Responsibilities:
- Design, build, and operate Azure cloud infrastructure supporting low-latency integrations with operator systems and external exchanges.
- Run Kubernetes (cluster sizing, autoscaling, multi-environment deployments).
- Stand up and optimize Kafka (event streaming), Redis (caching), and Postgres (OLTP).
- Implement CI/CD, secrets management, environment isolation, and infrastructure-as-code (Terraform/Bicep).
- Own observability (Prometheus, Grafana, OpenTelemetry), incident response, SLOs/SLIs, and on-call practices.
- Design and manage secure cloud networking (VNets, peering, private endpoints, DNS, firewall/NSGs) and connectivity with the operator.
- Drive security and compliance across identity, segmentation, secrets, and OS baselines in Azure.
- Lead performance engineering for market data ingestion and order routing.
- Partner with engineering on service boundaries, data contracts, and platform primitives.
- Manage cost, capacity, reliability, and availability while scaling the infrastructure function. Requirements:
- 8+ years operating production infrastructure at scale.
- Deep cloud expertise (Azure preferred; AWS/GCP backgrounds welcome).
- Hands-on production experience with Kubernetes, Kafka, Redis, and Postgres.
- Strong networking, security, identity (e.g., Azure AD), and infrastructure-as-code skills.
- Proven ability to build reliable, observable platforms with strong CI/CD; experience with Prometheus, Grafana, OpenTelemetry, or similar.
- Cloud networking fundamentals (VNet design, private endpoints, DNS, firewall rules).
- Performance tuning for latency- and throughput-sensitive systems.
- Strong collaboration skills translating product/trading needs into platform capabilities.
- Active, hands-on use of AI tooling (infra-as-code generation, log analysis, automation, incident triage). Nice to Have:
- Experience in trading, sports betting, exchanges, financial markets, or other real-time systems.
- Event-driven architecture or stream processing experience.
- SRE leadership or reliability program experience.
Consultant
Jennifer Innes
CEOXXX-XXX-XXXX) Reference number: 1322957 Profession:Company: Betting JobsDate posted: 2nd Mar, 2026
Skills
KafkaAzurePrometheusAWSTerraformSecurityCI/CDPostgreSQLKubernetesGrafanaGCPRedisEvent-Driven