Senior Platform Engineer
Description
Senior Platform Engineer
🔥 7 hours ago
Report problem
SaaS • Cloud Services • Consulting
DoiT International is a cloud services company that provides comprehensive solutions for managing and optimizing cloud infrastructure across multiple platforms such as AWS, Google Cloud, and Microsoft Azure. Their services include cloud cost management, workload intelligence, automation, and consulting. DoiT International helps businesses optimize their cloud environments, improve performance, and enhance security through a combination of advanced technology and expert consulting.
201 - 500 employees
Founded 2014
📋 Description
• Function as an individual contributor within the team: actively collaborating with peers through thorough code reviews, providing constructive support and mentorship, and contributing to a unified technical direction for the platform. This role also requires collaboration with individuals in feature teams, providing them with support and working with them to facilitate the adoption of developed platform features. • Architect, Design, and Implement Infrastructure as Code (IaC): You will treat our infrastructure as a sophisticated software system, responsible for its comprehensive lifecycle management using Terraform. This involves applying best practices from software engineering, such as deg reusable code modules, implementing robust unit and integration testing strategies, and ensuring that our infrastructure is consistently provisioned and managed in a predictable and repeatable manner. • Deploy, Manage, and Optimize Kubernetes Clusters on GCP (GKE) and AWS (EKS): You will take ownership of the deployment, configuration, and ongoing maintenance of our Kubernetes clusters on GCP Google Kubernetes Engine (GKE) and AWS Elastic Kubernetes Service (EKS) kit. This includes managing node groups, configuring network policies, implementing service meshes, handling cluster upgrades, and ensuring high availability and fault tolerance. You will also be responsible for monitoring cluster health, performance, and resource utilization, and proactively addressing any issues that arise. • Develop and Refine Internal Software Delivery Systems (CI/CD): You will design, implement, and maintain robust Continuous Integration/Continuous Deployment (CI/CD) software specifically tailored for our platform components. This involves integrating tools like Argo CD or Atlantis, applying advanced programming concepts to automate build and release processes, and ensuring seamless deployment of platform updates. You will also focus on optimizing pipeline performance and reducing deployment times. • Diagnose, Troubleshoot, and Resolve Platform-Related Issues: You will be the primary point of contact for diagnosing and resolving platform-related issues, including performance bottlenecks, scalability challenges, and security vulnerabilities. This involves utilizing advanced troubleshooting techniques, analyzing logs and metrics, and collaborating with development teams to identify and resolve root causes. You will also contribute to creating comprehensive incident response plans and post-mortem analyses. • Drive Automation Initiatives to Streamline Operational Tasks and Enhance System Reliability: You will champion automation initiatives to eliminate manual operational tasks, reduce human error, and improve overall system reliability. This involves developing scripts, tools, and workflows to automate tasks such as infrastructure provisioning, configuration management, and monitoring. You will also proactively identify opportunities for automation and drive continuous improvement in our operational processes. • Act as a Strategic Partner to Development Teams, Understanding and Addressing Their Infrastructure Needs: You will foster strong relationships with feature teams, treating them as your internal customers. You will actively engage with them to understand their infrastructure and developer experience requirements, provide expert guidance on platform capabilities, and ensure our platform effectively supports their development workflows. You will also translate developer needs into actionable platform