percision services GmbHRemote

2nd Level Service Operations Expert/Site Reliability Engineer (m/w/d) Devops / Kubernetes - Remote

Project-Based

Description

Für unseren Kunden im Energiesektor suchen wir ab Mai erfahrene Unterstützung als T2 Service Operations Expert  (m/w/d). Die Tätigkeit erfolgt remote und vereinzelt vor Ort oder Frankfurt nach Absprache. General DescriptionOperations within the program are responsible for the day-to-day operation tasks/activities of a hybrid data platform, such as a private cloud, as well as public cloud and towards a future KRITIS infrastructure. T2 Service Operations plays a vital and central role in maintaining the availability and performance. The responsibility lies in the management of incidents and service requests efficiently and effectively, often in critical situations. It involves technical coordination across the project with stakeholders such as customer success, platform delivery, software engineering and others. The focus is continuous improvement and ensuring high performance of the services. Management of both event-driven and planned operations activities, ensuring that incidents are swiftly resolved, problems are addressed and their root, and changes are implemented without causing unplanned downtime. Objective: Ensure Stability and Reliability of Hybrid Cloud OperationsTasks:• Monitoring and managing day-to-day operations of hybrid data platforms (private and public cloud, future KRITIS infrastructure).• Handling of incidents and service requests promptly and with high quality—especially in critical situations.• Acting as the link between Tier 1 (T1) Support and Tier 3 (T3) Operations. Objective: Manage and Resolve Operational IncidentsTasks• Identifying and managing major incidents, including leading Incident Response Teams (IRTs).• Coordination of root cause analysis and ensure sustainable problem resolution.• Expedite, coordination, and escalation of critical situations across different product lines and departments. Objective: Drive Continuous Improvement in Service OperationsTasks• Contributing to service monitoring enhancements, automation, and orchestration.• Promoting and delivering of continual service improvements through dedicated plans.• Reducing unplanned downtime by managing changes and implementing preventive measures. Objective: Maintain Service Knowledge and Onboarding ProceduresTasks:• Recording operational knowledge (in KDB) and maintaining up-to-date procedures.• Ensuring efficient onboarding and offboarding of clients to services. Profile Requirements• Experience in an operational role in vital environments with applications or systems designed based on state-of-art solutions (containerized and distributed). Ideally in a role of a T2 Service Operations Manager.• Experience with containerization and container management incl. the tools and methods operating containers.• Experience of ITSM frameworks, especially within following processes: incident management, service request management, change management, event management.• Experience with analysis methods (business analytics, metric analysis, KPI management, SLA management)• Experience with automation, orchestration, scheduling and monitoring.• Experience in large scale on-prem cloud projects and in coordination with different stakeholders• Experience in troubleshooting and problem-solving, with a focus on root cause analysis and sustainable solutions. Must-have language skills• fluent English in speech and writing (at least C1) Preferred experience• Exposure to ITSM tools inside the Enterprise IT context• Good understanding of IT infrastructure (network, SAN, virtualization, hyperscale)• Good understanding of Kubernetes Ort: RemoteDauer: Ende 2026 + Option, initial Start: Mai wir freuen uns auf ihre Bewerbung auf https://www.percision.de/projekt/9266 Sebastian LejaTeamleiter Recruiting percision services GmbH (adesso group)Agrippinawerft 26 (2.Etage)50678 Köln

Skills

DevOpsEvent-DrivenKubernetes

Want AI to find more roles like this?

Upload your CV once. Get matched to relevant assignments automatically.

Try personalized matching