Back to listings
CGIBangalore, KA

Monitoring & Tools L3 Administrator / Engineer (Dynatrace, Splunk, SolarWinds, Nagios)

Description

Monitoring & Tools L3 Administrator / Engineer (Dynatrace, Splunk, SolarWinds, Nagios) Role Overview The Tools L3 Administrator is responsible for advanced management, troubleshooting, and optimization of enterprise monitoring and observability platforms. This includes application performance monitoring (APM), infrastructure monitoring, log analytics, and alerting systems. The role requires deep expertise in tools such as Dynatrace, Splunk, SolarWinds, Nagios, and the ability to resolve complex issues independently while ensuring proactive monitoring and service reliability. Key Responsibilities

  • Provide L3 support for escalated incidents related to monitoring and observability tools.
  • Manage and maintain Dynatrace, Splunk, SolarWinds, Nagios platforms for enterprise environments.
  • Configure and optimize dashboards, alerts, and reports to ensure proactive monitoring.
  • Perform root cause analysis using log analytics and APM tools.
  • Implement performance tuning, capacity planning, and monitoring automation.
  • Integrate monitoring tools with ITSM platforms (ServiceNow, Remedy) for incident workflows.
  • Automate repetitive tasks using PowerShell, Python, or Ansible.
  • Collaborate with infrastructure, application, and security teams to ensure end‑to‑end visibility.
  • Lead critical incident investigations and document resolutions.
  • Maintain knowledge base and documentation for configurations, processes, and troubleshooting guides. Required Skills & Experience
  • 7–12 years of experience in enterprise monitoring and observability with strong L3 expertise.
  • Hands‑on experience with Dynatrace (APM), Splunk (log analytics), SolarWinds (network monitoring), Nagios (infrastructure monitoring).
  • Strong knowledge of monitoring architecture, integrations, and scaling strategies.
  • Expertise in alerting, dashboards, and reporting for proactive incident detection.
  • Experience with cloud monitoring (Azure Monitor, AWS CloudWatch, GCP Operations Suite).
  • Familiarity with DevOps pipelines and CI/CD monitoring integrations.
  • Proficiency in scripting and automation (Python, PowerShell, Ansible).
  • Solid understanding of networking, servers, and application performance metrics.
  • Ability to lead critical incident resolution and mentor junior administrators. Preferred Qualifications
  • Certifications: Splunk Certified Admin/Architect, Dynatrace Professional, SolarWinds Certified Professional, Nagios Certified Expert.
  • Experience with observability stacks (ELK/EFK, Prometheus, Grafana).
  • Exposure to AIOps platforms for predictive monitoring.
  • Knowledge of ITIL processes for incident, problem, and change management.

Skills:

Nagios, Splunk

Skills

GCPGrafanaSplunkCI/CDAWSPrometheusAnsibleAzurePowershellDevOpsElkPythonSecurity