- You are familiar with monitoring, incident management (and operational problem management).
- You are the first point of contact for symptom reports and incidents from your customers.
- The first level team prequalifies and filters the IT incidents in the ITSM tool (ServiceNow e.g.) and you are responsible for ensuring that the incidents are resolved and properly documented according to the defined procedures.
- You actively take part in building up your team’s knowledge and drive the evolution of our managed services.
- We work in a 3-shift system (early, late, night) and offer very attractive compensation.
What will be your responsibilities?
- Analyzing and solving incidents (user & monitoring) using tools like Grafana, Prometheus, Kibana, Dynatrace and AWS monitoring.
- Creating and analyzing problem tickets including analyzing the code, where possible (Java, Spring), SQL queries, views and functions.
- Optimizing the existing SQL queries.
- Working with various AWS resources.
- Changes on Kubernetes cluster (restarts, rollbacks, secrets, configurations).
- Adjusting and implementing monitors.
- Extending the existing documentation.
- Running CI/CD pipelines.