About the command: we are a team that aims to provide round -the -clock monitoring of services and infrastructure. Our main mission is to guarantee the stability and continuous work of business processes, proactively identifying and solving possible problems. We are working on creating effective monitoring and automation solutions that will reduce risks and ensure the maximum accessibility of critical systems. ? / Flexibility to processes and a tendency to rapid self -study and transfer to co
About the command:
we are a team that aims to provide round -the -clock monitoring of services and infrastructure. Our main mission is to guarantee the stability and continuous work of business processes, proactively identifying and solving possible problems. We are working on creating effective monitoring and automation solutions that will reduce risks and ensure the maximum accessibility of critical systems. ? /
Flexibility to processes and a tendency to rapid self -study and transfer to colleagues Understanding the basic principles of network infrastructure and administration systems Basic understanding li> Working with Linux The ability to communicate and collaborate with other departments and team members The ability to quickly analyze data and draw conclusions to solve problems and optimize processes Willingness to participate in duty or work in off -time to solve critical problems __ Tasks:
- Monitoring active positions, servers, communication channels with the help of monitoring systems
- Tracking logs of services and servers to identify potential problems and anomalies in the system.
- Registration, escalation, classification and routing of incidents Different levels of complexity
- Preparing analytical and monitoring Dashboards on products
- Active participation in viewing existing processes to improve their efficiency and stability
- Transfer of knowledge and assistance in solving complex Monitoring issues
- Supporting current monitoring and monitoring procedures
- Close interaction with developers and engineers to solve problems and infrastructure optimization
- freelance situations and restoration of work of the first level of complexity
Technologies and tools:
- Elastic/Open Search (Kibana)
- grafana
- sql/metabase
- jira/confluence
- loki
- Pagerduty