32 views
E1
E1 is looking for a qualified Site reliability engineer (Senior). #Віддалено #Україна
As a Site reliability engineer, you will play a key role in defining and achieving key SLOs for users and internal teams.
The dedicated Site reliability engineer position appears for the first time in the product, therefore, you are expected to have experience building processes from scratch.
Main duties:
-Monitoring and working with alert systems: You you will lead initiatives to improve the customer's monitoring and alerting systems, ensuring they meet the requirements of the backend, DevOps, DBA and Security teams.
- SLA/SLO/SLI management: you will implement the basics of SLA management, monitor SLI, working closely with the technical team to achieve performance goals.
-Incident Response: You will determine the responsibilities of support specialists, support troubleshooting efforts, manage communications during service interruptions, and conduct incident investigations.
-Organize reliability and security measures for a large information platform: You will develop a strategy for managing backups, chaos testing, and health monitoring data and infrastructure.
Requirements:
-Significant experience as an SRE with an emphasis on reliability site.
- Proficiency in Docker and Kubernetes.
- In-depth knowledge of Linux, including a thorough understanding of core components and hands-on experience managing distributed Linux systems.
- Skills in using basic tools for telemetry, tracing, alerting and monitoring.
-Systems thinking and accountability.
Phases recruiting:
-Screen call,
-Technical interview with a small (20-30 min.) test,
-General interview.
Contacts:
Telegram (priority) @DenisOskorbin
Skype Denysjet
Mail [email protected] or [email protected]
ApplyNow
We respect your desire to develop professionally and monetize professional skills, so we will make every effort to so that your skills and experience receive a high market value.