We are looking for a Junior Site Reliability Engineer (SRE) to join a professional and experienced team supporting corporate clients of various business sizes worldwide. This role provides ample opportunities for hands-on experience, learning new technologies and professional growth (such as SRE or DevOps). Reliability, strong self-discipline and hands-on system administration experience are critical to our success.Brief roleWork model: Remote or in Kharkiv office.Language: English (minimum B2 l
We are looking for a Junior Site Reliability Engineer (SRE) to join a professional and experienced team supporting corporate clients of various business sizes worldwide. This role provides ample opportunities for hands-on experience, learning new technologies and professional growth (such as SRE or DevOps). Reliability, strong self-discipline and hands-on system administration experience are critical to our success.
Brief role
Work model: Remote or in Kharkiv office.
Language: English (minimum B2 level)
Schedule: Flexibility in working hours is required; sometimes there may be night shifts.
WHAT YOU WILL DO:
- Administration and support of Linux and Windows server environments (production and staging).
- Management of local (on-premise) and cloud servers, ensuring availability, high performance and cyber protection.
- Management of cloud users, accounts and subscriptions (access control, organization of resources, cost control).
- Monitoring of the state of systems, logs and alerts; response to incidents and degradation of services.
- Performance of patch management, updates and strengthening of OS security.
- Management of user accesses, rights and basic identification services.
- Execution and regular verification of backup and recovery procedures.
- Participation in monitoring, alerting and backup operations as part of professional SRE / SysOps teams.
- Support disaster recovery planning and testing.
- Collaboration with Engineering and Infrastructure teams on production readiness and resource status.
- Maintain clear operational documentation, runbooks and standards.
- Participate in incident analysis with root cause analysis and actionable remediation plans.
- Interaction with security teams in cases where compliance requirements or security impact operations.
WHAT WE EXPECT FROM YOU:
- Hands-on Linux administration experience (e.g. Ubuntu, CentOS, RHEL, Debian).
- Hands-on Windows Server administration experience (Active Directory, Group Policy, IIS is a plus).
- Experience in any technical support processes (professional or private).
- Understanding the basics of networks (DNS, TCP/IP, firewalls, routing databases).
- Strong troubleshooting skills and analytical thinking; calm and structured approach to work in stressful situations, time pressure.
- English level B2 or higher (written and oral).
- Responsibility, reliability and flexibility in the work schedule, including possible night shifts.
Would be a plus:
- Familiarity with DevOps and automation tools (eg Git, CI/CD pipelines, Ansible, Terraform).
- Experience supporting production environments with uptime and performance requirements.
- Experience managing on-premise and cloud infrastructure.
- Experience with containerization or orchestration (Docker, Kubernetes).
- Knowledge of cloud platforms such as AWS, Azure or GCP.
- Basic scripting skills (Bash, PowerShell or Python).
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK, Zabbix etc.).
- Ability to write clear technical documentation and runbooks.