Skill Requirements
1. In-depth knowledge of site reliability engineering (sre) principles and best practices.
2. Proficiency in system monitoring, incident management, and performance tuning tools.
3. Strong understanding of cloud services, microservices architecture, and containerization technologies.
4. Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
5. Experience with scripting languages (e.g., python, bash) for automation and tool development.
6. Familiarity with agile methodologies and devops practices for continuous integration and delivery.
7. Strong communication and leadership skills to effectively lead a support team and collaborate with cross functional teams.
8. Ability to work under pressure, prioritize tasks, and manage multiple projects simultaneously.
Certifications: Relevant certifications in Site Reliability Engineering (SRE) or Cloud Services are a plus.