Senior Infrastructure Reliability Lead – Hybrid/On-Prem & Cloud |SRE|

Location
Contract Type
Full-time
Work from home
Partial work from home
Published
Reference
20-16-414619
Job description

Are you an experienced Platform Engineerin Lead within SRE?

🌟 This role focuses on maintaining and optimising the organisation’s most critical technology infrastructure, ensuring operational stability while swiftly resolving complex technical issues. The position blends advanced technical expertise with leadership, strategic planning, and stakeholder engagement to deliver continuous improvement and resilience.

🛠️ Core Responsibilities

🔧 Advanced Technical Support – Provide expert-level assistance to resolve complex incidents for assigned clients or client groups. Shape and enhance support models to boost service quality for customers and stakeholders

📊 Proactive Maintenance & Monitoring – Carry out preventative hardware/software maintenance and use monitoring metrics to identify, prevent, and resolve potential issues before they impact operations

📚 Knowledge Management – Maintain and expand a knowledge base with detailed case documentation, enabling self-service, faster resolutions, and effective information sharing across teams

🕵️ Root Cause Analysis – Investigate system logs, error messages, and user reports to identify underlying issues in hardware, software, and networks; apply targeted fixes, replacements, reconfigurations, or software reinstalls

⚙️ Automation & Efficiency – Implement process automations and fine-tune monitoring tools, thresholds, and alerts to increase stability, improve responsiveness, and reduce manual workload

🛡️ Risk Management – Identify, mitigate, and escalate potential service-impacting risks through formal processes, supporting business continuity and governance requirements

📈 Capacity & Resiliency Planning – Oversee capacity management and implement resiliency strategies aligned with operational and front-office needs

🤝 Strategic & Leadership Dimensions 📌 Strategic Planning & Policy Management – Contribute to defining strategy, shaping requirements, and recommending change initiatives. Oversee resources, budgets, and processes to align with business goals

👥 Team & Talent Development – Define roles, plan for future organisational needs, coach and mentor team members, and lead specialists to balance short- and long-term priorities

🧠 Subject Matter Expertise – Provide technical direction, lead multi-year projects, guide structured work assignments, and ensure collaboration with other specialist areas when needed. 📢 Stakeholder Engagement – Build strong relationships with internal and external partners, influencing decisions through data-driven insights and negotiation skills

📊 Analytical & Problem-Solving Excellence – Apply deep analytical thinking, research, and complex option comparisons to define challenges and craft innovative solutions

🔍 Governance & Control – Strengthen operational controls, assess risks, and ensure alignment with the organisation’s control and governance agenda

🌐 Cross-Functional Collaboration – Stay aligned with evolving business strategies by actively engaging with other functional areas to provide tailored, business-aligned support

Requirements

What you should already have:

According to the Czech labour law, you need to hold a valid work permit.

✅ Proven experience on a SRE (Site Reliability Engineer) lead role

☁️🌐 Strong technical experience in cloud/hybrid environments

💻 Key Technologies & Skills

🐧 Strong Linux/Unix on-premise expertise, with advanced scripting skills in Python and Bash/Shell

🗄️ Proficiency in database technologies including PostgreSQL, MS SQL, and Oracle

📡 Experience implementing observability & monitoring solutions (Elastic, Geneos ITRS, Grafana)

📦 Skilled in containerisation and virtualisation technologies

📜 Strong grasp of ITIL principles with a proactive, improvement-driven mindset

🏆 Previous leadership or high-level technical experience in a similar environment

Benefits
  • Annual bonus
  • 5 weeks of holiday/year
  • 60 sick days per year
  • pension contribution
  • courses
  • home office
  • modern offices
  • work-life balance
  • cafeteria points
  • meal vouchers
Other notes
For more related job opportunities visit https://www.grafton.cz/en/job-search