About Us
We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy.
We're unique too. We were established to find a better way for the global financial community to move value a reliable, safe and secure approach that the community can trust, completely. We're always striving to be better and are constantly evolving in an ever-changing landscape, without undermining that trust. Five decades on, our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently, test exhaustively, then implement fast. In a connected and exciting era, our mission has never been more relevant. Swift now has a presence in 200+ countries and legal territories to serve a community of more than 12,000 banks and financial institutions.
About The Role
As a Senior Site Reliability Engineering Manager, you will lead a team responsible for the reliability, observability, and automation of SWIFT's monitoring platform that powers infrastructure, network, and synthetic monitoring. You will ensure high availability for critical services while driving an automation-first culture. This role requires hands-on experience in troubleshooting complex systems, scaling distributed platforms, and mentoring a team to own operational excellence.
Key Responsibilities
- Team Building and Mentorship:
- Recruit, retain, and grow engineers with expertise in monitoring, observability, and automation.
- Mentor team members on incident response, root cause analysis, and production troubleshooting.
- Operational Leadership:
- Own reliability, uptime, and performance of monitoring and observability platforms.
- Lead incident management, major incident response, and post-incident reviews.
- Drive automation to reduce manual operational work, including runbooks and self-healing systems.
- Collaboration and Alignment:
- Partner with Product Owners, Engineering Leads, and cross-functional teams to align SRE priorities with business impact.
- Promote transparency, visibility, and best practices across teams.
- Technical Leadership:
- Guide system design, architecture, and operational best practices for monitoring and observability platforms.
- Advocate for automation, observability, and reliability at scale.
- Continuous Improvement and Innovation:
- Introduce new monitoring, observability, and automation tools.
- Encourage knowledge sharing, learning, and innovation across teams.
What Will Make You Successful
Professional Skills
- Strong leadership, communication, and mentoring skills.
- Passion for troubleshooting and operational excellence.
- Hands-on experience with monitoring, metrics, logging, tracing, and alerting.
- Familiarity with Agile, DevOps, and SRE practices.
- Fluency in English.
Key Qualifications
- 8+ years in software engineering or operations for large-scale distributed systems.
- 5+ years managing technical teams, preferably SRE, platform, or production engineering.
- Expertise in monitoring platforms and observability tools (ELK, Grafana, OpenTelemetry, Splunk).
- Strong automation skills: Infrastructure as code, CI/CD for ops, scripting (Python, Go, Bash).
- Production troubleshooting experience across software stack, networks, and infrastructure.
- Large-scale Linux, Kubernetes, or cloud-native operations experience.
- Proven ability to manage mission-critical services and drive reliability culture.
Additional Requirements
- Advocate for automation-first approaches to minimize operational toil.
- Strong sense of ownership and transparent communication style.
- Self-motivated, curious, and proactive in improving systems and processes.
About The Team
Our SRE team tackles high-scale, high-impact challenges in monitoring, observability, and reliability. We value troubleshooting, automation-first thinking, and operational excellence. Collaboration, learning, and innovation are core to our culture.
What We Offer
We put you in control of career
We give you a competitive package
We help you perform at your best
We help you make a difference
We give you the freedom to be yourself
We give you the freedom to be yourself. We are creating an environment of unique individuals like you with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyone's voice counts and where you can reach your full potential.
If you believe you require a reasonable accommodation to participate in the job application or interview process, please contact us to request accommodation.
Don't meet every single requirement At Swift, we are dedicated to building a workplace where people can bring their full selves and ideas to the team, so if you are excited about this role, we encourage you to apply even if you do not meet every single qualification.