Senior Site Reliability Engineer


Company 

Caspian One

Location 

newcastle-upon-tyne

Employment Hours 

Full Time

Employment Type 

Permanent

Salary 

Job Requirements/Description
Seeking an exceptional talent to shape the core of a rapidly growing fintech that's revolutionizing the financial services landscape. If you are a passionate Senior Site Reliability Engineer (SRE) ready to take the helm and build a world-class SRE function from the ground up, harnessing the latest in cloud, monitoring, and DevOps technologies please apply! As the firm scales, they are looking for an SRE who’s not just exceptional in their field, but pioneering . You'll be given the autonomy to design and implement SLOs, optimize the system’s reliability, and safeguard our infrastructure against vulnerabilities, all while working at the cutting edge of finance tech. What You’ll Do: Build and lead: Establish a top-tier SRE function from scratch, creating the processes, tools, and culture needed to scale our operations globally. Drive reliability: Own and optimize Service Level Objectives (SLOs) , ensuring systems are always performing at their peak. Monitoring & Automation: Implement cutting-edge monitoring tools to create real-time system observability and automate repetitive tasks to ensure the platform is self-healing and always available. Security & Vulnerability Management: Collaborate with security teams to implement vulnerability management , ensuring the fintech infrastructure is robust and secure against emerging threats. Incident Management: Develop clear incident response processes and lead blameless post-mortems to continuously improve service reliability. Cloud Expertise: Utilize your deep knowledge of cloud platforms (AWS, GCP) and container orchestration to optimize infrastructure at scale. DevOps & CI/CD: Implement DevOps best practices and CI/CD pipelines that drive continuous delivery and robust system health. What You Bring: Proven experience building and scaling SRE functions, ideally in a high-growth or fintech environment. Deep expertise in monitoring tools (e.g., Prometheus, Grafana, Datadog), logging, and alerting systems. Extensive experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker). Strong knowledge of Service Level Indicators (SLIs) and SLOs, including optimizing system performance and uptime. Excellent understanding of vulnerability management , security best practices, and incident response frameworks. Mastery of automation and infrastructure as code (Terraform, Ansible, etc.). Experience in driving DevOps culture, implementing CI/CD pipelines, and creating reliable systems from day one. An engineering background able to program in multiple languages and embed in core development teams A collaborative mindset, ready to work cross-functionally with developers, security experts, and product teams to drive results. Why Join? Shape the Future: As the first Senior SRE, you’ll be the architect of our reliability culture, setting standards that will define the future of fintech infrastructure. Innovative Environment: Work with the latest in cloud, DevOps, and automation technologies, in a forward-thinking fintech. Growth Opportunities: Scale with us as we expand globally, with a path to leadership as our team grows. Competitive Salary & Benefits: We offer a generous package, equity options, and flexible working arrangements. Flexible working: Fully remote on offer! Ready to Make an Impact? If you're an SRE expert who thrives in fast-paced, innovative environments and is excited by the opportunity to build from scratch, we’d love to hear from you. Apply today and help us drive the future of fintech!
Company 

Caspian One

Location 

newcastle-upon-tyne

Employment Hours 

Full Time

Employment Type 

Permanent

Salary 

An error has occurred. This application may no longer respond until reloaded. Reload 🗙