DevOps Platform Senior Specialist
- Sisal Albania Sh.p.k.
- Tirana - Rruga Dritan - 40h
- 8mo ago
- Full-Time
- Remote
About us:
Flutter, the world’s largest online sports betting and iGaming group listed on the London and New York stock exchanges, is home to the Southern Europe & Africa (SEA) region, featuring iconic brands like Sisal and PokerStars, a globally loved brand that embodies innovation and ambition, with significant growth potential in a dynamic market.
We are seeking an experienced DevOps Platform Senior Specialist to join our infrastructure team of DevOps Platform.
This role seeks an experienced engineer to manage Kubernetes clusters across OCI and Fury, supporting both on-prem and cloud environments using tools like Ansible, OCI-CLI, Helmfile, and Makefile. It also involves 24/7 on-call support for system availability.
What you'll do:
Kubernetes Cluster Operations
Design, deploy, and manage enterprise-grade Kubernetes clusters on OCI cloud infrastructure and Fury on-premise environments
Implement automated deployment processes using Ansible Playbooks for on-premise infrastructure and OCI-CLI, Helmfile, and Makefile for cloud-based automation
Optimize cluster configurations to ensure peak performance, security compliance, and resource efficiency
Infrastructure Monitoring and Observability
Architect and maintain comprehensive monitoring solutions using the kube-prometheus-stack (Prometheus, Grafana, Alertmanager)
Configure and manage centralized logging infrastructure using Fluent Bit and OpenSearch for advanced log analysis and correlation
Implement log forwarding to Splunk and integrate Alertmanager for proactive incident detection and notification
Security and Access Management
Design and implement Role-Based Access Control (RBAC) policies for development teams
Ensure secure cluster access protocols and maintain compliance with organizational security standards
Monitor and respond to security vulnerabilities and threats across the platform
System Troubleshooting and Performance Optimization
Conduct advanced diagnostics and resolution of complex issues across clusters, services, and networking infrastructure
Perform comprehensive root cause analysis for performance bottlenecks, system failures, and service disruptions
Drive continuous improvement initiatives to enhance cluster stability, availability, and operational resilience
Automation and Tool Management
Maintain and optimize infrastructure toolchain including Nginx Ingress Controller, Opensearch, Fluentbit, Fluentd, Dynatrace, Splunk, Logging Operator, Prometheus, Grafana, Alertmanager, KEDA, Cluster Autoscaler on OCI
Develop, maintain, and enhance infrastructure automation scripts using Bash, Ansible, Python and OCI-CLI
Implement Infrastructure as Code practices to ensure consistent and repeatable deployments
Incident Response and On-Call Support
Participate in structured 24/7 on-call rotation schedule for critical incident response
Provide immediate response to platform outages, security incidents, and system failures
Maintain incident documentation and participate in post-incident reviews and improvement processes.
What You'll Bring:
Min 2 years of experience in similar roles
Bachelor’s degree in Computer Science, Engineering, or related field
Oracle Cloud Infrastructure and CKA certifications (preferred)
Advanced Kubernetes expertise in architecture, deployment, and troubleshooting
Proficient with: OCI-CLI, Helm, Helmfile, Makefile, Kustomize, Ansible, Python
Skilled in observability tools: Prometheus, Grafana, Fluent Bit, OpenSearch, Splunk, Alertmanager
Strong Linux system administration, Bash scripting, and networking knowledge
Solid understanding of RBAC, security best practices, and log aggregation
Familiarity with Dynatrace (preferred)
Proven troubleshooting skills in complex containerized environments
Comfortable with 24/7 on-call support rotation
Team oriented mindset with a collaborative approach
Proactive and able to adapt in dynamic settings
Proficiency in English (Italian is a plus).
Why choose us:
Permanent contract.
Food Allowance.
Pension Fund.
Company Owned Devices (laptop and business mobile phone).
Flexible working hours and possibility of smart-working (according to the company's internal policy).
24 Extra Hours Paid Leave.
Preferential treatment on products offered by Intesa Sanpaolo bank.
Supplementary Private Health Insurance and consultation with the company doctor.
Flutter Sharesave Plan.
Choose us also for:
Psychological well-being: online meditation courses, medical online service, and counseling service thanks to the support of certified psychologists and coaches;
Continuous learning for soft and hard skills (es. Learn / Platform, Training);
Support for parents and children and new mothers’ contributions.
Much more about us:
Equal Opportunity
Flutter is an Equal Opportunity Employer. Diversity and Inclusion are fundamental values for us. We welcome any candidate without distinction of age, culture, religion, ethnicity, sexual orientation, gender identity and expression.
Location: Tirana, Albania