Centage is transforming how finance teams operate by providing modern, intuitive, and automated tools for budgeting, forecasting, and financial planning. As we embark on the next phase of scaling and platform transformation, we’re looking for an exceptional SRE to join our team and help future-proof our infrastructure.
Role Overview
As a Site Reliability Engineer at Centage, you'll partner closely with engineering to drive platform modernization, reliability, and data infrastructure evolution. This includes deep involvement in cloud, Kubernetes, and data workflows as well as leading initiatives around observability, security, and performance.
You’ll also play a critical role in supporting and re-architecting our legacy ETL pipelines (Jenkins + SSAS cubes) as we migrate to a modern MongoDB-backed approach .
What You’ll Gain
Must-Have :
Effective communicator, able to coordinate across support and product teams.
Nice-to-Have :
Centage is transforming how finance teams operate by providing modern, intuitive, and automated tools for budgeting, forecasting, and financial planning. As we embark on the next phase of scaling and platform transformation, we’re looking for an exceptional SRE to join our team and help future-proof our infrastructure.
Role Overview
As a Site Reliability Engineer at Centage, you'll partner closely with engineering to drive platform modernization, reliability, and data infrastructure evolution. This includes deep involvement in cloud, Kubernetes, and data workflows as well as leading initiatives around observability, security, and performance.
You’ll also play a critical role in supporting and re-architecting our legacy ETL pipelines (Jenkins + SSAS cubes) as we migrate to a modern MongoDB-backed approach .
What You’ll Gain
Design, implement, and maintain secure, scalable infrastructure in AWS, including cost and performance optimization., Manage, monitor, and scale Kubernetes (EKS) clusters; handle container lifecycle, Helm charts, and service mesh configuration., Lead the re-architecture of our legacy Jenkins-based ETL pipelines, currently relying on SSAS data cubes, toward a modern MongoDB-native approach., Ensuring incident response readiness and supporting root cause analysis in coordination with support and development teams., Optimize and scale MongoDB Atlas environments, managing replication, performance tuning, and availability., Drive observability through tools like Sumo Logic (preferred), Grafana, Prometheus, and Datadog., Implement and manage Infrastructure-as-Code solutions (Terraform, Pulumi) and contribute to CI / CD automation., Own operational metrics (SLAs / SLOs / SLIs) and continuously improve system reliability through automation and process refinement., Document best practices, build runbooks, and ensure system resilience through testing and chaos engineering where appropriate.] Requirements : Jenkins, AWS, Kubernetes, MongoDB, Bitbucket, Scripting language Additionally : Training budget, Private healthcare, Small teams.
Site Reliability Engineer • Kraków, Poland