This job offer is not available in your country.

Site Reliability Engineer (SRE)

DevsData LLCWarszawa, Masovian, Poland

21 days ago

Job description

technologies-expected :

Kubernetes
Terraform
Prometheus
Grafana
Loki
Alertmanager

technologies-optional :

Rust

about-project :

Are you a passionate Site Reliability Engineer? We’re hiring for a company specialized in distributed systems, content delivery, and video streaming at scale. This fast-growing tech company is transforming in-transit entertainment with an intelligent caching platform that enables airlines and cruise lines to deliver personalized, high-quality video content — even without internet access. Join a global team building the next-generation content delivery system for aircraft and maritime environments.

Salary : €7,900 – €9,000 / month

Location : 100% Remote

Type : Full-time

Contract type : B2B

responsibilities :

Design, deploy, and maintain Kubernetes-based infrastructure using Terraform and Infrastructure-as-Code principles

Lead software deployment efforts for two new international content delivery sites

Build and optimize observability systems (metrics, logging, alerting) to monitor service health and performance

Collaborate with engineering teams to develop and automate CI / CD pipelines (GitLab CI, Argo CD, etc.)

Operate and improve service mesh technology (e.g., Istio) to ensure secure, reliable service-to-service communication

Troubleshoot production systems with a focus on distributed services and networking (HTTP / S, DNS, QUIC)

Contribute to post-incident reviews, root cause analyses, and long-term stability initiatives

Participate in on-call rotations for incident response and site uptime

requirements-expected :

3+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles

Strong hands-on experience with :

Kubernetes (Helm, Operators, workload and networking management)

Terraform and other Infrastructure-as-Code tools

Containers and orchestration at scale

CI / CD systems (e.g., GitLab CI, Argo CD)

Observability tools like Prometheus, Grafana, Loki, Alertmanager

Familiarity with service meshes such as Istio or Linkerd

Deep understanding of networking protocols (TCP / IP, HTTPS, DNS, QUIC)

Experience with distributed systems principles (consistency, fault tolerance, horizontal scaling)

Ability to diagnose and resolve production issues effectively in high-availability systems

Bachelor’s degree in Computer Science or equivalent professional experience

benefits :

remote work opportunities

Create a job alert for this search

Site Reliability Engineer • Warszawa, Masovian, Poland