This job offer is not available in your country.

Site Reliability Engineer ll

AltaReturnKrakow, Poland

30+ days ago

Job description

Job Description

We are Allvue Systems, the leading provider of software solutions for the Private Capital and Credit markets. Whether a client wants an end-to-end technology suite, or independently focused modules, Allvue helps eliminate the boundaries between systems, information, and people. We’re looking for ambitious, smart, and creative individuals to join our team and help our clients achieve their goals. Working at Allvue Systems means working with pioneers in the fintech industry. Our efforts are powered by innovative thinking and a desire to build adaptable financial software solutions that help our clients achieve even more. With our common goals of growth and innovation, whether you’re collaborating on a cutting-edge project or connecting over shared interests at an office happy hour, the passion is contagious. We want all of our team members to be open, accessible, curious and always learning. As a team, we take initiative, own outcomes, and have passion for what we do. With these pillars at the center of what we do, we strive for continuous improvement, excellent partnership and exceptional results. Come be a part of the team that’s revolutionizing the alternative investment industry. Define your own future with Allvue Systems!

Help develop and implement strategies for the monitoring and alerting of systems health, performance, and security
Help develop and implement strategies for incident management, problem management, and change management
Create and maintain automation tools and code for configuration management, deployment, and maintenance of cloud-based infrastructure
Collaborate with development and operations teams to ensure that application and infrastructure changes are properly tested, deployed, and maintained
Develop and maintain documentation of system configurations, processes, and procedures.
Champion an atmosphere of continuous improvement by providing and gathering feedback for improvement.
Collaborate with Product and Engineering teams to ensure successful delivery and operation of diverse systems at scale.
Identify opportunities for improvement in current technology and that of individual systems. Avoid the creation of, quickly identify, and prioritize the remediation of technical debt.
Understanding of DevOps methodologies and SRE best practices.
Understanding of DevOps practices, including CI / CD pipelines, configuration management, and Infrastructure as Code (IaC).
Experience in scripting or programming languages (PowerShell, Python, or similar) for automation and infrastructure management in AWS and Azure, as well as IAC like Terraform and CloudFormation
Understanding of networking, security, and identity and access management (IAM) in cloud environments.
Knowledge of cloud computing concepts, including expertise in operating cloud-based solutions using IaaS, PaaS, and SaaS models.
Experience with monitoring, observability and logging tools (Datadog, Splunk, Prometheus, Grafana, etc.).
Proficient in performing in-depth analysis, technical troubleshooting, and problem resolution
Strong time management skills, ability to multi-task and perform well under pressure. Ability to adapt to changing priorities and meeting deadlines.
Experience working within geographically distributed organizations.
Professional written and interpersonal skills.
AWS or Azure certifications (AWS / Azure Solutions Architect, Developer, etc) are a plus, but not required.

Create a job alert for this search

Site Reliability Engineer • Krakow, Poland