Talent.com
Senior DL Performance Infrastructure and MLOps Engineer
Senior DL Performance Infrastructure and MLOps EngineerNVIDIA • Remote, Poland
Senior DL Performance Infrastructure and MLOps Engineer

Senior DL Performance Infrastructure and MLOps Engineer

NVIDIA • Remote, Poland
30+ days ago
Job description

We are now looking for a Senior DL Performance Infrastructure & MLOps Engineer.

NVIDIA is seeking engineers who love building world-class infrastructure, from automated command-line scripting to full-blown CI / CD systems running on some of the world's largest clusters, to support our work to accelerate training of deep neural networks like Stable Diffusion or ChatGPT via hardware and software innovations. If you have that itch whenever the mechanical aspects of code development, performance analysis, and data processing consume any more human time than necessary, we'd like to hear from you. If you are passionate about accelerating all existing workfloads in a diverse team while also envisioning next-gen opportunities to enable new forms of hardware / software analysis and development we haven't even thought of, this is the place for you.

What you'll be doing :

Improve all tooling and automation in use in the team, from simple data collection scripts to datacenter-scale ML CI / CD systems.

Understand and internalize workflows for GPU performance analysis and optimization so you can help us re-invent them.

Build Python-based machinery hooking into common Deep Learning software like PyTorch or JAX to support performance analysis work.

Ruthlessly discover and chase down workflow- and tool-related inefficiencies in the team's daily work, and dream up and implement ways to eliminate them.

What we need to see

MS degree in CS or adjacent fields or equivalent experience

3+ years of relevant work experience

Background in deep learning fundamentals and common deep learning software, especially PyTorch / JAX

Experience in GPU computing, i.e. fundamental understanding of heterogeneous multi-node accelerated computing systems

Background in analyzing and optimizing application performance

Familiarity with containerized CI / CD flows, e.g. gitlab + docker

Programming skills in C++, Python, and CUDA

Deep passion related to tools, scripts, and automation

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.

Create a job alert for this search

Senior Infrastructure • Remote, Poland

Similar jobs
Senior Boomi Engineer

Senior Boomi Engineer

BlueSoft • Polska
We are looking for an experienced Boomi engineer, who is eager to work part-time in agile model of cooperation - i.Easter Standard Time area, depending on current needs. Advanced knowledge of Dell B...Show more
Last updated: 30+ days ago • Promoted
Senior Infrastructure Engineer, Cloud Platform

Senior Infrastructure Engineer, Cloud Platform

ClickUp • Polska, Polska
At ClickUp, we're not just building software.We're architecting the future of work In a world overwhelmed by work sprawl, we saw a better way. That's why we created the first truly converged AI work...Show more
Last updated: 1 day ago • Promoted
Lead IT Infrastructure-Integrator Engineer (OpenStack & Private Cloud) IRC279153

Lead IT Infrastructure-Integrator Engineer (OpenStack & Private Cloud) IRC279153

GlobalLogic • Polska
Ansible, DNS, Elastic Search, Grafana, HTTPS and TLS / SSL, Kafka, MQTT, Terraform.We are seeking a skilled and motivated IT Infrastructure Engineer with deep expertise in OpenStack to design, implem...Show more
Last updated: 30+ days ago • Promoted
MLOps Engineer

MLOps Engineer

Signify Technology • Polska
Poland, Wroclaw (Fully remote, must be based in Poland).A leading software company with bespoke products for the finance industry. Signify have partnered up with a European Fintech organisation spec...Show more
Last updated: 30+ days ago • Promoted
Senior DevOps & Infrastructure Engineer IRC277241

Senior DevOps & Infrastructure Engineer IRC277241

Hitachi Vantara Corporation • Polska
GlobalLogic is inviting an experienced Senior Data Platform Engineer to join our team and work on a strategic engagement with Cloudera, a global leader in enterprise data management and analytics.Y...Show more
Last updated: 30+ days ago • Promoted
Remote Senior Site Reliability Engineer — Incident Leader

Remote Senior Site Reliability Engineer — Incident Leader

Digitalhub Sac • Polska
Una empresa tecnológica en Polonia busca un ingeniero de confiabilidad de sitios senior.Este rol requiere una experiencia de más de 5 años con EKS, Kubernetes y containerización.El candidato debe p...Show more
Last updated: 3 days ago • Promoted
Expert DL Deployment Engineer IRC284135

Expert DL Deployment Engineer IRC284135

GlobalLogic • Polska
Our client is dedicated to making safe, intelligent mobility a reality.Headquartered in Sweden, they develop a complete, scalable software stack for ADAS and autonomous driving—from sensing to actu...Show more
Last updated: 24 days ago • Promoted
MLOps Engineer

MLOps Engineer

Hays • Polska
The client offers comprehensive IT services throughout Europe.Location : 100% remote or hybrid (Warsaw / Poznań / Lublin). PLN 135 / hour net + VAT (additional payment for on-call time per week / month).Type...Show more
Last updated: 30+ days ago • Promoted
DevOps Principal & Delivery Leader (Hands-on)

DevOps Principal & Delivery Leader (Hands-on)

Liatrio • Polska
A boutique consulting firm in Poland is seeking a Technical Principal to lead DevOps transformations.The ideal candidate will have strong experience in modern engineering practices and be passionat...Show more
Last updated: 30+ days ago • Promoted
MLOps Engineer

MLOps Engineer

TRG TECH RESEARCH GROUP Limited • PL
Quick Apply
At TRG we work in a global environment where every diverse personality and culture is included.We look for talented people worldwide who have passion for what they do and work together, shoulder to...Show more
Last updated: 30+ days ago
Remote MLOps Engineer : Scale Production ML Pipelines

Remote MLOps Engineer : Scale Production ML Pipelines

Hitachi Vantara Corporation • Polska
A leading digital engineering partner is looking for a Machine Learning Operations Engineer to deploy and monitor machine learning models effectively. You will bridge the gap between platform engine...Show more
Last updated: 30+ days ago • Promoted
Remote ML Systems Engineer : Scale LLMs & Infra

Remote ML Systems Engineer : Scale LLMs & Infra

RelationalAI • Polska
A leading tech company is looking for a Machine Learning Systems Engineer to enhance their machine learning infrastructure and contribute to open source projects. Candidates should have 3+ years of ...Show more
Last updated: 24 days ago • Promoted
Sr. / Staff - Infrastructure / Site Reliability Engineer (SRE)

Sr. / Staff - Infrastructure / Site Reliability Engineer (SRE)

Oscilar • Polska
Staff - Infrastructure / Site Reliability Engineer (SRE).Staff - Infrastructure / Site Reliability Engineer (SRE).Shape the future of trust in the age of AI. At Oscilar, we're building the most advanced...Show more
Last updated: 30+ days ago • Promoted
SRE / LLM Ops Engineer

SRE / LLM Ops Engineer

CluePoints • Polska
Accepting B2B & Contract of Employment applications).SRE Cloud Infrastructure engineer (SRE / Infra) is responsible for the development, the maintenance, the monitoring and the deployment automation ...Show more
Last updated: 1 day ago • Promoted
Senior Infrastructure Engineer

Senior Infrastructure Engineer

Recrucial • Polska
Recrucial is hiring a Senior Infrastructure Engineer for our client's large-scale Legal tech and fintech platform transformation initiative. This is a high-impact role at the core of a distributed s...Show more
Last updated: 30+ days ago • Promoted
Remote ML Engineer for Climate Vision & Impact

Remote ML Engineer for Climate Vision & Impact

FloVision Solutions • Polska
A dynamic technology firm is seeking a Machine Learning Engineer to design and develop deep learning features for applications. This remote position offers a chance to work on innovative machine-lea...Show more
Last updated: 24 days ago • Promoted
Cloud IaaS DevOps Engineer & SRE — Remote / Hybrid

Cloud IaaS DevOps Engineer & SRE — Remote / Hybrid

Gcore • Polska
A global software and infrastructure provider is seeking a DevOps Engineer with at least 5 years of experience, including 3 years with OpenStack. This role focuses on implementing infrastructure cha...Show more
Last updated: 7 days ago • Promoted
MLOps Engineer

MLOps Engineer

Intellias • Polska
MLOps / platform architecture or adjacent roles, with shipped AI systems.Proficient Python and strong software engineering principles. Deep experience with at least one major cloud (AWS / Azure / GCP) and...Show more
Last updated: 11 days ago • Promoted