This job offer is not available in your country.

PySpark Developer

Axiom Software Solutions LimitedWarsaw, Masovian Voivodeship, PL

30+ days ago

Job type

Quick Apply

Job description

PySpark Developer

Description

We are looking for a skilled Data Engineer with expertise in Python, PySpark, and Cloudera to join our team. The ideal candidate will be responsible for developing and optimizing big data pipelines while ensuring efficiency and scalability. Experience with Databricks is a plus. Additionally, familiarity with Git, GitHub, Jira, and Confluence is highly valued for effective collaboration and version control.

Key Responsibilities

Design, develop, and maintain ETL pipelines using Python and PySpark.
Work with Cloudera Hadoop ecosystem to manage and process large-scale datasets.
Ensure data integrity, performance, and reliability across distributed systems.
Collaborate with data scientists, analysts, and business stakeholders to deliver data-driven solutions.
Implement best practices for data governance, security, and performance tuning.
Use Git and GitHub for version control and efficient code collaboration.
Track and manage tasks using Jira, and document processes in Confluence.
Optional) Work with Databricks for cloud-based big data processing.

Required Skills & Experience

Strong programming skills in Python.

Hands-on experience with PySpark for distributed data processing.

Expertise in Cloudera Hadoop ecosystem (HDFS, Hive, Impala).

Experience with SQL and working with large datasets.

Knowledge of Git and GitHub for source code management.

Experience with Jira for task tracking and Confluence for documentation.

Strong problem-solving and analytical skills.

Preferred Qualifications

Basic knowledge of Databricks for cloud-based big data solutions.

Experience with workflow orchestration tools (e.g., Airflow, Oozie).

Understanding of cloud platforms (AWS, Azure, or GCP).

Exposure to Kafka or other real-time streaming technologies.

Create a job alert for this search

Developer • Warsaw, Masovian Voivodeship, PL

Related jobs

ETRM Data Scientist

IT PerformanceWarszawa, Polska

Poszukujemy kandydatów na stanowisko ETRM Data Scientist.Praca jest dedykowana dla rozpoznawalnej firmy konsultingowej IT. Tworzenie, wdrażanie i utrzymywanie skalowalnych modeli predykcyjnych (ARIM...Show moreLast updated: 30+ days ago

Senior Data Engineer (Palantir Foundry Expert) | AI & Data

Deloitte CEWarsaw, Poland

Senior Data Engineer (Palantir Foundry Expert) | AI & Data.Senior Data Engineer (Palantir Foundry Expert) | AI & Data.Deloitte Digital, Strategy, Analytics and M&A. Consulting, Data & Analytics, Dig...Show moreLast updated: 5 days ago

PySpark Developer, Rzeczpospolita Polska

Axiom Software Solutionswarsaw, Rzeczpospolita Polska, Poland

PySpark Developer PySpark Developer Description We are looking for a skilled Data Engineer with expertise in Python, PySpark, and Cloudera to join our team. The ideal candidate will be responsible ...Show moreLast updated: 30+ days ago

Middle / Senior Data Engineer

BonapoliaWarszawa, Mazowieckie, PL

Quick Apply

For job seekers, BONAPOLIA offers a gateway to exciting career prospects and the chance to thrive in a fulfilling work environment. We believe that the right job can transform lives, and we are comm...Show moreLast updated: 30+ days ago

Data Platform Engineer @ ITFS Sp. z o.o.

ITFS Sp. z o.o.Warszawa, Poland

Inżynier Platformy Danych będziesz wspierać projektowanie i tworzenie architektury platformy danych oraz jej kluczowych komponentów. Co najmniej podstawowe doświadczenie z.Doświadczenie w projektowa...Show moreLast updated: 8 days ago

New!

Senior Data Engineer (Microsoft Fabric) @ Square One Resources

Square One ResourcesWarsaw, Poland

To idealna rola dla osoby, która szuka elastycznego, dorywczego zaangażowania z wykorzystaniem najnowszych technologii Microsoft. Minimum 8 lat doświadczenia w data engineeringu lub BI.Doświadczenie...Show moreLast updated: 9 hours ago

Python Developer

1dea Kośnik sp.kWarszawa, Polska

Dla jednego z naszych dużych klientów poszukujemy osoby do roli : .ASAP (akceptujemy kandydatury z max 1msc okresem wypowiedzenia). Stawka (ustalana indywidualnie) : .B2B (outsourcing z 1dea), full-time...Show moreLast updated: 30+ days ago

Senior Data Engineer @ Harvey Nash Technology

Harvey Nash TechnologyWarszawa, Poland

Design, build, and enhance data pipelines for streaming and batch processing Extend and support our AWS cloud data platform Develop features using Databricks pipelines, Unity Catalog, and Spark S...Show moreLast updated: 7 days ago

Data Engineer with Pyspark @ Capgemini Polska Sp. z o.o.

Capgemini Polska Sp. z o.o.Warszawa, Poland

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago

Promoted

Data Engineer / Data Tech Lead

apreel Sp. z o.o.Warszawa, Masovian, Poland

We are looking for skilled Data Tech Lead to join an enterprise-scale data platform project leveraging Azure cloud and modern big data technologies. You will work on building robust data pipelines, ...Show moreLast updated: 10 days ago

Data Engineer

GreenmindsWarszawa, Masovian Voivodeship, PL

Dla jednego z naszych Klientów poszukujemy specjalistów na stanowisko.Forma współpracy : kontrakt B2B.Docelowo, zespół ma powiększyć się o architektów i większą liczbę deweloperów.Projekt międzynaro...Show moreLast updated: 28 days ago

Data Analyst Expert

B2B.NET S.A.Warszawa, Masovian, Poland

We are seeking a skilled and motivated Data Analyst to join our data-driven organization and support development teams in the IIS (Information Infrastructure Services) area.This role combines deep ...Show moreLast updated: 26 days ago

Analityk Danych / Inżynier Danych / BI Developer

Bank PekaoWarszawa

Dołącz do nas!Buduj z nami bank przyszłości oparty na danych! Szukamy specjalistów, którzy chcą współtworzyć transformację Banku Pekao S. Będziemy wspólnie rozwijać nowoczesne platformy danych i nar...Show moreLast updated: 5 days ago

Programista Machine Learning x2

b2bnetworkWarszawa, Polska

Praca przy dużej platformie administracyjnej z zakresu ochrony zdrowia, dedykowanej sektorowi publicznemu.Projekt obejmuje rozwój i wdrażanie zaawansowanych modeli uczenia maszynowego oraz analizę ...Show moreLast updated: 30+ days ago

Promoted

Data Scientist

speedappWarszawa, Masovian, Poland

Join a dynamic team in the business travel industry and contribute to the development of intelligent, data-driven tools that manage billions in global travel spend. Design and implement statistical ...Show moreLast updated: 14 days ago

Data Product Owner (100% remote) @ Crestt

CresttWarszawa, Poland

We are looking for a person to support a project at the intersection of business analysis, data architecture, and product management – with a strong foundation in the data environment, but without ...Show moreLast updated: 13 days ago

Data Engineer with Databricsk and PySpark

Antal Sp. z o.o.Warszawa, Polska

Data Engineer with Databricsk and PySpark - 100% remote .Your responsibilities will include : .Leading the migration of data and applications from Microsoft Fabric to Databricks, ensuring minimal dis...Show moreLast updated: 30+ days ago

Promoted

Data Engineer (PySpark + Palantir Foundry)

CRESTT sp. z o.o.Warszawa, Masovian, Poland

Hi! We are looking for experienced Data Engineers to join a strategic project within the healthcare sector for one of the world’s leading life science companies. You will be working on data migratio...Show moreLast updated: 6 days ago