Talent.com
This job offer is not available in your country.
PySpark Developer

PySpark Developer

Axiom Software Solutions LimitedWarsaw, Masovian Voivodeship, PL
30+ days ago
Job type
  • Quick Apply
Job description

PySpark Developer

Description

We are looking for a skilled Data Engineer with expertise in Python, PySpark, and Cloudera to join our team. The ideal candidate will be responsible for developing and optimizing big data pipelines while ensuring efficiency and scalability. Experience with Databricks is a plus. Additionally, familiarity with Git, GitHub, Jira, and Confluence is highly valued for effective collaboration and version control.

Key Responsibilities

  • Design, develop, and maintain ETL pipelines using Python and PySpark.
  • Work with Cloudera Hadoop ecosystem to manage and process large-scale datasets.
  • Ensure data integrity, performance, and reliability across distributed systems.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver data-driven solutions.
  • Implement best practices for data governance, security, and performance tuning.
  • Use Git and GitHub for version control and efficient code collaboration.
  • Track and manage tasks using Jira, and document processes in Confluence.
  • Optional) Work with Databricks for cloud-based big data processing.

Required Skills & Experience

  • Strong programming skills in Python.
  • Hands-on experience with PySpark for distributed data processing.
  • Expertise in Cloudera Hadoop ecosystem (HDFS, Hive, Impala).
  • Experience with SQL and working with large datasets.
  • Knowledge of Git and GitHub for source code management.
  • Experience with Jira for task tracking and Confluence for documentation.
  • Strong problem-solving and analytical skills.
  • Preferred Qualifications

  • Basic knowledge of Databricks for cloud-based big data solutions.
  • Experience with workflow orchestration tools (e.g., Airflow, Oozie).
  • Understanding of cloud platforms (AWS, Azure, or GCP).
  • Exposure to Kafka or other real-time streaming technologies.
  • Create a job alert for this search

    Developer • Warsaw, Masovian Voivodeship, PL

    Related jobs
    ETRM Data Scientist

    ETRM Data Scientist

    IT PerformanceWarszawa, Polska
    Poszukujemy kandydatów na stanowisko ETRM Data Scientist.Praca jest dedykowana dla rozpoznawalnej firmy konsultingowej IT. Tworzenie, wdrażanie i utrzymywanie skalowalnych modeli predykcyjnych (ARIM...Show moreLast updated: 30+ days ago
    Senior Data Engineer (Palantir Foundry Expert) | AI & Data

    Senior Data Engineer (Palantir Foundry Expert) | AI & Data

    Deloitte CEWarsaw, Poland
    Senior Data Engineer (Palantir Foundry Expert) | AI & Data.Senior Data Engineer (Palantir Foundry Expert) | AI & Data.Deloitte Digital, Strategy, Analytics and M&A. Consulting, Data & Analytics, Dig...Show moreLast updated: 5 days ago
    PySpark Developer, Rzeczpospolita Polska

    PySpark Developer, Rzeczpospolita Polska

    Axiom Software Solutionswarsaw, Rzeczpospolita Polska, Poland
    PySpark Developer PySpark Developer Description We are looking for a skilled Data Engineer with expertise in Python, PySpark, and Cloudera to join our team. The ideal candidate will be responsible ...Show moreLast updated: 30+ days ago
    Middle / Senior Data Engineer

    Middle / Senior Data Engineer

    BonapoliaWarszawa, Mazowieckie, PL
    Quick Apply
    For job seekers, BONAPOLIA offers a gateway to exciting career prospects and the chance to thrive in a fulfilling work environment. We believe that the right job can transform lives, and we are comm...Show moreLast updated: 30+ days ago
    Data Platform Engineer @ ITFS Sp. z o.o.

    Data Platform Engineer @ ITFS Sp. z o.o.

    ITFS Sp. z o.o.Warszawa, Poland
    Inżynier Platformy Danych będziesz wspierać projektowanie i tworzenie architektury platformy danych oraz jej kluczowych komponentów. Co najmniej podstawowe doświadczenie z.Doświadczenie w projektowa...Show moreLast updated: 8 days ago
    • New!
    Senior Data Engineer (Microsoft Fabric) @ Square One Resources

    Senior Data Engineer (Microsoft Fabric) @ Square One Resources

    Square One ResourcesWarsaw, Poland
    To idealna rola dla osoby, która szuka elastycznego, dorywczego zaangażowania z wykorzystaniem najnowszych technologii Microsoft. Minimum 8 lat doświadczenia w data engineeringu lub BI.Doświadczenie...Show moreLast updated: 9 hours ago
    Python Developer

    Python Developer

    1dea Kośnik sp.kWarszawa, Polska
    Dla jednego z naszych dużych klientów poszukujemy osoby do roli : .ASAP (akceptujemy kandydatury z max 1msc okresem wypowiedzenia). Stawka (ustalana indywidualnie) : .B2B (outsourcing z 1dea), full-time...Show moreLast updated: 30+ days ago
    Senior Data Engineer @ Harvey Nash Technology

    Senior Data Engineer @ Harvey Nash Technology

    Harvey Nash TechnologyWarszawa, Poland
    Design, build, and enhance data pipelines for streaming and batch processing Extend and support our AWS cloud data platform Develop features using Databricks pipelines, Unity Catalog, and Spark S...Show moreLast updated: 7 days ago
    Data Engineer with Pyspark @ Capgemini Polska Sp. z o.o.

    Data Engineer with Pyspark @ Capgemini Polska Sp. z o.o.

    Capgemini Polska Sp. z o.o.Warszawa, Poland
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 12 days ago
    • Promoted
    Data Engineer / Data Tech Lead

    Data Engineer / Data Tech Lead

    apreel Sp. z o.o.Warszawa, Masovian, Poland
    We are looking for skilled Data Tech Lead to join an enterprise-scale data platform project leveraging Azure cloud and modern big data technologies. You will work on building robust data pipelines, ...Show moreLast updated: 10 days ago
    Data Engineer

    Data Engineer

    GreenmindsWarszawa, Masovian Voivodeship, PL
    Dla jednego z naszych Klientów poszukujemy specjalistów na stanowisko.Forma współpracy : kontrakt B2B.Docelowo, zespół ma powiększyć się o architektów i większą liczbę deweloperów.Projekt międzynaro...Show moreLast updated: 28 days ago
    Data Analyst Expert

    Data Analyst Expert

    B2B.NET S.A.Warszawa, Masovian, Poland
    We are seeking a skilled and motivated Data Analyst to join our data-driven organization and support development teams in the IIS (Information Infrastructure Services) area.This role combines deep ...Show moreLast updated: 26 days ago
    Analityk Danych / Inżynier Danych / BI Developer

    Analityk Danych / Inżynier Danych / BI Developer

    Bank PekaoWarszawa
    Dołącz do nas!Buduj z nami bank przyszłości oparty na danych! Szukamy specjalistów, którzy chcą współtworzyć transformację Banku Pekao S. Będziemy wspólnie rozwijać nowoczesne platformy danych i nar...Show moreLast updated: 5 days ago
    Programista Machine Learning x2

    Programista Machine Learning x2

    b2bnetworkWarszawa, Polska
    Praca przy dużej platformie administracyjnej z zakresu ochrony zdrowia, dedykowanej sektorowi publicznemu.Projekt obejmuje rozwój i wdrażanie zaawansowanych modeli uczenia maszynowego oraz analizę ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    speedappWarszawa, Masovian, Poland
    Join a dynamic team in the business travel industry and contribute to the development of intelligent, data-driven tools that manage billions in global travel spend. Design and implement statistical ...Show moreLast updated: 14 days ago
    Data Product Owner (100% remote) @ Crestt

    Data Product Owner (100% remote) @ Crestt

    CresttWarszawa, Poland
    We are looking for a person to support a project at the intersection of business analysis, data architecture, and product management – with a strong foundation in the data environment, but without ...Show moreLast updated: 13 days ago
    Data Engineer with Databricsk and PySpark

    Data Engineer with Databricsk and PySpark

    Antal Sp. z o.o.Warszawa, Polska
    Data Engineer with Databricsk and PySpark - 100% remote .Your responsibilities will include : .Leading the migration of data and applications from Microsoft Fabric to Databricks, ensuring minimal dis...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer (PySpark + Palantir Foundry)

    Data Engineer (PySpark + Palantir Foundry)

    CRESTT sp. z o.o.Warszawa, Masovian, Poland
    Hi! We are looking for experienced Data Engineers to join a strategic project within the healthcare sector for one of the world’s leading life science companies. You will be working on data migratio...Show moreLast updated: 6 days ago