Senior Data Engineer (with Snowflake) @ Transition Technologies PSC Poland
- Minimum 5-7+ years in data engineering supporting AI / ML applications
- HoldB.Sc, B. Eng. or higher, or equivalent in Computer Science, Data Engineering or related fields
- Experience with Snowflake, Python, SQL and vector database native languages
- Experience with relational databases
- Hands‑on experience with AWS (OpenSearch, S3, Lambda) or Azure (AI Search, Blob Storage, Automation)
- Experience building scalable ETL / ELT workflows using dbt, Apache Airflow or similar
- Ability to design and integrate RESTful APIs for data exchange
- Understanding of encryption and role‑based access controls for data security & governance
- Familiarity with Git, CI / CD, containerization (Docker, Kubernetes) and IaC (Terraform, CloudFormation)
- Experience working with AI‑specific data needs such as embeddings, RAG and LLM fine‑tuning data preparation
Nice to have
NoSQL and vector databasesIoT data streaming with Kafka, Kinesis, PySpark, etc.Responsibilities
Design, build, and maintain scalable data pipelines (ETL / ELT) leveraging Snowflake and AirflowImplement optimized schemas, partitioning, and indexing strategies in Snowflake and relational databasesDevelop data processing workflows and automation scripts in Python and SQL; integrate with APIs and microservicesEnsure scalability, performance, and resilience of pipelines; implement observability for jobs and data flowsPartner with data scientists and ML engineers to deliver high‑quality datasets optimized for AI / ML workloadsPrepare, transform, and manage datasets for embeddings, RAG workflows, and LLM fine‑tuningRequirements
SnowflakePythonAWSETL / ELTCI / CDDockerKubernetesSQLNoSQLVector DatabasesKafka, Kinesis, PySparkBenefits : Sport subscription, Training budget, Private healthcare, Flat structure, International projects, Free coffee, Playroom, Bike parking, Free snacks, Free beverages, In‑house trainings, Modern office, No dress code.
#J-18808-Ljbffr