Israel
Tel Aviv-Yafo
Full-time
The Role
We’re looking for a Data Engineer to build and scale the data infrastructure that powers our machine learning research. You’ll own pipelines and databases that make large, complex datasets usable for training foundation models.
What You'll Do
Build and maintain scalable data pipelines for genomic and biological data
Design and manage the company’s core database, including defining and evolving the ERD
Develop and orchestrate ETL workflows (Airflow, Prefect, etc.) for ingestion, preprocessing, and validation
Optimize storage, retrieval, and distributed data processing
Work closely with ML engineers and researchers
What We’re Looking For
What We're Looking For
4+ years in data engineering or related fields
Proficiency in Python, SQL
Hands-on experience with cloud platforms (AWS, GCP)
Solid software engineering practices
Strong experience with distributed data processing (Spark, Ray, Beam, etc.)
Bonus: experience with MLOps workflows or bioinformatics/genomics