4Bell Technology

Staffing & Recruiting

Data Engineer(R-1544)

100,000.00-1,500,000.00/A

Any Degree

IT (Information Technology)

Contract

Remote

16-Jun-2026

SQL Spark Python Presto Deltalake Apache big data tool suites Docker Kubernetes Cloud ETL

Job Description

·  Experience in building and operating production big data platforms and pipelines

·  Strong experience with SQL, Spark, workflow orchestrators, distributed message bus, Python, Presto, Deltalake, apache big data tool suites, Docker, Kubernetes, MPP

·  Hands on with the design and implementation of cloud-based data solutions using platforms like Azure, AWS, or GCP, optimizing for scalability, cost-efficiency, and performance.

·  Implement and maintain data lakes and warehouses, lakehouses including data modeling, ETL processes, and data quality assurance to empower data-driven decision-making.

·  Develop real-time data pipelines using streaming technologies like Apache Kafka or AWS Event hub, enabling timely insights and actions from incoming data streams.

·  Manage and enhance distributed data systems (e.g., Hadoop, Spark) to efficiently process large-scale datasets, ensuring data availability and reliability.

·  Previous experience of working on health data and Azure cloud is a strong plus

·  Experience with Databricks or MS Fabric

·  Strong track record of designing and implementing scalable data models, schemas, ETL logic

·  Experience with data governance, master data management, data pseudonimization and anonymization, and data catalog solutions .

·  A strong interest in learning new things and team player ethics.  

·  Strong analytical skills and good understanding of data structures and algorithms.

·  Some exposure to Nextflo and or Nextflow Tower