4Bell Technology

Staffing & Recruiting

Data Engineer - PySpark(R-1659)

100,000.00-2,500,000.00/A

B.Tech/B.E

IT (Information Technology)

Full-time

Remote

19-Jul-2026

PySpark RDD Python SQL GCP Databricks or EMR

Job Description

Required Skills & Qualifications

• 4–8 years of experience in data engineering or big data development

• Strong expertise in PySpark and core Spark concepts (RDDs, DataFrames, DAG, lazy evaluation, shuffling)

• Strong understanding of converting RDD-based implementations to DataFrame/Dataset APIs

• Deep understanding of Spark performance tuning techniques

• Solid programming skills in Python

• Experience working with distributed data processing systems

• Strong knowledge of SQL and data modeling concepts

• Familiarity with data storage formats like Parquet, ORC, Avro

Preferred Qualifications

• Experience with cloud platforms (Preferably GCP)

• Hands-on experience with Spark on platforms like Databricks or EMR

• Exposure to workflow orchestration tools (Airflow, etc.)

• Knowledge of CI/CD pipelines and version control (Git)

• Understanding of streaming frameworks (Spark Streaming / Structured Streaming)