Job Description

We are seeking a highly skilled Data Engineer with strong expertise in PySpark  and the Cloudera Data Platform (CDP) . The ideal candidate will design, develop, and maintain scalable data pipelines while ensuring high data quality, performance, and availability across the organisation. 
This role requires hands-on experience in big data ecosystems, cloud-native technologies, and advanced data processing frameworks. You will collaborate with cross-functional teams to build reliable and high-performance data solutions that drive business insights. 
Key Responsibilities  1. Data Pipeline Development  Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on CDP 
Ensure data integrity, reliability, and performance optimisation 
2. Data Ingestion  Develop ingestion frameworks to collect data from relational databases, APIs, streaming sources, and file syst...
            

Apply for This Position

Ready to take the next step? Click the button below to submit your application.

Submit Application

Data Engineer (PySpark)

Job Description

Key Responsibilities

1. Data Pipeline Development

2. Data Ingestion

Apply for This Position