Job Description
We are looking for a seasoned Senior Big Data Developer with strong Python and PySpark expertise to design, develop, and optimize large-scale data processing solutions in a Hadoop/Cloudera ecosystem. The ideal candidate brings deep hands‑on experience across the full Big Data stack, strong data analysis and wrangling capabilities for high-volume datasets, and the ability to thrive in fast-paced globally distributed team environments.
Develop and maintain data pipelines using
Python, PySpark, Spark, and Kafka
for high-volume batch and streaming workloads
Perform
data analysis and data wrangling
on large-scale datasets ensuring accuracy, completeness, and quality
Design and optimize queries across
Hive, Impala, and SQL
for performance and scalability
Troubleshoot and resolve performance issues within the
Cloudera Big Data ecosystem
Schedule and monitor jobs using
Autosys ;...
Develop and maintain data pipelines using
Python, PySpark, Spark, and Kafka
for high-volume batch and streaming workloads
Perform
data analysis and data wrangling
on large-scale datasets ensuring accuracy, completeness, and quality
Design and optimize queries across
Hive, Impala, and SQL
for performance and scalability
Troubleshoot and resolve performance issues within the
Cloudera Big Data ecosystem
Schedule and monitor jobs using
Autosys ;...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application