Job Description
What you will do
- Design, develop, and maintain ETL pipelines to extract, transform, and load data across various data sources (cloud storage, databases, APIs);
- Use Apache Airflow for orchestrating workflows, scheduling tasks, and managing pipeline dependencies;
- Build and manage data pipelines on Azure and GCP clouds;
- Design and support Data Lake;
- Write Python scripts for data cleansing, transformation, and enrichment using libraries like Pandas, PySpark;
- Analyze logs and metrics from Airflow and cloud services to resolve pipeline failures or inefficiencies.
- Experience (2+ years ) writing efficient and scalable Python code , especially for data manipulation and ETL tasks (using libraries like Pandas , PySpark , Dask , etc.);
- Knowledge of Apache Airflow for orchestrating ETL workflows , managing task dependencies, sche...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application