Job Description
Skills
AWS Glue, AWS data services, PySpark or Scala, Data lake architecture and Redshift Spectrum, Redshift performance tuning, VPN/Direct Connect or AWS DMS communication.
Responsibilities
- Design and build scalable ETL pipelines using AWS Glue (both visual and scripted jobs) to migrate data from IDP Netezza to Amazon Redshift.
- Connect and extract data securely from on‑premise or hosted Netezza sources using JDBC connectors and Glue crawlers.
- Implement data transformations, schema mappings, and data cleansing procedures using PySpark/Scala in AWS Glue.
- Set up AWS Glue crawlers and catalogue tables to organize metadata for downstream Redshift consumption.
- Optimize data load and performance using Redshift best practices (e.g., distribution styles, sort keys, COPY commands). Automate workflows with AWS Glue Workflows, Triggers, and Step Functions. Integrate with AWS S3, IAM, CloudWatch, and KMS for secure, ob...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application