Job Description
Key Responsibilities:
Design and deliver scalable real-time data and machine learning solutions by building robust ingestion and transformation frameworks across Hadoop ecosystems. Enable end-to-end ML model operationalization and performance optimization, while supporting multi-modal data processing and development of engineering tools and applications
- Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)
- Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell ing for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.
- Develop full‑stack applications and internal engineering tools using Python, shell ing, and modern web frameworks (e.g., Flask, React).
- Collaborate closely with data scientists to operationalize machine learning models using C...
Apply for This Position
Ready to take the next step? Click the button below to submit your application.
Submit Application