← Back to opportunities
About the Role
:Design, develop, deploy, and operate scalable ETL and data pipelines using PySpark, Python, advanced SQL, and AWS data services Own data pipeline lifecycle from requirements, mapping, development, testing, deployment, monitoring, production support, release management, and future roadmap planning Build ingestion and transformation pipelines for flat files, relational databases, APIs, data warehouses, healthcare data sources, and enterprise data platforms Implement mapping automation, preferably using AI, along with LLM-assisted data cleaning, transformation, data quality checks, and RAG use cases Implement secure handling of PHI/PII data including encryption, access controls, auditability, retention, masking, de-identification, governance, and operational readiness Advanced expertise in PySpark, Python, advanced SQL, ETL best practices, data modeling, and large-scale data processing ...
Knowledge, Skills, and Abilities:
Ready to Join Through a Referral?
Apply now and get connected directly with the hiring team
Apply for this Position