Associate Principal - Data Engineering
Primary skills are Python PySpark Databricks SQL Git GitHub Bitbucket CICD Azure DevOps and
Secondary Skills are Azure Data Factory ADF Power BI
Location India MultiNational FMCG
DomainPortfolio Digital Customer Experience Applications Experience Engineering Mobile Digital Platforms
Employment Type Fulltime
Level Developer Senior Developer
Cloud Landscape Hyperscalerfirst strong enterprise investment in cloud platforms
Role Summary
We are looking for a Developer Senior Developer Data Engineer to build and operate scalable data pipelines and curated datasets that enable Digital and Customer Experience Applications across a global FMCG enterprise You will work handson with Python PySpark Databricks and SQL contribute to modern engineering practices using Gitbased workflows and CICD and support endtoend delivery through Azure DevOps
Key Responsibilities
Design develop and optimize ETLELT pipelines using Databricks PySpark to ingest transform and serve data for digitalcustomer experience use cases
Build and maintain highquality performant SQL transformations and dimensionalcurated data layers for analytics and downstream consumption
Implement best practices for Databricks engineering notebooks vs repos modular code clusterjob configuration performance tuning and cost awareness
Use Python for data processing utilities validations orchestration helpers and reusable libraries
Manage source control using Git with enterprise workflows across GitHub Bitbucket enforce PR reviews branching strategies and coding standards
Build and maintain CICD pipelines using Azure DevOps automating testing packaging and deployment of data codeartifacts across environments
Ensure data quality and reliability through validations reconciliation checks monitoring and operational runbooks
Collaborate with productapp teams analysts and platform teams to translate requirements into scalable data products aligned to customer experience journeys
For Senior Developer lead modulelevel design mentor engineers drive standards for CICD and code quality and own delivery for critical pipelines
Primary Skills MustHave
Strong handson experience with
o Python
o PySpark
o Databricks jobs workflows clusters repos
o SQL advanced querying optimization
Strong experience with Git and repository platforms GitHub andor Bitbucket
Handson experience implementing CICD for data engineering workloads
Strong working experience with Azure DevOps Boards Repos Pipelines buildrelease workflows
Secondary Skills GoodtoHave
Azure Data Factory ADF for orchestration ingestion and scheduling patterns
Power BI experience for consumptionlayer understanding datasets measures refresh patterns supporting analytics enablement
Job Segment:
Database, Engineer, SQL, Technology, Engineering