Experienced Azure and Databricks Data Architect with 9 years in developing large-scale data platforms and streaming pipelines. Delivered significant cost savings of β¬1.3M annually through optimized storage and governance strategies. Expertise in PySpark, ADF, and Power BI, alongside strong capabilities in GenAI and LLM integration. Known for providing efficient and scalable data solutions that meet global market demands.
Delivered AI-powered metadata automation using LLMs & Agentic AI.
Migrated 20+ workspaces from Hive Metastore Unity Catalog for improved governance. Administered 200+ Power BI workspaces, 1,000+ dashboards, and migrated to Fabric.
Data Architect
Accenture
02.2022 - 01.2023
Developed internal process optimizations to streamline automation of repetitive tasks, expediting data management
Engineered data models to meet intricate analysis demands.
Oversaw migration to Azure on-premises, implementing ADLS, Data Factory, Databricks, SQL DW, and Logic Apps. Developed PySpark pipelines for Teradata and Oracle while ensuring monitoring through Logic Apps.
Converted various legacy systems to updated technologies, lowering expenses and improving efficiency of computing tasks.
Constructed and communicated innovative business information solutions.
Oversaw offshore support team, ensuring adherence to SLA delivery and comprehensive stakeholder reporting.
Data Engineer
Cognizant
12.2020 - 02.2022
Enhanced data processing through implementation of efficient ETL pipelines and optimization of database design.
Migrated claims data from DB2 and NAS to Azure Data Lake Storage plus Delta Lake. Constructed PySpark pipelines, incorporating data validation and data quality checks. Implemented CI/CD processes using GitHub for production pipeline automation.
Partnered on ETL (Extract, Transform, Load) tasks, ensuring data integrity and confirming pipeline stability.
Elevated data quality through comprehensive cleaning, validation, and transformation processes.
Streamlined routine tasks through automation with Python scripts, enhancing team productivity and minimizing manual errors.
Senior Data Analyst
Capgemini
02.2017 - 10.2020
Reduced manual data entry errors by designing and deploying automated ETL processes to transform raw data into usable formats.
Contributed to overall organizational growth through continuous improvement of data analytics capabilities and processes.
Analyzed large amounts of data to identify trends and find patterns, signals and hidden stories within data.
Assessed large datasets, drew valid inferences and prepared insights in narrative or visual forms.
Created and automated data visualizations to present insights and tell compelling stories.