Summary
Overview
Work History
Education
Certification
Timeline
BusinessAnalyst
Gnanasambanthan R

Gnanasambanthan R

Summary

Experienced Senior Data Engineer with over 9 years of comprehensive expertise in designing, developing, and maintaining data pipelines and infrastructure.


Proficient in a wide array of technologies and Languages including SQL, Python, Apache Spark, DMS, Kafka, Kinesis, S3, Glue, Lake Formation, Athena, Step Functions, Event Bridge, and Airflow. Skilled in data warehousing solutions like Redshift, BigQuery, and Snowflake, as well as data visualization tools such as Tableau, Power BI, and QuickSight. Experienced in working with Informatica for ETL processes, ensuring data integrity and quality throughout the pipeline.


Collaborated with cross-functional teams including data scientists, business analysts, and stakeholders to understand data requirements and delivered solutions that met business needs. Worked closely with DevOps team to implement CI/CD pipelines and followed Agile development methodologies such as Scrum and Kanban to ensure streamlined deployment processes and agile development cycles.

Overview

10
10
years of professional experience
1
1
Certificate

Work History

Senior Data Engineer

Alphasoftz Solutions Pvt Ltd
10.2020 - Current

Tech Stack : Apache Kafka, Amazon Kinesis, SFTP, AWS RDS, Ec2, Lambda, S3, Glue, IAM, Lake Formation, Apache Airflow, Step Functions, Cloudwatch, Athena, Snowflake, Redshift, Quicksight, Tableau, PySpark, Python and SQL.

  • Designed scalable data solutions on AWS, utilizing services like S3, Redshift and Glue for storage, processing, cataloging, and ETL.
  • Developed and maintained ETL pipelines with Apache Spark for large-scale data processing, extracting, transforming, and loading data into warehouses or lakes.
  • Wrote Python/PySpark code for data processing, ETL workflows, and analysis tasks, including scripting, manipulation, and integration.
  • Defined data models and schemas for efficient querying and analysis, optimizing pipelines for performance, scalability, and cost-effectiveness.
  • Ensured data quality and governance standards, implementing validation checks, error handling, and data lineage tracking.
  • Set up monitoring and alerting systems to track pipeline health and performance, resolving issues promptly.
  • Implemented security best practices and compliance controls to protect sensitive data and ensure regulatory compliance.
  • Documented pipelines, diagrams, and specs for knowledge sharing, contributing to internal repositories and providing mentorship.

Data Engineer

Alphasoftz Solutions Pvt Ltd
06.2014 - 09.2020

Tech Stack: Informatica PowerCenter, SQL, Oracle, Linux, AWS S3, Glue, Athena

  • Utilized Informatica PowerCenter alongside Hadoop, Oracle, and Linux environments for designing, developing, and maintaining comprehensive data integration solutions.
  • Designed intricate mappings with transformations like lookup, update strategy, aggregator, router, stored procedure, expression, and joiner, proficiently managing slowly changing dimensions (SCD) for comprehensive history maintenance.
  • Implemented robust data quality rules and processes within Informatica, ensuring data accuracy, completeness, and consistency across diverse source systems and data warehouses.
  • Managed metadata effectively within Informatica repositories to document data lineage, transformations, and dependencies, facilitating enhanced understanding and maintenance of data integration processes.
  • Collaborated closely with cross-functional teams to deliver agile data integration solutions aligned with evolving business requirements, leveraging AWS services such as Amazon S3, AWS Glue, and Athena for cloud-based ETL solutions.

Education

Master of Computer Applications -

Bharath University
Chennai | India
05.2013

Certification

  • Professional Certificate in Data Engineering, IBM
  • ETL and Data Pipelines with Shell, Airflow and Kafka, IBM
  • Designing Data Lakes on AWS, Coursera
  • AWS Data Analysis and Visualization, Whizlabs
  • Data Analysis using PySpark, Great Learning
  • SQL Server Integration Services - SISS , LinkedIn
  • Mastering Informatica PowerCenter 9, Udemy
  • Agile Software Development: Scrum for Developers, LinkedIn

Timeline

Senior Data Engineer

Alphasoftz Solutions Pvt Ltd
10.2020 - Current

Data Engineer

Alphasoftz Solutions Pvt Ltd
06.2014 - 09.2020

Master of Computer Applications -

Bharath University
Gnanasambanthan R