Summary
Overview
Work History
Education
Skills
Work Preference
Websites
Timeline
Hi, I’m

Sourav Saha

Senior Data Platform & Quality Engineer
Sourav Saha

Summary

I build & own trust-worthy & performant systems, brings engineering excellence in fast-paced environment with decade of technical & strategic experiences of product development lifecycle & full stack quality assurance. I collaborate & drive cross functional teams, keep expectation in-line & make things visible to stakeholder. I have worked in Insurance, Banking, Real Estate sector. I love running open-source initiatives & contribute to open-source projects. And Super passionate about Algo & Quants trading.

Overview

10
years of professional experience

Work History

PropertyGuru Group
Singapore & India

Senior Data Quality Engineer
03.2021 - Current

Job overview

  • Manifest expertise spread across Data engineering, QA engineering, Product Development & Internal / External stakeholder handling including clients & vendors
  • Own Delivery & Quality aspect of Transaction Dataset for Singapore & Malaysia including the Airflow DAGs
  • Improved & Enriched Malaysian Address Quality using LLM (OpenAI GPT 4o mini)
  • Governed the QA Process & Frameworks within Data CoE
  • Accountable for Data APIs & Data pipelines deployment in production
  • Accountable for ensuring P0 Alerts for Production on Datadog
  • Created the reusable CI pipeline with GitHub Actions for PR validation & headless production deployment of internal tools
  • Architected & developed self-served Performance Evaluation platform to auto generated load config from non-functional requirements
  • This is developed on top of Taurus with JMeter and heavily used by both Dev & QA, resulting above 90% effort saving in comparison with JMeter only for performance test design & execution
  • Architected & Developed Data Quality Platform for Data Testing, Data Contract generation & validation
  • Runs within Database eliminating data export overhead & test execution cost
  • Developed Interactive UI to visualise reports
  • Adopted by Data Science & Data Engineering teams and implemented for all data projects
  • Reduced regular data issue by above 80% measured by issue reported
  • Architected & Developed API Evaluation Platform followed by Interactive UI to compare & evaluate the quality and consistency of responses from Data APIs & Machine Leaning APIs over time, achieving reliability on model & servicing in production environment
  • Own & Manage API Gateway for External client facing enterprise API suites
  • Improved TYK Gateway management by developing Interactive GUI for API and Token management & technical troubleshooting; achieved savings of 50K USD per year by eliminating paid-service
  • Used by Product Manager & Engineers across teams
  • Developed to pipeline for log transfer & client billing using FiveTran
  • Drive Scrum operations as Scrum Master with effective sprint goal setting & traceability with team OKR
  • And co-own the project delivery alongside with Program Manager & Product Managers
  • Promote Engineering Excellence, Educate teams about internal tools, best practices & better documentation
  • Part of team to develop config-based tool to generate ELT & Reverse ELT pipelines
  • Tools & Technology: Python, Streamlit, Plotly, Postman, JMeter, Taurus, BigQuery SQL, Elasticsearch FastAPI, Airflow, Docker, AWS, GCP, DBT, FiveTran, GitHub Actions, TYK, Nginx, Datadog, Sentry, Celery, PySpark, Looker, Tableau

Cognizant Technologies Solutions
OCBC Bank, Singapore

Automation Engineer
08.2020 - 03.2021

Job overview

  • Flaky Test identification & Test Failure Analysis with K-Means Clustering, resulting 2 hours per day human effort savings
  • Researched & Developed automation test framework for thick client application developed on Unity platform using Java & Alt-Unity resulting transition of manual to automation for the said application
  • Automated tests for Mobile & Web apps with Java, Selenium & Appium and managed daily execution of Regression Suite (200+ tests)
  • Enhanced the existing framework to report the test execution status and upload test artifacts directly to JIRA from test run, resulting 1-2 hours of effort savings everyday by the team of 6 pax
  • Tools & Technology: UiPath, Alt-Unity, UFT, BDD-Cucumber, Java, Appium, Selenium, Jenkins, Python, Rest Assured

Cognizant Technologies Solutions
Great Eastern Life, Singapore

Automation Engineer
02.2020 - 08.2020

Job overview

  • This was a Digital Transformation journey of a business team, where we identified the manual workflow insurance policy life cycle and developed Assistive Automation Suite with UFT for new business, customer service, claims transaction processing which reduces 80% manual effort measured by time spent
  • Tools & Technology: UFT, Excel, VBA

Cognizant Technologies Solutions
, India

Automation Lead | Product Specialist
05.2017 - 01.2020

Job overview

  • This role was an opportunity to consult many projects spread across different countries outside the full time assignment
  • My contribution leads approx
  • 250K (hard + soft) dollar savings over a period of 3 years
  • Developed scalable web scraping framework to scrape policy data from multiple application and build query engine accessible interactive UI and natural language; which reduces test data management effort by 90% and reduces execution cycle 40%
  • Developed solution for PDF Statement extraction and validation using Python & PDFBox
  • Enabled via REST API & web portal
  • Accelerated the testing process above 90% measured by end to end time spend
  • Developed automation framework for regression and in-sprint functional automation to be used by team of 40+ testers
  • Horizontally scalable system supporting test execution on multiple application with inter-dependency & synchronization along with dashboard for monitoring and test control
  • Tools & Technology: Java, Selenium, jSoup, Python, Flask, HTML, JS, PDFBox, SQL, MongoDB, Jenkins, Excel + VBA

Cognizant Technologies Solutions
, India

Test Engineer
08.2014 - 04.2017

Job overview

  • Developed Mainframe Data Scraping Tool followed by validation tool for a Policy Remediation Project
  • Resulting 80% effort saving against manual time
  • Client saved 3.5k USD (hard dollar) per weekly run, totalling 120K USD savings in over a period of 6 months
  • Maintained Automation test suite UFT (formerly QTP)
  • Responsible for automation test execution along with functional testing on web apps ( wmA, csA, nbA) & Mainframe app (Vantage)
  • Tools & Technology: UFT, Aviva VBA for Mainframe, Excel, VBA
  • Personal Projects & Open-Source Contributions
  • Developing Uptime Monitor (Personal Project | Open Source Code)
  • This is open source uptime monitor for APIs, Websites & Infrastructure which supports misc
  • Web protocol
  • Reporting & Alerting System under development
  • Development Tech Stack: Streamlit based UI, FastAPI Microservice, TimescaleDB
  • Infra & Deployment Stack: GitHub Actions, Oracle Cloud, Cloudflare, Google Domain, NewRelic, GCP OAuth
  • Website: https://watchtower.finanssure.com
  • Repo: https://github.com/sahasourav123/watchtower | CICD: https://github.com/sahasourav123/watchtower/actions
  • Developing Realtime Stock Screening Platform for Algo Trading (Proprietary Code | Public Access)
  • This is a Realtime Stock Screening & Alerting Platform that consumes & enrich numerous stock market data from different sources followed by that compute hundreds of trading indicator over 1000+ stocks in near real-time and send out multi-channel alerts
  • Additionally aggregate stock news by Web Scarping followed by news segregation and news sentiment analysis
  • Development Tech Stack: Streamlit based UI, Four (4) FastAPI Microservices, Postgres & TimescaleDB, Kafka + PySpark (stream processing), ETL using Dagster (Python) + DuckDB, Huggingface Models
  • Infra & Deployment Stack: GitHub Actions, Oracle Cloud, Cloudflare, Google Domain, NewRelic, GCP OAuth
  • Website: https://screener.finanssure.com
  • Developing Algo Trading Strategy Management Platform (Proprietary Code | Controlled Access)
  • This is a Trading Strategy Orchestration System which can handles hundreds of strategies and generated & executed trade decision in real-time using any Broker System of choice
  • This system uses Above Screening Platform for Data & Trade Signal
  • Development Tech Stack: Streamlit based UI, Three (3) Microservices (FastAPI), Postgres
  • Infra & Deployment Stack: GitHub Actions, Oracle Cloud, Cloudflare, Google Domain, NewRelic
  • Website: https://entropy.finanssure.com
  • Others Open Source Contributions
  • Improved and published fork of Kotak API Trading Python Client Library (Open Source) | [Commit History]
  • Selenium based Assistive Automation Toolset (Personal | Open Source | Not maintained) | [Repo Link]
  • Provided Window OS specific fix on Python PDFBox library (Community | Open Source) | [Commit History]

Education

Institute of Engineering & Management
Kolkata, India

Bachelor of Technology from Electronics & Communication Engineering
08.2010 - 07.2014

University Overview

Skills

Microservices Architecture

Data Engineering

Machine Learning

Python Development

Continuous Integration and Deployment

Quality Engineering Tools

Automation

Data Pipeline

Work Preference

Work Type

Full Time

Work Location

On-SiteHybrid

Important To Me

Work-life balanceCompany CultureFlexible work hours

Timeline

Senior Data Quality Engineer
PropertyGuru Group
03.2021 - Current
Automation Engineer
Cognizant Technologies Solutions
08.2020 - 03.2021
Automation Engineer
Cognizant Technologies Solutions
02.2020 - 08.2020
Automation Lead | Product Specialist
Cognizant Technologies Solutions
05.2017 - 01.2020
Test Engineer
Cognizant Technologies Solutions
08.2014 - 04.2017
Institute of Engineering & Management
Bachelor of Technology from Electronics & Communication Engineering
08.2010 - 07.2014
Sourav SahaSenior Data Platform & Quality Engineer