Santosh is a Lead Data Engineer with 9+ years of experience in data engineering, specializing in Azure and Python. He has a proven track record in optimizing data pipelines and deploying predictive models.
Optimized Databricks ETL flows, saving $2k/month.
Migrated legacy pipelines to Azure Cloud and Databricks, achieving 70% time savings and 80% cost savings.
Developed a standalone Python library for Data Quality Management that was widely adopted within the organization.
Designed and developed a proactive alerting automation system for data quality issues.
Awarded as Loss data defender and HERO OF THE MOMENT for SALPL propensity model development.
Optimized existing Databricks ETL flows, saving $2k/month in costs.
Migrated a legacy pipeline to Azure Cloud and Databricks, resulting in 70% time savings and 80% cost savings.
Developed a widely adopted standalone Python library for Data Quality Management within the organization.
Overview: Led the design and development of a Data Quality Framework (DQM) for comprehensive dataset reporting. Responsibilities: Designed and developed DQM for reporting datasets, developed proactive alerting automation for data quality issues, and optimized existing Databricks ETL flows, resulting in a $2k/month cost saving.
Key outcomes:
Optimized existing Databricks ETL flows, saving $2k/month.
Migrated legacy pipelines to Azure Cloud and Databricks, saving 70% time and 80% cost.
Overview: Focused on migrating existing data processing scripts and developing solutions for data quality and predictive modeling. Responsibilities: Migrated all SQL scripts to PySpark on Databricks, responsible for analytical model deployment, and created datasets for predictive model development.
Key outcomes:
Successfully migrated all SQL scripts to PySpark on Databricks.
Developed an NLP model for information extraction.
Overview: Involved in migration projects for reporting and integration services to Azure Cloud and Tableau. Responsibilities: Participated in a migration project for SSRS reports and SSIS packages to Azure cloud, developed and deployed reports in MS SQL Server environment using SSRS.
Key outcomes:
Successfully migrated SSRS reports to Tableau dashboards.
ETL Developer — full lifecycle of SSIS packages + SSRS report deployment + Azure Cloud migration.
Key outcomes:
Developed and deployed SSIS packages for ETL workflows, including complex transformations.
Designed and deployed SSRS reports with automated scheduling.
Created SQL UDFs and batch programs for efficient data handling.
Junior Software Engineer — database automation + data security + bulk data loading + reporting foundations with Stored Procedures + Triggers + SSIS + SSRS.
Key outcomes:
Implemented automation and data security using SQL Stored Procedures, Triggers, and Views.
Developed and deployed batch programs for bulk data loading.
Created and maintained SSIS packages and SSRS reports for various client needs.
Santosh
Azure Data Engineer