Deepali · Senior Spark / Hadoop Data Engineer · 5+ yrs

Mid-Level

5+ years experienceremote

Available within 48 hrs

Proof of scale

155 S3 buckets

5 critical Redshift clusters

5 critical Redshift clusters155 S3 buckets

Built for

AmazonDeloitte

About Deepali

Deepali is a Data Engineer with 5. 5+ years of proven experience in optimizing data pipelines and managing cloud services.

5+ years of commercial experience in

SaaS / B2B

Skills(14)

AWSApache SparkHadoopSQLPythonPySparkScalaSalesforceOracleSQL ServerRedshiftAWS Step FunctionsSqoopSpark-Scala

Why hire Deepali?

Production deploy authorityMentored 5+ juniors

Proven ownership in migrating and optimizing data pipelines to AWS Step Functions.

Demonstrated expertise in Redshift cluster management, including encryption and WLM settings.

Consistent recognition for contributions and customer obsession with multiple awards.

Managed encryption across five critical Redshift clusters, minimizing business disruptions.

Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Spearheaded the migration of S3 files to Glacier for enhanced cost efficiency and S3 management.

Project highlights(4)

Data Pipeline Optimization – Data Engineer

Overview: Currently focused on creating and optimizing data pipelines for efficient data processing. Responsibilities: Created and optimized data pipelines, contributing to efficient data processing. Developed and tuned SQL queries to enhance performance in data processing. Managed and maintained Redshift databases to ensure optimal performance and reliability.

AWSRedshiftSQL

Key outcomes:

Created and optimized data pipelines for efficient data processing.
Developed and tuned SQL queries for performance enhancement.
Managed and maintained Redshift databases to ensure optimal performance.

Redshift Cluster Management – Data Engineer

Overview: Managed encryption across five critical Redshift clusters and orchestrated synchronization for production clusters. Responsibilities: Managed encryption across five critical Redshift clusters, collaborating to minimize business disruptions. Orchestrated synchronization for two production clusters, ensuring seamless encryption and smooth transition of WBR jobs. Proposed and implemented migration of all pipelines to Step Functions seamlessly, ensuring uninterrupted functionality.

AWSRedshiftAWS Step Functions

Key outcomes:

Managed encryption across five critical Redshift clusters minimizing business disruptions.
Successfully migrated all pipelines to Step Functions seamlessly without errors, ensuring uninterrupted functionality.

Salesforce Data Migration – Big Data Developer

Overview: Responsible for data extraction, transformation, and production job management within a Big Data environment. Responsibilities: Extracted and Imported data from Salesforce to DataLake using Teradata Hadoop Connector, ensuring seamless integration. Transformed data according to CDC logic, maintaining historical data using DataFrame in PySpark. Automated the Salesforce Migration component using UNIX shell scripting for efficiency.

SalesforceHadoopPySpark

Key outcomes:

Automated Salesforce Migration component using UNIX shell scripting for efficiency.
Optimized HiveQL queries for ORC transactional tables, enhancing query performance and efficiency.

Data Lake Ingestion Pipelines – Hadoop Developer

Overview: Implemented data ingestion pipelines from various sources into a DataLake using CDC logic. Responsibilities: Implemented data ingestion pipelines to extract data from various sources (SFDC, SAP, VISTAAR, EMPOWER, BRAZIL) and loaded it into the A3 DataLake using CDC logic. Utilized Sqoop for importing and exporting data between relational database systems (SQL Server, Oracle) and HDFS.

HadoopSqoopSpark-Scala

Key outcomes:

Implemented data ingestion pipelines from various sources using CDC logic.
Optimized Spark code performance by implementing best practices.

Industry experience

SaaS / B2B

Reported in resume

Ready to work with Deepali?

Schedule an interview and onboard within 48 hours. No long hiring cycles.

At a Glance

Experience5+ years

Work moderemote

Starting from₹1.4 L/mo

Direct hirePossible

Start within48 hours

From₹1.4 L/ month

Single contract. No agency markup confusion.

Typically responds within 4 business hours.

5-day replacement guarantee

48-hour onboarding, single invoice

Direct chat — no recruiter middleman

Seniority signals

Owns production deploysSystem ownerCode reviewerMentor / leads juniorsRecognised OSS contributor

Vetted by Witarist

Technical skills assessed & verified

Background & identity checked

English communication verified

Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Deepali

Big Data Engineer