Shubham Gagrani  ·  Senior AWS Spark Data Engineer  ·  5+ yrs

Mid-Level
5+ years experienceremote
Available within 48 hrs

Proof of scale

235 registered users
10x increase in pipeline speeds
235 registered users10x increase in pipeline speeds

About Shubham

Shubham Gagrani is a Data Engineer with 5. 5+ years of hands-on experience in building and maintaining data pipelines using Python, Apache Spark, and AWS services.

5+ years of commercial experience in

Skills(26)

PythonAWSApache SparkTerraformDockerHugging FaceFlaskBootstrapMySQLreplicate.aiAWS GluePySparkAirflowRedshiftAWS LambdaQuicksightAWS S3PandasSparkApifyRDSStep FunctionsSQLSeleniumKubernetesKafka

Why hire Shubham?

Production deploy authorityLed construction of AI models

Designed and maintained diverse data pipelines handling large amounts of data.

Increased pipeline speeds by up to 10x and reduced errors by 20 percent.

Successfully set up and managed AWS services, implementing CI/CD pipelines.

Increased pipeline speeds by up to 10x.

Reduced errors by 20 percent in data extraction and storage processes.

Attracted 235 registered users to a SaaS web application.

Project highlights(5)

Fine-Tuned Stable Diffusion ModelAI Developer

Overview: Fine-tuned a Stable Diffusion model to generate artwork in the traditional Japanese Ukiyo-e style. Responsibilities: Developed an AI Model by fine-tuning a Stable Diffusion model on Japanese Ukiyo-e style images.

PythonHugging Face

Key outcomes:

  • Optimized training using 33 instance images and 1,000 regularization images.

SaaS Sound Generation ApplicationDeveloper

Overview: A web application to generate sound samples from text prompts, deployed on Render. Responsibilities: Developed and deployed a SaaS application using Flask, Bootstrap, MySQL, and replicate.ai for AI model inference.

PythonFlaskBootstrapMySQLreplicate.ai

Key outcomes:

  • Successfully attracted and managed 235 registered users.

  • Ensured reliable performance and security through secure payment processing.

E-commerce Data PipelineData Engineer

Overview: Zid is a cutting-edge e-commerce platform designed for SaaS applications. Responsibilities: Developed data pipelines using the medallion architecture for analytics. Maintained MySQL databases, automated tasks, and monitored AWS Glue jobs.

PythonMySQLAWS GlueTerraformDockerPySparkAirflowRedshiftAWS LambdaQuicksight

Key outcomes:

  • Ensured smooth operation of MySQL databases through maintenance and performance optimization.

  • Implemented automation strategies to streamline repetitive tasks.

Media Data AggregatorData Engineer / ML Engineer

Overview: Velocity Media Data Aggregator is an innovative project for extracting, integrating, and presenting public information about places in South Africa. Responsibilities: Developed and maintained data extraction scripts using Apify and stored data in AWS S3.

PythonAWS S3AWS LambdaPandasMySQLSparkPySparkApifyRDSStep Functions

Key outcomes:

  • Developed and executed comprehensive testing plans to ensure data accuracy.

  • Successfully set up and managed AWS services, implementing CI/CD pipelines.

Real Estate Ads AggregatorData Engineer

Overview: KI Immo is a real estate ads aggregator website that collects and normalizes ads from diverse sources. Responsibilities: Extracted data from 15+ diverse sources using SQL connections, Selenium, Airflow, Spark jobs, Kafka, and APIs.

PythonSQLSeleniumAirflowSparkPandasDockerKubernetesKafka

Key outcomes:

  • Increased pipeline speeds by up to 10x by transitioning to a different data extraction framework.

  • Reduced errors by 20 percent by applying a data quality check mechanism.

Industry experience

AI / ML Platform

5 projects
  • E-commerce Data PipelineData EngineerPython · MySQL · AWS Glue · Terraform +6
  • Media Data AggregatorData Engineer / ML EngineerPython · AWS S3 · AWS Lambda · Pandas +6
  • Real Estate Ads AggregatorData EngineerPython · SQL · Selenium · Airflow +5
  • SaaS Sound Generation ApplicationDeveloperPython · Flask · Bootstrap · MySQL +1
  • Fine-Tuned Stable Diffusion ModelAI DeveloperPython · Hugging Face

Real Estate

1 project
  • Real Estate Ads AggregatorData EngineerPython · SQL · Selenium · Airflow +5

Ready to work with Shubham?

Schedule an interview and onboard within 48 hours. No long hiring cycles.

At a Glance

Experience5+ years
Work moderemote
Starting from₹1.4 L/mo
Direct hirePossible
Start within48 hours
From₹1.4 L/ month

Single contract. No agency markup confusion.

Typically responds within 4 business hours.

5-day replacement guarantee
48-hour onboarding, single invoice
Direct chat — no recruiter middleman
Seniority signals
Owns production deploysGreenfield architectSystem owner
VerifiedVetted by Witarist
Technical skills assessed & verified
Background & identity checked
English communication verified
Ready to onboard in 48 hours

Not sure if this is the right fit?

Tell us your requirements and we'll match you with the best candidates.

Shubham Gagrani

Python Developer