Sachin is a Data Engineer with 8+ years of expertise in designing and implementing cutting-edge data pipelines that drive actionable insights. He has a strong analytical mindset and a proven track record in leveraging cloud technologies for data management.
Designed and implemented cutting-edge data pipelines driving actionable insights for clients.
Spearheaded projects that increased data processing efficiency and ensured data accuracy and security.
Proficiently utilized AWS Glue and Google Dataflow for robust ETL pipeline creation and management.
Leveraged Apache Airflow to automate and schedule complex data workflows across diverse systems.
Successfully integrated Big Data technologies like Spark and Hadoop for large-scale data processing and analysis.
Built Project-AI — AWS Glue data pipeline + schema/jobs/trigger config + Datacatalog as Data Engineer
Built Health channel — GCP services automation + data workflows + accessibility + decision-making as Data Engineer
Built MCE — multi-source data integration with Apache Airflow + Snowflake + PySpark + SFTP + AWS S3 as Data Engineer
Built Merchant Leanscale — web scraping Amazon + Ali Express + Walmart + Asos.com + MongoDB storage as Data Engineer
Overview: Involved in data scraping from websites like Amazon, Ali Express, Walmart, and Asos.com. Responsibilities: Created scripts for scraping data from major e-commerce websites and storing the data inside MongoDB. Used Python for XML, JSON processing, data exchange, and business logic implementation. Developed data-driven applications using Python for order automation.
Key outcomes:
Created scripts for scraping data from major e-commerce websites.
Overview: Implemented a robust data management and automation solution using Google Cloud Platform (GCP) services. Responsibilities: Developed, deployed, and maintained data pipelines using Google Dataflow for ETL. Implemented efficient data ingestion and integration processes from various sources into Google BigQuery.
Key outcomes:
Streamlined data workflows and improved data accessibility and decision-making.
Overview: Streamlined data operations, enhanced data reliability, and empowered data-driven decision-making. Responsibilities: Designed, developed, and maintained data pipelines using Apache Airflow to automate and schedule data workflows.
Key outcomes:
Optimized SQL queries and data transfer processes within Snowflake.
Merchant (Leanscale) — Amazon + Ali Express + Walmart + Asos.com scraping + MongoDB.
Key outcomes:
Created scripts for scraping data from major e-commerce websites (Amazon, Ali Express, Walmart, Asos.com)
Developed data-driven applications using Python for data scraping and order automation
Deployed scripts on AWS server with MongoDB database
Sachin
Data Engineer