Devesh is a Data Engineer with 3+ years of experience in building and maintaining complex data pipelines. He specializes in Python, SQL, and AWS technologies, focusing on data processing and analytics.
Designed and maintained complex data pipelines for large-scale data processing
Automated data workflows for banking operations, enhancing efficiency
Implemented real-time event data processing platforms for user behavior tracking
Developed a full-featured E-Commerce website with secure payment integration
Successfully designed and maintained complex data pipelines for large-scale data processing using Spark
Engineered Spark ETL scripts to clean, transform data, and calculate scores for financial analytics, supporting data-driven decision-making
Automated data workflows for banking operations and integrated diverse data sources into centralized data lakes
Overview: This project focused on enhancing Loan Origination System (LOS) and Business Rules Engine (BRE) processes for financial products. Responsibilities: Designed and implemented Spark ETL scripts to clean, transform data, and calculate scores based on various financial parameters. Ingested data from diverse sources into a centralized Data Lake using AWS S3, EMR, Glue, and Athena.
Key outcomes:
Enhanced LOS and BRE processes through advanced data analytics
Supported data-driven decision-making for various Lines of Business
Overview: This project focused on building an open-source data collection platform that generates complete, accurate, and well-structured event data across all platforms and channels. Responsibilities: Collected raw Snowplow events data via Stream Collector, sent to Apache Kafka Sink, and transformed raw data into TSV format using Stream Enricher.
Key outcomes:
Implemented a robust open-source data collection platform
Enabled continuous data ingestion and transformation using Spark and Kafka
Overview: Developed a full-featured E-Commerce website allowing customers to buy electronic gadgets, physical goods, and clothes. Responsibilities: Developed the website using the Django Framework for the backend, implemented features for customers to manage products and process online payments.
Key outcomes:
Developed a complete E-Commerce platform with core shopping functionalities
Implemented secure payment gateway integration
User Behavior Tracking — gathering + analyzing website event data via Snowplow Analytics with HDFS storage. PySpark + Spark-Streaming + Snowplow + Apache Kafka + Hadoop.
Key outcomes:
Enabled comprehensive user behavior tracking on a website.
Implemented real-time data processing for event data.
Ensured structured storage of partitioned event data for efficient querying.
Utilized Snowplow Analytics for accurate and complete data collection.
E-Commerce Website — full-featured site with cart + wish lists + secure online payment. Django + HTML + CSS + JavaScript + PostgreSQL.
Key outcomes:
Developed a complete E-Commerce platform with core shopping functionalities.
Implemented secure payment gateway integration.
Created a custom admin panel for efficient management of website operations.
Enabled sales and customer reporting for business insights.
Devesh
AWS Data Engineer