Nitesh is a Big Data Developer with 6+ years of experience in designing, implementing, and supporting big data applications. He has extensive expertise in Apache Spark, AWS, and Python, contributing to various projects across financial services and manufacturing domains.
Designed and implemented diverse big data applications using Apache Spark, Hadoop, and AWS.
Achieved performance optimization and data extraction across various projects.
Successfully processed delta data and enabled customer report extraction from S3.
Designed, implemented, and supported diverse big data applications using Apache Spark, Hadoop, and AWS.
Achieved performance optimization, data extraction, cleaning, and reporting across various projects.
Overview: This project is part of Mastercard Inc., focusing on payment transaction processing and related payment services. Responsibilities: Understood upstream source nature and aligned with business cases. Monitored multiple batches and executed multiple scripts on various servers. Performed initial data load using `Initial_Dataload.sql`, checking table structure and sample data. Filtered data from several files and created Standard Operating Procedures (SOPs) for automation.
Key outcomes:
Enabled efficient data filtering and initial data loading for critical payment processing systems.
Created SOPs to streamline and automate operational tasks.
Overview: This project supports UniPol, a global leading manufacturing company specializing in investment casting technology for automotive and aerospace industries, by analyzing data for its data warehouse. Responsibilities: Understood upstream source nature and business cases, connecting to a PostgreSQL instance via Workbench. Performed initial data load using `Initial_Dataload.sql`, verifying table structure and sample data. Copied all dependencies on EMR and created HBase tables using Phoenix script. Wrote Spark jobs to process data and imported reference data from RDBMS to HBase.
Key outcomes:
Successfully implemented data loading and processing pipelines for manufacturing data.
Integrated HBase and Spark for efficient data handling and transformation.
Overview: This project involved developing and architecting Hadoop applications for a UK branch client, focusing on data processing and analysis. Responsibilities: Interacted with onsite teams and customers to clarify and formulate exact requirements for Hadoop applications. Imported structured data from RDBMS to HDFS using Sqoop and performed necessary transformations. Processed data into HDFS by developing solutions and analyzed data using Spark and Hive to produce summary results into Hadoop.
Key outcomes:
Successfully designed and implemented efficient data ingestion and processing workflows using Hadoop ecosystem components.
Improved data retrieval performance by designing optimized Hive and HBase schemas with partitioning and bucketing.
Key outcomes:
Ensured quality assurance standards were consistently achieved through thorough testing and defect remediation.
Contributed to creating customer reports and processing delta data, enhancing data accessibility and utility.
Nitesh
Big Data Engineer