Syed is a Data Engineer with 5+ years of experience in designing and building scalable data pipelines and warehouses. He has a proven track record of leveraging technologies such as AWS, Azure, and GCP to drive business impact.
Built and automated scalable data pipelines resulting in a 7% CTR increase for email campaigns.
Led data processing migration from SAS to Python, achieving an 80% runtime reduction.
Designed and implemented data migration pipelines using Apache Airflow and Snowflake.
Achieved a 7% CTR increase for email campaigns.
Achieved an 80% runtime reduction for data extraction, transformation, and preparation.
Overview: This project focused on building and automating scalable data pipelines for customer data to enable data-driven email campaign targeting. Responsibilities: Built and automated scalable data pipelines using PySpark and Databricks for customer data, contributing to a 7% click-through rate (CTR) increase. Owned and managed scalable data pipelines utilizing Azure Data Factory. Developed and automated a data reporting framework by creating a central repository of modular SQL models with dbt. Maintained and scaled the data architecture by onboarding new data sources and adapting dbt models on Azure Data Bricks. Mentored team members on data management, fostering a culture of data literacy and best practices.
Key outcomes:
Achieved a 7% click-through rate (CTR) increase for email campaigns.
Streamlined data access and insights for leadership.
Overview: This project involved designing and implementing a data migration pipeline to Snowflake, optimizing data storage and retrieval for analysis. Responsibilities: Designed and implemented a Data Migration Pipeline using Apache Airflow to migrate customer data to Snowflake. Ensured data quality and consistency for further analysis, with dbt models providing well-documented and maintainable transformations.
Key outcomes:
Ensured data quality and consistency for customer data.
Optimized data storage and retrieval for efficient data exploration.
Overview: This project focused on supporting data pipelines for machine learning model development and processed data per business requirements for various clients. Responsibilities: Designed Data Warehouse Schemas in Snowflake to optimize data storage and retrieval. Collaborated on data governance, ensuring adherence to data quality and policy standards.
Key outcomes:
Ensured data quality and consistency for customer data.
Identified key SKUs impacting store traffic to inform strategies.
Syed
Data Engineer