Thirunagari Gopi Chaithanya is a Senior Data Engineer with 6+ years of experience specializing in Big Data and cloud technologies. He has a proven track record in designing and implementing complex data pipelines and architectures.
Designed and implemented ETL Warehouse data modeling using Informatica, Cassandra, and Power BI.
Engineered and deployed end-to-end ML applications within Azure MLops Cloud Environment.
Optimized Hive QL/Pig scripts using execution engines like Tez and Spark, improving query performance.
Developed robust real-time data streaming and analytics solutions using Kafka, Storm, HBase, and Spark.
Implemented CI/CD pipelines with Jenkins and Terraform for automated deployments.
Multi-cloud (AWS + Azure) data engineering
Overview: This project involved data processing and transformation for a financial advisory client, focusing on Big Data solutions. Responsibilities: Developed multiple Kafka & Kafka Connectors for producing and consuming data based on software requirements. Configured Spark Streaming Jobs in Airflow for automated script deployment to process ongoing data from Kafka and store it in HDFS. Designed and implemented jobs in Azure Databricks, Azure Data Storage, Azure Synapse ETL, Azure Cosmos DB, EventHub, Azure Data Catalog, Azure Functions, Azure Purview, and MDM.
Key outcomes:
Configured Spark Streaming jobs in Airflow for automated script deployment, resulting in easier job scheduling.
Designed data migration pipelines to Azure cloud using Azure SQL.
Overview: This project focused on Big Data batch and real-time processing solutions within a financial services context. Responsibilities: Integrated Spring Boot with Elastic Search and Kibana to display application status and log records. Wrote and tuned queries using PySpark, PostgreSQL & Airflow Application for data analysis.
Key outcomes:
Implemented Spring Boot integration with Elastic Search and Kibana to provide real-time application status.
Designed and developed authorization policies for Big Data Hadoop resources using Ranger API.
Overview: This project focused on real-time data streaming and analytics for a financial services group, involving Microservices deployment in Azure. Responsibilities: Designed and developed real-time data streaming and analytics using Kafka, Storm, and HBase, Spark.
Key outcomes:
Designed and developed real-time data streaming and analytics solutions using Kafka, Storm, HBase, and Spark.
Deployed microservices using SpringBoot, MongoDB, and Kafka in an Azure Production Cluster.
Overview: This project involved a customer credit card system and an Online Investment Service (OIS). Responsibilities: Worked on the complete life cycle of software development, including new requirement gathering, redesigning, and implementing business functionalities.
Key outcomes:
Successfully implemented new business functionalities for a customer credit card system.
Managed database objects and enhancements in Oracle 11g database.
Thirunagari Gopi Chaithanya
Big Data engineer with Talend