Built real-time event-driven data pipeline with SQS and Lambda, filtered data using EventBridge and uploaded data to S3.
Developed real-time streaming pipeline to capture data changes in DynamoDB using Kinesis Data Streams, Kinesis Firehose, Glue, Lambda, S3 and EventBridge. Used Athena for data analysis.
Designed a data ingestion and transformation pipeline on transactional data. Crawled data using Glue and developed ETL Glue job to enrich/validate and upsert to Redshift.
Extracted sales data from S3, transformed with Glue Spark, stored in MySQL RDS; secured via IAM, VPC endpoints.
Modified ETL pipelines in Informatica, and tested workflows to load data into two new columns.
Led migration of 300+ equity finance, Bloomberg, fixed income datasets from on-prem SQL systems to Hadoop cluster for the PRIME Data science platform. Performed data validation checks in HIVE and Impala.
Performed development, testing and deployment of shell linkage id column addition to contracts table which helped connect borrows with loans strategically to look at exposure.
Enabled BCI Regulatory Operations Team identify daily changes to security prioritization by importing two new columns to stock loan table, complying with SEC customer protection rule.
Created detailed runbooks that reduced production risks by 25% and performed UAT validations.
Guided 2 teams in migrating 10 applications to an automated CI/CD pipeline, enhancing deployment speed by 40%. Created POC and presented to team leads.
Developed Python script to report changes between tables' metadata and helped forecast Identity column max usage.
Reduced operation time by 50% by automating REST API stress testing in Locust using Python.
Performed pivot, VLOOKUPS, nested conditions in Excel for data validation. Aligned with release plans and made on-time deliveries.
Forward Thinking Technology SolutionsJune 2022 - July 2022
Product/BI Intern
Created 5 reports for internal teams and clients by conducting stakeholder meetings to understand reporting requirements and KPIs.
Analyzed insights from 500+ JIRA tickets by creating Power BI dashboards and established resolution time metrics, completing a migration process in under 3 months.
Developed interactive dashboards in Power BI and tracked adoption rate and conversion rates to drive sales.
Continental Automotive AG2017 - 2021
Data Analyst | Bengaluru
Implemented 10 process improvements that increased code coverage of customer reports by 300%.
Developed Excel reports, Tableau dashboards and SQL queries to improve timing analysis by 400ms for 20+ OEMs.
Created and tested automation scripts using Python, effectively saving monthly 25 hours of manual effort.
Optimized SQL queries that improved efficiency of data extraction tool.
EDUCATION
Pennsylvania State University
Master of Science in Data Analytics
Amrita School of Engineering
Bachelor of Technology in Electronics and Communication Engineering
ADDITIONAL INFORMATION
Skills: SQL, Informatica, AWS, Excel, Tableau, Power BI, Python, Data Engineering
Certificates: AWS Solution Architect Associate, AWS Cloud Practitioner, Microsoft Power BI