S. Vinay Kumar

Senior Data Engineer | Cloud Data Solutions Expert

Building scalable data pipelines and cloud solutions to transform business insights with 10+ years of experience in AWS, big data, and machine learning integration.

Vinay Kumar

Cloud Specialist

Extensive experience with AWS, Azure, and GCP cloud platforms for data solutions.

Data Pipeline Expert

Designed and optimized ETL/ELT processes for high-volume data processing.

ML Integration

Implemented machine learning models into production data pipelines for actionable insights.

About Me

Senior Data Engineer with a decade of experience building robust data solutions

Professional Summary

I'm a Senior Data Engineer specializing in building and optimizing end-to-end data pipelines, ETL processes, and large-scale data solutions on cloud platforms.

With extensive experience in AWS services (Glue, Lambda, Redshift, SageMaker, EMR) and big data technologies (Spark, Hadoop, Kafka), I've helped organizations transform their data infrastructure for better decision-making.

My expertise includes designing complex data models, implementing real-time data streaming solutions, and integrating machine learning models into production pipelines.

I hold a Master's degree in Information Systems with a focus on Business Analytics from Marist College and a Bachelor's degree in Computer Science Engineering.

Key Skills

Cloud Platforms (AWS, Azure, GCP) 95%
Data Engineering 98%
Big Data Technologies 90%
Machine Learning 85%

Work Experience

My professional journey through leading organizations

Visa Inc

Senior Data Engineer

March 2023 - Present

• Leveraged AWS Kinesis for real-time data streaming, simplifying data ingestion processes

• Built predictive analytics models using AWS SageMaker, improving decision-making accuracy

• Developed and optimized ETL pipelines using AWS Glue, improving data integration efficiency

Change Healthcare

Senior Data Engineer

November 2021 - February 2023

• Designed ETL processes in AWS Glue to migrate historic product purchase data from on-prem to AWS Redshift

• Automated data analytics pipelines using Apache Airflow, optimizing Tableau dashboards

• Implemented data security strategy using AWS IAM policies, KMS encryption, and S3 bucket policies

The Home Depot

Senior Data Engineer

April 2019 - October 2021

• Designed and implemented ETL/ELT pipelines using AWS Glue, Lambda, and Step Functions

• Automated cloud infrastructure setup with Terraform, reducing deployment time

• Implemented data governance practices including data quality checks and lineage tracking

M&T Bank

AWS Data Engineer

June 2017 - March 2019

• Managed cloud-based infrastructure on AWS (EC2, S3, Lambda) adhering to data governance policies

• Applied machine learning techniques to analyze large datasets for strategic decision-making

• Designed and implemented scalable data pipelines using Hadoop and HDFS

Technical Skills

My expertise across various technologies and platforms

Cloud Platforms

AWS Glue AWS Lambda Amazon Redshift AWS SageMaker AWS EMR AWS Step Functions AWS Kinesis AWS Athena Azure Data Factory Azure Synapse Google Cloud Dataflow Google BigQuery

Data Tools

Apache Spark Apache Kafka Apache Airflow Apache Flink Hadoop HDFS Hive Presto Snowflake DBT Talend

Programming

Python SQL Java Scala JavaScript PL/SQL Bash R

ML & Databases

TensorFlow Scikit-Learn Pandas NumPy MySQL PostgreSQL MongoDB SQL Server Teradata

Featured Projects

Some of my notable data engineering implementations

Real-time Payment Analytics

AWS Kinesis AWS Glue Redshift Lambda

Implemented real-time data streaming solution for payment transaction analytics at Visa, reducing insight latency from hours to seconds.

Learn More

Healthcare Data Migration

AWS DMS EMR S3 Airflow

Migrated multi-terabyte healthcare data from on-prem MySQL to AWS RDS with zero downtime, improving query performance by 300%.

Learn More

ML Fraud Detection Pipeline

SageMaker TensorFlow Lambda Step Functions

Built end-to-end ML pipeline for fraud detection with automated model retraining and monitoring, reducing false positives by 40%.

Learn More

Get In Touch

Feel free to reach out for collaborations or opportunities

Email

vinaykumarsurabhi190@gmail.com

Phone

+1 (469) 712-7243

Location

Austin, TX, USA