Skills Experience Location Contact Resume ↗

Data Engineer II  ·  Bangalore

Sumit
Chaurasia

Data engineer who never says no.
I find the solution, not the workaround.
Built on Databricks, PySpark, and stubbornness.

Scroll
5+Years Experience
100TB+Data Pipelines
90%Performance Gains
200+Merchants Onboarded
3Companies
70%Onboarding Time Cut
40%Glue Perf. Boost
5+Years Experience
100TB+Data Pipelines
90%Performance Gains
200+Merchants Onboarded
3Companies
70%Onboarding Time Cut
40%Glue Perf. Boost
01 — Core Stack

Tools that
move data
at scale.

Five years across three companies — supply chain, financial planning, and alternative data. Every tool below has been production-tested against real volume, real deadlines, and real consequences.

PySpark
5+ YEARS
explore →
Databricks
3+ YEARS
explore →
SQL / T-SQL
5+ YEARS
explore →
Python
5+ YEARS
explore →
Cloud & Infra
AWS Glue S3 Lambda Azure ADF ADLS
Data Stack
Delta Lake Unity Catalog dbt Airflow SSIS
Architecture
Medallion Lakehouse ETL/ELT Star Schema
Other
PostgreSQL Bash CircleCI DABs MCP MLflow
02 — Work History

Where the
work happened.

From enterprise supply chain at a unicorn, through a Big Four consulting engagement, to alternative data powering hedge-fund products. Each role pushed the stack harder than the last.

01 / 03
YipitData
Jul 2025 – Present
Data Engineer II
  • Architected a merchant-configurable ETL framework on Databricks, standardizing pipelines for 200+ retail merchants.
  • Reduced new-merchant onboarding time by 70% via OOP framework and MCP-driven automation.
  • Building a unified Delta Lakehouse on Unity Catalog powering hedge-fund analytical products.
  • Delivered data quality improvements via OCR-aware reconciliation and LLM-based product enrichment.
02 / 03
Deloitte USI
Nov 2024 – Jul 2025
Data Consultant — Supply Chain Network Optimization
  • Improved AWS Glue batch processing performance by 40% through Spark tuning and partitioning.
  • Built automated test frameworks for end-to-end data flow for a global automotive client.
  • Bridged business stakeholders and technical teams, driving PostgreSQL data model improvements.
03 / 03
o9 Solutions
Dec 2020 – Nov 2024
Technical Consultant → Senior Technical Consultant
  • Reduced data processing time by 90% through query tuning, partitioning, and parallelization.
  • Designed star and snowflake schema models for enterprise data warehousing.
  • Architected ETL/ELT pipelines using ADF, Databricks, Spark, and SSIS.
  • Built pipelines processing millions of records daily; cut SQL execution time by 40%.
03 — Based in

HSR Layout,
Bangalore.

Open to remote, hybrid, and onsite opportunities. Available for relocation discussions.

HSR Layout · Bangalore
04 — Let's Talk

Open to
interesting
problems.

If you have a hard data problem worth solving, I want to hear about it.