mostlyhemish

Hemish Veeraboina

0-1 Engineer / Startup Platform Builder / Full Stack Developer

I partner with early-stage founders to architect and launch production-ready data platforms, backend services, and full-stack apps powered by big data pipelines.

If you're building a startup and need the first version shipped with real users in mind, I'm the engineer who makes it happen—with full-stack delivery and big data infrastructure baked in.

Experience

Roles where I shipped scalable data platforms, backend services, and analytics tooling.

National Internet Observatory, Northeastern University logo

National Internet Observatory, Northeastern University

Data Engineer / Backend Engineer

Oct 2024 — Present

Engineered and optimized a Prefect + Dask ETL pipeline migrating 5–10 million MongoDB network records per minute into PostgreSQL. Containerized the stack with Docker and shipped automated GitLab CI + Helm deployments on Kubernetes to process 5+ billion packets to date. Dockerized Superset, Grafana, and Prometheus for real-time observability across databases and warehouses. Built a Django-authenticated visualization app with FastAPI + Polars services and JWT-secured HS256 endpoints that helped secure $1M in funding. Designing an open-source Dask loader targeting 25 million records per minute, boosting throughput 2–3× over Dask-Mongo.

Adobe logo

Adobe

Python Data Engineer

Aug 2024 — Oct 2024

Piloted Project AJAX to pull and transform 5,000+ Adobe Experience Manager pages in under 10 minutes through a Solr-powered ETL pipeline. Automated incremental loads, historical archiving, multi-format exports, and on-demand filtering to deliver training-ready datasets for Acrobat, Photoshop, Lightroom, Firefly, and VEGA AI. Delivered curated corpora for downstream AI/ML teams within 2–10 minutes using Python multithreading, BeautifulSoup, and Grammarly APIs.

C

Cloud Data Works

Data Engineer Intern

Oct 2023 — Dec 2023

Engineered a real-time Azure Databricks analytics platform that ingested streaming transaction logs, API feeds, and batch files with PySpark. Enhanced complex ETL pipelines in PostgreSQL using advanced SQL and CTEs to handle million-row datasets for fraud detection. Delivered comprehensive Power BI dashboards that improved operational decision-making for a major financial client.

Deloitte Touche Tohmatsu Limited logo

Deloitte Touche Tohmatsu Limited

Solution Delivery Associate

Jan 2020 — Aug 2022

Architected AWS Step Functions + Lambda ETL workflows to automate ingestion from Security Hub, GuardDuty, Inspector, and other telemetry sources. Integrated alert pipelines with Power BI while aligning classifications to PCI-DSS, AWS FSCP, and CIS standards to cut incident response time by 70%. Partnered with New York Life Infrastructure to raise ServiceNow incidents, deliver Terraform templates, and run Hadoop/Spark jobs for large-scale processing.

Certifications

Professional credentials that keep my architectures production ready.

Publications

Writing on distributed systems, analytics, and lessons from production pipelines.

Contact

Hemish Veeraboina

Let's Build Together

I partner with early-stage founders to architect and ship production-ready data platforms, backend services, and full-stack applications. If you're building something impactful, I'd love to hear from you.