SDE II @ AWS · Bellevue, WA

Ravi Kiran
Vadlamani

Software engineer shipping high-availability, performance, and correctness improvements to Tier-1 real-time systems at AWS. 4+ years across distributed systems, production ML serving, and high-throughput backends.

70%
P99 latency reduction
83%
Query time reduction
8x
Pipeline speedup
4+
Years building scale

01 — Career

Experience

Software Development Engineer II

Amazon Web Services (AWS)

Mar 2025 — Present Bellevue, WA
Java 17
Spring
DynamoDB
AWS CDK
CloudWatch
SNS/SQS
  • Improved P99 latency of a real-time routing hot path by ~70% via parallelized execution redesign for the largest-scale customer tenants; validated with load testing exceeding peak production traffic.
  • Delivered an upstream data-shaping optimization that reduced search-index query time by ~83% with no correctness regressions.
  • Authored HLD/LLD for a distributed-locking redesign activating all hosts in a cluster (vs. single leader), reducing single-host-failure blast radius by ~75%.
  • Built end-to-end canary infrastructure (CDK + integration test framework) with automated deployment gating on composite alarms.
  • Built internal GenAI tooling for operational reviews and on-call gameday — recognized with team-level AI innovation award.

Software Engineer II

Apexon

Feb 2023 — Mar 2025 Santa Clara, CA
Java/Spring
Python/Flask
Snowflake
OpenShift
Control-M
  • Optimized a Data Quality profiler via Python multithreading and SQL tuning — runtime reduced from 4 hours to ~30 minutes (~8x).
  • Built Flask APIs to validate confidential documents against standardized schemas; owned production ML inference deployment & ops.
  • Built the backend framework for generative AI use cases on dbt, coordinating LLM calls, SQL transformations, and pipelines with retry/backoff.

Software Engineer Intern

Amazon Lab 126

Jun 2022 — Aug 2022 Sunnyvale, CA
Python
Java
PyTorch
DynamoDB
Spark
EMR
  • Built a Java service (Guice/Spring) exposing APIs for dynamic test sets — turnaround reduced from 2 weeks to <2 hours.
  • Built a Python orchestration layer provisioning EMR clusters, running Spark jobs, and tearing them down as compute backend.
  • Improved Alexa model perceptibility on low-confidence utterances via loss-function changes with the science team.

Senior Engineer → Assistant Manager

Indian Oil Corporation

Aug 2016 — Aug 2021 Paradeep, India
Spring Boot
FastAPI
Node.js
Spark
AWS
  • Led a 10-person cross-functional team operating a large-scale LPG terminal; built a gas-leak prediction & interlock shutdown system and a vision-based personnel-tracking system for hazardous zones.
  • Designed a distributed web application handling 100,000 concurrent API requests and 1M database records.

02 — Toolkit

Skills & Technologies

Languages

JavaPythonJavaScript/TSC/C++

Backend & Frameworks

Spring (Boot, Security, Data)Hibernate/JPAFastAPIFlaskDjangoNode.js/ExpressGraphQLKafka

Frontend

ReactAngularReduxHTML/CSS/SASS

Cloud & DevOps

AWS (DynamoDB, Lambda, CDK, CloudWatch)DockerKubernetesCI/CD

Distributed Systems

Consensus & leader electionSharding & replicationCachingEvent-driven architecturesLoad testingObservability

ML Infra

TensorRT-LMTriton Inference ServerPyTorchSpark / EMR

03 — Foundations

Education

Dec 2022

Carnegie Mellon University

M.S. Electrical & Computer Engineering

Concentration: AI/ML Systems · Pittsburgh, PA

May 2015

NIT Tiruchirappalli

B.Tech. Instrumentation & Control Engineering

Tiruchirappalli, India

Let's build

Ready to ship something meaningful?

Always interested in conversations about distributed systems, infrastructure, and ML platforms.