Skip to content
View pranavthombare's full-sized avatar
🦧
Focusing
🦧
Focusing

Organizations

@LegendROM-N @Oneplus-msm8998-pie @bchi-coursework

Block or report pranavthombare

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pranavthombare/README.md

πŸ‘‹ Hey, I'm Pranav Thombare

πŸš€ Machine Learning Engineer | Systems Builder | LLM Infra

I build high-performance AI systems that actually run in production.

From optimizing LLM inference pipelines to working deep in operating systems, I enjoy solving problems across the entire stack β€” from kernels to large language models.


🧠 What I Do

  • ⚑ Optimize LLM/VLM systems for latency, throughput, and scale
  • πŸ— Build end-to-end AI pipelines (RAG, agents, training systems)
  • πŸ” Design intelligent systems that replace manual workflows
  • 🧩 Work across systems: OS β†’ backend β†’ ML β†’ infra

πŸš€ Current Focus

  • 🧠 LLM inference optimization (TensorRT-LLM, vLLM, SGLang)
  • πŸ”— Retrieval-Augmented Generation (RAG) systems
  • πŸ€– Agentic workflows using LangGraph
  • βš™οΈ Distributed systems & Kubernetes-based deployments

πŸ›  Tech Stack

Languages
Python C++ Rust

ML / AI
PyTorch TensorRT-LLM Triton vLLM SGLang

Systems & Infra
Kubernetes Docker AWS GCP

Other
AOSP Linux Kernel RAG Pipelines LoRA Quantization


πŸ— Notable Work

  • ⚑ Improved LLM performance by 60%+ using speculative decoding & KV cache optimization
  • πŸ“„ Built a VLM-based document parsing system (98%+ accuracy)
  • πŸ€– Developed autonomous agents for processing real-world business workflows
  • πŸ§ͺ Built RAG pipelines to generate automated integration tests from codebases
  • πŸ“± Former Android OS engineer working on kernel, SELinux & device security

🀝 Open to Collaborate

  • 🐧 Linux Kernel / systems programming
  • πŸ€– LLM / ML infrastructure
  • πŸ›  Developer tools & infra-heavy projects

πŸ“« Reach Me

  • πŸ“§ Email: check profile
  • 🌐 LinkedIn
  • πŸ’» GitHub
  • πŸ“± Telegram / Instagram / Unsplash: @pranavthombare

⚑ Fun Facts

  • πŸ₯‹ I can use nunchucks
  • πŸ”οΈ Trekked to Everest Base Camp

🧭 Philosophy

Build things that are not just impressive β€” but useful, scalable, and real.

Pinned Loading

  1. Pontoon Pontoon Public

    C++

  2. cameraX cameraX Public

    A demo working case and an implementation of AndroidX library: CameraX. This will go through the new features of the library: preview, analysis and image capture.

    Kotlin

  3. cowin_appointment_tracker cowin_appointment_tracker Public

    Python 1

  4. llm-napkin llm-napkin Public

    TypeScript 2

  5. device_oneplus_msm8998-common device_oneplus_msm8998-common Public

    C++ 1

  6. android_kernel_oneplus_msm8994 android_kernel_oneplus_msm8994 Public

    Forked from LegendROM-N/android_kernel_oneplus_msm8994

    C