Skip to content
View VinnuReddy18's full-sized avatar

Highlights

  • Pro

Block or report VinnuReddy18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
VinnuReddy18/README.md

Vinay Kumar Reddy

Software Engineer • AI Engineer • GenAI Systems Builder


Bengaluru, India • [email protected]


About

I am a software engineer focused on building real GenAI systems, not demos.

My work revolves around designing LLM pipelines, AI agents, and backend-heavy architectures that are reliable, scalable, and production-ready. I enjoy working close to the system layer where prompts, embeddings, retrieval, APIs, and infra meet.

I like shipping fast, then refactoring hard.


What I Work On

  • LLM agents and workflow orchestration engines
  • RAG systems including chunking, embeddings, retrieval, routing
  • Prompt optimization and reliability engineering
  • Backend-first full stack AI products

Experience

Pocket

Software Engineering Intern
Remote | Oct 2025 to Nov 2025

  • Rewrote and optimized the entire prompt flow system to significantly improve LLM accuracy and consistency
  • Built a customizable summary agent with control over depth, tone, and structure
  • Designed an auto summarization pipeline that intelligently routes transcripts without manual user selection

Handa Uncle

AI Engineering and Web Development Intern
Remote | Jun 2025 to Oct 2025

  • Built the complete backend from scratch with REST APIs, authentication, and RAG logic
  • Implemented chunking, embeddings, and contextual retrieval pipelines
  • Integrated Zerodha Kite MCP to enable conversational access to live portfolio data
  • Used Firebase Realtime DB for fast data storage and retrieval
  • Collaborated closely with mobile app developers for seamless backend integration

Project Dark Horse

Software Engineering Intern
Bangalore | Oct 2024 to Dec 2024


Saphaare Labs Pvt Ltd

Software Development Engineer Intern
Remote | Sep 2024 to Mar 2025

  • Built a full stack AI SaaS platform integrating GPT, Claude, and Flux
  • Developed secure authentication using Firebase
  • Used Cloudflare Workers and R2 for scalable infra and storage
  • Implemented AI driven blog generation workflows and history tracking
  • Product link

Featured Project

AurAgent

AI Workflow Automation Platform

Tech stack
Next.js 14 • TypeScript • Python • FastAPI • PostgreSQL • ReactFlow • GPT 4

  • LLM powered platform that converts natural language instructions into executable workflows
  • Designed a DAG based async execution engine supporting more than 15 node types
  • Built a visual workflow editor with drag and drop nodes, real time validation, and smooth animations
  • Integrated GPT 4 via OpenRouter with streaming responses and schema validation
  • Implemented secure credential vault, multimodal file processing, and more than 20 REST APIs

This project focuses on infrastructure level GenAI engineering, not surface level tooling.


Tech Stack

Languages

Java • JavaScript • TypeScript • Python

Frameworks

Node.js • Express.js • React.js • Next.js • Spring Boot

AI and LLM Systems

RAG • Embeddings • Prompt Engineering
LLM Workflow Optimization • Text Summarization
Chat Server Architecture

Databases and Infra

MySQL • MongoDB • PostgreSQL
Firebase Realtime DB • Cloudflare R2

Tools

GitHub • Docker • Firebase • Cloudflare
VS Code • Postman • Android Studio


GitHub Activity


Philosophy

I care about building systems that work under real constraints.

Clean backend architecture. Reliable LLM behavior. Products that scale beyond demos.

If that resonates, we will get along.


Engineering AI, not just calling APIs.
Open to internships, collaborations, and high impact work.

Pinned Loading

  1. cloudflare-rag cloudflare-rag Public

    Forked from RafalWilinski/cloudflare-rag

    Fullstack "Chat with your PDFs" RAG (Retrieval Augmented Generation) app built fully on Cloudflare

    TypeScript

  2. llamatutor llamatutor Public

    Forked from Nutlope/llamatutor

    An AI personal tutor built with Llama 3.1

    TypeScript

  3. talent-matchmaking talent-matchmaking Public

    TypeScript

  4. textract-ocr textract-ocr Public

    HTML