Harshith Gujjeti — AI/ML Engineer | Multi-Agent · RAG

01 // ABOUT

I don't just model data —
I engineer intelligence.

AI/ML engineer and researcher with a sharp focus on production ML systems, generative AI pipelines, and healthcare analytics. I care about building things that actually work in the real world — not just in notebooks.

Currently pursuing my M.S. in AI & Business Analytics at USF, with hands-on research at the Muma College of Business and the Centre for Transportation Research. Previously an AI Research Intern at ISRO.

I build at the intersection of multi-agent AI, computer vision, NLP, and data engineering — wherever hard problems meet real stakes.

EDUCATION

University of South Florida

M.S. AI & Business Analytics

2025 – 2027
4.0 GPA
CURRENT

Jawaharlal Nehru Technological University, Hyderabad

B.Tech. Computer Science — AI/ML Specialization

2021 – 2025

04 // CURRENTLY BUILDING

What I'm working on

ACTIVE

Pharmacy Document Intelligence Tool

Agentic RAG system for structured extraction from pharmaceutical documents — LangGraph orchestration, ChromaDB vector store, RAGAS evaluation pipeline, FastAPI + Docker deployment.

LangGraph ChromaDB RAGAS FastAPI Docker

IN PROGRESS

Clinical Speech-to-Note Assistant

Whisper + spaCy pipeline converting clinician audio into structured SOAP notes. 70% reduction in manual structuring time. Dockerized for HIPAA-compliant local deployment.

Whisper spaCy LangChain Docker HIPAA GitHub →

05 // ARSENAL

Tech & Tools

🧠 ML & AI

PyTorchXGBoostscikit-learnTensorFlowMLflowHyperparameter Tuning

✦ GenAI & LLMs

LangChainLangGraphCrewAIOpenAI APIOllamaRAGPrompt Engineering

👁 Computer Vision

YOLOv8ByteTrackSRGANCNNNeRFImage Processing

📊 Data Science & NLP

pandasNumPyspaCyWhisperSQLNLTKR

🗄 Data Eng & Infra

SnowflakeDockerChromaDBMySQLMongoDBPostgreSQLGit

📈 BI & Visualisation

TableauPower BIStreamlitData Storytelling

☁ Cloud & DevOps

Azure OpenAIAWSKubernetesDockerCI/CDLangSmithRAGASFastAPI

06 // HOW I THINK

Principles I build by.

Short opinions, held with conviction. Subject to change when the data says so.

Evals before architecture.

If I can't measure the system, I can't improve it. Every LLM pipeline starts with a golden set + RAGAS-style eval loop before I touch the graph. Faithfulness, context precision, answer relevance — numbers first, vibes second.

RAGASLangSmithGolden sets

RAG before fine-tune.

Fine-tuning is a last resort. 90% of "we need a custom model" is really "we need better retrieval + prompting." I reach for RAG, tool use, and structured output first — cheaper, safer, and swap-in-a-better-model friendly.

RetrievalStructured output

Ship narrow, ship real.

A working demo on one small surface beats a roadmap slide forever. Every project I own has a Docker image, a FastAPI endpoint, and one metric I can defend — not just a notebook that runs on my laptop.

FastAPIDockerObservability

Agents are pipelines in disguise.

"Multi-agent" is glamorous. But every agent loop eventually becomes a deterministic graph with guardrails. I design for that reality — explicit edges, retry budgets, failure modes — instead of hoping the LLM figures it out.

LangGraphGuardrails

Data engineering is 60% of ML.

The model is the easy part. Making sure the 150K-row entity resolution actually matches the right companies, or that the 100hr of traffic video gets labeled without drift — that's where the work hides. Own the pipeline end-to-end.

Entity resolutionpandasSQL

Boring infra, bold ideas.

Kubernetes for a weekend project is a red flag. I pick the smallest stack that survives the next 6 months, then spend the saved energy on the part that's actually interesting — the research question, the eval, the user experience.

DockerCI/CDPragmatism

08 // WHEN I'M NOT BUILDING

Outside the terminal.

🏋️

Iron & Discipline

Training keeps me sharp. The gym teaches progressive overload — a principle I apply just as hard to engineering. Consistency over intensity, every single day.

🌱

Growing Things

Gardening grounds me. Watching something you planted grow over time — patience, care, long-term thinking. The same mindset I bring to building ML systems.

Harshith
Gujjeti.

I don't just model data —
I engineer intelligence.

Where I've shipped

Systems I've built

What I'm working on

Pharmacy Document Intelligence Tool

Clinical Speech-to-Note Assistant

Tech & Tools

Principles I build by.

Evals before architecture.

RAG before fine-tune.

Ship narrow, ship real.

Agents are pipelines in disguise.

Data engineering is 60% of ML.

Boring infra, bold ideas.

Building every day.

Outside the terminal.

Iron & Discipline

Growing Things

Open to what's next.

HarshithGujjeti.

I don't just model data —I engineer intelligence.

Where I've shipped

Systems I've built

What I'm working on

Pharmacy Document Intelligence Tool

Clinical Speech-to-Note Assistant

Tech & Tools

Principles I build by.

Evals before architecture.

RAG before fine-tune.

Ship narrow, ship real.

Agents are pipelines in disguise.

Data engineering is 60% of ML.

Boring infra, bold ideas.

Building every day.

Outside the terminal.

Iron & Discipline

Growing Things

Open to what's next.

Harshith
Gujjeti.

I don't just model data —
I engineer intelligence.