Simple RAG System (Full‑Stack)

A minimal, production‑style Retrieval Augmented Generation (RAG) application. Users can upload a document (PDF, TXT, Markdown) and ask questions about its content through a chat interface.

The project focuses on correctness, clarity, and real‑world engineering practices rather than UI polish or experimental optimizations.

Features

Document upload (PDF, TXT, Markdown)
Text chunking and embedding
Vector storage using PostgreSQL + pgvector
Semantic retrieval of relevant chunks
Question answering using Gemini (free tier)
Simple chat‑style frontend

Tech Stack

Backend

Python
FastAPI
SQLAlchemy (async)
PostgreSQL + pgvector
asyncpg
Gemini (google‑genai SDK)

Frontend

React (Vite)
Fetch API
Plain CSS

Project Structure

rag-app/
├── backend/
│   ├── app/
│   │   ├── api/            # Route definitions
│   │   ├── core/           # DB and config
│   │   ├── models/         # ORM models
│   │   ├── services/       # Chunking, embeddings, retrieval, LLM
│   │   └── main.py         # FastAPI app entry
│   └── requirements.txt
│
├── frontend/
│   ├── src/
│   │   ├── api/            # Backend API calls
│   │   ├── components/     # UI components
│   │   └── styles.css
│   └── package.json
│
└── README.md

Prerequisites

Python 3.10+
Node.js 18+
PostgreSQL 14+
pgvector extension enabled
Gemini API key (free tier)

Backend Setup

1. Create virtual environment

cd backend
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

2. Install dependencies

pip install -r requirements.txt

3. Configure environment variables

Create a .env file or export variables:

DATABASE_URL=postgresql+asyncpg://user:password@localhost:5432/rag_db
GEMINI_API_KEY=your_api_key_here

Ensure pgvector is enabled:

CREATE EXTENSION IF NOT EXISTS vector;

4. Run backend

uvicorn app.main:app --reload

Backend runs at: http://localhost:8000

Frontend Setup

1. Install dependencies

cd frontend
npm install

2. Run frontend

npm run dev

Frontend runs at: http://localhost:5173

Usage

Open the frontend in the browser
Upload a PDF, TXT, or Markdown file
Ask questions related to the document
The system retrieves relevant content and generates an answer

Architecture Overview

Document Flow

File uploaded via frontend
Backend extracts text (PDF parser or raw text)
Text is split into chunks
Each chunk is embedded using Gemini embeddings
Embeddings are stored in PostgreSQL (pgvector)

Query Flow

User asks a question
Question is embedded
Vector similarity search retrieves relevant chunks
Retrieved text is sent as context to the LLM
LLM generates a grounded answer

Key Design Decisions

Async SQLAlchemy + asyncpg: avoids blocking I/O and aligns with FastAPI’s async model
pgvector: simple, production‑ready vector storage without extra infrastructure
Simple chunking: predictable behavior, easy to reason about
No chat history storage: keeps scope aligned with assessment requirements
Minimal frontend: focuses on usability and clarity, not visual polish

The optional “Accuracy and Performance Considerations” section from the assessment was intentionally not implemented.

Notes

CORS is enabled for local development
No paid APIs or proprietary services are used

License

This project is provided for assessment and educational purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple RAG System (Full‑Stack)

Features

Tech Stack

Backend

Frontend

Project Structure

Prerequisites

Backend Setup

1. Create virtual environment

2. Install dependencies

3. Configure environment variables

4. Run backend

Frontend Setup

1. Install dependencies

2. Run frontend

Usage

Architecture Overview

Document Flow

Query Flow

Key Design Decisions

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Simple RAG System (Full‑Stack)

Features

Tech Stack

Backend

Frontend

Project Structure

Prerequisites

Backend Setup

1. Create virtual environment

2. Install dependencies

3. Configure environment variables

4. Run backend

Frontend Setup

1. Install dependencies

2. Run frontend

Usage

Architecture Overview

Document Flow

Query Flow

Key Design Decisions

Notes

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages