mAIcro: Open Source Knowledge Service

Built and maintained by the Dev Department of MicroClub, the computer science club at USTHB (University of Science and Technology Houari Boumediene, Algiers).

mAIcro is an open-source AI service designed to centralize organizational knowledge and answer questions via RAG (Retrieval-Augmented Generation). It features a stateless architecture optimized for cloud deployment, automatic Discord integration, and production-ready performance.

Features

RAG-Powered Q&A: Google Gemini with hybrid search (semantic vector + keyword BM25) and Reciprocal Rank Fusion
Real-Time Discord Sync: Gateway WebSocket listener handles message CREATE, EDIT, and DELETE instantly
Temporal Query Intelligence: Understands "what happened today?" and "what was the last message?"
Question Normalization: Rewrites slang ("whats", "wanna", "gonna"), augments time-aware queries
Startup Audit: Reconciles offline Discord edits and deletes on every restart
Stateless Architecture: All cursors and vectors live in Qdrant Cloud; no local database
Rate Limit Resilience: Exponential backoff with jitter plus optional secondary LLM fallback
Production-Ready: Multi-stage Docker, health checks, graceful reconnection

Quick Start

There are two easy ways to run mAIcro locally. The recommended method does NOT require cloning the repository, it pulls and runs the published GHCR image.

Run the published GHCR image (recommended)

Pull & run the image directly with Docker:

docker pull ghcr.io/microclub-usthb/maicro:latest
docker run --env-file .env -p 8000:8000 ghcr.io/microclub-usthb/maicro:latest

Clone and run from source (for development or when you want to build locally)

git clone https://github.com/MicroClub-USTHB/mAIcro.git
cd mAIcro
cp .env.example .env
# Uncomment the `build: .` line in docker-compose.yml, then:
docker compose build
docker compose up -d

Fill in .env (see the Configuration section below). The API is available at http://localhost:8000. Interactive docs at http://localhost:8000/api/v1/docs.

Ingest and query

# Sync Discord message history to Qdrant
curl -X POST http://localhost:8000/api/v1/ingest/discord

# Ask a question
curl -X POST http://localhost:8000/api/v1/ask \
  -H "Content-Type: application/json" \
  -d '{"question":"When is the next event?"}'

Configuration

All settings are environment variables loaded from .env via pydantic-settings.

Required

Variable	Description
`GEMINI_API_KEY`	Google Gemini API key (used for LLM + embeddings)
`QDRANT_URL`	Qdrant Cloud instance URL (e.g. `https://xxxxx.cloud.qdrant.io`)
`QDRANT_API_KEY`	Qdrant Cloud API key
`DISCORD_BOT_TOKEN`	Discord bot token (from the Developer Portal)
`DISCORD_CHANNEL_IDS`	Comma-separated Discord channel IDs to watch

Optional

Variable	Default	Description
`ORG_NAME`	`MicroClub`	Organization name embedded in the AI system prompt
`ORG_DESCRIPTION`	`A generic organization using mAIcro`	Organization description
`GOOGLE_MODEL_NAME`	`gemini-2.5-flash`	Gemini model used for answering
`SECONDARY_GEMINI_API_KEY`	(none)	Fallback Gemini key when the primary is rate-limited
`LLM_FALLBACK_ENABLED`	`false`	Set to `true` to enable automatic fallback to the secondary key
`COLLECTION_NAME`	`microclub_knowledge`	Name of the Qdrant collection
`HYBRID_SEARCH_ALPHA`	`0.7`	Blend weight between vector and keyword search (1.0 = vector only)
`HYBRID_SEARCH_RRF_K`	`60`	RRF constant used in result fusion

Discord Bot Setup

Go to the Discord Developer Portal and create an application
Navigate to Bot → enable Message Content Intent under Privileged Gateway Intents
Under OAuth2 > URL Generator, select bot scope and permissions: Read Messages/View Channels + Read Message History
Use the generated URL to invite the bot to your server
Copy the bot token (from the Bot page) into DISCORD_BOT_TOKEN
Enable Developer Mode in Discord (User Settings → Advanced → Developer Mode)
Right-click the channels you want to watch → Copy Channel ID → paste into DISCORD_CHANNEL_IDS (comma-separated)

Architecture

System Overview

flowchart TB
    USER([User]) --> API[FastAPI Service<br/>/api/v1]

    subgraph STARTUP["Startup Phase (Lifespan)"]
        AUDIT[Startup Audit<br/>run_startup_audit]
        HIST[Historical Ingestion<br/>ingest_from_discord]
        LISTEN[Discord Listener<br/>run_discord_listener]
    end

    subgraph RUNTIME["Runtime Services"]
        QA[QA Service<br/>ask_question]
        SEARCH[Hybrid Search<br/>Vector + BM25 + RRF]
        INGEST[Ingestion Pipeline<br/>ingest_documents]
    end

    subgraph EXTERNAL["External Systems"]
        QDRANT[(Qdrant Cloud)]
        GEMINI[[Google Gemini]]
        DISCORD[Discord<br/>REST + Gateway]
        EMBED[Embedding Service]
    end

    API --> QA
    API --> HIST

    QA --> SEARCH
    SEARCH --> QDRANT
    QA --> GEMINI

    LISTEN --> INGEST
    INGEST --> EMBED
    INGEST --> QDRANT

    AUDIT --> QDRANT
    AUDIT --> DISCORD
    HIST --> DISCORD

    QDRANT <--> DISCORD

Question Answering Flow

sequenceDiagram
    participant U as User
    participant API as FastAPI
    participant QA as qa_service
    participant HS as Hybrid Search
    participant QD as Qdrant
    participant LLM as Gemini

    U->>API: POST /ask {question}
    API->>QA: ask_question(question)

    rect rgb(30, 50, 60)
        Note over QA: Special query detection
        alt "today" or "last message"
            QA->>QD: scroll(order_by timestamp DESC)
            QD-->>QA: filtered messages
            QA->>LLM: answer from messages
        else Normal query
            QA->>QA: normalize (whats→what is)
            QA->>HS: hybrid_search(query, k=6)
            HS->>QD: vector similarity
            HS->>QD: BM25 keyword match
            HS->>QD: Reciprocal Rank Fusion
            QD-->>HS: ranked documents
            HS-->>QA: top-k documents
            QA->>QA: format context (≤6000 chars)
            QA->>LLM: invoke(prompt + context)
        end
    end

    LLM-->>QA: answer
    QA-->>API: answer
    API-->>U: {question, answer}

How Discord Sync Works

On startup, the app runs three tasks in parallel:

Audit (run_startup_audit): Fetches the last 200 messages before the cursor from each channel. Compares them against Qdrant. Deletes any points that no longer exist in Discord (offline deletes) and updates content that was edited offline.
Historical Ingest (ingest_from_discord): Fetches all messages after each channel's cursor via the Discord REST API. Converts them to LangChain Documents, generates embeddings, and upserts them into Qdrant.
Real-Time Listener (run_discord_listener): Connects to the Discord Gateway via WebSocket. Listens for MESSAGE_CREATE, MESSAGE_DELETE, and MESSAGE_UPDATE events. Ingests, deletes, or updates Qdrant points accordingly. Automatically reconnects with exponential backoff on disconnect.

API Reference

Method	Path	Description
`GET`	`/api/v1/health`	Service health check
`POST`	`/api/v1/ask`	Answer a question via RAG
`POST`	`/api/v1/ingest/discord`	Trigger Discord historical ingestion

Examples

# Health check
curl http://localhost:8000/api/v1/health
# {"status":"ok","service":"mAIcro","version":"0.1.0","org":"MicroClub","llm_provider":"google"}

# Ask a question
curl -X POST http://localhost:8000/api/v1/ask \
  -H "Content-Type: application/json" \
  -d '{"question":"What are the rules for joining a workshop?"}'
# {"question":"What are the rules for joining a workshop?","answer":"..."}

# Trigger ingestion
curl -X POST http://localhost:8000/api/v1/ingest/discord
# {"status":"ok","documents_ingested":42,"details":{"channels":{"123456789":42},"errors":{}}}

Deployment

Development

docker compose -f docker-compose.dev.yml up -d

Includes mAIcro and a local Qdrant instance for testing without cloud credentials.

Production

The published GHCR image is public; anyone can pull and run it with no authentication needed.

docker compose up -d

This pulls ghcr.io/microclub-usthb/maicro:latest by default. To use a specific tagged image instead:

MAICRO_IMAGE=ghcr.io/microclub-usthb/maicro:sha-<hash> docker compose up -d

Or pull & run directly with Docker:

docker pull ghcr.io/microclub-usthb/maicro:latest
docker run --env-file .env -p 8000:8000 ghcr.io/microclub-usthb/maicro:latest

Use Cases

Student Clubs: Event info, team opportunities, FAQs, and workshop announcements
Online Communities: Consolidated announcements and member documentation
Companies: Internal policies, documentation, and knowledge bases
NGOs: Instant access to mission-critical organizational information
Developer Communities: Technical Q&A grounded in shared resources and past discussions

Future Extensions

Agentic AI: Automate workflows, summarize announcements, notify members about events
Multi-Platform Integration: Notion, Slack, Google Docs, and other knowledge platforms
Web Dashboard: Visual knowledge browser and query analytics
Third-Party API: Public API for external tool integrations

Contributing

Contributions are welcome. Please read CONTRIBUTING.md before submitting PRs.

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
.github/workflows		.github/workflows
assets		assets
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mAIcro: Open Source Knowledge Service

Table of Contents

Features

Quick Start

Ingest and query

Configuration

Required

Optional

Discord Bot Setup

Architecture

System Overview

Question Answering Flow

How Discord Sync Works

API Reference

Examples

Deployment

Development

Production

Use Cases

Future Extensions

Contributing

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mAIcro: Open Source Knowledge Service

Table of Contents

Features

Quick Start

Ingest and query

Configuration

Required

Optional

Discord Bot Setup

Architecture

System Overview

Question Answering Flow

How Discord Sync Works

API Reference

Examples

Deployment

Development

Production

Use Cases

Future Extensions

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages