GRAG

Guarded Retrieval Augmented Generation. This repository contains a presentation from DSS2023 and a demo demonstrating the idea of GRAG.

Abstract

Problem: Large Language Models (LLMs) are at the forefront of natural language processing advancements, thanks to their ability to understand and generate human-like text. However, challenges arise due to their tendency to occasionally produce irrelevant or misleading outputs and their inherent knowledge cutoff. This limitation means they might not be updated with the most recent information, necessitating the exploration of effective solutions beyond mere fine-tuning.

Methodology: Retrieval-augmented Generation (RAG) and its advanced iteration, Guarded Retrieval Augmented Generation (GRAG), were explored as potential solutions. RAG utilizes an intermediary approach, fetching relevant facts from external knowledge databases, grounding LLM outputs in verifiable data. Building on this, GRAG incorporates guardrails - specific controls that guide the model's outputs, such as circumventing politically charged topics or adhering to a set dialogue path. The methodology central to GRAG involves defining example queries or utterances and embedding them within a semantic vector space. This allows for rapid decision-making based on the semantic proximity of a user's query to predefined utterances.

Conclusions: The utilization of GRAG offers a more efficient and swifter alternative to the basic RAG method. It effectively mitigates the challenges of inaccurate or unrelated outputs from LLMs and ensures more precise and targeted outcomes.

Relevance to practitioners and business: As businesses integrate LLMs, GRAG offers means to ensure accuracy and relevance in AI outputs, catering to diverse business needs, from customer support to knowledge management.

Installation

This project uses Conda for environment and package management. However, you are free to use your preferred method for installing packages if you are more comfortable with a different approach.

Using Conda

If you choose to use Conda, here are the steps to set up the necessary environment for this project:

Ensure that Conda is installed on your system. Conda is an open-source package management and environment management system which runs on Windows, macOS, and Linux. If you do not have Conda installed, you can download it from Miniconda (a minimal installer for Conda) or Anaconda (which includes Conda and some additional tools).
Open your terminal (or Anaconda Prompt if you are on Windows) and navigate to the directory where the environment.yml file is located.
Create a new Conda environment and install all the required packages using the provided environment.yml file by running:
```
conda env create -f environment.yml
```
Once the installation is complete, activate the new environment with:
```
conda activate grag
```

For more information on managing Conda environments, please refer to the official Conda documentation.

Running the Jupyter Notebook

After setting up the environment, you can run the Jupyter notebook GRAG.ipynb to execute the code:

Ensure that the Conda environment you created (or your custom environment) is activated.
Start Jupyter Notebook by running:
```
jupyter notebook
```
In the Jupyter Notebook interface, navigate to the GRAG.ipynb file and open it.
Run the cells in the notebook as needed to execute the code.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
GRAG.ipynb		GRAG.ipynb
GRAG.pdf		GRAG.pdf
README.md		README.md
environment.yml		environment.yml
faq_en_data.json		faq_en_data.json
faq_pl_data.json		faq_pl_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GRAG

Installation

Using Conda

Running the Jupyter Notebook

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GRAG

Installation

Using Conda

Running the Jupyter Notebook

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages