Question Answering Application

Requirement

Given a combination of dataset (PDFs, Web pages, Docs), build a system which can answer the question in a generative way with reference to the section of the doc/page.

Question Answering Application

This project implements a question-answering system using Langchain, Chroma, and Ollama. The application takes user queries and searches a document database to provide relevant answers along with the sources.

Installation

Navigate to nlp rag folder.

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install dependencies:
```
pip install -r requirements.txt
```

Architecture

Usage

Populate the Database with PDF docs

To populate the database with documents, use the populate_database.py script. This script will load documents from the data directory, split them into chunks, and store them in a Chroma database.

python populate_database.py --reset  # Use --reset to clear the existing database

Populate the Database with web crawl

To populate the database with web pages, use the populate_database.py script. This script will load documents from the data directory, split them into chunks, and store them in a Chroma database.

python populate_database.py --reset --source 'WEB' --url <url> --limit <limit>
limit is optional.
E.g : python populate_database.py --reset --source 'WEB' --url 'https://www2.gov.bc.ca/gov/content/governments/organizational-structure/ministries-organizations/ministries/children-and-family-development' --limit 100

Query the Database

You can query the database using the query_data.py script. Pass your query as an argument to get an answer based on the context from the documents.

python query_data.py "Your query here"

Streamlit Application

You can also interact with the question-answering system using a Streamlit web interface. Run the app.py script to start the web application.

streamlit run app.py

Components

query_data.py

This script handles querying the database and returning responses based on the context.

get_embedding_function.py

This script provides the embedding function used to embed documents and queries.

populate_database.py

This script handles loading documents, splitting them into chunks, and populating the Chroma database.

app.py

This script implements a Streamlit web application for querying the database interactively.

Running the Application

Populate the Database:

python populate_database.py --reset

Query the Database:

python query_data.py "Your query here"

Run the Streamlit Application:

streamlit run app.py

Contact

Please send an email to [email protected] for a demo or to get know more about this prototype.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
nlp rag		nlp rag
nlp_rag		nlp_rag
notebooks		notebooks
streamlit		streamlit
.DS_Store		.DS_Store
.gitignore		.gitignore
Chromodb_langchain.ipynb		Chromodb_langchain.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirement

Question Answering Application

Installation

Architecture

Usage

Populate the Database with PDF docs

Populate the Database with web crawl

Query the Database

Streamlit Application

Components

query_data.py

get_embedding_function.py

populate_database.py

app.py

Running the Application

Populate the Database:

Query the Database:

Run the Streamlit Application:

Contact

About

Releases

Packages

Languages

AOT-Technologies/rag-chatbot

Folders and files

Latest commit

History

Repository files navigation

Requirement

Question Answering Application

Installation

Architecture

Usage

Populate the Database with PDF docs

Populate the Database with web crawl

Query the Database

Streamlit Application

Components

query_data.py

get_embedding_function.py

populate_database.py

app.py

Running the Application

Populate the Database:

Query the Database:

Run the Streamlit Application:

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages