Zettel LLM

Zettel LLM leverage LLMs and vector embeddings to automatically assign labels to notes within a zettelkasten system. For a more detailed guide, check out the Simple Manual.

🛠️ Setup

Clone the Repository

git clone https://github.com/przadka/zettel-llm

Navigate to the Project Directory
```
cd zettel-llm
```
Activate the Virtual Environment (Make sure you have virtualenvwrapper installed)
```
workon zettel-llm
```
Install Dependencies
```
pip install -r requirements.txt
```
Prepare Your Documents:
- Place your documents in the documents directory.
- Currently, the system assumes documents are in CSV format.

💾 Data Setup

Ensure you have the following CSV files in the documents/ directory:

notions.csv: Contains the notions that the system will embed.
train_data.csv: Contains the quotes and associated metadata.

For details on the expected structure of these CSV files, please refer to the Simple Manual.

🚀 Usage

1. Initialize ChromaDB

Run the initialize_chroma_db.py script. This will ensure the presence of the 'zettelkasten' collection in the specified database and begin the querying process.

python initialize_chroma_db.py

⚠️ Note: If you need to re-initialize the Chroma database, you can delete the existing one by executing the following command:

$ rm -rf chroma.db

Initializing a new database incurs a small cost. Always ensure you're aware of any associated expenses before performing this action.

2. Query the Database

Execute the main.py script to search for specific texts within the 'zettelkasten' collection.

python main.py

🌍 Environment Variables

Before using the scripts, configure the necessary environment variables:

OPENAI_API_KEY: Your OpenAI API key, required for the embedding function.

You can either export this variable directly in your shell or use an .env file.

📜 License

This project is open-source and licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.gitignore		.gitignore
README.md		README.md
SIMPLE_MANUAL.md		SIMPLE_MANUAL.md
assign-notions-big-context.py		assign-notions-big-context.py
assign-notions.py		assign-notions.py
assign-titles.py		assign-titles.py
create_jsonl_dataset.py		create_jsonl_dataset.py
data_split.py		data_split.py
evaluation.py		evaluation.py
extract_notions.py		extract_notions.py
guess_zettel_language.py		guess_zettel_language.py
initialize_chroma_db.py		initialize_chroma_db.py
main.py		main.py
prep-baseline-data.py		prep-baseline-data.py
requirements.txt		requirements.txt
title_fine_tuning.py		title_fine_tuning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zettel LLM

🛠️ Setup

💾 Data Setup

🚀 Usage

1. Initialize ChromaDB

2. Query the Database

🌍 Environment Variables

📜 License

About

Releases

Packages

Languages

przadka/zettel-llm

Folders and files

Latest commit

History

Repository files navigation

Zettel LLM

🛠️ Setup

💾 Data Setup

🚀 Usage

1. Initialize ChromaDB

2. Query the Database

🌍 Environment Variables

📜 License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages