feat: chat and messaging with openai 4o llm implementation #27

Nyumat · 2024-11-07T10:00:16Z

new tables and extensions to the database, updating dependencies, and implementing new chat-related features in the backend and frontend. lfg. 🗣️

Database Schema Updates:

Added new tables chats, messages, document_chunks, and embeddings along with necessary foreign key constraints in prisma/migrations/20241106184235_chats_messages/migration.sql.
Updated prisma/schema.prisma to include new models and extensions for vector.

Dependencies Updates:

Updated package.json to add new dependencies like @ai-sdk/openai, @langchain/core, pgvector, and others.

Configuration Changes:

Updated docker-compose.yml to use ankane/pgvector:latest image and added a new volume for initialization scripts.
Updated next.config.ts to include api.dicebear.com in the images.domains list.

Backend Features:

Implemented a new API route src/app/api/chat/route.ts for handling chat messages, including user authentication, message processing, and interaction with OpenAI's 4o model.

Note

This is just the first step in implementation. We still need to integrate the pinecone vector db work done in (#19) into the system.

It will take some query, compare the embedding to all course material documents, and return the K=1 most relevant document to the LLM, which will stream the response to the Next.js client.

Frontend Features:

Added a new ChatArea component in src/app/chat/(render)/chat-area.tsx to handle chat UI, message input, and image attachments.

Nyumat · 2024-11-07T10:19:49Z

Going to secure the code real quick, we need protected routes to avoid exposing resources to non-OSU students

cshafizadeh · 2024-11-08T02:10:37Z

@Nyumat out of curiosity, are we storing the document chunks and the embeddings in a SQL database as well as in pinecone?

Nyumat · 2024-11-08T05:21:44Z

@Nyumat out of curiosity, are we storing the document chunks and the embeddings in a SQL database as well as in pinecone?

Good question. I initially was thinking that if we store our chunks, and embeddings as vectors alongside our core data model, we'd have a tighter integration for more complex use cases, like those outlined in the planning doc from week 2:

Some of the stuff in the doc:

upvote downvote ai responses

ai generated quizzes based on course content

top notes from rubric, compare assignments to the rubric

But after thinking about the work you did, along with just how complex it'd be to hack prisma into supporting vectors — I think it's best centering our RAG approach around Pinecone.

I'll update the data model to go back to what is was before. Thanks for asking that question and providing me some retrospective.

cshafizadeh · 2024-11-08T07:08:32Z

@Nyumat out of curiosity, are we storing the document chunks and the embeddings in a SQL database as well as in pinecone?

Good question. I initially was thinking that if we store our chunks, and embeddings as vectors alongside our core data model, we'd have a tighter integration for more complex use cases, like those outlined in the planning doc from week 2:

Some of the stuff in the doc:

upvote downvote ai responses

ai generated quizzes based on course content

top notes from rubric, compare assignments to the rubric

But after thinking about the work you did, along with just how complex it'd be to hack prisma into supporting vectors — I think it's best centering our RAG approach around Pinecone.

I'll update the data model to go back to what is was before. Thanks for asking that question and providing me some retrospective.

Yeah no problem, Im not against storing the documents in a second db. it would allow us to easily upload the document to pinecone if they are already broken into chunks, and it would be a backup incase something happened to the vector db or we wanted to switch platforms. Ill try to take a look over the weekend at the updated UI you built and look into integrating the embeddings into sending messages. I looked at the UI from the last push and it looks great!

Nyumat self-assigned this Nov 7, 2024

Nyumat linked an issue Nov 7, 2024 that may be closed by this pull request

Design the Application Data Model #1

Closed

Nyumat added 15 commits November 7, 2024 02:03

feat: chat list, welcome msg, page

242637f

db: pgvector, 1536 embeddings, and chat models

2eaa930

cfg: add dicebear image domain

1120ac3

css: osu color and svg uri

ff0ea2c

ui: not found page

6fbac8f

assets: add logo

b50097e

api: send chat - POST

c8120e1

security: protected routing - chat layout

3393e29

db: create and delete chats

cbfb2eb

ui: chat area with attachment support

fa2a304

ui: initial chat flow and dialog

d76f805

ui: framer-motion on files and upload

245dbef

auth: use colored google icon

4ee95f6

perf: use next-image pkg for logo

13223c9

ui: better PDF previews and deletion ux

751634d

Nyumat force-pushed the nyumat/full-stack-chats branch from bdabd97 to 751634d Compare November 7, 2024 10:03

security: add protection against unauthorized access vectors

1fb4b86

Nyumat added 7 commits November 8, 2024 05:34

db: add file visibility remove chunk/embedding

b9e82b2

deps: add seed script and tsx

0014c49

perf: lazy url previews

2f7df48

feat: file visibility options & move upload to dialog

0dd6aad

perf: prop drilling -> useContext

51622c3

refactor: make PDF upload dialog reusable

fd2488a

ui: style upload trigger in stats

73101fa

refactor: upload links automatically open dialog

b6d0075

cshafizadeh approved these changes Nov 8, 2024

View reviewed changes

Nyumat merged commit d6badff into main Nov 9, 2024
1 check passed

Nyumat deleted the nyumat/full-stack-chats branch November 9, 2024 03:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: chat and messaging with openai 4o llm implementation #27

feat: chat and messaging with openai 4o llm implementation #27

Nyumat commented Nov 7, 2024

Nyumat commented Nov 7, 2024

cshafizadeh commented Nov 8, 2024

Nyumat commented Nov 8, 2024

cshafizadeh commented Nov 8, 2024

feat: chat and messaging with openai 4o llm implementation #27

feat: chat and messaging with openai 4o llm implementation #27

Conversation

Nyumat commented Nov 7, 2024

Database Schema Updates:

Dependencies Updates:

Configuration Changes:

Backend Features:

Frontend Features:

Nyumat commented Nov 7, 2024

cshafizadeh commented Nov 8, 2024

Nyumat commented Nov 8, 2024

cshafizadeh commented Nov 8, 2024