Welcome to the Chat-PDF website project! This platform allows users to interact with their PDF documents through a chat interface. The project leverages various technologies to provide a seamless and efficient user experience.
To set up the project, follow these steps:
-
Copy the provided
.env.example
file to.env
. -
Generate API keys for the following services:
- Clerk
- PineconeDB
- GeminiAPI
- NeonDB
- AWS
-
Set up your Next.js project by running the following commands:
# Install dependencies
npm install
npm run dev
Open http://localhost:3000 with your browser to see the result.
PDFs are securely stored on AWS S3, ensuring seamless access and reliability.
Metadata extracted from the documents is intelligently organized using Pinecone DB. This enables efficient retrieval through vector search, making the search process faster and more accurate.
Vector search is a method based on representing documents as vectors in a high-dimensional space. Similar documents are close to each other in this space, enabling efficient and accurate search, even with a large number of documents.
The robust backend relies on NeonDB, managed via Drizzle ORM. This ensures seamless handling of user and chat data.
The power of conversation and text embedding is provided by the Gemini-AI API. A custom prompt is used to make conversations more natural, dynamic, and accurate.