Application of Q&A and LLM for Information Extraction🤖

This project aims to build a robust system for extracting and analyzing information using advanced techniques such as Natural Language Processing (NLP), sentiment analysis, and numerical data comparison. It includes tools to preprocess text, compare sentence similarities, extract contextual keywords, and analyze sentiment for comprehensive information extraction.

💡Features

Text Preprocessing:
- Tokenization
- Lemmatization
- Stopword removal
Sentence Similarity:
- Matrix generation for sentence similarity between two texts.
Keyword Extraction:
- Identifies major attributes in sentences for context comparison.
Numerical Data Analysis:
- Extracts and compares numerical values between sentences.
Sentiment Analysis:
- Analyzes and compares the sentiment of sentences.
Comprehensive Evaluation:
- Generates a similarity matrix and a final decision matrix to evaluate text.

🔧Technologies Used

Python Libraries:
- spacy: For NLP and text preprocessing.
- scikit-learn: For calculating cosine similarity.
- nltk: For sentiment analysis.
- numpy: For matrix operations.
- pandas: For data handling.

🛠️ Setup and Installation

Clone the repository:

git clone https://github.com/akarshi19/Application-of-Q-A-and-LLM-for-Information-Extraction.git
cd Application-of-Q-A-and-LLM-for-Information-Extraction

Create a virtual environment and activate it:

python -m venv env
source env/bin/activate  # On Windows: env\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Download the required spacy model:
```
python -m spacy download en_core_web_sm
```
Ensure NLTK is fully downloaded:
```
import nltk
nltk.download('all')
```

⚙️How It Works

Preprocess Text:
- Tokenizes, lemmatizes, and removes stopwords to prepare the text for analysis.
Compare Sentences:
- Computes sentence similarities using TF-IDF vectorization and cosine similarity.
Extract Context:
- Identifies common keywords to determine the context of similar sentences.
Numerical and Sentiment Analysis:
- Extracts numerical values for comparison.
- Uses sentiment analysis to evaluate and compare sentence sentiment.
Decision Matrix:
- Generates a final matrix that consolidates the results of the analysis.

🖥️Usage

Run the provided script by defining two input texts (e.g., historical or comparative data). Below is a snippet to execute the main function:

from main import main

text1 = '''Your first text input.'''
text2 = '''Your second text input.'''

main(text1, text2)

📝Examples

Example 1: Comparing two education-related texts:

Input:

Text 1: Modi increased the education budget by 20%...
Text 2: Gandhi introduced a new Education budget...

Output:

Final_Matrix:-
[           Text1      Text2      Output     ]
[budget     20.0       10.0       1          ]
[school     25.0       100.0      1          ]
...
Entity in Text 1 is better than Entity in Text 2

Example 2: Comparing personal traits:

Input:

Text 1: Raman is a good boy...
Text 2: Pathak is a bad boy...

Output:

Entity in Text 1 is equal to Entity in Text 2

📣License

This project is licensed under the MIT License. See the LICENSE file for details.

Feel free to contribute to this project or raise issues for improvement! Reach out if you have any questions or suggestions.

Happy Coding!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Code.py		Code.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Application of Q&A and LLM for Information Extraction🤖

📚Table of Contents

💡Features

🔧Technologies Used

🛠️ Setup and Installation

⚙️How It Works

🖥️Usage

📝Examples

📣License

About

Releases

Packages

Languages

akarshi19/Application-of-Q-A-and-LLM-for-Information-Extraction

Folders and files

Latest commit

History

Repository files navigation

Application of Q&A and LLM for Information Extraction🤖

📚Table of Contents

💡Features

🔧Technologies Used

🛠️ Setup and Installation

⚙️How It Works

🖥️Usage

📝Examples

📣License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages