VoiceGPT

AI Assistant with Speech Interaction

A simple script to talk with GPT using OpenAI and Azure Speech Services

This GitHub repository contains a Python program that implements an AI assistant capable of interactive conversations with users using both text and speech. The assistant is powered by OpenAI's GPT-3.5 model for natural language processing and Azure Cognitive Services for speech recognition and text-to-speech synthesis.

Features

Engaging and helpful AI assistant with natural language understanding.
Recognizes speech input from the user and converts it into text.
Generates contextually relevant and engaging responses using GPT-3.5. (or any model you decide)
Converts AI responses into spoken output using text-to-speech synthesis using Azure.
Keeps the conversation history limited to the last 8 exchanges. This can be changed.
Funny and entertaining responses to create a positive user experience.

Prerequisites

To run the AI assistant, you will need the following:

Python :-)
Azure (for speech recognition and generation) and OpenAI (for chat completions) accounts
Environment variables set up for the necessary API keys:
- OPENAI_API_KEY: API key for OpenAI GPT-3.5 API.
- COG_SERVICE_KEY: API key for Azure Cognitive Services.
- COG_SERVICE_REGION: Region for Azure Cognitive Services.

Setup

Clone this repository to your local machine.
Install the required Python packages by running pip install -r requirements.txt.
Create a .env file in the root directory of the project and set the environment variables mentioned in the prerequisites.

Usage

Ensure your microphone is working correctly.
Run the program by executing talk_gpt.py.
Speak when prompted, and the assistant will respond to your input.

Configuration

You can modify the conversation history limit by changing the -8 value in the get_response_from_gpt(prompt) function within main.py.

Notes

The speech recognition and text-to-speech synthesis depend on Azure Cognitive Services. Please ensure you have an active subscription and the appropriate keys and region set in the environment variables.

Acknowledgments

This program uses OpenAI's GPT-3.5 model to generate conversational responses. Visit OpenAI's website to learn more about their APIs and services.
The speech recognition and text-to-speech synthesis are powered by Azure Cognitive Services. Check out Microsoft's Azure Cognitive Services for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
img		img
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
talk_gpt.py		talk_gpt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceGPT

AI Assistant with Speech Interaction

A simple script to talk with GPT using OpenAI and Azure Speech Services

Features

Prerequisites

Setup

Usage

Configuration

Notes

Acknowledgments

About

Releases

Packages

Languages

huolter/VoiceGPT

Folders and files

Latest commit

History

Repository files navigation

VoiceGPT

AI Assistant with Speech Interaction

A simple script to talk with GPT using OpenAI and Azure Speech Services

Features

Prerequisites

Setup

Usage

Configuration

Notes

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages