-
Notifications
You must be signed in to change notification settings - Fork 112
/
Copy pathREADME.Rmd
121 lines (79 loc) · 9.03 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
---
output: github_document
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# gptstudio <a href="https://michelnivard.github.io/gptstudio/"><img src="man/figures/logo.png" align="right" height="150"/>
<!-- badges: start -->
[![Lifecycle: maturing](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)
[![CRAN status](https://www.r-pkg.org/badges/version/gptstudio)](https://CRAN.R-project.org/package=gptstudio)
[![Codecov test coverage](https://codecov.io/gh/MichelNivard/gptstudio/branch/main/graph/badge.svg)](https://app.codecov.io/gh/MichelNivard/gptstudio?branch=main)
[![R-CMD-check](https://github.com/MichelNivard/gptstudio/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/MichelNivard/gptstudio/actions/workflows/R-CMD-check.yaml)
[![CRAN RStudio mirror downloads](https://cranlogs.r-pkg.org/badges/gptstudio)](https://www.r-pkg.org/pkg/gptstudio)
[![CRAN RStudio mirror downloads](https://cranlogs.r-pkg.org/badges/grand-total/gptstudio)](https://www.r-pkg.org/pkg/gptstudio)
<!-- badges: end -->
The goal of gptstudio is for R programmers to easily incorporate use of large language models (LLMs) into their project workflows. These models appear to be a step change in our use of text for knowledge work, but you should carefully consider ethical implications of using these models.
For further addins, tailored for R developers, also see the sister package: [gpttools](https://jameshwade.github.io/gpttools/)
## Getting Started: Installation & AI Service Setup
```{r}
#| eval: false
install.packages("gptstudio")
```
To get a bug fix or to use a feature from the development version, you can install the development version of this package from GitHub.
```{r}
#| eval: false
# install.packages("pak")
pak::pak("MichelNivard/gptstudio")
```
### Available AI Services and Models
| AI Service | Models | Documentation | Setup |
| ---- | ---- | ---- | ---- |
| [OpenAI](https://platform.openai.com) | gpt-4-turbo, gpt-4, gpt-3.5-turbo ([latest models](https://platform.openai.com/docs/models)) | [OpenAI API Docs](https://platform.openai.com/docs/api-reference) | [OpenAI Setup](https://michelnivard.github.io/gptstudio/articles/openai.html) |
| [HuggingFace](https://huggingface.co/) | various | [HF Inference API Docs](https://huggingface.co/docs/hub/models-inference) | [HF Setup](https://michelnivard.github.io/gptstudio/articles/huggingface.html) |
| [Anthropic](https://docs.anthropic.com/claude/docs/guide-to-anthropics-prompt-engineering-resources) | claude-3.5, claude-2.1, claude-instant-1.2 | [Anthropic API Docs](https://docs.anthropic.com/claude/reference/getting-started-with-the-api) | [Anthropic Setup](https://michelnivard.github.io/gptstudio/articles/anthropic.html) |
| [Ollama](https://ollama.com/) | mistral, llama2, mixtral, phi ([latest models](https://ollama.com/library)) | [Ollama API Docs](https://github.com/ollama/ollama/blob/main/docs/api.md) | [Ollama Setup](https://michelnivard.github.io/gptstudio/articles/ollama.html) |
| [Perplexity](https://www.perplexity.ai) | pplx-7b-chat, pplx-70b-chat, pplx-7b-online, pplx-70b-online, llama-2-70b-chat, codellama-34b-instruct, mistral-7b-instruct, and mixtral-8x7b-instruct | [Perplexity API Docs](https://docs.perplexity.ai/reference/post_chat_completions) | [Perplexity Setup](https://michelnivard.github.io/gptstudio/articles/perplexity.html) |
| [Google AI Studio](https://ai.google.dev/tutorials/ai-studio_quickstart) | Gemini and Palm (legacy) | [Google AI Studio Docs](https://ai.google.dev/docs) | [Google AI Studio Setup](https://michelnivard.github.io/gptstudio/articles/google.html) |
| [Azure OpenAI](https://learn.microsoft.com/en-us/azure/ai-services/openai/overview) | gpt-4, gpt-3.5-turbo ([latest models](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#gpt-4-and-gpt-4-turbo-preview)) | [Azure OpenAI API Docs](https://learn.microsoft.com/en-us/azure/ai-services/openai/quickstart?tabs=command-line,python&pivots=rest-api) | [Azure OpenAI Setup](https://michelnivard.github.io/gptstudio/articles/azure.html) |
| [Cohere](https://cohere.com/) | command, command-light, command-nightly, command-light-nightly | [Cohere API Docs](https://docs.cohere.com/) | [Cohere Setup](https://michelnivard.github.io/gptstudio/articles/cohere.html) |
### Default AI Service: OpenAI
To get started, you must first set up an API service. The package is configured to work with several AI service providers, allowing for flexibility and choice based on your specific needs. The default configuration is set to use OpenAI's services. To use it you need:
1. Make an OpenAI account. [Sign up here](https://openai.com/api/).
2. [Create an OpenAI API key](https://platform.openai.com/account/api-keys) to use with the package.
3. Set the API key up in Rstudio. See the section below on configuring the API key.
#### Configuring OpenAI API Key
To interact with the OpenAI API, it's required to have a valid `OPENAI_API_KEY` environment variable. Here are the steps to configure it.
You can establish this environment variable globally by including it in your project's .Renviron file. This approach ensures that the environment variable persists across all sessions as the Shiny app runs in the background.
Here is a set of commands to open the .Renviron file for modification:
```{r}
#| eval: false
require(usethis)
edit_r_environ()
```
For a persistent setting that loads every time you launch this project, add the following line to .Renviron, replacing `"<APIKEY>"` with your actual API key:
``` bash
OPENAI_API_KEY="<APIKEY>"
```
**Caution:** If you're using version control systems like GitHub or GitLab, remember to include .Renviron in your .gitignore file to prevent exposing your API key!
**Important Note:** OpenAI API will not function without valid payment details entered into your OpenAI account. This is a restriction imposed by OpenAI and is unrelated to this package.
### Alternative AI Service Providers
While OpenAI is the default and currently considered one of the most robust options, `gptstudio` is also compatible with other AI service providers. These include [Anthropic](https://michelnivard.github.io/gptstudio/articles/anthropic.html), [HuggingFace](https://michelnivard.github.io/gptstudio/articles/huggingface.html), [Google AI Studio](https://michelnivard.github.io/gptstudio/articles/google.html), [Azure OpenAI](https://michelnivard.github.io/gptstudio/articles/azure.html), and [Perplexity](https://michelnivard.github.io/gptstudio/articles/perplexity.html). You can select any of these providers based on your preference or specific requirements. You can also run local models with [Ollama](https://michelnivard.github.io/gptstudio/articles/ollama.html). This requires more setup but at the benefit of not sharing your data with any third party.
To use an alternative provider, you will need to obtain the relevant API key or access credentials from the chosen provider and configure them similarly.
## Privacy Notice for gptstudio
This privacy notice is applicable to the R package that uses popular language models like gpt-4 turbo and claude-3.5. By using this package, you agree to adhere to the privacy terms and conditions set by the API service.
### Data Sharing with AI Services
When using this R package, any text or code you highlight/select with your cursor, or the prompt you enter within the built-in applications, will be sent to the selected AI service provider (e.g., OpenAI, Anthropic, HuggingFace, Google AI Studio, Azure OpenAI) as part of an API request. This data sharing is governed by the privacy notice, rules, and exceptions that you agreed to with the respective service provider when creating an account.
### Security and Data Usage by AI Service Providers
We cannot guarantee the security of the data you send via the API to any AI service provider, nor can we provide details on how each service processes or uses your data. However, these providers often state that they use prompts and results to enhance their AI models, as outlined in their terms of use. Be sure to review the terms of use of the respective AI service provider directly.
### Limiting Data Sharing
The R package is designed to share only the text or code that you specifically highlight/select or include in a prompt through our built-in applications. No other elements of your R environment will be shared unless you turn those features on. It is your responsibility to ensure that you do not accidentally share sensitive data with any AI service provider.
**IMPORTANT: To maintain the privacy of your data, do not highlight, include in a prompt, or otherwise upload any sensitive data, code, or text that should remain confidential.**
## Code of Conduct
Please note that the gptstudio project is released with a [Contributor Code of Conduct](https://github.com/MichelNivard/gptstudio/blob/main/.github/CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.