-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
- Loading branch information
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,2 @@ | ||
inst/doc | ||
.Rproj.user |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -8,8 +8,8 @@ Authors@R: c(person("Nicolas", "Proellochs", email="[email protected] | |
person("Stefan", "Feuerriegel", email="[email protected]", | ||
role=c("aut"))) | ||
Maintainer: Nicolas Proellochs <[email protected]> | ||
Description: Performs model-free reinforcement learning in R. This implementation allows to learn | ||
an optimal policy based on sample sequences consisting of states, actions and rewards. In | ||
Description: Performs model-free reinforcement learning in R. This implementation enables the learning | ||
of an optimal policy based on sample sequences consisting of states, actions and rewards. In | ||
addition, it supplies multiple predefined reinforcement learning algorithms, such as experience | ||
replay. | ||
License: MIT + file LICENSE | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,14 +1,14 @@ | ||
#' Game states of 100,000 randomly sampled Tic Tac Toe games. | ||
#' Game states of 100,000 randomly sampled Tic-Tac-Toe games. | ||
#' | ||
#' A dataset containing 406,541 games states of Tic Tac Toe. | ||
#' A dataset containing 406,541 game states of Tic-Tac-Toe. | ||
#' The player who succeeds in placing three of their marks in a horizontal, vertical, or diagonal row wins the game. | ||
#' All states are observed from the perspective of player X who is also assumed to have played first. | ||
#' All states are observed from the perspective of player X, who is also assumed to have played first. | ||
#' | ||
#' @format A data frame with 406,541 rows and 4 variables: | ||
#' \describe{ | ||
#' \item{State}{The current game state, i.e. the state of the 3x3 grid.} | ||
#' \item{Action}{The move of player X in the current game state.} | ||
#' \item{NextState}{The next observed state after action selection of player X and B.} | ||
#' \item{NextState}{The next observed state after action selection of players X and B.} | ||
#' \item{Reward}{Indicates terminal and non-terminal game states. Reward is +1 for 'win', 0 for 'draw', and -1 for 'loss'.} | ||
#' } | ||
"tictactoe" |
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.