Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated to-pdf script to explicitly use APA citation style #211

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions bin/to-pdf
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,9 @@ run_pandoc() {
--metadata title='Foundations of Reinforcement Learning with Applications in Finance' \
--metadata author='Ashwin Rao, Tikhon Jelvis' \
--metadata reference-section-title='Bibliography' \
--metadata biblatex \
--metadata biblio-style='apa' \
--metadata biblatexoptions='backend=biber' \
--filter pandoc-crossref \
--pdf-engine xelatex \
--template ../templates/latex.template \
Expand Down
2 changes: 1 addition & 1 deletion book/chapter0/chapter0.md
Original file line number Diff line number Diff line change
Expand Up @@ -61,7 +61,7 @@ This jargon overload is due to the confluence of terms from Control Theory (emer

It is also important to recognize that Reinforcement Learning is considered to be a branch of Machine Learning. While there is no crisp definition for *Machine Learning* (ML), ML generally refers to the broad set of techniques to infer mathematical models/functions by acquiring ("learning") knowledge of patterns and properties in the presented data. In this regard, Reinforcement Learning does fit this definition. However, unlike the other branches of ML (Supervised Learning and Unsupervised Learning), Reinforcement Learning is a lot more ambitious - it not only learns the patterns and properties of the presented data, it also learns about the appropriate behaviors to be exercised (appropriate decisions to be made) so as to drive towards the optimization objective. It is sometimes said that Supervised Learning and Unsupervised learning are about "minimization" (i.e., they minimize the fitting error of a model to the presented data), while Reinforcement Learning is about "maximization" (i.e., RL identifies the suitable decisions to be made to maximize a well-defined objective). Figure \ref{fig:ml_branches} depicts the in-vogue classification of Machine Learning.

![Branches of Machine Learning \label{fig:ml_branches}](./chapter0/BranchesofML.jpg "Branches of Machine Learning")
![Branches of Machine Learning \label{fig:ml_branches}](./chapter0/BranchesOfML.jpg "Branches of Machine Learning")

More importantly, the class of problems RL aims to solve can be described with a simple yet powerful mathematical framework known as *Markov Decision Processes* (abbreviated as MDPs). We have an entire chapter dedicated to deep coverage of MDPs, but we provide a quick high-level introduction to MDPs in the next section.

Expand Down
7 changes: 5 additions & 2 deletions default.nix
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,13 @@ let
tex-packages = {
inherit (pkgs.texlive)
scheme-medium
biber
biblatex
biblatex-apa
footmisc
noto
titling
xpatch
noto;
xpatch;
};

python-packages = ps: with ps;
Expand Down
2 changes: 1 addition & 1 deletion templates/latex.template
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ $endif$
% The `babel` package, among other things, lets you determine what
% language you are using in a given stretch of text, so that typesetting
% will go well. Here we specify that mostly, we are speaking English:
\usepackage[english]{babel}
\usepackage[american]{babel}

\providecommand{\tightlist}{%
\setlength{\itemsep}{0pt}\setlength{\parskip}{0pt}}
Expand Down