-
Notifications
You must be signed in to change notification settings - Fork 2
/
index.Rmd
76 lines (60 loc) · 2.97 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
---
title: "Data Analysis and Visualization in R (IN2339)"
author: |
| Chair of Computational Molecular Medicine
| Technical University of Munich
date: "`r Sys.Date()`"
output:
html_document:
df_print: paged
documentclass: krantz
bibliography:
- book.bib
- packages.bib
biblio-style: apalike
link-citations: yes
colorlinks: yes
lot: no
lof: no
graphics: yes
urlcolor: blue
geometry: left=1.5in, right=1.5in, top=1.25in, bottom=1.25in
description: This book introduces concepts and skills that can help you tackle
real-world data analysis challenges. It covers concepts from probability, statistical
inference, linear regression and machine learning and helps you develop skills such
as R programming, data wrangling with data.table, data visualization with ggplot2, file
organization with UNIX/Linux shell, version control with GitHub, and reproducible
document preparation with R markdown.
site: bookdown::bookdown_site
header-includes: \usepackage{float} \floatplacement{figure}{H}
always_allow_html: yes
subtitle: A practical introduction to Data Science
---
```{r include=FALSE}
# automatically create a bib database for R packages
knitr::write_bib(c(
.packages(), 'bookdown', 'knitr', 'rmarkdown'), 'packages.bib')
```
# Preface {-}
This is the lecture script of the module Data Analysis and Visualization in R (IN2339).
This work is licensed under the [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)](https://creativecommons.org/licenses/by-nc-sa/4.0)
# Acknowledgments {-}
This script has been first put together in the winter semester 2020/2021 by Felix Brechtmann, Alexander Karollus, Daniela Klaproth-Andrade, Pedro Silva, and Julien Gagneur with help from Xueqi Cao, Laura Martens, Ines Scheller, Vangelis Theodorakis, and Vicente Yépez.
We leveraged work from colleagues who helped creating lecture slides since 2017: Žiga Avsec, Ines Assum, Daniel Bader, Jun Cheng, Bašak Eraslan, Mathias Heinig, Jan Krumsieck, Christian Mertes, and Georg Stricker.
# Prerequisites {-}
Basics in probabilities are required. Chapters 13-15 ("Introduction to Statistics with R", "Probability" and "Random variables") of the Book "Introduction to Data Science" https://rafalab.github.io/dsbook/ make a good refresher. Make sure all concepts are familiar to you. Check your knowledge by trying the exercises.
# Datasets {.unnumbered}
Datasets used in this script are available to download as a compressed file [here](http://shorturl.at/otwxO).
# Feedback {-}
For improvement suggestions, reporting errors and typos, please use the online document [here](https://docs.google.com/document/d/1tdWmtkDaNUnQLkG_cNfJ2KLHwnaldtQwZh6pUQh8Mpc/edit?usp=sharing).
<!-- Latex commands -->
\newcommand{\bs}{\boldsymbol}
\newcommand{\m}{\mathbf}
\newcommand{\w}{\m w}
\newcommand{\x}{\m x}
\newcommand{\y}{\m y}
\newcommand{\X}{\m X}
\newcommand{\I}{\m I}
\newcommand{\E}{\operatorname{E}}
\newcommand{\Var}{\operatorname{Var}}
\newcommand{\Cov}{\operatorname{Cov}}