Skip to content

Files

Latest commit

Jan 23, 2019
70c1f4c · Jan 23, 2019

History

History
69 lines (55 loc) · 2.65 KB

index.md

File metadata and controls

69 lines (55 loc) · 2.65 KB

Introduction to Deep Learning

STAT 157, UC Berkeley, Spring, 2019

Practical information

Instructors Alex Smola and Mu Li
TAs Ryan Theisen
Lectures Every Tuesday/Thursday 3:30pm - 5:00pm
Location LeConte 3
Instructor Office Hours Thursday, 1:00 - 3:00pm
Location 337 Evans Hall
Contact instructors and TAs berkeley-stat-157@googlegroups.com
Course discussion https://discuss.mxnet.io/c/courses
Grading Policy Homework 30%, Midterm 20%, Project 50%

News

  • 1/22: Updated course information, uploaded slides for today's lecture and homework 1.

Overview

This class provides a practical introduction to deep learning, including theoretical motivations and how to implement it in practice. As part of the course we will cover multilayer perceptrons, backpropagation, automatic differentiation, and stochastic gradient descent. Moreover, we introduce convolutional networks for image processing, starting from the simple LeNet to more recent architectures such as ResNet for highly accurate models. Secondly, we discuss sequence models and recurrent networks, such as LSTMs, GRU, and the attention mechanism. Throughout the course we emphasize efficient implementation, optimization and scalability, e.g. to multiple GPUs and to multiple machines. The goal of the course is to provide both a good understanding and good ability to build modern nonparametric estimators. The entire course is based on Jupyter notebooks to allow students to gain experience quickly. Supporting material can be found at https://d2l.ai.

Prerequisites

Programming in Python (CS 61a or CS/STAT C8 and CS 88), Linear Algebra (MATH 54, STAT 89A, or EE 16A), Probability (STAT 134, STAT 140, or EE 126), and Statistics (STAT 20, STAT 135, or CS/STAT C100) are highly desirable. Eqivalent knowledge is fine, and we will try to make the class as self-contained as possible. This is a class where you need to get your hands dirty with programming.

Course Format

The course consists of 2 units of 90 minutes, taught by the instructors, plus office hours by the instructors and TAs. Evaluation is based on a midterm exam (20%), homework (30%), and a research project (50%) which will be presented in lieu of an end-of-course exam. As part of the course you will be performing work that's similar to research leading up to a paper.


.. toctree::
   :maxdepth: 1
   :hidden:

   syllabus
   calendar
   homework
   project
   faq
   units/index