SBWL 1: Data Processing 1 (PI2.0)

Summerterm 2019
Axel Polleres, Stefan Sobernig


Table of contents

Schedule
Organisational
Unit details
Jupyter Notebook
Supplemental Reading

Syllabus

Overall, students shall gain fundamental knowledge for dealing with different data formats and in using methods and tools to integrate data from various sources in this course

Schedule



Unit Date Room Topic
1 Tue 05.03.2019 11:00 – 15:00 TC.1.01 OeNB Course introduction
2 Tue 12.03.2019 10:00 – 14:00 TC.0.01 ERSTE Data access
3 Tue 19.03.2019 10:00 – 14:00 TC.0.02 Red Bull Data processing (basics)
4 Mon 25.03.2019 10:00 – 14:00 TC2.03 Data processing (cont'd)
5 Tue 02.04.2019 10:00 – 14:00 D1.1.074 Data storage
6 Tue 09.04.2019 10:00 – 14:00 D1.1.078 Advanced topics (pandas, visualisation)
7 Tue 30.04.2019 11:00 – 16:00 EA.6.026 Project presentation

Organisational

Instructor(s)

Axel Polleres

axel.polleres@wu.ac.at

Stefan Sobernig

stefan.sobernig@wu.ac.at

Raphael Dachs (Tutor)

raphael.dachs@wu.ac.at

Grading

See the authoritative details at Learn@WU.


Course Material

Unit details

Unit 1: Course Overview & Introduction

Slides: This unit is also available in a PDF format and as a single HTML Page

Readings:

Notebook of Unit1

Unit1: Homework

Task:

Details: Assignment 1 on Learn@WU

Submission: Via Assignment 1 on Learn@WU, until Mon, March 18, 2019, 23:55.

Unit 2: Data access, formats, & encoding

Slides: This unit is also available in a PDF format and as a single HTML Page

Readings:

Notebook of Unit2

Unit 3: Data cleaning and preparation (Basics)

Slides: This unit is also available in a PDF format and as a single HTML Page

Notebook of Unit3

Homework

Details: Assignment 2 on Learn@WU

Submission: Via Assignment 2 on Learn@WU, until Tue, March 27th, 2019, 12:00.

Unit 4: Data cleaning and preparation (Cont'd)

Slides: This unit is also available in a PDF format and as a single HTML Page

Notebooks of Unit 4

Unit 5: Data storage & Persistence

Connection to and loading data into and from a database system (vs. storing/loading from a file)

Slides: This unit is also available in a PDF format and as a single HTML Page

Readings:

Notebook of Unit5

Unit 6: Advanced topics

Slides: This unit is also available in a PDF format and as a single HTML Page

Readings:

Notebooks of Unit 6

Jupyter Notebook

The theoretical part of the course is accompanied by practical code examples and hands on exercises using the interactive Python environment Jupyter.

Supplemental Reading

Coding