Location: Room 200, Deschutes Hall
Time: 10:00-11:20AM, Tu/Th
Prerequisites: Calculus.
Instructor: Matthew Sottile (matt@cs.uoregon.edu)
Office: 203 Deschutes Hall
Office Hours: 2:30-4:00pm, Mon/Wed.
Data is everywhere, and the volume of data is increasing rapidly as improvements in simulation and experimental science progress. The focus of this course is on how one sifts through this vast volume of data to gain some knowledge from it. Very often the information that one seeks, particularly from numerical data, requires a coupling of clever algorithms with mathematical techniques in order to tease out the needle from this numerical haystack. Very often the algorithms become simpler if one leverages the mathematical properties of the data in how the problem is approached. This course will look at a combination of numerical techniques and algorithms to solve interesting problems in data analysis. We will explore how computation drives the science of understanding data: hence, computational data science. This name is also chosen to reflect the connection of the topic to computational science. In computational science, we are concerned with the modeling and implementataion of scientific probles. In this course we will look at the computational science issues specifically related to analysis of data produced by computational scientists from their models and simulations.
Students will learn about the various forms that data can take on, including images, volumetric data, time series, and graphs. We will look at how one can infer meaningful structure and properties from these representations using algorithms and mathematics. We will explore how one defines relationships such as a quantitative measure of ``similarity'' between data elements. We will discuss questions such as:
The course work will be composed of a mixture of exercises to demonstrate an understanding of the material, one or more whole-class, hands on projects to get a feel for working with data, and a few basic programming exercises. Students will be exposed to tools such as Mathematica, Matlab, and GNU Octave as implementation vehicles for working with data. Prior knowledge of these tools is not required, as the necessary information on how to use them will be part of the course itself. The tools will be made available to the students in the CIS department lab and on departmental servers.
Rough outline of topics (subject to change):
Notes, news, and details can be found here.