Basic techniques of data harvesting and cleaning; association rules, classification and clustering; analyze, manipulate, and visualize data using programming languages. Basic principles of probability and statistical modeling/inference to make meaning out of large datasets. Cross-listed with: STAT 087.


Open to Degree and CDE students; Cross listed with STAT 087 A; Total combined enrollment: 40

This course is an introduction to the field of Data Science for students with no experience in computer coding and statistics. In this course, you will learn to: •Analyze and manipulate data with the computer language, R; •Understand the basic principles of probability and statistical modeling; to enable you to •Make meaning out of large data sets; in order to •Make better-informed decisions in business, research, and life. Credit is not offered for students who have previously taken courses in Stat at the 200 level or higher.

This is an in-person course, so you'll be expected to attend class on campus, unless you are ill. There is also a YYA section, in case you are a fully 'At Home' student. Materials will be provided online and/or during class by the instructor. Software will be used in the course, at no cost. It is highly recommended that you have a laptop for this course, and bring it to class regularly. There will be group work done in class, so attendance is very important. No statistics background and no computer coding background are required, but you should come with an open mind, a good attitude, and an interest in data science.


Homework 25% of grade; Tests 40% of grade; Midterm, Take-home 15% of grade; Project 15% of grade; Participation 5% of grade

