5Class Outline for Data Mining (CSCI 550) in Fall 2010.

Instructor: Dr. Rafal A. Angryk;

Prerequisites: Some statistics and some databases and a solid, “graduate-level” brain on the top of that.

Lectures: Monday, Wednesday, Friday 9:00-9:50AM at EPS 108

Course Web page: http://www.cs.montana.edu/courses/csci550/ (may change, as we progress toward unified course numbers system)

Your points/grades can be checked here

 

Please check the errata of the textbook draft before you try to identify potential errors in the textbook.

 

* All requested readings need to be done BEFORE the class. During each of the meetings, the students may be tested via "daily quizzes" on the readings which has been assigned for the particular day.

 

Abbreviation PR in the table below stands for "Graduate Course PRoject".

 

Date

Readings*

Lecture

Extras ...

30-Aug, Mon

 

Syllabus; Intro to CSCI550

(1) Read carefully My General Class Policies, (2) Check out popular DM Portal, (3) Read Ch.1 from your textbook, (4) Have a look at the IEEE formatting rules for your final paper, (5) Fill out Let’s meet form.

1-Sep, Wed

Ch.1.

Introduction to DM

(1) Check out WEKA and Matlab software and its DM capabilities, (2) Check out two industrial DM methodologies: CRISP-DM and SEMMA, (3) Read Ch.2 from your textbook,

3-Sep, Fri

-continued

6-Sep, Mon

Labor Day holiday

No Classes; Offices Closed

8-Sep, Wed

-continued

10-Sep, Fri

Ch.2.

Data Preprocessing (1)

(1) Check Out this website on Graphical Data Analysis, (2) 1st Homework - Paper Review. Review form is here, (3) No idea how to review a paper- Ask me and check out Parberry's Referee's Guide, (4) PR: Here is an EXCELLENT tutorial that could help you, when working on review and your final project. I highly recommend that you at least read Part 2 (it starts on slide 68)!

13-Sep, Mon

-continued

15-Sep, Wed

Ch.2 & 4.3.

Data Preprocessing (2)

I extended Data Preprocessing (1) slides (last page!), to answer one good question I got after the class. We’ll go through the changes during the next meeting.

17-Sep, Fri

-continued

Interested in Box-Cox Transformation? Check out these links: (1) What to do when data are non-normal, and (2) Power Transform Family Graphs

20-Sep, Mon

-continued

22-Sep, Wed

GAOI

1) Please, make sure to bring copies of the reviewed paper to class!!!

24-Sep, Fri

-continued

1) Please, make sure to bring copies of the reviewed paper to class!!!

27-Sep, Mon

Ch.5.

Frequent Patterns (1)

29-Sep, Wed

-continued

(1) 2nd Homework DataSet1 and DataSet3 are here.

1-Oct, Fri

-continued

4-Oct, Mon

-continued

6-Oct, Wed

Ch.5.4

Frequent Patterns (2)

8-Oct, Fri

-continued

11-Oct, Mon

Homework 2 Review

13-Oct, Wed

TEST 1

15-Oct, Fri

Ch.5.5

Frequent Patterns (3)

18-Oct, Mon

Let’s Talk about Projects

Here is an EXCELLENT tutorial that could help you, when working on your final project: 4-slides per page version (and link to the original is here). Read at least Part 2 (it starts on slide 68)!

20-Oct, Wed

Ch.7-7.3

Clustering (1)

22-Oct, Fri

- continued

25-Oct, Mon

Ch. 7.4

Clustering (2)

27-Oct, Wed

Ch. 7.5

Clustering (3)

29-Oct, Fri

Ch. 7.6-.7

Clustering (4)

1-Nov, Mon

- continued

3-Nov, Wed

Ch. 7.6-.8, .11

Clustering (5)

Project Proposals are due.

5-Nov, Fri

- continued

8-Nov, Mon

Clustering (6) - Evaluation

10-Nov, Wed

PRESENTATIONS OF PROJECTS (1)

PROJECTS

12-Nov, Fri

PRESENTATIONS OF PROJECTS (2)

PROJECTS

15-Nov, Mon

PRESENTATIONS OF PROJECTS (3)

PROJECTS

17-Nov, Wed

Ch.6-6.3

Classification (1)

19-Nov, Fri

-continued

Literature Review is due.

22-Nov, Mon

Ch.6.12-.15

Classification (2) - Evaluation

24-Nov, Wed

Thanksgiving Day holiday

No Classes; Offices Closed

26-Nov, Fri

Thanksgiving Day holiday

No Classes; Offices Closed

29-Nov, Mon

Classification (3)

1-Dec, Wed

TEST 2

3-Dec, Fri

Dimensionality Reduction

6- Dec, Mon

PRESENTATIONS OF PROJECTS (1)

Mark+Jeff E.+Devin, Richard+Mike, Scott

8- Dec, Wed

PRESENTATIONS OF PROJECTS (2)

Patrick+Shane, Dennis+Karthik, Swapan+Travis

10- Dec, Fri

PRESENTATIONS OF PROJECTS (3)

Jeff B. , Liessman, Michael RW,  Chandrima+Atanu

Papers on Projects are due at the beginning of the class

13-Dec, Mon

FINAL EXAM: 8:30-9:50am, in EPS 108