Data mining can be viewed as lossy data reduction and learning techniques that are designed to handle massive data sets containing large numbers of categorical and numeric attributes. This course covers topics in data mining and knowledge discovery structured and unstructured databases such as data integration, mining, and interpretation of patterns, rule-based learning, decision trees, association rule mining, and statistical analysis for discovery of patterns, evaluation and interpretation of the mined patterns using visualization techniques.
Data Mining (CS 525)
Programs\Type | Required | Core Elective | Area Elective |
MA-European Studies | |||
MA-European Studies-Non Thesis | |||
MA-Political Science | |||
MA-Political Science-Non Thes | |||
MA-Visual Arts&Vis. Com Des-NT | |||
MA-Visual Arts&Visual Com Des | |||
MS-Bio. Sci. & Bioeng. LFI | |||
MS-Bio. Sci. & Bioeng. LFI-ENG | |||
MS-Biological Sci&Bioeng. | * | ||
MS-Business Analytics | |||
MS-Computer Sci.&Eng. LFI | |||
MS-Computer Sci.&Eng. LFI-ENG | |||
MS-Computer Science and Eng. | * | ||
MS-Cyber Security(with thesis) | * | ||
MS-Data Science | |||
MS-Elec. Eng&Comp Sc.LFI-ENG | |||
MS-Electronics Eng&Comp Sc.LFI | |||
MS-Electronics Eng&Computer Sc | * | ||
MS-Electronics Eng. | * | ||
MS-Electronics Eng. LFI | |||
MS-Electronics Eng. LFI-ENG | |||
MS-Energy Techno.&Man. | * | ||
MS-Industrial Eng. LFI-ENG | |||
MS-Industrial Engineering | * | ||
MS-Industrial Engineering LFI | |||
MS-Manufacturing Eng-Non Thes | * | ||
MS-Manufacturing Engineering | * | ||
MS-Materials Sci & Engineering | * | ||
MS-Materials Sci. & Eng. LFI | |||
MS-Materials Sci.&Eng. LFI-ENG | |||
MS-Mathematics | |||
MS-Mechatronics | * | ||
MS-Mechatronics LFI | |||
MS-Mechatronics LFI-ENG | |||
MS-Physics | |||
MS-Physics-Non Thesis | * | ||
MS-Psychology | |||
MS-Psychology-Non Thesis | |||
PHD-Biological Sci&Bioeng. | * | ||
PHD-Comp. Sci and Eng.after UG | * | ||
PHD-Computer Science and Eng. | * | ||
PHD-Cyber Security | * | ||
PHD-Electronics Eng&ComputerSc | * | ||
PHD-Electronics Eng. | * | ||
PHD-Electronics Eng. after UG | * | ||
PHD-Experimental Psychology | |||
PHD-Industrial Engineering | * | ||
PHD-Management | |||
PHD-Manufacturing Eng after UG | * | ||
PHD-Manufacturing Engineering | * | ||
PHD-Materials Sci.&Engineering | * | ||
PHD-Mathematics | |||
PHD-Mechatronics | * | ||
PHD-Mechatronics after UG | * | ||
PHD-Physics | |||
PHD-Physics after UG | |||
PHD-Social Psychology | |||
PHDBIO after UG | * | ||
PHDCYSEC after UG | * | ||
PHDEECS after UG | * | ||
PHDEPSY after UG | |||
PHDIE after UG | * | ||
PHDMAN after UG | |||
PHDMAN after UG-Finance | |||
PHDMAN after UG-Man. and Org. | |||
PHDMAN after UG-Op.&Sup. Cha. | |||
PHDMAN-Finance Area | |||
PHDMAN-Man. and Org. Area | |||
PHDMAN-Op. & Supp. Chain Area | |||
PHDMAT after UG | * | ||
PHDMATH after UG | |||
PHDSPSY after UG |
CONTENT
OBJECTIVE
? to introduce the basic data mining process which includes feature selection, supervised learning, unsupervised learning, association rule mining and inference
? to discuss the operations research background of popular hard and fuzzy clustering algorithms
? to derive multiple linear regression (MLR) as a tool in supervised learning process
? to introduce the operations research background of logistic regression as a classification algorithm.
? to introduce the details of popular classifiers such as naive bayes, instance based and decision tree based classifiers.
? to discuss the details of association rule mining.
? to introduce various intelligent search algorithms such as simulated annealing, genetic algorithms, tabu search, GRASP and beam search and how they are used as part of wrappers in the context of feature selection process.
? to introduce multicriteria decision making and how some MCDM algorithms can be incorporated in the data mining process
? to experience WEKA as a software tool in data mining.
? to expose the students to real world applications through case studies.
? to develop students? team working skills, as well as self-confidence, in dealing with decision making problems.
LEARNING OUTCOME
list the basic components of a data mining process.
model a data mining problem and decide which techniques are suitable for the business objective of the user.
understand the optimization formulation of the various data mining problems and derive the techniques that can be used in those problems.
correctly apply the steps of various feature selection techniques, supervised and unsupervised learning algorithms and association rule mining algorithm.
implement feature selection, supervised and unsupervised learning algorithms, association rule mining techniques with a data mining tool (e.g., WEKA) in order to determine the relations that are hidden in the data.
Update Date:
ASSESSMENT METHODS and CRITERIA
Percentage (%) | |
Final | 15 |
Midterm | 15 |
Assignment | 20 |
Term-Paper | 50 |
RECOMENDED or REQUIRED READINGS
Readings |
Jiawei Han |
Course Web | SuCourse |