Skip to main content
Skip header
Ukončeno v akademickém roce 2010/2011

Data Analysis Methods

Type of study Doctoral
Language of instruction Czech
Code 456-0921/01
Abbreviation MAD
Course title Data Analysis Methods
Credits 0
Coordinating department Department of Computer Science
Course coordinator doc. RNDr. Jana Šarmanová, CSc.

Osnova předmětu

Lectures:
Defining the problem of multivariate data analysis.
Methods of data analysis: mathematical statistics and exploratory data analysis. The input data types of formal and semantic aspects. Filtration,
missing data, dichotomize, categorization
Preprocessing, transformation. Normalization and standardization. Principal components.
Cluster analysis, non-hierarchical methods, hierarchical methods, presentation and interpretation of results.
Finding associations, automatic creation of hypotheses, presentation and interpretation of results.
Decision tree construction, presentation and interpretation.

Exercise:
Practice methods of lectures on examples of specific data.
Papers on new methods of data mining.
Reports on the results of an analysis.

Projects:
Analysis of specific data from their own experience or from a database.
Preprocessing, selection of appropriate methods.
Own calculations, interpretation.
Presentation of results, documentation.

Computer Labs:
A system for data analysis, control methods, presentation of results, applications.

Povinná literatura

Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Second Edition. Elsevier Inc., 2006, 770 p., ISBN 1-55860-901-3 .

Doporučená literatura

Dunham, M.H.: Data Mining. Introductory and Advanced Topics. Pearson Education, Inc., 2003, 315 p.