Skip to main content
Skip header

Methods of Vector Data Analysis

* Exchange students do not have to consider this information when selecting suitable courses for an exchange stay.

Course Unit Code460-4126/01
Number of ECTS Credits Allocated4 ECTS credits
Type of Course Unit *Compulsory
Level of Course Unit *Second Cycle
Year of Study *First Year
Semester when the Course Unit is deliveredWinter Semester
Mode of DeliveryFace-to-face
Language of InstructionCzech
Prerequisites and Co-Requisites There are no prerequisites or co-requisites for this course unit
Name of Lecturer(s)Personal IDName
PLA06prof. Ing. Jan Platoš, Ph.D.
Summary
This course is focused on algorithms for data analysis and data visualization. The first part of the course is focused on explorative analysis and data clustering. The second part is focused on the data classification. he course describes a less complex linear methods to the more complex method based on the SVM. More advanced methods will be described in the last part of the course.
Learning Outcomes of the Course Unit
The goal of this course is to deepen and improve the knowledge about data analysis methods acquired in the previous courses. The main information delivered to the students is advanced algorithms for data classification, stream data processing, advanced data structures and machine learning techniques. The students will be able to use these methods, to interpret achieved results. Moreover, the student will be able to presents and visualize the results using proper methods.
Course Contents
Exploration data analysis
1. Frequent patterns, Rule based analysis.
2. Representative based clustering, Hierarchical Clustering.
3. Density based clustering, Cluster validation.
4. Self-organizing maps
5. Anomaly detection
Data Classification
6. Linear classification (Linear discriminant analysis, Naive Bayes, Logistics regression)
7. Decision Trees, Random Forests.
8. Support Vector Machine, Kernel based methods
9. Neural networks (Perceptron, Feed forward NN+Back propagation)
10. Regression methods
11. Advanced classification methods
12. Classification validation
Advanced methods
13. Stream dat analysis
14. Vektor data vizualization
Recommended or Required Reading
Required Reading:
Ian H. Witten, Eibe Frank, Mark A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, Third Edition, Morgan Kaufmann, 2011, ISBN: 978-0123748560
Charu C. Aggarwal, Data Mining - The Textbook, Springer, 2015, ISBN: 978-3-319-14141-1.
Ian H. Witten, Eibe Frank, Mark A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, Third Edition, Morgan Kaufmann, 2011, ISBN: 978-0123748560
Charu C. Aggarwal, Data Mining - The Textbook, Springer, 2015, ISBN: 978-3-319-14141-1.
Recommended Reading:
Mohammed J. Zaki, Wagner Meira, Jr., Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge University
Press, May 2014. ISBN: 9780521766333.
Jure Leskovec, Anand Rajaraman, David Ullman, Mining of Massive Datasets, 2nd editions, Cambridge University Press,
Novemeber 2014, ISBN: 9781107077232, On-line http://infolab.stanford.edu/~ullman/mmds/book.pdf [2014-09-12]
Mohammed J. Zaki, Wagner Meira, Jr., Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge University
Press, May 2014. ISBN: 9780521766333.
Jure Leskovec, Anand Rajaraman, David Ullman, Mining of Massive Datasets, 2nd editions, Cambridge University Press,
Novemeber 2014, ISBN: 9781107077232, On-line http://infolab.stanford.edu/~ullman/mmds/book.pdf [2014-09-12]
Planned learning activities and teaching methods
Lectures, Tutorials
Assesment methods and criteria
Task TitleTask TypeMaximum Number of Points
(Act. for Subtasks)
Minimum Number of Points for Task Passing
Graded creditGraded credit100 51