Image Analysis II

Type of study	Follow-up Master
Language of instruction	Czech
Code	460-4107/01
Abbreviation	ANO II
Course title	Image Analysis II
Credits	4
Coordinating department	Department of Computer Science
Course coordinator	Ing. Radovan Fusek, Ph.D.

Subject syllabus

Lectures:
* Basic concepts of object detection in images. Methods for face detection in images. Haar features (AdaBoost, Viola-Jones). Local Binary Patterns (LBP), Histograms of Oriented Gradients (HOG) and their use for object analysis. Methods for pedestrian detection in images. Keypoint detectors and descriptors (e.g. SIFT, SURF).
* Convolutional neural networks (basic principles, description of layers). Modern variants of convolutional neural networks (e.g. VGGNet, GoogLeNet, ResNet).
* Description of convolutional networks for object localization (e.g. R-CNN, Faster R-CNN, YOLO, SSD).
* Description of generative networks (e.g. DCGAN, Diffusion-GAN).
* Transformer networks (especially Vision Transformer – ViT) and their use in image analysis.
* Convolutional neural networks for image segmentation (encoder-decoder networks, U-Net).
* Human Pose Estimation Using Deep Learning.
* Optical systems in the area of self-driving vehicles, IR image processing, LIDAR image processing, depth image analysis, use of depth sensors (RealSense, Kinect).

Computer Labs:
* Development of the detector for the selected object of interest, implementation of the sliding window method, preparation of data for the training and testing phases of the detector.
* Detection based on Haar-type features, detection using local binary patterns, analysis of objects using gradients (HOG method).
* Experiments with convolutional neural networks, exploring the parameters of convolutional networks.
* Application of different types of convolutional networks (e.g. VGGNet, GoogLeNet, ResNet), comparison of detectors.
* Practical use of localization methods based on convolutional neural networks (e.g., R-CNN, Faster R-CNN, YOLO).
* Practical use of generative networks for data augmentation (DCGAN, Diffusion-GAN).
* Experiments with image segmentation using encoder-decoder networks (U-Net).
* Practical use of transformer networks for object analysis in images.
* Analysis of objects in IR and depth images (RealSense, Kinect).

E-learning

Materials are available on the educator's website:
https://mrl.cs.vsb.cz//people/fusek/ano2_course.html

Literature

1. Lakshmanan, Valliappa, et al. Practical Machine Learning for Computer Vision: End-to-End Machine Learning for Images. O’Reilly Media, 2021.
2. Zhang, A., Lipton, Z.C., Li, M., Smola, A.J.: Dive into Deep Learning, https://d2l.ai, 2020
3. Chollet, F.: Deep Learning with Python. Manning, , 2017

Advised literature

1. Burger, W., Burge, M., J.: Principles of Digital Image Processing: Fundamental Techniques, Springer, , , 2011
2. Brahmbhatt, S.: Practical OpenCV (Technology in Action), Apress, , , 2013
3. Gary Bradski, Adrian Kaehler: Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library, O'Reilly Media, 2017

back to search page