Skip to main content
Skip header

Image Analysis II

Type of study Follow-up Master
Language of instruction English
Code 460-4107/02
Abbreviation AO2
Course title Image Analysis II
Credits 4
Coordinating department Department of Computer Science
Course coordinator Ing. Radovan Fusek, Ph.D.

Subject syllabus

Lectures:
- Basic concepts of object detection in images; Haar-like features (AdaBoost, Viola-Jones); Local Binary Patterns (LBP); Histograms of Oriented Gradients (HOG). Methods for pedestrian detection in images; methods for face detection in images. Keypoint detectors and descriptors (e.g. SIFT, SURF).
- Convolutional neural networks (basic principles, description of layers). Modern variants of convolutional neural networks (e.g. GoogLeNet, ResNet, EfficientNet).
- Description of convolutional networks for object localization (e.g. R-CNN, Faster R-CNN, YOLO, SSD).
- Description of generative networks (e.g. DCGAN, Diffusion-GAN).
- Transformer networks (especially Vision Transformer - ViT) and their use in image analysis.
- Convolutional neural networks for image segmentation (encoder-decoder networks, U-Net).
- Human pose estimation using deep learning.
- Optical systems in the area of self-driving vehicles, IR image processing, LiDAR data processing, depth image analysis, use of depth sensors (RealSense, Kinect).


Computer Labs:
- Development of the detector for the selected object of interest, implementation of the sliding window method, preparation of data for the training and testing phases of the detector.
- Object analysis using Haar-like features, Local Binary Patterns (LBP), and Histograms of Oriented Gradients (HOG).
- Experiments with convolutional neural networks, exploring the parameters of convolutional networks.
- Application of different types of convolutional networks (e.g. GoogLeNet, ResNet, EfficientNet), comparison of detectors.
- Practical use of localization methods based on convolutional neural networks (e.g. R-CNN, Faster R-CNN, YOLO).
- Practical use of generative networks for data augmentation (DCGAN, Diffusion-GAN).
- Experiments with image segmentation using encoder-decoder networks (U-Net).
- Practical use of transformer networks for object analysis in images.
- Analysis of objects in IR and depth images (RealSense, Kinect).

E-learning

Materials are available on the educator's website:
https://mrl.cs.vsb.cz//people/fusek/ano2_course.html

Literature

1. Ayyadevara, V. Kishore; Reddy, Yeshwanth. Modern Computer Vision with PyTorch: Explore deep learning concepts and implement over 50 real-world image applications. Packt Publishing, 2020. ISBN 978-1839213472 .
2. Lakshmanan, V.; Shlens, J.; Sukthankar, R. Practical Machine Learning for Computer Vision: End-to-End Machine Learning for Images. O’Reilly Media, 2021. ISBN 978-1098102364.
3. Zhang, A.; Lipton, Z. C.; Li, M.; Smola, A. J. Dive into Deep Learning. arXiv, 2021. Available at: https://doi.org/10.48550/arXiv.2106.11342.

Advised literature

1. Burger, W.; Burge, M. J. Principles of digital image processing: Fundamental techniques. Springer, 2011. ISBN 978-1848001909 .
2. Chollet, F. Deep learning with Python. Manning, 2017. ISBN 978-1617294433.
3. Howse, Joseph; Minichino, Joe. Learning OpenCV 4 Computer Vision with Python 3. 3rd ed. Birmingham: Packt Publishing, 2020. ISBN 978-1789531619.