Skip to main content
Skip header

Advanced methods for data manipulation

Type of study Follow-up MasterBachelor
Language of instruction English
Code 9360-0193/02
Abbreviation PMZD
Course title Advanced methods for data manipulation
Credits 3
Coordinating department CNT - Nanotechnology Centre
Course coordinator Ing. Dominik Legut, Ph.D.

Subject syllabus

This subject prepares participant for the processing and manipulation large data files and prepares to work with HPC supercomputers. Participatn will learn to work with files of million lines or columns or files as large as several GBi.
1. Unix(linux) commands for data manipulation in command line prompt
2. Handling text data and editing in unix, Vi-editor, Nano, midnight commander etc.
3. Introduction to scripting in Bash, for and while loops, etc.
4. Introduction to Awk, manipulation of data
5. How to exploit simple mathemtics in command line
6. Awk, formats of data I/O (formated input and output)
7. Basics of Ed and Sed, replacing strings, more complex constructions
8. Advance methods - Introduction to Perl
9. Perl II
10. Regular syntax I
11. Regular synax II
12. Data manipulation to and from HPC systems, dispaly forwarding, usage of scheduler and batch jobs
13. - 14. Practical sessions