1. Introduction to data preparation and manipulation: Overview of data types and structures in R, techniques for cleaning data. Filtering, grouping, and arranging data. Concept of tidy data.
2. Data visualisation: Generating various types of plots based on data type.
3. Understanding measures of central tendency, variability, and shape. Detection of outliers.
4. The principles of hypothesis testing, the concept of confidence intervals, critical regions, and p-values.
5. One- and two-sample parametric tests
6. Chi-squared goodness-of-fit tests and tests of independence in contingency tables
7. Analysis of variance
8. Non-parametric tests
9. Correlation analysis and related visualization techniques
10. Simple and multivariate linear regression models
11. Regression diagnostic, model performance evaluation
12. Additional regression models selected according to the nature of the data being studied.
Available study materials: Supplementary materials in LMS
1. AGRESTI, Alan a KATERI, Maria. Foundations of statistics for data scientists: with R and Python. Texts in statistical science. Boca Raton: CRC Press, 2022. ISBN 978-0-367-74845-6.
2. CRAWLEY, Michael J. Statistics: an introduction using R. 2nd ed. Chichester: Wiley, 2015. ISBN 978-1-118-94109-6.
3. COHEN, Yosef a COHEN, Jeremiah Y. Statistics and data with R: an applied approach through examples. Chichester, U.K.: Wiley, 2008. ISBN 9780470758052. Dostupné také z:
http://onlinelibrary.wiley.com/book/10.1002/9780470721896.
4. DEVORE, Jay L. Probability and statistics for engineering and the sciences. Ninth edition. Australia: Cengage Learning, [2016]. ISBN 978-1305251809.
5. WICKHAM, Hadley a GROLEMUND, Garrett. R for data science: import, tidy, transform, visualize and model data. Sebastopol: O'Reilly Media, [2017]. ISBN 978-1-4919-1039-9.