• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
Master 2024/2025

Probability Theory and Mathematical Statistics

Type: Elective course (Data Science)
Area of studies: Applied Mathematics and Informatics
When: 1 year, 1 module
Mode of studies: offline
Open to: students of all HSE University campuses
Instructors: Denis Fedyanin
Master’s programme: Data Science
Language: English
ECTS credits: 3

Course Syllabus

Abstract

The overwhelming majority of courses on data-mining and artificial intelligence implies a strong background in probability and statistics. The goal of this course is to provide those students who are not so keen on the subject matter with its fundamentals.
Learning Objectives

Learning Objectives

  • To introduce the theoretical foundations of Probability theory.
  • To introduce the theoretical foundations of Mathematical statistics.
  • To provide the students with practical skills of modelling real-world in the framework of probability and statistics.
Expected Learning Outcomes

Expected Learning Outcomes

  • Solve simple problems using linear and normal linear regression.
  • Understand basic statistical models.
  • Understand Empirical distribution function, method of moments, maximum likelihood principle.
  • Understand fundamental concepts chi-squared test.
  • Understand fundamental concepts of statistical hypothesis for the normal distribution.
  • Understand fundamental concepts of statistical hypothesis.
  • Understand fundamental concepts, advantages and limitations of Linear regression and Normal linear regression.
  • Understand fundamental concepts, advantages and limitations of non-parametric statistics.
  • Understand problems of statistics.
Course Contents

Course Contents

  • Problems of Statistics. Parameter estimators. Basic statistical models. Distributions and sample statistics. Estimators. Unbiased estimators. Unbiased estimators for expectation and variance. Consistency. Efficiency and mean-squared error. Kramer-Rao inequality.
  • Empirical distribution function. Empirical parameters. Method of moments. Maximum likelihood principle. Likelihood and loglikelihood.
  • Multivariate normal distribution. Student and Fisher distributions. Normal sample.
  • Testing statistical hypothesis. Null hypothesis. Test statistics. Type I and type II errors. Significant level. Critical region and critical values.
  • Statistical hypothesis for the normal distribution. The one-sample t-test. Comparing two samples. Tests for variance.
  • Chi-squared test. For known parameters. For unknown parameters.
  • Non-parametric statistics. Sign test. Two-sided sign test. Wilcoxon sign-rank test.
  • Tests for power-law distributions. Hill test. Ibragimov’s corrections for confident intervals. Data-collapse approach. Clauset-Shalizi-Newman test.
  • Linear regression. Normal linear regression.
Assessment Elements

Assessment Elements

  • non-blocking Seminar 1
  • non-blocking Seminar 2
  • non-blocking Seminar 3
  • non-blocking Seminar 4
  • non-blocking Seminar 5
  • non-blocking Seminar 6
  • non-blocking Seminar 7
  • non-blocking Seminar 8
  • non-blocking Exam
Interim Assessment

Interim Assessment

  • 2024/2025 1st module
    0.2 * Exam + 0.1 * Seminar 1 + 0.1 * Seminar 2 + 0.1 * Seminar 3 + 0.1 * Seminar 4 + 0.1 * Seminar 5 + 0.1 * Seminar 6 + 0.1 * Seminar 7 + 0.1 * Seminar 8
Bibliography

Bibliography

Recommended Core Bibliography

  • Dekking F. M. et al. A Modern Introduction to Probability and Statistics: Understanding why and how. – Springer Science & Business Media, 2005. – 488 pp.

Recommended Additional Bibliography

  • Wasserman, L. All of nonparametric statistics. – Springer Science & Business Media, 2006. – 270 pp.

Authors

  • Антропова Лариса Ивановна
  • GROMOV Vasilii ALEKSANDROVICH
  • Fedianin DENIS NIKOLAEVICH