• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
2023/2024

Data Analysis: Advanced Level

Category 'Best Course for Career Development'
Category 'Best Course for Broadening Horizons and Diversity of Knowledge and Skills'
Category 'Best Course for New Knowledge and Skills'
Type: Mago-Lego
When: 1, 2 module
Open to: students of one campus
Instructors: Valeria A. Ivaniushina, Darya Konstantinovna Khodorenko, Ksenia Tenisheva
Language: English
ECTS credits: 6
Contact hours: 40

Course Syllabus

Abstract

The course is continuation of the Data Analysis course. This course prepares students to gather, describe, and analyze data using advanced statistical tools. Topics of the first module include advanced data manipulation, treatment of missing data, survival analysis and choice modeling. The second module is centered around issues of causality and causal inference. The course consists of lectures by the instructor, practical assignments for each class, and two individual projects. Grading is based on practical assignments, two projects and final exam
Learning Objectives

Learning Objectives

  • This course aims to provide an overview of advanced statistical techniques that arise in data analytic applications. In this class, you will learn and practice advanced data analytic techniques. One or more practical applications associated with each technique will also be discussed.
Expected Learning Outcomes

Expected Learning Outcomes

  • At the end of the course, the students should be able to: - identify properties of the data, detect potential problems - treat data problems: sample bias, missings, excessively small/large samples
  • - be able to conduct time-series analysis and analysis of choice
  • - Define causal effects using potential outcomes
  • - Implement causal inference methods (matching, instrumental variables, regression discontinuity, difference-in-difference, fixed effects) - Identify which causal assumptions are necessary for each type of statistical method
  • - Express assumptions with causal graphs
Course Contents

Course Contents

  • 1. Data problems: what can be wrong?
  • 2. Correction of sample bias.
  • 3. Missing data treatment.
  • 4. Survival analysis
  • 5. Choice modeling
  • 6. Basics of causal inference
  • 7. Causal Diagrams.
  • 8. Statistical instruments for causal inference.
Assessment Elements

Assessment Elements

  • non-blocking Project 1
    This project is assigned at the end of the first module. Students have to demonstrate their abilities to detect potential data problems and fix these problems. Two elements of grading are correct coding and correct interpretation.
  • non-blocking Exam
    Exam is conducted in a form of take-home project. Students have to apply a set of methods studied during the course to get an answer for the given research question. Students have 48 hours to individually prepare and submit the paper
  • non-blocking Project 2
    The project is assigned at the end of the second module. Students have to demonstrate their skills to implement a causal inference method and to rationalize their choice of the method.
  • non-blocking Practical Assignments
    For each topic, there will be a practical assignment. Students have to complete it either at class or as a homework. Each task wil be graded as 1 (done) or 0 (not done). Maximum grade for this part is 8, if all the tasks are completed.
Interim Assessment

Interim Assessment

  • 2023/2024 2nd module
    Project 1* 0.25 + Practical Assignments * 0.2 + Project 2 * 0.25 + Exam * 0.3
Bibliography

Bibliography

Recommended Core Bibliography

  • 9780205849574 - Barbara G. Tabachnick; Linda S. Fidell - Using Multivariate Statistics, 6th Edition - 2013 - Pearson - https://search.ebscohost.com/login.aspx?direct=true&db=nlebk&AN=1418064 - nlebk - 1418064
  • Bertail, P., Blanke, D., Cornillon, P.-A., & Matzner-Løber, E. (2019). Nonparametric Statistics : 3rd ISNPS, Avignon, France, June 2016. Cham, Switzerland: Springer. Retrieved from http://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsebk&AN=2044916
  • Crawley, M. J. (2014). Statistics : An Introduction Using R (Vol. Second edition). Chichester, West Sussex, UK: Wiley. Retrieved from http://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsebk&AN=846213

Recommended Additional Bibliography

  • Field, A. V. (DE-588)128714581, (DE-627)378310763, (DE-576)186310501, aut. (2012). Discovering statistics using R Andy Field, Jeremy Miles, Zoë Field.

Authors

  • PESTOVA ALENA SERGEEVNA
  • SORBALE ALEKSEY BORISOVICH
  • Ильина Мария Ивановна
  • IVANYUSHINA VALERIYA ALEKSANDROVNA
  • TSVETKOVA EKATERINA ANDREEVNA
  • KHODORENKO DARYA KONSTANTINOVNA