Бакалавриат
2022/2023
Дата-инжиниринг
Лучший по критерию «Полезность курса для расширения кругозора и разностороннего развития»
Лучший по критерию «Новизна полученных знаний»
Статус:
Курс по выбору (Прикладной анализ данных)
Направление:
01.03.02. Прикладная математика и информатика
Где читается:
Факультет компьютерных наук
Когда читается:
3-й курс, 3 модуль
Формат изучения:
без онлайн-курса
Охват аудитории:
для своего кампуса
Язык:
английский
Кредиты:
5
Контактные часы:
44
Course Syllabus
Abstract
The course "Data Engineering" is dedicated to the work of data engineers who provide the foundation of the analytical process - the delivery of data to the analyst's desk. In order for this to happen, someone must ensure that they are searched, downloaded according to the required schedule or request, checked, converted into a usable form and protected taking into account user roles. In the classroom, we will talk about the basics of the Data Engineer profession: how and where data is stored, what to do if the data is not suitable for use and how to reduce the cost of performing analytics. Students will have a lot of practice, where they will solve engineering problems from the life of IT departments of Russian business with their own hands.
Learning Objectives
- Get an idea about the features of data management tasks, their application in business and practical skills in working with data engineer tools.
Expected Learning Outcomes
- Get the basics of data management as a discipline: data structures and sources, data manipulation methods
- Be able to work with data engineer tools and modern databases such as PostreSQL or Clickhouse.
- Be aware of why data quality is important and how to keep it high.
- See what data marts are and how to create them.
- Identify situations where data should be handled with care (sensitive business and personal data) and what protection mechanisms are in place.
- Learn about the modern data processing stack and how it will change in the future.
Course Contents
- Data structures.
- Data manipulation
- Cleaning and validating data
- Industrial practice
- Data marts
- Protecting and masking data
Bibliography
Recommended Core Bibliography
- Basic concepts in data structures, Klein, S. T., 2016
- Large scale and big data : processing and management, , 2016
Recommended Additional Bibliography
- Artificial intelligence and big data for financial risk management : intelligent applications, , 2023
- Integrating deep learning algorithms to overcome challenges in big data analytics, , 2022