We use cookies in order to improve the quality and usability of the HSE website. More information about the use of cookies is available here, and the regulations on processing personal data can be found here. By continuing to use the site, you hereby confirm that you have been informed of the use of cookies by the HSE website and agree with our rules for processing personal data. You may disable cookies in your browser settings.

  • A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Development of an Automated Pipeline for Forecasting Time Series¶of the Number of Bookings

Student: Albert Baychorov

Supervisor: Alexey Masyutin

Faculty: Faculty of Computer Science

Educational Programme: Financial Technology and Data Analysis (Master)

Year of Graduation: 2024

Modern business needs to be introduced to the data-driven culture, which implies making decisions based on regularly-parsed data. In the case of the hotel business, the optimal solution is to develop an automated pipeline that stores, processes and analyses regularly data every day. For such a task the time series analysis tools (ARIMA, ARIMAX models), tools for operations with large number of features and decompositions (Prophet, Random Forest, LightGBM), as well as development of all backend part (PostgreSQL database and DBeaver DBMS) and its automation (Cron) were required. The demand sensitivity model (IRF) was also trained on the Price parameters of the average tariff and traffic volume on the company's website, and a cross- validation was carried out for different timeframes. As a result, LightGBM model was selected to apply to each hotel pair and room type, which together showed the best predictive accuracy on the cross-validation data. These models are automatically placed in the configured databases, updated daily, and have a view that is ready to be used by business units.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses