• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site

Convolutional Models for Audio Source Separation in Streaming Setting

Student: Akhmatova Anna

Supervisor: Maxim Kaledin

Faculty: Faculty of Computer Science

Educational Programme: Applied Mathematics and Information Science (Bachelor)

Final Grade: 8

Year of Graduation: 2024

The research project was dedicated to the creation of a deep neural network for online target source separation. In order to solve this problem, we adapted one of the state-of-the-art convolutional neural networks for the offline task, SpexPlus, to the online mode. We developed several memory mechanisms for the architecture to improve its quality by expanding the captured context during sequential processing of audio chunks. The proposed methods were tested in experiments. We continued to work with the model with the most suitable memory mechanism, conducted a study on how much we can compress this architecture structurally (by changing its hyperparameters) and in case of applying standard techniques for model size reduction (pruning and quantization) to it. We investigated how network's compression affects its quality, weight and speed of inference. This paper describes the ideas from other studies that were used as a basis for creating memory mechanisms, the course and results of our research, the problems and features of models' training that we faced.

Full text (added May 20, 2024)

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses