• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
  • HSE University
  • Student Theses
  • A Program for Depersonalizing Personal Data in Legal Documents by Extracting Named Entities Using Artificial Intelligence Methods

A Program for Depersonalizing Personal Data in Legal Documents by Extracting Named Entities Using Artificial Intelligence Methods

Student: Kirill Grafov

Supervisor: Dmitry Pantiukhin

Faculty: HSE Tikhonov Moscow Institute of Electronics and Mathematics (MIEM HSE)

Educational Programme: Cybersecurity (Master)

Year of Graduation: 2024

This paper deals with the problem of depersonalizing digital data in legal documents using artificial intelligence methods. The relevance of the task is substantiated during the period of need to comply with the requirements for protecting digital data and ensuring information confidentiality. The developed program is a tool based on methods for extracting named entities from text using machine learning algorithms. The program is implemented in the Python programming language. In summary, the rapid work is a comprehensive study aimed at developing software tools for anonymizing scientific data in legal documents using artificial intelligence methods. The work contains 69 pages, 16 figures, 12 tables, 21 sources, 2 appendices.

Student Theses at HSE must be completed in accordance with the University Rules and regulations specified by each educational programme.

Summaries of all theses must be published and made freely available on the HSE website.

The full text of a thesis can be published in open access on the HSE website only if the authoring student (copyright holder) agrees, or, if the thesis was written by a team of students, if all the co-authors (copyright holders) agree. After a thesis is published on the HSE website, it obtains the status of an online publication.

Student theses are objects of copyright and their use is subject to limitations in accordance with the Russian Federation’s law on intellectual property.

In the event that a thesis is quoted or otherwise used, reference to the author’s name and the source of quotation is required.

Search all student theses