This email address is being protected from spambots. You need JavaScript enabled to view it.
 
+7 (4912) 72-03-73
 
Интернет-портал РГРТУ: https://rsreu.ru

UDC 004. 652.5

EMPLOYMENT DATA CLEANING METHODS IN DATABASE REENGINEERING

A. I. Baranchikov, Dr. in technical sciences, full professor, RSREU, Ryazan, Russia;
orcid.org/0000-0003-4133-7489, e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
I. I. Yakovlev, post-graduate student, RSREU, Ryazan, Russia;
orcid.org/0000-0002-3813-0455, e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
I. A. Klyueva, programmer, RSREU, Ryazan, Russia;
orcid.org/0000-0002-0392-3228, e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.

The aim of the work is to apply data cleansing algorithms focused on collecting data for data warehouses to solve the problem of database reengineering. The main tasks are the choice of a cleaning method, its modification for new purposes and its application in reengineering. The relevance of the research lies in the application of well-known methods to work with data in order to solve new problems. The article provides a classification of reengineering problems depending on the number of databases, analyzes various levels of the algorithm, and describes its main stages. The selection and modification of the cleaning algorithm for database reengineering is carried out, and the prospects for its use are assessed. An example of the algorithm operation is given. The result of the work is an algorithm that allows you to apply the data cleaning methods used in organization and operation of data warehouses to the tasks of database reengineering.

Key words: data, database, data structure, data cleansing, reengineering, subject area, attribute, profiling, data mining.

 Download