On the issue of increasing machine learning performance at the data sampling stage when solving classification problems

Authors

DOI:

https://doi.org/10.17308/sait/1995-5499/2022/4/146-155

Keywords:

data sampling, neural networks, point cloud, SDGs

Abstract

The purpose of this study is to determine a data storage method for machine learning tasks of neural networks and semantic segmentation of point clouds. The existing methods of working with large files are considered, experimental studies are carried out to determine the speed of the data reading operation. The experiment consisted in reproducing the process of accessing information from files for which the volume and structure of stored information with time measurement were described. For the research, the most common file formats used for storing information were taken *.csv, *.npy and *.h5. The result of the experiment was statistical information about the file reading time depending on the selected structure and the amount of information stored in it, as well as recommendations for choosing a storage method.

Author Biographies

  • Roman A. Dyachenko, Kuban State Technological University

    Doctor of Technical Sciences, Professor, Professor of the Department of Informatics and Computer Engineering of the Kuban State Technological University

  • Pavel A. Kosolapov, Kuban State Technological University

    Candidate of the Department of Computer Science and Computer Engineering of the Kuban State Technological University

  • Dmitry A. Gura, Kuban State Agrarian University named after I. T. Trubilin

    Candidate of Technical Sciences, Associate Professor, Associate Professor of the Department of Cadastre and Geo-Engineering of the Kuban State Technological University, Associate Professor of the Department of Geodesy of the Kuban State Agrarian University named after I. T. Trubilin

References

Downloads

Published

2022-12-26

Issue

Section

Intelligent Information Systems, Data Analysis and Machine Learning

How to Cite

On the issue of increasing machine learning performance at the data sampling stage when solving classification problems. (2022). Proceedings of Voronezh State University. Series: Systems Analysis and Information Technologies, 4, 146-155. https://doi.org/10.17308/sait/1995-5499/2022/4/146-155

Most read articles by the same author(s)