What is it about?

Scientists, researchers, engineers, etc. almost everyone who works with data crosses paths with Pandas at some point. It is so powerful library that allows for easy, rapid and efficient manipulation of data. It can convert data it represent into various file types. Among these file types, the determination of the one which records the same Pandas data with the smallest size on the disk is an important issue considering the abundance of today's data. In this study, the file types that can save Pandas data with minimum size has been experimentally investigated from various perspectives. In this respect, the CSV, HDF, JSON, Excel and Pickle file types are involved in the experiments. The sizes of these files were benchmarked under several conditions such as the completeness or lack of data and type of variables that are contained in data. In addition, it was also examined that how file sizes vary as data increases.

Featured Image

Read the Original

This page is a summary of: Recording Performances of Some File Types for Pandas Data, European Journal of Science and Technology, April 2022, European Journal of Science and Technology,
DOI: 10.31590/ejosat.1103499.
You can read the full text:

Read

Contributors

The following have contributed to this page