Common sleep data pipeline for combined data sets

doi:10.1371/journal.pone.0307202

Table 1.

Overview of the 21 datasets used in the pipeline.

More »

Expand

Fig 1.

Overview of the total pipeline.

The pipeline is separated in two: Data Standardization and Data Serving, with the common datastore connecting them. Lazy Loading refers to loading the preprocessed data from disk when it is needed during training.

More »

Expand

Table 2.

Analysis of Parquet, HDF5 and Pickle file formats, showing file size together with averages of peak memory and load time.

File sizes were measured once as the amount of space used on the system disk. Memory and load time experiments were repeated 100 times for each file format and the mean is reported in the table. Peak memory showed negligible variance. The files all contained the same data from CFS record ‘cfs-visit5–800002’.

More »

Expand

Table 3.

Description of actions in the template method pattern adaption for the data standardization procedure, shown in order of operation.

More »

Expand

Fig 2.

Class diagram depicting the inheritance hierarchy in CSDP.

The arrows point towards the parent class, which the child class inherits its functionality from. Abstract classes, abstract functions and abstract properties are shown with *.

More »