A Scalable Data Access Layer to Manage Structured Heterogeneous Biomedical Data | PLOS One

Advertisement

Browse Subject Areas

?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1 — Fig 1.

Example of slides acquired for digital pathology.
From left to right: a tissue sample extracted from a patient is divided into multiple specimens; these are fixed into containers, blocks of formalin or paraffin; in turn the blocks are sliced. The produced slides can have one specimen per container, multiple items from the same block or items from different parts in the same block. For Figure credits see the Section Acknowledgments.

More »

Fig 2 — Fig 2.

Number of common variants produced by different pipelines given identical input.
In the Figure, the underlined text indicates the versions of: the Burrows-Wheeler Aligner(BWA), the Broad Institute’s Genome Analysis Toolkit (GATK) Unified Genotyper(UG) and human genome reference(Hg). For Figure credits see the Section Acknowledgments.

More »

Fig 3 — Fig 3.

Mind map of a set of annotations produced by a variant calling procedure.
Dashed lines highlight recurrent blocks.

More »

Fig 4 — Fig 4.

The openEHR two-level separation.
Relationship between the Reference Model, on the left, and the Archetype Model, on the right.

More »

Fig 5 — Fig 5.

Example of an archetype expressed in ADL.
The archetype defines a generic sports racket.

More »

Fig 6 — Fig 6.

Example of an archetype query in AQL.

More »

Fig 7 — Fig 7.

Example of an archetype, composed of other archetypes, shown as a tree structure.
Different shapes/capital letters mean different archetypes. The text “0..*” shows the cardinality of the relation (i.e., from 0 to any number of the related archetypes).

More »

Fig 8 — Fig 8.

Some possible instances of the archetype illustrated in Fig 7.
The number after the capital letter denotes a particular instance of the given archetype.

More »

Fig 9 — Fig 9.

Tree view of an archetype data structure with multiple nested archetypes.

More »

Fig 10 — Fig 10.

AQL query for the data structure in Fig 9.

More »

Fig 11 — Fig 11.

PyEHR architecture: main modules, databases and their interactions.

More »

Fig 12 — Fig 12.

PyEHR architecture: Data Management System.

More »

Fig 13 — Fig 13.

Code snippet from the Driver Factory Class.
The actual driver factory class; the lines at the bottom refer to the drivers for the MongoDB and Elasticsearch databases.

More »

Fig 14 — Fig 14.

Main driver API tasks.

More »

Fig 15 — Fig 15.

PyEHR: Query flow.
The query evaluation process starts from nine o’ clock and proceeds clockwise.

More »

Fig 16 — Fig 16.

JSON view of a generic EHR structure.
The part on the left is the magnified view of the small region of the whole structure, shown on the top right.

More »

Fig 17 — Fig 17.

Graph representation of some EHR structures generated.
Archetypes are shown as circles in a tree structure.

More »

Fig 18 — Fig 18.

Structures relative depth distribution on the created EHRs.
On the right the archetype maximum depth vs the relative structure frequency. On the left the overall maximum depth vs the relative structure frequency.

More »

Fig 19 — Fig 19.

Structures “max width” distribution on the created EHRs.

More »

Fig 20 — Fig 20.

Type 3 query example with two levels of containment.
The number of CONTAINS clauses gives away the query level.

More »

Fig 21 — Fig 21.

Apache Hadoop MapReduce, CNR. Query time vs number of nodes for a type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements. At the scale shown the points and the bars are almost indistinguishable.

More »

Fig 22 — Fig 22.

Apache Hadoop MapReduce, CNR. “Spread” curves for a type 1 count query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements. At the scale shown the points and the bars are almost indistinguishable.

More »

Fig 23 — Fig 23.

PyEHR with MongoDB, CNR. Query time vs number of nodes for a type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 24 — Fig 24.

PyEHR with MongoDB, CNR. Query time vs number of nodes for different types of count query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 25 — Fig 25.

PyEHR with MongoDB, CNR. Query time spent in the index lookup, count and fetch phases while performing a type 3 query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 26 — Fig 26.

PyEHR with MongoDB, CNR. “Spread” curves for a type 2 count query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 27 — Fig 27.

PyEHR with Elasticsearch, CNR. Query time vs number of nodes for a type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 28 — Fig 28.

PyEHR with Elasticsearch, CNR. Query time vs number of nodes for different types of count query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 29 — Fig 29.

PyEHR with Elasticsearch, CNR. Query time spent in the index lookup, count and fetch phases in a type 3 query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 30 — Fig 30.

PyEHR with Elasticsearch, CNR. “Spread” curves for a type 2 count query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 31 — Fig 31.

Apache Hadoop MapReduce, CL. Query time for type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 32 — Fig 32.

PyEHR with MongoDB, CL. Query time for type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 33 — Fig 33.

PyEHR with MongoDB, CL. Query time for different type of count queries at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 34 — Fig 34.

PyEHR with MongoDB, CL. Query time spent in the index lookup, count and fetch phases in a type 3 query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 35 — Fig 35.

PyEHR with Elasticsearch, CL. Query time for type 1 count query at different levels.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 36 — Fig 36.

PyEHR with Elasticsearch, CL. Query time for different type of count queries at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 37 — Fig 37.

PyEHR with Elasticsearch, CL. Query time spent in the index lookup, count and fetch phases in a type 3 query at level 5.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 38 — Fig 38.

CNR. Results comparing performance for a type 1 query at level 3.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »

Fig 39 — Fig 39.

CL. Results comparing performance for a type 1 query at level 3.
Each point is the mean query time, while the bar shows the standard deviation of the measurements.

More »