Arioc: High-concurrency short-read alignment on multiple GPUs

doi:10.1371/journal.pcbi.1008383

Arioc: High-concurrency short-read alignment on multiple GPUs

Fig 2

Speed versus sensitivity for three GPU memory-layout techniques.

Speed (reads/second) is greater when GPU peer-to-peer memory interconnect is used, for a range of sensitivity (% concordant) settings. Speeds are highest when the H and J tables are partitioned across device RAM on all available GPUs and GPU peer-to-peer memory accesses use the direct P2P memory interconnect. Speeds are lower with H in device RAM on each GPU and J in page-locked system memory. Speeds are lowest when H and J both reside in page-locked system RAM. H table: 25GB; J table: 52GB. Data from SRR6020688 (S1 Data).

doi: https://doi.org/10.1371/journal.pcbi.1008383.g002