Using topic modeling via non-negative matrix factorization to identify relationships between genetic variants and disease phenotypes: A case study of Lipoprotein(a) (LPA)

doi:10.1371/journal.pone.0212112

Fig 1.

Illustration of topic modeling on EHRs using NMF.

More »

Expand

Fig 2.

Word clouds for six topics.

The size of the words (phecode) in each cloud indicates the weights of the phenotypes on the topic. Phenotypes with larger-sized words have greater influence on the topic compared to phenotypes with smaller-sized words. For each word cloud, we listed the top 60 words.

More »