Fig 1.
The number of total submissions in r/cholesterol each year from 2017 to 2022.
Fig 2.
The 11 mutually exclusive topics identified by using STM, in order of proportion.
Each subfigure is labeled with the topic number and its corresponding proportion. The bar plot visually represents the top 10 words with the highest probability for each topic.
Fig 3.
Mean of the semantic coherence and exclusivity for models with UMLS concept recognition for k values ranging from 2 to 6 before and after concept decomposition.
Fig 4.
The 3 topics identified by using STM (a) before and (b) after concept decomposition.
Each subfigure is labeled with the topic number and its corresponding proportion. The bar plot visually represents the top 10 UMLS concepts with the highest probability for each topic.
Fig 5.
Topic-document probability histograms for the highest 20% values in topic 1, topic 2, and topic 3 applied UMLS concept recognition before and after concept decomposition.
Table 1.
EHR validation for anxiety/depression of statin-use patients using data from VUMC and All of Us.
Fig 6.
t-SNE visualizations of TF-IDF matrix (a) before and (b) after concept decomposition.