Pathogen genetics is already a mainstay of public health investigation and control efforts; now advances in technology make it possible to investigate the role of human genetic variation in the epidemiology of infectious diseases. To describe trends in this field, we analyzed articles that were published from 2001 through 2010 and indexed by the HuGE Navigator, a curated online database of PubMed abstracts in human genome epidemiology. We extracted the principal findings from all meta-analyses and genome-wide association studies (GWAS) with an infectious disease-related outcome. Finally, we compared the representation of diseases in HuGE Navigator with their contributions to morbidity worldwide. We identified 3,730 articles on infectious diseases, including 27 meta-analyses and 23 GWAS. The number published each year increased from 148 in 2001 to 543 in 2010 but remained a small fraction (about 7%) of all studies in human genome epidemiology. Most articles were by authors from developed countries, but the percentage by authors from resource-limited countries increased from 9% to 25% during the period studied. The most commonly studied diseases were HIV/AIDS, tuberculosis, hepatitis B infection, hepatitis C infection, sepsis, and malaria. As genomic research methods become more affordable and accessible, population-based research on infectious diseases will be able to examine the role of variation in human as well as pathogen genomes. This approach offers new opportunities for understanding infectious disease susceptibility, severity, treatment, control, and prevention.
Citation: Rowell JL, Dowling NF, Yu W, Yesupriya A, Zhang L, Gwinn M (2012) Trends in Population-Based Studies of Human Genetics in Infectious Diseases. PLoS ONE 7(2): e25431. doi:10.1371/journal.pone.0025431
Editor: Cameron Neylon, Science and Technology Facilities Council, United Kingdom
Received: March 25, 2011; Accepted: September 5, 2011; Published: February 7, 2012
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: Jessica L Rowell is a fellow funded by Oak Ridge Institute for Science and Education (ORISE). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: M. Gwinn is a paid consultant to CDC via McKing Consulting Corporation. This does not alter the authors′ adherence to all the PLoS ONE policies on sharing data and materials, as detailed online in the guide for authors. M. Gwinn is also a member of the PLoS ONE Advisory Board and the PLoS Currents: Evidence for Genomic Tests Moderator Board.
Continually evolving human and environmental circumstances—including economic development, increased global travel and commerce, and demographic and behavioral changes—have contributed to the emergence of new infectious diseases and the re-emergence of existing ones . Ever-increasing global connectedness makes control of infectious diseases a global priority. Pathogen genomics has become a leading tool for identifying pathogens, tracking their spread and guiding public health interventions , .
Rapid advances in molecular technologies and informatics now allow researchers to study human as well as pathogen genetic variation in epidemiologic studies of infectious diseases. During the last decade, population-based research on host genetic factors has extended far beyond the traditional focus of such research on human leukocyte antigens (HLAs) . Although the pace of human gene discovery has been brisk for common chronic diseases and conditions, it has been slower for infectious diseases, which accounted for only 23 of the 978 genome-wide association studies (GWAS) published through 2010. We compiled and analyzed a comprehensive database of published studies in human genome epidemiology (HuGE) of infectious diseases to present a quantitative summary of the field, including its current scope, focus, and trends.
To assemble the set of published studies of human genetic associations related to infectious diseases, we used a comprehensive genetic association publications database, the Human Genome Epidemiology (HuGE) Literature Finder (http://www.hugenavigator.net). The HuGE Literature Finder contains articles specifically related to human genome epidemiology, including meta-analyses and systematic reviews, and also allows for filtering of these articles on a number of criteria, including study type (e.g., observational study, meta-analysis), category (e.g., pharmacogenomics, gene-environment interaction), gene, disease, and country of first author. Since 2001, the database has been updated weekly from PubMed (http://www.ncbi.nlm.nih.gov/pubmed) by a combination of automated and human curation procedures . This process has been shown to be highly sensitive (98.5%) and highly specific (97.5%) for retrieval of genetic association articles from PubMed .
To identify infectious disease-related articles in the HuGE Literature Finder database, we developed two queries based on medical subject heading (MeSH) terms, which are assigned by PubMed curators (http://www.nlm.nih.gov/mesh). One query used four specific MeSH terms (“bacterial diseases OR mycoses OR virus diseases OR parasitic diseases”) and one used two general MeSH terms (“infectious OR infection”). We compared the performance of the two queries in a subset of articles consisting of those published in 2005 and 2006. Most of the articles identified by the two queries were related to infectious diseases but the results overlapped by only 68%; therefore, we used both queries for our search.
We classified the subset of articles published between 2006 and 2010 (n = 2,456) into five categories based on the relationship between infection and the studied outcome: “infection as primary outcome” (the association of one or more human genetic variants with a specific infectious disease); “infection as a predisposing factor” (genetic susceptibility to a chronic condition given exposure to an infection); “infection as a complication” (genetic susceptibility to infection given a pre-existing chronic condition or predisposing event such as surgery or trauma); “genotype prevalence” (population prevalence of genotypes known to be associated with infectious diseases); and “pharmacogenomics in treatment of infection.”
To check the validity of our outcome classification by category, we selected a random 10% sample of the articles published in 2006–2008 using the RAND() function in Excel version 2007. In addition to the author, two reviewers (M. Gwinn and A. Yesupriya) independently classified the articles in this sample and any disagreements were resolved by discussion among the three reviewers.
We estimated the sensitivity of our combined query for infectious diseases by reviewing the title and abstract of every tenth article excluded by the query for the period from 2006 through 2009. We multiplied by 10 to estimate the numbers of false negative articles (i.e., related to infectious disease but missed by the query) and true negative articles.
All subsequent data analyses were performed with use of SAS version 9. We used a heuristic based on the textbook Genetic Susceptibility to Infectious Diseases to classify genes into functional categories (see Table S1) . We used the HuGE Literature Finder filter tool to identify infectious disease-related meta-analyses and extracted the results. We used the National Human Genome Research Institute's Catalog of Published Genome-Wide Association Studies (NHGRI Catalog) to identify all infectious disease-related GWAS published from 2005 through 2010 (including those for diseases with only a suspected infectious origin, such as Kawasaki disease). We extracted GWAS data directly from the NHGRI Catalog, which includes only associations with a reported p level of 1×10−5 or lower in the initial GWAS and replication populations, reported either separately or combined . A list of references for all meta-analyses and GWAS are found in the supplementary references.
To assess the alignment of research priorities with public health burden, we examined the correlation of publication frequency (a measure of research output) with disease-specific morbidity for 1) the six most frequently studied infectious diseases; 2) the five most frequently studied health conditions; and 3) the five leading causes of morbidity worldwide. In this analysis, we also included key chronic conditions often associated with one or more of the six most frequently studied infectious diseases: liver cirrhosis, liver cancer (hepatitis B and C infections), and gastric cancer (H. pylori infections). The measure of morbidity we used was disability-adjusted life years (DALYs), as calculated for the World Health Organization's Global burden of disease: 2004 update  (data file available here: http://www.who.int/healthinfo/global_burden_disease/estimates_regional/en/index.html). We chose morbidity instead of mortality because human genetic factors have been studied most frequently in infectious diseases with a chronic course.
Our queries selected 3,730 articles related to human genetic epidemiology of infectious diseases indexed by the HuGE Navigator from 2001–2010 . The relationship of infection to studied outcomes is summarized in Table 1. Approximately half of the articles focused on infection as the primary outcome; another 20% studied genetic associations with infections as predisposing factors for chronic conditions, such as liver fibrosis or cancer. Approximately 14% of the articles selected by the query were not related to infectious diseases—most often, their abstracts included the keywords “infectious” or “infection” in background sentences describing previous research. Thus, the estimated specificity of our query was 86%.
Classifying a systematic sample of articles according to outcome category, two independent reviewers agreed on 85% (116/137). Through a resolution process, the other 21 articles were finally classified as “Not related [to infectious disease]” (9 articles), “Infection as primary outcome” (6), “Pharmacogenomics in treatment of infection” (2), “Infection as a predisposing factor” (2), or “Infection as a complication” (2).
A systematic 10% sample of articles published in 2006–2009 that were not selected by our query contained 2,722 articles, including 21 articles related to infectious diseases. Thus, we estimated that the query had an overall sensitivity of 89%. The missed (false negative) articles included 10 on periodontitis, 4 on rhinosinusitis, and 2 each on sepsis, leprosy, and tuberculosis. None of these articles included the keywords “infectious” or “infection” in their titles or abstracts. Four missed articles were not indexed with MeSH terms, including one written in a language other than English. The remaining 17 articles were indexed with MeSH terms that referred either to infectious organisms or to clinical conditions such as “sinusitis” that are not cross-referenced with infectious diseases in the MeSH thesaurus.
The annual number of publications related to infectious diseases more than tripled from 148 in 2001 to 543 in 2010; however, the percentage of all articles in the HuGE Literature Finder database that were related to infectious diseases remained nearly constant (range, 5.9–7.3%) (Figure 1). The most commonly studied infectious diseases were human immunodeficiency virus/acquired immune deficiency syndrome (HIV/AIDS, 688 articles), hepatitis C virus (HCV, 410), Helicobacter pylori (H. pylori) infection (399), tuberculosis (289), hepatitis B virus (HBV, 285), sepsis (254), and malaria (199). Overall, cytokine receptor genes were the most frequently studied category. Among individual genes, TNF and HLA-DRB1 were studied most often, followed by IL10, CCR5, IL1B, and HLA-B; each of these genes appeared in more than 200 gene-disease association studies (Tables S2 and S3).
Selected results from the 27 published meta-analyses are summarized in Tables S4, S5, and S6. Six meta-analyses included cohort studies; of these, three were related to HIV/AIDS and were published in 2001 and 2003 (Table S5). Sixteen meta-analyses included only case-control studies (Table S4). Most of these had a combined sample size of more than 2,000 case subjects and more than 2,000 control subjects. The reported odds ratios (ORs) ranged from 1.09 to 2.58 for harmful effects and from 0.90 to 0.12 for protective effects. Of the five meta-analyses related to pharmacogenomics, three included clinical trials (Table S6). Three pharmacogenomics meta-analyses (one on anti-tuberculosis drug-induced hepatotoxicity and two on H. pylori eradication) produced statistically significant results, with ORs from 1.73 to 4.28.
Results reported from the 23 infectious disease-related GWAS are summarized in Table S7. Eight of these studies focused on HIV infection or progression to AIDS; four on treatment of HCV infection and viral clearance; one on the role of host genetics in determining susceptibility to atherosclerosis among HIV-infected men on highly active antiretroviral therapy (HAART); and three on chronic diseases with possible infectious agent origins (Kawasaki disease, nasopharyngeal carcinoma, and IgA nephropathy). The other seven GWAS focused on leprosy susceptibility, severe malaria in children, chronic hepatitis B infection, hepatocellular carcinoma (two studies), tuberculosis susceptibility, and meningococcal disease susceptibility. The distribution of effect sizes for significant GWAS results is shown in Figure 2. Approximately one-third of the ORs were between 1.0 and 1.5, one-third were between 1.51 and 2.0, and one-third were greater than 2.0.
One outlying association with an OR of 27.1 (IL28B and HCV treatment; Tanaka 2009) is not shown.
The five countries with the most publications from 2001 through 2010 were the United States, China, Japan, United Kingdom, and Germany (Figure 3). Together, they accounted for nearly half of all articles for which the first author's country of residence was known. Five additional countries each produced more than 100 publications during this period: Brazil (158 articles), Italy (157), Spain (140), India (131), and South Korea (103). The most frequently studied diseases varied considerably by country. In China, 145 of 524 (28%) focused on HBV; in the United States, 186 of 576 (33%) focused on HIV/AIDs; and in Japan, 109 of 314 (35%) focused on H. pylori infection.
Although infectious diseases are leading causes of morbidity worldwide—accounting for 280 million disability-adjusted life years lost (DALYs) annually—they are relatively underrepresented in human genome epidemiologic research (Figure 4). Only HIV/AIDS, which alone accounted for almost 55 million DALYs in 2004, was represented by more than 500 publications during 2001–2010. In contrast, the most frequently studied diseases together accounted for fewer than 82 million disability-adjusted life years (DALYs) lost in 2004.
Solid black circles represent the six most frequently studied infectious diseases; open black circles represent three chronic diseases that are often part of the natural course of infection with HBV, HCV, or H. Pylori. Gray diamonds represent the four most frequently studied diseases overall. Gray squares represent the four diseases with the greatest morbidity worldwide; the fifth is HIV/AIDS, represented with a blue dot. ⊕
Since the 19th century, scientists and clinicians have sought explanations for the extensive variation in clinical phenotypes among individuals infected by the same agent. Evidence has been mounting since the 1930s that human genetics may play an important role in this variation , . Results from several twin studies support the hypothesis that genetic factors contribute to variations in individual susceptibility to premature death from infectious diseases, as well as to variations in vaccine response . Results of a 2008 study of mortality data for multiple generations of Utah families provided convincing evidence of a heritable predisposition to death from influenza .
Results of the Human Genome Project and attendant developments in molecular technology and informatics have enhanced the study of human genetic factors in infectious disease at the population level. The HuGE Navigator database, which has grown rapidly since collection of published studies began in 2001, comprised more than 50,000 articles by the end of 2010. We found that the number of articles related to infectious diseases has increased at roughly the same pace as the total number of articles and still accounts for just 7% of the total.
Research in other fields has laid a strong foundation for exploring the role of human genetics in infectious diseases; in particular, research on immune processes has suggested many candidate genes for further study. We found more than 300 genes whose association with infectious diseases had been studied more than once. The genes most commonly studied were those encoding tumour necrosis factors, cytokine receptors, HLA Class II molecules, and chemokine receptors and their ligands.
Our review of published meta-analyses found many significant genetic associations with infectious diseases; however, statistically significant heterogeneity was found in half the studies that reported testing for it. This heterogeneity could reflect the effects of combining studies that were conducted in populations with different genetic backgrounds (ancestry), or that used different methods for genotyping (selecting and measuring genetic markers) or phenotyping (diagnosing infection and defining clinical outcomes).
The GWAS approach—based on hypothesis-free, systematic genome scanning—has uncovered additional candidate genetic associations with infectious diseases. Some are biologically plausible, such as the association of IL28B with spontaneous viral clearance in HCV infection . Others have implicated previously unexplored regions, such as 1p13.3, 9q23, and 8q22.3 in association with AIDS progression .
One possible reason for the relative scarcity of infectious disease-related GWAS is the challenge of obtaining large enough study populations with homogeneous phenotypes . For example, the first infectious disease-related GWAS was conducted with 486 HIV-infected patients selected from a potentially eligible group of 30,000 . This study identified two HLA-associated polymorphisms associated with HIV-1 control; however, their replication in an independent cohort did not meet the GWAS Catalog's criterion for statistical significance (p<10−5) —probably because the second cohort included only 140 patients. Both associations have been replicated in subsequent GWAS and confirmed by meta-analysis, which increases effective sample size by pooling the results of multiple studies.
Several other approaches have been proposed to discover additional genetic associations relevant to infectious diseases; these include systematic examinations of the entire major histocompatibility (MHC) region and of the set of approximately 1,000 genes involved in innate immunity . Fellay et al. recently suggested an approach for identifying rare variants (not detectable by GWAS) by whole-genome sequencing of a small sample for gene discovery, followed by testing of any associated variants in a larger cohort . Public health surveillance systems offer a potential source of such cohorts .
Human genome epidemiologic research on infectious diseases is a global enterprise. We found that the first authors of articles published from 2001–2010 were from 104 countries. Together, the United States, China, Japan, the United Kingdom, and Germany accounted for half of all publications, with China taking the lead after 2007. Our data provide only a minimum estimate of global research output in this field because they are derived from PubMed, which consists mostly of articles written in English.
The most frequently studied infections tended to vary by country, perhaps reflecting these countries' public health priorities. For example, approximately one-third of the articles from the United States, where more than 30,000 new HIV cases have been diagnosed each year since 2005, were related to HIV/AIDS . Nearly one-third of the articles from China, where approximately 8% of people are chronic HBV carriers, focused on HBV infection . Almost half of the articles by Japanese authors focused on health problems related to H. pylori infection, which is a significant public health concern in a country where gastric cancer rates are among the highest in the world .
HIV/AIDS was the most frequently studied infection and also the largest contributor to global morbidity from infectious diseases (about 55 million DALYs in 2004) . Tuberculosis and malaria each accounted for about 30 million DALYs—nearly twice the number attributed to any of the diseases studied most often for genetic associations (breast cancer, diabetes, Alzheimer's disease, schizophrenia, or lung cancer). In contrast to these adult-onset conditions, infectious diseases affect people of all ages, which accounts in part for their high impact when measured in DALYs.
Although developing countries have the highest rates of morbidity and mortality from infectious diseases, most lack the capacity to conduct human genomics research . In our analysis, Brazil and India ranked among the top 10 countries in numbers of publications; however, 66 of the 104 countries with authors in our database accounted for 10 or fewer articles. Although a few developing countries have built impressive biotechnology infrastructures, most have not, nor have they benefitted from genomic research conducted elsewhere . This lack of participation in genomic research of infectious diseases by countries with high rates of infectious disease indicates a need for a collaborative global effort to support the participation of limited-resource countries in such research. Such collaboration is also important for ethical reasons, so that countries participating in research also share in the benefits .
Analysis of pathogen genomics has become a mainstay of public health approaches to surveillance, investigation, and control of infectious diseases. For example, analysis of pathogen restriction-fragment length polymorphism has been used since the 1980s to identify epidemic strains and describe transmission patterns, and genetic changes in influenza viruses are being closely monitored for the emergence of strains with pandemic potential . Researchers are currently investigating the use of additional genomic techniques to improve surveillance of food-borne pathogens ,  and enhance food safety, e.g., by determining safe thresholds of contaminants for vulnerable population sub-groups . Advancements in informatics have led to the development of crucial resources such as continuously updated online databases; one example is the National Center for Biotechnology Information's Entrez Genome database, which contains complete sequence data for more than 1,000 microbes (http://www.ncbi.nlm.nih.gov/sites/entrez).
Studying the role of human genetics in infectious diseases offers new opportunities to understand the etiology and pathology of these diseases by exploring in more depth the determinants of variation in susceptibility, clinical course, and mortality . The path from gene discovery to public health benefit may be more clear-cut for infectious diseases than for many other health conditions ; for example, studies of the role of human genetics in infectious diseases have created the new field of vaccinomics, which focuses on predicting vaccine response and avoiding vaccine-related adverse events . Research on both human and pathogen genomes has the potential to identify novel vaccine candidates more quickly than traditional methods of vaccine candidate identification . Better understanding of host-pathogen genome interactions has also encouraged research in innovative therapies to limit and decrease the clinical severity of infections .
In our review of human genetic epidemiologic studies since 2001, we found that HIV/AIDS was the most commonly studied infectious disease. The search for human genetic variants that influence HIV infection actually began in the early 1980s, not long after the human immunodeficiency virus was identified. In 2004, Stephen J. O'Brien of the U.S. National Cancer Institute, a pioneer in this research, wrote, “Although AIDS is not generally considered a genetic disease, the considerable heterogeneity in the epidemic is at least partially determined by variants in genes that moderate virus replication and immunity” . CCR5 delta32, discovered in 1996, was only the first of many variants found in epidemiologic cohorts to be associated with HIV infection and AIDS progression. Discovery that an intact CCR5 receptor is an important co-factor in HIV infection has led to targeted drug and vaccine development efforts , .We found that 7% of articles on genetic associations published from 2001 through 2010 focused on infectious diseases—a disproportionately small fraction, given their public health importance. As genomic research methods become more affordable and accessible, human genome epidemiology will help increase our understanding of people's susceptibility to infectious diseases; the likely severity of these diseases; and how best to prevent, control, and treat them.
List of genes by category. Gene categories based on Kaslow, et al., 2008.
Gene-disease associations by gene category, 2001–2010. Gene categories based on Kaslow et al., 2008.
Number of associations for genes studied at least 50 times, by gene and gene category, 2001–2010. Gene categories based on Kaslow, et al., 2008.
Meta-analyses of case-control studies related to infectious diseases, 2001–2010.
Meta-analyses of cohort studies related to infectious diseases, 2001–2010.
Meta-analyses of pharmacogenomics studies related to infectious diseases, 2001–2010.
Genome-wide association studies related to infectious diseases, 2005–2010.
The authors would like to thank Melinda Clyne, the curator of the HuGE Navigator Literature Finder, for her valuable advice on identifying infectious disease-related articles. We would also like to thank Muin Khoury for sharing his insights and vision. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.
Conceived and designed the experiments: JLR NFD WY AY LZ MG. Performed the experiments: JLR. Analyzed the data: JLR WY AY. Wrote the paper: JLR NFD MG.
- 1. Fauci AS (2006) Emerging and re-emerging infectious diseases: influenza as a prototype of the host-pathogen balancing act. Cell 124: 665–670.
- 2. Haagmans BL, Andeweg AC, Osterhaus AD (2009) The application of genomics to emerging zoonotic viral diseases. PLoS Pathog 5: e1000557.
- 3. Seib KL, Dougan G, Rappuoli R (2009) The key role of genomics in modern vaccine and drug design for emerging infectious diseases. PLoS Genet 5: e1000612.
- 4. Honey K (2009) Tales from the gene pool: a genomic view of infectious disease. J Clin Invest 119: 2452–2454.
- 5. Yu Wei, Yesupriya Ajay, Wulf Anja, Qu Junfeng, Khoury MuinJ, et al. (2007) An open source infrastructure for managing knowledge and finding potential collaborators in a domain-specific subset of PubMed, with an example from human genome epidemiology. BMC Bioinformatics 8: 436.
- 6. Yu W, Clyne M, Dolan SM, Yesupriya A, Wulf A, et al. (2008) GAPscreener: an automatic tool for screening human genetic association literature in PubMed using the support vector machine technique. BMC Bioinformatics 9: 205.
- 7. Kaslow R, McNicholl J, Hill A, editors. (2008) Genetic susceptibility to infectious diseases. New York: Oxford University Press, Inc. 464 p.
- 8. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, et al. (2009) Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106: 9362–9367.
- 9. Mathers C, Fat DM, Boerma JT, World Health Organization (2008) The global burden of disease: 2004 update. 1 ed. Geneva: World Health Organization. Available: http://www.who.int/healthinfo/global_burden_disease/2004_report_update/en/index.html. Accessed 2012 Jan 4.
- 10. Yu W, Gwinn M, Clyne M, Yesupriya A, Khoury MJ (2008) A navigator for human genome epidemiology. Nat Genet 40: 124–125.
- 11. Alcais A, Abel L, Casanova JL (2009) Human genetics of infectious diseases: between proof of principle and paradigm. J Clin Invest 119: 2506–2514.
- 12. Casanova JL, Abel L (2007) Human genetics of infectious diseases: a unified theory. EMBO J 26: 915–922.
- 13. Burgner D, Jamieson SE, Blackwell JM (2006) Genetic susceptibility to infectious diseases: big is beautiful, but will bigger be even better? Lancet Infect Dis 6: 653–663.
- 14. Albright FS, Orlando P, Pavia AT, Jackson GG, Cannon Albright LA (2008) Evidence for a heritable predisposition to death due to influenza. J Infect Dis 197: 18–24.
- 15. Afdhal NH, McHutchison JG, Zeuzem S, Mangia A, Pawlotsky JM, et al. (2011) Hepatitis C pharmacogenetics: State of the art in 2010. Hepatology 53: 336–345.
- 16. Le Clerc S, Limou S, Coulonges C, Carpentier W, Dina C, et al. (2009) Genomewide association study of a rapid progression cohort identifies new susceptibility alleles for AIDS (ANRS Genomewide Association Study 03). J Infect Dis 200: 1194–1201.
- 17. Bowcock AM (2010) Genome-wide association studies and infectious disease. Crit Rev Immunol 30: 305–309.
- 18. Fellay J, Shianna KV, Ge D, Colombo S, Ledergerber B, et al. (2007) A whole-genome association study of major determinants for host control of HIV-1. Science 317: 944–947.
- 19. de Bakker PI, Telenti A (2010) Infectious diseases not immune to genome-wide association. Nat Genet 42: 731–732.
- 20. Fellay J, Shianna KV, Telenti A, Goldstein DB (2010) Host genetics and HIV-1: the final phase? PLoS Pathog 6: e1001033.
- 21. Crawford Dana C (2008) Integrating Host Genomics with Surveillance for Invasive Bacterial Diseases. Emerging Infectious Diseases 14: 1138–1140.
- 22. Centers for Disease Control and Prevention (June 2010) HIV Surveillance Report, 2008. Volume 20. Available: http://www.cdc.gov/hiv/surveillance/resources/reports/2008report/. Accessed 2012 Jan 4.
- 23. WHO Regional Office for the Western Pacific (2006) Preventing Mother-to-Child Transmission of Hepatitis B: Operational Field Guidelines for Delivery of the Birth Dose of Hepatitis B Vaccine. Geneva: World Health Organization. Available: http://www.wpro.who.int/internet/resources.ashx/EPI/docs/HepB/HepBBirthDoseFieldGuidelines.pdf. Accessed 2012 Jan 4.
- 24. Inoue M, Tsugane S (2005) Epidemiology of gastric cancer in Japan. Postgrad Med J 81: 419–424.
- 25. Singer PA, Daar AS (2001) Harnessing genomics and biotechnology to improve global health equity. Science 294: 87–89.
- 26. Wonkam A, Kenfack MA, Muna WF, Ouwe-Missi-Oukem-Boyer O (2011) Ethics of human genetics studies in sub-saharan Africa: the case of Cameroon through a bibliometric analysis. Dev World Bioeth.
- 27. Dowling NF, Gwinn M, Mawle A (2009) Human genomics and preparedness for infectious threats. Genome Med 1: 119.
- 28. Foley SL, Lynne AM, Nayak R (2009) Molecular typing methodologies for microbial source tracking and epidemiological investigations of Gram-negative bacterial foodborne pathogens. Infect Genet Evol 9: 430–440.
- 29. Withee J, Dearfield KL (2007) Genomics-based food-borne pathogen testing and diagnostics: possibilities for the U.S. Department of Agriculture's Food Safety and Inspection Service. Environ Mol Mutagen 48: 363–368.
- 30. Davila S, Hibberd ML (2009) Genome-wide association studies are coming for human infectious diseases. Genome Med 1: 19.
- 31. Rinaudo CD, Telford JL, Rappuoli R, Seib KL (2009) Vaccinology in the genome era. J Clin Invest 119: 2515–2525.
- 32. Kellam P, Weiss RA (2006) Infectogenomics: insights from the host genome into infectious diseases. Cell 124: 695–697.
- 33. O'Brien SJ, Nelson GW (2004) Human genes that limit AIDS. Nat Genet 36: 565–574.
- 34. Lopalco Lucia (2010) CCR5: From Natural Resistance to a New Anti-HIV Strategy. Viruses 2: 26.