Impact factor (IF) is a commonly used surrogate for assessing the scientific quality of journals and articles. There is growing discontent in the medical community with the use of this quality assessment tool because of its many inherent limitations. To help address such concerns, Eigenfactor (ES) and Article Influence scores (AIS) have been devised to assess scientific impact of journals. The principal aim was to compare the temporal trends in IF, ES, and AIS on the rank order of leading medical journals over time.
The 2001 to 2008 IF, ES, AIS, and number of citable items (CI) of 35 leading medical journals were collected from the Institute of Scientific Information (ISI) and the http://www.eigenfactor.org databases. The journals were ranked based on the published 2008 ES, AIS, and IF scores. Temporal score trends and variations were analyzed.
In general, the AIS and IF values provided similar rank orders. Using ES values resulted in large changes in the rank orders with higher ranking being assigned to journals that publish a large volume of articles. Since 2001, the IF and AIS of most journals increased significantly; however the ES increased in only 51% of the journals in the analysis. Conversely, 26% of journals experienced a downward trend in their ES, while the rest experienced no significant changes (23%). This discordance between temporal trends in IF and ES was largely driven by temporal changes in the number of CI published by the journals.
The rank order of medical journals changes depending on whether IF, AIS or ES is used. All of these metrics are sensitive to the number of citable items published by journals. Consumers should thus consider all of these metrics rather than just IF alone in assessing the influence and importance of medical journals in their respective disciplines.
Citation: Rizkallah J, Sin DD (2010) Integrative Approach to Quality Assessment of Medical Journals Using Impact Factor, Eigenfactor, and Article Influence Scores. PLoS ONE 5(4): e10204. doi:10.1371/journal.pone.0010204
Editor: Johan Bollen, Indiana University - Bloomington, United States of America
Received: August 12, 2009; Accepted: March 16, 2010; Published: April 15, 2010
Copyright: © 2010 Rizkallah, Sin. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funder: Michael Smith Foundation for Health Research (a government organization). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The impact factor (IF), which is a score calculated each year by the Institute for Scientific Information (ISI), is widely considered as one of the leading proxies for evaluating the quality, importance, and influence of medical journals to their respective discipline (Science Citation Index, Journal Citation Report. Institute for Scientific Information, www.isinet.com).  Medical editors frequently use the IF as a performance index of their journal and a means of ranking their journals relative to their peers., , ,  Some journals use the IF to “advertise” their quality and to entice potential authors in submitting high-quality papers to them. Promotion committees of academic institutions commonly use the IF to judge the quality of publications of applicants for promotion and tenure and departmental chairs may use it in the hiring and assessment process of new recruits.  Increasingly, however, there is growing discontent with the IF as a tool for determining “quality” and “prestige” of journals , . One reason is that the distribution of citations is non-parametric with fewer than 20% of the articles accounting for more than 50% of the total number of citations of journals and with many articles that never receive any citations , . Moreover, IF only counts the number of citations without taking into account the source of the citations (ie. citations from prestigious journals are worth no more than citations from lower-tier journals) or makes any allowances for the “citation culture” between journals and across disciplines . It is also now well recognized that journal's IF can be increased by reducing the number of original research papers and increasing the number of editorials (which are not counted in the denominator of IF), review papers, which receive on average twice as many citations as original articles ,  and by encouraging self-citations , . Original research papers, however, are the main “engines” of generating new knowledge and, by decreasing their publication rate, journals may be mitigating dissemination of scientific knowledge and curtailing scientific discourse. Over time, this may increase the IF but paradoxically reduce the overall influence of these journals on the scientific community as fewer scientists and clinicians read the journal. To address these and other concerns with the IF, other instruments including those that take into account the quality as well as the quantity of citations, have been proposed , , . This concept was first proposed by Pinski and Narin , who suggested that journals should be ranked according to their eigenvector centrality in a citation network. With the recent success of Google's ranking system for web pages, this concept has been modified to include algorithms based on a PageRank system . Although there are several different algorithms in use, the two that have gained the most attention in recent years are Scimago Journal Rank (SJR) (http://www.scimagojr.com/index.php) and Eigenfactor score (ES) (http://eigenfactor.org/), both of which use an iterative weighting system to calculate a summary index that reflects both the “quality” and the “quantity” of citations received by these journals based on a PageRank algorithm , . Despite the differences in the way in which weight-based and non-weight based methods are derived, studies have shown that in any given year, scores based on a PageRank algorithm correlate well with those based on traditional IF and produce similar rank order of medical journals , . However, it is not known whether the temporal trends in these scores produce similar or differential rank orders of these journals. Since ES is at least in part dependent on the number of citable items published by journals in any given year , , by reducing the publication rate, it is possible for a journal to increase IF without changing its ES (and vice versa). Thus, the primary aim of the present study was to determine the changes in IF and ES across the major general and sub-specialty medical journals over the past 8 years.
Selection of Journals
We decided a priori to evaluate the temporal trends in the impact factor (IF) and Eigenfactor Score (ES) in 35 general and subspecialty clinical journals between 2001 and 2008. We chose this timeframe to mitigate the influence of name changes of journals in the IF and ES calculations and to ensure comparability of data across the journals. To ensure reasonable representation of journals from each discipline, we chose the three mostly highly ranked journals per discipline as determined by the 2008 IF except for respiratory medicine and endocrinology in which four rather than three journals were selected. We did this to mitigate the potential effect of overlap of content and audience of journals in the “respiratory system” and “critical care medicine” (e.g. the American Journal of Respiratory and Critical Medicine is listed both categories) and to ensure that there is adequate representation of non-diabetic papers (and audience) in “endocrinology” as the top two journals under this category were diabetes-focused (e.g. Diabetes and Diabetes Care). From the Thompson Reuters' Journal Citation Reports (http://admin-apps.isiknowledge.com/JCR/JCR?PointOfEntry=Home&SID=3EIG4M34Amad6eKDPA) and the Eigenfactor.org websites (http://eigenfactor.org/), two independent reviewers (JR, DS) abstracted data on the IF, ES, citable items and Article Influence Score (AIS) on these journals. The data were imported into an Excel Spreadsheet and any disagreements were resolved by iteration and consensus.
The journals that were evaluated included Annals of Neurology (Ann Neurol), Annals of the Rheumatic Diseases (Ann Rheum Dis), Arthritis and Rheumatism (Art Rheum/Ar C Res), Brain, Circulation, Clinical Infectious Diseases (Clin Inf Dis), Diabetes, Diabetes Care, European Heart Journal (Eur Heart J), Gastroenterology, Gut, Hepatology, Intensive Care Medicine (Intens Care Med), Journal of the American Medical Association (JAMA), Journal of American College of Cardiology (J Am Coll Cardiol), Journal of Bone and Mineral Research (J Bone Miner Res); Journal of Clinical Endocrinology and Metabolism (J Clin Endocr Metab), Journal of Clinical Oncology (J Clin Oncol), Journal of Infectious Diseases (J Infect Dis), Journal of the National Cancer Institute (J Natl Cancer I), Journal of Neurosciences (J Neurosci), Lancet, Lancet Infectious Diseases (Lancet Infect Dis), Lancet Oncology (Lancet Oncol), New England Journal of Medicine (N Engl J Med), Rheumatology, Allergy, American Journal of Respiratory and Critical Care Medicine (Am J Respir Crit Care Med), Clinical and Experimental Allergy (Clin Exp Allergy), Chest, Critical Care, Critical Care Medicine (Crit Care Med), European Respiratory Journal (Eur Resp J), Journal of Allergy and Clinical Immunology (J Allergy Clin Immunology), and Thorax. We did not include any non-clinical journals.
The IF is published by ISI each year for all indexed journals and is calculated based on a three-year period. It reflects the average number of times that papers are cited up to two years following publication. For example, the 2009 IF for a journal would be calculated by taking the number of times articles (original, reviews, proceedings or notes) published in 2007 and 2008 were cited in 2009 and dividing this number by the total number of articles, reviews, proceedings, guidelines or consensus statements that were published in this journal in 2007 and 2008. Editorials and letters to the editors are generally excluded from the denominator but can be counted in the numerator of the impact factor. In general, review articles, consensus statements and clinical guidelines are cited more frequently than original articles.
Eigenfactor Score (ES)
For each of these journals, we retrieved data on the ES from http://www.eigenfactor.org. ES is calculated based on a complex algorithm that takes into account not only the quantity of citations but also their “quality” by assigning weights to the source of the citations. The full details of the algorithm can be found at http://www.eigenfactor.org/methods.htm. In brief, the algorithm assigns quality scores to journals by creating a citation network in which journal articles are first randomly selected. The citation lists from these retrieved articles are then used by the network to select the next set of journals. The citation lists from this batch of journals are then used by the network to select the third set of journals. This process continues indefinitely creating a hierarchical ranking of journals based on the frequency of citations. The network assumes that journals that are highly cited are to be of high quality, while those that are infrequently cited are deemed to be of lower quality. Importantly, the ES has no denominator. Thus, journals that publish a lot of articles have higher ES than those that publish very few articles if the average quality of the published articles is similar between these journals.
Article Influence Score (AIS)
Article Influence™ Score (AIS) is derived from ES and conceptually similar to IF in that there is a numerator as well as a denominator (i.e. number of citable papers) except that it uses ES (rather than the total number of citations) as the numerator. Thus, dissimilar to IF where all citations are counted equally regardless of their source, in AIS, each citation is multiplied by the “quality” of the citing journals, resulting in greater weights for citations that come from highly cited journals, and less weight to poorly cited journals. To facilitate interpretation, the AIS is normalized, so that the mean article in the Journal of Citation Reports® has an AIS of 1.00.
The journals were ranked based on the published 2008 ES, AIS, and IF scores. We also retrieved the 2001 to 2008 ES, AIS, IF scores, and number of citable items (CI) in order to determine the temporal trends in these values. The statistical significance of the temporal trends was determined using a chi-square test for trend. A p-value of less than 0.05 was considered statistically significant. All analyses were conducted using SAS version 9.1 (Carey, N.C.).
2008 AIS, IF, and ES
The 2008 ES, AIS, and IF values of selected medical journals are shown in Table 1. Of the evaluated journals, the overall leader was the New England Journal of Medicine irrespective of the metric used to measure quality. However, the rankings for the remaining journals changed depending on the score that was used. For instance, using the traditional IF score, the 2nd leading journal in 2008 was JAMA, followed by the Lancet, J Clin Oncol and J Natl Cancer I. In general, the AIS and IF values provided similar rank orders, with few notable exceptions including J Neurosci, which was ranked 11th based on AIS and 20th based on IF and J Allergy Clin Immunol, which was ranked 20th based on AIS and 14th based on IF.
Using ES values resulted in large changes in the rank order of the selected journals. While the N Engl J Med retained the top spot, J Neurosci took over 2nd spot on the list, followed by Circulation, Lancet and JAMA. In general, journals that published a lot of papers had higher ES values than journals that published small volumes of papers (figure 1). For instance, Lancet Oncol, which was ranked 8th on the AIS and 7th on the IF lists, was ranked only 30th on the ES list. On the other hand, J Clin Endocr Metab, which ranked 27th on both the AIS and IF lists, was ranked 8th on the ES list.
The area of the circles is proportional to the number of citable items published in 2008. The area of the dotted line is expanded in figure 1B. R2 = 0.5721; p<0.0001.
Both the IF and CI correlated significantly with the 2008 ES values (p value for both <.0001). The partial square value for IF was 0.5721 and that for CI was 0.3678. Thus, collectively, they accounted for 94% of the variance in the 2008 ES values. As the IF values increased so did the ES values (See figure 1A and 1B). In general, however, journals with a high number of citable items displayed higher ES values than those that had a small number of citable items.
Trends in IF, ES, AIS, and CI Between 2001 and 2008
Since 2001, the IF of 77% (27/35) of the journals included this analysis increased significantly (Table 2). Only J Neurosci experienced a significant decline in IF. In the remaining journals, the IF did not change significantly over time. In contrast, only 51% ( = 18/35) of the journals increased their ES values over the 8 years, while 26% ( = 9/35) of the journals experienced a decline in their EF values (table 3). The discordance between the temporal trends in IF and ES was largely driven by the temporal changes in the number of citable items published by each of the journals (see figure 2A). In 20% of the journals, the number of citable items increased and in another 20% the number of citable items decreased over time. In the remaining 60%, the number of citable items did not change significantly (Table 4; figure 2A). In general, as the number of citable items decreased, the IF of the journals increased, though this relationship did not reach statistical significance (p = 0.132) largely due to the extreme effects of the New England Journal of Medicine, whose IF score increased by 21 in the absence of any significant changes in the number of citable items over the 8 years of the study. The removal of the New England Journal of Medicine from this analysis, however, led to a significant relationship between the temporal trends in CI and IF (figure 2B; p = 0.05). There were journals whose IF score and the number citable items both increased during this period of time (see Tables 2 and 4). These included the European Heart Journal, Brain, Rheumatology, and Critical Care. On the other hand, journals such as Intensive Care Medicine, the Journal of American College of Cardiology, Diabetes Care, the Journal of Infectious Diseases, Hepatology, Annals of Rheumatic Diseases, Chest, Allergy, the European Respiratory Journal, Critical Care Medicine, Lancet Oncology, Clinical Infectious Diseases, the Journal of Clinical Oncology, Lancet Infectious Diseases, the Journal of Allergy and Clinical Immunology, and the New England Journal of Medicine increased their IF without significantly changing the number of citable items that were published per year. Conversely, a few journals such as the Lancet, Circulation, the American Journal of Respiratory and Critical Care Medicine, the Journal of the American Medical Association, Gut, and Thorax increased their IF but at the same time decreased the number of citable items published per year. Interestingly, some journals such as the Journal of Bone and Mineral Research and Journal of National Cancer Institute reduced the number of citable items without experiencing an increase in their IF. The temporal trends in AIS were similar to those of IF. 66% of the journals experienced an increase in AIS, while 6% experienced a decline (Table 5).
R2 = 0.1957; p = 0.0099 for the relationship between changes in citable items and changes in Eigenfactor score and R2 = 0.1216; p = 0.0505* for the relationship between changes in citable items and changes in the impact factor. *The New England Journal of Medicine was excluded from the regression analysis, as it was an extreme outlier.
There is no universally accepted metric for assessing the “quality” and “influence” of journals to the scientific community. In the Journal Citation Reports, ISI provides several attributes for assessing quality including total citations, IF, ES, and AIS. Of these the most widely used metric is the IF. However, the major shortcoming of IF is that it is sensitive to the number of original research papers published per year. Because in general review papers and guidelines have a higher citation index than that for original papers, by publishing fewer original papers (and more review papers), journals can increase their IF. Paradoxically, however, because original research is the primary engine for generating new scientific knowledge (or validating existing knowledge), by reducing the publication rate of research articles, journals' influence on the scientific discourse of their discipline may decrease. ES is an attempt to capture the “influence” of medical journals on the scientific discourse generated in their respective fields. The present study indicates that over the past 8 years most medical journals (77% evaluated in this study) have increased their IF. However, 26% of the journals have experienced a paradoxical reduction in their ES during this period of time, associated with a decrease in the number of citable items that were published per year in these journals. Interestingly and provocatively, many journals that fall into this category were those with a very high IF such as the Lancet, Circulation, American Journal of Respiratory and Critical Care Medicine and the Journal of the American Medical Association. Notable exceptions in this category were the New England Journal of Medicine and the Journal of Clinical Oncology, both of which experienced a dramatic increase in their IF without any significant changes in their publication rate of citable items. We also found that there were journals which increased their IF, ES, as well as the number of citable items published per year. These included the European Heart Journal, and Critical Care. Other journals have increased or maintained their IF without decreasing the number of papers published per year or sacrificing their ES values over time. These data indicate that IF and ES in particular can produce dissimilar results and thus highlight the importance of using multiple rather than just one metric in assessing the performance of journals and the impact and influence they have on their respective fields of study.
Our data are consistent with those of Chew et al , who showed that the IF of the seven top-ranked general medical journals rose considerably between 1994 and 2005 but the denominators (i.e. the number of citable items per year) either fell or remained constant. Our data are also consistent with those by Bollen et al, who showed that the concept of scientific impact is multi-dimensional that cannot be adequately captured by IF alone  and that both usage and citation based measures are needed to understand the scientific impact of journals. It should also be noted in our analysis that although IF and AIS values are calculated differently, they nonetheless produced similar rank order of journals, suggesting that the weighting system of AIS does not significantly modify the performance status of the journals. The major discrepancies occurred only when the denominator of AIS was removed (yielding ES values), which highlights the importance of quantity of publications in the determination of scientific impact of journals.
There are important implications for these data. Firstly, it is essential that authors take into account not only the IF of journals on deciding where to send their paper but also the ES, as journals with high IF but low ES may have low readership and have little influence on their respective field, although in general papers that are highly accessed and viewed are cited more frequently than those that have limited access , , . Secondly, IF must be viewed in the context of other metrics such as ES and AIS, which takes into account not only the quantity but also the quality of the citations. Thirdly, the rise of the journal IF over the past decade likely reflects the increase in the citation rate of papers published in these journals. However, it is possible that in some journals, the rise in their IF may in part reflect a reduction in the number of original articles published per year. The potential paradox is that by doing so these journals may be limiting their influence. Thus, as with individual researchers, journals should use IF in conjunction with other metrics such as ES in assessing the relevance and “impact” of their journals in their respective field.
There were limitations to this study. Firstly, ES was used as a surrogate for the “influence” of journals. However, this metric has never been fully validated for this outcome. In the same vein, IF has never been fully validated as measure of “quality”, though it is widely used in this fashion. Secondly, there are other conventional metrics of journal quality such as immediacy index, citation half-life, or PageRank based metrics such as SCImago journal rank indicator that were not considered in the present analysis. Thirdly, an important aspect of understanding the influence of journals is to determine the size and make-up of the readership, which was not done in the present study. Some , ,  but not all studies suggest that papers that are viewed more frequently receive higher citation rates than those that are accessed infrequently. Fourthly, we did not determine the reasons for the rise and fall of IF, ES and citable items in these journals. A previous study suggested that the temporal increases in IF for certain journals may reflect several factors including active recruitment of “high-impact” papers by journal editors, acceleration of the review and publication process, early on-line publication of accepted articles, media promotion of articles and journals, and the increase in the number of journals included in the ISI database . The reasons for the fall in the citable items for certain journals are also unclear. Some explanations include journals becoming more selective of the articles that they were accepting, and re-design of journals leading to fewer pages . Whatever the reason, by reducing the citable items, some journals may have (intentionally or unintentionally) increased their IF.
In summary, the present study indicates that IF and ES produce similar rank order of medical journals; however, some important discordances occur. In general, journals that publish a lot of papers have higher ES values than would be expected for their IF. Conversely, journals that publish a small volume of papers have lower ES values than expected for their IF. Some journals have increased their IF and at the same time reduced the number of papers that they publish per year, which may have reduced their influence on the field. Medical journals should carefully balance the important of IF and ES in their editorial composition of the quality and quantity of articles published.
Conceived and designed the experiments: JR DDS. Performed the experiments: JR DDS. Analyzed the data: DDS. Contributed reagents/materials/analysis tools: JR DDS. Wrote the paper: JR DDS.
- 1. Garfield E (1996) How can impact factors be improved? BMJ 313: 411–413.
- 2. Horgan A (2002) BMJ's impact factor increases by 24%. BMJ 325: 8.
- 3. Parrillo JE (2005) Our Journal, Critical Care Medicine, in 2005: high impact factor, rapid manuscript review, growing submissions, and widespread distribution. Crit Care Med 33: 923–924.
- 4. Tobin MJ (2004) Thirty years of impact factor and the Journal. Am J Respir Crit Care Med 170: 351–352.
- 5. Wedzicha JA, Johnston SL, Mitchell DM (2005) Journal impact factors for 2004: another rise for Thorax. Thorax 60: 712.
- 6. Saha S, Saint S, Christakis DA (2003) Impact factor: a valid measure of journal quality? J Med Libr Assoc 91: 42–46.
- 7. Seglen PO (1997) Why the impact factor of journals should not be used for evaluating research. BMJ 314: 498–502.
- 8. Favaloro EJ (2008) Measuring the quality of journals and journal articles: the impact factor tells but a portion of the story. Semin Thromb Hemost 34: 7–25.
- 9. Weale AR, Bailey M, Lear PA (2004) The level of non-citation of articles within a journal as a measure of quality: a comparison to the impact factor. BMC Med Res Methodol 4: 14.
- 10. Callaham M, Wears RL, Weber E (2002) Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. JAMA 287: 2847–2850.
- 11. Marashi SA (2005) On the identity of “citers”: are papers promptly recognized by other investigators? Med Hypotheses 65: 822.
- 12. Bergstrom CT, West JD, Wiseman MA (2008) The Eigenfactor metrics. J Neurosci 28: 11433–11434.
- 13. Bollen J, Rodriguez M, Van de Sompel H (2006) Journal status. Scientometrics 69: 669–687.
- 14. Dellavalle RP, Schilling LM, Rodriguez MA, Van de Sompel H, Bollen J (2007) Refining dermatology journal impact factors using PageRank. J Am Acad Dermatol 57: 116–119.
- 15. Pinski G, Narin F (1976) Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics. Information Processing and Management 12: 297–312.
- 16. Davis PM (2008) Eigenfactor: does the principle of repeated improvement result in better estimates than raw citation counts? Journal of the American Society for Information Science and Technology 59: 2186–2188.
- 17. Fersht A (2009) The most influential journals: Impact Factor and Eigenfactor. Proc Natl Acad Sci U S A 106: 6883–6884.
- 18. Chen P, Xie H, Maslov S, Redner S (2007) Finding scientific gems with Google's PageRank algorithm. Journal of Informetrics 1: 8–15.
- 19. Hakkalamani S, Rawal A, Hennessy MS, Parkinson RW (2006) The impact factor of seven orthopaedic journals: factors influencing it. J Bone Joint Surg Br 88: 159–162.
- 20. Chew M, Villanueva EV, Van Der Weyden MB (2007) Life and times of the impact factor: retrospective analysis of trends for seven medical journals (1994-2005) and their Editors' views. J R Soc Med 100: 142–150.
- 21. Bollen J, Van de Sompel H, Hagberg A, Chute R (2009) A principal component analysis of 39 scientific impact measures. PLoS One 4: e6022.
- 22. Brody T, Harnad S, Carr L (2006) Earlier Web usage statistics as predictors of later citation impact. Journal of the American Society for Information Science and Technology 57: 1060–1072.
- 23. Eysenbach G (2006) Citation advantage of open access articles. PLoS Biol 4: e157.
- 24. Perneger TV (2004) Relation between online “hit counts” and subsequent citations: prospective study of research papers in the BMJ. BMJ 329: 546–547.
- 25. Falagas ME, Kouranos VD, Arencibia-Jorge R, Karageorgopoulos DE (2008) Comparison of SCImago journal rank indicator with journal impact factor. FASEB J 22: 2623–2628.
- 26. Davis PM, Lewenstein BV, Simon DH, Booth JG, Connolly MJ (2008) Open access publishing, article downloads, and citations: randomised controlled trial. BMJ 337: a568.