Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Linking of Primary Care Records to Census Data to Study the Association between Socioeconomic Status and Cancer Incidence in Southern Europe: A Nation-Wide Ecological Study

  • Maria Garcia-Gil ,

    Affiliations Research Unit, Family Medicine, Girona, Spain, and Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain, Translab Research Group, Department of Medical Sciences, School of Medicine, University of Girona, Catalunya, Spain

  • Josep-Maria Elorza,

    Affiliation Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain

  • Marta Banque,

    Affiliation Cancer Prevention Unit and Cancer Registry, Department of Epidemiology and Evaluation, Hospital del Mar, Barcelona, Catalunya, Spain

  • Marc Comas-Cufí,

    Affiliation Research Unit, Family Medicine, Girona, Spain, and Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain

  • Jordi Blanch,

    Affiliation Research Unit, Family Medicine, Girona, Spain, and Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain

  • Rafel Ramos,

    Affiliation Primary Care Services, Girona, Spain, and Catalan Institute of Health (ICS), Catalunya, Spain

  • Leonardo Méndez-Boo,

    Affiliation Primary Care Information System, Catalan Institute of Health (ICS), Catalunya, Spain

  • Eduardo Hermosilla,

    Affiliation Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain

  • Bonaventura Bolibar,

    Affiliation Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain

  • Daniel Prieto-Alhambra

    Affiliations Jordi Gol Institute for Primary Care Research (IDIAP Jordi Gol), Catalunya, Spain, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, United Kingdom

Linking of Primary Care Records to Census Data to Study the Association between Socioeconomic Status and Cancer Incidence in Southern Europe: A Nation-Wide Ecological Study

  • Maria Garcia-Gil, 
  • Josep-Maria Elorza, 
  • Marta Banque, 
  • Marc Comas-Cufí, 
  • Jordi Blanch, 
  • Rafel Ramos, 
  • Leonardo Méndez-Boo, 
  • Eduardo Hermosilla, 
  • Bonaventura Bolibar, 
  • Daniel Prieto-Alhambra



Area-based measures of economic deprivation are seldom applied to large medical records databases to establish population-scale associations between deprivation and disease.


To study the association between deprivation and incidence of common cancer types in a Southern European region.


Retrospective ecological study using the SIDIAP (Information System for the Development of Research in Primary Care) database of longitudinal electronic medical records for a representative population of Catalonia (Spain) and the MEDEA index based on urban socioeconomic indicators in the Spanish census. Study outcomes were incident cervical, breast, colorectal, prostate, and lung cancer in 2009–2012. The completeness of SIDIAP cancer recording was evaluated through linkage of a geographic data subset to a hospital cancer registry. Associations between MEDEA quintiles and cancer incidence was evaluated using zero-inflated Poisson regression adjusted for sex, age, smoking, alcoholism, obesity, hypertension, and diabetes.


SIDIAP sensitivity was 63% to 92% for the five cancers studied. There was direct association between deprivation and lung, colorectal, and cervical cancer: incidence rate ratios (IRR) 1.82 [1.64–2.01], IRR 1.60 [1.34–1.90], IRR 1.22 [1.07–1.38], respectively, comparing the most deprived to most affluent areas. In wealthy areas, prostate and breast cancers were more common: IRR 0.92 [0.80–1.00], IRR 0.91 [0.78–1.06]. Adjustment for confounders attenuated the association with lung cancer risk (fully adjusted IRR 1.16 [1.08–1.25]), reversed the direction of the association with colorectal cancer (IRR 0.90 [0.84–0.95]), and did not modify the associations with cervical (IRR 1.27 [1.11–1.45]), prostate (0.74 [0.69–0.80]), and breast (0.76 [0.71–0.81]) cancer.


Deprivation is associated differently with the occurrence of various cancer types. These results provide evidence that MEDEA is a useful, area-based deprivation index for analyses of the SIDIAP database. This information will be useful to improve screening programs, cancer prevention and management strategies, to reach patients more effectively, particularly in deprived urban areas.


Links between morbidity, all-cause mortality, cause-specific mortality, and economic deprivation are well documented [1][5]; this association also applies to cancer incidence [6][8] and related mortality, the latter being partially explained by differences in health care provision according to socioeconomic area [8][9]. Although the link between deprivation and health has been associated with health-related behaviors [10], area-based socioeconomic measurements have proved useful to monitor health inequalities [11] in both geographic-area [12] and individual level studies [13][14].

There is a scarcity of data and lack of experience in using area-based measures of deprivation together with large, routinely collected medical records databases to establish the association between deprivation and disease at a population scale. Furthermore, the population databases often lack socioeconomic data or the reliability of the available data is uncertain. Accordingly, the construction of deprivation measures that are readily available and representative of the whole population could overcome this gap in patient data [11], [14][15]. Information about socioeconomic status has fundamental implications for health service provision, resource allocation, and prevention efforts.

One way to measure the extent to which these measures would be useful to detect health inequalities from an analysis of electronic medical records is to link primary care records to census data for the area served. This allows us to ascertain the relationship between the existing deprivation indices and specific health conditions. The present study aimed to analyze the association between the MEDEA deprivation index, validated for Spain, and the population incidence of five cancers (lung, colorectal, breast, prostate, and cervical) in a Southern European region (Catalonia, Spain).


Ethics Statement

Approval for all observational research using SIDIAP data is obtained from a local ethics committee (Clinical Research Ethics Committee of the IDIAP Jordi Gol).


Retrospective ecological study.

Data source

The Information System for the Development of Research in Primary Care (SIDIAP) is a clinical database of anonymized patient records for nearly six million people (80% of the Catalan population and 10.2% of the total population of Spain) registered in 274 primary care practices throughout Catalonia, with a total of 3,414 general practitioners (GP) and is highly representative of both urban and rural areas [16][17]. The information contained in SIDIAP is collected by health professionals during routine visits over time, and then it provides a good source of longitudinal population-based data.

The information recorded includes demographic and lifestyle factors relevant to primary care settings (body mass index, smoking status, alcohol use, etc); clinical diagnoses, outcomes, and events (coded according to the International Classification of Diseases, 10th revision [ICD-10]); referrals to specialists and laboratory tests; and prescribed medications actually dispensed by community pharmacies. The quality of SIDIAP data has been previously documented, and the database has been widely used to study the epidemiology of a number of health outcomes [17][22].

Main exposure: socioeconomic status

The main exposure considered for this study was the ecological MEDEA index, calculated using five census-based socioeconomic indicators (percentages, by census tract): 1) unemployment rate, 2) manual workers, 3) temporary workers, 4) illiterate adults (or less than basic, mandated education), and 5) school drop-outs among the population less than 16 years old. The MEDEA index was calculated as the weighted sum of these indicators, using the weights that correspond to the calculated values in the original MEDEA Project. Further details on the construction of the MEDEA index were previously published elsewhere [23].

The SIDIAP data were linked to the 2001 census data after harmonization of the recorded patient address information with census residence data and allocation of each individual record to the correct census tract. SIDIAP records from rural areas were excluded, as MEDEA has only been validated for urban populations. Each of the census tracts included was assigned a MEDEA index score based on the census data for its population.

Study outcome and diagnostic validation

The study outcomes were incident cases of the five most common cancers: colorectal, breast, lung, prostate, and cervix in the period 2009–2012. Only females were included in the breast and cervical cancer analyses, and only men in the analysis of prostate cancer.

Cancers were ascertained using ICD-10 codes as recorded by primary care physicians: C50-D05 for breast, C53 for cervical, C61-D07.5 for prostate, C18-19-20-D01 for colorectal, C34-D02.2 for lung cancers. Sensitivity as a measure of the completeness of the set of codes used in SIDIAP was validated by linkage to a long-standing cancer registry at Barcelona’s Hospital del Mar [24] using a trusted third party. The 212,863 people living in the Hospital del Mar area of reference were identified and linked to the Cancer Registry for this purpose.

Confounders and potential explanatory variables

A set of covariates were defined a priori and multivariate regression models adjusted accordingly to explore either confounding or potential causal pathways. The prevalence of the following variables was characterized at baseline (1/1/2009) at the census tract level: age (5-year groups), female sex, obesity (body mass index of 30 kg/m2 or above), hypertension (ICD-10 codes I10–I15), and diabetes (ICD-10 codes E10–E14), and SIDIAP codes for current smoking (yes/no), and high-risk alcohol intake (yes/no). Smoking and high-risk intake are recorded in SIDIAP according to Catalan clinical practice guidelines [25], [26].

Statistical analysis

Variables were described as weighted means; weighted one-way ANOVA contrasts were performed to contrast statistical differences between MEDEA quintiles. Cumulative incidences and 95% confidence intervals (95%CI) for all studied cancers were also calculated. MEDEA scores were categorized by quintiles. The aggregated nature of the data resulted in a higher incidence of zeros than would expected if the data were Poisson distributed. Thus, the association between MEDEA quintile (the higher, the more deprived) and the incidence of the studied cancers in 2009–2012 was evaluated using age- and sex-adjusted, zero-inflated, Poisson regression modeling (IRR and 95%CI) [27].

We further adjusted for lifestyle factors (smoking, alcoholism, obesity) and finally fitted a multivariate model, adjusted for all these factors and further adjusted for common comorbidities (hypertension, diabetes) to study potential causal pathways. All the described analyses were repeated after stratification by sex (with the exception of prostate, cervical, and breast cancer). A clustered “sandwich” estimator was calculated to estimate the variance-covariance matrix, which allowed us to account for the correlation between individuals living in the same census tract. All analyses were performed with STATA v12 [28].


Baseline characteristics

The study population of 3,701,169 primary care patients living in 4,096 urban census tracts was analyzed for the 2009–2012 study period. Table 1 presents census-based socioeconomic characteristics of the study population that are used to calculate the MEDEA index. The proportion of manual workers, unemployment rates, temporary workers, people with limited education (or illiteracy) and of young people (<16 years of age) not in school ranged from 29.4% to 73.6%, 8.0% to 14.6%, 14.3% to 28.6%, 19.2% to 51.2%, and from 5.6% to 24.8%, respectively. Women accounted for 51% of the total population and overall mean age was 45.4 (18.3) years. Regarding sociodemographic and clinical characteristics (Table 2), the proportion of women and mean age decreased with deprivation but the most deprived areas had the highest prevalence of lifestyle risk factors (smoking, alcoholism and obesity) and comorbidities (hypertension and diabetes).

Table 1. Baseline socioeconomic characteristics of study population, by MEDEA quintile.

Table 2. Baseline socio demographic and clinical characteristics of study population, by socioeconomic status.

Outcome and data validation

Within the period 2009–2012, overall cumulative incidences (x 10,000 persons at risk) of lung, colorectal, prostate, breast, and cervical cancers were: 21.5 [CI 95%: 21.1–22.1], 32.9 [32.3–33.5], 57.8 [56.5–59.1], 68.3 [67.1–69.5], and 14.6 [14.1–15.2], respectively. For the subset of 212,863 people living in the Hospital del Mar area of reference, 2,284 cases of cancer were identified in both datasets (509 lung, 567 colorectal, 494 prostate, 679 breast, 35 cervical); 3,322 patients had a cancer diagnosis according to primary care records but were not included in the registry (339 lung, 1,418 colorectal, 729 prostate, 746 breast, 90 cervical); and 410 participants (144 lung, 79 colorectal, 111 prostate, 59 breast, 17 cervical) were recorded in the cancer registry but not in primary care records. The sensitivity of SIDIAP compared to the cancer registry (gold standard) was 84.7% [83.3–86.1] overall, 77.9% [74.6–80.9] for lung cancer, 87.7% [85.0–90.0] for colorectal cancer, 81.6% [78.3–84.5] for prostate cancer, 92.0% [89.8–93.7] for breast cancer, and 63.7% [53.7–78.4] for cervical cancer.

Crude associations

Comparing the most deprived to most affluent areas, crude zero-inflated Poisson regression models showed a direct association between deprivation and lung, colorectal, and cervical cancer: incidence rate ratios (IRR) 1.82 [1.64–2.01], IRR 1.60 [1.34–1.90], IRR 1.65 [1.07–2.54], respectively. In wealthy areas, prostate and breast cancers were more common: IRR 0.92 [0.80–1.00], IRR 0.91 [0.78–1.06]. These broad observations were refined by the adjusted analyses, as described below.

Age- and sex-adjusted associations

The observed associations between deprivation and the risk of lung and cervical cancers were partially attenuated after adjustment for age and sex IRR 1.28 [1.19–1.38] and for age only 1.23 [1.08–1.39], respectively (Tables 3 and 4). In colorectal cancers, the association was no longer significant after adjusting for differences in age and sex: IRR 1.03 [0.96–1.10] for fifth compared to first MEDEA quintiles (Table 3). The effect size of the associations between deprivation and risk of breast and prostate cancers was higher after adjusting for age: IRR 0.79 [0.74–0.84] and IRR 0.77 [0.72–0.83], respectively, for fifth compared to first MEDEA quintiles (Tables 4 and 5).

Table 3. Association between MEDEA quintile and incidence of lung and colorectal cancer.

Table 4. Association between MEDEA quintile and incidence of breast and cervical cancer.

Table 5. Association between MEDEA quintile and prostate cancer incidence.

Full adjustment

Full adjustment for lifestyle-related cancer risk factors (smoking, alcohol use, and obesity) and common comorbidities (hypertension and diabetes mellitus) further attenuated the effect size of the association between socioeconomic status and lung cancer risk (fully adjusted IRR 1.16 [1.08–1.25]), and even reversed the direction of the association between deprivation and colorectal cancer (fully adjusted IRR 0.90 [0.84–0.95]) (Table 3). By contrast, the observed associations between MEDEA deprivation scores and the remaining cancer risks were not modified after adjustment for these same factors: fully adjusted IRR 1.27 [1.11–1.45] for cervical (Table 4), 0.74 [0.69–0.80] for prostate (Table 5), and 0.76 [0.71–0.81] for breast cancer (Table 4).

Stratification by sex

Analyses of the relationship between socioeconomic status and lung and colorectal cancer after stratification by sex are summarized in Table 3. In summary, the effect of deprivation on risk of lung cancer was stronger in men (age-adjusted IRR 1.47 [1.35–1.59] for the most deprived compared to wealthiest areas), and the direction of the association was inverted for women (age-adjusted IRR 0.79 [0.66–0.93]). On the other hand, sex stratification did not alter the results for the association between MEDEA quintiles and colorectal cancer risk.


The present study showed that increasing levels of deprivation were associated with an increased incidence of lung cancer in men and of cervical cancer in the most deprived compared to the wealthiest population groups. Conversely, socioeconomic deprivation was associated with decreased incidence of prostate and breast cancers in our data.

Most of the observed association between deprivation and lung cancer is explained by differences in the age-sex composition of the study population and in the prevalence of common lifestyle-related cancer risk factors and comorbidities. Interestingly, adjustment for these same variables inverted the association between socioeconomic status and colorectal cancer: the risk of this type of cancer was 10% lower in the most deprived compared to the most wealthy areas in multivariate regression models. Adjustment for all these factors did not alter the observed associations between socioeconomic deprivation and the risks of breast, prostate, or cervical cancer.

SIDIAP provides good sensitivity for the study of the epidemiology of cancer, with highly sensitive diagnostic coding ranging from 63% (cervical) to 92% (breast cancer). Our results provide evidence that the MEDEA index is a useful area-based deprivation index for use in analysis of the SIDIAP database, although it has not been constructed for use in rural areas.

Moreover, the linkage of SIDIAP and census data proved essential to ascertain the excess risk of cancer related to social deprivation in primary care settings.

The cancer types selected for analysis have the highest incidence in Spain, and their association with risk factors closely linked to socioeconomic status (life-style factors, sexual behavior, access to private healthcare or screening uptake) is well known. Our results were consistent with previously published studies [29][30].

Consistent with our data, lung cancer risk increases in deprived areas [6], [30][32] and its incidence is highest in the most deprived men even after adjustment for smoking [33]. This excess risk might be explained by other risk factors such as exposure to occupational carcinogens or dietary factors [33], [34], not accounted for in our study design. Unexpectedly, we found an inverse association between risk of lung cancer and deprivation in women; although some studies have shown that women with greater economic deprivation have a decreased risk of dying from lung cancer [35], a number of previous studies reported that lung cancer risk increases in deprived areas regardless of sex [6], [30][31].

On the other hand, there is no clear association between deprivation and colorectal cancer [7], [33], [36][37], probably due to changes in socioeconomic patterns over recent decades. Our study found an inverse association between deprivation and prostate cancer incidence, in agreement with some other studies [38]; this can likely be explained not only by inequalities in screening (periodic prostate-specific antigen measurement or physical examination by a urologist) related to socioeconomic strata [30], [38] but also by inequalities in exposure to risk factors, such as diet [39].

Regarding cancers specific to women, cervical and breast cancer have opposite associations with socioeconomic status. Cervical cancer incidence has been consistently shown to be more common in deprived populations [6], [30], probably due to inadequate or insufficient screening as well as to possible differences in age of first sexual intercourse, number of sexual partners, or fertility rates in different social strata, leading to unequal exposure to human papilloma virus [30], [38], [40][41]. Conversely, women living in wealthy areas are more likely to be diagnosed with breast cancer [30], [38], [42]. This association could be explained by differences in breastfeeding habits, age of first pregnancy, and use of hormone replacement therapy [42].

Strengths and limitations

The access to internally validated, high-quality electronic medical records provided a large sample size, reflected real-life conditions, and warranted high external validity. Furthermore, SIDIAP data are highly representative of Catalonia, including both urban and rural areas [17]. Of the covariates analysed, cardiovascular risk factors in SIDIAP have been previously validated for research use [18]. In general, smoking and alcohol intake are under-recorded in SIDIAP, compared with the Catalan National Health Survey [43]. In our results, however, the gradient and the higher prevalence of these risk factors in the most deprived population are consistent with the Health Survey results and other analyses reported in the literature [43][44].

The Hospital del Mar Cancer Registry is not a population registry, although it is quite representative of the population covered. Therefore, we could only calculate the sensitivity of SIDIAP for the cancers studied. Nonetheless, the relationships observed between the MEDEA index and cancer incidence were robust and in accordance with the literature. The most recent MEDEA index available was constructed using 2001 census data; this was matched to 2009–2012 cancer incidence data, on the dual assumption that cancer has a long latency period and the socioeconomic character (i.e., deprivation status) of census tracts does not vary significantly within one census period [45]. As an ecological measure of deprivation, the MEDEA index is more likely to be prone to misclassification [46] than are individual measures. However, the resulting bias is towards the null hypothesis because the misclassification is probably not related to the outcome. In addition, economic heterogeneity within the MEDEA strata may result in a less apparent socioeconomic gradient in incidences [11]. Furthermore, it is known that the inequalities observed with ecological measures are the same as those obtained with individual-based measures [46][47].

MEDEA is also a multidimensional measure of deprivation, making it likely to be more strongly associated with differences in health outcomes than are individual measures of poverty [15]. Moreover, there is a lack of standardization of deprivation scores and outcomes may vary depending on which deprivation index is used; nonetheless, the magnitude and direction of most of the associations between health outcomes and deprivation are maintained [1]. In our study, the relationships observed between the MEDEA index and cancer incidence appear to be robust.

In conclusion, deprivation was associated differently with the occurrence of various cancer types. These results provide evidence that MEDEA has good predictive validity [48], [49] and can be applied as an area-based deprivation index for use with the SIDIAP database, in accordance with previous studies [1], [11], [15]. Furthermore, the MEDEA index can identify areas in need of interventions and of additional primary care resources. Part of the association observed is explained by common, modifiable, lifestyle-related cancer risk factors and comorbidities. Therefore, public health policies should focus on effective ways to reach patients with proven cancer prevention and management strategies such as smoking cessation, responsible use of alcohol, and healthy dietary behaviors, particularly in economically deprived urban areas.

Author Contributions

Conceived and designed the experiments: MGG DPA. Performed the experiments: MGG JME. Analyzed the data: MGG JME MB MC JB. Contributed reagents/materials/analysis tools: MGG JME MB MC JB. Contributed to the writing of the manuscript: MGG DPA. Interpretation of data: MGG DPA JME RR LMB EH BB. Revision: MGG DPA JME RR LMB EH BB MB. Final approval: MGG DPA JME RR LMB EH BB MB MC JB.


  1. 1. Carstairs V (1995) Deprivation indices: their interpretation and use in relation to health. J Epidemiol Community Health. 1995 Dec 49 Suppl 2S3–8.
  2. 2. Eachus J, Williams M, Chan P, Smith GD, Grainge M, et al. (1996) Deprivation and cause specific morbidity: evidence from the Somerset and Avon survey of health. BMJ Feb 312(7026): 287–92.
  3. 3. Barceló MA, Saez M, Saurina C (2009) Spatial variability in mortality inequalities, socioeconomic deprivation, and air pollution in small areas of the Barcelona Metropolitan Region, Spain. Sci Total Environ 15 407(21): 5501–23.
  4. 4. Lyratzopoulos G, Barbiere JM, Rachet B, Baum M, Thompson MR, et al. (2011) Changes over time in socioeconomic inequalities in breast and rectal cancer survival in England and Wales during a 32-year period (1973–2004): the potential role of health care. Ann Oncol 22(7): 1661–6.
  5. 5. Puigpinós-Riera R, Marí-Dell’Olmo M, Gotsens M, Borrell C, Serral G, et al. (2011) Cancer mortality inequalities in urban areas: a Bayesian small area analysis in Spanish cities. Int J Health Geogr 10: 6.
  6. 6. Shack L, Jordan C, Thomson CS, Mak V, Moller H (2008) Variation in incidence of breast, lung and cervical cancer and malignant melanoma of skin by socioeconomic group in England. BMC Cancer 8(1): 271.
  7. 7. Pollock AM, Vickers N (1997) Breast, lung and colorectal cancer incidence and survival in South Thames Region, 1987–1992: the effect of social deprivation. J Public Health Med 19(3): 288–94.
  8. 8. Faggiano F, Partanen T, Kogevinas M, Boffetta P (1997) Socioeconomic differences in cancer incidence and mortality. IARC Sci Publ (138): 65–176.
  9. 9. Lyratzopoulos G, Abel GA, Brown CH, Rous BA, Vernon SA, et al. (2013) Socio-demographic inequalities in stage of cancer diagnosis: evidence from patients with female breast, lung, colon, rectal, prostate, renal, bladder, melanoma, ovarian and endometrial cancer. Ann Oncol 24(3): 843–50.
  10. 10. Sánchez-Santos MT, Mesa-Frias M, Choi M, Nüesch E, Asunsolo-Del Barco A, et al. (2013) Area-Level Deprivation and Overall and Cause-Specific Mortality: 12 Years’ Observation on British Women and Systematic Review of Prospective Studies. PLoS One 8(9): e72656.
  11. 11. Krieger N (1992) Overcoming the absence of socioeconomic data in medical records: validation and application of a census-based methodology. Am J Public Health 82(5): 703–10.
  12. 12. Borrell C, Marí-Dell’olmo M, Serral G, Martínez-Beneito M, Gotsens M (2010) Inequalities in mortality in small areas of eleven Spanish cities (the multicenter MEDEA project). Health Place 16(4): 703–11.
  13. 13. Domínguez-Berjón F, Borrell C, Rodríguez-Sanz M, Pastor V (2006) The usefulness of area-based socioeconomic measures to monitor social inequalities in health in Southern Europe. Eur J Public Health 16(1): 54–61.
  14. 14. Vaucher P, Bischoff T, Diserens EA, Herzig L, Meystre-Agustoni G, et al. (2012) Detecting and measuring deprivation in primary care: development, reliability and validity of a self-reported questionnaire: the DiPCare-Q. BMJ Open 2(1): e000692.
  15. 15. Butler DC, Petterson S, Phillips RL, Bazemore AW (2013) Measures of Social Deprivation That Predict Health Care Access and Need within a Rational Area of Primary Care Service Delivery. Health Serv Res 48(2 Pt 1): 539–59.
  16. 16. Bolíbar B, Fina Avilés F, Morros R, Garcia-Gil M, Hermosilla E, et al. (2012) SIDIAP database: electronic clinical records in primary care as a source of information for epidemiologic research. Med Clin 138(14): 617–21.
  17. 17. Garcia-Gil M, Hermosilla E, Prieto-Alhambra D, Fina F, Rosell M, et al. (2011) Construction and validation of a scoring system for selection of high quality data in a Spanish population primary care database (SIDIAP). Inf Prim Care.19(3): 135–45.
  18. 18. Ramos R, Balló E, Marrugat J, Elosua R, Sala J, et al. (2012) Validity for use in research on vascular diseases of the SIDIAP (Information System for the Development of Research in Primary Care): the EMMA study. Rev Española Cardiol 65(1): 29–37.
  19. 19. Prieto-Alhambra D, Judge A, Javaid MK, Cooper C, Diez-Perez A, et al. (2014) Incidence and risk factors for clinically diagnosed knee, hip and hand osteoarthritis: influences of age, gender and osteoarthritis affecting other joints. Ann Rheum Di 73(9): 1659–64.
  20. 20. Vinagre I, Mata-Cases M, Hermosilla E, Morros R, Fina F, et al. (2012) Control of glycemia and cardiovascular risk factors in patients with type 2 diabetes in primary care in Catalonia (Spain). Diabetes Care 35(4): 774–9.
  21. 21. Simó R, Plana-Ripoll O, Puente D, Morros R, Mundet X, et al. (2013) Impact of glucose-lowering agents on the risk of cancer in type 2 diabetic patients. The Barcelona case-control study. PLoS One 8(11): e79968.
  22. 22. Violán C, Foguet-Boreu Q, Hermosilla-Pérez E, Valderas JM, Bolíbar B, et al. (2013) Comparison of the information provided by electronic health records data and a population health survey to estimate prevalence of selected health conditions and multimorbidity. BMC Public Health 13: 251.
  23. 23. Domínguez-Berjón MF, Borrell C, Cano-Serral G, Esnaola S, Nolasco A, et al. (2008) Constructing a deprivation index based on census data in large Spanish cities(the MEDEA project). Gac Sanit Jun 22(3): 179–87.
  24. 24. Prevenció i Registre del Càncer - Parc de Salut Mar. Available from:
  25. 25. Guies de Pràctica Clínica. Available from:
  26. 26. Guies de Pràctica Clínica. Available from:
  27. 27. Lambert D (2008) Zero-inflated Poisson Regression with an Application to Defects in Manufacturing. Technometrics 34(1).
  28. 28. StataCorp (2011) Stata Statistical Software: Release 12. College Station, TX: StataCorp LP.
  29. 29. Clèries R, Esteban L, Borràs J, Marcos-Gragera R, Freitas A, et al.. (2013) Time trends of cancer incidence and mortality in Catalonia during 1993–2007. Clin Transl Oncol. Available from:
  30. 30. Krieger N, Quesenberry C, Peng T, Horn-Ross P, Stewart S, et al. (1999) Social class, race/ethnicity, and incidence of breast, cervix, colon, lung, and prostate cancer among Asian, Black, Hispanic, and White residents of the San Francisco Bay Area, 1988–92 (United States). Cancer Causes Control 10(6): 525–37.
  31. 31. Sidorchuk A, Agardh EE, Aremu O, Hallqvist J, Allebeck P, et al. (2009) Socioeconomic differences in lung cancer incidence: a systematic review and meta-analysis. Cancer Causes Control 20(4): 459–71.
  32. 32. Bryere J, Dejardin O, Bouvier V, Colonna M, Guizard AV, et al. (2014) Socioeconomic environment and cancer incidence: a French population-based study in Normandy. BMC Cancer 14(1): 87.
  33. 33. Kuznetsov L, Maier W, Hunger M, Meyer M, Mielck A (2012) Regional deprivation in Bavaria, Germany: linking a new deprivation score with registry data for lung and colorectal cancer. Int J Public Health 57(5): 827–35.
  34. 34. Boffetta P, Kogevinas M, Westerholm P, Saracci R (1997) Exposure to occupational carcinogens and social class differences in cancer occurrence. IARC Sci Publ (138): 331–41.
  35. 35. Salcedo N, Saez M, Bragulat B, Saurina C (2012) Does the effect of gender modify the relationship between deprivation and mortality? BMC Public Health 12: 574.
  36. 36. Doubeni CA, Major JM, Laiyemo AO, Schootman M, Zauber AG, et al. (2012) Contribution of behavioral risk factors and obesity to socioeconomic differences in colorectal cancer incidence. J Natl Cancer 104(18): 1353–62.
  37. 37. Doubeni CA, Laiyemo AO, Major JM, Schootman M, Lian M, et al. (2012) Socioeconomic status and the risk of colorectal cancer: an analysis of more than a half million adults in the National Institutes of Health-AARP Diet and Health Study. Cancer 118(14): 3636–44.
  38. 38. Yin D, Morris C, Allen M, Cress R, Bates J, et al. (2010) Does socioeconomic disparity in cancer incidence vary across racial/ethnic groups? Cancer Causes Control 21(10): 1721–30.
  39. 39. Shafique K, Oliphant R, Morrison DS (2012) The impact of socio-economic circumstances on overall and grade-specific prostate cancer incidence: a population-based study. Br J Cancer 107(3): 575–82.
  40. 40. Santelli JS, Lowry R, Brener ND, Robin L (2000) The association of sexual behaviors with socioeconomic status, family structure, and race/ethnicity among US adolescents. Am J Public Health 90(10): 1582–8.
  41. 41. Damiani G, Federico B, Basso D, Ronconi A, Bianchi CBNA, et al. (2012) Socioeconomic disparities in the uptake of breast and cervical cancer screening in Italy: a cross sectional study. BMC Public Health 12: 99.
  42. 42. Brown SBF, Hole DJ, Cooke TG (2007) Breast cancer incidence trends in deprived and affluent Scottish women. Breast Cancer Res Treat 103(2): 233–8.
  43. 43. Enquesta de Salut de Catalunya. Available from:
  44. 44. Probst C, Roerecke M, Behrendt S, Rehm J (2014) Socioeconomic differences in alcohol-attributable mortality compared with all-cause mortality: a systematic review and meta-analysis. Int J Epidemiol 43(4): 1314–27.
  45. 45. Exeter D, Boyle P, Norman P (2011) Deprivation (im)mobility and cause-specific premature mortality in Scotland. Soc Sci Med 72: 389–97.
  46. 46. Schrijvers CT, Mackenbach JP (1994) Cancer patient survival by socioeconomic status in seven countries: a review for six common cancer sites [corrected]. J Epidemiol Community Health 48(5): 441–6.
  47. 47. Domínguez-Berjón MF, Borrell C, López R, Pastor V (2005) Mortality and socioeconomic deprivation in census tracts of an urban setting in southern Europe. J Urban Health 82(2): 225–36.
  48. 48. Pampalon R, Hamel D, Gamache P, Simpson P, Philibert MD (2014) Validation of deprivation index for public health: a complex exercise illustrated by the Quebec index. Chronic Diseases and Injuires in Canada 34: 12–22.
  49. 49. Gordon D (1995) Census deprivation indices: their weighting and validation. J Epidemiol Community Health (Suppl 2): S39–S44.