This article has Open Peer Review reports available.
Systematic identification of DNA variants associated with ultraviolet radiation using a novel Geographic-Wide Association Study (GeoWAS)
- Irving Hsu†1, 2, 3,
- Rong Chen†1, 2, 5,
- Aditya Ramesh1,
- Erik Corona1, 2, 4,
- Hyunseok Peter Kang1, 2, 4,
- David Ruau1, 2 and
- Atul J Butte1, 2Email author
© Hsu et al.; licensee BioMed Central Ltd. 2013
Received: 3 July 2012
Accepted: 4 June 2013
Published: 20 June 2013
Long-term environmental variables are widely understood to play important roles in DNA variation. Previously, clinical studies examining the impacts of these variables on the human genome were localized to a single country, and used preselected DNA variants. Furthermore, clinical studies or surveys are either not available or difficult to carry out for developing countries. A systematic approach utilizing bioinformatics to identify associations among environmental variables, genetic variation, and diseases across various geographical locations is needed but has been lacking.
Using a novel Geographic-Wide Association Study (GeoWAS) methodology, we identified Single Nucleotide Polymorphisms (SNPs) in the Human Genome Diversity Project (HGDP) with population allele frequencies associated with geographical ultraviolet radiation exposure, and then assessed the diseases known to be assigned with these SNPs.
2,857 radiation SNPs were identified from over 650,000 SNPs in 52 indigenous populations across the world. Using a quantitative disease-SNP database curated from 5,065 human genetic papers, we identified disease associations with those radiation SNPs. The correlation of the rs16891982 SNP in the SLC45A2 gene with melanoma was used as a case study for analysis of disease risk, and the results were consistent with the incidence and mortality rates of melanoma in published scientific literature. Finally, by analyzing the ontology of genes in which the radiation SNPs were significantly enriched, potential associations between SNPs and neurological disorders such as Alzheimer’s disease were hypothesized.
A systematic approach using GeoWAS has enabled us to identify DNA variation associated with ultraviolet radiation and their connections to diseases such as skin cancers. Our analyses have led to a better understating at the genetic level of why certain diseases are more predominant in specific geographical locations, due to the interactions between environmental variables such as ultraviolet radiation and the population types in those regions. The hypotheses proposed in GeoWAS can lead to future testing and interdisciplinary research.
Over the past decade, high-throughput technologies and advancements in computational biology have made it possible to determine genotypes and complete human DNA sequences at a continued reduction of cost, and these technologies have been used to determine numerous variants associated with various diseases and traits. Why and when variants leading to disease susceptibility entered the human genome remains unclear, but could be elucidated through the study of genetics in indigenous populations, more closely representing original human subpopulations across the world . How these disease variants might be -- or have been -- associated with environmental conditions might yield insights into why those disease variants are present in the genome.
To study early human migrations, in 2005, Li et al. analyzed 1064 individuals from indigenous populations at over 650,000 Single Nucleotide Polymorphisms (SNPs) in the Human Genome Diversity Project (HGDP) [2, 3]. The HGDP collection is currently the most comprehensive human DNA collection representing the world’s population distribution available to not-for-profit researchers. Unlike genome-wide association studies (GWAS) , which were performed mostly on populations of European ancestry, human genome databases from the HGDP have enabled biomedical researchers to study how the genetic risks of different diseases vary over a global range of ethnic populations . The availability of individual level genotypes across various countries and populations has led to an increased interest in understanding how human diseases are associated with environmental variables, since these variables are usually geographically dependent. Moreover, many diseases are understood to result from the complex interactions between both genetic and environmental factors.
One important environmental factor is ultraviolet (UV) radiation, which has been linked to a wide range of human diseases. Interactions between skin pigmentation genes and UV radiation have been previously studied, but only on a country-wide scale [5, 6]. Using SNPs pre-selected from genes of interest, these works investigated the role of pigmentation in melanoma predisposition within single countries, Spain and Australia, respectively. SNPs associated with UV radiation have not been identified across the world by using a systematic approach, but such a world-wide study carries obvious difficulties .
In another recent study , evidence was found for human adaptations to climate at the genome-wide level. This was accomplished by identifying the SNPs with allele frequencies that were the most strongly correlated with several different climate variables across the world. Among the phenotypes that were found to be associated with these SNPs, however, the number of diseases identified was limited. Only one disease – Systemic lupus erythematosus (SLE) – was reported to be significantly correlated with solar radiation.
In this paper, we investigate UV radiation and its effects at the genome-wide level across a wider range of human diseases in different countries. Human genomes from 52 native populations in the HGDP panel were mapped to 193 different countries. We then designed a bioinformatics-based Geographic-Wide Association Study (GeoWAS) to systematically identify DNA variants that were the most strongly correlated with UV exposure levels across the world, which we called radiation SNPs. To identify diseases that are strongly associated with radiation SNPs, we queried these SNPs against a curated disease-SNP database called VARIMED (Variants Informing Medicine) [8, 9]. With this developed methodology, we were able to investigate the effects and implications of UV radiation on human phenotypes and diseases at both the genomic and geographic levels. Lastly, the significance of GeoWAS and its promises are discussed from both biomedical and methodological perspectives.
We analyzed 650,000 DNA variants in the form of single nucleotide polymorphisms (SNPs) for individuals from each of the 52 native populations. Each SNP has a corresponding ancestral allele and a derived allele produced by mutation events over time. The derived allele may become more common in human loci due to random genetic drift or selective pressures from the environment. For a given locus, the allele frequency represents the fraction of the chromosomes that carry the specified allele. For every SNP, the ancestral allele frequency (AAF) in each country was correlated with the radiation levels measured for those countries. Both Pearson’s and Spearman’s correlation coefficients and p-values were calculated to assess the relative strength of the associations. Each SNP was then annotated with its corresponding chromosome number, function type, gene symbol, and NCBI Gene ID .
These initial correlations served as a means of filtering the SNPs that we targeted for further analysis. We identified SNPs that had ancestral allele frequencies significantly associated with UV radiation exposure by focusing only on the SNPs with a Pearson’s absolute rho of greater than 0.8 (r > 0.8), a strict cutoff value determined from our preliminary permutation studies. These SNPs were classified as radiation SNPs.
Validation using SNPs in Vitamin D synthesis pathway genes
To ascertain the effectiveness of our proposed methodology, we first examined biological processes known to be linked to radiation, and analyzed the degree to which these processes are supported by our approach. It is widely understood that variations in UV exposure influence the synthesis and metabolism of Vitamin D . Six different genes (CYP24A1, CYP27A1, CYP27B1, CYP2R1, DHCR7, GC), taken from Reactome , were identified to be involved in Vitamin D synthesis pathways. The SNPs within these genes were compared with background SNPs from other genes in our database, and the degree of association was quantified.
Analysis of diseases associated with radiation SNPs
After running our initial correlations, the radiation SNPs were queried against the VARIMED database to identify diseases and other phenotypes that the SNPs may be linked to. VARIMED is a quantitative human disease-SNP association database, curating from approximately 5500 different publications, as previously described .
Populations under positive selection for radiation SNPs
The Integrated Haplotype Score (iHS) is a statistic for identifying recent positive selection at a particular locus, and is dependent upon differences in linkage disequilibrium (LD) around a positively selected allele compared to the background allele [14, 15, 17]. In all 52 HGDP populations, each of the 650,000 SNPs has a specific iHS score representing the degree of positive selection for it. We developed an algorithm that targets the populations most prone to selection for radiation SNPs, for further analysis. For each population, we mapped all of the available SNPs to their corresponding iHS scores, and calculated a selection metric for the set of radiation SNPs by taking the mean of the iHS scores. To obtain a relative measure of the significance of selection for the radiation SNPs, we computed empirical p-values by comparing the metric for each population against a null distribution of such metrics. In each population, the null distribution was generated by using arbitrary SNPs that were randomly selected among all SNPs for which data are available. For example, if UV radiation has n number of associated SNPs, then n samples were taken from the entire list of SNPs for a population. In this case, our null hypothesis for each population would be that it has not experienced selection for radiation SNPs. Those populations in which the empirical p-value is below a scalar alpha (α < 0.01) are considered to be under significant positive selection for the set of SNPs currently in question. We chose to implement this procedure using the HGDP data set, since we may view selection trends across a gamut of populations.
Gene ontology terms of radiation SNPs
From the complete pool of 650,000 SNPs, those SNPs identified to be associated with UV radiation (radiation SNPs) were joined with their corresponding NCBI Entrez Gene IDs to create a gene list. From the list, we identified the genes in which the radiation SNPs were significantly enriched by joining the list with a table containing the Gene Ontology (GO) terms for these genes. We focused on the GO terms that had a p-value of under 0.05 after performing a Bonferroni correction. These genes and their potential connections to diseases were surveyed in published literature. Based on our findings, we formulated new hypotheses on how DNA variation in the genes of interest may be potentially related to UV radiation and diseases. These proposed mechanisms of disease formation can lead to future research and follow-up testing.
For each SNP measured in the Human Genome Diversity Project (HGDP), the ancestral allele frequency (AAF) in each of 193 countries was correlated with the radiation levels measured for those countries. After running these initial correlations, 2,857 of the 650,000 SNPs were identified to have AAFs significantly associated with UV radiation exposure, using a Pearson’s absolute rho of 0.8 as a strict cutoff value. These SNPs were designated as radiation SNPs, and each was annotated with its corresponding chromosome number, function type, gene symbol, and Entrez database Gene ID.
SNPs in Vitamin D synthesis pathway genes
To validate the efficacy of our methodology, we first evaluated genes involved in Vitamin D synthesis, known to be influenced by UV exposure. We curated six genes known to be involved in Vitamin D synthesis pathways, and conducted a Mann–Whitney U test to compare the Pearson p-values of the SNPs within these genes against those of background SNPs in other genes. In total, 155 different SNPs were contained in these six genes. We found that SNPs in genes for Vitamin D biosynthesis had a significant p-value in association with UV radiation (p = 3.0 × 10-4) than the set of background SNPs in other genes, indicating that the SNPs found using our methodology are relevant to pathways known to be associated with radiation.
Radiation SNPs are significantly enriched for association with human diseases
Different phenotypes and diseases associated with radiation SNPs, identified through VARIMED
SNP ID (rs)
1.10 × 10-7
5.80 × 10-12
7.44 × 10-11
1.10 × 10-7
HDL cholesterol levels
2.13 × 10-7
Transferrin receptor levels
6.80 × 10-14
1.00 × 10-11
8.51 × 10-10
Basal cell carcinoma*
1.60 × 10-12
1.48 × 10-12
3.89 × 10-11
2.00 × 10-8
8.30 × 10-15
1.70 × 10-9
5.02 × 10-8
Squamous cell carcinoma*
1.00 × 10-7
1.10 × 10-8
3.98 × 10-10
3.10 × 10-12
2.90 × 10-9
Amyloid beta-protein levels
1.90 × 10-12
6.20 × 10-14
HDL cholesterol levels
3.90 × 10-7
3.10 × 10-15
2.20 × 10-7
1.90 × 10-8
Bone mineral density
9.40 × 10-9
Of the 19 distinct phenotypes, 8 were associated with the rs16891982 SNP located in the SLC45A2 gene. As shown in Table 1, this SNP was strongly associated with pigmentation and with various forms of skin cancer, including melanoma, basal cell carcinoma, and squamous cell carcinoma. The rs16891982 SNP is associated with each of these cancer types with the risk allele being G, while the ancestral allele is G, and derived allele is C. The correspondence between the risk allele and the ancestral allele has important implications on the skin cancers we identified. This indicates that native populations with higher ancestral allele frequencies are more susceptible to the skin cancers. Among these cancers, melanoma has been found to be significantly associated with rs16891982 through separate clinical studies previously conducted in Spain and Australia [5, 6]. Although this is consistent with the results we obtained, such studies are confined to a single area and thus cannot be directly applied to make interregional comparisons.
To an extent, worldwide variations in ancestral allele frequency distributions for the rs16891982 SNP may be explained by the disparity in UV radiation levels across different countries, as previously illustrated in Figure 2. Geographic regions with higher UV exposure, such as Africa and South America, have countries with a lower proportion of the ancestral allele. The derived allele C is more predominant in these regions. Evolutionary adaptations to UV radiation may account for the lower AAFs in these parts of the world, and thus lead to a lower susceptibility to cancers such as melanoma. The European populations, on the other hand, receive significantly lesser radiation and have higher AAFs. The negative correlation between AAF and UV radiation for this SNP was shown earlier in Figure 3.
Native populations under strong positive selection for radiation SNPs
Average iHS score
2.1 × 10-6
1.5 × 10-6
1.4 × 10-5
3.7 × 10-4
6.5 × 10-4
1.9 × 10-2
3.9 × 10-2
4.6 × 10-2
Gene ontology terms of radiation SNPs
To gain further insight into the relationship of UV radiation SNPs with human genetic functions, we have associated the 2,857 radiation SNPs with a derived gene function database from the Gene Ontology Consortium. We identified the genes in which the radiation SNPs were significantly enriched, by joining the gene list described in the methodology section with a table containing the ontology terms for these genes.
Ontology of genes in which the radiation SNPs were significantly enriched
Gene ontology term
Glutamate receptor activity
1.7 × 10-5
Calcium ion binding
2.3 × 10-5
Overall, the radiation SNPs were robustly enriched in calcium ion binding genes throughout all analyses using Pearson’s rho cutoffs ranging from 0.70 to 0.85. This is consistent with published studies indicating that UV irradiation of lymphocytes induced calcium flux and tyrosine phosphorylation . It was previously shown that rapid calcium responses were directly induced in both dose- and wavelength-dependent manners. Genes involved in the activity of glutamate receptors were only enriched when a Pearson’s rho cutoff of 0.8 was used.
Discussion and conclusion
In this paper, we presented a systematic approach for both identifying DNA variants associated with UV radiation and for studying their connections to diseases such as skin cancers, via a novel geographic-wide association study. Our analyses have led to a better understating of why certain diseases are more predominant in particular geographical locations, due to the interactions between environmental variables such as UV radiation and the population genetics in those regions.
Taken collectively, our results have both biomedical and methodological significance. From a biomedical context, we were able to identify SNPs that had AAFs strongly correlated with UV exposure levels across the world. From the pool of radiation SNPs, we identified candidate SNPs that were enriched for association with different diseases and phenotypes, most of which had a direct link with UV radiation. We found an enrichment in genes associated with calcium ion binding and glutamate receptor activities. One potential explanation for these findings is that UV radiation causes an excess of glutamates to accumulate in the extracellular space, resulting in an influx of calcium ions into the cytosol and thus increasing the risk of brain disorders. The process in which high levels of calcium ions enter cells through glutamate receptors, ultimately causing apoptosis, is called excitotoxicity . Excitotoxicity is known to be associated with neurodegenerative diseases such as multiple sclerosis, Alzheimer’s disease, and Parkinson’s disease.
As a case study, we chose the rs16891982 SNP to analyze its association with melanoma, using its allele frequency distributions in the HGDP populations to explain why certain groups of people in the world are more susceptible to melanoma, and how natural selective pressures from environmental factors such as UV radiation may have played an important role. The findings were consistent with published data for the worldwide incidence and death rates for melanoma. We also devised an algorithm to calculate the degree of positive selection for radiation SNPs in native populations worldwide. From the same group of SNPs, we determined the ontology of genes in which the SNPs were significantly enriched, surveyed known relationships in literature, and can propose hypotheses to explain our observed findings at the genetic level.
From a methodological standpoint, we have developed a powerful approach that extends the current GWAS model (Genome-Wide Association Studies) to incorporate countries into the analysis. The algorithm designed in this study integrates worldwide SNP and UV data to simultaneously compare the risks of different diseases across the world. Using again the case study of the radiation SNP rs16891982 as an example, we were able to expand earlier approaches that only accounted for single countries [5, 6] by systematically identifying SNP-disease associations for different HGDP populations on a global scale. Unlike former studies that used pre-selected SNPs, the GeoWAS algorithm identifies all possible SNPs in a more efficient and objective manner.
The GeoWAS methodology presented in this paper can also be applied to other long-term environmental variables such as temperature, precipitation, and humidity. The algorithm can be similarly used to carry out analyses of the genomes of other species. Furthermore, the UV radiation SNPs identified in this work may potentially be used in aiding skin cancer diagnostics, through in vitro methods based on the expression of the SLC45A2 gene .
With our developed methodology, we were able to investigate the effects and implications of UV radiation on human phenotypes and diseases at both the genomic and geographic levels. The significance of GeoWAS and its promises are discussed from both biomedical and methodological perspectives. Ultimately, the GeoWAS approach can benefit epidemiologists, and physicians, and policy-makers in regions of the world where conventional clinical studies are either costly or unavailable. Future work will involve refining the current approach by performing the country-population mappings based on updated HGDP coordinate systems.
The authors would like to acknowledge Alex Skrenchuk and Gordon V Sinclair from Stanford University for computer support. I.H. was funded by the Stanford Institutes of Medicine Summer Research Program (SIMR). R.C., E.C., P.K., D.R., A.J.B. were funded by the Lucile Packard Foundation for Children’s Health, the Hewlett Packard Foundation, the National Institute of General Medical Sciences (R01 GM079719), the National Library of Medicine (R01 LM009719), and the Howard Hughes Medical Institute. E.C. was funded by a National Science Foundation Graduate Research Fellowship and the Armin and Linda Miller Fellowship Fund.
- Chen R, Corona E, Sikora M, Dudley JT, Morgan AA, Moreno-Estrada A, Nilsen GB, Ruau D, Lincoln SE, Bustamante CD, Butte AJ: Type 2 diabetes risk alleles demonstrate extreme directional differentiation among human populations, compared to other diseases. PLoS Genet. 2012, 8: e1002621-10.1371/journal.pgen.1002621.View ArticlePubMedPubMed CentralGoogle Scholar
- Cavalli-Sforza LL: The human genome diversity project: past, present and future. Nat Rev Genet. 2005, 4: 333-340.Google Scholar
- Li JZ, Absher DM, Tang H, Southwick AM, Casto AM, Ramachandran S, Cann HM, Barsh GS, Feldman M, Cavalli-Sforza LL, Myers RM: Worldwide human relationship inferred from genome-wide patterns of variation. Science. 2005, 319: 1100-1104.View ArticleGoogle Scholar
- Hindorff LA, MacArthur J, Wise A, Junkins HA, Hall PN, Klemm AK, Manolio TA: A Catalog of Published Genome-Wide Association Studies. 2009Google Scholar
- Ibarrola-Villava M, Fernandez LP, Alonso S, Boyano MD, Peña-Chilet M, Pita G, Aviles JA, Mayor M, Gomez-Fernandez C, Casado B, Martin-Gonzalez M, Izagirre N, De la Rua C, Asumendi A, Perez-Yarza G, Arroyo-Berdugo Y, Boldo E, Lozoya R, Torrijos-Aguilar A, Pitarch A, Pitarch G, Sanchez-Motilla JM, Valcuende-Cavero F, Tomas-Cabedo G, Perez-Pastor G, Diaz-Perez JL, Gardeazabal J, De Lizarduy M, Sanchez-Diez A, Valdes C: A Customized Pigmentation SNP Array Identifies a Novel SNP Associated with Melanoma Predisposition in the SLC45A2 Gene. PLoS One. 2011, 6 (4): e19271-10.1371/journal.pone.0019271.View ArticlePubMedPubMed CentralGoogle Scholar
- Duffy DL, Zhao ZZ, Sturm RA, Hayward NK, Martin NG, Montgomery GW: Multiple pigmentation gene polymorphisms account for a substantial proportion of risk of cutaneous malignant melanoma. J Invest Dermatol. 2010, 130: 520-528. 10.1038/jid.2009.258.View ArticlePubMedGoogle Scholar
- Hancock AM, Witonsky DB, Alkorta-Aranburu G, Beall CM, Gebremedhin A, Sukernik R, Utermann G, Pritchard JK, Coop G, Di Rienzo A: Adaptation to climate-mediated selective pressures in humans. PLoS Genet. 2011, 7 (4): e1001375-10.1371/journal.pgen.1001375.View ArticlePubMedPubMed CentralGoogle Scholar
- Chen R, Davydov EV, Sirota M, Butte AJ: Non-synonymous and synonymous coding SNPs show similar likelihood and effect size of human disease association. PLoS One. 2010, 5: e13574-10.1371/journal.pone.0013574.View ArticlePubMedPubMed CentralGoogle Scholar
- Ashley EA, Butte AJ, Wheeler MT, Chen R, Klein TE, Dewey FE, Dudley JT, Ormond KE, Pavlovic A, Morgan AA, Pushkarev D, Neff NF, Hudgins L, Gong L, Hodges LM, Berlin DS, Thorn CF, Sangkuhl K, Hebert JM, Woon M, Sagreiya H, Whaley R, Knowles JW, Chou MF, Thakuria JV, Rosenbaum AM, Zaranek AW, Church GM, Greely HT, Quake SR, Altman RB: Clinical assessment incorporating a personal genome. Lancet. 2010, 375: 1525-1535. 10.1016/S0140-6736(10)60452-7.View ArticlePubMedPubMed CentralGoogle Scholar
- WHO: Global Health Observatory Data Repository. Geneva, Switzerland: World Health Organization, [http://apps.who.int/ghodata/]
- Maglott D, Ostell J, Pruitt KD, Tatusova T: Entrez Gene: gene-centered information at NCBI. Nucl Acids Res. 2007, 35 (suppl 1): D26-D31.View ArticlePubMedGoogle Scholar
- Loomis WF: Skin-pigment regulation of Vitamin-D biosynthesis in man. Science. 1967, 157: 501-506. 10.1126/science.157.3788.501.View ArticlePubMedGoogle Scholar
- Reactome: a knowledge base of biological pathways and processes. Cold Spring Harbor, New York, USA: Cold Spring Harbor Laboratory, http://www.reactome.org/ReactomeGWT/entrypoint.html,
- Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL, Gibbs RA, Belmont JW, Boudreau A, Hardenbol P, Leal SM, Pasternak S, Wheeler DA, Willis TD, Yu F, Yang H, Zeng C, Gao Y, Hu H, Hu W, Li C, Lin W, Liu S, Pan H, Tang X, Wang J, Wang W, Yu J, Zhang B, Zhang Q, Zhao H: A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007, 449: 851-861. 10.1038/nature06258.View ArticlePubMedGoogle Scholar
- Voight BF, Kudaravalli S, Wen X, Pritchard JK: A map of recent positive selection in the human genome. PLoS Biol. 2006, 4 (3): e72-10.1371/journal.pbio.0040072.View ArticlePubMedPubMed CentralGoogle Scholar
- HGDP Population Ancestral Allele Frequency Distribution for rs16891982 SNP. Chicago, Illinois USA: Joseph Pickrell and University of Chicago, [http://hgdp.uchicago.edu/tmp1/Alfreqs/rs16891982.frqs.pdf]
- Pickrell JK, Coop G, Novembre J, Kudaravalli S, Li JZ, Absher D, Srinivasan BS, Barsh GS, Myers RM, Feldman MW, Pritchard JK: Signals of recent positive selection in a worldwide sample of human populations. Genome Res. 2009, 19 (5): 826-837. 10.1101/gr.087577.108.View ArticlePubMedPubMed CentralGoogle Scholar
- Melanoma of the skin, Incident and Death Rates by Race/Ethnicity and Sex, U.S. Atlanta, Georgia USA: Centers for Disease Control and Prevention, 1999-2007. http://www.cdc.gov/cancer/skin/statistics/race.htm,
- World Health Organization: IARC Cancerbase 2001, No. 5, Version 1.0. GLOBOCAN: Cancer Incidence, Mortality and Prevalence Worldwide. 2001, Lyon: IARCGoogle Scholar
- Crombie IK: Racial differences in melanoma incidence. Br J Cancer. 1979, 40: 185-193. 10.1038/bjc.1979.165.View ArticlePubMedPubMed CentralGoogle Scholar
- Schieven GL, Kirihara JM, Gilliland LK, Uckun FM, Ledbetter JA: Ultraviolet radiation rapidly induces tyrosine phosphorylation and calcium signaling in lymphocytes. Mol Biol Cell. 1993, 4: 523-530.View ArticlePubMedPubMed CentralGoogle Scholar
- Hynd MR, Scott HL, Dodd PR: Glutamate-mediated excitotoxicity and neurodegeneration in Alzheimer’s disease. Neurochem Int. 2004, 45 (5): 583-595. 10.1016/j.neuint.2004.03.007.View ArticlePubMedGoogle Scholar
- Soufir N: Vitro Method For Diagnosing Skin Cancer. 2011, Hopitaux De Paris, Paris, France: US Patent application number: 20110086805Google Scholar
- The pre-publication history for this paper can be accessed here:http://0-www.biomedcentral.com.brum.beds.ac.uk/1471-2350/14/62/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.