- Open Access
Parkinson's disease candidate gene prioritization based on expression profile of midbrain dopaminergic neurons
Journal of Biomedical Science volume 17, Article number: 66 (2010)
Parkinson's disease is the second most common neurodegenerative disorder. The pathological hallmark of the disease is degeneration of midbrain dopaminergic neurons. Genetic association studies have linked 13 human chromosomal loci to Parkinson's disease. Identification of gene(s), as part of the etiology of Parkinson's disease, within the large number of genes residing in these loci can be achieved through several approaches, including screening methods, and considering appropriate criteria. Since several of the indentified Parkinson's disease genes are expressed in substantia nigra pars compact of the midbrain, expression within the neurons of this area could be a suitable criterion to limit the number of candidates and identify PD genes.
In this work we have used the combination of findings from six rodent transcriptome analysis studies on the gene expression profile of midbrain dopaminergic neurons and the PARK loci in OMIM (Online Mendelian Inheritance in Man) database, to identify new candidate genes for Parkinson's disease.
Merging the two datasets, we identified 20 genes within PARK loci, 7 of which are located in an orphan Parkinson's disease locus and one, which had been identified as a disease gene. In addition to identifying a set of candidates for further genetic association studies, these results show that the criteria of expression in midbrain dopaminergic neurons may be used to narrow down the number of genes in PARK loci for such studies.
Selective degeneration of mesencephalic dopaminergic (mesDA) neurons of substantia nigra pars compacta (SNpc) is the pathological hallmark of Parkinson's disease (PD; OMIM #168600). Although the molecular mechanism behind demise of these neurons during the course of PD is still unknown, numerous studies have shown contribution of both genetic and environmental and factors, such as neurotoxins, to degeneration of this inherently vulnerable neuronal population , with less than 15% of all PD cases account for familial subtype.
Based on DNA linkage studies, 13 distinct human chromosomal locations, PARK loci, have been linked to the disease: PARK1 , PARK2 , PARK3 , PARK4 , PARK5 , PARK6 , PARK7 , PARK8 , PARK9 , PARK10 , PARK11 , PARK12, PARK13 . These loci expand variably, from 7 to 40 Mb, on different chromosomes each of which contains several hundreds of genes. There are 4 orphan PD loci with no associated genes so far and mutation/s in 8 genes, located in 9 out of 13 PARK loci have been linked to PD. Mutations in α-synuclein located on PARK1 and PARK 4, Parkin, Ubiquitin carboxy-terminal-hydrolase-L1(UCHL1), PTEN-induced-putative kinase (PINK1), DJ1, Leucine-rich repeat kinase 2 (LRRK2), ATPase type 13A2 (ATP13A2), HTRA2 genes in PARK2, 5, 6, 7, 8, 9 and 13 have been shown, respectively, to cause PD . Additionally, two other susceptibility genes, Nurr1 (NR4A2) and tau, which show no linkage to previously described PARK Loci, have been linked to families with Parkinson's disease . Each of the PARK loci contains a large number of genes and identification of disease genes requires proper criteria to narrow down the number of candidates. Since five out of seven PD genes (α-SYNUCLEIN, PARKIN, UCHL1, PINK1 and LRRK2) plus the two latter genes (NURR1 and TAU) are expressed in midbrain dopaminergic neurons, possibly linking the abnormality in their expression or structure to selective degeneration of SNpc neurons, expression of genes within this neuronal population seems to be a suitable criterion for narrowing down the number of genes to be further analyzed for identification of PD genes to be associated with orphan PD loci.
To date, six genome-wide screens have been performed to identify gene expression pattern of rodent dopaminergic neurons [16–21]. In a recent study, using the in situ hybridization database, Allen Brain Atlas (ABA) , and the results of the transcriptome analyses, we verified the expression of 362 genes within the dopaminergic neurons of the midbrain . In this study, using the criteria of specific expression, and the strategy described by Gherbassi et al. , we merged the rodent gene expression data from three of the six screens, with human linkage studies to narrow down the number of candidates for disease susceptibility genes.
Transcript collection and processing
The transcripts of the mouse and rat genes were obtained from six published libraries [16–21]. After elimination of redundancies and duplications, above background mRNA expression within ventral midbrain, with expression patterns, resembling that of tyrosine hydroxylase in VTA or SNpc, was verified in ABA , as previously described . We employed each nucleotide sequence for a nucleotide-nucleotide BLAST (blastn) (basic local alignment search tool) on the non-redundant database http://www.ncbi.nlm.nih.gov/BLAST/ and on the mouse genome http://www.ncbi.nlm.nih.gov/genome/seq/BlastGen/BlastGen.cgi?taxid=10090. Using the criteria of highest homology and lowest e-value, for this study, we employed only the unambiguous hits (transcripts), with homology on mouse genome.
Mapping gene locations into the PARK loci
The human analogs for the mouse genes were found, using the Gene, Protein or the BLAST search functions of the NCBI database. After determining the Genbank accession number of the genes, the cytogenetic location on the human genome was determined using the Map Viewer search tool http://www.ncbi.nlm.nih.gov/mapview/. The neighboring genes on the mouse and human genome were considered to verify the identity and the position of the gene in the human genome. The cytogenetic positions on the human genome were compared with previously described PARK loci http://www.pdgene.org. We aligned the human chromosome map view with the OMIM morbid/disease map http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM to identify the PD gene candidates.
Majority of the genes associated with familial Parkinson's disease are expressed within mesDA neurons, the population of neurons which is lost during the course of the disease. We, therefore, used the criterion of expression to prioritize the identification of genes, which may be the risk factors or responsible for the onset or progression of the disease. Recently, we identified the expression pattern of the genes within two major nuclei of the midbrain, substantia nigra pars compacta and ventral tegmental area, detectable by in situ hybridization. This was done by verifying the expression pattern of genes from six published libraries [16–21] in the Allen Brain Atlas in situ hybridization database  and comparing them to the expression pattern of tyrosine hydroxylase within the midbrain. This search confirmed the expression of 362 genes out of the published libraries within mesDA neurons . The results were confirmed and updated as of May 2010.
Cytogenetic locations and linkage to human genome
A recent study by Gherbassi et al. considered the genes in three of the six screens (Thuret et al., Stewart et al., and Barrett et al.), which were performed to identify the expression profile of mesDA neurons. In order to avoid redundancies, we excluded the genes, which were identified by more than one screen and only considered the genes identified by Greene et al., Grimm et al., and Chung et al. After confirming the expression of each gene within the mouse ventral midbrain and determining the homologous human gene name, symbol and accession number (Additional file 1: Table S1), we used the Map Viewer search tool http://www.ncbi.nlm.nih.gov/mapview/ on the NCBI web site to determine the cytogenetic locations of 199 genes. Then, the positions were compared with the positions of the PARK loci by aligning the human chromosome map view with the OMIM morbid map in http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM, using the OMIM identifiers 163890, 602544, 602404, 605543, 191342, 605909, 602533, 607060, 606693, 606852, 607688, 300557 and 610297 for PARK1-13. Of 199 genes, the cytogenetic locations of 20 overlapped with that of PARK loci, which were labeled PD candidate genes. One of these genes, UCHL1, is a known PD gene, and 7 were within Park12, which is an orphan PD locus (Table 1).
Identifying multi-factorial disease-related genes requires methods based on priori knowledge about the candidates. To prioritize the genes, several context-based approaches, ranging from phylogenetic profiling to biochemical data integration have been used. Any given method has its advantages and limitations and the ultimate test for validity of each method is functional relevance of the identified candidates to initiation or progression of the disease. In Parkinson's disease the functional validation has been strongly linked to the neuronal population that is afflicted during the course of the disease, the dopaminergic neurons of substantia nigra pars compacta in the midbrain.
Despite being the most prominent pathological feature of PD, the reasons underlying specific degeneration of these neurons are not fully understood. However, identification of disease genes has been crucial in understanding multiple cellular and molecular mechanisms, contributing to this process. The expression of several of the familial PD genes within nesDA neurons seems to be required for physiological functions and survival of this neuronal population. A number of studies have established the role of multiple PD genes, including PARKIN, PINK1 and DJ1 in degradation of unfolded proteins . Several other studies also have established their neuroprotective role within mesDA neurons against mitochondrial dysfunction in the animal models of the disease . PD genes also play vital functions within the dopaminergic synapses .
Given these data, it is highly likely that familial Parkinson's disease, caused by the loss of function mutations in the PD genes, is due to hindrance to functions of the wild-type forms and that the proper expression of the genes within mesDA neurons is essential to their long-term survival. Considering this hypothesis, genomic convergence, which combines gene expression with genomic linkage analysis, has been used to prioritize candidate susceptibility genes for PD [15, 27]. In this study, we used this approach to find candidates, among the genes that are expressed in mesDA neurons, shown by six rodent studies and verified by using the Allen Brain Atlas in situ hybridization database. A previous study, by Gherbassi et al. had merged the data from three of the six screens to Parkinson's disease linkage studies. Here we found that 20, in addition to 21 human genes, identified by Gherbassi et al., are located in multiple PARK loci (Table 1). The presence of UCHL1, a known PD gene, among the results of this study validates the genomic convergence approach as an efficient tool for prioritization and identification of candidate PD genes. Using the same approach, three other known PD genes (so far, 10% or 4 out of 41 genes, found by two studies, are known PD genes) were identified previously . Additional genetic and disease model studies are needed to determine whether any of the seven genes, which are within PARK12 can be considered a disease gene and their degree of functional relevance to survival and maintenance of mesDA neurons.
Online Mendelian Inheritance in Man
Basic Local Alignment Search Tool
in situ hybridization
Allen Brain Atlas.
Langston JW, Ballard PA: Parkinson's disease in a chemist working with 1-methyl-4-phenyl-1,2,5,6-tetrahydropyridine. N Engl J Med. 1983, 309 (5): 310-
Polymeropoulos MH, Lavedan C, Leroy E, Ide SE, Dehejia A, Dutra A, Pike B, Root H, Rubenstein J, Boyer R, Stenroos ES, Chandrasekharappa S, Athanassiadou A, Papapetropoulos T, Johnson WG, Lazzarini AM, Duvoisin RC, Di Iorio G, Golbe LI, Nussbaum RL: Mutation in the alpha-synuclein gene identified in families with Parkinson's disease. Science. 1997, 276 (5321): 2045-7. 10.1126/science.276.5321.2045.
Matsumine H, Saito M, Shimoda-Matsubayashi S, Tanaka H, Ishikawa A, Nakagawa-Hattori Y, Yokochi M, Kobayashi T, Igarashi S, Takano H, Sanpei K, Koike R, Mori H, Kondo T, Mizutani Y, Schäffer AA, Yamamura Y, Nakamura S, Kuzuhara S, Tsuji S, Mizuno Y: Localization of a gene for an autosomal recessive form of juvenile Parkinsonism to chromosome 6q25.2-27. Am J Hum Genet. 1997, 60 (3): 588-96.
Gasser T, Müller-Myhsok B, Wszolek ZK, Oehlmann R, Calne DB, Bonifati V, Bereznai B, Fabrizio E, Vieregge P, Horstmann RD: A susceptibility locus for Parkinson's disease maps to chromosome 2p13. Nat Genet. 1998, 18 (3): 262-5. 10.1038/ng0398-262.
Waters CH, Miller CA: Autosomal dominant Lewy body parkinsonism in a four-generation family. Ann Neurol. 1994, 35 (1): 59-64. 10.1002/ana.410350110.
Boyer R, Auburger G, Leube B, Ulm G, Mezey E, Harta G, Brownstein MJ, Jonnalagada S, Chernova T, Dehejia A, Lavedan C, Gasser T, Steinbach PJ, Wilkinson KD, Polymeropoulos MH: The ubiquitin pathway in Parkinson's disease. Nature. 1998, 395 (6701): 451-2. 10.1038/26652.
Valente EM, Bentivoglio AR, Dixon PH, Ferraris A, Ialongo T, Frontali M, Albanese A, Wood NW: Localization of a novel locus for autosomal recessive early-onset parkinsonism, PARK6, on human chromosome 1p35-p36. Am J Hum Genet. 2001, 68 (4): 895-900. 10.1086/319522.
van Duijn CM, Dekker MC, Bonifati V, Galjaard RJ, Houwing-Duistermaat JJ, Snijders PJ, Testers L, Breedveld GJ, Horstink M, Sandkuijl LA, van Swieten JC, Oostra BA, Heutink P: Park7, a novel locus for autosomal recessive early-onset parkinsonism, on chromosome 1p36. Am J Hum Genet. 2001, 69 (3): 629-34. 10.1086/322996.
Funayama M, Hasegawa K, Kowa H, Saito M, Tsuji S, Obata F: A new locus for Parkinson's disease (PARK8) maps to chromosome 12p11.2-q13.1. Ann Neurol. 2002, 51 (3): 296-301. 10.1002/ana.10113.
Hampshire DJ, Roberts E, Crow Y, Bond J, Mubaidin A, Wriekat AL, Al-Din A, Woods CG: Kufor-Rakeb syndrome, pallido-pyramidal degeneration with supranuclear upgaze paresis and dementia, maps to 1p36. J Med Genet. 2001, 38 (10): 680-2. 10.1136/jmg.38.10.680.
Hicks AA, Pétursson H, Jónsson T, Stefánsson H, Jóhannsdóttir HS, Sainz J, Frigge ML, Kong A, Gulcher JR, Stefánsson K, Sveinbjörnsdóttir S: A susceptibility gene for late-onset idiopathic Parkinson's disease. Ann Neurol. 2002, 52 (5): 549-55. 10.1002/ana.10324.
Pankratz N, Nichols WC, Uniacke SK, Halter C, Rudolph A, Shults C, Conneally PM, Foroud T, Parkinson Study Group: Genome screen to identify susceptibility genes for Parkinson disease in a sample without parkin mutations. Am J Hum Genet. 2002, 71 (1): 124-35. 10.1086/341282.
Wakabayashi K, Takahashi H: Pathology of familial Parkinson's disease. Brain Nerve. 2007, 59 (8): 851-64.
Gasser T: Update on the genetics of Parkinson's disease. Mov Disord. 2007, 22 (Suppl 17): S343-50. 10.1002/mds.21676.
Gherbassi D, Bhatt L, Thuret S, Simon HH: Merging mouse transcriptome analyses with Parkinson's disease linkage studies. DNA Res. 2007, 14 (2): 79-89. 10.1093/dnares/dsm007.
Thuret S, Bhatt L, O'Leary DD, Simon HH: Identification and developmental analysis of genes expressed by dopaminergic neurons of the substantia nigra pars compacta. Mol Cell Neurosci. 2004, 25 (3): 394-405. 10.1016/j.mcn.2003.11.004.
Stewart GJ, Savioz A, Davies RW: Sequence analysis of 497 mouse brain ESTs expressed in the substantia nigra. Genomics. 1997, 39 (2): 147-53. 10.1006/geno.1996.4503.
Barrett T, Xie T, Piao Y, Dillon-Carter O, Kargul GJ, Lim MK, Chrest FJ, Wersto R, Rowley DL, Juhaszova M, Zhou L, Vawter MP, Becker KG, Cheadle C, Wood WH, McCann UD, Freed WJ, Ko MS, Ricaurte GA, Donovan DM: A murine dopamine neuron-specific cDNA library and microarray/increased COX1 expression during methamphetamine neurotoxicity. Neurobiol Dis. 2001, 8 (5): 822-33. 10.1006/nbdi.2001.0423.
Greene JG, Dingledine R, Greenamyre JT: Gene expression profiling of rat midbrain dopamine neurons: implications for selective vulnerability in parkinsonism. Neurobiol Dis. 2005, 18 (1): 19-31. 10.1016/j.nbd.2004.10.003.
Grimm J, Mueller A, Hefti F, Rosenthal A: Molecular basis for catecholaminergic neuron diversity. Proc Natl Acad Sci USA. 2004, 101 (38): 13891-6. 10.1073/pnas.0405340101.
Chung CY, Seo H, Sonntag KC, Brooks A, Lin L, Isacson O: Cell type-specific gene expression of midbrain dopaminergic neurons reveals molecules involved in their vulnerability and protection. Hum Mol Genet. 2005, 14 (13): 1709-25. 10.1093/hmg/ddi178.
Lein ES, Hawrylycz MJ, Ao N, Ayres M, Bensinger A, Bernard A, Boe AF, Boguski MS, Brockway KS, Byrnes EJ, Chen L, Chen L, Chen TM, Chin MC, Chong J, Crook BE, Czaplinska A, Dang CN, Datta S, Dee NR, Desaki AL, Desta T, Diep E, Dolbeare TA, Donelan MJ, Dong HW, Dougherty JG, Duncan BJ, Ebbert AJ, Eichele G, Estin LK, Faber C, Facer BA: Genome-wide atlas of gene expression in the adult mouse brain. Nature. 2007, 445 (7124): 168-76. 10.1038/nature05453.
Alavian KN, Simon HH: Linkage of cDNA expression profiles of mesencephalic dopaminergic neurons to a genome-wide in situ hybridization database. Mol Neurodegener. 2009, 4 (1): 6-10.1186/1750-1326-4-6.
Tanaka M, Kim YM, Lee G, Junn E, Iwatsubo T, Mouradian MM: Aggresomes formed by alpha-synuclein and synphilin-1 are cytoprotective. J Biol Chem. 2004, 279 (6): 4625-31. 10.1074/jbc.M310994200.
Dodson MW, Guo M: Pink1, Parkin, DJ-1 and mitochondrial dysfunction in Parkinson's disease. Curr Opin Neurobiol. 2007, 17 (3): 331-7. 10.1016/j.conb.2007.04.010.
Chandra S, Fornai F, Kwon HB, Yazdani U, Atasoy D, Liu X, Hammer RE, Battaglia G, German DC, Castillo PE, Südhof TC: Double-knockout mice for alpha- and beta-synucleins: effect on synaptic functions. Proc Natl Acad Sci USA. 2004, 101 (41): 14966-71. 10.1073/pnas.0406283101.
Hauser MA, Li YJ, Takeuchi S, Walters R, Noureddine M, Maready M, Darden T, Hulette C, Martin E, Hauser E, Xu H, Schmechel D, Stenger JE, Dietrich F, Vance J: Genomic convergence: identifying candidate genes for Parkinson's disease by combining serial analysis of gene expression and genetic linkage. Hum Mol Genet. 2003, 12 (6): 671-7. 10.1093/hmg/12.6.671.
The authors declare that they have no competing interests.
This study was designed and supervised by KNA and was performed by SV, MR. The data was analyzed by KNA and HHS. DG and AM partook in design of the study and made critical comments. SV and KNA drafted the manuscript and all authors read and approved the final version.