Challenge #1: Allen Human Brain Atlas Integration Posted By: NITRC ADMIN - Apr 15, 2013Tool/Resource: HBM Hackathon For this challenge, you will try to identify the best relationship betweenimaging data and gene expression data in the Allen Human Brain Atlas. BackgroundThe goal of the Allen Human Brain Atlas is to create a comprehensive map oftranscript usage across the entire adult brain, with the emphasis on anatomically complete coverage in a small number of high-quality, clinically unremarkable brains profiled with DNA microarrays for quantitative gene-level transcriptome coverage. Further, structural brain imaging data were obtained from each individual to visualize gene expression data in its native 3D anatomical coordinate space, and to allow correlations between imaging and transcriptome modalities. Data and ToolsThe Atlas consists of microarray profiles of 3702 neuroanatomically precisesubdivisions sampled over six individuals (5 males, 1 female). Each geneexpression profile contains information for over 58,000 gene probes with93% of known genes represented by at least 2 probes. Sample locations weremapped back into the native brain MRI coordinates and subsequently to Montreal Neurological Institute (MNI) coordinate space. These data are freely accessible via the Allen Brain Atlas data portal (http://human.brain-map.org). They have also been mirrored using Amazon’s AWS S3 cloud storage here. Use the online tools to visualize the expression data as heatmaps or projected into 3D anatomical context. An integrated data service allows users to perform differential and correlative searches to discover genes of interest. For more detail, start with a guided overview or see the recent publication in Nature. DetailsThe data is divided into separate .zip files each of the 6 donors. The zip file contains:
Getting StartedWe’re looking for the best relationship between gene expression in the Atlas and imaging data set. You are not restricted to any particular imaging data set. This sample Matlab code that might be a good place to start. It takes one of the example data sets from the Statistical Parametric Mapping (SPM) package, samples the image at all of the microarray sample MNI coordinates, and correlates the sampled values to gene expression values. You can also look at this sample Python code. It uses numpy to correlate the gene expression values of all of the samples to each other for one donor. There is also some code for picking one probe per gene. It chooses the probe that is most correlated to the other probes for that gene. For inspiration, consider reading this study that integrates imaging and gene expression data: Mueller K, Sacher J, Arelin K, Holiga S, Kratzsch J, et al. Overweight and obesity are associated with neuronal injury in the human cerebellum and hippocampus in young adults: a combined MRI, serum marker and gene expression study. Transl Psychiatry 2: e200. doi:10.1038/tp.2012.121. Good luck! Link to Original Article |
Latest News |
|