TY - JOUR
T1 - A semantic cross-species derived data management application
AU - Keator, David B.
AU - Chen, Jinran
AU - Nichols, Nolan
AU - Fana, Fariba
AU - Stern, Hal
AU - Baram, Tallie Z.
AU - Small, Steven L.
N1 - Funding Information:
This work was supported by the National Institute of Mental Health of the National Institutes of Health under grant P50 MH 096889, “Fragmented early life environment and emotional/cognitive vulnerabilities” (Tallie Z. Baram, Center PI; Steven L. Small, Imaging Core PI, Hal Stern, BCDM Core PI). Their support is gratefully acknowledged.
Publisher Copyright:
© 2017 The Author(s).
PY - 2017/9/20
Y1 - 2017/9/20
N2 - Managing dynamic information in large multi-site, multi-species, and multi-discipline consortia is a challenging task for data management applications. Often in academic research studies the goals for informatics teams are to build applications that provide extract-transform-load (ETL) functionality to archive and catalog source data that has been collected by the research teams. In consortia that cross species and methodological or scientific domains, building interfaces which supply data in a usable fashion and make intuitive sense to scientists from dramatically different backgrounds increases the complexity for developers. Further, reusing source data from outside one’s scientific domain is fraught with ambiguities in understanding the data types, analysis methodologies, and how to combine the data with those from other research teams. We report on the design, implementation, and performance of a semantic data management application to support the NIMH funded Conte Center at the University of California, Irvine. The Center is testing a theory of the consequences of “fragmented” (unpredictable, high entropy) early-life experiences on adolescent cognitive and emotional outcomes in both humans and rodents. It employs cross-species neuroimaging, epigenomic, molecular, and neuroanatomical approaches in humans and rodents to assess the potential consequences of fragmented unpredictable experience on brain structure and circuitry. To address this multi-technology, multi-species approach, the system uses semantic web techniques based on the Neuroimaging Data Model (NIDM) to facilitate data ETL functionality. We find this approach enables a low-cost, easy to maintain, and semantically meaningful information management system, enabling the diverse research teams to access and use the data.
AB - Managing dynamic information in large multi-site, multi-species, and multi-discipline consortia is a challenging task for data management applications. Often in academic research studies the goals for informatics teams are to build applications that provide extract-transform-load (ETL) functionality to archive and catalog source data that has been collected by the research teams. In consortia that cross species and methodological or scientific domains, building interfaces which supply data in a usable fashion and make intuitive sense to scientists from dramatically different backgrounds increases the complexity for developers. Further, reusing source data from outside one’s scientific domain is fraught with ambiguities in understanding the data types, analysis methodologies, and how to combine the data with those from other research teams. We report on the design, implementation, and performance of a semantic data management application to support the NIMH funded Conte Center at the University of California, Irvine. The Center is testing a theory of the consequences of “fragmented” (unpredictable, high entropy) early-life experiences on adolescent cognitive and emotional outcomes in both humans and rodents. It employs cross-species neuroimaging, epigenomic, molecular, and neuroanatomical approaches in humans and rodents to assess the potential consequences of fragmented unpredictable experience on brain structure and circuitry. To address this multi-technology, multi-species approach, the system uses semantic web techniques based on the Neuroimaging Data Model (NIDM) to facilitate data ETL functionality. We find this approach enables a low-cost, easy to maintain, and semantically meaningful information management system, enabling the diverse research teams to access and use the data.
KW - Database
KW - MRI
KW - NIDM
KW - Neuroscience
KW - RDF
KW - Semantic web
UR - http://www.scopus.com/inward/record.url?scp=85030868184&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85030868184&partnerID=8YFLogxK
U2 - 10.5334/dsj-2017-045
DO - 10.5334/dsj-2017-045
M3 - Article
AN - SCOPUS:85030868184
SN - 1683-1470
VL - 16
JO - Data Science Journal
JF - Data Science Journal
M1 - 45
ER -