13
SchizConnect: Mediating neuroimaging databases on schizophrenia and related disorders for large-scale integration Lei Wang a,b, , Kathryn I. Alpert a , Vince D. Calhoun c,d,e,f,g , Derin J. Cobia a , David B. Keator h , Margaret D. King c , Alexandr Kogan a , Drew Landis c , Marcelo Tallis i , Matthew D. Turner m,o , Steven G. Potkin h,l , Jessica A. Turner c,n,o , Jose Luis Ambite i,j,k a Department of Psychiatry and Behavioral Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL, USA b Department of Radiology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA c The Mind Research Network, Albuquerque, NM, USA d University of New Mexico Health Sciences Center, Albuquerque, NM, USA e Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM, USA f Department of Psychiatry, University of New Mexico, Albuquerque, NM, USA g Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA h Brain Imaging Center, University of California, Irvine, CA, USA i Information Sciences Institute, University of Southern California, Marina del Rey, CA, USA j Digital Government Research Center, University of Southern California, Los Angeles, CA, USA k Department of Computer Science, University of Southern California, Los Angeles, CA, USA l Department of Psychiatry & Human Behavior, University of California, Irvine, School of Medicine, Irvine, CA, USA m Department of Computer Science, Georgia State University, Atlanta, GA, USA n Department of Psychology, Georgia State University, Atlanta, GA, USA o Neuroscience Institute, Georgia State University, Atlanta, GA, USA abstract article info Article history: Accepted 23 June 2015 Available online xxxx Keywords: Data mediation and integration Neuroinformatics Mega analysis Schizophrenia databases SchizConnect (www.schizconnect.org) is built to address the issues of multiple data repositories in schizophrenia neuroimaging studies. It includes a level of mediationtranslating across data sourcesso that the user can place one query, e.g. for diffusion images from male individuals with schizophrenia, and nd out from across participating data sources how many datasets there are, as well as downloading the imaging and related data. The current version handles the Data Usage Agreements across different studies, as well as interpreting database-specic terminologies into a common framework. New data repositories can also be mediated to bring immediate access to existing datasets. Compared with centralized, upload data sharing models, SchizConnect is a unique, virtual database with a focus on schizophrenia and related disorders that can mediate live data as information is being updated at each data source. It is our hope that SchizConnect can facilitate testing new hypotheses through aggregated datasets, pro- moting discovery related to the mechanisms underlying schizophrenic dysfunction. © 2015 Elsevier Inc. All rights reserved. Introduction Schizophrenia is a complex psychiatric disease with heterogeneous clinical, behavioral, cognitive and genetic manifestations, and the litera- ture, especially on neuroimaging studies, has yet to achieve a state of consistency and reproducibility. As a result, multi-site consortia have been created to coordinate the collection of large datasets in order to address these issues (Turner, 2014). Consortia such as the Functional Biomedical Informatics Research Network (FBIRN) (Friedman et al., 2008; Helmer et al., 2011a; Keator et al., 2008), the Mind Clinical Imaging Consortium (MCIC) (Gollub et al., 2013), North American Prodrome Longitudinal Study Consortium (NAPLS) (Addington et al., 2012) and Bipolar-Schizophrenia Network for Intermediate Phenotypes (B-SNIP) (Keshavan et al., 2011; Thaker, 2008) have led to the improvement of our understanding of brain circuitry, brain function and genetic variabil- ity in schizophrenia (Allen et al., 2011; Arnold et al., 2015; Cannon et al., 2015; Castro et al., 2014; Chen et al., 2012; Chen et al., 2013; Ehrlich et al., 2014; Hass et al., 2015a, 2015b; Kim et al., 2009; Kim et al., 2010; Mathew et al., 2014; Potkin et al., 2009). These efforts were successful in part be- cause they were created with data sharing in mind, and they anticipated many of the difculties in combining data from multiple sites in their de- sign and schema. Yet, in building data repositories, many decisions were made that were specic to that particular repository or study, about what they would call different data types. In one study's data a structural MRI scan may be listed informatively as T1-weighted scan, or something NeuroImage xxx (2015) xxxxxx Corresponding author at: Northwestern University Feinberg School of Medicine, Department of Psychiatry and Behavioral Sciences, 710 N. Lake Shore Dr, Abbott Hall 1322, Chicago, IL 60614, USA. E-mail address: [email protected] (L. Wang). YNIMG-12362; No. of pages: 13; 4C: 2, 5, 7 http://dx.doi.org/10.1016/j.neuroimage.2015.06.065 1053-8119/© 2015 Elsevier Inc. All rights reserved. Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Please cite this article as: Wang, L., et al., SchizConnect: Mediating neuroimaging databases on schizophrenia and related disorders for large-scale integration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2015.06.065

SchizConnect: Mediating neuroimaging databases on

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: SchizConnect: Mediating neuroimaging databases on

NeuroImage xxx (2015) xxx–xxx

YNIMG-12362; No. of pages: 13; 4C: 2, 5, 7

Contents lists available at ScienceDirect

NeuroImage

j ourna l homepage: www.e lsev ie r .com/ locate /yn img

SchizConnect: Mediating neuroimaging databases on schizophrenia andrelated disorders for large-scale integration

Lei Wang a,b,⁎, Kathryn I. Alpert a, Vince D. Calhoun c,d,e,f,g, Derin J. Cobia a, David B. Keator h, Margaret D. King c,Alexandr Kogan a, Drew Landis c, Marcelo Tallis i, Matthew D. Turner m,o, Steven G. Potkin h,l,Jessica A. Turner c,n,o, Jose Luis Ambite i,j,k

a Department of Psychiatry and Behavioral Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL, USAb Department of Radiology, Northwestern University Feinberg School of Medicine, Chicago, IL, USAc The Mind Research Network, Albuquerque, NM, USAd University of New Mexico Health Sciences Center, Albuquerque, NM, USAe Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM, USAf Department of Psychiatry, University of New Mexico, Albuquerque, NM, USAg Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USAh Brain Imaging Center, University of California, Irvine, CA, USAi Information Sciences Institute, University of Southern California, Marina del Rey, CA, USAj Digital Government Research Center, University of Southern California, Los Angeles, CA, USAk Department of Computer Science, University of Southern California, Los Angeles, CA, USAl Department of Psychiatry & Human Behavior, University of California, Irvine, School of Medicine, Irvine, CA, USAm Department of Computer Science, Georgia State University, Atlanta, GA, USAn Department of Psychology, Georgia State University, Atlanta, GA, USAo Neuroscience Institute, Georgia State University, Atlanta, GA, USA

⁎ Corresponding author at: Northwestern UniversityDepartment of Psychiatry and Behavioral Sciences, 7101322, Chicago, IL 60614, USA.

E-mail address: [email protected] (L. Wang

http://dx.doi.org/10.1016/j.neuroimage.2015.06.0651053-8119/© 2015 Elsevier Inc. All rights reserved.

Please cite this article as:Wang, L., et al., Schiintegration, NeuroImage (2015), http://dx.d

a b s t r a c t

a r t i c l e i n f o

Article history:Accepted 23 June 2015Available online xxxx

Keywords:Data mediation and integrationNeuroinformaticsMega analysisSchizophrenia databases

SchizConnect (www.schizconnect.org) is built to address the issues ofmultiple data repositories in schizophrenianeuroimaging studies. It includes a level of mediation—translating across data sources—so that the user can placeonequery, e.g. for diffusion images frommale individualswith schizophrenia, andfindout fromacross participatingdata sources howmanydatasets there are, aswell as downloading the imaging and relateddata. The current versionhandles the Data Usage Agreements across different studies, as well as interpreting database-specific terminologiesinto a common framework. New data repositories can also be mediated to bring immediate access to existingdatasets. Compared with centralized, upload data sharing models, SchizConnect is a unique, virtual database witha focus on schizophrenia and related disorders that can mediate live data as information is being updated at eachdata source. It is our hope that SchizConnect can facilitate testing newhypotheses through aggregateddatasets, pro-moting discovery related to the mechanisms underlying schizophrenic dysfunction.

© 2015 Elsevier Inc. All rights reserved.

Introduction

Schizophrenia is a complex psychiatric disease with heterogeneousclinical, behavioral, cognitive and genetic manifestations, and the litera-ture, especially on neuroimaging studies, has yet to achieve a state ofconsistency and reproducibility. As a result, multi-site consortia havebeen created to coordinate the collection of large datasets in order toaddress these issues (Turner, 2014). Consortia such as the FunctionalBiomedical Informatics Research Network (FBIRN) (Friedman et al.,2008; Helmer et al., 2011a; Keator et al., 2008), theMind Clinical Imaging

Feinberg School of Medicine,N. Lake Shore Dr, Abbott Hall

).

zConnect: Mediating neuroimoi.org/10.1016/j.neuroimage.2

Consortium (MCIC) (Gollub et al., 2013), North American ProdromeLongitudinal Study Consortium (NAPLS) (Addington et al., 2012) andBipolar-Schizophrenia Network for Intermediate Phenotypes (B-SNIP)(Keshavan et al., 2011; Thaker, 2008) have led to the improvement ofour understanding of brain circuitry, brain function and genetic variabil-ity in schizophrenia (Allen et al., 2011; Arnold et al., 2015; Cannon et al.,2015; Castro et al., 2014; Chen et al., 2012; Chen et al., 2013; Ehrlich et al.,2014;Hass et al., 2015a, 2015b; Kimet al., 2009; Kimet al., 2010;Mathewet al., 2014; Potkin et al., 2009). These efforts were successful in part be-cause they were created with data sharing in mind, and they anticipatedmany of the difficulties in combining data frommultiple sites in their de-sign and schema. Yet, in building data repositories, many decisions weremade thatwere specific to that particular repository or study, aboutwhatthey would call different data types. In one study's data a structural MRIscan may be listed informatively as “T1-weighted scan”, or something

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 2: SchizConnect: Mediating neuroimaging databases on

2 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

as complex as “5MPRAGE-AVG” or just “scan1”. Combining data fromdatasets that do not share the same protocols and data structures is chal-lenging and it remains a barrier to aggregating mega-datasets. Whencombining data across different sources, individual investigators facethe difficult and costly process of understanding the different imaging,subject, assessment, meta-data content (definitions, formats, organiza-tions) and converting the data into standardized terms. A critical andunmet need exists to automate and virtualize this process so that appro-priate data can be retrieved and combined from different databases re-gardless of differences in the their structure and terminology.

In this paper, we describe the initial deployment of “SchizConnect”(publicly available at http://www.schizconnect.org/), a new resourceaimed at establishing mega-datasets (Wang et al., 2014). Building onthe success of previous consortia efforts, SchizConnect creates a re-source that supports structured querying and retrieval of neuroimagingand related data across these consortia repositories.

Methods

SchizConnect architecture

SchizConnect follows the classical virtual data integrationframework (Florescu et al., 1998; Halevy, 2001; Lenzerini, 2002;Ullman, 1997; Wiederhold, 1992), which saw its initial theoretical

SchizConnect M

Data Source Interface

Query Optimizer

Query Rewriter

HID Data SourceFBIRN–Federated data

XNAT DataNUSDAST in XNA

SchizConnect.oWeb Portal

Mediator Interface

Fig. 1. SchizConnect architecture&data flow. The SchizConnect architecture has the following3org web portal. After the user builds the query at the SchizConnect web portal, the query commSchizConnect domainmodel to translate the incoming query into source-specific schemas, andlanguages, and variables. The returns from the sources are handled by themediator engine, whicportal then interacts with the user for further processing, including signing of data use agreem

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

application on schizophrenia neuroimaging data within the FBIRNconsortium (Ashish et al., 2010). A distinct feature of this frame-work is that the data can reside at and be maintained at the originaldata sources in their original formats and schemas (i.e., their ownways to define the structure, content, and semantics of data).SchizConnect consists of the following three fundamental components(Fig. 1): 1) the data sources that provide the data, structured accordingto native schemas; 2) the mediation software (i.e., mediator) that recon-ciles the semantic differences in the data across the different sourcesthrough domain modeling and inter-schema mapping; and 3) theweb portal that provides a user-friendly interface for querying anddownloading data.

The typical data flow begins with a user query constructed by drag-ging and dropping terms in a graphical user interface (GUI) at theSchizConnect web portal. The graphical query is translated to SQL andpassed to the SchizConnect mediator. The mediator then relies on theSchizConnect domain model and inter-schema mappings to translatethe user query into source-specific terms, and then queries each datasource directly and concurrently, using the available source-specificquery mechanisms, languages and variables. The results from thesources are collated and joined together by the mediator and providedto the SchizConnect web portal as a unified result table containing themediated (harmonized) variables. The SchizConnect web portal theninteracts with the user for further processing, including signing of data

ediator @ USCLogical Schema

Mappings

sSource Cost

Statistics

COINS Data SourceMCIC & COBRE in COINS

SourceT Central

rg Web PortalData Warehouse

Data Source Interface

components: the federated data sources, the SchizConnectmediator, and the SchizConnect.and is passed to the SchizConnect mediator engine. Themediator engine then relies on thequeries each data source directly and in parallel, using source-specific query mechanisms,h provides the SchizConnectweb portalwith a unified results table. The SchizConnectwebents and data download.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 3: SchizConnect: Mediating neuroimaging databases on

Table 1Sample characteristics on modeled terms.

Dx No knowndisorder

Schizophreniabroad

Schizophreniastrict

Schizoaffective Bipolardisorder

Sibling of schizophreniastrict

Sibling of no knowndisorder

Number of current subjectsa 481 117 333 38 10 44 66Gender (m/f) 295/186 87/30 242/91 22/16 5/5 21/23 16/50Age (years) (mean ± s.d.) 34.2 ± 12.9 34.4 ± 11.0 35.4 ± 12.9 39.5 ± 10.0 46.6 ± 14.4 21.6 ± 3.7 20.4 ± 3.5Age range (years) min–max 13–67 18–61 17–66 19–59 21–64 14–28 14–28

a 40 subjects from the FBIRN database contains no diagnosis information and are not listed here.

3L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

use agreements and data download. Below we describe each of thecomponents in detail.

The data sources

SchizConnect accesses the data available at each of the sources inreal time. It maintains a library of connectors (wrappers) to commonsource types, including relational databases, XML (Extensible MarkupLanguage) or JSON (JavaScript Object Notation) databases, SOAP(Simple Object Access Protocol) or REST (Representation State Transfer)web services, or flat files. For previously unseen source types, the archi-tecture is extensible to program new connectors. Each source runs theirown server platforms with their own database schemas, formats andapplication programming interfaces (APIs). SchizConnect imposes norequirements on data sources with regard to databasing or accessmethods, but is flexible to special needs (see results below for specialscase example).

SchizConnect mediator

The SchizConnect mediator is built on the BIRN mediator (Ashishet al., 2010; Helmer et al., 2011b). The core idea is to define a commondomain model/schema and inter-schema mapping rules that mapbetween the domain schema and the different source schemas. Thedomain model defines an integrated view of the shared data in theapplication domain (i.e., SchizConnect). This concept is closely relatedto that of an ontology, but it is more limited in scope, in the sense thatit only models the part of the application domain that is relevant tointegrating a set of sources (Ashish et al., 2010). It can be viewed as anincremental, data-driven, pragmatic approach to domain ontology de-velopment. Inter-schema mappings are logical formulas, or rules, thatrelate the data models at each of the sources with the domain model,reconciling their semantic differences in the description of the data.The SchizConnect domain model consists of terms that are pertinent to

Table 2SchizConnect data sources, including current and planned sample sizes.

Data source FBIRN

Project name FBIRN Phase IIDatabase(language)

HID(PostgreSQL)

URL http://fbirnbdr.nbirn.net:8080/BNumber of current subjects (with imaging data) 255

(251)Image type Structural MRI

Resting-state fMRITask fMRIdMRI

Number of current subjects with multimodal imagingb 208Number of New subjects (timeline) 400–FBIRN Phase 3

(2015)

a The native COINS architecture involves dynamic data packaging following the query, whichfrom the COINS executive committee, an API was built to enable us to extract domain-model ddatabase underlying the duplicated MCICShare/COBRE data is MySQL using the relational data

b Multimodal imaging is defined as having any of the combination of the following: T1, diffu

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

schizophrenia neuroimaging research (such as those related to imagingprotocol, disease severity or cognitive functioning). Definitions of theseterms are provided via downloadable documents available on theSchizConnect web portal. During development, we have aligned theSchizConnect domain model terms with the standards provided by theontology community: XCEDE (XML-Based Clinical Experiment DataExchange Schema, http://www.xcede.org/XCEDE.html) (Gadde et al.,2012), NeuroLEX (the Neuroscience Lexicon, http://neurolex.org/)(Larson and Martone, 2013), NIF (the Neuroscience Information Frame-work, http://neuinfo.org/) (Gardner et al., 2008) and INCF (the Interna-tional Neuroinformatics Coordinating Facility, http://incf.org/) (BjaalieandGrillner, 2007; De Schutter, 2009),wherever appropriate. The domainterms with links to existing ontology can be found on the SchizConnectwebsite documentations page. For example, the Positive and NegativeSymptom Scale (PANSS) has an entry on NeuroLex (birnlex_3032,http://neurolex.org/wiki/Category:Positive_and_Negative_Symptom_Scale). In cases where no accepted ontology has been defined, such as theUPSA (University of California Performance-based Skills Assessment), wework with the neuroimaging and neuroinformatics community to definethem. Although such effort is ongoing, since there doesn't yet exist a stan-dard process for ontological term contribution, we will work with datacontributors and the neuroinformatics community on a case-by-casebasis.

The mediator performs two core functions. First, it uses the inter-schema mappings to rewrite the user query, e.g., “T1 images from indi-viduals with schizophrenia older than 30 years old,” from the domainschema to the formats and languages specific to each source. It currentlyuses GAV (global-as-view)mappings (Ashish et al., 2010; Florescu et al.,1998; Halevy, 2001; Lenzerini, 2002; Ullman, 1997;Wiederhold, 1992),where each domain predicate is constructed as a query over the sourcepredicates. Second, the mediator constructs, optimizes, and executes adistributed query evaluation plan, based on the rewritten source-levelquery, whichprovides the answers to the user query. The source queriesare executed directly over the data sources in parallel through each

NU MRN

NUSDAST MCIC & COBREXNAT(XML)

COINS Data Exchangea

DR/ https://central.xnat.org/REST/projects/NUDataSharing http://coins.mrn.org/451(368)

423(410)

Structural MRI Structural MRIResting-state fMRITask fMRIdMRI

– 404130(2015)

200(2016)

does not allow for data to be immediately returned to the query engine. With permissionefined variables from the COINS databases and to duplicate them at the mediator site. Themodel.sion MRI, task-paradigm fMRI, and resting-state fMRI.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 4: SchizConnect: Mediating neuroimaging databases on

Table 3Example queries and returns at SchizConnect.org.

Query terms Number of images & distinct subjects returned

Total fBIRN@ HID

COBRE@ COINS

MCICShare@ COINS

NUSDAST@ XNAT

T1 SCZ + CON 3178 1048 484 264 1382737 185 173 95 284

3T T1 20–60 Yr 1348 750 492 106 –345 112 180 53

T1 healthy controls 1846 655 243 264 684439 113 94 95 137

Sternberg SCZ 3 T b= 30 Yr 210 210 – – –18 18

Resting BP + CON 103 – 103 – –101 101

Sensory motor + T1,SCZ + CON, 30–50Yr (multimodal)

1310 1140 – 170 –119 83 36

All MRI 21,309 13,552 1596 4347 18141029 251 198 212 368

1 This link will be replaced by the SchizConnect web portal in the near future.2 On the COINS website, after user builds a query, data are packaged offline and a link is

provided for downloading. Therefore the user never sees the actual data until the data aredownloaded via the link.

4 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

source connector (which wraps the source APIs). Results from sourcequeries are collated and joined together in the mediator, and served tothe SchizConnect web portal as a unified table organized using the do-main schema terms.

The query execution engine used by the mediator for source-levelquery evaluation is based on the Open Grid Services Architecture DataAccess and Integration (OGSA-DAI) and Distributed Query Processing(OGSA-DQP) services (http://www.ogsadai.org.uk/) (Antonioletti et al.,2005; Grant et al., 2008; Lynden et al., 2009; Lynden et al., 2008).OGSA-DAI is a streaming workflow engine in which data resources andweb services are wrapped through a library of connectors. OGSA-DQP isa query engine that optimizes and implements the query evaluationplan as anOGSA-DAIworkflow,which accesses themultiple data sources.

The SchizConnect domain query language is SQL, expressed onthe domain model terms. Each data source query is wrapped as anOGSA-DAI resource through its library of connectors to common datasources such as PostgreSQL, MySQL, or XML. For additional details onthe mediator architecture, see our previous work on the BIRN mediator(Ashish et al., 2010).

SchizConnect web portal

SchizConnect has been conceptualized as a one-stop resource forquerying and retrieving neuroimaging and related data from distributed,heterogeneous repositories. SchizConnect is therefore set up to facilitateuser registration and compliance with data use agreement (DUA) witheach data source. Tomaximize the usage and efficiency, while complyingwithNIHpolicies andbest practices, the SchizConnectwebportal is struc-tured to have two query modes. The first is a preliminary search, forexample, for the number of male subjects between the ages of 20 and60 with diffusion MRI scans. This mode is completely open to the public,allowing for anyone to browse and immediately obtain useful, current in-formation on a current summary count of available data without havingthe need to register. However, no subject-level data at this point aredownloadable. In the second mode, the user registers and signs in, andwith the same query is able to view subject-level data, sign DUAs anddownload the data. Only data forwhichDUAhas been signed are down-loadable. Signed DUAs are forwarded to each respective data source ad-ministrator for reviewing and filing. The administrators at SchizConnectas well as each data source will work with each user's institution fordata use and institutional review board (IRB) terms upon request.Regarding participant privacy, we require that it is the responsibilityof the data source to ensure that data are shared in accordance torules and regulations set forth by their IRB. The responsibility ofSchizConnect is to confirm with each data source that they agree to thisresponsibility.

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

Results

The initial deployment of SchizConnect makes available data on1129 subjects from its contributing sources. Sample demographicscan be found in Table 1 as well as on the SchizConnect web portal(http://schizconnect.org/#subject-stats). Of these subjects, 1029 havescan data, which include structural MRI (sMRI), resting-state functionalMRI (fMRI), task-paradigm fMRI, anddiffusionMRI (dMRI) scans, totaling21,309 volumes (Tables 2 and 3). Demographic, neuropsychologicalmeasures, and clinical assessments are either available or in the processof becoming available. As the data sources take in new subjects/scans,they can automatically become available for querying throughSchizConnect, depending on the source data repository's policies. Atthe writing of this manuscript, SchizConnect has 49 users and 37 down-loads, and that statistic is steadily growing (http://schizconnect.org/#user-stats).

SchizConnect data sources

Current data sources include the following schizophrenia-relateddatasets, which are all publicly available themselves and have beenextensively curated, documented, and subjected to quality assurance.Detailed explanations of data can be found in the accompanying specialpapers describing the data source repositories, as well as the followingpapers: (Cetin et al., 2014; Glover et al., 2012; Gollub et al., 2013;Wang et al., 2013).

• FBIRN Phase II dataset @ UCI (http://fbirnbdr.nbirn.net:8080/BDR/)1

(Glover et al., 2012; Potkin and Ford, 2009), containing cross-sectional multisite data from 251 subjects, each with two visits. (Seepaper on FBIRN data in this special issue.) Data include sMRI andfMRI scans collected on a variety of 1.5 T and 3 T scanners, includingSternberg Item Recognition Paradigm (SIRP) and Auditory Oddballparadigms, breath-hold and sensorimotor tasks. The underlying data-base is HID (Keator et al., 2009; Ozyurt et al., 2010) in PostgreSQLusing the relational data model. Access is provided via Java databaseconnectivity technology (JDBC), a database-access API for the Javaprogramming language. Results from the query are returned to theMediator as an SQL ResultSet.

• NUSDAST @ XNAT Central (https://central.xnat.org/REST/projects/NUDataSharing) (Wang et al., 2013), containing data from 368subjects, the majority with longitudinal data (~2 years apart). (Seepaper on NU data in this special issue.) Data include sMRI scanscollected on a single Siemens 1.5 T Vision scanner. The underlyingdatabase is XNAT (Marcus et al., 2007), using the XML data model.Access is provided via XNAT REST API. The mediator constructs aquery XML document conforming to the XNAT search web servicespecification. Results from the query are returned to the Mediator asan XML document.

• COBRE & MCICShare @ COINS Data Exchange (http://coins.mrn.org/)(Bockholt et al., 2010; Scott et al., 2011; Wood et al., 2014), con-taining data from 198 and 212 subjects from COBRE (Cetin et al.,2014) and MCICShare (Gollub et al., 2013) projects, respectively.(See paper on COINS data in this special issue.) Data for COBREinclude sMRI and rest-state fMRI scans collected on a single 3 Tscanner. Data for the multisite MCICShare include sMRI, rest-statefMRI and dMRI scans, collected on 1.5 T and 3 T scanners. COINSdata required special handling in order to satisfy SchizConnect'sneeds because the native COINS architecture involves dynamicdata packaging following the query, which does not allow fordata to be immediately returned to the query engine.2 With

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 5: SchizConnect: Mediating neuroimaging databases on

5L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

permission from the COINS executive committee, an API wasbuilt to enable SchizConnect to extract domain-model definedvariables (see below) from the COINS databases and to dupli-cate them at the Mediator site. The database underlying the du-plicated MCICShare/COBRE data is MySQL using the relationaldata model. Access is provided via JDBC to the MySQL server.Results from the query are returned to the Mediator as an SQLResultSet.

Subject

Min Age

Max Age

Sex

Diagnosis

Project

COINS MCICShare

COINS COBRE

fBIRN Phase II

NUSDAST

A B

Imaging Protocol

Structural

Perfusion

T1

T2

Functional Resting State

Field Mapping

Diffusion

Task Paradigm

MRI

CField

Strength

1.5 T3 T4 T

MakeSiemens

GEPhilips

ModelVisionTrio

Fig. 2. SchizConnectmediator (Virtual) domain Schema and Inter-schemaMappings. (A) Project, cfrom 3 data sources are included in the domain model. (B) Subject, containing demographic anddefined by “minimum age,” “maximum age,” “sex,” and “diagnosis.” Current diagnosis categorieswhich includes “schizophrenia strict” and “schizoaffective.”We note that the precise definition o“control” to label the healthy comparison subjects provided by each data source. Instead, the termscanner platforms and imaging protocols under which subject imaging data were collected. “Immapping.” Both “structural” and “functional” protocols are subdivided for further distinctions inclpsychological assessments. The domain model uses the MATRICS Consensus Cognitive Batteryfunction,” “learning,” “working memory,” “episodic memory,” “intelligence,” “language,” “mo(E) Clinical domain contains full demographics, assessments of symptoms and functional capacity,“SES (socioeconomic status),” and “Handedness.” Symptom measures include “PANSS (Positive“SANS (Scale for the Assessment of Negative Symptoms),” using the Unified Medical Language S“Mood,” “Suicide Ideation,”, and “Extrapyramidal Symptoms.” Medical information includes “M“Nicotine Addiction/Dependence.” (F) Examples of inter-schemamappings. For example, subjectwith a diagnostic codemapping table to harmonize thediagnoses; normalizeddiagnosis of Strict Sctables with subject and assessment data, then selecting the subjects according to a predefined alg

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

SchizConnect mediator

The SchizConnect mediator domain schema defines the followingconcepts: project, subject, imaging, cognitive, and clinical, described indetail below.

• Project contains the name and description of the 4 projects from3 datasources for which data are collected (Fig. 2A).

No Known Disorder

SchizoaffectiveSchizophrenia Strict

Mental Disorder

Psychotic Disorder

Bipolar Disorder

Schizophrenia Broad

Auditory OddballFinger TappingBreath Hold

Go NoGoMismatch Negativity

Sensory MotorSIRP

Working Memory

FLASHMPRAGE

ontaining description of the research project forwhich data are collected. Currently 4 projectsdiagnostic information for individual participants of the project. “Subject” is subsequentlyare: “no known disorder” (for healthy controls), “bipolar disorder,” “schizophrenia broad,”f “control”may depend on specific studies and study populations, we did not use the term“no known disorders” is used. (C) Imaging_Protocol (MRI), containing information on MRI

aging protocol” is subsequently defined by “structural,” “functional,” “perfusion,” and “fielduding “T1,” “T2,” “resting state,” and “task paradigm.” (D) Cognitive domain contains neuro-(Nuechterlein et al., 2008) terms whenever appropriate, which are “attention,” “executivetor,” “premorbid functioning,” “processing speed,” “social cognition,” and “visuospatial.”andmedical information. Demographic information includes “Race,” “Ethnicity,” “Education,”And Negative Symptoms Scales),” “SAPS (Scale for the Assessment of Positive Symptoms),”ystem (UMLS) terms whenever appropriate. Symptom measures also include “Depression,”edical History/Medication,” “SCID (Structured Clinical Interview for DSM Disorders),” anddata for the NUSDAST source is obtained by calling the XNAT search web service and joininghizophrenia for the subjects in theHID source is computedon thefly, byfirst joining threeHIDorithm on assessment values.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 6: SchizConnect: Mediating neuroimaging databases on

D

Cognitive

Learning & Memory

Attention

Social Cognition

Motor

Executive Function

Premorbid Functioning

IQEmotional IQ

Verbal LearningVisual Learning

Verbal Working Memory Visual Working MemoryVerbal Episodic Memory Visual Episodic Memory

Language

Processing Speed

Visuospatial

Intelligence

Clinical

E

F

Symptoms -Extrapyramidal

Demographics

Symptoms -Psychopathology

Functional Capacity

Medical

RaceEthnicityEducation

SESHandedness

PANSSSAPSSANS

DepressionMood

Suicide Ideation

Medical HistorySCID

Nicotine

Functional Capacity

Quality of Life

Fig. 2 (continued).

6 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

• Subject contains demographic and diagnostic information for individualparticipants, including “subject id”, “age”, “sex” and “diagnosis”(Fig. 2B). Current diagnosis categories are: “no known disorder” (for

Fig. 3. SchizConnectwebportal, query, and return. (A) SchizConnet.orgAt thewebportal, the statusitemaintenance orwhatever reason, since each data source is queried independent of the others,and therefore their lack of return. (B, C, D) Query and reviewwithout signing up. Even without sigquery of “T1-weighted 3 T scans of subjects between the ages 20 and 60.” The query is built fromadownloadable images from 345 distinct subjects from COBRE,MCICShare and FBIRN Phase II datasreturned from the query is now provided as an on-screen table. This table lists sortable variablesinformation to decide whether to further refine the query. The user has the ability to modify, savthe data use agreement page and then data downloading. In the casewheremultiple images are reMRI session, these images will be indicated with unique “Datauri's” (such as the first 4 entries seeDUAs to agreewith, and receive data only fromdata sourceswithwhich the DUAs have been agreestaged at SchizConnect.org host together with the mediated meta-data table for a specified limiteindividual segments, easily reconstructablewith the 7-zip utility (http://www.7-zip.org/). They conor NIFTI formats. Links to these files along with unpacking instructions are sent to the user via em

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

healthy controls), “bipolar disorder,” “schizophrenia broad,” whichincludes “schizophrenia strict” and “schizoaffective.”We note that theprecise definition of “control” may depend on specific studies and

s of the data sources is shown. If one ormore of the data sources are unavailable due to sourcedata sources that are operational can still be queried. The end user iswarned of the down sitesning up, any end user has the ability to query SchizConnect. Here an example is shown for adrag-and-drop user-friendly graphical user interface. This particular querywould return 1348ets. (E) Query and review after signing up. After signing in, detailed subject-level informationincluding provenance, age, sex, and scan parameters. The end user can review and use thise and retrieve queries. A data download button is also active at this point, which will lead toturned for the same subject, for example as the result ofmultiple acquisitionswithin the samen here) fields. (F, G, H) DUA and data download. The user has the option of choosing whichd. The imaging data resulting from the queries are first transferred out of the data sources andd time period for downloading. Imaging data files are compressed and packaged into 10 GBtain the original and/or preprocessed imaging data shared by each source, inDICOM,Analyze,ail and are available on the “MySchizConnect” page of the website.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 7: SchizConnect: Mediating neuroimaging databases on

7L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

study populations, we did not use the term “control” to label thehealthy comparison subjects provided by each data source. Instead,the term “no known disorders” is used.

• Imaging_Protocol (MRI) contains information on MRI scannerplatforms and imaging protocols. “Imaging protocol” consists of“structural,” “functional,” “perfusion,” and “field mapping” categories.Both “structural” and “functional” protocols are subdivided for furtherdistinctions, e.g., “T1,” “T2,” “resting state,” and “task paradigm”(Fig. 2C).

Query

A

B

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

• Cognitive domain contains neuropsychological assessments. Thedomain model uses the MATRICS Consensus Cognitive Battery(Nuechterlein et al., 2008) terms whenever appropriate, which are“attention,” “executive function,” “learning,” “working memory,”“episodic memory,” “intelligence,” “language,” “motor,” “premorbidfunctioning,” “processing speed,” “social cognition,” and “visuospatial”(Fig. 2D).

• Clinical domain contains full demographics, assessments of symptomsand functional capacity, and medical information. Demographic

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 8: SchizConnect: Mediating neuroimaging databases on

QueryC

DReview

Fig. 3 (continued).

8 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

information includes “Race,” “Ethnicity,” “Education,” “SES (socioeco-nomic status),” and “Handedness.” Symptom measures include“PANSS (Positive and Negative Symptoms Scales),” “SAPS (Scale forthe Assessment of Positive Symptoms),” “SANS (Scale for the Assess-ment of Negative Symptoms),”using theUnifiedMedical Language Sys-tem (UMLS) terms whenever appropriate. Symptom measures alsoinclude “Depression,” “Mood,” “Suicide Ideation,”, and “Extrapyramidal

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

Symptoms.” Medical information includes “Medical History/Medication,” “SCID (StructuredClinical Interview forDSMDisorders),”and “Nicotine Addiction/Dependence” (Fig. 2E).

The SchizConnect mediator relates the above domain modelconcepts to the terms in the source schemas using inter-schema

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 9: SchizConnect: Mediating neuroimaging databases on

E

FDUAs

Review

Fig. 3 (continued).

9L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

mappings (see Fig. 2F for sample mappings). For example, subjectdata for the NUSDAST source is obtained by calling the XNAT searchweb service and joining with a diagnostic code mapping table to har-monize the diagnoses; normalized diagnosis of Strict Schizophreniafor the subjects in the HID source is computed on the fly, by first join-ing three HID tables with subject and assessment data, then selectingthe subjects according to a predefined algorithm on assessmentvalues.

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

Currently, the SchizConnect model comprises 7 domain predicates,17 source predicates (from the 4 sources HID, NUSDAST, COBRE andMCICShare, a local database that contains value mapping information,and utility functional sources), and 18 inter-schema mappings (10 forHID, 4 for NUSDAST, and 4 for COBRE/MCICShare). It is worth notingthat since the inter-schema mappings are specified in a declarativerule language, themediator is easily extensible to incorporate additionalsources by writing the appropriate inter-schema mappings.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 10: SchizConnect: Mediating neuroimaging databases on

Download

Download

G

H

Fig. 3 (continued).

10 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

SchizConnect.org web portal and example queries

At the http://www.schizconnect.org/ web portal, the operatingstatuses of the data sources are displayed (Fig. 3A). Queries areperformed only on data sources that are operational. Data queries areconstructed using a drag-and-drop GUI, in which a series of logical“AND” and “OR” operators can be concatenated filtering on domainmodel terms.

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

An example query is shown in Fig. 3B, C, with summary return infor-mation presented Fig. 3D. Additional sample queries are listed inTable 3. Upon signing in, detailed subject-level information returnedfrom the query is presented to the user as an on-screen table (Fig. 3E).This table lists sortable domain model variables including provenance,age, sex, and scan parameters for each subject with downloadableimages. The user can review and use this information to decidewhetherto further refine the query. The user also has the ability to modify, save

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 11: SchizConnect: Mediating neuroimaging databases on

11L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

and retrieve queries. A data download button is available that leads tothe DUA page (Fig. 3F). The user must sign the SchizConnect DUA, andhas the option of choosing which participating source DUAs to agreewith, and receives data only from data sources with which the DUAshave been agreed.

For downloading, the imaging data resulting from the queries aretransferred out of the data sources and staged at the SchizConnect.org host, together with the returnedmeta-data table of the mediatedvariables. Transfer is performed via gridFTP, REST API, HTTP, orothers, depending on the specific data transfer protocol at each source.Imaging data files are compressed and packaged into 10-GB easilyreconstructable, individual 7-zip (http://www.7-zip.org/) segments,available for the user to downloadwithin a specified limited time period(currently 2 weeks). Depending on the size of the requested imagingdata, it can take up to a few days for all data to be staged for download.These data files contain the original and/or preprocessed imaging datashared by each source, in DICOM, Analyze, or NIFTI formats. Links tothese files along with unpacking instructions are sent to the user viaemail. The links are also available on the “My SchizConnect” page ofthewebportal (Fig. 3G, H). Documentation for descriptions of image for-mats and directory structures can be found on the SchizConnectwebsite(http://www.schizconnect.org/documentation). All cognitive and clinicaldata (see model description above) can be included in the downloadpackage along with the download imaging data. On the documentationpage, the user can finduseful information including: Tutorials on how touse the website, DUAs of all data sources, data descriptions, data dictio-naries, and peer-reviewed journal papers related to the data. Since notall neuroimaging data (such as scan sequence parameters) have beenmodeled into the SchizConnect domain hierarchy, it is important to pro-vide technical information on our website to facilitate research. In thedata description section, the NUSDAST description lists structural pa-rameters (e.g., 3D MPRAGE: TR = 9.7 ms, TE = 4 ms, flip = 10°,ACQ = 1, 256 × 256 matrix, 1 × l mm in-plane resolution, 128 slices,slice thickness=1.25mm, 5:36min scan time each) and the COBRE de-scription lists resting-state parameters (resting state scans consisting of149 volumes of T2*-weighted functional images, acquired using agradient-echo EPI sequence: TR = 2 s, TE = 29 ms, flip = 75°, slicethickness= 3.5mm, slice gap=1.05mm, field of view=240mm,ma-trix size=64× 64, voxel size=3.75mm×3.75mm×4.55mm). In thePublications section, we have provided a link to a peer-reviewed journalpublication that provide detailed technical descriptions on each datasource. For example, Gollub et al. (2013) describes “theMCIC collection:a shared repository of multi-modal, multi-site brain image data from aclinical investigation of schizophrenia.” Similar papers describing allthe other data sources currently participating in SchizConnect can befound here.

Discussions and future plans

These initial results demonstrate that SchizConnect allows combiningof neuroimaging data from different databases via mediation to formcompatible mega-datasets with accuracy and fidelity. In SchizConnect,data remains with the source rather than warehoused in a central repos-itory. Providers maintain control of their data and do not need to modifythem for sharing. The user query is done over a single, consistent, well-defined model that is translated to the schemas of the sources. This ap-proach unburdens the user and each source from having to make contactand helping to interpret the data each time, especiallywhen newdata arebeing continuously added to the data source repository. Theweb portal isuser-friendly and intuitive, appearing to the user as a single, virtual data-base with real-time query and download performance. As an on-goingproject, we are continuing to define additional domain model terms forneuropsychological and psychopathological variables, make available ad-ditional imagingmodalities and subjects, and identify andevaluate poten-tial new data sources. Work is also under way integrating with clinicalresearch databases such as Research Electronic Data Capture (REDCap)

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

(Harris et al., 2009) where clinical and cognitive assessment data arestored, further enhancing the functionality of SchizConnect by searchingacross imaging and non-imaging databases for the same subject.

SchizConnect shows considerable potential for overcoming currentbarriers for creating large-scale datasets to increase statistical power,accelerating the testing of new hypotheses and methods, and creatinga resource for developing advanced techniques to better integrate dis-parate data at low cost. As a proof-of-concept for combining datasets,we compared SAPS/SANS and neurocognition across the current datasources. For SAPS/SANS, we found that while fewer schizophreniasubjects in the NU data scored 2 or higher, the distributions were notstatistically different between sites. For neurocognitive measures, wecomputed z-scores on episodic memory, working memory, attentionand executive function domains using each site's own controls as refer-ence sets. No statistical difference between sites was observed (NUrange: −1.23 to −0.71, FBIRN: −1.28 to −0.65), indicating thatschizophrenia subjects across both consortia exhibit similar degrees ofpsychopathology and cognitive impairment. Nonetheless, subtle differ-ences in cohort make-up may still exist upon closer inspection, butusers will have the opportunity to explore the sources of potentialheterogeneity and use such differences to benefit in developing newhypotheses.

SchizConnect was aimed at combining across different types of datarepositories, and its current data sources are examples of classic types ofdata repository: The HID that houses the FBIRN data is a multisite studyrepository with prospectively collected data using pre-determined pro-tocols; The COINS Data Exchange that houses theMCICShare and COBREdata, and the XNAT Central that houses the NUSDAST data, are central-ized data repositories where data from unrelated studies are uploadedand stored, and the database platforms provide the user with queryabledata structures and interfaces. When owners of these datasets are will-ing to share subject-level data (i.e., scans, assessments), SchizConnect isdesigned to interact with such data repositories for mediation. Wheninvestigators are only willing to share measures, meta-analysis effortssuch as the ENIGMA (Gupta et al., 2014; Thompson et al., 2014; vanErp et al., in press; Wright et al., 2015) will be an excellent alternativefor data sharing.

During the initial implementation of SchizConnect using the currentdata sources as test beds, we have established a series of domainmodelsand their translations to the initial data sourcemodels. TheSchizConnectmediation approach simply translates user queries into the querylanguage of each data source (i.e., equivalent to the user performingthe same query at the individual data source's own web portal). TheSchizConnect data models can be extended easily to accommodatenew data structures of a new data source while placing no restrictionsonwhat the data sourcewants to share. Although the process of domainunderstanding and data modeling can be labor intensive, we have laidthe foundation for extending SchizConnect to other schizophrenianeuroimaging data sources, which will build on the existing schemas.A limitation of the current domainmodelwork is that theunderstandingand modeling of source data remains a manual step. In SchizConnect,this data mediation process starts with domain understanding byexperts on the specific data that is being integrated so that we have aclear knowledge on the data terms and their definition. The data media-tion experts can then integrate this new knowledge into existingSchizConnect models to enable the data to be accessedwithout requiringit to bemodified.We have provided a questionnaire on the schizconnect.org web portal that can initiate the process, http://schizconnect.org/questionnaires/1/responses/new. It is important as a next step for us todevelop approaches that learn from existing data to facilitate more auto-mated procedures for inclusion of new data sources (e.g., following ourprevious work (Knoblock et al., 2012)). Another limitation is query eval-uation in the virtual integration approach, is generally slower that queryevaluation on a warehouse-model database.

Finally, SchizConnect is the only data finder (data broker) of itskind for researchers interested in combining neuroimaging data on

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 12: SchizConnect: Mediating neuroimaging databases on

12 L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

schizophrenia and related disorders from disparate sources. Existingdata portals such as the COINS, FBIRN, and XNAT Central, data sourcesused in this project, contain schizophrenia data but serve as repositoriesof these types of data. When users intent to combine data across thesesources, harmonization of data terms and mediation of these termsremain a critical challenge andbarrier. SchizConnect is fulfilling this crit-ical need, and the data mediation framework that SchizConnect createdcan be readily extended to other clinical domains including BipolarDisorder. Recent developments on data harmonization have led to thecreation of the Research Domain Criteria Database (RDoCdb) at theNational Institute of Mental Health (NIMH) and its associated datarepository, the National Database for Autism Research (NDAR) (Hallet al., 2012). It should be noted that SchizConnect is a virtual databasethat can mediate data sources that make ongoing updates to their re-positories, and SchizConnect is focused on schizophrenia and relateddisorders. Administratively, the SchizConnect web portal is hosted atNorthwestern University, the SchizConnect mediator is hosted at theInformation Science Institute of the University of Southern California.For inclusion into SchizConnect, new data sources will be evaluatedon completeness, quality, and enthusiasm to collaborate. The focus ofthe initial expansion will be the quickest return for the research com-munity, so that larger, richer datasets that have been carefully collectedand checked for completeness and quality (e.g., already analyzed andpublished) would be of the highest priority. Data repositories basedon XNAT, or those are already in COINS can be more readily integrated,up to any new data models and new querying news. For valuabledatasets without a database platform, we can help develop an XNAT,HID, or COINS.

It is our hope that SchizConnect can facilitate the testing of new hy-potheses that are hitherto not possible, thus greatly promote discoveryrelated to themechanisms underlying schizophrenia.We aim to expandto repositories on psychosis and related disorders, thus facilitatinglarge-scale dimensional research. We hope that SchizConnect willbecome the prototype for the study of psychiatric disorders, serving asamodel for broader efforts for the integration and sharing of biomedicalinformation across the greater scientific community.

Acknowledgment

This work was supported in part by NIH grants 1U01 MH097435,1R01 MH084803, P50 MH071616, R01 MH056584, U24 RR025736-01,U24-RR021992, U24GM10420, P20 GM103472.

References

Addington, J., Cadenhead, K.S., Cornblatt, B.A., Mathalon, D.H., McGlashan, T.H., Perkins,D.O., Seidman, L.J., Tsuang, M.T., Walker, E.F., Woods, S.W., Addington, J.A., Cannon,T.D., 2012. North American Prodrome Longitudinal Study (NAPLS 2): overview andrecruitment. Schizophr. Res. 142, 77–82.

Allen, E.A., Erhardt, E.B., Damaraju, E., Gruner, W., Segall, J.M., Silva, R.F., Havlicek, M.,Rachakonda, S., Fries, J., Kalyanam, R., Michael, A.M., Caprihan, A., Turner, J.A.,Eichele, T., Adelsheim, S., Bryan, A.D., Bustillo, J., Clark, V.P., Feldstein Ewing, S.W.,Filbey, F., Ford, C.C., Hutchison, K., Jung, R.E., Kiehl, K.A., Kodituwakku, P., Komesu,Y.M., Mayer, A.R., Pearlson, G.D., Phillips, J.P., Sadek, J.R., Stevens, M., Teuscher, U.,Thoma, R.J., Calhoun, V.D., 2011. A baseline for the multivariate comparison ofresting-state networks. Front. Syst. Neurosci. 5, 2.

Antonioletti, M., Atkinson, M., Baxter, R., Borley, A., Chue Hong, N.P., Collins, B., Hardman,N., Hume, A.C., Knox, A., Jackson, M., Krause, A., Laws, S., Magowan, J., Paton, N.W.,Pearson, D., Sugden, T., Watson, P., Westhead, M., 2005. The design and implementa-tion of Grid database services in OGSA-DAI. Concurr. Comput. Pract. Experience 17,357–376.

Arnold, S.J., Ivleva, E.I., Gopal, T.A., Reddy, A.P., Jeon-Slaughter, H., Sacco, C.B., Francis, A.N.,Tandon, N., Bidesi, A.S., Witte, B., Poudyal, G., Pearlson, G.D., Sweeney, J.A., Clementz,B.A., Keshavan, M.S., Tamminga, C.A., 2015. Hippocampal volume is reduced inschizophrenia and schizoaffective disorder but not in psychotic bipolar I disorderdemonstrated by both manual tracing and automated parcellation (FreeSurfer).Schizophr. Bull. 41, 233–249.

Ashish, N., Ambite, J.L., Muslea, M., Turner, J.A., 2010. Neuroscience data integrationthrough mediation: an (F)BIRN case study. Front. Neuroinform. 4, 118.

Bjaalie, J.G., Grillner, S., 2007. Global neuroinformatics: the International NeuroinformaticsCoordinating Facility. J. Neurosci. 27, 3613–3615.

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

Bockholt, H.J., Scully, M., Courtney, W., Rachakonda, S., Scott, A., Caprihan, A., Fries, J.,Kalyanam, R., Segall, J.M., de la Garza, R., Lane, S., Calhoun, V.D., 2010. Mining themind research network: a novel framework for exploring large scale, heterogeneoustranslational neuroscience research data sources. Front. Neuroinform. 3, 36.

Cannon, T.D., Chung, Y., He, G., Sun, D., Jacobson, A., van Erp, T.G., McEwen, S., Addington,J., Bearden, C.E., Cadenhead, K., Cornblatt, B., Mathalon, D.H., McGlashan, T., Perkins,D., Jeffries, C., Seidman, L.J., Tsuang, M., Walker, E., Woods, S.W., Heinssen, R., NorthAmerican Prodrome Longitudinal Study, C., 2015. Progressive reduction in corticalthickness as psychosis develops: a multisite longitudinal neuroimaging study ofyouth at elevated clinical risk. Biol. Psychiatry 77, 147–157.

Castro, E., Gupta, C.N., Martinez-Ramon, M., Calhoun, V.D., Arbabshirani, M.R., Turner, J.,2014. Identification of patterns of gray matter abnormalities in schizophrenia usingsource-based morphometry and bagging. Conf. Proc. IEEE Eng. Med. Biol. Soc. 2014,1513–1516.

Cetin, M.S., Christensen, F., Abbott, C.C., Stephen, J.M., Mayer, A.R., Canive, J.M., Bustillo,J.R., Pearlson, G.D., Calhoun, V.D., 2014. Thalamus and posterior temporal lobe showgreater inter-network connectivity at rest and across sensory paradigms in schizo-phrenia. NeuroImage 97, 117–126.

Chen, J., Calhoun, V.D., Pearlson, G.D., Ehrlich, S., Turner, J.A., Ho, B.C., Wassink, T.H.,Michael, A.M., Liu, J., 2012. Multifaceted genomic risk for brain function in schizo-phrenia. NeuroImage 61, 866–875.

Chen, J.Y., Calhoun, V.D., Pearlson, G.D., Perrone-Bizzozero, N., Sui, J., Turner, J.A., Bustillo,J.R., Ehrlich, S., Sponheim, S.R., Canive, J.M., Ho, B.C., Liu, J.Y., 2013. Guided explorationof genomic risk for gray matter abnormalities in schizophrenia using parallel inde-pendent component analysis with reference. NeuroImage 83, 384–396.

De Schutter, E., 2009. The International Neuroinformatics Coordinating Facility: evaluatingthe first years. Neuroinformatics 7, 161–163.

Ehrlich, S., Geisler, D., Yendiki, A., Panneck, P., Roessner, V., Calhoun, V.D., Magnotta, V.A.,Gollub, R.L., White, T., 2014. Associations of white matter integrity and cortical thick-ness in patients with schizophrenia and healthy controls. Schizophr. Bull. 40, 665–674.

Florescu, D., Levy, A.Y., Mendelzon, A., 1998. Database techniques for the world-wideweb: a survey. SIGMOD Rec. 27, 59–74.

Friedman, L., Stern, H., Brown, G.G., Mathalon, D.H., Turner, J., Glover, G.H., Gollub, R.L.,Lauriello, J., Lim, K.O., Cannon, T., Greve, D.N., Bockholt, H.J., Belger, A., Mueller, B.,Doty, M.J., He, J., Wells, W., Smyth, P., Pieper, S., Kim, S., Kubicki, M., Vangel, M.,Potkin, S.G., 2008. Test–retest and between-site reliability in a multicenter fMRIstudy. Hum. Brain Mapp. 29, 958–972.

Gadde, S., Aucoin, N., Grethe, J.S., Keator, D.B., Marcus, D.S., Pieper, S., Fbirn, M.B.-C., 2012.XCEDE: an extensible schema for biomedical data. Neuroinformatics 10, 19–32.

Gardner, D., Akil, H., Ascoli, G.A., Bowden, D.M., Bug, W., Donohue, D.E., Goldberg, D.H.,Grafstein, B., Grethe, J.S., Gupta, A., Halavi, M., Kennedy, D.N., Marenco, L., Martone,M.E., Miller, P.L., Muller, H.M., Robert, A., Shepherd, G.M., Sternberg, P.W., VanEssen, D.C., Williams, R.W., 2008. The neuroscience information framework: a dataand knowledge environment for neuroscience. Neuroinformatics 6, 149–160.

Glover, G.H., Mueller, B.A., Turner, J.A., van Erp, T.G., Liu, T.T., Greve, D.N., Voyvodic, J.T.,Rasmussen, J., Brown, G.G., Keator, D.B., Calhoun, V.D., Lee, H.J., Ford, J.M., Mathalon,D.H., Diaz, M., O'Leary, D.S., Gadde, S., Preda, A., Lim, K.O., Wible, C.G., Stern, H.S.,Belger, A., McCarthy, G., Ozyurt, B., Potkin, S.G., 2012. Function biomedical informaticsresearch network recommendations for prospectivemulticenter functionalMRI studies.J. Magn. Reson. Imaging 36, 39–54.

Gollub, R.L., Shoemaker, J.M., King, M.D., White, T., Ehrlich, S., Sponheim, S.R., Clark, V.P.,Turner, J.A., Mueller, B.A., Magnotta, V., O'Leary, D., Ho, B.C., Brauns, S., Manoach,D.S., Seidman, L., Bustillo, J.R., Lauriello, J., Bockholt, J., Lim, K.O., Rosen, B.R., Schulz,S.C., Calhoun, V.D., Andreasen, N.C., 2013. The MCIC collection: a shared repository ofmulti-modal,multi-site brain imagedata froma clinical investigation of schizophrenia.Neuroinformatics 11, 367–388.

Grant, A., Antonioletti, M., Hume, A.C., Krause, A., Dobrzelecki, B., Jackson, M.J., Parsons,M., Atkinson, M.P., Theocharopoulos, E., 2008. OGSA-DAI: middleware for data inte-gration: selected applications. IEEE Fourth International Conference on eScience,2008. eScience '08, p. 343-343.

Gupta, C.N., Calhoun, V.D., Rachakonda, S., Chen, J., Patel, V., Liu, J., Segall, J., Franke, B.,Zwiers, M.P., Arias-Vasquez, A., Buitelaar, J., Fisher, S.E., Fernandez, G., van Erp, T.G.,Potkin, S., Ford, J., Mathalon, D., McEwen, S., Lee, H.J., Mueller, B.A., Greve, D.N.,Andreassen, O., Agartz, I., Gollub, R.L., Sponheim, S.R., Ehrlich, S., Wang, L., Pearlson,G., Glahn, D.C., Sprooten, E., Mayer, A.R., Stephen, J., Jung, R.E., Canive, J., Bustillo, J.,Turner, J.A., 2014. Patterns of gray matter abnormalities in schizophrenia based onan international mega-analysis. Schizophr. Bull.

Halevy, A.Y., 2001. Answering queries using views: a survey. VLDB J. 10, 270–294.Hall, D., Huerta, M.F., McAuliffe, M.J., Farber, G.K., 2012. Sharing heterogeneous data: the

national database for autism research. Neuroinformatics 10, 331–339.Harris, P.A., Taylor, R., Thielke, R., Payne, J., Gonzalez, N., Conde, J.G., 2009. Research electron-

ic data capture (REDCap)—a metadata-driven methodology and workflow process forproviding translational research informatics support. J. Biomed. Inform. 42, 377–381.

Hass, J., Walton, E., Kirsten, H., Turner, J., Wolthusen, R., Roessner, V., Sponheim, S.R., Holt,D., Gollub, R., Calhoun, V.D., Ehrlich, S., 2015a. Complexin2 modulates working mem-ory-related neural activity in patients with schizophrenia. Eur Arch Psychiatry ClinNeurosci 265, 137–145.

Hass, J., Walton, E., Wright, C., Beyer, A., Scholz, M., Turner, J., Liu, J., Smolka, M.N.,Roessner, V., Sponheim, S.R., Gollub, R.L., Calhoun, V.D., Ehrlich, S., 2015b. Associationsbetween DNA methylation and schizophrenia-related intermediate phenotypes - agene set enrichment analysis. ProgNeuropsychopharmacol Biol Psychiatry 59, 31–39.

Helmer, K.G., Ambite, J.L., Ames, J., Ananthakrishnan, R., Burns, G., Chervenak, A.L., Foster,I., Liming, L., Keator, D., Macciardi, F., Madduri, R., Navarro, J.P., Potkin, S., Rosen, B.,Ruffins, S., Schuler, R., Turner, J.A., Toga, A., Williams, C., Kesselman, C., BiomedicalInformatics Research, N., 2011a. Enabling collaborative research using the BiomedicalInformatics Research Network (BIRN). J Am Med Inform Assoc 18, 416–422.

aging databases on schizophrenia and related disorders for large-scale015.06.065

Page 13: SchizConnect: Mediating neuroimaging databases on

13L. Wang et al. / NeuroImage xxx (2015) xxx–xxx

Helmer, K.G., Ambite, J.L., Ames, J., Ananthakrishnan, R., Burns, G., Chervenak, A.L., Foster,I., Liming, L., Keator, D., Macciardi, F., Madduri, R., Navarro, J.P., Potkin, S., Rosen, B.,Ruffins, S., Schuler, R., Turner, J.A., Toga, A., Williams, C., Kesselman, C., BiomedicalInformatics Research, N., 2011b. Enabling collaborative research using the BiomedicalInformatics Research Network (BIRN). J. Am. Med. Inform. Assoc. 18, 416–422.

Keator, D.B., Grethe, J.S., Marcus, D., Ozyurt, B., Gadde, S., Murphy, S., Pieper, S., Greve, D.,Notestine, R., Bockholt, H.J., Papadopoulos, P., 2008. A national human neuroimagingcollaboratory enabled by the Biomedical Informatics Research Network (BIRN). IEEETrans. Inf. Technol. Biomed. 12, 162–172.

Keator, D.B., Wei, D., Gadde, S., Bockholt, J., Grethe, J.S., Marcus, D., Aucoin, N., Ozyurt, I.B.,2009. Derived data storage and exchange workflow for large-scale neuroimaginganalyses on the BIRN grid. Front. Neuroinform. 3, 30.

Keshavan, M.S., Morris, D.W., Sweeney, J.A., Pearlson, G., Thaker, G., Seidman, L.J., Eack,S.M., Tamminga, C., 2011. A dimensional approach to the psychosis spectrum be-tween bipolar disorder and schizophrenia: the Schizo-Bipolar Scale. Schizophr. Res.133, 250–254.

Kim, D.I., Manoach, D.S., Mathalon, D.H., Turner, J.A., Mannell, M., Brown, G.G., Ford, J.M.,Gollub, R.L., White, T., Wible, C., Belger, A., Bockholt, H.J., Clark, V.P., Lauriello, J.,O'Leary, D., Mueller, B.A., Lim, K.O., Andreasen, N., Potkin, S.G., Calhoun, V.D., 2009.Dysregulation of working memory and default-mode networks in schizophreniausing independent component analysis, an FBIRN and MCIC study. Hum. BrainMapp. 30, 3795–3811.

Kim, M.A., Tura, E., Potkin, S.G., Fallon, J.H., Manoach, D.S., Calhoun, V.D., Turner, J.A., 2010.Working memory circuitry in schizophrenia shows widespread cortical inefficiencyand compensation. Schizophr. Res. 117, 42–51.

Knoblock, C.A., Szekely, P., Jos, #233, Ambite, L., Goel, A., Gupta, S., Lerman, K., Muslea, M.,Taheriyan,M., Mallick, P., 2012. Semi-automaticallymapping structured sources into thesemanticweb. Proceedings of the 9th international conferenceonTheSemanticWeb: re-search and applications. Springer-Verlag, Heraklion, Crete, Greece, pp. 375–390.

Larson, S.D., Martone, M.E., 2013. NeuroLex.org: an online framework for neuroscienceknowledge. Front. Neuroinform. 7, 18.

Lenzerini, M., 2002. Data integration: a theoretical perspective. Proceedings of ACM Sym-posium on Principles of Database Systems, Madison, Wisconsin.

Lynden, S., Pahlevi, S.M., Kojima, I., 2008. Service-based data integration using OGSA-DQPand OGSA-WebDB. 2008 9th IEEE/ACM International Conference on Grid Computing,pp. 160–167.

Lynden, S., Mukherjee, A., Hume, A.C., Fernandes, A.A.A., Paton, N.W., Sakellariou, R.,Watson, P., 2009. The design and implementation of OGSA-DQP: a service-based dis-tributed query processor. Futur. Gener. Comput. Syst. 25, 224–236.

Marcus, D.S., Olsen, T.R., Ramaratnam, M., Buckner, R.L., 2007. The Extensible Neuroimag-ing Archive Toolkit: an informatics platform for managing, exploring, and sharingneuroimaging data. Neuroinformatics 5, 11–34.

Mathew, I., Gardin, T.M., Tandon, N., Eack, S., Francis, A.N., Seidman, L.J., Clementz, B.,Pearlson, G.D., Sweeney, J.A., Tamminga, C.A., Keshavan, M.S., 2014. Medial temporallobe structures and hippocampal subfields in psychotic disorders: findings from theBipolar-Schizophrenia Network on Intermediate Phenotypes (B-SNIP) study. JAMAPsychiatry 71, 769–777.

Nuechterlein, K.H., Green, M.F., Kern, R.S., Baade, L.E., Barch, D.M., Cohen, J.D., Essock, S.,Fenton, W.S., Frese III, F.J., Gold, J.M., Goldberg, T., Heaton, R.K., Keefe, R.S., Kraemer,H., Mesholam-Gately, R., Seidman, L.J., Stover, E., Weinberger, D.R., Young, A.S.,Zalcman, S., Marder, S.R., 2008. The MATRICS Consensus Cognitive Battery, part 1:test selection, reliability, and validity. Am. J. Psychiatry 165, 203–213.

Ozyurt, I.B., Keator, D.B., Wei, D., Fennema-Notestine, C., Pease, K.R., Bockholt, J., Grethe,J.S., 2010. Federated web-accessible clinical data management within an extensibleneuroimaging database. Neuroinformatics 8, 231–249.

Potkin, S.G., Ford, J.M., 2009. Widespread cortical dysfunction in schizophrenia: the FBIRNimaging consortium. Schizophr. Bull. 35, 15–18.

Potkin, S.G., Turner, J.A., Brown, G.G., McCarthy, G., Greve, D.N., Glover, G.H., Manoach,D.S., Belger, A., Diaz, M., Wible, C.G., Ford, J.M., Mathalon, D.H., Gollub, R., Lauriello, J.,O'Leary, D., van Erp, T.G., Toga, A.W., Preda, A., Lim, K.O., Fbirn, 2009.Workingmemoryand DLPFC inefficiency in schizophrenia: the FBIRN study. Schizophr. Bull. 35, 19–31.

Scott, A., Courtney, W., Wood, D., de la Garza, R., Lane, S., King, M., Wang, R., Roberts, J.,Turner, J.A., Calhoun, V.D., 2011. COINS: an innovative informatics and neuroimagingtool suite built for large heterogeneous datasets. Front. Neuroinform. 5, 33.

Thaker, G., 2008. Psychosis endophenotypes in schizophrenia and bipolar disorder.Schizophr. Bull. 34, 720–721.

Thompson, P.M., Stein, J.L., Medland, S.E., Hibar, D.P., Vasquez, A.A., Renteria, M.E., Toro, R.,Jahanshad, N., Schumann, G., Franke, B., Wright, M.J., Martin, N.G., Agartz, I., Alda, M.,Alhusaini, S., Almasy, L., Almeida, J., Alpert, K., Andreasen, N.C., Andreassen, O.A.,Apostolova, L.G., Appel, K., Armstrong, N.J., Aribisala, B., Bastin, M.E., Bauer, M.,Bearden, C.E., Bergmann, O., Binder, E.B., Blangero, J., Bockholt, H.J., Boen, E., Bois, C.,

Please cite this article as:Wang, L., et al., SchizConnect: Mediating neuroimintegration, NeuroImage (2015), http://dx.doi.org/10.1016/j.neuroimage.2

Boomsma, D.I., Booth, T., Bowman, I.J., Bralten, J., Brouwer, R.M., Brunner, H.G.,Brohawn, D.G., Buckner, R.L., Buitelaar, J., Bulayeva, K., Bustillo, J.R., Calhoun, V.D.,Cannon, D.M., Cantor, R.M., Carless, M.A., Caseras, X., Cavalleri, G.L., Chakravarty,M.M., Chang, K.D., Ching, C.R., Christoforou, A., Cichon, S., Clark, V.P., Conrod, P.,Coppola, G., Crespo-Facorro, B., Curran, J.E., Czisch, M., Deary, I.J., de Geus, E.J., denBraber, A., Delvecchio, G., Depondt, C., de Haan, L., de Zubicaray, G.I., Dima, D.,Dimitrova, R., Djurovic, S., Dong, H., Donohoe, G., Duggirala, R., Dyer, T.D., Ehrlich,S., Ekman, C.J., Elvsashagen, T., Emsell, L., Erk, S., Espeseth, T., Fagerness, J., Fears, S.,Fedko, I., Fernandez, G., Fisher, S.E., Foroud, T., Fox, P.T., Francks, C., Frangou, S.,Frey, E.M., Frodl, T., Frouin, V., Garavan, H., Giddaluru, S., Glahn, D.C., Godlewska, B.,Goldstein, R.Z., Gollub, R.L., Grabe, H.J., Grimm, O., Gruber, O., Guadalupe, T., Gur,R.E., Gur, R.C., Goring, H.H., Hagenaars, S., Hajek, T., Hall, G.B., Hall, J., Hardy, J.,Hartman, C.A., Hass, J., Hatton, S.N., Haukvik, U.K., Hegenscheid, K., Heinz, A., Hickie,I.B., Ho, B.C., Hoehn, D., Hoekstra, P.J., Hollinshead, M., Holmes, A.J., Homuth, G.,Hoogman, M., Hong, L.E., Hosten, N., Hottenga, J.J., Hulshoff Pol, H.E., Hwang, K.S.,Jack Jr., C.R., Jenkinson, M., Johnston, C., Jonsson, E.G., Kahn, R.S., Kasperaviciute, D.,Kelly, S., Kim, S., Kochunov, P., Koenders, L., Kramer, B., Kwok, J.B., Lagopoulos, J.,Laje, G., Landen, M., Landman, B.A., Lauriello, J., Lawrie, S.M., Lee, P.H., Le Hellard, S.,Lemaitre, H., Leonardo, C.D., Li, C.S., Liberg, B., Liewald, D.C., Liu, X., Lopez, L.M.,Loth, E., Lourdusamy, A., Luciano, M., Macciardi, F., Machielsen, M.W., Macqueen,G.M., Malt, U.F., Mandl, R., Manoach, D.S., Martinot, J.L., Matarin, M., Mather, K.A.,Mattheisen, M., Mattingsdal, M., Meyer-Lindenberg, A., McDonald, C., McIntosh,A.M., McMahon, F.J., McMahon, K.L., Meisenzahl, E., Melle, I., Milaneschi, Y.,Mohnke, S., Montgomery, G.W., Morris, D.W., Moses, E.K., Mueller, B.A., MunozManiega, S., Muhleisen, T.W., Muller-Myhsok, B., Mwangi, B., Nauck, M., Nho, K.,Nichols, T.E., Nilsson, L.G., Nugent, A.C., Nyberg, L., Olvera, R.L., Oosterlaan, J.,Ophoff, R.A., Pandolfo, M., Papalampropoulou-Tsiridou, M., Papmeyer, M., Paus, T.,Pausova, Z., Pearlson, G.D., Penninx, B.W., Peterson, C.P., Pfennig, A., Phillips, M.,Pike, G.B., Poline, J.B., Potkin, S.G., Putz, B., Ramasamy, A., Rasmussen, J., Rietschel,M., Rijpkema, M., Risacher, S.L., Roffman, J.L., Roiz-Santianez, R., Romanczuk-Seiferth, N., Rose, E.J., Royle, N.A., Rujescu, D., Ryten, M., Sachdev, P.S., Salami, A.,Satterthwaite, T.D., Savitz, J., Saykin, A.J., Scanlon, C., Schmaal, L., Schnack, H.G.,Schork, A.J., Schulz, S.C., Schur, R., Seidman, L., Shen, L., Shoemaker, J.M., Simmons,A., Sisodiya, S.M., Smith, C., Smoller, J.W., Soares, J.C., Sponheim, S.R., Sprooten, E.,Starr, J.M., Steen, V.M., Strakowski, S., Strike, L., Sussmann, J., Samann, P.G., Teumer,A., Toga, A.W., Tordesillas-Gutierrez, D., Trabzuni, D., Trost, S., Turner, J., den Heuvel,Van, Alzheimer's Disease Neuroimaging Initiative, E.C.I.C.S.Y.S.G., 2014. The ENIGMAConsortium: large-scale collaborative analyses of neuroimaging and genetic data.Brain Imaging Behav 8, 153–182.

Turner, J.A., 2014. The rise of large-scale imaging studies in psychiatry. GigaScience 3.van Erp, T.G., Hibar, D.P., Rasmussen, J.M., Glahn, D.C., Pearlson, G.D., Andreassen, O.A.,

Agartz, I., Westlye, L.T., Haukvik, U.K., Dale, A.M., Melle, I., Hartberg, C.B., Gruber, O.,Kraemer, B., Zilles, D., Donohoe, G., Kelly, S., McDonald, C., Morris, D.W., Cannon,D.M., Corvin, A., Machielsen, M.W., Koenders, L., de Haan, L., Veltman, D.J.,Satterthwaite, T.D., Wolf, D.H., Gur, R.C., Gur, R.E., Potkin, S.G., Mathalon, D.H.,Mueller, B.A., Preda, A., Macciardi, F., Ehrlich, S., Walton, E., Hass, J., Calhoun, V.D.,Bockholt, H.J., Sponheim, S.R., Shoemaker, J.M., van Haren, N.E., Pol, H.E., Ophoff,R.A., Kahn, R.S., Roiz-Santianez, R., Crespo-Facorro, B., Wang, L., Alpert, K.I., Jonsson,E.G., Dimitrova, R., Bois, C., Whalley, H.C., McIntosh, A.M., Lawrie, S.M., Hashimoto,R., Thompson, P.M., Turner, J.A., 2015. Subcortical brain volume abnormalities in2028 individuals with schizophrenia and 2540 healthy controls via the ENIGMA con-sortium. Mol Psychiatry (in press).

Ullman, J.D., 1997. Information integration using logical views. Proceedings of the SixthInternational Conference on Database Theory, Delphi, Greece, pp. 19–40.

Wang, L., Kogan, A., Cobia, D., Alpert, K., Kolasny, A., Miller, M.I., Marcus, D., 2013. North-western University schizophrenia data and software tool (NUSDAST). Front.Neuroinform. 7, 25.

Wang, L., Alpert, K.I., Calhoun, V., Keator, D., King, M., Kogan, A., Landis, D., Tallis, M.,Potkin, S.G., Turner, J.A., Ambite, J.L., 2014. SchizConnect: large-scale schizophrenianeuroimaging data integration and sharing. Annual Meeting of the American Collegeof Neuropsychopharmacology (ACNP), Phoenix, Arizona.

Wiederhold, G., 1992. Mediators in the architecture of future information systems. IEEEComput. 25, 38–49.

Wood, D., King, M., Landis, D., Courtney, W., Wang, R., Kelly, R., Turner, J.A., Calhoun, V.D.,2014. Harnessingmodern web application technology to create intuitive and efficientdata visualization and sharing tools. Front. Neuroinform. 8, 71.

Wright, C., Calhoun, V.D., Ehrlich, S., Wang, L., Turner, J.A., Bizzozero, N.I., 2015. Meta geneset enrichment analyses link miR-137-regulated pathways with schizophrenia risk.Front Genet 6, 147.

aging databases on schizophrenia and related disorders for large-scale015.06.065