Researcher hub: Data available for researchers

Page Content

MVP data available for research

Through centralized data collection, cleaning, and curation, MVP has a wealth of health records, self-reported surveys, and genetic data available for research, with the generation of other omics data underway. MVP researchers contribute to the curation of phenotypes, and all MVP phenotype definitions are stored in the Centralized Interactive Phenomics Resource (CIPHER), a publicly accessible phenotype knowledgebase. Note: CIPHER does not contain patient-level data. CIPHER stores algorithms (which are instructions or “recipes”) for using MVP data to define health conditions.

Veterans discussing brochure

Page Content

Applying for access to MVP data

  • MVP genomic and phenotypic data is available to VA researchers through VA-funded research projects and select non-VA federal funding.
  • While opportunities for accessing MVP data are evolving, access is currently limited to VA-affiliated researchers.
  • VA-affiliated researchers can submit proposals in response to RFAs from our ORD services: RFAs and Program Announcements (va.gov). VA researchers can also apply for select types of non-VA federal funding.

Exploring MVP data

The following resources allow researchers to explore MVP data while planning and developing research proposals. Some resources may only be accessible to VA users.

  • The VA MVP Data Explorer tool enables researchers to query data based on clinical and other data characteristics to build rough cohorts, estimate sample sizes, and perform power analyses.
    • This tool is accessible to VA users with a NT account to help explore MVP data while planning and developing research proposals.

Page Content

MVP analytics environments and tools

MVP provides the following centralized analytics environments and tools to support researchers in their studies. Researchers can also bring approved tools into the MVP analytics environment. The most commonly used analysis software and programming languages are available in the computational environments and updated regularly. New tools and software can be added upon request and approval.

Analytical environments

Genomic Information System for Integrative Science (GenISIS)

  • The Genomic Information System for Integrative Science (GenISIS) is a high-performance computing cluster (HPC) that approved MVP researchers access to analyze MVP genetic data.
  • In addition to 2,354 cores for analysis, it contains >6.3 PB of storage. Access from GenISIS to the VA enterprise cloud (VAEC) is currently being tested and will be available for MVP research in the future.
  • Access to this analytical environment is only available to VA system users.