Biotechnology Laboratory Science & Tools Pharmaceuticals

Alliance for Genomic Discovery expands to 312,000 genomes with new proteomics dataset

Laboratory handling biological samples for genomic research in Alliance for Genomic Discovery dataset studies
  • Regeneron Genetics Center joins the Alliance for Genomic Discovery

  • Dataset expands to more than 312,000 whole genomes

  • New 50,000-sample multiomic dataset will combine genomics and proteomics

  • Large-scale datasets support target discovery and drug development

The Alliance for Genomic Discovery has expanded its dataset to more than 312,000 whole genomes following the addition of Regeneron Genetics Center as a member. The initiative also plans to add paired proteomic data to support multiomic drug discovery research.

Regeneron Genetics Center joins Alliance for Genomic Discovery

The Alliance, established by Illumina and Nashville Biosciences, brings together pharmaceutical and biotechnology organisations to analyse large-scale genomic datasets linked to clinical information.

With the addition of Regeneron Genetics Center, the Alliance dataset now includes more than 312,000 whole genomes paired with longitudinal clinical data from electronic health records.

Large genomic dataset supports disease research

The dataset combines whole genome sequencing with detailed phenotypic information derived from clinical records. This approach allows researchers to identify disease cohorts and study the genetic basis of complex conditions.

Large population-scale datasets are increasingly used to support target discovery, drug development and clinical trial design.

Proteomics expansion introduces multiomic data

The Alliance also announced plans to generate a dataset of 50,000 whole genomes paired with proteomic data. The expansion will use Illumina Protein Prep technology to integrate genomic and protein-level information.

GSK is among the first pharmaceutical companies participating in the multiomic expansion, alongside other Alliance members.

Multiomic datasets support drug discovery

Integrating genomic and proteomic data can provide deeper insight into biological pathways and disease mechanisms. Researchers can use these datasets to investigate how genetic variants influence protein expression and disease risk.

The expanded dataset also includes individuals from diverse genetic backgrounds, enabling the study of population-specific genetic variants and their biological effects.

Further information about the initiative is available on the Illumina website.