LogoLogo
OT PlatformOT GeneticsCommunityBlog
  • Open Targets Platform
  • Getting started
  • Target
    • Tractability
    • Safety
    • Chemical probes & TEPs
    • Baseline expression
    • Molecular interactions
    • Core Gene Essentiality
    • Pharmacogenetics
  • Disease or Phenotype
    • Clinical signs and symptoms
  • πŸ†•Variant
  • πŸ†•Study
  • Drug
    • Clinical Precedence
    • Pharmacovigilance
    • Pharmacogenetics
  • πŸ†•Credible Set
  • Target–disease evidence
  • Target–disease associations
  • πŸ†•GWAS & functional genomics
    • Data sources
    • Fine-mapping
    • Colocalisation
    • Locus-to-Gene (L2G)
    • Gentropy
  • Bibliography
  • Web interface
    • Associations on the Fly
    • Target Prioritisation
    • Evidence pages
    • Entity profile pages
  • Data and code access
    • Download datasets
    • Google BigQuery
    • GraphQL API
    • πŸ†•Platform infrastructure
    • πŸ†•Data pipeline
  • πŸ†•FAQs
  • Release notes
  • Citation
  • Licence
    • Terms of use
  • Partner Preview Platform
Powered by GitBook
On this page
  • Overview
  • Study categories
  • 1. Genome-wide association studies (GWAS)
  • 2. molQTL studies
  • Study inclusion criteria
  • GWAS/molQTL credible sets

Was this helpful?

Export as PDF

Study

Overview

In the Open Targets Platform, a study is a qualifying genome-wide association study for binary or quantitative traits.

Studies might capture different types of associations, such as curated associations (top hits) from literature or post-processing GWAS summary statistics.

For each study meeting the inclusion criteria, harmonisation, fine-mapping, colocalisation, and Locus-to-Gene analysis are performed, resulting in a list of significant credible sets and their most likely associated genes. The resulting 95% credible sets can be visualised in all study pages. Within the same study, credible sets might result from different fine-mapping pipelines but still be presented in the same study with their respective provenance and confidence.

Studies without GWAS-significant associations

The study will still be presented in the Open Targets Platform if no significant associations are found.

Key study annotation

To provide a rich context to downstream interpretation of studies, and their associations, we capture a wide range annotations, whenever possible in a standardised way to enable interoperability.

For example:

  • The measured trait/phenotype is standardised using the Experimental Factor Ontology (EFO), as the background trait when applied

  • For molecular QTLs, the affected gene/protein is standardised to Ensembl gene identifiers

  • For molecular QTLs, the biological context is standardised to relevant ontology (e.g. tissue β€” UBERON or cell ontology β€” CL)

  • Depending on the granularity of the ingested study metadata, a rich cohort information is captured including sample sizes, ancestry composition and a list of named cohorts

  • Indicated if association data of a study was ingested as summary statistics, together with the corresponding quality control measurements performed on the summary statistics

  • Besides the Pubmed identifier, rich publication information is also captured including first author, publication year for easier data access

Study categories

Integrated studies can be split into two major groups that determine how their associations would be utilised in the target identification process.

1. Genome-wide association studies (GWAS)

A GWAS identifies associations between genetic variants and traits or phenotypes by analysing genome-wide variation data from a population. These studies cover binary or quantitative traits reported in the integrated sources.

Shared trait studies

GWAS studies sharing the same disease or phenotype (EFO) as the study are listed.

Sources: GWAS Catalog, FinnGen

2. molQTL studies

A molQTL study identifies genetic variation that can be significantly associated with changes at the molecular level. As such, associations of these studies provide invaluable help to put GWAS derived loci into context via colocalisation. The type of molecular trait measured defines the type of QTL:

  • eQTLs impact gene expression

  • pQTLs impact protein abundance

  • sQTLs have an effect on gene splicing

  • tuQTLs impact transcript usage

  • sceQTL impact gene expression at the single-cell level

All molQTL studies are annotated with the gene whose expression levels are regulated, referred as the affected gene.

On top of the effected gene, molQTL studies also capture the tissue/cell type in which the change in the molecular trait was measure providing further context for interpretation.

Each molQTL study captures the unique combination of the following annotations:

  • The publication authoring the study

  • The gene product measured in the study

  • The quantitative method (e.g. aptamer) used to measure the trait, when available

  • The cell type or tissue where the trait is measured, when available

  • The experimental conditions (e.g. interferon-stimulated macrophages)

Sources: eQTL Catalogue, UK Biobank Pharma Proteomics Project (UKB-PPP)

Study inclusion criteria

Studies must meet predefined criteria to ensure consistent representation, and increased reliability of associations for downstream analysis.

Each integrated study needs to meet the following criteria:

  • GWAS studies need to have a valid disease or phenotype identifier (EFO)

  • QTL studies must have an affected gene and tissue/cell-type valid identifier (biosample)

  • Valid study type (e.g gwas or a QTL type from a defined set)

  • Data licensed for commercial usage

  • When summary statistics are available, further criteria must be met:

    • Mean beta within expected range

    • Genomic control (GC) lambda value within the expected range

    • The PZ value within the expected range

    • Additional curation from Open Targets team ensuring study meet standards

GWAS/molQTL credible sets

The fine-mapping results for all 95% credible sets in the study are displayed with their most relevant metadata, as well as the top Locus-to-Gene assignment.

Because the same study might be processed through different fine-mapping pipelines, it's possible to observe credible sets obtained with different methods even within the same study. The credible set exclusion criteria describes the rules applied to avoid duplicated credible sets resulting from separate fine-mapping pipelines.

Source: Open Targets

PreviousVariantNextDrug

Last updated 2 months ago

Was this helpful?

πŸ†•