Target Prioritisation view
Learn about the new target prioritisation view released in 23.12
The new target prioritisation page can be accessed by clicking onto the Target prioritisation factors tab from the target-disease Associations on the Fly page when searching for targets associated with a disease or phenotype.
The view focuses on displaying target-specific properties in a disease agnostic way, which have been aggregated into four main sections - Precedence, Tractability, Doability, Safety - and individually scored by the Open Targets team.
A "traffic light" system has been designed to visually inform on target prioritisation, with the aim to facilitate target recommendations. Using a colour scale, green indicates potentially positive attributes and red indicates potentially negative attributes, providing information to help users assess the targets for further prioritisation or deprioritisation, respectively.
Precedence section
Target in clinic
Definition: Gene is targeted by available drugs in any clinical phase for any indication.
Source of Data: Platform Known Drugs widget (ChEMBL)
Scoring: Maximum clinical trial phase the target has been reported for, independently of the disease. Phases range from 0 to IV (corresponding to values of 0, 0.25, 0.5, 0.75 and 1 in the tool scores).
Tractability section
Membrane Protein
Definition: Target is annotated to be located in the cell membrane.
Source of Data: Platform Subcellular location widget [HPA (Human Protein Atlas) and UniProt]
Scoring:
1 = Protein target is located (at least) in the cell or plasma membrane.
0 = Protein target is not located in the cell membrane but some location information is accessible.
NA = No location information available.
Secreted protein
Definition: Target is secreted or predicted to be secreted.
Source of Data: Platform Subcellular location widget [HPA (Human Protein Atlas) and UniProt]
Scoring:
1 = Protein target is (at least) secreted or predicted to be secreted.
0 = Not secreted but with location information.
NA = No location information available.
Note: When contradictions between HPA (Human Protein Atlas) and UniProt exist (i.e. target is secreted according to HPA but in membrane according to UniProt), the information from HPA is taken.
Ligand binder
Definition: Target binds at least one High-Quality Ligand according to ChEMBL tractability bucket.
Source of Data: Platform tractability widget (Open Targets tractability)
Scoring:
1 = Target has a high-quality ligand reported.
0 = Target does not have high-quality ligand reported.
NA = No information available.
Small molecule binder
Definition: Target has been co-crystallised with a small molecule, reported in the Protein Data Bank.
Source of Data: Platform tractability widget (Open Targets tractability)
Scoring:
1 = Target has a small molecule reported.
0 = Target does not have a small molecule reported.
NA = No information available.
Predicted pockets
Definition: Target has a DrugEBIlity score equal or above 0.7, which is predictive of harbouring a high-quality pocket.
Source of Data: Platform tractability widget (Open Targets tractability)
Scoring:
1 = Target contains a high-quality predicted pocket.
0 = Target does not have a high-quality predicted pocket.
NA= No information available.
Doability section
Models, tools and/or reagents that allow target assessment in preclinical settings to enable exploration of a given target
Mouse ortholog identity
Definition: Mouse orthologs maximum identity percentage. A mouse harbouring an ortholog for the target of interest could be useful for in vivo assaying.
Source of Data: Platform comparative genomics widget (Ensembl Compara)
Scoring:
From 0 to 1 are linearly scored those targets with at least one ortholog in mice harbouring at least 80% with the target.
1 = There is at least one gene in mice that contains a sequence with a 100% of identity with the target.
0 = There are no genes in mice containing a sequence with at least 80% of identity with the target.
NA = No ortholog information.
Note: Here we consider mouse orthologs and display the "query percentage identity" (percentage of the human target sequence that matches to the mouse gene) when there is an 80% identity or more. In the cases of targets with more than one ortholog, we take the one with the maximum query % ID.
Chemical probes
Definition: Target has high quality chemical probes.
Chemical probes are small molecules acting as chemical modulators, binding reversibly to the target.
Source of Data: Platform Chemical probes widget (Probes & Drugs)
Scoring:
1 = Target has high-quality chemical probes.
0 = Target does not have high-quality chemical probes.
NA = No information available.
Safety section
Genetic constraint
Definition: Genest that are important for human physiology are seen to be depleted of deleterious variants. The Genome Aggregation Database (gnomAD) has developed a continuous measurement of intolerance to loss of function (LoF) variants per gene, based on observed/expected LoF variant analysis. As recommended by gnomAD and implemented in the Open Targets platform, the rank of genes regarding their loss-of-function observed/expected upper bound fraction (LOEUF) metric is used (LOEUF score).
Source of Data: Platform genetic constraint widget (gnomAD)
Scoring:
A score from -1 to 1 is given to genes depending on their LOEUF metric rank, being -1 the least tolerant to LoF variation and 1 the most tolerant.
Mouse models
Definition: The international database Mouse Genome Informatics contains information about reported phenotypes when a gene is knocked-out in this animal model. These phenotypes are categorised in multiple phenotype classes, using an organ/system classification. We retrieve this information (available in our platform) and score the phenotypes classes regarding their severity (from 0 to -1). After aggregating all phenotypes with their scores according to the phenotype class they belong to, we use the harmonic sum to build a continuous score, which is normalised from 0 to -1.
Source of Data: Platform mouse phenotypes widget (Mouse Phenotypes, feeded from MGI, a reference database for mice knockouts)
Scoring:
Below 0 to -1 = When the target has been knocked-out in mice there were multiple and severe phenotypes reported, with a score higher than the first quartile.
0 = Either the target has non-severe phenotypes reported or is in the first quartile of the normalised score.
NA = No information available.
Note: Below you can find how we scored the mouse phenotype classes (-1 being the "most severe" and 0 "non relevant" phenotypes
MP:0005370
liver/biliary system phenotype
-1
MP:0005385
cardiovascular system phenotype
-1
MP:0010768
mortality/aging
-1
MP:0003631
nervous system phenotype
-0.75
MP:0005388
respiratory system phenotype
-0.75
MP:0005367
renal/urinary system phenotype
-0.75
MP:0005376
homeostasis/metabolism phenotype
-0.75
MP:0005386
behavior/neurological phenotype
-0.75
MP:0005381
digestive/alimentary phenotype
-0.5
MP:0005379
endocrine/exocrine gland phenotype
-0.5
MP:0005382
craniofacial phenotype
-0.5
MP:0005377
hearing/vestibular/ear phenotype
-0.5
MP:0005384
cellular phenotype
-0.5
MP:0005380
embryo phenotype
-0.5
MP:0005394
taste/olfaction phenotype
-0.5
MP:0002006
neoplasm
-0.5
MP:0005375
adipose tissue phenotype
-0.5
MP:0005389
reproductive system phenotype
-0.5
MP:0005397
hematopoietic system phenotype
-0.5
MP:0005387
immune system phenotype
-0.5
MP:0005391
vision/eye phenotype
-0.5
MP:0005390
skeleton phenotype
-0.5
MP:0005369
muscle phenotype
-0.25
MP:0001186
pigmentation phenotype
-0.25
MP:0005378
growth/size/body region phenotype
-0.25
MP:0005371
limbs/digits/tail phenotype
-0.25
MP:0010771
integument phenotype
-0.25
MP:0002873
normal phenotype
0
Gene essentiality
Definition: The second generation map of cancer dependencies (Pacini et al., 2024) increased the number of cancer cell lines analysed (930 CRISPR-Cas9 genome wide knock-out screenings, targeting almost 18,000 genes), spanning to 27 cancer types and curated patient genomic data, to identify cancer-type-specific and pan-cancer gene dependencies integrated with multi-omic markers.
Candidate anti-cancer therapeutic targets were characterised using a prioritisation criteria based on:
- Fitness Score. Strength of the effect on cellular fitness upon target depletion.
- Presence of dependency marker.
- Evidence linking the dependency and marker.
After applying a priority score based on approved drug targets, authors nominated 370 targets for 27 cancer types; 302 were cancer-type specific, while 196 where pan-cancer. This list of genes is the one used to label a target for gene essentiality.
Source of Data: A comprehensive clinically informed map of dependencies in cancer cells and framework for target prioritization. Pacini et al., 2024, Cancer Cell 42, 301–316. Supplementary Table 6. Gene essentiality widget (Cancer DepMap).
Scoring:
-1 = Target reported as essential.
0 = Target not reported as essential
NA = No information available
Known safety events
Definition: Target is associated with curated adverse events.
Source of Data: Safety liability data from Platform safety widget (Open Targets Safety) and Open Targets downstream analysis of toxicity datasets from PharmGKB.
Scoring:
-1 = The target has at least one adverse event.
NA = No information available.
Cancer driver gene
Definition: Target is classified as an oncogene and/or tumour suppressor gene.
Source of Data: Platform Cancer Hallmarks widget (COSMIC)
Scoring:
We use the attribute information from the cancer hallmarks, in the target profile. Here, targets considered as "cancer driver genes" are flagged as tumour suppressor, oncogene, or both
-1 = Target is catalogued as driver gene (tumour suppressor, oncogene or both).
NA = No information available.
Paralogues
Definition: Paralogue maximum identity percentage.
Source of Data: Platform comparative genomics widget (Ensembl Compara)
Scoring:
Below 0 to -1 are linearly scored those targets with at least one paralogue in human harbouring at least 60% of identity with the target.
0 = Those targets with paralogues harbouring less than 60% of identity.
NA = No information available about paralogues for that target.
Tissue specificity
Definition: HPA assessment on tissue-specific target expression.
Source of Data: Platform baseline expression widget (ExpressionAtlas, HPA and GTEx). We used the assessment for every target from the RNA expression data from the public version of Human Protein Atlas (proteinAtlasTissue)
Scoring:
Tissue enriched >=4 fold higher mRNA in a given tissue compared to any other
1
Group enriched >=4 fold higher average mRNA in 2-5 tissue compared to any other
0.75
Tissue enhanced >=4 fold higher mRNA level in a given tissue compared to average of all other tissues
0.5
Low tissue specificity
-1
Not detected
NA
Tissue distribution
Definition: HPA assessment on any detectable baseline expression for the target across tissues.
Source of Data: Platform baseline expression widget (Expression Atlas, HPA and GTEx). We used the assessment for every target from the RNA expression data from the public version of Human Protein Atlas.
Scoring:
Detected in single detected in a single tissue
1
Detected in some detected in more than one but less than 1/3 of tissues
0.5
Detected in many detected in at least 1/3 but not all tissues
0
Detected in all
-1
Not detected
NA
Last updated