Data for assessing tractability with small molecule, antibody, and other clinical modalities
To support target prioritisation, the Open Targets Platform includes tractability data that identifies key details, including if there is a binding site suitable for small molecule binding, an accessible epitope for antibody based therapy, relevant data for using Proteolysis Targeting Chimeras (PROTACs), or a compound in clinical trials with a modality other than small molecule or antibody.
The tractability data can assist in target prioritisation by identifying potential drug targets suitable for discovery pipelines and therapeutic modalities that are most likely to succeed. It also supports further investigation of targets for which there are no ligands or experimental structures or those targets outside a "druggable" target family but with strong genetic associations.
Our target tractability is based on a modified version of Approaches to target tractability assessment – a practical perspective and The PROTACtable genome and has workflows that generate tractability assessments for small molecule (SM), antibody (AB), Proteolysis Targeting Chimeras (PR), and other clinical (OC) modalities.
The tractability assessments displayed on the Platform's target profile pages is the result of an open-source computational pipeline that performs in silico tractability assessments with small molecule, antibody, PROTAC, and other clinical modality workflows.
Data sources used in the pipeline include UniProt, HPA, PDBe, DrugEBIlity, ChEMBL, Pfam, InterPro, Complex Portal, DrugBank, Gene Ontology, and BioModels.
Assessments common to all modalities, ingested from ChEMBL, are:
- Approved Drug: the target has clinical precedence with Phase IV drugs;
- Advanced Clinical: the target has clinical precedence with Phase II or III drugs;
- Phase 1 Clinical: the target has clinical precedence with Phase I drugs.
We also include additional assessments specific to each modality.
- Structure with Ligand: Target has been co-crystallised with a small molecule (source: Protein Data Bank)
- High-Quality Ligand: Target with ligand(s) (PFI ≤ 7, SMART hits ≤ 2, scaffolds ≥ 2) (source: ChEMBL)
- UniProt loc high conf: High confidence that the subcellular location of the target is either plasma membrane, extracellular region/matrix, or secretion (source: Uniprot)
- GO CC high conf: High confidence that the subcellular location of the target is either plasma membrane, extracellular region/matrix, or secretion (source: Gene Ontology)
- UniProt loc med conf: Medium confidence that the subcellular location of the target is either plasma membrane, extracellular region/matrix, or secretion (source: Uniprot)
- UniProt SigP or TMHMM: Target has a predicted signal peptide or trans-membrane regions, and not destined to organelles (source: Uniprot SigP, TMHMM)
- GO CC med conf: Medium confidence that the subcellular location of the target is either plasma membrane, extracellular region/matrix, or secretion (source: Gene Ontology)
- Human Protein Atlas loc: High confidence that the target is located in the Plasma membrane (source: HPA)
- Literature: Target mentioned in a set of manually curated PROTAC-related publications (source: Europe PMC)
- UniProt Ubiquitination: Target tagged with the Uniprot keyword “Ubl conjugation [KW-0832]”, which indicates that the protein has a ubiquitination site, based on evidence from the literature (source: Uniprot)
- Database Ubiquitination: Target has reported ubiquitination sites in PhosphoSitePlus, mUbiSiDa (2013), or Kim et al. 2011
- Small Molecule Binder: Target has a reported small-molecule ligand in ChEMBL with a measured activity of at least 10 μM in a target-based assay (source: ChEMBL)
The data is available for download as part of the target core annotation from our data downloads page.
Alternatively, you can also download the input TSV file with the per-target assessments via FTP. To access this file, visit our FTP site and click on the release version (e.g. 21.04), followed by "input", followed by "annotation-files". You can then download the
tractability_bucketsTSV file. Descriptions of the columns found in the input file can be found on the pipeline README.md file.
Brown KK, Hann MM, Lakdawala AS, Santos R, Thomas PJ, Todd K. Approaches to target tractability assessment - a practical perspective. Medchemcomm. 2018 Feb 14;9(4):606-613. doi: 10.1039/c7md00633k. PMID: 30108951; PMCID: PMC6072525.
Schneider M, Radoux CJ, Hercules A, Ochoa D, Dunham I, Zalmas LP, Hessler G, Ruf S, Shanmugasundaram V, Hann MM, Thomas PJ, Queisser MA, Benowitz AB, Brown K, Leach AR. The PROTACtable genome. Nat Rev Drug Discov. 2021 Jul 20. doi: 10.1038/s41573-021-00245-x. PMID: 34285415.