Google BigQuery
To support more complex queries and advanced informatics workflows that use Google Cloud services, the Open Targets Platform data is also available as a Google Cloud public dataset via our Google BigQuery instance — open-targets-prod.
What is Google BigQuery?
Google BigQuery is a data warehouse that enables researchers to run super-fast, asynchronous SQL queries using Google's cloud infrastructure. After running your query, you can either export into various formats or copy into a Google Cloud bucket for further downstream analyses.
Open Targets Platform data is publicly accessible as a Google Cloud public dataset. Users only pay for the queries they perform on the data, and through this program, the first 1 TB per month is free.
BigQuery access points
Open Targets has uploaded all of our data to Google BigQuery. You can run queries via:
For more information on BiqQuery, please review the BigQuery documentation.
Example BigQuery SQL queries
Below is a sample query that uses our associationByOverallDirect
dataset to return a list of targets associated with psoriasis (EFO_0000676) and the overall association score.
Similarly, you can use our molecule
dataset and pass a list of drug trade names to find relevant information:
Tutorials and how-to guides
For more information on how to use BigQuery to access Platform data and example queries based on actual use cases and research questions, check out the Open Targets Community and our Google Cloud dataset homepage.
Last updated