Advancing Disease Treatment through Integrative Data Science

An approach to identify candidate compounds for treating Parkinson’s Disease using techniques rooted in network science combined with modern graph query languages to identify compounds (and genes) that are maximally associated with PD in the network

Through the work of the Integrative Data Science Lab (IDSL), we are pioneering new ways to rapidly improve disease treatment and drug discovery using integrative knowledge graphs and advanced machine learning approaches to profile and predict the biological effects of potential new drugs. We developed Chem2Bio2RDF, the first large scale linked public data graph for preclinical drug discovery; novel link prediction and data mining algorithms for finding hidden insights in large heterogeneous data graphs; and in our 2012 Drug Discovery Today paper laid out a strategy for using linked data and graph analytics to expand beyond the current single-target drug discovery model.

Current projects include researching knowledge graphs that encode computable networks for multi-mechanism complex diseases, including the PRIDE project (Parkinson’s Research through Integrative Data Experiments).

We are thankful to NIH NCATS, Indiana CTSI, the OpenPHACTS foundation, Eli Lilly, and Pfizer for funding of this work. Applications in this area are being commercialized in our company Data2Discovery Inc.

SELECT papers

These are a selection of our papers that we think are a good starting point, with links to the PDFs of the articles. For a full list of publications relating to knowledge graphs, see David’s Google Scholar page. Please contact David if you have trouble accessing any papers of interest.

RELATED BLOG POSTS

Featured

Screen Shot 2019-02-19 at 2.36.26 PM.png

Feb 19, 2019

How knowledge graphs will transform drug discovery

Feb 19, 2019

What if we could bring all the knowledge, data, insight, and prior decision-making of drug discovery together and use it to accelerate the discovery of new drugs? What if we could encode the millions of known relationships between potential new (or old) drugs, protein targets, genomics, biological processes, and disease mechanisms, and then use all this together to get new insights into disease and treatments?

Feb 19, 2019

Screen Shot 2018-05-04 at 1.28.51 PM.png

May 1, 2018

Transforming Pharmaceutical and Healthcare Companies into Data Companies

May 1, 2018

This is a five-minute flash talk on transforming pharmaceutical and healthcare companies into data companies.

May 1, 2018

Screen Shot 2018-05-04 at 1.35.21 PM.png

May 1, 2018

Big Data in Drug Discovery

May 1, 2018

This is a talk given at the Pervasive Technology Institute at Indiana University on Big Data in Drug Discovery

May 1, 2018