Knowledge Graphs in Drug Discovery

We are pioneering new ways to transform early-stage drug discovery using knowledge graphs and data-driven approaches. We developed Chem2Bio2RDF, the first large scale  linked public data graph for preclinical drug discovery; novel link prediction and data mining algorithms for finding hidden insights in large heterogeneous data graphs; and in our 2012 Drug Discovery Today paper laid out a strategy for using linked data and graph analytics to expand beyond the current single-target drug discovery model.  Current projects include researching knowledge graphs that encode computable networks for multi-mechanism complex diseases, integrating patient medical records with molecular data to help identify potential targeted therapies, and developing new ways to apply machine learning on top of  heterogeneous linked data graphs. We are thankful to NIH NCATS, Indiana CTSI, the OpenPHACTS foundation, Eli Lilly, and Pfizer for funding of this work. Applications in this area are being commercialized in our company Data2Discovery Inc.

SELECT papers

These are a selection of our papers that we think are a good starting point, with links to the PDFs of the articles. For a full list of publications relating to knowledge graphs, see David’s Google Scholar page. Please contact David if you have trouble accessing any papers of interest.