DrugStore - An Overview

DrugStore is a database of drugs - both small molecules and biologicals. It is quite small, but complex, since drugs and the way they work are surprisingly complex. The basic content of DrugStore is three domains -

Chemistry., the 2-D structure of the drug, and in the case of biologicals the sequence. We also take care of the case where the dosed drug is a prodrug (a case where the dosed ingredient is essentially inactive, and a metabolic transformation is required to generate the active form). Given this data, it is then trivial to calculate a large number of compound associated properties (MW, clogP, etc.).

Biology, the mechanism of action and the molecular target of the drug. This is surprisingly complicated, for many drugs there are complex modes of action, or cases where the target assignment is still tentative, here we have a confidence score for the target assignment. We are also careful to only include targets that are responsible for the clinical efficacy of the drug, since many drugs bind to serum albumin, but you wouldn’t really want to include that as a drug target. Where possible, we also include the binding domain for the drug within its target, alongside PFAM domain, etc.

Medicine, what the drug is used for, and its properties, for example included are things like black-box warnings, withdrawn status, etc. We include a number of ontologies (well to use the word ontology here is a bit of a stretch) to describe the therapeutic application of a drug. We also differentiate between genuine therapeutic agents, imaging agents, and non-therapeutics - this has the number of therapeutic drugs appear smaller than sometimes thought.


 Sprinkled across the database are properties like the dosage size, dosage form (oral, topical, parenteral, etc), pharmacokinetic properties, trade names, various approval dates, full package inserts, etc., etc. There are many things we have been asked for in the past, like quantitative side-effects, biomarker data, pharmacogenetic data, adventitious pharmacology, detailed toxicology data, sales revenues, but there is no simple way to consistently harvest these data, and other resources (unfortunately often proprietary) cover some of these areas incredibly well. We also store some data within our curation schema that we cannot (currently) distribute due to copyright restrictions, however, these additional data would be available to interested collaborators.

We are developing a plan to integrate the data within DrugStore within some other public databases that address drugs and drug action, and to eventually tightly integrate the most interesting features into CandiStore, treating launched drugs as Phase IV agents.

Since there are so few drugs launched, keeping DrugStore up to date is quite straightforward, and we plan to post data for new drug approvals on the ChEMBLog.