Method for predicting ADMET properties using Abbreviated Profile of Drugs (A-POD)
A novel numerical and graphical representation has been developed to better comprehend the physicochemical, biological and pharmacokinetic properties of drug-like compounds. Abbreviated profile of drug (A-POD) would help in considering and visualizing the various compound characteristics as one entity. The salient features of A-POD are: a unique way of representing the compound properties, computer friendly numerical string representation making comparison of any properties possible, graphical representation that gives a snapshot of properties and their relative changes, and its usefulness in qualitatively predicting the ADMET properties based on chemical properties alone. This simple yet powerful web-based tool is especially useful in comparing any two compounds at one time. A new therapeutic agent can be compared with the reference compound quickly and easily, by checking which properties are affected and finding out whether it possesses better drug-like properties. Keywords: Drug discovery, Pharmacokinetics, in-silico ADMET, Drug-like properties, ADMET prediction, Profile of drugs, A-POD
This divisional application claims the benefit of priority from U.S. patent provisional application No. 60/994,811, “Abbreviated Profile of Drugs (A-POD): A unique numerical and graphical representation for compound properties and its use in ADMET prediction”, filed on Sep. 22, 2007.
This divisional application is related to the non-provisional application Ser. No. 12/209,319, “A method for providing a numerical and graphical Abbreviated Profile of Drugs (A-POD)”, filed on Sep. 12, 2008.
BACKGROUND ARTBlake J F (2000). Chemoinformatics—predicting the physicochemical properties of ‘drug-like’ molecules. Curr Opi Biotech. 11: 104-107.
Ertl P et al. (2000). Fast calculation of molecular polar surface area as a sum of fragment based contributions and its application to the prediction of drug transport properties. Med Chem. 43: 3714-3717.
Ertl P et al. (2003). Web-based cheminformatics and molecular property prediction tools supporting drug design and development at Novartis. SAR and QSAR in Environmental Research. 14: 321-328.
Helson H E (1999). Structure diagram generation. Edited by Lipkowitz K B & Boyd D B. Rev Comput Chem. 313-398.
Hodgson J (2001). ADMET—turning chemicals into drugs. Nature Biotech. 19: 722-726.
Kubinyi K J (2002). Chemical similarity and biological activities. Baz Chem Soc. 13(6): 717-726.
Lipinski C A et al. (1997). Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 23: 3-25.
Lombardo F et al. (2003). In silico ADME prediction: Data, models, facts and myths. Mini Reviews in Medicinal Chemistry. 3: 861-875.
Pajouhesh H & Lenz G R (2005). Medicinal chemical properties of successful central nervous system drugs. NeuroRx. 2: 541-553.
Stenberg P et al. (2001). Experimental and computational screening models for the prediction of intestinal drug absorption. Med Chem. 44: 1927-1937.
Subramanian K (2005). truPK—Human pharmacokinetic models for quantitative ADME prediction. Expert Opin Drug Metab Toxicol. 1(3): 555-564.
Van de Waterbeemd H et al. (2001). Property-based design: optimization of drug absorption and pharmacokinetics. J Med Chem. 44: 1313-1333.
Weininger D (1988). SMILES, a chemical language and information system. Introduction to methodology and encoding rules. J Chem Inf Comput Sci. 28: 31-36.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENTNot applicable.
REFERENCE TO A COMPUTER PROGRAM LISTINGThere are three computer programs that are involved for A-POD:
-
- 1. File name: Hiremath_A-POD_Program_Compute.txt This first program is executed from the computer in an internet browser. It accepts the text file containing the compound related information and then processes it to generate the A-POD for the compound which also predicts the pharmacokinetic properties.
- 2. File name: Hiremath_A-POD_Program_PredictADMET.txt This second program is run by the processor. It accepts the A-POD of a single compound computed from the first program and displays the results on the computer in an internet browser.
- 3. File name: Hiremath_A-POD_Program_ProfileDifference.txt This third program is run by the processor. It accepts the A-POD for the two compounds computed from the first program and displays the results on the computer in an internet browser.
In the early drug discovery process, one of the first steps is to find a reasonable and promising compound for a predefined target. This lead is then modified iteratively to optimize the biological activity. The compounds are synthesized and biologically evaluated for better activity. There are several physicochemical properties that need to the considered while designing or modifying compounds, so that these properties remain within the acceptable limits. Currently, these properties are computed and stored separately in a text file, spread sheet or database (Ertl et al., 2003). It is relatively easy for a chemist to comprehend several compounds along with their properties. However, this comprehension ability could be a challenge for a non-chemist. Hence, it is necessary to have a simple representation for the profile of the compounds, which need not be exact. Herein, a unique approach developed to aid in the actual design process of the compounds is described.
The Lipinski “rule of five”, which is widely cited, is useful in revealing the drug-like characteristics of compounds (Lipinski, et al., 1997). These are a set of four rules that the properties of a compound should satisfy to be drug-like. First, the molecular weight (MW) should be less than 500 daltons. The number of hydrogen-bond acceptors (HA) should be less than 10. The number of hydrogen-bond donors (HD) should be less than 5. Finally, the value of the octanol-water partition coefficient (LP) should be less than 5. These four rules have been mostly valid for several of the known drugs.
The polar surface (PS) area has been implicated as an important property for the drug-like compounds (Ertl, 2000; Blake, 2000). It is defined as the surface area (Å2) occupied by nitrogen and oxygen atoms and their attached polar hydrogens. It is critical in the compound-membrane interactions (Pajouhesh & Lenz, 2005). The optimal polar surface area is considered to be under 90-120 Å2.
In spite of the some observed exceptions to the Lipinski's rule, the property values of the vast majority (90%) of the orally active compounds are within their cut-off limits (Lipinski, et al., 1997). Hence, the Lipinski properties as well as the polar surface area have been considered here.
Pharmacokinetic properties such as absorption, distribution, metabolism, excretion and toxicity (ADMET) are important in order to determine the success of the compound for human therapeutic use. The ideal oral drug is one that is rapidly and completely absorbed from the alimentary canal, distributed specifically to its site of action in the body, metabolized in a way that does not instantly remove its activity, and eliminated in a suitable manner, without causing any harm (Hodgson, 2001). ADMET is a complex phenomenon. It is estimated that around half of all drugs in development fail to make it to the market because of poor pharmacokinetics (Hodgson, 2001).
The pharmacokinetic properties depend on the chemical properties of the molecule. Stenberg and coworkers have found polar surface area to be a primary determinant of fraction absorbed (Stenberg, et al., 2001). Passive absorption also depends on the solution properties of the compound such as lipophilicity (octanol-water partition coefficient). Van de Waterbeemd and coworkers have suggested low molecular weight of compounds for oral absorption (Van de Waterbeemd, 2001). The distribution of the compound in the human body depends on factors such as blood-brain barrier (BBB), permeability, volume of distribution and plasma protein binding. The octanol-water partition coefficient has been implicated in the BBB penetration and permeability prediction, and so is the polar surface area (Pajouhesh & Lenz, 2005). Metabolism reduces the circulating drug concentration and increases elimination. Metabolism also influences oral bioavailability and toxicology of drugs (Pajouhesh & Lenz, 2005). It has been described by Lombardo and co-workers that the excretion process that eliminates the compound from the human body depends on the molecular weight and octanol-water partition coefficient. Rapid renal clearance is associated with small and hydrophilic compounds. The metabolism of most drugs that takes place in the liver is associated with large and hydrophobic compounds (Lombardo, et al., 2003). Higher lipophilicity of compounds leads to increased metabolism and poor absorption, along with an increased probability of binding to undesired hydrophobic macromolecules, thereby increasing the potential for toxicity (Pajouhesh & Lenz, 2005). These insights (summarized in Table 1) need to be incorporated into the discovery process.
Over the years, several quantitative structure-activity/property relation (QSAR)/QSPR) approaches have been developed using large number of descriptors to predict the ADMET properties, with some success (Subramanian, 2005; Lombardo, et al., 2003). These in-silico prediction methods are mainly based on artificial neural networks; some of which are inaccurate, while others are time-consuming and are not practical. Therefore, a simple and quick method to predict ADMET properties from chemical structures is strongly needed for an effective drug design.
FIELD OF THE INVENTIONThe present invention relates to the computational drug discovery methods and systems for a unique, numerical and graphical, A-POD representation of any number of physicochemical, biological, and pharmacokinetic properties of chemical compounds. Unlike other prediction methods of pharmacokinetic properties which are based on modeling and analysis of other compounds, the present invention is related to the prediction of pharmacokinetic properties based on the given compound's physicochemical properties alone, directly into the novel A-POD representation.
BRIEF SUMMARY OF THE INVENTIONThe present invention, the abbreviated profile of drug, A-POD, is a unique representation for compound properties, which is extremely beneficial in several ways in the early design process of the compound against a pre-identified target. The computer friendly numerical string representation makes the comparison of any properties possible, where as the graphical representation gives a snapshot of properties and their relative changes. The importance of A-POD has also been demonstrated in prediction of ADMET, based on few chemical properties alone, directly into the A-POD representation. The invention provides methods and systems for not only a valuable “add-on” to standard property calculations but also a quick and qualitative predictor of ADMET that might aid in the day-to-day drug design efforts. The first-step has been taken towards unifying the physicochemical, biological, and pharmacokinetic properties in the A-POD representation.
Abbreviated profile of drugs (A-POD), pronounced as A-pod, has been developed to better comprehend several of the important properties in a simple way. The numerical A-POD (NA-POD), pronounced as N-A-pod, is the string of numbers representing several relevant properties of compounds. NA-POD is computer friendly representation of the compound properties, similar to SMILES (Simplified Molecular Input Line Entry System) which is a line notation for representing the structure of molecules and reactions (Weininger, 1988). The graphical A-POD (GA-POD), pronounced as G-A-pod, is a visual representation of the selected compound properties. These two representations will aid in comprehending and visualizing the compound properties as a single entity. A-POD' s usefulness is demonstrated in qualitative prediction of the ADMET properties based on chemical properties.
Selection of Compounds
The compounds used for demonstration and analysis were selected based on the availability of low and high biological activity for a pair of similar compounds. For Thrombin inhibitors, analog containing X=—CH2— with Ki=2000 nM was considered as a reference compound and the analog containing X=—CO— with Ki=500 nM was considered as a “new” compound (
Internet Tool
A web-based tool was created in the form of two Java applets. One applet was developed to display the physiochemical, biological, and pharmacokinetic properties for a given compound. And the other applet was developed to compare a “new” compound with a reference compound. Since, five dominant chemical properties (DCP) were used, the dependency relationships is referred to by the term “DCP5”.
Convention
The order of the chemical properties considered is WRGLS, pronounced as wregals, where each letter corresponds to molecular weight (W), H-bond receiver (R), H-bond giver (G), lipophilicity (L), and polar surface area (S). For brevity, two letter codes are also used such as MW, HA, HD, LP, PS, respectively. These are followed by biological activity (BA) and absorption, distribution, metabolism, excretion, toxicity (ADMET). The complete order is therefore given by “WRGLSBADMET”.
Polar Surface Area
Average polar surface area contributed by the oxygen, nitrogen, sulfur and phosphorous atom groups were computed based on the fragment contributions as described by Ertl and coworkers (Ertl, et al., 2000) (Table 2). Usually, oxygen and nitrogen atoms are considered in computing the polar surface area. Sulfur and phosphorous atoms could be considered in contributing partially to the polar surface area. The average conventional PSA(O, N) contribution is 16.45 Å2 and the average comprehensive PSA(O, N, S, P) is 19.81 Å2 (Table 2). Hence, comprehensive polar surface area is about 20% higher than the conventional polar surface area. The percentage of fraction absorbed with respect to the conventional polar surface area reaches a plateau around 125 Å2. Coincidently or naturally, in the spirit of the number five, the comprehensive polar surface area limit for the drug-like compounds could be considered as 150 Å2. The Lipinski “rule of five” must then be expanded to include the comprehensive polar surface area, thereby making it the fifth rule (Table 3).
Normalization
In order to represent the properties of compound in a meaningful way, it is necessary to normalize them (
Biological Activity
The biological activity considered to be in the micro-molar to nano-molar range was normalized. The lower values correspond to higher efficacy of compounds. In contrast, the increase in the A-POD value represents the increase in biological activity of the compound. The inverse relationship was converted into a direct relationship by considering the one-minus-activity values. In order to easily identify the activity ranges, two groups with 5 values each were implemented; the micro-molar concentrations corresponding to A-POD values from 0 to 4, and the nano-molar concentrations corresponding to A-POD values from 5 to 9.
ADMET Prediction
The individual A or D or M or E or T property value was computed by considering the favorable and unfavorable relationships of the chemical properties as described earlier (Table 1). These relationships can be modified and updated as well. For example, if the molecular weight has opposite effect on absorption, then 1-minus-normalized molecular weight was considered for absorption. And if a particular pharmacokinetic property has more than two chemical property dependencies, then taking into account their direct or inverse dependency, a simple average was computed (
Numerical A-POD
Considering several relevant properties as a string of numbers can represent the abbreviated profile of drug. For a hypothetical compound X after normalizing the compound property values, if molecular weight is 9, the number of hydrogen bond acceptors is 7, the number of hydrogen bond donors is 2, octanol-water partition coefficient is 4, and polar surface area is 6, the numerical A-POD may be written as A-POD(X:97246). From this A-POD, it is clear that in considering the design of new compounds based on this compound, there is less room for increasing the molecular weight and more room in terms of increasing the properties such as the hydrogen-bond donors, partition coefficient and polar surface area. It is obvious that the actual molecular property values are not important in considering the modifications for a particular lead compound. The biological activity value can also be treated in the similar way. If the normalized biological activity for the hypothetical compound is 8, then the abbreviated profile for drug is A-POD(X:972468). Clearly, this hypothetical compound has the near-high biological activity and the compound is drug-like.
Graphical A-POD
The graphical representation allows the non-chemist to have the chemist's memory of the compound. The physicochemical properties combined with the biological activity information for a compound capture useful drug information and make it easy to comprehend the profile of compound. Using these representations, even for a novice, it is extremely helpful to think of possible modifications to the compound and still be within the drug-like space. For thrombin inhibitor (
Comprehensive A-POD
Here, an attempt has also been made to qualitatively predict the individual ADMET properties, based on a simple idea that a few chemical properties predominantly contribute to A, D, M, E or T (Table 1) (Stenberg, et al., 2001; Van de Waterbeemd, 2001; Pajouhesh & Lenz, 2005; Lombardo, et al., 2003). Applying the individual influences of the chemical properties to obtain pharmacokinetic properties, for both the numerical as well as graphical A-POD representations, gives a comprehensive A-POD, CA-pod, pronounced as C-A-pod. This unique way of determining individual pharmacokinetic properties could be customized and experimented by the user. For Thrombin inhibitor, the “new” compound clearly shows that not only the experimentally determined biological activity has improved but also predicts the unchanged absorption and distribution; improved/favorable properties such as metabolism, excretion and toxicity (reduced with respect to the reference compound) (
Difference A-POD
Difference A-POD, DA-pod, pronounced as D-A-pod, is useful in comparing two compounds. Since, the individual A-POD parameters are normalized, two corresponding parameters can be easily subtracted. The sum of all the difference A-POD values gives an overall effect of the modification made with respect to a particular reference compound. If the biological activity has improved for the new compound then the magnitude and direction for each property that changed can be taken as indication of the drug design going in the right direction. For Thrombin inhibitor, the DA-pod representation clearly shows which chemical properties are affected and its consequences on the ADMET properties (
Limitation of A-POD
The limitation of A-POD representation is that compounds must possess the properties within their range for drug-like compounds. It is primarily designed to help make a “good” lead compound into a “better” lead compound with “small” chemical changes, rather than starting with a “bad” compound.
Comparison of A-POD
The SMILES and A-POD representations are both linear notations related to the molecules, the former is a linguistic construct of structure where as the later is a mathematical construct of properties. SMILES strings can be converted back into the structures of the molecules. However, the conversion is not unambiguous (Helson, 1999). A-POD representation by design is not exact, and hence the individual values cannot be used for property conversion. Nevertheless, both SMILES and A-POD are simple, compact and computer friendly.
Unification of Properties in A-POD
The structure and the complex interplay between the various chemical properties of a compound are responsible for the biological efficacy of the compound and its pharmacokinetic properties such as ADMET. The changes in the chemical structure are made so as to maximize the efficacy of the compound with favorable ADMET properties. Thus, the profile of the compound should be considered as a single entity. It is incorrect to consider the ADMET properties separate from the chemical properties. Some of the chemical and biological properties may or may not be sufficiently independent of the others. It would not be worthwhile if the spatial and chemical properties do not produce optimal biological properties. Hence, it is likely that the complex interplay of the spatial and chemical properties as a whole is responsible for the biological properties. The precise value of the various properties may not be important. An attempt to take the first-step has been made towards unifying the chemical and biological properties in the A-POD representation.
Claims
1. A computer-implemented method for predicting the pharmacokinetic properties of a compound based on its chemical properties and providing a comprehensive abbreviated profile of a drug compound in a numerical representation to aid in drug design, said method comprising steps of:
- (i) defining a set of chemical property parameters, wherein the set comprises molecular weight (W), hydrogen-bond acceptors (R), hydrogen-bond donors (G), octanol-water partition coefficient (L), and comprehensive polar surface area (S);
- (ii) defining a set of biological property parameters, wherein the set comprises biological activity (B);
- (iii) defining a set of pharmacokinetic property parameters, wherein the set comprises absorption (A), distribution (D), metabolism (M), excretion (E), and toxicity (T), and setting the pharmacokinetic or ADMET rules based on the favorable and unfavorable dependencies on chemical property parameters;
- (iv) obtaining, for one or more drug compounds, values associated with each of the property parameters and setting an upper and lower limit for each of said values;
- (v) determining, for one or more drug compounds, a set of normalized values for each property parameter by dividing the values associated with each of the property parameters by its upper limit and multiplying the resulting number by ten;
- (vi) determining, for one or more drug compounds, a set of abbreviated profile of drug compound (A-POD) values associated with each of the property parameters by rounding off the normalized values to the lowest integer, wherein the integers are between 0 and 9;
- (vii) determining, for one or more drug compounds, a set of abbreviated profile of drug compound (A-POD) values associated with each of the pharmacokinetic property parameters based on the chemical A-POD values using the ADMET rules and rounding off the values to the highest integer, wherein the integers are between 0 and 9;
- (viii) generating, for one or more drug compounds, a numerical representation of the comprehensive A-POD values associated with each of the property parameters defined as CA-POD (X:WRGLSBADMET), wherein X represents the name of the drug, wherein W corresponds to the A-POD value associated with molecular weight; R corresponds to the A-POD value associated with hydrogen-bond acceptors; G corresponds to the A-POD value associated with hydrogen-bond donors; L corresponds to the A-POD value associated with octanol-water partition coefficient; S corresponds to the A-POD value associated with comprehensive polar surface area; B corresponds to the A-POD value associated with biological activity; A corresponds to the A-POD value associated with absorption; D corresponds to the A-POD value associated with distribution; M corresponds to the A-POD value associated with metabolism; E corresponds to the A-POD value associated with excretion; and T corresponds to the A-POD value associated with toxicity;
- (ix) displaying, for one or more drug compounds, the numerical representation of said A-POD values on a two-dimensional graph, wherein the A-POD values are displayed on the y-axis and the property parameters are displayed on the x-axis;
- wherein steps (i)-(ix) are performed on a suitably programmed computer.
2. The method of claim 1 further comprising: repeating steps (i) through (ix) using two different drug compounds, determining differences between the A-POD values associated with each of the property parameters for the two different drug compounds, and displaying the differences on a two-dimensional graph, wherein the differences between the A-POD values are displayed on the y-axis and the property parameters are displayed on the x-axis.
3. The method of claim 1 wherein the A-POD value for biological property parameter (B) in micromolar range is determined by taking ten thousand minus biological activity and dividing by ten thousand and then multiplying the resulting number by five and finally adding zero.
4. The method of claim 1 wherein the A-POD value for biological property parameter (B) in nanomolar range is determined by taking one thousand minus biological activity and dividing by one thousand and then multiplying the resulting number by five and finally adding five.
5. The method of claim 1 wherein the ADMET rule for absorption (A) is based on favorable chemical properties octanol-water partition coefficient (L) and comprehensive polar surface area (S) and unfavorable chemical property molecular weight (W).
6. The method of claim 5 wherein the absorption (A) is determined using A-POD values by adding octanol-water partition coefficient (L) and comprehensive polar surface area (S) and subtracting molecular weight (W) and dividing the resulting number by three.
7. The method of claim 1 wherein the ADMET rule for distribution (D) is based on favorable chemical properties octanol-water partition coefficient (L) and comprehensive polar surface area (S).
8. The method of claim 7 wherein the distribution (D) is determined using A-POD values by adding octanol-water partition coefficient (L) and comprehensive polar surface area (S) and dividing the resulting number by two.
9. The method of claim 1 wherein the ADMET rule for metabolism (M) is based on favorable chemical property octanol-water partition coefficient (L).
10. The method of claim 9 wherein the metabolism (M) is determined using A-POD values by taking octanol-water partition coefficient (L) and dividing the resulting number by one.
11. The method of claim 1 wherein the ADMET rule for excretion (E) is based on favorable chemical properties octanol-water partition coefficient (L) and molecular weight (W).
12. The method of claim 11 wherein the excretion (E) is determined using A-POD values by adding octanol-water partition coefficient (L) and comprehensive molecular weight (W) and dividing the resulting number by two.
13. The method of claim 1 wherein the ADMET rule for toxicity (T) is based on favorable chemical property octanol-water partition coefficient (L).
14. The method of claim 13 wherein the toxicity (T) is determined using A-POD values by taking octanol-water partition coefficient (L) and dividing the resulting number by one.
15. The method of claim 1 wherein the ADMET rule for the A-POD value associated with any pharmacokinetic property parameter as a function of A-POD values associated with different chemical property parameters is defined using the weighted mean equation such as: wherein,
- Π(Γ)=[ΣαNβNΓN]/ΣβN=[α1β1Γ1+α2β2Γ2+... +αNβNΓN]/ΣβN
- Π is the A-POD value associated with pharmacokinetic property parameter;
- Γ is the A-POD value associated with chemical property parameter(s);
- N is the number of chemical property parameters dependencies;
- α is the favorability factor, which is plus one when favorable and minus one when unfavorable;
- β is the weighting factor for chemical property parameter; and
- Σ is the summation sign.
16. The method of claim 2 wherein the score based on the differences of A-POD values associated with each of the pharmacokinetic property parameters for the two different drug compounds is an indicator of the progress in the drug design, which is determined by adding difference of A-POD values associated with absorption (A) and distribution (D) and subtracting metabolism (M), excretion (E) and toxicity (T).
17. The method of claim 16 wherein the score is assigned a positive sign if the difference of A-POD values associated with biological activity (B) is positive, indicating that the drug design is going in “right” direction as suggested by the trends in the differences of A-POD values associated with the chemical property parameters; the score is assigned a negative sign if the difference of A-POD values associated with biological activity (B) is negative, indicating that the drug design is going in “wrong” direction as suggested by the trends in the differences of A-POD values associated with the chemical property parameters.
Type: Application
Filed: Dec 24, 2012
Publication Date: May 9, 2013
Inventor: CHAITANYA N. HIREMATH (WESTFORD, MA)
Application Number: 13/726,519
International Classification: G06F 19/12 (20060101);