Use of self-assembled monolayers to probe the structure of a target molecule
Weak binding motifs were transformed into a high affinity ligand surface by using a heterologous self-assembled monolayer (SAM) as a rigid scaffold to present discrete binding moieties, in a controlled geometry, to a target molecule. At a critical ligand density, the discrete binding moieties simulated a multivalent ligand and promoted high-affinity, cooperative binding of the target molecule. Statistical calculations were applied to SAM components in solution and gold-sulfur packing dimensions to extract the inter-ligand-distance within the SAM. This distance information is valuable to the rational design of multivalent drugs.
This application is a continuation of U.S. patent application Ser. No. 09/324,258, filed Jun. 2, 1999, which claims priority to U.S. Provisional Patent Application No. 60/087,766, filed on Jun. 2, 1998.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH AND DEVELOPMENTThis work was supported, in part, by NIH Grants 5T32EM-07598-18 and GM-32308. The government of the United States of America may have some rights in this invention.
FIELD OF INVENTIONThe present invention relates to the use of self-assembled monolayers attached to surfaces for the detection and probing of target molecule structure and function.
BACKGROUND OF THE INVENTIONCombinatorial chemistry techniques are used to synthesize diverse “libraries” of unique chemical compounds. These small molecule libraries often yield drug candidates that are capable of binding a specific biological target but because of their small size and relative simple chemical makeup, they characteristically interact with the target in a low affinity interaction. These low affinity interactions cannot adequately compete with larger more diverse natural ligands, like proteins and protein complexes, and thus provide little therapeutic value. Natural products, which are naturally occurring organisms isolated from soils, yeast, marine organisms, and the like are larger and chemically more interesting than small molecules from combinatorial libraries. Natural products are routinely screened for therapeutic activity against disease-related organisms. Many cancer drugs have been identified in this way. The problem with developing a natural product for the drug marked is that they are large and chemically complicated, which means that elaborate and expensive schemes for their synthesis must be developed. Identifying a synthetic scheme that is commercially feasible is a technical challenge that at best takes years and millions of dollars to accomplish and at worst cannot be done. For this reason, there is interest in enhancing the affinity between small molecule drugs and their biologically relevant targets.
Knowles and colleagues, at Harvard, reported that they could enhance the binding affinity of a small molecule for a particular target by attaching a “greasy tail” to the small molecule. This hydrophobic tail was later shown to interact with a hydrophobic patch on the target molecule adjacent to the binding site.
Many biologically relevant target molecules present more than one binding site for a particular ligand. Some present pseudo identical binding sites with which they bind natural ligands that contain “repeats” of a binding motif. It is known that bivalent interactions (like antibody interactions) are higher affinity interactions than monovalent interactions, due to the cooperative binding effect. Therefore, one would like to link several small molecule drugs together to form a pseudo multivalent drug that would interact more strongly with a multi-binding-site target molecule. The problem with this logic is that the enthalpic advantage of the additional binding energy is offset by the large entropic energy cost of ordering the connected binding moieties. However, making the linker between the binding moieties a rigid linker would introduce order and thus minimize the entropic cost to yield a higher affinity interaction. In order to connect two binding moieties (the small molecule drugs) with a rigid linker, in a geometry that would encourage its binding to the target molecule, one would need to know apriori the distance between the binding sites on the target molecule. This inter-binding-site distance information is currently derived from X-ray or NMR structure determination of the target molecule. This process is time-consuming (years) and expensive.
The subject of this invention is how self-assembled monolayers (SAMs) can be used to present discrete binding moieties, at varying densities, in a rigid 2-dimensional array, to multivalent target molecules in order to promote a higher affinity, cooperative interaction. Ligand densities within the SAM are varied to determine the critical distance between binding moieties that will promote simultaneous, cooperative binding of the target molecule. By monitoring the kinetics of binding events between the target molecule and the variable density ligand surfaces, one can empirically determine the lowest surface density that prompts a large shift in affinity for the multivalent target molecule. One can then use Poisson statistics to infer the distance between surface-immobilized ligands and thus also the distance between the binding sites on the target molecule. Once this distance information has been deduced, it can be used to rationally design bi- or multi-valent drugs or rigid linkers to connect two binding moieties. Alternatively, the SAM itself can become a part of the “drug”; in this case, the SAM is used as the “rigid linker” between binding moieties to present multiple binding motifs, at the empirically determined critical density, to promote the higher affinity cooperative interaction. The SAM, presented ligands and underlying gold (may be gold colloids) are both the drug and the drug delivery system. Inert thiols of the SAMs can be terminated with lipid-like groups to facilitate drug delivery. Similarly, a biospecific ligand could be incorporated (at varying densities) into a liposome, at the critical presentation density determined, and used directly as a multivalent drug in its own delivery system.
SUMMARY OF THE INVENTIONSelf-assembled monolayers are used as a rigid 2-dimensional matrix for presenting binding moieties, at varying distances from each other, to a target molecule. Two-component SAMs incorporate an inert spacer molecule and a biospecific molecule that can directly or indirectly present a binding moiety to a target molecule. The distance between the biospecific molecules in the array, the ligand density, is controlled by manipulating the concentrations of the two component thiols in solution before deposition onto gold. The affinity of the interaction between the surface immobilized ligands and the multivalent target molecule is monitored as a function of ligand density. The lowest ligand surface density that elicits a jump in affinity for the target molecule contains the critical information needed to extract the distance between binding sites on the target molecule. The dimensions of the hexagonal tiling pattern formed when the sulfurs from the thiols bind to gold solid are known. Therefore, Poisson statistics can be used to infer the distance between surface immobilized ligands, and thus the inter-binding-site distance on the target molecule, from the concentrations of the thiols in solution. Further, the gold surface itself and the attached SAM can be used as a scaffold to present binding moieties, in a controlled, higher affinity geometry, to a target molecule.
In a preferred embodiment, SAMs are generated that incorporate two thiol types: 1) an inert tri-ethylene glycol-terminated thiol; and 2) a nitrilo tri-acetic acid (NTA) terminated thiol that when complexed with Ni, captures histidine-tagged proteins or peptides. The density of NTA-thiol within the SAM is varied to present varying densities of a histidine-tagged binding moiety to a multi-valent target molecule. The affinity of the interaction is plotted as a function of ligand density within the SAM. A dramatic increase in the binding affinity occurs at a critical surface density when the presented ligands are close enough to each other to simultaneously bind to a common target molecule. The solution concentrations of the two thiol types and the dimensions of the tiling pattern that the thiols form on the gold substrate are input into Poisson distribution equations to extract the probable distance between binding sites on a target molecule.
BRIEF DESCRIPTION OF THE DRAWINGS
Variable density nitrilotriacetic acid (NTA)-SAMs were used to probe the binding site(s) of a biologically important molecule, the human general transcription factor TATA box binding protein (hTBP) [Burley, S. K. and Roeder, R. G. (1996) Biochemistry and structural biology of transcription factor IID (TFIID). Annu. Rev. Biochem. 65:769-799]. This transcription factor has been implicated as a direct target of transcriptional activators such as VP16 [Ingles, J. C., M. Shales, W. D. Cress, S. J. Triezenberg and J. Greenblatt. (1991) Reduced binding of TFIID to transcriptionally compromised mutants of VP16. Nature. 351:588-590]. In fact, the need for an activator is eliminated when TBP is artificially tethered to a DNA promoter [Xiao, H., J. D. Friesen and J. T. Lis. 1995. Recruiting TATA-binding protein to a promoter: transcriptional activation without an upstream activator. Mol. and Cell. Biol. 15(10):5757-5761].
Transcriptional activator proteins are modular in that they have functionally separable domains [Brent, R. and M. Ptashne. (1985) A Eukaryotic transcriptional activator bearing the DNA specificity of a prokaryotic repressor. Cell. 43:729-736], a DNA binding domain, and an activating region. The structures of TBP [Nikolov, D. B., H. Chen, E. D. Halay, A. A. Usheva, K. Hisatake, D. K. Lee, R. G. Roeder and S. K. Burley. (1995) Crystal structure of a TFIIB-TBP-TATA element ternary complex. Nature. 377:119-128] and several activator DNA binding domains [Marmorstein, R., M. Carey, M. Ptashne, and S. C. Harrison. 1992. DNA recognition by Gal4: structure of a protein/DNA complex. Nature. 356:408-414; Ellenberger et al., 1992; and Baleja, J. D., R. Marmorstein, S. C. Harrison and G. Wagner. 1992]. The structure of the DNA-binding domain of Cd2-Gal4 from Saccaromyces cervisiae in solution has been solved, yet the structure of an activating region, alone or complexed with a target molecule has remained elusive. Fundamental questions as to how an activating region effects gene transcription remain unanswered. One mechanistic model of gene activation proposes that DNA-bound activators trigger transcription by merely “recruiting” some necessary factor, perhaps TBP, to the promoter through direct contact with the activating region [Triezenberg, S. J. 1995. Structure and function of activation domains. Curr. Opin. Genet. Dev. 5(2):190-196]. Another model proposes that activating regions induce a conformational change in a target protein(s) [Sheldon and Reinberg, 1995] or sequentially perform some function until a threshold is reached which catalyzes gene transcription.
In eukaryotes, more than one DNA-tethered activator is typically required to achieve activated transcription and that multiply bound activators transcribe synergistically [Lin, Y. S., M. Carey, M. Ptashne and M. R. Green. (1990) How different eukaryotic transcriptional activators can cooperate promiscuously. Nature 345:359-361]. Cryptic repeats of minimal activation motifs have been identified in eukaryotic activators that, when tandemly reiterated and tethered to DNA, efficiently activate transcription in vitro [Blair et al., 1994; Tanaka, M. and W. Herr, (1994) Reconstitution of transcriptional activation domains by reiteration of short peptide segments reveals the modular organization of a glutamine-rich activation domain. Mol. Cell. Biol. 14(9):6056-6067]. An eight amino acid minimal activation motif (DFDLDMLG) derived from the prototypic mammalian activator VP16 was recently identified [Tanaka, M. (1996) Modulation of promoter occupancy by cooperative DNA binding and activation-function is a major determinant of transcriptional regulation by activators in vivo. Proc. Natl. Acad Sci. USA. 93(9):4311-4315]. As an exemplary embodiment, this invention describes novel biophysical methods to quantitate the kinetics, as well as investigate the mechanism, of the interaction between hTBP and tandem repeats of the VP 16 minimal motif.
The interactions were characterized by SPR in a BIAcore instrument. SPR is a fairly new optical technique for the real time detection and kinetic analysis of intermolecular interactions [Liedberg, B., C. Nylander and L. Lundstrom. (1983) Surface plasmon resonance for gas detection and biosensing. Sens. Actuators. 4(2):299-304.; Daniels et al., 1988; Lofas, S. and Johnsson, B. (1990) A novel hydrogel matrix on gold surfaces in surface plasmon resonance sensors for fast and efficient covalent immobilization of ligands. J Chem. Soc., Chem. Commun.: 1526-1528]. The basis of the technology is as follows: ligands are immobilized on a surface; putative target molecules are flowed over this surface; the protein concentration at the solution-surface interface changes as target binds ligand. The increased protein mass at the interface causes a change in the optical properties of the system. The amount of new protein recruited to the interfacial region can be quantitated by measuring the change in the angle at which light reflected off the interface is a minimum [for a review see Bamdad, C. 1997. Surface plasmon resonance for measurements of biological interest. Current Protocols in Molecular Biology 20.4.1-20.4.12.]. Changes in this angle are measured in resonance units (RUs) where 1 RU is defined as a change of {fraction (1/10,000)}th of a degree. A rule of thumb is that for a distance of about 150 mn from the interface, 1 ng protein/mm2 registers 103 RUs.
SAMs were generated that incorporated an NTA group for the specific binding of histidine-tagged peptides. The density of NTA in the SAM was varied so that different amounts of a His-tagged activation motif could be presented to TBP, in solution. SPR was used to quantitate avidity effects between TBP and surface-bound peptides as a function of peptide density.
A panel of variable density NTA-SAMs were prepared by diluting the concentration of the active component, NTA-thiol, relative to that of the inert component, EG3-thiol, in ethanol solutions. Gold-coated glass slides were incubated in solutions containing 1.3%, 3.8%, 5.7%, or 11.4% NTA-thiol, with the total thiol concentration constant at 1 mM. The SAMs were glued onto blank CM-5 SPR chip cassettes and docked into a BIAcore instrument. A 16-mer peptide comprised of two repeats of the eight amino acid minimal activation motif (X=DFDLDMLG), derived from the human activator VP16, was fused to histidine-tagged GST (GST-2×). The fusion proteins were then immobilized on variable density SAMs through complexation of the NTA group by the protein's histidine tag. This generated a series of surfaces that displayed peptides at incrementally decreasing distances from each other. The core region of human TBP (hTBPc: residues 155-335) (Nikolov et al., 1995) was injected over the peptide surfaces. GST-2× immobilized at low density (1.3%-3.8%), was unable to bind hTBPc. In contrast, when the same concentration hTBPc was injected over a more dense (5.7%-11.4%) GST-2× surface, where the average distance between peptide motifs would be smaller, a high affinity interaction resulted (see
As the graph of
Note that at high NTA density, the chip surface acted as a rigid linker between two −2× modules to mimic a 4× module, thus creating a higher affinity ligand. Three possible models might explain why the 4× peptide is a higher affinity ligand for hTBPc than a 2× peptide (See
In order to compare dissociation rates, aliquots of hTBPc were pre-incubated at very high concentration (35 μM) with either buffer, 2× peptide (1:4 stoichiometry), or 4× peptide (1:2 stoichiometry), then diluted to the usual hTBPc concentration (124 nM) before injection over GST-4× surfaces. Synthetic 2× (16-mer) and 4× (32-mer) peptides were used to eliminate possible interference from GST.
The experiments tabulated in
Next the kinetics of the surface interaction to analogous interactions in solution were compared. A series of equilibrium inhibition experiments were performed to characterize the solution interactions between hTBPc and 2× or 4× peptides. Aliquots of hTBPc, (124 nM), were mixed with increasing amounts of synthetic 2× or 4× peptide then incubated at 4° for 1 hour prior to injection over GST-4× surfaces. Titration curves (see
The physiological relevance of the interaction between hTBP and the reiterated minimal motifs was investigated. It has been argued that the widely observed in vitro interactions between TBP and activation domains are artifacts resulting from a nonspecific interaction between TBP's basic DNA-binding region and the acidic peptides. To rule out this possibility, N-terminally histidine-tagged hTBP was immobilized on NTA-SAMs then separately incubated with either: a) TATA sequence DNA; or b) DNA that did not contain a hTBP recognition sequence. GST-4× was then injected over the derivatized surfaces. DNA that did not contain a TATA sequence did not bind to the immobilized hTBP significantly. DNA containing a TATA sequence bound to immobilized hTBP with approximate 1:1 stoichiometry but was in no way inhibitory to the subsequent binding of GST-4× (see
A competitive inhibition experiment was performed to determine whether the 4× peptide could block the interaction between hTBP and the native activation domain of VP16. A histidine-tagged Gal4(1-147)+VP16(413-490) fusion protein was immobilized on NTA-SAMs. hTBP was incubated with buffer or 4× peptide then injected over VP16 derviatized surfaces. The last two lines of
In conclusion, SAMs were used to form biospecific rigid, nano-scale probe arrays of known surface density and then utilized to determine the number of binding sites on a target molecule and an approximate distance between sites. This approach is not hampered by the vagaries of secondary or tertiary structures that would be encountered by using DNA or peptide spacers to determine distances between active sites. SPR was used to show that the avidity between TBP, in solution, and surface immobilized peptides was a non-linear function of peptide surface density.
Peptides immobilized on a 3.8% NTA-SAM were not able to bind hTBP, while peptides presented on a 5.7% NTA-SAM bound TBP with nano-molar affinity. The findings are consistent with the idea that this large increase in binding strength marks the transition between mono- and bivalent binding of the target protein. Individual 8 amino acid minimal activation motifs separated by a 15 amino acid flexible linker bound hTBP nearly as well as four tandem repeats of the motif, leading to the conclusion that hTBP has at least two discrete sites capable of simultaneously interacting with the 8 amino acid motif. Calculations based on an assumed Poisson distribution of NTA in the SAM indicate that the surfaces that did not bind hTBP (3.7% NTA) presented peptides an average distance of 29 Å apart while peptides in denser arrays (5.7% NTA) that bound hTBP with high avidity were on average 2 Å apart.
The crystal structure of hTBPc has been solved (Nikolov et al., 1995). The peptide consists of two imperfect repeats that form a two-domain saddle shaped DNA-binding protein with two-fold intramolecular symmetry. TBP binds DNA with the concave underside of its “saddle” shape. The general transcription factor TFIIB binds near the TBP/DNA complex at the downstream end leaving the convex “seat” of the saddle available for other intermolecular interactions. Quasi-identical structures composed of basic helices and P sheets flank the seat of the saddle. Mirror image helices H2 and H2′ are separated by distances on the order of 20 Å. It is conceivable that the minimal activation motifs, described herein, simultaneously bind to two-fold related pseudo-identical recognition sites that may be separated by approximately 23 Å.
Similar schemes can be devised to determine distances between active sites on other bivalent molecules or complexes. Of particular interest are dimeric hormone receptors whose signaling activity depends on its association state. Detailed knowledge of distances between active sites would allow for the rational design of agonist or antagonist drugs.
Experimental MethodsProtein preparation: hTBPc was prepared according to Nikolov et al., 1996 and full length histidine-tagged hTBP according to Lee et al. [Lee, W. S., C. C. Kao, G. O. Bryant, X. Liu and A. J. Berk. (1991) Adenovirus EIA activation domain binds the basic repeat in the TATA box transcription factor. Cell 67:365-376]. Glutathione S-transferase (GST) fusion proteins were prepared according to Tanaka, 1996. The preparation of Gal4-VP 16 is described by Hori, R., S. Pyo and M. Carey, 1995. Protease footprinting reveals a surface on transcription factor TFHB that serves as an interface for activators and co-activators. Proc. Natl. Acad Sci. USA. 92(13):6047-6051.
DNA: TATA sequence DNA was prepared according to Parvin et al. [Parvin, J. D., R. J. McCormick, P. A. Sharp, and D. E. Fisher. 1995. Pre-bending of a promoter sequence enhances affinity for the TATA-binding factor. Nature. 373:724-727] with the exception that it was not circularized. A 50 base-pair double stranded oligo containing 2 Gal4 binding sites, synthesized and quantitated by GibcoBRL, Life Technologies Inc., Grand Island, N.Y., was used as non-specific control DNA. Equal mass amounts of specific vs. non-specific DNA were added.
Synthetic peptides: Peptides were generated by F-MOC synthesis and quantitated by amino acid analysis, analytical HPLC and mass spectroscopy.
The preparation self-assembled monolayers: NTA-SAMs were prepared according to Sigal et al., 1996. A panel of incrementally different density NTA surfaces was generated by serial dilution of a stock solution containing 11.4% NTA-thiol, relative to tri-ethylene glycol terminated thiol, into solutions containing the tri-ethylene glycol terminated thiol alone. Total thiol concentration was kept constant at 1 mM. NTA-SAMs were stored under argon for up to 1 week prior to use. Background levels of binding were assessed by passing reactants over underivatized GST surfaces and subtracted.
Surface plasmon resonance: Experiments were carried out in a BIAcore instrument at room temperature in phosphate buffered saline (PBS) (137 mM NaCl) running at a constant flow rate of 5 μl/min. Sample injection volumes (plugs) were 35 ∥l. Association and dissociation rate constants were extracted from the data with BIAevaluation software, version 2.1, assuming a pseudo first order kinetics model: A+BAB. Error rates were taken from the deviation of measurements among multiple experiments performed on surfaces of different NTA densities with a range of protein concentrations and using several different protein preparations, of the same species, to account for variation of the active concentration of a component.
Statistical calculations: Sulfur atoms bind to gold to form a face-centered hexagonal tiling pattern 4.99 Å on edge. In an ordered monolayer, all the positions of the hexagon are occupied by a thiol. Each vertex is shared by three hexagons, so there are three possible positions for thiol deposition per hexagon. If the thiol solution is doped with a derivatized species of thiol, such as ours is, the average number of NTA-thiols deposited per some number of hexagons (λ), can be calculated, assuming Poisson statistics, for a given NTA-thiol concentration. (It was assumed that the concentration of NTA-thiol in solution was equal to its concentration in the SAM; see
Since there are equal numbers of nearest and next-nearest neighbors, the average of these two distances is a first order approximation of the average distance between ligands resulting from a random distribution. According to this model, NTA ligands on SAMs formed from a 3.8% NTA-thiol solution would be an average of 29 Å apart, while NTA ligands in a SAM formed from a 5.7% NTA-thiol solution would be 23 Å apart. Calculations were done to evaluate the contribution of clustering using Poisson statistics.
Equation 2 calculates the probability, P, of having n NTA ligands per unit area, where λ, equals the average number of NTAs per unit area. Equation 3 calculates the ratio of the probabilities of having one NTA ligand to two NTA ligands deposited per unit area. It is 17-times more likely to get one NTA than two, per unit area, for 3.8% NTA-thiol SAMs and 11 times more likely at 5.7% NTA concentration.
P(n)=e−λλn/n! (2)
All publications cited in this application are hereby incorporated by reference. The citation of any publication is for its disclosure prior to the filing date and should not be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention.
It is to be understood that the above invention is not limited to the particular embodiments described which are meant to be for illustrative purposes only. Variations and modifications of these embodiments may be made that are still included in the description of this invention and fall within the scope of the appended claims.
Claims
1. A method for presenting discrete binding moieties in a controlled geometry that promotes high-affinity, cooperative binding to a target molecule, comprising forming a self-assembled monolayer that incorporates at least one thiol species (the biospecific component) that is capable of directly or indirectly displaying a binding partner to said target molecule and at least one inert spacer thiol component by the process of:
- (a) mixing the biospecific and a second component of said self-assembled monolayer in defined proportions; and
- (b) forming said self-assembled monolayer on a suitable substrate.
2-39. (canceled)
Type: Application
Filed: Apr 6, 2004
Publication Date: Apr 28, 2005
Applicant: Minerva Biotechnologies, Inc. (Newton, MA)
Inventor: Cynthia Bamdad (Newton, MA)
Application Number: 10/818,694