METHOD FOR DETECTING CELLS
The present invention relates to methods for detecting the chromatin state of a cell based on recording a super resolution image of nucleosome organization and correlating said imaged with size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutches. Additionally, the invention relates to a kit comprising a first antibody capable of specifically binding to a histone protein and a photo switchable fluorophore linked-secondary antibody and the use of the kit of the invention for detecting the chromatin state of a cell and isolating a cell in an open chromatin state or in a close chromatin state. The invention also relates to a device adapted to detect the chromatin state of a cell.
The present invention belongs to the field of methods for cell identification.
BACKGROUND OF INVENTIONPluripotent stem cells have potential to differentiate into any of the three germ layers: endoderm, mesoderm, or ectoderm and provide a chance to obtain a renewable source of healthy cells and tissues to treat a wide array of diseases.
Methods currently used to detect/isolate pluripotent cells have inherent experimental variability and low efficiency, and are (1) mechanical isolation based on morphology that requires experience, and is laborious and not efficient; (2) quantification of the endogenous expression of stem cell transcription factors (OCT4, SOX2, etc.) in live cells, which requires genome modification; (3) fluorescence-activated cell sorting (FACS)-based analysis using cell surface markers (SSEA-4, TRA-1-60, etc.), which requires use of antibody based staining that is inherently variable; and (4) more recently, a pluripotent stem cell-specific adhesion signature, which is dependent on the surface properties of cell clusters and thus interrogates the population and not individual cells. Additionally, the identification of high-grade pluripotent hiPSCs is time consuming, requiring the generation of teratomas and several additional pluripotency test.
Several studies of chromosome territory occupation and genome distribution inside the nucleus show that the epigenome is dynamic and, that among other processes; it contributes to gene expression and cell differentiation.
Recent studies have revealed key differences in chromatin states of pluripotent cells as compared to differentiated cell types.
The spatial organization of chromatin inside the nucleus plays a key functional role. However, how nucleosomes are arranged to form the chromatin fiber is still highly debated.
The existence of a hierarchical organization of the chromatin fiber inside intact eukaryotic nuclei in vivo has recently been debated after cryo-electron microscopy, small-angle X-ray scattering (SAXS) and electron spectroscopic imaging experiments failed to detect the 30-nm fiber. The structural information obtained in these studies led to the overall conclusion that the eukaryotic nuclei are mainly composed of 10 nm fibers even though the core histone proteins could not be identified unequivocally using these methods due to their lack of molecular specificity. In addition, genome-wide analyses have revealed that nucleosomes are depleted at promoter and terminator regions and at many enhancers and that nucleosomes occupy preferred positions in genes and non-gene regions. Since the 30-nm fiber arrangement imposes specific constrains on nucleosome occupancy and positioning, these genome-wide analyses along with the latest imaging results argue against a hierarchical organization of nucleosomes along the chromatin fiber.
Conventional microscopy have shown that heterochromatin appears in large regions in pluripotent cells but it was confined to small foci in differentiated cells, confirming that chromatin in pluripotent cells assumes a globally more open conformation (Meshorer E. et al., 2006).
Up to date, however, the super-resolution studies of DNA and histones have not addressed questions regarding the organization of single or groups of nucleosomes, the overall nucleosome occupancy level of DNA and whether these parameters are consistent with the 30 nm fiber model of chromatin. How the chromatin organization changes at the nanoscale level as a function of cell state such as pluripotency and differentiation, while of fundamental importance, has also not been studied. In general, what has been lacking is a quantitative approach that can count and determine the number of nucleosomes within the chromatin fiber and thus identify nucleosome spatial arrangement at the nanoscale level.
Given the current debate on nucleosome occupancy, positioning and organization, and the importance of these parameters for DNA accessibility and gene expression, novel methods that allow quantitative visualization of nucleosome organization with high molecular specificity at the nanometer length scales in individual intact nuclei and leading to determine the chromatin state of a cell without the disadvantages of harsh sample preparation, lack of molecular specificity or low spatial resolution are needed.
SUMMARY OF THE INVENTIONCombining quantitative super-resolution nanoscopy with computer simulations the inventors detected a striking heterogeneity in the nucleosome organization of intact eukaryotic nuclei. Nucleosomes formed groups of varying sizes, which they term “clutches” and these were interspersed with nucleosome-depleted regions. Remarkably, the median number of nucleosomes and their compaction inside clutches highly correlated with cellular state, such that clutch size is predictive of pluripotency grade. Ground-state pluripotent stem cells had, on average, less dense clutches containing fewer nucleosomes. RNA polymerase II preferentially associated with the smallest clutches. These results provide novel insights into chromatin organization at the nanoscale level and open new possibilities for identification of stem cells through the structural organization of their chromatin fibers.
In a first aspect, the invention relates to a method for detecting the chromatin state of a cell comprising
-
- a) contacting a sample containing cells with a first antibody capable of specifically binding to a histone protein,
- b) contacting the antibody:histone complex formed in step a) with a secondary antibody having at least one photoswitchable fluorophore adapted to be optically excited at a certain wavelength Δ1 and to emit light at a wavelength λ2 different from λ1,
- c) recording a super resolution image of nucleosome organization by means of a sensor being sensitive at least to the wavelength of emission of the photoswitchable fluorophore by exciting the sample with an optical radiation having a wavelength λ1,
- d) correlating the image obtained in step c) with size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutches, and
- e) comparing data obtained in step d) with a corresponding reference value to obtain a score based on size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch,
wherein if the cell comprises smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in an open chromatin state and wherein if the cell comprises bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in a close chromatin state.
In a second aspect, the invention relates to a method for isolating a cell in an open chromatin state comprising
-
- a) detecting the chromatin state of a cell by a method according to the invention, and
- b) isolating a cell having smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches.
In a third aspect, the invention relates to a method for isolating a cell in a close chromatin state comprising
-
- a) detecting the chromatin state of a cell by a method according to the invention, and
- b) isolating a cell having bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches
In a fourth aspect, the invention relates to a kit comprising a first antibody capable of specifically binding to a histone protein and a photoswitchable fluorophore linked-secondary antibody.
In a fifth aspect, the invention relates to the use of a kit of the invention for detecting and isolating a cell in an open chromatin state or in a close chromatin state.
In a sixth aspect, the invention relates to a device adapted to detect the chromatin state of a cell comprising
-
- a source of optical radiation adapted to emit light at a wavelength λ1 over an interrogation area adapted to receive a biological sample,
- an optical sensor sensible to a second wavelength λ2 adapted to measure the optical radiation at λ2,
- a control unit connected to the optical sensor and to the source of optical radiation wherein said control unit is adapted to carry out the method of the invention.
The authors of the present invention have resolved how nucleosomes are arranged along the chromatin fiber in a large number of different cell types. Their observations indicate that nucleosomes are grouped in discrete domains, which they termed “nucleosome clutches” in analogy with “egg clutches” (Example 1). They developed quantitative methods to assess clutch size, defined as the number of nucleosomes per clutch, and found that this number is very heterogeneous in a given nucleus arguing against the existence of a well-organized and ordered fiber. By comparing experimental data to computer simulations they estimated the nucleosome occupancy of the chromatin fiber and found that nucleosome-depleted regions intersperse nucleosome clutches. Two-color super-resolution imaging showed increased levels of H1 in larger clutches containing more nucleosomes suggesting that H1 might be responsible for bringing nucleosomes into close proximity inside the clutches. On the other hand, RNA Polymerase II associated more closely with smaller clutches containing fewer nucleosomes, suggesting that the chromatin fiber within these regions is more accessible (Example 5). Strikingly, despite the heterogeneity in the clutch size in a given nucleus, on average differentiated cells contained clutches with larger number of nucleosomes compared to stem cells. Furthermore, there was a high degree of correlation between clutch size and pluripotency grade of wild-type and mutant mouse embryonic stem cells (mESC) cultured under different conditions and pluripotency grade of a number of different human induced pluripotent stem cell (hiPSC) clones. Therefore, nucleosome organization is predictive of cell pluripotency (Example 4). These results open up exciting possibilities for identifying stem cells by analyzing their nucleosome arrangement organization is predictive of cell pluripotency. Thus, the inventors have developed a method for identifying the chromatin state of a cell by analyzing their nucleosome arrangement.
Method for Detecting the Chromatin State of a CellAccording to the “textbook picture”, chromatin compaction follows a hierarchical model where nucleosomes from a “beads-on-string” fiber of 10 nm in diameter, which folds into higher ordered fibers of 30 nm, which in turn compact progressively into larger fibers of 100-200 nm.
The structural information needs to be obtained using optical means having optical sensors combined with post-processing software configured to reveal internal structures having a length scale of about 10 nm.
In a first aspect, the invention relates to a method for detecting the chromatin state of a cell comprising,
-
- a) contacting a sample containing cells with a first antibody capable of specifically binding to a histone protein,
- b) contacting the antibody:histone complex formed in step a) with a secondary antibody having at least one photoswitchable fluorophore adapted to be optically excited at a certain wavelength λ1 and to emit light at a wavelength λ2 different from λ1,
- c) recording a super resolution image of nucleosome organization by means of a sensor being sensitive at least to the wavelength of emission of the photoswitchable fluorophore by exciting the sample with an optical radiation having a wavelength λ1,
- d) correlating the image obtained in step c) with size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutches, and
- e) comparing data obtained in step d) with a corresponding reference value to obtain a score based on size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch,
wherein if the cell comprises smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in an open chromatin state and wherein if the cell comprises bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in a close chromatin state.
Detecting, as used herein, refers both to determine and/or identify if a cell is in an open or close chromatin state. As will be understood by those skilled in the art, the detection, although preferred to be, need not be correct for 100% of the cells to be detected or evaluated. The term, however, requires that a statistically significant portion of cells can be identified as in an open chromatin state or in a close chromatin state. Whether a cell is statistically significant can be determined without further ado by the person skilled in the art using various well known statistic evaluation tools, e.g., determination of confidence intervals, p-value determination, Student's t-test, Mann-Whitney test, etc. Details are found in Dowdy and Wearden, Statistics for Research, John Wiley & Sons, New York 1983. Preferred confidence intervals are at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, or at least 95%. The p-values are, preferably, 0.05, 0.01, 0.005 or lower.
As the skill person can understand, the method of the invention allows comparing the chromatin state between two cells, and thus it is possible to determine if two cells have a similar or different chromatin state.
“Chromatin state of a cell”, as used herein relates to a condition of a cell showing open (active) chromatin or close (inactive) chromatin. Said terms are known by a skill person. “Open chromatin” means a DNA in which histone modifications such as acetylation lead to exposure of a DNA sequence thus allowing binding of transcription factors and transcription to take place. Open chromatin is structurally loose to allow access to RNA and DNA polymerases that transcribe and replicate the DNA. “Close chromatin” is found associated with structural proteins and include modifications of the histone tails that lead to are more tightly packaged state of the chromatin, which is less accessible to the binding of the majority of transcription factors and polymerases.
In a preferred embodiment the cell in an open chromatin state is selected from the group consisting of transcriptionally active cells, pluripotent cells, cancer cells and drug perturbed cells. In a more preferred embodiment, the cell in an open chromatin state is a pluripotent cell. The term “transcriptionally active cell” as used herein, relates to a cell having an active chromatin, which means a DNA in which histone modifications such as acetylation lead to exposure of a DNA sequence thus allowing binding of transcription factors and transcription to take place.
“Pluripotent cell” as used herein, relates to a primordial cell that can differentiate into a sub-group of specialized types of cells, for example, a stem cell that has the potential to differentiate into any of the three germ layers: endoderm (interior stomach lining, gastrointestinal tract, the lungs), mesoderm (muscle, bone, blood, urogenital), or ectoderm (epidermal tissues and nervous system). Pluripotent stem cells can give rise to any fetal or adult cell type. However, alone they cannot develop into a fetal or adult animal because they lack the potential to contribute to extraembryonic tissue, such as the placenta. Illustrative, non-limitative examples of pluripotent cells include adipose-derived stem cells (ASCs), amniotic stem cells, bone marrow-derived stem cells (BMSCs), cord blood-derived stem cells (CBSCs), embryonic stem cells (ESCs), fetal stem cells (FSCs), amniotic stem cells, endothelial stem cells, epidermal stem cells, haematopoietic stem and progenitor cells (HSPCs), mesenchymal stem cells (MSCs), neural stem cells (NSCx), retinal stem and progenitor cells (RSPCs), etc.
In a further embodiment, the pluripotent cell is an induced pluripotent stem cell, commonly abbreviated as iPS cell or iPSC, which is a type of pluripotent stem cell artificially derived from a non-pluripotent cell, typically an adult somatic cell, by inducing a “forced” expression of specific genes. iPSCs are similar to natural pluripotent stem cells in many respects, such as the expression of certain stem cell genes and proteins, chromatin methylation patterns, doubling time, embryoid body formation, teratoma formation, viable chimera formation, and potency and differentiability.
“Cancer cell” refers to a cell from a cancer or tumor or a cancer cell line. “Cancer” refers to a broad group of diseases involving unregulated cell growth and which are also referred to as malignant neoplasms. The term is usually applied to a disease characterized by uncontrolled cell division (or by an increase of survival or apoptosis resistance) and by the ability of said cells to invade other neighboring tissues (invasion) and spread to other areas of the body where the cells are not normally located (metastasis) through the lymphatic and blood vessels, circulate through the bloodstream, and then invade normal tissues elsewhere in the body. Depending on whether or not they can spread by invasion and metastasis, tumours are classified as being either benign or malignant: benign tumours are tumours that cannot spread by invasion or metastasis, i.e., they only grow locally; whereas malignant tumours are tumours that are capable of spreading by invasion and metastasis. Biological processes known to be related to cancer include angiogenesis, immune cell infiltration, cell migration and metastasis. Cancers usually share some of the following characteristics: sustaining proliferative signalling, evading growth suppressors, resisting cell death, enabling replicative immortality, inducing angiogenesis, and activating invasion and eventually metastasis. Cancers invade nearby parts of the body and may also spread to more distant parts of the body through the lymphatic system or bloodstream. Cancers are classified by the type of cell that the tumour cells resemble, which is therefore presumed to be the origin of the tumour. These types include:
-
- Carcinoma: Cancers derived from epithelial cells. This group includes many of the most common cancers, particularly in the aged, and include nearly all those developing in the breast, prostate, lung, pancreas, and colon.
- Sarcoma: Cancers arising from connective tissue (i.e. bone, cartilage, fat, nerve), each of which develop from cells originating in mesenchymal cells outside the bone marrow.
- Lymphoma and leukaemia: These two classes of cancer arise from hematopoietic (blood-forming) cells that leave the marrow and tend to mature in the lymph nodes and blood, respectively.
- Germ cell tumour: Cancers derived from pluripotent cells, most often presenting in the testicle or the ovary (seminoma and dysgerminoma, respectively).
- Blastoma: Cancers derived from immature “precursor” cells or embryonic tissue. Blastomas are more common in children than in older adults.
In a preferred embodiment the cancer cells are cells from a cancer selected from breast, ovarian, prostate, brain, pancreas, skin, bone, bone marrow, blood, thymus, uterus, testicles, hepatobiliary and liver tumors, adenoma, angiosarcoma, astrocytoma, epithelial carcinoma, germinoma, glioblastoma, glioma, hemangioendothelioma, hemangio sarcoma, hematoma, hepatoblastoma, leukaemia, lymphoma, medulloblastoma, melanoma, neuroblastoma, hepatobiliary cancer, osteosarcoma, retinoblastoma, rhabdomyosarcoma, sarcoma, and teratoma, acrallentiginous melanoma, actinic keratosis adenocarcinoma, adenoid cystic carcinoma, adenomas, adenosarcoma, adenosquamous carcinoma, astrocytictumors, bartholin gland carcinoma, basal cell carcinoma, bronchial gland carcinoma, capillary carcinoid, carcinoma, carcinosarcoma, cholangiocarcinoma, cystadenoma, endodermal sinus tumor, endometrial hyperplasia, endometrial stromal sarcoma, endometrioid adenocarcinoma, ependymal sarcoma, Swing's sarcoma, focal nodular hyperplasia, germ cell tumors, glioblastoma, glucagonoma, hemangioblastoma, hemangioendothelioma, hemangioma, hepatic adenoma, hepatic adenomatosis, hepatocellular carcinoma, hepatobiliary cancer, insulinoma, intraepithelial neoplasia, interepithelial squamous cell neoplasia, invasive squamous cell carcinoma, large cell carcinoma, leiomyosarcoma, melanoma, malignant melanoma, malignant mesothelialtumor, medulloblastoma, medulloepithelioma, mucoepidermoid carcinoma, neuroblastoma, neuroepithelial adenocarcinoma, nodular melanoma, osteosarcoma, papillary serous adenocarcinoma, pituitary tumors, plasmacytoma, pseudosarcoma, pulmonary blastoma, renal cell carcinoma, retinoblastoma, rhabdomyosarcoma, sarcoma, serous carcinoma, small cell carcinoma, soft tissue carcinoma, somatostatin-secreting tumor, squamous carcinoma, squamous cell carcinoma, undifferentiated carcinoma, uveal melanoma, verrucous carcinoma, vipoma, Wilm's tumor, intracerebral cancer, head and neck cancer, rectal cancer, astrocytoma, glioblastoma, small cell cancer, and non-small cell cancer.
“Drug perturbed cell”, as used herein relates to a cell treated with a compound that target the cell machinery of transcription, the cell cycle or proliferation process. Illustrative non-limitative examples of components of the transcription machinery are RNA polymerase; specificity factors (alter the specificity of RNA polymerase for a given promoter or set of promoters, making it more or less likely to bind to them (i.e. sigma factors used in prokaryotic transcription); repressors (bind to non-coding sequences on the DNA strand that are close to or overlapping the promoter region, impeding RNA polymerase's progress along the strand, thus impeding the expression of the gene; general transcription factors (position RNA polymerase at the start of a protein-coding sequence and then release the polymerase to transcribe the mRNA); activators (enhance the interaction between RNA polymerase and a particular promoter, encouraging the expression of the gene. Activators do this by increasing the attraction of RNA polymerase for the promoter, through interactions with subunits of the RNA polymerase or indirectly by changing the structure of the DNA); enhancers (sites on the DNA helix that are bound to by activators in order to loop the DNA bringing a specific promoter to the initiation complex); silencers (regions of DNA that are bound by transcription factors in order to silence gene expression); chromatin remodeling through specific use of miRNA molecules presents one method by which euchromatin, typically associated with transcriptional activity, is converted to heterochromatin, reducing transcription. This occurs by means of RNA induced transcriptional silencing complex or “RITS.”
Illustrative non-limitative examples of such drugs are tamoxifene, bicalutamide and various types of anti-inflammatory and anabolic steroid, enzyme inhibitors such as kinase and acetylase inhibitors or activators. As a result of treatment with the drug the cell could suffer transcription of a gene.
Step a) of the method for detecting the chromatin state of a cell comprises contacting a sample containing cells with a first antibody capable of specifically binding to a histone protein. Thus, according to an embodiment of the invention, an antibody:histone complex is formed contacting a sample containing cells with a first antibody capable of specifically binding to a histone protein.
“Sample”, as used herein refers to any biological sample susceptible of containing cells, and it can be obtained by conventional methods known by those of average skill in the art, depending on the nature of the sample.
In a particular embodiment, said biological sample is a biopsy sample, tissue, cell or biofluid sample (plasma, serum, saliva, semen, sputum, cerebral spinal fluid (CSF), tears, mucus, sweat, milk, brain extracts and the like). Said biological samples can be obtained by any conventional method. In another aspect, the sample is a cell culture sample.
In a more preferred embodiment, the sample is a mouse or human commercial cell line. In another preferred embodiment, the sample is a biopsy sample from a human patient. In another preferred embodiment, the sample comprises primary cells purified from body parts of human donors.
As used herein, the term “antibody” refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules containing an antigen fixing site binding specifically (immunoreacting) with an antigen, such as a protein for example. There are 5 isotypes or main classes of immunoglobulins: immunoglobulin M (lgM), immunoglobulin D (lgD), immunoglobulin G (lgG), immunoglobulin A (lgA) and immunoglobulin E (lgE).
The antibodies that are going to be used in the present invention can be, for example, polyclonal sera, hybridoma supernatants or monoclonal antibodies, antibody fragments, Fv, Fab, Fab′ and F(ab′)2, scFv, diabodies, triabodies, tetrabodies and humanized antibodies.
The suitable conditions for the formation of the antibody:histone complex to take place are known by the skilled in the art. If the sample containing cells contains histone proteins, then the corresponding antibody:histone complex will be formed.
“Histone protein”, as used herein relates to a highly alkaline protein found in eukaryotic cell nuclei that packages and orders the DNA into structural units called nucleosomes. Five major families of histones exist: H1/H5, H2A, H2B, H3 and H4. Histones H2A, H2B, H3 and H4 are known as the core histones, while histones H1 and H5 are known as the linker histones. “Nucleosomes” are a repeating unit of the chromatin, formed by 146 base pairs (bp) of DNA wrapped around octamers of the four core histone proteins (H2A, H2B, H3 and H4).
As a person skilled in the art can know, the histone protein can also be detected by detecting a functionally equivalent variant of a histone protein.
“Functionally equivalent variant” is understood to mean all those proteins derived from a histone sequence by modification, insertion and/or deletion or one or more amino acids, whenever the function is substantially maintained.
Assays to determine the function of an enzyme are known by the skilled person and include, without limitation, initial rate assays, progress curve assays, transient kinetics assays and relaxation assays. Continuous assays of enzymatic activity include, without limitation, spectrophotometric, fluorometric, calorimetric, chemiluminiscent, light scattering and microscale thermopheresis assays. Discontinuous assays of enzymatic activity include, without limitation, radiometric and chromatographic assays. As the skilled person understands, factors that may influence enzymatic activity comprise salt concentration, temperature, pH, and substrate concentration.
The function of a histone can be determined by analyzing the compaction of DNA. The compaction of DNA can be assay using several methods known in the art, by way of illustrative-non limitative example by density gradient centrifugation on MNase digested samples, comet assay. Particularly the function of H2B can be assayed by determining the phosphorylation of H2B at serine 14, which is linked to chromatin condensation. Additionally, by way of illustrative-non limitative example, the function of the H2B can be assayed by detecting acetylation in Lys12 and in Lys15 or ubiquitylation in Lys120, all of these modifications, associated with transcriptionally activation, and thus with an open chromatin state.
Preferably, variants of a histone protein are (i) polypeptides in which one or more amino acid residues are substituted by a preserved or non-preserved amino acid residue (preferably a preserved amino acid residue) and such substituted amino acid may be coded or not by the genetic code, (ii) polypeptides in which there is one or more modified amino acid residues, for example, residues modified by substituent bonding, (iii) polypeptides resulting from alternative processing of a similar mRNA, (iv) polypeptide fragments and/or (v) polypeptides resulting from a histone fusion or the polypeptide defined in (i) to (iii) with another polypeptide, such as a secretory leader sequence or a sequence being used for purification (for example, His tag) or for detection (for example, Sv5 epitope tag). The fragments include polypeptides generated through proteolytic cut (including multisite proteolysis) of an original sequence. The variants may be post-translationally or chemically modified. Such variants are supposed to be apparent to those skilled in the art.
As known in the art, the “similarity” between two polypeptides is determined by comparing the amino acid sequence and the substituted amino acids preserved from a polypeptide with the sequence of a second polypeptide. The variants are defined to include polypeptide sequences different from the original sequence, preferably different from the original sequence in less than 40% of residues per segment concerned, more preferably different from the original sequence in less than 25% of residues per segment concerned, more preferably different from the original sequence in less than 10% of residues per segment concerned, more preferably different from the original sequence in only a few residues per segment concerned and, at the same time, sufficiently homologous to the original sequence to preserve functionality of the original sequence. The present invention includes amino acid sequences which are at least 60%, 65%, 70%, 72%, 74%, 76%, 78%, 80%, 90%, or 95% similar or identical to the original amino acid sequence. The degree of identity between two polypeptides may be determined using computer algorithms and methods which are widely known to those skilled in the art. The identity between two amino acid sequences is preferentially determined using BLASTP algorithm [BLASTManual, Altschul, S. et al., NCBI NLM NIH Bethesda, Md. 20894, Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990)].
In a preferred embodiment, the histone protein is core histone protein.
“Core histone protein”, as used herein, refers to any histone selected from the group consisting of histone H2A, H2B, H3 and H4. In a more preferred embodiment, the core histone protein is H2B.
“H2B”, as used herein, refers to one of the 5 main histone proteins involved in the structure of chromatin in eukaryotic cells. Featuring a main globular domain and a long N terminal tail H2B is involved with the structure of the nucleosomes of the ‘beads on a string’ structure. H2B has 19 variants in humans. The detection of any variant of H2B can be used in the present invention.
As the person skilled in the art understands it may be necessary that, after contacting the sample with the first antibody, the sample is properly collected, fixed and/or sectioned. Cells in a sample can be fixed by any suitable process including perfusion or by submersion in a fixative. Fixatives can be classified as cross-linking agents (such as aldehydes, e.g., formaldehyde, paraformaldehyde, and glutaraldehyde, as well as non-aldehyde cross-linking agents), oxidizing agents (e.g., metallic ions and complexes, such as osmium tetroxide and chromic acid), protein-denaturing agents (e.g., acetic acid, methanol, and ethanol), fixatives of unknown mechanism (e.g., mercuric chloride, acetone, and picric acid), combination reagents (e.g., Carnoy's fixative, methacarn, Bouin's fluid, B5 fixative, Rossman's fluid, and Gendre's fluid), microwaves, and miscellaneous fixatives (e.g., excluded volume fixation and vapor fixation). Additives may also be included in the fixative, such as buffers, detergents, tannic acid, phenol, metal salts (such as zinc chloride, zinc sulfate, and lithium salts), and lanthanum. In a preferred embodiment, the fixative used in the present invention is a combination of methanol and ethanol, more particularly in a 1:1 ratio.
To reduce background staining, samples can be incubated with a buffer that blocks the reactive sites to which the primary or secondary antibodies may otherwise bind. Common blocking buffers include normal serum, non-fat dry milk, BSA, or gelatin. Commercial blocking buffers with proprietary formulations are available. Methods to eliminate background staining include dilution of the primary or secondary antibodies, changing the time or temperature of incubation, or using a different primary antibody. In a preferred embodiment, the blocking is carry out by a buffer comprising BSA
The detection of the antibody:histone complex (step b) is carried out by contacting said complex with a secondary antibody, having at least one photoswitchable fluorophore adapted to be optically excited at a certain wavelength λ1 and to emit light at a wavelength λ2 different from λ1. When the sample having the antibody:histone complex is excited with optical energy, for instance by means of a laser beam of a wavelength λ1, those locations of the antibody:histone complex linked to the photoswitchable fluorophore emit light at the wavelength λ2.
“Fluorophore”, as used herein, refers to entities that can emit light of a certain emission wavelength when exposed to a stimulus, for example, an excitation wavelength.
“Photoswitchable” as used herein, relates to an entity which can be switched between different light-emitting or non-emitting states by incident light of different wavelengths. Typically, a “switchable” entity can be identified by one of ordinary skill in the art by determining conditions under which an entity in a first state can emit light when exposed to an excitation wavelength, switching the entity from the first state to the second state, e.g., upon exposure to light of a switching wavelength, then showing that the entity, while in the second state can no longer emit light (or emits light at a reduced intensity) or emits light at a different wavelength when exposed to the excitation wavelength. Examples of switchable entities are disclosed in WO 2008/091296. As a non-limiting example of a switchable fluorophore, Cy5 can be switched between a fluorescent and a dark state in a controlled and reversible manner by light of different wavelengths, e.g., 633 nm or 657 nm red light can switch or deactivate Cy5 to a stable dark state, while 405 nm or 532 nm light can switch or activate the Cy5 back to the fluorescent state.
In some cases, the fluorophore can be reversibly switched between the two or more states, e.g., upon exposure to the proper stimuli. For example, a first stimuli (e.g., a first wavelength of light) may be used to activate the switchable fluorophore, while a second stimuli (e.g., a second wavelength of light) may be used to deactivate the switchable fluorophore, for instance, to a non-emitting state. Any suitable method may be used to activate the fluorophore. For example, in one embodiment, incident light of a suitable wavelength may be used to activate the entity to emit light, i.e., the entity is photoswitchable. Thus, the photoswitchable fluorophore can be switched between different light-emitting or non-emitting states by incident light, e.g., of different wavelengths. The light may be monochromatic (e.g., produced using a laser) or polychromatic.
In another embodiment, the entity may be activated upon stimulation by electric field and/or magnetic field. In other embodiments, the entity may be activated upon exposure to a suitable chemical environment, e.g., by adjusting the pH, or inducing a reversible chemical reaction involving the entity, etc.
Similarly, any suitable method may be used to deactivate the entity, and the methods of activating and deactivating the entity need not be the same. For instance, the entity may be deactivated upon exposure to incident light of a suitable wavelength, or the entity may be deactivated by waiting a sufficient time.
In some embodiments, the switchable entity includes a first, light-emitting portion (e.g., a fluorophore), and a second portion that activates or “switches” the first portion.
Upon exposure to light, the second fluorophore may activate the first fluorophore a, causing the first fluorophore to emit light. Examples of activator fluorophores include, but are not limited to Alexa Fluor 405 (Invitrogen), Alexa 488 (Invitrogen), Cy2 (GE Healthcare), Cy3 (GE Healthcare), Cy3.5 (GE Healthcare), or Cy5 (GE Healthcare), or other suitable dyes. Examples of light-emitting portions include, but are not limited to, Cy5, Cy5.5 (GE Healthcare), or Cy7 (GE Healthcare), Alexa Fluor 647 (Invitrogen), or other suitable dyes. These may linked together, e.g., covalently, for example, directly, or through a linker, e.g., forming compounds such as, but not limited to, Cy5-Alexa Fluor 405, Cy5-Alexa Fluor 488, Cy5-Cy2, Cy5-Cy3, Cy5-Cy3.5, Cy5.5-Alexa Fluor 405, Cy5.5-Alexa Fluor 488, Cy5.5-Cy2, Cy5.5-Cy3, Cy5.5-Cy3.5, Cy7-Alexa Fluor 405, Cy7-Alexa Fluor 488, Cy7-Cy2, Cy7-Cy3, Cy7-Cy3.5, or Cy7-Cy5. In a more preferred embodiment the first fluorophore (activator) is Alexa 405 and the second fluorophore is Alexa 647.
In another preferred embodiment, wavelength λ1 is 647 nm, wavelength λ2 is 670 nm and wavelength λ3 is 405 nm.
Any suitable method may be used to link the first, light-emitting fluorophore and the second, activation fluorophore. In some cases, a linker is chosen such that the distance between the first and second fluorophore is sufficiently close to allow the activator fluorophore to activate the light-emitting fluorophore as desired, e.g., whenever the light-emitting fluorophore has been deactivated in some fashion. Typically, the fluorophore will be separated by distances on the order of 500 nm or less, for example, less than about 300 nm, less than about 100 nm, less than about 50 nm, less than about 20 nm, less than about 10 nm, less than about 5 nm, less than about 2 nm, less than about 1 nm, etc. Examples of linkers include, but are not limited to, carbon chains (e.g., alkanes or alkenes), polymer units, or the like.
The switchable entity may comprise a first fluorophore directly bonded to the second fluorophore, or the first and second entity may be connected via a linker or a common entity. Whether a pair of light emitting portion and activator portion produces a suitable switchable entity can be tested by methods known to those of ordinary skills in the art. For example, light of various wavelength can be used to stimulate the pair and emission light from the light-emitting portion can be measured to determine whether the pair makes a suitable switch.
Additional details about fluorophores can be found in WO2009/085218.
Step c) of the method of the invention comprises recording a super resolution image of nucleosome organization by means of a sensor being sensitive at least to the wavelength of emission of the photoswitchable fluorophore by exciting the sample with an optical radiation having a wavelength λ1. Recording an image by means of an optical sensor aiming to the optically excited sample provides a bitmap image at certain resolution having information about the nucleosome organization. In particular, the image shows the projection over the focal plane of the sensor used for recording the image of the location of the photoswitchable fluorophores that have emitted light. This information will be used to provide characteristic length scales and density of some relevant structural parts of the protein that allows identifying nucleosomal organization and thus the chromatin state of a cell.
“Super resolution image” as used herein, refers to an image with an axial and lateral resolution under 100 nm allowing single molecule localization. At present, super resolution images provides a resolution near the limit of the length scale defined by chromatin fibers, that is 10-30 nm.
In a preferred embodiment, the images obtained are characterized by a lateral (XY) resolution of approximately 20-30 nm and axial (Z) resolution of 50-60 nm.
The super resolution images can be obtained by any super resolution techniques known in the art. Super-resolution techniques allow the capture of images with a higher resolution than the diffraction limit. They fall into two broad categories, “true” super-resolution techniques, which capture information contained in evanescent waves, and “functional” super-resolution techniques, which use clever experimental techniques and known limitations on the matter being imaged to reconstruct a super-resolution image. There are two major groups of methods for functional super-resolution microscopy:
1. Deterministic super-resolution: The most commonly used emitters in biological microscopy, fluorophores, show a nonlinear response to excitation, and this nonlinear response can be exploited to enhance resolution. These methods include without limitation STED, GSD, RESOLFT and SSIM.
2. Stochastical super-resolution: The chemical complexity of many molecular light sources gives them a complex temporal behaviour, which can be used to make several close-by fluorophores emit light at separate times and thereby become resolvable in time. These methods include without limitation SOFI and all single-molecule localization methods (SMLM) such as SPDM, SPDMphymod, PALM, FPALM, STORM and dSTORM.
In a preferred embodiment, the super resolution image is obtained by a stochastical super resolution technique, preferably STORM, PALM and (PALM, and more preferably by STORM. STORM combines two concepts: single molecule localization and fluorophore photoswitching. The first concept allows one to localize the position of a single fluorophore with nanometer precision. Photoswitching makes it possible to “turn off” most fluorophores into a dark state and “turn on” only a small subset of them at a time. As a result, the images of the “active” fluorophores are isolated in space and their positions can be localized with high precision. Once all the fluorophores are imaged and their positions are localized, a high-resolution image can be reconstructed from these localizations. To date, the spatial resolution achieved by this technique is ˜20 nm in the lateral dimensions and ˜50 nm in the axial dimension. More details of STORM technology are described in WO2013090360, WO2009085218 and EP2378343.
In a preferred embodiment, a plurality of super resolution images are taken by means of a sensor being sensitive at least to the wavelength of emission of the second fluorophore λ2 rendering a further super resolution image by collecting the sensed light emissions recorded in the plurality of images. According to said further embodiment, a plurality of images are recorded and post-processed in order to obtain a new image with the accumulated value of the optical radiation emitted by the sample. When a sample is exited with an optical radiation some of the photoswitchable fluorophores are activated and other photoswitchable fluorophores are not. The new image provides information of a large number of locations of photoswitchable fluorophores because the probability of recording the emission of light of certain photoswitchable fluorophore being excited is higher.
According to an embodiment, a pair of different photoswitchable fluorophores is used. The first photoswitchable fluorophore is adapted to be optically excited at a certain wavelength λ1 and to emit light at a wavelength λ2 different from λ1; and, the second photoswitchable fluorophore is adapted to be optically excited at a wavelength λ3 and reactivate the first fluorophore by bringing it from its dark state back to its ground state.
Thus, in a preferred embodiment of the method of the invention, the secondary antibody further comprises a second fluorophore adapted to be optically excited at a wavelength λ3 and reactivate the first fluorophore by bringing it from its dark state back to its ground state, upon which the first fluorophore can be excited again at its excitation wavelength λ1 and emit light at its emission wavelength λ2.
In this case, a first step the sample is excited with an optical radiation having a wavelength λ1 turning the first fluorophore to a dark state. A further optical radiation having a wavelength λ3 excites the second photoswitchable fluorophore which reactivates the first fluorophore by bringing it from its dark state back to its ground state, upon which the first fluorophore can be excited again at its excitation wavelength λ1 and emit light at its emission wavelength λ2. This last excitation using an optical radiation at a wavelength λ1 provides the emission at an emission wavelength λ2 that is recorded at least in one image.
In a preferred embodiment, the power of the optical radiation having a wavelength λ3 is monotonically increased. In an example, the optical radiation at a wavelength λ3 has been gradually increased in a sigmoidal manner reaching a maximum power value, keeping this maximum value until the fluorophores are exhaustively imaged and photobleached.
In another preferred embodiment before recording each super resolution image of the plurality of super resolution images, the sample is excited once or more times with an optical radiation having a wavelength λ1 and subsequently excited once or more times with an optical radiation having a wavelength λ3.
As the skill person knows, during imaging, only an optically resolvable subset of fluorophores is activated to a fluorescent state at any given moment, such that the position of each fluorophore can be determined with high precision by finding the centroid position of the single-molecule images of particular fluorophore. The fluorophore is subsequently deactivated, and another subset is activated and imaged. Iteration of this process allows numerous fluorophores to be localized and a super-resolution image to be constructed from the image data.
One fluorophore is recorded in the image by a plurality of pixels grouped in a region of the said image. The value of each pixel is associated to a certain value of radiation. The location of the fluorophore needs to be determined for the set of pixels having information of that fluorophore.
A more complex situation is found when two or more fluorophores are close enough as for a plurality of pixels show the accumulated radiation of the plurality of fluorophores. That is, the radiation value represented in a single pixel may be the contribution of the radiation from more than one fluorophore.
The individual locations of photoswitchable fluorophores and cluster information need to be identified over the image.
Step d) of the present invention, comprises correlating the image obtained in step c) with size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutches.
“Nucleosome clutch”, as used herein relates to a heterogeneous nucleosome group.
“Size of nucleosomal cluches” as used herein relates to the number of nucleosomes per clutch.
“Nucleosomal density”, as used herein, relates to the number of nucleosomes in a clutch divided by the unit area of that clutch.
Thus, according to the invention, the image obtained in step c) is converted to a list of “fluorescent probe positions”. Several known softwares can be used for obtaining fluorescent probe positions, as illustrative non-limiting example the Insight 3 provided by BO Huang, University of California, San Francisco. Briefly, peaks in single-molecule images are identified based on a threshold and fit to a simple Gaussian to determine the x and y positions. The final images are rendered by representing each x-y position (localization) as a Gaussian with a width that corresponds to the determined localization precision (9 nm). Sample drift during acquisition is calculated and subtracted by reconstructing STORM images from subsets of frames (typically 500-1000 frames, for which drift was assumed to be small) and correlating these images to a reference frame (typically one that is reconstructed at the initial time segment). For multicolor images, each peak is color coded based on whether the emission is recorded immediately after λ3 or another activation wavelength (λ4). The peaks coming from a frame not belonging to the one right after an activation frame were coded as “non-specific”. A crosstalk algorithm as described previously is applied to correct for non-specific activations by the imaging laser (Dani et al., 2010). Briefly, the number of “apparent specific” activations are calculated from the frame immediately following the activation pulse and the number of “non-specific” activations from subsequent imaging frames in the imaging cycle. Assuming that the probability of “non-specific” activations is constant across all frames, it could be determined the number of “actual specific” activations by subtracting the “non-specific activation” number from the “apparent specific” activation number. We then used these numbers to statistically subtract crosstalk due to “non-specific” activations in an unbiased way as previously described (Dani et al., 2010).
Additionally, the position lists can be used to construct discrete localization images, such that each pixel has a value equal to the number of localizations falling within the pixel area, as a way of illustrative-non limitative example the pixel size is ≧the location accuracy, in a more preferred embodiment the pixel size is 10 nm. From the localization images, density maps may be obtained by 2-dimensional convolution with a square kernel, as a way of illustrative-non limitative example, preferably ≧1×1 pixels2, more preferably 5×5 pixels2, although the kernel can have different shapes. A constant threshold may be used to digitize the density maps into binary images, such that pixels have a value of 1 where the density is larger than the threshold value and a value of 0 elsewhere. Localizations falling on zero-valued pixels of the binary images (low-density areas) may be discarded from further analysis. Connected components of the binary image, composed by adjacent non-zero pixels (4-connected neighborhood), are sequentially singled out and analyzed. Localization coordinates within each connected component can be grouped by means of a distance-based clustering algorithm. Initialization values for the number of clusters and the relative centroid coordinates can be obtained from local maxima of the density map within the connected region, calculated by means of a peak finding routine. Localizations may be associated to clusters based on their proximity to cluster centroids. New cluster centroid coordinates can be iteratively calculated as the average of localization coordinates belonging to the same cluster. The procedure was iterated until convergence of the sum of the squared distances between localizations and the associated cluster and provided cluster centroid positions and number of localizations per cluster. Cluster sizes can be calculated as the standard deviation of localization coordinates from the relative cluster centroid.
In an embodiment, a super resolution image is rendered from the list of locations (x,y) determined as the coordinates in the sample where an optical emission of a photoswitchable fluorophore adapted to emit light at a wavelength λ2 is present. In an example, peaks in single-molecule image are identified wherein only values over a predetermined threshold value are taken into account. The relevant values, those values over the threshold value, are fit to a simple Gaussian to determine the x and y positions over the image. The x and y position over the image can be correlated to the physical x and y coordinates over the sample for instance once the limits of the image over the sample are known. Then, the set of locations (x,y) may be provided as a list.
A further procedure uses data in a form of a list of coordinates (x, y), each coordinate (x, y) corresponding to one location of a photoswitchable fluorophore.
Departing from the information having the location (x, y) of the fluorophores obtained from the image or images, in an embodiment of the invention clutches and relevant parameters on said clutches is provided.
In a first step, a density image of resolution lower than or equal to the rendered high resolution image used for the determination of the locations (x, y) and representing the same area as said rendered high resolution image is provided wherein each pixel of the density image has a value proportional to the number of locations of the location list falling within the area represented by said pixel. In particular, the value is taken as the number of localizations falling within the pixel area represented by the pixel.
In a second step, a binary image representing the same area than the density image comprising zero value pixels if the corresponding value represented by the density image in the same location is lower than a predefined threshold; and, nonzero if said value is higher, is provided. Zero and nonzero values (for example 1), are examples of binary values representing two different levels. A first level corresponding to pixel values under the threshold value and a second level corresponding to pixel values equal or over the threshold value.
Regions of pixels corresponding to the second level comprise clutches, which are shows as clusters of pixels. A third step identifies connected regions of pixels representing values higher than the predefined threshold, that is, the binary value representing the second level.
In a fourth step, the localization of clutches is identified from the binary image and the list of localizations. For each connected region, the localization coordinates falling within said connected region is grouped according to a distance-based criterion. Each group of locations is deemed to belong to the same clutch.
The position of the clutch is taken as the centroid position of the localization coordinates associated with said clutch.
The fourth step provides a list of the position of clutches calculated as disclosed. Once the position of the clutches being in each region, the number of clutches per region, the density calculated using a distance-based criterion and other statistical values may be used as measurements parameters for the determination of criteria that allows discerning if a cell is in an open chromatin state or in a close chromatin state according to particular embodiments of the invention.
Thus, in an embodiment, a density image of resolution lower than or equal to the rendered high resolution image and representing the same area than said rendered high resolution image is provided wherein each pixel of the density image has a value proportional to the number of locations of the list of location coordinates falling within the area represented by said pixel,
a binary image representing the same area than the density image comprising zero value pixels if the corresponding value represented by the density image in the same location is lower than a predefined threshold; and, nonzero if said value is higher, is provided,
identifying connected regions of pixels representing values higher than the predefined threshold,
for each connected region, providing a list of clutch positions by grouping the localization coordinates within said connected region according to a distance-based criterion being the position of the clutch the centroid position of the localization coordinates associated with said clutch.
In a preferred embodiment, the method comprises identifying connected regions of nonzero pixels.
In a preferred embodiment, the size of nucleosomal clutch is calculated as a measure of the spreading of the positions of all the localization coordinates associated with said clutch and/or the number of nucleosomes within said clutch.
In another preferred embodiment, the density of clutches within a connected region is calculated as the number of nucleosomes per clutch divided by the area occupied by said clutches.
The method of the invention further comprises step e) comparing data obtained in step d) with a corresponding reference value to obtain a score based on size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch.
“Reference value”, as used herein relates to a laboratory value used as a reference for the values/data obtained from samples. The reference value (or reference level) can be an absolute value, a relative value, a value which has an upper and/or lower limit, a series of values, an average value, a median, a mean value, or a value expressed by reference to a control or reference value. A reference value can be based on the value obtained from an individual sample, such as, for example, a value obtained from a sample of study but obtained at a previous point in time. The reference value can be based on a high number of samples, such as the values obtained in a population of samples. In order to detect a cell in an open chromatin state, the reference value can be based on the clutches area, number of nucleosomes per nucleosomal clutch, or nucleosome density of clutches from a cell in a close chromatin state, by way of illustrative non-limitative example from a non-cancer cell, a terminally differentiated cell or from a cell wherein the machinery of transcription is inactive.
In another preferred embodiment, the reference value can be based on the clutch area, number of nucleosomes per nucleosomal clutch or nucleosome density of clutches from cells with an open chromatin state or alternatively with a more open chromatin state, by way of illustrative, non-limitative example highly transcriptionally activated cells, highly pluripotent cell, ESCs and iPSCs. Cells with a more open chromatin state may correspond to cells with higher grade of pluripotency. The grade of pluripotency in a cell can be determined, for example, with a gene card technology (Bock et al, 2011).
In another preferred embodiment, in order to detect a cell in a close chromatin state, the reference value is based on the clutch area, number of nucleosomes per nucleosomal clutch or nucleosome density of clutches from cell kwnon to be I a close chromatin state.
In a preferred embodiment, the reference value for discriminating among different cell types, based on the number of nucleosomes per clutch by way of illustrative non-limitative example is <=5 nucleosomes per clutch. In another preferred embodiment, the reference value for discriminating different among different cell types, based on the density of nucleosomes per clutch by way of illustrative non-limitative example is <=0.005 nucleosomes/nm2.
Once the reference value has been established, the size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch is compared with the reference value. As a consequence of this comparison the size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch can be “greater than” or “bigger than” or “more that”; “less than” or “smaller than” or “equal to” the corresponding reference value.
In the context of the present invention, the size of nuclesomal clutches, the nucleosomal density or the number of nucleosomes per nucleosomal clutches are “greater than or more than or bigger than” the corresponding reference value, when the size of nuclesomal clutches, the nucleosomal density or the number of nucleosomes per nucleosomal clutches is by way of illustrative, non-limitative example, at least 1.1-fold, 1.5-fold, 2-fold, 5-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, 90-fold, 100-fold or even more when compared with the reference value for said marker. On the other hand, the size of nuclesomal clutches, the nucleosomal density or the number of nucleosomes per nucleosomal clutches are “lower than or smaller than” the corresponding reference value, when the size of nuclesomal clutches, the nucleosomal density or the number of nucleosomes per nucleosomal decreases by way of illustrative, non-limitative example, at least 5%, 10%, 25%, 50%, 75%, or even 100%.
According to the invention, if the cell comprises smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in an open chromatin state.
According to the invention if the cell comprises bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches compared to the corresponding reference value is indicative that said cell is in a close chromatin state
In another preferred embodiment, the method for detecting the chromatin state of a cell further comprises detecting the RNA polymerase II association to the nucleosome.
According to this aspect of the invention, if the RNA polymerase II is more associated to the nucleosome is indicative that said cell is in an open chromatin state.
“RNA polymerase II”, as used herein, relates to an enzyme that catalyzes the transcription of DNA to synthesize precursors of mRNA and most snRNA and microRNA.
In a preferred embodiment, the RNA pol II subunit B1 is detected. The sequence of RNA pol II subunit B1 in humans corresponds to the sequence P24928 in the Uniprot database 3 Sep. 2014.
In another aspect, the invention further comprises detecting the linker histone H1.
“Histone H1”, as used herein relates to a protein involved with the packing of the “beads on a string” sub-structures into a high order structure. The sequence of RNA H1 in humans corresponds to the sequence Q02539 in the Uniprot database 3 Sep. 2014.
According to this aspect of the invention, if the histone H1 is more associated to the nucleosome is indicative that said cell is in a close chromatin state.
The association of RNA polymerase II or H1 to the nucleosome can be detected by any method known in the art. In a preferred embodiment, the association is detected by multicolor super resolution imaging as described in Bates et al., 2007.
Method for Isolating a Cell in an Open or Close Chromatin StateIn another aspect, the invention relates to a method for isolating a cell in an open chromatin state comprising
-
- a) detecting the chromatin state of a cell by a method according to the invention, and
- b) isolating a cell having smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches.
In a preferred embodiment the cell in an open chromatin state is selected from the group consisting of transcriptionally active cells, pluripotent cells, cancer cells and drug perturbed cells. In a more preferred embodiment, the cell in an open chromatin state is a pluripotent cell. In another aspect, the invention relates to a method for isolating a cell in a close chromatin state comprising
-
- a) detecting the chromatin state of the cell by a method according to the method of the invention, and
- b) isolating a cell having bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches.
All the terms and embodiments previously described are equally applicable to this aspect of the invention.
Kit of the InventionIn another aspect, the invention relates to a kit comprising a first antibody capable of specifically binding to a histone protein and a photoswitchable fluorophore linked-secondary antibody.
In the context of the present invention, “kit” is understood as a product containing the different reagents necessary for carrying out the methods of the invention packed so as to allow their transport and storage. Additionally, the kits of the invention can contain instructions for the simultaneous, sequential or separate use of the different components which are in the kit. Said instructions can be in the form of printed material or in the form of an electronic support capable of storing instructions susceptible of being read or understood, such as, for example, electronic storage media (e.g. magnetic disks, tapes), or optical media (e.g. CD-ROM, DVD), or audio materials. Additionally or alternatively, the media can contain internet addresses that provide said instruction.
In a preferred embodiment, the first antibody capable of specifically binding to a histone protein and a photoswitchable fluorophore linked-secondary antibody comprise at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90% or at least 100% of the total amount of reagents forming the kit.
In a preferred embodiment, the histone protein is a core histone protein, more preferably histone H2B.
All the terms and embodiments previously described are equally applicable to this aspect of the invention.
Use of the KitIn another aspect, the invention relates to the use of the kit according to the invention for detecting the chromatin state of a cell and isolating a cell in an open chromatin state or in a close chromatin state.
In a preferred embodiment, the detection of the chromatin state of a cell and the isolation of a cell in an open chromatin state or in a close chromatin state is performed by a method of the invention.
All the terms and embodiments previously described are equally applicable to this aspect of the invention.
DeviceAccording to another aspect of the invention, the method of the first and second aspect of the invention may be carried out by means of a device adapted to detect the chromatin state of a cell comprising:
-
- a source of optical radiation adapted to emit light at a wavelength λ1 over an interrogation area adapted to receive a biological sample,
- an optical sensor sensible to a second wavelength λ2 adapted to measure the optical radiation at λ2,
- a control unit connected to the optical sensor and to the source of optical radiation wherein said control unit is adapted to carry out the method according to the invention.
The source of the optical radiation may be in the form of a laser source. The interrogation area is the area where the sample is located and it is the area over which the optical sensor is aiming so that the image taken by the optical sensor is the focused over the sample. In particular, the optical sensor is sensible to the second wavelength λ2, that is, the wavelength of the radiation emitted by the photoswitchable fluorophores linked to the antibody:histone complex determining its location.
The control unit is configured to have the control over the source of the optical radiation and the optical sensor to allow recording images of samples when the photoswitchable fluorophores are excited.
In an embodiment wherein the device further comprises a source of optical radiation adapted to emit light at a wavelength λ3 over an interrogation area adapted to receive a biological sample, the control unit is further adapted to carry out the method of the invention.
In an embodiment, the control unit is a programmable unit and is adapted to execute a computer program. According to a further embodiment, the control unit is an ASIC unit being programmed to carry out the control over the source of the optical radiation and the optical sensor to allow recording images of samples when the photoswitchable fluorophores are excited.
In a further embodiment, the control unit is adapted to carry out a post-processing of the image for the assessment of parameters over a sample. In a preferred example, the control unit is configured to carry out steps first, second, third and fourth for the calculation of position of clutches.
It is part of the invention a computer program configured to carry out any of the disclosed methods for processing images obtaining information of clutches and the spatial distribution.
The invention will be described by way of the following examples which are to be considered as merely illustrative and not limitative of the scope of the invention.
Material and Methods Cells.Human fibroblasts (hFb) (BJ, Skin Fibroblast, American Type Culture Collection, ATCC® CRL-2522™) were cultured in DMEM supplemented with 10% FBS, 1× Non-essential AA, 1× GlutaMax and 1× penicillin/streptomycin. Human fibroblasts were treated with 300 nM of TSA (TricostatinA, Sigma-Aldrich) solution (TSA-hFbs) in complete growth medium for 24 hours before imaging experiments. Human fibroblasts expressing the Histone H2B-SNAP fusion protein were obtained after drug selection of nucleofected cells with the pSNAP-H2B Plasmid (N91795, New England BioLabs) using the Amaxa Human Dermal Fibroblast Nucleofector Kit (Lonza, VPD-1001).
mESCs and mESCsTcf3−/− were previously described (Merrill et al., 2004). mESCsH1tkO gift from Arthur I. Skoultchi (Fan et al., 2005) mESCs were cultured on gelatin in sLif medium composed by KO DMEM supplemented with 15% FBS (Hyclone), 1× Non-Essential Amino acid, 1× GlutaMax (Invitrogen), 1× penicillin/streptomycin, 1×2-mercaptoethanol and 1,000 U/mL LIF ESGRO (Chemicon). mESCs were cultured also in 2iLif medium composed by N2B27 medium supplemented with 3 μM CHIR99021, 1 μM PD0325901, 1,000 U/mL LIF and 1× penicillin/streptomycin for eight passages before imaging experiments.
mNPCs were generated by culturing mESCs as cell aggregates with 5 μM retinoic acid (RA) as previously described (Bibel et al., 2007). Neuronal progenitors cells were fixed 2 days after plating dissociated cellular aggregates.
In-Vitro Polynucleosome Arrays.The regular 12-mer and 24-mer DNA templates (gift from S. Grigoryev, (Grigoryev et al., 2009)) were isolated from Escherichia Coli and reconstituted with native histone octamers from HeLa cells using the ‘In vitro Chromatin Assembly Kit’ (CA-vitro-003, DIAGENODE). Chromatin was purified over a column of 4% agarose beads (cat#: A-1040-M, ABT, Agarose Bead Technologies) in a 0.5×20 cm Econo-Column (BioRad) and immediately used for experiments. To induce high compact folding before STORM imaging, the purified polynucleosomes were spotted on a coverglass and incubated over night at 4° C. in presence of 1 mM MgCl2 and 150 mM NaCl, then fixed with PFA 4% solution for 10 min at 4° C.
Mononucleosomes were reconstituted as described before (Workman and Kingston, 1992). Briefly, naked 200 bp DNA was mixed with HeLa octamers in a 1:1 w/w ratio in a reconstitution mix with 10 mM Tris-HCl, pH 8, 20 mM EDTA, 2M NaCl, 10 mM DTT, 2 mM 2-mercaptoethanol, 15 ng/μl BSA and left in mini dialysis chamber in a floater for dialysis in a high salt concentrated buffer for 2 h at 4° C. Then the samples were dialyzed over 20 h at 4° C. continuously diluting the concentration of NaCl from 2M to 0M. Mononucleosomes were collected from the mini dialysis chamber and centrifuged. Then they were spotted on a coverglass and left at 4° C. overnight and finally fixed in PFA 4% for 10 minutes at 4° C. HeLa's octamers were spotted on a coverglass and fixed in the same way after overnight incubation at 4° C. without addition of salts.
12-polynucleosome arrays were prepared for EM according to standard protocols (CA-vitro-003, DIAGENODE). Purified and undiluted samples were applied to air glow discharged continuous carbon (hydrophilic-negatively charged surface), contrasted with Uranyl Formate and examined in a Philips Biotwin microscope at 120 kV. Images were recorded on a KeenView CCD camera (SIS Olympus) (Electron Microscopy Core Facility of European Molecular Biology Laboratory, EMBL Heidelberg).
Human Induced Pluripotent Stem Cells Generation and Characterization.Integration-free hiPSCs were generated as described previously (Okita et al., 2011). Briefly a combination of episomal vectors encoding for OCT3/4-shp53, SOX2, KLF4 and L-MYC (Addgene, #27077, #27078, #27080) was nucleofected in human skin. Fibroblast cells (BJ, American Type Culture Collection, ATCC® CRL-2522™) using the Amaxa Human Dermal Fibroblast Nucleofector Kit (Lonza, VPD-1001). Normal fibroblast medium (DMEM supplemented with 10% FBS, 1× GlutaMax and 1× penicillin/streptomycin) was changed every day. On day 7, the nucleofected fibroblasts were reseeded onto a monolayer of feeders cells and on day 8 the normal medium was changed to hiPSC medium (DMEM/F12, 20% KO-SR, 1× Minimum Non-Essential Amino acid, 1× GlutaMax (Invitrogen), 1× penicillin/streptomycin, 1000×2-Mercaptoethanol, supplemented with 10 ng/mL fresh basic FGF just before feeding the cells). Medium was changed every day the first week and then every 2 days. hiPSC colonies appeared ˜20 days after nucleofection. 20 clones were picked and plated on human feeders adding ROCK inhibitor (Y27632) at 10 μM to the medium. After some passages cells were collected using trypsin (0.05%) and plated on matrigel coated plates. 5 different clones (#6, #8, #13, #16 and #20) were finally cultured and characterized.
hiPSC clones were plated on feeders and cultured in hiPSC medium. hiPSCs plated on matrigel were cultured with the MEF-conditioned hiPSC medium.
Alkaline Phosphatase.The staining was carried out on cells fixed in 10% Neutral Formalin Buffer for 15 min at 4° C., and washed three times with distilled water. The samples were then incubated for 45 min at room temperature in 2 ml of the staining solution prepared as it follows: 0.005 g Naphthol AS MX-PO4 (Sigma, N5000), 0.03 g Red Violet LB salt (Sigma, F1625), 200 ml N,N-Dimethylformamide (DMF, Fischer Scientific, D1191), 25 ml of Tris-HCl (MW=157.6, pH 8.3, 0.2M), and 25 ml of distilled water. The alkaline-phosphatase-positive cells showed a red color and were visible under phase-contrast microscopy.
Immunostaining of Stem Cell Markers.The staining was carried out on cells fixed with 4% PFA for 15 min at room temperature and permeabilized with 0.1% Triton X-100 (Sigma) in PBS for 10 min. Samples were incubated in blocking buffer containing 10% BSA (Sigma) in PBS for 1 h and then where left overnight at 4° C. with primary antibodies in solution with blocking buffer.
Primary antibodies used were: mouse monoclonal anti-Human SSEA-4 clone MC-813-70 (STEMCELL technologies) diluted 1:50; mouse monoclonal anti-Human TRA1-60 clone TRA1-60R (STEMCELL technologies) diluted 1:50, mouse monoclonal anti-Oct3/4 (Santa Cruz Biotechnologies, sc-5279) diluted 1:100, rabbit polyclonal anti-Sox2 (SIGMA, s9072) diluted 1:200; rabbit polyclonal anti-Nanog (Abcam, ab21624) diluted 1:100. For each primary antibody a respective secondary antibodies conjugated to Alexa Fluor (Invitrogen), was used for 40 min at room temperature diluted 1:1000 in blocking buffer. The cells were then counterstained with DAPI (Vector Laboratories).
Embryoid Bodies (EBs) Formation.The cells were harvested by trypsinisation and seeded in 96 well plates with V-bottom (Corning Costar) in hiPSC medium supplemented with 10 ng/ml bFGF and 10 μM ROCK inhibitor (Y27632). 48 h later the EBs were removed from the V-bottom well plates and transferred to 10 cm2 low attachment dishes in hiPSC medium. After 24 h formed EBs were divided in three parts for in vitro differentiation to meso-endo-ecto-lineages.
For differentiation to endoderm and mesoderm, EBs were propagated for 3 more days in suspension with EB medium (KO DMEM, 10% FBS (Hyclone), 1× GlutaMax (Invitrogen), 1× penicillin/streptomycin) before being plated on gelatine coated plates in EB medium. The medium was changed every 2-3 days until 15 days when samples were fixed and processed for immuno-fluorescence staining. For mesoderm differentiation, the medium was supplied with 0.5 mM ascorbic acid. For immuno-staining, rabbit polyclonal anti-Alpha Actin-Smooth Muscle (ThermoScientific, #RB-9010), 1:100 dilution and rabbit polyclonal Anti-FOXA2 antibody (Abcam, ab40874), 1:500 dilution were used.
For differentiation to ectoderm, the EBs were propagated for 3 additional days in suspension with N2B27 media (50% Neurobasal medium, 50% DMEM/F12 media, 1× GlutaMax (Invitrogen), 1× penicillin/streptomycin) supplemented with 10 ng/mL bFGF, 20 ng/mL EGF and 1,000 U/mL LIF and then for 4 more days with the addition of 1 μM RA to the medium. Then EBs were collected, washed and dissociated by incubating with trypsin (0.25) for 10 min at room temperature, pipetting up and down. Cells were then collected and plated into matrigel-coated plates in N2B27 supplemented with 10 ng/mL bFGF and 20 ng/mL EGF. 24 h later the medium was changed to N2B27 alone and cells were maintained in culture for 20 days, until neuronal connection was seen in the dish. Cells were fixed and stained as explained in previous sections with mouse monoclonal anti-beta III Tubulin, TU-20 (Abcam, ab7751), 1:500 dilution.
Teratoma Production.Cells were collected, resuspended in matrigel and intratesticular injected in SCID mice. After 7 weeks, formed teratomas were surgically dissected, fixed, embedded, sectioned and stained with hematoxylin and eosin.
TaqMan hPSC Scorecard Panel.
hiPSCs, previously grown for one passage on matrigel without feeders, were collected and RNA extracted with the RNeasy Mini Kit (Quiagen), according to manufacturer instructions. Total RNA was treated with DNase (Quiagen) to prevent DNA Contamination. RNA integrity was controlled by bioanalyzer instrument. High Capacity cDNA Reverse Transcription (Invitrogen) was used to prepare cDNA according to TaqMan hPSC Scorecard Panel Workflow. qRT-PCR using the TaqMan hPSC Scorecard Panel was prepared according to manufacturer instruction and run in Viia 7 Real-Time PCR System. Raw Data were analyzed using the web-based hPSC Scorecard™ Analysis Softwarev1.2, available at lifetechnologies.com/scorecardsoftware.
Immuno-Staining for STORM.For the imaging experiments, cells were plated on S-well Lab-tek 1 coverglass chamber (Nunc) at a seeding density of 20,000-50,000 cells per well, fixed and permeabilized with Methanol-Ethanol (1:1) solution at −20° C. for 6 min or fixed with PFA 4% in PBS for 10 min and then permeabilized with 0.1% v/v Triton X-100 (SIGMA) in PBS for 10 min at room temperature. As the distribution of H2B was independent of the fixation and permeabilization protocols, Methanol-Ethanol (1:1) was preferred to minimize the handling of the sample. After 1 h incubation at room temperature with blocking buffer containing 10% (wt/vol) BSA (Sigma) in PBS, samples were incubated overnight with the primary antibody diluted 1:50 in blocking buffer and then for 40 min with the appropriate dilution of dye-labeled secondary antibodies. Repeated washing were done at every step. Primary antibodies used for immunostaining experiments were: rabbit polyclonal anti-H2B (Abcam, abcam 1790); mouse monoclonal anti-Histone H1 Antibody, clone AE-4 (Merk Millipore, 05-457); rabbit polyclonal anti-SNAP-H2B (New England Biolabs, P9310S); mouse monoclonal anti-Histone H2B, clone 5HH2-2A8 (Merk Millipore, 05-1352); rabbit polyclonal anti-Acetyl-histone H3 (Merk Millipore, 06-599); mouse monoclonal anti-RNA Polymerase II, clone H5-phosphoserine 2 version of pol II (Covance, MMS-129R); mouse monoclonal anti-RNA Polymerase II, clone H14-phosphoserine 5 version of pol II (Covance, MMS-134R); goat anti-GFP-Alexa Fluor 647 nanobody (gift from Jonas Ries), 1:1000 dilution.
Secondary antibodies used were donkey-anti mouse and donkey-anti rabbit. Secondary antibodies were all from Jackson ImmunoResearch. For STORM imaging, the secondary antibodies were labeled in-house with different combinations of pairs of activator/reporter dyes, as previously described (Bates et al., 2007). Briefly, the dyes were purchased as NHS ester derivatives: Alexa Fluor 405 Carboxylic Acid Succinimidyl Ester (Invitrogen), Cy3 mono-Reactive Dye Pack (GE HealthCare), and Alexa Fluor 647.
Carboxylic Acid succinimidyl Ester (Invitrogen). Antibody labeling reactions were performed by incubating for 40 min at room temperature a mixture containing the secondary antibody, NaHCO3, and the appropriate pair of activator/reporter dyes diluted in DMSO. Purification of labeled antibodies was performed using NAPS Columns (GE HealthCare). The dye to antibody ratio was quantified using Nanodrop and only antibodies with a composition of 3-4 Alexa Fluor 405 and 0.9-1.2 Alexa Fluor 647 per antibody were used for imaging.
For all H2B quantification experiments, primary rabbit polyclonal anti-H2B (ab1790) and the secondary donkey anti-rabbit labeled with Alexa Fluor 405-Alexa Fluor 647 (Invitrogen) pair dyes were used after in vitro characterization using mononucleosome, 12- and 24-nucleosome array labeling.
hiPSCs grown on feeder layers were co-stained for OCt3/4 (sc-5279) and H2B (ab1790), only the cells positive for the pluripotency marker were then STORM imaged for H2B.
STORM Imaging.STORM imaging was carried out with a commercial STORM microscope system from Nikon Instruments (NSTORM). Laser light at 647 nm was used for exciting Alexa Fluor 647 (Invitrogen) and switching it to the dark state, and laser light at 405 nm was used for reactivating the Alexa Fluor 647 (Invitrogen) fluorescence via an activator dye (Alexa Fluor 405)-facilitated manner. An imaging cycle was used in which one frame belonging to the activating light pulse (405 nm) was alternated with three frames belonging to the imaging light pulse (647 nm). Dual color imaging was performed with two sets of secondary antibodies labeled with the same reporter dye (Alexa Fluor 647) but two different activator dyes (Alexa Fluor 405 and Cy3) (Bates et al., 2007). In addition to the 405 nm laserlight, an additional imaging cycle with 561 nm laser light as the activating light pulse was used for reactivating Alexa Fluor 647 linked to the second activator dye (Cy3). The emitted light from Alexa Fluor 647 was collected by an oil immersion 100× objective with 1.49 NA, filtered by an emission filter (ET705/72m) and imaged onto an electron multiplying charge coupled device (EMCCD) (Andor Technology) camera at a frame rate of 15 ms per frame. For all single color and in vitro H2B imaging experiments, identical ‘excitation-switching off-reactivation’ scheme was used by gradually increasing the 405 nm laser power in a sigmoidal manner starting with 0.5 μW at frame 800 and ending with 2000 μW at frame 44800 according to Table I. Up to frame 800, the 405 nm laser power was set to zero. When the final power of 2000 μW was reached, this power was kept until the fluorophores were exhaustively imaged and photobleached. Imaging was done using a previously described imaging buffer [Cysteamine MEA (SigmaAldrich, #30070-50G), Glox Solution: 0.5 mgmL−1 glucose oxidase, 40 mgmL−1 catalase (all Sigma), 10% Glucose in PBS](Bates et al., 2007).
STORM Data Analysis.STORM images were analyzed and rendered as previously described (Bates et al., 2007; Huang et al., 2008a; Huang et al., 2008b), using custom-written software (Insight3, kindly provided by Bo Huang, University of California, San Francisco). Briefly, peaks in single-molecule images were identified based on a threshold and fit to a simple Gaussian to determine the x and y positions. The final images were rendered by representing each x-y position (localization) as a Gaussian with a width that corresponds to the determined localization precision (9 nm). Sample drift during acquisition was calculated and subtracted by reconstructing STORM images from subsets of frames (typically 500-1000 frames, for which drift was assumed to be small) and correlating these images to a reference frame (typically one that is reconstructed at the initial time segment). For multicolor images, each peak was color coded based on whether the emission was recorded immediately after 405 nm or 532 nm activation cycle. The peaks coming from a frame not belonging to the one right after an activation frame were coded as “non-specific”. A crosstalk algorithm as described previously was applied to correct for non-specific activations by the imaging laser (Dani et al., 2010). Briefly, the number of “apparent specific” activations were calculated from the frame immediately following the activation pulse and the number of “non-specific” activations from subsequent imaging frames in the imaging cycle. Assuming that the probability of “non-specific” activations is constant across all frames, we could then determine the number of “actual specific” activations by subtracting the “non-specific activation” number from the “apparent specific” activation number. We then used these numbers to statistically subtract crosstalk due to “non-specific” activations in an unbiased way as previously described (Dani et al., 2010).
Image Analysis and Cluster Quantification.STORM data consisting in (x,y) localization lists were used to construct discrete localization images, such that each pixel has a value equal to the number of localizations falling within the pixel area (pixel size=10 nm). From the localization images, density maps were obtained by 2-dimensional convolution with a square kernel (5×5 pixels2). A constant threshold was used to digitize the density maps into binary images, such that pixels have a value of 1 where the density is larger than the threshold value and a value of 0 elsewhere. For the determination of the threshold value, unlabeled samples were imaged. The images were analyzed as described, and digitized with increasing threshold values. For each threshold value, the ratio of nonzero to zero pixels was calculated. The threshold value (0.002 nm−2) giving a ratio <2×10−4 was used for image analysis. Localizations falling on zero-valued pixels of the binary images (low-density areas) were discarded from further analysis. For our threshold setting, the number of discarded localizations typically corresponded to <5% of the total number of localization within a nuclear region.
Connected components of the binary image, composed by adjacent non-zero pixels (4-connected neighborhood), were sequentially singled out and analyzed. Localization coordinates within each connected component were grouped by means of a distance-based clustering algorithm. Initialization values for the number of clusters and the relative centroid coordinates were obtained from local maxima of the density map within the connected region, calculated by means of a peak finding routine. Localizations were associated to clusters based on their proximity to cluster centroids. New cluster centroid coordinates were iteratively calculated as the average of localization coordinates belonging to the same cluster. The procedure was iterated until convergence of the sum of the squared distances between localizations and the associated cluster and provided cluster centroid positions and number of localizations per cluster. Cluster sizes were calculated as the standard deviation of localization coordinates from the relative cluster centroid.
In order to further check the effect of the threshold on the quantification, a subset of data (hFbs, TSA-hFbs, mononucleosomes, 12- and 24-nucleosome array) was analyzed by applying different threshold values, ranging from 8·10−4 to 0.004 nm2. All the investigated data showed similar linear dependence of the median number of localizations per cluster versus the threshold value.
Analyses were performed by means of custom code written in Matlab.
Simulations of Synthetic Images.Three-dimensional nucleosomes sequences were simulated assuming nucleosomes as impenetrable spheres (r=10 nm) arranged in space according to a Gaussian chain model. Inter-nucleosomes end-to-end distances were calculated by conversion of DNA linker lengths according to worm like chain (WLC) model for a polymer with a persistence length of 150 bp.
It has been assumed that at full DNA occupancy (75% of DNA length covered by nucleosomes) nucleosomes have 146 bp DNA wrapped around them and are uniformly spaced by linker DNA fragments of 50 bp (Kornberg and Lorch, 1999).
For comparison with the experimental data, distributions of linker DNA lengths were modified on the basis of two different models. In the first model (NR-model), we considered the possibility that nucleosomes can be randomly removed with a finite probability p. The removal of a nucleosome results in increase in the DNA linker length between neighboring nucleosomes, caused by DNA unwrapping (146 bp). For this model, the DNA occupancy (OCCDNA) depends on the nucleosome removal percentage p as:
In the linker length (LL) model, we assume that nucleosomes are spaced by linker-DNA lengths distributed according to a normal distribution with average length laverage, so that the DNA occupancy is a function of laverage:
Simulations were carried out for several nucleosome removal percentages (from 0 to 95%) and average linker length laverage (from 50 bp to ˜3000 bp). For each parameter value, thousands of simulation were generated and analyzed.
In order to obtain synthetic STORM images of nucleosomes configuration, for each simulated nucleosome a number of localizations was randomly drawn from the distribution obtained from STORM images of mononucleosomes in vitro. To take into account the different efficiency in detecting localizations at various distances from the focal plane (z axis), the number of localizations was scaled with a z-dependent factor obtained by an independent calibration. This calibration consisted of repeatedly imaging the same sample at defined distances from the focal plane. The sample was moved by means of a piezoelectric stage. Quantification of the number of localizations versus the sample distance provided the z-dependent correction factor. Localizations were then randomly placed around the nucleosome centroid position according to a 2-d Gaussian distribution with standard deviation equal to the one obtained from STORM images of mononucleosomes in vitro.
The localization coordinates were then analyzed in the same way as the regular STORM images and the number of localization per cluster, cluster area and nearest neighbor distance were quantified.
Example 1 Nucleosomes in Interphase Nuclei of Human Somatic Cells are Organized in Discrete NanodomainsTo reveal the organization of chromatin at nanoscale resolution, the inventors recorded STORM images of the core histone protein H2B in interphase human fibroblast nuclei (hFb). An antibody that recognizes native H2B was used. STORM images revealed a striking organization of H2B inside the nucleus (
To rule out the possibility that the observed clustered distribution of H2B was due to sample preparation or labeling methods used, the inventors performed a series of control experiments. First, they showed that the clustered distribution of H2B was independent of the fixation and permeabilization protocols used (
They next aimed to analyze the nucleosome organization in cells undergoing massive epigenome modifications and chromatin rearrangements. Thus, hFbs were treated with Trichostatin A (TSA) (TSA-hFb), a potent inhibitor of histone deacetylase enzyme, which is known to lead to genome-wide decondensation of chromatin inside the nucleus through accumulation of acetylation groups on histone tails (Toth et al., 2004). As expected, there was a large increase in H3 acetylation after TSA treatment (
To gain quantitative insight into the H2B nanodomains, the inventors next developed a cluster identification algorithm to group the detected localizations in STORM images into nanodomains (
To assess the H2B organization of pluripotent stem cells, the inventors next imaged mouse embryonic stem cells (mESC) with STORM. mESCs were initially cultured in a medium containing serum and the Leukemia inhibitory factor (sLif) and H2B was labeled by immunofluorescence and imaged with STORM as before. STORM images of these mESCs showed two different categories of nuclei. The first category, Type 1, displayed nanodomains that appeared bright (i.e. contained a large number of localizations) similar to hFbs (
The activation of Wnt/β-catenin signaling pathway controls mESC pluripotency and self-renewal (Kuhl and Kuhl, 2013). Tcf3 acts as a key effector of this pathway by repressing Wnt target genes. Its deletion in mESCs maintains the ground state of pluripotency. H2B nanodomains in the STORM images of mESCs in which Tcf3 was knocked-out (mESCsTcf3−/−) resembled those found in naïve pluripotent mESCs (Type 2 and 2iLif) (
H1 is the linker histone that binds to the entry and exit sites of DNA that is wrapped around the histone octamer, keeping the nucleosome in place and leading to higher order compaction of the chromatin structure (Woodcock et al., 2006). Therefore, H1 is thought to play an important role in chromatin organization. For example, mESCs that lack three H1 iso forms were shown to have chromatin structural changes such as reduced local chromatin compaction (Fan et al., 2005). To test whether the nanodomain organization of mESCs depended on H1, they imaged mutant mESCs carrying a deletion of three H1 isoforms (mESCH1tKO). STORM imaging of H2B in these cells showed a large amount of dim nanodomains (
Quantification of the differences in nanodomain features among the various mESCs also confirmed that the number of localizations per nanodomain and nanodomain nnds were lower in ground-state mESCs with respect to somatic mNPCs (
The inventors next aimed to further quantify the changes they observed in the number of localizations (and hence brightness) of nanodomains in terms of the number of nucleosomes. There is not a one-to-one relationship between the number of localizations in STORM images and the number of nucleosomes mainly for two reasons: i) the antibody epitope labeling efficiency may not be 100%, ii) each fluorophore can undergo multiple photoswitching events, resulting in multiple localizations arising from a single fluorophore. However, the epitope labeling efficiency of the H2B antibody should be comparable across the human cells (hFbs, TSA-hFbs) and likewise across the different mESCs analyzed. In addition, the antibodies used were always labeled with a similar dye composition (Extended Experimental Procedures) and each cell was imaged in the same way (Table I) to obtain comparable number of localizations per antibody. Therefore, the number of nucleosomes should scale with the number of localizations. A similar approach has previously been used to quantify the receptor heterogeneity of synapses in brain slices (Dani et al., 2010).
Nanodomains in any given nucleus contained a large distribution of localizations spanning two orders of magnitude (˜3 to 300) (
In order to relate the median number of localizations to the median number of nucleosomes while taking into account the limitations mentioned above, the inventors generated an in vitro calibration curve. To this end, single nucleosomes were assembled, spotted on coverglass in vitro, labeled using the same cell immunostaining protocol and STORM images were obtained using identical imaging conditions (Table I). As expected, the mononucleosome images resembled small clusters of localizations (
The calibration curve was first validated by imaging nucleosomes assembled with circular DNA in vitro. The plasmid used had a DNA length allowing the assembly of ˜20 nucleosomes. The polynucleosomes assembled with this plasmid were immunostained, imaged and the number of localizations per polynucleosome was quantified. The median number of localizations obtained was then interpolated from the calibration curve into the median number of nucleosomes and corresponded to 19.5±2.0 nucleosomes, confirming that the calibration curve was indeed accurate (
The inventors next proceeded to convert the median number of localizations obtained from the in vivo STORM images (
The median number of localizations in hFbs corresponded to a median number of ˜8 nucleosomes per clutch whereas this number decreased to ˜2 nucleosomes after TSA treatment (
To determine whether the observed changes in nucleosome clutches corresponded to changes in the compaction of nucleosomes inside the clutches, the median nucleosome density was calculated by dividing the median number of nucleosomes per clutch with the median area of the clutches. Indeed, the nucleosomes in hFbs were more densely compacted inside the clutches compared to TSA-hFbs (
Given the correlation between clutch size and pluripotency level of different mESCs, the inventors next aimed to study whether the identification of the number of nucleosomes per clutch could be predictive of the pluripotency grade in hiPSC clones.
To this end, different hiPSC clones were generated from hFbs and were characterized using standard methods such as alkaline phosphatase (AP) staining, analysis of expression of stem cell genes, formation of embryoid bodies and in vivo teratoma in immune-compromised mice. Based on the results of this characterization, hiPSC clone 13 was the most pluripotent since it was AP positive, expressed high levels of the stem cell markers TRA1-60, SSEA4, Oct4, Sox2 and Nanog, formed embryoid bodies, which differentiated in the three germ layers, and generated large and fully differentiated teratomas in mice (
In a double blind fashion, the median number of localizations was quantified after STORM imaging of all the hiPSC clones (
The arrangement of nucleosomes in large and small clutches with higher and lower compaction respectively could potentially facilitate the binding of transcription factors, polymerases and other proteins to the DNA, which should be more accessible in regions containing a smaller number of nucleosomes. The compaction of the nucleosomes within the clutches, on the other hand, should be aided by the presence of linker histone protein H1, which is known to be involved in nucleosome compaction as well as to be enriched in heterochromatin (Fan et al., 2005; Shen et al., 1995; Woodcock et al., 2006). Thus, to evaluate differences in the heterochromatin content and accessibility of RNA Polymerase II (PolII), multi-color super-resolution imaging of H2B with histone H1 and of H2B with PolII was carried out. For H1, an antibody that recognizes all of its isoforms was used. In the case of PolII, an antibody against phosphoserine 5 of the carboxiterminal domain of PolII (PolII11) was used to image both PolII at the initiation complex and the elongating PolII.
The inventors first recorded multi-color STORM images of H2B and H1 in hFbs and TSA-hFbs (
In mESCs cultured in sLif around 54±2% of H2B colocalized with H1 and the number of H1 localizations also increased with the number of H2B localizations (
Next, the inventors analyzed PolII and H2B multi-color STORM images of hFbs and TSA-hFbs. In both cases, PolII was found interspersed with the nucleosome clutches (
The organization of nucleosomes in discrete clutches that are separated in space implies that nucleosome-depleted regions likely exist in the chromatin fiber. The inventors hypothesized that these regions might be the result of two alternative mechanisms. First, removal of one or more nucleosomes in between nucleosome-rich regions can generate the clutch-like organization of nucleosomes observed in STORM images. Second, variations in the length of the linker-DNA between subsequent nucleosomes can generate nucleosome-depleted regions if the linker-DNA length becomes larger than the spatial resolution.
To gain more insight into nucleosome occupancy in human fibroblast cells, they used coarse-grained computer simulations of nucleosome spatial arrangement, using a simple model with only few parameters. Nucleosomes were placed along the DNA fiber at regular intervals with 146 bp of DNA wrapped around each nucleosome and with 50 bp of linker-DNA separating them, which is the average linker-DNA length measured in previous studies, (
To simulate nucleosome removal, the linker-DNA length was kept fixed at 50 bp but nucleosomes were randomly removed with a probability ranging from p=0 to p=95% (NR model) (
To simulate variations in linker-DNA length (1) between subsequent nucleosomes (LL Model) the 146 bp of DNA wrapped around nucleosomes was maintained, but the linker-DNA lengths (1) were extracted from normal distributions with average linker-DNA lengths (laverage) ranging from 50 bp to ˜3000 bp (
Finally, as a control, the potential effect of labeling efficiency was also simulated by decorating the nucleosomes with antibodies corresponding to an average of 1.6 antibodies per nucleosome (in vitro measured value,
Synthetic STORM images of the nucleosomes along the DNA fiber at different levels of nucleosome occupancy (
In the case of the number of localizations, both the NR and LL models intersected the experimental values at around 56% and 46% occupancy for the hFbs and TSA-hFbs, respectively (
Taken altogether, these results indicate that the nucleosome occupancy in TSA-hFbs is around 45% and nucleosome removal is likely the dominant mechanism to generate nucleosome poor regions along the DNA fiber since all three measured parameters of the experimental data can be recapitulated with this model. In hFbs, around 56% of the DNA fiber is occupied with nucleosomes and likely both nucleosome removal and linker-DNA length modifications play a role in generating the nucleosome-depleted regions. For these nucleosome occupancy levels (45% and 56%) the simulation results not only reproduced the median values observed for the experimental data but also the full experimental distributions of the three parameters fit well to the simulated distributions (
- Bates, M., Huang, B., Dempsey, G. T., and Zhuang, X. (2007). Multicolor super-resolution imaging with photo-switchable fluorescent probes. Science 317, 1749-1753.
- Bibel, M., Richter, J., Lacroix, E., and Barde, Y. A. (2007). Generation of a defined and uniform population of CNS progenitors and neurons from mouse embryonic stem cells. Nature protocols 2, 1034-1043.
- Bock, C., Kiskinis, E., Verstappen, G., Gu, H., Boulting, G., Smith, Z. D., Ziller, M., Croft, G. F., Amoroso, M. W., Oakley, D. H., et al. (2011). Reference Maps of human ES and iPS cell variation enable high-throughput characterization of pluripotent cell lines. Cell 144, 439-452.
- Dani, A., Huang, B., Bergan, J., Dulac, C., and Zhuang, X. (2010). Superresolution imaging of chemical synapses in the brain. Neuron 68, 843-856.
- Fan, Y., Nikitina, T., Zhao, J., Fleury, T. J., Bhattacharyya, R., Bouhassira, E. E., Stein, A., Woodcock, C. L., and Skoultchi, A. I. (2005). Histone H1 depletion in mammals alters global chromatin structure but causes specific changes in gene regulation. Cell 123, 1199-1212.
- Fussner, E., Ching, R. W., and Bazett-Jones, D. P. (2011a). Living without 30 nm chromatin fibers. Trends in biochemical sciences 36, 1-6.
- Fussner, E., Djuric, U., Strauss, M., Hotta, A., Perez-Iratxeta, C., Lanner, F., Dilworth, F. J., Ellis, J., and Bazett-Jones, D. P. (2011b). Constitutive heterochromatin reorganization during somatic cell reprogramming. The EMBO journal 30, 1778-1789.
- Grigoryev, S. A., Arya, G., Correll, S., Woodcock, C. L., and Schlick, T. (2009). Evidence for heteromorphic chromatin fibers from analysis of nucleosome interactions. Proceedings of the National Academy of Sciences of the United States of America 106, 13317-13322.
- Huang, B., Jones, S. A., Brandenburg, B., and Zhuang, X. (2008a). Whole-cell 3D STORM reveals interactions between cellular structures with nanometer-scale resolution. Nature methods 5, 1047-1052.
- Huang, B., Wang, W., Bates, M., and Zhuang, X. (2008b). Three-dimensional super-resolution imaging by stochastic optical reconstruction microscopy. Science 319, 810-813.
- Kornberg, R. D., and Lorch, Y. (1999). Twenty-five years of the nucleosome, fundamental particle of the eukaryote chromosome. Cell 98, 285-294.
- Kuhl, S. J., and Kuhl, M. (2013). On the role of Wnt/beta-catenin signaling in stem cells. Biochimica et biophysica acta 1830, 2297-2306.
- Marks, H., Kalkan, T., Menafra, R., Denissov, S., Jones, K., Hofemeister, H., Nichols, J., Kranz, A., Stewart, A. F., Smith, A., et al. (2012). The transcriptional and epigenomic foundations of ground state pluripotency. Cell 149, 590-604.
- Meshorer, E., Yellajoshula, D., George, E., Scambler, P. J., Brown, D. T., and Misteli, T. (2006). Hyperdynamic plasticity of chromatin proteins in pluripotent embryonic stem cells. Developmental cell 10, 105-116.
- Nieuwenhuizen, R. P., Lidke, K. A., Bates, M., Puig, D. L., Grunwald, D., Stallinga, S., and Rieger, B. (2013). Measuring image resolution in optical nanoscopy. Nature methods 10, 557-562.
- Ying, Q. L., Wray, J., Nichols, J., Batlle-Morera, L., Doble, B., Woodgett, J., Cohen, P., and Smith, A. (2008). The ground state of embryonic stem cell self-renewal. Nature 453, 519-523.
- Rust, M. J., Bates, M., and Zhuang, X. (2006). Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM). Nature methods 3, 793-795.
- Struhl, K., and Segal, E. (2013). Determinants of nucleosome positioning. Nature structural & molecular biology 20, 267-273.
- Toth, K. F., Knoch, T. A., Wachsmuth, M., Frank-Stohr, M., Stohr, M., Bacher, C. P., Muller, G., and Rippe, K. (2004). Trichostatin A-induced histone acetylation causes decondensation of interphase chromatin. Journal of cell science 117, 4277-4287.
- Woodcock, C. L., and Ghosh, R. P. (2010). Chromatin higher-order structure and dynamics. Cold Spring Harbor perspectives in biology 2, a000596.
- Yi, F., Pereira, L., and Merrill, B. J. (2008). Tcf3 functions as a steady-state limiter of transcriptional programs of mouse embryonic stem cell self-renewal. Stem cells 26, 1951-1960.
Claims
1. A method for detecting the chromatin state of a cell comprising wherein if the cell comprises smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches compared to the corresponding reference value then it is indicative that said cell is in an open chromatin state and wherein if the cell comprises bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches compared to the corresponding reference value then it is indicative that said cell is in a closed chromatin state.
- a) contacting a sample containing cells with a first antibody capable of specifically binding to a histone protein,
- b) contacting the antibody:histone complex formed in step a) with a secondary antibody having at least one photoswitchable fluorophore adapted to be optically excited at a certain wavelength λ2 and to emit light at a wavelength λ2 different from λ2,
- c) recording a super resolution image of nucleosome organization by means of a sensor being sensitive at least to the wavelength of emission of the photoswitchable fluorophore by exciting the sample with an optical radiation having a wavelength λ2,
- d) correlating the image obtained in step c) with size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutches, and
- e) comparing data obtained in step d) with a corresponding reference value to obtain a score based on size of nucleosomal clutches, nucleosomal density and/or number of nucleosomes per nucleosomal clutch,
2. The method according to claim wherein the cell in an open chromatin state is selected from the group consisting of a transcriptionally active cell, a pluripotent stem cell, a cancer cell and a drug perturbed cell.
3. The method according to claim 1, wherein the secondary antibody further comprises a second fluorophore adapted to be optically excited at a wavelength λ3 and reactivate the first fluorophore by bringing it from its dark state back to its ground state, upon which the first fluorophore can be excited again at its excitation wavelength λ2 and emit light at its emission wavelength λ2.
4. The method according to claim 3, wherein a plurality of super resolution images are taken by means of a sensor being sensitive at least to the wavelength of emission of the second fluorophore λ2 rendering a further super resolution image by collecting the sensed light emissions recorded in the plurality of images.
5. The method according to claim 4, wherein the power of the optical radiation having a wavelength λ2 is monotonically increased.
6. The method according to claim 4, wherein, before recording each super resolution image of the plurality of super resolution images, the sample is excited one or more times with an optical radiation having a wavelength λ2 and subsequently excited one or more times with an optical radiation having a wavelength λ2.
7. The method according to claim 4, wherein the super resolution image is rendered from a list of locations (x,y) determined as the coordinates in the sample where an optical emission of a photoswitchable fluorophore adapted to emit light at a wavelength λ2 is present.
8. The method according to claim 7, wherein
- a density image of resolution lower than or equal to the rendered high resolution image and representing the same area as said rendered high resolution image is provided wherein each pixel of the density image has a value proportional to the number of locations of the location list falling within the area represented by said pixel,
- a binary image representing the same area as the density image comprising zero value pixels if the corresponding value represented by the density image in the same location is lower than a predefined threshold; and, nonzero if said value is higher, is provided,
- identifying connected regions of pixels representing values higher than the predefined threshold,
- for each connected region, providing a list of clutch positions by grouping the localization coordinates within said connected region according to a distance-based criterion being the position of the clutch the centroid position of the localization coordinates associated with said clutch.
9. The method according to claim 8, wherein the size of each clutch is calculated as a measure of the spreading of the positions of all the localization coordinates associated with said clutch and/or the number of nucleosomes within said clutch.
10. The method according to claim 8, wherein the density of nucleosomes within a clutch calculated as the number of nucleosomes within that clutch divided by the area occupied by said clutch.
11. The method according to claim 1, wherein the histone protein is H2B.
12. A method for isolating a cell in an open chromatin state comprising
- a) detecting the chromatin state of a cell by a method according to claim 1, and
- b) isolating a cell having smaller clutches, less densely compacted nucleosomes or less nucleosomes per clutches.
13. The method for isolating a cell in an open chromatin state according to claim 12 wherein the cell in an open chromatin state is selected from the group consisting of transcriptionally active cell, pluripotent cell, cancer cell and drug-perturbed cell.
14. The method for isolating a cell in a close chromatin state
- a) detecting the chromatin state of the cell by a method according to claim 1, and
- b) isolating a cell having bigger clutches, more densely compacted nucleosomes or more nucleosomes per clutches.
15. A kit comprising a first antibody capable of specifically binding to a histone protein and a photoswitchable fluorophore linked-secondary antibody.
16. The kit according to claim 15, wherein the histone protein is histone H2B.
17. Use of the kit according to claim 15 for detecting the chromatins state of a cell and isolating a cell in an open chromatin state or in a close chromatin state.
18. A device adapted to detect the chromatin state of a cell comprising
- a source of optical radiation adapted to emit light at a wavelength, over an interrogation area adapted to receive a biological sample,
- an optical sensor sensible to a second wavelength adapted to measure the optical radiation at λ2,
- a control unit connected to the optical sensor and to the source of optical radiation wherein said control unit is adapted to carry out the method according to claim 1.
19. A device adapted to detect the chromatin state of a cell comprising
- a first source of optical radiation adapted to emit light at a wavelength λ2 over an interrogation area adapted to receive a biological sample,
- a further second source of optical radiation adapted to emit light at a wavelength λ2 over an interrogation area adapted to receive a biological sample,
- an optical sensor sensible to a second wavelength λ2 adapted to measure the optical radiation at λ2,
- a control unit connected to the optical sensor and to the first and to the second source of optical radiation wherein said control unit is adapted to carry out the method according to claim 3.
Type: Application
Filed: Sep 10, 2014
Publication Date: Mar 10, 2016
Inventors: Melike LAKADAMYALI (Barcelona), Carlo MANZO (Barcelona), Maria Aurelia RICCI (Barcelona), Maria Pia COSMA (Barcelona)
Application Number: 14/482,586