TISSUE POTENCY DETERMINATION THROUGH QUANTITATIVE HISTOMORPHOLOGY ANALYSIS
Systems and methods for performing quantitative histopathology analysis for determining tissue potency are disclosed. According to some embodiments, a method training a tissue classifier is provided. According to the method, training the tissue classifier includes generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints. According to some embodiments, a method for utilizing the trained tissue classifier is provided. According to the method, the trained tissue classifier determines whether tissue in an unknown slide image corresponds to slide images clustered during the training of the tissue classifier.
Latest Enzyvant Therapeutics, Inc. Patents:
This application claims the benefit to and priority of U.S. Provisional Application No. 62/694,829, filed Jul. 6, 2018, the subject matter of which is incorporated herein by reference.
FIELD OF INVENTIONThe described embodiments relate to a quantitative approach for performing quantitative histomorphology analysis of digital images of cells within a tissue sample for determining viable transplantation tissue candidates.
BACKGROUND OF THE INVENTIONThe invention may be understood by reference to the preparation of allogeneic cultured postnatal thymus tissue-derived product, although this disclosure of and claims to the invention are not limited to such an embodiment of the invention.
Allogeneic cultured postnatal thymus tissue-derived product has been shown to be useful for the treatment of T cell immunodeficiency (primary immune deficiency) resulting from congenital athymia, for example in the treatment of complete DiGeorge Anomaly (cDGA) associated with 22q11.2 deletion and CHARGE (coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness) syndrome associated with mutations in the chd7 (chromodomain-helicase-DNA-binding protein 7) gene and in athymic patients with forkhead box protein N1 (FOXN1) deficiency. Congenital athymia is a rare fatal condition and currently has no drug treatment options utilizing regulatory approved drug products.
Experimental transplantation of an allogeneic cultured postnatal thymus tissue-derived product that retains thymus epithelial cells (TECs) has been successfully applied to treat pediatric patients with congenital athymia (Markert M L, Devlin B H, McCarthy E A, 2010, “Thymus transplantation,” Clin Immunol., 135(2): 236-46; Markert M L, et al., 2004, “Postnatal thymus transplantation with immunosuppression as treatment for DiGeorge syndrome,” Blood 104(8):2574-2581; Markert M L, et al., 1999, “Transplantation of thymus tissue in complete DiGeorge syndrome,” N Engl J Med 341(16): 1180-1189 27).
Allogeneic cultured postnatal thymus tissue-derived product is a tissue-engineered product that is prepared, cultured and stored for up to 21 days to produce partially T cell-depleted thymus tissue slices and which is differentiated from native thymus by a conditioning process. The conditioning regimen partially depletes the donor thymocytes from the cultured thymus tissue slices. Based on in vitro data (immunohistochemistry) a culture period between 12 and 21 days preserves the epithelial network as assessed using cytokeratin antibodies. The culturing is preferably done at 37° C. in a 5% CO2 incubator.
The culturing process significantly modifies the biological characteristics of the donor thymus tissue and constituent cells contained therein in the following manner to optimize the effective therapeutic properties of the Allogeneic cultured postnatal thymus tissue-derived product slices. The culturing process assures that a defined composition of the cultured cells/tissue having the pre-requisite biological characteristics is obtained in a manner suitable for surgical implantation into a subject to enable reconstitution of the subject's immune system. The culturing process results in a loss of thymocytes and relative enrichment of thymic epithelial cells and other stromal cells in the donor thymus tissue slices. The culturing process further results in depletion of thymocytes and maintenance of TECs to enable reconstitution of the recipient's immune system and allows tolerance to develop in the recipient to HLA antigens in the donor thymus. Overall, the manufacturing process is designed to deplete thymocytes from the donor thymus tissue and to preserve the functional architecture of the thymic stroma (thymic epithelial cells and fibroblasts).
The surgical administration of allogeneic, cultured postnatal thymus tissue-derived product (e.g., “RVT-802”) in athymic patients leads to a cascade of events resulting in the development of a functional immune system. Following surgical placement of allogeneic, cultured postnatal thymus tissue-derived product in a recipient, T cells are educated by donor TECs and recipient dendritic cells (DCs). Donor TECs in conjunction with recipient DCs enable tolerance to the implanted donor thymus tissue, which is implanted as cultured thymus tissue slices. This is the same tolerance induction as in a normal thymus. The recipient TECs in conjunction with recipient DCs lead to tolerance to self.
Thymopoiesis has been documented by allograft biopsies and the presence of recipient naive T cells in the periphery (Markert M L, 2010; Markert M L, et al., 2008, “Use of allograft biopsies to assess thymopoiesis after thymus transplantation,” J Immunol 180(9):6354-6364; Markert M L, et al., 2007, “Review of 54 patients with complete DiGeorge anomaly enrolled in protocols for thymus transplantation: outcome of 44 consecutive transplants,” Blood 109(10):4539-454728), which are incorporated herein by reference.
Studies of children treated with investigational allogeneic, cultured postnatal thymus tissue-derived product show tolerance to donor major histocompatibility complex (MHC) by mixed lymphocyte reactions (Chinn I K, Devlin B H, Li Y J, & Markert M L, 2008, “Long-term tolerance to allogeneic thymus transplants in complete DiGeorge anomaly,” Clin Immunol 126(3):277-281). In addition, the infants with congenital athymia, after allogeneic, cultured postnatal thymus tissue-derived transplantation, are able to control infections such as Epstein Barr virus (Markert M L, 2014, Thymus Transplantation. Stiehm's Immune Deficiencies, eds Sullivan K E & Stiehm E R (Academic Press), 1st Ed, pp 1059-1067.
Historically, allogeneic cultured postnatal thymus tissue-derived product release criteria has included histopathological evaluation of H&E and immunostained sections of tissue at the mid-point of the manufacturing process, which was later refined to days 5-9 of the culture period. This histopathological evaluation has served as the potency assay, and has been performed as a qualitative analysis by a board-certified pathologist. Samples were prepared for evaluation either by freezing or formalin fixation prior to the tissue slice being sectioned and then fixed onto a slide. Samples prepared in this manner are stable over long periods of time, allowing for reanalysis to be performed.
The historical samples available from the 20+ years of development history can be linked to positive clinical outcomes, and thus provide a strong data set for development of a quantitative histology assay for evaluation of product quality.
A new digital histology assay was developed using scanned images of H&E slides from previous clinical lots and from experimental lots of allogeneic, cultured postnatal thymus-tissue derived product. These images were analyzed for development into a quantitative release assay, as described below.
SUMMARY OF THE INVENTIONThe embodiments fully disclosed herein describe a method for quantitatively assessing the overall nuclear characteristics of a tissue represented within a slide image.
One application of the disclosed embodiments is the development, through training of a tissue classifier, of an assay that is capable of determining the potency or the quality of the tissue prior to transplantation into patients. For example, the approaches described below could be applied to determining potency of allogeneic cultured postnatal thymus tissue-derived product slices for complete DiGeorge Syndrome patients. The described embodiments can be utilized as an allogeneic cultured postnatal thymus tissue-derived product slices tissue characterization assay that evaluates thymus tissue potency based upon histological slide image analysis and associating passing tissues with a pass classification and failing tissues with a fail classification for potency. As shown in
As will be discussed in additional detail below, the described embodiments describe a training phase that includes the training of the classifier based on a library of slide images and a classification phase that includes the utilization of the trained classifier on an unclassified slide image to determine the potency of the tissue represented in the unclassified slide image. The training phase trains the classifier through a quantitative assessment of nuclear characteristics that are associated with tissue composition and cell viability. The training phase includes analysis of both passing and failing slide images to train the classifier to recognize the corresponding nuclear characteristics in the passing and failing slide images. The resulting classifier implements an assay that recognizes passing and failing composition and viability characteristics based on the historical data that results in successful immune reconstitution following tissue implantation.
In an embodiment, the training phase includes two parts. The first part includes validation of a training dataset which is used to generate suitability criteria for appropriate metadata values, background values, and tissue entropy, and may define acceptable ranges or values that are considered to be acceptable for quantitative analysis. The training dataset includes slide images that are known to have a passing or failing classification for transplantation. These may be slide images that have been analyzed by a pathologist or other medical professional, for example. The second part includes applying the acceptable ranges or values generated in the first part to images within a control library, which results in clustering of the images into positive and negative control groups. As will be discussed further below in the specification and Examples, each control group includes slide images grouped based on having similar feature fingerprints. Positive control groups include slide images with similar feature fingerprints determined to pass the criteria for transplantation; negative control groups include slide images with similar feature fingerprints determined to fail the criteria for transplantation.
In an embodiment, the classification phase then analyzes new slide images based on the positive and negative control groups and clustering the new slide images based on the respective feature fingerprints. The classification phase then determines the potency of the new slide images based on the whether the new slide images are clustered with the positive or negative control groups.
In an embodiment, the feature fingerprints are based on four determinations of the cultured thymus cells. These include “area,” “circularity,” “integrated density,” and “perimeter.” As discussed below, “area” measures the nuclear area which is larger for thymic epithelial cells than for thymocytes. Cells undergoing apoptosis are also likely to be smaller. “Circularity” measures how circular the cells are. Circularity is measured on a scale of 0 to 1 with 1 being a perfect circle. Thymocytes have increased circularity compared to thymic epithelial cells. Non-viable cells have reduced circularity compared to viable cells. As such, circularity may be expected to decrease over culture time course, as both thymocytes are reduced and more non-viable cells can be expected to be observed in the tissue slices. Degraded samples are also expected to have decreased circularity as there would be more non-viable cells thereby shifting the distribution toward lower circularity. “Integrated Density” represents how dark a nucleus is stained. Integrated density is high for thymocytes, which shown uniformly dark staining. Thymic epithelial cells have dark-stained rims and mostly clear nucleoplasm with a prominent dark nucleolus. Finally, “Perimeter” represents the outline of the nuclei that are detected. The perimeter is related to cell viability; as cells degrade, the nuclear outline becomes irregular and its perimeter increases. Perimeter would also increase as the proportion of TEC cells to total cells increase as culture progresses. Perimeter changes are expected over time in culture as well as with degradation of tissue.
The described embodiments are an improvement over current approaches that rely solely on qualitative human-driven analysis, via immunohistochemistry (IHC) and hematoxylin and eosin (H&E) histopathology of slide images to determine potency. These current approaches suffer from a number of limitations that diminish the efficacy of the present qualitative analyses, including an inability to make such qualitative analyses either semi-quantitative or quantitative and that in the human assay, a pathologist cannot assess the entire tissue. Instead, the pathologist is looking at only a part of the tissue (an individual field of view) as opposed to the entire slice of tissue.
Moreover, the described embodiments are particularly more effective than conventional approaches for analyzing complex tissues, like the thymus. In complex tissues, the orientation of the sample tissue can markedly change outcome variations. For example, two separate slices with very different morphologies (e.g. corticomedullary ratio) could both be considered “good” samples (i.e., have a pass classification) with acceptable potency. The described embodiments avoid this limitation and, therefore, are effective in classifying all tissues.
The described embodiments rely on quantitative analysis by leveraging a library of slide images having known potency and efficacy. The slide images in the library include images of tissue that are known to have passed or failed the criteria for potency for that particular tissue by associating such tissues with good or poor clinical outcomes (e.g. survival or not). Slide images that have passed can be associated with a pass classification while slide images that have failed can be associated with a fail classification.
The described embodiments may be implemented on one or more processors. The one or more processors may be co-located or distributed over a network, such as a LAN, WAN, or the Internet (e.g., cloud). One or more of such processors may be a graphics processing unit (GPU). In an embodiment, a GPU may be a processor that is a specialized electronic circuit designed to process mathematically intensive applications. The GPU may have a parallel structure that is efficient for parallel processing of large blocks of data, such as mathematically intensive data common to computer graphics applications, images, videos, etc. The one or more processors may be coupled to a memory that includes one or more levels of cache, and which may have control logic (e.g., computer software) and/or data stored therein.
A first aspect of the present disclosure provides a method of training a tissue classifier for performing quantitative histopathological assessment, comprising:
converting a slide image into a binary slide image, wherein the slide image is selected from a library of control images, wherein each slide image in the library of control images is associated with a pass classification or a fail classification;
detecting one or more nuclei within the binary slide image;
for each detected nucleus, extracting a feature from the detected nucleus, wherein the feature represents a property of the detected nucleus within the binary slide image and comprises at least one of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, or a circularity of the detected nucleus;
for each detected nucleus, generating, based on the feature, a feature fingerprint associated with the binary slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
incorporating the binary slide image into a cluster wherein the cluster comprises a plurality of images, wherein each of the first plurality of images is associated with a corresponding feature fingerprint, and wherein the incorporating is based on comparing the feature fingerprint with the corresponding feature fingerprint;
applying a cutoff height to the cluster to form a plurality of groups, wherein the cutoff height minimizes a number of groups within the plurality of groups based on multivariate analysis of variance analysis of the cluster;
categorizing a first group within the plurality of groups as a positive control set if the first group comprises first slide images associated with the pass classification; and categorizing a second group within the plurality of groups as a negative control set if the second group comprises second slide images associated with the fail classification.
A second aspect of the present disclosure is a method for performing a quantitative histopathological assessment of an unclassified slide image of a tissue, comprising: detecting a hematoxylin channel from the unclassified slide image, wherein the hematoxylin channel is associated with a cellular nucleus within the unclassified slide image of a tissue;
extracting a feature from the detected hematoxylin channel, wherein the feature represents a property of the nucleus within the binary slide image and comprises at least one of an area of the nucleus, a perimeter of the nucleus, an integrated density of the nucleus, or a circularity of the nucleus;
generating, based on the feature, a feature fingerprint associated with the unclassified slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
co-clustering the feature fingerprint with a first set of fingerprints associated with one or more positive control sets and a second set of fingerprints associated with a negative control set, wherein the positive control set(s) comprises a first set of slide images associated with a pass classification and the negative control set comprises a second set of slide images associated with a fail classification; and
determining, based on the co-clustering, if the feature fingerprint is associated with the pass classification or the fail classification.
A third aspect of the present disclosure is a system for classifying objects within digital images of tissue, comprising:
means for converting a slide image into a binary slide image, wherein the slide image is selected from a library of control images, wherein each slide image in the library of control images is associated with a pass classification or a fail classification;
means for detecting a nucleus within the binary slide image;
means for extracting a feature from the detected nucleus, wherein the feature represents a property of the detected nucleus within the binary slide image and comprises at least one of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, or a circularity of the detected nucleus;
means for generating, based on the feature, a feature fingerprint associated with the binary slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
means for incorporating the binary slide image into a cluster wherein the cluster comprises a plurality of images, wherein each of the first plurality of images is associated with a corresponding feature fingerprint, and wherein the incorporating is based on comparing the feature fingerprint with the corresponding feature fingerprint;
means for applying a cutoff height to the cluster to form a plurality of groups, wherein the cutoff height minimizes a number of groups within the plurality of groups based on multivariate analysis of variance analysis of the cluster;
means for categorizing a first group within the plurality of groups as a positive control set if the first group comprises first slide images associated with the pass classification; and means for categorizing a second group within the plurality of groups as a negative control set if the second group comprises second slide images associated with the fail classification.
A fourth aspect of the present disclosure is a classifier, comprising:
means for detecting a hematoxylin channel from the unclassified slide image, wherein the hematoxylin channel is associated with a cellular nucleus within the unclassified slide image of a tissue;
means for extracting a feature from the detected hematoxylin channel, wherein the feature represents a property of the nucleus within the binary slide image and comprises at least one of an area of the nucleus, a perimeter of the nucleus, an integrated density of the nucleus, or a circularity of the nucleus;
means for generating, based on the feature, a feature fingerprint associated with the unclassified slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
means for co-clustering the feature fingerprint with a first set of fingerprints associated with a positive control set and a second set of fingerprints associated with a negative control set, wherein the positive control set comprises a first set of slide images associated with a pass classification and the negative control set comprises a second set of slide images associated with a fail classification; and
means for determining, based on the co-clustering, if the feature fingerprint is associated with the pass classification or the fail classification.
In certain embodiments of the first to fourth aspects of the present disclosure, the methods, system and classifier may comprise a tissue classifier or use of a tissue classifier capable of determining potency or the transplantability of a tissue of a subject, preferably a human subject.
In certain embodiments of the first to fourth aspects of the present disclosure, the tissue is a thymus tissue, preferably allogeneic cultured postnatal thymus tissue-derived product slices for implantation into a human subject.
In certain embodiments of the first to fourth aspects of the present disclosure, the human may be suffering from complete DiGeorge syndrome associated with 22q11.2 deletion; coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness syndrome (CHARGE), or athymia associated with forkhead box protein N1 (FOXN1) deficiency.
In certain embodiments of the first to fourth aspects of the present disclosure, the tissue classifier determines potency of an unknown tissue by generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints, preferably a thymus tissue.
In certain embodiments of the first to fourth aspects of the present disclosure, the thymus tissue has been subjected to a culturing process in a thymus organ medium for a period of time to partially deplete the thymus tissue of thymocytes. In some embodiments, the period of time is up to 21 days. In other embodiments, the period of time is from about 12 to about 21 day, or from about 5 to about 9 days.
In certain embodiments of the first to the fourth aspects of the present disclosure, the culturing process preserves the functional architecture of the thymic stroma, preferably the thymic stroma comprises thymic epithelial cells and fibroblasts.
In certain embodiments of the first to the fourth aspects of the present disclosure, the tissue is selected from the group consisting of vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
In certain embodiments of the first to the fourth aspects of the present disclosure, the feature fingerprint is generated from measurements comprising numerical values of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, and a circularity of the detected nucleus.
It is further appreciated that certain features described herein, which are, for clarity, described in the context of different aspects of the present disclosure and/or in separate embodiments, can also be provided in combination in a single embodiment. Conversely, various features which are, for brevity, described in the context of a single aspect of the present disclosure and/or in a single embodiment, can also be provided separately or in any suitable subcombination.
For a more complete understanding of the principles disclosed herein, and the advantages thereof, reference is made to the following descriptions taken in conjunction with the accompanying drawings, in which:
The titles, headings and subheadings provided herein should not be interpreted as limiting the various aspects of the disclosure. Accordingly, the terms defined below are more fully defined by reference to the specification in its entirety. All references cited herein are incorporated by reference in their entirety.
Unless otherwise defined, scientific and technical terms used herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. In this application, the use of “or” means “and/or” unless stated otherwise. In the context of a multiple dependent claim, the use of “or” refers back to more than one preceding independent or dependent claim in the alternative only.
It is further noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the,” and any singular use of any word, include plural referents unless expressly and unequivocally limited to one referent. As used herein, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
The instant invention is most clearly understood with reference to the following definitions.
The term “about” is used herein to mean approximately, in the region of, roughly, or around. When the term “about” is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. In general, the term “about” is used herein to modify a numerical value above and below the stated value by a variance of +/−10%. As used herein, the term about refers to a numeric value, including, for example, whole numbers, fractions, and percentages, whether or not explicitly indicated. The term about generally refers to a range of numerical values (e.g., +/−5-10% of the recited range) that one of ordinary skill in the art would consider equivalent to the recited value (e.g., having the same function or result). When terms such as at least and about precede a list of numerical values or ranges, the terms modify all of the values or ranges provided in the list. In some instances, the term about may include numerical values that are rounded to the nearest significant figure.
As used herein, the terms “comprising” (and any form of comprising, such as “comprise”, “comprises”, and “comprised”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”), or “containing” (and any form of containing, such as “contains” and “contain”), are inclusive or open-ended and do not exclude additional, un-recited elements or method steps. Additionally, a term that is used in conjunction with the term “comprising” is also understood to be able to be used in conjunction with the term “consisting of” or “consisting essentially of.”
The term “tissue” as used herein refers to any type of tissue in human or animals, and includes, but is not limited to, vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
As described herein, any concentration range, percentage range, ratio range or integer range is to be understood to include the value of any integer within the recited range and, when appropriate, fractions thereof (such as one tenth and one hundredth of an integer), unless otherwise indicated. Ranges are approximate and may vary by more than an integer.
Units, prefixes, and symbols are denoted in their Systeme International de Unites (SI) accepted form. Numeric ranges are inclusive of the numbers defining the range. Measured values are understood to be approximate, taking into account significant digits and the error associated with the measurement.
It is further appreciated that certain features described herein, which are, for clarity, described in the context of separate embodiments, can also be provided in combination in a single embodiment. Conversely, various features which are, for brevity, described in the context of a single embodiment, can also be provided separately or in any suitable subcombination.
“Area”—is reported in μm2. The nuclear area is larger for thymic epithelial cells than for thymocytes. Cells undergoing apoptosis are also likely to be smaller.
“Circularity”—is a measure of how circular the cells are. Circularity is measured on a scale of 0 to 1 with 1 being a perfect circle. Thymocytes have increased circularity compared to thymic epithelial cells. Non-viable cells have reduced circularity compared to viable cells. As such, circularity may be expected to decrease over culture time course, as both thymocytes are reduced and more non-viable cells can be expected to be observed in the tissue slices. Degraded samples are also expected to have decreased circularity as there would be more non-viable cells thereby shifting the distribution toward lower circularity.
“Integrated Density”—represents how dark a nucleus is stained. Integrated density is high for thymocytes which show uniformly dark staining. Thymic epithelial cells have dark-stained rims ad mostly clear nucleoplasm with a prominent dark nucleolus.
“Perimeter”—represents the outline of the nuclei that are detected and is reported in μm. The perimeter is related to cell viability; as cells degrade, the nuclear outline becomes irregular and its perimeter increases. Perimeter would also increase as the proportion of TE cells to total cells increase as culture progresses. Perimeter changes are expected over time in culture as well as with degradation of tissue.
Overview of the Quantitative Histology MethodThe quantitative histology method developed is an image based algorithm that clusters like images based on properties that were determined to have statistical and biological relevance to allogeneic cultured postnatal thymus tissue-derived product. Scanned H&E histology slides are created. The slide is uploaded into the validated thymus tissue analysis software as either an SCN or TIFF image. If the file uploaded is a SCN image, the algorithm will convert it into a TIFF image for analysis. The red channel of the image is extracted and then inverted such that the nuclei that are highlighted with the eosin stain are now black shapes on a white background.
The area, perimeter, integrated density (how dark the shape is), and circularity are then measured for each nuclei. The frequency distributions for each of these attributes are then able to be compared to known good and bad samples in a database. A statistical clustering comparison is then performed for the attributes to determine if the new input sample is statistically similar to the known samples and thus can be determined as “passing” or “failing” per previously identified criteria.
A selection of clinical and R&D H&E slides of allogeneic cultured postnatal thymus tissue-derived product were scanned at 40× or 20× magnification. The images were then uploaded for development of a quantitative histology method.
To quantify the slides, the images were first analyzed such that attribute data could be extracted from the images. To achieve this, the images were converted into the TIFF FGP format and then processed by an image processing algorithm through ImageJ where the images were calibrated to 1 μm/pixel, the red channel is extracted, and then the red channel is inverted such that the darker stained nuclei result in higher pixel intensity. Refer to
Thresholds were determined and set to appropriate values for selection of nuclei to ensure image analysis is consistent from image to image. Images were analyzed for particles (cell nuclei), here defined as contiguous regions of pixels, exceeding 10 μm2 in area. Parameters were then extracted for each particle including area, perimeter, width etc. (refer to Table 1 for full list of parameters evaluated).
Features that were determined to be of statistical significance through principle component analysis in the tissue population initially analyzed were area, perimeter, integrated density, and circularity. Other features were found incapable in aiding to distinguish between samples and were subsequently no longer analyzed.
Once the four parameters have been recorded for each nuclei, a frequency distribution is created for each parameter to show the distribution of nuclei over the entire slide. There are generally more than 100,000 data points for each slide. The bin width and cut-off for each parameter within a distribution were determined by selecting the fewest bins that still showed variability between the slides. The proportion of nuclei in each bin is determined and those values are used for the clustering analysis.
Overview of Quantitative Analysis and ClusteringThe quantitative analysis described herein is an unbiased/emergent approach to digital pathology. Both the training phase and the classification phase include computerized feature extraction to generate a feature fingerprint for each image. In an embodiment, the feature fingerprint represents the underlying nuclear features of each image. Utilization of cellular feature extraction enables quantitative characterization of the underlying cell population within the tissue that represented in each slide image. Both phases also include a statistical hierarchical agglomerative clustering technique to classify histology sections by quantitative features describing each cell within a slide image. The clustering analysis categorizes the slide images into different groups based on similarities between the fingerprints generated for each slide image. In an embodiment, hierarchical agglomerative clustering is used within the context of image analysis for both training a tissue classifier and utilizing the trained tissue classifier for determining potency of unknown tissue. Examples of clustering embodiments of the disclosure are set forth below in the specification, Figures and Examples. For example, and not by way of limitation, the clustering techniques described in connection with
Hierarchical agglomerative clustering, in the context of tissue classification, includes assembling clusters directly from data in order to reveal emergent properties of the underlying dataset. In the classification phase, according to an embodiment, the clustering technique allows for classification of unknown tissue based on the similarity of generated feature fingerprints of the unknown tissue to previously clustered feature fingerprints of known tissue (e.g., from a library of slide images). The clustering technique also relies on analysis of the relative height of a cluster dendrogram (see
The training and classification phase may include one or more of the following steps: suitability determination, image processing, feature extraction, and clustering.
Suitability DeterminationSuitability determination refers to assessing a slide image's characteristics to determine whether the slide image is suitable for further quantitative analysis as described in the disclosed embodiments. Suitability determination may be performed both for a training set of images as well as for new slide images. In an embodiment, suitability determination of a slide image includes metadata analysis, background pixel analysis (e.g., examination of the amount of background pixels present in the slide image), and tissue entropy (and nuclei segmentation) analysis (e.g., examination of the amount of entropy in the slide image).
Metadata AnalysisMetadata analysis of slide images determines whether a slide image has the appropriate metadata properties for the quantitative analysis. Examples of metadata that may be considered include but are not limited to: filename (e.g., whether the filename is unique), last modified date (date the file was last modified), file size (e.g., the size of the file in bytes), format (e.g., the file type such as TIF), image width (e.g., a value containing the width of the slide image in pixels), image height (e.g., a value containing the height of the slide image in pixels), bit depth (total number of bits for color channels in the slide image), color type (color type of the image, e.g., RGB), x-resolution (e.g., a value representing the resolution of the slide image in the X-direction), y-resolution (e.g., a value representing the resolution of the slide image in the Y direction), resolution units (e.g., a string containing the units of the x-resolution and y-resolution properties), image background ratio (ratio of the amount of background pixels to the total number of pixels), background label (label describing the amount of background pixels in the image), tissue entropy (entropy describing only the issue pixels segmented), and nuclei segmentation label (label describing the success of the nuclei segmentation analysis).
In an embodiment, a slide image may be considered suitable for quantitative analysis based on an analysis of one or more metadata properties described above. For example, in an embodiment, if all the above fields are present and readable within the header of the slide image, and the slide image has an x-resolution between 0.8 μm-1.2 μm and a y-resolution between 0.8 μm-1.2 μm, then slide image is suitable for quantitative analysis. In another embodiment, if any fields are missing or corrupted, the slide image may be excluded from quantitative analysis.
Background AnalysisAnalysis of background pixels is a step to ensure that the image of the tissue is suitable for further analysis that requires a view of the tissue to be analyzed on a background. In an embodiment, quantitative analysis utilizes a standardized amount of background pixels in a slide image to ensure accurate segmentation takes place during tissue segmentation and nuclei segmentation. Images with too many background pixels (empty images) or images with not enough background pixels (cropped images) will perform poorly during the segmentation analysis (described below) and therefore the range for background pixels must be determined such that images outside of this range are screened out prior to continuing with quantitative analysis of the disclosed embodiments.
The results of the background analysis may result in associating a background label (e.g., pass or fail) with the slide image. For example,
Exemplary steps of the background analysis will now be discussed. The image background for slide images will be classified using classification intervals based on the ratio of the amount of background pixels to the total number of pixels. For example, empty images (e.g.,
The background analysis is performed for images within a training set (e.g., to generate the appropriate ranges for values that are considered to be suitable for quantitative analysis), for images in a control set (e.g., to determine whether the background values of images in the control set fall within the generated appropriate ranges) and for new unspecified images. In an embodiment, determining the image background ratio classification for slide images includes but is not limited to the following steps.
In an embodiment, determining the image background ratio classification for slide images for the training set includes but is not limited to the following steps:
1. Calculate the image background ratio for all images in the training set.
-
- a. Calculate the normalized image histogram counts.
- b. Use an inversion function on all image pixels from the red color channel to segment tissue pixels from background pixels (e.g., otsu_dark function in application ImageJ)
- c. Generate histogram counts from all image pixels with N bins where N=2Bit depth of slide image
- d. Divide all histogram counts by the total number of pixels in the image to normalize the data.
- e. Calculate the proportion of pixels greater than the threshold. This proportion represents the image background ratio.
2. Calculate classification interval of image background ratio for images that are considered to have an appropriate number of background pixels (e.g., pass).
-
- a. Calculate the mean image background ratio and standard deviation for each of the first, second, and third background classes.
- b. Calculate the upper bound of the classification range.
- i. Calculate the midpoint between the mean image background ratio for the first and second background classes. This value may be assigned as the upper range bound of the accepted classification interval.
- c. Calculate the lower bound of the classification range.
- ii. Calculate the midpoint between the mean image background ratios for first, second and third background classes. This value is assigned as the lower range bound of the accepted classification interval.
In an embodiment, the results of the foregoing steps performed on a training class of slide images produce an acceptable background range. Slide images having background values within this acceptable background range may be considered suitable for further quantitative analysis. The acceptable background range may then be applied to classify new images outside of the training class, such as images within a control library. For example, new slide images determined to have a background ratio within the background range calculated for images with the first background class (e.g.,
Nuclei segmentation analysis ensures that nuclei pixels are suitably separated from other pixels in the slide image. Feature fingerprints of slide images are based, in part, on nuclei characteristics, so correctly segmented nuclei pixels are necessary for the quantitative analysis in embodiments. The results of the nuclei segmentation analysis may result in associating a nuclei segmentation label (e.g., pass or fail) with the slide image.
Nuclei segmentation analysis includes evaluating entropy of tissue pixels to ensure enough entropy exists such that accurate nuclei segmentation will take place. Image entropy, like thermodynamic entropy, corresponds to the number of states in a system. An image that has many different pixels values evenly distributed amongst the image has a high number of states, and therefore, a high entropy. An image with pixel values unevenly distributed amongst the image will have a low number of states, and therefore, a low entropy. Evaluation of entropy of tissue pixels in a slide image ensures proper contrast and sharpness exists in the image, which aid in the quantitative analysis of the slide image. Images with a low tissue contrast will have lower entropy compared to images with a normal amount of tissue contrast. The number and arrangement of pixels per nuclei corresponds to their “features” and for generation of feature fingerprints that are used to cluster the tissue samples. These features will vary from nuclei-to-nuclei in the slide image. In an embodiment, images that have passed the two previous steps (metadata analysis, image background) will proceed to this tissue entropy analysis.
When applied to a training set, the tissue entropy analysis creates a range for the accepted tissue entropy within an image. In an embodiment, tissue entropy analysis may include, but is not limited to, the following steps:
1. Segment all tissue pixels from background pixels.
-
- i. Use an inversion function on all image pixels from the red color channel to segment tissue pixels from background pixels (e.g., otsu_dark function in application ImageJ).
- ii. Calculate the entropy of all tissue pixels using an entropy equation and save the corresponding values. An example of an entropy equation is shown below where ET is the tissue entropy for all tissue pixels, p is the normalized histogram counts of the tissue pixels in bin i, and N is the number of histogram bins used in the histogram:
-
- iii. Create a binary mask of all segmented nuclei within the slide image (e.g., applying function otsu_dark with all tissue pixels). A binary mask corresponds to a particular image and points to the pixels that will be used. Binary masks are usually created when segmentation is performed. For example, a binary nuclei mask that corresponds to an H&E image would be a matrix with the same dimensions as the original image where the nuclei mask is only 1-bit and only contains predetermined values (e.g., 0 or 1). Pixel locations that match the location of nuclei in the original image will have a predetermined value (e.g., 1), where non-nuclei pixels will have a predetermined value of (e.g., 0).
2. Create Nuclei Segmentation labels for all images. This label describes the success of the nuclei segmentation performed in the image processing routine described above with respect to tissue entropy analysis. Nuclei segmentation is the separation of nuclei pixels from all other pixels in the original slide (e.g., H&E) image. Nuclei segmentation is performed so the object analysis is performed only on cell nuclei in the image.
-
- i. Determine accuracy of the nuclei segmentation. For example, this step may be performed using an application and comparing to an image key that provides examples of accurate and inaccurate segmentation.
1. Overlay the binary nuclei mask on the red channel image using a certain opacity value (
2. Examine a predetermined number of nuclei (e.g., 50) for correct segmentation that is defined as masks being properly overlaid on top of nuclei.
-
- a. If more than or equal to a certain number of nuclei (e.g., 45) of the predetermined number of nuclei are correctly segmented, the image may be given a first Nuclei Segmentation label (e.g., “0”) (
FIG. 4A ). - b. If less than the certain number of nuclei is correctly segmented, the image may be given a second Nuclei Segmentation label (e.g., “1”) (
FIGS. 4B , C).
- a. If more than or equal to a certain number of nuclei (e.g., 45) of the predetermined number of nuclei are correctly segmented, the image may be given a first Nuclei Segmentation label (e.g., “0”) (
3. Calculate the classification interval of tissue entropy for images with the first Nuclei Segmentation label.
-
- i. Calculate the mean tissue entropy and standard deviation for each of the images having the first and second Nuclei Segmentation labels.
- ii. Calculate the lower bound of the classification range.
1. Calculate the midpoint between the mean tissue entropy for the first and second Nuclei Segmentation labels. This value is assigned as the lower bound of the tissue entropy range.
2. The accepted range of tissue entropies may span from the lower bound of the range calculated in step 3.ii.1 to infinity.
In an embodiment, the results of the foregoing steps performed on a training class of slide images produce an acceptable classification range for tissue entropy in a slide image. The acceptable classification range may then be applied to classify new slide images. Slide images with a tissue entropy value greater than the lower bound of the classification range (see step 3.ii.1) are classified as having an appropriate amount of tissue entropy and are accepted for analysis using this method. Slide images with a tissue entropy value that are less than the lower bound of the classification range are rejected and will not move forward for further quantitative analysis.
The image processing feature is discussed further below with respect to analyzing hematoxylin channels of slide images since this channel depicts nuclear features of each cell. In an embodiment, the hematoxylin image is separated from the eosin image. In an embodiment, the image processing step also may include, as shown in
In an embodiment, the feature extraction step includes extracting features for each nucleus detected in the slide image and generating a fingerprint for each of the extracted features. In an embodiment, features are represented as numerical feature values. Table 1 below lists features that, for example and without limitation, are detectable and therefore capable of being extracted for each nucleus. One or more features may be extracted for each nucleus. For example, certain features may be more relevant for the purposes of different potency applications. That is, certain features may be determined to vary significantly between passing and failing tissues.
In an embodiment, when the tissue to be classified is thymus tissue (which may or may be part of the assay), the relevant features extracted are area of each detected nucleus, perimeter of each detected nucleus, integrated density of each detected nucleus, and circularity of each nucleus. With regard to area, the nuclear area is larger for thymic epithelial cells than for thymocytes. The perimeter is related to cell viability, as cells degrade, the nuclear outline becomes irregular and its perimeter increases. Integrated density is high for thymocytes, which show uniformly dark staining. Thymic epithelial cells have dark-stained rims and mostly clear nucleoplasm with a prominent dark nucleolus. Thymocytes have increased circularity compared to thymic epithelial cells. Non-viable cells have reduced circularity compared to viable cells.
In an embodiment, the integrated density is represented by the size of the detected nucleus and a darkness value associated with the detected nucleus. The circularly represents an assessment of the circular shape of the detected nucleus and, in an embodiment, may be represented by a range (e.g., 0.0 to 1.0 where 1.0 represents a perfect circle).
Determination of the feature fingerprint for a slide image according to an embodiment is now discussed. Initially, a numerical fingerprint is created for each of the extracted features. For example, in the embodiment described above, a feature fingerprint is generated for each the area, perimeter, integrated density, and circularity of nuclei within slide images. In an embodiment, histograms are generated for each of the extracted features (e.g., area, perimeter, integrated density and circularity). Each histogram shows the frequency of results in specific ranges (bins) for a feature. A feature fingerprint is generated from the combined histograms for each slide image. Clustering (discussed below and in the Examples set forth below) is based on the statistical analysis of the extracted features. Histograms are shown in
In an embodiment, clustering may include assembling clusters based on feature fingerprints of a slide image. In an embodiment, a Euclidian distance matrix for the generated feature fingerprint is tabulated. The distance matrix may then be used for hierarchical agglomerative clustering by looking for the optimum arrangement of slide images such that the within-group variance of slide images is minimized.
In an embodiment, feature fingerprints may be represented by a numerical value for each slide image generated from the combined histograms of extracted features as noted above. In the training phase, hierarchical agglomerative clustering is applied to the feature fingerprints of the slide images from a library of control images with known datasets such as Duke (Markert Lab), Forced Degradation (CT2), and Manufacturing (CT2) specimen slides. In an embodiment, clustering also includes performing statistical analysis to determine a cutoff height at which to segregate clusters into groups to ensure statistical significance between slide images within each group. In the classification phase, unknown samples are classified by co-clustering with the slide images from the library. The unknown samples are then classified based on the group within the cluster in which the unknown samples are co-clustered.
In an embodiment, the clustering step may include segregation of a cluster into groups. A cutoff height must be calculated by which to perform the segregation in a statistically significant manner. In an embodiment, the cutoff height is calculated through a systematic increase of a cut-off height with corresponding multiple analysis of variance (MANOVA) between and amongst groupings and the selected cutoff height minimizes the number of groups while maximizing the statistical significance of differences in feature fingerprints between the group populations. In an example experimental test where the tissue was thymus tissue and the extracted features included area, perimeter, integrated density, and circularity, a cutoff height (e.g., 0.4) was determined for segmentation of clusters which resulted in segregating the clusters into a certain number of groups (e.g., 9).
In an example experimental test, after groups were formed, groups were assigned as a positive control group if the population of samples within the group consisted of slide images from samples previously determined to have passed the potency criteria (i.e., “good” samples having a pass classification). Examples of these groups include the Duke and manufacturing datasets. Groups were assigned as a negative control group if the population of samples within the group consisted of slides previously determined to have failed the potency criteria (i.e., “bad” samples having a fail classification). Examples of these groups included the forced degradation dataset.
In an embodiment, the following description of the control groups in
Regarding the positive control groups shown in
Tables 2 and 3 below illustrate example results of a statistical analysis of forming groups after performing a co-clustering analysis. In an embodiment, MANOVA analysis is performed. Table 2 illustrates that groups based on a co-clustering analysis resulted in populations between the clustered groupings differing significantly from each other.
Table 3 illustrates that groups of randomly generated groups (i.e., without performing co-clustering) resulted in no statistical significance between the populations of each group.
In an embodiment, after the training phase, the classifier has established segmented groupings of the slide images from the library. The groupings are separated into positive control groups and negative control groups, where the positive control groups include slide images that have been previously determined to be candidates for transplantation (i.e., pass classification) and the negative control groups include slide images that have been previously determined to not be candidates for transplantation (i.e., fail classification).
In the classification phase, a sample slide image to be classified is analyzed to assess the potency of the tissue in the slide image. The slide image is processed in preparation for the analysis. In an embodiment, this includes conversion of the image into a binary image. Feature detection is next performed. In an embodiment, a hematoxylin channel (i.e., nuclei) is extracted from the slide image. Features are determined for the extracted channel. A feature fingerprint is generated based on the features of the extracted channel. In an embodiment, the feature fingerprint may be represented by a histogram as shown in
The generated feature fingerprint may then be compared with feature fingerprints of positive and negative control groups that were generated during the training phase. In an embodiment, the comparison step includes co-clustering the generated feature fingerprint with the feature fingerprints of the positive and negative control groups.
After co-clustering, statistical analysis (e.g., MANOVA) may be performed to evaluate the formed clusters for determining whether the generated feature fingerprint clusters correlate in a statistically significant manner with any of the positive or negative control groups. If the results of the statistical analysis indicate a lack of statistically significant clustering or that generated feature fingerprint clusters with a negative control group, the slide image is either considered to have failed the potency criteria and may be associated with a fail classification or it may be assessed qualitatively for disposition. If the results of the statistical analysis indicate clustering with a positive control group, the slide image is considered to have passed the potency criteria and may be associated with a pass classification.
Table 4 below describes corresponding qualitative characteristics of tissue from slide images in negative control groups and positive control groups in an experimental embodiment when the classification phase was applied to a library of slide images of thymus tissue.
Table 5 illustrates characteristics corresponding between negative control groups (labelled Negative 1-5) and positive control groups (labelled Positive 1-6).
In an embodiment, clustering is based on composition and features of nuclei for all cells within slide images. For thymus cells, thymocytes may be a contributing factor for the analysis, as they comprise a majority of population nuclei. Medulla and cortical thymocytes contribute differently to the feature fingerprints based on the features used for the fingerprints. Quantitative differences in the cell population lead to quantitative differences in the feature fingerprints that lead to differences in clustering. Thymocyte composition portrays tissue health as significant variations in composition indicate necrosis, fibrosis, and general degradation. Clustering groups samples that share a similar fingerprint pattern, and hence share similar features of their cellular populations. Sample images whose fingerprints differ statistically significantly will be located in different groups.
The disclosed embodiments for quantitative analysis and feature fingerprint generation enable a classifier to detect shifts in cell population, because feature fingerprinting evaluates every detectable nuclei in a slide image. During culturing, the overall number of thymocytes should reduce in tissues. However, the relative proportion of cortical to medullary thymocytes should stay consistent. Significant alterations to these proportions can indicate uneven tissue degradation or compromise of potency. Significant changes in cell population will be detected as shifts in the quantitative feature fingerprints. Significant change in corticalmedullary thymocyte ratio (increase or decrease) will cause a shift in feature fingerprints toward negative control groups, and hence the sample will cluster with negative control samples.
In a preferred embodiment of the present invention, the methods of performing quantitative histopathological assessment of an unclassified slide image of a tissue and methods of training a tissue classifier to perform quantitative histopathological assessments can be performed in the following exemplary manner.
A culturing time period for the tissue to be examined is selected based on results constituting positive clinical outcomes following the implantation of a desired tissue-engineered product. With regard to allogeneic cultured post-natal thymus tissue-derived product, the culturing time period allows the tissue to deplete thymocytes either through washing them out of the tissue and/or through apoptosis. TECs in the cultured thymus tissue are substantially maintained throughout the culturing time.
An assay examines a sample taken from between days 5-9 of culture to assess the suitability of the cultured thymus tissue for release and transplantability. To help understand the product better and to better assess the discerning capabilities of the assay, samples are taken from days 1 to 21 and beyond, data on which can be assessed to determine apparent trends.
For example, batches of clinical cultured thymus tissue samples and a degraded thymus tissue sample may be examined for histopathological features such as area, perimeter, integrated density, and circularity. The overall trend in the number of nuclei detected are examined, this is expected to decrease over time as thymocytes wash out of the tissues or undergo apoptosis. These data are then normalized for tissue area as different slices from multiple lots are examined on the different days. Tissue section size can differ between slices and can cause variability in the data set, but a general negative trend can be seen with increasing culture days, with a visual step change in the data around day 10. Previous data has shown that the majority of thymocytes are depleted earlier in the culture period when compared to the decrease in the total number of cells. The cultured tissue samples may then be analyzed in time course studies to assess, for example, inter-tissue variability. For illustrative purposes, the process will be described for assessing inter-thymus tissue variability. This will include an assessment of thymic epithelial cells and thymocytes. The data obtained in the inter-thymus variability step will be subjected to hierarchal cluster analysis. A training set of data will be established and then a group by group analysis of the cultured thymus tissue will be determined. These data will be compared to data obtained in either a forced degradation study of the tissue and/or by comparing the group by group determination to s set of data associated with failed specimens with negative clinical outcomes.
The general appearance of the tissue, and consequently the histology slides, changes over the course of the selected culture period, for example at 0, 5, 9, 12 and 21 days for cultured thymus tissue. Again, in the illustration of cultured thymus tissue, as thymocytes are washed out of the cultured thymus tissue. The tissue at Day 5, 9, 12 and 21 shows a marked decrease in integrated density and a profile more similar to the profile for thymic epithelial cells. The number of cells with very high circularity diminishes over time throughout the culturing process. This is likely due to apoptosis resulting in nuclei that are less circular as well as washing out of the very circular thymocytes. For samples at Day 0 with lower circularity it is likely due to clumping of thymocytes being measured as a single entity with lower circularity than a single nuclei. At Day 0 there are a large proportion of cells with high perimeters, which is likely due to clumps of cells being read in the program as a single shape resulting in the large perimeter values. The increase in perimeter is likely a combination of the thymocyte washing out as well as cells undergoing apoptosis over culture time and a resulting increase in perimeter from that event. Overall, nuclear characteristics over the course of culture time aligns with theoretical expectations for trends. Most of the apparent shifts in the data are prior to the intended release days of the lots. This suggests that many of the changes occur during the initial days of culture, after which the environment is able to be sustained for up to 21 days. This is supportive of the assay to detect true trends in the cell populations and how they change over time.
Inter-thymus variability is examined to better understand if samples are similar between thymuses. Inter-thymus variability may be assessed in a similar manner to intra-thymus variability where samples examined are from a single day, but instead of being restricted to a single thymus, the assessment includes samples across lots. The Euclidean distance of each sample can be calculated to the center point of all samples within the isolated data set. ANOVA's are then performed on the distances to center by thymus. Such an analysis is performed on samples on Days 5 and 9. Inter-thymus variability may be examined to better understand if samples are similar between thymuses. Ninety-five percent confidence intervals at each day are generated per thymus. A more specific analysis of only thymic epithelial cells is performed to help characterize the allogeneic, cultured postnatal thymus tissue-derived product. Thymic epithelial cells are hypothesized to be critical for the mechanism of action of the allogeneic, cultured postnatal thymus tissue-derived product. A pathologist selects nuclei for cells identified as thymic epithelial cells. The nuclei of thymic epithelial cells differ from other cells in the tissue population. TECs are generally larger, and have a nucleolus present, which appears as a darker purple dot within the center of the lighter purple outer nucleus. Known marked cells are then extracted as individual data points from the software. Using this data, a filter can be devised to enable the following steps to be performed in sequential order. Two size filters remove any cells outside of the size range window both pre- and post-splitting of conjoined cells. The following filtering steps were carried out.
The darkest cells are removed from the dataset by defining threshold of inclusion below darkest pixels. Cells below 50 μm in area are then removed. Holes are filled, and watershed applied to split conjoined cells. Cells outside a range of 30-250 μm2 and circularity <0.75 are then filtered out of the dataset. The foregoing filters permit analysis of images with data sets generated and restricted to characterize the TEC cells. When examining the total proportion of cells in the tissue over the course of the culture period, there is a general increase in TECs. The increase in TECs is because, as the thymocytes are washed out of the tissue, a larger number of the remaining cells are TECs. The foregoing analysis of TECs also provides a better view of the trends of the thymocytes washing out of the tissue through the culture period. A similar analysis can be inferred from the data, that the number of thymocytes is reduced during the course of the culture as the ratio of TECs increase. The foregoing data may be used to show the samples broken into groups. The groups are generated through hierarchal cluster analysis. This analysis systematically and statistically identifies samples with similar characteristics by iteratively grouping data based on similarities resulting in meaningful clusters of data of similar properties (referred to as “groups”). The process for grouping cells required the following steps:
The distance between groups is calculated. Distance is a measure of similarity between groups. The cost: of joining two groups is calculated. The cost here is how much error is added by joining the groups. The groups that have the least merging cost are joined. The process is repeated until all data are joined into one group. The resulting data set essentially shows a family tree of how “related” or similar the samples are to each other. The height between the branches shows how related two groups are to each other. Distance on the horizontal x-axis does is not indicative of any closer relationship when graphed. Samples within each group are considered to be statistically similar, while those in different groups are considered statistically different.
To determine where the cut height is between the different groups, a Scree Plot can be used to examine where the distances between the groups are most significant. This ensures that minimal differences between samples do not overly influence the algorithm. Too small of differences will result in more likely fracturing of future samples into independent groups as the samples have to be too similar to cluster together than is realistic for allogeneic, cultured postnatal thymus tissue-derived product. Alternatively, the groups must have an appropriate cut off to ensure that there is differentiation between samples. In the case of allogeneic, cultured postnatal thymus tissue-derived product, there is high likelihood of heterogeneity due to the nature of the tissue and the lot-to-lot variability that can present itself. By examining samples that showed previous positive clinical outcomes (here defined by survival) and comparing to tissues that were degraded, the appropriate level of differentiation can be determined.
A training set may be selected that contains the most information relating to clinical outcomes. Initially, samples are included from various representative R&D studies as well as multiple forced degradation conditions and clinical samples. These data may be clustered into good and bad groups. The training set may be deemed most informative if restricted to known “good” and known “bad” samples that are representative of future in-process samples to be examined for lot release. This step restricts the samples in the following manner.
Samples may be removed if they were not from the mid-point or Day 5-9 of the culture period. In the example of allogeneic cultured thymus tissue-derived tissue, this is when in-process samples for lot release based on quantitative histology may be taken. Samples may also be removed if they were from R&D lots that are believed to be representative but there are no associated clinical outcomes to examine. Furthermore, samples may be removed if they were associated with negative clinical outcomes. Samples may also be removed from the forced degradation arm if orthogonal methods were unable to confirm degradation.
The underlying data may be grouped to better understand what underlying features result in the various clusters. Groups may be associated with positive clinical outcomes and forced degraded samples and/or tissues associated with negative clinical outcomes. Different parameters may drive the differentiation of the groups with positive clinical outcomes (e.g. mid-sized area, high circularity, and high perimeter) from forced degraded samples and/or negative clinical outcomes.
For example, a group may be characterized by a larger proportion of nuclei that have large perimeters, high integrated densities, and high area with lower circularity. This may be due to the presence of clumps of nuclei that cannot be read as independent cells by the software. This would more likely occur in tissues that still have larger number of thymocytes present in the earlier days of the culture period.
Additional groups may be defined that have positive clinical outcomes. For example groups may be designated by having large numbers of cells with high circularity values and/or higher area when compared to other groups and/or mid-range integrated density. These values are expected for healthy viable tissues in the mid-range of the culture period. Histograms of the measurements of area, circularity, integrated density and perimeter may be constructed from the data.
For comparison a forced degradation study may be conducted where the samples are placed back in thymus organ medium (TOM) prior to the samples being removed on days 5 and 9. This shows how hardy allogeneic cultured postnatal thymus tissue-derived product is to variations in process conditions. Not all culturing condition may cause degradation detectable via any of the measures used. It is believed that these conditions do not permanently damage the tissue to a point where it was either not functional or not viable. As would be expected of degraded samples, there are large proportions of cells with lower circularity, and low area and higher perimeters. This shows cells that are no longer able to maintain viability and are therefore changed in morphology and shrinking with shriveled edges. To show the similarity within a group, analysis of the variability for each group is examined; “good” and “bad” (pass versus fail) classifications may be determined. Using the foregoing steps, the analysis of tissue specimens associated with pass and fail characteristics may be assessed.
EXAMPLESAllogeneic, cultured postnatal thymus tissue-derived product (e.g., “RVT-802”) is typically cultured from 12 to 21 days prior to implantation into the recipient. This culturing time period historically has been demonstrated to result in positive clinical outcomes following implantation of the tissue-engineered product. Not intending to be bound by theory, this culturing time period is believed to allow the tissue to deplete thymocytes either through washing them out of the tissue or through apoptosis. Thymic epithelial cells are maintained through the culture time.
The validated assay examines a sample taken from between days 5-9 of culture to assess the suitability of the cultured thymus tissue for release. To help understand the product better and to better assess the discerning capabilities of the assay, samples were run through the software program taken from days 1 to 21 and beyond and the resulting data were compared to each other to assess what trends were apparent.
The overall trend in the number of nuclei detected was also examined, this is expected to decrease over time as thymocytes wash out of the tissues or undergo apoptosis which can be observed in
The general appearance of the tissue, and consequently the histology slides, changes over the course of the culture period. Images presented in
Overall, nuclear characteristics over the course of culture time aligns with theoretical expectations for trends. Most of the apparent shifts in the data are prior to the intended release days of the lots. This suggests that many of the changes in occur during the initial days of culture, after which the environment is able to be sustained for up to 21 days. This is supportive of the assay to detect true trends in the cell populations and how they change over time.
Example 2: Inter-Thymus Variability StudyInter thymus variability was examined to better understand if samples are similar between thymuses. Inter thymus variability was assessed in a similar manner to intra thymus variability where samples examined are from a single day, but instead of being restricted to a single thymus, include samples across lots. The Euclidean distance of each sample was calculated to the center point of all samples within the isolated data set. ANOVA's were then performed on the distances to center by thymus. This analysis was performed on samples on Days 5 and 9.
Inter thymus variability was examined to better understand if samples are similar between thymuses. Inter thymus variability was assessed in a similar manner to intra thymus variability where samples examined are from a single day, but instead of being restricted to a single thymus, include samples across lots. The Euclidean distance of each sample was calculated to the center point of all samples within the isolated data set. ANOVA's were then performed on the distances to center by thymus. This analysis was performed on samples on Days 5 and 9.
From Table 6 below, there is no detectable differences between lots at Day 9. There are slight statistical differences between thymuses at Day 5. It is hard to determine if these result in practical differences that result in differences in product quality. To assess this in a different manner, 95% confidence intervals at each day were generated per thymus. As can be seen in
A more specific analysis of only thymic epithelial cells was performed to help characterize the allogeneic, cultured postnatal thymus tissue-derived product. Thymic epithelial cells are hypothesized to be critical for the mechanism of action of the allogeneic, cultured postnatal thymus tissue-derived product. Images were sent to a pathologist. The pathologist selected nuclei for cells identified as thymic epithelial cells (refer to
Known marked cells were extracted as individual data points from the software. Using this data, a filter was devised to enable the following steps to be performed in sequential order. Two size filters remove any cells outside of the size range window both pre- and post-splitting of conjoined cells. The following filtering steps were carried out.
The darkest cells were removed from the dataset by defining threshold of inclusion below darkest pixels (included pixels between 100-130 to 150-180 pixel intensity dependent on staining intensity).
Cells below 50 μm2 in area were removed.
Holes were filled, watershed applied to split conjoined cells.
Cells outside range of 30-250 μm2 and circularity <0.75 were filtered out of the dataset.
The foregoing filters permitted analysis of images with data sets generated and restricted to characterize the TEC cells. When examining the total proportion of cells in the tissue over the course of the culture period, there is a general increase in TECs. The increase in TECS is because, as the thymocytes are washed out of the tissue, a larger number of the remaining cells are TECs. This change can be visualized in
The TEC analysis of Example 3 also gives a better view of the trends of the thymocytes washing out of the tissue through the culture period. A similar analysis was not performed for the thymocytes, but it can be inferred from the above data, that the number of thymocytes is reduced during the course of the culture as the ratio of TECs increase. This change can also be visualized in images of the H&E stained slides at multiple magnifications (See, for example,
The data presented in Examples 1-4 shows the samples broken into 4 groups as illustrated in
The distance between groups is calculated. Distance is a measure of similarity between groups.
The cost: of joining two groups is calculated. The cost here is how much error is added by joining the groups.
The groups that have the least merging cost are joined.
The process is repeated until all data are joined into one group.
The resulting data set essentially shows a family tree of how “related” or similar the samples are to each other. The height between the branches shows how related two groups are to each other. Distance on the horizontal x-axis does is not indicative of any closer relationship. Refer to
To determine where the cut height is between the different groups, a Screen Plot can be used to examine where the distances between the groups are most significant. This ensures that minimal differences between samples do not overly influence the algorithm. Too small of differences will result in more likely fracturing of future samples into independent groups as the samples have to be too similar to cluster together than is realistic for allogeneic, cultured postnatal thymus tissue-derived product. Alternatively, the groups must have an appropriate cut off to ensure that there is differentiation between samples. In the case of allogeneic, cultured postnatal thymus tissue-derived product, there is high likelihood of heterogeneity due to the nature of the tissue and the lot-to-lot variability that can present itself. By examining samples that showed previous positive clinical outcomes (here defined by survival) and comparing to tissues that were degraded, the appropriate level of differentiation can be determined.
Example 6: Training Set DeterminationIn a preferred embodiment, a training set was selected that contained the most information relating to clinical outcomes. Initially, samples were included from various representative R&D studies as well as multiple forced degradation conditions and clinical samples. This resulted in a clustering analysis that included many good and bad groups (refer to
Samples were removed if they were not from the mid-point or Day 5-9 of the culture period. This is when in-process samples for lot release based on quantitative histology will be taken.
Samples were removed if they were from R&D lots that are believed to be representative but there are no associated clinical outcomes to examine.
Samples were removed if they were associated with negative clinical outcomes. These samples were not believed to be “bad” as they were released from the facility based on the qualitative histology assay in use, and there were no cases where it was believed that the lot itself was the cause of the clinical outcome. These cases were removed because there is no definitive proof that they did work.
Samples were removed from the forced degradation arm if orthogonal methods were unable to confirm degradation. These samples were examined by a pathologist as well as spent media samples examined for cytokines and chemokines and any sample that was not confirmed to be “bad” by another method was left out of the final training set Allogeneic cultured postnatal thymus tissue-derived product has been shown to be hardy to a variety of conditions, and the analysis should not be skewed to detect for cases that still likely result in “good” tissue. Conditions that remained in the analysis set were from samples that were frozen at −20° C. or where the media was replaced with 10×PBS. For a full listing of degradation conditions and results refer to Table 7.
The remaining samples that were used in the final training set are those that are either from clinical cases with shown survival at 1 year or were from cases that were confirmed to be degraded R&D tissues. The resulting clusters are depicted in
This resulted in five clusters initially, four clusters with clinically good samples and one cluster with confirmed bad samples. One of the good clusters contained only two samples; it did not have high variability within itself and therefore would cause leverage on the model to cause shifts. So, when new samples would be examined, these two samples will more likely cause the other clusters to move around to allow those samples to be grouped appropriately. To test this, a systematic removal of each sample and subsequent re-clustering was examined. It was found that either one of these samples, if isolated, would cause the resulting samples to shift. As such, these samples were removed. It was determined that while this may result in additional samples that are similar to these two tissues to cluster independently, the overall result would be a more robust algorithm and training set to compare against. These samples may be used in conjunction with future samples to reassess the buckets at a later date when the data set can be expanded. The final training library that was validated in the software is depicted in
The underlying data for the four groups depicted in
A graphical representation of all the clusters next to each other is shown in
As can be seen by the data plotted in
Group 1. Group 1 is the largest cluster. It is comprised of 19 different tissues with positive clinical outcomes Group 1 is characterized by a larger proportion of nuclei that have large perimeters, high integrated densities, and high area with lower circularity. This is likely due to the presence of clumps of nuclei that cannot be read as independent cells by the software. This would more likely occur in tissues that still have larger number of thymocytes present in the earlier days of the culture period. When a blinded study was run, a sample from a Day 0 tissue clustered into Group 1, thereby lending support to the above theory. While it does not appear that this assay will reject tissues that have large numbers of thymocytes present, the samples in Group 1 had positive clinical outcomes when implanted at the end of the culture period.
Group 2.
Group 2 contains 10 lots of tissues that had positive clinical outcomes. Groups 2 and 3 both have large proportions of cells with high circularity values. Group 2 also appears to have cells with higher area when compared to groups 3 and 4, and mid-range integrated density. These values are expected for healthy viable tissues in the mid-range of the culture period as shown by the samples that form Group 2. Histograms of the measurements of area, circularity, integrated density and perimeter appear in
Group 3.
Group 3 is comprised of 6 lots that showed positive clinical outcomes. Like group 2, group 3 has a large proportion of cells with high circularity, showing both thymocytes and overall cellular viability. Group 3 has the highest proportion of cells of the smallest area and also in the lower range of integrated density (2nd and 3rd bins).
Histograms of the measurements of area, circularity, integrated density and perimeter appear in
Group 4.
Group 4 is comprised of 4 forced degraded samples. The degradation conditions shown here are for samples that have been frozen at −20° C. for 4 hours and samples where the culture media was replaced with 10×PBS for 24 hrs. Both of these conditions showed significant degradation per qualitative histology and significant changes when examining CCL21, which is a biomarker for TEC health. A table of all forced degraded conditions and the results for each method is included below.
Histograms of the measurements of area, circularity, integrated density and perimeter appear in
A forced degradation study was run where the samples were placed back in thymus organ medium (TOM) prior to the samples being removed on days 5 and 9. This shows how hardy allogeneic cultured postnatal thymus tissue-derived product is to variations in process conditions. It is interesting to note that heating the samples to 55° C. was not detectable by the present method of traditional histology. This is likely because heating the tissue essentially fixed it from a histopathological perspective. However, the degradation was apparent via CCL21 analysis. Many of the tested conditions did not cause degradation detectable via any of the measures used. It is believed that these conditions did not permanently damage the tissue to a point where it was either not functional or not viable. There are process controls to ensure the product is never exposed to any of the above conditions. As would be expected of degraded samples, there are large proportions of cells with lower circularity, and low area and higher perimeters. This shows cells that are no longer able to maintain viability and are therefore changing morphology and shrinking with shriveled edges. The forced degradation conditions of the study appear in Table 7.
To show the similarity within a group, analysis of the variability for each bin was examined. Group 4 regularly exhibited the highest standard deviation within a bin across the samples in it. This is possibly due to the fewer samples within that group, but it is also not unexpected for tissue as it degrades to be less consistent (see
Abbreviations:
AE1/AE3: A cocktail of 2 antibodies that react with all cytokeratins to identify epithelial cells.
CD3: An antibody that reacts with T cells, including thymocytes. CK14: An antibody that detects only cytokeratin 14, a component of the cytoskeleton in a subset of epithelial cells hypothesized to have repopulating potential.
FFPE: Formalin-fixed paraffin embedded tissue sections.
H&E: Hematoxylin and eosin, a histologic stain that is most commonly used for light microscopy of mammalian tissue sections.
Ki-67: The Ki-67 antibody recognizes the Ki-67 antigen, a protein associated with cellular proliferation.
TEC: Thymic epithelial cells.
References discussed in the application, which are incorporated by reference in their entirety, for their intended purpose, which is clear based upon its context.
All patents, patent applications and publications cited herein are hereby incorporated by reference in their entirety. The disclosures of these publications in their entireties are hereby incorporated by reference into this application.
The disclosures of each and every patent, patent application, publication, and accession number cited herein are hereby incorporated herein by reference in their entirety.
It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
The foregoing embodiments and advantages are merely exemplary and are not to be construed as limiting the present invention. The present teachings can be readily applied to other types of methods, systems and classifiers, experiments and surgical procedures. Also, the description of the embodiments of the present invention is intended to be illustrative and not to limit the scope of the claims. Many alternatives, modifications, and variations will be apparent to those skilled in the art
While present disclosure has been disclosed with reference to various embodiments, it is apparent that other embodiments and variations of these may be devised by others skilled in the art without departing from the true spirit and scope of the disclosure. The appended claims are intended to be construed to include all such embodiments and equivalent variations.
The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the embodiments. The foregoing description and Examples detail certain embodiments and describes the best mode contemplated by the inventors. It will be appreciated, however, that no matter how detailed the foregoing may appear in text, the embodiment may be practiced in many ways and should be construed in accordance with the appended claims and any equivalents thereof.
REFERENCES
- Markert M L, Devlin B H, McCarthy E A, 2010, “Thymus transplantation,” Clin Immunol., 135(2): 236-46.
- Markert M L. et al., 2004, “Postnatal thymus transplantation with immunosuppression as treatment for DiGeorge syndrome,” Blood 104(8):2574-2581.
- Markert M L, et al., 1999, “Transplantation of thymus tissue in complete DiGeorge syndrome,” N Engl J Med 341(16): 1180-1189 27). Markert M L, et al., 2008, “Use of allograft biopsies to assess thymopoiesis after thymus transplantation,” J Immunol 180(9):6354-6364.
- Markert M L, et al., 2007, “Review of 54 patients with complete DiGeorge anomaly enrolled in protocols for thymus transplantation: outcome of 44 consecutive transplants,” Blood 109(10):4539-454728).
- Chinn I K, Devlin B H, Li Y J, & Markert M L, 2008, “Long-term tolerance to allogeneic thymus transplants in complete DiGeorge anomaly,” Clin Immunol 126(3):277-281).
- Markert M L, 2014, Thymus Transplantation. Stiehm's Immune Deficiences, eds Sullivan K E & Stiehm E R (Academic Press), 1st Ed, pp 1059-1067.
Claims
1. A method of training a tissue classifier for performing quantitative histopathological assessment, comprising:
- converting a slide image into a binary slide image, wherein the slide image is selected from a library of control images, wherein each slide image in the library of control images is associated with a pass classification or a fail classification;
- detecting one or more nuclei within the binary slide image;
- for each detected nucleus, extracting a feature from the detected nucleus, wherein the feature represents a property of the detected nucleus within the binary slide image and comprises at least one of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, or a circularity of the detected nucleus;
- for each detected nucleus, generating, based on the feature, a feature fingerprint associated with the binary slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
- incorporating the binary slide image into a cluster wherein the cluster comprises a plurality of images, wherein each of the first plurality of images is associated with a corresponding feature fingerprint, and wherein the incorporating is based on comparing the feature fingerprint with the corresponding feature fingerprint;
- applying a cutoff height to the cluster to form a plurality of groups, wherein the cutoff height minimizes a number of groups within the plurality of groups based on multivariate analysis of variance analysis of the cluster;
- categorizing a first group within the plurality of groups as a positive control set if the first group comprises first slide images associated with the pass classification; and
- categorizing a second group within the plurality of groups as a negative control set if the second group comprises second slide images associated with the fail classification.
2. The method according to claim 1, wherein the tissue classifier is capable of determining potency or the quality of the tissue for transplantation into a subject.
3. The method according to claim 2, wherein the tissue is thymus tissue.
4. The method according to claim 3, wherein the tissue is allogeneic cultured postnatal thymus tissue-derived product slices for implantation into a human subject.
5. The method according to claim 4, wherein the human subject is suffering from complete DiGeorge syndrome associated with 22q11.2 deletion; coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness syndrome (CHARGE), or athymia associated with forkhead box protein N1 (FOXN1) deficiency.
6. The method according to claim 2, wherein the tissue classifier determines potency or quality of a tissue for transplantation into a subject of an unknown tissue for transplantation by generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints.
7. The method according to claim 6, wherein the unknown tissue is thymus tissue.
8. The method according to claim 7, where the thymus tissue has been subjected to a culturing process in a thymus organ medium for a period of time to partially deplete the thymus tissue of thymocytes.
9. The method according to claim 8, wherein the period of time is up to 21 days.
10. The method according to claim 8, wherein the period of time is from about 12 to about 21 days.
11. The method according to claim 8, wherein the period of time is from about 5 to about 9 days.
12. The method according to claim 8, wherein the culturing process preserves the functional architecture of the thymic stroma.
13. The method according to claim 12, wherein the thymic stroma comprises thymic epithelial cells and fibroblasts.
14. The method according to claim 1, wherein the tissue is selected from the group consisting of vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
15. The method according to claim 1, wherein the feature fingerprint is generated from measurements comprising numerical values of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, and a circularity of the detected nucleus.
16. A method for performing quantitative histopathological assessment of an unclassified slide image of a tissue, comprising:
- detecting a hematoxylin channel from the unclassified slide image, wherein the hematoxylin channel is associated with a cellular nucleus within the unclassified slide image of a tissue;
- extracting a feature from the detected hematoxylin channel, wherein the feature represents a property of the nucleus within the binary slide image and comprises at least one of an area of the nucleus, a perimeter of the nucleus, an integrated density of the nucleus, or a circularity of the nucleus;
- generating, based on the feature, a feature fingerprint associated with the unclassified slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
- co-clustering the feature fingerprint with a first set of fingerprints associated with one or more positive control sets and a second set of fingerprints associated with a negative control set, wherein the positive control set(s) comprises a first set of slide images associated with a pass classification and the negative control set comprises a second set of slide images associated with a fail classification; and
- determining, based on the co-clustering, if the feature fingerprint is associated with the pass classification or the fail classification.
17. The method according to claim 16, wherein the feature fingerprint is capable of determining potency or the quality of the tissue for transplantation into a subject.
18. The method according to claim 17, wherein the tissue is thymus tissue.
19. The method according to claim 18, wherein the tissue is allogeneic cultured postnatal thymus tissue-derived product slices for implantation into a human subject.
20. The method according to claim 19, wherein the human subject is suffering from complete DiGeorge syndrome associated with 22q11.2 deletion; coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness syndrome (CHARGE), or athymia associated with forkhead box protein N1 (FOXN1) deficiency.
21. The method according to claim 17, wherein the feature fingerprint determines potency or quality of the tissue for transplantation into a subject of an unknown tissue by generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints.
22. The method according to claim 21, wherein the unknown tissue is thymus tissue.
23. The method according to claim 22, where the thymus tissue has been subjected to a culturing process in a thymus organ medium for a period of time to partially deplete the thymus tissue of thymocytes.
24. The method according to claim 23, wherein the period of time is up to 21 days.
25. The method according to claim 23, wherein the period of time is from about 12 to about 21 days.
26. The method according to claim 23, wherein the period of time is from about 5 to about 9 days.
27. The method according to claim 23, wherein the culturing process preserves the functional architecture of the thymic stroma.
28. The method according to claim 27, wherein the thymic stroma comprises thymic epithelial cells and fibroblasts.
29. The method according to claim 16, wherein the tissue is selected from the group consisting of vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
30. The method according to claim 17, wherein the feature fingerprint is generated from measurements comprising numerical values of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, and a circularity of the detected nucleus.
31. A system for classifying objects within digital images of tissue, comprising:
- means for converting a slide image into a binary slide image, wherein the slide image is selected from a library of control images, wherein each slide image in the library of control images is associated with a pass classification or a fail classification;
- means for detecting a nucleus within the binary slide image;
- means for extracting a feature from the detected nucleus, wherein the feature represents a property of the detected nucleus within the binary slide image and comprises at least one of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, or a circularity of the detected nucleus;
- means for generating, based on the feature, a feature fingerprint associated with the binary slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
- means for incorporating the binary slide image into a cluster wherein the cluster comprises a plurality of images, wherein each of the first plurality of images is associated with a corresponding feature fingerprint, and wherein the incorporating is based on comparing the feature fingerprint with the corresponding feature fingerprint;
- means for applying a cutoff height to the cluster to form a plurality of groups, wherein the cutoff height minimizes a number of groups within the plurality of groups based on multivariate analysis of variance analysis of the cluster;
- means for categorizing a first group within the plurality of groups as a positive control set if the first group comprises first slide images associated with the pass classification; and
- means for categorizing a second group within the plurality of groups as a negative control set if the second group comprises second slide images associated with the fail classification.
32. The system according to claim 31, wherein the system is capable of determining potency or the quality of a tissue for transplantation into a subject.
33. The system according to claim 32, wherein the tissue is thymus tissue.
34. The system according to claim 33, wherein the thymus tissue is allogeneic cultured postnatal thymus tissue-derived product slices for implantation into a human subject.
35. The system according to claim 34, wherein the human subject is suffering from complete DiGeorge syndrome associated with 22q11.2 deletion; coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness syndrome (CHARGE), or athymia associated with forkhead box protein N1 (FOXN1) deficiency.
36. The system according to claim 31, wherein the system classifies potency or quality of a tissue for transplantation into a subject of an unknown tissue by generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints.
37. The system according to claim 36, wherein the unknown tissue is thymus tissue.
38. The system according to claim 37, where the thymus tissue has been subjected to a culturing process in a thymus organ medium for a period of time to partially deplete the thymus tissue of thymocytes.
39. The system according to claim 38, wherein the period of time is up to 21 days.
40. The system according to claim 38, wherein the period of time is from about 12 to about 21 days.
41. The system according to claim 38, wherein the period of time is from about 5 to about 9 days.
42. The system according to claim 38, wherein the culturing process preserves the functional architecture of the thymic stroma.
43. The system according to claim 42, wherein the thymic stroma comprises thymic epithelial cells and fibroblasts.
44. The system according to claim 31, wherein the tissue is selected from the group consisting of vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
45. The system according to claim 36, wherein the feature fingerprint is generated from measurements comprising numerical values of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, and a circularity of the detected nucleus.
46. A classifier, comprising:
- means for detecting a hematoxylin channel from the unclassified slide image, wherein the hematoxylin channel is associated with a cellular nucleus within the unclassified slide image of a tissue;
- means for extracting a feature from the detected hematoxylin channel, wherein the feature represents a property of the nucleus within the binary slide image and comprises at least one of an area of the nucleus, a perimeter of the nucleus, an integrated density of the nucleus, or a circularity of the nucleus;
- means for generating, based on the feature, a feature fingerprint associated with the unclassified slide image, wherein the feature fingerprint is a numerical value calculated from processing the feature;
- means for co-clustering the feature fingerprint with a first set of fingerprints associated with a positive control set and a second set of fingerprints associated with a negative control set, wherein the positive control set comprises a first set of slide images associated with a pass classification and the negative control set comprises a second set of slide images associated with a fail classification; and
- means for determining, based on the co-clustering, if the feature fingerprint is associated with the pass classification or the fail classification.
47. The classifier according to claim 46, wherein the system is capable of determining potency or the quality of a tissue for transplantation into a subject.
48. The classifier according to claim 47, wherein the tissue is thymus tissue.
49. The classifier according to claim 48, wherein the thymus tissue is allogeneic cultured postnatal thymus tissue-derived product slices for implantation into a human subject.
50. The classifier according to claim 49, wherein the human subject is suffering from complete DiGeorge syndrome associated with 22q11.2 deletion; coloboma, heart defect, choanal atresia, growth or mental retardation, genital hypoplasia and ear anomalies or deafness syndrome (CHARGE), or athymia associated with forkhead box protein N1 (FOXN1) deficiency.
51. The classifier according to claim 46, wherein the system classifies potency of an unknown tissue by generating feature fingerprints of detected nuclei within slide images in a control library and clustering the slide images based on their corresponding feature fingerprints.
52. The classifier according to claim 51, wherein the unknown tissue is thymus tissue.
53. The classifier according to claim 52, where the thymus tissue has been subjected to a culturing process in a thymus organ medium for a period of time to partially deplete the thymus tissue of thymocytes.
54. The classifier according to claim 53, wherein the period of time is up to 21 days.
55. The classifier according to claim 53, wherein the period of time is from about 12 to about 21 days.
56. The classifier according to claim 53, wherein the period of time is from about 5 to about 9 days.
57. The classifier according to claim 53, wherein the culturing process preserves the functional architecture of the thymic stroma.
58. The classifier according to claim 57, wherein the thymic stroma comprises thymic epithelial cells and fibroblasts.
59. The classifier according to claim 46, wherein the tissue is selected from the group consisting of vascular tissue, skin tissue, hepatic tissue, pancreatic tissue, neural tissue, urogenital tissue, gastrointestinal tissue, skeletal tissue including bone and cartilage, adipose tissue, connective tissue including tendons and ligaments, amniotic tissue, chorionic tissue, dura, pericardia, muscle tissue, glandular tissue, facial tissue, ophthalmic tissue.
60. The classifier according to claim 46, wherein the feature fingerprint is generated from measurements comprising numerical values of an area of the detected nucleus, a perimeter of the detected nucleus, an integrated density of the detected nucleus, and a circularity of the detected nucleus.
Type: Application
Filed: Jul 2, 2019
Publication Date: Sep 10, 2020
Applicant: Enzyvant Therapeutics, Inc. (Cambridge, MA)
Inventors: Alex TRACY (Apex, NC), Kristin MARKS (Durham, NC), Michael Thomas JOHNSON (Bridgewater, NJ), Thomas Stephen VILLANI (Belle Mead, NJ)
Application Number: 16/460,288