METHOD FOR STRATIFYING AND SELECTING CANDIDATES FOR RECEIVING A SPECIFIC THERAPEUTIC APPROACH
The disclosure concerns a method for patient stratification and selection of patients who are candidates for a specific therapy is described which is based on quantifying one or more digital image analysis feature distributions from stained tissue. The method extends beyond the abilities of a manual observer and a microscope, and generally comprises: acquiring digital images of stained tissue sections from patients submitted for evaluation, applying an algorithm process to said images with a computer to extract the morphometric and staining features of image pixels and tissue objects, deriving one or more distribution function for one or more image analysis features, calculating a summary statistic of the one or more distribution functions, and using said summary statistic along with an associated predefined patient stratification paradigm to separate a patient cohort into distinct strata which correspond to a decision to include or exclude a patient for a specific therapy.
Latest Flagship Biosciences, Inc. Patents:
- Method for digitally generating scores for multiple diagnostic tests from tissue assayed with a single test
- Method for aligning digital images of tissue sections using tissue object correspondences
- Methods for Preparing Data from Tissue Sections for Machine Learning Using Both Brightfield and Fluorescent Imaging
- Method for Digitally Generating Scores for Multiple Diagnostic Tests from Tissue Assayed with a Single Test
- Method for aligning digital images of tissue sections using tissue object correspondences
This application is a continuation of application Ser. No. 14/986,512 filed Dec. 31, 2015, which is a continuation in part of application Ser. No. 14/189,864, filed Feb. 25, 2014; the contents of each are hereby incorporated by reference.
BACKGROUND Field of the InventionThis application relates to methods for evaluating tissue image analysis feature distribution functions with the intent to stratify patient cohorts into two or more distinct categories of interest. More specifically, the method utilizes digital tissue image analysis to extract staining and morphometric features from images of stained tissue sections, quantifies the distribution of one or more image analysis feature, and applies a patient selection paradigm based on a comparison of the patient-specific image analysis feature distribution to a reference distribution or value to identify patients as candidates for a specific therapy.
Description of the Related ArtThe majority of current in vitro diagnostic assays, laboratory developed tests, and research use only assays are based on measuring the staining intensity of biomarkers of interest, such as HER2. The biomarker is visualized by using antibodies, histologic dyes, or in situ hybridization probes to detect the biomarker of interest and detection reagents such as chromogenic and fluorescent stains or dyes. Typically, evaluation of the staining, as a surrogate for the biomarker, is assessed manually by a pathologist. In some instances, digital image analysis algorithms which are configured to mimic manual scoring paradigms can also be used to evaluate biomarker expression in tissue. In general, these approaches often condense, in an over simplified manner, the sometimes heterogeneous distribution of staining intensities into a single summary score for a tissue.
As an illustrative example, the H-Score paradigm consists of a pathologist assessing two different conditions in a given tissue to assign a score. The pathologist categorizes staining on a semi-quantitative scale of 0, 1, 2, and 3+ (negative, low, medium, and high expression, respectively). In addition, the pathologist must assign the percentage of tissue or cells within the tissue that falls into each category, with all four categories adding to 100%. Finally, an H-score is calculated by multiplying the score category (i.e. 0, 1, 2, 3+) by the percentage of tissue (i.e. 0-100%) in that category and results in a score ranging from 0 to 300.
Interestingly, with the H-Score paradigm, two different tissues with unique staining distributions (e.g. one tissue with 100% of cells staining in the 1+ category and a second tissue with 50% of cells staining in the 0 and the other 50% of cells staining in the 2+ categories) can have the same H-score (e.g. an H-Score of 100 for both tissues), and illustrates the conundrum with present scoring paradigms which overly simplify complex staining information into a single summary score.
Alternatively, a tissue can be scored, as a whole, on a 0, 1, 2, and 3+ scale. This scoring approach assigns a score to a tissue based on the maximum staining level observed in a specified percentage of the tissue (e.g. strong staining in >15% of tumor cells scores a tissue sample as 3+). In this scoring paradigm, only the staining intensity of a subset of cells or tissue area is considered, and the information contained in the remainder of the tissue is discarded in the summary score.
While these scoring approaches have demonstrated utility for evaluating biomarkers, there are instances where the scoring paradigms are insufficient to capture the necessary granularity of biomarker expression to guide insights or patient selection decisions. Digital tissue image analysis has evolved into a powerful tool for extracting a great deal of information from stained tissue. Digital image analysis can quantify many features, both morphometric and staining, related to biomarker and tissue presentation for use in developing novel scoring paradigms.
SUMMARYThe disclosure concerns a novel tissue scoring methodology which utilizes digital image analysis of tissue to assess the full distribution of a morphometric and staining features in a way that categorizes samples based on summary values which better capture the nuanced nature of image analysis feature distributions.
In accordance with the embodiments herein, a novel method for patient stratification and selection of patients who are candidates for a specific therapy is described which is based on quantifying one or more digital image analysis feature distributions from stained tissue. The method extends beyond the abilities of a manual observer and a microscope, and generally comprises: acquiring digital images of stained tissue sections from patients submitted for evaluation, applying an algorithm process to said images with a computer to extract the morphometric and staining features of image pixels and tissue objects, deriving one or more distribution function for one or more image analysis features, calculating a summary statistic of the one or more distribution functions, and using said summary statistic along with an associated predefined patient stratification paradigm to separate a patient cohort into distinct strata which correspond to a decision to include or exclude a patient for a specific therapy.
In the following description, for purposes of explanation and not limitation, details and descriptions are set forth in order to provide a thorough understanding of the present invention. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments that depart from these details and descriptions without departing from the spirit and scope of the invention.
In one embodiment, the method may be comprised of nine steps: 1) obtaining digital images of stained tissue sections; 2) extracting staining and morphometric features of cells or tissue structures within the image using a digital image analysis algorithm implemented by a computer; 3) storing image analysis feature values in computer memory or a database for future recall and processing; 4) calculating the probability density function or cumulative density function for one or more image analysis features; 5) deriving a summary score for the probability density function and cumulative density function; 6) evaluating the summary score(s) relative to reference value(s); 7) applying patient selection criteria to the evaluation of summary and reference values; 8) selecting or excluding patients for a particular therapy based on said patient selection criteria.
The following method described in this invention is utilized to evaluate one or more patient tissue samples to determine whether or not said patient or patients are candidates for a specified therapy. To perform the following invention, digital images of stained tissue sections from the one or more patients are obtained.
Patient tissue samples for evaluation are generated using standard histologic processes to produce tissue sections mounted on glass histology slides. Each tissue section for a patient is stained to highlight expression of one or more biomarkers by one or more of: standard immunohistochemistry (IHC), in situ hybridization (ISH), histological stains (i.e. H&E, trichrome, etc.) and immunofluorescent (IF) methods to generate tissue stained in a manner to evaluate said biomarker. One or more biomarker may be stained for in each tissue section (i.e. mono- and multiplexed assay formats) or on multiple sections from a patient's tissue sample (e.g. one biomarker per serial section for a single patient).
Stained tissues mounted on histology slides are digitized using standard practices (i.e. digital slide scanning, imaging with a digital camera mounted on a microscope, etc.) for chromogenic and/or fluorescent stains. The digital images of each patient tissue sample are stored in computer memory or in a database for future recall and analysis. The images can be bright-field, dark-field, bright- or dark-field equivalent images, or a combination image of bright- and dark-field images.
The one or more patient sample cohort is submitted for evaluation by the following invention once digital images are available.
In the preferred embodiment of this invention, a digital tissue image analysis algorithm implemented by a computer is applied to each image to extract the morphometric and staining features pertaining to staining presentation in each image. Image analysis features can be extracted in pixel-based (e.g. per-pixel staining intensity for biomarker staining color) or object-based (e.g. staining intensity for a biomarker staining color within identified cells) manner.
Morphometric features pertain to the size, shape, and texture of tissue stains within objects (i.e. cells, cell membranes, blood vessels, tumor epithelium regions, etc.) observed in a digital image. For example and not limitation, morphometric features can be the area of a cell nucleus, the completeness of biomarker staining in a cell membrane, the diameter of a cell nucleus, the roundness of a blood vessel etc. Staining features pertain to the pixel intensities of specified IHC, ISH, and IF stains or dyes. Staining features can be evaluated relative to objects (e.g. average staining intensity in each cell in an image) or relative to individual pixels across an image (e.g. average staining intensity of pixels in a sample).
The image analysis algorithm implemented by a computer captures morphometric and staining features for each pixel and/or object within an image and stores said values for further analysis in computer memory or to a database.
For example and not limitation,
In the preferred embodiment of this invention, the probability density function (PDF) or cumulative distribution function (CDF) is calculated using standard approaches for one or more selected image analysis feature. Capture of the PDF or CDF for an image analysis feature goes far beyond current tissue scoring paradigms by retaining the complete information of staining or morphometric feature distribution for pixels or objects identified within an image of a stained tissue section. Furthermore, summary statistics of either the PDF or CDF's for an image analysis feature retain greater information about the image analysis feature than existing paradigms.
In an illustrative example,
The CDF and PDF functions quantify the entire distribution of an image analysis feature (e.g. biomarker staining intensity from 0% to 100% of the possible dynamic range of the feature) in a standardized way that allows comparisons between distributions. Once the CDF and PDF are derived for an image analysis feature in each patient sample, summary statistics can be determined for either distribution. Similarly, a sub-range (i.e. biomarker staining intensity ranging from 0% to 50% of maximum, biomarker staining intensity ranging from 20% to 60% of the possible dynamic range, etc.) of the CDF and PDF can be evaluated and summary statistics can be calculated for the sub-range only.
PDF summary statistics are histogram statistics and can be one or more of: mean, median, mode, minimum, maximum, standard deviation of the mean, standard error of the mean, skewness, density at a specified point (e.g. 50% biomarker staining), full-width at half maximum, number of resolvable peaks, and kurtosis values. CDF summary statistics can be one or more of: the area under the curve (AUC), the slope of the curve, the slope over a defined range of the curve, the difference in image analysis parameter values between two defined cumulative distribution values, the difference in cumulative distribution values between two image analysis parameter values, the point of maximum deviation from a reference curve, and the sum of residuals or squared residuals (deviations) from a reference curve.
Summary statistics for both the PDF and CDF can be calculated for the entire range (e.g. feature value of 0 to the maximum possible dynamic range value) of a feature or for a defined sub-range (e.g. 20% to 60% of the possible dynamic range maximum value). In the instance where a sub-range is assessed, the AUC can be determined for a PDF as a summary statistic.
In an embodiment of the present invention, survival curve statistics can be utilized to determine a summary score for the distribution of an image analysis feature for a patient cohort. In this embodiment, the CDF is transformed (e.g. inverted) to generate a survival curve-like representation of the image analysis feature distribution. Once transformed, survival curve evaluation statistics can be derived which summarize the nature of the distribution or compare a patient-specific distribution to a reference curve.
In the preferred embodiment of the present invention, the summary statistic(s) derived from the PDF or CDF for one or more image analysis features are utilized to stratify patients into two or more groups for selection/exclusion as candidates for a specified therapy. The patient selection criteria will be pre-defined criteria which can be applied to a patient cohort to stratify patients into one or more groups selected to receive a therapy and one or more groups which are excluded from receiving a therapy.
In one embodiment of this invention, image analysis evaluation of one biomarker is used to determine a summary score for patient selection. In another embodiment of this invention, image analysis is utilized to evaluate multiple biomarkers to derive a summary score for use in patient selection.
The patient selection criteria paradigm can rely on one or more image analysis feature and summary statistic of said feature distributions within samples for a patient cohort. The patient selection criteria paradigm can rely on summary statistics information for image analysis features from a single biomarker or multiple biomarkers.
Once the patient selection criteria paradigm is identified, the method described herein is applied to a patient cohort submitted for evaluation. In the preferred embodiment of this invention, the extracted image analysis features for the patient cohort which are relevant to the defined patient selection paradigm are collected and one or more of the: PDF, CDF, and inverted CDF is derived. Statistical analysis is used to derive one or more relevant summary statistic for the image analysis feature distribution for each patient and pre-defined patient selection criteria are applied to stratify the patient cohort into distinct strata. The patient selection criteria will stratify patients into two or more groups which have previously been determined to correspond to a treatment approach.
Claims
1. A method comprising:
- obtaining at least one digital image of a stained tissue section;
- extracting data associated with at least one staining feature and at least one morphometric feature;
- calculating a distribution function which captures the distribution of data for at least one of the staining and morphometric features;
- deriving a summary score for a patient from the distribution function;
- evaluating the summary score relative to at least one reference value and
- selecting the patient as a candidate for a specified therapy based on the summary score.
2. The method of claim 1, wherein the tissue is stained by a method selected from the group consisting of immunohistochemistry, histological dyes, immunofluorescence, and in situ hybridization.
3. The method of claim 1, wherein the at least one morphometric feature is of at least one tissue object and are selected from the group consisting of size, shape, area, texture, organization, and organizational relationship, wherein the at least one tissue object is selected from the group consisting of cells, cell sub-compartments, cell neighborhood, and tissue compartment.
4. The method of claim 1, wherein the at least one staining feature is selected from the group consisting of stain appearance, stain intensity, stain completeness, stain shape, stain texture, and stain area.
5. The method of claim 1, wherein the distribution function is selected from the group consisting of a histogram distribution, a probability distribution function, a cumulative distribution function, and an inverted cumulative distribution function.
6. The method of claim 5, wherein the summary score calculated from the histogram distribution or probability distribution function is selected from the group consisting of minimum, maximum, mean, median, mode, skewness, kurtosis, full-width at half maximum, number of resolvable peaks, area under the curve, and normalness value.
7. The method of claim 5, wherein the summary score calculated from the cumulative density function is selected from the group consisting of an area under the curve (AUC), slope of the curve, slope over a defined range of the curve, difference in image analysis parameter values between two defined cumulative distribution values, difference in cumulative distribution values between two image analysis parameter values, point of maximum deviation from a reference curve, and sum of residuals or squared residuals from a reference curve.
8. The method of claim 5, wherein the summary score calculated from the histogram, probability density, and cumulative distributions can be derived from the group consisting of a measured dynamic range of image analysis feature values, possible dynamic range of an image analysis feature, and a sub-range of image analysis feature values.
9. The method of claim 5, wherein the summary score calculated from the inverted cumulative density function includes one or more Kaplan-Meier statistics values.
10. The method of claim 1, wherein the summary score is derived from the group consisting of a single distribution, multiple distributions describing a single biomarker, or multiple distributions describing multiple biomarkers.
11. The method of claim 1, further comprising applying at least one patient stratification criterium to the summary score to then determine the candidacy of the patient for the specified therapy.
12. The method of claim 11, wherein the patient is determined to be a member of a specific stratum based on the at least one patient stratification criterium.
13. The method of claim 11, wherein the at least one patient stratification criterium defines at least two patient strata based.
14. The method of claim 13, wherein at least one patient stratum is linked to positive response to the specified therapy.
Type: Application
Filed: Sep 5, 2017
Publication Date: Dec 21, 2017
Applicant: Flagship Biosciences, Inc. (Westminster, CO)
Inventors: Joshua Black (Denver, CO), Carsten Schnatwinkel (Boulder, CO), Kristin Wilson (Denver, CO), Nathan T. Martin (Boulder, CO), Joseph Krueger (Andover, MA), Holger Lange (Enger)
Application Number: 15/696,159