METHODS FOR PREDICTING PROSTATE CANCER AND USES THEREOF

- miR Scientific, LLC

The present invention relates to compositions and methods for diagnosing, prognosing, monitoring, and treating a patient with prostate cancer. In particular, the invention relates to the use of miRNA and snoRNA as expression signatures for identifying a clinically significant prostate cancer.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Divisional of U.S. application Ser. No. 16/819,070, filed on Mar. 14, 2020, now allowed; that was filed concurrently with the PCT International Application, PCT/US2020/022862; both applications claim benefit to U.S. Provisional Application No. 62/978,184, filed Feb. 18, 2020, and U.S. Provisional Application No. 62/819,325, filed Mar. 15, 2019, which are incorporated by reference herein in their entirety.

SEQUENCE LISTING

A Sequence Listing conforming to the rules of WIPO Standard ST.26 is hereby incorporated by reference. Said Sequence Listing has been filed as an electronic document via Patent Center in ASCII format encoded as XML. The electronic document, created on Apr. 20, 2023, is entitled “P-586001-US1_ST26”, and is 775,985 bytes in size.

FIELD OF THE INVENTION

The present invention relates to compositions and methods for diagnosing, prognosing, monitoring, and treating a patient with prostate cancer. In particular, the invention relates to the use of small non-coding RNAs (sncRNAs) such as miRNA and snoRNA as expression signatures for identifying a clinically significant prostate cancer.

BACKGROUND

The current method of screening for prostate cancer includes a digital rectal examination followed by a prostate-specific antigen (PSA) test. The former is invasive and the latter requires drawing of a blood sample from the subject.

Patients with a suspicious DRE and/or an elevated PSA level are subjected to a systematic 12-needle core biopsy or Magnetic Resonance Imaging (MRI)-guided targeted needle biopsy. This standard diagnosis strategy is invasive, imprecise and associated with significant costly morbidities, most notably bacterial infections.

The PSA test has significant drawbacks. In addition to indicating prostate cancer, elevated PSA levels may also indicate urinary tract infection or prostatitis (an inflammation or the prostate or benign prostatic hyperplasia or BPH). The test overdiagnoses prostate cancer, and many men are unnecessarily subjected to core needle biopsies. The prostate tissue collected during the biopsy is then examined by a pathologist and assigned a Gleason score that assesses the grade of the disease. The Gleason score is the sum of two numbers: (1) a primary grade assigned by the pathologist based on the pathologist's determination of the grade of the tumor in the most common pathology (2) a secondary grade based on the determination of the grade of the tumor in the next most prominent pathology. For each area, a score of one to five is assigned based on how aggressive the tumor appears and the two numbers are added together to provide the final Gleason Score. A tumor with cells that appear close to normal is assigned a low Gleason score (six or below, reported as Gleason 3+3) whereas a tumor with cells that appear clearly different from those of a normal prostate is assigned a higher Gleason score (seven or above). Low-grade tumors based on low Gleason scores are less likely to be aggressive; whereas tumors with high Gleason score are more likely to be aggressive and metastasize. There are aspects of the Gleason Scoring System that have been problematic since they were implemented—most notably the fact that tumors that are Gleason 3+4 and Gleason 4+3 are both reported as Gleason 7, even though the clinical outcomes of these groups are clearly different. A recent refinement of the Gleason Score, referred to as Grade Grouping, has been adopted to eliminate this issue (Grade Group 1 (GG1) encompasses Gleason 3+3; GG2—Gleason 3+4; GG3—Gleason 4+3; GG4—Gleason 4+4 and GG5—Gleason 5+4 or higher. This change to the scoring system has simplified the reporting of the histopathology of prostate cancer and has eliminated the ambiguity associated with “Gleason 7” tumors, making classifying outcome more straight forward.

Approximately 50-70% of patients recommended for core needle biopsy on the basis of “elevated” PSA (>3 ng/mL) have negative biopsies, while 14% of men with PSA<3 ng/mL have prostate cancer but are not routinely biopsied because of their low PSA levels. The combination of PSA screening and core needle biopsy is both invasive and has poor performance characteristics, which leaves physicians and patients with no reliable measures on which to base their choices of treatment options. The result is that many men needlessly opt for clinical intervention, very often prostatectomy. It also hinders the development of new prognostic tools since the Gleason Score “gold standard” is not itself a reliable indicator of prostate tumor progression.

This problem has been recognized for at least 30 years. It remains a major issue today. The intervening years have seen many attempts to develop prognostic markers for aggressive disease, including ploidy, nuclear morphology and nuclear matrix architecture, microarray-based transcriptome analyses, DNA methylation status, and detection of gene fusions such as the TMPRSS2:ETS family fusions. None of these methodologies have proved to be significantly better than Gleason Scores as indicators of prostate tumor progression. Furthermore, they do not identify the cancer stage or grade adequately.

Tests have been developed that are designed to distinguish cancer states using mRNA expression profiles. However, each demonstrates significant shortcomings. First, with only a few exceptions, these assays used tumor material derived from radical prostatectomy specimens, and therefore are at best predictive of early tumor recurrence. While potentially useful for making post-surgical clinical decisions related to continuing clinical decisions, they do not help distinguish prostate cancer grades prior to surgery. Secondly, a number of these genomic approaches have focused on specific pathways that have been implicated in prostate cancer progression, including the androgen receptor (AR) modulated gene expression, epithelial-stromal interactions, and cell cycle. These assays assume that all prostate tumors progress along a common pathway. Other commercially available biomarker assays use mRNA expression profiles generated by real-time PCR of a small subset of genes.

To date there has been very few genome wide transcriptome studies of sncRNAs in prostate cancer. One study compared the miRNA and snoRNA signatures in (i) freshly frozen radical prostatectomy samples and (ii) adjacent normal tissue from the same patient using Illumina/Solexa deep sequencing and microarray analysis on the Affymetrix miRNA® v.2 microarrays that contains 723 human miRNAs catalogued in Sanger miRBase v.10.1. (Wellcome Sanger Institute). This study provides a valuable data set for comparing the complement of sncRNAs expressed in prostate cancer and peritumoral benign tissue but is not useful for the rational design of a panel of sncRNAs that is prognostic and/or predictive for tumor progression prior to clinical intervention. It is also handicapped as a general screening technology, since the technique requires micro-dissected flash frozen material that is only available after surgery, and therefore cannot be used for diagnosis.

Accordingly, an improved method of predicting screening and classifying prostate cancer is needed. The present disclosure relates to a non-invasive (by eliminating or reducing the unnecessary core needle biopsy) method for screening the presence or absence of prostate cancer that is highly sensitive and specific.

The method also provides a platform for disease management that is useful for the diagnosis, classification, prognosis, and monitoring of the progression and treatment of the disease. The disclosed method is based on the interrogation of a large set of at least 200 small non-coding RNAs (sncRNAs) isolated from urinary exosomes in combination with the Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG tests. The Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG tests are based on algorithmic analyses and comparisons of snRNA sequences catalogued from a large target population having no evidence of prostate cancer (NEPC) or having prostate cancer (GG1-GG5) for the Sentinel™ PCa Test; having low grade cancer (GG1) versus intermediate and high grade cancer (GG2-GG5) for the Sentinel™ CS Test; and having low and favorable intermediate grade cancer (GG1+GG2) versus unfavorable intermediate and high grade (GG3-GG5) cancer for the Sentinel™ HG Test. The three Sentinel™ Tests, that can be performed on a single urine sample, are used to sequentially determine whether a patient has prostate cancer or not, and whether patients with prostate cancer have low- or favorable intermediate-grade disease that can be monitored on active surveillance protocols or high-grade disease that needs immediate treatment.

SUMMARY OF THE DISCLOSURE

In one aspect, the disclosure provides a method of screening a subject for prostate cancer comprising: (i) obtaining a biological sample from the subject, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 1-280, (iii) correlating the aggregate expression profile of SEQ ID NOs: 1-280 from the subject by comparing the aggregate expression level of SEQ ID NOs: 1-280 in a training data set from a target population having no evidence of prostate cancer (NEPC) or having prostate cancer; and (iv) classifying the subject as NEPC or has prostate cancer based on the results in (iii). This procedure is embodied in the Sentinel™ PCa test.

In yet another aspect, the disclosure provides a method of determining whether a patient diagnosed as having cancer has low-grade (GG1) or intermediate or high grade disease (GG2-GG5) or not, comprising: (i) obtaining a biological sample from a subject, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 281-560, (iii) correlating the aggregate expression profile of SEQ ID NOs: 281-560 from the subject by comparing the aggregate expression profile of SEQ ID NOs: 281-560 in a training data set from a target population known to have low risk, low grade (GG1) or intermediate and high grade, intermediate and high risk prostate cancer (GG2-GG5) and (iv) classifying the subject as GG1 or GG2-GG5 based on the results obtained from (iii). This procedure is embodied in the Sentinel™ CS test.

In yet another aspect, the disclosure provides a method of determining whether a patient diagnosed as having cancer has high-grade (GG3-GG5) or not (low or intermediate grade disease (GG1+GG2), comprising: (i) obtaining a biological sample from a subject, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 561-840, (iii) correlating the aggregate expression profile of SEQ ID NOs: 561-840 from the subject by comparing the aggregate expression profile of SEQ ID NOs: 561-840 in a training data set from a target population known to have high grade, high risk prostate cancer (GG3-GG5) or low- or intermediate-risk cancer (GG1+GG2) and (iv) classifying the subject as GG3-GG5 or GG1+GG2 based on the results obtained from (iii). This procedure is embodied in the Sentinel™ HG test.

In yet another aspect, the disclosure provides a method for treating a prostate cancer comprising: (i) obtaining a biological sample from a subject, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 281-840, (iii) correlating the aggregate expression profile of SEQ ID NOs: 1-840 from the subject by comparing the aggregate expression profile of SEQ ID NOs: 281-840 in a training data set from a target populations having NEPC, GG1, GG2, GG3, GG4 or GG5 prostate cancer, (iv) classifying the subject as having low-intermediate-grade, prostate cancer (GG1-GG2) or high-grade prostate cancer (GG3-GG5) based on the results obtained from (iii), and (v) treating the subject classified as having high-risk prostate cancer by administering one or more chemotherapeutic agents, hormones, immunotherapeutic, radiation, cryotherapy, surgery or a combination thereof.

In a further aspect, the disclosure provides a method for determining the likelihood of survival, disease recurrence or response to treatment for a subject with prostate cancer comprising: (i) obtaining a biological sample from a patient, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 1-840, (iii) comparing the aggregate expression profile of SEQ ID NOs: 1-840 after treatment with that prior to treatment, (iv) correlating the aggregate expression profile of SEQ ID Nos: 1-840 from the subject by comparing the aggregate expression profile of SEQ ID Nos: 1-840 in a training data set from a target populations having no evidence of prostate cancer (NEPC) or having prostate cancer and the aggregate expression profile of SEQ ID Nos: 1-840 in a training data set from a target populations having Grade Group 1, 2, 3 or 4-5; and (v) determining the likelihood of survival, disease recurrence or response to treatment in a subject treated for prostate cancer.

In one aspect, the disclosure provides a method for predicting future prostate cancer in a subject comprising: (i) obtaining a biological sample from a patient, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 1-280, (iii) correlating the aggregate expression profile of SEQ ID Nos: 1-280 from the subject by comparing the aggregate expression profile of SEQ ID Nos: 1-280 in a training data set from a target population from a target populations having Grade Group 1, 2, 3 or 4-5; (iv) determining the likelihood of a subject at risk of having Grade Group 2-5 prostate cancer based on the results obtained from (iii), and (iv) treating the subject predicted with a high risk of developing aggressive prostate cancer by administering one or more chemotherapeutic agents, hormones, immunotherapeutic, radiation, cryotherapy, surgery or a combination thereof.

In another aspect, the disclosure provides a system for determining whether a patient has no cancer or has cancer and classifying the subject with cancer as (i) indolent (low grade, GG1), (ii) intermediate or high grade (GG2-GG5), (iii) low/intermediate risk (GG1-GG2) or (iv) aggressive (high grade, GG3-GG5) prostate cancer comprising at least three processors configured to (a) interrogate sncRNA sequences for informative sequences, (b) determine and compare a Sentinel Score to determine if the subject has prostate cancer or no prostate cancer and to classify the prostate cancer stage group.

BRIEF DESCRIPTION OF THE DRAWINGS

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fees.

FIG. 1: Exosomes are small extracellular, membrane bound vesicles that are formed in the early endosomes and released from cells. Exosomes contain proteins, mRNAs and an array of sncRNAs [miRNAs and C/D box and H/ACA Box small nucleolar RNA (snoRNAs)] that reflect the biology of the cell.

FIG. 2: Using a commercial kit, cell-free urine is collected from the patient. The exosomes in the urine are captured, and the RNA is extracted from the exosomes. The RNA yield is measured using the Qubit assay (ThermoFisher) and the sample quality is assessed using the Agilent 2100 bioanalyzer. The resultant sncRNA levels are interrogated using custom made OpenArray™ (Thermo Fisher) plates that are specifically designed for the miR Scientific Sentinel™ PCa, Sentinel™ CS or Sentinel™ HG Tests. Interrogate is a common term of art for the simultaneous analysis of a large number of sequences in a biological sample. The resultant readout for amplification curves for snoRNA and microRNA (collectively called sncRNAs), are then analyzed and used to diagnose the patient, and when cancer is present to classify the disease, and monitor treatment accordingly.

FIG. 3 illustrates the complexity of using more than a single entity to establish an unbiased statistical approach to identify important interactions to identify individual and combinations of sncRNAs and to correlate the grade grouping or prostate cancer phenotype. (See, [00048]-[00051]).

FIG. 4 shows a schematic for screening and diagnosis of a patient suspected of having prostate cancer.

The patient first provides a urine sample for a three-layered Sentinel™ Analysis. Total urinary exosomal RNA is extracted and interrogated for expression of sncRNAs (SEQ ID NOs: 1-280) specific for the Sentinel™ PCa Test; sncRNAs (SEQ ID NOs: 281-560) specific for the Sentinel™ CS Test; and sncRNAs (SEQ ID NOs: 561-840) specific for the Sentinel™ HG Test. The expression signature will be used to classify patients into those with prostate cancer, and those without prostate cancer (Sentinel™ PCa Test, 1st layer). Patients with negative score will return every 12 months for monitoring.

In the second layer, patients with positive Sentinel™ PCa Score (those with prostate cancer) will be subjected to a secondary analysis that classifies them into having clinically insignificant (GG1) tumors or clinically significant tumors (GG2-GG5) using the Sentinel™ CS Test. Patients with clinically insignificant (GG1) tumors will be recommended for active surveillance (AS) and monitored continuously with quarterly Sentinel™ CS tests, to establish that the tumor has not progressed to GG2 or higher. Patients with clinically significant tumors (GG2-GG5) will be referred for immediate therapy.

For some patients, a third classification layer, the Sentinel™ HG Test, will further classify patients as having GG1-GG2 tumors or GG3-GG5 tumors. This test is designed to identify patients with GG3-GG5 cancers that need immediate intervention. Patients with GG1 or GG2 can be monitored by quarterly Sentinel™ HG Test to identify patients that progress to GG3 and therefore need therapeutic intervention. The availability of the Sentinel™ CS and Sentinel™ HG provides both patients and health care providers with the individualized information for treatment decision-making.

FIGS. 5A-5B show the output of the Discovery PCa Experiments. The PCa Discovery Studies use very carefully defined patient cohort [NEPC=89; Cancer (GG1-GG5)=146] with well characterized histopathology to identify the most informative sequences among the 6,599 sncRNAs interrogated on the miR4.0 microarrays.

FIG. 5A: Scatter plot of no cancer (NEPC) and cancer (GG1-GG5) status in the training data set. Positive Discovery PCa Score is indicative of having prostate cancer and a negative Discovery PCa Score is indicative of no cancer. The cancer status as determined by histopathology of core biopsies is shown in blue (no cancer) and red (cancer) circles.

FIG. 5B: Identification of informative sncRNAs for the Sentinel™ PCa Test Identification of the most informative sncRNA entities (top 35 are shown, each circle represents a single entity) for discriminating between no cancer and cancer status using the proprietary Selection Algorithm. The resultant Sentinel™ PCa Test interrogates 280 sncRNA including the 145 most informative sncRNA sequences, which comprises of 60 snoRNA and 85 miRNA entities as shown in the bar graph. Green: miRNA entities; Yellow: snoRNA entities.

FIG. 6A shows the output of the Discovery CS Experiments. The CS Discovery Studies use very carefully defined patient cohorts [GG1=90; GG2-GG5=56] with well characterized histopathology to identify the most informative sequences among the 6,599 sncRNAs interrogated on the miR4.0 microarrays. Positive Discovery CS Score is indicative of having GG2-GG5 cancer (yellow circles) and a negative Discovery CS Score is indicative of having GG1 cancer. (green circles)

FIG. 6B: Identification of informative sncRNAs for the Sentinel™ CS Test Identification of the most informative sncRNA entities (top 35 are shown, each circle represents a single entity) for discriminating between GG1 and GG2-GG5 cancer status using the proprietary Selection Algorithm. The resultant Sentinel™ CS Test interrogates 280 sncRNA including the 145 most informative sncRNA sequences, which comprises of 66 snoRNA and 130 miRNA entities as shown in the bar graph. Green: miRNA entities; Yellow: snoRNA entities.

FIG. 7A shows the output of the Discovery HG Experiments. The HG Discovery Studies use very carefully defined patient cohorts [GG1+GG2=181; GG3-GG5=55] with well characterized histopathology to identify the most informative sequences among the 6,599 sncRNAs interrogated on the miR4.0 microarrays. Positive Discovery HG Score is indicative of having GG3-GG5 cancer (purple circles) and a negative Discovery HG Score is indicative of having GG1+GG2 cancer (brown circles).

FIG. 7B: Identification of informative sncRNAs for the Sentinel™ HG Test. Identification of the most informative sncRNA entities (top 35 are shown, each circle represents a single entity) for discriminating between GG1 and GG2-GG5 cancer status using the proprietary Selection Algorithm. The resultant Sentinel™ CS Test interrogates 280 sncRNA including the 196 most informative sncRNA sequences, which comprises of 66 snoRNA and 130 miRNA entities as shown in the bar graph. Green: miRNA entities; Yellow: snoRNA entities.

FIGS. 8A-8C show the clinical validation of high throughput OpenArray™ interrogation of urinary exosomal sncRNA using the Sentinel™ PCa Test. The data from a case-control study of 1436 men (836 subjects in the training group used to cross-validate the interrogation of sncRNAs identified in the Discovery PCa phase and 600 independent subjects used in the validation study) are shown.

FIG. 8A: Scatter plot of cancer status in the validation group set that examines 600 patients (300 no cancer; 300 cancer). Classification of no cancer (black circle) and cancer (green circle) patients where a positive Sentinel™ PCa Score is indicative of having prostate cancer and a negative Sentinel™ PCa Score is indicative of no cancer.

FIG. 8B: Sorted plot of cancer status in the validation group set that examines 600 patients. Classification of no cancer (black circle) and cancer (green circle) patients where a positive Sentinel™ PCa Score is indicative of having prostate cancer and a negative Sentinel™ PCa Score is indicative of no cancer.

FIG. 8C: Receiver Operator Curve (ROC) for Sentinel™ PCa Test. The ROC curve for the analysis of 600 patients in testing group shown in FIGS. 8A and 8B was calculated by successively calculating (1-specificity) for different user-defined false negative rates. Performance characteristics reported in Table 6 (see, [000112]) were from the user defined false negative rate of 0.05.

FIGS. 9A-9C show the clinical validation of high throughput OpenArray™ interrogation of urinary exosomal sncRNA using the Sentinel™ CS Test. The data from a case-control study of 1436 men (836 subjects in the training group used to cross-validate the interrogation of sncRNAs identified in the Discovery CS phase and 600 independent subjects used in the validation study) are shown.

FIG. 9A: Scatter plot of cancer status in the validation group set that examines 300 prostate cancer patients (146 GG1-low grade and 154 GG2-GG5 intermediate and high grade). Classification of low grade (teal circle) and intermediate and high-grade cancer (orange circle) patients where a positive Sentinel™ CS Score is indicative of having high-grade prostate cancer and a negative Sentinel™ CS Score is indicative of low-grade cancer.

FIG. 9B: Sorted plot of cancer status in the validation group set as shown in FIG. 9A that examines 300 prostate cancer patients. Classification of low grade (teal circle) and high-grade cancer (orange circle) patients where a positive Sentinel™ CS Score is indicative of having high-grade prostate cancer and a negative Sentinel™ CS Score is indicative of low-grade cancer.

FIG. 9C: Receiver Operator Curve (ROC) for Sentinel™ CS Test. The ROC curve for the analysis of 300 prostate cancer patients, as shown in FIGS. 9A and 9B was calculated by successively calculating (1-specificity) for different user-defined false negative rates. Performance characteristics reported in Table 6 (see, [000112]) were from the user defined false negative rate of 0.05 (shown in red).

FIGS. 10A-10C show the clinical validation of high throughput OpenArray™ interrogation of urinary exosomal sncRNA using the Sentinel™ HG Test. The data from a case-control study of 1436 men (836 subjects in the training group used to cross-validate the interrogation of the same sncRNAs identified in the Discovery HG phase and 600 independent subjects used in the validation study) are shown.

FIG. 10A: Scatter plot of cancer status in the validation group set that examines 300 prostate cancer patients (200 GG1+GG2 low grade and 100 GG3-GG5 intermediate and high grade). Classification of low grade (teal circle) and intermediate and high-grade cancer (orange circle) patients where a positive Sentinel™ CS Score is indicative of having high-grade prostate cancer and a negative Sentinel™ CS Score is indicative of low-grade cancer.

FIG. 10B: Sorted plot of cancer status in the validation group set as shown in FIG. 10A that examines 300 prostate cancer patients. Classification of low grade (blue circle) and high-grade cancer (red circle) patients where a positive Sentinel™ HG Score is indicative of having high grade prostate cancer and a negative Sentinel™ HG Score is indicative of low-grade cancer.

FIG. 10C: Receiver Operator Curve (ROC) for Sentinel™ HG Test. The ROC curve for the analysis of 300 prostate cancer patients, as shown in FIGS. 10A and 10B was calculated by successively calculating (1-specificity) for different user-defined false negative rates. Performance characteristics reported in Table 6 (see, [000112]) were from the user defined false negative rate of 0.05 (shown in red).

DETAILED DESCRIPTION

The present subject matter may be understood more readily by reference to the following detailed description that forms a part of this disclosure. It is to be understood that this invention is not limited to the specific products, methods, conditions, or parameters described and/or shown herein, and that the terminology used is for the purpose of describing particular aspects and embodiments by way of example only and is not intended to be limiting of the claimed invention.

The disclosure relates to a method for screening, diagnosing and treating prostate cancer in a subject. The method provides robust tests for (1) classifying a male patient with unknown prostate cancer status and (2) accurately distinguishing between prostate cancer grades in biological samples from patients. The method is based on the detection and correlation of the aggregate expression profiles of a collection of sncRNAs from the patient's biological sample to determine if a patient has prostate cancer or not using Sentinel™ PCa Test. For the patient identified as having prostate cancer, the exosomal sncRNA is further interrogated using the Sentinel™ Clinical Significant (CS) Test to distinguish patients with clinically significant or aggressive (GG2-GG5) from those with clinically insignificant or indolent (GG1) prostate cancer, and the Sentinel™ High Grade (HG) Test to identify patients with high grade, high risk (GG3-GG5) prostate cancer.

The disclosed method is based on an unbiased statistical approach developed to identify important interactions of individual sequences and combinations of sequences that correlate best to a phenotype of interest. The approach is based on (i) the modulating effects of miRNA on mRNA and (ii) the influence of snoRNAs on mRNA translatability through post-transcriptional modification of ribosomal RNAs, tRNAs and other nuclear RNAs, which lead to new protein products that alters protein function and phenotype.

The disclosed computational/statistical approach analyses urinary exosomal sncRNAs to provide a very granular analysis of the critical associations between sncRNAs that leads to the identification of Sentinel sequences that accurately predict the prostate cancer phenotype. This is illustrated in the accompanying FIG. 3. For example, in the single entity analysis, the expression level of individual sncRNAs is correlated to the Grade Group of the prostate cancer (the phenotype). For each sncRNA entity there are two informative outcomes: either an increase in the expression level of the entity relative to the control pathology (e.g., no cancer) or a decreased expression level. No change of expression between the two phenotypes indicates that there is no association (1) with either phenotypes, and (2) the entity is not useful as a marker of either phenotype. Thus, when single entity is used in the analysis, there are only two informative outcomes, leaving all of the possible sncRNA interactions unexplored.

In the two entities analysis, examining the association of the expression changes for all possible interactions between two entities results in 8 different informative outcomes and 1 non-informative outcome (when neither entity is differentially expressed in their phenotype) (see, FIG. 3 “Interrogation of 2 Entities). Thus, in the context of the Sentinel™ Tests, when the association of all possible combinations of two sncRNA entities with a specific grade group are compared, there are 8 different ways that will lead to a meaningful association between pairs of sncRNAs and grade groups. This provides a more detailed analysis that uncovers hidden associations between expression levels of sncRNAs and Grade Grouping.

Using the same approach of three or four or more sncRNA entities provides a very granular analysis of the association between sncRNA expression and phenotype (grade grouping), making it possible to assess a patient of unknown disease status, and predict the individual's disease status using the expression levels of urinary exosomal sncRNA selected by the algorithm.

Development of Sentinel™ PCa, and Sentinel™ HG Test Sentinel™ CS Platforms

The Sentinel™ PCa Test is a classification platform or algorithm based on the analysis of a collection of signature sncRNA (i.e., miRNAs and snoRNAs sequences) levels. The predictive value of each sequence is defined via a data-driven Selection Algorithm that is independent of the a priori determined biological role of the sequences in prostate biology. The Selection Algorithm is trained on a dataset consisting of: (1) control subjects who presented in urology for conditions unrelated to prostate cancer; (2) subjects with suspicion of prostate cancer known to not have prostate cancer based on the biopsy results; and (3) patients diagnosed with prostate cancer and whose core needle biopsy histopathology was reported as Grade Groups 1 through 5 (GG1-GG5).

To establish robust datasets for the Sentinel™ Tests, exosomal sncRNAs obtained from the urinary exosomes of these training set of patients were interrogated using the Affymetrix miR 4.0 microarrays to define expression signatures. These studies using selected subjects with well characterized histopathology are referred to as the Discovery PCa, Discovery CS, and Discovery HG Tests. The patients included in the “no cancer” group were carefully selected from age-matched men who were seen at urology clinics for issues unrelated to urological oncology, and from men who had one or more 12-needle diagnostic core needle biopsy that showed no evidence of prostate cancer (NEPC). For patients in the “cancer” cohort, the pathological grade group classification of the core needle biopsies each tumor was thoroughly assessed. These carefully select groups of patients (no cancer and cancer group in different stages of cancer) form the training set in the development of the Discovery PCa, CS and HG Tests. The demographics of the 235 patients used for the Discovery experiments are shown in Table 4 (see, [000103]-[000104])

The Selection Algorithm

The most informative sncRNA sequences that discriminate between cancer and no cancer were identified using Selection algorithm which determines which sncRNA sequences are differentially represented between patients that do not have prostate cancer (NEPC) and those with prostate cancer (GG1-GG5). This is exemplified below for the Discovery PCa Test. The Selection algorithm tests how well the levels of the urinary exosomal sncRNAs correlate with pathological stage of the disease (cancer/no cancer) in a large population of participants [89 subjects with NEPC and 146 patients with cancer (GG1-GG5)] with carefully defined pathology. The Selection algorithm individually assesses how well each of the 6,599 sncRNAs interrogated on the miR4.0 arrays correlates the known pathology of the tumor. Since many sncRNA are coordinately modulated, the algorithm then assesses all combinations of 2 sncRNAs, 3 sncRNAs, or 4 sncRNAs, followed by examination of each individual sncRNA using a leave-one-out strategy to assess the importance of each individual sncRNA in the pathology of the disease. As would be expected leaving out most of the 6,599 sncRNA sequences from the Selection algorithm has no impact on the distinction between having prostate cancer and no prostate cancer because they are not differentially associated with either pathology. The impact of the sncRNAs assessment can be visualized using the importance plot shown in FIG. 5B. The importance plots show the following: (1) some exosomal sncRNAs are present in different levels in different pathologies, (2) the sncRNAs are snoRNAs and miRNAs indicating that one type of sncRNA is insufficient for the disclosed analysis, and (3) the diagnosis using the algorithm does not change with more than 280 sncRNA sequences in the classification assessment.

The informative sequences for the Discovery CS Test (which differentiates between low-risk (GG1) and intermediate- and high-risk prostate cancer (GG2-GG5), were identified using the appropriate Grade Groups and the same Selection strategy (FIG. 6A-6B). The patient population for this analysis included 89 subjects with NEPC and 146 patients with cancer (GG1-GG5)]

The informative sequences for the Discovery HG Test (which differentiates between low- and intermediate-risk (GG1+GG2) and high-grade, high-risk (GG3-GG5) prostate cancers were similarly determined (FIGS. 7A-7B). The patient population for this analysis included 181 patients with GG1+GG2 cancer and 55 patients with GG3-GG5 cancer. (FIGS. 7A-7B).

It is important to note that while some of the sncRNAs are common between the tests, their relative importance in the Classification of disease status varies from test to test (i.e., Discovery PCa Test, CS Test, and HG Test).

For each Test the most informative 280 sncRNAs SEQ. ID NOs: 1-840) were used to design customized OpenArray™ platforms. The OpenArray™ platform for each Sentinel™ Test was further validated in a large case-control study of 1436 patients. The demographics of the subjects used to train and validate the Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG Tests are shown in Table 5 (see [000109]-[000110]). A stratified random sample of 600 subjects was selected to identify the validation dataset; the remaining 836 subjects served as the training dataset. The validation sample of 600 patients was stratified so that an equal number of subjects were biopsy negative versus biopsy positive (300 each), and of the biopsy positive cases, 200 were GG1+GG2 (146 GG1 and 54 GG2), and 100 GG3-GG5.

The Sentinel™ PCa Test to Identify Prostate Cancer

The Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG Tests are based on a Classification Algorithm that takes as input the sncRNA expression signature for each patient with unknown disease status and produces a Sentinel™ Score; the participant is classified by comparing this Score to the pre-determined cutoff value (obtained from cross-validation in the training dataset) that controls sensitivity for classifying a future patient with unknown disease status (but known expression signature), at a user-defined level (typically 95% or greater).

The Sentinel PCa Score is compared to a calculated cutoff that controls sensitivity for a future patient at a desired level of, for example, 95% to distinguish between having prostate cancer and no prostate cancer for the PCa Test (FIG. 5A-5B). The Sentinel™ PCa Test utilizes 280 sncRNA (identified by the Discovery PCa Test), of which 145 unique sncRNAs: 60 miRNAs and 85 snoRNAs are highly informative. This defines the classification boundary used to dichotomize patients into prostate cancer or not. The cutoff is determined by the algorithm such that the Sentinel™ PCa score, which dichotomizes the patients into cancer/no cancer, correctly classify the patient as having cancer 19 times out of 20 (i.e., with 95% sensitivity).

TABLE 1 SEQ ID NOs: 1-280 Used in the PCa Test Analysis SEQ ID Sequence NO: Name Sequence 1 hsa-miR- CAAAAACCGGCAAUUACUUUUG 548ac 2 MIR6687 GAGAAUGGGGGGACAGAUGGAGAGGACACAGGCUGGCACUGAGGUCCCCUCC ACUUUCCUCCUAG 3 MIR3150B GAGGGAAAGCAGGCCAACCUCGAGGAUCUCCCCAGCCUUGGCGUUCAGGUGC UGAGGAGAUCGUCGAGGUUGGCCUGCUUCCCCUC 4 MIR1301 GGAUUGUGGGGGGUCGCUCUAGGCACCGCAGCACUGUGCUGGGGAUGUUGCA GCUGCCUGGGAGUGACUUCACACAGUCCUC 5 MIR548P AUUAGGUUGGUAUAAAAUUAAUUGCAGUUUUUGUCAUUACUUUCAAUAGCA AAAACUGCAGUUACUUUUGCACCAAUGUAAUAC 6 hsa-let- UGAGGUAGUAGGUUGUGUGGUU 7b-5p 7 ENSG00000222185 UGGACCAAUGAUGUGAAUGGAAUGCAUCUGAAUAAAAAUUAUGAUCAAUCA GUUUUUGGAACAACUGAGGUCCAC 8 MIR578 AGAUAAAUCUAUAGACAAAAUACAAUCCCGGACAACAAGAAGCUCCUAUAGC UCCUGUAGCUUCUUGUGCUCUAGGAUUGUAUUUUGUUUAUAUAU 9 hsa-miR- CACAUUACACGGUCGACCUCU 323a-3p 10 hsa-miR- UGGGGCUCAGCGAGUUU 4283 11 MIR4438 UAAGUGUAAACUUAAGGACUGUCUUUUCUAAGCCUGUGCCUUGCCUUUCCUU UGGCACAGGCUUAGAAAAGACAGUCUUUAAGUUUACACUUC 12 MIR1205 GAAGGCCUCUGCAGGGUUUGCUUUGAGGUACUUCCUUCCUGUCAACCCUGUU CUGGAGUCUGU 13 hsa-miR- GAUCCCUUUAUCUGUCCUCUAG 6866-3p 14 hsa-miR- UUGGGUUUUCUCUUCAAUCCAG 6839-3p 15 hsa-miR- CUUAGAUUAGAGGAUAUUGUU 8061 16 hsa-miR- CAGCCCCACAGCCUCAGA 4323 17 MIR6784 UACAGGCCGGGGCUUUGGGUGAGGGACCCCCGGAGUCUGUCACGGUCUCACC CCAACUCUGCCCCAG 18 SNORA22 UUGCACAGUGAACACCCAAGUGUGCUUUAUAGUUCCCUUGGCUUUGACCCUG UGCUAGAGCAUUGCCUGCUCUUCUCCUCUGCAUUAAAAGGAAUAUUUAUCCU UUUAAAUGUAUUCAGAAAGCCAGCACAUUA 19 MIR6803 CUCCUCUGGGGGUGGGGGGCUGGGCGUGGUGGACAGCGAUGCAUCCCUCGCC UUCUCACCCUCAG 20 MIR371A GUGGCACUCAAACUGUGGGGGCACUUUCUGCUCUCUGGUGAAAGUGCCGCCA UCUUUUGAGUGUUAC 21 MIR378E CUGACUCCAGUGUCCAGGCCAGGGGCAGACAGUGGACAGAGAACAGUGCCCA AGACCACUGGACUUGGAGUCAGGACAU 22 hsa-miR- UGGGGAGGUGUGGAGUCAGCAU 6825-5p 23 ENSG00000252204 AUGACCUGUGAAACCAAGGGCUCCUAAUGCUAUGACCAAAGACUGAAGCUCU CUAUGAGAUGCCAGCCACUCAAUAGUGCACUUUUUCUGAGAAGAUAUAAGA 24 MIR7847 GUGUCGGCUGUGGCGUGACUGUCCCUCUGUGUCCCCCACUAGGCCCACUGCU CAGUGGAGCGUGGAGGACGAGGAGGAGGCCGUCCACGAGCAAUGCCAGCAU 25 hsa-miR- GUGUGCGGAAAUGCUUCUGCUA 147b-3p 26 MIR181A1 UGAGUUUUGAGGUUGCUUCAGUGAACAUUCAACGCUGUCGGUGAGUUUGGA AUUAAAAUCAAAACCAUCGACCGUUGAUUGUACCCUAUGGCUAACCAUCAUC UACUCCA 27 hsa-miR- CAGCAGCAAUUCAUGUUUUGAA 424-5p 28 MIR4712 GACAGGAUUCCAGUACAGGUCUCUCAUUUCCUUCAUGAUUAGGAAUACUACU UUGAAAUGAGAGACCUGUACUGUAUCUGUU 29 hsa-miR- AGAGCUGGCUGAAGGGCAG 4487 30 hsa-miR- UGCCCUAAAUGCCCCUUCUGGC 18b-3p 31 ENSG00000238298 UUAUUUUUGUAGUUGAUGAAUGUGCUGAUUGGGUAUUCUCGUGUGUGUGUG AGGUGCCACCCUCAAACUUUGUUAUGAUGUUGGCACAUUACCCAUCUGAUA 32 hsa-miR- AUGGCCAGAGCUCACACAGAGG 4435 33 hsa-miR- CGAGCCUCAAGCAAGGGACUU 2114-3p 34 hsa-miR- UGCCCCACCUGCUGACCACCCUC 4758-3p 35 SNORD113- AAAGUGAGUGAUGAAUAGUUCUGUGGCAUAUGAAUCAUUAAUUUUGAUUAA 1 ACCCUAAACUCUGAAGUCC 36 hsa-miR- CAACACCAGUCGAUGGGCUGU 21-3p 37 ENSG00000239123 AUCCUUUUGUGGUUCAUAAGCAUGAUGAUCAGGUUUUCAGGCAUAUGUGUAC GAUGUGCCUCCUUCAAACUUUGUUAGGAUGCUACCACGCUACCCAUCUGACU 38 hsa-miR- AGAACUCUUGCAGUCUUAGAUGU 4680-5p 39 MIR5580 UGCUGGCUCAUUUCAUAUGUGUGCUGAGAAAAUUCACACAUAUGAAGUGAGC CAGCAC 40 SNORD42B GUGCAUAUGAUGGAAAAGUUUUAAUCUCCUGACACUUGUGAUGUCUUCAAAG GAACCACUGAUGCAC 41 MIR365A ACCGCAGGGAAAAUGAGGGACUUUUGGGGGCAGAUGUGUUUCCAUUCCACUA UCAUAAUGCCCCUAAAAAUCCUUAUUGCUCUUGCA 42 SNORD114- UGGAUCAAUGAUGACCACUGGUGGCGUAUGAGUCAUAUGUGAUGAAUACGU 21 GUCUGGAACUCUGAGGUCCA 43 ENSG00000252945 UGAGUUUUGGGAUGAGACCCUGGAAUAAGUGCUGGACACAGUGCCUGAAUCA GACUGUGGAAAUAUUAAUGUAUUUUAUUUUUACUUA 44 MIR4757 UUCCAGCCCGAGGCCUCUGUGACGUCACGGUGUCUGCGGGAGGAGACCAUGA CGUCACAGAGGCUUCGCGCUCUGAG 45 hsa-miR- CCUGUUGAAGUGUAAUCCCCA 1267 46 SNORD114- UGGAUCGAUGGUGACUGUUGAUGGCAUAUGACUCACAUAUGAUGAGUACGU 24 AUCUGGAACUCUGAGGUCUG 47 ENSG00000238676 AUCCUUUUGUACUUGGUAAGCAUGAUGAUUGGGUUUUUAUGCUUAUAUAUG AGACAUGCUUGUCUCAAAUCUUGUUACAGCACAUUACCCUUCCUACU 48 hsa-miR- UCUAGUAAGAGUGGCAGUCGA 628-3p 49 MIR7158 GGCUCAAUCUCUGGUCCUGCAGCCUUCUGCCUUUGGCUUUCUGAAGCGAGCU GAACUAGAGAUUGGGCCCA 50 ENSG00000238605 GUCCUUUUGUAGUCCAUAAGCAUGGUGAUUUGGUUUCAUGCUCAUGUGUCAG AUAUGCUUCCCUCAAACCUUGUUACAGCAUCAUCACAUUACCUGUUUGAUG 51 ENSG00000212618 UUCUUACAAAUCUAAAUGUGCUUUGAUGCAAGUAUAUUUGAAUCCCUUUCCA UCUGAUAACUGAGCAAAAUAAUA 52 hsa-miR- UAAUACUGUCUGGUAAAACCGU 429 53 hsa-miR- AACAAUAUCCUGGUGCUGAGUG 338-5p 54 hsa-miR- AGGUGUGUCUGUAGAGUCC 3650 55 hsa-miR- CUACAAAGGGAAGCCCUUUC 520d-5p 56 SNORD28 GUCAGAUGAUUUGAAUUGAUAAGCUGAUGUUCUGUGAGGUACAAAAGUUAA UAGCAUGUUAGAGUUCUGAUGGCA 57 hsa-miR- CAAUCACUAACUCCACUGCCAU 34b-3p 58 hsa-miR- AAAACUGCAGUUACUUUUGC 548av-3p 59 ENSG00000200042 AAAAUUAUACUUUCAGCAAUCAUCUCUAUAGUUUGUUACUAGAGAAGCUUCU GUGAAUGUGUAGAGCACCGGAAACCACAAGGCAAAGGCUCAGCAUUCUCUCC UAAGCGCGAAGCUGGCUCCUGGUGUUGGUUGGCCGCAACUGCCAUUUGCCAU UGAUGAUCAUUCUUCUCUUCCUGUGGUAGAGGAAGAGGGAGAGAAUGCAGU UUGAGUGG 60 ENSG00000238596 AUCCUUCUGUAGUUCGUGAGCAUGAUGAUUGGGUGCUCACACACAUAUGUGA GAUGUACCACCCUCAAACCUUGUUACAAUGUCAGCACAUUACCCAUCUGACC 61 hsa-miR- AGAGAAGAAGAUCAGCCUGCA 1253 62 MIR3910-2 UUUUUUUGUUGCUUGUCUUGGUUUUAUGCCUUUUAUGUGCCUUGAUAUAAA AGGCAUAAAACCAAGACAAGCAACAGAAAAA 63 MIR4690 GAGCAGGCGAGGCUGGGCUGAACCCGUGGGUGAGGAGUGCAGCCCAGCUGAG GCCUCUGC 64 hsa-miR- CUGACCUAUGAAUUGACAGCC 192-5p 65 hsa-miR- ACUUUAACAUGGAAGUGCUUUC 302b-5p 66 hsa-miR- UGUCACUCGGCUCGGCCCACUAC 668-3p 67 hsa-miR- CCUGGCAUAUUUGGUAUAACUU 4720-5p 68 hsa-miR- UGUCAGUUUGUCAAAUACCCCA 223-3p 69 ENSG00000239018 AUCCUGUUAUGAUUCAUAAGCAUGAUGACUGAGUUUUCACACUCCUGUGUGA GAUGUGCCUCCCUCUAACCUUAUUACAACAUUGACAUCUUACUCAUUUGACA 70 hsa-miR- AACCCGUAGAUCCGAACUUGUG 100-5p 71 hsa-miR- CGUGUUCACAGCGGACCUUGAU 124-5p 72 ENSG00000201151 CUAAAACAAUGUCAAUAGUUUUCAUCAACAGCAGUUGAACCUAGUAAGUGUC GAUACUUUGGGUCUGAGUGG 73 hsa-miR- AGUGCCUGAGGGAGUAAGAGCCC 550a-5p 74 SNORA38B CCCUCCUACAAAGGCAUGUCUAUAGUUCCUUGUCUUUGGACAUGUAAGAAUU GGAGGCAAAGAAAUGUGGACUUGGAGAAAUCUGGGGCCAGCUUGCUCUCCGC AGGCUCAAGAUCAACCAUCCCACAUAG 75 hsa-miR- UGUGCUUGCUCGUCCCGCCCGCA 636 76 hsa-miR- UGGGAAAGAGAAAGAACAAGUA 6739-5p 77 hsa-miR- UUAACUCCUUUCACACCCAUGG 4764-3p 78 hsa-miR- UACCUGGGAGACUGAGGUUGGA 7851-3p 79 ENSG00000212168 CUGUAAUGAUGUUGCUCAAAUAUCUGACCUGAAAUGAUUAUAUAGACCAAUU UAAUACUGAAGAA 80 SNORA9 UAGCAAGCCUCCAGCGUGCUUGGGUCUGCGGUGACCCUAUGCAUUCCUUCAG UGCUUGCUAGAACAGUUUUGAAACGGUUUGAGGCCUUGCCCUGCUCCAUCCA GAGCAAGGUUAUAGAAAUUUCAGACAAUG 81 hsa-miR- AUAUGGGUUUACUAGUUGGU 3115 82 ENSG00000252071 AACCUAUGUAUCAUAUCACAUUGGGUUUUCAUGCUCAUGUGUGAGAAGUGCC UCUUUCAAACCUUGUUCUGACACAUUAUCUUACA 83 hsa-miR- UGGGCUGCUGAGAAGGGGCA 6829-5p 84 MIR4693 GUUUAAAGAAUACUGUGAAUUUCACUGUCACAAAUUCAAAUAAAGUGAGAG UGGAAUUCACAGUAUUUAAGGAAU 85 MIR3162 CUGACUUUUUUAGGGAGUAGAAGGGUGGGGAGCAUGAACAAUGUUUCUCAC UCCCUACCCCUCCACUCCCCAAAAAAGUCAG 86 hsa-miR- UCUGGUCCUGGACAGGAGGC 7152-3p 87 ENSG00000252236 GGUCUCUUGUGUCGGCACCUGGGUGGCUUGCCGCCCACACAACCAAAUUAAA AAAUAACACAGAAGGGUAAGGUAAGUCUCCAUUAAACCCAGGAAAGAGACUG GAAAACUCCUCUUUGGAGCCUGUCUAUAGUCACAGGU 88 MIR153- AGCGGUGGCCAGUGUCAUUUUUGUGAUGUUGCAGCUAGUAAUAUGAGCCCAG 2 UUGCAUAGUCACAAAAGUGAUCAUUGGAAACUGUG 89 hsa-miR- AGGAGGACAAGUUGUGGGAU 7154-3p 90 hsa-miR- AGCAAAAUAAGCAAAUGGAAAA 1468-3p 91 MIR4446 CUGGUCCAUUUCCCUGCCAUUCCCUUGGCUUCAAUUUACUCCCAGGGCUGGC AGUGACAUGGGUCAA 92 hsa-miR- AGAAAGGGUGGCAAUACCUCUU 5681a 93 MIR592 UAUUAUGCCAUGACAUUGUGUCAAUAUGCGAUGAUGUGUUGUGAUGGCACA GCGUCAUCACGUGGUGACGCAACAUCAUGACGUAAGACGUCACAAC 94 MIR4521 UCGGCUAAGGAAGUCCUGUGCUCAGUUUUGUAGCAUCAAAACUAGGAUUUCU CUUGUUAC 95 MIR636 UGGCGGCCUGGGCGGGAGCGCGCGGGCGGGGCCGGCCCCGCUGCCUGGAAUU AACCCCGCUGUGCUUGCUCGUCCCGCCCGCAGCCCUAGGCGGCGUCG 96 ENSG00000238734 AUCCUUUUGUAAUACAUAAGCAUAAUGAUUGGGUUUUUAUGUUCACAUGUU UGAUAUGCCUCCCUCAAAUCCUCUUAUGAUGUCGGCACAUUACCCAUCUGAG G 97 hsa-miR- UGAGGCGGGGGGGCGAGC 6087 98 MIR4498 AGGGCUGGGCUGGCAGGGCAAGUGCUGCAGAUCUUUGUCUAAGCAGCCCCUG CCUUGGAUCUCCCA 99 hsa-miR- AGUUUGGGAUGGAGAGAGGAGA 6740-5p 100 hsa-miR- AGCGAGGUUGCCCUUUGUAUAU 381-5p 101 hsa-miR- UAAGUGCUUCCAUGCUU 302e 102 hsa-miR- UAGGGGCAGCAGAGGACCUGGG 4688 103 hsa-miR- CCUCCUGCCCUCCUUGCUGU 1976 104 ENSG00000252128 ACUCCAUGAUGAACCCAAAAUGCCAAGUAUAUGACUGAACUUACAAGUGAUA CCAUCUUACGACUGAAGAGU 105 ENSG00000238922 AUGCUUUUGUAGUUCGUAAGCAUGAUGAUUGGGUUUUCUUGCUCUUGUAUG AGAUGUGCCUCCGUCAUACCUUGGAAACCUGACUUGAAA 106 hsa-miR- UUGGAGGGUGUGGAAGACAUC 6515-5p 107 hsa-miR- AAGUCCUGCUUCUGUUGCAG 6838-3p 108 MIR892A GCAGUGCCUUACUCAGAAAGGUGCCAGUCACUUACACUACAUGUCACUGUGU CCUUUCUGCGUAGAGUAAGGCUC 109 ENSG00000251959 ACAAGGGUUCUAAUUUCACUACAUCCCCUCCAAUAUUUGGUAUCUUUCCUUU CUUAAAAAAAUAGCCAGCCUAGUGAGUGUGAAGUGGCAUCUCAAUGUGGUUU UGAUUU 110 ENSG00000207100 UGCACUGCGUGGUAUCUGCACUCAGCAGUUUACUCCUGCUAGGGUGUUCAAA GGUCAGUGCCAUAGAAAUCCAGUAUCUGGUUUCAUUGGUUUUCUUGGCUUUG UGCUUGUUAAACCUGGUAUUUCUAUUGAUACAGCA 111 ENSG00000238843 AUACUUUUGUAGGUCAUAAGCUGAGGAUUGGGUUUUCAUGCUCUUGUGUGA GAUAUGCUUCUCUCAAACCUUCUGACCUGGGCACAUUACCCAGCUAAUG 112 ENSG00000221496 UUUCUAUAGUUUAUUACCAGAAAAGUUUCUCAGAAUGUGUAGAGCACUGGA AACCAUGAGGAAGAGGCAUAGCGUUCUCUCUUGAGCAUCAAGUUGGCUGUUG GUGUUGCUUUGCUGCAAACGCCAUUUGUCAUUGUCUUCCUUGUCUUCCUUUA GGAGAGUAAGAGGGAGAGGACACAGUCUGGGUAG 113 hsa-miR- CACCCCCUGUUUCCUGGCCCAC 4640-3p 114 MIR2117 GCUCUGAUUUACUUCUGUCCGGCAUGGUGAACAGCAGGAUUGGCUGUAGCUG UUCUCUUUGCCAAGGACAGAUCUGAUCU 115 hsa-miR- GGUUCCCUCUCCAAAUGUGUCU 642b-5p 116 MIR25 GGCCAGUGUUGAGAGGCGGAGACUUGGGCAAUUGCUGGACGCUGCCCUGGGC AUUGCACUUGUCUCGGUCUGACAGUGCCGGCC 117 ENSG00000238748 AUCCUUUUGUAGUUCAUAAAUGUGAUAAUUGGGUGUUCACGUGCAUGUAUG AGAUGUCUGAGUCCCUCAAACCUUGUUACAACAUUGGUACAUUACCCAUUUU ACC 118 hsa-miR- GAUAUUCAGAGGCUAGGUGG 6074 119 MIRLET7F2 CUAUACAGUCUACUGUCUUUCC 120 hsa-miR- UUCCUUCUGUUGUCUGUGCAG 6868-3p 121 MIR4320 GACAUGUGGGGUUUGCUGUAGACAUUUCAGAUAACUCGGGAUUCUGUAGCUU CCUGGCAACUUUG 122 ENSG00000238651 AUCCUUUUGUAGUUUAUAAGCGUGAUGACUGGGGUUUCACGUGCAUGUGUG AAAUGUGCCUUCCCCAAGCCUUGUUAUGACCUCAUUGGAACAUUACCCCUUU GACA 123 hsa-miR- CACCGACUCUGUCUCCUGCAG 6510-3p 124 hsa-miR- GGCCUUGUUCCUGUCCCCA 4312 125 hsa-miR- UCUGCCAUCCUCCCUCCCCUAC 4769-3p 126 hsa-miR- UUGGGAUGGUAGGACCAGAGGGG 6728-5p 127 MIR5192 UUAGUUCCAGCCUCCUGGCUCACCUGGAACCAUUUCUCCUGGGAAGCAUGGU AGCCAGGAGAGUGGAUUCCAGGUGGUGAGGGCUUGGUACU 128 hsa-miR- CAACGGAAUCCCAAAAGCAGCUG 191-5p 129 hsa-miR- GGCGGAGGGAAGUAGGUCCGUUGGU 658 130 hsa-miR- UAGUUCUUCCCUUUGCCCAAUU 5584-3p 131 MIR155 CUGUUAAUGCUAAUCGUGAUAGGGGUUUUUGCCUCCAACUGACUCCUACAUA UUAGCAUUAACAG 132 hsa-miR- CCCGGACAGGCGUUCGUGCGACGU 3687 133 ENSG00000238807 AUCCUUUUGUAGUUCAUCAGUGUCAUGAGUGGGUUUUCACGCACAUGUGUCA AAUAUGCCUCCCUCAAACUGUUACGUCAUUGGCAUAUUACCUGACGUGAAG 134 hsa-miR- CCGGGGCAGAUUGGUGUAGGGUG 5090 135 hsa-miR- UCAGCACCAGGAUAUUGUUGGAG 3065-3p 136 hsa-miR- UCAAGUAGUUUCAUGAUAAAGG 5697 137 hsa-miR- UGCUCAGGUUGCACAGCUGGGA 3934-3p 138 ENSG00000200150 UGAAUCAAUGGUGACCACUGGUGGCAUAUAAGUCAUGGAUGAUGAAUAUGA GAAGAAAAGAAUCUAGGUUUUU 139 ENSG00000238947 AUCCUUUUGUGGUUCAUCCGCCUGAUGAUUGGGUUUUCAUGCAGACGUGUGA GCUGUGCCUCCCUCAAGCCUUGUUACAACAUCCGACAUCCGCACAUUACCUG UCUGAUG 140 MIR1289-2 CCACGGUCCUAGUUAAAAAGGCACAUUCCUAGACCCUGCCUCAGAACUACUG AACAGAGUCACUGGGUGUGGAGUCCAGGAAUCUGCAUUUUUACCCCUAUCGC CCCCGCC 141 hsa-miR- CCUCACCCAGCUCUCUGGCCCUCU 6862-3p 142 hsa-miR- AAUCAUACAGGGACAUCCAGUU 487a-3p 143 MIR590 UAGCCAGUCAGAAAUGAGCUUAUUCAUAAAAGUGCAGUAUGGUGAAGUCAA UCUGUAAUUUUAUGUAUAAGCUAGUCUCUGAUUGAAACAUGCAGCA 144 hsa-miR- ACUGGGAAGAGGAGCUGAGGGA 4646-5p 145 MIR4635 CCGGGACUUUGUGGGUUCUGACCCCACUUGGAUCACGCCGACAACACUGGUC UUGAAGUCAGAACCCGCAAAGUCCUGG 146 SNORD74 CUGCCUCUGAUGAAGCCUGUGUUGGUAGGGACAUCUGAGAGUAAUGAUGAAU GCCAACCGCUCUGAUGGUGG 147 hsa-miR- UGAUAUGUUUGAUAUUGGGUU 190b-5p 148 MIR4723 AGUUGGUGGGGGAGCCAUGAGAUAAGAGCACCUCCUAGAGAAUGUUGAACUA AAGGUGCCCUCUCUGGCUCCUCCCCAAAG 149 ENSG00000212397 AUUCUUAAAUGAAUGAUGAAAUACCAAAAAGAAAAAUAAGCAAAGAACAGA UAACAGAAAGAAGCACAGCAAAUACAACAUAAUACUGACAGUAAAAAU 150 MIR3123 AUGGAUUUGAUUGAAUGAUUCUCCCAUUUCCACAUGGAGAGUGGAGCCCAGA GAAUUGUUUAAUCAUGUAUCCAU 151 hsa-miR- GCUGGUUUCAUAUGGUGGUUUAGA 29b-1-5p 152 MIR4489 GGGGGUGGGGCUAGUGAUGCAGGACGCUGGGGACUGGAGAAGUCCUGCCUGA CCCUGUCCCA 153 ENSG00000239197 AAGCAUGGCACACUGGAUGGGCGUUCUGCUUCUCUUUAAAGAGCAUGGAUUU AUCCAUACCAUGUGACAUGAAUGAAAUGAGGAGUUUUCAGGGCUGCCAACCU CUUGGUUAAGGUUCUGUGUAGUAUAUUUCUCCUACAAUA 154 ENSG00000252380 UGGACCAAUGAUGAGAAUAUGUCAUGAACCAAGGAAUAUGAUUAAUCCAAU UCUGUGUACUGGAGGGUCAAA 155 hsa-miR- AGAAGUGGCUAAUAAUAUUGA 4795-5p 156 MIR4685 UAGCCCAGGGCUUGGAGUGGGGCAAGGUUGUUGGUGAUAUGGCUUCCUCUCC CUUCCUGCCCUGGCUAG 157 hsa-miR- ACGGAAUAUGUAUACGGAAUAUA 3669 158 MIR3202-2 AUUAAUAUGGAAGGGAGAAGAGCUUUAAUGCUCUGAAAAUGACUCCAAUCA UUAAAGCUCUUCUCCCUUCCAUAUUAAU 159 MIR5694 GCCAACUGCAGAUCAUGGGACUGUCUCAGCCCCAUAUGUAUCUGAAGGCUGA GAAGUCCCAUGAUCCGCACUUGGC 160 hsa-miR- AGUGGGGUGGGACCCAGCUGUU 6742-5p 161 hsa-miR- AAGCUGCCAGUUGAAGAACUGU 22-3p 162 hsa-miR- CCCUUCCCUCACUCUUCUCUCAG 6734-3p 163 ENSG00000252258 UCCCAUCUCUUAAAUAAAAAGAUUUUUUUUUUAAGAAGUUGUACAUGUGCA AUGGCUGCAAACAGCAGCUUCCUUGGCAGUGUGUGCAGCCUGUUUCUUGUAU GGGUUGCUCUAAGGGACCUUGGAGACAGGC 164 hsa-miR- GGGCUGGGGCGCGGGGAGGU 5787 165 hsa-miR- GCCGGACAAGAGGGAGG 4442 166 ENSG00000199713 UCAUCAGGUGGGAUAAUCCUUACCUGUUCCUCGUUUUGGAGGGCAGAUAGAA CAGGAUAAUUGGAGUUUGCAUGAUCCAUGAUUAAUGUCUCUGUGUAAUCAG GACUUGCAAACUCUGAUUGUUCAUAUCUGAU 167 hsa-miR- UUCACAGUGGCUAAGUUCCGC 27a-3p 168 hsa-miR- ACUGGGGAGCAGAAGGAGAACC 4667-5p 169 hsa-miR- UUGAUCUCGGAAGCUAAGC 7641 170 MIR320C2 CUUCUCUUUCCAGUUCUUCCCAGAAUUGGGAAAAGCUGGGUUGAGAGGGU 171 MIR6759 UAUUGUUGUGGGUGGGCAGAAGUCUGUUUUCUUCAUGGUUUUCUGACCUUU GCCUCUCCCCUCAG 172 hsa-miR- CAAGGAGACGGGAACAUGGAGC 4428 173 MIR3158-2 AUUCAGGCCGGUCCUGCAGAGAGGAAGCCCUUCCAAUACCUGUAAGCAGAAG GGCUUCCUCUCUGCAGGACCGGCCUGAAU 174 ENSG00000238306 AUCUUUUUGUGGUUCACAAGUGUGAUGAUUAGGUUUUCAGACUCAUGUGUG AGACAUGCCUUCCUCAAACCUUCUUAUGCUAUCAGCACAUAAUCUGGCUGAC A 175 hsa-miR- UAAAUUUCACCUUUCUGAGAAGG 513a-3p 176 ENSG00000206913 GAGCUUCCAGGAUCACCCCUGCAGAGUGGCUAAUAUUCUGCCAGCUUCGGAA AGGGAGGGGAAGCAAGCCUGGCAGAGGCACCCAUUCCAUUCCCAGCUUGCUU AGUAGCUGGCCAUGGGAAGACACUGUGCAACACUG 177 ENSG00000207199 UCUUAGUGAUGAAAACUUUGUCCAGUUCUGCUAAUGACUUUAAGUGAUGAU AAACUAUGUCUGAGGGGA 178 ENSG00000207094 AUCCAAGGCGAUUCCCUCUCCAAGGGGACAUCUAGUGCCCCUCUCAGGAAAG UAGCAACUUGGAAUAGAAUCUGGCAUGCCUAAGGUCUUUGAGGAACAGGGAU GCUUAUUUCCUCUGCCUUCCUUGGCUGCCUACAUAG 179 hsa-miR- AGCUCUGCUGCUCACUGGCAGU 3194-3p 180 ENSG00000206897 UAGCAAGCCUCCAGCGUGCUUGGGUCUGCAGUGACCCCGUGGAUUCCUACAG GGCUUGCCAGAACAGUUUUGAAAUGGUUUGAGGCCUUGCCGUGCUCCAUGUA GAGCAAGGUUAUAGAAAUUUCAGACAAUG 181 hsa-miR- UCACUCUCACCUUGCUUUGC 4639-3p 182 SNORD116- UGGAUCGAUGAUGACUUUCAUACAUGCAUUCCUUGGAAAGCUGAACAAAAUG 16 AGUGAAAACUCUAUACCGUCAUCCUCGUCGAACUGAGGUCCA 183 MIR1324 CCUGAAGAGGUGCAUGAAGCCUGGUCCUGCCCUCACUGGGAACCCCCUUCCC UCUGGGUACCAGACAGAAUUCUAUGCACUUUCCUGGAGGCUCCA 184 hsa-miR- GCGAGGACCCCUCGGGGUCUGAC 611 185 ENSG00000221043 AAGACUCUACUCUCAGGGCUCAUUUCUGUCAUUCAAUACUAGAGAAGUUUCU CUGAAUGUUUAGAGCACUGGAAACCAAACGGAGGAGGCGGGCAUUCUUUCCU GAGCAUGCAGCCAGCUCAUAGUGUUGUUUUGUUGCAGCUGCCGCUUGCCAUU GAUGAUCCUUCUUCUCUUCCUUCAGGGGAGUAAGGAGACGACGCGGUCUUAG UGG 186 hsa-miR- UCUAAAGACUAGACUUCGCUAUG 4744 187 ENSG00000252805 AAGACUACACUUUCAGGGAUAAUUUCUAUAGUUCAUUACUAGAGAAGUUUCU CUGAAUGUGUAGAGCACCAUAAAAUACAUUUUAUUUUUUAUUUGAGACAGG GUCUCACUCUGUCACCCAAGCUGGAGUGCAGUGG 188 MIR6500 CCUGCCUGCAGAAAGGAGCUAUCCACUCCAGGUGUCCUUUCUUCUGAGAGCU GGACACUUGUUGGGAUGACCUGCCUGCAGGUAGG 189 hsa-miR- AUUCUCUCUGGAUCCCAUGGAU 4768-5p 190 MIR4419B CUCAGGCUCAGUGGUGCAUGCUUAUAGUCCCAGCCACUCUGGAGGCUGAAGG AAGAUGGCUUGAGCCU 191 hsa-miR- UGGUCUGUUCAUUCUCUCUUUUUGGCC 7107-3p 192 ENSG00000239083 AUCCUUCUGUGGCUGAUAUGUGUGAUGAGGGGGUUUUCACACUCUUGCGUGG GACGUGCAACCUCUUUAGAACAGUGGCACAUUACCUGUCCUACA 193 MIR513C GCGUACAGUGCCUUUCUCAAGGAGGUGUCGUUUAUGUGAACUAAAAUAUAAA UUUCACCUUUCUGAGAAGAGUAAUGUACAGCA 194 hsa-miR- UCUUUUCUUUGAGACUCACU 627-3p 195 hsa-miR- UUGGGAGGGAAGACAGCUGGAGA 6780a-5p 196 ENSG00000212378 AUGUAAUAAUGUUCAUCAAAUGUCUGACCUGAAAUGAGCAUGUAGACAAGU UAAUUUAACACUGAAGAA 197 MIR559 GCUCCAGUAACAUCUUAAAGUAAAUAUGCACCAAAAUUACUUUUGGUAAAUA CAGUUUUGGUGCAUAUUUACUUUAGGAUGUUACUGGAGCUCCCA 198 SNORA70G CUGCAGCCUAUUAAGCCAACUGAGUUCCUUUCCUCAUGGGGGGGCCCAGUGU GCAAUGGCUGCAAACAGCAGCUUCCUUGGUAGUGUAUGCAGCCUGUGUGUUG UAUUGUAUGGGUUGCUCUAAGGGACCCUGGAGACAGUC 199 hsa-miR- AGCUACAGUUACUUUUGCACCA 548v 200 hsa-miR- AGACAGUAGUUCUUGCCUGGUU 4645-3p 201 hsa-miR- GGGAGGUGUGAUCUCACACUCG 3689d 202 hsa-miR- AACUCACGAAGUAUACCGAAGU 4798-3p 203 hsa-miR- UCUGAGGUGGAACAGCAGC 7162-3p 204 hsa-miR- GGAUUCCUGGAAAUACUGUUCU 145-3p 205 ENSG00000202268 AAUUGUAUACUUUCAGGGAUCAUUCCAUAGGUUGUUACUAGAGAAGUUUUU UUAGAUGUGUAGAACACUGGAAACCACGAGGAGGAGGCGCAGCAUUCUCUCU UGACCAUGAAGCCGGCUCUUGGUGUUGUUUCAUUGCAACUGUCAUUUGCCAU UGAUGAUCGUUCUCUUCCUCUGGGAGAGUAAGAGGGAGAGGACACAGUUUGA GUGG 206 ENSG00000238455 AUCCAUUUGUAGUUCAGAAACAUGACUAUUGUCUUUUCAAGCUUAUAUGAGA UCUGGCUCCCUCAAUCCUUGCUAUGAUAUCAGUACAUUACCUGUCUGAUG 207 MIR4420 CUCUUGGUAUGAACAUCUGUGUGUUCAUGUCUCUCUGUGCACAGGGGACGAG AGUCACUGAUGUCUGUAGCUGAGAC 208 SNORD115- GGGUCAAUGAUGAGAACCUUAUAUUGUUCUGAAGAGAGGUGAUGACUUAAA 6 AAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 209 hsa-miR- AUAGUGGUUGUGAAUUUACCUU 4460 210 hsa-miR- ACUCCAUUUGUUUUGAUGAUGGA 136-5p 211 hsa-miR- AUACACAUACACGCAACACACAU 466 212 ENSG00000201847 ACCACCAGUGAUGAGUUGAAUACUGCCCCAGUCUGAUCAACAUGCGUGAAAG AUAUUUUCUGAGCUGUG 213 hsa-miR- UCGCGGUUUGUGCCAGAUGACG 579-5p 214 hsa-miR- UGUCUUACUCCCUCAGGCACAU 550a-3p 215 MIR3167 GGCUGUGGAGGCACCAGUAUUUCUGAAAUUCUUUUUUCUGAAAUUCUUCAGG AAGGAUUUCAGAAAUACUGGUGUCCCGACAGCC 216 MIR676 GCAUGACUCUUCAACCUCAGGACUUGCAGAAUUAAUGGAAUGCUGUCCUAAG GUUGUUGAGUUGUGC 217 hsa-miR- AUUUGUGCUUGGCUCUGUCAC 2113 218 ENSG00000212611 AUACAUGAUGACUUACAUGGACUCUCAUUCAGCUAAUGACUUGCUGCUGAAA CAUGGAAAUCUGAUUUUU 219 hsa-miR- UUCCUGGGCUUCUCCUCUGUAG 6783-3p 220 hsa-miR- CAGGAUGUGGUCAAGUGUUGUU 1265 221 MIR4493 CCAGAGAUGGGAAGGCCUUCCGGUGAUUAUCACAGCCAUGCCUUUACCUCCA GAAGGCCUUUCCAUCUCUGUC 222 hsa-miR- GAGACAGGUUCAUGCUGCUA 4524b-3p 223 ENSG00000238475 AUCCUUUUGUAGUUCAUAAGCUUGAUGUUUGAGUUUUCACACUUACGUGUGA AAUGUGCCUCCCUUAAACCUUGUUACUACGUCAGCACAUUACCCAUGAGACA 224 SNORD115- GUGUUGAUGAUGAGAACCUUAUAUUAUCCUGAAGAGAGGUGAUGACUUAAA 1 AAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 225 MIR8085 CUAGGAGGGAUGGGAGAGAGGACUGUGAGGCAUGGGUGGCUCUAUGGUCAC GCCCAUCUUCCUAC 226 SNORD31 CUCACCAGUGAUGAGUUGAAUACCGCCCCAGUCUGAUCAAUGUGUGACUGAA AGGUAUUUUCUGAGCUGUG 227 SNORA73A UCCAACGUGGAUACACCCGGGAGGUCACUCUCCCCGGGCUCUGUCCAAGUGG CGUAGGGGAGCAUAGGGCUCUGCCCCAUGAUGUACAAGUCCCUUUCCACAAC GUUGGAAAUAAAGCUGGGCCUCGUGUCUGCGCCUGCAUAUUCCUACAGCUUC CCAGAGUCCUGUCGACAAUUACUGGGGAGACAAACCAUGCAGGAAACAGCC 228 MIR4435-1 AGGCAGCAAAUGGCCAGAGCUCACACAGAGGGAUGAGUGCACUUCACCUGCA GUGUGACUCAGCAGGCCAACAGAUGCUA 229 MIR29B2 CUUCUGGAAGCUGGUUUCACAUGGUGGCUUAGAUUUUUCCAUCUUUGUAUCU AGCACCAUUUGAAAUCAGUGUUUUAGGAG 230 MIR4777 UAGAAUAUUUCGGCAUUCUAGAUGAGAGAUAUAUAUAUACCUCAUAUGUAU AUGGUAUACCUCAUCUAGAAUGCUGUAAUAUUCUA 231 MIR635 CAGAGAGGAGCUGCCACUUGGGCACUGAAACAAUGUCCAUUAGGCUUUGUUA UGGAAACUUCUCCUGAUCAUUGUUUUGUGUCCAUUGAGCUUCCAAU 232 MIR508 CCACCUUCAGCUGAGUGUAGUGCCCUACUCCAGAGGGCGUCACUCAUGUAAA CUAAAACAUGAUUGUAGCCUUUUGGAGUAGAGUAAUACACAUCACGUAACGC AUAUUUGGUGG 233 ENSG00000201042 CCCUCCUACAAAGGCAUGUCUAUAAUUCCUUGUCUUUGGACAUGUAAGAAUU GGAGGGACAGAAAUGUGGACUUGGAGAAAUCUGGGGCCAGCUUUCUCAUCAC AGGCUCAACAUCAACCAUGCCACAUAG 234 hsa-miR- UCCUAAAUCUGAAAGUCCAAAA 5009-3p 235 MIR24-2 CUCUGCCUCCCGUGCCUACUGAGCUGAAACACAGUUGGUUUGUGUACACUGG CUCAGUUCAGCAGGAACAGGG 236 hsa-miR- UUAGGGCCCUGGCUCCAUCUCC 1296-5p 237 ENSG00000238914 AUCCUUUUGUGGUUCAUUAGCUUGAUAUUGGGUUUUCACACUAUUCUAUGAG AUGUGCCUCCCUCAAAACUUGUUACAACAUUGACACAUUACCCUUCUGAUG 238 hsa-miR- ACCUGAGGUUGUGCAUUUCUAA 544b 239 ENSG00000221398 GGGGUGCACUCAGGGCAGGGGGCUUGAAGAACGGCUCCUCUGUUUACGACAC ACUCAACAGGGGUGUGAGGUCACAGUGAUGAGAGGCCCAAACUUGUGGCCUC CCCGUGAACAAAUGCCCUACACAU 240 ENSG00000253072 UGAUGACACUCUCUGGAAUUGUUACACUACCAUAAUUAAAGUGCACUGAAUC UUUUUCUAUCUGAUGGGGGGGGAAUAAAAUAAUU 241 hsa-miR- UGAGCCUCUCCUUCCCUCCAG 6823-3p 242 hsa-miR- CCUGCGUGUUUUCUGUCCAA 4520a-5p 243 MIR6074 UACCAACCCCAUGGAAUUUUUACUCACCUUCAGUCAACUGAUUUGCUCUUUG GUGGAGAUAUUCAGAGGCUAGGUGGAGAUAGAGGUAGCCUUGAGGGUGGGU GUGG 244 hsa-miR- AGCGGUGCUCCUGCGGGCCGA 4746-3p 245 hsa-miR- CAGCGGAGCCUGGAGAGAAGG 7846-3p 246 hsa-miR- CUUGGAUUUUCCUGGGCCUCAG 5004-3p 247 ENSG00000238595 AUCCUCUUGCAGUUCAUAAGCAUGAUGAUUGGGUUUUCACACUCCUGUGUGA AAUGUACCUUCCUCAAACCUUUUUAUAACAUCAGCACAUUACCGAACAUGAA A 248 MIR4638 GACUCGGCUGCGGUGGACAAGUCCGGCUCCAGAACCUGGACACCGCUCAGCC GGCCGCGGCAGGGGUC 249 MIR1296 ACCUACCUAACUGGGUUAGGGCCCUGGCUCCAUCUCCUUUAGGAAAACCUUC UGUGGGGAGUGGGGCUUCGACCCUAACCCAGGUGGGCUGU 250 MIR1255B1 UACGGAUGAGCAAAGAAAGUGGUUUCUUAAAAUGGAAUCUACUCUUUGUGA AGAUGCUGUGAA 251 ENSG00000200051 GGUCAAUGAUGAGCUGACAUGUAUUCUGAAUCUAAAGUUGAUUAUUAGUAC UUUAGUUCUAGAAUUACUGAGACAUG 252 hsa-miR- CUGGAGUCUAGGAUUCCA 4309 253 hsa-miR- CUUCUUGUGCUCUAGGAUUGU 578 254 hsa-miR- UUAGUGAAGGCUAUUUUAAUU 3606-5p 255 hsa-miR- UCGGCUCUCUCCCUCACCCUAG 6741-3p 256 hsa-miR- UUUGGUCCCCUUCAACCAGCUA 133b 257 ENSG00000251858 UUAUUUAUCUGACAGACCUGCAGCAGUUACUGGAUGCUGUUAAAGUUUCCAC UACAGAUGCAAGAAAAGUGUCCCACACUUUCUGUCUGUCUGAUUGUGACAGC UAAGAUUAAAUCAGGUAGGACAGUA 258 ENSG00000252256 UAUUCCAAUGAUGCAAGUGUGUCGUGAACUAAGGAUUAUGAUUAAUCCAGU UUUGUAGCUAGAGGGAUUUU 259 hsa-miR- AGUGGCAAAGUCUUUCCAUAU 3688-5p 260 MIR3199-2 GUGACUCCCAGGGACUGCCUUAGGAGAAAGUUUCUGGAAUGUCAGAACUUCC AGAAACUUUCUCCUAAGGCAGUCCCUGGAGUCAC 261 MIR548W GGUUGGUGCAAAAGUAACUGCGGUUUUUGCCUUUCAACAUAAUGGCAAAACC CACAAUUACUUUUGCACCAAUC 262 SNORD60 AGUCUGUGAUGAAUUGCUUUGACUUCUGACACCUCGUAUGAAAACUGCACGU GCAGUCUGAUUAUUUAGCAAGACUGAGGCUU 263 MIR433 CCGGGGAGAAGUACGGUGAGCCUGUCAUUAUUCAGAGAGGCUAGAUCCUCUG UGUUGAGAAGGAUCAUGAUGGGCUCCUCGGUGUUCUCCAGG 264 MIR4447 GUUCUAGAGCAUGGUUUCUCAUCAUUUGCACUACUGAUACUUGGGGUCAGAU AAUUGUUUGUGGUGGGGGCUGUUGUUUGCAUUGUAGGAU 265 hsa-miR- UUGGGGAAACGGCCGCUGAGUG 2110 266 ENSG00000207274 CUGCAGCCAAUUAAGCCAACUGAGUUCCUUUCCUUGUGGGGGCCCAGUGUGC AAUGGCUGCACACAGCAGCUUCCUUGGUAGUGUACACAGCCUGUUGGUUGUA UGGGUUGCUCUGAGGGACCUUGGAGACAGGC 267 hsa-miR- UGCUUCCUUUCUCAGCUG 7162-5p 268 hsa-miR- UGGUUCUCUUGUGGCUCAAGCGU 597-3p 269 MIR147B UAUAAAUCUAGUGGAAACAUUUCUGCACAAACUAGAUUCUGGACACCAGUGU GCGGAAAUGCUUCUGCUACAUUUUUAGG 270 MIR4264 AAAGCUGGAUACUCAGUCAUGGUCAUUGUAACAUGAUAGUGACAGGUACUGG GUAAGACUGCAUAG 271 hsa-miR- AAGUGCUGUCAUAGCUGAGGUC 512-3p 272 ENSG00000201329 AAUGCUAUACUUUCAUGGGUCAUUUCUAUAGUUUGUUAUUAGAGAAGUUUC UCUGAAUGUGUUGAGCACCAGAAACCACGAGGAGAUGCAGCAUUCUCUCCUG AACGGGAAGCCAGCUUUUGGCAUUGCUUUGAUGCAACUACCAUUUGCCAUUG AUGGCAAUGCAUCGCUUCCUCUAGGAGUGUAAGAGGGAGUGGAUGCAGUCAG AGUGG 273 MIR548AH AGGUUGGUGCAAAAGUGAUUGCAGUGUUUGCCAAUAAAAGUAAUGACAAAA ACUGCAGUUACUUUUGCACCAGCCC 274 hsa-miR- AGGAGGUGGUACUAGGGGCCAGC 6851-5p 275 SNORD38B UCUCAGUGAUGAAAACUUUGUCCAGUUCUGCUACUGACAGUAAGUGAAGAUA AAGUGUGUCUGAGGAGA 276 MIR101- ACUGUCCUUUUUCGGUUAUCAUGGUACCGAUGCUGUAUAUCUGAAAGGUACA 2 GUACUGUGAUAACUGAAGAAUGGUGGU 277 MIR6866 CCAUUUUAGAGGCUGGAAUAGAGAUUCUUGAGGCUUGGAAGAGUAAGGAUC CCUUUAUCUGUCCUCUAG 278 hsa-miR- UCCUGCGUAGGAUCUGAGGAGU 3193 279 hsa-miR- UCGAGGACUGGUGGAAGGGCCUU 3131 280 ENSG00000223027 CCUCAUUUUCUUGGCAGGAACUUGUAGUCCCACUCCCUGUUAUGUACAGAGG CAAAGGGAAGAGCUCUGGCCCCCUUGGCAUGUCUUUGGAGCCAUGCAGCUUC CCGUCUGCCAGUUCUAUCCUCAAGCACCAGGACACCA

The Sentinel™ CS Test to Identify Low Grade (Indolent) Prostate Cancer

The Sentinel™ Clinically Significant (CS) Test uses a similar classification algorithm to produce a Sentinel™ CS Score that is compared to a calculated cutoff. The cutoff controls sensitivity for a future patient at a desired level (95%), to distinguish between Clinically Significant cancer (GG2-GG5) (if the Sentinel™ CS Score is greater than or equal to the cutoff) and Clinically Insignificant cancer (GG1) (if the Sentinel™ CS Score is less than the cutoff). The algorithm is trained using only the subset of patients known to have prostate cancer in the dataset used to train the Sentinel™ PCa Test. Similarly, using the Classification Algorithm, 280 sncRNAs were used as a basis to define an expression signature for the Sentinel™ CS Test. The Sentinel™ CS Test utilizes 280 sncRNA (identified by the Discovery CS Test, of which 135 unique sncRNAs: 130 miRNA and 66 snoRNAs are highly informative.

TABLE 2 SEQ ID NOs: 281-560 Used in the CS Test Analysis SEQ ID Sequence NO: Name Sequence 281 MIR539 AUACUUGAGGAGAAAUUAUCCUUGGUGUGUUCGCUUUAUUUAUGAUG AAUCAUACAAGGACAAUUUCUUUUUGAGUAU 282 hsa-miR- CCUGCAGCGACUUGAUGGCUUCC 1184 283 hsa-miR- AAAGUAGCUGUACCAUUUGC 562 284 ENSG00000207215 AAGACUAUACUCUCAGGAAUCAUUUCUAUAGUUUUUUACUAGAGAAA UUUCUCUGAACGUGUAGAGCACUGGAAACCGUGAGGAGAAGCUGCCU UCUCUUCUGAGCAUGAAGUGAGCUCUCAGUGUUGCUUCUCUGCAACUG CCAUUUGCCAUUGAUGAUCGUUCUUCUCUUCCUCUGGGAGAGUAAAA GGGUACAGGAUGCAGUCUGAG 285 SNORD116- UGGAUCGAUGAUGACUUUAAAAUGGAUCUCAUCGGAAUCUGAACAAA 25 AUGAGUGACCAAAUCACUUCUGUGCCACUUCUGUGAGCUGAGGUCCA 286 hsa-miR- GGACCCACCCGGCCGGGAAUA 1180-5p 287 hsa-miR- UUGAAUUCUUGGCCUUAAGUGAU 4452 288 hsa-miR- UGAGAUGACACUGUAGCU 4770 289 hsa-miR- UGUAAACAUCCCCGACUGGAAG 30d-5p 290 MIR518C GCGAGAAGAUCUCAUGCUGUGACUCUCUGGAGGGAAGCACUUUCUGU UGUCUGAAAGAAAACAAAGCGCUUCUCUUUAGAGUGUUACGGUUUGA GAAAAGC 291 MIR4315- UGGGCUUUGCCCGCUUUCUGAGCUGGACCCUCUCUCUACCUCUGGUGC 1 AGAACUACAGCGGAAGGAAUCUCUG 292 hsa-miR- GAGCUUAUUCAUAAAAGUGCAG 590-5p 293 hsa-miR- UGUAAACAUCCUACACUCUCAGC 30c-5p 294 SNORD114- UGGAUCAAUCAUGACUACUGGUAUUGGAUGGGUCUUCGUCAGUGAAU 30 GCCUAUCUGGAACUCUGAGGUCCA 295 MIR4450 UGUCUGGGGAUUUGGAGAAGUGGUGAGCGCAGGUCUUUGGCACCAUC UCCCCUGGUCCCUUGGCU 296 hsa-miR- UCUCUGAGUACCAUAUGCCUUGU 3921 297 hsa-miR- UUGAGAAUGAUGAAUCAUUAGG 580-3p 298 hsa-miR- GUUGGGACAAGAGGACGGUCUU 3122 299 hsa-miR- CCUGCGUGUUUUCUGUCCAA 4520a-5p 300 hsa-miR- AACACACCUGGUUAACCUCUUU 329-3p 301 hsa-miR- UGUGACAAUAGAGAUGAACAUG 4504 302 MIR4642 CACAACUGCAUGGCAUCGUCCCCUGGUGGCUGUGGCCUAGGGCAAGCC ACAAAGCCACUCAGUGAUGAUGCCAGCAGUUGUG 303 hsa-miR- UGCCCUGUGGACUCAGUUCUGG 146b-3p 304 ENSG00000206731 UUCUAAAGUGUUGAGUUCAGUCCAGGGUGGAUCCCCUGCUCUGUUAA UUGAACUGGAACAUUUAAACUGGCUAGGCAAAAUGCCUACAUAGAAA GCAUUACUCUUUAUUCAUCCCCAGCCUACAAAA 305 hsa-miR- GGCAGGAGGGCUGUGCCAGGUUG 4722-5p 306 hsa-miR- UUAAUGCUAAUCGUGAUAGGGGU 155-5p 307 hsa-miR- UCCCUUCUUCCUGGGCCCUCA 5088-3p 308 ENSG00000238334 ACUCUUUUGUAGUUCAUAAGUGUGAUGAUUUGGUGUUCAUGUGAACA UGUGAAACGUGCCACCCUCAAACCUUGUUACAAUGUGGGCAUAUUACC CAUCUGACA 309 MIR526B UCAGGCUGUGACCCUCUUGAGGGAAGCACUUUCUGUUGUCUGAAAGA AGAGAAAGUGCUUCCUUUUAGAGGCUUACUGUCUGA 310 hsa-miR- CCCUGUGCCCGGCCCACUUCUG 1914-5p 311 hsa-miR- UAGCAGCACAGAAAUAUUGGC 195-5p 312 MIR6742 GAGGGAGUGGGGUGGGACCCAGCUGUUGGCCAUGGCGACAACACCUG GGUUGUCCCCUCUAG 313 hsa-miR- CCCAAUACACGGUCGACCUCUU 323b-3p 314 hsa-miR- UAACGCAUAAUAUGGACAUGU 3912-3p 315 MIR203A GUGUUGGGGACUCGCGCGCUGGGUCCAGUGGUUCUUAACAGUUCAAC AGUUCUGUAGCGCAAUUGUGAAAUGUUUAGGACCACUAGACCCGGCG GGCGCGGCGACAGCGA 316 hsa-miR- CUCCUGGGGCCCGCACUCUCGC 1343-3p 317 hsa-miR- UGGGGAGGUGUGGAGUCAGCAU 6825-5p 318 hsa-miR- CAUCCCUUGCAUGGUGGAGGG 188-5p 319 MIR378H ACAGGAACACUGGACUUGGUGUCAGAUGGGAUGAGCCCUGGCUCUGU UUCCUAGCAGCAAUCUGAUCUUGAGCUAGUCACUGG 320 ENSG00000212626 UGCCUCUGACCUGGGUAGAGUGGCAUCUGGCUGUGACAUUCAUCUCAU AUCAGCCAGGGACAAAGCAACCCCUUGUUUAUUUCAGCUUGGCCUUUU GUCUGUGCCCAUGCCUGGUUCAUGCCUUGGACACACUA 321 ENSG00000212580 UUCUUAUUGAGCUCCUUUCUGUCUACUGGUGGCAGUCUAUGGAUUUG CACAAGACAAAACUAGCGCUAUUUUACCUUCUGUCUUUAAACAGGUA UAUUUGACUGUUUUGUGAGAAAUUC 322 MIR599 AAAGACAUGCUGUCCACAGUGUGUUUGAUAAGCUGACAUGGGACAGG GAUUCUUUUCACUGUUGUGUCAGUUUAUCAAACCCAUACUUGGAUGA C 323 hsa-miR- UCCCACUACUUCACUUGUGA 4301 324 hsa-miR- CCGCUUUCUGAGCUGGAC 4315 325 hsa-miR- CUGAUAAGAACAGAGGCCCAGAU 3161 326 MIR4502 AGCCUUUAGCAAGUUGUAAUCUUUUUGCUGAUGGAGGGUCUUGCCUC CAUGGGGAUGGCUGAUGAUGAUGGUGCUGAAGGC 327 SNORD23 UGCCCAGUGAUGACACCAUCCUUGCUCCCCGUGCCCCCCAGGGGCUAU GGGCGACACCAUGGCUGCCCCUGGGCUGGGCCAGUGGGGCCAAUGCCC AGGGGCUGAGGGCA 328 MIR320E GCCUUCUCUUCCCAGUUCUUCCUGGAGUCGGGGAAAAGCUGGGUUGAG AAGGU 329 ENSG00000238545 ACCCUUCUUAGUUCAUAAGCAUGAUGAUUGGGUUUUCAUACUCAUGU GUGAGAUGUGUCUCUCUCAAACUUUGUGAAAAGUCAGCACAUGACCC AUCUGAUG 330 MIR1294 CACCUAAUGUGUGCCAAGAUCUGUUCAUUUAUGAUCUCACCGAGUCCU GUGAGGUUGGCAUUGUUGUCUGGCAUUGUCUGAUAUACAACAGUGCC AACCUCACAGGACUCAGUGAGGUGAAACUGAGGAUUAGGAAGGUGUA 331 MIR519C UCUCAGCCUGUGACCCUCUAGAGGGAAGCGCUUUCUGUUGUCUGAAAG AAAAGAAAGUGCAUCUUUUUAGAGGAUUACAGUUUGAGA 332 ENSG00000212391 UGUUCUGACAUGGGAAGAGUAGCUUCUGGUUGGUGGAGCCCAUCUCA CAUUAGCCAGAGACAAAGCAACACCUUGUUUAUCCCGGCUUGGCUUUU GGCCUGUGUCCAUGACUGGUCCAUACCUUGGACACAUGG 333 ENSG00000238436 AUCCUUUUGUGGGUCAUAUGCAUGAUGAUUGGGUGUUCACGCACAAG UAUGAGAUGUGCCACCUUUUUACAGCAUUGGCACAUUACCUGUCUGA UG 334 hsa-miR- GCGACUCUGAAAACUAGAAGGU 4431 335 ENSG00000238948 AUCUUUUUGUGGUUCAUAAGCAUGAUGAUUGGGUUUUCAUACCAUUG UGUAAGAUGUGCCUUUCUCAGACCUUGCCAAAACACUGGCACAUUACC UGUCUGAUA 336 MIR518E UCUCAGGCUGUGACCCUCUAGAGGGAAGCGCUUUCUGUUGGCUAAAA GAAAAGAAAGCGCUUCCCUUCAGAGUGUUAACGCUUUGAGA 337 ENSG00000201500 UGGACCAAUGAUGAUGACUGGUGGUGUAUGAGUUAAAGGUGAUGAAU AGUAAGUGUCUUUGUUAGUGGCAAGUUCAGA 338 MIR3915 CAAGUUGGCACUGUAGAAUAUUGAGGAAAAGAUGGUCUUAUUGCAAA GAUUUUCAAUAAGACCAUCCUUUCCUCAAUAUUCUGUGGUGUCAUCU UUG 339 hsa-miR- CUGGGGUUCUGAGACAGACAGU 3170 340 hsa-miR- AACCCGUAGAUCCGAUCUUGUG 99a-5p 341 hsa-miR- UAUGUGACCUCGGAUGAAUCA 4501 342 hsa-miR- GAAGGCGCUUCCCUUUGGAGU 524-3p 343 hsa-miR- CAAUGUUUCCACAGUGCAUCAC 33a-3p 344 MIR4669 GCCUCCCUUCACUUCCUGGCCAUCCAGGCAUCUGUGUCUGUGUCCGGG AAGUGGAGGAGGGC 345 MIR1197 ACUUCCUGGUAUUUGAAGAUGCGGUUGACCAUGGUGUGUACGCUUUA UUUGUGACGUAGGACACAUGGUCUACUUCUUCUCAAUAUCA 346 ENSG00000252740 UGGAUCAAUGAUGACAAAGUAUCAUGAAUGAGGGAUUGUGAAUAAUC UAUUUUUAUGAACCUGUGGUCAAAU 347 ENSG00000238368 AUCCUUUUGUGGUUCAUAAGCAUGAUGAUUAGAUUUUCAUGCUAUUG GGUGAGAUAUGCCUUCCUCAGACUUUGUUACAGCAUAGGCACAUUAC AACCUGUCUGAUA 348 MIR4673 GUCCAGGCAGGAGCCGGACUGGACCUCAGGGAAGAGGCUGACCCGGCC CCUCUUGCGGC 349 MIR4720 AAGCCUGGCAUAUUUGGUAUAACUUAAGCACCAGGUAAAAUCUGGUG CUUAAGUUGUACCAAGUAUAGCCAAGUUU 350 ENSG00000212321 AAGGCUAUACUUUCAGGGAUCAUUUUUAUAGCUUAUUACUAGAGGAG UUAAUGUGAAUGUGUAGAGCACCAGAAACCUUGAGGAGGAGGUGCAG CGUUCUCUCCUGAGCAUAAAGCUGGCCCGCAGUAUUGUGUUGCCUCAC UGCAACUGCCAUUUGCCAUUGAUGAUGAUUGUUCUCUUUCACUGAGA GAGUAAGAGGACAGGAUGCAUUCUAACUGG 351 hsa-miR- AGGUUGUCUGUGAUGAGUUCG 410-5p 352 hsa-miR- UUCAUGAACUGGGUCUAGCUUGG 7154-5p 353 hsa-miR- UAGCCCCCAGGCUUCACUUGGCG 3943 354 ENSG00000238530 AUCCUUUUGUAGUUCAUAAGGAUGAUGACUGAGUGUUCACACUCGUG UGUGAGAUGUGCCACCCUCAGACCUUGAAAUCUUCAGUCACUCUUGUU AAGUGAAC 355 hsa-miR- GGCUCCUUGGUCUAGGGGUA 4448 356 hsa-miR- GAAGGACCUGCACCUUCG 6816-3p 357 hsa-miR- CUUUCAGUCGGAUGUUUACAGC 30e-3p 358 MIR525 CUCAAGCUGUGACUCUCCAGAGGGAUGCACUUUCUCUUAUGUGAAAA AAAAGAAGGCGCUUCCCUUUAGAGCGUUACGGUUUGGG 359 hsa-miR- GUGCAAAAGUCAUCACGGUU 548aw 360 hsa-miR- ACUAAAGGAUAUAGAAGGUUUU 4509 361 hsa-miR- AGGCCCUGUCCUCUGCCCCAG 6775-3p 362 hsa-miR- UAGCAGCGGGAACAGUUCUGCAG 503-5p 363 SNORD58B CUGCGAUGAUGGCAUUUCUUAGGACACCUUUGGAUUAAUAAUGAAAA CAACUACUCUCUGAGCAGC 364 hsa-miR- UGUAGGAACAGUUGAAUUUUGGCU 8065 365 ENSG00000200545 AAGACUGUGCUUUCAGGGAUCAUGUCUAUAGUUUGCCACUAGAGAAG UUUUUUUGAACAUGUAGUAGGGCACCAGAAGCACAAGGAAGAGGCAC AGCCUUCUCUCCUGAGCAUGAAUCUGGCUCUUGGUCUUGCUUUGUUCC AGCUACCAUUUGCCAUUGAUUAUGUCCUUCUCUUCCUUCCAGAAAGUA AAAGGGAGAGAAUGCAGUCUGAGUGG 366 ENSG00000238801 AUCCUUUUGUAGUUCAUAAGCUGAUGGUUGGGUUUUCACGCUCAUGU GUGAGAUGUGUUCCUUCAUAUCUGUCAACACACUACCGGGCUGUUG 367 hsa-miR- UUGAAGGGACAAGUCAGAUAUGCC 6864-5p 368 hsa-miR- AUGGUCACCUCCGGGACU 5587-5p 369 hsa-miR- GGAGGAACCUUGGAGCUUCGGC 3928-3p 370 hsa-miR- AGCUGUAAUUAGUCAGUUUUCU 1537-5p 371 MIR182 GAGCUGCUUGCCUCCCCCCGUUUUUGGCAAUGGUAGAACUCACACUGG UGAGGUAACAGGAUCCGGUGGUUCUAGACUUGCCAACUAUGGGGCGA GGACUCAGCCGGCAC 372 ENSG00000253047 CAAAGUUCUGGAAUUACAGGUGUGAGCCACCGUGCCCAGCAUUUAAA AUUUUAAUAUGUACUUUUUGCAACCCAGAACUCAUUGUUCAGUAUGA GUUUUGAUACAUAUAAGAAGGGAUAUU 373 hsa-miR- AAAAACUGAGACUACUUUUGCA 548e-3p 374 hsa-miR- UGAUGGAGCUGGGAAUACUCUG 8082 375 hsa-miR- ACUCGGCUGCGGUGGACAAGU 4638-5p 376 MIR8089 AAGGAGCACUCACUCCAAUUUCCCUGGACUGGGGGCAGGCUGCCACCU CCUGGGGACAGGGGAUUGGGGCAGGAUGUUCCAG 377 ENSG00000238339 AUCCUUUUGUAGUUUAUAAGCAUGAUGAUGGGUGCUCACACUCAUCU GAGAUGUGUCUCCCUCUAAGCCUUGUAACAACAUCAGCACGUUACCCU UCUGAUG 378 hsa-miR- GUGGUCUCUUGGCCCCCAG 6848-3p 379 ENSG00000239144 AGUUUAAAAAAUUUGUUAAGCAUGAUGAUUAACUUUUCACAAUAAUG CAAUAAUGUGUGAGCUAUGCCUCUCUCAAACCUUAUUAUGAUGUUGG CCCAUUACCCAUCUGAUG 380 hsa-miR- GGGUUUGUAGCUUUGCUGGCAUG 5087 381 MIR5583- AAACUAAUAUACCCAUAUUCUGGCUAGGUGAUCAUCAGAAUAUGGGU 1 AUAUUAGUUUGG 382 MIR8082 GCCUGUGUGAUGAUGGAGCUGGGAAUACUCUGGGGAGAGAGUCCUCU UUUCAGCUGUAUUUUGCUUCCUUCCCACACAGAC 383 hsa-miR- UUGCUAAGUAGGCUGAGAUUGA 4639-5p 384 hsa-miR- AAAAGUAAUUGUGGUUUUUGCC 548d-5p 385 ENSG00000252576 UUAAAUGAUGAUUUUUUUAAACAAAUGUAUCAGAGUGCAUUCAUUCA AAGGAAUGUUGUCUUCUGGCAAGUAAAAAUCCAUGCAG 386 SNORD19B UUUUGGUUGAAAUAUGAUGAGUGUACAAAAUCUUGAUUUAAGUGAAU GAAAAAUUACAAGAUCCAACUCUGAUUUCAGCCAGAG 387 ENSG00000238666 AUCCUUUUGUAGUUCAUGAGUGUAAUGAUUGAGUGUUCAUGCACAUG UGUGAGAUAUGCCACCCUUGAACCUUGUUACACCGUUGUCACAUUGCC CGUUUGACA 388 ENSG00000199363 AGGCAGGAUCUAGUUACAUUGUAGCUGUGAAGUGCUGCAUUGUCUUU GCCCCCUGCUCAAAAUAAAACUGUUACCUUUCAAGCCCUGUCUGCCAU GGUGCUGUAGCAGCAGGGAUGUUUGGUCUCAUACAU 389 hsa-miR- CUAUACAACUUACUACUUUCCC 98-3p 390 hsa-miR- GUAGCACCUUGCAGGAUAAGGU 5682 391 MIR7114 UCCGCUCUGUGGAGUGGGGUGCCUGUCCCCUGCCACUGGGUGACCCAC CCCUCUCCACCAG 392 SNORA16A UUGGCCCUUAUCGAAGCUGCAGCUGCUUCCGCAUAGCUGCUGUGGUCA AAAAGGAGCCCAGAGUGACAGUUUUCCUUGACGGUCGCCGUUCUGUU UGUUGUAACUGAUCUGCAACAUUUUGGGAAAAUACAGUU 393 hsa-miR- UGUGACUGGUUGACCAGAGGGG 134-5p 394 hsa-miR- CAAGGGACCAAGCAUUCAUUAU 4475 395 hsa-miR- UCCCAAGGGUGAGAUGCUGCCA 6814-5p 396 MIR323B UGGUACUCGGAGGGAGGUUGUCCGUGGUGAGUUCGCAUUAUUUAAUG AUGCCCAAUACACGGUCGACCUCUUUUCGGUAUCA 397 MIR4266 CCACUGCUGGCCGGGGCCCCUACUCAAGGCUAGGAGGCCUUGGCCAAG GACAGUC 398 hsa-miR- CUUCUGAUCAAGAUUUGUGGUG 4762-3p 399 ENSG00000252525 AAGCUUAUGAUGACGUAAGUGUGACGACAUUGGGUUUUCACGUUCAU GUGUGAGAUGUGCCUCCCUCAAGCCUUAUUACAAUGCCAGUACAUUUU UUUUCCACAUCUGAUG 400 MIR4742 UCAGGCAAAGGGAUAUUUACAGAUACUUUUUAAAAUUUGUUUGAGUU GAGGCAGAUUAAAUAUCUGUAUUCUCCUUUGCCUGCAG 401 hsa-miR- CGGCGCCCGUGUCUCCUCCAG 6789-3p 402 hsa-miR- AUGAAGCCUUCUCUGCCUUACG 7843-3p 403 hsa-miR- AAAAGUAAUUGCAGUUUUUGC 548ar-5p 404 ENSG00000238732 AUCUUUUCGUAGUUCAUAAGUGUGAUGACUGGGUAUUCAUGCAUGUA UGUGGGAUAUGCCACCCUUGACCCUUGUUACAACAUUAGCACAUUAAC CAUCUGACA 405 MIR8052 UGGAGGGCUGCGGGACUGUAGAGGGCAUGAGCUCAGGAGCUCAGGCC AGCUCAUGGUGCAAGGCCUCUG 406 SNORD114- UGGACCAGUGAUGGUGACUGGUGGUGUGUGAGUCAUGCACAGUGAAU 11 AUCAUGUGUCUGGAACUCUGAGGUCCA 407 hsa-miR- AUAGCAGCAUGAACCUGUCUCA 4524a-5p 408 hsa-miR- AAAAUUUCUUUCACUACUUAG 3606-3p 409 SNORD54 UGGCGAUGAGGAGGUACCUAUUGUGUUGAGUAACGGUGAUAAUUUUA UACGCUAUUCUGAGCC 410 SNORD32A GUCAGUGAUGAGCAACAUUCACCAUCUUUCGUUUGAGUCUCACGGCCA UGAGAUCAACCCCAUGCACCGCUCUGAGA 411 MIR3529 GGCACCAUUAGGUAGACUGGGAUUUGUUGUUGAGCGCAGUAAGACAA CAACAAAAUCACUAGUCUUCCAGAUGGGGCC 412 MIR520G UCCCAUGCUGUGACCCUCUAGAGGAAGCACUUUCUGUUUGUUGUCUGA GAAAAAACAAAGUGCUUCCCUUUAGAGUGUUACCGUUUGGGA 413 MIR4330 AAUUGUCAGCAGGCAAUUAUCUGAGGAUGCAGGAGAGGAAGGGGGCU UCUUUUUGACGCCUACUUCAUCAGCUGCUCCUCAGAUCAGAGCCUUGC AGGUCAGGCC 414 SNORD97 UUGCCCGAUGAUUAUAAAAAGACGCGUUAUUAAGAGGACUUUAUGCU GGAGUUCUUGACGUUUUUCUCUCUUUUCUAUACUUCUUUUUCUUUCU UUGAAUGUCCAGCGUCCUGUGAGCGAAGAUUAUGAGAUAUGAGGGCA A 415 MIR640 GUGACCCUGGGCAAGUUCCUGAAGAUCAGACACAUCAGAUCCCUUAUC UGUAAAAUGGGCAUGAUCCAGGAACCUGCCUCUACGGUUGCCUUGGG G 416 hsa-miR- UAGCACCAUCUGAAAUCGGUUA 29a-3p 417 MIR5739 GGUUGGCUAUAACUAUCAUUUCCAAGGUUGUGCUUUUAGGAAAUGUU GGCUGUCCUGCGGAGAGAGAAUGGGGAGCCAGG 418 MIR24-2 CUCUGCCUCCCGUGCCUACUGAGCUGAAACACAGUUGGUUUGUGUACA CUGGCUCAGUUCAGCAGGAACAGGG 419 hsa-miR- UCGUGUCCCUCUUGUCCACAG 6774-3p 420 MIR1224 GUGAGGACUCGGGAGGUGGAGGGUGGUGCCGCCGGGGCCGGGCGCUG UUUCAGCUCGCUUCUCCCCCCACCUCCUCUCUCCUCAG 421 hsa-miR- UCUCAGCUGCUGCCCUCUCCAG 6787-3p 422 hsa-miR- ACUUCACCUGGUCCACUAGCCGU 412-3p 423 hsa-miR- CAGCCUGACAGGAACAG 4293 424 MIR577 UGGGGGAGUGAAGAGUAGAUAAAAUAUUGGUACCUGAUGAAUCUGAG GCCAGGUUUCAAUACUUUAUCUGCUCUUCAUUUCCCCAUAUCUACUUA C 425 ENSG00000238294 AUUCUUUAGUAGUUCAUAAUGCUAUGAUUGGGUUUCCAUGUGCACAU GUAAGAUGUGCCUCUCUCAAGCCUUGUUGUGACAUCAGCACAUUACCC AUCUGAUG 426 ENSG00000252458 CUCAUACCUAAACCCAAGAAUCACUUUCUUAUAGUGAUGAUUUAAAC AGAUGCAAACAGCGAGCACAUCUUGUCACCUUUGCGGGACUGUGGCUG UGCCCCUCGCAGUAAAUUUGGAGGUUCUACAUCC 427 MIR3171 UAUAUAUAGAGAUGUAUGGAAUCUGUAUAUAUCUAUAUAUAUGUGUA UAUAUAGAUUCCAUAAAUCUAUAUAUG 428 ENSG00000253068 UGGAACAAUGAUGAGAGUGUGUCAUGAACCAAGGUUAUGAUUAAUCU AGUUCUGUGCAUCUGAAAUCCGUU 429 hsa-miR- GGUGAGGCUAGCUGGUG 4316 430 MIR379 AGAGAUGGUAGACUAUGGAACGUAGGCGUUAUGAUUUCUGACCUAUG UAACAUGGUCCACUAACUCU 431 ENSG00000239015 AUCCUUUAGUUCUUAAACAUGACAAUUGGAUGUUUAUGCAUAUGUGU GAGAUGUGUCACCCUUGAACCUUGUUACCAUGUCUGCACAUUACCUAU CUGACA 432 MIR4645 UGAUAGGGAAACCAGGCAAGAAAUAUUGUCUCCUCAAGUUGCGACGA GACAGUAGUUCUUGCCUGGUUUCUCUAUCA 433 hsa-miR- ACAAAGUACAGCAUUAGCCUUAG 3973 434 MIR184 CCAGUCACGUCCCCUUAUCACUUUUCCAGCCCAGCUUUGUGACUGUAA GUGUUGGACGGAGAACUGAUAAGGGUAGGUGAUUGA 435 ENSG00000221083 ACAGACUCACUUUGCACCUGGCUGCAGCCUCAUGGGGGUGCUUUUUCC AUGUGCCAGGGAAACAUUCUGGGGUGUUGUGGCUGCCUGACCUAUCA AGGGUGAUGCAGCUGUCUGGGGAUACAGGA 436 ENSG00000207171 GGCUUGCUGGUGCUUACCACAGGCUGAAUUCUUACACUGACUAUAUA GAAAAGGAGGUAGAGUAAACCUACCCAAUAUACCCCUCAGCCCAGGCU CUGUGCCUGAUCUAUAUUGUGAAUGUGGGAACAUAG 437 SNORA20 CUUCCCAUUUAUUUGCUGCUUGUAGUCUCACAGUGAUACGAGCAGUU AUACGCAUGGGAUAAAAUAACAUUGGGCCACUGUAAAUUGAGAUGAA GUAACCAUUUUCAUCUCUUCUGCAGGGACUAGACAUUG 438 MIR5002 UCUUCCUCUCUGUCCUCUGGAAUUUGGUUUCUGAGGCACUUAGUAGG UGAUAGCAUGACUGACUGCCUCACUGACCACUUCCAGAUGAGGGUUAC UC 439 MIR649 GGCCUAGCCAAAUACUGUAUUUUUGAUCGACAUUUGGUUGAAAAAUA UCUAUGUAUUAGUAAACCUGUGUUGUUCAAGAGUCCACUGUGUUUUG CUG 440 MIR302A CCACCACUUAAACGUGGAUGUACUUGCUUUGAAACUAAAGAAGUAAG UGCUUCCAUGUUUUGGUGAUGG 441 ENSG00000223111 UCCAGCAGUAGUCAGCUGUCUGGACAGAACCAUUCCUGGGAUCAUGUU ACACUGCUGGGAGAAGAAUGUCUUCUCUUCAUCCAGUUGCGUCCAUCA CUGUUCUGGUGGUGUCUGGCACUGGUGCAAGGCAGAACUGUGCUUCC UUGAGAGUGUGCUGAGCAUUCACCUUGGCUGCUUGGUUCUAGUCUAG GAGCAGACACA 442 hsa-miR- UUUGUUCGUUCGGCUCGCGUGA 375-3p 443 MIR3678 GAAUCCGGUCCGUACAAACUCUGCUGUGUUGAAUGAUUGGUGAGUUU GUUUGCUCAUUGAUUGAAUCACUGCAGAGUUUGUACGGACCGGAUUC 444 SNORD115- GUGUUGAUGAUGAGAACCUUAUAUUAUCCUGAAGAGAGGUGAUGACU 1 UAAAAAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 445 hsa-miR- CUCUAGAGGGAAGCACUUUCUC 518f-5p 446 MIR16-2 GUUCCACUCUAGCAGCACGUAAAUAUUGGCGUAGUGAAAUAUAUAUU AAACACCAAUAUUACUGUGCUGCUUUAGUGUGAC 447 MIR3691 UUGAGGCACUGGGUAGUGGAUGAUGGAGACUCGGUACCCACUGCUGA GGGUGGGGACCAAGUCUGCGUCAUCCUCUCCUCAGUGCCUCAA 448 hsa-miR- AGGCAUGGGAGGUCAGGUGA 3622b-5p 449 MIR4441 CAGAGUCUCCUUCGUGUACAGGGAGGAGACUGUACGUGAGAGAUAGU CAGAUCCGCAUGUUAGAGCAGAGUCUCCUUCGUGUACAGGGAGGAGA UUGUAC 450 hsa-miR- AUUGAAACCUCUAAGAGUGGA 510-3p 451 MIR454 UCUGUUUAUCACCAGAUCCUAGAACCCUAUCAAUAUUGUCUCUGCUGU GUAAAUAGUUCUGAGUAGUGCAAUAUUGCUUAUAGGGUUUUGGUGUU UGGAAAGAACAAUGGGCAGG 452 hsa-miR- CUUGGCACCUAGCAAGCACUCA 1271-5p 453 SCARNA4 ACUGGAGGACUAAGAAGGCUGAGUCUGAUGAAGUAAGACUUUGCUGA UACAUUCCUCCUAGAAAAAAGGGUUGGAGAGAGCAGCCUUCACUGAA GAGUAUCACAGGGCUGACUGUACUACCCAACACUC 454 MIR302E UUGGGUAAGUGCUUCCAUGCUUCAGUUUCCUUACUGGUAAGAUGGAU GUAGUAAUAGCACCUACCUUAUAGA 455 hsa-miR- CGACCUCGGCGACCCCUCACU 6790-3p 456 MIR3613 UGGUUGGGUUUGGAUUGUUGUACUUUUUUUUUUGUUCGUUGCAUUUU UAGGAACAAAAAAAAAAGCCCAACCCUUCACACCACUUCA 457 SNORD57 UGGAGGUGAUGAACUGUCUGAGCCUGACCUUGUAGAAUGGAGGCAAA AAAACUGAUUUAAUGAGCCUGAUCC 458 hsa-miR- CAUGCUAGGAUAGAAAGAAUGG 3146 459 hsa-miR- CUCAGUAGCCAGUGUAGAUCCU 222-5p 460 MIR1302- UGCCCGGCCUCCCAUUAAAUUGGUUUUUCAGACAAAUCACAAAUUUGU 5 UUAGGUAUAAGUAUAUCCCAUGUAAUCUUUGGGACAUACUUAUGCUA AAAUAAUUGUUCCUUGUUGAUUGGAAAUUUUAAUUUUAAUUAGGUGU CCUGUAUU 461 hsa-miR- UGCGUUUCUCCUCUUGAGCAG 6854-3p 462 SNORA15 GCAUGGCCGAAUACUGUGUUUUUAUCAGUAGUUUACACAGCCAGACA CCAUGCAAAAGCAGUCUUCCCUUUAGAAUGACUGAUGGUAUGCUAAG GUUUUUCAUAGCAUAUCAUUAUUAAAGGUGAAUACAAAU 463 hsa-miR- AACAACAAAAUCACUAGUCUUCCA 3529-3p 464 MIR106B CCUGCCGGGGCUAAAGUGCUGACAGUGCAGAUAGUGGUCCUCUCCGUG CUACCGCACUGUGGGUACUUGCUGCUCCAGCAGG 465 SNORD115- GGGUCAAUGAUGAGAAUCUUAUAUUGUCCUGAAGAGAGGUGAUGACU 39 UAAAAAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 466 hsa-miR- UAAUUUUAUGUAUAAGCUAGU 590-3p 467 ENSG000 GCAUGGGUUUGGAUUUAUGAUGGGCUGGAUUCCCUAGGCCUCUCAUA 00199787 GUACCCCAUGCCAGAGCAAACUGUAGCCCCAACCAUUGCCGGGCCUCU AUGCCUGUAGGCUGCUGGCACUGAAGUGGGUUGCACAGUA 468 hsa-miR- ACCUUCCUCUCCAUGGGUCUUU 3667-3p 469 SNORD15A CUUCGAUGAAGAGAUGAUGACGAGUCUGACUUGGGGAUGUUCUCUUU GCCCAGGUGGCCUACUCUGUGCUGCGUUCUGUGGCACAGUUUAAAGAG CCCUGGUUGAAGUAAUUUCCUAAAGAUGACUUAGAGGCAUUUGUCUG AGAAGG 470 hsa-miR- UGUGUCCCAUUAUUGGUGAUU 3657 471 MIR129-2 UGCCCUUCGCGAAUCUUUUUGCGGUCUGGGCUUGCUGUACAUAACUCA AUAGCCGGAAGCCCUUACCCCAAAAAGCAUUUGCGGAGGGCG 472 hsa-miR- AGGACUGGACUCCCGGCAGCCC 4515 473 hsa-miR- UGUAGUGUUUCCUACUUUAUGGA 142-3p 474 MIR6802 GAGGGCUAGGUGGGGGGCUUGAAGCCCCGAGAUGCCUCACGUCUUCAC CCCUCUCACCUAAGCAG 475 hsa-miR- ACUUGGGCACUGAAACAAUGUCC 635 476 hsa-miR- AUCAUAUGAACCAAACUCUAAU 5007-3p 477 MIR6835 UGAUGAGGGGGUAGAAAGUGGCUGAAGCGAGAUGUUUGUCUAAAAGC ACUUUUCUGUCUCCCAG 478 SNORD96B CCUGGUGAUGACAGACGACAUUGUCAGCCAAUCCCCAUGUGGUAGUGA GGACAUGUCCUGCAGUUCUGAAGG 479 MIR4755 AGAUUCAGCUUUCCCUUCAGAGCCUGGCUUUGGCAUCUAUGAAAGCCA GGCUCUGAAGGGAAAGUUGAAUCU 480 ENSG00000238488 AUCAUUUUGCAGCUUAUACAUGUGAUGACUGGGUUUUUUAACUCAUA AGUGAGAUGUGCCUUUCUUACAUCUUAUUAUGACAUUAGUACAUUAC CCAUUUGAUA 481 ENSG00000238624 AUCCUUCUGUAGUUUAUGAGUGUGAUGAUUGGCUGUUCAUGUGCAUG UAUGAGAUGUGCCACCCUUGAACCUUGUCAUGUCUGAUGUGAAA 482 hsa-miR- UCUCCCUUGAGGGCACUUU 4287 483 ENSG00000252154 UAGACCAGUGAUGAGAAUCUGUCAUGAACCAAGGAGUAUUAUUAAUC UAAUUCUGUUUACCUGAGAGUUUUAAA 484 SNORD91B AAGAGCCAAUGAUGUUUUUAUUCAAAAUGUCUGAACCUGUCUGAAGC AUCCCAGUGAUGCAACUUCUGUGUGAUACUGAGGCUUUU 485 ENSG00000238769 AUCCUUUUGUGGUUCAUAAGCAUGAUGAUUGGGUUUCCACAUUCUUG UGAGAUGGGCCUCCCUCCAACCUUGUUAUGAUGUCAGCACAUUACCCU UGACG 486 MIR3126 AUGAUUAUAUGAGGGACAGAUGCCAGAAGCACUGGUUAUGAUUUGCA UCUGGCAUCCGUCACACAGAUAAUUAU 487 hsa-miR- AGGUGCUCCAGGCUGGCUCACA 3907 488 hsa-miR- UCACAAAUCUAUAAUAUGCAGG 4719 489 MIR612 UCCCAUCUGGACCCUGCUGGGCAGGGCUUCUGAGCUCCUUAGCACUAG CAGGAGGGGCUCCAGGGGCCCUCCCUCCAUGGCAGCCAGGACAGGACU CUCA 490 SNORD114- UGGACCAAUGAUGACAACUGCCGGCGUAUGAGUGUUGGGUGAUGAAU 14 AAUACGUGUCUAGAACUCUGAGGUCCA 491 MIR4757 UUCCAGCCCGAGGCCUCUGUGACGUCACGGUGUCUGCGGGAGGAGACC AUGACGUCACAGAGGCUUCGCGCUCUGAG 492 MIR4738 GGUCGCAUUUCUCCUUCUUACCAGCGCGUUUUCAGUUUCAUAGGGAAG CCUUUCCAUGAAACUGGAGCGCCUGGAGGAGAAGGGGCC 493 hsa-miR- UGACCCCCAUGUCGCCUCUGUAG 6859-3p 494 MIR548AV AAAAGUACUUGCGGAUUUGCCAUCACCUUUACCUUUAAUGGCAAAAC UGCAGUUACUUUUGC 495 MIR4733 GGUCGCUUAAAUCCCAAUGCUAGACCCGGUGGCAAUCAAGGUCUAGCC ACCAGGUCUAGCAUUGGGAUUUAAGCCC 496 MIR3202- AUUAAUAUGGAAGGGAGAAGAGCUUUAAUGCUCUGAAAAUGACUCCA 2 AUCAUUAAAGCUCUUCUCCCUUCCAUAUUAAU 497 ENSG00000252728 UGCACUGAUGACAGUGAACCAUAAACCAAGAAUUAUGAUUUAUCCAG UUCUAUGAAUCUUAAGUCCAUU 498 ENSG00000238767 AUCCUUUUGUAGUUCAUAAGCGUGAUGACUGUGGUUUCAUGCUUGUG UGUGAGAGAUGGGUGGGCCUCCCUCAAACCUUGUUACGACGUAGGCCC AUUACCCAUCUGACA 499 ENSG00000199977 CCAAUGUGUAAUAUCCUGGGAUAUCAUUUUUUCUAGGCUUUGUCCAC AUGGCUUAGGGGAGCAUAGGGCUCUGCCCCAUGAUGUACAGUCCCUUU CCUCAGUGUUGGAGAUGAAGCUGGGUCUGGUGUUUGCACUUUCAUAU UCCUGUAGCUUCUCAGAAUCCUGUGGACAGUGACUGGGGAGACAAACC AUGCAGGAAAUAUGU 500 ENSG00000238564 AUCCUUCUAUAUAGUUCAUAAGCUUGAUGAUCGGUGUUCACACACAU GUGAGAUACGCCACCUGUGAACCUUGUUAGGACAUCAGCACAUUACCC AUCUGACA 501 ENSG00000238970 AUCCUUUUGUAGUUCAUAAGCACAAUGAUUGAAUUUUCAUGCUCAUG UGUGAGAUAUGCCUCACUCCAGUCUUGUUACAGUGUUAGCACAUUACC UAUCUGAUA 502 ENSG00000252298 UGGACAAAUGAUUAGAUUAGAUUGUGUUAUAAACCAAAGAUUAUAGU UAUUCCAAUUAUGUGCAUUUGAGAUCCACU 503 ENSG00000200398 CAGUCAGUGUCGAGAACCUUAUAUUGUUCUGAAGAGAGGUGGUGACU UAAAAAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 504 SNORD116- UGGAUCGAUGAUGACUUUCAUACAUGCAUUCCUUGGAAAGCUGAACA 16 AAAUGAGUGAAAACUCUAUACCGUCAUCCUCGUCGAACUGAGGUCCA 505 ENSG00000212445 UGUCCUUGACUUGGGUAGAGUGAUGUCUGGUUGGUGCUGCCUAUCUC AUAUAAGCCAGGGACAAAUCAAUGCCUUAUUUAUUCCAGCUUGGCUU UUGGUCUGUGCCCAUACCUGGUUUAUGCCUUGGACACAUGG 506 MIR548I3 CAGAUGGCUCCGAAGUUUACAUCCUAUUAGGUUUGUGCAAAAGUAAU UGCGGAUUUUGCCAUUAAAAGUAAUGGCAAAAAUAGCAAUUAUUUUU GUACCAGCCUAGUAUCUUUUCUCCUUCUACCAAACUUUGUCCCUGAGC CAUCUCA 507 ENSG00000199851 AAGACUAUACUUUCAGGGAUCAUUUCUACAUUUCCGGGUAAUUUCUU UGAACAUGUGGAGCACCGGAAACCACCAGGAGGAGGCACAGCAUUUUC UCUGGAGCGUGAAGCCAGUUCUUGGUGUUGCUGCAUAGCAACUGCCA UUUGCCUUUGAUGAUCAUUCUUCUUUUCCUUUAGGAGAAUAAGAGGG GGAGAACCCAGUCUGAGGG 508 MIR6776 CGGGCUCUGGGUGCAGUGGGGGUUCCCACGCCGCGGCAACCACCACUG UCUCUCCCCAG 509 hsa-miR- UUUGUAUGGAUAUGUGUGUGUAU 3149 510 ENSG00000201701 AGGUCAUUUCAAAGAGGUCUUGUGAGGCUGUGAAACCAAGAGCUCUU AACACUGCGACCAAAGAUGGAAGUUCUCUAUAGGAUGCCAUGGCAUU UGAUGGUGCUAUGUUUUCUUGAGGAGAUAUAAGA 511 hsa-miR- CCAGCCACGGACUGAGAGUGCAU 4691-3p 512 ENSG00000238327 AUCCCUUUAUAGUUCCCGAGCAUGACGAUUGGGUGUUCACAUGCAUG UGUGAGAUGUACCACCCUCGCAUCUUGUUAGACGUUGGCACAUUACCC GUCUGACC 513 hsa-miR- GUUCCACACUGACACUGCAGAAGU 3692-3p 514 hsa-miR- AGUAUUCUGUACCAGGGAAGGU 630 515 MIR143 GCGCAGCGCCCUGUCUCCCAGCCUGAGGUGCAGUGCUGCAUCUCUGGU CAGUUGGGAGUCUGAGAUGAAGCACUGUAGCUCAGGAAGAGAGAAGU UGUUCUGCAGC 516 SNORD50A UAUCUGUGAUGAUCUUAUCCCGAACCUGAACUUCUGUUGAAAAAAAA AAACUUUUACGGAUCUGGCUUCUGAGAU 517 ENSG00000238625 AUAAUCUUGUAGUUCAUAAGCAUGAUGAUUGCCUUUUCACACUCGUA UGAGAUGUGCCUCCCUUGAACCUUGUUAUGAUGUUGGCACAUUACCCA UCUGAUG 518 SCARNA8 UGGGAGGCUGAUACACAAAUUGGGCUGAAAUACUGCUCUACUUGUCA CCAUGCCUCCCUAGAAUAAACUGCCUUUUGAUGACCGGGACGAAUUGA GUGAAAUCGUAACGGACAGAUACGGGGCAGACAGAU 519 hsa-miR- CCAGAGAUGGUUGCCUUCCUAU 4756-3p 520 hsa-miR- CGGGGCCAUGGAGCAGCCUGUGU 6762-5p 521 ENSG00000239135 AUUCUUUUGCUGUUCGUAAGCAUAAGGAUCAGGUAUUCAUGGUCAUG UGUAAGACGUGCCUCCCUCCAACCUUGUUACGAUGUGGACGUCAGCAC AUACCCAUUUGAUG 522 MIR222 GCUGCUGGAAGGUGUAGGUACCCUCAAUGGCUCAGUAGCCAGUGUAG AUCCUGUCUUUCGUAAUCAGCAGCUACAUCUGGCUACUGGGUCUCUGA UGGCAUCUUCUAGCU 523 ENSG00000222185 UGGACCAAUGAUGUGAAUGGAAUGCAUCUGAAUAAAAAUUAUGAUCA AUCAGUUUUUGGAACAACUGAGGUCCAC 524 ENSG00000238557 AGGCCCCUGUAGUUCCCGAGCACGAUGACUGGGUGUUCACGUGCACGU GUGGGAUGUGCCACCCUCUGAACCUUGUUACGAUGUUGGCACAUUACC CUGGACCUGACC 525 MIR592 UAUUAUGCCAUGACAUUGUGUCAAUAUGCGAUGAUGUGUUGUGAUGG CACAGCGUCAUCACGUGGUGACGCAACAUCAUGACGUAAGACGUCACA AC 526 ENSG00000251778 UUCAAAAAAGACCAUAUAUCCUUGAAGAGUAACUGCUGAACUUAUUC ACUGGCAGUGGGCCUUAUAGCACAGUGAAUGACCAGGUUAGAGACAU GC 527 ENSG00000200538 AAGACUAUACUUUCAGGGAUCAUUUCUAUAGUUAGUUGCUAGAGAAG UUUCUCUGGACAUGUGGAGCACCAGAAACCAUGAGAAGGAGAUGUAG UGUUCUCUCCGGAGCAUGAAGCUGGCUCUUGGUGUUGCUUCGCUGCAC CUGCCAUUUGCCAUUGACAAUCAUUCUUCUCUUCCUCUGGGAGAGUAA GGAGGAGAGGACACAGUCUGAGUGG 528 hsa-miR- CUGCAGCCACUUGGGGAACUGGU 7156-3p 529 MIR5580 UGCUGGCUCAUUUCAUAUGUGUGCUGAGAAAAUUCACACAUAUGAAG UGAGCCAGCAC 530 hsa-miR- CCUGACACCCCAUCUGCCCUCA 6730-3p 531 hsa-miR- UUACACAGCUGGACAGAGGCA 4672 532 MIR217 AGUAUAAUUAUUACAUAGUUUUUGAUGUCGCAGAUACUGCAUCAGGA ACUGAUUGGAUAAGAAUCAGUCACCAUCAGUUCCUAAUGCAUUGCCU UCAGCAUCUAAACAAG 533 hsa-miR- GGUGGGGGGUGUUGUUUU 4472 534 MIR770 AGGAGCCACCUUCCGAGCCUCCAGUACCACGUGUCAGGGCCACAUGAG CUGGGCCUCGUGGGCCUGAUGUGGUGCUGGGGCCUCAGGGGUCUGCUC UU 535 hsa-miR- UAGCACCAUUUGAAAUCGGUUA 29c-3p 536 ENSG00000239068 AUCCUUUUGUAGUUCAUGAGCAUGAUGAUUGGUUGUUCACGUACAUG UGUGAGAUGUGUCACCCUCGAACCUUGUGGCAAUGUUGGAAUAUUAC CUGUCUGACA 537 ENSG00000252443 GAUUCACAGCAGAAAGACAGCUAAUCUAGUGUGCUAGCUGUAGAGCA AGUUUGCUGCAAACACCUCAAGGAGGGUCUCUGGCCAAAUGAGUAGA AUCUGACAGUAAUCCUUGCUAAAAGU 538 ENSG00000207299 CUACAAUGAUGGCAAUAUGUUUCAUCGACAGCAGUUCACCCAUUGAG UGUUGAUACCGUGGGUCUGAGUGA 539 MIR3158- AUUCAGGCCGGUCCUGCAGAGAGGAAGCCCUUCCAAUACCUGUAAGCA 2 GAAGGGCUUCCUCUCUGCAGGACCGGCCUGAAU 540 SNORD114- UGGACCAAUGAUGACAAAUACCGGCGUAUGAGUCUUGGAUGAUGAAU 12 AAUACGUGUCUGGAACUCUGAGGUCCA 541 MIR4531 GCCUAGGAGUCCUUGGUCAGUGGGGACAUGGAGAAGGCUUCUGAGGA 542 MIR6834 GUGAGGGACUGGGAUUUGUGGGGCGAGGAGGGACCUGUACUAGCCAU GGUUCUGAUCACAUAUGUCCCAUCCCUCCAUCAG 543 ENSG00000201025 GUUUCUGAUGAAGCCUAUGUUGGUAGGGACAACUAAGGUUGUUGAUG AAUGCUAACAGCUCUAACACACACAC 544 MIR451A CUUGGGAAUGGCAAGGAAACCGUUACCAUUACUGAGUUUAGUAAUGG UAAUGGUUCUCUUGCUAUACCCAGA 545 hsa-miR- GUGAGCCAGUGGAAUGGAGAGG 6807-5p 546 MIR4477B ACCUCCUCCCGUGAAUCACAAAUGUCCUUAAUAGCAAUCCUUAAAUGC CAUUAAGGACAUUUGUGAUUGAUGGGAGGAGGA 547 MIR548AO AACUAUUCUUAGGUUGAUGCAGAAGUAACUACGGUUUUUGCAGUUGA AAGUAAUGGCAAAGACCGUGACUACUUUUGCAACAGCCUAAUAGUUU CU 548 ENSG00000207027 AUCCAAGGGGAUUCCCUCUCCAAGGGAACAUGCAGUGCCCCUCUCAGG AAAGUAACAACCUGGAAUAGAAUCUGGCAUGCCUAAGGUCUUUGAGG AAUAGAGGAAUAGAGGAUGCUUGUUUCCUCUGCCUUCCUUGGCUGCC UACAUGG 549 MIR1302- ACAACAUGUUUUUAGGACAUGUAUGUCUGGUGCAAUAAUUGGGACAU 7 ACUUAUGCUAAAAAAAUUAGUGUUC 550 MIR3605 ACUUUAUACGUGUAAUUGUGAUGAGGAUGGAUAGCAAGGAAGCCGCU CCCACCUGACCCUCACGGCCUCCGUGUUACCUGUCCUCUAGGUGGGAC GCUCG 551 MIR588 AGCUUAGGUACCAAUUUGGCCACAAUGGGUUAGAACACUAUUCCAUUG UGUUCUUACCCACCAUGGCCAAAAUUGGGCCUAAG 552 ENSG00000212532 GUCCUCUGAUGACUUCAUGUUAGUGCCACCUGUCUGGGCCACGGAGAA CCCAUGAUGGAACUGAGAAUCUGAGGAA 553 hsa-miR- UUGCUCUGAGCUCCGAGAAAGC 5691 554 ENSG00000238506 ACCCUUUUGUAGUUCAUAAGCAGGAUGACUGAGUUUUCAUGCACUUG UGUGAGAUGCGCCUCCCUCAAUGUUGGCACAUUACCUAUCUGAUG 555 MIR553 CUUCAAUUUUAUUUUAAAACGGUGAGAUUUUGUUUUGUCUGAGAAAA UCUCGCUGUUUUAGACUGAGG 556 SNORD3A AAGACUAUACUUUCAGGGAUCAUUUCUAUAGUGUGUUACUAGAGAAG UUUCUCUGAACGUGUAGAGCACCGAAAACCACGAGGAAGAGAGGUAG CGUUUUCUCCUGACGUGAAGCCGGCUUUCUGGCGUUGCUUGGCUGCAA CUGCCGUCAGCCAUUGAUGAUCGUUCUUCUCUCCGUAUUGGGGAGUGA GAGGGAGAGAACGCGGUCUGAGUGGU 557 hsa-miR- AAUGUGGACUGGUGUGACCAAA 4491 558 hsa-miR- UGGGAACUUAGUAGAGGUUUAA 4471 559 MIR4288 AUGGAGGUGGAGAGUCAUCAGCAGCACUGAGCAGGCAGUGUUGUCUG CUGAGUUUCCACGUCAUUUG 560 SNORA33 AAGCCAGCCAAUGAAUCUGCUUACCUGAUUGUGUUUGUGCAGACAUA CUUUAAAAACUGGCAAUAGUAAAGCCAUGUUACGAGCCUUAAGGACA UUGAAGUCGUUAAGGUCCCUGAGAAUGGCUAUAACAAAU

The Sentinel™ HG Test to Identify Patients with High Grade Prostate Cancer

For individuals classified as having prostate cancer, a similar approach was used to train and validate the Sentinel™ High Grade (HG) which discriminates between GG1+GG2 (low- and favorable-intermediate risk cancer) and unfavorable-intermediate and high-risk prostate cancer (GG3-GG5). These informative sequences identified form the basis of the Sentinel™ HG Test. Additional analyses demonstrated that the same cohort of sncRNAs can be used to dichotomize the patients with cancer into GG1+GG2 (low and intermediate risk cancer) from GG3-GG5 (high risk cancers). This biostatistical analysis forms the basis for the Sentinel™ HG Test, which utilizes 280 sncRNAs, identified by the Discovery HG Test, of which 280 unique sncRNAs: 191 miRNA and 89 snoRNAs are highly informative.

TABLE 3 SEQ ID NOs: 561-840 Used in the HG Test Analysis SEQ ID Sequence NO: Name Sequence 561 hsa-miR- UCUGCAAGUGUCAGAGGCGAGG 2276-3p 562 hsa-miR-19a- UGUGCAAAUCUAUGCAAAACUGA 3p 563 hsa-miR-34a- CAAUCAGCAAGUAUACUGCCCU 3p 564 MIR3924 UAAAUGAAAAAGUAGUAGUCAAAUAUGCAGAUCUAUGUCAUAUA UACAGAUAUGUAUAUGUGACUGCUACUUUUUUGUUUA 565 hsa-miR- CAGCCUGACAGGAACAG 4293 566 ENSG00000206731 UUCUAAAGUGUUGAGUUCAGUCCAGGGUGGAUCCCCUGCUCUGU UAAUUGAACUGGAACAUUUAAACUGGCUAGGCAAAAUGCCUACA UAGAAAGCAUUACUCUUUAUUCAUCCCCAGCCUACAAAA 567 hsa-miR-561- CAAAGUUUAAGAUCCUUGAAGU 3p 568 hsa-miR- CUUCUGAUCAAGAUUUGUGGUG 4762-3p 569 hsa-miR-409- GAAUGUUGCUCGGUGAACCCCU 3p 570 hsa-miR- GUGCAUGGCUGUAUAUAUAACA 5011-3p 571 MIR4315-1 UGGGCUUUGCCCGCUUUCUGAGCUGGACCCUCUCUCUACCUCUGG UGCAGAACUACAGCGGAAGGAAUCUCUG 572 MIR6505 GCAUUGGAAUAGGGGAUAUCUCAGCAUGUUGAGCCCUGUCUCUG GGGAGCUGACUUCUACCUCUUCCAAAG 573 ENSG00000212338 AUCCAAGGUGAUUCCUUCUCCAAGGGGGACAUCCAGUGACCCUCU CAGGAAGUAGCAACUUGGAAUAGAAUAGUCCAGGAGUUCCAGGA CCAGCCUGGCCAAUAUGG 574 hsa-miR- GCCCUCUGUCACCUUGCAGACG 2276-5p 575 hsa-miR- AAGCCUCUGUCCCCACCCCAG 6819-3p 576 hsa-miR- UUAGCCAAUUGUCCAUCUUUAG 4662a-5p 577 ENSG00000239145 AUCUUUUUGUAGUUCAUGAGCGUGAUGACUGAGUGUUCAUGUGC AUGUGUGAGGCGUGCCACCCUUAAACCUUGUUAUAACAUCAGCA CAUUACCCACAUGACA 578 hsa-miR-188- CAUCCCUUGCAUGGUGGAGGG 5p 579 ENSG00000252193 CUGGAGACUAAGAAACCAGUCCUUGAAGUCAAGCUGACUCUGCU UUUAGCCUCCUAAAUUAAAAGAUAGAUAGAAUAGGUCUUGUUUG CAAAAUAAAUUCAAGAUCUACUCAUCUAUCAAUAGCA 580 hsa-miR- CAGUGCAAUGUUAAAAGGGCAU 130a-3p 581 hsa-miR- CUACAGGCUGGAAUGGGCUCA 7151-3p 582 ENSG00000253065 AGCACUUGUGUUUGCUUUUGUUUGACUUGUGGACAAAGACUUAU AGUAGACAGGCACGAAAAAAUAAAUCCUCUUUUGCAACCCAUGA GUUGUUAUACAUGCAAGAAGGAAUAUU 583 hsa-miR- UGCCCUGUGGACUCAGUUCUGG 146b-3p 584 hsa-miR- CUUCUGCCUGCAUUCUACUCCCAG 6738-3p 585 hsa-miR-98- CUAUACAACUUACUACUUUCCC 3p 586 ENSG00000251838 CUUCUGCUAAGGUUUACACUAUAGAUGCAGGAAAAAAAAUGUCC UCACACUGUCUGUCUGAUUGUGGCAGCUGAGAUUGAAUAGAGAA AUAUAGGG 587 hsa-miR- CCGGGGCAGAUUGGUGUAGGGUG 5090 588 SNORD115- GGGUCAAUGAUGAGAACCUUAUAAUGUUCUGAAGAGAGGUGAUG 34 ACUUAAAAAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 589 MIR382 UACUUGAAGAGAAGUUGUUCGUGGUGGAUUCGCUUUACUUAUGA CGAAUCAUUCACGGACAACACUUUUUUCAGUA 590 hsa-miR- UGACCCCCAUGUCGCCUCUGUAG 6859-3p 591 hsa-miR- CCUGCAGCGACUUGAUGGCUUCC 1184 592 hsa-miR- AUGCAGGCCUGUGUACAGCACU 6811-5p 593 MIR518C GCGAGAAGAUCUCAUGCUGUGACUCUCUGGAGGGAAGCACUUUC UGUUGUCUGAAAGAAAACAAAGCGCUUCUCUUUAGAGUGUUACG GUUUGAGAAAAGC 594 hsa-miR- UGUGACAAUAGAGAUGAACAUG 4504 595 MIR548S UUGCUGCAAAAAUAAUUGCAGUUUUUGCCAUUAUUUUUAAUAAU UAUAAUAAUGGCCAAAACUGCAGUUAUUUUUGCACCAA 596 MIR6761 UCUGCUCUGAGAGAGCUCGAUGGCAGGUGCCUCCGUGUUGCCGA ACCCUCCUACGCUGCUCUCUCACUCCAG 597 hsa-miR- UCGUUUGCCUUUUUCUGCUU 1282 598 hsa-miR- GUGACUGAUACCUUGGAGGCAU 4439 599 MIR 126 CGCUGGCGACGGGACAUUAUUACUUUUGGUACGCGCUGUGACAC UUCAAACUCGUACCGUGAGUAAUAAUGCGCCGUCCACGGCA 600 ENSG00000221125 AAGAUGACACUUUGAGGCAUCGUGUCUAUGGUUCAUUACUACAG AAGCUUCUCUGGAUGUGUAAAGCACAGGAAACCAGGCAGAGGAG GCACAGGGUGCUCUCCAGAACGAGAAGCCAGCUCCUGGAGUUGU UUGCUGCAACUGCCAUUCCCCGUUGAUGACCAUGCUCUUCCUUCA GAAGAGGGAGAGUGAGAGGACCAAGUCCAAGUGG 601 hsa-miR- UCCCAAGGGUGAGAUGCUGCCA 6814-5p 602 hsa-miR- AAUGUGGAAGUGGUCUGAGGCAU 4657 603 MIR31 GGAGAGGAGGCAAGAUGCUGGCAUAGCUGUUGAACUGGGAACCU GCUAUGCCAACAUAUUGCCAUCUUUCC 604 MIR6754 GGCUGCCAGGGAGGCUGGUUUGGAGGAGUCUGGUGGCCUGUUCU CUUCACCUGCCUCUGCCUGCAG 605 MIR376A2 GGUAUUUAAAAGGUAGAUUUUCCUUCUAUGGUUACGUGUUUGAU GGUUAAUCAUAGAGGAAAAUCCACGUUUUCAGUAUC 606 hsa-miR- UAACACUGUCUGGUAACGAUGU 200a-3p 607 hsa-miR- AUCAAAUAAGGACUAGUCUGCA 3671 608 hsa-miR- AGGCCCUGUCCUCUGCCCCAG 6775-3p 609 hsa-miR- CCUCACCCAGCUCUCUGGCCCUCU 6862-3p 610 ENSG00000238368 AUCCUUUUGUGGUUCAUAAGCAUGAUGAUUAGAUUUUCAUGCUA UUGGGUGAGAUAUGCCUUCCUCAGACUUUGUUACAGCAUAGGCA CAUUACAACCUGUCUGAUA 611 MIR183 CCGCAGAGUGUGACUCCUGUUCUGUGUAUGGCACUGGUAGAAUU CACUGUGAACAGUCUCAGUCAGUGAAUUACCGAAGGGCCAUAAA CAGAGCAGAGACAGAUCCACGA 612 hsa-miR-423- AGCUCGGUCUGAGGCCCCUCAGU 3p 613 hsa-miR- UGGAGAGAAAGGCAGUA 4306 614 ENSG00000252128 ACUCCAUGAUGAACCCAAAAUGCCAAGUAUAUGACUGAACUUAC AAGUGAUACCAUCUUACGACUGAAGAGU 615 ENSG00000238932 AUCCUUUUGUAGUUCAUGAGGAUGAUGGUUGGGUGUUUCACACA UGUGUGUGAAAUGUACCACCCUCAAACCUUGUUACAAUGUCAGC ACAUUACCUGCCUGACC 616 MIR96 UGGCCGAUUUUGGCACUAGCACAUUUUUGCUUGUGUCUCUCCGC UCUGAGCAAUCAUGUGCAGUGCCAAUAUGGGAAA 617 hsa-miR- ACUCCAGUUUUAGUUCUCUUG 3925-3p 618 hsa-miR- UCAGCACCAGGAUAUUGUUGGAG 3065-3p 619 ENSG00000200026 UCGAUGGGUGGGAUAAUCCUUACCUGUUCCUCGUUUUGGAGGGC AGAUAGAACAUGAUGAUUGGAGAUGCAUGAAAUGUGAUUAAUGC CUCUGCCUAAUCAGGACUUGCAACACCCUGAGUACUCCUCUCUGA U 620 ENSG00000239155 AUCCUUUUGUAGUUCAUAAGUGUGAUGAUUAGGUUUUCACAUUU GUGUGUGAGAUGUAUCUCCCUCAAACAUUUUAUGACAUCGGCAU AUUAUCCUUCUGAUG 621 hsa-miR-337- CUCCUAUAUGAUGCCUUUCUUC 3p 622 hsa-miR-559 UAAAGUAAAUAUGCACCAAAA 623 hsa-miR-580- UUGAGAAUGAUGAAUCAUUAGG 3p 624 ENSG00000200318 AAGACUAUACUUUCAGGGAUCAUUUGUAUAGUUCGUUACUAGAG AAUUUUCUCUGAAUGUGUAGAACACCAGAAACCACAAGGAGGAG GCGCAGCGUUCUCUCCUGAGCGUGAAGCCGGGUCCUGGUGUUGC UUCACUGCAACUGCCAUUUGCCAUUGAUGAUUGUUCUUCUCUUC CUUUGGGAGAGUAAGAGGCAAAGGAUGCAGUCUGAAUGG 625 hsa-miR- AACCAUCGACCGUUGAGUGGAC 181c-3p 626 MIR6808 GGGGCCAGGCAGGGAGGUGGGACCAUGGGGGCCUUGCUGUGUGA CCACCGUUCCUGCAG 627 ENSG00000238775 AUCUUUUUGUAGUUCAUAAGCAUGAUGAUUAUGUUUUUACAUUC AUGUGUAAGAUGUGCCUCCCUCAAACCUUGUUAUGAUGUCAGCA UAUUACCUGUCUGAUG 628 hsa-miR- ACUGACAGGAGAGCAUUUUGA 3660 629 SNORA33 AAGCCAGCCAAUGAAUCUGCUUACCUGAUUGUGUUUGUGCAGAC AUACUUUAAAAACUGGCAAUAGUAAAGCCAUGUUACGAGCCUUA AGGACAUUGAAGUCGUUAAGGUCCCUGAGAAUGGCUAUAACAAA U 630 hsa-miR-630 AGUAUUCUGUACCAGGGAAGGU 631 ENSG00000253028 UCCAUCUGUUUGGCAGACCUGGAGCAGUUAGUGUCUGCUGCUAA GGUUUCCAUUACAGAUGUGAGAAAAAAAAGUGUUCUUCUGCUUU CUGUCUGUCUCAGUGGCAACCAAGAUUGAAUGGGGGAUAUGAGA G 632 ENSG00000238618 AUUCUUUUGUAAUUCAUAAGCAUGAUGACUCGGUAUUCACGUGC AUGUGUGAGAUGUGCCACCCUGGAACCUUGUUGCAACGUCAGCA CAUUAUGGGUCUGACA 633 ENSG00000252844 AUGACCAAUGGUGAGAGUGUAUCAUGAAGCAAGGAAUGUGAUUA AUCCAGUUCUGUAAACCCAAGUUCCAGU 634 hsa-miR- UUGGAAGCUUGGACCAACUAGCUG 6079 635 ENSG00000238552 AUCCUUUUGUAGUUCAUGAGCAUGACCAUCGAGUGUUUACAUGC AUGUGUGAGAUAUGACACCUUCUGAACCUUGUUACGGAGUUGGC AUGUUACCCAUCUAACC 636 ENSG00000212165 AAGAUUAUAUUUUCAGGGAUCAUUUCUAUAGUUUGUCACUAGGG AAGUUCCUCUGAAUGUGUAGAGCACCAGAAACAUGAGGAAGAGG CACAGGGUUCUCUCCUGAGUGUGAAGCUGGCUCUUGGCGCUGCU UUCCUGCAACUGCUAUUUGCCAUUCGUGAUUGUGGAGAGUCAGA GGGAGAGGAUGAUGCAGUCUGAGUGG 637 ENSG00000238752 AUCCUUUUGUAGUUCAUGAGCAUGAUGAUUGGGUGUUCACGUGC AUGUGUGAGAUGUGACACCCUUGCACAUUACUCGCCUGACC 638 hsa-miR-504- AGACCCUGGUCUGCACUCUAUC 5p 639 hsa-miR- UAACAAACACCUGUAAAACAGC 5688 640 SNORD23 UGCCCAGUGAUGACACCAUCCUUGCUCCCCGUGCCCCCCAGGGGC UAUGGGCGACACCAUGGCUGCCCCUGGGCUGGGCCAGUGGGGCCA AUGCCCAGGGGCUGAGGGCA 641 MIR6797 CAGCCAGGAGGGAAGGGGCUGAGAACAGGACCUGUGCUCACUGG GGCCUGCAUGACCCUUCCCUCCCCACAG 642 ENSG00000252682 ACUCACUGAUGAGUAGCUUCUGACUUUCAUUCUGAGUUUGCUGA ACCCAGAUGCCAUUCCUGGGAAGG 643 ENSG00000238914 AUCCUUUUGUGGUUCAUUAGCUUGAUAUUGGGUUUUCACACUAU UCUAUGAGAUGUGCCUCCCUCAAAACUUGUUACAACAUUGACAC AUUACCCUUCUGAUG 644 MIR3202-1 UAUUAAUAUGGAAGGGAGAAGAGCUUUAAUGAUUGGAGUCAUUU UCAGAGCAUUAAAGCUCUUCUCCCUUCCAUAUUAAUG 645 MIR520F UCUCAGGCUGUGACCCUCUAAAGGGAAGCGCUUUCUGUGGUCAG AAAGAAAAGCAAGUGCUUCCUUUUAGAGGGUUACCGUUUGGGA 646 hsa-miR- UUCUUUGUUUUUAAUUCACAG 6844 647 hsa-miR- UUUUAAGGACACUGAGGGAUC 7856-5p 648 MIR6739 GAAUGUGGGAAAGAGAAAGAACAAGUAAAAGGAAUUUUCAUUUU CCAGCCCCUAAUUGUUCUGUCUUUCUCCCAG 649 MIR4521 UCGGCUAAGGAAGUCCUGUGCUCAGUUUUGUAGCAUCAAAACUA GGAUUUCUCUUGUUAC 650 SNORD114- UGGAUCGAUGAUGACUGCUGGUGGCGUAUGAGUCAUAUGCGAUG 28 AAUACGUGUCUAGAACUCUGAGGUCCA 651 MIR6867 CCCGGUGUGUGUGUAGAGGAAGAAGGGAAGCUGGGAACCUGACU GCCUCUCCCUCUUUACCCACUAG 652 hsa-miR- AAGUAGUUGGUUUGUAUGAGAUGGUU 1244 653 hsa-miR-23a- AUCACAUUGCCAGGGAUUUCC 3p 654 ENSG00000238889 AUUCUUUUGUAGUUCUUAGGCACGAUGAUUGGGUGUUCAUGUGC AUGUUUGAGAUGUGCCUCCCUCAAACCUUGUUCUUACAUCAGCA CCUUACACGUCUAACA 655 hsa-miR- CAUUCAACUAGUGAUUGU 4272 656 MIR320E GCCUUCUCUUCCCAGUUCUUCCUGGAGUCGGGGAAAAGCUGGGU UGAGAAGGU 657 ENSG00000238642 AGUUUUUCAUAGUUCAUAAGCAUGAUGAGUGGGUUUUCAUGUUC AUGUGUGAGGUGUGCCUCCCUCAAACCUUGUUAUGAUGUCAACA CAUUGCCCAUCUGAUG 658 ENSG00000266834 UGCAGAUGAUGUAAAAGAAUAUUUGCUAUCUGAGAGAUGGUGAU GACAUUUUAAACCACCAAGAUCGCUGAUGCA 659 hsa-miR- GCUGCACCGGAGACUGGGUAA 3130-3p 660 hsa-miR- CUACCCUCGGUCUGCUUACCACA 7848-3p 661 MIR4699 AGCAAUUGGAGAAGAUUGCAGAGUAAGUUCCUGAUUAAGAAAUG GAAUUUACUCUGCAAUCUUCUCCAAUUGCU 662 MIR770 AGGAGCCACCUUCCGAGCCUCCAGUACCACGUGUCAGGGCCACAU GAGCUGGGCCUCGUGGGCCUGAUGUGGUGCUGGGGCCUCAGGGG UCUGCUCUU 663 MIR3686 CUCACCUCAUUCAUUUACCUUCUCUUACAGAUCACUUUUCUGCAC UGGACAGUGAUCUGUAAGAGAAAGUAAAUGAAAGAGGUGAG 664 hsa-miR- AAAAGUAAUUGCAGUUUUUGC 548ar-5p 665 MIR1203 UCCUCCCCGGAGCCAGGAUGCAGCUCAAGCCACAGCAGGGUGUUU AGCGCUCUUCAGUGGCUCCAGAUUGUGGCGCUGGUGCAGG 666 SNORD115- GGGUCAAUGAUGAGAACCGUAUAUUGUCCUGAAGAGCGGUGAUG 33 ACUUAAAAAUAAUGCUCAAUAGGAUUACGCUGAGGCCC 667 hsa-miR- UGAGGUGGUAGGAUGUAGA 6134 668 ENSG00000252526 AAGACCCUUCAGCUGCAAACAACAGCUUCCUUGGUAGUUUAUGC AGCCUGUUUCUUGUAUGGGCUGCUCUAAGGGACCAUGGAGACAG GC 669 hsa-miR- AGCUGUAAUUAGUCAGUUUUCU 1537-5p 670 MIR7977 UUCCCAGCCAACGCACCAAAAAUGAUAUGGGUCUGUUGUCUGGA GAAAC 671 ENSG00000238843 AUACUUUUGUAGGUCAUAAGCUGAGGAUUGGGUUUUCAUGCUCU UGUGUGAGAUAUGCUUCUCUCAAACCUUCUGACCUGGGCACAUU ACCCAGCUAAUG 672 hsa-miR- CCUGCGUGUUUUCUGUCCAA 4520a-5p 673 ENSG00000238772 UUCCUGUUGGUUCCUAAGUGUGAUGAUUGGGUUUUCACAUUCAU GUGUGACAUGUGCCUCCCUCAAAUCUUGUGAUGAUGUCGGCACG UGACCCAUCUGACG 674 MIR593 CCCCCAGAAUCUGUCAGGCACCAGCCAGGCAUUGCUCAGCCCGUU UCCCUCUGGGGGAGCAAGGAGUGGUGCUGGGUUUGUCUCUGCUG GGGUUUCUCCU 675 hsa-miR-554 GCUAGUCCUGACUCAGCCAGU 676 SNORD87 ACAAUGAUGACUUAAAUUACUUUUUGCCGUUUACCCAGCUGAGG UUGUCUUUGAAGAAAUAAUUUUAAGACUGAGA 677 hsa-miR-624- CACAAGGUAUUGGUAUUACCU 3p 678 ENSG00000252777 UGGACAUUUAUUUUUAUUCAGUUUUUUCUCAAGGUGAAGGUAAC UGUUUGUAGAUGUCCUAGAGAAAUAUUGUAGCUUUCUGUUCACC CUUUGCAACUAAAAAGCAUGGACUGUUCCACUACUGAGAUUU 679 ENSG00000206649 CUUCCCAUUUAUUUGCUGCUAUAGUCUCAUAAUGAUACAAGCAG UUAUAUGCAUGGGAUAAAAUAAUAUUGGGACAUUGUAAAUUGAA AUGAAGUAACCAUUUUCAUCUUUUCUGCAUGGACAAGACAUUG 680 MIR4534 UGUGAAUGACCCCCUUCCAGAGCCAAAAUCACCAGGGAUGGAGG AGGGGUCUUGGGUACU 681 hsa-miR- UUGAAGAGGAGGUGCUCUGUAGC 4709-3p 682 MIR548F4 GAGUUCUAACGUAUUAGGUUGGUGCAAAAGUAAUAGUGGUUUUU GCCAUUAAAAGUAAUGACAAAAACUGUAAUUACUUUUGGAACAA UAUUAAUAGAAUUUCAG 683 ENSG00000200422 GGUCAAUGAUGUAAUGGCAUGUAUUAGCUGAAUCCAAAGUUGAA GUGAAUUCUAAAAUUACACCAAGACCUU 684 MIR16-1 GUCAGCAGUGCCUUAGCAGCACGUAAAUAUUGGCGUUAAGAUUC UAAAAUUAUCUCCAGUAUUAACUGUGCUGCUGAAGUAAGGUUGA C 685 ENSG00000212391 UGUUCUGACAUGGGAAGAGUAGCUUCUGGUUGGUGGAGCCCAUC UCACAUUAGCCAGAGACAAAGCAACACCUUGUUUAUCCCGGCUU GGCUUUUGGCCUGUGUCCAUGACUGGUCCAUACCUUGGACACAU GG 686 ENSG00000238983 AUCCCUUUGUAGUUCAUAAGCGUGAUGAUUGGGUGUUCAUGCUC AUACAUGAGCUGUGCCUCCCUCAAGCUUUGUUGUGACAUCAUCA UAUUACCUGUCCGAUG 687 hsa-miR- ACCUGCCAGCACCUCCCUGCAG 4722-3p 688 MIR6802 GAGGGCUAGGUGGGGGGCUUGAAGCCCCGAGAUGCCUCACGUCU UCACCCCUCUCACCUAAGCAG 689 hsa-miR-141- CAUCUUCCAGUACAGUGUUGGA 5p 690 MIR3124 GCGGGCUUCGCGGGCGAAGGCAAAGUCGAUUUCCAAAAGUGACU UUCCUCACUCCCGUGAAGUCGGC 691 hsa-miR-569 AGUUAAUGAAUCCUGGAAAGU 692 ENSG00000212604 UGCCCCUGACCUGGGAAGAGAGGGGCCUGGCUGGUGGUAUCCAU CUCAUACCAGCUAGGGAUGAAGAAACCGCUUGCUCAUCCCAGCCU GGCUCCUGGUCUAUGCCCAUGCCUGGUUCGUGCCUUGGACAUAUC A 693 hsa-miR- AGAGCUGGCUGAAGGGCAG 4487 694 MIR2110 CAGGGGUUUGGGGAAACGGCCGCUGAGUGAGGCGUCGGCUGUGU UUCUCACCGCGGUCUUUUCCUCCCACUCUUG 695 MIR5010 GAUCCAGGGAACCCUAGAGCAGGGGGAUGGCAGAGCAAAAUUCA UGGCCUACAGCUGCCUCUUGCCAAACUGCACUGGAUUUUGUGUC UCCCAUUCCCCAGAGCUGUCUGAGGUGCUUUG 696 ENSG00000251715 UUCACAAUGUCUAUUGAAGGAUCUCAUCACCUUUAGAGAGCUGU GGUCAUGCCCCUUAAAGUGAAUUUGGAGGUUUUAUACCC 697 MIR640 GUGACCCUGGGCAAGUUCCUGAAGAUCAGACACAUCAGAUCCCU UAUCUGUAAAAUGGGCAUGAUCCAGGAACCUGCCUCUACGGUUG CCUUGGGG 698 MIR4424 CUUACAUCACACACAGAGUUAACUCAAAAUGGACUAAUUUUUCC ACUAGUUAGUCCAUUUCAAGUUAACUCUGUGUGUGAUGUAGU 699 ENSG00000238685 AUUACUUGAAAAUCACUCCCAGGCUUUGGCCAUGGCAGCAGGUG AGAUUCAAGGCCCAGAGCCUCCAGGGCCUCAGCUCACCGCACACU GCCCCGUGUGUGGUGGGGAAACCCAGACCCCAACAGGU 700 hsa-miR-370- CAGGUCACGUCUCUGCAGUUAC 5p 701 MIR1307 CAUCAAGACCCAGCUGAGUCACUGUCACUGCCUACCAAUCUCGAC CGGACCUCGACCGGCUCGUCUGUGUUGCCAAUCGACUCGGCGUGG CGUCGGUCGUGGUAGAUAGGCGGUCAUGCAUACGAAUUUUCAGC UCUUGUUCUGGUGAC 702 ENSG00000252699 CCUCUUCUCAGAACACUUCCUGGGUCUGAUUGGUGGCCCAGGGA GCUGUCAGAGAAGAGCAGAGCAAAUGGCCUUCACUUUGUAGAUG AGAUGGCAGGAGGGUGGAUUGUUGGUCUCAGUCAGUGGUGGGAC AGAC 703 hsa-miR-433- AUCAUGAUGGGCUCCUCGGUGU 3p 704 ENSG00000201448 AAGCAGGAUUCAGACUACAAUAUAGCUGUUAAGUGCUGUAUUGU CAUUCCCCCUGCUCAAAUUAAAGUUGUUUCUUAACUAUACCCAUC UGCUAUUCUGUAGCAGCCAGGGAUGCUUGGUCACAUACAU 705 SNORA43 GCUGUCCUGGACCUGUUGGCACCACAGACAGUUGCUCUGCUGUGC CUGUGGCCUCGGGGCAAAGAGAAAGUGGCGAUUUCUACACUCAG UGCUCGGGAACCAGUGGGCACUGAGAAUGGUUUAUGGCCUGACA UUA 706 ENSG00000201398 UCGUCAGGUGGGAGAAUUCUUACAUGUUCCUCCUUUUGCAAGGC AGAUUAGAACAUGAUGAUUGGGGUUCGCAUAAUAUGUGAUUAAC GUUUCUGUGUAAUCAGGACUUGCAACAUCCCGAAUGCCCUUACC UGAC 707 ENSG00000252740 UGGAUCAAUGAUGACAAAGUAUCAUGAAUGAGGGAUUGUGAAUA AUCUAUUUUUAUGAACCUGUGGUCAAAU 708 hsa-miR-654- UAUGUCUGCUGACCAUCACCUU 3p 709 MIR3667 UGAGGAUGAAAGACCCAUUGAGGAGAAGGUUCUGCUGGCUGAGA ACCUUCCUCUCCAUGGGUCUUUCAUCCUCA 710 ENSG00000238978 AUCCUUUUGUGGUUCAUAAGAAUAGGGAUUGGGAUUUCACACUC AUGUGUGAGAUGUGCCUCCCUUAAACCUUAAGAUGUUGGCACAU UACCUAUUUGAUG 711 ENSG00000238494 AUCCUUUUGUAGUUCAUUAGCAUAAUGAUUGGGUUUUCACACUC AGGCGUGAGAUGUGCCUCUCUCAAACCUUGCUACGAUGUUGGCA CAUUGCCUAUCUGGCA 712 hsa-miR- AAAAUUUCUUUCACUACUUAG 3606-3p 713 hsa-miR-335- UUUUUCAUUAUUGCUCCUGACC 3p 714 ENSG00000252458 CUCAUACCUAAACCCAAGAAUCACUUUCUUAUAGUGAUGAUUUA AACAGAUGCAAACAGCGAGCACAUCUUGUCACCUUUGCGGGACU GUGGCUGUGCCCCUCGCAGUAAAUUUGGAGGUUCUACAUCC 715 MIR1297 UGUUUAUCUCUAGGGUUGAUCUAUUAGAAUUACUUAUCUGAGCC AAAGUAAUUCAAGUAAUUCAGGUGUAGUGAAAC 716 ENSG00000201329 AAUGCUAUACUUUCAUGGGUCAUUUCUAUAGUUUGUUAUUAGAG AAGUUUCUCUGAAUGUGUUGAGCACCAGAAACCACGAGGAGAUG CAGCAUUCUCUCCUGAACGGGAAGCCAGCUUUUGGCAUUGCUUU GAUGCAACUACCAUUUGCCAUUGAUGGCAAUGCAUCGCUUCCUC UAGGAGUGUAAGAGGGAGUGGAUGCAGUCAGAGUGG 717 MIR30B ACCAAGUUUCAGUUCAUGUAAACAUCCUACACUCAGCUGUAAUA CAUGGAUUGGCUGGGAGGUGGAUGUUUACUUCAGCUGACUUGGA 718 ENSG00000251778 UUCAAAAAAGACCAUAUAUCCUUGAAGAGUAACUGCUGAACUUA UUCACUGGCAGUGGGCCUUAUAGCACAGUGAAUGACCAGGUUAG AGACAUGC 719 hsa-miR-544a AUUCUGCAUUUUUAGCAAGUUC 720 ENSG00000201388 UUCUCACCUAAACCCAAGAAUCACUGUUUCUUAUAGCGGUGGUU UAAACAGAGGUGCAAACAGCAAGUGAAUCUCGUCGCCUUUGCGG GGCUGUGGCCAUGCCCCUCAAAGGAAAUUUGGAGGCUCUACAGC C 721 MIR767 GCUUUUAUAUUGUAGGUUUUUGCUCAUGCACCAUGGUUGUCUGA GCAUGCAGCAUGCUUGUCUGCUCAUACCCCAUGGUUUCUGAGCA GGAACCUUCAUUGUCUACUGC 722 MIR642A AUCUGAGUUGGGAGGGUCCCUCUCCAAAUGUGUCUUGGGGUGGG GGAUCAAGACACAUUUGGAGAGGGAACCUCCCAACUCGGCCUCU GCCAUCAUU 723 MIR454 UCUGUUUAUCACCAGAUCCUAGAACCCUAUCAAUAUUGUCUCUG CUGUGUAAAUAGUUCUGAGUAGUGCAAUAUUGCUUAUAGGGUUU UGGUGUUUGGAAAGAACAAUGGGCAGG 724 ENSG00000207094 AUCCAAGGCGAUUCCCUCUCCAAGGGGACAUCUAGUGCCCCUCUC AGGAAAGUAGCAACUUGGAAUAGAAUCUGGCAUGCCUAAGGUCU UUGAGGAACAGGGAUGCUUAUUUCCUCUGCCUUCCUUGGCUGCC UACAUAG 725 MIR4645 UGAUAGGGAAACCAGGCAAGAAAUAUUGUCUCCUCAAGUUGCGA CGAGACAGUAGUUCUUGCCUGGUUUCUCUAUCA 726 MIR3155B CCACUGCAGAGCCUGGGAAGGGAGCUGUCCGGCUCCCCAGGCUCU GCAGUGGGAGG 727 SNORD96B CCUGGUGAUGACAGACGACAUUGUCAGCCAAUCCCCAUGUGGUA GUGAGGACAUGUCCUGCAGUUCUGAAGG 728 ENSG00000212587 UGCACUUAUGUAUGUUUUUGUUUAACUUGUGGACAAAGACUUUA GGAAAGGUGCAAAAAAUAAAUCUUCUUUUGCAACCCAGAACUCA UUGUUCAGUAUGAGUUUUGAUACAUAUCAGAAUGGAUACU 729 hsa-miR- UCACGCGGAGAGAUGGCUUUG 3186-3p 730 ENSG00000212490 UGCCCCUUUUAAGGUUGACACAGUGCAUUAAGCAGAAGGGUUAA GUAAGUCUCCAUAAAACCCAGAGAAGAGAAUGUAAAGCUCCUCU UUGGAGGAGCUAGACUCCUGUCUGGAGUCACAGCU 731 MIR3622B AGUGAUAUAAUAGAGGGUGCACAGGCAUGGGAGGUCAGGUGAGC UCAGCUCCCUGCCUCACCUGAGCUCCCGUGCCUGUGCACCCUCUA UUGGCU 732 hsa-miR- CGGGGCCAUGGAGCAGCCUGUGU 6762-5p 733 MIR3938 AGGAAUUUUUAACCCGAUCACUAGAUUAUCUACAAGGGAAUUUU UUUUUAAUUUAAAAAAUUCCCUUGUAGAUAACCCGGUGGUCAGG UUGGAUGGCUCCAUG 734 ENSG00000207100 UGCACUGCGUGGUAUCUGCACUCAGCAGUUUACUCCUGCUAGGG UGUUCAAAGGUCAGUGCCAUAGAAAUCCAGUAUCUGGUUUCAUU GGUUUUCUUGGCUUUGUGCUUGUUAAACCUGGUAUUUCUAUUGA UACAGCA 735 hsa-miR- GUUGGGACAAGAGGACGGUCUU 3122 736 hsa-miR- AAAAACUGCAAUCACUUUUGC 548az-3p 737 hsa-miR- AGGUGGAUGCAAUGUGACCUCA 3165 738 MIR2277 GUGCUUCCUGCGGGCUGAGCGCGGGCUGAGCGCUGCCAGUCAGCG CUCACAUUAAGGCUGACAGCGCCCUGCCUGGCUCGGCCGGCGAAG CUC 739 MIR519C UCUCAGCCUGUGACCCUCUAGAGGGAAGCGCUUUCUGUUGUCUG AAAGAAAAGAAAGUGCAUCUUUUUAGAGGAUUACAGUUUGAGA 740 hsa-miR- UGCAACUUACCUGAGUCAUUGA 891b 741 hsa-miR- CACUGUUUCACCACUGGCUCUU 4676-3p 742 MIR5692C1 UAUAACAUUGUAUAUACCCACUGUGAUAUUAAGAGUAAUAGCUC UCUAGGUUAUUAUGAAUAAUAUCACAGUAGGUGUACACAAUGUU GUA 743 SCARNA15 CUGGAGACUAAGAAAAUAGAGUCCUUGAAAUCAAGCUGACUCUG CUUUUAGCCUCCUAAAUGAAAAGGUAGAUAGAACAGGUCUUGUU UGCAAAAUAAAUUCAAGACCUACUUAUCUACCAACAGCA 744 MIR3972 GCCCAUUUGCCUUGGCUUGGGGUGGCAGUCCUGUGGGAAUGAGA GAUGCCAAACUGGACCUGCCAGCCCCGUUCCAGGGCACAGCAU 745 ENSG00000201348 CCAAUGUGGAUACACCCAGGAGGUCACUCUCUCCCCAGGCUGUGU CCAAGUAGCAUAGGGGAGCACAGGGCUCUGUCCCCAUGAUGUAC UGUCCUUUUCCAUGACAUUGGAGAUGAAGCUGGACCUCAACUCU GCACAUGCAUAUUCCUACAACUUCUCAGAGUCCUGUGGAUAAUG ACGGAGGAGAGAAACCAUGCAGGAAACAGCC 746 ENSG00000252040 UGAGAUGAGAUCAUGCCAUUGCACUCCAGCCUGGACGACAGAGC GAGACUUCAUCUCAAAAAAAAAAAAGGAUCCUCAGGGCUGCCAA CCUUAUAGUAGAAGUUGAGGUGGUAGUGGAUUUCUCCUACACAA 747 MIR8074 CCAGUUCCUGAGUUUAUGCAAGAUGCCCAUGGGAGCCCAGAGAC GUCCUAUGGCGAGACUGGCAUGUACUCACACAACUGA 748 hsa-miR- AUUGCCUCUGUUCUAACACAAG 3152-5p 749 MIR619 CGCCCACCUCAGCCUCCCAAAAUGCUGGGAUUACAGGCAUGAGCC ACUGCGGUCGACCAUGACCUGGACAUGUUUGUGCCCAGUACUGU CAGUUUGCAG 750 ENSG00000202275 AUUGUUACAUUGAUAAAAUCAAAUCACCAUCUUUUAGCUAAGCU UGUGCUGGAUUUGCUUUUUUUCUGAUAAAGAUG 751 hsa-miR-759 GCAGAGUGCAAACAAUUUUGAC 752 MIR514A1 AACAUGUUGUCUGUGGUACCCUACUCUGGAGAGUGACAAUCAUG UAUAAUUAAAUUUGAUUGACACUUCUGUGAGUAGAGUAACGCAU GACACGUACG 753 MIRLET7A3 GGGUGAGGUAGUAGGUUGUAUAGUUUGGGGCUCUGCCCUGCUAU GGGAUAACUAUACAAUCUACUGUCUUUCCU 754 hsa-miR- AAAGUGCAUCCUUUUAGAGUGU 519a-3p 755 hsa-miR- CCGCACUGUGGGUACUUGCUGC 106b-3p 756 SNORD116- UGGAUCGAUGAUGACUUCCAUAUAUACAUUCCUUGGAAAGCUGA 20 ACAAAAUGAGUGAAAACUCUAUACUGUCAUCCUCGUCGAACUGA GGUCCA 757 hsa-miR- UCUGUGAGACCAAAGAACUACU 4677-3p 758 ENSG00000252298 UGGACAAAUGAUUAGAUUAGAUUGUGUUAUAAACCAAAGAUUAU AGUUAUUCCAAUUAUGUGCAUUUGAGAUCCACU 759 MIR493 CUGGCCUCCAGGGCUUUGUACAUGGUAGGCUUUCAUUCAUUCGU UUGCACAUUCGGUGAAGGUCUACUGUGUGCCAGGCCCUGUGCCA G 760 hsa-miR- GAGCUUGGAUGAGCUGGGCUGA 4538 761 ENSG00000212284 UCAAUAAUGAAAUCUUCUGAUUUGGUGAGAAAUAAUGCCUUAAA AUUACACUCAAUAGGAUUAUGCUGAGG 762 hsa-miR- AGACAUCAAGAUCAGUCCCAAA 3913-3p 763 MIR4714 AUUUUGGCCAACUCUGACCCCUUAGGUUGAUGUCAGAAUGAGGU GUACCAACCUAGGUGGUCAGAGUUGGCCAAAAU 764 hsa-miR- AAGUCCUGCUUCUGUUGCAG 6838-3p 765 hsa-miR- AUGGAGAAGGCUUCUGA 4531 766 hsa-miR-5481 AAAAGUAUUUGCGGGUUUUGUC 767 hsa-miR-326 CCUCUGGGCCCUUCCUCCAG 768 hsa-miR- CCAAGGAAGGAGGCUGGACAUC 6830-5p 769 MIR6780A GACACUUGGGAGGGAAGACAGCUGGAGAGUAUGGUCACAGCAGC AUCCUCCUCUGUUUUCUUUCCUAG 770 hsa-miR- AGGGAAGGAGGCUUGGUCUUAG 4747-5p 771 hsa-miR- CCAGAGAUGGUUGCCUUCCUAU 4756-3p 772 hsa-miR- AUCGUGCAUCCCUUUAGAGUGU 517a-3p 773 MIR3160-2 ACCUGCCCUGGGCUUUCUAGUCUCAGCUCUCCUGACCAGCUGAGC UGGAGGAGAGCUGAGACUAGAAAGCCCAGGGCAGGU 774 hsa-miR- AGCAGCAUUGUACAGGGCUAUGA 103a-3p 775 hsa-miR-588 UUGGCCACAAUGGGUUAGAAC 776 ENSG00000212378 AUGUAAUAAUGUUCAUCAAAUGUCUGACCUGAAAUGAGCAUGUA GACAAGUUAAUUUAACACUGAAGAA 777 MIR548BA AAAGGUAACUGUGAUUUUUGCUAUUAGAAAGUAAUGGCAAAAAC UGCAAUUACUUU 778 hsa-miR- CGCUUUGCUCAGCCAGUGUAG 1251-3p 779 ENSG00000238995 AACCAUGAAUGCAAGAAGCGUAUGAUUGGGUUUUCAUGCUCACG UGUGAAAUGGACCACCCUCAAACCUGGUUAUGCUAUCAGCACAU UACCUGUCUGAUG 780 SNORA64 ACUCUCUCGGCUCUGCAUAGUUGCACUUGGCUUCACCCGUGUGAC UUUCGUAACGGGGAGAGAGAGAAAAGAUCUCCUCAGGACCUCGG AUGGGCCUUACUGUGGCCUCUCUUUCCUUGAGGGGUGCAACAGG C 781 hsa-miR-206 UGGAAUGUAAGGAAGUGUGUGG 782 MIR5708 AUUACAGACAUGAGCGACUGUGCCUGACCAAAAGUCAACAUUAA ACAACAAAUCUUGGCCAGGCACAGUGGCUCAUGCCUGUAAU 783 hsa-miR- CCUCUGAAAUUCAGUUCUUCAG 146a-3p 784 ENSG00000238488 AUCAUUUUGCAGCUUAUACAUGUGAUGACUGGGUUUUUUAACUC AUAAGUGAGAUGUGCCUUUCUUACAUCUUAUUAUGACAUUAGUA CAUUACCCAUUUGAUA 785 MIR519D UCCCAUGCUGUGACCCUCCAAAGGGAAGCGCUUUCUGUUUGUUU UCUCUUAAACAAAGUGCCUCCCUUUAGAGUGUUACCGUUUGGGA 786 ENSG00000202498 AGAUCAUUGAUGACUUCCAUAUAUCCAUUCCUUGGAAAGCUGAA CAACAUGAGUGAAAACUCUACUGAAAAAAGAAAAGAAAUGGGAG GCCG 787 hsa-miR-429 UAAUACUGUCUGGUAAAACCGU 788 MIR5687 CCUCACUUAUCUGACUCUGAAAUCUUCUAAAUGGUACCCACUUU AUUUAGAACGUUUUAGGGUCAAAUAAGUACAGG 789 hsa-miR- UCAAGGCCAGAGGUCCCACAGCA 3922-5p 790 hsa-miR- UCGGGCGCAAGAGCACUGCAGU 6499-5p 791 MIR3683 GGGUGUACACCCCCUGCGACAUUGGAAGUAGUAUCAUCUCUCCCU UGGAUGCUACGAACAAUAUCACAGAAGGUGUACACCC 792 hsa-miR- AGGACUGGACUCCCGGCAGCCC 4515 793 hsa-miR- AGAGGCUGAGAAGGUGAUGUUG 6884-5p 794 ENSG00000202023 CAAAUACAUGAUGAUCUCACCUCAGUUUGAACUCUCUCACUGAU CACUUGAUGACAAUAAAAGAUCUGAUAUUGUG 795 SNORA17 ACUGCCCCUAGAGGCGUUGCAGCUGUGGCUGCCGUGUCACAUCUG UGUCAUUAGGUGGCAGAGAUUAGAGAGGCUAUGUCUACGCUCAG CGUUCUGCCCCGUGAACGUUUGAAUGUUUGAUAGUCUCACACUC 796 ENSG00000201811 GUCUGCAUUUGAAAGUGAUCAUCAGCUAGCCUGUGUCUUCGUCA UCGAUAGUACAGGCCGGUGAACUGCGCAAAGCAUUUUCUGCAUU UGGAGGGUCCAUCUCUAUCCUUGGAAAUGCUAGUGCUUUUCUCA CA 797 MIR3926-1 AAAAUGGAGCUGGCCAAAAAGCAGGCAGAGACUUUAAAAGCGUC UCUGCCUGCUUUUUGGCCAGCUCCGUUUU 798 ENSG00000200398 CAGUCAGUGUCGAGAACCUUAUAUUGUUCUGAAGAGAGGUGGUG ACUUAAAAAUCAUGCUCAAUAGGAUUACGCUGAGGCCC 799 MIR 1298 AGACGAGGAGUUAAGAGUUCAUUCGGCUGUCCAGAUGUAUCCAA GUACCCUGUGUUAUUUGGCAAUAAAUACAUCUGGGCAACUGACU GAACUUUUCACUUUUCAUGACUCA 800 hsa-miR- ACCGUGCAAAGGUAGCAUA 1973 801 ENSG00000238464 AUCCUUUUGUAAGUCAUAAGUGUGAUUGGGUUUUCAUGCUCUUG UGUCAAAUGUGCCUCCCUCAAACCUUGUUACGAAGUGGGCACAC UACCCACCUGAUG 802 SNORD114- AAGAUCAAUGAUGACUACUGUUAGUGUAUGAGUUACACAUGAUG 10 AAUACAUGUCUGAAACUCUGAGGUCCA 803 MIR6856 UGGAAAAGAGAGGAGCAGUGGUGCUGUGGCAGUGGCAGAGGUCG CUACAGCCCUGUGAUCUUUCCAG 804 ENSG00000239111 AACAUUUAAAAAAAUGUAUCAAGGCGUGGUGAUUAGGUUUUCAC ACUCAUGUGUGAGAUGUGCCUCCCUUGAACUUUGUUACAUUGGC ACUUUACCCAUUUGACA 805 hsa-miR-98- UGAGGUAGUAAGUUGUAUUGUU 5p 806 MIR1262 AUCUACAAUGGUGAUGGGUGAAUUUGUAGAAGGAUGAAAGUCAA AGAAUCCUUCUGGGAACUAAUUUUUGGCCUUCAACAAGAAUUGU GAUAU 807 ENSG00000199262 GUGCAUGUGAUGAAGCAAAUCAGUAUGAAUGAAUUCAUGAUACU GUAAACGCUUUCUGAUGUA 808 MIR548AY AGAAGAUGCUUACUACUAGGUUGGUGCAAAAGUAAUUGUGGUUU UUGCAUUUAAAGUAAUGGCCAAAACCGCGAUUACUCUUGCACGA ACCUAACGGUAACACUUCU 809 MIR4754 ACGCGCCUGAUGCGGACCUGGGUUAGCGGAGUGAGGCCCAGUGG UCACCGCCGCCCUCCGCAGGUCCAGGUUGCCGUGCGCAUGUGCCU 810 MIR2113 UUUUCAAAGCAAUGUGUGACAGGUACAGGGACAAAUCCCGUUAA UAAGUAAGAGGAUUUGUGCUUGGCUCUGUCACAUGCCACUUUGA AAA 811 MIR144 UGGGGCCCUGGCUGGGAUAUCAUCAUAUACUGUAAGUUUGCGAU GAGACACUACAGUAUAGAUGAUGUACUAGUCCGGGCACCCCC 812 ENSG00000212211 AAGAUUAUAUUUCCAGGGGUCAUUUCUGUGGUUCAUUACUUAAA GGAGUUUCCCCAAGUGUGUAGAGCACUGGAAACCACAGGAAGAU AUGCAAUGUUCUCUCCCGAGCACGAAGCUCGUUCUUGGUGUUGC UUCAUUGCAACUGCCAUUUGCCAUUGAUCAUUGUUUUUUUCUUC CUUUGGGGAGAUUAAGAGGAAGAGGACACAGUCUGAGUGA 813 MIR3135A UCACUUUGGUGCCUAGGCUGAGACUGCAGUGGUGCAAUCUCAGU UCACUGCAGCCUUGACCUCCUGGGCUCAGGUGA 814 ENSG00000207130 CUCCAUGUGUCUUUGGAACCUGUCAGCUGUGGCAGUUGCCCUUCC UAGCCAUGGAAGAGUAAGUAUAUUCUUGUUUAUUGGCAAAGCUG UCACCAUUUCAUUGGUAUCAGAUUCUGACUUGCACAAGUAACAU UC 815 ENSG00000238440 AUCCUUUUGAAGUUCAUAAGCAUGGUGAUUGGGUUUUCACACUC AUGUGUGAGAUGUACCACCCUUAAAGCUUGUUAUGAUGUAGGCA CAUUACCCAUCUGACA 816 MIR128-1 UGAGCUGUUGGAUUCGGGGCCGUAGCACUGUCUGAGAGGUUUAC AUUUCUCACAGUGAACCGGUCUCUUUUUCAGCUGCUUC 817 SCARNA5 AGGUCGAUGAUGAUUGGUAAAAGGUCUGAUUGCACUGAAUGUCA CGGUCCCUUUGUUGCCCUCAACUCCCAGCAGCCCAUUUUUUCCCU CCCGUCACAUUUAAGUCAUGUGUAUGGGAUCAUGGAGCAGCUGA UAAUUUGGGAUUCUGUCAGUGUGUGUUUCUGAGAGUGAUCGGCU CACAGCUGACGAGUAUCCAACAAAACCAGUUACACAGGAGACUG ACGAGUGGCAGUCAUGGGUGUGAUGGUGCAUGAUCUCAAGUUUU CAAUCUGAGACCU 818 MIR4432 GCAUCUUGCAGAGCCGUUCCAAUGCGACACCUCUAGAGUGUCAUC CCCUAGAAUGUCACCUUGGAAAGACUCUGCAAGAUGCCU 819 MIR569 GGUAUUGUUAGAUUAAUUUUGUGGGACAUUAACAACAGCAUCAG AAGCAACAUCAGCUUUAGUUAAUGAAUCCUGGAAAGUUAAGUGA CUUUAUUU 820 hsa-miR-586 UAUGCAUUGUAUUUUUAGGUCC 821 hsa-miR- UGAAGUUACAUCAUGGUCGCUU 4670-3p 822 ENSG00000212620 GACUUCUCACUGAGCUUCUUUCUGUCUGUUGCUGGCAGCUUAUG GAUUCAUAUGAGCAGAGAGAAUCACAGAACUAGCAUUACUUUUG UCUUUACAGGAGUAUAUUUGGCUGUCUUGUGAGAUAUUA 823 SNORD114- UGGUUCAGUGUUGACUACUGGUGUCGUGUGAGUCAUACAAUGAA 27 UACAUGUCUGGAACUCUGAGGCCCA 824 ENSG00000251737 AUCCUUUUGCGGUUCAUAAAGAACCAAGAUGACUGGGUUUCAUG CUAAUGCAUGACAUGUGCCUCCCUCAAAUCAUGUUGCCUCAUGG GCUUAUUGGCACAUUACCGUCUGAGG 825 hsa-miR- GGUGGGGGGUGUUGUUUU 4472 826 hsa-miR- UGCUGGCUCAUUUCAUAUGUGU 5580-5p 827 hsa-miR- CAAAAGUAAUUGUGGAUUUUGU 548n 828 MIR6841 GUGUUUAGGGUACUCAGAGCAAGUUGUGAAACACAGGUGUUUUU UAACCUCACCUUGCAUCUGCAUCCCCAG 829 MIR4434 UCACUUUAGGAGAAGUAAAGUAGAACUUUGGUUUUCAACUUUUC CUACAGUGU 830 ENSG00000212558 GUGCCUAAGGUUAACACAGCGCCUUAAGAGGCUAACACAGAAGG GCAAAGUAAGUCUCCAUAAAACCCAGAGAAGACUGUGAACCCCU CUCUGGAUCCUGUCUGGAGUCACAGCU 831 ENSG00000252517 CCUCAUUGAUUAGUAGCUUCUGACUUUUGUUCUGAGUUUGCUGA AGCUAGAUGCCAUUCCAGAUAAGA 832 hsa-miR- UGGGAGAGCAGGGUAUUGUGGA 6731-5p 833 ENSG00000212270 GUUCAUGAUAAGUAACAUUUCUUCAAUUUGACCUGAUGUGUAUU GAAGAAAACCAGCAUCUGAGG 834 hsa-miR- CGUCUUACCCAGCAGUGUUUGG 200c-5p 835 5 hsa-miR- UAUGGAGUGGACUUUCAGCUGGC 6514-5p 836 SNORD115- GGGUCAAUGAUGAGAUGUUACCUUGAAGAGAAAUGAUGACGUAA 48 AAAUUAAGUUCAGUUGGAUUACGCUGAGGCCC 837 MIR6779 GAGCUCUGGGAGGGGCUGGGUUUGGCAGGACAGUUUCCAAGCCC UGUCUCCUCCCAUCUUCCAG 838 hsa-miR- UUGCCAUGUCUAAGAAGAA 4659b-5p 839 MIR1915 UGAGAGGCCGCACCUUGCCUUGCUGCCCGGGCCGUGCACCCGUGG GCCCCAGGGCGACGCGGCGGGGGCGGCCCUAGCGA 840 MIR573 UUUAGCGGUUUCUCCCUGAAGUGAUGUGUAACUGAUCAGGAUCU ACUCAUGUCGUCUUUGGUAAAGUUAUGUCGCUUGUCAGGGUGAG GAGAGUUUUUG

The selection of the sncRNAs in the Sentinel™ PCa, CS and HG Tests is independent of PSA, Gleason Score, or biological pathway analysis, and as such is entirely unbiased. Because the algorithm was validated using sncRNA levels obtained from an independent training set made up of a cohort of participants whose core needle biopsy is positive or negative (PCa Test), or patients labeled as either having advanced disease (GG3-GG5) or not (No evidence of PCa or GG1-GG2) for the CS Test. (see Table 4, [000103]-[000104] and Table 5 [000109]-[000110]), this statistical methodology minimizes both Type 1 error (false negative) and Type 2 error (false positive), which ensure that the tests rigorously distinguish between none and low-grade cancer, low and intermediate grade cancer and between intermediate and high-grade disease. Based on the algorithm used in the analysis, the described invention has no false negatives and a very low (<5%) false positive rate.

Based on the three tests described above, the OpenArray™ platform sequentially interrogates the informative RNA entities present in a single sample of sncRNA extracted from urinary exosomes without compromising sensitivity and specificity of the three tests.

In one aspect, the disclosure provides a method for diagnosing prostate cancer comprising a platform that allows one to distinguish between clinically significant tumors and indolent tumors and in putting the data based on a subset sncRNAs interrogated into an algorithm that has been validated based on an independent training data set.

In one aspect, the method for diagnosing prostate cancer in a male patient comprising (1) obtaining a biological sample from the patient, (2) detecting the aggregate expression profile of a collection of signature small non-coding RNAs (sncRNAs) that bind to a plurality of nucleic acids or hybridizing probes selected from the group consisting of SEQ ID NOs: 1-280; and (3) correlating the aggregate expression profile of the collection of signature sncRNAs using the PCa test to determine whether the patient is at risk for prostate cancer, i.e., having no evidence of prostate cancer or having prostate cancer.

In another aspect, the disclosure provides a method for screening prostate cancer using the same. In yet another aspect, the disclosure provides a method for predicting the probability of prostate in a subject.

For patients identified to be at risk for prostate cancer (i.e., determined to have prostate cancer), the samples are re-analyzed using the Sentinel™ Clinical Significant (CS) Test to distinguish patients with clinically significant or aggressive prostate cancer (GG2-GG5) from patients with clinically insignificant or indolent (GG1) prostate cancer. In one embodiment, the patient is identified as having aggressive prostate cancer when the aggregate or combined expression profile of a plurality or a collection of signature sncRNAs is higher than or equal to the aggregate expression profile in a prostate cancer biological sample, or identifying the subject as having low-grade prostate cancer when the aggregate expression profile of a collection of signature sncRNAs is less than or equal to the aggregate expression profile in a low-grade prostate cancer biological sample.

In one embodiment the biological sample includes, but is not limited to, prostate tissue, blood, plasma, serum, urine, urine supernatant, urine cell pellet, cerebrospinal fluid, semen, prostatic secretions, and prostate cells. In some embodiments, the biological sample is a urine sample. In yet another embodiment, the samples are exosomes isolated from the urine sample. In a preferred embodiment, the sample is sncRNAs isolated from exosomes derived from the urinary sample.

Exosomes are small extracellular vesicles (EV) that originate in the endosomal compartment of eukaryotic cells. They are found in biological fluids including blood, urine, semen and cerebrospinal fluid. The biogenesis of exosomes is not well-understood; however, it is generally accepted that they arise at the point at which the early endosomal pathway bifurcates to form late endosomes and multi-vesicle endosomes, the first stages of the exosomal pathway. Exosomes contain sncRNAs including miRNAs and small nucleolar RNAs (snoRNAs) which are derived from the cytoplasm and nucleolar region of the cell respectively. The presence of exosomes and EVs in the tumor microenvironment have been associated with malignancy in a number of tumor types including prostate cancer and other cancers.

In certain embodiments, sncRNAs isolated from exosomes are derived from semen, blood, prostatic secretions and cerebral spinal fluid. In further embodiments, the exosomes are isolated from the cancer cells including prostate cancer cells, lymphocytes, and cells from prostate tissues.

The method for isolating exosomes is well known in the art and can be carried out using kits such as the Exosome RNA Isolation Kits (Norgen Biotek Corp., Ontario, CA). sncRNA yields can be quantified by fluorimetry (Qubit, Thermo Fisher Scientific), and the quality of the sncRNAs isolated is assessed using an Agilent 2100 bioanalyzer.

The RNAs extracted from the isolated exosomes are a mixture of small RNAs referred collectively as small non-coding RNAs (sncRNAs), which include miRNA, snoRNA, scaRNA, siRNA, snRNA, and exRNA. Due to their size (<200 nucleotides), the sncRNAs are readily extracted from biological samples, including, for example, Formalin-Fixed Paraffin-Embedded (FFPE) tissue or urine. The sncRNAs are not degraded during fixation or extraction, obviating the problems intrinsic in extraction of mRNA from FFPE tissues. Yield of sncRNAs of about 10 ng from the biological samples is sufficient for multiple analyses using the Exosome RNA Isolation Kits of Norgen disclosed above.

The extracted sncRNAs are reversed transcribed into cDNAs, which are more stable than RNA, thus allowing for longer storage. The resulting cDNAs are hybridized against a selected set or collection of signature sncRNAs probes or genome array or micro-array chips, such as the miR 4.0 arrays (ThermoFisher Scientific) for further analysis. The selection of the informative set of sncRNA probes is independent of PSA, Gleason Score or biological pathways. The selected set or collection of signature sncRNAs comprises SEQ ID NOs: 1-280, 281-560 and 561-840. The number of sncRNA sequences or probes in the set range from 145-196, preferably not more than 280 sncRNA sequences for in putting in the Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG Tests. The tests are more precise with up to 280 sncRNA sequences added to the algorithm. Addition of more than 280 sncRNA sequences in the analysis does not increase the precision of the assay.

Real time-PCR or RT-PCR, often known as qPCR or RT-qPCR, is conducted to quantify the absolute amount of a target sequence or to compare relative amounts of a target sequence between samples. The RT-qPCR monitors amplification of the target in real-time via a target-specific (probes) fluorescent signal emitted during amplification. Because background fluorescence occurs during most RT-qPCR reactions despite the use of sequence specific probes against the targets, the issue of background fluorescence signal can be addressed by considering two values in real-time PCR: (1) the threshold line (Ct) and (2) the cycle quantification (Cq) value. The threshold line (CO is the level of detection when a reaction reaches a fluorescent intensity that is above background levels, that is a point where the reaction curve begins the exponential phase (inflexion point). The Cq or cycle quantification value is the PCR cycle number at which the sample's reaction curve intersects the threshold line. The Co value therefore, indicates how many cycles it took to detect a real signal from the samples, i.e., time to event, where the event is the saturation of the fluorescence, indicating the maximum level of detection. Because RT-qPCR runs provide a reaction curve for each sample, there will be many Cq values. The software in the PCR cycler will calculate and chart the Cq value for each of the samples. Values are inverse to the amount of target nucleic acid in the sample, and correlate to the number of target copies in the sample. Lower Cq values (typically below 29 cycles) indicate high amounts of target sequence. Conversely, higher Cq values (above 38 cycles) indicate lower amounts of the target nucleic acid in the sample. However, the time to event value can be obtained when the slope of the reaction curve becomes zero, rather than using the Cq when the slope is at a maximum. In one embodiment, the sncRNAs are interrogated using RT-qPCR. In another embodiment, the sncRNAs are interrogated using qPCR. In a further embodiment, the sncRNAs isolated from the urinary exosomes are interrogated using the Affymetrix GeneChip™ miRNA 4.0 Array following manufacturer's instructions.

These informative sequences obtained from the training set (usually 280 sequences) are then transferred to the OpenArray platform. Patient samples of unknown status are then interrogated on the OpenArray platform and the Sentinel Score is determined using the Classification Algorithm. The status of the patient is determined from the Sentinel Score.

In one embodiment, the sncRNAs levels from a patient of unknown prostate cancer status are interrogated on the OpenArray platform and Sentinel score from a patient of unknown disease status can then be compared to that of the training set to determine the status of the patient

In one embodiment, the data obtained from the analysis of sncRNAs levels in the test sample from a patient of unknown prostate disease status (test sample) using, for example, by RT-qPCR on the Affymetrix GeneChip™ miRNA 4.0 Array can be compared to those from healthy patients (no evidence of cancer) or healthy cells obtained from the subject with prostate cancer. The data obtained from the analysis of sncRNA levels from the test sample can also be compared to clinical baselines established by analyzing healthy (no cancer) and non-healthy (with genitourinary cancer) patients, and the non-healthy patients be further categorized into different specific cancer types, which can be further categorized into different stages or severity of the specific cancer type (for example, prostate cancer, etc.) and different stages of the specific disease. In one embodiment, the data obtained from the analysis of sncRNAs expression in the test sample using RT-qPCR on the Affymetrix GeneChip™ miRNA 4.0 Array are compared to those from healthy patients (no evidence of cancer) or the data can also be compared to clinical baselines established by analyzing healthy (no cancer) and non-healthy (with having prostate cancer) patients, and the non-healthy patients be further categorized into different stages of prostate cancer (GG1 and GG2-GG5 or GG1+2 and GG3-GG5).

In some embodiments, the method uses an OpenArray™ technology (ThermoFisher Scientific) to interrogate a panel of sncRNAs (e.g., miRNAs, snoRNAs). The OpenArray™ technology uses a microscope slide-sized plate having 48 subarrays. Each subarray has 64 through-holes, and each hole is 300 μm in diameter and 300 μm deep. The holes are treated with hydrophilic and hydrophobic coatings in order to retain reagents in the through-holes via surface tension. The OpenArray™ technology, with its 3,072 through-hole (48×64), provides a system for streamlining real-time PCR studies that use large number of samples, assays or both. The system thus allows processing of samples for gene expression in large number in short time periods using microquantities of samples and reagents. The method employs an algorithm that relies on the expression level of each of the sncRNAs and the grading of the biopsies (at least 12 core needle biopsies). In the case of prostate cancer, the methodology is independent of serum Prostate Specific Antigen (PSA) levels, Gleason Score (neither of which are meaningful markers of tumor progression), or patient age. The methodology is also independent of any analyses of biological pathways. The present method uses sncRNAs isolated from the subject (e.g., urinary sample, urinary exosome, or prostate tissue samples) to stratify men into those that have prostate cancer (both indolent (clinically insignificant) or aggressive (clinically significant)) and those that do not. This methodology can replace serum PSA as the major screening assay for prostate cancer.

In one aspect, the disclosure provides a method that distinguishes clinically significant prostate cancer based on the aggregate expression profile of a collection of signature sncRNAs interrogated that is subjected to a classification algorithm that is independent of pathology (Gleason Score), tumor volume or PSA. The RNA extracted from biological samples of patients with known cancer outcomes are reverse-transcribed and hybridized against a full-genome array (e.g., Affymetrix GeneChip miR 4.0) containing sncRNAs. Small non-coding RNAs that are differentially regulated in clinically significant prostate tumors are identified. In one embodiment, the absolute value of the signal from the Open Array identifying the sncRNAs that hybridize to the probes for SEQ ID NOs: 281-561, is compared to the aggregate expression profile found in clinically significant (GG2-5) prostate cancer tumors. In another embodiment, the aggregate value of the signal from the Open Array identifying the sncRNAs that hybridize to the probes for SEQ ID NOs: 561-840, is compared to the absolute expression profile found in clinically low and favorable intermediate grade (GG1+GG2) versus unfavorable intermediate and high grade (GG3-GG5) prostate cancer tumors.

The disclosed method provides a robust and accurate determination of prostate cancer prognosis in 72-96 hours from the time when the urine sample is received to obtaining a Sentinel Score. In another embodiment, the aggregate expression profile of the identified sncRNAs that bind to SEQ ID NOs: 281-560 and 561-840 is compared to the aggregate expression profile of sncRNAs in clinically significant prostate cancer. In another embodiment, the aggregate expression profile of the identified sncRNAs interrogated that bind to SEQ ID NOs: 281-560 and 561-840 is compared to the aggregate expression profile of sncRNA in clinically significant prostate cancer. In a further embodiment, the aggregate expression profile of the identified sncRNAs that bind to SEQ ID NOs: 281-560 and 561-840 is compared to the aggregate expression profile of sncRNA in clinically significant prostate cancer sample. As a result, appropriate treatment options (or the lack thereof) can be initiated.

The terms relative aggregate expression profile are used interchangeably. The aggregate expression profile of at least a plurality of sncRNAs are combined and compared to the same aggregate expression profile in clinically significant prostate cancer tissue. In some embodiments at least 40 sncRNAs are combined and compared to the same aggregate expression profile in clinically significant prostate cancer tissue. In some embodiments at least 90 sncRNAs are combined and compared to the same aggregate expression profile in clinically significant prostate cancer tissue. In some embodiments at least 150 sncRNAs are combined and compared to the same aggregate expression profile in clinically significant prostate cancer tissue. In some embodiments at least 200 sncRNAs are combined and compared to the same aggregate expression profile in clinically significant prostate cancer tissue. In a preferred embodiment, at least 224 sncRNAs and not more than 280 sncRNAs are combined and compared to the same aggregate expression profile in clinically significantly prostate cancer tissue. In certain embodiments, a higher aggregate expression profile as compared to the aggregate expression profile in a low-grade prostate cancer tissue indicates the patient has aggressive prostate cancer and treatment is required. In other embodiments, an aggregate expression profile equal to or lower than the aggregate expression profile in a low-grade prostate cancer tissue indicates the patient does not have aggressive prostate cancer and monitoring but not treatment may be required.

In some embodiments, the aggregate expression profile of selected sncRNAs is an aggregation of various types of modulated expression of the sncRNAs. The modulated expression can be decreased or increased expression profile relative to the same sncRNA in other tissue/tumor types, such as healthy prostate tissue, low-grade prostate cancer tissue, or high-grade prostate cancer tissue.

In other embodiments, the aggregate expression profile of selected sncRNAs can be an aggregation of the decreased aggregate expression profile of certain sncRNAs as well as an aggregation of the increased aggregate expression profile of other sncRNAs in the same tissue sample. For example, a progression score, or aggregate expression profile of a collection of signature sncRNAs may include one or more sncRNAs with decreased aggregate expression profiles relative to another tissue type or other sncRNAs in the same tissue sample, while one or more of the remaining sncRNAs exhibit increased aggregate expression levels relative to another tissue type or other sncRNAs in the same tissue sample. The aggregate expression profile of the collection of differently modulated sncRNAs provides a sophisticated, unbiased, indication of whether a prostate tumor is clinically significant. Unlike other methods that merely evaluate the presence or absence, or simple increase or decrease of individual target molecules, as compared to normal tissue, the methods disclosed provide a truly unbiased, independent, and multi-variable analysis of a prostate tissue sample thereby allowing for a surprisingly accurate diagnosis of whether a prostate cancer tumor is clinically significant.

In some aspect, the method provides for the use of the aggregate expression profile of the collection of signature sncRNAs for monitoring metastasis and cancer staging.

In another aspect, the disclosure provides a method for detecting a urological malignancy based on the aggregate expression profile of the collection of signature sncRNAs interrogated and subject to analysis using the classification algorithm disclosed. In some embodiments the malignancy is cancer of the prostate.

The present disclosure provides an algorithm-based molecular diagnostic assay for predicting a clinical outcome for a patient with prostate cancer. The expression level of one or more sncRNAs may be used alone or arranged into functional gene subsets to calculate a quantitative score that can be used to predict the likelihood of a clinical outcome.

A “quantitative score” is an arithmetically or mathematically calculated numerical value for aiding in simplifying or disclosing or informing the analysis of more complex quantitative information, such as the correlation of certain expression profile of the disclosed sncRNAs or sncRNAs subsets to a likelihood of a clinical outcome of a prostate cancer patient. A quantitative score may be determined by the application of a specific algorithm. The algorithm used to calculate the quantitative score in the methods disclosed may group the expression profile values of the sncRNAs. The grouping of sncRNAs may be performed at least in part based on knowledge of the relative contribution of the sncRNAs according to physiologic functions or component cellular characteristics, such as in the groups discussed herein. A quantitative score may be determined for a sncRNA group (“sncRNA group score” or the Sentinel™ Score). The formation of groups, in addition, can facilitate the mathematical weighting of the contribution of various aggregate expression profile of genes or gene subsets to the quantitative score. The weighting of a sncRNA or sncRNAs group representing a physiological process or component cellular characteristic can reflect the contribution of that process or characteristic to the pathology of the cancer and clinical outcome, such as recurrence or upgrading/upstaging of the cancer. The present invention provides a number of algorithms for calculating the quantitative scores. For example, the Classification algorithm in the present disclosure works the same way for developing the Sentinel Scores for distinguishing different disease states. The Classification algorithm selects different sncRNA sequences from the training data set for clinically significant and insignificant disease states.

On the other hand, the Selection algorithm test how well the aggregate expression profile of the urinary exosomal sncRNAs correlate with pathological stage of the disease (cancer/no cancer) in a large population of participants with known pathology. The Selection algorithm individually assessed how well each of the 6,599 sncRNAs interrogated on the miR4.0 arrays correlates the known pathology of the participant. It then iteratively assesses all combinations of 2 sncRNAs, 3 sncRNAs or 4 sncRNAs of the 6,599 sncRNAs interrogated by the miR 4.0 Arrays, followed by examination of each individual sncRNA using a leave-one-out strategy to assess the importance of each individual sncRNA in the pathology of the disease. The Sentinel Score for a patient with unknown disease status is then determined by interrogating selected sncRNAs using the Open Array and the clinical status is determined by comparing the score to that from the training data sets. In an embodiment of the invention, an increase in the quantitative score indicates an increased likelihood of a negative clinical outcome.

Based on the quantitative score and cumulative or absolute or aggregate expression profile, methods of treatment can also be decided. The methods of treating prostate cancer include surgery for complete surgical removal of prostate tissue, administering an effective dose of radiation, and administering a therapeutically effective amount of a medication for the treatment of prostate cancer, or a combination of the above.

The algorithm-based assay and associated information provided by the practice of the methods of the present invention facilitate optimal treatment decision-making in prostate cancer. For example, such a clinical tool would enable physicians to identify patients who have a low likelihood of having an aggressive cancer and therefore would require no further medical intervention except for a routine follow-up or active surveillance every 3 months, 6 months or 12 months. Patients with no cancer do not require medical intervention return for follow-up once every year. Patients at risk for developing aggressive cancer require medical intervention, which includes but is not limited to treatment with one or more chemotherapeutic agents (e.g., taxotere, cabazitaxel, docetaxel, mitoxantrone, epirubicin, paclitaxel and estramustine, etc.), hormone therapy (e.g., lutenizing hormone releasing hormone agonists to prevent production of testosterone such as leuprorelin, goserelin and triptorelin or anti-androgen drugs that prevent testosterone from reaching the cancer cells, e.g., bicalutaminde and nilutamide), immunotherapeutics, radiation, cryotherapy, surgery or a combination thereof.

Patients who undergo treatment are monitored using the disclosed method to determine the patients' response to treatment. In one aspect, the disclosure provide a method for determining the patient's response to treatment comprising: (i) obtaining a biological sample from a patient, (ii) detecting the aggregate expression profile of a signature collection of small non-coding RNAs (sncRNAs) from the biological sample wherein the collection of sncRNAs comprises SEQ ID NOs: 1-280, 281-560 and 561-840 (iii) correlating the aggregate expression profile of sncRNAs of SEQ ID NOs: 1-280, 281-560 and 561-840 from the subject after treatment by comparing the aggregate expression profile of SEQ ID NOs: 1-280, 281-560 and 561-840 to that prior to treatment, (iv) determining if the patient is responsive to the treatment, and if there is a need for modification of the treatment. In one embodiment, the method further compares the resulting aggregate expression profiles of a signature collection of sncRNA from (iii) above is then compared to the aggregate expression profile of a signature collection of sncRNA for the large training data set from a target population having prostate cancer with known Grade groups to determine if the (a) patient prostate cancer is stable (no apparent change compared to the Grade group), (b) the patient is responsive to the treatment, i.e., patient gets better (the results show tumors that resembles tumors with lower grade group) or (c) the patient is non-responsive (patient gets worse when the results show tumors that resembles tumor of higher Grade group) based on the aggregate expression profile of a collection of signature sncRNAs and Sentinel Score for that Grade group, and if there is a need for modification of the treatment. Treatment modification includes but not limited to adjusting the concentration or amounts of chemotherapeutic agents, radiation, immunotherapeutic or hormone administered, adding or removing one or more of agents used.

In another aspect, the disclosure provides a method for determining the disease recurrence, disease progression or likelihood of survival based on the aggregate expression profile of a collection of signature sncRNAs comprising SEQ. ID. NOs: 1-280, 281-560 and 561-840 by comparing the aggregate expression profile of SEQ. ID. NOs: 1-280, 281-560 and 561-840 in a training dataset and the patient's earlier profile.

In another aspect, the disclosure provides a system for determining whether a patient has no cancer or has cancer and classifying the subject with cancer as (i) indolent (low grade, GG1), (ii) intermediate or high grade (GG2-GG5), (iii) low/intermediate risk (GG1-GG2) or (iv) aggressive (high grade, GG3-GG5) prostate cancer comprising at least three processors configured to (a) interrogate sncRNA sequences for informative sequences, (b) determine and compare a Sentinel Score to determine if the subject has prostate cancer or no prostate cancer and to classify subject determined to have cancer to the various Grade groups, e.g., low grade, intermediate/high grade, low/intermediate risk or aggressive grade cancer. Subjects determined to have no evidence of cancer do not require medical intervention and would return for follow-up once every year. Subjects determined to have low grade or low/intermediate grade prostate cancer would require no medical intervention except for a routine follow-up or active surveillance every 3, 6, or 12 months, and subjects determined to have intermediate/high grade or aggressive prostate cancer require medical intervention.

EXAMPLES

This and other aspects of the present invention are further illustrated by the following non-limiting examples.

Example 1 Study Populations

Two independent patient cohorts were used for the development and validation of the Sentinel™ PCa and Sentinel™ CS Tests. The clinical and demographic characteristics of the 233 participants used to develop the Sentinel™ PCa to classify patients as having cancer or no cancer was based on the statistical analysis of a collection of signatures snRNAs. For patients classified as having cancer, patients with GG1 (indolent, low risk cancer) are distinguished from GG2-5 (respectively as intermediate, high-risk and aggressive cancers) using the Sentinel™ CS Tests, which is also based on the statistical analysis of another collection of signatures the sncRNAs using a second Classification algorithm to classify tumors into GG1 versus GG2-5. The sncRNAs in both tests are interrogated by the Affymetrix miR 4.0 array.

Urine Collection and Processing

Urine samples for the development of the Sentinel™ PCa and CS Tests and the US-based cohort of the retrospective study were collected on the day of visit for clinical workup at two clinical sites: Albany Medical Center (Albany, N.Y., USA) and SUNY Downstate Medical Center (Brooklyn, N.Y., USA). Remaining samples for the retrospective study were retrieved from the GUBioBank, University Health Network, Toronto CA and shipped frozen at −20° C. in bulk to the miR Scientific laboratories. Patient information was collected and anonymized as approved by Institutional Review Board at each participating site. Prostate cancer diagnosis was obtained by histopathological grading of core-needle biopsies; the percentage of tumor per core and number of positive cores were used to assess the grade group (GG).

Urine samples were centrifuged to remove free cells and debris. RNA was extracted using Exosome RNA Isolation Kits (Norgen Biotek, ON) according to the manufacturer's instructions. sncRNA yields were quantified by fluorimetry (Qubit, Thermo Fisher Scientific) and RNA samples were stored at −80° C. until analysis.

Microarray Analysis of Total Exosomal sncRNAs

sncRNAs were interrogated using the Affymetrix GeneChip™ miR 4.0 Array following the manufacturer's instructions. MAIME-compliant raw data files for the 235 patients analyzed on these arrays have been deposited in NCBI's Gene Expression Omnibus. (Edgar R et al. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acid Res 2002 30:207). Six thousand five hundred and ninety-nine sncRNAs in the training set were interrogated on the Affymetrix GeneChip™ miRNA 4.0 Array.

The small non-coding RNA entities interrogated for each participant were analyzed using proprietary Selection and Classification Algorithms. The most informative sequences for distinguishing between cancer and non-cancer subjects (SEQ ID NOs: 1-280) and between Grade Group 1 and Grade Group 2-5 patients were identified. (SEQ ID NOs: 281-842)

QuantStudio OpenArray™-Based Interrogation of Exosomal sncRNAs

cDNA synthesis, pre-amplification of selected miRNAs: For analysis of exosomal miRNA, total sncRNA was reverse transcribed in separate reactions with three specific miRNA stem-loop primer pools with the TaqMan™ MicroRNA Reverse Transcription Kit (Thermo Fisher Scientific) as recommended by the manufacturer. The miRNA cDNA pools were enriched individually with Pre-Amp primer pools for 16 cycles (95° C. for 10 min, 55° C. for 2 min, 72° C. for 2 min, 95° C. for 15 sec and 60° C. for 4 min repeated 16 cycles, 99.9° C. for 10 min), and interrogated on the QuantStudio OpenArray™ on three 56-entity sub-arrays following the manufacturer's recommendations.

cDNA synthesis, pre-amplification and interrogation of selected snoRNAs: Total sncRNA was reverse transcribed with High-Capacity cDNA Reverse Transcription Kit with a single Pre-Amp primer pool (Thermo Fisher Scientific) as recommended by the manufacturer. snoRNA cDNA products were enriched by preamplification (95° C. for 10 min, 95° C. for 15 sec and 60° C. for 4 min repeated for 14 and 18 cycles respectively, and 99° C. for 10 min) and interrogated on two 56-entity sub-arrays.

Statistical Analysis:

The Sentinel™ PCa Test is based on Classification Algorithm that has been trained on a cohort of participants whose core-needle biopsy is positive or negative. The Classification Algorithm takes as input the sncRNA expression signature for a participant with unknown disease status and produces a Sentinel™ Score; the participant is classified by comparing the Sentinel™ PCa Score to the pre-determined cutoff value that maintains the sensitivity for classifying a future patient with unknown disease status (but known expression signature), at a user-defined level (95% or greater). A second classification algorithm, the Sentinel™ CS Test operates analogously to the Sentinel™ PCa Test. However, the classification algorithm for the Sentinel™ CS Test is trained on a cohort of patients labeled as low grade (GG1) and a second cohort of patients labeled as favorable intermediate to high grade prostate cancer (GG2-GG5). The third classification algorithm, the Sentinel™ HG Test is trained on a cohort of patients determined to be low- and favorable intermediate-risk (GG1+GG2) prostate cancer and a second cohort characterized as unfavorable intermediate risk and high-risk (GG3-GG5)

The Sentinel™ Testing paradigm operates in two or three layers. First, it uses the sncRNA signature from the participant's urine to input to the classification rule of the Sentinel™ PCa Test to determine if cancer is present; second, for those patients diagnosed with cancer, the Sentinel™ CS Test determines whether the cancer is low risk (GG1) or not; thirdly the Sentinel™ HG Test determine whether the tumor is unfavorable intermediate or high risk (GG3-GG5) or not. (See FIG. 4.)

TABLE 4 Demographics and Clinical Characteristics of Cohort Used to Develop Classification Algorithm. Cancer (%) Control (%) GG1 GG2 GG3 GG4 GG5 Total 89 (37.9%) 90 (38.3%) 34 (14.5%) 9 (3.8%) 7 (3.0%) 6 (2.5%) Age Range 23-89 53-83 50-81 60-81 51-74 59-74 Mean ± SD 65.6 ± 13.0 67.3 ± 6.7 66.3 ± 6.6 72.1 ± 6.6 63.0 ± 7.5 67.2 ± 5.5 BMI Range 18.9-54.1 20.7-53.3 22.0-44.1 26.1-36.1 16.0-44.1 27.5-41.2 Mean ± SD 29.3 ± 5.6  28.4 ± 4.9 28.2 ± 3.8 30.1 ± 3.3 29.2 ± 8.5 32.7 ± 5.2 Race NHW 72 78 30 7 7 6 NHB 3 7 4 1 0 0 not reported 14 5 0 1 0 0 PSA Range N/A 0.55-28.2  2.1-49.9  5.6-28.2  6.9-40.8  5.5-17.8 Mean ± SD  6.3 ± 4.0  7.7 ± 7.9 10.5 ± 7.4 18.8 ± 10.7  8.7 ± 4.6 *Under exempt study status, PSA levels were not available for patients with no evidence of cancer.

Table 4 established the training datasets used to develop the Sentinel™ tests. Of the 235 patients, patients included in the “no cancer” cohort (89 patients) were carefully selected from age-matched men who were seen at urology clinics for issues unrelated to urological oncology (n=58), and from men who had one or more 12-needle diagnostic core needle biopsies that showed no evidence of prostate cancer (n=30).

Patients in the “cancer” cohort (n=146) were selected based on the histopathology of the core needle biopsies. Of the 146 “cancer” cohort, 90 patients were classified as GG1 cancer, 56 patients were classified as GG2-5.

Of the 6,599 microarray sequences from the training data set interrogated using the proprietary Selection Algorithm to separate that are “informative” for the outcome versus those that are not, only 400-600 are informative. By outcome, it is meant to mean sncRNA sequences that impact the algorithm when each sequence is added to predict whether the subject of unknown disease status has or has no prostate cancer and the stages of the cancer (indolent versus aggressive).

The statistical analysis used is based on the ability to identify sequences that have hidden associations with outcome that is only observed after conditioning on other sequences. Of the 400-600 informative sncRNA sequences, 280 sncRNA sequences were used in the Classification algorithm as a basis to define an expression signature for the Discovery PCa Test (FIGS. 5A-5B), and Discovery HG Test (FIGS. 7A-7B) and Discovery CS Test (FIGS. 6A-6B). The subset of informative sncRNAs considered to be of the highest importance were then identified for each Test (FIGS. 5B, 7B and 6B, respectively).

These 280 sncRNAs were combined to design an OpenArray™ platform that provides the basis for the Sentinel™ PCa and CS Tests. The Sentinel™ PCa Test incorporates the aggregate expression profiles of 84 unique sncRNAs: 60 miRNAs and 24 snoRNAs, for classifying a subject with unknown disease status as having prostate cancer or no prostate cancer. Similarly, the Sentinel™ CS Test utilizes 135 unique sncRNAs: 105 miRNA and 30 snoRNAs for classifying a subject having prostate cancer as having GG1 (indolent) prostate cancer or GG2-GG5 (aggressive) prostate cancer. In addition, 61 sncRNAs (25 miRNAs and 36 snoRNAs) are informative in both Tests. The OpenArray™ platform sequentially interrogates the informative RNA entities present in a single sample of sncRNA extracted from urinary exosomes without compromising sensitivity and specificity of the two tests.

Example 2 Validation of the Sentinel™ PCa, Sentinel™ CS and Sentinel™ HG Tests in a Case Control Patient Cohort

The performance characteristics of the Sentinel™ PCa and Sentinel™ CS Tests using the OpenArray™ platform was established in a case control study of 1436 patients (Table 5).

TABLE 5 Demographics and Clinical Characteristics of Case-Control Sample Used to Validate the Sentinel ™ PCa and Sentinel ™ HG Tests. Cancer (%) NEPC (%) GG1 GG2 GG3 GG4 GG5 Total 568 (39.5%) 437 (30.4%) 162 (11.3%) 131 (9.1%) 66 (4.6%) 72 (5.0%) Age Range 23-90 46-93 50-96 49-95 50-93 54-91 Mean ± SD 65.8 ± 9.0 70.3 ± 8.8  71.1 ± 8.2  74.2 ± 8.7  72.4 ± 9.7  72.7 ± 9.5  Race NHW 458 233 96 47 26 36 NHB 1 20 11 5 9 3 not reported 49 184 55 79 31 33 PSA Range N/A 0.21-108  1.24-32.0 1.67-138   1.93-1400 1.98-199  Mean ± SD 6.4 ± 6.2 7.4 ± 4.7 14.0 ± 19.4  57.6 ± 194.0 25.9 ± 36.0 *Under exempt study status, PSA levels were not available for individual patients with no evidence of cancer, all were less than 3.0 ng/mL.

The performance characteristics of the Sentinel™ PCa Test were determined in case control cohort of 600 men whose demographics are shown in Table 5. The scatter plot of the Sentinel™ PCa Scores is shown in FIGS. 8A-8B, with the corresponding Receiver Operator curve (ROC) curve in FIG. 8C. As summarized in Table 6, the Sentinel™ PCa Test correctly classifies 281/300 patients as having cancer and 275/300 patients as having no cancer (Sensitivity 93.7%, Specificity 91.7%).

The performance characteristics of the Sentinel™ CS Test were determined in a testing cohort of 600 men. The scatter plot of the Sentinel™ CS Scores is shown in FIGS. 9A-9B, with the corresponding Receiver Operator curve (ROC) curve in FIG. 9C. As summarized in Table 6, the Sentinel™ CS Test correctly classifies 143/154 patients as high grade (GG3-GG5) and 132/143 as not high grade (Sensitivity 92.9%, Specificity 90.4%).

The performance characteristics of the Sentinel™ HG Test were determined in a testing cohort of 600 men. The scatter plot of the Sentinel™ HG Scores is shown in FIGS. 10A-10B, with the corresponding Receiver Operator curve (ROC) curve in FIG. 10C. As summarized in Table 6, the Sentinel™ CS Test correctly classifies 94/100 patients as high grade (GG3-GG5) and 191/200 as not high grade (GG1+GG2) (Sensitivity 94%, Specificity 95.5%).

TABLE 6 Empirical Sensitivity, Specificity, PPV and NPV for Sentinel ™ PCa, Sentinel ™ CS and Sentinel ™ HG Tests 95% 95% 1-Error Rate Numerator Denominator Proportion lower CI upper CI Sentinel ™ PCa Sensitivity 281 300 0.937 0.905 0.960 Specificity 275 300 0.917 0.882 0.944 PPV 281 306 0.918 0.884 0.945 NPV 275 294 0.935 0.903 0.959 Sentinel ™ CS Sensitivity 143 154 0.929 0.880 0.962 Specificity 132 146 0.904 0.848 0.944 PPV 143 157 0.911 0.859 0.948 NPV 132 143 0.923 0.871 0.959 Sentinel ™ HG Sensitivity  94 100 0.940 0.880 0.975 Specificity 191 200 0.955 0.919 0.978 PPV  94 103 0.913 0.846 0.956 NPV 191 197 0.970 0.938 0.987 *NPV or Negative Predictive Value is the probability that following a negative test result, that individual will not have that specific disease. NPV = True Negative True Negative + False Negative **Sensitivity of a test is the proportion of people who test positive among all those who actually have the disease. Sensitivity = True Positive True Positive + False Negative ***The specificity of a test is the proportion of people who test negative among all those who do not actually have that disease. Specificity = True Negative True Negative + False Positive ****PPV or Positive Predictive Value is the probability that following a positive test result, that individual will truly have the specific disease. PPV = True Positive True Positive + False Positive

Example 3

Safety and Scientific Validity Study to Identify Clinically Insignificant PCa with Scientific Sentinel™ Platform.

The purpose the clinical study is to validate the performance characteristics of the Scientific Sentinel™ PCa Test and the Scientific Sentinel™ CS Test to (1) identify patients with prostate cancer in men of age 50-80 years with suspicion of prostate cancer for whom needle biopsy is performed, and (2) to distinguish men of ages 50-80 years with clinically significant prostate cancer (Grade 2 or above) from men with clinically insignificant prostate cancer (Grade Group 1). These classifications will be compared to the results of core needle biopsies, and of radical prostatectomy (where available). The sensitivity, specificity, positive and negative predictive values will be established. This study is a prospective, observational and non-interventional study. The informed participants will provide two or more urine samples over the course of the study and consent to share relevant anonymized clinical data with the study team.

Participants between the age of 50 and 80 years with suspicion of prostate cancer for whom a core-needle biopsy is performed, and otherwise meeting the inclusion and exclusion criteria, will be enrolled and will provide urine samples for the Sentinel™ PCa/CS Tests. The study will evaluate the properties of the Sentinel™ PCa Test and the Sentinel™ CS Test that is based on the disclosed method using Classification Algorithms to identify future patients with prostate cancer and to classify prostate cancer as clinically significant or clinically insignificant.

The “gold standard” assessment of cancer will be made from the results of core needle biopsies: participants with no positive cores will be designated “cancer-free”; participants with cancer in one or more cores will be designated as having “Clinically Insignificant” prostate cancer provided all cores with cancer have no greater than Grade Group 1 histopathology; participants will be designated as having “Clinically Significant” prostate cancer if any cores have Grade Groups 2-5.

Each study participant enrolled will be followed for one year. Participants will provide urine samples during each visit, and all relevant clinical data, including re-biopsies, PSA results and pathology report from radical prostatectomy (if administered as part of clinical care) will be obtained. The follow-up results, if available will be used for outcome analysis. For each urine sample provided, the Sentinel™ PCa and CS tests will be determined and compared with the available 1 year follow up outcome data to inform the sensitivity, specificity, positive and negative predictive values of the tests.

The Classification Algorithm employed functions by controlling sensitivity at, or above, a pre-specified level, denoted 1−α; for example, the value that has been assumed in this design is α=0.05, so that sensitivity is at least 95% in the population. Note that the value of a represents the false-negative rate of the test, i.e., the test is (incorrectly) negative for a patient who is truly positive.

To describe how the cutoff the Sentinel™ PCa Score is calculated to control sensitivity, for each participant in the training dataset, the Sentinel™ PCa Score will be calculated using the remaining members of the training dataset and only his small non-coding RNA (sncRNA) sequence; that is, the true disease status of each patient in the training dataset will be blinded, thereby mimicking the setting for classification of a future patient. The cutoff used in the Sentinel™ PCa Test is then calculated so that the empirical sensitivity over patients in the training dataset with prostate cancer corresponds to the value that provides an upper one-sided 95% confidence interval for population sensitivity for a future patient of at least 1−α.

With this cutoff for the Sentinel™ PCa Score determined from the training dataset a priori, the corresponding values of sensitivity, specificity, positive and negative predictive values will be calculated, along with a corresponding upper 95% confidence interval, on the prospective participants data accrued in this proposed study, with each biopsy result blinded, i.e., using only the participant's sncRNA sequence. Note that these error rates refer to the classification of a future patient with unknown disease status.

Any patent, patent application publication, or scientific publication, cited in this application, is incorporated by reference in its entirety.

Claims

1. A system for classifying a human test subject as having prostate cancer or no prostate cancer comprising:

a processor for performing a PCa (Prostate Cancer) Test configured to: (a) interrogate sncRNA sequences for a set of informative sequences comprising a panel of hybridizing probes of SEQ ID NOs: 1-280 in a first high throughput microarray platform; (b) determine a test score (PCa score) that examines the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 1-280 in a first Classification algorithm, wherein the first Classification algorithm is trained on a cohort of participants whose core-needle biopsy is positive (known to have prostate cancer) and a cohort of participant whose core-needle biopsy is negative (known to have no prostate cancer); (c) compare the test score (PCa score) in (b) for the interrogated informative sequences to a score obtained in training data sets from patients known to have prostate cancer or patients with no prostate cancer; and (d) classify with a specificity of at least 90-95% whether the human test subject has: prostate cancer when the test score (PCa score) from the human test subject is statistically greater or equal to the score in the training data set from a target population having prostate cancer; or no prostate cancer when the test score (PCa score) from the human test subject is statistically less than or equal to the score in the training data set from a target population having no prostate cancer.

2. The system of claim 1, wherein sncRNAs are extracted from a prostate derived biological sample and reversed transcribed to cDNAs.

3. The system of claim 2, wherein the prostate derived biological sample comprises sncRNA extracted from urinary exosome isolated from cell free urine.

4. The system of claim 3, wherein the sncRNA comprises miRNA, C/D box snoRNA, H/ACA box snoRNA, scaRNA, piRNA, and lncRNA.

5. The system of claim 3, wherein the sncRNA comprises a miRNA and a snoRNA.

6. The system of claim 1, wherein the hybridizing probes are specific for each of the cDNA derived from sncRNAs obtained from a prostate derived biological sample.

7. The system of claim 6, wherein the hybridizing probes detect at least SEQ ID NOs: 2, 6, 14, 20, 31, 56, 64, 67, 79, 82, 91, 92, and 161.

8. The system of claim 1, wherein the human test subject determined to have prostate cancer requires further testing to classify the human test subject as having indolent (low grade or GG1) or intermediate/high grade (GG2-GG5) prostate cancer; and the human subject determined to have no prostate cancer requires monitoring.

9. The system of claim 8, wherein the human test subject classified to have no prostate cancer requires routine follow-up urine test every 12 months.

10. A system for classifying a human test subject identified to have prostate cancer as having indolent (low grade, GG1), or intermediate/high grade (GG2-GG5) prostate cancer comprising:

a processor for performing a CS (Clinically Significant) Test configured to: (a) interrogate sncRNA sequences for a set of informative sequences comprising a panel of hybridizing probes specific for each of the cDNA derived from sncRNAs obtained from a prostate derived biological sample, wherein the hybridizing probes detect of SEQ ID NOs: 281-560 in a second high throughput microarray platform; (b) determine a test score (CS Score) that examine the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 281-560 in a second Classification algorithm, wherein the second Classification algorithm is trained on a cohort of participants known to have low grade (GG1) prostate cancer, and another cohort of participants known to have intermediate/high grade (GG2-GG5) prostate cancer; (c) compare the test score (CS Score) in (b) for the interrogated informative sequences to a score obtained in training data sets from patients known to have indolent (low grade, GG1) or intermediate/high grade (GG2-GG5) prostate cancer; and (d) determine with a specificity of at least 90-95% that the human test subject has: indolent prostate cancer (low grade, GG1) when the CS score from the human test subject is statistically less than or equal to the CS score in the training data set from a target population having indolent prostate cancer (low grade, GG1), or intermediate/high grade (GG2-GG5) prostate cancer when the CS score from the human test subject is statistically greater than or equal to the CS score in the training data set from a target population having intermediate/high grade (GG2-GG5) prostate cancer, wherein the human test subject determined to have high grade (GG2-GG5) prostate cancer is further tested to determine if the human test subject has low/intermediate risk (GG1-GG2) or aggressive (high grade, GG3-GG5) prostate cancer.

11. The system of claim 10, wherein the hybridizing probes detect at least SEQ ID NOs: 22, 281, 283, 284, 285, 286, 287, 313, 316, 321, 336, 337, 340, 363 and 371.

12. The system of claim 10, wherein the human test subject determined to have intermediate/high grade (GG2-GG5) prostate cancer requires further testing to classify the human test subject as having low/intermediate grade (GG1-GG2), or high grade/aggressive (GG3-GG5) prostate cancer; and the human test subject determined to have indolent (low grade or GG1) prostate cancer requires monitoring for a routine follow-up urine test every 6 to 12 months.

13. A system for classifying if a human test subject identified to have intermediate/high grade (GG2-GG5) prostate cancer as having low/intermediate grade (GG1-GG2) or high grade/aggressive (GG3-GG5) prostate cancer comprising:

a processor for performing a high grade (HG) test configured to (a) interrogate sncRNAs sequences for a set of informative sequences comprising a panel of hybridizing probes specific for each of the cDNA derived from sncRNAs obtained from a prostate derived biological sample, wherein the hybridizing probes detect SEQ ID NOs: 561-840 on a third high throughput microarray platform; (b) determine a test score (HG Score) that examines the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 561-840 in a third Classification algorithm, wherein the third Classification algorithm is trained on a cohort of participants known to have low/intermediate grade (GG1-GG2) prostate cancer, and another cohort of participants known to have high grade/aggressive (GG3-GG5) prostate cancer; (c) compare the test score (HG Score) in (b) for the interrogated informative sequences to a score obtained in training data sets from patients known to have low/intermediate risk (GG1-GG2) or high grade/aggressive (GG3-GG5) prostate cancer; and (d) determine with a sensitivity of at least 90-95% whether the human test subject has: low/intermediate risk (GG1-GG2) prostate cancer when the test score (HG Score) from the human test subject is statistically less than or equal to the HG score in the training date set from a target population low/intermediate grade (GG1-GG2) prostate cancer, or high grade/aggressive (GG3-GG5) prostate cancer when the test score (HG Score) from the human test subject is statistically greater than or equal to the HG score in the training date set from a target population high grade/aggressive (GG3-GG5) prostate cancer.

14. The system of claim 13, wherein the hybridizing probes detect at least SEQ ID NOs: 111, 178, 291, 318, 346, 347, 389, 503, 580, 620, 704, 708, 766, 791, and 797.

15. The system of claim 13, wherein the human test subject determined to have high grade/aggressive (GG3-GG5) prostate cancer requires treatment; and the human test subject determined to have low/intermediate grade (GG1-GG2) prostate cancer requires the urine test as part of an active surveillance every 3 or 6 months.

16. The system of claim 15, wherein the subject classified as having aggressive (intermediate or high grade, GG2-GG5) prostate cancer is treated with one or more of radical prostatectomy, brachytherapy of the prostate, radiotherapy of the prostate, neoadjuvant hormone therapy and adjuvant hormone therapy.

17. A system for determining if a human test subject has cancer or no cancer, and classifying the human test subject with cancer as having (i) indolent (low grade, GG1), (ii) intermediate or high grade (GG2-GG5), (iii) low/intermediate risk (GG1-GG2) or (iv) aggressive (high grade, GG3-GG5) prostate cancer comprising:

a first processor configured to: (a) interrogate sncRNA sequences for a first set of informative sequences comprising a panel of hybridizing probes comprising SEQ ID NOs: 1-280 in a first high throughput microarray platform; (b) determine a test score (PCa score) that examines the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 1-280 and interaction between the sequences in a first Classification algorithm, wherein the first Classification algorithm is trained on a cohort of target population whose core-needle biopsy is positive (known to have prostate cancer) and a cohort of target population whose core-needle biopsy is negative (known to have no prostate cancer); (c) compare the test score (PCa score) in (i)(b) for the interrogated informative sequences to a score obtained in training data sets from a target population known to have prostate cancer or no prostate cancer; and (d) determine with a specificity of at least 90-95% whether the human test subject has: prostate cancer when the test score (PCa score) from the test subject is statistically greater than or equal to the score in the training data set from a target population having prostate cancer, or no prostate cancer when the test score (PCa score) from the test subject is statistically less than or equal to the score in the training data set from a target population having no prostate cancer; wherein the human test subject determined to have prostate cancer is re-tested to classify if the human test subject has indolent (low grade, GG1) or intermediate or high grade (GG2-GG5) prostate cancer;
(ii) a second processor configured to: (a) interrogate sncRNA sequences for a second set of informative sequences comprising a panel of hybridizing probes comprising SEQ ID NOs: 281-560 in a second high throughput microarray platform; (b) determine a second test score (Clinically Significant (CS) score) that examines the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 281-560 and interaction between the sequences in a second Classification algorithm, wherein the second Classification algorithm is trained on a cohort of target population known to have indolent (low grade, GG1), and another cohort of target population known to have intermediate/high grade (GG2-GG5) prostate cancer; (c) compare the test score (CS score) in (ii)(b) for the interrogated informative sequences to a score obtained in training data sets from target population known to have indolent (low grade, GG1) or high grade (GG2-GG5) prostate cancer; and (d) determine with a specificity of at least 90-95% whether the human test subject has: indolent (low grade, GG1) prostate cancer when the CS score from the human test subject is statistically less than or equal to the CS score in the training data set from a target population having indolent prostate cancer (low grade, GG1), or high grade (GG2-GG5) prostate cancer when the CS score from the human test subject is statistically less than or equal to the CS score in the training data set from a target population having intermediate/high grade (GG2-GG5) prostate cancer, wherein the human test subject determined to have high grade (GG2-GG5) prostate cancer is retested to further classify if the human test subject has low/intermediate risk (GG1-GG2) or aggressive (high grade, GG3-GG5) prostate cancer;
(iii) a third processor configured to (a) interrogate sncRNAs sequences for a third set of informative sequences comprising a panel of hybridizing probes comprising SEQ ID NOs: 561-840 in a third high throughput microarray platform; (b) determine a third test score (HG score) that examine the aggregate expression profile of all the interrogated sequences of SEQ ID NOs: 561-840 and interaction between the sequences in a third Classification algorithm; (c) compare the HG score in (iii)(b) for the interrogated informative sequences to a score obtained in training data sets from target population known to have low/intermediate risk (GG1-GG2) or aggressive (high grade, GG3-GG5) prostate cancer; and (d) determine with a specificity of at least 90-95% whether the test subject has: low/intermediate risk (GG1-GG2) when the test score (HG Score) from the human test subject is statistically less than or equal to the HG score in the training date set from a target population low/intermediate grade (GG1-GG2) prostate cancer, or aggressive (high grade, GG3-GG5) prostate cancer when the test score (HG Score) from the human test subject is statistically greater than or equal to the HG score in the training date set from a target population high grade/aggressive (GG3-GG5) prostate cancer.

18. The system of claim 17, wherein sncRNAs are extracted from a prostate derived biological sample, and wherein the extracted sncRNA are reverse transcribed to cDNAs.

19. The system of claim 17, wherein the hybridizing probes are specific for each of the cDNA derived from sncRNAs obtained from a prostate derived biological sample.

20. The system of claim 17, where the human test subject classified as having aggressive (high grade, GG3-GG5) prostate cancer requires one or more treatments including radical prostatectomy, brachytherapy of the prostate, radiotherapy of the prostate, neoadjuvant hormone therapy and adjuvant hormone therapy; the test subject determined to have low/intermediate grade (GG1-GG2) prostate cancer requires routine follow-up urine test or active surveillance every 6-12 months; and test subject determined to have intermediate/high grade (GG2-GG5) requires routine follow-up urine test or active surveillance every 3-6 months.

Patent History
Publication number: 20230257826
Type: Application
Filed: Apr 24, 2023
Publication Date: Aug 17, 2023
Applicant: miR Scientific, LLC (Rensselaer, NY)
Inventors: Martin Tenniswood (Rensselaer, NY), Albert Gregory DiRienzo (Rensselaer, NY), Wei-Lin Winnie Wang (Rensselaer, NY)
Application Number: 18/138,207
Classifications
International Classification: C12Q 1/6886 (20060101);