CLINICAL DECISION SUPPORT TO AID INTERVENTIONAL VS MEDICAL THERAPIES FOR CARDIOVASCULAR DISEASE
Provided herein are methods and systems for making patient-specific therapy recommendations of a lipid-lowering therapy for patients with known or suspected cardiovascular disease, such as atherosclerosis.
This application is a continuation of U.S. patent application Ser. No. 17/838,148, filed on Jun. 10, 2022, which claims the benefit of U.S. Provisional Application Ser. No. 63/209,164, filed on Jun. 10, 2021 and U.S. patent application Ser. No. 17/693,229, filed on Mar. 11, 2022. The entire contents of the foregoing are incorporated herein by reference.
TECHNICAL FIELDThis disclosure relates to methods and systems for making patient-specific therapy recommendations for patients with known or suspected cardiovascular disease, such as atherosclerosis.
BACKGROUNDMyocardial infarction (MI) and ischemic stroke (IS), major consequences of unstable atherosclerotic lesions, are the most common causes of death worldwide worldwide (World Health Organization (WHO). Cardiovascular diseases (CVDs)Fact Sheet, 2017, 23 Apr. 2020; available online at who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)). Guidance for prevention of MI and IS is currently based on treatment efficacy at the group level.
According to the World Health Organization (WHO), cardiovascular disease (CVD), encompassing, coronary, and lower extremity artery disease, is the leading cause of death and disability globally (The Atlas of Heart Disease and Stroke, W.H. Organization, Editor, 2014), mainly by myocardial infarction and ischemic stroke from unstable atherosclerosis worldwide (World Health Organization (WHO). Cardiovascular diseases (CVDs)Fact Sheet, 2017, 23 Apr. 2020; Available from: www.who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)). New treatments have been revolutionary over the last 30 years, yet CVD still exerts an exorbitantly high financial costs (Bloom et al., The Global Economic Burden of Noncommunicable Diseases, W.E. Forum, Editor. 2011: Geneva), with a $320 billion annual burden on the U.S. economy alone (Mozaffarian et al., Heart Disease and Stroke Statistics-2015 Update: A Report from the American Heart Association. Circulation, 2015. 131 (4): p. e29). This is exacerbated by aging and changing ethnic mix (Gierada et al., Projected outcomes using different nodule sizes to define a positive CT lung cancer screening examination. Journal of the National Cancer Institute, 2014. 106 (11): p. dju284; Warner, J. Stroke Costs Reaching Trillions: Without Action, Financial Costs of Strokes to Reach $2.2 Trillion by 2050. Stroke Health Center 2006 (cited 2014 Nov. 14, 2014); Available from: www.webmd.com/stroke/news/20060816/stroke-costs-reaching-trillions), as well as affecting an increasing proportion of people globally as economic development continues to narrow the gap between the developed and developing word populations.
In the U.S., the American Heart Association (AHA) projects that over 9% of adults are at significant (more than 20%) risk of adverse events within 10 years and over 25% more are at a moderate risk (Association, A.H., AHA STATISTICAL UPDATE Heart Disease and Stroke Statistics—2018 Update. Circulation Journal, 2018. 137). This yields 23 million high risk patients and 57 million moderate risk people. Of these, approximately 30 million people in the U.S. are currently on statin therapy in an attempt to avoid new or recurrent CV events, and the 16.5 million with a current CVD diagnosis are almost all on maintenance medications (Ross, G., Too Few Americans Take Statins, CDC Study Reveals. American Council on Science and Health, 2015; Vishwanath, R. and L. C. Hemphill, Familial hypercholesterolemia and estimation of US patients eligible for low-density lipoprotein apheresis after maximally tolerated lipid-lowering therapy. Journal of Clinical Lipidology, 2014. 8: p. 18-28; Herper, M. How Many People Take Cholesterol Drugs? Forbes, 2008; Pearson et al., Markers of Inflammation and Cardiovascular Disease: Application to Clinical and Public Health Practice: A Statement for Healthcare Professionals From the Centers for Disease Control and Prevention and the American Heart Association. Circulation, 2003. 107 (3): p. 499-511).
According to the WHO, stroke accounts for 10% of all deaths across the globe, causing at least 5.5 million deaths annually (The Atlas of Heart Disease and Stroke, W.H. Organization, Editor. 2014). Of the approximately 800,000 annual strokes in the U.S., 87% are ischemic, and approximately 15% of all strokes are heralded by a transient ischemic attack (TIA) (Writing Group, M., D. Mozaffarian et al., Heart Disease and Stroke Statistics-2016 Update: A Report From the American Heart Association. Circulation, 2016. 133 (4): p. e38-360; Bruce Ovbiagele, Stroke Epidemiology: Advancing Our Understanding of Disease Mechanism and Therapy. Neurotherapeutics, 2011. 2011 (8): p. 319-329). Many ischemic stroke events are caused by atherosclerosis (Barrett et al., Stroke Caused by Extracranial Disease. Circ Res, 2017. 120 (3): p. 496-501). 2.3 million subjects in the US are believed to have clinically significant stenosis (>50%), 19% of which have over 70% stenosis (de Weerd et al., Prevalence of Asymptomatic Carotid Artery Stenosis in the General Population: An Individual Participant Data Meta-Analysis. Stroke, 2010. 41 (6): p. 1294-1297). Stroke also results in enormous costs for society, accounting for $36.5 (Go et al., Heart Disease and Stroke Statistics 2014 Update: A Report From the American Heart Association. Circulation, 2014. 129 (3): p. e28-e292) to $74 billion annually (D. L. Brown et al., Projected costs of ischemic stroke in the United States. Neurology, 2006), estimated to reach $2.2 trillion by 2050 (PTINR.com-Staff $2.2 trillion stroke cost projected. 2006; Brown et al., Projected costs of ischemic stroke in the United States. Neurology, 2006. 67 (8): p. 1390-1395).
According to the WHO, “coronary heart disease is now the leading cause of death worldwide. It is on the rise and has become a true pandemic that respects no borders” (The Atlas of Heart Disease and Stroke, W.H. Organization, Editor. 2014). Of the approximately 1.2 million annual coronary attacks in the U.S., ˜66,000 are new, ˜305,000 are recurrent, and ˜160,000 are silent myocardial infarctions (MIs) (Writing Group, Mozaffarian et al., Heart Disease and Stroke Statistics-2016 Update: A Report From the American Heart Association. Circulation, 2016. 133 (4): p. e38-360; Bruce Ovbiagele, Stroke Epidemiology: Advancing Our Understanding of Disease Mechanism and Therapy. Neurotherapeutics, 2011. 2011 (8): p. 319-329. Coronary heart disease caused by atherosclerosis is the most common type of heart disease, killing 365,914 people in 2017 (Benjamin et al., Heart Disease and Stroke Statistics-2019 Update: A Report From the American Heart Association. Circulation, 2019. 139 (10): p. e56-e528).
The relative risk levels for varying degrees of obstruction remains equivocal, with some reports seeming to support the notion that clinically non-obstructive coronary artery disease (CAD) actually harbors more high-risk plaque than more occlusive plaques, where others suggest that the stenotic plaques do have higher event rates (Chang et al., Coronary Atherosclerotic Precursors of Acute Coronary Syndromes. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY (JACC), 2018. 71 (22); Gaston A. Rodriguez-Granillo et al., Defining the non-vulnerable and vulnerable patients with computed tomography coronary angiography: evaluation of atherosclerotic plaque burden and composition. European Heart Journal—Cardiovascular Imaging, 2016. 2016 (17): p. 481-491; Ahmadi et al., Do plaques rapidly progress prior to myocardial infarction? The interplay between plaque vulnerability and progression. Circulation research, 2015. 117 (1): p. 99-104; Bittencourt et al., Prognostic Value of Nonobstructive and Obstructive Coronary Artery Disease Detected by Coronary Computed Tomography Angiography to Identify Cardiovascular Events. Circulation: Cardiovascular Imaging, 2014. 7 (2): p. 282-291; Virmani et al., Pathology of the Vulnerable Plaque. JACC, 2006. 47 (8): p. C13-8; F D Kolodgie et al., Pathologic assessment of the vulnerable human coronary plaque. Heart, 2004. 90; Virmani et al., Lessons from sudden coronary death: a comprehensive morphological classification scheme for atherosclerotic lesions. Arterioscler Thromb Vasc Biol, 2000. 20 (5): p. 1262-75).
There is a significant need to help healthcare providers make therapeutic recommendations that are tailored to specific patients rather than taking a “one size fits all” approach with the available and future therapies for cardiovascular disease.
SUMMARYThe present disclosure provides methods and systems for selecting and recommending a suitable therapeutic treatment plan for a patient with cardiovascular disease, such as atherosclerosis. For example, physicians and other healthcare providers can use the new methods and systems to analyze and process non-invasively obtained data, such as imaging data, e.g., computed tomography angiography (CTA) data, of arteries from patients with atherosclerosis to obtain predicted proteomic and genomic information. Based on this information, various potential therapies, e.g., pharmacotherapies and/or procedural interventions, can be simulated based on their mechanisms of action in in silico systems biology models as described herein to enable the health care provider to provide a report to the patient recommending one or more specific pharmacotherapies and/or procedural interventions to be used to treat the patient.
This disclosure also provides methods for obtaining proteomic and/or genetic information and methods for building in silico systems biology models.
The in silico systems biology models are initially generated or trained with two types of data. First, one uses experimentally determined data from biological specimens from development subjects. Development subjects are people for whom actual proteomic data is available that shows differentially expressed protein levels that are linked to the specific characteristics and morphology of the plaques in each of those subjects. Second, one uses results from searches of public literature, experimental results, and/or other databases to find journal articles and the like to obtain detailed information about the proteins in the model. These two sources of data are used to create the initial in silico systems biology model.
The initial in silico systems biology model is then updated with calibration data, such as 'omics data, from test subjects to validate and refine the initial model. The calibration data is again based on actual biological samples that show differentially expressed protein and/or transcription levels that are linked to the specific characteristics and morphology of the plaques in each of those test subjects. This update of the initial model provides a calibrated in silico systems biology model. This step confirms that the model works as intended and also augments and renders the model more robust, given the calibration data from many test subjects.
Then in operation, the calibrated in silico systems biology model is again updated, but now with patient-specific personalized data based on imaging of the patient's plaque without the need to perform an invasive blood test or biopsy. The calibrated in silico systems biology model is also updated with the predicted effects of two or more different therapies. The methods and systems described herein use the patient's non-invasively obtained data, e.g., imaging data to provide a therapy recommendation based upon an automated comparison of the two or more different therapies whose predicted effects are programmed into the model.
In one aspect, the disclosure features computer-implemented methods for clinical decision support, the methods including receiving non-invasively obtained data related to a plaque from a patient; updating a trained in silico systems biology model using personalized calibration data derived from the received data to generate an in silico patient-specific systems biology model, wherein (i) the trained in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule, and a plurality of edges between pairs of nodes, each edge representing a molecule-molecule interaction, (ii) at least two of the nodes represent molecules whose levels are affected by the atherosclerotic cardiovascular disease, and (iii) at least one of the set of networks includes a disease-associated molecule level for each of the nodes in the network; perturbing the in silico patient-specific systems biology model to simulate a therapeutic effect of a lipid-lowering agent for the patient; and, optionally, providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease by the lipid-lowering agent for the patient and a recommendation supporting a clinical decision as to whether the lipid-lowering agent would benefit the patient.
In some embodiments, the recommendation informs a decision that leads to a clinical action. In different embodiments, the recommendation enables a healthcare provider to tailor a therapy for the patient.
In various embodiments, the at least one of the set of networks includes nodes corresponding, respectively, to one or more of a glycosylated low-density lipoprotein (glyLDL), an oxidized LDL (oxLDL), a minimally-modified LDL (mmLDL), or a very-low-density lipoprotein (VLDL).
In various embodiments, the non-invasively obtained data is radiological imaging data. The radiological imaging data can be obtained by computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MRI), multi-contrast magnetic resonance imaging (multi-contrast MRI), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT) diagnostic images, or any combination thereof.
In different embodiments, the molecule is a gene (transcript), a protein, and/or a metabolite.
In some embodiments, the at least one of the set of networks further includes edges representing molecule-molecule interactions, e.g., protein-protein interactions, gene-gene interactions, protein-metabolite interactions, and/or protein-gene interactions. Interactions may represent any one of translation, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and methylation as a result of an interaction between two molecules.
In various embodiments, the lipid-lowering agent is a hyperlipidemia control medication. The hyperlipidemia control medication may be a statin, and in some embodiments the statin is atorvastatin. In different embodiments, the hyperlipidemia control medication is an intensive lipid-lowering agent. In some embodiments, the intensive lipid-lowering agent is a proprotein convertase subtilisin kexin type 9 (PCSK9) inhibitor or a cholesteryl ester transfer protein (CETP) inhibitor.
In different embodiments, the hyperlipidemia control medication is a hypertriglyceridemia lowering agent or a hypercholesterolemia lowering agent, and, the hypercholesterolemia lowering agent may be a statin, ezetimibe, bile acid sequestrant, adenosine triphosphate-citrate lyase (ACL) inhibitor, fibrate, niacin, omega-3 fatty acid ethyl ester, or omega-3 polyunsaturated fatty acids (PUFA).
In another aspect, the disclosure features clinical decision support systems, wherein the systems include a memory configured to store instructions, and, a processor to execute the instructions to perform operations including updating a trained in silico systems biology model using personalized calibration data derived from the received data to generate an in silico patient-specific systems biology model, wherein (i) the trained in silico systems biology model includes a set of networks, wherein each network includes a plurality of nodes, each node representing a baseline level of a molecule, such as a protein, gene (transcript), or metabolite, and a plurality of edges between pairs of nodes, each edge representing a molecule-molecule interaction, (ii) at least two of the nodes represent molecules whose levels are affected by the atherosclerotic cardiovascular disease, and (iii) at least one of the set of networks includes a disease-associated molecule level for each of the nodes in the network; perturbing the in silico patient-specific systems biology model to simulate a therapeutic effect of a lipid-lowering agent for the patient; and providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease by the lipid-lowering agent for the patient and a recommendation supporting a clinical decision as to whether the lipid-lowering agent would benefit the patient.
In some embodiments, the at least one of the set of networks further includes edges representing molecule-molecule interactions, e.g., protein-protein interactions, gene-gene interactions, protein-metabolite interactions, and/or protein-gene interactions. Interactions may represent any one of translation, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and methylation as a result of an interaction between two molecules.
In another aspect, the disclosure features one or more non-transitory computer-readable media storing instructions that are executable by a processing device, and upon such execution cause the processing device to perform operations including updating a trained in silico systems biology model using personalized calibration data derived from non-invasively obtained data related to a plaque from a patient to generate an in silico patient-specific systems biology model, wherein (i) the trained in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule, and a plurality of edges between pairs of nodes, each edge representing a molecule-molecule interaction, (ii) at least two of the nodes represent proteins whose levels are affected by the atherosclerotic cardiovascular disease, and (iii) at least one of the set of networks includes a disease-associated molecule level for each of the nodes in the network; perturbing the in silico patient-specific systems biology model to simulate a therapeutic effect of a lipid-lowering agent for the patient; and providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease by the lipid-lowering agent for the patient and a recommendation supporting a clinical decision as to whether the lipid-lowering agent would benefit the patient.
In another aspect, the disclosure features computer-implemented methods for generating an in silico systems biology model of atherosclerotic cardiovascular disease, the methods including obtaining multiple first inputs representing biological pathways associated with the atherosclerotic cardiovascular disease; generating, based on the first inputs, a first set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule, and a plurality of edges between pairs of nodes, each edge representing a molecule-molecule interaction; obtaining second inputs indicative of calibration data from multiple test subjects who have been diagnosed with the atherosclerotic cardiovascular disease; determining, from the second inputs, a disease-associated molecule level for nodes representing molecules in the first network; and generating, based on the first network and the disease-associated molecule levels, a second set of networks, wherein the second set of networks, updated using the second inputs, represent a calibrated in silico systems biology model of the atherosclerotic cardiovascular disease and includes the disease-associated molecule levels for nodes representing proteins in the second set of networks.
In various embodiments, at least two of the nodes represent molecules whose levels are affected by the atherosclerotic cardiovascular disease. In different embodiments, at least one of the second set of networks includes a disease-associated molecule level for each of the nodes in the network.
In various embodiments, the non-invasively obtained data is radiological imaging data. The radiological imaging data can be obtained by computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MRI), multi-contrast magnetic resonance imaging (multi-contrast MRI), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT) diagnostic images, or any combination thereof.
In various embodiments, the molecule is a gene (transcript), a protein, and/or a metabolite.
In some embodiments, the at least one of the set of networks further includes edges representing protein-protein interactions, gene-gene interactions, protein-metabolite interactions, and/or protein-gene interactions. Interactions may represent any one of translation, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and methylation as a result of an interaction between two molecules.
DefinitionsA “computational model” uses computer programs to simulate and study complex systems using an algorithmic or mechanistic approach.
A “predictive model” is a mathematical formulation often described as artificial intelligence, machine learning, or deep learning that computes one or more outputs (“response variables”) form one or more inputs (“predictors”). In the present application, predictive models may be used for characterizing tissue (as a “virtual tissue model”), for predicting molecular levels form characterized tissues, or predicting outcome form either tissue characterizations and/or virtual 'omics.
A “systems biology model” refers to a model that is used to represent a set of interconnected biological pathways potentially used to simulate changes across those pathways under defined conditions.
An “in silico systems biology model” refers to a computational representation of a biological system, e.g., wherein the biological system is atherosclerotic cardiovascular disease.
An “initial in silico systems biology model” refers to an in silico systems biology model generated or trained with actual proteomic data obtained from development subjects and information obtained from literature searches.
A “calibrated in silico systems biology model” refers to an initial in silico systems biology model that is updated using measured calibration data, such as 'omics data, from a given subject (e.g., a test subject) who has been diagnosed with cardiovascular disease or from a patient with known or suspected cardiovascular disease.
“Calibration data” refers to test subject-derived data or patient-specific data that can be used to update an in silico systems biology model. Examples include measured 'omics data, such as, transcriptomics data, proteomics data, and/or metabolomics data, e.g., obtained non-invasively. Calibration data can also be obtained from molecular or tissue assays, e.g., biopsies.
“'Omics data” refers to biologically relevant quantities of gene expression, transcriptomics, proteomics, or metabolomics, based on directly measured molecular expression levels, e.g., by blood tests, molecular assays, or tissue biopsy.
“Virtual 'omics data” refers to computationally predicted levels of biologically relevant quantities of gene expression, transcriptomics, proteomics, or metabolomics (e.g., based on patient-derived imaging data) instead of directly measured molecular expression levels, e.g., by blood tests, molecular assays, or tissue biopsy.
A “network” refers to a graphical representation of interactions (edges) between various molecules (nodes).
An “artificial neural network” refers to a type of computational model, wherein the computational model is structured analogously to the human brain, as a series of interconnected “neurons” or mathematically as summations by weights and thus providing means to represent complex relationships with high degrees of non-linearity.
“Direction (of the edge)” refers to an orientation of an interaction between a pair of molecules (e.g., when molecule A activates molecule B, the direction would be A to B).
A “biological pathway” refers to a series of actions among molecules that leads to a certain product or a change.
“Baseline level” (of a molecule) refers to the biological state (e.g., expression level) of a molecule before perturbation in a systems biology model (e.g., in a healthy person or subject, before a test subject or patient was afflicted with a disease, or before a patient started a new treatment for a diagnosed disease).
A “disease-associated level” (of a molecule) refers to the quantitative amount of a molecule (gene transcript, protein, or metabolite) from an individual test-subject who has been diagnosed with a specific disease. In some instances, disease-associated levels of a molecule can be determined based on virtual 'omics data, which can include data obtained from plaque tissue, and may also include data from minimal disease tissue, as long as the data is taken from a test subject who has been diagnosed with the disease, e.g., a cardiovascular disease. Note that during model generation, disease-associated levels from test subjects are utilized, but during operation in the clinic personalized levels are used, where the word “calibration” applies to both in context.
A “personalized level” (of a molecule) refers to the quantitative amount of a molecule (transcript, protein, or metabolite) from an individual patient. In some instances, personalized levels of a molecule can be determined based on virtual 'omics data. Note that during model generation, disease-associated levels from test subjects are utilized, but during operation in the clinic personalized levels are used, where the word “calibration” applies to both in context.
A “phenotype” refers to the set of observable characteristics of an individual resulting from the interaction of its genotype with the environment. In this specification, it can be understood as also referring to “endotype” (a subtype of a disease condition, which is defined by a distinct pathophysiological mechanism), or “theratype” (a means to group according to their response to specific therapeutic alternatives), terms that are sometimes used in the field of precision medicine pertaining to the categorization or typing performed without loss of generality by the methods and systems described herein.
A “biochemical reaction” refers to an interaction among molecular quantities such as molecules (e.g., transcripts, RNAs, proteins, metabolites, inorganic compounds, etc.). Specifically, it refers to the transformation of one molecule to a different molecule inside a cell, usually (though not necessarily) annotated with quantitative coefficients or terms that allow effects to propagate across networks.
A “biochemical relation” is a semi-quantitative approximation to a biochemical reaction. “Reaction” and “relation” are used as alternatives in this disclosure (i.e., interchangeably) without loss of generality.
The new methods and systems described herein provide numerous advantages and benefits as well as improvements in the ability to provide patient-specific recommendations of therapies for atherosclerotic cardiovascular disease.
The number of people with atherosclerosis is very high. Most patients are unaware of their disease progression until onset of symptoms. Risk management of patients is largely dependent on population-based scoring methods such as Framingham Risk Score (Newby et al., Coronary CT Angiography and 5-Year Risk of Myocardial Infarction. N Engl J Med, 2018. 379 (10): p. 924-933; Bergstrom et al., The Swedish CArdioPulmonary BioImage Study: objectives and design. J Intern Med, 2015. 278 (6): p. 645-59) and development of diagnostics for more precise patient categorization is warranted. As treatment options for patients with CVD have become available, stratifying patients increasingly needs to be based on per-patient rather than population-based risk factors/scoring or simplistic imaging methods. For example, accessing a degree of stenosis, calcium scoring, or even fractional flow reserve (FFR) are not sufficiently specific for determining individual patient disease category at a level necessary to identify what treatment will best serve them, that is, to select among waiting, pharmacotherapy, procedural intervention, surgery, or a specific treatment within one of these categories. This is important economically as well as clinically, because recent advances in pharmaceuticals targeting specific mechanisms with increasing efficacy are generally more expensive than earlier generation drugs such as statins and are too expensive for use in broad populations. These new drugs are also not necessarily the best therapy for all patients and the present methods and systems can be used to match the right patients with the best therapies.
One current difficulty is that an ability to measure a response to a specific drug therapy remains elusive, and both under-treatment as well as over-treatment remain common problems, which can result in high numbers of patients that are needlessly treated while at the same time consuming financial resources and causing patients to go through needlessly invasive procedures for the results obtained. Likewise, to the extent that methods are proposed to assess vulnerable plaque, there remains the issue that just because a vulnerable plaque can be found, the causes for it are systemic rather than focal; often resulting in focal treatment being mismatched with the actual cause of the plaque, which can rather warrant systemic treatment. The concept of the “vulnerable patient” has been discussed, but we need markers to identify such individuals, and we need the ability to categorize the specific mechanism causing their vulnerability at an individual level, if we are to make demonstrable improvements in outcomes for given societal cost, for example by tailored therapeutics. Each of these needs and opportunities presents a challenge to the methods that have been developed so far, but are addressed by the methods and systems described herein.
The present disclosure fills gaps in understanding the extent and rate of progression of atherosclerosis under differing potential treatment alternatives. Advanced software-based techniques to extract data embedded in images, which are otherwise not readily appreciated visually or quantitatively, provide biomarkers to identify patients with unstable atherosclerosis and imaging to localize unstable atherosclerotic plaques, and provide more accurate characterizations extending from clinical care to developing drugs that are more effective for patients at risk of ischemic events.
The new methods and systems described herein provide outcome and cost improvements including improved noninvasive diagnostics to identify which patients have progressing disease, and the ability to provide automated recommendations of the best therapy or combination therapy for each specific patient based on simulations of how a specific therapy is likely to affect the specific patient and how the patient will respond given a specific therapy. The methods and systems can also be used to select or modulate doses of specific medications based on simulated patient responses, as well as be used to simulate the effects of new drug candidates, i.e., virtual clinical trials.
The presently described virtual biomarkers can go beyond indicating that there is a problem, to categorizing patients specifically as to the most effective way to treat the problem. Moreover, manifestations of conditions are considered both in terms of dynamic insufficiency (e.g., stress-induced ischemia of perfused tissues) as well as disruptive events such as thrombosis and rupture (i.e., causing infarction). Plasma biomarkers serve an important role as a screening tool, but by themselves are neither as sensitive nor as specific as knowing what is happening within a tissue, e.g., within and surrounding a plaque (i.e., the transcriptomics and proteomics of the tissue and the blood).
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Methods and materials are described herein for use in the present invention; other, suitable methods and materials known in the art can also be used. The materials, methods, and examples are illustrative only and not intended to be limiting. All publications, patent applications, patents, sequences, database entries, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control.
Other features and advantages of the invention will be apparent from the following detailed description and figures, and from the claims.
The methods and systems described herein not only characterize atherosclerosis in terms of morphology and stability based on non-invasively obtained data, e.g., non-invasive imaging data, of a patient's arteries (using, e.g., CT angiography), but further provide therapy recommendations for individual patients, based on the nature and stability of their plaques, all using only non-invasively obtained data from the patient, e.g., imaging data, such as arterial imaging data. For instance, by obtaining genotypic and/or phenotypic information (i.e., through virtual 'omics modeling, or based on actual measurements) for a given patient, the new methods and systems described herein can be used to model a patient's expected response to various therapies, including medicinal/pharmaceutical and interventional or procedural therapies, to recommend the therapy that is predicted to provide a superior outcome for that specific patient.
Diagnostic accuracy is improved as the morphological and biological features of atherosclerotic plaques can be determined by non-invasive imaging. To do this, we have established a quantitative linkage between scales. Specifically, as shown in
The need for the new methods and systems is clear. Myocardial infarction (MI) and ischemic stroke (IS), major consequences of unstable atherosclerotic lesions, are the most common causes of death worldwide. However, any recommendations available for the prevention of MI and IS are currently based only on treatment efficacy at the group level, and practical means to tailor treatment for individual patients are a presently not available. To date, personalized treatment strategies for atherosclerotic cardiovascular disease (CVD) have not been possible. Other adverse outcomes from atherosclerosis include, without loss of generality, claudication, amputation, and various presentations of aorta disease such as aneurysm.
In the setting of CVD, we have used existing biobanks containing detailed disease-specific information at varying morphological and molecular scales to create dedicated in silico systems biology models, with applications including evaluation of drug side effects, consideration of drug combinations, and modelling of the effect of drugs and procedural interventions on a specific patient. The ability to identify in advance, whether an individual patient may or may not respond to a drug has a strong value. Our inclusion of broad molecular pathway analysis provides an advantage by addressing fundamental complexities needed for many clinical scenarios, when measurements of molecular species in plasma or tissue biopsies are not possible.
Incorporating molecular pathway analysis into an in silico setting however requires appreciation of the numerous structural and biological features that characterize the unstable atheroma, where a number of different pathways interleave in a complex set of interactions. For example: collagen fibers confer structural stability (World Health Organization (WHO). Cardiovascular diseases (CVDs) Fact Sheet, see, who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (2017)); collagen degradation the converse (Lambin et al., Radiomics: the bridge between medical imaging and personalized medicine. Nature Reviews Clinical Oncology 14, 749-762, doi:10.1038/nrclinonc.2017.141 (2017)). Reductions in atherogenic lipoproteins resulting from phospholipid and cholesterol efflux improve stability (Lee et al., Radiomics and imaging genomics in precision medicine. Precision and Future Medicine 1, 10-31 (2017)); endothelial to mesenchymal transition can influence tissue structure with both stabilizing and destabilizing effects (Buckler et al., Virtual Transcriptomics: Non-Invasive Phenotyping of Atherosclerosis by Decoding Plaque Biology From Computed Tomography Angiography Imaging. Arteriosclerosis, thrombosis, and vascular biology, Atvbaha121315969, doi:10.1161/atvbaha.121.315969 (2021); Peyvandipour et al., Novel computational approach for drug repurposing using systems biology. Bioinformatics 34, 2817-2825 (2018)); and perivascular adipose tissue has been suggested to increase plaque inflammation (Nguyen et al., Identifying significantly impacted pathways: a comprehensive review and assessment. Genome biology 20, 1-15 (2019); Réda et al., Machine learning applications in drug development. Computational and structural biotechnology journal 18, 241-252 (2020); Pai et al., netDx: interpretable patient classification using integrated patient similarity networks. Molecular systems biology 15, e8497 (2019)), resulting in atherothrombosis, MI, or IS (Adam et al., Machine learning approaches to drug response prediction: challenges and recent progress. NPJ precision oncology 4, 1-10 (2020)).
According to the present disclosure, given the complexity and multifactorial biology of atherosclerosis, comprehensive disease modelling as presented herein required the consideration of more complete biological networks than have been reported to date. To capture sufficiently granular information including prediction of disease-critical biological responses to different drugs, we included biological processes represented by pathway networks of molecular interactions essential for disease progression.
In the present disclosure, we describe comprehensive in silico systems biology models of atherosclerosis using curated networks of molecular pathways to effectively describe and predict unstable disease. Using molecular data from plaque specimens from test subjects, we incorporated disease-specific pathways across multiple cell types to develop an integrated in silico systems biology model, we can then use this calibrated in silico systems biology model to make therapy recommendations for individual patients. We evaluated the potential of the model by simulating the effects of different pharmacological treatments on molecular processes relevant for stabilization of atherosclerotic lesions, effectively predicting personalized pharmacological effects and highlighting a potential for clinical utility and tailored therapy for prevention or inhibition of adverse events such as MI and IS.
The present disclosure also provides systems and methods of using these models to provide patient-specific therapy recommendations for individual patients based only on non-invasive arterial imaging data.
I. Methods of Obtaining Phenotypic/Endotypic/Theratypic Data Based on Virtual 'Omics Modeling
Information about prevalent biological processes that are related to plaque characterization and stability can be obtained non-invasively through virtual 'omics methods. Briefly, methods include receiving a non-invasively obtained imaging dataset for an atherosclerotic plaque from a subject; processing the non-invasively obtained imaging dataset to obtain quantitative plaque morphology data; processing the quantitative plaque morphology data with a virtual expression model to obtain estimated protein and/or gene expression data for the plaque from the subject; and generating phenotypic data for the atherosclerotic plaque from the subject based on the molecular data.
The phenotypic data refers to the set of observable characteristics of an individual patient, test subject, or development subjects, resulting from the interaction of their genotype with the environment. In particular, the phenotypic data can include endotypic data, which relates to a subtype of a disease condition that is defined by a distinct pathophysiological mechanism, and/or theratypic data, which is used to group patients or test subjects according to their response to specific therapeutic alternatives.
Non-Invasively Obtained DataThe first step in obtaining patient or subject data for the methods and systems described herein is to obtain data non-invasively. For example, that data can be imaging data, i.e., image(s) of the plaques in arteries, and can be obtained by various methods that are well known in the art. In some embodiments the imaging dataset is obtained by radiological methods. For instance, any of the following can be employed: computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MRI), multi-contrast magnetic resonance imaging (multi-contrast MM), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT). In a particular embodiment, CTA is utilized.
For example, in one embodiment, CTA can be performed as a pre-operative routine procedure in the hospital using site-specific image acquisition protocols. CTA exams can be performed with 100 or 120 kVp, variation of CTDIvol16 cm between 13.9 and 36.9 mGy or CTDIvol32 cm 7.9-28.3 mGy. Contrast injection rates and amounts followed by a saline chaser can be used as required. In general, a caudocranial scanning direction can be selected from the aortic arch to the vertex, using intravenous contrast. An axial image reconstruction of about 0.5 to about 1.0 mm, e.g., 0.65 mm, 0.9 mm, or 1.0 mm can be used, and transferred into a digital workstation for vascular CTA image analysis.
Variations of these examples of non-invasive imaging are contemplated and could be used by those of skill in the art.
Tissue ModelsData, such as imaging data obtained from the non-invasive imaging methods described herein are loaded into an image processing software, e.g., ElucidVivo® (Elucid Bioimaging Inc., Boston, MA) software, which outlines (segments) the luminal and outer wall surfaces of the common, internal, and external arteries to provide quantitative plaque morphology data. See also, U.S. Pat. Nos. 10,176,408, 10,740,880, 11,094,058, and 11,087,460, each of which is incorporated herein by reference. Specifically, the software creates fully 3-dimensional segmentations of lumen, wall, and each tissue type at an effective resolution ≈3× higher than the reconstructed voxel size with improved soft tissue plaque component differentiation relative to manual inspection. The common and internal artery are defined as a target with lumen and wall evaluated automatically and, when needed, edited manually.
The software provides vessel structure measurements including the degree of stenosis (calculated both by area or diameter), wall thickness (distance between the lumen boundary to outer vessel wall boundary), and remodeling index (the ratio of vessel area with plaque to a vessel area without plaque used as reference). Investigations in animal models and histological analyses of human plaque lesions have characterized distinct, but common, structural and biological tissue characteristics such as enhanced inflammation, accumulation of a large lipid-rich and necrotic central core (LRNC), intra-plaque hemorrhage (IPH), a thin and rupture-prone fibrous cap from extracellular matrix (ECM) degradation, apoptosis of smooth muscle cells (SMCs), level of calcification (CALC), matrix/fibrous tissue (MATX), and fibrous cap/perivascular adipose tissue (FC/PVAT).
The software includes algorithms to decrease blur caused by image formation in the scanner. A patient-specific 3-dimensional point spread function is adaptively determined so that image intensities are restored to represent the original materials imaged more closely, which mitigates artefacts such as calcium blooming, and enables discrimination of less prominent tissue types. In particular, the image restoration is undertaken in concert with tissue characterization based on expert-annotated histology (which includes both proteome and transcriptome information), e.g., as described in U.S. Pat. Nos. 10,176,408, 10,740,880, 11,094,058, and 11,087,460, each of which is incorporated herein by reference.
As shown in
Processing of the CTA images allows for multiple objectively validated measurements to be made, thereby permitting the characterization of plaque morphology by CTA analysis software. These assessments included structural anatomy (“structure”) and tissue characterization (“composition”) as shown in
The overlapping densities of tissues such as LRNC and IPH, for example, necessitate a method for accurate classification. To avoid limitations of conventional analysis of CTA utilizing fixed thresholds, the accuracy required for elucidating molecular pathways was achieved by algorithms that account for distributions of tissue constituents rather than assuming constant material density ranges. In this way, the software makes mathematical judgments to interpret the Hounsfield units (HU) of adjacent voxels by maximizing criteria that mimic expert annotation at microscopy, simultaneously mitigating variation between scanners, reconstruction kernels, and contrast levels. In this way, the software fundamentally addresses subjectivity intrinsic to other analysis methods.
Processing the non-invasively obtained image data with the software provides output information relating to quantitative plaque morphology, such as structural anatomy data and tissue composition data. For example, structural anatomy data includes measuring any one or more of the following in the lumen and wall: remodeling, wall thickening, ulceration, stenosis, dilation, plaque burden, or any of the measurands listed in the Table 1 below.
As outlined in Table 1, vessel structure measurements include the degree of stenosis (calculated both by area or diameter), wall thickness (distance between the lumen boundary to outer vessel wall boundary), and remodeling index (the ratio of vessel area with plaque to a vessel area without plaque used as reference).
Tissue composition data includes calcification (CALC), lipid-rich necrotic core plaque (LRNC), intra-plaque hemorrhage (IPH), and matrix/fibrous tissue (MATX), see Table 2 below.
Volume measurements, either in place of or additive to area measurements can also be utilized. Likewise, various forms of spatially labelled data that represent these can also be used. These specific tissues types are provided as examples without loss of generality.
As described in further detail below, the virtual 'omics models are built from a variety of machine learning models. Briefly, any of several methods, devices, and/or other features are used to perform a specific informational task (such as classification or regression) using a number of examples of data of a given form, and are then capable of exercising this same task on unknown data of the same type and form from a new patient or subject. The machine (e.g., a computer or processor) will “learn,” for example, by identifying patterns, categories, statistical relationships, etc., exhibited by training data. The result of the learning is then used to predict whether new data exhibits the same patterns, categories, and statistical relationships.
Examples of such models include neural networks, support vector machines (SVMs), decision trees, hidden Markov models, Bayesian networks, Gram Schmidt models, reinforcement-based learning, genetic algorithms, and cluster-based learning. Multiple can be used to create the pool of trained machines from which the choice is made. These can include methods of feature selection and reduction, ranking of features, random generation of feature sets, correlations among features, PCA (Principal Component Analysis), ICA (Individual Component Analysis), parameter variation, and any methods known to those skilled in the art.
Supervised learning occurs when training data is labelled to reflect the “correct” result, i.e., that the data belongs to a certain class or exhibits a pattern. Supervised learning techniques include neural networks, SVMs, decision trees, hidden Markov models, Bayesian networks, etc. Test data sets encompassing known class(es) can be used to determine if a trained learning machine is able to identify patterns in data and/or classify data. The test data set is preferably generated independently from the training data set. Training Data sets (of known or unknown classes) are used to train a learning machine. Regardless of whether the class of the data is known or unknown, the data can be adequate for training a learning machine. Unsupervised learning occurs when training data is not labelled to reflect the “correct” result, i.e., there is no indication within the data itself as to whether the data belongs to a class or exhibits a pattern. Unsupervised learning techniques include Gram Schmidt, reinforcement-based learning, cluster-based learning, etc.
Thus, certain embodiments of the present invention can utilize machine learning methods and/or deep learning methods, although these methods are not always required in all embodiments.
In one embodiment, one or more neural network(s) can be generated and/or updated with virtual 'omics from vascular CT images processed as described in
This method overcomes two problems. First, the quantity of annotated data, required for training, is both low and high dimensional of the CT image volume. The present disclosure exploits the dimensionality reduction provided by the Virtual Tissue Models, which also provide an opportunity for objective validation. We also leverage the large quantity of unlabeled vessels which is enabled from the use of this validated image processing step, from which the virtual 'omics networks can learn a rich representation of vessel structure in a semi- or self-supervised manner. Second, the output has high dimensionality. We address this by employing a neural architecture that constructs a common representation of the input, which is shared across the components which predict individual 'omics levels.
In another embodiment, one or more deep learning network(s) can be used for adverse event prediction and/or drug interaction effects. The common representation described herein can be imported into a new model, which will use the features it provides to predict adverse events directly or after further fine-tuning with labelled data. These features can also be fused with numerical predictions from the systems biology model to estimate drug interaction effects.
In another embodiment, neural networks can be used to implement portions or the whole of the therapy effect simulations, noting that portions of the systems biology model itself may be differentiable. Reaction kinetics networks are comprised fundamentally of systems of coupled ODEs and PDEs which may be implemented within neural networks to enable speedups in both model training and model inference. Neural networks can be employed to find favorable initializations of such reaction networks to allow optimal solutions efficiently.
Generating Phenotypic, Endotypic, and/or Theratypic Data for Atherosclerotic Plaques
The quantitative plaque morphology data (which relates, e.g., to the profile, characterization, type of plaque) received from the processing of CTA images, as described in the section “Tissue Models” above, is processed against one or more virtual proteomic/transcriptomic models, as described above, to obtain estimated/predicted gene expression and/or protein level data for the plaque from the subject. In other words, the tissue models are further processed against known gene-expression and/or known protein levels patterns (that is, the tissue models based on the imaging data are correlated to gene-expression and/or protein levels patterns) to generate a predicted 'omics model.
The predicted 'omics model then, in turn, allows the clinician to predict which 1) gene transcript levels are likely elevated and which gene levels are likely decreased in the plaque and/or 2) protein levels are likely elevated and which protein levels are likely decreased in the plaque. 'Omics levels (elevated/decreased/unchanged) are in reference to a non-atherosclerotic patient. As a result, this data provides information about the mechanisms related to plaque pathophysiology, plaque instability, or other relevant biological insight, thereby generating phenotypic, endotypic, and/or theratypic data for the atherosclerotic plaque from the subject.
II. Methods of Generating an in Silico Systems Biology Models Generating and Training an in Silico Systems Biology Model
The in silico systems biology model is initially generated or trained with two types of data. First, we use experimentally determined data from biological specimens from development subjects. Development subjects are people for whom actual proteomic data is available that shows differentially expressed protein levels that are linked to the specific characteristics and morphology of the plaques in each of those subjects. Second, we use results from searches of public literature, experimental results, and/or other databases to find journal articles and the like to obtain detailed information about the proteins in the model. These two sources of data are used to create the initial model.
An example of a mathematical framework for multi-scale analysis is shown below:
The function y(t) refers to a phenotype y at time t. Function x is the cellular and molecular level at time t, and z represents the patient-level outcome or state at time t. The present disclosure provides systems of equations, or non-linear models, ƒ and g, where ƒ decreases in scale and g increases in scale. One example of function ƒ is a predictive modeling paradigm, where y can be expressed as scalar, vector, or multidimensional data as pictured, to derive expression profiles, protein concentrations, or other lower level information. One example of function g can also be a predictive model, but a different model than ƒ, one that increases in scale. Inverse functions for ƒ and g can also be derived.
Further detail is given in
The virtual 'omics models themselves can be utilized, without loss of generality, for example, as follows. All or a selection of probes from a microarray, or species from mass spectrometry, or other assay method for obtaining so-called 'omics data, can be selected. Single as well as multiple variable regression models covering linear and non-linear modelling techniques can be performed on predictor sets constructed from a development cohort including plaque morphology, demographics, clinical (laboratory) values, and/or other variables, in part to recognize that clinical factors can affect the expression data or models, and to inspect what is the added value of morphology over clinical and demographic data, and to identify when morphology and other variables have independent information content, different predictor sets can be used, some only using plaque morphology, but others also using lab values, demographic, and other values in composite models. Each model result can be output and tabulated to identify the highest-achieved performance on a species-by-species basis.
Predictive performance can be determined based on the accuracy of the prediction relative to the true or reference values. Models can be built with variations, for example, differing sets of morphological measurements according to hypothesized physiological rationale, automated optimization using for example cross validation while simultaneously varying tuning parameter values; and/or, partitioning data such that a training set on which the cross-validation was performed was strictly separated from a sequestered validation data set to test performance using locked-down models. Use of histologically validated plaque features, for example, can produce interpretable models, and when coupled with cross-validation, can mitigate overfitting.
Supervised model quality (MQ) can be determined, by way of example, but not only by this method, as the product of two measures for each model type. MQ for continuous estimation models was computed as the product of concordance correlation coefficient (CCC) and regression slope of predicted vs. observed for continuous value estimation (the former to measure the tightness of fit, but augmented by the latter to ensure proportional prediction relative to observed). MQ for dichotomized categorical prediction models was computed as the product of area under the receiver characteristic curve (AUC) times Kappa for dichotomized prediction (the former to measure the net classification performance, but augmented by the latter to ensure performance in both high and low expression classes).
The forgoing can be implemented using deep learning networks of various network topologies, and using either raw imagery, or enriched images identified with tissue type annotations, and/or that result from spatial normalizations such as, but not limited to, unwrapping.
Recognizing the existence of the various virtual 'omics processing steps, the present disclosure builds on that base with further steps that provide further utility. For example, models of the complex biological behavior sometimes referred to as pathways or cell signaling networks are described with mathematical formalisms using differential equations or other mathematical formalisms that capture behavior such as mass transfer, reaction dynamics that stem from enzymes, various inhibitory processes, and other approximations to biochemical reactions/relations.
In general, the numeric variables identified are descriptive of the behavior expected in groups of patients or animals, that is, in general, they are not applicable to a specific individual; but they do provide a structure and calibration levels for patient groups. One example embodiment is diagrammed in
On the right side of
The present disclosure moves beyond patient groups to provide the facility to reach individual patient-level results. As shown in
Specifically, first, as shown in
The initial model is then updated with calibration data, such as 'omics data, from test subjects to validate and refine the initial model. The calibration data is again based on actual biological samples that show differentially expressed protein and/or transcription levels that are linked to the specific characteristics and morphology of the plaques in each of those test subjects. This update of the initial model provides a calibrated model. This step confirms that the model works as intended and also augments and renders the model more robust, given the data from many test subjects.
Test data (e.g., from test subjects), are fed forward to obtain information about the plaque morphology as well as to obtain estimated gene and/or protein measurements (see right side of
More particularly, information obtained from CTA imaging is input into an in silico systems biology model, which is a (set of) network(s) characterizing atherosclerotic cardiovascular disease, where the (each) network includes nodes (each node representing a different protein) and edges between a pair of nodes (each edge representing protein-protein interactions in a given cell type, including “self-edges” as means to represent the transcription/translation process). Each node in the network has information representing a protein level, which can be calibrated based on data (e.g., computed tomography angiograph imaging data of a plaque and proteomics data) from multiple test subjects.
Using the Calibrated in Silico Systems Biology ModelThen in operation, the calibrated model is again updated, but now with patient-specific personalized data based on imaging of the patient's plaque without the need to perform an invasive blood test or biopsy. The calibrated model is also updated with the predicted effects of two or more different therapies. The methods and systems described herein use the patient's imaging data to provide a therapy recommendation based upon an automated comparison of the two or more different therapies whose predicted effects are programmed into the model.
For example, once the initial in silico systems biology model has been calibrated, the biological pathways contained in the in silico systems biology model can then be manipulated based on various drug mechanism of actions and the end outcome of treating a patient with a particular drug can be simulated. Ultimately, a patient's likelihood of survival can also be estimated based on the drug simulations and the system automatically provides a therapy recommendation as described in further detail below.
III. Systems for Generating in Silico Systems Biology Models for Atherosclerosis
Given the above, here we disclose one example of a system to generate such a systems biology model.
The input device 340 is configured to obtain pathway data 302a and test subject data 302b and provide the pathway data 302a and the test subject data 302b to another device across a network 320. The pathway data 302a include biological pathways (e.g., pathway names, identifiers) associated with atherosclerotic cardiovascular disease. The test subject data 302b include data (e.g., computed tomography angiograph imaging of a plague, proteomics, transcriptomics) from multiple test subjects who have been diagnosed with atherosclerotic cardiovascular disease. For example, the input device 340 can include a server 340a that is configured to obtain the pathway data 302a from a pathway database. In some implementations, the one or more other input devices can access the test subject data 302b obtained by the server 340a and transmit the obtained test subject data 302b to the computer 330 via the network 320. The network 320 represents a computer network (unlike biological networks such as a first network 306 and a second network 314) and can include one or more of a wired Ethernet network, a wired optical network, a wireless WiFi network, a LAN, a WAN, a Bluetooth network, a cellular network, the Internet, or other suitable network, or any combination thereof.
The computer 330 is configured to obtain the pathway data 302a and the test subject data 302b from the input device 340 and generate an in silico systems biology model of the disease represented by a network. In some implementations, the computer 330 stores the pathway data 302a and the test subject data 302b in a database 332 and access the database 332 to retrieve desired datasets. The database 332, such as a local database or a cloud-based database, can store the pathway data 302a, the test subject data 302b, a first network 306, a second network 314, or other suitable data.
In some implementations, the pathway data 302a is obtained from differential expression analysis. Each pathway in the pathway data 302a includes at least one differentially expressed molecule. For example, the computer 330 obtains first molecular expression data (e.g., gene expression data, protein expression data) of a first set of test subjects who have been diagnosed with the atherosclerotic cardiovascular disease and second molecular expression data of a second set of test subjects who do not have the atherosclerotic cardiovascular disease. Differential expression analysis identifies molecules, e.g., RNA, genes or proteins, that are differentially expressed between these two sets of test subjects. The gene expression data is obtained from microarray, RNA sequencing, single cell RNA sequencing, or reverse transcriptase PCR. Without loss of generality, protein levels can be measured by liquid chromatography mass spectrometry (LC-MS or LS-MS/MS, for example).
The network generation engine 304 is configured to define/train a systems biology model by receiving publicly available and/or experimentally determined data such as pathway data 302a and generating a first network 306. The first network 306 (also referred to as an initial or baseline network) characterizes a baseline of the disease, as the network is not yet calibrated using the test subject data 302b. In some implementations, the first network 306 is a data structure that represents nodes, edges between nodes, and information (e.g., a protein level) contained in each node. In some implementations, pathway data 302a are obtained from findings from scholarly literatures.
The network generation engine 304 can perform one or more tasks such as protein isolation by cell type 304a, pruning network 304b, compartmentalization 304c, and creating intima network 304d. The protein isolation by cell type 304a identifies a cell type in which each protein-protein interaction occurs. Referring to
The compartmentalization 304c aims to localize proteins by assigning a compartment to each protein, where the compartment includes an intracellular of each cell type (e.g., intracellular of VSMC), a cell membrane, an extracellular space, and a compartment for blood.
Creating the intima network 304d generates an intima network that represents topologically accurate plasma interfaces, as the intima network accounts for topological relationships between compartments. The resulting intima network is referred to as the first network 306. The first network 306 includes baseline levels of proteins. We note that other integrated networks, such as for the adventitia, media, or perivascular space can also be used without loss of generality.
The virtual 'omics engine 310 is configured to receive the test subject data 302b and generate virtual 'omics data 312. The test subject data 302b include computed tomography angiograph (CTA) imaging data of a plaque from the test subject, plaque morphology data, and proteomics data corresponding to the test subject. As shown in
The network calibration engine 308 is configured to receive the first network 306 and the virtual 'omics data 312 and generate a second network 314. The second network 314, updated from the first network 306 using virtual 'omics data 312 derived from the test subject data 302b, includes a disease-associated protein level for each protein in the second network. In some implementations, measured 'omics data, in addition to or instead of the virtual 'omics data, are used to update the first network. To update the first network, the network calibration engine 308 first identifies disease-associated protein levels for a set of proteins whose disease-associated protein levels are known from the virtual 'omics data 312. For proteins whose disease-associated protein levels are unknown, the network calibration engine 308 iteratively estimates a disease-associated protein level for a protein based on the protein's adjacent nodes in the first network. After disease-associated protein levels of all proteins in the first network are found (either from the virtual 'omics data 312 or estimation), the network calibration engine 308 outputs the second network 314. The computer 330 can store the second network in the database 332.
The computer 330 can generate rendering data that, when rendered by a device having a display such as a user device 350 (e.g., a computer having a monitor 350a, a mobile computing device such as a smart phone 350b, or another suitable user device), can cause the device to output data including the first network 306 and the second network 314. Such rendering data can be transmitted by the computer 330 to the user device 350 through the network 320 and processed by the user device 350 or associated processor to generate output data for display on the user device 350. In some implementations, the user device 350 can be coupled to the computer 330. In such instances, the rendered data can be processed by the computer 330 and can cause the computer 330, on a user interface, to output data, e.g., visualizing the second network 314.
The system obtains multiple first inputs indicative of biological pathways associated with the atherosclerotic cardiovascular disease (402). For example, the system queries a pathway database (e.g., the Kyoto Encyclopedia of Genes and Genomes (KEGG)) to identify biological pathways associated with the atherosclerotic cardiovascular disease. In some implementations, each pathway in the biological pathways includes at least one differentially expressed molecule.
To identify molecules that are differentially expressed, the system obtains first molecular expression data of a first set of test subjects who have been diagnosed with the atherosclerotic cardiovascular disease and second molecular expression data of a second set of test subjects who do not have the atherosclerotic cardiovascular disease. The system performs differential expression analysis on the first and the second molecular expression data and identifies molecules that are differentially expressed. In some implementations, the first and the second molecular expression data are gene expression data. In some implementations, the first and the second molecular expression data are protein expression data.
The system generates, based on the first inputs, a first network (404). The first network includes nodes representing baseline levels of proteins and edges representing protein-protein interactions in one or more cell types. The first network includes proteins, genes, mRNA, nutrients, cellular events, external signals, or combinations thereof found in the biological pathways. The system represents the proteins in the multiple first inputs as the nodes in a graph (also referred to as a state graph), initializes a baseline level for each of the proteins, represents the protein-protein interactions as the edges in the graph, and outputs the graph as the first network. The baseline level indicates a state of the node. The one or more cell types are associated with the atherosclerotic cardiovascular disease. In some implementations, the one or more cell types include cell types that include at least one protein whose level is altered by the atherosclerotic cardiovascular disease. The one or more cell types can include, for example endothelial cells, vascular smooth muscle cells, macrophages, and lymphocytes. Other cell types can be included without loss of generality.
In some implementations, each of the edges in the first network is directed with a weight, where directed edges indicate a direction of the protein-protein interaction, e.g., a molecule A activating a molecule B. The weight can indicate a type of the protein-protein interaction, e.g., activation, inhibition, dissociation, methylation, glycosylation, translation, repression, degradation, etc. The weight is positive for activation and translation. The weight is negative for inhibition, repression, and degradation. The edges in the first network can have information indicative of a dependency condition: a molecule A interacts with a molecule B under a certain condition, e.g., the baseline level of the molecule B meets a threshold. The first network can be displayed in a graphical form on a user interface, e.g., using cytoscape.
The first network includes (i) a “core network” representing protein-protein interactions unique to each respective cell type, (ii) a “mid network” representing protein-protein interactions that occur in multiple cell types, but not all cell types, and (iii) a “full network” representing protein-protein interactions that occur in all cell types. The edges represent protein-protein interactions representing any one of different types of interactions including, for example, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and/or methylation.
The system can separately calibrate the core network, the mid network, and the full network by using the second inputs to generate a calibrated sub-network. After calibration, the system generates the second network that includes the calibrated sub-networks. In particular, the protein-protein interaction of an ith molecule with a jth molecule is represented as Σjw(j,i)*sj(t−d(j,i)), wherein w(j,i) is a weight of the edge between the ith molecule and the jth molecule, sj is a baseline level of the ith molecule, t is a time step, and d(j,i) is a delay of the edge between the ith molecule and the jth molecule. The delay of the edge indicates the time step required for the protein-protein interaction to be effected.
The system obtains second inputs indicative of calibration data from multiple test subjects who have been diagnosed with the atherosclerotic cardiovascular disease (406). The second inputs include non-invasively obtained data, such as imaging data, for each test subject, of a plaque from the test subject, morphology data obtained from the plaque, and proteomics data corresponding to the plaque.
The imaging data can be obtained by computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MM), multi-contrast magnetic resonance imaging (multi-contrast MRI), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT) diagnostic images or any combination thereof.
In case that the proteomics data are not available, or in addition to the proteomics data, the system can obtain transcriptomics data. In some implementations, the system obtains, for at least some of the test subjects, transcriptomics data. The transcriptomics data is obtained by microarray, RNA sequencing (RNA-seq), single cell RNA sequencing (scRNA-seq), reverse transcriptase PCR (RT-PCR), or any combination thereof. In some implementations, the system obtains, for at least some of the test subjects, proteomics data, e.g., protein levels obtained from protein mass spectrometry. In some implementations, the system obtains, for at least some of the test subjects, liquid chromatography-mass spectrometry data of various molecules.
For the case where 'omics data are obtained, the first network includes nodes representing baseline levels of proteins and genes, and edges representing protein-protein interactions, gene-gene interactions, and protein-gene interactions in the one or more cell types.
The system determines, from the second inputs, a disease-associated protein level for proteins in the first network (408). The disease-associated protein level of a specific protein corresponds to one or more of a measured protein level from tissue samples from the test subjects, an estimated protein level based on one or more virtual 'omics models of the test subjects, or a protein level corresponding to non-invasively obtained imaging data from the test subjects. In different embodiments, the specific proteins can be one or more of lipopolysaccharide-binding protein (LBP), integrin subunit alpha 2b (ITGA2B), toll like receptor 4 (TLR4), lipocalin 2 (LCN2), S100 calcium binding protein A8 (S100A8), S100 calcium binding protein A9 (S100A9), cyclin dependent kinase inhibitor 1A (CDKN1A), matrix metallopeptidase 1 (MMP1), receptor for advanced glycation end products (RAGE), heme oxygenase 1 (HMOX1), SMAD family member 2 (SMAD2), and coagulation factor VIII (F8). Many other molecular species are utilized by the invention, without loss of generality; these are given by way of example rather than being considered definitive or limiting.
The system identifies disease-associated protein levels for a set of proteins from the second inputs, where the disease-associated protein levels of the set of proteins are obtained from the second inputs from the test subjects. The system estimates, for proteins in the first network other than the set of proteins, a disease-associated protein level based on the disease-associated protein levels of a subset of the set of proteins, where the subset of the set of proteins are represented by adjacent nodes in the first network.
The system generates, based on the first network and the disease-associated protein levels, a second network (410). The second network, an updated network from the first network using the second inputs, represents a calibrated in silico systems biology model of the atherosclerotic cardiovascular disease and includes the disease-associated protein level for each protein in the second network. To generate the second network, the system identifies a disease-associated protein level for each node whose disease-associated protein level is obtained from the calibration data from the test subjects; and identifies a disease-associated protein level for each node whose disease-associated protein level is estimated.
IV. Methods and Systems for Predicting Suitable Therapeutic/Treatment Plans for Specific Patients
In general, various therapies, e.g., pharmacotherapies and/or procedural interventions, can be used for the treatment of cardiovascular diseases, such as atherosclerosis. The in silico systems biology models described herein can simulate how an actual patient will react to a particular therapy (i.e., will the therapy have a beneficial effect, and if so, to what extent) based on the mechanism of action of that specific pharmacotherapy. Provided below are examples of embodiments of how personalized therapeutic treatment plans can be simulated in the in silico systems biology model described herein by manipulating/physically changing the levels of certain molecules, e.g., RNA, DNA, or all or parts of genes or proteins, in the models based on the mechanism of action of the therapy, e.g., pharmacotherapy. Accordingly, this disclosure provides methods of simulating a therapeutic response in an actual patient by modulating levels of specific gene transcripts and/or protein levels in the in silico systems biology models described herein.
The virtual 'omics engine 310 in
The network calibration engine is configured to receive the virtual 'omics data 312 (e.g., predicted protein levels of the patient, based on the CTA imaging dataset) and the first network 306 and generate a second network 314. The first network 306 is a trained in silico systems biology model of atherosclerotic cardiovascular disease, as described referring to
The therapeutic response simulation engine 316 simulates a therapeutic response for each potential therapy in the second network, the trained in silico systems biology model calibrated for a given patient. The therapeutic response simulation engine 316 determines a known set of molecules affected by the potential therapy, e.g., based on published scientific discoveries pertaining to mechanism of action and defines one or more therapeutic effect molecule level for each molecule in the known set of molecules (e.g., proteins, genes), e.g., based on known mechanisms of action of the potential therapy. The therapeutic response simulation engine 316 estimates a therapeutic effect molecule level based on a simulated effect of the defined therapeutic effect molecule levels and compares the defined and estimated therapeutic effect molecule levels in the second network, before and after the therapeutic response simulation for each potential therapy. As an output of the therapeutic response simulation engine 316, a therapeutic recommendation 318, a report indicating the preferred therapy for the patient, is generated. The therapeutic recommendation 318 is sent to the user device 350, e.g., the monitor 350a and the smartphone 350b. The therapeutic recommendation 318 can be stored in the database 332, for the computer 330 to access to retrieve.
The system receives a non-invasively obtained imaging data of a plaque from the patient (452). The non-invasively obtained imaging data is obtained by computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MRI), multi-contrast magnetic resonance imaging (multi-contrast MM), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT) diagnostic images or any combination thereof.
The system accesses a trained in silico systems biology model of cardiovascular disease (454). The trained in silico systems biology model includes a network characterizing the cardiovascular disease. The network includes a disease-associated molecule level for each of a plurality of nodes, wherein each node represents a different molecule, e.g., protein or gene or nucleic acid. In some implementations, the network includes proteins, and disease molecule levels represent disease-associated protein levels for proteins and disease-associated gene levels for genes. The network includes protein-protein interactions in one or more cell types including endothelial cells, vascular smooth muscle cells, macrophages, and lymphocytes. These cell types, in some implementations, are cell types that include at least one molecule whose level is altered by the cardiovascular disease. In some implementations, the trained in silico systems biology model is a baseline model built using the publicly available or otherwise known data. In some implementations, the trained in silico systems biology model is an updated model, from the baseline model, using calibration data from test subjects, as described herein.
The system updates the in silico systems biology model for the patient using personalized molecule levels derived from the non-invasively obtained data, e.g., imaging data (456). The system compares the imaging data of the patient with imaging data of multiple test subjects, where the imaging data of multiple test subjects were an input to update the in silico systems biology model. Based on the comparison, the system predicts personalized molecule levels for molecules in the network.
The system obtains information relating to two or more potential therapies for the patient, or compares one potential therapy against baseline levels (458). The potential therapies can include, for example, (i) a lipid lowering drug, (ii) an antidiabetic drug, (iii) an anti-inflammatory treatment, and (iv) any combination of (i)-(iii). For example, the system receives an identifier of the potential therapies.
For example, the lipid-lowering drug can be any one or more of a statin, a proprotein convertase subtilisin kexin type 9 (PCSK9) inhibitor, or a cholesteryl ester transfer protein (CETP). The antidiabetic drug can include, for example, metformin. The anti-inflammatory treatment can include, for example, anti-IL1β, anti-TNF, anti-IL 12/23, and anti-IL17 agents. These treatments are provided as examples without loss of generality.
The system simulates a therapeutic response for each potential therapy in the trained in silico systems biology model (460) by following sub processes. The system determines a known set of molecules affected by the potential therapy (460a). The system defines a therapeutic effect molecule level for each molecule in the known set of molecules based on one or more known mechanisms of action of the potential therapy on the known set of molecules (460b). To define the therapeutic effect level, the system sets therapeutic effect molecule levels of the set of proteins to a baseline level. The baseline level, in some implementations, is determined based on observed level of molecules from subjects or patients without disease, or a baseline can be developed for subjects or patients already on some form of pharmacotherapy where the simulation would be considered additive to that baseline therapy.
The system estimates a therapeutic effect level for other molecules represented in the in silico systems biology model other than the set of known molecules, based on a simulated effect of the defined therapeutic effect levels of the set of known molecules, e.g., proteins, on one or more of the other molecules represented in the network (460c). The system defines a simulated therapeutic effect level for each molecule represented in the in silico systems biology model based on the defined and estimated therapeutic effect levels (460d). In the cases where the molecule is a protein, a therapeutic effect level is a therapeutic effect protein level. When the molecule is a gene, a therapeutic effect molecule level is a therapeutic effect gene level.
The system compares the simulated therapeutic effect levels in the in silico systems biology model before and after the therapeutic response simulation for each potential therapy (462).
The system selects one or more of the potential therapies as a preferred therapy based on the comparison (464).
The system provides a report recommending the preferred therapy for the patient (466). The report includes predicted effectiveness of potential therapies and change in therapeutic effect molecule levels before and after the therapeutic response simulation for the preferred therapy. The report, as shown in
Operations of the system includes receiving non-invasively obtained data related to a plaque from a patient (472). For example, imaging data can be received by the system. Operations also include updating a trained in silico systems biology model using personalized calibration data derived from the received data to generate an in silico patient-specific systems biology model (474). The trained in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule, and a plurality of edges between pairs of nodes, each edge representing a molecule-molecule interaction. At least two of the nodes represent molecules whose levels are affected by the atherosclerotic cardiovascular disease. At least one of the set of networks includes a disease-associated molecule level for each of the nodes in the network. In one implementation, the at least set of networks includes of nodes corresponding, respectively, to one or more of, for example, a glycosylated low-density lipoprotein (glyLDL), an oxidized LDL (oxLDL), a minimally-modified LDL (mmLDL), or a very-low-density lipoprotein (VLDL). Operations of the system with such nodes also include perturbing the in silico patient-specific systems biology model to simulate a therapeutic effect of, for example, a lipid-lowering agent for the patient (476). Operations of the system that has such a perturbation also include providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease by the exemplary lipid-lowering agent for the patient and a recommendation supporting a clinical decision as to whether the exemplary lipid-lowering agent would benefit the patient (478).
Computing device 500 includes a processor 502, memory 504, a storage device 506, a high-speed controller 508 connecting to memory 504 and high-speed expansion ports 510, and a low speed controller 512 connecting to low speed bus 514 and storage device 506. Each of the components 502, 504, 508, 508, 510, and 512, are interconnected using various busses, and can be mounted on a common motherboard or in other manners as appropriate. The processor 502 can process instructions for execution within the computing device 500, including instructions stored in the memory 504 or on the storage device 506 to display graphical information for a GUI on an external input/output device, such as display 516 coupled to high speed controller 508. In other implementations, multiple processors and/or multiple buses can be used, as appropriate, along with multiple memories and types of memory. Also, multiple computing devices 500 can be connected, with each device providing portions of the necessary operations, e.g., as a server bank, a group of blade servers, or a multi-processor system.
The memory 504 stores information within the computing device 500. In one implementation, the memory 504 is a volatile memory unit or units. In another implementation, the memory 504 is a non-volatile memory unit or units. The memory 504 can also be another form of computer-readable medium, such as a magnetic or optical disk.
The storage device 506 is capable of providing mass storage for the computing device 500. In one implementation, the storage device 506 can be or contain a computer-readable medium, such as a floppy disk device, a hard disk device, an optical disk device, or a tape device, a flash memory or other similar solid state memory device, or an array of devices, including devices in a storage area network or other configurations. A computer program product can be tangibly embodied in an information carrier. The computer program product can also contain instructions that, when executed, perform one or more methods, such as those described above. The information carrier is a computer- or machine-readable medium, such as the memory 504, the storage device 506, or memory on processor 502.
The high-speed controller 508 manages bandwidth-intensive operations for the computing device 500, while the low speed controller 512 manages lower bandwidth intensive operations. Such allocation of functions is exemplary only. In one implementation, the high-speed controller 508 is coupled to memory 504, display 516, e.g., through a graphics processor or accelerator, and to high-speed expansion ports 510, which can accept various expansion cards (not shown). In the implementation, low speed controller 512 is coupled to storage device 506 and low speed bus 514. The low-speed expansion port, which can include various communication ports, e.g., USB, Bluetooth, Ethernet, wireless Ethernet can be coupled to one or more input/output devices, such as a keyboard, a pointing device, microphone/speaker pair, a scanner, or a networking device such as a switch or router, e.g., through a network adapter.
The computing device 500 can be implemented in a number of different forms, as shown in the figure. For example, it can be implemented as a standard server 520, or multiple times in a group of such servers. It can also be implemented as part of a rack server system 524. In addition, it can be implemented in a personal computer such as a laptop computer 522. Alternatively, components from computing device 500 can be combined with other components in a mobile device (not shown), such as device 550. Each of such devices can contain one or more of computing device 500, 550, and an entire system can be made up of multiple computing devices 500, 550 communicating with each other.
The computing device 500 can be implemented in a number of different forms, as shown in the figure. For example, it can be implemented as a standard server 520, or multiple times in a group of such servers. It can also be implemented as part of a rack server system 524. In addition, it can be implemented in a personal computer such as a laptop computer 522. Alternatively, components from computing device 500 can be combined with other components in a mobile device (not shown), such as device 550. Each of such devices can contain one or more of computing device 500, 550, and an entire system can be made up of multiple computing devices 500, 550 communicating with each other.
Computing device 550 includes a processor 552, memory 564, and an input/output device such as a display 554, a communication interface 566, and a transceiver 568, among other components. The device 550 can also be provided with a storage device, such as a micro-drive or other device, to provide additional storage. Each of the components 550, 552, 564, 554, 566, and 568, are interconnected using various buses, and several of the components can be mounted on a common motherboard or in other manners as appropriate.
The processor 552 can execute instructions within the computing device 550, including instructions stored in the memory 564. The processor can be implemented as a chipset of chips that include separate and multiple analog and digital processors. Additionally, the processor can be implemented using any of a number of architectures. For example, the processor can be a CISC (Complex Instruction Set Computers) processor, a RISC (Reduced Instruction Set Computer) processor, or a MISC (Minimal Instruction Set Computer) processor. The processor can provide, for example, for coordination of the other components of the device 550, such as control of user interfaces, applications run by device 550, and wireless communication by device 550.
Processor 552 can communicate with a user through control interface 558 and display interface 556 coupled to a display 554. The display 554 can be, for example, a TFT (Thin-Film-Transistor Liquid Crystal Display) display or an OLED (Organic Light Emitting Diode) display, or other appropriate display technology. The display interface 556 can comprise appropriate circuitry for driving the display 554 to present graphical and other information to a user. The control interface 558 can receive commands from a user and convert them for submission to the processor 552. In addition, an external interface 562 can be provide in communication with processor 552, so as to enable near area communication of device 550 with other devices. External interface 562 can provide, for example, for wired communication in some implementations, or for wireless communication in other implementations, and multiple interfaces can also be used.
The memory 564 stores information within the computing device 550. The memory 564 can be implemented as one or more of a computer-readable medium or media, a volatile memory unit or units, or a non-volatile memory unit or units. Expansion memory 574 can also be provided and connected to device 550 through expansion interface 572, which can include, for example, a SIMM (Single In Line Memory Module) card interface. The expansion memory 574 can provide extra storage space for device 550, or can also store applications or other information for device 550. Specifically, expansion memory 574 can include instructions to carry out or supplement the processes described above, and can include secure information also. Thus, for example, expansion memory 574 can be provide as a security module for device 550, and can be programmed with instructions that permit secure use of device 550. In addition, secure applications can be provided via the SIMM cards, along with additional information, such as placing identifying information on the SIMM card in a non-hackable manner.
The memory can include, for example, flash memory and/or NVRAM memory, as discussed below. In one implementation, a computer program product is tangibly embodied in an information carrier. The computer program product contains instructions that, when executed, perform one or more methods, such as those described above. The information carrier is a computer- or machine-readable medium, such as the memory 564, expansion memory 574, or memory on processor 552 that can be received, for example, over transceiver 568 or external interface 562.
Device 550 can communicate wirelessly through communication interface 566, which can include digital signal processing circuitry where necessary. Communication interface 566 can provide for communications under various modes or protocols, such as GSM voice calls, SMS, EMS, or MMS messaging, CDMA, TDMA, PDC, WCDMA, CDMA2000, or GPRS, among others. Such communication can occur, for example, through (radio-frequency) transceiver 568. In addition, short-range communication can occur, such as using a Bluetooth, Wi-Fi, or other such transceiver (not shown). In addition, GPS (Global Positioning System) receiver module 570 can provide additional navigation- and location-related wireless data to device 550, which can be used as appropriate by applications running on device 550.
Device 550 can also communicate audibly using audio codec 560, which can receive spoken information from a user and convert it to usable digital information. Audio codec 560 can likewise generate audible sound for a user, such as through a speaker, e.g., in a handset of device 550. Such sound can include sound from voice telephone calls, can include recorded sound, e.g., voice messages, music files, etc. and can also include sound generated by applications operating on device 550.
The computing device 550 can be implemented in a number of different forms, as shown in the figure. For example, it can be implemented as a cellular telephone 780. It can also be implemented as part of a smartphone 782, personal digital assistant, or other similar mobile device.
Various implementations of the systems and methods described here can be realized in digital electronic circuitry, integrated circuitry, specially designed ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations of such implementations. These various implementations can include implementation in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which can be special or general purpose, coupled to receive data and instructions from, and to transmit data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software, software applications or code) include machine instructions for a programmable processor, and can be implemented in a high-level procedural and/or object-oriented programming language, and/or in assembly/machine language. As used herein, the terms “machine-readable medium” “computer-readable medium” refers to any computer program product, apparatus and/or device, e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs), used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term “machine-readable signal” refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the systems and techniques described here, or any combination of such back end, middleware, or front end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”), a wide area network (“WAN”), and the Internet.
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
V. Types of Therapies
The in silico systems biology models described herein can be used to model the effects of any therapies, e.g., medical or procedural therapies, for which a mechanism of action is known or discovered, e.g., for which a mechanism of action is described in the public record or otherwise known, and is converted into data that can be used to update the calibrated model. Then, the systems biology model can be updated with data representing a specific patient's plaque characteristics, and then specific models of potential therapies can be added to the systems biology model updated with the specific patient's information. The results of applying the drug to the patient-specific systems biology model can be compared, and the best performing therapy, or no therapy, can be recommended to the patient.
At the outset, it is important to note that the drug therapies/procedural interventions listed below are merely examples. One skilled in the art, prior to performing the methods described herein would do a review of the literature for drug and/or procedural intervention therapies and would determine the necessary parameters to model the effectiveness of that specific drug and/or procedural intervention therapy. For example, one of skill in the art would determine which molecules represented in the trained in silico systems biology model to manipulate and by how much to alter their levels based on the literature search.
A current search of the literature would show that atherosclerosis has many different endotypes. For example, an endotype of increased LDL is associated with the following genetic factors: LDLR, PCSK9, APOE, APOB-100, SORT1, ANGPTL3, CELSR2, PSRC1, HMGCR; and the following biomarkers: Total cholesterol, LDL-C, ApoB, ApoB-100, ox-LDLs, modified LDL, sdLDL, and PCSK9. An endotype characterized by an increase in Lp(a) is mainly genetically determined by the LPA gene locus and is not significantly influenced by other genetic, dietary, or environmental factors.
Biomarkers associated with an increase in Lp(a) include the following: Lp(a), apolipoprotein isoform (a), and antibodies to Lp(a). An endotype associated with arterial injury (arterial hypertension) is associated with the following genetic factors: ADAMTS7, THBS2, CFDP1, NOX4, EDNRA, PHACTR1, GUCY1A3, CNNM2, CYP17A1; and the following biomarkers: endothelin, angiotensin, adrenomedul-lin, natriuretic peptides, von Willebrand factor, cell adhesion molecules, endothelial progenitor cells, endothelial micro-particles, nitric oxide, and asymmetric dimethylarginine.
An endotype characterized by inflammation is associated with the following genetics: CXCL12, MCP-1, TLRs, SH2B3, HLA, IL-6R, IL-5, PECAM1; and the following biomarkers: TNF, IL-1b, IL-6, IL-12, IL-18, IL-23, IFN-g, IL-17, IL-22, TH17 cells, hsCRP, pentraxin-3, sCD40L, VCAM, and ICAM.
Finally, the endotype characterized by metabolic risk factors is associated with the following genetics: TCF7L2, HNF1A, CTRB1/2, MRAS, ZC3HC1, MIR17HG, and CCDC92; and the following biomarkers: blood glucose, blood insulin, C-peptide, glycated hemoglobin, glycated albumin, sRAGE, fructosamine (Vadim V. Genkel, Igor I. Shaposhnik, “Conceptualization of Heterogeneity of Chronic Diseases and Atherosclerosis as a Pathway to Precision Medicine: Endophenotype, Endotype, and Residual Cardiovascular Risk”, International Journal of Chronic Diseases, vol. 2020, Article ID 5950813, 9 pages, 2020).
Examples of Drug TherapiesIn general, any suitable drug therapy is contemplated by the present application. For instance, any compound that targets (e.g., inhibits) a specific gene, protein, or metabolite. “Inhibits” refers to the compound's ability to control, prevent, restrain, arrest, regulate a molecule's function. Exemplary compounds include, small molecules, inucleic acids (e.g, interference RNA (RNAi), short interfering RNA (siRNA); a micro, interfering RNA (miRNA); a small, temporal RNA (stRNA); or a short, hairpin RNA (shRNA); small RNA-induced gene activation (RNAa); small activating RNAs (saRNAs); messenger RNA (mRNA), inhibitory antibodies.
Hyperlipidemia Control Medications
High levels of low-density lipoprotein cholesterol (LDL) is a characteristic feature of cardiovascular diseases, such as atherosclerosis. As such, these diseases can be treated with hyperlipidemia control medications (e.g., intensive lipid lowering therapies, fibrates, niacin, fish oil, statins (like, atorvastatin), ezetimibe, bile acid sequestrants, a proprotein convertase subtilisin kexin type 9 (PCSK9) inhibitor, a cholesteryl ester transfer protein (CETP), adenosine triphosphate-citrate lyase (ACL) inhibitors, omega-3 fatty acid ethyl esters, and marine-derived omega-3 polyunsaturated fatty acids (PUFA).
For example, the effect that an intensive lipid lowering drug would have on a subject can be represented in the in silico systems biology model, thereby allowing a clinician to predict whether an intensive lipid lowering drug would be beneficial to the patient. For instance, in some embodiments the levels, e.g., of the gene level, protein level, or both levels, of LDL are physically lowered in the in silico systems biology model by 75%, 50%, 40%, 30%, 25%, 20%, 10%, or 5%, depending on what is known about how the drug affects the LDL levels. For example, if a specific drug is considered in the literature to be effective in certain patients when the LDL level in the patient has been reduced by 25%, then the model is updated to show an effective reduction of 25%. In some embodiments, the gene level, protein level, or both, of LDL products, such as, glycosylated (glyLDL), oxidized (oxLDL), and minimally-modified (mmLDL), and VLDL are also manipulated (i.e., lowered) in the in silico systems biology model by, for example, 75%, 50%, 40%, 30%, 25%, 20%, 10%, or 5%.
Lowering the levels of these molecules in the in silico systems biology model shows the changes in the levels of one or more genes, proteins, or both, as well as of other molecules that are both directly and indirectly connected to the LDL mechanistic pathway. If the in silico systems biology model shows a reduced possibility of a stroke or myocardial infarction, then an intensive lipid lowering drug would be deemed as beneficial to a patient. If the in silico systems biology model shows no change, or a worsening of one of more conditions of the patient over time, then the intensive lipid lowering drug would not be deemed as beneficial to the patient and would not be recommended.
Anti-Inflammatory Drugs
Inflammation is highly associated with atherosclerosis. As such, therapies that inhibit IL-1, IL1β, TNF, IL12/23, IL17, or other agents that affect an inflammatory cascade can be beneficial in treating subjects with atherosclerosis. Examples of therapies include colchicine, canakinumab, an inhibitor of a pro-inflammatory cytokine induced on danger signaling, a pro-resolvin (e.g., omega-3 fatty acids, like, eicosapentaenoic acid (EPA), docosahexaenoic acid (DHA), or docosapentaenoic acid (DPA)). To date, however, it has been hard to identify which patients would benefit vs. which would not, the latter suffering potentially dangerous side effects until or unless likely response can be established. As a result these drugs are not yet widely used, despite their apparent promise.
Accordingly, the present disclosure provides, in some embodiments, methods for simulating the effect that an anti-inflammatory drug would have on a subject or patient. For example, in some embodiments, the gene level, protein level, or both, of inflammatory molecules (such as, but not limited to, IL-1, IL1β, TNF, IL12/23, or IL17) are also physically manipulated (i.e., lowered) in the in silico systems biology model by, for example, 75%, 50%, 40%, 30%, 25%, 20%, 10%, or 5%, depending on what is known in the literature about how a specific drug affects inflammation. For example, if a specific drug is considered in the literature to be effective in certain patients when the IL-1, IL1β, TNF, IL12/23, or IL17 level in the patient has been reduced by 25%, then the model is updated to show an effective reduction of 25%. Lowering the levels of these molecules in the in silico systems biology model simulates the changes in gene, protein, or both, of other molecules that are both directly and indirectly connected in the inflammatory molecule pathway. In some cases, molecular levels can be raised, for example in pro-resolvin therapies or therapies which raise HDL by way of example, without loss of generality.
Lower plaque instability is a desirable treatment outcome. That is, if the in silico systems biology model after an anti-inflammatory drug effect simulation shows improvement in stability, then an anti-inflammatory drug would be deemed as beneficial to a subject. Plaque stability is quantified based on molecule levels; if the molecule levels of a subject are similar to those from test subjects with stable atherosclerosis, the patient will likely have a relatively higher plaque stability. The relative change in plaque stability of the subject before and after the anti-inflammatory drug is quantified by change in molecule levels in the in silico systems biology model.
Anti-Diabetic Drugs
Metabolic diseases and diabetes are associated with a strongly elevated risk of developing cardiovascular diseases, such as, atherosclerosis. In some subjects, a critical aspect for the development and progression of cardiovascular disease is the impaired lowering of blood glucose levels. Accordingly, in some instances, treatment with an anti-diabetic drug would be beneficial to a subject or patient suffering from a cardiovascular disease.
Accordingly, the present disclosure provide, in some embodiments, methods for simulating the effect that an anti-diabetic drug would have on a subject. For example, in some embodiments, the gene level, protein level, or both, of glucose/metabolic-related molecules (such as, but not limited to, MTOR, NFκβ1, ICAM1, or VCAM1) are also physically manipulated (i.e., lowered) in the in silico systems biology model by, for example, 75%, 50%, 40%, 30%, 25%, 20%, 10%, or 5%, depending on what is known in the literature about how a specific drug affects glucose levels and/or metabolism. For example, if a specific drug is considered in the literature to be effective in certain patients when the MTOR, NFκβ1, ICAM1, or VCAM1 level in the patient has been reduced by 25%, then the model is updated to show an effective reduction of 25%. Lowering the levels of these molecules in the in silico systems biology model shows the changes in gene, protein, or both, of other molecules that are both directly and indirectly connected to the glucose/metabolic-related molecule. If the in silico systems biology model shows that the patient would have a reduced level of diabetes, then an anti-diabetic drug would be deemed as beneficial to a subject. If the in silico systems biology model shows no change or worsening in diabetes symptoms, then an anti-diabetic drug would not be deemed as beneficial to the patient and would not be recommended.
Other Drug Classes
Other drug classes are also contemplated. For example immunomodulating agents, such as those that trigger innate immunity, that are immune tolerance stimulating agents, or that increase Treg activity.
Hypertensive agent (such as, ACE inhibitors) and anti-coagulating agent (agents that reduce thrombin production and/or limits the activity of thrombin) are also envisioned.
Triggers of innate immunity and regulation of intracellular signal transduction suggests novel targets for therapeutic treatment, including the inhibition of the pro-inflammatory cytokines induced on danger signaling. As an example, stimulating immune tolerance with increased Treg activity is being explored. As another example, clearing chylomicron remnants (large triglyceride-rich lipoproteins) is atheroprotective since chylomicron particles and the triglyceride-rich particles are directly and indirectly implicated in plaque development.
Combination Therapies
In some instances, a subject can benefit from the combination of one or more of the above-referenced therapies. Accordingly, in some embodiments, provided are methods for simulating the effect that an intensive lipid lowering and an anti-inflammatory drug would have on a subject; intensive lipid lowering and an anti-diabetic drug would have on a subject; an anti-inflammatory drug and an anti-diabetic drug would have on a subject; or an intensive lipid lowering, an anti-inflammatory drug, and an anti-diabetic drug would have on a subject.
For combination therapies, in determining a known set of molecules affected, the therapeutic response simulation engine 316 considers a first set of molecules affected by a first therapy, a second set of molecules affected by a second therapy, and a third set of molecules affected by an interaction between the first therapy and the second therapy. After defining the known set of molecules, the therapeutic response simulation engine 316 defines a therapeutic effect molecule level for each molecule in the known set of molecules, based on known mechanisms of action of a given combination therapy. Additional steps after defining the therapeutic effect molecule level are described above, referring to
In some embodiments, a pharmacotherapy is not the suitable treatment plan for a given patient and a procedural intervention is the only choice. If the simulations in the in silico systems biology model for the various possible drug candidates for a given patient do not show any predicted benefit for the patient, then a procedural intervention should be considered. In general, procedural interventions can make larger-scale changes than pharmacotherapy, for example, outright tissue removal represented by a broad decrease in protein levels, or structural anatomic changes such as the inclusion of a stent, which can block or interfere with connections in the systems biology model. In either case, there can also be localized drug addition, such as drug-eluting stents (DES), which may not address a current condition, but a known consequent action by the biology, in reaction to the procedural intervention, which can be compensatory, but have its own undesired side effects. Perturbations or changes can be made in the trained systems biology model to represent various aspects of such procedural interventions.
Procedural interventions, include, but are not limited to surgery, DES, atherectomy devices, intravascular lithotripsy (IVL), drug coated balloons, variable temperature balloons, and/or prosthetic heart valves.
Drug-Eluting Stents
Stents can be developed for specific patient groups depending on atherosclerosis characteristics and patient co-morbidities. Diabetic patients may respond better to different drugs. In addition, determining the potential rejection or allergic reaction to a specific drug, polymer, or metal can be determined in advance if vessel wall biology and patient response is understood in advance of the intervention. DES are generally made up three components: metallic stent, polymer and drug. Any one of these variables can affect the long-term patency.
For patients with stent thrombosis elevation MI, perhaps DESs with BP are preferable. This has been further supported by the recently reported BIOSTEMI trial showing superiority of ultra-thin BP sirolimus-eluting stent ORSIRO® over DP everolimus-eluting stent XIENCE® with respect to TLF at 1 year. For patients with high bleeding risk, BioFreedom™ or Resolute Onyx™ with 1-month dual antiplatelet therapy (DAPT) have the most supportive data (Comparison of Contemporary Drug-eluting Coronary Stents—Is Any Stent Better than the Others? Available at www.touchcardio.com/interventional-cardiology/journal-articles/comparison-of-contemporary-drug-eluting-coronary-stents-is-any-stent-better-than-the-others, Accessed May 7, 2021).
Patients with diabetes represent a challenging cohort. Most comparative trials of different DESs have shown no difference in effect of stent type between those with and without diabetes. In PLATINUM PLUS, there was no difference the in risk of the primary endpoint between those stented with PROMUS™ versus XIENCE™ (3.5% versus 3.5%, RR 1.00, 95% CI 0.62-1.60). However, in the sub-group with diabetes, XIENCE was favored (7.8% versus 3.0%, RR 2.50, 95% CI 1.16-5.38, interaction p=0.05). This relationship, however, was not seen in the 5-year follow-up data of the preceding PLATINUM trial with a similar design. The comparison of BP DESs versus PP DESs in patients with diabetes was recently examined by Bavishi et al., who included 5,190 patients from 11 RCTs in a meta-analysis, focusing on current-generation stents. After a mean follow-up of 2.7 years, there were no differences in a range of outcomes, including target lesion revascularization (RR 1.02, 95% CI 0.85-1.24, p=0.80) and stent thrombosis (1.66% versus 1.83%, RR 0.84, 95% CI 0.54-1.31, p=0.45) between the two stent types. There was no difference in this relationship between those patients with diabetes treated with and without insulin ((Comparison of Contemporary Drug-eluting Coronary Stents—Is Any Stent Better than the Others? Available at www.touchcardio.com/interventional-cardiology/journal-articles/comparison-of-contemporary-drug-eluting-coronary-stents-is-any-stent-better-than-the-others, Accessed May 7, 2021)).
Atherectomy Devices
Four different methods of atherectomy have been utilized for treatment of femoropopliteal or small vessel infrapopliteal disease: plaque excision (directional) atherectomy, rotational atherectomy/aspiration, laser atheroablation, and orbital atherectomy.
Atherosclerotic plaque molecular signature, morphology proportions & volume can determine the ability of stents to fully expand and stay patent within the focal area which can affect long- and short-term outcomes.
Understanding the lipid volume, matrix proportion, calcium extent, arc, thickness, volume, area and their impact on long term outcomes can help determine if a patient will respond better acutely and if long term outcomes/patency are improved when selecting different atherectomy devices for lesion preparation.
Intravascular Lithotripsy (IVL)
Atherosclerotic plaque molecular signature, morphology proportions & volume can determine the effectiveness of IVL within the focal lesion area which can affect long- and short-term outcomes. The power and pulse of the lithotripsy can potentially be determined by the plaque morphology.
Drug Coated Balloons
Target lesion revascularization rates in coronary and peripheral arterial disease can be affected by plaque morphology and/or atherosclerotic molecular signature. Different drug coated balloons can be developed for specific patient groups depending on atherosclerosis characteristics and patient co-morbidities. Patients with diabetes combined with different proportions of biological substances of the plaque can determine which drug balloon/excipient combo would be best suited for a particular patient. The type of drug (currently either paclitaxel or sirolimus), the excipient and the timing of release (dosing) can be tailored depending on plaque morphology to extend target lesion patency. Highly lipidic lesions like in-stent restenosis can affect long term patency and warrant a patient specific drug. Highly calcified lesions can require a different kind of drug coated balloon. A combination of atherectomy, plus a specific drug coated balloon can be selected based on molecular signature of the plaque.
Variable Temperature Balloons
The atherosclerotic lesion molecular signature can help determine if a patient is not a good candidate (would not respond well) for drug coated balloon or drug eluting stent and requires an alternative interventional therapy. The patient may have co-morbidities or allergies to certain drugs requiring a different therapeutic approach. This can avoid catastrophic acute reactions and long-term implications of sometimes permanent implants. The use of a “hot balloon” or a “cold balloon” may be warranted for certain lesion morphology characteristics.
Cryoplasty combines the dilatation force of angioplasty with the simultaneous delivery of cold thermal energy to the arterial wall. Both mechanisms are achieved simultaneously by filling the angioplasty catheter with nitrous oxide instead of the usual contrast saline/solution mixture. Cryotherapy has been proven to biologically alter the behavior of arterial cellular components in a benign healing process (The Next-Generation PolarCath™ System Available at evtoday.com/articles/2018-jan-supplement/the-next-generation-polarcath-system, Accessed May 10, 2021).
Several scientific studies have demonstrated that this cooling process within the vessel results in: weakening of the plaque, promoting uniform dilation and reducing vessel trauma; alteration of elastin fibers to reduce vessel wall recoil, while collagen fibers remain undisturbed and capable of maintaining architectural integrity; induction of smooth muscle apoptosis, which is associated with reduced neointimal formation and, subsequently, less restenosis ((The Next-Generation PolarCath System Available at evtoday.com/articles/2018-jan-supplement/the-next-generation-polarcath-system, Accessed May 10, 2021)).
So-called Hot Balloons are currently in development and may alter the morphology and fibrous cap thickness while reducing neointimal hyperplasia seen with standard angioplasty balloons.
Prosthetic Heart Valves
Understanding the molecular signature of heart valve disease phenotypes can help determine which drugs can arrest the disease and potentially reverse it before progressing to an irreparable state. In addition, the pathology of patient specific valve disease can determine the long-term efficacy and patient response to a particular prosthetic heart valve (TAVR: self-expanding, balloon expandable or different surgically implanted valves.)
Heart valves are complex tri-layered structures that ensure the unidirectional flow of blood. Scientists are actively investigating how characteristics of the two major cell types, valvular endothelial cells (VECs) and valvular interstitial cells (VICs), and their mechanical relationships with the valvular extracellular matrix promote structural integrity and age-related remodeling. Abnormal changes in VECs, VICs, and the extracellular matrix at the molecular level lead to gross tissue malformations and dysfunction. Improving our understanding of heart valve biology, the impact of cardiovascular drugs, and remodeling changes will be critical to the development of novel therapies for heart valve diseases (Xu, S. and K. J. Grande-Allen (2010). “The role of cell biology and leaflet remodeling in the progression of heart valve disease.” Methodist Debakey Cardiovasc J 6(1): 2-7).
The clinical and pathological features of the most frequent intrinsic structural diseases that affect the heart valves are well established, but heart valve disease mechanisms are poorly understood, and effective treatment options are evolving. Major advances in the understanding of the structure, function and biology of native valves and the pathobiology, biomaterials and biomedical engineering, and the clinical management of valvular heart disease have occurred over the past several decades (Schoen, F. J. (2018). “Morphology, Clinicopathologic Correlations, and Mechanisms in Heart Valve Health and Disease.” Cardiovasc Eng Technol 9(2): 126-140).
Procedural interventions in CAD include coronary artery bypass grafts (CABG), percutaneous coronary intervention (PCI, e.g., balloon angioplasty with or without stent placement). Also of relevance are procedures for valve replacement or repair including transcatheter aortic valve replacement (TAVR), due to the need for coronary artery assessment in the pre-procedure work-up.
Optimal Medical Therapy (OMT)
Most subjects on statins are prescribed a relatively low dose, but as there are indications of plaque requiring more intensity, various approaches exist. One approach is to increase the dose, for example, high-dose atorvastatin is often prescribed for subjects with hypercholesterolemia. There is a growing consensus that hypertriglyceridemia vs. hypercholesterolemia differs (Le, N. A. and M. F. Walter, The role of hypertriglyceridemia in atherosclerosis. Curr Atheroscler Rep, 2007. 9 (2): p. 110-5), with at least one recent drug (Vascepa®) capturing current attention. For subjects with hypertriglyceridemia, improved outcomes have been reported in trials such as the Reduction of Cardiovascular Events with EPA— Intervention Trial (REDUCE-IT) trial (Bhatt et al., REDUCE-IT USA: Results From the 3146 Patients Randomized in the United States. Circulation, 2020. 141 (5): p. 367-375; Bhatt et al., Cardiovascular Risk Reduction with kosapent Ethyl for Hypertriglyceridemia. N Engl J Med, 2019. 380 (1): p. 11-22; Bhatt et al., Reduction in First and Total Ischemic Events With kosapent Ethyl Across Baseline Triglyceride Tertiles. J Am Coll Cardiol, 2019. 74 (8): p. 1159-1161; Bhatt, D.L., Reduce-It. Eur Heart J, 2019. 40 (15): p. 1174-1175; Bhatt et al., Effects of kosapent Ethyl on Total Ischemic Events: From REDUCE-IT. J Am Coll Cardiol, 2019. 73 (22): p. 2791-2802; Boden et al., Profound reductions in first and total cardiovascular events with icosapent ethyl in the REDUCE-IT trial: why these results usher in a new era in dyslipidaemia therapeutics. Eur Heart J, 2019). Detailed quantitative studies have yet to be done to determine how IPE affects tissues in the vessel wall because it has not been previously possible to quantitatively assess changes in plaque morphology non-invasively.
Other Emerging Drug Classes
Triggers of innate immunity and regulation of intracellular signal transduction suggests novel targets for therapeutic treatment, including the inhibition of the pro-inflammatory cytokines induced on danger signaling (Zimmer et al., Danger signaling in atherosclerosis. Circ Res, 2015. 116 (2): p. 323-40). As an example, stimulating immune tolerance with increased Treg activity is being explored (Herbin et al., Regulatory T-cell response to apolipoprotein B100-derived peptides reduces the development and progression of atherosclerosis in mice. Arterioscler Thromb Vasc Biol, 2012. 32 (3): p. 605-12). As another example, clearing chylomicron remnants (large triglyceride-rich lipoproteins) (Rahmany, S. and I. Jialal, Biochemistry, Chylomicron, in StatPearls. 2020: Treasure Island (FL)) is atheroprotective since chylomicron particles and the triglyceride-rich particles are directly and indirectly implicated in plaque development (Tomkin, G. H. and D. Owens, The chylomicron: relationship to atherosclerosis. Int J Vasc Med, 2012. 2012: p. 784536).
Drug candidates in other therapeutic areas, such as immuno-modulators in cancer can have side effects where atherosclerosis is aggravated, due to activation of T-cells in the plaques that can result in plaque rupture, but there are no accurate methods to track these effects. There is a widely recognized need for effective markers during drug development for atherosclerosis and even unrelated diseases as well as companion diagnostics post-marketing.
VI. Examples of Applications
Clinical Decision Support Systems
The present disclosure can be used as a clinical decision support system. The invention supports clinical decision making by informing the clinician on what the likely effect would be for different possible therapies, and also provides tools to help discuss these options with the patient. The disclosure provides a recommendation based on the statistical significance of the likely improvement, and can compare across potential recommendations to identify the one that has been considered which exceeds others in the degree of improvement provided. This recommendation can be understood as determining a clinical action, or informing a decision that leads to a clinical action.
Such recommendations and actions that proceed from use of the presently disclosed methods and systems allow therapy to be tailored to the individual rather than be based only on population statistics. Presently, clinical guidelines have not been able to use such diagnostic specificity because there have been no means to do so. Individuals have different genetic pre-disposition, environmental exposures, and differing lifestyle habits. Both modifiable and non-modifiable risk factors influence what is best for that patient. The in silico systems biology models described herein provide a description of the disease, and a way to process and calibrate it for individual patients. This then enables the actual expected effect of therapies to be evaluated more specifically than previously possible. The benefit is that rather than referring to the population as a whole or at best sub-populations, that actual molecular level effects may be considered.
This has been widely understood in cancer treatment and is increasingly the norm. However, whereas cancer is generally informed by molecular diagnostics run on biopsied tumor tissue, it is not possible to biopsy atherosclerotic plaque tissues because it can cause a disruption, which is not desired. As a result, the computer-based systems described herein, which utilize advanced techniques, including forms of artificial intelligence, can extend what the clinicians would otherwise be able to do by themselves. The features of the tissues are generally of too complex a nature as to be easily interested by a human observer but the present invention analyses data at a far more granular level. To make such a decision support system practical is a mix of mathematical formulations, knowledge representation, and architecture in terms of user interfaces, reporting systems, and backbone of computation, all as described herein.
The utility of any diagnostic system must address what can be done with the information. Presently numerous powerful therapies exist, both procedural, pharmaceutical, or combinations such as drug eluting stents. By evaluating the individualized response to these therapies, the current systems and methods make diagnostics actionable by identifying the degree of improvement and annotates that improvement level with the statistical significance of its computation. These recommendations can be presented, for example, on screen-based user interfaces or in printable PDF forms that may be used in communicating among groups of clinicians or with patients.
Identification of Likely Responses at an Individual Patient Level
Provided herein are methods and systems for identifying a likely response, at an individual patient level, for a potential therapeutic agent. More specifically, an in silico systems biology model is generated, trained, and updated to create a calibrated model, as described herein. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition is then further updated to simulate one or more potential therapies based on the mechanism of action for each therapy to arrive at various in silico systems biology model representations of various simulated conditions for each potential therapy. Based on the results, the patient is provided with a recommendation, e.g., in the form of a report, of a suitable therapy or treatment regimen. The resulting absolute pathology as well as the relative improvement in the pathology can be quantified and expressed as a likely response for each simulated therapy.
Quantification of Actual Responses at an Individual Patient Level
Also provided herein are methods and systems for quantifying an actual response, at an individual patient level, for a potential therapeutic agent. More specifically, an in silico systems biology model is generated, trained, and updated to create the calibrated model, as described herein. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition is then further updated to simulate each potential therapy based on the mechanism of action for each therapy to arrive at various in silico systems biology model representations of various simulated conditions for each potential therapy. Based on the results, the patient is provided with a recommendation of a suitable therapy or treatment regimen.
After the patient has been on the recommended treatment regimen for a time sufficient to elicit a therapeutic response, the in silico systems biology model (i.e., a calibrated model that has not been updated with new patient-specific information) is updated with new patient-specific information (e.g., new virtual 'omics data), to create a model that represents a simulation of the effect of the recommend therapy (after-treatment simulation).
The baseline condition is compared to the after-treatment simulation. If there has been an actual improvement in the pathology, that result provides an indication that the patient improved under the treatment, even if the specific changes to the protein levels were not exactly as originally simulated. Further, if the specific changes to the protein levels were approximately as simulated, then one can further determine that the treatment caused the improvement and the method can be considered a surrogate end-point for treatment effect. In other words, in some embodiments, the simulations need to be only approximately correct to provide the intended utility in clinical practice.
Quantification of Actual Responses at a Cohort Level
Also provided herein are methods and systems of determining the actual responses to a specific treatment at a cohort level of patients or test subjects.
For example, an in silico systems model can be built. More specifically, an in silico systems biology model can be generated, trained, and updated to create the calibrated system, as described here. Then, for each patient or test subject in a cohort, information from each patient (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens) is used to update the model for each patient/test subject to form the baseline condition. For each patient/test subject in the cohort and for each therapy to be simulated, the calibrated model is perturbed based on the mechanism of action for the therapy to arrive at a simulated condition.
After an interval where each patient/test subject in the cohort has received the (adjusted) recommended treatment, e.g., after a time sufficient to elicit a therapeutic response, the in silico systems biology model, i.e., a calibrated model that has not been updated with new patient-specific information, is updated with new patient-specific information (e.g., new virtual 'omics or new histological analysis obtained from actual tissue and/or blood specimens), to create a model that represents an after-treatment simulation. If there has been an actual improvement in the pathology across the cohort of patients, one can conclude that the patients improved under the treatment, even if the specific changes to the protein levels were not exactly as simulated. Further, if the specific changes to the protein levels were approximately as simulated, then it can further be said that the treatment caused the improvement and the method may be considered to be a surrogate end point for treatment effect. This can be performed in the context of an observational study, a randomized clinical trial, or other study designs.
Detecting Contraindications at an Individual Patient Level
Also provided herein are methods and systems wherein after the simulated conditions are generated for each potential therapy, contra-indications at the individual patient level are detected.
For example, provided herein are methods for identifying a likely response, at an individual patient level, for a potential therapeutic agent. More specifically, an in silico systems biology model is generated, trained, and updated to create the calibrated system, as described above. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition, also as described above, is then further updated to simulate each potential therapy based on the mechanism of action for each treatment to arrive at various in silico systems biology model representing various simulated condition for each potential treatment. Deleterious side effects in the simulated condition are determined by looking at how molecules are perturbed in the model. That is, even if there is an apparent improvement in the condition with respect to the pathology, there may be inadvertent other effects that are worse for the patient than the intended improvement.
Once determined, those other effects can also be provided to the patient, e.g., in a report.
Identification of Likely Adverse Reactions, Current Actual Toxicity, or Likely Future Negative Reactions, at an Individual Patient Level
Also provided herein are methods and systems, wherein after the simulated conditions are generated for each potential treatment, likely adverse reactions, current actual toxicity, or likely future negative reactions, at an individual patient level are identified.
For example, provided herein are methods and systems for identifying a likely response, at an individual patient level, for a potential therapeutic agent. More specifically, an in silico systems biology model is generated, trained, and updated to create the calibrated model, as described herein. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition, also as described herein, is then further updated to simulate each potential therapy based on the mechanism of action for each therapy to arrive at various in silico systems biology model representations of various simulated conditions for each potential therapy.
Deleterious side effects are determined (adverse reaction) in the simulated condition, that is, even if there is an apparent improvement in the condition with respect to the pathology, there may be inadvertent other effects that are worse for the patient than the intended improvement. One can use this information to modify the therapy recommendations, that is, for example, one may downgrade a recommendation for treatments that improve the pathology, but also have one or more adverse reactions.
After an interval where the patient(s) has received the (adjusted) recommended treatment, e.g., after a time sufficient to elicit a therapeutic response, the in silico systems biology model (i.e., one that has not been updated with new patient-specific information) is updated with new patient-specific information, e.g., information either obtained through collection of tissue and/or blood specimens from the patient using transcriptomics and/or proteomics and/or metabolomics, or from non-invasive prediction (virtual 'omics)), to create a model that represents an after-treatment simulation.
If there has been an actual improvement in the pathology, one can conclude that the patient or patients improved under the therapy, even if the specific changes to the protein levels were not exactly as simulated. Further, if the specific changes to the protein levels were approximately as simulated, then one can further determine that the therapy caused the improvement and the method can be considered to be a surrogate end point for a treatment effect.
If there has been an adverse effect, one can determine that the patient failed to improve under the treatment, even if the specific changes to the protein levels were not exactly as simulated.
In some instances, the in silico model can be rebuilt (i.e., step one) with additional information regarding adverse events. All subsequent steps can then be repeated to determine additional improvement, adverse effects, or both for modifying treatments or for conducting dynamic, combination, multi-stage, or adaptive clinical trial designs or individual patient management.
Screening Tools for Clinical Trial Enrichment to “Select In” Cases that Increase the Statistical Power of a Clinical Trial
Also provided herein are methods and systems for creating and using screening tools for clinical trials to determine “select in” cases. More specifically, an in silico systems biology model is generated, trained, and updated to create the calibrated system, as described herein. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition, also as described herein, is then further updated to simulate each potential treatment based on the mechanism of action for each treatment to arrive at various in silico systems biology model representing various simulated condition for each potential treatment. The resulting pathology as well as the relative improvement in the pathology is quantified and expressed as a likely response for each simulated treatment.
If the likely improvement of the patient is above an inclusion criteria threshold, one would select the patient for the clinical trial. Otherwise, one would not select the patient for the clinical trial, if there are no other exclusion or inclusion criteria issues.
Screening Tools for Clinical Trial Enrichment to “Select Out” Cases that Decrease the Statistical Power of a Clinical Trial
Also provided herein are methods and systems for creating and using screening tools for clinical trials to determine “select out” cases. More specifically, an in silico systems biology model is generated, trained, and calibrated to create the calibrated system, as described above. Then the calibrated model is updated with patient-specific information (e.g., virtual 'omics or from histological analysis obtained from actual tissue and/or blood specimens), to create the baseline condition. The in silico systems biology model representing the baseline condition, also as described herein, is then further updated to simulate each potential treatment based on the mechanism of action for each treatment to arrive at various in silico systems biology model representing various simulated condition for each potential treatment. Any deleterious side effects (adverse reaction) in the simulated condition are flagged, that is, even if there is an apparent improvement in the condition with respect to the pathology, there may be an inadvertent other effect that is worse for the patient than the intended improvement.
If the adverse reaction of the patient is above an exclusion criteria threshold, then one would not select the patient for the clinical trial. Otherwise, one would select the patient for the clinical trial, if there are no other exclusion or inclusion criteria issues.
EXAMPLESThe invention is further described in the following examples, which do not limit the scope of the invention described in the claims.
Example 1: Creation of an in Silico Systems Biology Model MethodsCohort Assembly and Proteomic Processing
A total of 22 male patients on statin therapy undergoing stroke-preventive carotid endarterectomy (CEA) for high-grade (>50% NASCET (Golriz Khatami, S. et al. Using predictive machine learning models for drug response simulation by calibrating patient-specific pathway signatures. npj Systems Biology and Applications 7, 1-9 (2021))) stenosis were prospectively enrolled to represent the differences in protein levels between unstable and stable atherosclerosis, yielding 18 patients with data from CTA, histology, and plaque proteomics for complete characterization (comprising three spatial scales) (see
Study cohort demographics are summarized in
Table 3, below. Briefly, CEAs were collected at surgery and retained within a biobank, with details of sample collection and processing previously described.11,12 All samples were collected with informed consent from patients and the study was approved by the Ethical Review Board. Continuous variables are presented as medians (inter-quartile range). No variable was found to be significantly different between stable and unstable phenotypes.
Demographic variables were summarized to characterize the cohort and identify significantly different values across plaque subgroups. Categoric variables with less than 25% missing data were tabulated with fractions and significance analysed with Fisher Exact test. Continuous variables were tabulated as medians with inter-quartile range and significance analysed by Wilcoxon non-parametric test (using a confidence level of p=0.05).
Excised plaques were divided transversally at the most stenotic part; the proximal half used for protein analysis and the distal half fixed in 4% formaldehyde and prepared for histology. Histological analysis was performed on Masson-Tri-Chrome stained sections to assess presence of instability features such as lipid-rich necrotic core (LRNC), intra-plaque haemorrhage (IPH), fibrous cap thickness and integrity, and other factors according to the Virmani classification (Barrett, T. J. Macrophages in Atherosclerosis Regression. Arteriosclerosis, thrombosis, and vascular biology 40, 20-33, doi:10.1161/ATVBAHA.119.312802 (2020)) categorizing symptomatic and asymptomatic patients based on plaque stability (minimal, stable, or unstable) and resulting in 18 patients appropriately matched with respect to symptomatology and plaque morphology features. We further characterized the patients utilizing analyses from CTA by ElucidVivo (Boston, MA USA) for plaque morphology comprising structural anatomy and tissue characteristics as well as non-invasive plaque stability classification (see, e.g.,
LC-MS/MS Analysis and Protein Identification
Using methods previously described (Evrard, S. M. et al. Corrigendum: Endothelial to mesenchymal transition is common in atherosclerotic lesions and is associated with plaque instability. Nat Commun 8, 14710, doi:10.1038/ncomms14710 (2017)) plaques from selected patients were processed for proteomic analysis. Briefly, 4 mm thick sections were retrieved from the proximal half of the lesion, one from the peripheral end and one from the central core. Proteomic processing was performed using high-resolution isoelectric focusing (HiRIEF (Newby, A. C. et al. Vulnerable atherosclerotic plaque metalloproteinases and foam cell phenotypes. Thrombosis and haemostasis 101, 1006-1011 (2009))) with median normalization of ratios on the peptide spectrum match (PSM) level. FTMS master scans were followed by data-dependent MS/MS. Spectra were searched using MSGF+ (v10072) (Bittner et al., P6164 High level of EPA is associated with lower perivascular coronary attenuation as measured by coronary CTA. European heart journal 40, ehz746. 0770 (2019)) and Percolator (v2.08) (Antonopoulos, A. S. et al. Detecting human coronary inflammation by imaging perivascular fat. Science translational medicine 9, doi:10.1126/scitranslmed.aa12658 (2017)), where search results were grouped for Percolator target/decoy analysis. PSMs found at 1% PSM- and peptide-level FDR (false discovery rate) were used to infer gene identities, and median normalization of ratios on the PSM level was performed. Protein level FDRs were calculated using the picked-FDR method (Rajsheker, S. et al. Crosstalk between perivascular adipose tissue and blood vessels. Curr Opin Pharmacol 10, 191-196, doi:10.1016/j.coph.2009.11.005 (2010)).
Cell Network Pathway Selection
A systems biology model was created from a combination of proteomic pathways based on the differences in plaque stability, which represented late-stage disease, augmented with literature-based and data base retrieval, e.g., from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, to ensure coverage of earlier stages of atherogenesis. Keywords were used to search the KEGG database (see, e.g., Table 4 below).
KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism, and the ecosystem, from genomic and molecular-level information. It is a computer representation of the biological system, consisting of molecular building blocks of genes and proteins (genomic information) and chemical substances (chemical information) that are integrated with the knowledge on molecular wiring diagrams of interaction, reaction, and relation networks (systems information). It also contains disease and drug information (health information) as perturbations to the biological system. In KEGG, reference pathway maps of molecular interaction/reaction network diagrams are represented in terms of the KEGG Orthology (KO) groups, so that experimental evidence in specific organisms can be generalized to other organisms through genomic information. In other words, maps (such as the ones referred to in Tables 5 and 6 below) are reference maps and are noted with a “mapxxxxx” identification number. These maps can then be generalized to Homo sapiens (i.e., humans) and are noted with a “hsaxxxxx” identification number. For example, map05417 refers to the reference pathway for Lipid and Atherosclerosis, and HSA05417 refers to the Lipid and Atherosclerosis pathway in Homo sapiens.
Selected pathways were assigned according to their applicability to four primary cell types: endothelial cells (ECs), vascular smooth muscle cells (VSMCs), macrophages, and lymphocytes (Table 5). In Table 5, a “1” is placed to signify that the pathway has a more than trivial participation in the given cell type. In general, pathways were deemed either fully included or fully excluded relative to a cell type (Table 6). In Table 6, the table comprises pathways which are generally common to mammalian cells of many types, including those identified. Some pathways contained cell-type specific portions. In such cases, pathways were split prior to inclusion.
Table 7 below lists pathways important for lipid lowering. A high number listed in the “lipid significance” column means that the pathway is highly significant, a low number means low significance.
Table 8 below lists pathways important for anti-inflammation. A high number listed in the “inflammation significance” column means that the pathway is highly significant, a low number means low significance.
Table 9 below lists pathways important for anti-diabetes. A high number listed in the “diabetes significance” column means that the pathway is highly significant, a low number means low significance.
To exemplify, the KEGG pathway HSA05417 contains unique pathways for three of the cell types modelled in this work (ECs, VSMCs, and macrophages), plus the plasma compartment. In other words, pathway HSA5417 is one of the ones that is broken into cell-type specific pieces. In particular, the relations summarizing the products from low-density lipoprotein (LDL) into oxidized LDL (oxLDL), glycated LDL (glyLDL), and minimally modified LDL (mmLDL) were identified in terms of relations with proteins in the tissue (see, Kanehisa, M.; “Post-genome Informatics”, Oxford University Press (2000); Otsuka et al., Pathology of coronary atherosclerosis and thrombosis. Cardiovasc Diagn Ther 6, 396-408, doi:10.21037/cdt.2016.06.01 (2016)).
HSA04514 (“Cell Adhesion Molecules”) similarly contains pathway information for three modelled cell types (ECs, lymphocytes, and macrophages) with content split accordingly. HSA04514 is another pathway that is broken into cell-type specific pieces.
HSA04640, “Hematopoietic Cell Lineage” was split to remove content irrelevant for the cell types modelled in our work.
HSA04670, “Leukocyte Transendothelial Migration” splits the EC portion from the leukocyte portion, where two of the cell types modelled in this study were leukocytes (macrophages and lymphocytes).
HSA04931, “Insulin Resistance,” is included both in VSMCs, needed for our study, and also in liver, not used in our study.
Likewise, some pathways included content relating to the plasma-tissue boundary as noted.
The resulting set of pathways was integrated into cell networks at three scopes: “core,” “mid,” “full,” utilizing a program to split .kgml files by cell type. “Core” networks included pathways unique to each respective cell type. “Mid” included pathways shared by one other cell type. “Full” included pathways shared by these and other human cell types, being in general associated with mammalian cell function. The selected pathways at each scope for each cell type were merged into a cytoscape representation using BioNSi (Biological Network Simulator)(Shalhoub, J. et al. Systems biology of human atherosclerosis. Vascular and endovascular surgery 48, 5-17 (2014); Fava, C. & Montagnana, M., Atherosclerosis is an inflammatory disease, which lacks a common anti-inflammatory therapy: how human genetics can help to this issue. A narrative review. Frontiers in pharmacology 9, 55 (2018)), however, overriding the edge weights to allowed a richer set of relations than otherwise supported by BioNSi. The generated node lists were then compared against available plaque protein measurements from our cohort. Proteins where no direct experimental measurement was available and having no incoming edges were pruned.
BioNSi is a tool for modeling biological networks and simulating their discrete-time dynamics, implemented as a Cytoscape app. BioNSi includes a visual representation of the network that enables researchers to construct, set the parameters, and observe network behavior under various conditions. In particular, specifics on the use of BioNSi names to signify LDL products in the methods described herein include (this is not the way BioNSi is normally intended, but is used here as a means to represent a more granular biochemistry as needed to support the simulations described herein):
-
- 1. glyLDL is reflected as glycosylation of LDL, as a correct representation
- 2. oxLDL is reflected as binding/association, not because it is the correct name, but rather that the weight is 1, indicating that a fraction gets converted, and that oxLDL is the smallest fraction
- 3. mmLDL is reflected as state change, not because it is the correct name, but rather that the weight is 3, justified as Levitan 2010 indicating that a fraction gets converted, and that mmLDL is a higher fraction than ox
- 4. VLDL is reflected as indirect effect, not because it is the correct name, but rather that the weight is 2, justified as VLDL being estimated as TG/5, with 93 patients having both TG and LDL, had an average level of 15% of LDL (as a best-effort approximation).
Table 10, below shows detailed BioNSi edge mappings on import.
BioNSi import also adds self-inhibition loops (−9) but they can be deleted when used without transcriptomic data, or can represent the transcription/translation process when both proteomics and transcriptomics data are used.
In addition, networks for the integrated intima were created by compartmentalizing proteins into the intracellular of each cell type, the cell membranes, the extracellular space, with a separate compartment for the blood (See,
Specifically,
Given the network definitions thus created, the proteomic data was used to update networks using calibration data from each patient. Approximately 50% of the proteins in the networks were actually measured within the proteomic dataset. Since the pathways cover all selected protein-protein interactions in the pathway, estimation of protein levels for those lacking measurement in the dataset required interpolation. A total of 540 personalized networks were calibrated: 2 for each of the 18 patients at each cell type, integrated intima, and at each of the three scopes, respectively, comprising the entire database of protein level vectors referred to as “exemplars.” The database of exemplars demonstrated large variation in proteomic signatures after individual test patient calibration, corresponding to an estimated range of 39-96% plaque instability in the baseline condition.
Pseudo-code for the algorithm is outlined as follows:
-
- Set true plaque phenotype from histology (minimal, stable, unstable)
- Load protein levels
- Interpolate missing protein levels by iterating until achieving a high similarity (cosine similarity metric as a measure of convergence):
- For each node that is not fixed:
- For each edge, record suggestion (negating weights from outgoing edges):
- If the weight is negative (e.g., inhibition) if the source is less than the mean, unweighted suggestion is formed to pull down the target by a modest amount, or if the source is above the mean, to pull the target correspondingly down
- Else (e.g., activation), if the source is less than the mean, unweighted suggestion is formed to raise the target modestly, or if greater than the mean, to raise it correspondingly more
- Create a weighted mean (handling missing values)
- For each edge, record suggestion (negating weights from outgoing edges):
- Record the result and iterate for overall convergence (handling lack of convergence)
- For each node that is not fixed:
- Save protein levels
An example of a visualization of individual patient calibration molecules is shown in
Clustering analysis performed on the calibrated networks identified proteins with high variance for each cell type and scope. By way of example, the proteins with the highest variance between unstable plaque, stable plaque, and minimal disease at the core scope for ECs were interstitial collagenase (MMP1), lipopolysaccharide-binding protein (LBP), advanced glycosylation end product-specific receptor (RAGE), and integrin alpha-IIb (ITGA2B). At the mid scope, proteins such as TLR4 and HMOX1 also demonstrated large differences. For the mid scope networks, VSMCs showed strong separation in proteins such as tumour protein (p53), mothers against decapentaplegic homolog 2 (SMAD2), and coagulation factor VIII (F8), macrophages in proteins such as lipocalin 2 (LCN2), S100 calcium binding protein (S100A8/9), and cyclin dependent kinase inhibitor 1A (CDKN1A). In lymphocytes, matrix metalloproteinases (MMP1/9), insulin like growth factor binding protein acid labile subunit (IGFALS), and solute carrier family 2 (SLC2A1) were separated, whereas the integrated intima showed strong separation in proteins such as SMAD2 and S100A9 across cell types and interleukin 23 Receptor (IL23R) in lymphocytes.
Specifically,
Based on the identified proteins from the clustering results, plaque instability in this cohort was found to be mainly driven by networks coupled to endothelial dysfunction, modulated immune system responses, and inflammation at a range of degrees. Consequently, we simulated treatments with intensive lipid lowering (Sawada et al., From unbiased transcriptomics to understanding the molecular basis of atherosclerosis. Current Opinion in Lipidology 32, 328-329, doi:10.1097/mol.0000000000000773 (2021)) an IL1β antagonist as an example anti-inflammatory drugs (Alimohammadi et al., Development of a Patient-Specific Multi-Scale Model to Understand Atherosclerosis and Calcification Locations: Comparison with In vivo Data in an Aortic Dissection. Front Physiol 7, 238, doi:10.3389/fphys.2016.00238 (2016)), and an anti-diabetic agent, with hypothesized effects in treatment of atherosclerosis (Corti, A. et al. Multiscale Computational Modeling of Vascular Adaptation: A Systems Biology Approach Using Agent-Based Models. Front Bioeng Biotechnol 9, 744560, doi:10.3389/fbioe.2021.744560 (2021); Casarin et al., A Computational Model-Based Framework to Plan Clinical Experiments—an Application to Vascular Adaptation Biology. Comput Sci ICCS 10860, 352-362, doi:10.1007/978-3-319-93698-7_27 (2018)).
The intensive lipid lowering treatment was modelled by decreasing the patient's LDL level by 25% constrained by a minimal value to represent clinically reported effects of such therapies (Morgan et al., Mathematically modelling the dynamics of cholesterol metabolism and ageing. Biosystems 145, 19-32, doi:10.1016/j.biosystems. 2016.05.001 (2016)). For plasma lipids, we modelled LDL products (Otsuka et al., Pathology of coronary atherosclerosis and thrombosis. Cardiovasc Diagn Ther 6, 396-408, doi:10.21037/cdt.2016.06.01 (2016)) including glycosylated (glyLDL), oxidised (oxLDL), and minimally-modified (mmLDL), and VLDL. Specifics on LDL products are outlined above.
The anti-inflammatory treatment was modeled by holding IL10 level to a minimum level observed across proteins in the dataset. The anti-diabetic treatment was modelled by holding MTOR, NFKβ1, ICAM1, and VCAM1 (based on documented effects of Metformin) to the minimum level observed across proteins in the dataset (Ally et al., Role of neuronal nitric oxide synthase on cardiovascular functions in physiological and pathophysiological states. Nitric Oxide 102, 52-73 (2020); Parton et al., New models of atherosclerosis and multi-drug therapeutic interventions. Bioinformatics 35, 2449-2457, doi:10.1093/bioinformatics/ bty980 (2018)). “Minimum level” refers to the lowest number in the test subject data across molecules, determined as a function of the process.
The results from this specific example showed that simulation with intensive lipid lowering therapy was generally the most effective at decreasing plaque instability, with marginal improvement in simulated combination therapy. Anti-inflammatory and anti-diabetic therapies provided mixed results from patient to patient, manifesting as overall inferior performance compared with intensive lipid lowering. The combination therapy that included intensive lipid lowering and an anti-diabetic drug was in general the best for patients starting out from highly unstable proteomic signatures. This example illustrates that the invention can be an effective strategy for selected patients. Moreover, the fact that some initially unstable patients did not show appreciable response to the simulated pharmacotherapies suggested an ability of the modelling approach to identify individuals best treated surgically rather than medically. Patients with initially stable signatures showed less improvement by the simulated therapies, indicating sufficient preventive efficacy on standard medical treatment alone. In addition, some patients starting with unstable signatures did not benefit from simulated pharmacotherapy and should likely receive preventive surgery, suggesting a potential of the modelling approach to identify high-risk individuals and improve decision making between procedural intervention and pharmacotherapy. The individualized patient treatment recommendations differed widely across patients, highlighting the importance of individual predictions and more refined patient stratifications, as enabled by the defined systems biology model of our study. Given the dominating inflammatory proteomic signature of unstable plaques, the subtle effects observed by simulation with anti-inflammatory therapy are worth considering. This finding may be due to the fact that only a single dose of treatment was simulated whereas effective inhibition of inflammatory pathways would possibly require not only sustained presence of the antagonist but also reduction in the driving cause. In addition, the chosen treatment targeted IL10, as this strategy has been shown to be effective at the group level and even more effectively in subgroups with enhanced systemic inflammation, which were not included in our cohort and may thus not be well represented in the resulting model. In different cohorts or settings, response to anti-inflammatory treatment may exceed intensive lipid lowering on patients with CVD as a comorbidity rather than as a primary indication. Nevertheless, for clinical applicability, the model should ideally capture such phenotypes. Inclusion of patients from these subgroups would improve efficacy, and if necessary, the model could be revised using indicators of enhanced systemic inflammation such as CRP. In any case, the demonstrated superior effect of combination therapy over intensive lipid lowering alone, suggests an ability of the modelling approach of the study to adequately simulate effects of drugs targeting different pathways in disease pathophysiology.
Prediction of Subject-Specific Drug ResponseDrug response was then simulated in silico. In our study, the first category of simulated treatment was intensive lipid lowering, anti-inflammatory drugs (i.e., canakinumab), anti-diabetics (i.e., metformin), and a combination of intensive lipid lowering and anti-diabetic.
Two control simulations for each subject were also computed as a check on the mathematical formalism to prevent inadvertent design or coding defects. The first control simulation represented no change in treatment, where the expected result was to be the same as the baseline case but derived as if it was a treatment and running through the same simulation; if the output was found to differ from the baseline case, a logic or mathematical error would be detected. The second control simulation was named “multiple insult,” which simulated a condition of a “perfect storm” of atherosclerosis risk factors causing know disease drivers. In this control, the expected result was to see degraded stability, roughly in proportion to the original stability, that is, the farther the subject started form these adverse conditions the worse their relative impact should be. If this did not result, a logic and/or mathematical error would be detected.
Multi-Level Analysis of Simulated Treatment EffectThe simulated treated and baseline conditions were evaluated using multi-level analysis. Mean absolute cohort-level instability demonstrated coherent estimation across cell types and scopes. The variation across individuals is shown in
Further, the distribution of absolute baseline instability demonstrated a wide range across the experimental cohort (
Additionally, results demonstrated the mean relative treatment effects (positive indicate decreased instability our improved by treatment), across cohort, cell types, and network scopes (
In
The multi-level analysis across cell types and scopes also demonstrated coherent estimation of mean absolute cohort-level response to treatment in the mathematical controls.
Treatment effects ranged from an improvement of 20% to no improvement. Not only did improvement vary from patient to patient, but the range of improvement observed differed based on how instability was estimated. Whereas improvement in clinically symptomatic patients ranged from −8% to +20% and from −22% to +13% for asymptomatic patients; these ranges tighten to −2% to +20% for patients with relatively unstable protein levels vs. −22% to +7% for patients with more stable protein levels. There are two important points raised by this; first, the ability to distinguish for a given patient rather than a group is motivated, and second, this is critically important as an improvement vs. standard clinical practice of using symptomatology to guide treatment.
Intensive lipid lowering had the strongest effect particularly in patients starting out with unstable plaque signatures and morphology. The simulated treatment predictions showed distinct variation between subjects. For example, patients P491 and P773 were initially characterized by highly unstable proteomic signatures, and where the best effect of treatment simulation would be expected. Indeed, whereas simulation with intensive lipid lowering exceeded the other monotherapies, both anti-inflammatory and anti-diabetic therapies conferred improvement, as well as simulation with the combination therapy that exceeded the benefit of intensive lipid lowering (Table 11,
Table 11, below, shows the absolute and relative improvement for baseline and treated cases. Bold patient IDs were annotated using histology and clinical symptomatology as unstable. Key: Bas=baseline; ILL=Intensive Lipid Lowering; −IL1B=anti-IL1B (anti-inflammatory); Met=Metformin (anti-diabetic); Comb=Combination; Imp=improvement. p values: ****<0.0001, ***<0.001, **<0.01. Each patient is represented as a row, with quantitative assessment of absolute instability for the baseline conditions and each simulated condition followed by the quantitative relative improvement. The relative improvement cells are based on the significance of improvement, as judged by a net decrease of instability; +7% and above signifying statistically significant improvement relative to baseline, −5% to +6% signifying the absence of any statistically significant effect, and −7% and below signifying statistically significant degradation relative to baseline state. Rows are sorted by baseline instability. Patients P834, P821, P298, P187, and P491 (all categorized as unstable from histological reference) are seen to benefit the most from treatment, all starting from a position of very unstable and ending up at stable post treatment. Patients P853, P450, and P737 represent highly unstable plaques showing a lack of treatment effects, indicating that these cases could be viewed as the most to benefit from surgical intervention, given their highly unstable phenotypes and lack of improvement by pharmacotherapy. Patients P472, P265, and P682 represent patients for which neither pharmacotherapy will help nor can be needed, given the stability of the plaque.
Radar charts, shown in
Radar charts, shown in
We next observed a relatively well-defined threshold of absolute instability level of approximately 76%, where subjects with greater instability in the baseline state showed benefit from intensive lipid lowering and improved further with combination therapy but did not benefit when anti-inflammatory or anti-diabetic agents alone were simulated.
One set of patients that were initially characterized as highly unstable, did not show any response to the simulated pharmacotherapies. Importantly, this finding suggested an ability of the modelling approach of the study to identify individuals with a high risk, rather suitable for surgical treatment than pharmacotherapy. Further, we found that patients with initially more stable plaque signatures either did not improve, regardless of treatment category simulated. In this proof-of-concept setting, combinatory treatment or intensive lipid lowering alone had a generally beneficial effect on stability, with combination therapy providing incremental benefit for many patients.
Full results including summary of mean effects, confidence intervals, and assessment of contribution variance are provided in Table 12 and Table 13, shown below.
Personalized treatment recommendations were then composed based on the in-silico results for each patient using an automated decision algorithm where simulations of different pharmacotherapies were incorporated. The recommendations combined the level of instability achieved on the selected drug choices and the controls, with a text statement automatically generated to reflect the best therapy for that patient (
Patient “John Doe” is an example of a patient with a highly unstable initial condition that can be improved with high confidence by pharmacotherapy (
Patient “Bill Smith” represents a patient starting from a more stable initial condition where pharmacotherapy would not be recommended (
Patient “David Jones” represents a patient that would receive only marginal improvement from pharmacotherapy, but based on the highly unstable starting point, one would choose a procedural intervention as the best course (
Heatmaps were also included for specific protein level signatures, including the protein expression for the baseline condition, and adding the treated condition in cases where statistically significant treatment improvement was found. The degree to which this would be clinically significant is determined from the difference in clinical presentation; treatments demonstrate a strength commensurate with the difference between being asymptomatic vs. symptomatic. The results for P491 and P265 illustrate the range over which the simulation capability can be applied (
It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. The following are numbered embodiments intended to further illustrate, but not limit, the scope of the invention.
Other aspects, advantages, and modifications are within the scope of the following claims.
Claims
1. A computer-implemented method comprising:
- receiving non-invasively obtained data related to a plaque from a patient;
- inputting the non-invasively obtained data from the patient to an estimation model;
- generating, as an output of the estimation model, virtual 'omics data that include estimated pathway activations or molecule levels, or both, of the patient by applying the estimation model to the non-invasively obtained data from the patient;
- updating an in silico systems biology model using the generated virtual 'omics data to generate an in silico patient-specific systems biology model, wherein (i) the in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule or pathway, and a plurality of edges between pairs of nodes, each edge representing a node-node interaction, (ii) at least two of the nodes of the plurality of nodes in each network represent molecules or pathways whose levels are affected by an atherosclerotic cardiovascular disease, and (iii) at least one network of the set of networks includes a disease-associated pathway activation or molecule level, or both, for each of the nodes in the network;
- perturbing the in silico patient-specific systems biology model to simulate, for the patient, a therapeutic effect of one or more pharmaceutical agents singly or in combination; and
- providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease for the patient and a recommendation regarding whether the magnitude of the improvement relative to the stability of the patient's atherosclerosis warrants consideration of procedural intervention with or without medical therapy.
2. The computer-implemented method of claim 1, wherein the recommendation informs a decision that leads to a clinical action.
3. The computer-implemented method of claim 1, wherein the recommendation enables a healthcare provider to tailor a therapy for the patient.
4. The computer-implemented method of claim 1, wherein the at least one network of the set of networks includes nodes corresponding, respectively, to one or more of a glycosylated low-density lipoprotein (glyLDL), an oxidized LDL (oxLDL), a minimally-modified LDL (mmLDL), a very-low-density lipoprotein (VLDL), a byproduct of a triglyceride, a byproduct of a high-density lipoprotein, IL-1, IL1β, IL-6, TNF, IL12/23, IL17, IL18, interleukin, another cytokine molecule, MTOR, SGLT2, GLP1, NFκβ1, ICAM1, or VCAM1.
5. The computer-implemented method of claim 1, wherein the non-invasively obtained data comprises imaging data.
6. The computer-implemented method of claim 5, wherein the non-invasively obtained imaging data is obtained by computed tomography (CT), dual energy computed tomography (DECT), spectral computed tomography (spectral CT), computed tomography angiography (CTA), cardiac computed tomography angiography (CCTA), magnetic resonance imaging (MM), multi-contrast magnetic resonance imaging (multi-contrast MRI), ultrasound (US), positron emission tomography (PET), intra-vascular ultrasound (IVUS), optical coherence tomography (OCT), near-infrared radiation spectroscopy (NIRS), or single-photon emission tomography (SPECT) diagnostic images or any combination thereof.
7. The computer-implemented method of claim 1, wherein the estimated pathway activations or molecule levels, or both, comprise an alteration in a level of a gene, a transcript, a protein, or a metabolite or dysregulation of a pathway.
8. The computer-implemented method of claim 7, wherein the at least one network of the set of networks further comprises edges representing protein-protein interactions, gene-gene interactions, gene-transcript interactions, transcript-protein interactions, protein-metabolite interactions, and/or protein-gene interactions.
9. The computer-implemented method of claim 8, wherein interactions represent any one of translation, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and methylation as a result of an interaction between two molecules.
10. The computer-implemented method of claim 1, wherein the dyslipidemia management agent is an agent controlling hypertriglyceridemia or hypercholesterolemia, an agent raising atheroprotective lipoproteins, or other dyslipidemia control medication.
11. The computer-implemented method of claim 1, wherein the dyslipidemia management agent is a statin.
12. The computer-implemented method of claim 11, wherein the statin is atorvastatin.
13. The computer-implemented method of claim 1, wherein the dyslipidemia management agent is an intensive lipid-lowering agent.
14. The computer-implemented method of claim 1, wherein the dyslipidemia management agent is a proprotein convertase subtilisin kexin type 9 (PCSK9) inhibitor, a Lp(a)-lowering therapy, or a cholesteryl ester transfer protein (CETP) inhibitor.
15. The computer-implemented method of claim 1, wherein the dyslipidemia management agent is a hypertriglyceridemia lowering agent or a hypercholesterolemia lowering agent.
16. The computer-implemented method of claim 15, wherein the hypercholesterolemia lowering agent is a statin, ezetimibe, bile acid sequestrant, adenosine triphosphate-citrate lyase (ACL) inhibitor, fibrate, niacin, omega-3 fatty acid ethyl ester, or omega-3 polyunsaturated fatty acids (PUFA).
17. The computer-implemented method of claim 1, wherein the agent that affects the inflammatory cascade is an anti-inflammatory or pro-resolving drug.
18. The computer-implemented method of claim 17, wherein the anti-inflammatory or pro-resolving drug is an inhibitor of IL-1.
19. The computer-implemented method of claim 18, wherein the inhibitor of IL-1 is canakinumab.
20. The computer-implemented method of claim 17, wherein the anti-inflammatory or pro-resolving drug inhibits TNF activity.
21. The computer-implemented method of claim 17, wherein the anti-inflammatory or pro-resolving drug inhibits IL12/23.
22. The computer-implemented method of claim 17, wherein the anti-inflammatory or pro-resolving drug inhibits IL17.
23. The computer-implemented method of claim 1, wherein the agent that affects the inflammatory cascade is an inhibitor of a pro-inflammatory cytokine induced on danger signaling.
24. The computer-implemented method of claim 1, wherein the agent that affects the inflammatory cascade is a pro-resolvin.
25. The computer-implemented method of claim 24, wherein the pro-resolvin is an omega-3 fatty acid.
26. The computer-implemented method of claim 1, wherein the glycemic treatment agent is metformin.
27. The computer-implemented method of claim 1, wherein the plaque comprises atherosclerotic plaque.
28. A system comprising:
- a memory configured to store instructions; and
- one or more processors configured to execute the instructions to perform operations comprising: inputting, to an estimation model, non-invasively obtained data related to a plaque from a patient; generating virtual 'omics data that include estimated pathway activations or molecule levels, or both, of the patient, by applying the estimation model to the non-invasively obtained data from the patient; updating an in silico systems biology model using the generated virtual 'omics data to generate an in silico patient-specific systems biology model, wherein (i) the in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule or pathway, and a plurality of edges between pairs of nodes, each edge representing a node-node interaction, (ii) at least two of the nodes of the plurality of nodes in each network represent molecules or pathways whose levels are affected by an atherosclerotic cardiovascular disease, and (iii) at least one network of the set of networks includes a disease-associated pathway activation or molecule level, or both, for each of the nodes in the network; perturbing the in silico patient-specific systems biology model to simulate, for the patient, a therapeutic effect of one or more pharmaceutical agents singly or in combination; and providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease for the patient and a recommendation regarding whether the magnitude of the improvement relative to the stability of the patient's atherosclerosis warrants consideration of procedural intervention with or without medical therapy.
29. The system of claim 28, wherein the estimated pathway activations or molecule levels, or both, comprise an alteration in a level of a gene, a transcript, a protein, or a metabolite or dysregulation of a pathway.
30. The system of claim 29, wherein the at least one network of the set of networks further comprises edges representing protein-protein interactions, gene-gene interactions, gene-transcript interactions, transcript-protein interactions, protein-metabolite interactions, and/or protein-gene interactions.
31. The system of claim 30, wherein interactions represent any one of translation, activation, inhibition, indirect effect, state change, binding, dissociation, phosphorylation, dephosphorylation, glycosylation, ubiquitination, and methylation as a result of an interaction between two molecules.
32. The system of claim 28, wherein the plaque comprises atherosclerotic plaque.
33. One or more non-transitory computer-readable media storing instructions that are executable by one or more processing devices, and upon such execution cause the one or more processing devices to perform operations comprising:
- inputting, to an estimation model, non-invasively obtained data related to a plaque from a patient;
- generating virtual 'omics data that include estimated pathway activations or molecule levels, or both, of the patient, by applying the estimation model to the non-invasively obtained data related from the patient;
- updating an in silico systems biology model using the generated virtual omics data to generate an in silico patient-specific systems biology model, wherein (i) the in silico systems biology model comprises a set of networks, wherein each network comprises a plurality of nodes, each node representing a baseline level of a molecule or pathway, and a plurality of edges between pairs of nodes, each edge representing a node-node interaction, (ii) at least two of the nodes of the plurality of nodes in each network represent proteins whose levels are affected by an atherosclerotic cardiovascular disease, and (iii) at least one network of the set of networks includes a disease-associated pathway activation or molecule level, or both, for each of the nodes in the network;
- perturbing the in silico patient-specific systems biology model to simulate, for the patient, a therapeutic effect of one or more pharmaceutical agents singly or in combination; and
- providing an output indicating a level of improvement in the atherosclerotic cardiovascular disease for the patient and a recommendation regarding whether the magnitude of the improvement relative to the stability of the patient's atherosclerosis warrants consideration of procedural intervention with or without medical therapy.
34. The one or more non-transitory computer-readable media of claim 33, wherein the plaque comprises atherosclerotic plaque.
Type: Application
Filed: Sep 7, 2023
Publication Date: Jan 4, 2024
Inventors: Andrew J. Buckler (Boston, MA), Ulf Hedin (Ronninge), Ljubica Matic (Solna), Matthew Phillips (Boston, MA)
Application Number: 18/243,138