METHOD AND APPARATUS FOR DETERMINING HEALTH STATUS
A system and method for monitoring the state of an individual. The method includes providing a stimulus to the individual, measuring a response to the provided stimulus, comparing the measured response to an expected response, and diagnosing one or more aspects of disease in accordance with the result of the comparison between the measured response and the expected response. The stimulus may be a predetermined test sequence, such as a visually displayed predetermined sequence of images, or may include observation of the physical response of the individual while performing one or more predetermined activities. Stored images or video of the individual responding to one or more test sequences may be stored in a lossy or lossless state, and thus security and de-identification may be provided to stored data. This stored data may also be de-identified in a manner to allow for the answering of the greatest number of future questions.
This application is a continuation of U.S. patent application Ser. No. 15/682,366, filed Aug. 21, 2017, which claims the benefit of the following U.S. Provisional Patent Applications, the entire contents thereof being incorporated herein by reference:
U.S. Provisional Patent Application Ser. No. 62/377,818, filed Aug. 22, 2016, to Hanina, titled “Method and Apparatus for Determining Health Status”;
U.S. Provisional Patent Application Ser. No. 62/419,763, filed Nov. 9, 2016, to Hanina et al., titled “Method and Apparatus for Storing Information”;
U.S. Provisional Patent Application Ser. No. 62/505,627, filed May 12, 2017, to Hanina at al., titled “Method and Apparatus for Storing and Processing Information”; and
U.S. Provisional Patent Application Ser. No. 62/531,703, filed Jul. 12, 2017, to Hanina et al., titled “Method and Apparatus for Visual Diagnostics.”
This application also incorporates by reference the entire contents of the material presented in U.S. patent application Ser. No. 13/189,518, filed Jul. 24, 2011, to Hanina et al., titled “Method and Apparatus for Monitoring Medication Adherence” and published as U.S. Patent App. Pub. No. 2012/0316897.
FIELDThe subject matter of the present disclosure relates generally to monitoring patient health status and to the diagnosis and monitoring of disease employing visual images and image analysis, and more particularly to visually monitoring health status by visually or otherwise recognizing one or more characteristics of the patient, and to the use of particular visual analysis tools and models in both active and passive monitoring techniques to diagnose and monitor disease and symptoms of disease as related to particular disease states and patient populations.
The subject matter of the present disclosure additionally relates generally to storing personally identifiable visual information, and more particularly to storing such video information in a manner that allows for analysis of activity, clinical parameters, diagnosis or monitoring of disease, or other desired features, in addition to any desired future analysis of such data, while maintaining the security and privacy of the data. The subject matter of the present disclosure also relates to the use of such stored secured information in the diagnosis and treatment of disease, as well as more generally determining status of an individual based upon the stored data, as noted above.
BACKGROUNDMonitoring patient health status, as well as the diagnosis and monitoring of disease, whether, e.g., during a clinical drug trial, during disease management by a physician, or in home care scenarios, may entail confirming that a patient has administered required medication.
SUMMARYIn U.S. Pat. Nos. 8,731,856, 8,731,961, 8,666,781, 9,454,645, and 9,183,601, the contents of these five patents being incorporated herein by reference in their entirety, the inventors of the subject matter of the present disclosure have proposed a system, method and apparatus that allow for complete control and verification of adherence to a prescribed medication protocol or machine or apparatus use in a clinical trial setting, whether in a healthcare provider's care, or when self-administered in a homecare situation by a patient.
In U.S. Pat. No. 9,256,776 (the '776 patent), the content of this patent also being incorporated herein by reference in its entirety, the inventors of the subject matter of the present disclosure describe a system for de-identification of one or more images of an individual. The techniques for determination of one or more portions of an image to be de-identified may be applied to the subject matter of the present disclosure, thus reducing required computing load. As will be described below, application of the feature extraction/keypoint method and system may be applied not necessarily to complete images, but additionally to one or more portions of images as identified in accordance with the '776 patent.
These patents and other patents attributable to the inventors of the subject matter of the present disclosure present the only medication management system that may determine whether a user is actually following a protocol, provide additional assistance to a user, starting with instructions preferably including one or more interactive and real-time audio, visual, textual or the like prompts based upon one or more actions detected of the user, and moving up to contact from a medication administrator if it is determined that the user would need such assistance in any medical adherence situation, including clinical trial settings, home care settings, healthcare administration locations, such as nursing homes, clinics, hospitals and the like, and in clinical trial settings. They also present the only system designed to contextually de-identify images of the face of an individual allowing for review of these images while maintaining the security of these images.
The subject matter disclosed herein builds on these initial inventions and provides one or more features that may be employed in accordance with these systems to use further visual information collected by a video camera or other sensor to determine additional characteristics associated with medication administration or otherwise the health of the patient administering the medication, or any other individual.
The subject matter disclosed herein builds on these initial systems, devices and techniques, and additionally provides one or more mechanisms for collecting and storing visual data related to the users of the system, and in a particular embodiment, video of the users administering medication. In this context, such video may provide a view of the face of a particular user, thus allowing their identity to be determined from these video images. Such determinations may be made on the local device of a user, or at a remote processing location, such as a cloud computing location, or dedicated remote server location. While this information may be stored in a secure manner, through encryption and the like, the inventors of the subject matter disclosed herein have determined that it would be beneficial to implement a system that balanced the need to secure the video data while allowing for future analysis of the data in one or more manners currently not contemplated.
Therefore, in accordance with one or more embodiments of the present disclosure, a system and method are provided in which a video sequence is preferably analyzed to determine a number of features that may be representative of one or more images in a video sequence, such as keypoints or landmarks of the face of a user indicative of the identity, a current health or other state of the user, or other visual features in one or more additional images. Once such features are determined, a subsequent determination may be preferably made to determine a subset of these features that may be maintained which allow for analysis of the video sequence, while being “lossy” enough to de-identify the images by precluding reverse processing of the features to allow identification of the user. Images for use in accordance with the invention may be captured using a camera capture device, such as that in a dedicated camera, a mobile device such as a smartphone, or the like. Analysis processing may employ any methods, such as computer vision analysis, neural networks, deep (reinforcement) learning, machine learning, or the like. Processing may be provided by a processor embedded within such a camera, mobile device, or dedicated computing system. Data transmission preferably takes place over a cellular, Wi-Fi enabled or other wireless or wired communication system.
Additionally, information retrieved from one or more users of the system, whether provided in identifiable or de-identified format, may be used in order to make determinations of diagnosis or monitoring of disease, or other detailed status of an individual, such as determination of pulse rate or other bodily states. Thus, any images obtained of such users may be relied upon in order to determine one or more statuses of a user, whether the images are identifiable, or de-identified. The ability to perform such analysis from a de-identified image ensures proper data protection while allowing for extensive post-hoc analysis of these de-identified images. These images to be analyzed may further be analyzed in other than collected chronological order, this allowing for the analysis of individual frames, or frames having similar characteristics, together regardless of when these images were acquired.
The subject matter of the present disclosure further provides one or more mechanisms for collecting and storing visual data related to the users of a system, and in a particular embodiment, video of users or patients performing one or more predetermined activities (active monitoring), or performing their daily routine activities (passive monitoring). Active monitoring may comprise presentation of a particular visual sequence on a display, and monitoring of a user's response to this visual sequence, such as by tracking or otherwise analyzing the eyes, gaze direction or other facial characteristic of the user. Passive monitoring may comprise observing the user when performing a sequence as instructed by a device, but for the purpose of another activity, rather than the visual sequence displayed as noted above, or by monitoring daily routines that are known to the system in order to measure a response. In this context, such video may provide a view of the face of a particular user, thus allowing for one or more desired characteristics to be viewed and tracked over time. Such characteristic review may be made on the local device of a user, or at a remote processing location, such as a cloud computing location, or dedicated remote server location. This information is preferably stored in a secure manner, through encryption and the like.
Therefore, in accordance with one or more embodiments of the present disclosure, a system and method are provided in which a video sequence of a user performing one or more predetermined activity sequences, or performing routine activities, is analyzed to determine a number of features that may be representative of one or more diagnostic attributes, such as eye movement, affect, heartrate, skin tone and the like. Once such features are recognized and tracked, a subsequent determination may preferably be made to determine a subset or combination of these features that are indicative of diagnosis or monitoring of disease, and may be analyzed over time to determine changes in a particular disease progression.
The subject matter of the present disclosure provides one or more features that may be employed in accordance with these systems to use further visual information collected by a video camera or other sensor to determine additional characteristics associated with the health of a patient, or any other individual.
Images for use in accordance with the subject matter of the present disclosure may be captured using a camera capture device, such as that in a dedicated camera, a mobile device such as a smartphone, or the like. Analysis processing may employ any methods, such as computer vision analysis, neural networks, deep learning, machine learning, or the like. Processing may be provided by a processor embedded within such a camera, mobile device, or dedicated computing system, either local or remote, such as in a cloud-based system. Data transmission preferably takes place over a cellular, Wi-Fi enabled or other wireless or wired communication system.
Additionally, information retrieved from one or more users of the system may be used in order to make determinations of diagnosis of disease, or other detailed status of an individual, such as determination of pulse rate, eye movement, or other bodily states. Thus, any images obtained of such users may be relied upon in order to determine one or more statuses of a user.
Implementations of the subject matter of the present disclosure may have advantages relative to existing systems and techniques. For example, U.S. Pat. No. 7,359,214 includes a device that provides instruction to a patient regarding medications to take. This system, however, provides no mechanism for actually confirming that a patient is in fact properly administering required medication, including, e.g., placing a medication pill into their mouth, or injecting or inhaling medication following a predetermined series of steps, as required in a clinical drug trial, as prescribed by a prescribing physician in the case where adherence to a particular regimen may prove to be critical to efficacy of the prescription regimen, in various public health scenarios, in situations where failure to keep up a prescription regimen can potentially harm a population as a whole, such as the generation of antibiotic-resistant bacteria strains, in various disease management scenarios, or in home care situations where maintaining proper control of administering healthcare professionals is critical. U.S. patent application Ser. No. 11/839,723 (now U.S. Pat. No. 8,538,775), filed Aug. 16, 2007, titled Mobile Wireless Medication Management System provides a medication management system employing mobile devices and an imaging technology so that a user is able to show a pill to be taken to the system, and the system can then identify the medication. Similarly, however, there is in fact no particular manner in which to ensure actual adherence, including ingestion, inhalation, injection of the medication, or the relationship of adherence to the efficacy or safety of the drug over time.
The inventors of the subject matter disclosed herein have determined that diagnosis and monitoring of disease traditionally requires subjective determination of disease state by a healthcare professional. Application of known disease parameters to a currently observed set of disease states from a patient results in a diagnosis of disease. Continued monitoring of these disease states allows for monitoring of disease, and determinations of progression thereof, over time. Medical diagnosis and monitoring systems typically rely on such subjective determinations, or upon measurements made in a controlled environment, such as blood draws in a clinic, x-rays and the like.
The inventors of the subject matter disclosed herein have further determined, however, that prior art systems fail to describe the use of advanced visual analysis to properly diagnose and monitor disease, collecting information from across populations, and determining critical characteristics indicative of such diagnoses. These systems similarly fail to take into account accurate determinations of medication adherence in order to further diagnose and monitor disease.
U.S. Pat. No. 8,408,706 entitled “3D Gaze Tracker” describes a system for determining a gaze of a person, comprising a 3D camera that acquires a range image of a person, a picture camera that acquires a picture of the person, and a controller that processes images acquired by the cameras to determine a gaze direction and origin for the gaze vector of an eye of the person. This complex system is one that is difficult or impossible to implement on a simple mobile device, and requires complex processing of an image being viewed by the person in order to determine the input stimulus being provided to the person.
US Patent Application Serial No. 2013/0321772 entitled “Medical Diagnostic Gaze Tracker” describes a system for detecting a gaze behavior of a user in response to a stimulus, where the stimulus is a non-diagnostic stimulus, determining a type of the stimulus, and generating an indication of the gaze behavior and the type of the stimulus. Thus, while the system is not an active system (i.e., does not provide a predetermined stimulus to the subject) it does require a complex system for imaging the field of view being viewed by a person, and determining a type of input stimulus being observed by the person.
International Patent Application Serial No. WO/2001/074236 describes a system for diagnosing Attention Deficit Hyperactivity Disorder (ADHD). The application in turn relies upon a complex eye tracker system described in U.S. Pat. No. 4,852,988, and comprises a complex system including a helmet to be worn by the person being tested. While this application describes a number of characteristics to be reviewed in diagnosing ADHD, the system suffers from complexity, and the inability to monitor eye movement when viewing a passive, non-predetermined set of visual images, as the system is primarily designed to track the eyes of the person while performing a physical task.
Additionally, existing systems fail to further incorporate the monitoring of any further patient characteristics to aid in determining proper medication adherence, and to further determine a status of the patient.
The inventors of the subject matter of the present disclosure have further determined that these and other existing systems fail to consider that sensitive or other individually identifiable information may be captured and stored by these systems. These existing systems fail to consider confidentiality of any acquired information, and the ability to store this information in a secure, confidential manner while still allowing for future analysis of any such acquired and stored data.
The inventors of the subject matter of the present disclosure have further determined that prior art systems fail to describe the use of such stored data for the purpose of diagnosing or monitoring disease or current status of an individual.
Still other objects and advantages of the invention will in part be obvious and will in part be apparent from the specification and drawings.
The subject matter of the present disclosure accordingly comprises several steps and the relation of one or more of such steps with respect to each of the others, and the apparatus embodying features of construction, combinations of elements and arrangement of parts that are configured to affect such steps, all as exemplified in the following detailed disclosure, and the scope of the invention will be indicated in the claims.
Reference is made to the following description and accompanying drawings, in which:
In accordance with an embodiment of the subject matter of the present disclosure, a visual motion capture device, camera or the like is used to capture motion information related to the administration of pill or film based oral medications, or injectable, inhaler-based, other non-pill based medication, or any other form of patient administration task that may be performed, may be utilized in accordance with one or more of the methods, devices, and systems noted in the above-referenced applications.
Information Capture SystemReferring first to
Referring next to
In accordance with an embodiment of the present disclosure, apparatus 1000 is preferably configured to be part of a system that improves adherence to a medical protocol, and to analysis of collected visual data of user medication adherence to determine health status of users. Users of apparatus 1000 in accordance with this system give administrators a tangible and concrete manner in which to review activities and collected information. Apparatus 1000 of the invention is configured to receive instructions for patients from remote data and computing location 3000 and provide these instructions to patients. Such instructions may comprise written, audio or audio instructions for guiding a user to perform one or more activities, such as performing a sequence of actions to test a particular action of the user, or whether a user is adhering to a prescribed medication protocol.
The system, in accordance with an embodiment of the present disclosure, is also applicable to monitoring of patient activities when being requested to perform particular actions, or when performing routine actions not specifically requested. Therefore, in accordance with an embodiment of the present disclosure, a method and apparatus may be provided for analyzing captured patient motion data, preferably in near real time to provide feedback to the user, to determine a number of times a participant performs some action that is, for example, considered suspicious, or to determine one or more elements of diagnosing or monitoring disease.
In accordance with a further embodiment of the present disclosure, the visual capture device 1110 may be used to capture visual information related to one or more subjects. Any standard camera or image capture device may be employed, including but not limited to a camera on a mobile phone, tablet, other computing device, standalone camera, or any other image acquisition apparatus that is able to record one or more (video) images of a subject. In a preferred embodiment of the present disclosure, the subject may comprise a face of a human, but may comprise any other desirable subject, including one or more other body parts of the user, or other object. Analysis of these recorded images may be performed currently, or the images may be stored for future analysis. Storage of such images may be performed local to the image capture apparatus, at a remote location, such as a dedicated storage location, or a cloud based storage system. Storage of these images, however, may present a security problem, and the identity of the subjects or other confidential information in the captured visual information may be discernable from these images. De-identification of this stored information will be described below.
Additionally, visual representations of a user can be further used to determine a status of the user. Visual determination of one or more parameters, such as motion, eye motion, skin tone, emotions, heart rate, blood pressure, body mass, gps location, proximity, or other measurements (such as non-visual measurements) that may be provided in accordance with one or more incorporated or coupled sensor, may be measured visually or otherwise, at once or over time, to determine changes in one more of such parameters in order to identify changes in the health of the user. In accordance with an embodiment of the present disclosure, by way of example, display 1140 preferably displays one or more bits of information to a user. Such information may preferably comprise a specific video sequence designed to test the reaction of the user, or may comprise interactive or other instructions to the user to perform a predetermined activity. Information capture apparatus preferably captures information monitoring the user upon viewing of the displayed information, and performing one or more activities in response thereto. Other devices for capturing information in response to presented visual or other stimuli may include diverse sensors, such a glucose meters, blood pressure cuffs, radar systems, visual capture devices, thermometers, accelerometers (measuring the shake of the hand of a user, for example), or the like. One or more of such measure parameters may be used to identify particular characteristics of one or more disease states. In such a manner, while monitoring adherence or other activities, or when performing actions in response to a presented test script, such parameters may be measured, and reported back to one or more healthcare professionals, one or more care providers, other individuals, or may be collected in order to analyze automatically, perhaps over time, to diagnose and monitor disease. Thus, these parameters may be measured over time without reference to adherence, allowing for diagnosis of disease, measurement of progression of disease once diagnosed, or measurement of various health indicators to gauge the overall health of an individual.
Furthermore, a database or other repository of such measurements may be collected over time and over users at remote data and computing location 3000. Such database may be characterized by disease state or other demographic characteristics. In this manner future measurements of one or more users may be compared against such a database, and allow for diagnosis of one or more diseases, or changes in these characteristics when monitoring these diseases. Furthermore, expected progression of such parameters over time may be determined for a population as a whole, or for various portions of a population, defined by demographics, disease state or the like. So, by way of example, it may be possible to determine expected progression of one or more visual characteristics, such as weight gain, of a female aged 40-60 suffering from diabetes, or to determine expected changes in response to a visual presentation of a script to be followed by a user. Of course, other sub-populations may also be determined from such a database.
In an additional embodiment of the present disclosure, the measured characteristics may comprise one or more de-identified video assets that may be determined from capture of images of a user from connected, integrated or other visual capture apparatus 1110, such as a still or video camera. De-identification of one or more of such captured images allows for further analysis of these images, generating a more robust database, while protecting the identity of any individual originally pictured in the images. Thus, through the generation of derivative images, portions of images, characteristics or the like, information may be stored in a manner that allows for future analysis while protecting the identity of the users. One or more mechanisms for generating such de-identified images will be described below.
In yet a further embodiment of the present disclosure, the determination of whether a particular user has modified disease characteristics may be determined in accordance with one or more unsupervised learning systems, such as a neural network or the like. In such a manner, the database of collected images may be employed to train such a system, identifying one or more characteristics from the training images that may be used to identify similar characteristics in future images. Furthermore, the use of such unsupervised learning techniques preferably provides an additional layer of de-identification of images, allowing for the further collection of images of users, and the subsequent maintenance of these images in a de-identified state through the generation of derivative images, image portions, extracted characteristics and the like that may be stored for comparison to future images processed in a similar manner. Furthermore, data may be further encoded in these images, such as one or more codes associated with a camera device, or other identifying information. Additional information collected from one or more external sensors, such as accelerometers, void recorders, associated with the camera device, or one or more external medical devices, such as glucose meters, heartrate meters, or other measurement devices, or any of the sensors noted in
By way of example, pulse oxymiters, heartrate monitors and the like may be employed with collected video information to allow for more precise determinations. Additionally, micro movements associated with movement of a mobile device or the like may be also be employed. Micro eye movements, gaze tracking, analysis of expression, or any other micro gestures, micro movements, or other recognizable conditions and the like may further be employed. These additional measured features may be further employed to identify changes in characteristics along a number of alternative dimensions in such an unsupervised or supervised learning system, ultimately diagnosing or monitoring disease. Analysis of the accumulated information may allow for identification of one or more common characteristics among or between various disease states, demographic states, or other common identifying characteristic.
Longitudinal analysis of such data and changes in visual and other characteristics over time may be further correlated to negative health outcomes such as hospitalization events or death, and may give rise to relationships that can then act as the basis to trigger interventions in advance of a negative health outcome occurring. Through such monitoring, early warning signs may be extracted from visual images of users in a manner not previously possible. Thus, any number of visual analysis techniques may be employed to generate a video asset base by therapeutic area over time, thus allowing for the user of such assets to evaluate the health of users in the future including similar characteristics, and residing in similar therapeutic areas.
In accordance with one or more embodiments of the present disclosure, it is anticipated that the use of one or more sections of the electromagnetic spectrum will allow for an in-depth analysis of facial or other visible user features. For example, as will be described below, rather than simply noting external facial features, the techniques and systems disclosed herein allow for the determination of the location of various blood vessels under the skin of a user in the field of view of a camera. Over time, differences determined in the various images provide information about the performance of the user, and may further indicate changes in disease, physical ability, or the like. Such changes, for example, may be more visible under near-infrared light, or other wavelength of energy, thus resulting in additional information being extracted based upon the use of multiple types of light, energy, or other data extraction mechanisms.
As will similarly be described below, by overlaying multiple layers of information, irrespective of the amount of information provided in each of those layers, a more concrete picture of the status of an individual may be provided. Information collected from lower power devices may include lower resolution information, such as images with fewer pixels, or may include different mechanisms for data collection. As more information is acquired, either through different collection mechanisms, or as the power of collection improves over time, correlations between collected data and negative outcomes (or positive outcomes) in one or more medical therapeutic areas may be provided. The system may therefore be employed as a diagnostic tool for predicting disease or other diagnosis based upon processing of visual images of a user. By allowing for the use of such varied layers, the system is robust to methods of collection, timing of collection, and storage formats of the information, thus “future proofing” the system to rely on older data (perhaps at lower resolutions), and newer collected data, having a higher resolution, or a newer or new data collection technique.
The inventive system may therefore learn various correlations between one or more observed features, and health status, health outcomes, disease progression, symptom progression, or one or more changes in overall health. By analyzing and correlating these changes in features and ultimate health status, the system provides a mechanism for determining yet unknown relationships between measurable quantities and the health of an individual. Once established, these relationships can then be used to predict future medical situations. By way of example, one or more sub-segments of the population may be targeted for observation. If such population is a post-stroke population, it is known that rapid weight gain may be a symptom of failure to take proper medication, or may predict a more urgent medical situation. In accordance with an embodiment of the present disclosure, daily images of an individual may allow for a determination of such rapid weight gain without the use of a body-weight scale. In such a situation, a healthcare provider may be immediately notified to follow up with the individual. While visual-spectrum features may be used to determine weight gain, determinations of changes in pulse, blood pressure or other measurements may rely on the above-mentioned other areas of the electromagnetic spectrum, audio pulses, or any other type of desirable sensor, whether alone or in concert with a visual analysis.
In accordance with alternative embodiments of the present disclosure, accumulated images of one or more users, associated sensor data information, visually extracted information, and one or more additional inputs may be incorporated into a comprehensive database. Analysis of the accumulated information may allow for identification of one or more common characteristics among or between various disease states, demographic states, or other common identifying characteristic. Thus, in accordance with one or more embodiments of the present disclosure, collected information may be stored in a standard or de-identified format, and used to determine not only current conditions and actions, but also to diagnose disease, changes in health conditions, or to segregate individuals from populations to identify one or more potential medical or other unique conditions. The techniques and systems disclosed herein provide the ability to perform these diagnoses from de-identified information, thus allowing for the protection of personal information while retaining sufficient information to allow for subsequent diagnosis.
As noted above, this accumulated information may be employed to train one or more learning systems, thus further extracting common characteristics or elements to be extracted. These extracted characteristics comprise one or more derivative elements of the accumulated images and other data, and allow for the storage of these derivative elements in a de-identified manner, without the need to store and maintain the images including identifying information. In such a manner, large databases of patient information may be processed, and characteristics extracted therefrom may be used to identify further common characteristics from later acquired images related to medication adherence, or other disease state characterization. The system will therefore be adaptable to diagnose disease, or other changes in patient status simply by taking images of a new user, either in a single shot, or noting changes in any measured characteristics over time.
Longitudinal analysis of such data and changes in visual and other characteristics over time may be further correlated to negative health outcomes such as hospitalization events or death, and may give rise to relationships that can then act as the basis to trigger interventions in advance of a negative health outcome occurring. Through such monitoring, early warning signs may be extracted from visual images of users in a manner not previously possible. Thus, any number of visual analysis techniques may be employed to generate a de-identified video asset base by therapeutic area over time, thus allowing for the user of such assets to evaluate the health or users in the future including similar characteristics, and residing in similar therapeutic areas.
The system may process information at remote system 3000 housing a database of collected information. New images acquired by an image acquisition camera 1110 in local mobile device 1000 may be transmitted to the remote location 3000, one or more of the above-noted identification or processing techniques may be applied, and then the results of such analysis may be provided as feedback to a user or other healthcare provider to provide advanced notice for possible adverse events, changes in health, and predicted outcomes, such as future hospitalizations, worsening of illnesses, particular symptoms or the like. Alternatively, processing may be performed in advance of image acquisition, and then a system for identifying characteristics of a future image may be provided to the local device to further process acquired images. Such a system may be reduced in processing requirements to allow for appropriate processing on the local mobile device. By providing either of these systems in accordance with a medication monitoring process, when monitoring medication administration, other health issues of the user may be diagnosed. Such diagnosis may identify changes in characteristics of a particular disease state, or may identify new diseases or one or more characteristics of any of such diseases.
De-Identification of InformationReferring next to
As is further shown in
A second option denoted by lossy triangle 130 allows for the computation of some generic level data layers that have identifying information removed, and also to use the collected original data to answer some known questions before removal of the identifying information. As can be seen in
A third option 135 goes a bit further and simply computes answers to known questions and then discards all of the initially-acquired information. While this provides a further improvement along the security axis, the ability to answer any future questions that might arise is minimal, as there is no further representation of the initially-acquired data to be processed. Computing answers to all known tasks provides more privacy since the data stored is restricted to a minimal set of information. On the other hand, it is unlikely that all queries will be determined up front, and therefore the ability to address many unknown future tasks is low.
Finally, option 140 simply discards all known data after any current processing. While this situation provides the highest level of security, there is no ability to use the information in the future.
The inventors of the present disclosure have determined that option 130 provides the best tradeoff between future processing and security. The subject matter of the present disclosure therefore presents a number of solutions in order to achieve this option along with a system allowing for selectability of the security/future processing tradeoff.
Referring next to
As is therefore shown in
Such testing or processing may provide one or more guidelines as to the number of features to be extracted from one or more images and to be stored in order to support any future processing. The images to be processed may be further classified by complexity or the like to determine the number and type of features to be extracted (i.e. more complex images may result in more features to be extracted, representing the richness of information available, while less complex images may only warrant extraction of a fewer number of features). Once such classifications are determined, they may similarly be used to classify and process future images in a more streamlined process. Thus, for images of certain complexity, features determined to be helpful in the processing of prior images including a similar complexity may again be used. Alternatively, testing may be performed during live processing on each image, or on a subset of images, to adjust on the fly the number and type of features to be extracted.
As noted with respect to
In an alternative embodiment of the present disclosure, rather than predetermining a number of features to be extracted from an image, or making such determination in an automated fashion during processing of an image, a mechanism for determining a number of features/generic layers to be extracted may be provided to a user, such as a slider, dial, or number selector. Thus, the user may be provided with a visual mechanism for selecting the relationship between security and level of information. For each image or each batch of images, based upon the subject matter thereof, the user is able to select the number of features stored, and thus the tradeoff between security and data availability. The system may preferably determine a most useful set of stored features, or a level of granularity may be provided to users to identify any desired bias in the selection of such features. For example, if the user is likely to want to answer questions in the future focusing on a particular area, features may be chosen that are more likely to help in answering such questions.
In accordance with a preferred embodiment of the present disclosure, various processing may be performed on the one or more images to identify one or more features of the subject of the image. In a more preferred embodiment, the subject may be the face of a person, and the features may comprise one or more keypoints or landmarks, and may further comprise one or more known predetermined and mapped keypoints or landmark points, or time-domain features such as significant activities. Alternatively, such features may be determined through analysis employing an unsupervised learning process, a supervised learning process, or other teaching or learning process.
It should be noted that the ability to process these features, including one or more keypoints or landmark points may be limited by the available processing power of the associated computer processor. Thus, for example, while a processor associated with a mobile device may be able to extract “x” features or points, a dedicated processor may be able to extract 10×, 100× or more features or points. Therefore, it is important that any processing system be able to address these varying number of points.
Referring next to
Moving next to image 530, another post-processing image derived from image 510 is shown. The number of points stored associated with the image has been reduced from the number shown in image 520. As can be seen, while the major features are still visible, some of the identifying detail has been removed. It may still, however, be possible to identify the individual in image 510 by looking at image 530. Additionally, with further computer processing, it may be possible to regenerate something approximating image 510 from image 530.
Fewer points are stored and used to generate image 540, and indeed, one can see that the ability to determine the identity of the subject in the image is reduced. Image 550 relies on even fewer points, and may be considered de-identified, as there is not enough stored information to go back to the image 510.
It is important to note that the points stored in image 550 are preferably a subset of the points stored in image 540, which are a subset of points in image 530, and then 520. Alternatively, and more generally, a relationship of some sort is preferably defined between the keypoints/features in the different images. By determining the most critical points for determining action, recognition or the like, these points can be prioritized in image 550, but also stored in images 540, 530 and 520 to allow for consistency of processing.
As noted, in a situation where different numbers of features or points are extracted, the fewer points are preferably a subset of the extracted larger number of points. Thus, when processing is to proceed, it is possible to combine images having a different number of extracted points. Processing may be variable based upon the number of points, or may be based upon the minimum number of points in one of the images to be processed, these same minimum number of points being processed from each stored set of points representative of an image, and irrespective of whether additional points may be available for processing.
As described with reference to
Thus, the selection of the number of types of features, or points to be selected may be performed in a simple manner (i.e. limiting to a particular predefined subset of points to be extracted), or may be performed in a more complex iterative manner, such as by evaluating the ability to reconstitute information given a set of features, and then adjusting whether to extract a greater or fewer number of points, or even whether to extract different points, regardless of the number of points. Additionally, various external information may be used in order to guide in point selection. For example, if a particular population is included, this may result in particular keypoints and features being selected. For instance, if a patient population has an issue with facial twitching, then in accordance with a preferred embodiment of the present disclosure, features selected may be one or more that allow for an in-depth analysis of any twitching on the face. Other desired features may also be employed in order to extract particular information for use in a particular analysis.
In addition to selecting one or more keypoints or other features in the visible spectrum, non-visible light, or other areas on the electromagnetic spectrum may preferably be employed. The use of such non-visible and other electromagnetic radiation allows for the extraction of additional information from an image, and for example the face of a person in an image. It is also contemplated in accordance with one or more embodiments of the present disclosure that different features or other keypoints may be selected in accordance with the different forms of electromagnetic radiation. Various information and analysis may be performed in accordance with each of these multiple forms of radiation, the results thereof interacting to provide yet additional information in determining one or more features. Various computational photography techniques may be employed, utilizing so called “edge of camera spectrum” to formulate a greater and much higher resolution understanding of, for example, blood flow and blood pressure. Such computational photography techniques can thus be utilized to build upon derivative data sets. Indeed, any applicable features may be employed, either alone or in combination, to extract valuable data.
In accordance with one or more embodiments of the present disclosure, it is anticipated that the use of one or more sections of the electromagnetic spectrum will allow for an in-depth analysis of facial or other visible user features. For example, rather than simply noting external facial features, use of the techniques and systems disclosed herein allow for the determination of the location of various blood vessels under the skin of a user in the field of view of a camera. Over time, differences determined in the various images provide information about the performance of the user, and may further indicate changes in disease, physical ability, or the like. Such changes, for example, may be more visible under near-infrared light, or other wavelength of energy, thus resulting in additional information being extracted based upon the use of multiple types of light, energy, or other data extraction mechanisms.
Thus, by overlaying multiple layers of information, irrespective of the amount of information provided in each of those layers, a more concrete picture of the status of an individual may be provided. Information collected from lower power devices may include lower resolution information, such as images with fewer pixels, or may include different mechanisms for data collection. As more information is acquired, either through different collection mechanisms, or as the power of collection improves over time, correlations between collected data and negative outcomes (or positive outcomes) in one or more medical therapeutic areas may be provided. The system may therefore be employed as a diagnostic tool for predicting disease or other diagnosis based upon processing of visual images of a user. By allowing for the use of such varied layers, the system is robust to methods of collection, timing of collection, and storage formats of the information, thus “future proofing” the system to rely on older data (perhaps at lower resolutions), and newer collected data, having a higher resolution, or a newer or new data collection technique.
As is shown in
Processing included in
Therefore, in accordance with the embodiment of the present disclosure noted in
In accordance with an embodiment of the present disclosure, a user may be monitored performing one or more activities, such activities having been determined to be indicative of existence or progression of disease. Thus, by monitoring a user in accordance with a visual record, or other non-visual record, one or more characteristics may be monitored to determine whether a particular user may be diagnosed with a disease, or whether a user with a particular disease has one or more characteristics that are progressing over time, indicative of a progression of disease.
Such monitoring may take place in an active or passive monitoring situation. In an active monitoring situation, as is shown in
In an alternative embodiment, a user may be asked to focus on a particular marker on the display at step 1710, and a second marker may be provided on the display, and the ability for the user to continue to focus on the initial marker may be measured at step 1730. This feature thus measures ability to maintain focus, and how easily the user is distracted. Again, monitoring over time allows for a determination of progression of disease at step 1740.
In a still further embodiment, the user may be monitored without providing a particular stimulus on the display. For example, the ability for a user to consistently read text on a screen, or otherwise focus on items in the world around them may further be monitored in a passive manner.
In yet another embodiment, an augmented reality or virtual reality scene may be presented to a user, and a response to the presented information may be monitored. Such an augmented or virtual reality schema may be employed with either active or passive monitoring situation, and may preferably be employed when encouraging a user to follow one or more steps associated with monitoring and encouraging proper medication administration. (see one or more of U.S. Pat. Nos. 8,731,856, 8,731,961, 8,666,781, 9,454,645, and 9,183,601 patents previously incorporated herein by reference). The augmented reality system may similarly be employed to provide one or more instructions or other guidance as noted in these applications incorporated by reference, and may further, after analysis of collected visual or other data, be used to provide additional instructions to the user to properly perform one or more desired sequences, such as properly administering medication, in response to determination of an error. The system may be applicable to oral, inhalable, injectable, or any other method of medication administrations, or other actions associated with health or health status.
Movement may also be tracked, and similarly may be tracked both actively and passively. Active monitoring may ask the user to perform a predetermined sequence of steps, and confirm proper performance, and changes in ability of performance over time. Similarly, passive monitoring may track gait changes over time, ability to stay seated without fidgeting, etc. The monitoring of either of these movement issues may be employed to diagnose or monitor progression of disease.
Additionally, making reference to U.S. Pat. No. 13/189,518, previously incorporated herein by reference, output from any of the schema for presenting and monitoring a user may be input into a system for determining current and potential future medication adherence status of a user. Therefore, any of the features of the present disclosure may be employed as inputs into the system as described in the '518 application, allowing for a more robust set of inputs to drive determinations of adherence and health status, and to drive potential interventions, monitoring and the like of any particular user.
The following additional features, parameters or characteristics may be actively tracked in accordance with one or more embodiments of the present disclosure: observe reactions while performing task; Track point; Look at pictures of emotional faces; Coordination test; Balance test; Reaction speed test; Reflex/reflex suppression; Memory; Keeping appointments (engage with phone at designated time); etc.
The following additional features, parameters or characteristics may be passively tracked in accordance with one or more embodiments of the present disclosure: observe reactions while engaged in other activities (Gaze; Action units; Head motion/pose; Pupil dilation; etc.)
In a further embodiment of the present disclosure, when administering or performing a therapeutic or recovery exercise, the following additional features, parameters or characteristics may be actively tracked in accordance with one or more embodiments of the present disclosure: Tracking motion; Concentration; Memory; ability to focus one's eyes (e.g. after eye surgery to monitor healing timeline). Remote therapy may also be incorporated when administering or performing a remote therapeutic or recovery exercise, or during passive monitoring of patients remotely: administering therapeutic/recovery exercises with remote guidance: Track markers while conversing; Symptom tracking; Tremors; Involuntary motions; Jaundice; Temperature; Heart rate; BMI. Recovery tracking may also review one or more of Recovery tracking; Skin/cosmetic surgery; Eye surgery (pupil size etc.), Hair implant, etc. Of course, other characteristics may also be monitored to the extent that these characteristics are determined to be indicative of diagnosis or progression of disease.
Furthermore, to the extent any such relationship between a measure characteristic and disease has not yet been defined, in accordance with an alternative embodiment of the present disclosure, collected data may be processed to determine any such relationships to allow for use in the future. Different demographic groups may be employed to determine characteristics in these particular demographic groups, and thus allow for targeted diagnosis and monitoring of disease. The use of supervised or unsupervised learning techniques may be employed to analyze such data to determine any applicable relationships.
Referring next to
By dividing populations by demographics, disease state, or other mechanism, it is also desirable to provide input into the system as to one or more medical or other conditions that may be more likely or more dangerous in the identified population. While older individuals may be more likely to suffer a stroke, for example, younger individuals may be more prone to overuse injuries, or other disease states. By providing such information as an input to the system, a reduced set of disease states or other changes in parameters may be examined, thus improving the accuracy, repeatability and precision of the inventive system. Desired sensors may be employed in order to watch for a particular disease state, or other indication of disease of a user.
Any such readings determined in accordance with the inventive system may also be correlated with one or more known, validated scales or question sets. Correlations between responses to the scales and feature determination may be provided, resulting in yet additional predictive analytics, and less reliance on the actual scales going forward. Thus, in accordance with
Once various feature sets have been determined to have a predictive capability (for predicting, for example, existence of a particular disease, one or more data masks may be generated to allow for a review of incoming future data to similarly determine existence of that same particular disease, for example. Such a data mask may comprise one or more feature values indicative of a state of an individual. Thus, for example, rapid weight gain, an increase in blood pressure, and a change in skin color may be defined as indicative of the progression of a particular disease. By providing such a mask, a simple diagnostic tool may be provided that allows for an immediate and automated determination of potential medical conditions of an individual, and may further indicate a preferred path for addressing these conditions. For example, the individual may be given instruction to rest, to drink fluids, to eat, to call their doctor, to take medication, or to go immediately to an emergency room. The system may further be provided with adjunct systems for performing some of these functions, such as calling a family member, dialing 911 and providing GPS information for location of the individual, and may also provide an initial diagnostic analysis of the health of the individual to those responding to the alert, allowing quicker response and triage on site, or when the individual reached the hospital.
By way of example only, understanding the artery, capillary and vein structure of the body may allow for a determination of changes therein, in order to allow for visual measurement of heartrate, temperature, or other bodily measurements, as will be described below with respect to
As noted by the inventors of the present disclosure, and as a further example, by understanding the artery, capillary and vein structure of the body, determination of changes in any such structure may be more easily determined. By knowing where to look for, by way of example a capillary at the temple of a user, it is easier to determine changes in that capillary as the system can be properly focused, and need not scan the entire body to determine where to look. In order to perform such a task, the inventors of the present disclosure have determined that it is possible to overlay a data mask indicative of the locations of, for example, arteries and veins on a typical face, on an image of an individual. As is shown in
In accordance with a preferred embodiment of the present disclosure, one or more particular facial or other features may be captured from images in question. As is shown in
The layers of
Therefore, in accordance with various embodiments of the present disclosure, layer data may be extracted, analyzed, and determined whether helpful in a particular situation. Additionally, if one is looking for a particular type of information, the desired layers for analysis may be pre-identified. Any of this information may be reviewed over time to determine, for example, weight gain or loss, blood pressure or pulse change, dehydration, frostbite, hypothermia, or any other measurable quantity. In addition, one or more of the visual layers may take advantage of high speed photography, allowing for the collection of information up to 400 frames per second or higher, for example, do allow further analysis of very subtle changes in skin movement, color, or other micro gestures that may be of interest. Images may be considered in chronological order, or may be considered simultaneously, relying on commonality of structure rather than time. Such processing preferably groups incoming images along one or more dimensions other than chronological. In such a manner, images may be grouped in a manner providing the most information for determining existence of a particular characteristic. Such characteristic may be indicative of existence or progression of disease, or may be relevant to any other measureable quantity or quality of disease or other characteristic. By grouping incoming images along such non-chronological dimensions, as more images are collected, the number of images in each grouping may be increased, thus improving the precision of any processes dependent upon these groupings. Any of the layers shown in
Therefore, further in accordance with a preferred embodiment of the present disclosure, the one or more layers (either visual or having a temporal dimension, such as layers working with various non-visual sensors, or even metadata), may be examined to determine whether one or more features thereof is relevant to a particular set of images. Once determined to store important information, these features within a single layer, or across multiple layers may be grouped and selected as a subset of keypoints or other information, in a fashion as noted above, in order to select a subset of the possible information, to allow for storage and processing of the data in the future, but preferably in a lossy manner in order to protect the identity, and reduce the ability to re-identify an image. Of course, this process may also be employed even if an image is to be stored in a lossless manner, or in a lossy manner that still allows for re-identification of information stored therein. In addition to a formal visual layer, it is also to further process additional types of electromagnetic radiation, such as ultraviolet, infrared, etc. In such a manner, it is possible to extract features that may not be visible under electromagnetic radiation in the visual portion of the spectrum, but may be more prominent once other areas of the spectrum are used.
Referring next to
Such processing may be applied, for example to images of a user administering a pill or film based oral medications, or injectable, inhaler-based, other non-pill based medication, or any other form of patient administration task that may be performed, may be utilized in accordance with one or more of the present disclosures noted in the above-referenced applications. Therefore, in accordance with an embodiment of the present disclosure, a method and apparatus may be provided for analyzing captured patient image data.
The system may process information at a remote system housing the database of collected information. New images acquired by a camera in a local mobile device may be transmitted to the remote location, one or more of the above-noted processing techniques, including extraction of keypoint/feature data may be applied, and then the results of such analysis may be provided as feedback to a user in any number of scenarios. Such keypoint/feature processing data may be used to determine one or more features of medication adherence, facial recognition, facial images, or the like based upon differing levels of stored information and keypoint/feature data. Alternatively, as noted above, processing may be performed in advance of new image acquisition to identify a desired number of keypoints/features to be extracted, and then a system for identifying characteristics of a future image may be provided to the local device to further process acquired images. Such a system may be reduced in processing requirements to allow for appropriate processing on the local mobile device. By providing either of these systems in accordance with a medication monitoring process, when monitoring medication administration, various features of the administration process may be extracted.
A mechanism for determining a number of features to be extracted may be provided to a user, such as a slider, dial, or number selector. Thus, the user may be provided with a visual mechanism for selecting the relationship between security and level of information. For each image or each batch of images, based upon the subject matter thereof, the user is able to select the number of features stored, and thus the tradeoff between security and data availability.
As noted above, in one embodiment of the present disclosure, in a situation where different numbers of features or points are extracted, the fewer points are preferably a subset of the extracted larger number of points/features, as described above. Thus, when processing is to proceed, it is possible to combine images having a different number of extracted points/features. Processing may be variable based upon the number of points, or may be based upon the minimum number of points/features in one of the images to be processed, these same minimum number of points/features being processed from each stored set of points representative of an image.
By way of example, if the selected features are representative of the face of an individual (
In U.S. Pat. No. 9,256,776, and U.S. patent application Ser. No. 14/990,389, the contents thereof being incorporated herein by reference, the owners of the present disclosure have described a system that allows for the partial blurring of images to de-identify one or more images in a video sequence. In accordance with the present disclosure, as set forth in
Referring next to
As is shown in
Referring next to
As is further shown in
The de-identified image at step 1125 may further be combined at step 1135 with a public identity image 1140 (or itself a synthesized image) retrieved from a database or other storage location 1145, and provided to a discriminative model element 1150. This element receives the combined image from step 1135 (which may be a real public identity image or a synthesized de-identified image) and outputs a binary label at step 1155 indicating whether the discriminative model element 1150 has determined that the image is a real or a synthesized image. The de-identification model is trained to generate realistic image which fool the discriminative model. The discriminative model is trained to distinguish between the real and synthesized images.
The discriminator performance is then evaluated in accordance with a calculation of a binary classification loss at step 1160. The results of the discriminator output are compared with the known combination system employed in step 1135 to modify settings in a feedback loop system, as appropriate, until the system can consistently fool the discriminator.
Referring next to
As is shown in
Referring next to
A pair of images (image_1 and image_2) with known identities are retrieved from a dataset of identity annotated facial images 1305 at steps 1310 and 1315. At step 1320, a flag is set to indicate whether the images at step 1310 and 1315 are of the same person or not. Next, at step 1325 a de-identification model is applied to each of the images image_1 and image_2, outputting de-identified representations 1 and 2 at steps 1330 and 1335 respectively. These two de-identified representations 1330, 1335 are fed into a same-not-same model 1340. Same-not-same model 1340 estimates whether representations 1330, 1335 are of images of the same person or not, outputting the result at 1345 regarding its decision. At binary classification loss step 1350, the output at step 1345 is compared with the label generated at step 1320 to determine whether there is a match (i.e. if the images are determined to be of the same person in step 1345, are they actually of the same person as noted in step 1320. If there is no same-not-same model that performs better than random chance, then it can be determined that the de-identification is perfect.
The same-not-same model and the de-identification model described above “compete” with each other. The de-identification model tries to mask the identity (making it harder to determine what is going on, while the same-not-same model tries to determine what is going on between the images.
Thus, models may be trained for known tasks 1435 (task_1 . . . task_n) using supervised data. i.e. data which includes annotation for the tasks of interest. The task models are optimized to minimize losses 1445 (loss_1 . . . loss_n) which are appropriate losses for each one of the tasks of interest. A possible strategy to jointly optimize the task models and the de-identification model is block-wise coordinate optimization. The de-identification model parameters may be fixed, and the task models trained to minimize the sum or weighted sum of the individual task losses. Given fixed task models, the parameters of the de-identification model may be adjusted to improve performance on the task losses.
Therefore, in accordance with various embodiments of the present disclosure system and method are provided in which, among other results, images of faces can be de-identified in a manner that retains relevant information so that a maximum number of future questions can be properly answered.
Use of the system may be employed in a controlled or natural setting, and therefore may improve patient engagement, as the system may be used at any location. Additionally, the user's environment may be measured in, for example, a recovery situation, to determine whether user is in a most desirable environment to aid such recovery.
Therefore, in accordance with various embodiments of the present disclosure system and method are provided in which, among other results, images of faces can be stored in a manner that retains relevant information so that a maximum number of future questions can be properly answered.
This specification uses the term “configured” in connection with apparatuses, processors modules, systems and computer program components. For a system of one or more computers to be configured to perform particular operations or actions means that the system has installed on it software, firmware, hardware, or a combination of them that in operation cause the system to perform the operations or actions. For one or more computer programs to be configured to perform particular operations or actions means that the one or more programs include instructions that, when executed by data processing apparatus, cause the apparatus to perform the operations or actions.
Embodiments of the subject matter and the functional operations described in this specification can be implemented in digital electronic circuitry, in tangibly-embodied computer software or firmware, in computer hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them.
Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, i.e., one or more modules of computer program instructions encoded on a tangible non transitory program carrier for execution by, or to control the operation of, data processing apparatus. Alternatively, or in addition, the program instructions can be encoded on an artificially generated propagated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus. The computer storage medium can be a machine-readable storage device, a machine-readable storage substrate, a random or serial access memory device, or a combination of one or more of them. The computer storage medium is not, however, a propagated signal.
The term “data processing apparatus” encompasses all kinds of apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, or multiple processors or computers. The apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit). The apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, or a combination of one or more of them.
A computer program (which may also be referred to or described as a program, software, a software application, a module, a software module, a script, or code) can be written in any form of programming language, including compiled or interpreted languages, or declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data, e.g., one or more scripts stored in a markup language document, in a single file dedicated to the program in question, or in multiple coordinated files, e.g., files that store one or more modules, sub programs, or portions of code. A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
As used in this specification, an “engine,” or “software engine,” refers to a software implemented input/output system that provides an output that is different from the input. An engine can be an encoded block of functionality, such as a library, a platform, a software development kit (“SDK”), or an object. Each engine can be implemented on any appropriate type of computing device, e.g., servers, mobile phones, tablet computers, notebook computers, music players, e-book readers, laptop or desktop computers, PDAs, smart phones, or other stationary or portable devices, that includes one or more processors and computer readable media. Additionally, two or more of the engines may be implemented on the same computing device, or on different computing devices.
The processes and logic flows described in this specification can be performed by one or more programmable computers executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit). For example, the processes and logic flows can be performed by and apparatus can also be implemented as a graphics processing unit (GPU).
Computers suitable for the execution of a computer program include, by way of example, general or special purpose microprocessors or both, or any other kind of central processing unit. Generally, a central processing unit will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a central processing unit for performing or executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device, e.g., a universal serial bus (USB) flash drive, to name just a few.
Computer readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, embodiments of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.
Embodiments of the subject matter described in this specification can be implemented in a computing system that includes a back end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back end, middleware, or front end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), e.g., the Internet.
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any invention or of what may be claimed, but rather as descriptions of features that may be specific to particular embodiments of particular inventions. Certain features that are described in this specification in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or variation of a subcombination.
Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system modules and components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.
Particular embodiments of the subject matter have been described. Other embodiments are within the scope of the following claims. For example, the actions recited in the claims can be performed in a different order and still achieve desirable results. As one example, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.
It should be noted that any of the above-noted techniques and systems may be provided in combination or individually. Furthermore, the system may be employed in mobile devices, computing devices, cloud based storage and processing. Camera images may be acquired by an associated camera, or an independent camera situated at a remote location. Processing may be similarly provided locally on a mobile device, or a remotely at a cloud-based location, or other remote location. Additionally, such processing and storage locations may be situated at a similar location, or at remote locations.
Claims
1.-18. (canceled)
19. A method for monitoring a state of an individual, the method comprising:
- obtaining a video record of the individual performing a physical activity;
- de-identifying the video record, wherein de-identifying the video record comprises identifying a subset of points to be extracted from the video record, the subset of points allowing for future analysis of the physical activity, and storing the subset of points without storing the video record to inhibit re-identification of the individual in the video record;
- measuring the physical activity as recorded by the stored subset of points;
- comparing the measured physical activity to an expected physical activity to obtain a comparison result; and
- diagnosing one or more aspects of disease based on the comparison result.
20. The method of claim 19, comprising storing non-visual information associated with the video record and selected from one or more sensor outputs.
21. The method of claim 19, comprising:
- identifying a subset of sensors of a plurality of sensors applicable to recognize a particular disease;
- recording a second physical activity by the subset of sensors; and
- analyzing the recorded second physical activity to confirm a presence or absence of the particular disease.
22. The method of claim 19, comprising:
- recording a second physical activity by a plurality of sensors; and
- analyzing the recorded second physical activity to determine a presence or absence of one or more possible recognized diseases.
23. The method of claim 19, wherein the subset of points comprises facial keypoints.
24. The method of claim 19, wherein the subset of points corresponds to one or more portions of a face of the individual.
25. The method of claim 19, comprising displaying, to a user, an interface by which the user is able to modify a number of points to be included in the subset of points.
26. The method of claim 19, comprising:
- blurring a portion of the video record; and
- storing the blurred portion in association with the stored subset of points.
27. A system for monitoring a state of an individual, comprising:
- a video capture device;
- one or more processors; and
- a memory storing one or more non-transitory, computer-readable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising: obtaining, from the video capture device, a video record of the individual performing a physical activity; de-identifying the video record, wherein de-identifying the video record comprises identifying a subset of points to be extracted from the video record, the subset of points allowing for future analysis of the physical activity, and storing the subset of points without storing the video record to inhibit re-identification of the individual in the video record; measuring the physical activity as recorded by the stored subset of points; comparing the measured physical activity to an expected physical activity to obtain a comparison result; and diagnosing one or more aspects of disease based on the comparison result.
28. The system of claim 27, wherein the operations comprise storing non-visual information associated with the video record and selected from one or more sensor outputs.
29. The system of claim 27, comprising a plurality of sensors, wherein the operations comprise:
- identifying a subset of sensors of the plurality of sensors applicable to recognize a particular disease;
- recording a second physical activity by the subset of sensors; and
- analyzing the recorded second physical activity to confirm a presence or absence of the particular disease.
30. The system of claim 27, comprising a plurality of sensor, wherein the operations comprise:
- recording a second physical activity by the plurality of sensors; and
- analyzing the recorded second physical activity to determine a presence or absence of one or more possible recognized diseases.
31. The system of claim 27, wherein the subset of points comprises facial keypoints.
32. The system of claim 27, wherein the subset of points corresponds to one or more portions of a face of the individual.
33. The system of claim 27, comprising a display, wherein the operations comprise displaying, to a user, by the display, an interface by which the user is able to modify a number of points to be included in the subset of points.
34. The system of claim 27, wherein the operations comprise:
- blurring a portion of the video record; and
- storing the blurred portion in association with the stored subset of points.
Type: Application
Filed: Nov 4, 2020
Publication Date: Feb 25, 2021
Patent Grant number: 11961620
Inventors: Adam Hanina (New York, NY), Daniel Glasner (New York, NY)
Application Number: 17/089,544