Patents by Inventor Philip S. Yu
Philip S. Yu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9237192Abstract: One embodiment of the present method and apparatus adaptive in-operator load shedding includes receiving at least two data streams (each comprising a plurality of tuples, or data items) into respective sliding windows of memory. A throttling fraction is then calculated based on input rates associated with the data streams and on currently available processing resources. Tuples are then selected for processing from the data streams in accordance with the throttling fraction, where the selected tuples represent a subset of all tuples contained within the sliding window.Type: GrantFiled: May 17, 2013Date of Patent: January 12, 2016Assignee: International Business Machines CorporationInventors: Bugra Gedik, Kun-Lung Wu, Philip S. Yu
-
Publication number: 20150262185Abstract: Various embodiments for maintaining security and confidentiality of data and operations within a fraud detection system. Each of these embodiments utilizes a secure architecture in which: (1) access to data is limited to only approved or authorized entities; (2) confidential details in received data can be readily identified and concealed; and (3) confidential details that have become non-confidential can be identified and exposed.Type: ApplicationFiled: May 13, 2015Publication date: September 17, 2015Inventors: Naoki Abe, Carl E. Abrams, Chidanand V. Apte, Bishwaranjan Bhattacharjee, Kenneth A. Goldman, Matthias Gruetzner, Matthew A. Hilbert, John Langford, Sriram K. Padmanabhan, Charles P. Tresser, Kathleen M. Troidle, Philip S. Yu
-
Patent number: 9064364Abstract: Various embodiments for maintaining security and confidentiality of data and operations within a fraud detection system. Each of these embodiments utilizes a secure architecture in which: (1) access to data is limited to only approved or authorized entities; (2) confidential details in received data can be readily identified and concealed; and (3) confidential details that have become non-confidential can be identified and exposed.Type: GrantFiled: October 22, 2003Date of Patent: June 23, 2015Assignee: International Business Machines CorporationInventors: Naoki Abe, Carl E. Abrams, Chidanand V. Apte, Bishwaranjan Bhattacharjee, Kenneth A. Goldman, Matthias Gruetzner, Matthew A. Hilbert, John Langford, Sriram K. Padmanabhan, Charles P. Tresser, Kathleen M. Troidle, Philip S. Yu
-
Patent number: 8891814Abstract: Systems and methods for embedding metadata such as personal patient information within actual medical data signals obtained from a patient are provided wherein two watermarks, a robust watermark and a fragile watermark are embedded in a given medical data signal. The robust watermark includes a binary coded representation of the metadata that is incorporated into the frequency domain of the medical data signal using discrete Fourier transformations and additive embedding. Error correcting code can also be added to the binary representation of the metadata using Hamming coding. A given robust watermark can be incorporated multiple times in the medical data signal. The fragile watermark is added on top of the modified medical signal containing the robust watermark in the spatial domain of the modified medical signal. The fragile watermark utilizes hash function to generate random sequences that are incorporated through the medical data signal.Type: GrantFiled: June 7, 2012Date of Patent: November 18, 2014Assignee: International Business Machines CorporationInventors: Michail Vlachos, Philip S. Yu
-
Publication number: 20130254350Abstract: One embodiment of the present method and apparatus adaptive in-operator load shedding includes receiving at least two data streams (each comprising a plurality of tuples, or data items) into respective sliding windows of memory. A throttling fraction is then calculated based on input rates associated with the data streams and on currently available processing resources. Tuples are then selected for processing from the data streams in accordance with the throttling fraction, where the selected tuples represent a subset of all tuples contained within the sliding window.Type: ApplicationFiled: May 17, 2013Publication date: September 26, 2013Applicant: International Business Machines CorporationInventors: Bugra Gedik, Kun-Lung Wu, Philip S. Yu
-
Patent number: 8494036Abstract: Streaming environments typically dictate incomplete or approximate algorithm execution, in order to cope with sudden surges in the data rate. Such limitations are even more accentuated in mobile environments (such as sensor networks) where computational and memory resources are typically limited. Introduced herein is a novel “resource adaptive” algorithm for spectrum and periodicity estimation on a continuous stream of data. The formulation is based on the derivation of a closed-form incremental computation of the spectrum, augmented by an intelligent load-shedding scheme that can adapt to available CPU resources. Experimentation indicates that the proposed technique can be a viable and resource efficient solution for real-time spectrum estimation.Type: GrantFiled: July 22, 2008Date of Patent: July 23, 2013Assignee: International Business Machines CorporationInventors: Deepak Srinivac Turaga, Michail Vlachos, Philip S. Yu
-
Patent number: 8478875Abstract: One embodiment of the present method and apparatus adaptive in-operator load shedding includes receiving at least two data streams (each comprising a plurality of tuples, or data items) into respective sliding windows of memory. A throttling fraction is then calculated based on input rates associated with the data streams and on currently available processing resources. Tuples are then selected for processing from the data streams in accordance with the throttling fraction, where the selected tuples represent a subset of all tuples contained within the sliding window.Type: GrantFiled: June 30, 2008Date of Patent: July 2, 2013Assignee: International Business Machines CorporationInventors: Bugra Gedik, Kun-Lung Wu, Philip S. Yu
-
Patent number: 8311959Abstract: An object and attributes that describe that object are identified. The attributes are grouped into attribute patterns, and classification classes are identified. For each identified class a sketch table containing a plurality of parallel hash tables is created. For the object to be classified, each attribute pattern is processed using the all of the hash functions for each sketch table, resulting in a plurality of values under each sketch table for a single attribute pattern. The lowest value is selected for each sketch table. The distribution of values across all sketch tables is evaluated for each attribute pattern, producing a discriminatory power for each attribute pattern. Attribute patterns having a discriminatory power above a given threshold are selected and added to the associated sketch table values. The sketch table with the largest overall sum is identified, and the associated class is assigned to the object belonging to the attribute patterns.Type: GrantFiled: February 21, 2012Date of Patent: November 13, 2012Assignee: International Business Machines CorporationInventors: Charu C Aggarwal, Philip S Yu
-
Publication number: 20120281894Abstract: Systems and methods for embedding metadata such as personal patient information within actual medical data signals obtained from a patient are provided wherein two watermarks, a robust watermark and a fragile watermark are embedded in a given medical data signal. The robust watermark includes a binary coded representation of the metadata that is incorporated into the frequency domain of the medical data signal using discrete Fourier transformations and additive embedding. Error correcting code can also be added to the binary representation of the metadata using Hamming coding. A given robust watermark can be incorporated multiple times in the medical data signal. The fragile watermark is added on top of the modified medical signal containing the robust watermark in the spatial domain of the modified medical signal. The fragile watermark utilizes hash function to generate random sequences that are incorporated through the medical data signal.Type: ApplicationFiled: June 7, 2012Publication date: November 8, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Michail Vlachos, Philip S. Yu
-
Patent number: 8301584Abstract: Disclosed in a method and structure for searching data in databases using an ensemble of models. First the invention performs training. This training orders models within the ensemble in order of prediction accuracy and joins different numbers of models together to form sub-ensembles. The models are joined together in the sub-ensemble in the order of prediction accuracy. Next in the training process, the invention calculates confidence values of each of the sub-ensembles. The confidence is a measure of how closely results form the sub-ensemble will match results from the ensemble. The size of each of the sub-ensembles is variable depending upon the level of confidence, while, to the contrary, the size of the ensemble is fixed. After the training, the invention can make a prediction. First, the invention selects a sub-ensemble that meets a given level of confidence.Type: GrantFiled: December 16, 2003Date of Patent: October 30, 2012Assignee: International Business Machines CorporationInventors: Wei Fan, Haixun Wang, Philip S. Yu
-
Patent number: 8286153Abstract: A system and method are provided for optimizing component composition in a distributed stream-processing environment having a plurality of nodes capable of being associated with one or more of a plurality of stream processing components. The system includes an adaptive composition probing (ACP) module and a hierarchical state manager. The ACP module probes a subset of the plurality of stream processing components to determine the optimal component composition in response to a stream processing request. The hierarchical state manager manages local and global information for use by said ACP module in determining the optimal component composition.Type: GrantFiled: April 2, 2008Date of Patent: October 9, 2012Assignee: International Business Machines CorporationInventors: Xiaohui Gu, Philip S. Yu
-
Patent number: 8229191Abstract: Systems and methods for embedding metadata such as personal patient information within actual medical data signals obtained from a patient are provided wherein two watermarks, a robust watermark and a fragile watermark are embedded in a given medical data signal. The robust watermark includes a binary coded representation of the metadata that is incorporated into the frequency domain of the medical data signal using discrete Fourier transformations and additive embedding. Error correcting code can also be added to the binary representation of the metadata using Hamming coding. A given robust watermark can be incorporated multiple times in the medical data signal. The fragile watermark is added on top of the modified medical signal containing the robust watermark in the spatial domain of the modified medical signal. The fragile watermark utilizes hash function to generate random sequences that are incorporated through the medical data signal.Type: GrantFiled: March 5, 2008Date of Patent: July 24, 2012Assignee: International Business Machines CorporationInventors: Michail Vlachos, Philip S. Yu
-
Publication number: 20120166382Abstract: An object and attributes that describe that object are identified. The attributes are grouped into attribute patterns, and classification classes are identified. For each identified class a sketch table containing a plurality of parallel hash tables is created. For the object to be classified, each attribute pattern is processed using the all of the hash functions for each sketch table, resulting in a plurality of values under each sketch table for a single attribute pattern. The lowest value is selected for each sketch table. The distribution of values across all sketch tables is evaluated for each attribute pattern, producing a discriminatory power for each attribute pattern. Attribute patterns having a discriminatory power above a given threshold are selected and added to the associated sketch table values. The sketch table with the largest overall sum is identified, and the associated class is assigned to the object belonging to the attribute patterns.Type: ApplicationFiled: February 21, 2012Publication date: June 28, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Charu C. Aggarwal, Philip S. Yu
-
Publication number: 20120124233Abstract: One embodiment of the present method and apparatus adaptive load shedding includes receiving at least one data stream (comprising a plurality of tuples, or data items) into a first sliding window of memory. A subset of tuples from the received data stream is then selected for processing in accordance with at least one data stream operation, such as a data stream join operation. Tuples that are not selected for processing are ignored. The number of tuples selected and the specific tuples selected depend at least in part on a variety of dynamic parameters, including the rate at which the data stream (and any other processed data streams) is received, time delays associated with the received data stream, a direction of a join operation performed on the data stream and the values of the individual tuples with respect to an expected output.Type: ApplicationFiled: January 3, 2012Publication date: May 17, 2012Applicant: International Business Machines CorporationInventors: BUGRA GEDIK, Kun-Lung Wu, Philip S. Yu
-
Patent number: 8165979Abstract: A system and method for resource adaptive classification of data streams. Embodiments of systems and methods provide classifying data received in a computer, including discretizing the received data, constructing an intermediate data structure from said received data as training instances, performing subspace sampling on said received data as test instances and adaptively classifying said received data based on statistics of said subspace sampling.Type: GrantFiled: April 1, 2011Date of Patent: April 24, 2012Assignee: International Business Machines CorporationInventors: Charu C. Aggarwal, Philip S. Yu
-
Patent number: 8140448Abstract: An object and attributes that describe that object are identified. The attributes are grouped into attribute patterns, and classification classes are identified. For each identified class a sketch table containing a plurality of parallel hash tables is created. For the object to be classified, each attribute pattern is processed using the all of the hash functions for each sketch table, resulting in a plurality of values under each sketch table for a single attribute pattern. The lowest value is selected for each sketch table. The distribution of values across all sketch tables is evaluated for each attribute pattern, producing a discriminatory power for each attribute pattern. Attribute patterns having a discriminatory power above a given threshold are selected and added to associated sketch table values. The sketch table with the largest overall sum is identified, and the associated class is assigned to the object belonging to the attribute patterns.Type: GrantFiled: May 9, 2008Date of Patent: March 20, 2012Assignee: International Business Machines CorporationInventors: Charu C Aggarwal, Philip S Yu
-
Patent number: 8117331Abstract: One embodiment of the present method and apparatus adaptive load shedding includes receiving at least one data stream (comprising a plurality of tuples, or data items) into a first sliding window of memory. A subset of tuples from the received data stream is then selected for processing in accordance with at least one data stream operation, such as a data stream join operation. Tuples that are not selected for processing are ignored. The number of tuples selected and the specific tuples selected depend at least in part on a variety of dynamic parameters, including the rate at which the data stream (and any other processed data streams) is received, time delays associated with the received data stream, a direction of a join operation performed on the data stream and the values of the individual tuples with respect to an expected output.Type: GrantFiled: June 30, 2008Date of Patent: February 14, 2012Assignee: International Business Machines CorporationInventors: Bugra Gedik, Kun-Lung Wu, Philip S. Yu
-
Patent number: 8112247Abstract: Introduced herein is a “resource adaptive” algorithm for spectrum and periodicity estimation on a continuous stream of data. The formulation is based on the derivation of a closed-form incremental computation of the spectrum, augmented by a load-shedding scheme that can adapt to available CPU resources to provide a resource efficient solution for real time spectrum estimation.Type: GrantFiled: March 24, 2006Date of Patent: February 7, 2012Assignee: International Business Machines CorporationInventors: Deepak Srinivao Turaga, Michail Vlachos, Philip S. Yu
-
Patent number: 8086550Abstract: Uncertain data is classified by constructing an error adjusted probability density estimate for the data, and applying a subspace exploration process to the probability density estimate to classify the data.Type: GrantFiled: August 28, 2007Date of Patent: December 27, 2011Assignee: International Business Machines CorporationInventors: Charu Aggarwal, Philip S. Yu
-
Patent number: 8060461Abstract: Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.Type: GrantFiled: February 17, 2009Date of Patent: November 15, 2011Assignee: International Business Machines CorporationInventors: Yun Chi, Haixun Wang, Philip S. Yu