Patents by Inventor Philip Yu

Philip Yu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20070220219
    Abstract: A method (and system) of storing data in a value-based storage system, includes optimizing a value of data stored in the value-based storage system.
    Type: Application
    Filed: March 16, 2006
    Publication date: September 20, 2007
    Applicant: International Business Machines Corporation
    Inventors: Nikhil Bansal, Frederick Douglis, Lisa Fleischer, Kirsten Hildrum, Akshay Kumar Katta, John Palmer, Elizabeth Richards, David Tao, William Tetzlaff, Joel Wolf, Philip Yu
  • Publication number: 20070214163
    Abstract: There are provided methods, computer program products, and systems for indexing a data stream. A method for indexing a data stream having attribute values includes the steps of parsing the data stream, and forming an index of tuples for a subset of attribute values of the data stream. The index is configured for retrieving the top-K tuples that optimize linearly weighted sums of at least some of the attribute values in the subset.
    Type: Application
    Filed: March 10, 2006
    Publication date: September 13, 2007
    Inventors: Gang Luo, Kun-Lung Wu, Philip Yu
  • Publication number: 20070211703
    Abstract: A system, method, and computer program product for establishing multi-party VoIP conference audio calls in a distributed, peer-to-peer network where any number of nodes are able to arbitrarily and asynchronously start or stop producing audio output to be mixed into a single composite audio stream that is distributed to all nodes. A single distribution tree is used that has optimal communications characteristics to distribute the composite audio signal to all nodes. An audio mixing tree is established and maintained by adaptively and dynamically adding and merging intermediate mixing nodes operating between user nodes and the root of the single distribution tree. The intermediate mixing nodes and the root of the single distribution tree are all hosted, in an exemplary embodiment, on user nodes that are endpoints of the distribution tree.
    Type: Application
    Filed: March 10, 2006
    Publication date: September 13, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Xiaohui Gu, Zon-Yin Shae, Zhen Wen, Philip Yu
  • Publication number: 20070118539
    Abstract: Techniques for community discovery in a network are disclosed. For example, a technique for discovering a community around a given entity in an interaction graph, wherein nodes in the graph represent entities and edges connecting nodes in the graph represent interactions between connected nodes, comprises the following steps/operations. Nodes in the interaction graph are partitioned into different sets of nodes based on interaction information associated with each node to minimize a number of interaction pairs that need to be considered. An objective function is minimized by moving entities between the different sets such that the community is discovered once a measure associated with the objective function is minimized.
    Type: Application
    Filed: November 18, 2005
    Publication date: May 24, 2007
    Applicant: International Business Machines Corporation
    Inventors: Kirsten Hildrum, Philip Yu
  • Publication number: 20070043565
    Abstract: Systems and methods are provided for real-time classification of streaming data. In particular, systems and methods for real-time classification of continuous data streams implement micro-clustering methods for offline and online processing of training data to build and dynamically update training models that are used for classification, as well as incrementally clustering the data over contiguous segments of a continuous data stream (in real-time) into a plurality of micro-clusters from which target profiles are constructed which define/model the behavior of the data in individual segments of the data stream.
    Type: Application
    Filed: August 22, 2005
    Publication date: February 22, 2007
    Inventors: Charu Aggarwal, Philip Yu
  • Publication number: 20070016560
    Abstract: A computer implemented method, apparatus, and computer usable program code for performing load diffusion to process data stream pairs. A data stream pair is received for correlation. The data stream pair is partitioned into portions to meet correlation constraints for correlating data in the data stream pair to form a partitioned data stream pair. The partitioned data stream pair is sent to a set of nodes for correlation processing to perform the load diffusion.
    Type: Application
    Filed: July 15, 2005
    Publication date: January 18, 2007
    Applicant: International Business Machines Corporation
    Inventors: Xiaohui Gu, Philip Yu
  • Publication number: 20060287984
    Abstract: Range query techniques are disclosed for use in accordance with data stream processing systems. In one aspect of the invention, a technique is provided for indexing continual range queries for use in data stream processing. For example, a technique for use in processing a data stream comprises obtaining at least one range query to be associated with the data stream, and building a range query index based on the at least one range query using one or more virtual constructs such that the query index is adaptive to one or more changes in a distribution of range query sizes. The step/operation of building the range query index may further comprise building the range query index such that the range query index accommodates one or more changes in query positions outside a monitoring area of the at least one range query. In another aspect of the invention, a technique is provided for incrementally processing continual range queries against moving objects.
    Type: Application
    Filed: June 17, 2005
    Publication date: December 21, 2006
    Applicant: International Business Machines Corporation
    Inventors: Shyh-Kwei Chen, Kun-Lung Wu, Philip Yu
  • Publication number: 20060282425
    Abstract: Techniques are disclosed for clustering and classifying stream data. By way of example, a technique for processing a data stream comprises the following steps/operations. A cluster structure representing one or more clusters in the data stream is maintained. A set of projected dimensions is determined for each of the one or more clusters using data points in the cluster structure. Assignments are determined for incoming data points of the data stream to the one or more clusters using distances associated with each set of projected dimensions for each of the one or more clusters. Further, the cluster structure may be used for classification of data in the data stream.
    Type: Application
    Filed: April 20, 2005
    Publication date: December 14, 2006
    Applicant: International Business Machines Corporation
    Inventors: Charu Aggarwal, Philip Yu
  • Publication number: 20060271304
    Abstract: A method which identifies different types of substructures within a graph and encodes them using techniques suitable to the characteristics of each of them. The method is embodied by an efficient two-phase algorithm, where the first phase identifies and encodes strongly connected components as well as tree substructures, and the second phase encodes the remaining reachability relationships by compressing dense rectangular submatrices in the transitive closure matrix.
    Type: Application
    Filed: May 31, 2005
    Publication date: November 30, 2006
    Applicant: IBM Corporation
    Inventors: Hao He, Haixun Wang, Philip Yu
  • Publication number: 20060224562
    Abstract: Techniques for similarity searching are provided. In one aspect, a method of searching structural data in a database against one or more structural queries comprises the following steps. A desired minimum degree of similarity between the one or more queries and the structural data in the database is first specified. One or more indices are then used to exclude from consideration any structural data in the database that does not share the minimum degree of similarity with one or more of the queries.
    Type: Application
    Filed: March 31, 2005
    Publication date: October 5, 2006
    Applicant: International Business Machines Corporation
    Inventors: Xifeng Yan, Philip Yu
  • Publication number: 20060224356
    Abstract: Arrangements and methods for performing structural clustering between different time series. Time series data relating to a plurality of time series is accepted, structural features relating to the time series data are ascertained, and at least one distance between different time series via employing the structural features is determined. The different time series may be partitioned into clusters based on the at least one distance, and/or the k closest matches to a given time series query based on the at least one distance may be returned.
    Type: Application
    Filed: March 31, 2005
    Publication date: October 5, 2006
    Applicant: IBM Corporation
    Inventors: Vittorio Castelli, Michail Vlaschos, Philip Yu
  • Publication number: 20060212337
    Abstract: A method (and system) of assigning a sales opportunity, includes creating an assignment model based on clustering historical sales opportunities, and providing a scoring mechanism on a plurality of sales agents for automatically optimizing an assignment of at least one sales opportunity to at least one of the plurality of sales agents.
    Type: Application
    Filed: March 16, 2005
    Publication date: September 21, 2006
    Applicant: International Business Machines Corporation
    Inventors: Jamshid Vayghan, Philip Yu
  • Publication number: 20060200251
    Abstract: A system and method are provided for optimizing component composition in a distributed stream-processing environment having a plurality of nodes capable of being associated with one or more of a plurality of stream processing components. The system includes an adaptive composition probing (ACP) module and a hierarchical state manager. The ACP module probes a subset of the plurality of stream processing components to determine the optimal component composition in response to a stream processing request. The hierarchical state manager manages local and global information for use by said ACP module in determining the optimal component composition.
    Type: Application
    Filed: March 1, 2005
    Publication date: September 7, 2006
    Inventors: Xiaohui Gu, Philip Yu
  • Publication number: 20060195599
    Abstract: One embodiment of the present method and apparatus adaptive load shedding includes receiving at least one data stream (comprising a plurality of tuples, or data items) into a first sliding window of memory. A subset of tuples from the received data stream is then selected for processing in accordance with at least one data stream operation, such as a data stream join operation. Tuples that are not selected for processing are ignored. The number of tuples selected and the specific tuples selected depend at least in part on a variety of dynamic parameters, including the rate at which the data stream (and any other processed data streams) is received, time delays associated with the received data stream, a direction of a join operation performed on the data stream and the values of the individual tuples with respect to an expected output.
    Type: Application
    Filed: February 28, 2005
    Publication date: August 31, 2006
    Inventors: Bugra Gedik, Kun-Lung Wu, Philip Yu
  • Publication number: 20060190430
    Abstract: Systems and methods are provided for resource adaptive workload management. In a method thereof, at least one execution objective is received for at least one of a plurality of queries under execution. A progress status of, and an amount of resource consumed by, each of the plurality of queries are monitored. A remaining resource requirement for each of the plurality of queries is estimated, based on the progress status of, and the amount of resource consumed by, each of the plurality of queries. Resource allocation is adjusted based on the at least one execution objective and the estimates of the remaining resource requirements.
    Type: Application
    Filed: February 22, 2005
    Publication date: August 24, 2006
    Inventors: Gang Luo, Philip Yu
  • Publication number: 20060184527
    Abstract: Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.
    Type: Application
    Filed: February 16, 2005
    Publication date: August 17, 2006
    Applicant: IBM Corporation
    Inventors: Yun Chi, Haixun Wang, Philip Yu
  • Publication number: 20060174024
    Abstract: Towards mining closed frequent itemsets over a sliding window using limited memory space, a synopsis data structure to monitor transactions in the sliding window so that one can output the current closed frequent itemsets at any time. Due to time and memory constraints, the synopsis data structure cannot monitor all possible itemsets, but monitoring only frequent itemsets makes it difficult to detect new itemsets when they become frequent. Herein, there is introduced a compact data structure, the closed enumeration tree (CET), to maintain a dynamically selected set of itemsets over a sliding-window. The selected itemsets include a boundary between closed frequent itemsets and the rest of the itemsets Because the boundary is relatively stable, the cost of mining closed frequent itemsets over a sliding window is dramatically reduced to that of mining transactions that can possibly cause boundary movements in the CET.
    Type: Application
    Filed: January 31, 2005
    Publication date: August 3, 2006
    Applicant: IBM Corporation
    Inventors: Yun Chi, Haixun Wang, Philip Yu
  • Publication number: 20060161575
    Abstract: Sequence-based XML indexing aims at avoiding expensive join operations in query processing. It transforms structured XML data into sequences so that a structured query can be answered holistically through subsequence matching. Herein, there is addresed the problem of query equivalence with respect to this transformation, and thereis introduced a performance-oriented principle for sequencing tree structures. With query equivalence, XML queries can be performed through subsequence matching without join operations, post-processing, or other special handling for problems such as false alarms. There is identified a class of sequencing methods for this purpose, and there is presented a novel subsequence matching algorithm that observe query equivalence. Also introduced is a performance-oriented principle to guide the sequencing of tree structures.
    Type: Application
    Filed: January 14, 2005
    Publication date: July 20, 2006
    Applicant: IBM Corporation
    Inventors: Wei Fan, Haixun Wang, Philip Yu
  • Publication number: 20060132326
    Abstract: An improved universal remote control unit (URC) for controlling electronic appliance units. The URC unit has the typical remote controller module for controlling appliances such as TV, stereo, VCR or DVD. Additionally, the URC has a scratch pad memory for storing telephone numbers and web site information entered through the URC unit's alphanumeric keys. When activated, the key pad entries are stored in the memory, instead of being used to control the appliance. The URC unit further has a digital recorder module that can be implemented with a microphone, a voice recorder chip and a speaker, all integrated with the URC unit. The digital recorder module can even use the battery that is typically used by the URC unit. The URC unit further has a display screen to display the information stored in and recalled from the memory.
    Type: Application
    Filed: December 23, 2005
    Publication date: June 22, 2006
    Inventors: Calvin Fang, Philip Yu
  • Publication number: 20060106666
    Abstract: A method for implementing a multi-stage, multi-classification sales opportunity modeling system. The method includes receiving operational data relating to past sales activities and receiving parameters identified as being relevant in determining a likelihood of whether exploitation of a sales opportunity will be successful. The method also includes generating a multi-stage model by applying the operational data and the parameters to an analytic engine for evaluating different factors affecting success of the sales opportunity.
    Type: Application
    Filed: November 15, 2004
    Publication date: May 18, 2006
    Applicant: International Business Machines Corporation
    Inventors: Jamshid Vayghan, Philip Yu