Patents by Inventor Tanveer Syeda-Mahmood

Tanveer Syeda-Mahmood has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

FINDING STRUCTURES IN MULTI-DIMENSIONAL SPACES USING IMAGE-GUIDED CLUSTERING

Publication number: 20090175544

Abstract: A data processing system is provided that comprises a processor, a random access memory for storing data and programs for execution by the processor, and computer readable instructions stored in the random access memory for execution by the processor to perform a method for clustering data points in a multidimensional dataset in a multidimensional image space. The method comprises generating a multidimensional image from the multidimensional dataset; generating a pyramid of multidimensional images having varying resolution levels by successively performing a pyramidal sub-sampling of the multidimensional image; identifying data clusters at each resolution level of the pyramid by applying a set of perceptual grouping constraints; and determining levels of a clustering hierarchy by identifying each salient bend in a variation curve of a magnitude of identified data clusters as a function of pyramid resolution level.

Type: Application

Filed: June 20, 2008

Publication date: July 9, 2009

Applicant: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy Lohman
Finding structures in multi-dimensional spaces using image-guided clustering

Patent number: 7558425

Abstract: A data processing system is provided that comprises a processor, a random access memory for storing data and programs for execution by the processor, and computer readable instructions stored in the random access memory for execution by the processor to perform a method for clustering data points in a multidimensional dataset in a multidimensional image space. The method comprises generating a multidimensional image from the multidimensional dataset; generating a pyramid of multidimensional images having varying resolution levels by successively performing a pyramidal sub-sampling of the multidimensional image; identifying data clusters at each resolution level of the pyramid by applying a set of perceptual grouping constraints; and determining levels of a clustering hierarchy by identifying each salient bend in a variation curve of a magnitude of identified data clusters as a function of pyramid resolution level.

Type: Grant

Filed: June 20, 2008

Date of Patent: July 7, 2009

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Data classification by kernel density shape interpolation of clusters

Patent number: 7542954

Abstract: A method for representing a dataset comprises clustering the dataset using an unsupervised, non-parametric clustering method to generate a set of clusters each comprising a set of data points in an image; clustering the data points of each cluster using a supervised, partitional clustering method to partition each cluster into a specified number of sub-clusters; generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each sub-cluster using a kernel density function; identifying a maximum density estimate value and a sub-cluster associated with the maximum density estimate value for the grid point; adding each grid point for which the maximum density estimate value exceeds a specified threshold to the sub-cluster associated with the maximum density estimate value; and, for each cluster, merging the sub-clusters of the cluster into a corresponding cluster region in the image.

Type: Grant

Filed: June 30, 2008

Date of Patent: June 2, 2009

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Data classification by kernel density shape interpolation of clusters

Patent number: 7542953

Abstract: A data processing system is provided that comprises a processor, a random access memory for storing data and programs for execution by the processor, and computer readable instructions stored in the random access memory for execution by the processor to perform a method for obtaining a shape interpolated representation of shapes of clusters in an image of a clustered dataset. The method comprises generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each cluster using a kernel density function; evaluating the density estimate value of each grid point for each cluster to identify a maximum density estimate value of each grid point and a cluster associated with the maximum density estimate value; and adding each grid point for which the maximum density estimate value exceeds a specified threshold to the associated cluster to form a shape interpolated representation.

Type: Grant

Filed: June 20, 2008

Date of Patent: June 2, 2009

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
DATA CLASSIFICATION BY KERNEL DENSITY SHAPE INTERPOLATION OF CLUSTERS

Publication number: 20090132594

Abstract: A data processing system is provided that comprises a processor, a random access memory for storing data and programs for execution by the processor, and computer readable instructions stored in the random access memory for execution by the processor to perform a method for obtaining a shape interpolated representation of shapes of clusters in an image of a clustered dataset. The method comprises generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each cluster using a kernel density function; evaluating the density estimate value of each grid point for each cluster to identify a maximum density estimate value of each grid point and a cluster associated with the maximum density estimate value; and adding each grid point for which the maximum density estimate value exceeds a specified threshold to the associated cluster to form a shape interpolated representation.

Type: Application

Filed: June 20, 2008

Publication date: May 21, 2009

Applicant: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
DATA CLASSIFICATION BY KERNEL DENSITY SHAPE INTERPOLATION OF CLUSTERS

Publication number: 20090132568

Abstract: A method for representing a dataset comprises clustering the dataset using an unsupervised, non-parametric clustering method to generate a set of clusters each comprising a set of data points in an image; clustering the data points of each cluster using a supervised, partitional clustering method to partition each cluster into a specified number of sub-clusters; generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each sub-cluster using a kernel density function; identifying a maximum density estimate value and a sub-cluster associated with the maximum density estimate value for the grid point; adding each grid point for which the maximum density estimate value exceeds a specified threshold to the sub-cluster associated with the maximum density estimate value; and, for each cluster, merging the sub-clusters of the cluster into a corresponding cluster region in the image.

Type: Application

Filed: June 30, 2008

Publication date: May 21, 2009

Applicant: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Finding structures in multi-dimensional spaces using image-guided clustering

Patent number: 7519227

Abstract: A method executed on a computer for determining a hierarchical clustering of a multidimensional dataset in a multidimensional image space comprises receiving a pyramid of multidimensional images of the multidimensional dataset in which the images of the pyramid representing a first multidimensional image of the multidimensional dataset at successively lower resolution levels; identifying data clusters at each resolution level of the pyramid by applying a set of perceptual grouping constraints; plotting a variation curve of a magnitude of data clusters identified at each resolution level of the pyramid as a function of resolution level; and generating a clustering hierarchy for the multidimensional dataset by identifying the resolution level at each salient bend in the variation curve as a level of the clustering hierarchy.

Type: Grant

Filed: July 7, 2008

Date of Patent: April 14, 2009

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Method for data classification by kernel density shape interpolation of clusters

Patent number: 7412429

Abstract: A method for obtaining a shape interpolated representation of shapes of one or more clusters in an image of a dataset that has been clustered comprises generating a density estimate value of each grid point of a set of grid points sampled from the image at a specified resolution for each cluster in the image using a kernel density function; evaluating the density estimate value of each grid point for each cluster to identify a maximum density estimate value of each grid point and a cluster associated with the maximum density estimate value of each grid point; and adding each grid point for which the maximum density estimate value exceeds a specified threshold to the cluster associated with the maximum density estimate value for the grid point to form a shape interpolated representation of the one or more clusters.

Type: Grant

Filed: November 15, 2007

Date of Patent: August 12, 2008

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Method and system for finding structures in multi-dimensional spaces using image-guided clustering

Patent number: 7406200

Abstract: A method is provided clustering data points in a multidimensional dataset in a multidimensional image space that comprises generating a multidimensional image from the multidimensional dataset; generating a pyramid of multidimensional images having varying resolution levels by successively performing a pyramidal sub-sampling of the multidimensional image; identifying data clusters at each resolution level of the pyramid by applying a set of perceptual grouping constraints; and determining levels of a clustering hierarchy by identifying each salient bend in a variation curve of a magnitude of identified data clusters as a function of pyramid resolution level.

Type: Grant

Filed: January 8, 2008

Date of Patent: July 29, 2008

Assignee: International Business Machines Corporation

Inventors: Tanveer Syeda-Mahmood, Peter J. Haas, John M. Lake, Guy M. Lohman
Method and apparatus for semantic search of schema repositories

Publication number: 20070185868

Abstract: Mechanisms for searching XML repositories for semantically related schemas from a variety of structured metadata sources, including web services, XSD documents and relational tables, in databases and Internet applications. A search is formulated as a problem of computing a maximum matching in pairwise bipartite graphs formed from query and repository schemas. The edges of such a bipartite graph capture the semantic similarity between corresponding attributes of the schema based on their name and type semantics. Tight upper and lower bounds are also derived on the maximum matching that can be used for fast ranking of matchings whilst still maintaining specified levels of precision and recall. Schema indexing is performed by ‘attribute hashing’, in which matching schemas of a database are found by indexing using query attributes, performing lower bound computations for maximum matching and recording peaks in the resulting histogram of hits.

Type: Application

Filed: February 8, 2006

Publication date: August 9, 2007

Inventors: Mary Roth, Gauri Shah, Tanveer Syeda-Mahmood, Willi Urban, Lingling Yan
Method and system to compose software applications by combining planning with semantic reasoning

Publication number: 20070156622

Abstract: A system and method for composing application services includes an indexing module configured to index words in a request and available application descriptions to create a semantic similarity map. A semantic matcher is configured to determine semantic similarity between concepts/terms in both domain-independent and domain-specific ontologies for the semantic similarity map. A prefiltering module is configured to determine candidate compositions for the request based on the semantic similarity map and the available descriptions. A metric guided composition method is configured to run algorithms to generate a set of alternative compositions by determining which applications can be composed with which others using the semantic similarity map.

Type: Application

Filed: January 5, 2006

Publication date: July 5, 2007

Inventors: Rama Akkiraju, Richard Goodwin, Anca-Andreea Ivan, Biplav Srivastava, Tanveer Syeda-Mahmood
Technique for relationship discovery in schemas using semantic name indexing

Publication number: 20060253476

Abstract: Techniques are provided for semantic matching. A semantic index is created for one or more schemas, wherein each of the one or more schemas includes one or more word attributes, and wherein each of the one or more word attributes includes one or more tokens, wherein the semantic index identifies one or more keys and one or more values for each key, wherein each value specifies one of the one or more schemas, a word attribute from the specified schema, and a token of the specified word attribute, and wherein the specified token is a synonym of the key. For a source word attribute from one of the one or more schemas, the source word attribute is used as a key to index the semantic index to identify one or more matching word attributes.

Type: Application

Filed: May 9, 2005

Publication date: November 9, 2006

Inventors: Mary Roth, Tanveer Syeda-Mahmood, Lingling Yan
Method and apparatus for order-preserving clustering of multi-dimensional data

Publication number: 20060155394

Abstract: A method of clustering ordered data sets, wherein the method comprises forming n-dimensional curvilinear representations from an ordered data set; formulating a n+1-dimensional curvilinear representation from a pair of ordered data sets; computing a similarity of the pair of ordered data sets using a similarity between the n-dimensional curvilinear representations and the n+1-dimensional curvilinear representation; and clustering ordered data sets based on the similarity between the n-dimensional curvilinear representations and the n+1-dimensional curvilinear representation. In the n-dimensional curvilinear representations, a first dimension of space corresponds with a common ordering dimension and the remaining dimension of space corresponds with the ordered data set. The process of computing the similarity comprises comparing a shape of the n+1-dimensional curvilinear representation to a shape of each component n-dimensional curvilinear representation.

Type: Application

Filed: December 16, 2004

Publication date: July 13, 2006

Applicant: International Business Machines Corporation

Inventor: Tanveer Syeda-Mahmood
Automatic composition of services through semantic attribute matching

Publication number: 20060136428

Abstract: A method of automatically matching schemas begins by extracting schemas from sources and targets. Then, source and target attributes are extracted from the schemas. Each source schema will have multiple source attributes and each target schema will also have multiple target attributes. The source attributes and the target attributes are presented as nodes in a bipartite graph. This bipartite graph has edges between nodes that are related to each other. A plurality of similarity scores are defined between each set of related nodes. Each of the similarity scores is based on a different context-specific cue of the attributes that the nodes represent. These context-specific cues can comprise lexical name, semantic name, type, structure, functional mappings, etc. An overall weight is computed for each edge in the bipartite graph by combining the similarity scores of each set of nodes that form an edge.

Type: Application

Filed: December 16, 2004

Publication date: June 22, 2006

Applicant: International Business Machines Corporation

Inventor: Tanveer Syeda-Mahmood
Method, program product and apparatus for discovering functionally similar gene expression profiles

Publication number: 20050027460

Abstract: Genes to be compared are listed by their gene expression profiles and processed with a similar sequences algorithm that is a time and intensity invariant correlation function to obtain a data set of gene expression pairs and a match fraction for each pair. A threshold match fraction is chosen and a null set is created to hold indices of genes accounted for. Genes are then assigned to clusters by match fraction value if they have a match fraction greater than the threshold. Genes are then removed from clusters if they are represented in more than one cluster by removing a first gene from a cluster when another cluster has another gene with a higher match fraction with the first gene.

Type: Application

Filed: July 29, 2003

Publication date: February 3, 2005

Inventors: Bhooshan Kelkar, Tanveer Syeda-Mahmood, Gregor Meyer

prev 1 2 3 4