Patents by Inventor Sudipto Guha

Sudipto Guha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Outlier detection for streaming data

Patent number: 12174807

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Grant

Filed: December 13, 2021

Date of Patent: December 24, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
Anomaly detection with feedback

Patent number: 11308407

Abstract: Examples of techniques for anomaly detection with feedback are described. An instance includes a technique is receiving a plurality of unlabeled data points from an input stream; performing anomaly detection on a point of the unlabeled data points using an anomaly detection engine; pre-processing the unlabeled data point that was subjected to anomaly detection; classifying the pre-processed unlabeled data point; determining the anomaly detection was not proper based on a comparison of a result of the anomaly detection and a result of the classifying of the pre-processed unlabeled data point; and in response to determining the anomaly detection was not proper, providing feedback to the anomaly detection engine to change at least one emphasis used in anomaly detection.

Type: Grant

Filed: December 14, 2017

Date of Patent: April 19, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Sudipto Guha, Tal Wagner, Shiva Prasad Kasiviswanathan, Nina Mishra
OUTLIER DETECTION FOR STREAMING DATA

Publication number: 20220100721

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Application

Filed: December 13, 2021

Publication date: March 31, 2022

Applicant: Amazon Technologies, Inc.

Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
Outlier detection for streaming data

Patent number: 11232085

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Grant

Filed: January 7, 2016

Date of Patent: January 25, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
Anomaly detection in streaming graphs

Patent number: 11003717

Abstract: Techniques for detecting anomalies in streaming graph data are described. For example, an embedding technique of generating a multi-dimensional vector of summations of each weighted edge found in both a random source bounding proper subset and a random destination bounding proper subset associated with a dimension of the epoch graph is detailed. Anomaly detection is performed on the generated multi-dimensional vectors.

Type: Grant

Filed: February 8, 2018

Date of Patent: May 11, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Dhivya Eswaran, Sudipto Guha, Nina Mishra
Anomaly detection with missing values and forecasting data streams

Patent number: 10972491

Abstract: Techniques for seasonality-based anomaly detection and forecast are described. For example, a method of receiving a request to generate forecast for received time series data; performing a seasonality-based anomaly detection and forecast for the received time series data based upon the received request, the seasonality-based anomaly detection and forecasting to utilize a second data structure that reflect anomalies found in a first data structure on the input from the received time series data; and providing a result of the performed seasonality-based anomaly detection and forecast is described.

Type: Grant

Filed: May 11, 2018

Date of Patent: April 6, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Sudipto Guha, Santosh Kalki, Akshay Satish
Artificial intelligence system providing dimension-level anomaly score attributions for streaming data

Patent number: 10902062

Abstract: At an artificial intelligence system, a random cut tree corresponding to a sample of a multi-dimensional data set is traversed to determine a tree-specific vector indicating respective contributions of individual dimensions to an anomaly score of a particular data point. Level-specific vectors of per-dimension contributions obtained using bounding-box analyses at each level during the traversal are aggregated to obtain the tree-specific vector. An overall anomaly score contribution for at least one dimension is obtained using respective tree-specific vectors generated from one or more random cut trees, and an indication of the overall anomaly score contribution is provided.

Type: Grant

Filed: August 24, 2017

Date of Patent: January 26, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Sudipto Guha, Nina Mishra
CHIP ASSEMBLIES EMPLOYING SOLDER BONDS TO BACK-SIDE LANDS INCLUDING AN ELECTROLYTIC NICKEL LAYER

Publication number: 20190237391

Abstract: A stacked-chip assembly including a plurality of IC chips or die that are stacked, and electrically coupled by solder bonds. In accordance with some embodiments described further below, the solder bonds are to contact a back-side land that includes a diffusion barrier to reduce intermetallic formation and/or other solder-induced reliability issues. The back-side land may include an electrolytic nickel (Ni) barrier layer separating solder from a back-side redistribution layer trace. This electrolytic Ni may be of high purity, which at least in part, may enable the backside metallization stack to be of minimal thickness while still functioning as a diffusion barrier. In some embodiments, the back-side land composition and architecture is distinct from a front-side land composition and/or architecture.

Type: Application

Filed: October 27, 2016

Publication date: August 1, 2019

Applicant: Intel Corporation

Inventors: Seshu V. SATTIRAJU, Krishna Prakash GANESAN, Ashish BHATIA, Vinay SRIRAM, John MUIRHEAD, Hiten KOTHARI, Aloysius A. GUNAWAN, Lavanya ARYASOMAYAJULA, Shravan GOWRISHANKAR, Sriram PATTABHIRAMAN, Sudipto GUHA
OUTLIER DETECTION FOR STREAMING DATA

Publication number: 20170199902

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Application

Filed: January 7, 2016

Publication date: July 13, 2017

Applicant: Amazon Technologies, Inc.

Inventors: NINA MISHRA, DANIEL BLICK, SUDIPTO GUHA, OKKE JOOST SCHRIJVERS
Apparatus and method for correlating synchronous and asynchronous data streams

Patent number: 8131792

Abstract: Certain exemplary embodiments provide a method comprising: automatically: receiving a plurality of elements for each of a plurality of continuous data streams; treating the plurality of elements as a first data stream matrix that defines a first dimensionality; reducing the first dimensionality of the first data stream matrix to obtain a second data stream matrix; computing a singular value decomposition of the second data stream matrix; and based on the singular value decomposition of the second data stream matrix, quantifying approximate linear correlations between the plurality of elements.

Type: Grant

Filed: May 23, 2008

Date of Patent: March 6, 2012

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Nikolaos Koudas, Sudipto Guha
Method and apparatus for using histograms to produce data summaries

Patent number: 7965643

Abstract: A system and method are provided for summarizing dynamic data from distributed sources through the use of histograms. In particular, the method comprises receiving a first data signal at a first location, determining a first array sketch of the first data signal, and constructing a first output histogram from the first array sketch and a first robust histogram via a first hybrid histogram. Array sketches of a number of data signals may be calculated, and added to yield a single vector sum. The histogram is constructed from the vector sum. In that way, the vector sum may be analyzed without revealing the individual data signals that form the basis of the sum.

Type: Grant

Filed: July 10, 2008

Date of Patent: June 21, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Anna C. Gilbert, Sudipto Guha, Piotr Indyk, Ioannis Kotidis, Shanmugavelayutham Muthukrishnan, Martin J. Strauss
Method and apparatus for optimizing queries under parametric aggregation constraints

Patent number: 7904458

Abstract: The present invention relates to a method and apparatus for optimizing queries. The present invention discloses an efficient method for providing answers to queries under parametric aggregation constraints.

Type: Grant

Filed: December 26, 2009

Date of Patent: March 8, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Nikolaos Koudas, Divesh Srivastava, Sudipto Guha, Dimitrios Gunopulos, Michail Vlachos
METHOD AND APPARATUS FOR OPTIMIZING QUERIES UNDER PARAMETRIC AGGREGATION CONSTRAINTS

Publication number: 20100100538

Abstract: The present invention relates to a method and apparatus for optimizing queries. The present invention discloses an efficient method for providing answers to queries under parametric aggregation constraints.

Type: Application

Filed: December 26, 2009

Publication date: April 22, 2010

Inventors: NIKOLAOS KOUDAS, Divesh Srivastava, Sudipto Guha, Dimitriious Gunopoulos, Michail Vlachos
Method and apparatus for optimizing queries under parametric aggregation constraints

Patent number: 7668801

Abstract: The present invention relates to a method and apparatus for optimizing queries. The present invention discloses an efficient method for providing answers to queries under parametric aggregation constraints.

Type: Grant

Filed: April 21, 2004

Date of Patent: February 23, 2010

Assignee: AT&T Corp.

Inventors: Nikolaos Koudas, Divesh Srivastava, Sudipto Guha, Dimitrios Gunopulos, Michail Vlachos
Apparatus and method for correlating synchronous and asynchronous data streams

Patent number: 7437397

Abstract: Certain exemplary embodiments provide a method comprising: automatically: receiving a plurality of elements for each of a plurality of continuous data streams; treating the plurality of elements as a first data stream matrix that defines a first dimensionality; reducing the first dimensionality of the first data stream matrix to obtain a second data stream matrix; computing a singular value decomposition of the second data stream matrix; and based on the singular value decomposition of the second data stream matrix, quantifying approximate linear correlations between the plurality of elements.

Type: Grant

Filed: April 12, 2004

Date of Patent: October 14, 2008

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Nikolaos Koudas, Sudipto Guha
Apparatus and method for merging results of approximate matching operations

Patent number: 7415461

Abstract: A device and a method are provided. Approximate match operations are performed for each of a group of attributes for each of a group of tuples with respect to a query to create a respective ranking for each of the group of attributes. The rankings of the group of attributes are combined to provide a ranking score for each of the group of tuples. Data representing a ranking score of each of the group of tuples is generated according to a position of a respective ranking of each one of the group of tuples for a first k positions of the ranking. K of top ranked ones of the group of tuples are identified based at least in part on the generated data, wherein a number of the group of tuples is n and k<n.

Type: Grant

Filed: August 3, 2005

Date of Patent: August 19, 2008

Assignee: AT&T Corp

Inventors: Sudipto Guha, Nikolas Koudas, Amit Marathe, Divesh Srivastava
METHOD AND APPARATUS FOR OPTIMIZING QUERIES UNDER PARAMETRIC AGGREGATION CONSTRAINTS

Publication number: 20080052268

Abstract: The present invention relates to a method and apparatus for optimizing queries. The present invention discloses an efficient method for providing answers to queries under parametric aggregation constraints.

Type: Application

Filed: October 29, 2007

Publication date: February 28, 2008

Inventors: NIKOLAOS KOUDAS, Divesh Srivastava, Sudipto Guha, Dimitrios Gunopulos, Michail Vlachos
Method and apparatus for using histograms to produce data summaries

Patent number: 7177282

Abstract: A system and method are provided for monitoring dynamic data from distributed sources through the use of histograms. In the method, an array sketch of the digital signal is determined, a robust histogram is constructed from the array sketch, and an output histogram is constructed from the array sketch and the robust histogram via a hybrid histogram. Dyadic intervals of a representation of the array sketch are used in constructing the robust histogram.

Type: Grant

Filed: April 2, 2002

Date of Patent: February 13, 2007

Assignee: AT&T Corp.

Inventors: Anna C. Gilbert, Sudipto Guha, Piotr Indyk, Ioannis Kotidis, Shanmugavelayutham Muthukrishnan, Martin J. Strauss
Computer implemented scalable, incremental and parallel clustering based on weighted divide and conquer

Patent number: 6907380

Abstract: A technique that uses a weighted divide and conquer approach for clustering a set S of n data points to find k final centers. The technique comprises 1) partitioning the set S into P disjoint pieces S1, . . . , Sp; 2) for each piece Si, determining a set Di of k intermediate centers; 3) assigning each data point in each piece Si to the nearest one of the k intermediate centers; 4) weighting each of the k intermediate centers in each set Di by the number of points in the corresponding piece Si assigned to that center; and 5) clustering the weighted intermediate centers together to find said k final centers, the clustering performed using a specific error metric and a clustering method A.

Type: Grant

Filed: December 1, 2003

Date of Patent: June 14, 2005

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Nina Mishra, Liadan O'Callaghan, Sudipto Guha, Rajeev Motwani
Computer implemented scalable, Incremental and parallel clustering based on weighted divide and conquer

Publication number: 20040122797

Abstract: A technique that uses a weighted divide and conquer approach for clustering a set S of n data points to find k final centers. The technique comprises 1) partitioning the set S into P disjoint pieces S1, . . . , Sp; 2) for each piece Si, determining a set Di of k intermediate centers; 3) assigning each data point in each piece Si to the nearest one of the k intermediate centers; 4) weighting each of the k intermediate centers in each set Di by the number of points in the corresponding piece Si assigned to that center; and 5) clustering the weighted intermediate centers together to find said k final centers, the clustering performed using a specific error metric and a clustering method A.

Type: Application

Filed: December 1, 2003

Publication date: June 24, 2004

Inventors: Nina Mishra, Liadan O?apos; Callaghan, Sudipto Guha, Rajeev Motwani

1 2 next