Patents by Inventor Nina Mishra

Nina Mishra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PRIVACY-PRESERVING DATASET SKETCHES THAT CAN BE JOINED NON-INTERACTIVELY

Publication number: 20240005022

Abstract: Computer systems and associated methods are disclosed to implement the non-interactive join of privacy-preserving dataset sketches. In some embodiments, an entity can publish a one-time sketch of their dataset that would enable another entity to join their data without exposing private information. The sketch can map, using a hash function, the identities associated with a first value of the dataset to a data structure, in some embodiments. A same or different entity can join the first sketch with a privacy-preserving second sketch of a second dataset that includes added noise, and can determine an estimate of a number of identities that correspond with specific values of the first and second datasets from the joined dataset. The sketch can be published just one time, and therefore does not require separate new private computations with privacy budgeting for each additional party when a join is desired, in some embodiments.

Type: Application

Filed: June 30, 2022

Publication date: January 4, 2024

Applicant: Amazon Technologies, Inc.

Inventors: James Alexander Cook, Nina Mishra
Determining causal insights

Patent number: 11853912

Abstract: Described are systems and methods for determining causal connections between various metrics collected by wearable devices and using those causal connections to provide causal insights to other users. For example, some users may elect to perform one or more self-experiments to explore the impact certain changes in their behavior may have on metrics measured by the user's wearable device. Causal connections determined from those experiments may be used to provide causal insights relating to those metrics to other users who have not performed the same or similar experiments.

Type: Grant

Filed: January 30, 2020

Date of Patent: December 26, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Shiva Prasad Kasiviswanathan, Nina Mishra, Yonatan Naamad
Systems, methods, and apparatus for hotspot detection

Patent number: 11797572

Abstract: Techniques for hotspot detection in a dataset are described. A hotspot being a region (or a collection of points) where the value of a function of given any region in the space measures the concentration of points in that region is significantly higher than its other regions of the dataspace. As such, a region that has a denser concentration of points than other regions of the dataspace may be considered a hotspot. In some implementations, hotspot detection includes finding two or more regions to evaluate for high-density in the dataset, a high-density region indicating a potential hotspot and extending a size of the manipulated found two or more regions to determine borders for these regions.

Type: Grant

Filed: August 7, 2018

Date of Patent: October 24, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Yonatan Naamad, Shiva Prasad Kasiviswanathan, Nina Mishra, Morteza Monemizadeh, Lauren Anne Moos, Joshua M. Tokle
Anomaly detection with feedback

Patent number: 11308407

Abstract: Examples of techniques for anomaly detection with feedback are described. An instance includes a technique is receiving a plurality of unlabeled data points from an input stream; performing anomaly detection on a point of the unlabeled data points using an anomaly detection engine; pre-processing the unlabeled data point that was subjected to anomaly detection; classifying the pre-processed unlabeled data point; determining the anomaly detection was not proper based on a comparison of a result of the anomaly detection and a result of the classifying of the pre-processed unlabeled data point; and in response to determining the anomaly detection was not proper, providing feedback to the anomaly detection engine to change at least one emphasis used in anomaly detection.

Type: Grant

Filed: December 14, 2017

Date of Patent: April 19, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Sudipto Guha, Tal Wagner, Shiva Prasad Kasiviswanathan, Nina Mishra
OUTLIER DETECTION FOR STREAMING DATA

Publication number: 20220100721

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Application

Filed: December 13, 2021

Publication date: March 31, 2022

Applicant: Amazon Technologies, Inc.

Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
Outlier detection for streaming data

Patent number: 11232085

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Grant

Filed: January 7, 2016

Date of Patent: January 25, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
Anomaly detection in streaming graphs

Patent number: 11003717

Abstract: Techniques for detecting anomalies in streaming graph data are described. For example, an embedding technique of generating a multi-dimensional vector of summations of each weighted edge found in both a random source bounding proper subset and a random destination bounding proper subset associated with a dimension of the epoch graph is detailed. Anomaly detection is performed on the generated multi-dimensional vectors.

Type: Grant

Filed: February 8, 2018

Date of Patent: May 11, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Dhivya Eswaran, Sudipto Guha, Nina Mishra
Artificial intelligence system providing dimension-level anomaly score attributions for streaming data

Patent number: 10902062

Abstract: At an artificial intelligence system, a random cut tree corresponding to a sample of a multi-dimensional data set is traversed to determine a tree-specific vector indicating respective contributions of individual dimensions to an anomaly score of a particular data point. Level-specific vectors of per-dimension contributions obtained using bounding-box analyses at each level during the traversal are aggregated to obtain the tree-specific vector. An overall anomaly score contribution for at least one dimension is obtained using respective tree-specific vectors generated from one or more random cut trees, and an indication of the overall anomaly score contribution is provided.

Type: Grant

Filed: August 24, 2017

Date of Patent: January 26, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Sudipto Guha, Nina Mishra
Question answering system

Patent number: 10713289

Abstract: Systems, methods, and devices for performing interactive question answering using data source credibility and conversation entropy are disclosed. A speech-controlled device captures audio including a spoken question, and sends audio data corresponding thereto to a server(s). The server(s) performs speech processing on the audio data, and determines various stored data that can be used to determine an answer to the question. The server(s) determines which stored data to use based on the credibility of the source from which the stored data was received. The server(s) may also determine a number of user interactions needed to obtain data in order to fully answer the question and may select a question for a dialog soliciting further data based on the number of user interactions.

Type: Grant

Filed: March 31, 2017

Date of Patent: July 14, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Nina Mishra, Yonatan Naamad
Ranking relevant discussion groups

Patent number: 10637807

Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

Type: Grant

Filed: October 12, 2017

Date of Patent: April 28, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
RANKING RELEVANT DISCUSSION GROUPS

Publication number: 20180034752

Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

Type: Application

Filed: October 12, 2017

Publication date: February 1, 2018

Inventors: Nina MISHRA, Abhimanyu DAS, Krishnaram KENTHAPADI
Ranking relevant discussion groups

Patent number: 9819618

Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

Type: Grant

Filed: June 18, 2014

Date of Patent: November 14, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
OUTLIER DETECTION FOR STREAMING DATA

Publication number: 20170199902

Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.

Type: Application

Filed: January 7, 2016

Publication date: July 13, 2017

Applicant: Amazon Technologies, Inc.

Inventors: NINA MISHRA, DANIEL BLICK, SUDIPTO GUHA, OKKE JOOST SCHRIJVERS
RANKING RELEVANT DISCUSSION GROUPS

Publication number: 20150370797

Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.

Type: Application

Filed: June 18, 2014

Publication date: December 24, 2015

Inventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
Recommending queries according to mapping of query communities

Patent number: 9171045

Abstract: A set of queries, such as a search log, is divided into commercial queries and non-commercial queries. A first set of query communities is determined from the non-commercial queries and a second set is determined from the commercial queries. The query communities are correlated based on the users who submitted the queries and instances where a query from the first set of query communities was followed by a query from the second set to generate a mapping between the first set of query communities and the second set. Later, a non-commercial query is received from a user, and the mapping is used to predict one or more commercial queries that the user is likely to submit in the future based on the non-commercial query. One or more of the commercial queries are presented to the user according to the mapping with search results responsive to the non-commercial query.

Type: Grant

Filed: November 11, 2010

Date of Patent: October 27, 2015

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Nina Mishra, Sreenivas Gollapudi, Srikanth Jagabathula
DISCUSSION SUMMARY

Publication number: 20150120680

Abstract: One or more techniques and/or systems are provided for providing a discussion summary corresponding to a search query and/or for providing discussion session search results. For example, discussion data (e.g., corresponding to real-time messaging, such as a microblog discussion) may be evaluated to identify a discussion topic for a discussion sessions (e.g., a kitchen renovation topic may be assigned to a 1 hour exchange of kitchen renovation messages by a discussion group). A discussion summary of a discussion session may be provided based upon the discussion session having a discussion topic corresponding to a search query topic of a search query. The discussion summary may be provided along with other results for the query and may describe the discussion group, identifiers such as hashtags used by the discussion group, meeting dates/times, average number(s) of participants, other discussion sessions hosted by the discussion group, future discussion sessions, and/or other information.

Type: Application

Filed: October 24, 2013

Publication date: April 30, 2015

Applicant: Microsoft Corporation

Inventors: Omar Alonso, Kartikay Khandelwal, Mohamed Mansour, Paul Ko, Nina Mishra, Krishnaram Kenthapadi, Abhimanyu Das
TOPIC IDENTIFIERS ASSOCIATED WITH GROUP CHATS

Publication number: 20140324982

Abstract: Text messages over some period of time are collected. Topic identifiers, such as hashtags, are extracted from the text messages. The text messages associated with each topic identifier are processed to identify which topic identifiers are associated with group chats based on information associated with the text messages such as the times when the text messages were generated and whether the text messages identify user accounts. The topic identifiers that are determined to be associated with the group chats are incorporated into applications that allow users to search for group chats, and to view text messages from past group chats.

Type: Application

Filed: April 29, 2013

Publication date: October 30, 2014

Applicant: Microsoft Corporation

Inventors: Rakesh Agrawal, James A. Cook, Krishnaram Kenthapadi, Nina Mishra
PRESERVING GEOMETRIC PROPERTIES OF DATASETS WHILE PROTECTING PRIVACY

Publication number: 20140196151

Abstract: The privacy of a dataset is protected. A private dataset is received that includes multiple rows of multidimensional data. Each row may correspond to a user, and each dimension may be an attribute of the user. A projection matrix is applied to each row to generate a lower dimensional sketch of the row. Noise is added to each of the lower dimensional sketches. The sketches with the added noise may be published together with the projection matrix. The sketches preserve geometric relationships of the original dataset including clustering, distances, and nearest neighbor, and therefore may be useful for data mining purposes while still protecting the privacy of the users.

Type: Application

Filed: January 10, 2013

Publication date: July 10, 2014

Applicant: Microsoft Corporation

Inventors: Nina Mishra, Krishnaram Kenthapadi, IIya Mironov
System and method for determining an element value in private datasets

Patent number: 8738387

Abstract: A system and method is disclosed for determining a value of an element, having a k-th rank. The method discloses: calculating a total number of elements in a first and second dataset; prohibiting parties from accessing each other's dataset; ranking the elements within each dataset; computing a total number of elements in the datasets each having a value less than a test value; computing a total number of elements in the datasets each having a value greater than the test value; and setting the value of the element, having the k-th rank, equal to the test value, if the total number of elements having values less than the test value is ? the k-th rank minus one, and the total number of elements having values greater than the test value is ? the total number of elements minus the k-th rank. The system discloses means for practicing the method.

Type: Grant

Filed: October 29, 2004

Date of Patent: May 27, 2014

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Gagan Aggarwal, Nina Mishra, Binyamin Pinkas
Synopsis of a search log that respects user privacy

Patent number: 8601024

Abstract: Described is releasing output data representing a search log, in which the data is suitable for most data mining/analysis applications, but is safe to publish by preserving user privacy. The search log is processed such that a query is only included if a sufficient count of that query is present; noise may be added. User contributions that are considered may be limited to a maximum number of queries. The output may indicate how often (possibly plus noise) that each query appeared. Other output may comprise a query-action graph, a query-inaction graph and/or a query-reformulation graph, with nodes representing queries and nodes representing actions, inactions or reformulations (e.g., clicked URLs, skipped URLs, or selected related queries), and edges between nodes representing action, skip or selection counts (possibly plus noise). The output may correspond to the top results/related queries returned from a search.

Type: Grant

Filed: June 16, 2009

Date of Patent: December 3, 2013

Assignee: Microsoft Corporation

Inventors: Krishnaram Kenthapadi, Aleksandra Korolova, Nina Mishra, Alexandros Ntoulas

1 2 3 next