Patents by Inventor Nina Mishra
Nina Mishra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240005022Abstract: Computer systems and associated methods are disclosed to implement the non-interactive join of privacy-preserving dataset sketches. In some embodiments, an entity can publish a one-time sketch of their dataset that would enable another entity to join their data without exposing private information. The sketch can map, using a hash function, the identities associated with a first value of the dataset to a data structure, in some embodiments. A same or different entity can join the first sketch with a privacy-preserving second sketch of a second dataset that includes added noise, and can determine an estimate of a number of identities that correspond with specific values of the first and second datasets from the joined dataset. The sketch can be published just one time, and therefore does not require separate new private computations with privacy budgeting for each additional party when a join is desired, in some embodiments.Type: ApplicationFiled: June 30, 2022Publication date: January 4, 2024Applicant: Amazon Technologies, Inc.Inventors: James Alexander Cook, Nina Mishra
-
Patent number: 11853912Abstract: Described are systems and methods for determining causal connections between various metrics collected by wearable devices and using those causal connections to provide causal insights to other users. For example, some users may elect to perform one or more self-experiments to explore the impact certain changes in their behavior may have on metrics measured by the user's wearable device. Causal connections determined from those experiments may be used to provide causal insights relating to those metrics to other users who have not performed the same or similar experiments.Type: GrantFiled: January 30, 2020Date of Patent: December 26, 2023Assignee: Amazon Technologies, Inc.Inventors: Shiva Prasad Kasiviswanathan, Nina Mishra, Yonatan Naamad
-
Patent number: 11797572Abstract: Techniques for hotspot detection in a dataset are described. A hotspot being a region (or a collection of points) where the value of a function of given any region in the space measures the concentration of points in that region is significantly higher than its other regions of the dataspace. As such, a region that has a denser concentration of points than other regions of the dataspace may be considered a hotspot. In some implementations, hotspot detection includes finding two or more regions to evaluate for high-density in the dataset, a high-density region indicating a potential hotspot and extending a size of the manipulated found two or more regions to determine borders for these regions.Type: GrantFiled: August 7, 2018Date of Patent: October 24, 2023Assignee: Amazon Technologies, Inc.Inventors: Yonatan Naamad, Shiva Prasad Kasiviswanathan, Nina Mishra, Morteza Monemizadeh, Lauren Anne Moos, Joshua M. Tokle
-
Patent number: 11308407Abstract: Examples of techniques for anomaly detection with feedback are described. An instance includes a technique is receiving a plurality of unlabeled data points from an input stream; performing anomaly detection on a point of the unlabeled data points using an anomaly detection engine; pre-processing the unlabeled data point that was subjected to anomaly detection; classifying the pre-processed unlabeled data point; determining the anomaly detection was not proper based on a comparison of a result of the anomaly detection and a result of the classifying of the pre-processed unlabeled data point; and in response to determining the anomaly detection was not proper, providing feedback to the anomaly detection engine to change at least one emphasis used in anomaly detection.Type: GrantFiled: December 14, 2017Date of Patent: April 19, 2022Assignee: Amazon Technologies, Inc.Inventors: Sudipto Guha, Tal Wagner, Shiva Prasad Kasiviswanathan, Nina Mishra
-
Publication number: 20220100721Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.Type: ApplicationFiled: December 13, 2021Publication date: March 31, 2022Applicant: Amazon Technologies, Inc.Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
-
Patent number: 11232085Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.Type: GrantFiled: January 7, 2016Date of Patent: January 25, 2022Assignee: Amazon Technologies, Inc.Inventors: Nina Mishra, Daniel Blick, Sudipto Guha, Okke Joost Schrijvers
-
Patent number: 11003717Abstract: Techniques for detecting anomalies in streaming graph data are described. For example, an embedding technique of generating a multi-dimensional vector of summations of each weighted edge found in both a random source bounding proper subset and a random destination bounding proper subset associated with a dimension of the epoch graph is detailed. Anomaly detection is performed on the generated multi-dimensional vectors.Type: GrantFiled: February 8, 2018Date of Patent: May 11, 2021Assignee: Amazon Technologies, Inc.Inventors: Dhivya Eswaran, Sudipto Guha, Nina Mishra
-
Patent number: 10902062Abstract: At an artificial intelligence system, a random cut tree corresponding to a sample of a multi-dimensional data set is traversed to determine a tree-specific vector indicating respective contributions of individual dimensions to an anomaly score of a particular data point. Level-specific vectors of per-dimension contributions obtained using bounding-box analyses at each level during the traversal are aggregated to obtain the tree-specific vector. An overall anomaly score contribution for at least one dimension is obtained using respective tree-specific vectors generated from one or more random cut trees, and an indication of the overall anomaly score contribution is provided.Type: GrantFiled: August 24, 2017Date of Patent: January 26, 2021Assignee: Amazon Technologies, Inc.Inventors: Sudipto Guha, Nina Mishra
-
Patent number: 10713289Abstract: Systems, methods, and devices for performing interactive question answering using data source credibility and conversation entropy are disclosed. A speech-controlled device captures audio including a spoken question, and sends audio data corresponding thereto to a server(s). The server(s) performs speech processing on the audio data, and determines various stored data that can be used to determine an answer to the question. The server(s) determines which stored data to use based on the credibility of the source from which the stored data was received. The server(s) may also determine a number of user interactions needed to obtain data in order to fully answer the question and may select a question for a dialog soliciting further data based on the number of user interactions.Type: GrantFiled: March 31, 2017Date of Patent: July 14, 2020Assignee: Amazon Technologies, Inc.Inventors: Nina Mishra, Yonatan Naamad
-
Patent number: 10637807Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.Type: GrantFiled: October 12, 2017Date of Patent: April 28, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
-
Publication number: 20180034752Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.Type: ApplicationFiled: October 12, 2017Publication date: February 1, 2018Inventors: Nina MISHRA, Abhimanyu DAS, Krishnaram KENTHAPADI
-
Patent number: 9819618Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.Type: GrantFiled: June 18, 2014Date of Patent: November 14, 2017Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
-
Publication number: 20170199902Abstract: Random cut trees are generated with respective to respective samples of a baseline set of data records of a data set for which outlier detection is to be performed. To construct a particular random cut tree, an iterative splitting technique is used, in which the attribute along which a given set of data records is split is selected based on its value range. With respect to a newly-received data record of the stream, an outlier score is determined based at least partly on a potential insertion location of a node representing the data record in a particular random cut tree, without necessarily modifying the random cut tree.Type: ApplicationFiled: January 7, 2016Publication date: July 13, 2017Applicant: Amazon Technologies, Inc.Inventors: NINA MISHRA, DANIEL BLICK, SUDIPTO GUHA, OKKE JOOST SCHRIJVERS
-
Publication number: 20150370797Abstract: Messages are collected and processed to determine topic identifiers that correspond to discussion groups. Queries are received and multiple discussion groups that are relevant to the query are determined based on the messages that are associated with the discussion groups and the topic identifiers associated with the discussion groups. The relevant discussion groups are ranked using a group preference model that simulates the behavior of a hypothetical seeker that considers discussion groups by selecting a message author who is an authority in a particular group, and exploring the discussion groups that are preferred by the selected author. The behavior of the seeker is simulated using a stationary Markov process and is used to generate a probability distribution that is used to rank the relevant discussion groups. The ranked relevant discussion groups are provided in response to the query.Type: ApplicationFiled: June 18, 2014Publication date: December 24, 2015Inventors: Nina Mishra, Abhimanyu Das, Krishnaram Kenthapadi
-
Patent number: 9171045Abstract: A set of queries, such as a search log, is divided into commercial queries and non-commercial queries. A first set of query communities is determined from the non-commercial queries and a second set is determined from the commercial queries. The query communities are correlated based on the users who submitted the queries and instances where a query from the first set of query communities was followed by a query from the second set to generate a mapping between the first set of query communities and the second set. Later, a non-commercial query is received from a user, and the mapping is used to predict one or more commercial queries that the user is likely to submit in the future based on the non-commercial query. One or more of the commercial queries are presented to the user according to the mapping with search results responsive to the non-commercial query.Type: GrantFiled: November 11, 2010Date of Patent: October 27, 2015Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Nina Mishra, Sreenivas Gollapudi, Srikanth Jagabathula
-
Publication number: 20150120680Abstract: One or more techniques and/or systems are provided for providing a discussion summary corresponding to a search query and/or for providing discussion session search results. For example, discussion data (e.g., corresponding to real-time messaging, such as a microblog discussion) may be evaluated to identify a discussion topic for a discussion sessions (e.g., a kitchen renovation topic may be assigned to a 1 hour exchange of kitchen renovation messages by a discussion group). A discussion summary of a discussion session may be provided based upon the discussion session having a discussion topic corresponding to a search query topic of a search query. The discussion summary may be provided along with other results for the query and may describe the discussion group, identifiers such as hashtags used by the discussion group, meeting dates/times, average number(s) of participants, other discussion sessions hosted by the discussion group, future discussion sessions, and/or other information.Type: ApplicationFiled: October 24, 2013Publication date: April 30, 2015Applicant: Microsoft CorporationInventors: Omar Alonso, Kartikay Khandelwal, Mohamed Mansour, Paul Ko, Nina Mishra, Krishnaram Kenthapadi, Abhimanyu Das
-
Publication number: 20140324982Abstract: Text messages over some period of time are collected. Topic identifiers, such as hashtags, are extracted from the text messages. The text messages associated with each topic identifier are processed to identify which topic identifiers are associated with group chats based on information associated with the text messages such as the times when the text messages were generated and whether the text messages identify user accounts. The topic identifiers that are determined to be associated with the group chats are incorporated into applications that allow users to search for group chats, and to view text messages from past group chats.Type: ApplicationFiled: April 29, 2013Publication date: October 30, 2014Applicant: Microsoft CorporationInventors: Rakesh Agrawal, James A. Cook, Krishnaram Kenthapadi, Nina Mishra
-
Publication number: 20140196151Abstract: The privacy of a dataset is protected. A private dataset is received that includes multiple rows of multidimensional data. Each row may correspond to a user, and each dimension may be an attribute of the user. A projection matrix is applied to each row to generate a lower dimensional sketch of the row. Noise is added to each of the lower dimensional sketches. The sketches with the added noise may be published together with the projection matrix. The sketches preserve geometric relationships of the original dataset including clustering, distances, and nearest neighbor, and therefore may be useful for data mining purposes while still protecting the privacy of the users.Type: ApplicationFiled: January 10, 2013Publication date: July 10, 2014Applicant: Microsoft CorporationInventors: Nina Mishra, Krishnaram Kenthapadi, IIya Mironov
-
Patent number: 8738387Abstract: A system and method is disclosed for determining a value of an element, having a k-th rank. The method discloses: calculating a total number of elements in a first and second dataset; prohibiting parties from accessing each other's dataset; ranking the elements within each dataset; computing a total number of elements in the datasets each having a value less than a test value; computing a total number of elements in the datasets each having a value greater than the test value; and setting the value of the element, having the k-th rank, equal to the test value, if the total number of elements having values less than the test value is ? the k-th rank minus one, and the total number of elements having values greater than the test value is ? the total number of elements minus the k-th rank. The system discloses means for practicing the method.Type: GrantFiled: October 29, 2004Date of Patent: May 27, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventors: Gagan Aggarwal, Nina Mishra, Binyamin Pinkas
-
Patent number: 8601024Abstract: Described is releasing output data representing a search log, in which the data is suitable for most data mining/analysis applications, but is safe to publish by preserving user privacy. The search log is processed such that a query is only included if a sufficient count of that query is present; noise may be added. User contributions that are considered may be limited to a maximum number of queries. The output may indicate how often (possibly plus noise) that each query appeared. Other output may comprise a query-action graph, a query-inaction graph and/or a query-reformulation graph, with nodes representing queries and nodes representing actions, inactions or reformulations (e.g., clicked URLs, skipped URLs, or selected related queries), and edges between nodes representing action, skip or selection counts (possibly plus noise). The output may correspond to the top results/related queries returned from a search.Type: GrantFiled: June 16, 2009Date of Patent: December 3, 2013Assignee: Microsoft CorporationInventors: Krishnaram Kenthapadi, Aleksandra Korolova, Nina Mishra, Alexandros Ntoulas