Patents by Inventor Charles Menguy
Charles Menguy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11914665Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.Type: GrantFiled: February 18, 2022Date of Patent: February 27, 2024Assignee: Adobe Inc.Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
-
Patent number: 11899693Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.Type: GrantFiled: February 22, 2022Date of Patent: February 13, 2024Assignee: Adobe Inc.Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
-
Patent number: 11809455Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.Type: GrantFiled: April 30, 2021Date of Patent: November 7, 2023Assignee: Adobe Inc.Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
-
Publication number: 20230267132Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.Type: ApplicationFiled: February 22, 2022Publication date: August 24, 2023Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
-
Publication number: 20230267158Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.Type: ApplicationFiled: February 18, 2022Publication date: August 24, 2023Applicant: Adobe Inc.Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
-
Machine-learning techniques for evaluating suitability of candidate datasets for target applications
Patent number: 11704598Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.Type: GrantFiled: September 2, 2022Date of Patent: July 18, 2023Assignee: ADOBE INC.Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy -
Patent number: 11630854Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.Type: GrantFiled: April 22, 2022Date of Patent: April 18, 2023Assignee: Adobe Inc.Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
-
MACHINE-LEARNING TECHNIQUES FOR EVALUATING SUITABILITY OF CANDIDATE DATASETS FOR TARGET APPLICATIONS
Publication number: 20230004869Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.Type: ApplicationFiled: September 2, 2022Publication date: January 5, 2023Inventors: Kourosh MODARRESI, Hongyuan YUAN, Charles MENGUY -
Machine-learning techniques for evaluating suitability of candidate datasets for target applications
Patent number: 11481668Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.Type: GrantFiled: February 13, 2019Date of Patent: October 25, 2022Assignee: ADOBE INC.Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy -
Publication number: 20220253463Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.Type: ApplicationFiled: April 22, 2022Publication date: August 11, 2022Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
-
Patent number: 11355035Abstract: Certain embodiments involve generating a real-time notification to facilitate the delivery of customized content, based on detecting that a subject activity occurs within a customized subject region. For instance, a computing system updates a graphical interface to display, as a layer on a map, detected instances of a subject activity performed within a geographic area of the map. The computing system determines a polygon corresponding to a region of the geographic area, where the polygon is determined from a graphical input representing lines at locations on the map on which the detected instances are overlaid. The computing system determines that a location of a detected instance of the subject activity performed by a user device falls within the polygon. The computing system transmits a notification to a content provider, such that the content provider delivers customized content to the user device that performed the detected instance of the subject activity.Type: GrantFiled: August 7, 2019Date of Patent: June 7, 2022Assignee: Adobe Inc.Inventors: Charles Menguy, Navin Chaganti, Aditya Satishrao Borde
-
Patent number: 11328002Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.Type: GrantFiled: April 17, 2020Date of Patent: May 10, 2022Assignee: Adobe Inc.Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
-
Publication number: 20210326361Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.Type: ApplicationFiled: April 17, 2020Publication date: October 21, 2021Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
-
Publication number: 20210311969Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.Type: ApplicationFiled: April 30, 2021Publication date: October 7, 2021Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
-
Patent number: 11023495Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.Type: GrantFiled: March 19, 2018Date of Patent: June 1, 2021Assignee: ADOBE INC.Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
-
Patent number: 10997264Abstract: Systems and techniques for delivery of contextual interest from interaction information are described that process user interactions with digital content to generate user interest scores for various topics. A contextual user interest system uses user interaction data to identify and contextualize content, and assigns propensity scores to the contextualized content. By dynamically contextualizing pages of content, the contextual user interest system may adapt to changes in the content and provide more accurate and robust information over time, which is not possible using conventional techniques. The contextualized pages of content are used to assign user interest scores across a number of topics to users who have visited the pages of content, and the user interest scores are normalized in a manner that allows a user's degree of interest in a topic to be compared to that of another user.Type: GrantFiled: August 21, 2018Date of Patent: May 4, 2021Assignee: Adobe Inc.Inventors: Charles Menguy, Sudhakar Pandey, Roshan Singh, Ravi Prakash Singh, David Meier Weinstein, Ankush Sharma
-
Publication number: 20210043116Abstract: Certain embodiments involve generating a real-time notification to facilitate the delivery of customized content, based on detecting that a subject activity occurs within a customized subject region. For instance, a computing system updates a graphical interface to display, as a layer on a map, detected instances of a subject activity performed within a geographic area of the map. The computing system determines a polygon corresponding to a region of the geographic area, where the polygon is determined from a graphical input representing lines at locations on the map on which the detected instances are overlaid. The computing system determines that a location of a detected instance of the subject activity performed by a user device falls within the polygon. The computing system transmits a notification to a content provider, such that the content provider delivers customized content to the user device that performed the detected instance of the subject activity.Type: ApplicationFiled: August 7, 2019Publication date: February 11, 2021Inventors: Charles Menguy, Navin Chaganti, Aditya Satishrao Borde
-
Patent number: 10785134Abstract: Techniques are disclosed that provide more accurate clustering of devices by forming clusters of devices and merging or changing clusters based on predetermined criteria. The technique starts with a large number of clusters (e.g., one for each account) and refines the clusters, for example, by merging clusters or determining which cluster a given device should be in when the device is associated with multiple clusters. One technique iteratively adjusts clusters of devices by merging clusters determined to be associated with a single user until a cluster contains all of the devices and accounts expected to be associated with a single user.Type: GrantFiled: September 25, 2018Date of Patent: September 22, 2020Assignee: Adobe Inc.Inventors: Virgil-Artimon Palanciuc, Mihai Daniel Fecioru, Catalin Costache, Charles Menguy
-
MACHINE-LEARNING TECHNIQUES FOR EVALUATING SUITABILITY OF CANDIDATE DATASETS FOR TARGET APPLICATIONS
Publication number: 20200258002Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.Type: ApplicationFiled: February 13, 2019Publication date: August 13, 2020Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy -
Publication number: 20200065425Abstract: Systems and techniques for delivery of contextual interest from interaction information are described that process user interactions with digital content to generate user interest scores for various topics. A contextual user interest system uses user interaction data to identify and contextualize content, and assigns propensity scores to the contextualized content. By dynamically contextualizing pages of content, the contextual user interest system may adapt to changes in the content and provide more accurate and robust information over time, which is not possible using conventional techniques. The contextualized pages of content are used to assign user interest scores across a number of topics to users who have visited the pages of content, and the user interest scores are normalized in a manner that allows a user's degree of interest in a topic to be compared to that of another user.Type: ApplicationFiled: August 21, 2018Publication date: February 27, 2020Applicant: Adobe Inc.Inventors: Charles Menguy, Sudhakar Pandey, Roshan Singh, Ravi Prakash Singh, David Meier Weinstein, Ankush Sharma