Patents by Inventor Charles Menguy

Charles Menguy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-modal machine-learning model training for search

Patent number: 11914665

Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.

Type: Grant

Filed: February 18, 2022

Date of Patent: February 27, 2024

Assignee: Adobe Inc.

Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
Trait expansion techniques in binary matrix datasets

Patent number: 11899693

Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.

Type: Grant

Filed: February 22, 2022

Date of Patent: February 13, 2024

Assignee: Adobe Inc.

Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
Automatically generating user segments

Patent number: 11809455

Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.

Type: Grant

Filed: April 30, 2021

Date of Patent: November 7, 2023

Assignee: Adobe Inc.

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
Trait Expansion Techniques in Binary Matrix Datasets

Publication number: 20230267132

Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.

Type: Application

Filed: February 22, 2022

Publication date: August 24, 2023

Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
Multi-Modal Machine-Learning Model Training for Search

Publication number: 20230267158

Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.

Type: Application

Filed: February 18, 2022

Publication date: August 24, 2023

Applicant: Adobe Inc.

Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
Machine-learning techniques for evaluating suitability of candidate datasets for target applications

Patent number: 11704598

Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.

Type: Grant

Filed: September 2, 2022

Date of Patent: July 18, 2023

Assignee: ADOBE INC.

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
Segmenting users with sparse data utilizing hash partitions

Patent number: 11630854

Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.

Type: Grant

Filed: April 22, 2022

Date of Patent: April 18, 2023

Assignee: Adobe Inc.

Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
MACHINE-LEARNING TECHNIQUES FOR EVALUATING SUITABILITY OF CANDIDATE DATASETS FOR TARGET APPLICATIONS

Publication number: 20230004869

Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.

Type: Application

Filed: September 2, 2022

Publication date: January 5, 2023

Inventors: Kourosh MODARRESI, Hongyuan YUAN, Charles MENGUY
Machine-learning techniques for evaluating suitability of candidate datasets for target applications

Patent number: 11481668

Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.

Type: Grant

Filed: February 13, 2019

Date of Patent: October 25, 2022

Assignee: ADOBE INC.

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
SEGMENTING USERS WITH SPARSE DATA UTILIZING HASH PARTITIONS

Publication number: 20220253463

Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.

Type: Application

Filed: April 22, 2022

Publication date: August 11, 2022

Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
Customized geospatial population segmentation based on a received polygon definition

Patent number: 11355035

Abstract: Certain embodiments involve generating a real-time notification to facilitate the delivery of customized content, based on detecting that a subject activity occurs within a customized subject region. For instance, a computing system updates a graphical interface to display, as a layer on a map, detected instances of a subject activity performed within a geographic area of the map. The computing system determines a polygon corresponding to a region of the geographic area, where the polygon is determined from a graphical input representing lines at locations on the map on which the detected instances are overlaid. The computing system determines that a location of a detected instance of the subject activity performed by a user device falls within the polygon. The computing system transmits a notification to a content provider, such that the content provider delivers customized content to the user device that performed the detected instance of the subject activity.

Type: Grant

Filed: August 7, 2019

Date of Patent: June 7, 2022

Assignee: Adobe Inc.

Inventors: Charles Menguy, Navin Chaganti, Aditya Satishrao Borde
Dynamic clustering of sparse data utilizing hash partitions

Patent number: 11328002

Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.

Type: Grant

Filed: April 17, 2020

Date of Patent: May 10, 2022

Assignee: Adobe Inc.

Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
DYNAMIC CLUSTERING OF SPARSE DATA UTILIZING HASH PARTITIONS

Publication number: 20210326361

Abstract: The present disclosure describes systems, non-transitory computer-readable media, and methods for utilizing hash partitions to determine local densities and distances among users (or among other represented data points) for clustering sparse data into segments. For instance, the disclosed systems can generate hash signatures for users in a sparse dataset and can map users to hash partitions based on the hash signatures. The disclosed systems can further determine local densities and separation distances for particular users (or other represented data points) within the hash partitions. Upon determining local densities and separation distances for datapoints from the dataset, the disclosed systems can select a segment (or cluster of data points) grouped according to a hierarchy of a clustering algorithm, such as a density-peaks-clustering algorithm.

Type: Application

Filed: April 17, 2020

Publication date: October 21, 2021

Inventors: Fan Du, Yeuk-Yin Chan, Eunyee Koh, Ryan Rossi, Margarita Savova, Charles Menguy, Anup Rao
AUTOMATICALLY GENERATING USER SEGMENTS

Publication number: 20210311969

Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.

Type: Application

Filed: April 30, 2021

Publication date: October 7, 2021

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
Automatically generating meaningful user segments

Patent number: 11023495

Abstract: Systems, methods, and non-transitory computer-readable media (systems) are disclosed for generating meaningful and insightful user segment reports based on a high dimensional data space. In particular, in one or more embodiments, the disclosed systems utilize a relaxed bi-clustering model to automatically identify user segments in a data space including datasets of features specific to individual users. In at least one embodiment, the disclosed systems identify and include users in automatically generated user segments even though those users are associated with some, but perhaps not all, of the features as other members in the automatically generated user segments.

Type: Grant

Filed: March 19, 2018

Date of Patent: June 1, 2021

Assignee: ADOBE INC.

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
Delivery of contextual interest from interaction information

Patent number: 10997264

Abstract: Systems and techniques for delivery of contextual interest from interaction information are described that process user interactions with digital content to generate user interest scores for various topics. A contextual user interest system uses user interaction data to identify and contextualize content, and assigns propensity scores to the contextualized content. By dynamically contextualizing pages of content, the contextual user interest system may adapt to changes in the content and provide more accurate and robust information over time, which is not possible using conventional techniques. The contextualized pages of content are used to assign user interest scores across a number of topics to users who have visited the pages of content, and the user interest scores are normalized in a manner that allows a user's degree of interest in a topic to be compared to that of another user.

Type: Grant

Filed: August 21, 2018

Date of Patent: May 4, 2021

Assignee: Adobe Inc.

Inventors: Charles Menguy, Sudhakar Pandey, Roshan Singh, Ravi Prakash Singh, David Meier Weinstein, Ankush Sharma
CUSTOMIZED GEOSPATIAL POPULATION SEGMENTATION BASED ON A RECEIVED POLYGON DEFINITION

Publication number: 20210043116

Abstract: Certain embodiments involve generating a real-time notification to facilitate the delivery of customized content, based on detecting that a subject activity occurs within a customized subject region. For instance, a computing system updates a graphical interface to display, as a layer on a map, detected instances of a subject activity performed within a geographic area of the map. The computing system determines a polygon corresponding to a region of the geographic area, where the polygon is determined from a graphical input representing lines at locations on the map on which the detected instances are overlaid. The computing system determines that a location of a detected instance of the subject activity performed by a user device falls within the polygon. The computing system transmits a notification to a content provider, such that the content provider delivers customized content to the user device that performed the detected instance of the subject activity.

Type: Application

Filed: August 7, 2019

Publication date: February 11, 2021

Inventors: Charles Menguy, Navin Chaganti, Aditya Satishrao Borde
Identifying multiple devices belonging to a single user

Patent number: 10785134

Abstract: Techniques are disclosed that provide more accurate clustering of devices by forming clusters of devices and merging or changing clusters based on predetermined criteria. The technique starts with a large number of clusters (e.g., one for each account) and refines the clusters, for example, by merging clusters or determining which cluster a given device should be in when the device is associated with multiple clusters. One technique iteratively adjusts clusters of devices by merging clusters determined to be associated with a single user until a cluster contains all of the devices and accounts expected to be associated with a single user.

Type: Grant

Filed: September 25, 2018

Date of Patent: September 22, 2020

Assignee: Adobe Inc.

Inventors: Virgil-Artimon Palanciuc, Mihai Daniel Fecioru, Catalin Costache, Charles Menguy
MACHINE-LEARNING TECHNIQUES FOR EVALUATING SUITABILITY OF CANDIDATE DATASETS FOR TARGET APPLICATIONS

Publication number: 20200258002

Abstract: Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.

Type: Application

Filed: February 13, 2019

Publication date: August 13, 2020

Inventors: Kourosh Modarresi, Hongyuan Yuan, Charles Menguy
Delivery of Contextual Interest from Interaction Information

Publication number: 20200065425

Abstract: Systems and techniques for delivery of contextual interest from interaction information are described that process user interactions with digital content to generate user interest scores for various topics. A contextual user interest system uses user interaction data to identify and contextualize content, and assigns propensity scores to the contextualized content. By dynamically contextualizing pages of content, the contextual user interest system may adapt to changes in the content and provide more accurate and robust information over time, which is not possible using conventional techniques. The contextualized pages of content are used to assign user interest scores across a number of topics to users who have visited the pages of content, and the user interest scores are normalized in a manner that allows a user's degree of interest in a topic to be compared to that of another user.

Type: Application

Filed: August 21, 2018

Publication date: February 27, 2020

Applicant: Adobe Inc.

Inventors: Charles Menguy, Sudhakar Pandey, Roshan Singh, Ravi Prakash Singh, David Meier Weinstein, Ankush Sharma

1 2 next