Patents by Inventor Matvey Kapilevich

Matvey Kapilevich has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-modal machine-learning model training for search

Patent number: 11914665

Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.

Type: Grant

Filed: February 18, 2022

Date of Patent: February 27, 2024

Assignee: Adobe Inc.

Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
Trait expansion techniques in binary matrix datasets

Patent number: 11899693

Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.

Type: Grant

Filed: February 22, 2022

Date of Patent: February 13, 2024

Assignee: Adobe Inc.

Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
Multi-Modal Machine-Learning Model Training for Search

Publication number: 20230267158

Abstract: Multi-modal machine-learning model training techniques for search are described that overcome conventional challenges and inefficiencies to support real time output, which is not possible in conventional training techniques. In one example, a search system is configured to support multi-modal machine-learning model training. This includes use of a preview mode and an expanded mode. In the preview mode, a preview segment is generated as part of real time training of a machine learning model. In the expanded mode, the preview segment is persisted as an expanded segment that is used to train and utilize an expanded machine-learning model as part of search.

Type: Application

Filed: February 18, 2022

Publication date: August 24, 2023

Applicant: Adobe Inc.

Inventors: Matvey Kapilevich, Margarita R. Savova, Anup Bandigadi Rao, Tung Thanh Mai, Lakshmi Shivalingaiah, Liron Goren Snai, Charles Menguy, Vijeth Lomada, Moumita Sinha, Harleen Sahni
Trait Expansion Techniques in Binary Matrix Datasets

Publication number: 20230267132

Abstract: A cluster generation system identifies data elements, from a first binary record, that each have a particular value and correspond to respective binary traits. A candidate description function describing the binary traits is generated, the candidate description function including a model factor that describes the data elements. Responsive to determining that a second record has additional data elements having the particular value and corresponding to the respective binary traits, the candidate description function is modified to indicate that the model factor describes the additional elements. The candidate description function is also modified to include a correction factor describing an additional binary trait excluded from the respective binary traits. Based on the modified candidate description function, the cluster generation system generates a data summary cluster, which includes a compact representation of the binary traits of the data elements and additional data elements.

Type: Application

Filed: February 22, 2022

Publication date: August 24, 2023

Inventors: Yeuk-yin Chan, Tung Mai, Ryan Rossi, Moumita Sinha, Matvey Kapilevich, Margarita Savova, Fan Du, Charles Menguy, Anup Rao
Generating overlap estimations between high-volume digital data sets based on multiple sketch vector similarity estimators

Patent number: 11720592

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that estimate the overlap between sets of data samples. In particular, in one or more embodiments, the disclosed systems utilize a sketch-based sampling routine and a flexible, accurate estimator to determine the overlap (e.g., the intersection) between sets of data samples. For example, in some implementations, the disclosed systems generate a sketch vector—such as a one permutation hashing vector—for each set of data samples. The disclosed systems further compare the sketch vectors to determine an equal bin similarity estimator, a lesser bin similarity estimator, and a greater bin similarity estimator. The disclosed systems utilize one or more of the determined similarity estimators in generating an overlap estimation for the sets of data samples.

Type: Grant

Filed: August 10, 2022

Date of Patent: August 8, 2023

Assignee: Adobe Inc.

Inventors: Anup Rao, Tung Mai, Matvey Kapilevich
GENERATING OVERLAP ESTIMATIONS BETWEEN HIGH-VOLUME DIGITAL DATA SETS BASED ON MULTIPLE SKETCH VECTOR SIMILARITY ESTIMATORS

Publication number: 20220391407

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that estimate the overlap between sets of data samples. In particular, in one or more embodiments, the disclosed systems utilize a sketch-based sampling routine and a flexible, accurate estimator to determine the overlap (e.g., the intersection) between sets of data samples. For example, in some implementations, the disclosed systems generate a sketch vector—such as a one permutation hashing vector—for each set of data samples. The disclosed systems further compare the sketch vectors to determine an equal bin similarity estimator, a lesser bin similarity estimator, and a greater bin similarity estimator. The disclosed systems utilize one or more of the determined similarity estimators in generating an overlap estimation for the sets of data samples.

Type: Application

Filed: August 10, 2022

Publication date: December 8, 2022

Inventors: Anup Rao, Tung Mai, Matvey Kapilevich
Generating overlap estimations between high-volume digital data sets based on multiple sketch vector similarity estimators

Patent number: 11449523

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that estimate the overlap between sets of data samples. In particular, in one or more embodiments, the disclosed systems utilize a sketch-based sampling routine and a flexible, accurate estimator to determine the overlap (e.g., the intersection) between sets of data samples. For example, in some implementations, the disclosed systems generate a sketch vector—such as a one permutation hashing vector—for each set of data samples. The disclosed systems further compare the sketch vectors to determine an equal bin similarity estimator, a lesser bin similarity estimator, and a greater bin similarity estimator. The disclosed systems utilize one or more of the determined similarity estimators in generating an overlap estimation for the sets of data samples.

Type: Grant

Filed: November 5, 2020

Date of Patent: September 20, 2022

Assignee: Adobe Inc.

Inventors: Anup Rao, Tung Mai, Matvey Kapilevich
GENERATING OVERLAP ESTIMATIONS BETWEEN HIGH-VOLUME DIGITAL DATA SETS BASED ON MULTIPLE SKETCH VECTOR SIMILARITY ESTIMATORS

Publication number: 20220138218

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that estimate the overlap between sets of data samples. In particular, in one or more embodiments, the disclosed systems utilize a sketch-based sampling routine and a flexible, accurate estimator to determine the overlap (e.g., the intersection) between sets of data samples. For example, in some implementations, the disclosed systems generate a sketch vector—such as a one permutation hashing vector—for each set of data samples. The disclosed systems further compare the sketch vectors to determine an equal bin similarity estimator, a lesser bin similarity estimator, and a greater bin similarity estimator. The disclosed systems utilize one or more of the determined similarity estimators in generating an overlap estimation for the sets of data samples.

Type: Application

Filed: November 5, 2020

Publication date: May 5, 2022

Inventors: Anup Rao, Tung Mai, Matvey Kapilevich
Utilizing one hash permutation and populated-value-slot-based densification for generating audience segment trait recommendations

Patent number: 11109085

Abstract: The present disclosure relates to training a recommendation model to generate trait recommendations using one permutation hashing and populated-value-slot-based densification. In particular, the disclosed systems can train the recommendation model by computing sketch vectors corresponding to traits using one permutation hashing. The disclosed systems can then fill in unpopulated value slots of the sketch vectors using populated-value-slot-based densification. The disclosed systems can combine the resulting densified sketches to generate the trained recommendation model. For example, in some embodiments, the disclosed systems can combine the sketches by generating a plurality of locality sensitive hashing tables based on the sketches. In some embodiments, the disclosed systems generate a count sketch matrix based on the sketches and generate trait embeddings based on the count sketch matrix using spectral embedding.

Type: Grant

Filed: March 28, 2019

Date of Patent: August 31, 2021

Assignee: ADOBE INC.

Inventors: Anup Rao, Yasin Abbasi Yadkori, Tung Mai, Ryan Rossi, Ritwik Sinha, Matvey Kapilevich, Alexandru Ionut Hodorogea
Methods and systems for identifying multiple devices belonging to a single user by merging deterministic and probabilistic data to generate a cross device data structure

Patent number: 11093565

Abstract: Systems and methods are disclosed for clustering multiple devices that are associated with particular users by utilizing both probabilistic and deterministic data derived from analytics information on the users. An analytics computing system generates at least one deterministic device cluster that groups a first set of devices associated with a first user. The first set of devices share deterministic user identifiers specific to the first user. The analytics computing system also identifies a probabilistic link between a device in the first set of devices and additional devices. The probabilistic link indicates common usage patterns between two devices. Based on the probabilistic link, the analytics computing system generates a data structure that includes the deterministic device cluster and the additional devices.

Type: Grant

Filed: February 27, 2019

Date of Patent: August 17, 2021

Assignee: ADOBE INC.

Inventors: Karthik Raman, Nedim Lipka, Matvey Kapilevich
PREDICTING A PERSONA CLASS BASED ON OVERLAP-AGNOSTIC MACHINE LEARNING MODELS FOR DISTRIBUTING PERSONA-BASED DIGITAL CONTENT

Publication number: 20210056458

Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for intelligently predicting a persona class of a client device and/or target user utilizing an overlap-agnostic machine learning model and distributing persona-based digital content to the client device. In particular, in one or more embodiments, the persona classification system can learn overlap-agnostic machine learning model parameters to apply to user traits in real-time or in offline batches. For example, the persona classification system can train and utilize an overlap-agnostic machine learning model that includes an overlap-agnostic embedding model, a trained user-embedding generation model, and a trained persona prediction model. By applying the learned overlap-agnostic machine learning model parameters to the target user traits, the persona classification system can predict a persona class for sending digital content based on the predicted persona class.

Type: Application

Filed: August 20, 2019

Publication date: February 25, 2021

Inventors: Margarita Savova, Matvey Kapilevich, Lakshmi Shivalingaiah, Anup Rao, Alexandru Ionut Hodorogea, Harleen Singh Sahni
Audience size estimation and complex segment logic

Patent number: 10803471

Abstract: Selection of a trait may be received. A complex segment rule may be created that is usable to evaluate one or more qualification events. For example, the segment rule may be usable to evaluate a combined recency and frequency of the one or more qualification events. The qualification events may be based on collected network data associated with the plurality of visitors with each qualification event corresponding to a separate qualification of visitor according to the trait. The qualification events may be evaluated together according to the segment rule. For example, the combined recency and frequency of the one or more qualification events may be evaluated according to the segment rule. Evaluating the segment rule may include estimating a segment population size in real-time.

Type: Grant

Filed: September 27, 2012

Date of Patent: October 13, 2020

Assignee: Adobe Inc.

Inventors: David M. Weinstein, Matvey Kapilevich, Harleen S. Sahni, Margarita R. Savova, Nicholas M. Jordan, David A. Jared
UTILIZING ONE HASH PERMUTATION AND POPULATED-VALUE-SLOT-BASED DENSIFICATION FOR GENERATING AUDIENCE SEGMENT TRAIT RECOMMENDATIONS

Publication number: 20200314472

Abstract: The present disclosure relates to training a recommendation model to generate trait recommendations using one permutation hashing and populated-value-slot-based densification. In particular, the disclosed systems can train the recommendation model by computing sketch vectors corresponding to traits using one permutation hashing. The disclosed systems can then fill in unpopulated value slots of the sketch vectors using populated-value-slot-based densification. The disclosed systems can combine the resulting densified sketches to generate the trained recommendation model. For example, in some embodiments, the disclosed systems can combine the sketches by generating a plurality of locality sensitive hashing tables based on the sketches. In some embodiments, the disclosed systems generate a count sketch matrix based on the sketches and generate trait embeddings based on the count sketch matrix using spectral embedding.

Type: Application

Filed: March 28, 2019

Publication date: October 1, 2020

Inventors: Anup Rao, Yasin Abbasi Yadkori, Tung Mai, Ryan Rossi, Ritwik Sinha, Matvey Kapilevich, Alexandru Ionut Hodorogea
Attributing online activities to internet protocol addresses of routers for customizing content to different networks

Patent number: 10477382

Abstract: Certain embodiments involve associating a device with an identifier of a router that is assigned an IP address based on DHCP. For example, connection data that includes the IP address is analyzed to associate devices with the identifier of the router. A determination is made as to whether any usage of the IP address by the devices overlap in time. Devices with overlapping usage are determined to be connecting to online resources via a same router, while devices with non-overlapping usage are determined to be connecting from other routers. A single router identifier is associated with the devices using the same router. This identifier is then used to track the online activity of the associated devices.

Type: Grant

Filed: April 18, 2018

Date of Patent: November 12, 2019

Assignee: Adobe Inc.

Inventor: Matvey Kapilevich
ATTRIBUTING ONLINE ACTIVITIES TO INTERNET PROTOCOL ADDRESSES OF ROUTERS FOR CUSTOMIZING CONTENT TO DIFFERENT NETWORKS

Publication number: 20190327600

Abstract: Certain embodiments involve associating a device with an identifier of a router that is assigned an IP address based on DHCP. For example, connection data that includes the IP address is analyzed to associate devices with the identifier of the router. A determination is made as to whether any usage of the IP address by the devices overlap in time. Devices with overlapping usage are determined to be connecting to online resources via a same router, while devices with non-overlapping usage are determined to be connecting from other routers. A single router identifier is associated with the devices using the same router. This identifier is then used to track the online activity of the associated devices.

Type: Application

Filed: April 18, 2018

Publication date: October 24, 2019

Inventor: Matvey Kapilevich
Tunable algorithmic segments

Patent number: 10373197

Abstract: Tunable algorithmic segment techniques are described. In one or more implementations, a target audience definition is obtained that is input to initiate creation of a look-alike model. The target audience definition indicates traits associated with a baseline group of consumers who have interacted with online resources in a designated manner, such as by buying a product, visiting a website, using a service, and so forth. Tuning parameters designated for the look-alike model are ascertained and the look-alike model is built based on the target audience definition and the tuning parameters. The tuning parameters may include at least a setting selectable to control reach versus accuracy for the look-alike model. Segment data indicative of market segments generated according to the look-alike model may then be exposed for manipulation by a client. The manipulation may include selectable control over the tuning parameters to generate different look-alike groups from the segment data.

Type: Grant

Filed: December 24, 2012

Date of Patent: August 6, 2019

Assignee: Adobe Inc.

Inventors: Nicholas M. Jordon, Margarita R. Savova, Matvey Kapilevich, Paul Mackles, David M. Weinstein
METHODS AND SYSTEMS FOR IDENTIFYING MULTIPLE DEVICES BELONGING TO A SINGLE USER BY MERGING DETERMINISTIC AND PROBABILISTIC DATA TO GENERATE A CROSS DEVICE DATA STRUCTURE

Publication number: 20190197072

Abstract: Systems and methods are disclosed for clustering multiple devices that are associated with particular users by utilizing both probabilistic and deterministic data derived from analytics information on the users. An analytics computing system generates at least one deterministic device cluster that groups a first set of devices associated with a first user. The first set of devices share deterministic user identifiers specific to the first user. The analytics computing system also identifies a probabilistic link between a device in the first set of devices and additional devices. The probabilistic link indicates common usage patterns between two devices. Based on the probabilistic link, the analytics computing system generates a data structure that includes the deterministic device cluster and the additional devices.

Type: Application

Filed: February 27, 2019

Publication date: June 27, 2019

Inventors: Karthik Raman, Nedim Lipka, Matvey Kapilevich
Methods and systems for identifying multiple devices belonging to a single user by merging deterministic and probabilistic data to generate a cross device data structure

Patent number: 10255371

Abstract: Systems and methods are disclosed for clustering multiple devices that are associated with particular users by utilizing both probabilistic and deterministic data derived from analytics information on the users. An analytics computing system generates at least one deterministic device cluster that groups a first set of devices associated with a first user. The first set of devices share deterministic user identifiers specific to the first user. The analytics computing system also identifies a probabilistic link between a device in the first set of devices and additional devices. The probabilistic link indicates common usage patterns between two devices. Based on the probabilistic link, the analytics computing system generates a data structure that includes the deterministic device cluster and the additional devices.

Type: Grant

Filed: September 19, 2016

Date of Patent: April 9, 2019

Assignee: Adobe Systems Incorporated

Inventors: Karthik Raman, Nedim Lipka, Matvey Kapilevich
METHODS AND SYSTEMS FOR IDENTIFYING MULTIPLE DEVICES BELONGING TO A SINGLE USER BY MERGING DETERMINISTIC AND PROBABILISTIC DATA TO GENERATE A CROSS DEVICE DATA STRUCTURE

Publication number: 20180081960

Abstract: Systems and methods are disclosed for clustering multiple devices that are associated with particular users by utilizing both probabilistic and deterministic data derived from analytics information on the users. An analytics computing system generates at least one deterministic device cluster that groups a first set of devices associated with a first user. The first set of devices share deterministic user identifiers specific to the first user. The analytics computing system also identifies a probabilistic link between a device in the first set of devices and additional devices. The probabilistic link indicates common usage patterns between two devices. Based on the probabilistic link, the analytics computing system generates a data structure that includes the deterministic device cluster and the additional devices.

Type: Application

Filed: September 19, 2016

Publication date: March 22, 2018

Inventors: Karthik Raman, Nedim Lipka, Matvey Kapilevich
Tunable Algorithmic Segments

Publication number: 20140180804

Abstract: Tunable algorithmic segment techniques are described. In one or more implementations, a target audience definition is obtained that is input to initiate creation of a look-alike model. The target audience definition indicates traits associated with a baseline group of consumers who have interacted with online resources in a designated manner, such as by buying a product, visiting a website, using a service, and so forth. Tuning parameters designated for the look-alike model are ascertained and the look-alike model is built based on the target audience definition and the tuning parameters. The tuning parameters may include at least a setting selectable to control reach versus accuracy for the look-alike model. Segment data indicative of market segments generated according to the look-alike model may then be exposed for manipulation by a client. The manipulation may include selectable control over the tuning parameters to generate different look-alike groups from the segment data.

Type: Application

Filed: December 24, 2012

Publication date: June 26, 2014

Applicant: ADOBE SYSTEMS INCORPORATED

Inventors: Nicholas M. Jordan, Margarita R. Savova, Matvey Kapilevich, Paul Mackles, David M. Weinstein

1 2 next