Patents by Inventor Jinfeng Yi

Jinfeng Yi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Location context inference based on user mobile data with uncertainty

Patent number: 10003923

Abstract: Methods and a system are provided that is performed by a computer server for inferring location context categories for a set of mobile users having at least two members. A method includes, for each mobile user in the set, obtaining at least one location context category therefor from publically available information responsive to uncertain mobile device location data. The method further includes applying multi-user collaborative machine learning to the at least one location context category for each mobile user in the set to infer a single refined location context category for each mobile user in the set.

Type: Grant

Filed: May 31, 2017

Date of Patent: June 19, 2018

Assignee: International Business Machines Corporation

Inventors: Hongfei Li, Anshul Sheopuri, Jinfeng Yi, Qi Yu
CUSTOMER PROFILE LEARNING BASED ON SEMI-SUPERVISED RECURRENT NEURAL NETWORK USING PARTIALLY LABELED SEQUENCE DATA

Publication number: 20170293836

Abstract: A method and system are provided. The method includes receiving by a computer having a processor and a memory, sequence data that includes labeled data and unlabeled data. The method further includes generating, by the computer having the processor and the memory, a recurrent neural network model of the sequence data, the recurrent neural network model having a recurrent layer and an aggregate layer. The recurrent neural network model feeds sequences generated from the recurrent layer into the aggregate layer for aggregation, stores temporal dependencies in the sequence data, and generates labels for at least some of the unlabeled data.

Type: Application

Filed: April 11, 2016

Publication date: October 12, 2017

Inventors: Hongfei Li, Anshul Sheopuri, Jinfeng Yi, Qi Yu
FEATURE LEARNING ON CUSTOMER JOURNEY USING CATEGORICAL SEQUENCE DATA

Publication number: 20170293919

Abstract: A method and system are provided. The method includes converting, by a computer having a processor and a memory, categorical sequence data for a customer journey into a numerical similarity matrix. The method further includes learning, by the computer, features of the customer journey by applying a distance metric learning based matrix factorization approach to the numerical similarity matrix.

Type: Application

Filed: April 11, 2016

Publication date: October 12, 2017

Inventors: Hongfei Li, Anshul Sheopuri, Jinfeng Yi, Qi Yu
LOCATION CONTEXT INFERENCE BASED ON USER MOBILE DATA WITH UNCERTAINTY

Publication number: 20170272908

Abstract: Methods and a system are provided that is performed by a computer server for inferring location context categories for a set of mobile users having at least two members. A method includes, for each mobile user in the set, obtaining at least one location context category therefor from publically available information responsive to uncertain mobile device location data. The method further includes applying multi-user collaborative machine learning to the at least one location context category for each mobile user in the set to infer a single refined location context category for each mobile user in the set.

Type: Application

Filed: May 31, 2017

Publication date: September 21, 2017

Inventors: Hongfei Li, Anshul Sheopuri, Jinfeng Yi, Qi Yu
Location context inference based on user mobile data with uncertainty

Patent number: 9743243

Abstract: Methods and a system are provided that is performed by a computer server for inferring location context categories for a set of mobile users having at least two members. A method includes, for each mobile user in the set, obtaining at least one location context category therefor from publically available information responsive to uncertain mobile device location data. The method further includes applying multi-user collaborative machine learning to the at least one location context category for each mobile user in the set to infer a single refined location context category for each mobile user in the set.

Type: Grant

Filed: March 16, 2016

Date of Patent: August 22, 2017

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Hongfei Li, Anshul Sheopuri, Jinfeng Yi, Qi Yu
Large-scale data clustering with dynamic social context

Patent number: 9639598

Abstract: A system and method for dynamic, semi-supervised clustering comprises receiving data attributes, generating a set of ensemble partitions using the data attributes, forming a convex hull using the set of ensemble partitions, generating a simplex vector by performing ensemble clustering on the convex hull, receiving dynamic links, deriving an optimal simplex vector using the simplex vector and the dynamic links, computing a current optimal clustering result using the optimal simplex vector, and outputting the current optimal clustering result.

Type: Grant

Filed: July 31, 2014

Date of Patent: May 2, 2017

Assignee: International Business Machines Corporation

Inventors: Jun Wang, Jinfeng Yi
PRIVACY AND MODELING PRESERVED DATA SHARING

Publication number: 20160283735

Abstract: A system, method and computer program product for generating a classification model using original data that is sensitive or private to a data owner. The method includes: receiving, from one or more entities, a masked data set having masked data corresponding to the original sensitive data, and further including a masked feature label set for use in classifying the masked data contents; forming a shared data collection of the masked data and the masked feature label sets received; and training, by a second entity, a classification model from the shared masked data and feature label sets, wherein the classification model learned from the shared masked data and feature label sets is the same as a classification model learned from the original sensitive data. The sensitive features and labels cannot be reliably recovered even when both the masked data and the learning algorithm are known.

Type: Application

Filed: March 24, 2015

Publication date: September 29, 2016

Inventors: Jun Wang, Jinfeng Yi
PRIVACY AND MODELING PRESERVED DATA SHARING

Publication number: 20160283738

Abstract: A method for generating a classification model using original data that is sensitive or private to a data owner. The method includes: receiving, from one or more entities, a masked data set having masked data corresponding to the original sensitive data, and further including a masked feature label set for use in classifying the masked data contents; forming a shared data collection of the masked data and the masked feature label sets received; and training, by a second entity, a classification model from the shared masked data and feature label sets, wherein the classification model learned from the shared masked data and feature label sets is the same as a classification model learned from the original sensitive data. The sensitive features and labels cannot be reliably recovered even when both the masked data and the learning algorithm are known.

Type: Application

Filed: July 10, 2015

Publication date: September 29, 2016

Inventors: Jun Wang, Jinfeng Yi
Similar document detection and electronic discovery

Patent number: 9418144

Abstract: Systems and methods are disclosed for performing duplicate document analyses to identify texturally identical or similar documents, which may be electronic documents stored within an electronic discovery platform. A process is described which includes representing each of the documents, including a target document, as a relatively large n-tuple vector and also as a relatively small m-tuple vector, performing a series of calculations on the set of m-tuple vectors to identify a set of documents which are candidate near-duplicates to the target document, and then filtering the candidate set of near-duplicate documents based upon the distance of their n-tuple vectors from the n-tuple vector of the target document.

Type: Grant

Filed: September 24, 2015

Date of Patent: August 16, 2016

Assignee: STROZ FRIEDBERG, LLC

Inventors: Michael Sperling, Rong Jin, Illya Rayvych, Jianghong Li, Jinfeng Yi
LARGE-SCALE DATA CLUSTERING WITH DYNAMIC SOCIAL CONTEXT

Publication number: 20160034554

Abstract: A system and method for dynamic, semi-supervised clustering comprises receiving data attributes, generating a set of ensemble partitions using the data attributes, forming a convex hull using the set of ensemble partitions, generating a simplex vector by performing ensemble clustering on the convex hull, receiving dynamic links, deriving an optimal simplex vector using the simplex vector and the dynamic links, computing a current optimal clustering result using the optimal simplex vector, and outputting the current optimal clustering result.

Type: Application

Filed: July 31, 2014

Publication date: February 4, 2016

Applicant: International Business Machines Corporation

Inventors: Jun Wang, Jinfeng Yi
SIMILAR DOCUMENT DETECTION AND ELECTRONIC DISCOVERY

Publication number: 20160012061

Abstract: Systems and methods are disclosed for performing duplicate document analyses to identify texturally identical or similar documents, which may be electronic documents stored within an electronic discovery platform. A process is described which includes representing each of the documents, including a target document, as a relatively large n-tuple vector and also as a relatively small m-tuple vector, performing a series of calculations on the set of m-tuple vectors to identify a set of documents which are candidate near-duplicates to the target document, and then filtering the candidate set of near-duplicate documents based upon the distance of their n-tuple vectors from the n-tuple vector of the target document.

Type: Application

Filed: September 24, 2015

Publication date: January 14, 2016

Inventors: Michael Sperling, Rong Jin, Illya Rayvych, Jianghong Li, Jinfeng Yi
Similar document detection and electronic discovery

Patent number: 9208219

Abstract: Systems and methods are disclosed for performing duplicate document analyses to identify texturally identical or similar documents, which may be electronic documents stored within an electronic discovery platform. A process is described which includes representing each of the documents, including a target document, as a relatively large n-tuple vector and also as a relatively small m-tuple vector, performing a series of one-dimensional searches on the set of m-tuple vectors to identify a set of documents which are near-duplicates to the target document, and then filtering the near set of near duplicate documents based upon the distance of their n-tuple vectors from that of the target document.

Type: Grant

Filed: February 8, 2013

Date of Patent: December 8, 2015

Assignee: STROZ FRIEDBERG, LLC

Inventors: Michael Sperling, Rong Jin, Illya Rayvych, Jianghong Li, Jinfeng Yi

prev 1 2