Patents by Inventor Patrice Y. Simard

Patrice Y. Simard has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INTERACTIVE CONCEPT EDITING IN COMPUTER-HUMAN INTERACTIVE LEARNING

Publication number: 20240135098

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Application

Filed: December 1, 2023

Publication date: April 25, 2024

Inventors: Patrice Y. SIMARD, David G. GRANGIER, Leon BOTTOU, Saleema A. AMERSHI
Interactive semantic data exploration for error discovery

Patent number: 11803763

Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.

Type: Grant

Filed: February 3, 2022

Date of Patent: October 31, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
INTERACTIVE SEMANTIC DATA EXPLORATION FOR ERROR DISCOVERY

Publication number: 20220156598

Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.

Type: Application

Filed: February 3, 2022

Publication date: May 19, 2022

Inventors: Gonzalo A Ramos, Jin A. Suh, Johannes H. Verwey, Patrice Y. Simard, Steven M. Drucker, Nan-Chen Chen
Interactive semantic data exploration for error discovery

Patent number: 11270211

Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.

Type: Grant

Filed: February 5, 2018

Date of Patent: March 8, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
Interactive feature selection for training a machine learning system and displaying discrepancies within the context of the document

Patent number: 11023677

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: July 13, 2016

Date of Patent: June 1, 2021

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
INTERACTIVE SEMANTIC DATA EXPLORATION FOR ERROR DISCOVERY

Publication number: 20190244113

Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.

Type: Application

Filed: February 5, 2018

Publication date: August 8, 2019

Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
Interactive concept editing in computer-human interactive learning

Patent number: 10372815

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: November 8, 2013

Date of Patent: August 6, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Patrice Y. Simard, David G. Grangier, Leon Bottou, Saleema A. Amershi
Interactive Concept Editing In Computer-Human Interactive Learning

Publication number: 20190213252

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Application

Filed: March 19, 2019

Publication date: July 11, 2019

Inventors: PATRICE Y. SIMARD, DAVID G. GRANGIER, LEON BOTTOU, SALEEMA A. AMERSHI
Active machine learning

Patent number: 10262272

Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.

Type: Grant

Filed: December 7, 2014

Date of Patent: April 16, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
Error-driven feature ideation in machine learning

Patent number: 10068185

Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.

Type: Grant

Filed: December 7, 2014

Date of Patent: September 4, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
Feature completion in computer-human interactive learning

Patent number: 9779081

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: April 21, 2016

Date of Patent: October 3, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez
Backup and archival of selected items as a composite object

Patent number: 9594759

Abstract: An archive of items, which are computing data accessed by a user, is created at a semantic object level. The object archiving may group seemingly disparate items as a composite object, which may then be stored to enable retrieval by the user at a later point in time. The composite object may include metadata from the various items to enable identifying the composite object and providing retrieval capabilities. In some aspects, an archiving process may extract item data from an item that is accessed by a computing device. Next, the item may be selected by a schema for inclusion in a composite object when the item data meets criteria specified in the schema. The composite object(s) may then be stored in an object store as an archive.

Type: Grant

Filed: June 16, 2009

Date of Patent: March 14, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Elissa E.S. Murphy, Patrice Y. Simard, Navjot Virk, Kamal Jain, Mathew J. Dickson
Active labeling for computer-human interactive learning

Patent number: 9582490

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: November 8, 2013

Date of Patent: February 28, 2017

Assignee: Microsoft Technolog Licensing, LLC

Inventors: Patrice Y. Simard, David Max Chickering, Aparna Lakshmiratan, Denis X. Charles, Leon Bottou
ACTIVE FEATURING IN COMPUTER-HUMAN INTERACTIVE LEARNING

Publication number: 20170039486

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Application

Filed: July 13, 2016

Publication date: February 9, 2017

Inventors: Patrice Y. SIMARD, David Max CHICKERING, David G. GRANGIER, Aparna LAKSHMIRATAN, Saleema A. AMERSHI
Interactive segment extraction in computer-human interactive learning

Patent number: 9489373

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: November 8, 2013

Date of Patent: November 8, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Saleema A. Amershi, Aparna Lakshmiratan, Carlos Garcia Jurado Suarez
Active featuring in computer-human interactive learning

Patent number: 9430460

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: November 8, 2013

Date of Patent: August 30, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
FEATURE COMPLETION IN COMPUTER-HUMAN INTERACTIVE LEARNING

Publication number: 20160239761

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Application

Filed: April 21, 2016

Publication date: August 18, 2016

Inventors: PATRICE Y. SIMARD, DAVID MAX CHICKERING, DAVID G. GRANGIER, DENIS X. CHARLES, LEON BOTTOU, CARLOS GARCIA JURADO SUAREZ
Active Machine Learning

Publication number: 20160162802

Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.

Type: Application

Filed: December 7, 2014

Publication date: June 9, 2016

Inventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
ERROR-DRIVEN FEATURE IDEATION IN MACHINE LEARNING

Publication number: 20160162803

Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.

Type: Application

Filed: December 7, 2014

Publication date: June 9, 2016

Inventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
Feature completion in computer-human interactive learning

Patent number: 9355088

Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.

Type: Grant

Filed: November 8, 2013

Date of Patent: May 31, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez

1 2 3 4 5 … next