Patents by Inventor Patrice Y. Simard
Patrice Y. Simard has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240135098Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: ApplicationFiled: December 1, 2023Publication date: April 25, 2024Inventors: Patrice Y. SIMARD, David G. GRANGIER, Leon BOTTOU, Saleema A. AMERSHI
-
Patent number: 11803763Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.Type: GrantFiled: February 3, 2022Date of Patent: October 31, 2023Assignee: Microsoft Technology Licensing, LLCInventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
-
Publication number: 20220156598Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.Type: ApplicationFiled: February 3, 2022Publication date: May 19, 2022Inventors: Gonzalo A Ramos, Jin A. Suh, Johannes H. Verwey, Patrice Y. Simard, Steven M. Drucker, Nan-Chen Chen
-
Patent number: 11270211Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.Type: GrantFiled: February 5, 2018Date of Patent: March 8, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
-
Patent number: 11023677Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: July 13, 2016Date of Patent: June 1, 2021Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
-
Publication number: 20190244113Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.Type: ApplicationFiled: February 5, 2018Publication date: August 8, 2019Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
-
Patent number: 10372815Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: November 8, 2013Date of Patent: August 6, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Patrice Y. Simard, David G. Grangier, Leon Bottou, Saleema A. Amershi
-
Publication number: 20190213252Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: ApplicationFiled: March 19, 2019Publication date: July 11, 2019Inventors: PATRICE Y. SIMARD, DAVID G. GRANGIER, LEON BOTTOU, SALEEMA A. AMERSHI
-
Patent number: 10262272Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.Type: GrantFiled: December 7, 2014Date of Patent: April 16, 2019Assignee: Microsoft Technology Licensing, LLCInventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
-
Patent number: 10068185Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.Type: GrantFiled: December 7, 2014Date of Patent: September 4, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
-
Patent number: 9779081Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: April 21, 2016Date of Patent: October 3, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez
-
Patent number: 9594759Abstract: An archive of items, which are computing data accessed by a user, is created at a semantic object level. The object archiving may group seemingly disparate items as a composite object, which may then be stored to enable retrieval by the user at a later point in time. The composite object may include metadata from the various items to enable identifying the composite object and providing retrieval capabilities. In some aspects, an archiving process may extract item data from an item that is accessed by a computing device. Next, the item may be selected by a schema for inclusion in a composite object when the item data meets criteria specified in the schema. The composite object(s) may then be stored in an object store as an archive.Type: GrantFiled: June 16, 2009Date of Patent: March 14, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Elissa E.S. Murphy, Patrice Y. Simard, Navjot Virk, Kamal Jain, Mathew J. Dickson
-
Patent number: 9582490Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: November 8, 2013Date of Patent: February 28, 2017Assignee: Microsoft Technolog Licensing, LLCInventors: Patrice Y. Simard, David Max Chickering, Aparna Lakshmiratan, Denis X. Charles, Leon Bottou
-
Publication number: 20170039486Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: ApplicationFiled: July 13, 2016Publication date: February 9, 2017Inventors: Patrice Y. SIMARD, David Max CHICKERING, David G. GRANGIER, Aparna LAKSHMIRATAN, Saleema A. AMERSHI
-
Patent number: 9489373Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: November 8, 2013Date of Patent: November 8, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Saleema A. Amershi, Aparna Lakshmiratan, Carlos Garcia Jurado Suarez
-
Patent number: 9430460Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: November 8, 2013Date of Patent: August 30, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
-
Publication number: 20160239761Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: ApplicationFiled: April 21, 2016Publication date: August 18, 2016Inventors: PATRICE Y. SIMARD, DAVID MAX CHICKERING, DAVID G. GRANGIER, DENIS X. CHARLES, LEON BOTTOU, CARLOS GARCIA JURADO SUAREZ
-
Publication number: 20160162802Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.Type: ApplicationFiled: December 7, 2014Publication date: June 9, 2016Inventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
-
Publication number: 20160162803Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.Type: ApplicationFiled: December 7, 2014Publication date: June 9, 2016Inventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
-
Patent number: 9355088Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.Type: GrantFiled: November 8, 2013Date of Patent: May 31, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez