Patents by Inventor Patrice Y. Simard

Patrice Y. Simard has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240135098
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Application
    Filed: December 1, 2023
    Publication date: April 25, 2024
    Inventors: Patrice Y. SIMARD, David G. GRANGIER, Leon BOTTOU, Saleema A. AMERSHI
  • Patent number: 11803763
    Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.
    Type: Grant
    Filed: February 3, 2022
    Date of Patent: October 31, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
  • Publication number: 20220156598
    Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.
    Type: Application
    Filed: February 3, 2022
    Publication date: May 19, 2022
    Inventors: Gonzalo A Ramos, Jin A. Suh, Johannes H. Verwey, Patrice Y. Simard, Steven M. Drucker, Nan-Chen Chen
  • Patent number: 11270211
    Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.
    Type: Grant
    Filed: February 5, 2018
    Date of Patent: March 8, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
  • Patent number: 11023677
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: July 13, 2016
    Date of Patent: June 1, 2021
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
  • Publication number: 20190244113
    Abstract: Classification predictions made by a concept classifier may be interactively visualized and explored in a user interface that displays visual representations of a plurality of data items in a star coordinate space spanned by a plurality of anchor concepts each mapping the data items onto respective finite real-valued scores. Positions of the visual representations of the data items in the star coordinate space are based on the scores for the plurality of anchor concepts, and may be updated responsive to a user manipulating the anchor concepts in the user interface, e.g., by moving or modifying definitions of anchor concepts, or by adding or deleting anchor concepts. The visual representations may of the data items may reflect labels and/or classification predictions, and may be updated based on updated classification predictions following retraining the of the concept classifier based on added training data or new features.
    Type: Application
    Filed: February 5, 2018
    Publication date: August 8, 2019
    Inventors: Gonzalo A Ramos, Jin A Suh, Johannes H Verwey, Patrice Y Simard, Steven M Drucker, Nan-Chen Chen
  • Patent number: 10372815
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: August 6, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Patrice Y. Simard, David G. Grangier, Leon Bottou, Saleema A. Amershi
  • Publication number: 20190213252
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Application
    Filed: March 19, 2019
    Publication date: July 11, 2019
    Inventors: PATRICE Y. SIMARD, DAVID G. GRANGIER, LEON BOTTOU, SALEEMA A. AMERSHI
  • Patent number: 10262272
    Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.
    Type: Grant
    Filed: December 7, 2014
    Date of Patent: April 16, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
  • Patent number: 10068185
    Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.
    Type: Grant
    Filed: December 7, 2014
    Date of Patent: September 4, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
  • Patent number: 9779081
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: April 21, 2016
    Date of Patent: October 3, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez
  • Patent number: 9594759
    Abstract: An archive of items, which are computing data accessed by a user, is created at a semantic object level. The object archiving may group seemingly disparate items as a composite object, which may then be stored to enable retrieval by the user at a later point in time. The composite object may include metadata from the various items to enable identifying the composite object and providing retrieval capabilities. In some aspects, an archiving process may extract item data from an item that is accessed by a computing device. Next, the item may be selected by a schema for inclusion in a composite object when the item data meets criteria specified in the schema. The composite object(s) may then be stored in an object store as an archive.
    Type: Grant
    Filed: June 16, 2009
    Date of Patent: March 14, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Elissa E.S. Murphy, Patrice Y. Simard, Navjot Virk, Kamal Jain, Mathew J. Dickson
  • Patent number: 9582490
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: February 28, 2017
    Assignee: Microsoft Technolog Licensing, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, Aparna Lakshmiratan, Denis X. Charles, Leon Bottou
  • Publication number: 20170039486
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Application
    Filed: July 13, 2016
    Publication date: February 9, 2017
    Inventors: Patrice Y. SIMARD, David Max CHICKERING, David G. GRANGIER, Aparna LAKSHMIRATAN, Saleema A. AMERSHI
  • Patent number: 9489373
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: November 8, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Saleema A. Amershi, Aparna Lakshmiratan, Carlos Garcia Jurado Suarez
  • Patent number: 9430460
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: August 30, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Aparna Lakshmiratan, Saleema A. Amershi
  • Publication number: 20160239761
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Application
    Filed: April 21, 2016
    Publication date: August 18, 2016
    Inventors: PATRICE Y. SIMARD, DAVID MAX CHICKERING, DAVID G. GRANGIER, DENIS X. CHARLES, LEON BOTTOU, CARLOS GARCIA JURADO SUAREZ
  • Publication number: 20160162802
    Abstract: Technologies are described herein for active machine learning. An active machine learning method can include initiating active machine learning through an active machine learning system configured to train an auxiliary machine learning model to produce at least one new labeled observation, refining a capacity of a target machine learning model based on the active machine learning, and retraining the auxiliary machine learning model with the at least one new labeled observation subsequent to refining the capacity of the target machine learning model. Additionally, the target machine learning model is a limited-capacity machine learning model according to the description provided herein.
    Type: Application
    Filed: December 7, 2014
    Publication date: June 9, 2016
    Inventors: David Maxwell Chickering, Christopher A. Meek, Patrice Y. Simard, Rishabh Krishnan Iyer
  • Publication number: 20160162803
    Abstract: Disclosed herein are technologies directed to a feature ideator. The feature ideator can initiate a classifier that analyzes a training set of data in a classification process. The feature ideator can generate one or more suggested features relating to errors generated during the classification process. The feature ideator can generate an output to cause the errors to be rendered in a format that provides for an interaction with a user. A user can review the summary of the errors or the individual errors and select one or more features to increase the accuracy of the classifier.
    Type: Application
    Filed: December 7, 2014
    Publication date: June 9, 2016
    Inventors: Saleema Amershi, Michael J. Brooks, Bongshin Lee, Steven M. Drucker, Patrice Y. Simard, Jin A. Suh, Ashish Kapoor
  • Patent number: 9355088
    Abstract: A collection of data that is extremely large can be difficult to search and/or analyze. Relevance may be dramatically improved by automatically classifying queries and web pages in useful categories, and using these classification scores as relevance features. A thorough approach may require building a large number of classifiers, corresponding to the various types of information, activities, and products. Creation of classifiers and schematizers is provided on large data sets. Exercising the classifiers and schematizers on hundreds of millions of items may expose value that is inherent to the data by adding usable meta-data. Some aspects include active labeling exploration, automatic regularization and cold start, scaling with the number of items and the number of classifiers, active featuring, and segmentation and schematization.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: May 31, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Patrice Y. Simard, David Max Chickering, David G. Grangier, Denis X. Charles, Leon Bottou, Carlos Garcia Jurado Suarez