Patents by Inventor Furu Wei

Furu Wei has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11877016
    Abstract: The present disclosure provides a technical solution of live comments generating, which may acquire candidate texts highly similar with segments of video as live comments of corresponding segments by matching the candidate texts with the segments, and further generate new live comments based on video segments and existed live comments to enrich the live comments information of related video.
    Type: Grant
    Filed: March 16, 2020
    Date of Patent: January 16, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lei Cui, Furu Wei, Shaohan Huang, Ming Zhou
  • Publication number: 20230368782
    Abstract: Systems and methods are provided for training a machine learning model to learn speech representations. Labeled speech data or both labeled and unlabeled data sets is applied to a feature extractor of a machine learning model to generate latent speech representations. The latent speech representations are applied to a quantizer to generate quantized latent speech representations and to a transformer context network to generate contextual representations. Each contextual representation included in the contextual representations is aligned with a phoneme label to generate phonetically-aware contextual representations. Quantized latent representations are aligned with phoneme labels to generate phonetically aware latent speech representations.
    Type: Application
    Filed: July 3, 2023
    Publication date: November 16, 2023
    Inventors: Yao QIAN, Yu WU, Kenichi KUMATANI, Shujie LIU, Furu WEI, Nanshan ZENG, Xuedong David HUANG, Chengyi WANG
  • Publication number: 20230315969
    Abstract: A system and method for converting a document is described. The system accesses a document comprising one or more section breaks. The system detects sections of the text document demarked by the one or more section breaks and generates a section title metadata and a section summary metadata for each section of the plurality of sections. The system inserts the section title metadata and the section summary metadata at the corresponding section breaks in the text document. The system modifies the text document into slides. Each slide being formed for each section based on the corresponding section title metadata and the section summary metadata. The system generates a presentation document based on the slides.
    Type: Application
    Filed: May 20, 2021
    Publication date: October 5, 2023
    Inventors: Tomasz L. Religa, Utsab Bose, Si-Qing CHEN, Lei CUI, Tao Ge, Huitian JIAO, Ravi Mandliya, Kaushik Ramaiah Narayanan, Max Wang, Furu WEI
  • Patent number: 11735171
    Abstract: Systems and methods are provided for training a machine learning model to learn speech representations. Labeled speech data or both labeled and unlabeled data sets is applied to a feature extractor of a machine learning model to generate latent speech representations. The latent speech representations are applied to a quantizer to generate quantized latent speech representations and to a transformer context network to generate contextual representations. Each contextual representation included in the contextual representations is aligned with a phoneme label to generate phonetically-aware contextual representations. Quantized latent representations are aligned with phoneme labels to generate phonetically aware latent speech representations.
    Type: Grant
    Filed: May 14, 2021
    Date of Patent: August 22, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yao Qian, Yu Wu, Kenichi Kumatani, Shujie Liu, Furu Wei, Nanshan Zeng, Xuedong David Huang, Chengyi Wang
  • Patent number: 11727270
    Abstract: A method and system for training a text-to-content recommendation ML model includes training a first ML model using a first training data set, utilizing the trained first ML model to infer information about the data contained in the first training data set, collecting the inferred information to generate a second training data set, and utilizing the first training data set and the second training data set to train a second ML model. The second ML model may be a text-to-content recommendation ML model.
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: August 15, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Amit Srivastava, Xingxing Zhang, Furu Wei, Ming Zhou
  • Patent number: 11705096
    Abstract: Implementations of the subject matter described herein provide a solution that enables a machine to automatically generate a melody. In this solution, user emotion and/or environment information is used to select a first melody feature parameter from a plurality of melody feature parameters, wherein each of the plurality of melody feature parameters corresponds to a music style of one of a plurality of reference melodies. The first melody feature parameter is further used to generate a first melody that conforms to the music style and is different from the reference melody. Thus, a melody that matches user emotions and/or environmental information may be automatically created.
    Type: Grant
    Filed: May 24, 2019
    Date of Patent: July 18, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Shaohan Huang, Lei Cui, Tao Ge, Furu Wei, Ming Zhou
  • Publication number: 20230206670
    Abstract: There is provided a solution for semantic representation of text in a document. In this solution, textual information comprising a sequence of text elements (220) and layout information (230) of the text element are determined from a document. The layout information (230) indicates a spatial arrangement of the plurality of text elements (220) presented within the document. Based at least in part on the plurality of text elements (220) and the layout information (230), respective semantic feature representations (180) of the plurality of text elements (220) are generated. By jointly using both the textual information and the layout information (230), rich semantics of the text elements (220) in the document can be effectively captured in the feature representations.
    Type: Application
    Filed: June 12, 2020
    Publication date: June 29, 2023
    Inventors: Lei Cui, Shaohan HUANG, Li DONG, Furu WEI
  • Publication number: 20230129314
    Abstract: A data processing system implements obtaining first textual content in a first language from a first client device; determining that the first language is supported by a first machine learning model; obtaining a guard list of prohibited terms associated with the first language; determining that the textual content does not include one or more prohibited terms associated based on the guard list; providing the first textual content as an input to the first machine learning model responsive to the textual content not including the one or more prohibited terms; analyzing the first textual content with the first machine learning model to obtain a first content recommendation; obtaining a first content recommendation policy that identifies content associated with the first language that may not be provided as a content recommendation; determining that the first content recommendation is not prohibited; and providing the first content recommendation to the first client device.
    Type: Application
    Filed: October 26, 2021
    Publication date: April 27, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Ji LI, Amit SRIVASTAVA, XingXing ZHANG, Furu WEI
  • Publication number: 20230076387
    Abstract: Methods and systems for linking comments to portions of content items. An example computing device receives information associated with a content item produced by a source system, the content item being accessible to other the computing devices via a network and receives a comment associated with the content item, the comment produced by one of the other computing devices. In response to receiving the information and the comment, the computing device predicts a subsection of the content item to link to the received comment based at least on details associated with the content item and the comment, then makes information associated with the predicted subsection of the content item available to other computing devices requesting access to the content item.
    Type: Application
    Filed: October 27, 2022
    Publication date: March 9, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Furu WEI, Ming ZHOU, Yang LIU, Ziqiang CAO, Shaohan HUANG, Li DONG, Lei CUI
  • Patent number: 11516159
    Abstract: Methods and systems for linking comments to portions of content items. An example computing device receives information associated with a content item produced by a source system, the content item being accessible to other the computing devices via a network and receives a comment associated with the content item, the comment produced by one of the other computing devices. In response to receiving the information and the comment, the computing device predicts a subsection of the content item to link to the received comment based at least on details associated with the content item and the comment, then makes information associated with the predicted subsection of the content item available to other computing devices requesting access to the content item.
    Type: Grant
    Filed: May 29, 2015
    Date of Patent: November 29, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Furu Wei, Ming Zhou, Yang Liu, Ziqiang Cao, Shaohan Huang, Li Dong, Lei Cui
  • Publication number: 20220366898
    Abstract: Systems and methods are provided for training a machine learning model to learn speech representations. Labeled speech data or both labeled and unlabeled data sets is applied to a feature extractor of a machine learning model to generate latent speech representations. The latent speech representations are applied to a quantizer to generate quantized latent speech representations and to a transformer context network to generate contextual representations. Each contextual representation included in the contextual representations is aligned with a phoneme label to generate phonetically-aware contextual representations. Quantized latent representations are aligned with phoneme labels to generate phonetically aware latent speech representations.
    Type: Application
    Filed: May 14, 2021
    Publication date: November 17, 2022
    Inventors: Yao QIAN, Yu WU, Kenichi KUMATANI, Shujie LIU, Furu WEI, Nanshan ZENG, Xuedong David HUANG, Chengyi WANG
  • Patent number: 11455466
    Abstract: A method and system for providing an application-specific embedding for an entire text-to-content suggestions service is disclosed. The method includes accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an unsupervised ML model to the dataset to generate a pretrained embedding; and utilizing the pretrained embedding to train the text-to-content suggestion ML model utilized by the application.
    Type: Grant
    Filed: May 1, 2019
    Date of Patent: September 27, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xingxing Zhang, Ji Li, Furu Wei, Ming Zhou, Amit Srivastava
  • Patent number: 11429787
    Abstract: Method and system for training a text-to-content suggestion ML model include accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an ML model to the dataset to generate a pretrained embedding, and applying a supervised ML model to a labeled dataset to train the text-to-content suggestion ML model utilized by the application by utilizing the pretrained embedding generated by the supervised ML model.
    Type: Grant
    Filed: May 1, 2019
    Date of Patent: August 30, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ji Li, Xingxing Zhang, Furu Wei, Ming Zhou, Amit Srivastava
  • Publication number: 20220141503
    Abstract: The present disclosure provides a technical solution of live comments generating, which may acquire candidate texts highly similar with segments of video as live comments of corresponding segments by matching the candidate texts with the segments, and further generate new live comments based on video segments and existed live comments to enrich the live comments information of related video.
    Type: Application
    Filed: March 16, 2020
    Publication date: May 5, 2022
    Inventors: Lei CUI, Furu WEI, Shaohan HUANG, Ming ZHOU
  • Publication number: 20210334457
    Abstract: A method and system for providing an application-specific embedding for an entire text-to-content suggestions service is disclosed. The method includes accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an unsupervised ML model to the dataset to generate a pretrained embedding; and utilizing the pretrained embedding to train the text-to-content suggestion ML model utilized by the application.
    Type: Application
    Filed: May 1, 2019
    Publication date: October 28, 2021
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Xingxing ZHANG, Ji LI, Furu WEI, Ming ZHOU, Amit SRIVASTAVA
  • Publication number: 20210334708
    Abstract: Method and system for training a text-to-content suggestion ML model include accessing a dataset containing unlabeled training data collected from an application, the unlabeled training data being collected under user privacy constraints, applying an ML model to the dataset to generate a pretrained embedding, and applying a supervised ML model to a labeled dataset to train the text-to-content suggestion ML model utilized by the application by utilizing the pretrained embedding generated by the supervised ML model.
    Type: Application
    Filed: May 1, 2019
    Publication date: October 28, 2021
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ji LI, Xingxing ZHANG, Furu WEI, Ming ZHOU, Amit SRIVASTAVA
  • Publication number: 20210264106
    Abstract: A method and system for training a text-to-content recommendation ML model includes training a first ML model using a first training data set, utilizing the trained first ML model to infer information about the data contained in the first training data set, collecting the inferred information to generate a second training data set, and utilizing the first training data set and the second training data set to train a second ML model. The second ML model may be a text-to-content recommendation ML model.
    Type: Application
    Filed: February 24, 2020
    Publication date: August 26, 2021
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ji LI, Amit SRIVASTAVA, Xingxing ZHANG, Furu WEI, Ming ZHOU
  • Publication number: 20210166014
    Abstract: According to implementations of the subject matter described herein, there is provided a solution for generating a summary of a document. In this solution, feature information of pages comprised in a document is extracted, which characterizes at least one type of content contained in each page. Respective importance of the pages is determined at least based on the extracted feature information. A summary of the document is generated for the document by selecting a predetermined number of pages less than the number of the pages based on the respective importance. Through the solution, instead of providing all the pages, pages containing important content may be determined automatically to serve as the summary of the document. This summary allows the user to learn quickly main content of the document, shorten the time consumed in browsing all documents and/or facilitate location of a document of interest as soon as possible.
    Type: Application
    Filed: June 17, 2019
    Publication date: June 3, 2021
    Inventors: Xingxing ZHANG, Shaohan HUANG, Lei CUI, Tao GE, Furu WEI, Ming ZHOU
  • Publication number: 20210158790
    Abstract: Implementations of the subject matter described herein provide a solution that enables a machine to automatically generate a melody. In this solution, user emotion and/or environment information is used to select a first melody feature parameter from a plurality of melody feature parameters, wherein each of the plurality of melody feature parameters corresponds to a music style of one of a plurality of reference melodies. The first melody feature parameter is further used to generate a first melody that conforms to the music style and is different from the reference melody. Thus, a melody that matches user emotions and/or environmental information may be automatically created.
    Type: Application
    Filed: May 24, 2019
    Publication date: May 27, 2021
    Inventors: Shaohan HUANG, Lei CUI, Tao GE, Furu WEI, Ming ZHOU
  • Publication number: 20210142002
    Abstract: In embodiments of the present disclosure, there is provided a method of generating a slide for presentation. Upon a target passage for presentation is obtained, a plurality of sentences are generated based on the target passage, and a label associated with each sentence and an icon corresponding to each label are determined. Then, the sentences, labels and icons are displayed in association in a user interface of an application for presentation. According to embodiments of the present disclosure, the illustrated slides can be automatically generated for a passage to be presented, which can improve efficiency of slide making and improve user experience for slide presentation.
    Type: Application
    Filed: June 18, 2019
    Publication date: May 13, 2021
    Inventors: Lei Cui, Shaohan HUANG, Xingxing ZHANG, Furu WEI, Ming ZHOU