Patents by Inventor Junzhe Miao

Junzhe Miao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11461353
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system extracts text windows of varying length from text in one or more content items associated with an entity. Next, the system applies a machine learning model to features for the text windows to produce scores representing the likelihoods that the text windows contain addresses. The system then identifies, based on the scores and validation rules applied to the text windows, one of the text windows as an address for the entity. Finally, the system stores the selected text window as the address for the entity.
    Type: Grant
    Filed: December 27, 2018
    Date of Patent: October 4, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Paul D. Bergeron, Ted J. Tomlinson, Junzhe Miao, Gurbir Singh
  • Patent number: 11436532
    Abstract: The disclosed embodiments provide a system that identifies duplicate entities. During operation, the system selects training data for a first machine learning model based on confidence scores representing likelihoods that pairs of entities in an online system are duplicates. Next, the system updates parameters of the first machine learning model based on features and labels in the training data. The system then identifies a first subset of additional pairs of the entities as duplicate entities based on scores generated by the first machine learning model from values of the features for the additional pairs and a first threshold associated with the scores. The system also determines a canonical entity in each of the duplicate entities based on additional features. Finally, the system updates content outputted in a user interface of the online system based on the identified first subset of the additional pairs.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: September 6, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Tianhao Lu, Junzhe Miao, Yunpeng Xu, Dan Shacham, Hong H. Tam, Tao Xiong
  • Publication number: 20210173825
    Abstract: The disclosed embodiments provide a system that identifies duplicate entities. During operation, the system selects training data for a first machine learning model based on confidence scores representing likelihoods that pairs of entities in an online system are duplicates. Next, the system updates parameters of the first machine learning model based on features and labels in the training data. The system then identifies a first subset of additional pairs of the entities as duplicate entities based on scores generated by the first machine learning model from values of the features for the additional pairs and a first threshold associated with the scores. The system also determines a canonical entity in each of the duplicate entities based on additional features. Finally, the system updates content outputted in a user interface of the online system based on the identified first subset of the additional pairs.
    Type: Application
    Filed: December 4, 2019
    Publication date: June 10, 2021
    Inventors: Tianhao Lu, Junzhe Miao, Yunpeng Xu, Dan Shacham, Hong H. Tam, Tao Xiong
  • Publication number: 20200311156
    Abstract: Techniques of inferring an organic website URL of an organization based on a web search result are provided. A query that includes an organization name is sent to a search engine and a set of search results is received from the search engine as a result of the query. Each search result in the set of search results includes a URL for the organization website address. For each search result in the set of search results, a set of feature values that is associated with each search result is identified. The set of feature values is inputted to a prediction model that generates a prediction, and based on the prediction, a determination of whether to associate the URL of each search result with the organization name is made.
    Type: Application
    Filed: March 29, 2019
    Publication date: October 1, 2020
    Inventors: Junzhe Miao, Yunpeng Xu, Wenxuan Gao
  • Publication number: 20200210442
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system extracts text windows of varying length from text in one or more content items associated with an entity. Next, the system applies a machine learning model to features for the text windows to produce scores representing the likelihoods that the text windows contain addresses. The system then identifies, based on the scores and validation rules applied to the text windows, one of the text windows as an address for the entity. Finally, the system stores the selected text window as the address for the entity.
    Type: Application
    Filed: December 27, 2018
    Publication date: July 2, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Paul D. Bergeron, Ted J. Tomlinson, Junzhe Miao, Gurbir Singh