Patents by Inventor Haoyuan Gao

Haoyuan Gao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240070445
    Abstract: The present disclosure discloses a data processing circuit, a data processing method, and related products. The data processing circuit is implemented as a computing apparatus included in a combined processing apparatus. The combined processing apparatus further includes an interface apparatus and other processing apparatus. The computing apparatus interacts with other processing apparatus to jointly complete a user specified computation operation. The combined processing apparatus further includes a storage apparatus. The storage apparatus is connected to the computing apparatus and other processing apparatus, respectively. The storage apparatus is used to store data of the computing apparatus and other processing apparatus. The solution disclosed in the present disclosure provides hardware implementation for operations related to structured sparsity, which can simplify processing and improve processing efficiency of a machine.
    Type: Application
    Filed: September 23, 2021
    Publication date: February 29, 2024
    Inventors: Yufeng GAO, Shibing ZHU, Haoyuan HE
  • Patent number: 10909329
    Abstract: Embodiments of a multimodal question answering (mQA) system are presented to answer a question about the content of an image. In embodiments, the model comprises four components: a Long Short-Term Memory (LSTM) component to extract the question representation; a Convolutional Neural Network (CNN) component to extract the visual representation; an LSTM component for storing the linguistic context in an answer, and a fusing component to combine the information from the first three components and generate the answer. A Freestyle Multilingual Image Question Answering (FM-IQA) dataset was constructed to train and evaluate embodiments of the mQA model. The quality of the generated answers of the mQA model on this dataset is evaluated by human judges through a Turing Test.
    Type: Grant
    Filed: April 25, 2016
    Date of Patent: February 2, 2021
    Assignee: Baidu USA LLC
    Inventors: Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, Wei Xu
  • Patent number: 10706314
    Abstract: The present disclosure provides an image recognition method and apparatus, a device and a non-volatile computer storage medium. In embodiments of the present disclosure, it is feasible to obtain the to-be-recognized image of the designated space, then perform image segmentation processing for the to-be-recognized image, to obtain at least one area image of the designated space, and then perform image matching processing for each area image in said at least one area image, to obtain a reference image corresponding to said each area image, so that it is possible to perform recognition processing for said each area image according to image information of the reference image corresponding to said each area image to obtain article information of said each area image. The so doing does not require manual participation and exhibits simple operations and a high rate of correctness, and thereby improves the recognition efficiency and reliability.
    Type: Grant
    Filed: May 23, 2016
    Date of Patent: July 7, 2020
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chen Zhao, Haoyuan Gao, Ji Liang
  • Publication number: 20190019053
    Abstract: The present disclosure provides an image recognition method and apparatus, a device and a non-volatile computer storage medium. In embodiments of the present disclosure, it is feasible to obtain the to-be-recognized image of the designated space, then perform image segmentation processing for the to-be-recognized image, to obtain at least one area image of the designated space, and then perform image matching processing for each area image in said at least one area image, to obtain a reference image corresponding to said each area image, so that it is possible to perform recognition processing for said each area image according to image information of the reference image corresponding to said each area image to obtain article information of said each area image. The so doing does not require manual participation and exhibits simple operations and a high rate of correctness, and thereby improves the recognition efficiency and reliability.
    Type: Application
    Filed: May 23, 2016
    Publication date: January 17, 2019
    Applicant: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chen ZHAO, Haoyuan GAO, Ji LIANG
  • Patent number: 10146796
    Abstract: A method and an apparatus for photograph classification and storage by matching an image characteristic of a first photograph with an image characteristic of a second photograph in a directory, and calculating the similarity between the first photograph and the second photograph and presenting the first photograph and the second photograph in a front-end page as located in a same subdirectory when the similarity between the first photograph and the second photograph is larger than a preset threshold. The beneficial effects being that a number of similar images in a user's photo album can be sorted efficiently and placed into the same directory to facilitate the user's management and viewing of the photographs.
    Type: Grant
    Filed: January 14, 2015
    Date of Patent: December 4, 2018
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Haoyuan Gao, Xi Wu
  • Publication number: 20170154054
    Abstract: A method and an apparatus for photograph classification and storage by matching an image characteristic of a first photograph with an image characteristic of a second photograph in a directory, and calculating the similarity between the first photograph and the second photograph and presenting the first photograph and the second photograph in a front-end page as located in a same subdirectory when the similarity between the first photograph and the second photograph is larger than a preset threshold. The beneficial effects being that a number of similar images in a user's photo album can be sorted efficiently and placed into the same directory to facilitate the user's management and viewing of the photographs.
    Type: Application
    Filed: January 14, 2015
    Publication date: June 1, 2017
    Inventors: Haoyuan Gao, Xi Wu
  • Publication number: 20160342895
    Abstract: Embodiments of a multimodal question answering (mQA) system are presented to answer a question about the content of an image. In embodiments, the model comprises four components: a Long Short-Term Memory (LSTM) component to extract the question representation; a Convolutional Neural Network (CNN) component to extract the visual representation; an LSTM component for storing the linguistic context in an answer, and a fusing component to combine the information from the first three components and generate the answer. A Freestyle Multilingual Image Question Answering (FM-IQA) dataset was constructed to train and evaluate embodiments of the mQA model. The quality of the generated answers of the mQA model on this dataset is evaluated by human judges through a Turing Test.
    Type: Application
    Filed: April 25, 2016
    Publication date: November 24, 2016
    Applicant: Baidu USA LLC
    Inventors: Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, Wei Xu