Patents by Inventor Xiatian ZHU

Xiatian ZHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12322198
    Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.
    Type: Grant
    Filed: August 5, 2020
    Date of Patent: June 3, 2025
    Assignee: VERITONE, INC.
    Inventors: Shaogang Gong, Qi Dong, Xiatian Zhu
  • Patent number: 11837025
    Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.
    Type: Grant
    Filed: February 16, 2021
    Date of Patent: December 5, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Brais Martinez, Tao Xiang, Victor Augusto Escorcia, Juan Perez-Rua, Xiatian Zhu, Antoine Toisoul
  • Patent number: 11797824
    Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.
    Type: Grant
    Filed: June 15, 2020
    Date of Patent: October 24, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Juan Manuel Perez Rua, Tao Xiang, Timothy Hospedales, Xiatian Zhu
  • Patent number: 11755888
    Abstract: A method for accelerating score-based generative models (SGM) is provided, including setting a frequency mask (R) and a space mask (A) and a target sampling iteration number (T); sampling an initial sample (x0); conducting iteration comprising steps as follows: sampling a noise term; applying a preconditioned diffusion sampling (PDS) operator (M) to the noise term and thus generate a preconditioned noise term; calculating a drift term; applying the transpose of the PDS operator (MT) and then applying the PDS operator (M) to the drift term, and thus generate a preconditioned drift term; diffusing the sample of each iteration (xt); and outputting the result.
    Type: Grant
    Filed: January 9, 2023
    Date of Patent: September 12, 2023
    Assignee: FUDAN UNIVERSITY
    Inventors: Li Zhang, Hengyuan Ma, Xiatian Zhu, Jianfeng Feng
  • Publication number: 20230145150
    Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.
    Type: Application
    Filed: February 16, 2021
    Publication date: May 11, 2023
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Brais MARTINEZ, Tao XIANG, Victor Augusto ESCORCIA, Juan PEREZ-RUA, Xiatian ZHU, Antoine TOISOUL
  • Publication number: 20220343626
    Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.
    Type: Application
    Filed: August 5, 2020
    Publication date: October 27, 2022
    Applicants: Vision Semantics Limited, Vision Semantics Limited
    Inventors: Shaogang Gong, Qi Dong, Xiatian Zhu
  • Patent number: 11430261
    Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.
    Type: Grant
    Filed: July 17, 2018
    Date of Patent: August 30, 2022
    Assignee: Vision Semantics Limited
    Inventors: Shaogang Gong, Xiatian Zhu, Hanxiao Wang, Xu Lan
  • Publication number: 20210125026
    Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.
    Type: Application
    Filed: June 15, 2020
    Publication date: April 29, 2021
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Juan Manuel PEREZ RUA, Tao XIANG, Timothy HOSPEDALES, Xiatian ZHU
  • Publication number: 20200218888
    Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.
    Type: Application
    Filed: July 17, 2018
    Publication date: July 9, 2020
    Inventors: Shaogang GONG, Xiatian ZHU, Hanxiao WANG, Xu LANG