Patents by Inventor Xiatian ZHU

Xiatian ZHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Text based image search

Patent number: 12322198

Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.

Type: Grant

Filed: August 5, 2020

Date of Patent: June 3, 2025

Assignee: VERITONE, INC.

Inventors: Shaogang Gong, Qi Dong, Xiatian Zhu
Method and apparatus for action recognition

Patent number: 11837025

Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.

Type: Grant

Filed: February 16, 2021

Date of Patent: December 5, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Brais Martinez, Tao Xiang, Victor Augusto Escorcia, Juan Perez-Rua, Xiatian Zhu, Antoine Toisoul
Electronic apparatus and method for controlling thereof

Patent number: 11797824

Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.

Type: Grant

Filed: June 15, 2020

Date of Patent: October 24, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Juan Manuel Perez Rua, Tao Xiang, Timothy Hospedales, Xiatian Zhu
Method and system for accelerating score-based generative models with preconditioned diffusion sampling

Patent number: 11755888

Abstract: A method for accelerating score-based generative models (SGM) is provided, including setting a frequency mask (R) and a space mask (A) and a target sampling iteration number (T); sampling an initial sample (x0); conducting iteration comprising steps as follows: sampling a noise term; applying a preconditioned diffusion sampling (PDS) operator (M) to the noise term and thus generate a preconditioned noise term; calculating a drift term; applying the transpose of the PDS operator (MT) and then applying the PDS operator (M) to the drift term, and thus generate a preconditioned drift term; diffusing the sample of each iteration (xt); and outputting the result.

Type: Grant

Filed: January 9, 2023

Date of Patent: September 12, 2023

Assignee: FUDAN UNIVERSITY

Inventors: Li Zhang, Hengyuan Ma, Xiatian Zhu, Jianfeng Feng
METHOD AND APPARATUS FOR ACTION RECOGNITION

Publication number: 20230145150

Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.

Type: Application

Filed: February 16, 2021

Publication date: May 11, 2023

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Brais MARTINEZ, Tao XIANG, Victor Augusto ESCORCIA, Juan PEREZ-RUA, Xiatian ZHU, Antoine TOISOUL
Text Based Image Search

Publication number: 20220343626

Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.

Type: Application

Filed: August 5, 2020

Publication date: October 27, 2022

Applicants: Vision Semantics Limited, Vision Semantics Limited

Inventors: Shaogang Gong, Qi Dong, Xiatian Zhu
Target re-identification

Patent number: 11430261

Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.

Type: Grant

Filed: July 17, 2018

Date of Patent: August 30, 2022

Assignee: Vision Semantics Limited

Inventors: Shaogang Gong, Xiatian Zhu, Hanxiao Wang, Xu Lan
ELECTRONIC APPARATUS AND METHOD FOR CONTROLLING THEREOF

Publication number: 20210125026

Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.

Type: Application

Filed: June 15, 2020

Publication date: April 29, 2021

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Juan Manuel PEREZ RUA, Tao XIANG, Timothy HOSPEDALES, Xiatian ZHU
Target Re-Identification

Publication number: 20200218888

Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.

Type: Application

Filed: July 17, 2018

Publication date: July 9, 2020

Inventors: Shaogang GONG, Xiatian ZHU, Hanxiao WANG, Xu LANG