Patents by Inventor Xiatian ZHU
Xiatian ZHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12322198Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.Type: GrantFiled: August 5, 2020Date of Patent: June 3, 2025Assignee: VERITONE, INC.Inventors: Shaogang Gong, Qi Dong, Xiatian Zhu
-
Patent number: 11837025Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.Type: GrantFiled: February 16, 2021Date of Patent: December 5, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Brais Martinez, Tao Xiang, Victor Augusto Escorcia, Juan Perez-Rua, Xiatian Zhu, Antoine Toisoul
-
Patent number: 11797824Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.Type: GrantFiled: June 15, 2020Date of Patent: October 24, 2023Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Juan Manuel Perez Rua, Tao Xiang, Timothy Hospedales, Xiatian Zhu
-
Patent number: 11755888Abstract: A method for accelerating score-based generative models (SGM) is provided, including setting a frequency mask (R) and a space mask (A) and a target sampling iteration number (T); sampling an initial sample (x0); conducting iteration comprising steps as follows: sampling a noise term; applying a preconditioned diffusion sampling (PDS) operator (M) to the noise term and thus generate a preconditioned noise term; calculating a drift term; applying the transpose of the PDS operator (MT) and then applying the PDS operator (M) to the drift term, and thus generate a preconditioned drift term; diffusing the sample of each iteration (xt); and outputting the result.Type: GrantFiled: January 9, 2023Date of Patent: September 12, 2023Assignee: FUDAN UNIVERSITYInventors: Li Zhang, Hengyuan Ma, Xiatian Zhu, Jianfeng Feng
-
Publication number: 20230145150Abstract: Broadly speaking, the present techniques relate to a method and apparatus for performing action recognition, and in particular to a computer-implemented method for performing action recognition on resource-constrained or lightweight devices such as smartphones. The ML model may be adjusted to achieve required accuracy and efficiency levels, while also taking into account the computational capability of the apparatus that is being used to implement the ML model. One way is to adjust the number of channels assigned to the first set of channels, i.e. the full temporal resolution channels. Another way is to adjust the point in the ML model where the temporal pooling layer or layers are applied.Type: ApplicationFiled: February 16, 2021Publication date: May 11, 2023Applicant: SAMSUNG ELECTRONICS CO., LTD.Inventors: Brais MARTINEZ, Tao XIANG, Victor Augusto ESCORCIA, Juan PEREZ-RUA, Xiatian ZHU, Antoine TOISOUL
-
Publication number: 20220343626Abstract: Method and system for building a machine learning model for finding visual targets from text queries, the method comprising the steps of receiving a set of training data comprising text attribute labelled images, wherein each image has more than one text attribute label. Receiving a first vector space comprising a mapping of words, the mapping defining relationships between words. Generating a visual feature vector space by grouping images of the set of training data having similar attribute labels. Mapping each attribute label within the training data set on to the first vector space to form a second vector space. Fusing the visual feature vector space and the second vector space to form a third vector space. Generating a similarity matching model from the third vector space.Type: ApplicationFiled: August 5, 2020Publication date: October 27, 2022Applicants: Vision Semantics Limited, Vision Semantics LimitedInventors: Shaogang Gong, Qi Dong, Xiatian Zhu
-
Patent number: 11430261Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.Type: GrantFiled: July 17, 2018Date of Patent: August 30, 2022Assignee: Vision Semantics LimitedInventors: Shaogang Gong, Xiatian Zhu, Hanxiao Wang, Xu Lan
-
Publication number: 20210125026Abstract: An electronic apparatus and a method for controlling the electronic apparatus are disclosed. The method includes: obtaining a neural network model trained to detect an object corresponding to at least one class; obtaining a user command for detecting a first object corresponding to a first class; and based on the first object not corresponding to the at least one class, obtaining a new neural network model based on the neural network model and information of the first object.Type: ApplicationFiled: June 15, 2020Publication date: April 29, 2021Applicant: SAMSUNG ELECTRONICS CO., LTD.Inventors: Juan Manuel PEREZ RUA, Tao XIANG, Timothy HOSPEDALES, Xiatian ZHU
-
Publication number: 20200218888Abstract: A computer implemented method and system for training a machine to identify a target within video data, the method comprising the steps of providing a training data set including identified labelled targets within video data having the same target within different video views. Generating, using a learning model, a bounding box action policy for determining required adjustments to a bounding box around a target in the video data by: generating a bounding box around a labelled target within a first view of the video data. Converting the target bounded by the bounding box to a quantitative representation. Determining a matching level between the quantitative representation and a quantitative representation of a further labelled target within the video data from a second view different to the first view. Looping the following steps one or more times, the looped steps comprising: using the bounding box action policy to determine an action to change the bounding box.Type: ApplicationFiled: July 17, 2018Publication date: July 9, 2020Inventors: Shaogang GONG, Xiatian ZHU, Hanxiao WANG, Xu LANG