Patents by Inventor Juwei Lu

Juwei Lu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260134643
    Abstract: Methods and devices for inpainting a three-dimensional scene in response to object removal. A 3D scene is stored in the form of Gaussian splats. When object removal occurs, a never-before-seen (NBS) area may be revealed that requires inpainting as a result of pruning of the Gaussian splats. Object masks are dilated and remapped to the pruned scene to identify pixels bordering the NBS area. A reference image is inpainted and the geography of its NBS area reconstructed using depth prediction and smoothing. The reference image is then warped to other viewpoints and the warped images and their inpainting masks are input to a multi-view restoration model created by modifying a pre-trained diffusion-based inpainting model to use sparse space-time attention layers to ensure consistency among views. Those refined views are then used to refine the pruned 3D Gaussian model.
    Type: Application
    Filed: April 8, 2025
    Publication date: May 14, 2026
    Inventors: Zhihao Shi, Dong Huo, Yuhongze Zhou, Yan Min, Xinxin Zuo, Juwei Lu
  • Patent number: 12524078
    Abstract: Methods and devices for machine vision-based selection of content are described. One or more hands are detected in a current frame of video data. A respective fingertip location is determined for each of up to two of the detected hands. A content selection gesture is determined corresponding to the up to two detected hands. Selected content is extracted, as indicated by the content selection gesture and based on the up to two fingertip locations. The device may be a smartphone, a tablet, a laptop, a smart light device, a reader device, etc.
    Type: Grant
    Filed: June 30, 2023
    Date of Patent: January 13, 2026
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Juwei Lu, Sayem Mohammad Siam, Deepak Sridhar, Sidharth Singla, Yannick Verdie, Xiaofei Wu, Srikanth Muralidharan, Zihao Yang, Peng Dai, Songcen Xu
  • Patent number: 12511866
    Abstract: Systems and methods for temporal action localization of video data are described. A feature representation extracted from video data has a temporal dimension and a spatial dimension. The feature representation is self-aligned in the spatial dimension. Spatial multi-sampling is performed to obtain a plurality of sparse samples of the self-aligned representation along the spatial dimension, and the multi-sampled representation is fused with the self-aligned representation. Attention-based context information aggregation is applied on the fused representation to obtain a spatially refined representation. Local temporal information aggregation is applied on the self-aligned representation to obtain a temporally refined representation. Action localization is performed on a concatenation of the spatially refined representation and the temporally refined representation.
    Type: Grant
    Filed: June 1, 2023
    Date of Patent: December 30, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Yanhui Guo, Deepak Sridhar, Peng Dai, Juwei Lu
  • Publication number: 20250371790
    Abstract: System, method, and computer readable medium for generating a 3D texture for a 3D object are disclosed. A 3D mesh and a text prompt for a desired texture are obtained. A sequence of texture sampling steps is performed, where each given texture sampling step includes iterating over a plurality of 2D views of the 3D mesh to generate an intermediate texture map. For a given iteration, a given 2D view and the text prompt are processed using a pre-trained 2D image generation diffusion model to fill in a portion of an intermediate texture map based on the given 2D view. A noise estimation generated by the diffusion model is refined, adding the intermediate texture map as guidance, to generate a latent variable to be inputted to a subsequent texture sampling step, enabling generation of a 3D texture, based on a text prompt, with fewer artifacts.
    Type: Application
    Filed: May 28, 2024
    Publication date: December 4, 2025
    Inventors: Dong HUO, Xinxin ZUO, Zhihao SHI, Peng DAI, Juwei LU, Songcen XU
  • Patent number: 12430905
    Abstract: Method and devices for training a keypoint estimation network are described. In each training iteration, synthetic images are generated by a generator, each synthetic image being assigned respective assigned keypoints by the generator. Using a prior-iteration of the keypoint estimation network, a set of predicted keypoints is obtained for each synthetic image. Based on an error score between the predicted keypoints and the assigned keypoints, poor quality synthetic images are discarded. The remaining synthetic images, together with real world images, are used to train an updated keypoint estimation network. The performance of the updated keypoint estimation network is validated, and the training iterations are performed until a convergence criteria is satisfied.
    Type: Grant
    Filed: May 11, 2023
    Date of Patent: September 30, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xin Ding, Deepak Sridhar, Juwei Lu, Sidharth Singla, Peng Dai, Xiaofei Wu
  • Patent number: 12424029
    Abstract: Methods and devices are described for computer vision-based gesture detection. From a frame of image data, extracted locations of keypoints of a detected hand are obtained. The extracted locations are normalized to obtain normalized features. The normalized features are processed using a trained decision tree ensemble to generate a probability of a valid gesture for the detected hand. The generated probability is compared with a defined decision threshold to generate a binary classification to classify the detected hand as a valid gesture or invalid gesture.
    Type: Grant
    Filed: June 22, 2022
    Date of Patent: September 23, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Yannick Verdie, Zi Hao Yang, Deepak Sridhar, Juwei Lu
  • Publication number: 20250285236
    Abstract: Methods, devices, and processor-readable media for method for performing a 3D reconstruction from a single view image, comprising progressively denoising a randomly initialized set of 3D-gaussian representations with continuous guidance from an input image.
    Type: Application
    Filed: October 7, 2024
    Publication date: September 11, 2025
    Inventors: Yuxuan MU, Xinxin ZUO, Peng DAI, Juwei LU
  • Publication number: 20250239053
    Abstract: A computer-implemented method for vision transforming includes, for each of one or more channels of each of a set of tiles of an image: splitting the channel into at least a first channel portion and a second channel portion; processing the first channel portion using depthwise convolution; processing the second channel portion with multi-head self-attention; and combining the processed first channel portion and the processed second channel portion; and identifying an object in the image at least partially based on the combined processed first channel portion and processed second channel portion for each of the one or more channels of each of the set of tiles of the image.
    Type: Application
    Filed: January 19, 2024
    Publication date: July 24, 2025
    Inventors: Deepak SRIDHAR, Jizong PENG, Md Ibrahim KHALIL, Peng DAI, Juwei LU, Renjing PEI, Songcen XU
  • Patent number: 12217187
    Abstract: Methods, systems, and media for training deep neural networks for cross-domain few-shot classification are described. The methods comprise an encoder and a decoder of a deep neural network. The training of the autoencoder comprises two training stages. For each iteration in the first training stage, a batch of data samples from the source dataset are sampled and fed to the encoder to generate a plurality of source feature maps, then determining a first training stage loss, which updates the autoencoder's parameters. For each iteration in the second training stage, the novel dataset is split into a support set and a query set. The support set is fed to the encoder to determine a prototype for each class label. The query set is also fed to the encoder to calculate a query set metric classification loss. The query set metric classification loss updates the autoencoder's parameters.
    Type: Grant
    Filed: March 17, 2021
    Date of Patent: February 4, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Hanwen Liang, Peng Dai, Qiong Zhang, Juwei Lu
  • Patent number: 12148097
    Abstract: Methods and systems for estimation of a 3D hand pose are disclosed. A 2D image containing a detected hand is processed using a U-net network to obtain a global feature vector and a heatmap for the keypoints of the hand. Information from the global feature vector and the heatmap are concatenated to obtain a set of input tokens that are processed using a transformer encoder to obtain a first set of 2D keypoints representing estimated 2D locations of the keypoints in a first view. The first set of 2D keypoints are inputted as a query to a transformer decoder, to obtain a second set of 2D keypoints representing estimated 2D locations of the keypoints in a second view. The first and second sets of 2D keypoints are aggregated to output the set of estimated 3D keypoints.
    Type: Grant
    Filed: December 9, 2022
    Date of Patent: November 19, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Yannick Verdie, Zihao Yang, Deepak Sridhar, Steven George McDonagh, Juwei Lu
  • Patent number: 12093465
    Abstract: Methods and systems for gesture-based control of a device are described. An input frame is processed to determine a location of a distinguishing anatomical feature in the input frame. A virtual gesture-space is defined based on the location of the distinguishing anatomical feature, the virtual gesture-space being a defined space for detecting a gesture input. The input frame is processed in only the virtual gesture-space, to detect and track a hand. Using information generated from detecting and tracking the at least one hand, a gesture class is determined for the at least one hand. The device may be a smart television, a smart phone, a tablet, etc.
    Type: Grant
    Filed: September 22, 2022
    Date of Patent: September 17, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Juwei Lu, Sayem Mohammad Siam, Wei Zhou, Peng Dai, Xiaofei Wu, Songcen Xu
  • Publication number: 20240292073
    Abstract: Methods and devices for generating a customized video segment from a video are disclosed. The video is partitioned into video segments. For each respective video segment, a respective set of scores is computed, where each score represents a respective content feature in the respective video segment. A respective weighted aggregate score is computed for each respective video segment by applying, to each respective set of scores, a common set of weight values. A selected video segment is outputted as the customized video segment, where the selected video segment is selected from one or more high-ranked video segments having high-ranked weighted aggregate scores.
    Type: Application
    Filed: May 7, 2024
    Publication date: August 29, 2024
    Inventors: Md Ibrahim KHALIL, Peng DAI, Hanwen LIANG, Lizhe CHEN, Varshanth Ravindra RAO, Juwei LU, Songcen XU
  • Publication number: 20240193866
    Abstract: Methods and systems for estimation of a 3D hand pose are disclosed. A 2D image containing a detected hand is processed using a U-net network to obtain a global feature vector and a heatmap for the keypoints of the hand. Information from the global feature vector and the heatmap are concatenated to obtain a set of input tokens that are processed using a transformer encoder to obtain a first set of 2D keypoints representing estimated 2D locations of the keypoints in a first view. The first set of 2D keypoints are inputted as a query to a transformer decoder, to obtain a second set of 2D keypoints representing estimated 2D locations of the keypoints in a second view. The first and second sets of 2D keypoints are aggregated to output the set of estimated 3D keypoints.
    Type: Application
    Filed: December 9, 2022
    Publication date: June 13, 2024
    Inventors: Yannick VERDIE, Zihao YANG, Deepak SRIDHAR, Steven George MCDONAGH, Juwei LU
  • Patent number: 12001613
    Abstract: Methods and systems for gesture-based control of a device are described. A virtual gesture-space is determined in a received input frame. The virtual gesture-space is associated with a primary user from a ranked user list of users. The received input frame is processed in only the virtual gesture-space, to detect and track a hand. Using a hand bounding box generated by detecting and tracking the hand, gesture classification is performed to determine a gesture input associated with the hand. A command input associated with the determined gesture input is processed. The device may be a smart television, a smart phone, a tablet, etc.
    Type: Grant
    Filed: May 30, 2022
    Date of Patent: June 4, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Juwei Lu, Sayem Mohammad Siam, Wei Zhou, Peng Dai, Xiaofei Wu, Songcen Xu
  • Patent number: 11966516
    Abstract: Methods and systems for gesture-based control of a device are described. A virtual gesture-space is determined in a received input frame. The virtual gesture-space is associated with a primary user from a ranked user list of users. The received input frame is processed in only the virtual gesture-space, to detect and track a hand. Using a hand bounding box generated by detecting and tracking the hand, gesture classification is performed to determine a gesture input associated with the hand. A command input associated with the determined gesture input is processed. The device may be a smart television, a smart phone, a tablet, etc.
    Type: Grant
    Filed: May 30, 2022
    Date of Patent: April 23, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Juwei Lu, Sayem Mohammad Siam, Wei Zhou, Peng Dai, Xiaofei Wu, Songcen Xu
  • Patent number: 11954145
    Abstract: Methods, systems, and media for image searching are described. Images comprising one query image and a plurality of candidate images are received. For each candidate image, a first model similarity measure from an output of a first model configured for scene classification to perceive scenes in the images is determined. Further, for each candidate image of the plurality of candidate images, a second model similarity measure from the output of a second model configured for attribute classification to perceive attributes in the images is determined. For each candidate image of the plurality of candidate images, a similarity agglomerate index of a weighted aggregate of the first model similarity measure and the second model similarity measure is computed. The plurality of candidate images based on the respective similarity agglomerate index of each candidate image are ranked and a first ranked candidate images corresponding to the searched images are generated.
    Type: Grant
    Filed: June 22, 2021
    Date of Patent: April 9, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Varshanth Ravindra Rao, Md Ibrahim Khalil, Peng Dai, Juwei Lu
  • Patent number: 11914788
    Abstract: Methods and systems for gesture-based control of a device are described. A virtual gesture-space is determined in a received input frame. The virtual gesture-space is associated with a primary user from a ranked user list of users. The received input frame is processed in only the virtual gesture-space, to detect and track a hand. Using a hand bounding box generated by detecting and tracking the hand, gesture classification is performed to determine a gesture input associated with the hand. A command input associated with the determined gesture input is processed. The device may be a smart television, a smart phone, a tablet, etc.
    Type: Grant
    Filed: May 30, 2022
    Date of Patent: February 27, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Juwei Lu, Sayem Mohammad Siam, Wei Zhou, Peng Dai, Xiaofei Wu, Songcen Xu
  • Publication number: 20240054757
    Abstract: Systems and methods for temporal action localization of video data are described. A feature representation extracted from video data has a temporal dimension and a spatial dimension. The feature representation is self-aligned in the spatial dimension. Spatial multi-sampling is performed to obtain a plurality of sparse samples of the self-aligned representation along the spatial dimension, and the multi-sampled representation is fused with the self-aligned representation. Attention-based context information aggregation is applied on the fused representation to obtain a spatially refined representation. Local temporal information aggregation is applied on the self-aligned representation to obtain a temporally refined representation. Action localization is performed on a concatenation of the spatially refined representation and the temporally refined representation.
    Type: Application
    Filed: June 1, 2023
    Publication date: February 15, 2024
    Inventors: Yanhui GUO, Deepak SRIDHAR, Peng DAI, Juwei LU
  • Patent number: 11900260
    Abstract: Methods, devices and processor-readable media for an integrated teacher-student machine learning system. One or more teacher-student modules are trained as part of the teacher neural network training. Each student sub-network uses a portion of the teacher neural network to generate an intermediate feature map, then provides the intermediate feature map to a student sub-network to generate inferences. The student sub-network may use a feature enhancement block to map the intermediate feature map to a subsequent feature map. A compression block may be used to compress intermediate feature map data for transmission in some embodiments.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: February 13, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Deepak Sridhar, Juwei Lu
  • Patent number: 11902548
    Abstract: Systems, methods, and computer media of processing a video are disclosed. An example method may include: receiving a plurality of video frames of a video; generating a plurality of first input features based on the plurality of video frames; generating a plurality of second input features based on reversing a temporal order of the plurality of first input features; generating a first set of joint attention features based on the plurality of first input features; generating a second set of joint attention features based on the plurality of second input features; and concatenating the first set of joint attention features and the second set of joint attention features to generate a final set of joint attention features.
    Type: Grant
    Filed: March 16, 2021
    Date of Patent: February 13, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Deepak Sridhar, Niamul Quader, Srikanth Muralidharan, Yaoxin Li, Juwei Lu, Peng Dai