Patents Examined by Feng Niu

High-performance and low-complexity neural compression from a single image, video or audio data

Patent number: 12382068

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium for encoding input data comprising input data values corresponding to respective input data grid points of an input data grid, such as image, video or audio data.

Type: Grant

Filed: November 15, 2024

Date of Patent: August 5, 2025

Assignee: DeepMind Technologies Limited

Inventors: Emilien Dupont, Hyun Jik Kim, Matthias Stephan Bauer, Lucas Marvin Theis
Systems and methods for video models with procedure understanding

Patent number: 12367662

Abstract: Embodiments described herein provide systems and methods for training video models to perform a task from an input instructional video. A procedure knowledge graph (PKG) may be generated with nodes representing procedure steps, and edges representing relationships between the steps. The PKG may be generated based on text and/or video training data which includes procedures (e.g., instructional videos). Using the PKG, a video model may be trained using the PKG to provide supervisory training signals for a number of tasks. Once the model is trained, it may be fine-tuned for a specific task which benefits from the model being trained in a way that makes the model embed procedural information when encoding videos.

Type: Grant

Filed: January 25, 2023

Date of Patent: July 22, 2025

Assignee: Salesforce, Inc.

Inventors: Roberto Martin-Martin, Silvio Savarese, Honglu Zhou, Juan Carlos Niebles Duque
Recycling methods and systems, and related plastic containers

Patent number: 12348840

Abstract: A plastic item, such as a beverage bottle, conveys two distinct digital watermarks, encoded using two distinct signaling protocols. A first, printed label watermark conveys a retailing payload, including a Global Trade Item Number (GTIN) used by a point-of-sale scanner in a retail store to identify and price the item when presented for checkout. A second, plastic texture watermark conveys a recycling payload, including data identifying the composition of the plastic. The use of two different signaling protocols assures that a point-of-sale scanner will not spend its limited time and computational resources working to decode the recycling watermark, which lacks the data needed for retail checkout. In some embodiments, a recycling apparatus makes advantageous use of both types of watermarks to identify the plastic composition of the item (e.g., relating GTIN to plastic type using an associated database), thereby increasing the fraction of items that are correctly identified for sorting and recycling.

Type: Grant

Filed: April 15, 2024

Date of Patent: July 1, 2025

Assignee: Digimarc Corporation

Inventors: Ravi K. Sharma, Tomas Filler, Vojtech Holub, Osama M. Alattar, Hugh L. Brunk, John D. Lord, William Y. Conwell
Learning data generation device, learning data generation method, and program

Patent number: 12333785

Abstract: A learning data generation device for generating learning data for learning a recognizer capable of estimating a contour of a sphere making spinning motion, with high accuracy, the sphere being recorded in a single camera video image, is provided.

Type: Grant

Filed: June 24, 2019

Date of Patent: June 17, 2025

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Dan Mikami, Mariko Isogawa, Hiroko Yabushita, Yoshinori Kusachi
Generating a fused object bounding box based on uncertainty

Patent number: 12327364

Abstract: This document describes techniques and systems for generating a fused object bounding box based on uncertainty. At least two bounding boxes, each associated with a different sensor, is generated. A fused center point and yaw angle as well as length, width, and velocity can be found by mixing the distributions of the parameters from each bounding box. A discrepancy between the center points of each bounding box can be used to determine whether to refine the fused bounding box (e.g., find an intersection between at least two bounding boxes) or consolidate the fused bounding box (e.g., find a union between at least two bounding boxes). This results in the fused bounding box having a confidence level of the uncertainty associated with the fused bounding box. In this manner, better estimations of the uncertainty of the fused bounding box may be achieved to improve tracking performance of a sensor fusion system.

Type: Grant

Filed: May 25, 2022

Date of Patent: June 10, 2025

Assignee: Aptiv Technologies AG

Inventor: Bin Jia
Sensor calibration validation

Patent number: 12323574

Abstract: Techniques for determining a probability that a first sensor is miscalibrated with respect a second sensor are discussed herein. For example, a computing device may receive calibrated extrinsics of a camera to a lidar, determine a plurality of sets of perturbed extrinsics based on the calibrated extrinsics, determine respective costs for perturbed extrinsics of the plurality of sets of perturbed extrinsics based on image data captured by the camera, the plurality of sets of perturbed extrinsics, and lidar data captured by the lidar, and determine a local maxima score for the calibrated extrinsics based at least in part on the respective costs for the perturbed extrinsics of the plurality of sets of perturbed extrinsics and a cost of the calibrated extrinsics. The computing device may then determine a probability that the camera is miscalibrated based on a Bayes probability and the local maxima score.

Type: Grant

Filed: May 9, 2022

Date of Patent: June 3, 2025

Assignee: Zoox, Inc.

Inventors: Derek Adams, Zakieh Sadat Hashemifar, Agis Iakovos Mesolongitis
Image defect detection method, electronic device using the same

Patent number: 12293502

Abstract: A method for detecting defects in products from images thereof and an electronic device applying the method inputs a defect image repair data set into an autoencoder to train the autoencoder, and generates a reconstructed image, calculates a reference error value between the sample image and the reconstructed image by a preset error function, and set a threshold value based on the reference error value. The electronic device inputs an image possibly revealing a defect into the autoencoder and generates the reconstructed image corresponding to the image to be detected, and uses the preset error function to calculate the reconstruction error between the image and the reconstructed image, thereby determining whether the image being analyzed does reveal defects. When the reconstruction error is greater than the threshold value, a determination is made that a defect is revealed.

Type: Grant

Filed: December 30, 2021

Date of Patent: May 6, 2025

Assignee: HON HAI PRECISION INDUSTRY CO., LTD.

Inventors: Jung-Hao Yang, Chin-Pin Kuo, Chih-Te Lu, Tzu-Chen Lin, Wan-Jhen Lee, Wei-Chun Wang
Utilizing machine-learning based object detection to improve optical character recognition

Patent number: 12288406

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately enhancing optical character recognition with a machine learning approach for determining words from reverse text, vertical text, and atypically-sized text. For example, the disclosed systems segment a digital image into text regions and non-text regions utilizing an object detection machine learning model. Within the text regions, the disclosed systems can determine reverse text glyphs, vertical text glyphs, and/or atypically-sized text glyphs utilizing an edge based adaptive binarization model. Additionally, the disclosed systems can utilize respective modification techniques to manipulate reverse text glyphs, vertical text glyphs, and/or atypically-sized glyphs for analysis by an optical character recognition model.

Type: Grant

Filed: September 30, 2021

Date of Patent: April 29, 2025

Assignee: Adobe Inc.

Inventors: Ankit Bal, Mohit Gupta, Ram Bhushan Agrawal, Tarun Verma, Uttam Dwivedi
Image processing method and apparatus based on machine learning

Patent number: 12277676

Abstract: An image processing method and apparatus based on machine learning are disclosed. The image processing method based on machine learning, according to the present invention, may comprise the steps of: generating a first corrected image by inputting an input image to a first convolution neural network; generating an intermediate image on the basis of the input image; performing machine learning on a first loss function of the first convolution neural network on the basis of the first corrected image and the intermediate image; and performing machine learning on a second loss function of the first convolution neural network on the basis of the first corrected image and a natural image.

Type: Grant

Filed: November 11, 2020

Date of Patent: April 15, 2025

Assignee: POSTECH RESEARCH AND BUSINESS DEVELOPMENT FOUNDATION

Inventors: Seung Yong Lee, Sung Hyun Cho, Hyeong Seok Son
Medical monitoring system with automatic dosing equipment

Patent number: 12277798

Abstract: There is provided a recognition system adaptable to a portable device or a wearable device. The recognition system senses a body heat using a thermal sensor, and performs functions such as the living body recognition, image denoising and body temperature prompting according to detected results.

Type: Grant

Filed: April 25, 2024

Date of Patent: April 15, 2025

Assignee: PIXART IMAGING INC.

Inventors: Nien-Tse Chen, Yi-Hsien Ko, Yen-Min Chang
Body temperature monitoring device and wearable accessary having thermal sensor

Patent number: 12277797

Abstract: There is provided a recognition system adaptable to a portable device or a wearable device. The recognition system senses a body heat using a thermal sensor, and performs functions such as the living body recognition, image denoising and body temperature prompting according to detected results.

Type: Grant

Filed: April 18, 2024

Date of Patent: April 15, 2025

Assignee: PIXART IMAGING INC.

Inventors: Nien-Tse Chen, Yi-Hsien Ko, Yen-Min Chang
Scale-aware monocular localization and mapping

Patent number: 12260575

Abstract: Disclosed is an image processing device comprising a processor configured to estimate the scale of image features by the steps of: processing multiple images of a scene by means of a first trained model to identify features in the images and to estimate the depths of those features in the images; processing the multiple images by a second trained model to estimate a scaling for the images; and estimating the scales of the features by adjusting the estimated depths in dependence on the estimated scaling. A method for training an image processing model is also disclosed.

Type: Grant

Filed: October 28, 2021

Date of Patent: March 25, 2025

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Benjamin Busam, Gregory Slabaugh
Method and apparatus for data efficient semantic segmentation

Patent number: 12242964

Abstract: A method and system for training a neural network are provided. The method includes receiving an input image, selecting at least one data augmentation method from a pool of data augmentation methods, generating an augmented image by applying the selected at least one data augmentation method to the input image, and generating a mixed image from the input image and the augmented image.

Type: Grant

Filed: June 26, 2023

Date of Patent: March 4, 2025

Assignee: Samsung Electronics Co., Ltd

Inventors: Qingfeng Liu, Mostafa El-Khamy, Jungwon Lee, Behnam Babagholami Mohamadabadi
Systems for multi-task joint training of neural networks using multi-label datasets

Patent number: 12243292

Abstract: Systems and methods for multi-task joint training of a neural network including an encoder module and a multi-headed attention mechanism are provided. In one aspect, the system includes a processor configured to receive input data including a first set of labels and a second set of labels. Using the encoder module, features are extracted from the input data. Using a multi-headed attention mechanism, training loss metrics are computed. A first training loss metric is computed using the extracted features and the first set of labels, and a second training loss metric is computed using the extracted features and the second set of labels. A first mask is applied to filter the first training loss metric, and a second mask is applied to filter the second training loss metric. A final training loss metric is computed based on the filtered first and second training loss metrics.

Type: Grant

Filed: September 2, 2022

Date of Patent: March 4, 2025

Assignee: LEMON INC.

Inventors: Shuo Cheng, Wanchun Ma, Linjie Luo
Interactive video surveillance as an edge service using unsupervised feature queries

Patent number: 12211277

Abstract: A method for querying data obtained from a distributed sensor network, comprising: receiving sensor data representing an aspect of an environment with a sensor of the distributed sensor network; communicating a representation of the sensor data to a fog node through an automated communication network; determining, by the fog node, a correspondence of a query received through the automated communication network to characteristics of the representation of the sensor data; and selectively communicating, in response to the query, at least one of: the sensor data having the determined characteristics corresponding to the query, an identification of the sensor data having the determined characteristics corresponding to the query, and the data representing the sensor data having the determined characteristics corresponding to the query.

Type: Grant

Filed: March 9, 2022

Date of Patent: January 28, 2025

Assignee: The Research Foundation for The State University of New York

Inventors: Yu Chen, Seyed Yahya Nikouei
Qualitative or quantitative characterization of a coating surface

Patent number: 12203868

Abstract: A method for qualitative and/or quantitative characterization of a coating surface is provided, comprising: providing a program recognizing coating surface defect types; determining, by the program, whether a camera(s) coupled to the program is within a predefined distance range and/or within a predefined image acquisition angle range relative to a currently presented coating surface; depending on the determination: generating a feedback signal indicative of whether adjustment of the position of the camera(s) is within predefined distance range and/or within the predefined image acquisition angle range; and/or automatically adjusting the relative distance of the camera and and/or automatically adjusting the angle of the camera; enabling the camera to acquire an image of the coating surface only when the camera(s) is/are within the predefined distance range and/or image acquisition angle range; processing the digital image for recognizing coating surface defects; and outputting a characterization of the

Type: Grant

Filed: September 16, 2021

Date of Patent: January 21, 2025

Assignee: EVONIK OPERATIONS GMBH

Inventors: Philipp Isken, Sandra Bittorf, Oliver Kroehl, Claudia Bramlage, Markus Vogel, Stefan Silber, Gaetano Blanda, Olivia Lewis, Daniel Haake
System and method for using artificial intelligence (AI) to analyze social media content

Patent number: 12205387

Abstract: Systems and methods for determining attributes in media content. A computing device may be configured to obtain the media content based on a received media content identifier and segment the media content into scenes. The computing device may analyze viewer engagement metrics to identify a scene associated with a viewer engagement score that exceeds a threshold, select video frames from the identified scene, and identify primary objects in the series of images in the scene. The computing device may add a bounding box around the identified primary objects in one or more selected frames and perform text extraction within the bounding box. The computing device may determine object attributes of the identified primary objects, querying a database to identify topics of interest (ToIs) based on the extracted text and the determined object attributes, and performing a responsive action in response to identifying the one or more ToIs.

Type: Grant

Filed: January 31, 2024

Date of Patent: January 21, 2025

Assignee: Social Voice Ltd.

Inventor: Allen O'Neill
Automatic dolly zoom image processing device

Patent number: 12205238

Abstract: A system produces a dolly zoom effect by utilizing side view information. The system first captures a main image at a main location. The main image includes at least a foreground object of a given size and a background. The system calculates one or more side view locations based on a zoom-in factor to be applied to the background and an estimated size of the foreground object. The system then guides a user to capture one or more side view images at the one or more side view locations. The foreground object of the given size is superimposed onto a zoomed-in background. Then the side view information is used by the system to perform image inpainting.

Type: Grant

Filed: April 18, 2022

Date of Patent: January 21, 2025

Assignee: MediaTek Inc.

Inventors: Chih-Wei Chen, Pei-Kuei Tsung, Yao-Sheng Wang, Chun Chen Lin, Chia-Ching Lin
Mask for satellite image data

Patent number: 12198347

Abstract: A computing device is provided, including a processor configured to receive imaging relevance data for a geographic area. The processor may be further configured to generate, based at least in part on the imaging relevance data, image mask instructions specifying a region of interest included in the geographic area. The processor may be further configured to transmit the image mask instructions to a satellite. The processor may be further configured to receive, from the satellite, filtered satellite image data of the region of interest. One or more deprioritized regions of the geographic area outside the region of interest may be excluded from the filtered satellite image data.

Type: Grant

Filed: January 13, 2022

Date of Patent: January 14, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shadi Abdollahian Noghabi, Ranveer Chandra, Krishna Kant Chintalapudi, Peder Andreas Olsen
Systems for generating indications of relationships between electronic documents

Patent number: 12198459

Abstract: In implementations of systems for generating indications of relationships between electronic documents, a processing device implements a relationship system to segment text of electronic documents included in a document corpus into segments. The relationship system determines a subset of the electronic documents that includes electronic document pairs having a number of similar segments that is greater than a threshold number. The similar segments are identified using locality sensitive hashing. The electronic document pairs are classified as related documents or unrelated documents using a machine learning model that receives a pair of electronic documents as an input and generates an indication of a classification for the pair of electronic documents as an output. Indications of relationships between particular electronic documents included in the subset are generated based at least partially on the electronic document pairs that are classified as related documents.

Type: Grant

Filed: November 24, 2021

Date of Patent: January 14, 2025

Assignee: Adobe Inc.

Inventors: Natwar Modani, Vaidehi Ramesh Patil, Inderjeet Jayakumar Nair, Gaurav Verma, Anurag Maurya, Anirudh Kanfade

1 2 3 4 5 … next