Patents Examined by Bernard Krasnic
-
Patent number: 12217465Abstract: According to an aspect of the disclosure, a method of point cloud geometry encoding in a point cloud encoder is provided. In the method, geometry coding is performed on a point cloud at a first partition depth. Further, a plurality of largest coding units (LCUs) of the point cloud is determined at a second partition depth. A coding state of a LCU of the plurality of LCUs of the point cloud is set at the second partition depth. The geometry coding is performed on the plurality of LCUs of the point cloud at the second partition depth based on the coding state of the LCU at the second partition depth.Type: GrantFiled: September 3, 2021Date of Patent: February 4, 2025Assignee: TENCENT AMERICA LLCInventors: Wen Gao, Xiang Zhang, Shan Liu
-
Patent number: 12175744Abstract: Methods and systems for providing an interactive image scene graph pattern search are provided. A user is provide with an image having a plurality of selectable segmented regions therein. The user selects one or more of the segmented regions to build a query graph. Via a graph neural network, matching target graphs are retrieved that contain the query graph from a target graph database. Each matching target graph has matching target nodes that match with the query nodes of the query graph. Matching target images from an image database are associated with the matching target graphs. Embeddings of each of the query nodes and the matching target nodes are extracted. A comparison of the embeddings of each query node with the embeddings of each matching target node is performed. The user interface displays the matching target images that are associated with the matching target graphs.Type: GrantFiled: September 17, 2021Date of Patent: December 24, 2024Assignee: Robert Bosch GmbHInventors: Zeng Dai, Huan Song, Panpan Xu, Liu Ren
-
Patent number: 12174883Abstract: The present disclosure provides a retrieval and push method based on a fine art image tag, including the following steps: establishing a tag model database: training different tag contents with training samples of different subjects and different categories, and categorizing the training samples in the database according to knowledge point tags to obtain the tag model database; retrieval and push: uploading fine art work image samples to the trained tag model database, then extracting knowledge point tags of the fine art work image samples, retrieving associated fine art works, and then pushing the associated fine art works; real-time updating: recording knowledge point tags of input fine art work image samples by the tag model database in real time, and establishing a common tag and a model, and subsequently increasingly pushing contents relevant to the common tag; and generating a user portrait.Type: GrantFiled: December 1, 2023Date of Patent: December 24, 2024Assignee: Guangdong University of TechnologyInventors: Yi Ji, Zhenni Li, Yinghe Xiao, Yihui Cai, Junxian Lin, Jiayin Xiao, Shaolong Zheng, Jiaqi Xie, Xiaoying Guo
-
Patent number: 12169912Abstract: An image processor receives first image data representing an image. The first image data comprising a plurality of color values corresponding to a plurality of pixels in the image. The image processor determines, using a trained machine learning model, second image data based on the first image data. The second image data comprises surface spectral reflection values corresponding to the plurality of pixels in the image, where the surface spectral reflection values are distributed across a plurality of wavelengths of visible light in the image. The image processor then performs at least one image processing operation with respect to the image using the second image data.Type: GrantFiled: June 21, 2021Date of Patent: December 17, 2024Assignee: Microsoft Technology Licensing, LLCInventor: Hamidreza Vaezi Joze
-
Patent number: 12165299Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods that implement an image filter for enhancing light text and removing document shadows. In particular embodiments, the disclosed systems use a modified adaptive thresholding approach the relies on image gradients to efficiently guide the thresholding process. In addition, the disclosed systems use a machine-learning model to generate a document shadow map. The document shadow map can include text reflections. Accordingly, the disclosed systems remove text reflections from the document shadow map (e.g., by using an interpolated shadow intensity value of neighboring shadow map pixels). In turn, the disclosed systems use the document text mask and the document shadow map cleaned of text reflections to remove shadows from the digital image. Further, the disclosed systems enhance text in the shadow-removed digital image based on contrast stretching.Type: GrantFiled: February 14, 2022Date of Patent: December 10, 2024Assignee: Adobe Inc.Inventors: Prasenjit Mondal, Sachin Soni
-
Patent number: 12167015Abstract: A method, device and computer program product, the method comprising: obtaining access to a classifier trained upon a multiplicity of sets of decoded coefficients; obtaining a set of block coefficients associated with at least a part of the compressed image; and applying the classifier to the set of block coefficients, to obtain a classification of the compressed image.Type: GrantFiled: February 2, 2021Date of Patent: December 10, 2024Assignee: DSP Group Ltd.Inventor: Tal Hendel
-
Patent number: 12159452Abstract: Systems and methods for detecting and predicting text within images. An image is passed to a feature-extraction module. Each image typically contains at least one text object, and each text object contains at least one character. Based on the image, the feature-extraction module generates at least one feature map indicating text object(s) in the image. The feature map(s) is then passed to a decoder module. In son implementations, the decoder module applies a weighted mask to the feature map(s). Based on the feature map(s), the decoder module predicts a sequence of characters in the text object(s). In some embodiments, that prediction is based on previous known data. The decoder module is directed by a query that indicates at least one desired characteristic of the text object(s). An output module then refines the predicted content. At least one neural network may be used.Type: GrantFiled: November 14, 2019Date of Patent: December 3, 2024Assignee: ServiceNow Canada Inc.Inventors: Perouz Taslakian, Negin Sokhandan Asl
-
Patent number: 12142005Abstract: Systems, and method and computer readable media that store instructions for distance measurement, the method may include obtaining, from a camera of a vehicle, an image of a surroundings of the vehicle; searching, within the image, for an anchor, wherein the anchor is associated with at least one physical dimension of a known value; and when finding the anchor, determining a distance between the camera and the anchor based on, (a) the at least one physical dimension of a known value, (b) an appearance of the at least one physical dimension of a known value in the image, and (c) a distance-to-appearance relationship that maps appearances to distances, wherein the distance-to-appearance relationship is generated by a calibration process that comprises obtaining one or more calibration images of the anchor, and obtaining one or more distance measurements to the anchor.Type: GrantFiled: October 13, 2021Date of Patent: November 12, 2024Assignee: AUTOBRAINS TECHNOLOGIES LTDInventor: Igal Raichelgauz
-
Patent number: 12136262Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing instance segmentation by detecting and segmenting individual objects in an image. In one aspect, a method comprises: processing an image to generate data identifying a region of the image that depicts a particular object; obtaining data defining a plurality of example object segmentations; generating a respective weight value for each of the example object segmentations; for each of a plurality of pixels in the region of the image, determining a score characterizing a likelihood that the pixel is included in the particular object depicted in the region of the image using: (i) the example object segmentations, and (ii) the weight values for the example object segmentations; and generating a segmentation of the particular object depicted in the region of the image using the scores for the pixels in the region of the image.Type: GrantFiled: October 12, 2023Date of Patent: November 5, 2024Assignee: Google LLCInventors: Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin
-
Patent number: 12131436Abstract: The present disclosure provides a target image generation method. The method includes obtaining a first parsed image and a first pose image based on an original image, the first parsed image being an image labeled with parts of an object in the original image, the first pose image representing a pose of the object in the original image; inputting the first parsed image, the first pose image, and a second pose image representing a target pose into a first image generation model, and determining, a first transformation parameter and adjusting the first parsed image based on the first transformation parameter to obtain a target parsed image, a pose of the object in the target parsed image being the target pose; and inputting a first combined image and a second combined image into a second image generation model, and adjusting the first combined image to obtain a target image.Type: GrantFiled: November 23, 2021Date of Patent: October 29, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Liying Lu, Shu Liu, Jiaya Jia
-
Patent number: 12131543Abstract: A semantic-based method and apparatus for retrieving a perspective image, an electronic device and a computer-readable storage medium are provided. An method includes obtaining a perspective image for a space containing an inspected object therein. A semantic division on the perspective image is performed using a first method, to obtain a plurality of semantic region units. A feature extraction network is constructed using a second method. Based on the perspective image and each of the plurality of semantic region units, a feature of each semantic region unit is extracted using the feature extraction network. Based on the feature of each semantic region unit, an image most similar to the semantic region unit is retrieved from an image feature database, to assist in determining an inspected object in the semantic region unit.Type: GrantFiled: March 15, 2021Date of Patent: October 29, 2024Assignees: Tsinghua University, Nuctech Company LimitedInventors: Li Zhang, Zhiqiang Chen, Yuanjing Li, Yuxiang Xing, Fanhua Meng, Qiang Li, Wei Li, Gang Fu
-
Patent number: 12124503Abstract: According to one embodiment, system includes a determination unit, a first storage, a second storage, a search unit and a display. The determination unit determines a feature quantity of the process-targeted manufacturing data. The first storage stores cause-unidentified manufacturing data. The second storage stores cause-identified manufacturing data. The search unit searches, based on the feature quantity of the process-targeted manufacturing data, the first storage and the second storage for the cause-unidentified manufacturing data and the cause-identified manufacturing data that have a feature quantity similar to that of the process-targeted manufacturing data. The display displays the search result.Type: GrantFiled: February 26, 2021Date of Patent: October 22, 2024Assignee: KABUSHIKI KAISHA TOSHIBAInventors: Takahiro Takimoto, Kouta Nakata, Kazunori Imoto, Ayana Yamamoto, Shun Hirao
-
Patent number: 12118699Abstract: An image processing apparatus includes an image processing unit configured to generate a second image obtained by performing predetermined image processing on a first image representing an input image, a determination unit configured to determine a luminance correction range on the basis of image information in one of a high luminance range and a low luminance range determined in accordance with the image processing, and a luminance correction unit configured to correct a luminance value in the luminance correction range determined by the determination unit for the second image.Type: GrantFiled: December 7, 2021Date of Patent: October 15, 2024Assignee: SHARP KABUSHIKI KAISHAInventor: Yuichi Yoshida
-
Patent number: 12112537Abstract: A group captioning system includes computing hardware, software, and/or firmware components in support of the enhanced group captioning contemplated herein. In operation, the system generates a target embedding for a group of target images, as well as a reference embedding for a group of reference images. The system identifies information in-common between the group of target images and the group of reference images and removes the joint information from the target embedding and the reference embedding. The result is a contrastive group embedding that includes a contrastive target embedding and a contrastive reference embedding with which to construct a contrastive group embedding, which is then input to a model to obtain a group caption for the target group of images.Type: GrantFiled: October 16, 2023Date of Patent: October 8, 2024Assignee: ADOBE INC.Inventors: Quan Hung Tran, Long Thanh Mai, Zhe Lin, Zhuowan Li
-
Patent number: 12086995Abstract: Techniques related to video background estimation inclusive of generating a final background picture absent foreground objects based on input video are discussed. Such techniques include generating first and second estimated background pictures using temporal and spatial background picture modeling, respectively, and fusing the first and second estimated background pictures based on first and second confidence maps corresponding to the first and second estimated background pictures to generate the final estimated background picture.Type: GrantFiled: September 3, 2020Date of Patent: September 10, 2024Assignee: Intel CorporationInventors: Itay Benou, Yevgeny Priziment, Tzachi Herskovich
-
Patent number: 12080009Abstract: The present invention discloses a system and a method for providing multi-channel high-quality depth estimation from a monocular camera for providing augmented reality (AR) and virtual reality (VR) features to an image. The invention further includes the method to enhance generalization on deployment-friendly monocular depth inference pipeline with semantic information. Furthermore, a vivid and intact reconstruction is guaranteed by inpainting the missing depth and context within the single image input.Type: GrantFiled: August 31, 2021Date of Patent: September 3, 2024Assignee: Black Sesame Technologies Inc.Inventors: Fangwen Tu, Bo Li
-
Patent number: 12075190Abstract: A machine learning system can generate an image mask (e.g., a pixel mask) comprising pixel assignments for pixels. The pixels can be assigned to classes, including, for example, face, clothes, body skin, or hair. The machine learning system can be implemented using a convolutional neural network that is configured to execute efficiently on computing devices having limited resources, such as mobile phones. The pixel mask can be used to more accurately display video effects interacting with a user or subject depicted in the image.Type: GrantFiled: July 13, 2023Date of Patent: August 27, 2024Assignee: Snap Inc.Inventors: Lidiia Bogdanovych, William Brendel, Samuel Edward Hare, Fedir Poliakov, Guohui Wang, Xuehan Xiong, Jianchao Yang, Linjie Yang
-
Patent number: 12067744Abstract: Systems and methods detect the posture of a user of an IHS (Information Handling System). An image is generated of a user and of the physical environment in which the user is operating the IHS. The image is processed to generate segregated images of the user and of the physical environment. The segregated image of the physical environment is classified as corresponding to a known environment in which the user has operated the IHS and that is associated with a probability of an ergonomic posture being used while in that particular environment. The segregated image of the user is processed to determine a physical posture of the user relative to the IHS. An ergonomic risk score is generated based on deviations of the user's posture from an ideal posture. The ergonomic risk score is scaled based on the probability of an ergonomic posture being used, due to the environment.Type: GrantFiled: February 18, 2022Date of Patent: August 20, 2024Assignee: Dell Products, L.P.Inventors: Loo Shing Tan, Seng Khoon Teh, Ruizhi Joyce Lu
-
Patent number: 12045967Abstract: Systems and methods are disclosed for model based document image enhancement. Instead of requiring paired dirty and clean images for training a model to clean document images (which may cause privacy concerns), two models are trained on the unpaired images such that only the dirty images are accessed or only the clean images are accessed at one time. One model is a first implicit model to translate the dirty images from a source space to a latent space, and the other model is a second implicit model to translate the images from the latent space to clean images in a target space. The second implicit model is trained based on translating electronic document images in the target space to the latent space. In some implementations, the implicit models are diffusion models, such as denoising diffusion implicit models based on solving ordinary differential equations.Type: GrantFiled: August 16, 2023Date of Patent: July 23, 2024Assignee: Intuit Inc.Inventors: Jiaxin Zhang, Tharathorn Joy Rimchala, Lalla Mouatadid, Kamalika Das, Sricharan Kallur Palli Kumar
-
Patent number: 12033428Abstract: A method of verifying a user for transportation purposes is disclosed. The method may include using a communication apparatus to detect a face of the user. The method may include using the communication apparatus to instruct the user to perform a specific action, to validate that the specific action is performed by the user, to extract a frame from the specific action to use as an image, to obtain image parameters from the frame and to use the communication apparatus to send the image to a server for the server to determine whether the image is a genuine face by comparing the image parameters of the image with parameters in a database to obtain a comparison result and to use the comparison result to determine if the user should be verified.Type: GrantFiled: February 4, 2020Date of Patent: July 9, 2024Assignee: GRABTAXI HOLDINGS PTE. LTD.Inventors: Cheuk Lun Dong, Chun Tung Wong, Munirul Abedin, Yee Won Nyon