Distinguishing Text From Other Regions Patents (Class 382/176)

Utilizing machine-learning based object detection to improve optical character recognition

Patent number: 12288406

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately enhancing optical character recognition with a machine learning approach for determining words from reverse text, vertical text, and atypically-sized text. For example, the disclosed systems segment a digital image into text regions and non-text regions utilizing an object detection machine learning model. Within the text regions, the disclosed systems can determine reverse text glyphs, vertical text glyphs, and/or atypically-sized text glyphs utilizing an edge based adaptive binarization model. Additionally, the disclosed systems can utilize respective modification techniques to manipulate reverse text glyphs, vertical text glyphs, and/or atypically-sized glyphs for analysis by an optical character recognition model.

Type: Grant

Filed: September 30, 2021

Date of Patent: April 29, 2025

Assignee: Adobe Inc.

Inventors: Ankit Bal, Mohit Gupta, Ram Bhushan Agrawal, Tarun Verma, Uttam Dwivedi
Encoding a video frame using different compression ratios for text blocks and non-text blocks

Patent number: 12289457

Abstract: This document describes systems and techniques for encoding a video frame using different compression ratios or compression algorithms for text blocks and non-text blocks. The described systems and techniques can determine, using a machine-learned model, which blocks of a frame include and do not include text. The described systems and techniques can then use a different compression ratio or compression algorithm for text blocks than the compression ratio or compression algorithm used for non-text blocks. For example, the systems and techniques can encode the text blocks using a first compression ratio that results in higher video quality than a second compression ratio used on at least some non-text blocks. In this way, the described systems and techniques can improve text legibility in a video file without significantly increasing the bandwidth requirements to transmit the video file to remote computing devices.

Type: Grant

Filed: November 9, 2020

Date of Patent: April 29, 2025

Assignee: Google LLC

Inventors: Daniele Moro, Claudionor Coelho, Sean R. Purser-Haskell, Hao Zhuang, Stan Vitvitskyy
Image guided video thumbnail generation for e-commerce applications

Patent number: 12283083

Abstract: Systems and methods are provided for automatically generating a thumbnail for a video on an online shopping site. The disclosed technology automatically generates a thumbnail for a video, where the thumbnail represents an item but not necessarily content of the video. A thumbnail generator receives a video that describes the item and an ordered list of item images associated with the item used in an item listing. The thumbnail generator extracts video frames from the video based on sampling rules and determines similarity scores for the sampled video frames. A similarity score indicates a degree of similarity between content of a video frame and an item image. The thumbnail generator determines weighted similarity scores based item images and occurrences of sampled video frames in the video. The disclosed technology generates a thumbnail for the video by selecting a sample video frame based on the weighted similarity scores.

Type: Grant

Filed: December 8, 2021

Date of Patent: April 22, 2025

Assignee: eBay Inc.

Inventor: Berkan Solmaz
Method of identifying characters in images, electronic device, and storage medium

Patent number: 12236697

Abstract: A method of identifying characters in images extracts features of a detection image including characters. Enhancement processing is performed on the detection image according to the features to obtain an enhanced image. Closed edges of the characters are detected in the enhanced image. First rectangular outlines of the characters are determined according to the closed edges. The first rectangular outlines are corrected to obtain second rectangular outlines. The characters are cropped from the detection image according to the second rectangular outlines. The method identifies characters in images accurately and rapidly.

Type: Grant

Filed: May 18, 2022

Date of Patent: February 25, 2025

Assignee: HON HAI PRECISION INDUSTRY CO., LTD.

Inventors: Cheng-Feng Wang, Li-Che Lin, Hui-Xian Yang
Method and apparatus for recognizing subtitle region, device, and storage medium

Patent number: 12236696

Abstract: A method and an apparatus for recognizing a subtitle region, a device, and a storage medium are provided, relating to the field of computer vision technologies of artificial intelligence. The method includes: recognizing a video to obtain n candidate subtitle regions, the candidate subtitle regions being regions in which text contents are displayed in the video, and n being a positive integer; and screening the n candidate subtitle regions according to a subtitle region screening policy to obtain the subtitle region, the subtitle region screening policy being used for determining a candidate subtitle region in which text contents have a repetition rate being lower than a repetition rate threshold and have a longest total display duration as the subtitle region. By using the method and apparatus, device, and system, labor resources required for subtitle region recognition can be saved.

Type: Grant

Filed: October 4, 2022

Date of Patent: February 25, 2025

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jie Huang, Shupei Wang
System and method for automatically curating and displaying images

Patent number: 12229462

Abstract: The present disclosure is directed to automatically curating a set of images according to a predetermined set of compliance rules and displaying those images. The machine learning and artificial intelligence technology can differentiate between images and identify features within images. The features may include a desired perspective, inconsistent backgrounds, low detail, blurriness, shadows, glare, reflections, unwanted information, unwanted elements, poles, trees, lack or focus, poor resolution, rain, snow, fumes, smoke, mud, unwanted banners, unwanted overlays, etc. The technology is trained to identify these features in the images and automatically tag the images with information relating to the features. A user selects a set of predetermined rules to be applied to the images. The technology then uses the tags to apply these selected rules to the images to modify the images and display the modified images in a predetermined sequence and arrangement.

Type: Grant

Filed: April 10, 2023

Date of Patent: February 18, 2025

Assignee: Freddy Technologies LLC

Inventors: Sudheer Kumar Pamuru, Madhukiran Dandamudi, Vineel Kurma
Image processing apparatus, image processing method, and storage medium

Patent number: 12223261

Abstract: An image processing apparatus includes at least one memory that stores instructions; and at least one processor that execute the instructions to perform: detecting text blocks in an input image; determining a registered document corresponding to the input image among a plurality of registered documents; determining the text block in the input image that corresponds to a processing target item, based on a partial layout defined in the determined registered document and including a first text block corresponding to the processing target item and at least one second text block present near the first text block; and obtaining a character string corresponding to the processing target item by performing character recognition processing on the determined text block.

Type: Grant

Filed: March 5, 2021

Date of Patent: February 11, 2025

Assignee: Canon Kabushiki Kaisha

Inventor: Takashi Miyauchi
Image processing apparatus and image processing method for contrast enhancement

Patent number: 12198314

Abstract: An image processing method includes: receiving an input image; performing a low-frequency image regulating operation to regulate the local intensity of the image of pixel unit(s) according to low-frequency information of the image of pixel unit(s) of the input image; performing a high-frequency image regulating operation to improve the details of the image of pixel unit(s) according to high-frequency information of the image of pixel unit (s) of the input image; and, generating an output image according to the input image, the low-frequency image regulating operation, and the high-frequency image regulating operation.

Type: Grant

Filed: April 13, 2022

Date of Patent: January 14, 2025

Assignee: Realtek Semiconductor Corp.

Inventors: Yu-Hsuan Kuan, Tsung-Hsuan Li, Shih-Tse Chen
Video text tracking method and electronic device

Patent number: 12190612

Abstract: A video text tracking method and an electronic device are disclosed. In the method, a text line region is split into sub-regions, the sub-regions are tracked and then processed, and processed sub-regions are combined into a new text line. The technical solutions provided in this application are not only applicable to a straight-line text scenario or a curved text scenario, but also present a good tracking effect for a deformable text line.

Type: Grant

Filed: January 14, 2021

Date of Patent: January 7, 2025

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Qisen Tang, Hengzhi Yao
Image forming apparatus

Patent number: 12170748

Abstract: An image forming apparatus configured to form an image on a recording medium includes a display unit configured to perform display relating to a setting of image processing performed by the image forming apparatus, and a detection unit configured to detect a user's finger at a position separate from a surface of the display unit by a first distance in a vertical direction perpendicular to the surface of the display unit, and detect the user's finger at a position separate from the surface by a second distance longer than the first distance, wherein in a case where the display unit is in an off state, the display unit turns on when the detection unit detects the user's finger at the position separate from the surface by the second distance.

Type: Grant

Filed: November 15, 2023

Date of Patent: December 17, 2024

Assignee: Canon Kabushiki Kaisha

Inventor: Takuya Uemura
Machine learning enabled document deskewing

Patent number: 12165298

Abstract: A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

Type: Grant

Filed: January 7, 2022

Date of Patent: December 10, 2024

Assignee: SAP SE

Inventors: Marek Polewczyk, Marco Spinaci
Display device and method of driving the same

Patent number: 12165611

Abstract: The present disclosure provides a display device with a display panel and driver. The driver includes a logo compensator to generate a logo map data with respect to a logo area through which the logo image is displayed and compensates for a brightness of the logo area using the logo map data. The logo compensator includes an extractor, a logo calculator, a logo determination unit, and a brightness compensation block. The extractor extracts logo area data. The logo calculator calculates the logo map data with first data corresponding to a first image recognized as the logo image and second data corresponding to a second image recognized as a logo background image. The logo determination unit sets a boundary area and determining whether a first area corresponding to the first image overlaps the boundary area to output determination data. The brightness compensation block compensates for the brightness of the logo area.

Type: Grant

Filed: May 28, 2021

Date of Patent: December 10, 2024

Assignee: SAMSUNG DISPLAY CO., LTD.

Inventors: Byung Ki Chun, Hyeonmin Kim, Youngwook Yoo, Jungyu Lee, Hyunjun Lim
Determination of the decision-relevant image components for an image classifier using binary masks

Patent number: 12148135

Abstract: A method for measuring the components of an input image on which an image classifier bases its decision regarding the assignment of this input image to one or multiple class(es) of a predefined classification. The method includes: providing binary masks, which indicate which pixels of the input image and/or of an intermediate product formed in the image classifier are considered relevant; assessing the binary masks using a quality function, which is a measure of the extent to which at least one classification score, supplied by the image classifier, with respect to at least one target class changes when the pixels of the input image or of the intermediate product which are relevant according to the binary mask are changed; and ascertaining the sought-after components of the input image relevant for the decision of the image classifier from the combination of the binary masks with respective assessments by the quality function.

Type: Grant

Filed: December 9, 2021

Date of Patent: November 19, 2024

Assignee: ROBERT BOSCH GMBH

Inventor: Andres Mauricio Munoz Delgado
Anomaly and fraud detection with fake event detection using pixel intensity testing

Patent number: 12136089

Abstract: The present disclosure involves systems, software, and computer implemented methods for transaction auditing. One example method includes determining valid pixel-based pattern(s) that are included in valid reference images. Fraudulent pixel-based pattern(s) that are included in fraudulent reference images are determined. A request to classify an image is received. A determination is made as to whether pixel values in the image match a valid pixel-based pattern or a fraudulent pixel-based pattern. In response to determining that the pixel values match a valid pixel-based pattern, a likelihood of classifying the first image as a valid image is increased. In response to determining that the pixel values match a fraudulent pixel-based pattern, a likelihood that the image as a fraudulent image is increased. The image is classified in response to the request as either a valid image or a fraudulent image based on the likelihoods.

Type: Grant

Filed: April 11, 2022

Date of Patent: November 5, 2024

Assignee: SAP SE

Inventors: Jesper Lind, Suchitra Sundararaman
Image processing system, image processing apparatus, control method

Patent number: 12113938

Abstract: An image processing apparatus includes a display device configured to display information, a reading device configured to read a document, and one or more controllers configured to function as a unit configured to input an image read by the reading device to a trained model trained based on an image that does not contain text and orientation information about the image that does not contain text, and a unit configured to display information about the image read by the reading device on the display device based on at least an output result from the trained model.

Type: Grant

Filed: December 2, 2021

Date of Patent: October 8, 2024

Assignee: Canon Kabushiki Kaisha

Inventor: Satoru Ikeda
Display apparatus, display system, display control method, and non-transitory recording medium

Patent number: 12112720

Abstract: A display apparatus includes circuitry to display, on a display, an image including a table, receive an operation of specifying a range to be edited in the image, acquire coordinates of lines of the table in the range, and change a color of pixels other than pixels corresponding to the lines in the range to a predetermined color.

Type: Grant

Filed: July 21, 2022

Date of Patent: October 8, 2024

Assignee: Ricoh Company, Ltd.

Inventor: Kohdai Asanuma
Device and a method of encoding images including a privacy filter

Patent number: 12096163

Abstract: An image processing device a camera and a method for of encoding images captured by a camera are disclosed. For each image of an image sequence captured by the camera, the image is pre-processed by filtering the image by applying a privacy filter, the privacy filter being configured to distort the image in such a way that privacy is achieved in the filtered image, and, for at least a subset of the filtered images, by colour revising the filtered image by changing colour of pixels of a plurality of scattered areas of the filtered image such that a respective colour of one or more pixels of each area of the plurality of scattered areas represents one or more original colours of one or more pixels before filtering at a location of that area in the filtered image. The pre-processed images are then encoded into an encoded video stream.

Type: Grant

Filed: November 17, 2021

Date of Patent: September 17, 2024

Assignee: AXIS AB

Inventors: Carl-Axel Alm, Stefan Lundberg
Adding a watermark on a document for printing in a virtual desktop infrastructure (VDI) environment

Patent number: 12093586

Abstract: Example methods and systems are described to add a watermark for printing in a virtual desktop environment having an agent side and a client side. A watermark can be configured at the agent side for printing at the client side. At the agent side, a fallback font can be determined for text of the watermark, and coordinate space calculation can be performed, so that the watermark prints correctly at the client side.

Type: Grant

Filed: May 4, 2023

Date of Patent: September 17, 2024

Assignee: VMware LLC

Inventors: Hui Yuan, Kun Shi
Systems, methods, and apparatuses for implementing medical image segmentation using interactive refinement

Patent number: 12094190

Abstract: Medical image segmentation using interactive refinement, in which the trained deep models are then utilized for the processing of medical imaging are described. Operating a two-step deep learning training framework including receiving original input images at the deep learning training framework; generating an initial prediction image specifying image segmentation by base segmentation model; receiving user input guidance signals; routing each of (i) the original input images, (ii) the initial prediction image, and (iii) the user input guidance signals to an InterCNN; generating a refined prediction image specifying refined image segmentation by processing each of the (i) the original input images, (ii) the initial prediction image, and (iii) the user input guidance signals through the InterCNN to render the refined prediction image incorporating the user input guidance signals; and outputting a refined segmentation mask to the deep learning training framework as a guidance signal.

Type: Grant

Filed: February 18, 2022

Date of Patent: September 17, 2024

Assignee: Arizona Board of Regents on behalf of Arizona State University

Inventors: Diksha Goyal, Jianming Liang
Computer vision based document parsing

Patent number: 12094231

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for machine learning. One of the methods includes receiving one or more page images from a document; for each page image: providing the page image to a computer vision neural network model, wherein the neural network model is trained for the particular page type and is configured to output predictions of coordinates for one or more regions within the image and corresponding labels for the respective regions; and generating an output data structure associating each labeled region with text content located within the labeled region.

Type: Grant

Filed: October 1, 2021

Date of Patent: September 17, 2024

Assignee: States Title, LLC

Inventors: James P. Buban, Allen Ko
Image modification based on text recognition

Patent number: 12081901

Abstract: A text detection process may involve comparing high-contrast pixel densities of areas of images of a video to determine quantities of text-containing areas in the images. Based on a difference between quantities of text-containing areas of subsets of the images, an image of the video may be selected for modification.

Type: Grant

Filed: August 20, 2021

Date of Patent: September 3, 2024

Assignee: Comcast Cable Communications, LLC

Inventors: Oliver Jojic, David F. Houghton
Systems and methods for packaging reusable generative artificial intelligence pipelines

Patent number: 12079570

Abstract: The systems and methods described herein relate to improvements to generative artificial intelligence systems through the use of generative artificial intelligence pipelines to supply external information to pre-trained large language models for use in answering queries. To improve the efficiency and accuracy of large language models in responding to user queries, according to various aspects described herein, such queries may be modified and augmented with additional relevant information and may be divided into multiple queries for parallel handling, the results of which may then be combined into a response. The additional relevant information may include portions of documents or other data sets to be used in generating the response. Additional aspects may further improve resilience and flexibility by managing the generation or implementation of such modified and augmented queries.

Type: Grant

Filed: October 24, 2023

Date of Patent: September 3, 2024

Assignee: MCKINSEY & COMPANY, INC.

Inventors: Peter Mondlock, Catarina Aleixo, Oleksandr Lobunets
Computer vision systems and methods for information extraction from text images using evidence grounding techniques

Patent number: 12056941

Abstract: Computer vision systems and methods for text classification are provided. The system detects a plurality of text regions in an image and generates a bounding box for each detected text region. The system utilizes a neural network to recognize text present within each bounding box and classifies the recognized text, based on at least one extracted feature of each bounding box and the recognized text present within each bounding box, according to a plurality of predefined tags. The system can associate a key with a value and return a key-value pair for each predefined tag.

Type: Grant

Filed: January 24, 2023

Date of Patent: August 6, 2024

Assignee: Insurance Services Office, Inc.

Inventors: Khoi Nguyen, Maneesh Kumar Singh
Feature detection methods and systems using deconstructed color image data

Patent number: 12051226

Abstract: An illustrative image processing system extracts a first color-field image from an original color image associated with a set of color-field components. The first color-field image is associated with a first subset of the set of color-field components. The image processing system also extracts a second color-field image from the original color image. The second color-field image is associated with a second subset of the set of color-field components that is different from the first subset. The image processing system detects a first set of features within the first color-field image and a second set of features within the second color-field image. At least one feature is detected within the first color-field image and included in the first set of features while not being detected within the second color-field image or included in the second set of features. Corresponding methods and systems are also disclosed.

Type: Grant

Filed: August 25, 2021

Date of Patent: July 30, 2024

Assignee: Verizon Patent and Licensing Inc.

Inventors: Tom Hsi Hao Shang, Elena Dotsenko
System and method for automatic document management

Patent number: 12045244

Abstract: A system for managing documents, comprising: interfaces to a user interface, proving an application programming interface, a database of document images, a remote server, configured to communicate a text representation of the document from the optical character recognition engine to the report server, and to receive from the remote server a classification of the document; and logic configured to receive commands from the user interface, and to apply the classifications received from the remote server to the document images through the interface to the database. A corresponding method is also provided.

Type: Grant

Filed: April 24, 2023

Date of Patent: July 23, 2024

Assignee: Autoflie Inc.

Inventors: Eitan Dub, Adam O. Dub, Alfredo J. Miro
Hierarchical histogram calculation with application to palette table derivation

Patent number: 12047565

Abstract: Systems, apparatuses, and methods for calculating multi-pass histograms for palette table derivation are disclosed. An encoder calculates a first histogram for a first portion of most significant bits (MSBs) of pixel component values of a block of an image or video frame. Then, the encoder selects a given number of the highest pixel count bins from the first histogram. The encoder then increases the granularity of these selected highest pixel count bins by evaluating one or more additional bits from the pixel component values. A second histogram is calculated for the concatenation of the original first portion MSBs from the highest pixel count bins and the one or more additional bits, and the highest pixel count bins are selected from the second histogram. A palette table is derived based on these highest pixel count bins selected from the second histogram, and the block is encoded using the palette table.

Type: Grant

Filed: July 26, 2021

Date of Patent: July 23, 2024

Assignee: ATI Technologies ULC

Inventors: Feng Pan, Wei Gao, Yang Liu, Crystal Yeong-Pian Sau, Haibo Liu, Edward A. Harold, Ying Luo, Ihab Amer, Gabor Sines
Recognizing text in image data

Patent number: 12019675

Abstract: A device may receive image data representing a document, the document including: text, and edges. Based on the edges, the device may identify, a segment of interest within the image data and crop the segment of interest to obtain a portion of the image data. In addition, the device may perform optical character recognition on the portion of the image data, the optical character recognition producing recognized text. The device may obtain, based on the recognized text, validation data that includes verification text, and determine whether the recognized text is verified based on the verification text. Based on a result of the determination, the device may perform an action.

Type: Grant

Filed: March 5, 2021

Date of Patent: June 25, 2024

Assignee: Capital One Services, LLC

Inventors: Subhashini Tripuraneni, Joseph R. Barco, Jr.
Neural network-based optical character recognition

Patent number: 12020152

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural network-based optical character recognition. An embodiment of the system may generate a set of bounding boxes based on reshaped image portions that correspond to image data of a source image. The system may merge any intersecting bounding boxes into a merged bounding box to generate a set of merged bounding boxes indicative of image data portions that likely portray one or more words. Each merged bounding box may be fed by the system into a neural network to identify one or more words of the source image represented in the respective merged bounding box. The one or more identified words may be displayed by the system according to a standardized font and a confidence score.

Type: Grant

Filed: August 9, 2021

Date of Patent: June 25, 2024

Assignee: Vannevar Labs, Inc.

Inventors: Daniel Goodman, Nathaniel Honka, Eleony Moorhead, Nathanial Hartman, Brett Granberg
Data extraction from form images

Patent number: 12014560

Abstract: An image processing system accesses an image of a completed form document. The image of the form document includes one or more features, such as form text, at particular locations within the image. The image processing system accesses a template of the form document and computes a rotation and zoom of the image of the form document relative to the template of the form document based on the locations of the features within the image of the form document relative to the locations of the corresponding features within the template of the form document. The image processing system performs a rotation operation and a zoom operation on the image of the form document, and extracts data entered into fields of the modified image of the form document. The extracted data can be then accessed or stored for subsequent use.

Type: Grant

Filed: April 18, 2023

Date of Patent: June 18, 2024

Assignee: ZENPAYROLL, INC.

Inventor: Quentin Louis Raoul Balin
Information processing apparatus, control method, and program

Patent number: 11966435

Abstract: An information processing apparatus (2000) displays, on a display apparatus (60), a first display (30) that represents a partial image (14) where an object contained therein was not recognized as a product. The information processing apparatus (2000) receives input for selecting one or more first displays (30). The information processing apparatus (2000), upon receiving a predetermined input from a user, ends receiving selection of a first display (30). The information processing apparatus (2000) stores, in a storage apparatus (120), product identification information input to a product information input area (54) and feature information based on a partial image(s) (14) corresponding to a selected first display(s) in association with each other.

Type: Grant

Filed: March 1, 2019

Date of Patent: April 23, 2024

Assignees: NEC CORPORATION, NEC Solution Innovators, Ltd.

Inventors: Yaeko Yonezawa, Akiko Kubo, Hiroki Iiduka
Image processing apparatus, image processing method, and non-transitory storage medium for determining extraction target pixel

Patent number: 11948342

Abstract: A first binary image is generated by binarizing an input image based on a threshold, a second binary image is generated by changing a pixel that has predetermined high luminance in the input image into a black pixel, and whether a black pixel cluster in the second binary image is made to be an extraction target is determined based on a position of a character image identified based on a black pixel cluster in the first binary image, and a position of the black pixel cluster in the second binary image.

Type: Grant

Filed: June 30, 2021

Date of Patent: April 2, 2024

Assignee: CANON KABUSHIKI KAISHA

Inventor: Satoru Yamanaka
System and method for recreating image with repeating patterns of graphical image file to reduce storage space

Patent number: 11915389

Abstract: A system may include a computer readable medium and a processor communicatively coupled to the computer readable medium. The processor may be configured to: obtain a graphical image file, the graphical image file including an image, wherein the image includes at least one sequence of repeating pattern elements, each of the at least one sequence including the repeating pattern elements that are repeated along a linear direction; and convert the graphical image file to at least one file including hardware directives that when executed cause a recreation of the image of the graphical image file to be drawn, wherein a file size of the at least one file is smaller than the graphical image file.

Type: Grant

Filed: November 12, 2021

Date of Patent: February 27, 2024

Assignee: Rockwell Collins, Inc.

Inventors: Jeff M. Henry, Kyle R. Peters, Reed A. Kovach
Methods and systems for colour-based image analysis and search

Patent number: 11907992

Abstract: Computer-implemented methods and systems for colour-based image tagging and colour-based searching. The method may include identifying, using image analysis, one or more dominant colours of a product based on an image of the product and receiving selection of at least one of the one or more dominant colours. In response to receiving the selection of the at least one of the one or more dominant colours, a search for products matching the at least one of the one or more dominant colours may be initiated to obtain one or more results of the searching, the one or more results including at least one product matching the at least one of the one or more dominant colours.

Type: Grant

Filed: April 4, 2022

Date of Patent: February 20, 2024

Assignee: Shopify Inc.

Inventors: Niklas Itaenen, Kshetrajna Raghavan, Xiaoxiao Li, Kyle Bruce Tate, Siphumelele Langeni, Peng Yu
Document image analysis apparatus, document image analysis method and program thereof

Patent number: 11900644

Abstract: Disclosed herein is a document image analysis apparatus including: a document image acquisition unit configured to acquire a document image; a region detection unit configured to detect a plurality of regions from the document image acquired by the document image acquisition unit; a clustering unit configured to cluster the plurality of regions detected by the region detection unit to integrate into a cluster; and a reading order assignment unit configured to assign a reading order to a plurality of regions belonging to the cluster within the cluster integrated by the clustering unit.

Type: Grant

Filed: October 31, 2019

Date of Patent: February 13, 2024

Assignee: Rakuten Group, Inc.

Inventors: Simona Maggio, Alois De La Comble, Ken Prepin
Method and apparatus for recognizing imaged information-bearing medium, computer device and medium

Patent number: 11893765

Abstract: A method and apparatus for recognizing an imaged information-bearing medium, a computer-readable storage device and a computer device are provided. The method comprising: acquiring a first image of the imaged information-bearing medium; performing text recognition on the first image to acquire a text content of the imaged information-bearing medium; classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and archiving the text content according to the type.

Type: Grant

Filed: May 20, 2020

Date of Patent: February 6, 2024

Assignee: BOE TECHNOLOGY GROUP CO., LTD.

Inventors: Guangwei Huang, Ruibin Xue, Bingchuan Shi, Yue Li, Jibo Zhao
Self-supervised document representation learning

Patent number: 11886815

Abstract: One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.

Type: Grant

Filed: May 28, 2021

Date of Patent: January 30, 2024

Assignee: ADOBE INC.

Inventors: Jiuxiang Gu, Vlad Morariu, Varun Manjunatha, Tong Sun, Rajiv Jain, Peizhao Li, Jason Kuen, Handong Zhao
Parallel object analysis for efficiently generating layouts in digital design documents

Patent number: 11829703

Abstract: This disclosure covers methods, non-transitory computer readable media, and systems analyze a digital design document having an initial layout of digital objects and automatically generate candidate layouts by concurrently performing operations on the digital objects within the initial layout. By iteratively performing concurrent operations, in some implementations, the methods, non-transitory computer readable media, and systems produce multiple candidate layouts that the systems evaluate by generating design scores. Based on a comparison of such design scores, the methods, non-transitory computer readable media, and systems generate one or more modified layouts (from among the candidate layouts) for presentation to a user.

Type: Grant

Filed: January 9, 2018

Date of Patent: November 28, 2023

Assignee: Adobe Inc.

Inventors: Vineet Batra, Ankit Phogat, Tarun Beri
3D object camera customization system

Patent number: 11823341

Abstract: Systems and methods are provided for capturing by a camera of a user device, a first image depicting a first environment of the user device; overlaying a first virtual object on a portion of the first image depicting the first environment; modifying a surface of the first virtual object using content captured by the user device; storing a second virtual object comprising the first virtual object with the modified surface; and generating for display the second virtual object on a portion of a second image depicting a second environment.

Type: Grant

Filed: August 4, 2022

Date of Patent: November 21, 2023

Assignee: Snap Inc.

Inventors: Samuel Edward Hare, Andrew James McPhee, Maxim Maximov Lazarov, Wentao Shang, Kyle Goodrich, Tony Mathew
Automated communication design construction system

Patent number: 11816911

Abstract: An automated communication design analysis and construction system that includes one or more intelligent communication design servers, comprising: a normalization module that converts communication content files for different recipients to normalized intermediate format files; an objects identification and quantification module that identifies text objects and image objects in the normalized intermediate format files; a cross-recipient group analysis module configured to identify static global objects that are invariant between recipients, data variables, and variable global objects that vary between recipients in the normalized intermediate format files; and an intelligent communication content learning and constructing engine that can construct standard communication design files based on the static global objects, the data variables, and the variable global objects. A data storage stores the communication content files and the standard communication design files.

Type: Grant

Filed: January 21, 2022

Date of Patent: November 14, 2023

Assignee: Shutterfly, LLC

Inventors: Aaron P. Reihl, Sairam Vangapally, Aaron Gregory Rasset
System and method for determination of label values in unstructured documents

Patent number: 11810383

Abstract: This disclosure relates generally to method and system for determining label value for labels in unstructured documents. Typical systems have challenge in understanding variations in layout of unstructured documents and extract information therefrom. The disclosed method and system facilitate systematically identifying sections and bounding boxes in the page images, taking image portion of the bounding boxes and extracting labels and label values therefrom. In case the label values are not present in the same bounding box having the label, the neighboring labels are examined for the matching label values. The system also obtains label-label value pairs from the document by utilizing a trained deep learning model, and compares the output with the label-label value pairs extracted earlier. An aggregated confidence score is assigned to the text in the bounding box.

Type: Grant

Filed: November 20, 2020

Date of Patent: November 7, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Devang Jagdishchandra Patel, Prabhat Ranjan Mishra, Ketkee Pandit, Ankita Gupta, Chirabrata Bhaumik, Dinesh Yadav, Amit Kumar Agrawal
Document spatial layout feature extraction to simplify template classification

Patent number: 11804056

Abstract: Image encoded documents are identified by recognizing known objects in each document with an object recognizer. The objects in each page are filtered to remove lower order objects. Known features in the objects are recognized by sequentially organizing each object in each filtered page into a one-dimensional array, where each object is positioned in a corresponding one-dimensional array as a function of location in the corresponding filtered page. The one-dimensional array is then compared to known arrays to classify the image document corresponding to the one-dimensional array.

Type: Grant

Filed: May 30, 2022

Date of Patent: October 31, 2023

Assignee: Automation Anywhere, Inc.

Inventors: Michael Sundell, Vibhas Gejji
Digital content design system using baseline units to control arrangement and sizing of digital content

Patent number: 11768992

Abstract: Digital content design system techniques are described using baseline units to control arrangement and sizing of digital content. In one example, a digital content design system receives a user input specifying a number of baselines to be included within an available display area of a page. Baselines are used to align digital content to control arrangement of the digital content within the page, e.g., text. From this, the digital content design system then calculates a baseline unit from a distance used to space adjacent baselines of the number of baselines from each other. This baseline unit is then leveraged by the system as a fundamental unit of measure to control arrangement and/or sizing of digital content in relation to each other.

Type: Grant

Filed: May 5, 2020

Date of Patent: September 26, 2023

Assignee: Adobe Inc.

Inventors: Aman Arora, Rohit Kumar Dubey, Anurag Singh
Image segmentation confidence determination

Patent number: 11763460

Abstract: Examples for determining a confidence level associated with image segmentation are disclosed. A confidence level associated with a collective image segmentation result can be determined by generating multiple individual segmentation results each from the same image data. These examples can then aggregate the individual segmentation results to form the collective image segmentation result and measure the spread of each individual segmentation result from the collective image segmentation result. The measured spread of each individual segmentation result can then be used to determine the confidence level associated with the collective image segmentation result. This can allow a confidence level associated with the collective image segmentation result to be determined. This confidence level may be determined without needing a ground truth to compare to the collective image segmentation result.

Type: Grant

Filed: April 30, 2021

Date of Patent: September 19, 2023

Inventors: Jonathan Tung, Jung W Suh, Advit Bhatt
Systems for generating snap guides relative to glyphs of editable text

Patent number: 11755817

Abstract: In implementations of systems for generating snap guides relative to glyphs of editable text rendered in a user interface using a font, a computing device implements a snap guide system to receive input data describing a position of a cursor relative to the glyphs of the editable text in the user interface. The glyphs of the editable text are enclosed within a bounding box having a height that is less than a height of an em-box of the font. The snap guide system generates a first group of snap guides for the glyphs of the editable text which includes a snap guide for each side of the bounding box and a snap guide for an x-height of the font. The snap guide system generates an indication of a particular snap guide of the first group of snap guides for display in the user interface based on the position of the cursor.

Type: Grant

Filed: August 2, 2021

Date of Patent: September 12, 2023

Assignee: Adobe Inc.

Inventors: Praveen Kumar Dhanuka, Arushi Jain, Shivi Pal
System and method for recreating image with repeating patterns of graphical image file to reduce storage space

Patent number: 11741573

Abstract: A system may include a computer readable medium and a processor communicatively coupled to the computer readable medium. The processor may be configured to: obtain a graphical image file, the graphical image file including an image, wherein the image includes at least one sequence of repeating pattern elements, each of the at least one sequence including the repeating pattern elements that are repeated along a linear direction; and convert the graphical image file to at least one file including hardware directives that when executed cause a recreation of the image of the graphical image file to be drawn, wherein a file size of the at least one file is smaller than the graphical image file.

Type: Grant

Filed: November 12, 2021

Date of Patent: August 29, 2023

Assignee: Rockwell Collins, Inc.

Inventors: Jeff M. Henry, Kyle R. Peters, Reed A. Kovach
Image search in walkthrough videos

Patent number: 11734338

Abstract: A spatial indexing system receives a set of walkthrough videos of an environment taken over a period of time and receives an image search query that includes an image of an object. The spatial indexing system searches the set of walkthrough videos for instances of the object. The spatial indexing system presents search results in a user interface, displaying in a first portion a 2D map associated with one walkthrough video with marked locations of instances of the object and a second portion with a histogram of instances of the object over time in the set of walkthrough videos.

Type: Grant

Filed: May 30, 2022

Date of Patent: August 22, 2023

Assignee: OPEN SPACE LABS, INC.

Inventors: Michael Ben Fleischman, Gabriel Hein, Thomas Friel Allen, Philip DeCamp
Methods for mobile image capture of vehicle identification numbers in a non-document

Patent number: 11734938

Abstract: Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.

Type: Grant

Filed: May 31, 2022

Date of Patent: August 22, 2023

Assignee: MITEK SYSTEMS, INC.

Inventors: Grigori Nepomniachtchi, Nikolay Kotovich
Method and apparatus for image recognition in mobile communication device to identify and weigh items

Patent number: 11727678

Abstract: In some embodiments, a method can include executing a first model to extract a first region of interest (ROI) image and a second ROI image from an image that shows an item and an indication of information associated to the item. The first ROI image can include a portion of the image showing the item and the second ROI image can include a portion of the image showing the indication of information. The method can further include executing a second model to identify the item from the first ROI image and generate a representation of the item. The method can further include executing a third model to read the indication of information associated to the item from the second ROI image and generate a representation of information.

Type: Grant

Filed: October 30, 2020

Date of Patent: August 15, 2023

Assignee: Tiliter Pty Ltd.

Inventors: Marcel Herz, Christopher Bradley Rodney Sampson
Apparatus for detecting contextually-anomalous sentence in document, method therefor, and computer-readable recording medium having program for performing same method recorded thereon

Patent number: 11727703

Abstract: Disclosed are an apparatus and a method for detecting whether an anomalous sentence having a context different from that of other sentences exists in a document. The apparatus for detecting a contextually-anomalous sentence in a document according to the present invention includes: a sentence encoder for encoding individual sentences constituting document data by means of a predetermined rule (function) to generate encoding vectors; a context embedder neural network for converting the generated encoding vector into embedding vectors corresponding thereto; and a context anomaly detector neural network for detecting whether an anomalous sentence exists in the converted document data.

Type: Grant

Filed: November 14, 2019

Date of Patent: August 15, 2023

Assignee: ESTSOFT CORP.

Inventors: Hyeong Jin Byeon, Min Gwan Seo, Hae Bin Shin
Interactive technique for using a user-provided image of a document to collect information

Patent number: 11727316

Abstract: In a collection technique, a user (such as a taxpayer) provides information (such as income-tax information) by submitting an image of a document, such as an income-tax summary or form. In particular, the user may provide a description of the document. In response, the user is prompted for the information associated with the field in the document. Then, the user provides the image of a region in the document that includes the field. Based on the image, the information is extracted, and the field in the form is populated using the extracted information. The prompting, receiving, extracting and populating operations may be repeated for one or more additional fields in the document.

Type: Grant

Filed: August 7, 2020

Date of Patent: August 15, 2023

Assignee: INTUIT, INC.

Inventors: Amir Eftekhari, Alan Tifford

1 2 3 4 5 … next