Distinguishing Text From Other Regions Patents (Class 382/176)

Image processing apparatus, image processing method, and non-transitory storage medium for determining extraction target pixel

Patent number: 11948342

Abstract: A first binary image is generated by binarizing an input image based on a threshold, a second binary image is generated by changing a pixel that has predetermined high luminance in the input image into a black pixel, and whether a black pixel cluster in the second binary image is made to be an extraction target is determined based on a position of a character image identified based on a black pixel cluster in the first binary image, and a position of the black pixel cluster in the second binary image.

Type: Grant

Filed: June 30, 2021

Date of Patent: April 2, 2024

Assignee: CANON KABUSHIKI KAISHA

Inventor: Satoru Yamanaka
System and method for recreating image with repeating patterns of graphical image file to reduce storage space

Patent number: 11915389

Abstract: A system may include a computer readable medium and a processor communicatively coupled to the computer readable medium. The processor may be configured to: obtain a graphical image file, the graphical image file including an image, wherein the image includes at least one sequence of repeating pattern elements, each of the at least one sequence including the repeating pattern elements that are repeated along a linear direction; and convert the graphical image file to at least one file including hardware directives that when executed cause a recreation of the image of the graphical image file to be drawn, wherein a file size of the at least one file is smaller than the graphical image file.

Type: Grant

Filed: November 12, 2021

Date of Patent: February 27, 2024

Assignee: Rockwell Collins, Inc.

Inventors: Jeff M. Henry, Kyle R. Peters, Reed A. Kovach
Methods and systems for colour-based image analysis and search

Patent number: 11907992

Abstract: Computer-implemented methods and systems for colour-based image tagging and colour-based searching. The method may include identifying, using image analysis, one or more dominant colours of a product based on an image of the product and receiving selection of at least one of the one or more dominant colours. In response to receiving the selection of the at least one of the one or more dominant colours, a search for products matching the at least one of the one or more dominant colours may be initiated to obtain one or more results of the searching, the one or more results including at least one product matching the at least one of the one or more dominant colours.

Type: Grant

Filed: April 4, 2022

Date of Patent: February 20, 2024

Assignee: Shopify Inc.

Inventors: Niklas Itaenen, Kshetrajna Raghavan, Xiaoxiao Li, Kyle Bruce Tate, Siphumelele Langeni, Peng Yu
Document image analysis apparatus, document image analysis method and program thereof

Patent number: 11900644

Abstract: Disclosed herein is a document image analysis apparatus including: a document image acquisition unit configured to acquire a document image; a region detection unit configured to detect a plurality of regions from the document image acquired by the document image acquisition unit; a clustering unit configured to cluster the plurality of regions detected by the region detection unit to integrate into a cluster; and a reading order assignment unit configured to assign a reading order to a plurality of regions belonging to the cluster within the cluster integrated by the clustering unit.

Type: Grant

Filed: October 31, 2019

Date of Patent: February 13, 2024

Assignee: Rakuten Group, Inc.

Inventors: Simona Maggio, Alois De La Comble, Ken Prepin
Method and apparatus for recognizing imaged information-bearing medium, computer device and medium

Patent number: 11893765

Abstract: A method and apparatus for recognizing an imaged information-bearing medium, a computer-readable storage device and a computer device are provided. The method comprising: acquiring a first image of the imaged information-bearing medium; performing text recognition on the first image to acquire a text content of the imaged information-bearing medium; classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and archiving the text content according to the type.

Type: Grant

Filed: May 20, 2020

Date of Patent: February 6, 2024

Assignee: BOE TECHNOLOGY GROUP CO., LTD.

Inventors: Guangwei Huang, Ruibin Xue, Bingchuan Shi, Yue Li, Jibo Zhao
Self-supervised document representation learning

Patent number: 11886815

Abstract: One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.

Type: Grant

Filed: May 28, 2021

Date of Patent: January 30, 2024

Assignee: ADOBE INC.

Inventors: Jiuxiang Gu, Vlad Morariu, Varun Manjunatha, Tong Sun, Rajiv Jain, Peizhao Li, Jason Kuen, Handong Zhao
Parallel object analysis for efficiently generating layouts in digital design documents

Patent number: 11829703

Abstract: This disclosure covers methods, non-transitory computer readable media, and systems analyze a digital design document having an initial layout of digital objects and automatically generate candidate layouts by concurrently performing operations on the digital objects within the initial layout. By iteratively performing concurrent operations, in some implementations, the methods, non-transitory computer readable media, and systems produce multiple candidate layouts that the systems evaluate by generating design scores. Based on a comparison of such design scores, the methods, non-transitory computer readable media, and systems generate one or more modified layouts (from among the candidate layouts) for presentation to a user.

Type: Grant

Filed: January 9, 2018

Date of Patent: November 28, 2023

Assignee: Adobe Inc.

Inventors: Vineet Batra, Ankit Phogat, Tarun Beri
3D object camera customization system

Patent number: 11823341

Abstract: Systems and methods are provided for capturing by a camera of a user device, a first image depicting a first environment of the user device; overlaying a first virtual object on a portion of the first image depicting the first environment; modifying a surface of the first virtual object using content captured by the user device; storing a second virtual object comprising the first virtual object with the modified surface; and generating for display the second virtual object on a portion of a second image depicting a second environment.

Type: Grant

Filed: August 4, 2022

Date of Patent: November 21, 2023

Assignee: Snap Inc.

Inventors: Samuel Edward Hare, Andrew James McPhee, Maxim Maximov Lazarov, Wentao Shang, Kyle Goodrich, Tony Mathew
Automated communication design construction system

Patent number: 11816911

Abstract: An automated communication design analysis and construction system that includes one or more intelligent communication design servers, comprising: a normalization module that converts communication content files for different recipients to normalized intermediate format files; an objects identification and quantification module that identifies text objects and image objects in the normalized intermediate format files; a cross-recipient group analysis module configured to identify static global objects that are invariant between recipients, data variables, and variable global objects that vary between recipients in the normalized intermediate format files; and an intelligent communication content learning and constructing engine that can construct standard communication design files based on the static global objects, the data variables, and the variable global objects. A data storage stores the communication content files and the standard communication design files.

Type: Grant

Filed: January 21, 2022

Date of Patent: November 14, 2023

Assignee: Shutterfly, LLC

Inventors: Aaron P. Reihl, Sairam Vangapally, Aaron Gregory Rasset
System and method for determination of label values in unstructured documents

Patent number: 11810383

Abstract: This disclosure relates generally to method and system for determining label value for labels in unstructured documents. Typical systems have challenge in understanding variations in layout of unstructured documents and extract information therefrom. The disclosed method and system facilitate systematically identifying sections and bounding boxes in the page images, taking image portion of the bounding boxes and extracting labels and label values therefrom. In case the label values are not present in the same bounding box having the label, the neighboring labels are examined for the matching label values. The system also obtains label-label value pairs from the document by utilizing a trained deep learning model, and compares the output with the label-label value pairs extracted earlier. An aggregated confidence score is assigned to the text in the bounding box.

Type: Grant

Filed: November 20, 2020

Date of Patent: November 7, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Devang Jagdishchandra Patel, Prabhat Ranjan Mishra, Ketkee Pandit, Ankita Gupta, Chirabrata Bhaumik, Dinesh Yadav, Amit Kumar Agrawal
Document spatial layout feature extraction to simplify template classification

Patent number: 11804056

Abstract: Image encoded documents are identified by recognizing known objects in each document with an object recognizer. The objects in each page are filtered to remove lower order objects. Known features in the objects are recognized by sequentially organizing each object in each filtered page into a one-dimensional array, where each object is positioned in a corresponding one-dimensional array as a function of location in the corresponding filtered page. The one-dimensional array is then compared to known arrays to classify the image document corresponding to the one-dimensional array.

Type: Grant

Filed: May 30, 2022

Date of Patent: October 31, 2023

Assignee: Automation Anywhere, Inc.

Inventors: Michael Sundell, Vibhas Gejji
Digital content design system using baseline units to control arrangement and sizing of digital content

Patent number: 11768992

Abstract: Digital content design system techniques are described using baseline units to control arrangement and sizing of digital content. In one example, a digital content design system receives a user input specifying a number of baselines to be included within an available display area of a page. Baselines are used to align digital content to control arrangement of the digital content within the page, e.g., text. From this, the digital content design system then calculates a baseline unit from a distance used to space adjacent baselines of the number of baselines from each other. This baseline unit is then leveraged by the system as a fundamental unit of measure to control arrangement and/or sizing of digital content in relation to each other.

Type: Grant

Filed: May 5, 2020

Date of Patent: September 26, 2023

Assignee: Adobe Inc.

Inventors: Aman Arora, Rohit Kumar Dubey, Anurag Singh
Image segmentation confidence determination

Patent number: 11763460

Abstract: Examples for determining a confidence level associated with image segmentation are disclosed. A confidence level associated with a collective image segmentation result can be determined by generating multiple individual segmentation results each from the same image data. These examples can then aggregate the individual segmentation results to form the collective image segmentation result and measure the spread of each individual segmentation result from the collective image segmentation result. The measured spread of each individual segmentation result can then be used to determine the confidence level associated with the collective image segmentation result. This can allow a confidence level associated with the collective image segmentation result to be determined. This confidence level may be determined without needing a ground truth to compare to the collective image segmentation result.

Type: Grant

Filed: April 30, 2021

Date of Patent: September 19, 2023

Inventors: Jonathan Tung, Jung W Suh, Advit Bhatt
Systems for generating snap guides relative to glyphs of editable text

Patent number: 11755817

Abstract: In implementations of systems for generating snap guides relative to glyphs of editable text rendered in a user interface using a font, a computing device implements a snap guide system to receive input data describing a position of a cursor relative to the glyphs of the editable text in the user interface. The glyphs of the editable text are enclosed within a bounding box having a height that is less than a height of an em-box of the font. The snap guide system generates a first group of snap guides for the glyphs of the editable text which includes a snap guide for each side of the bounding box and a snap guide for an x-height of the font. The snap guide system generates an indication of a particular snap guide of the first group of snap guides for display in the user interface based on the position of the cursor.

Type: Grant

Filed: August 2, 2021

Date of Patent: September 12, 2023

Assignee: Adobe Inc.

Inventors: Praveen Kumar Dhanuka, Arushi Jain, Shivi Pal
System and method for recreating image with repeating patterns of graphical image file to reduce storage space

Patent number: 11741573

Abstract: A system may include a computer readable medium and a processor communicatively coupled to the computer readable medium. The processor may be configured to: obtain a graphical image file, the graphical image file including an image, wherein the image includes at least one sequence of repeating pattern elements, each of the at least one sequence including the repeating pattern elements that are repeated along a linear direction; and convert the graphical image file to at least one file including hardware directives that when executed cause a recreation of the image of the graphical image file to be drawn, wherein a file size of the at least one file is smaller than the graphical image file.

Type: Grant

Filed: November 12, 2021

Date of Patent: August 29, 2023

Assignee: Rockwell Collins, Inc.

Inventors: Jeff M. Henry, Kyle R. Peters, Reed A. Kovach
Image search in walkthrough videos

Patent number: 11734338

Abstract: A spatial indexing system receives a set of walkthrough videos of an environment taken over a period of time and receives an image search query that includes an image of an object. The spatial indexing system searches the set of walkthrough videos for instances of the object. The spatial indexing system presents search results in a user interface, displaying in a first portion a 2D map associated with one walkthrough video with marked locations of instances of the object and a second portion with a histogram of instances of the object over time in the set of walkthrough videos.

Type: Grant

Filed: May 30, 2022

Date of Patent: August 22, 2023

Assignee: OPEN SPACE LABS, INC.

Inventors: Michael Ben Fleischman, Gabriel Hein, Thomas Friel Allen, Philip DeCamp
Methods for mobile image capture of vehicle identification numbers in a non-document

Patent number: 11734938

Abstract: Various embodiments disclosed herein are directed to methods of capturing Vehicle Identification Numbers (VIN) from images captured by a mobile device. Capturing VIN data can be useful in several applications, for example, insurance data capture applications. There are at least two types of images supported by this technology: (1) images of documents and (2) images of non-documents.

Type: Grant

Filed: May 31, 2022

Date of Patent: August 22, 2023

Assignee: MITEK SYSTEMS, INC.

Inventors: Grigori Nepomniachtchi, Nikolay Kotovich
Method and apparatus for image recognition in mobile communication device to identify and weigh items

Patent number: 11727678

Abstract: In some embodiments, a method can include executing a first model to extract a first region of interest (ROI) image and a second ROI image from an image that shows an item and an indication of information associated to the item. The first ROI image can include a portion of the image showing the item and the second ROI image can include a portion of the image showing the indication of information. The method can further include executing a second model to identify the item from the first ROI image and generate a representation of the item. The method can further include executing a third model to read the indication of information associated to the item from the second ROI image and generate a representation of information.

Type: Grant

Filed: October 30, 2020

Date of Patent: August 15, 2023

Assignee: Tiliter Pty Ltd.

Inventors: Marcel Herz, Christopher Bradley Rodney Sampson
Interactive technique for using a user-provided image of a document to collect information

Patent number: 11727316

Abstract: In a collection technique, a user (such as a taxpayer) provides information (such as income-tax information) by submitting an image of a document, such as an income-tax summary or form. In particular, the user may provide a description of the document. In response, the user is prompted for the information associated with the field in the document. Then, the user provides the image of a region in the document that includes the field. Based on the image, the information is extracted, and the field in the form is populated using the extracted information. The prompting, receiving, extracting and populating operations may be repeated for one or more additional fields in the document.

Type: Grant

Filed: August 7, 2020

Date of Patent: August 15, 2023

Assignee: INTUIT, INC.

Inventors: Amir Eftekhari, Alan Tifford
Apparatus for detecting contextually-anomalous sentence in document, method therefor, and computer-readable recording medium having program for performing same method recorded thereon

Patent number: 11727703

Abstract: Disclosed are an apparatus and a method for detecting whether an anomalous sentence having a context different from that of other sentences exists in a document. The apparatus for detecting a contextually-anomalous sentence in a document according to the present invention includes: a sentence encoder for encoding individual sentences constituting document data by means of a predetermined rule (function) to generate encoding vectors; a context embedder neural network for converting the generated encoding vector into embedding vectors corresponding thereto; and a context anomaly detector neural network for detecting whether an anomalous sentence exists in the converted document data.

Type: Grant

Filed: November 14, 2019

Date of Patent: August 15, 2023

Assignee: ESTSOFT CORP.

Inventors: Hyeong Jin Byeon, Min Gwan Seo, Hae Bin Shin
Validation method and system to improve data accuracy

Patent number: 11720961

Abstract: An automated method and system for validating (cross-validating) data fields in an electronic document, such as a document that has been passed through an optical character recognition (“OCR”) or Intelligent Document Recognition (“IDR”) system or software, to improve accuracy of the electronic document.

Type: Grant

Filed: August 30, 2021

Date of Patent: August 8, 2023

Assignee: SOFTWORKS AI, LLC

Inventors: Ari Gross, Matthew Joshua Khan Persad, Yunhao Shi, Perry Kangoun, Talya Klein
Display device and system

Patent number: 11722650

Abstract: An image processing engine and method of forming a hologram of a target image for projection using data streaming. An input or primary image is sub-sampled using a kernel and the secondary image output used to generate a hologram of the target image. A technique of kernel sub-sampling using a plurality of two or more data streams provides improvements in efficiency, including reduced data storage requirements and increased processing speed.

Type: Grant

Filed: April 30, 2021

Date of Patent: August 8, 2023

Assignee: ENVISICS LTD

Inventor: Stig Mikael Collin
Text recognition for a neural network

Patent number: 11710304

Abstract: Image data having text associated with a plurality of text-field types is received, the image data including target image data and context image data. The target image data including target text associated with a text-field type. The context image data providing a context for the target image data. A trained neural network that is constrained to a set of characters for the text-field type is applied to the image data. The trained neural network identifies the target text of the text-field type using a vector embedding that is based on learned patterns for recognizing the context provided by the context image data. One or more predicted characters are provided for the target text of the text-field type in response to identifying the target text using the trained neural network.

Type: Grant

Filed: August 23, 2022

Date of Patent: July 25, 2023

Assignee: BILL.COM, LLC

Inventor: Eitan Anzenberg
Systems and methods for digitized document image data spillage recovery

Patent number: 11704925

Abstract: Systems and methods for digitized document image data spillage recovery are provided. One or more memories may be coupled to one or more processors, the one or more memories including instructions operable to be executed by the one or more processors. The one or more processors may be configured to capture an image; process the image through at least a first pass to generate a first contour; remove a preprinted bounding region of the first contour to retain text; generate one or more pixel blobs by applying one or more filters to smudge the text; identify the one or more pixel blobs that straddle one or more boundaries of the first contour; resize the first contour to enclose spillage of the one or more pixel blobs; overlay the text from the image within the resized contour; and apply pixel masking to the resized contour.

Type: Grant

Filed: February 18, 2021

Date of Patent: July 18, 2023

Assignee: CAPITAL ONE SERVICES, LLC

Inventor: Douglas Slattery
Image analysis based document processing for inference of key-value pairs in non-fixed digital documents

Patent number: 11699297

Abstract: An online system extracts information from non-fixed form documents. The online system receives an image of a form document and obtains a set of phrases and locations of the set of phrases on the form image. For at least one field, the online system determines key scores for the set of phrases. The online system identifies a set of candidate values for the field from the set of identified phrases and identifies a set of neighbors for each candidate value from the set of identified phrases. The online system determines neighbor scores, where a neighbor score for a candidate value and a respective neighbor is determined based on the key score for the neighbor and a spatial relationship of the neighbor to the candidate value. The online system selects a candidate value and a respective neighbor based on the neighbor score as the value and key for the field.

Type: Grant

Filed: January 4, 2021

Date of Patent: July 11, 2023

Assignee: Salesforce, Inc.

Inventors: Mingfei Gao, Zeyuan Chen, Le Xue, Ran Xu, Caiming Xiong
Image processing apparatus, imaging device, moving body device control system, image processing method, and program product

Patent number: 11691585

Abstract: An image processing apparatus includes one or more processors; and a memory, the memory storing instructions, which when executed by the one or more processors, cause the one or more processors to generate vertical direction distribution data indicating a frequency distribution of distance values with respect to a vertical direction of a range image, from the range image having distance values according to distance of a road surface in a plurality of captured images captured by a plurality of imaging parts; set a search range corresponding to a predetermined reference point in the vertical direction distribution data and extract a plurality of pixels from the search range; and detect a road surface, based on the plurality of extracted pixels.

Type: Grant

Filed: September 5, 2018

Date of Patent: July 4, 2023

Assignee: RICOH COMPANY, LTD.

Inventor: Naoki Motohashi
Optical character recognition method and apparatus, electronic device and storage medium

Patent number: 11694461

Abstract: The present application discloses a method and an apparatus for optical character recognition, an electronic device and a storage medium, and relates to the fields of artificial intelligence and deep learning. The method may include: determining, for a to-be-recognized image, a text bounding box of a text area therein, and extracting a text area image from the to-be-recognized image according to the text bounding box; determining a bounding box of text lines in the text area image, and extracting a text-line image from the text area image according to the bounding box; and performing text sequence recognition on the text-line image, and obtaining a recognition result. The application of the solution in the present application can improve a recognition speed and the like.

Type: Grant

Filed: March 11, 2021

Date of Patent: July 4, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Mengyi En, Shanshan Liu, Xuan Li, Chengquan Zhang, Hailun Xu, Xiaoqiang Zhang
Systems and methods for processing a stream of data values using data value subset groups

Patent number: 11688047

Abstract: Data processing systems (e.g. image processing systems) and methods are provided for processing a stream of data values (e.g. pixel values). In each of a plurality of iterations, a respective particular data value of the stream is processed by operating on a respective particular subset of data values of the stream. In each iteration: group indication data for at least one group is retrieved and used to define a set of groups into which data values within the particular subset can be grouped; each of the data values within the particular subset is grouped into one of the groups of the set of groups; the particular data value is processed using one or more of the data values of the particular subset in dependence on the classification of the data values into the groups; and group indication data is stored for a group, for use in a subsequent iteration.

Type: Grant

Filed: August 25, 2021

Date of Patent: June 27, 2023

Assignee: Imagination Technologies Limited

Inventor: Timothy Lee
Systems and methods for generating medical image data for longitudinal studies

Patent number: 11682145

Abstract: In a method for generating synthetic medical image data, first image data of an object under examination including a first value for a property is acquired, second image data of the object under examination including a second value for the property is acquired, the second value of the property of the second image data is matched to the first value to modify the second image data to generate synthetic image data, and the synthetic image data is provided (e.g. in electronic form as a data file). The first image data can be captured with a first magnetic resonance device at a first point in time, and the second image data can be captured with a second magnetic resonance device at a second point in time.

Type: Grant

Filed: December 18, 2020

Date of Patent: June 20, 2023

Assignee: Siemens Healthcare GmbH

Inventor: Mario Zeller
Information processing apparatus and non-transitory computer readable medium

Patent number: 11682224

Abstract: An information processing apparatus includes a memory and a processor configured to acquire an image of a digitized document and execute a first verification process by using image processing using artificial intelligence. The first verification process verifies whether a first requirement is satisfied. The first requirement is a specific requirement among multiple requirements that are required when the acquired image of the document is stored. The processor is also configured to execute a second verification process by using a determination process not using the artificial intelligence. The second verification process verifies whether a second requirement among the multiple requirements is satisfied. The second requirement is other than the first requirement.

Type: Grant

Filed: January 15, 2021

Date of Patent: June 20, 2023

Assignee: FUJIFILM Business Innovation Corp.

Inventors: Michinori Masumoto, Yusuke Hariya
Scalable video coding system with parameter signaling

Patent number: 11677967

Abstract: A method is provided for encoding a digital video to provide for improved color mapping.

Type: Grant

Filed: April 21, 2016

Date of Patent: June 13, 2023

Assignee: ARRIS Enterprises LLC

Inventors: Koohyar Minoo, Zhouye Gu, David M. Baylon, Ajay Luthra
Machine learning systems and methods for automatically tagging documents to enable accessibility to impaired individuals

Patent number: 11675970

Abstract: Systems, methods, and products for auto tagging structured PDF documents that do not have accessibility tags. In one embodiment, structured PDF documents having accessibility tags are first parsed and analyzed to organize the visual components of the documents. The relationships of the identified objects to DOM elements (e.g., tags) are determined, and the objects and related DOM elements are stored in training files. The training files are used to train various classifiers. Untagged PDF documents are then parsed to identify included visual objects, and the classifiers are used to determine DOM elements that should be associated with visual objects identified in the untagged PDF documents. This information is used to construct a DOM structure corresponding to each untagged document. A new PDF is then generated corresponding to each untagged document using the generated DOM structure and visual object information.

Type: Grant

Filed: February 12, 2021

Date of Patent: June 13, 2023

Assignee: OPEN TEXT CORPORATION

Inventors: David Comeau, Jeffrey Williams, Evgeny Kolesnikov, Michael Itkin, June Qiang, James Relunia, Brian Sue
Information processing apparatus, system, display control method, and recording medium

Patent number: 11669534

Abstract: An information processing apparatus includes: a network interface to communicate with a server for managing content data generated during an event, the content data including at least text data converted from audio data collected during the event and screenshot data of a screen captured during the event; and circuitry to control a display to display one or more items of text data, and one or more images of screenshot data, side by side, in a temporal order.

Type: Grant

Filed: April 19, 2019

Date of Patent: June 6, 2023

Assignee: RICOH COMPANY, LTD.

Inventor: Takuro Mano
Efficient bounding box merging

Patent number: 11670102

Abstract: A system can merge text bounding boxes such as Optical Character Recognition (OCR) bounding boxes. A document can comprise a plurality of the text bounding boxes. Distance thresholds between text bounding boxes can be utilized for comparison against a distance threshold. Distance thresholds can vary depending on context information associated with the document. In response to a determination that text bounding boxes satisfy the distance threshold, the text bounding boxes can be assigned to a bounding box group.

Type: Grant

Filed: September 2, 2021

Date of Patent: June 6, 2023

Assignee: PayPal, Inc.

Inventor: Xiaodong Yu
Information processing apparatus and non-transitory computer readable medium for changing display order of recognition results based on previous checking order

Patent number: 11671540

Abstract: An information processing apparatus includes a processor. The processor is programmed to: control a display to display a plurality of recognition results, each recognition result being a recognition result of a document, the document having a plurality of items and an entry field for each item, each recognition result being displayed for each corresponding item of the document; acquire a checking order for each item, the checking order being an order in which each of the displayed recognition results has been checked by a user viewing the displayed recognition results; and change a display order by using the acquired checking order, the display order being an order in which to display a subsequent set of recognition result.

Type: Grant

Filed: March 26, 2020

Date of Patent: June 6, 2023

Assignee: FUJIFILM Business Innovation Corp.

Inventor: Shigekazu Sasagawa
Data extraction from form images

Patent number: 11663841

Abstract: An image processing system accesses an image of a completed form document. The image of the form document includes one or more features, such as form text, at particular locations within the image. The image processing system accesses a template of the form document and computes a rotation and zoom of the image of the form document relative to the template of the form document based on the locations of the features within the image of the form document relative to the locations of the corresponding features within the template of the form document. The image processing system performs a rotation operation and a zoom operation on the image of the form document, and extracts data entered into fields of the modified image of the form document. The extracted data can be then accessed or stored for subsequent use.

Type: Grant

Filed: August 15, 2022

Date of Patent: May 30, 2023

Assignee: ZENPAYROLL, INC.

Inventor: Quentin Louis Raoul Balin
Scalable, flexible and robust template-based data extraction pipeline

Patent number: 11657631

Abstract: A computer-implemented method for extracting information from a document, for example an official document, is disclosed. The method comprises acquiring an input image comprising a document portion; performing image segmentation on the input image to form a binary input image that distinguishes the document portion from the remaining portion of the input image; estimating a first image transform to align the binary input image to a binary template image, using the first image transform on the input image to form an intermediate image; estimating a second image transform to align the intermediate image to a template image; using the second image transform on the intermediate image to form an output image; and extracting a field from the output image using a predetermined field of the template image.

Type: Grant

Filed: April 28, 2021

Date of Patent: May 23, 2023

Assignee: Onfido Ltd.

Inventors: Christos Sagonas, Karolina Dabkowska, Zhiyuan Shi, Edward Fieri Soler, Mohan Mahadevan, Iona Grace Vincent, Luca Peric, Alessandro Lenzi, Alvaro Fernando Lara, James Stonehill
Generation of an electronic document capable of receiving user input

Patent number: 11659104

Abstract: An image of a document is received from an image capture device, the image being in a format of an image file. At least one location of a user input field is automatically detected within the image based on patterns previously detected in a set of other images that were annotated to identify locations of user input fields within the individual images of the set. Coordinates are determined for the at least one location, and an electronic document is generated based on the received image. Generation of the electronic document includes addition of a software user input component at the location within the image with use of the coordinates, the software user input component configured to receive input from a user in electronic form.

Type: Grant

Filed: December 28, 2020

Date of Patent: May 23, 2023

Inventors: Maanusri Balasubramanian, Arjun Ashok Kumar
Automatic sizing and placement of text within a digital image

Patent number: 11657510

Abstract: This disclosure involves the automatic sizing and placement of text within an image background. For example, a computing system obtains reference font size information for a font type to be applied to message text for display on a digital image. The computing system detects, within an image background of the digital image, a target region having proportions that enclose the message text based on the reference font size information. The computing system determines a target font size for the message text. The target font size allows the message text, when rendered in the font type at the target font size, to fit within the target region of the image background. The computing system generates a combined digital image by rendering the message text in the font type at the target font size within the target region of the image background.

Type: Grant

Filed: January 27, 2021

Date of Patent: May 23, 2023

Assignee: Adobe Inc.

Inventor: Pin Zhang
System and method for automatic document management

Patent number: 11636121

Abstract: A system for managing documents, comprising: interfaces to a user interface, proving an application programming interface, a database of document images, a remote server, configured to communicate a text representation of the document from the optical character recognition engine to the report server, and to receive from the remote server a classification of the document; and logic configured to receive commands from the user interface, and to apply the classifications received from the remote server to the document images through the interface to the database. A corresponding method is also provided.

Type: Grant

Filed: May 3, 2021

Date of Patent: April 25, 2023

Assignee: DUB SOFTWARE GROUP INC.

Inventors: Eitan Dub, Adam O. Dub, Alfredo J. Miro
Method and system for document classification and text information extraction

Patent number: 11631233

Abstract: Variation in received documents types and templates used for each document type poses challenge in developing a generic background noise removal approach for automatic text information extraction technique. Embodiments herein provide a method and a system for document classification and text information extraction. Time efficient and accurate text detection engine-based Region of Interest (ROI) technique is provided to accurately identify text region followed by a multi-layered neural network based architecture for enhanced classification accuracy to identify the type of document. A multistage image pre-processing approach is provided for efficient, effective, and accurate background noise removal from the classified document, which includes unsupervised clustering, identification, segmentation, masking, contour approximation, selective subtraction, and dynamic thresholding.

Type: Grant

Filed: March 19, 2021

Date of Patent: April 18, 2023

Assignee: TATA CONSULTANCY SERVICES LIMITED

Inventors: Devang Jagdishchandra Patel, Prosenjit Mondal, Rajdeep Chatterjee, Prabhat Ranjan Mishra, Pushp Kumar Jain, Harinakshi Raina, Amit Kumar Agrawal, Anshika Jain, Ankita Gupta, Ketkee Pandit
Information processing apparatus, information processing method, and storage medium that provide a highlighting feature of highlighting a displayed character recognition area

Patent number: 11620434

Abstract: Results of character recognition processing for a scanned image of a document and a setting item set to a property attached to the scanned image of a document are obtained. Displaying on a screen having a preview area where the scanned image of a document is displayed and an editing area where information input in the setting item is edited, that is, displaying the scanned image of a document in the preview area and displaying the setting item and the information in the editing area are controlled. A selection for the setting item displayed in the editing area is detected. A verification rule set to the detected setting item is obtained. A character recognition area satisfying the verification rule is extracted from the results of the character recognition processing. A character recognition area displayed on the preview area and extracted is highlighted.

Type: Grant

Filed: March 7, 2022

Date of Patent: April 4, 2023

Assignee: CANON KABUSHIKI KAISHA

Inventor: Kenichi Shiraishi
Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data

Patent number: 11610084

Abstract: In some embodiments, a method includes training a first machine learning model based on multiple documents and multiple templates associated with the multiple documents. The method further includes executing the first machine learning model to generate multiple relevancy masks, the multiple relevancy masks to remove a visual structure of the multiple templates from a visual structure of the multiple documents. The method further includes generating multiple multichannel field images to include the multiple relevancy masks and at least one of the multiple documents or the multiple templates. The method further includes training a second machine learning model based on the multiple multichannel field images and multiple non-native texts associated with the multiple documents. The method further includes executing the second machine learning model to generate multiple non-native texts from the multiple multichannel field images.

Type: Grant

Filed: June 1, 2020

Date of Patent: March 21, 2023

Assignee: Hyper Labs, Inc.

Inventors: Boris Nikolaev Daskalov, Daniel Biser Balchev
Performing electronic document segmentation using deep neural networks

Patent number: 11600091

Abstract: Techniques for document segmentation. In an example, a document processing application segments an electronic document image into strips. A first strip overlaps a second strip. The application generates a first mask indicating one or more elements and element types in the first strip by applying a predictive model network to image content in the first strip and a prior mask generated from image content of the first strip. The application generates a second mask indicating one or more elements and element types in the second strip by applying the predictive model network to image content in the second strip and the first mask. The application computes, from a combined mask derived from the first mask and the second mask, an output electronic document that identifies elements in the electronic document and the respective element types.

Type: Grant

Filed: May 21, 2021

Date of Patent: March 7, 2023

Assignee: Adobe Inc.

Inventors: Mausoom Sarkar, Arneh Jain
Image processing device, control method, and non-transitory recording medium

Patent number: 11588954

Abstract: An image processing device includes an image reader that reads an image, a first determiner that determines whether character crushing occurs when the image is binarized, a second determiner that determines a rate of a photographic region in the image, and a controller that performs conversion into monochrome N-gradation image data based on determination results of the first and second determiners.

Type: Grant

Filed: November 24, 2021

Date of Patent: February 21, 2023

Assignee: SHARP KABUSHIKI KAISHA

Inventors: Daisaku Imaizumi, Teruhiko Matsuoka, Akihito Yoshida, Chiharu Hirayama
Method and apparatus for determining an icon position

Patent number: 11574415

Abstract: Disclosed are a method and device for determining an icon position. The method includes: detecting a target object in a target image and determining the reference position of the target object in the target image, and detecting a salient position in the target image, thereby obtaining the reference position of a key target or object in the target image, and a salient position possibly requiring more attention in the target image; and selecting, according to the distance between the reference position or salient position and preset candidate positions, an icon position from the candidate positions.

Type: Grant

Filed: November 22, 2021

Date of Patent: February 7, 2023

Assignee: Beijing Dajia Internet Information Technology Co., Ltd.

Inventors: Mading Li, Yunfei Zheng, Jiajie Zhang, Xiaodong Ning, Yuyan Song, Bing Yu
Computer vision systems and methods for information extraction from text images using evidence grounding techniques

Patent number: 11562591

Abstract: Computer vision systems and methods for text classification are provided. The system detects a plurality of text regions in an image and generates a bounding box for each detected text region. The system utilizes a neural network to recognize text present within each bounding box and classifies the recognized text, based on at least one extracted feature of each bounding box and the recognized text present within each bounding box, according to a plurality of predefined tags. The system can associate a key with a value and return a key-value pair for each predefined tag.

Type: Grant

Filed: December 23, 2020

Date of Patent: January 24, 2023

Assignee: Insurance Services Office, Inc.

Inventors: Khoi Nguyen, Maneesh Kumar Singh
System and method for verifying whether text will be properly rendered in a target area of a user interface and/or a graphics file

Patent number: 11551463

Abstract: A system and method are capable of ensuring that one or more text strings will be able to be fully rendered in a target area of a user interface or a target area of a graphics file. The system and method determine the number of pixels of first and second reference text that fit in the target area in the horizontal direction and the vertical direction, respectively, determine the number of pixels of string text in the horizontal direction and the vertical direction, and compare the number of pixels in the horizontal direction of the first reference text and the vertical direction of the second reference text respectively to the number of pixels in the horizontal direction and the vertical direction of the text string that is desired to be rendered in the target area to determine whether the text string will fit in the target area.

Type: Grant

Filed: May 19, 2021

Date of Patent: January 10, 2023

Inventor: Gregory Mark Henninger
Information processing system, information processing method, and information processing apparatus for assisting input of information

Patent number: 11532146

Abstract: An information processing system includes circuitry configured to accept a selection of specification information from a list of the specification information displayed on a display, the specification information being included in form information acquired by performing form recognition; and display, on the display, an input field in which journal information based on the selected specification information is input.

Type: Grant

Filed: October 26, 2020

Date of Patent: December 20, 2022

Assignee: Ricoh Company, Ltd.

Inventors: Ryoh Aruga, Hiroshi Kobayashi, Fumihiro Teshima
Determination of root causes of customer returns

Patent number: 11526665

Abstract: Root cause estimation for a data set corresponding to customer returns of a product may use a probabilistic model to associate customer-entered product return data with probability distributions relating to possible root causes for the returns. A particular application relates to applying a Bayesian network to customer-selected return reason codes and customer-entered return reason comments to estimate a probability distribution for root causes of a plurality of returns and uncertainties relating to the probability distribution estimation. A bag-of-n-grams can be used to enable the Bayesian network to process natural language portions of the customer-entered product return data. The output of the model and other data relating to the root cause estimation can be conveyed to a seller of the returned products via a user interface.

Type: Grant

Filed: December 11, 2019

Date of Patent: December 13, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Karen Hovsepian, Mingwei Shen, Srikar Appalaraju, Andrew Shanley, Vijay Patha

1 2 3 4 5 … next