Segmenting Individual Characters Or Words Patents (Class 382/177)

Separating touching or overlapping characters (Class 382/178)

Segmenting hand-printed characters (Class 382/179)

System and method for data extraction

Patent number: 12354156

Abstract: A system and method of data extraction is disclosed. An image file of a scan of a printed information list is received at a server via a network connection. When not in portable document format, the received image file is processed with an optical character recognition (OCR) engine at the server to identify all text therein and then the processed image file is stored in a memory. When in portable document format, the image file is processed using metadata and positional data at the server to generate sentences; process the sentences to identify prices, descriptions, items, and categories; link each identified item to an associated price, description, and category; and extract and link all modifiers from identified description, and then all identified and extracted information is stored in memory. A user interface is provided to a user via the server for graphically visualizing and editing a stored processed file.

Type: Grant

Filed: October 14, 2022

Date of Patent: July 8, 2025

Assignee: NCR Voyix Corporation

Inventors: Mihir Joshi, Kyle Wade, Brian A. Klotzman, Eric Smith
Automatic training data sample collection

Patent number: 12327371

Abstract: A method in a mobile computing device includes: controlling a camera to capture an image; tracking a pose of the mobile computing device, corresponding to the image, in a coordinate system; detecting an item in the image; determining a location of the detected item in the coordinate system, based on the tracked pose; obtaining an item identifier corresponding to the detected item, based on the location of the detected item in the coordinate system; generating a training data sample including (i) a payload based on the detected item, and (ii) a label including the obtained item identifier; and storing the training data sample.

Type: Grant

Filed: October 29, 2021

Date of Patent: June 10, 2025

Assignee: Zebra Technologies Corporation

Inventor: Patrenahalli M. Narendra
Content extraction based on graph modeling

Patent number: 12277787

Abstract: Methods and systems are presented for extracting categorizable information from an image using a graph that models data within the image. Upon receiving an image, a data extraction system identifies characters in the image. The data extraction system then generates bounding boxes that enclose adjacent characters that are related to each other in the image. The data extraction system also creates connections between the bounding boxes based on locations of the bounding boxes. A graph is generated based on the bounding boxes and the connections such that the graph can accurately represent the data in the image. The graph is provided to a graph neural network that is configured to analyze the graph and produce an output. The data extraction system may categorize the data in the image based on the output.

Type: Grant

Filed: April 13, 2023

Date of Patent: April 15, 2025

Assignee: PAYPAL, INC.

Inventors: Xiaodong Yu, Hewen Wang
Resolving obfuscated sections of an image with selectively captured new imagery

Patent number: 12249045

Abstract: In an embodiment of the present invention, imagery of a geographic area is received from an image capture system. The imagery includes an obscurity impeding visibility. A geographic region within the geographic area is determined corresponding to a location of the obscurity. The image capture system is requested to capture new imagery of the geographic region at a time determined based on information including weather patterns. The new imagery of the geographic region is combined with the imagery of the geographic area to produce resulting imagery resolving the obscurity.

Type: Grant

Filed: December 14, 2021

Date of Patent: March 11, 2025

Assignee: International Business Machines Corporation

Inventors: Lisa Seacat DeLuca, Rachel Kutok, Andrew T. Penrose, Massimiliano Gallo
Simulated handwriting image generator

Patent number: 12229399

Abstract: Techniques are provided for generating a digital image of simulated handwriting using an encoder-decoder neural network trained on images of natural handwriting samples. The simulated handwriting image can be generated based on a style of a handwriting sample and a variable length coded text input. The style represents visually distinctive characteristics of the handwriting sample, such as the shape, size, slope, and spacing of the letters, characters, or other markings in the handwriting sample. The resulting simulated handwriting image can include the text input rendered in the style of the handwriting sample. The distinctive visual appearance of the letters or words in the simulated handwriting image mimics the visual appearance of the letters or words in the handwriting sample image, whether the letters or words in the simulated handwriting image are the same as in the handwriting sample image or different from those in the handwriting sample image.

Type: Grant

Filed: January 23, 2024

Date of Patent: February 18, 2025

Assignee: Adobe Inc.

Inventors: Christopher Alan Tensmeyer, Rajiv Jain, Curtis Michael Wigington, Brian Lynn Price, Brian Lafayette Davis
Electronic device and method for identifying sentence indicated by strokes

Patent number: 12223755

Abstract: A processor of an electronic device is configured to display a plurality of strokes in the display. The processor is configured to display, in response to a first input indicating selection of at least one character distinguished by the strokes, a first visual object for identifying a first sentence including the at least one character. The processor is configured to identify, in response to a second input indicating selection of the first visual object, strokes included in the first sentence among the strokes, based on a spacing between a first word including the at least one character and a second word, and moments of a plurality of words including the first word, and the second word are inputted. The processor is configured to display, in the display based on identification of the strokes included in the first sentence, a second visual object corresponding to the identified strokes among the strokes.

Type: Grant

Filed: April 14, 2023

Date of Patent: February 11, 2025

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Taewon Kwak
Method for displaying live streaming interface

Patent number: 12206948

Abstract: The disclosure provided live streaming interface display method, device, apparatus, storage medium and program product. The method comprises displaying a live streaming interface, wherein the live streaming interface comprises an identification area, the identification area comprises at least one scene identification, and each scene identification is used for indicating a live streaming scene; in response to a first input on a first scene identification of the at least one scene identification, displaying at least one first data source identification corresponding to the first scene identification in the identification area, wherein the first scene identification is used for indicating a first live streaming scene, and each first data source identification is used for indicating one data source in the first live streaming scene.

Type: Grant

Filed: December 21, 2023

Date of Patent: January 21, 2025

Assignee: Beijing Zitiao Network Technology Co., Ltd.

Inventors: Hongfu Li, Sibo Li
Extracting structured information from document images

Patent number: 12205391

Abstract: An example method of extracting structured information from document images comprises: receiving a document image; detecting a tabular structure within the document image; identifying a plurality of rows of the tabular structure, wherein each row of the plurality of rows comprises one or more lines; for each row of the plurality of rows, identifying a set of field types of one or more fields comprised by each line of the one or more lines comprised by the respective row; detecting, in each line of the one or more lines, a set of fields corresponding to a respective set of field types; and extracting information from the set of fields.

Type: Grant

Filed: December 27, 2021

Date of Patent: January 21, 2025

Assignee: ABBYY Development Inc.

Inventors: Mikhail Lanin, Stanislav Semenov
Method for displaying live streaming interface

Patent number: 12160639

Abstract: The disclosure provided live streaming interface display method, device, apparatus, storage medium and program product. The method comprises displaying a live streaming interface, wherein the live streaming interface comprises an identification area, the identification area comprises at least one scene identification, and each scene identification is used for indicating a live streaming scene; in response to a first input on a first scene identification of the at least one scene identification, displaying at least one first data source identification corresponding to the first scene identification in the identification area, wherein the first scene identification is used for indicating a first live streaming scene, and each first data source identification is used for indicating one data source in the first live streaming scene.

Type: Grant

Filed: December 21, 2023

Date of Patent: December 3, 2024

Assignee: Beijing Zitiao Network Technology Co., Ltd.

Inventors: Hongfu Li, Sibo Li
Image processing device and image forming apparatus capable of detecting and correcting mis-converted character in text extracted from document image

Patent number: 12022043

Abstract: An image processing device includes a storage device that previously stores a document image, a plurality of registered words, and a plurality of font characters, and a control device that functions as: a character region identifier that identifies a character region in the document image; an image acquirer that acquires an image of the character region; a text extractor that extracts a text from the image of the character region; a word identifier that identifies each of words in the text; a word determiner that determines whether each of the words is matched with one of the registered words; and a generator that generates a corrected text by replacing a target character of a non-matching word in the text with, among the font characters, a font character having a first degree of matching not lower than a first rate with the target character and a highest first degree of matching.

Type: Grant

Filed: October 25, 2021

Date of Patent: June 25, 2024

Assignee: KYOCERA Document Solutions Inc.

Inventor: Jezza Vinalon
Training a neural network model for recognizing handwritten signatures based on different cursive fonts and transformations

Patent number: 11995545

Abstract: A device receives information indicating first names and last names of individuals and applies different cursive fonts to each of the first names and the last names to generate images of different cursive first names and different cursive last names. The device applies different transformations to the images of the different cursive first names and the different cursive last names to generate a set of first name images and a set of last name images. The device combines each first name image with each last name image to form a set of signature images and trains a neural network model, with the set of signature images, to generate a trained neural network model. The device receives an image of a signature and processes the image of the signature, with the trained neural network model, to recognize a first name and a last name in the signature.

Type: Grant

Filed: November 19, 2021

Date of Patent: May 28, 2024

Assignee: Capital One Services, LLC

Inventors: Reza Farivar, Fardin Abdi Taghi Abad, Anh Truong, Mark Watson, Austin Walters, Jeremy Goodsitt, Vincent Pham
Method, apparatus, device, storage medium and program product of performing text matching

Patent number: 11989962

Abstract: A method, an apparatus, a device, a storage medium and a program product of performing a text matching are provided, which relate to a field of a computer technology, and in particular to natural language processing and deep learning technologies. The method includes: determining a word set and a plurality of semantic units from a text set, the word set is associated with a first predetermined attribute, and the text set contains a plurality of first texts indicating an object information and a plurality of second texts indicating an object demand information; generating a graph; and generating a final feature representation associated with the text set and the word set based on the graph and a graph convolution model, so as to perform the text matching.

Type: Grant

Filed: December 22, 2021

Date of Patent: May 21, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventors: Chao Ma, Jingshuai Zhang, Qifan Huang, Kaichun Yao, Peng Wang, Hengshu Zhu
Information processing device and information processing method

Patent number: 11972208

Abstract: The present disclosure determines whether or not a character string of a result obtained by a character recognition process matches a word of a word dictionary; and when a pattern that is similar to a predefined arrangement pattern of a character type is present in the character string of the result obtained by the character recognition process that is determined not to match a word of the word dictionary, changes the character recognition process for the character string based on the pattern.

Type: Grant

Filed: July 14, 2020

Date of Patent: April 30, 2024

Assignee: CANON KABUSHIKI KAISHA

Inventor: Satoshi Kawara
Multimodal transmission of packetized data

Patent number: 11930050

Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.

Type: Grant

Filed: July 1, 2022

Date of Patent: March 12, 2024

Assignee: GOOGLE LLC

Inventors: Justin Lewis, Richard Rapp, Gaurav Bhaya, Robert Stets
Method and device for processing information, electronic device, and storage medium

Patent number: 11908219

Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.

Type: Grant

Filed: April 29, 2021

Date of Patent: February 20, 2024

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Zihan Ni, Yipeng Sun, Kun Yao, Junyu Han, Errui Ding, Jingtuo Liu, Haifeng Wang
Simulated handwriting image generator

Patent number: 11899927

Abstract: Techniques are provided for generating a digital image of simulated handwriting using an encoder-decoder neural network trained on images of natural handwriting samples. The simulated handwriting image can be generated based on a style of a handwriting sample and a variable length coded text input. The style represents visually distinctive characteristics of the handwriting sample, such as the shape, size, slope, and spacing of the letters, characters, or other markings in the handwriting sample. The resulting simulated handwriting image can include the text input rendered in the style of the handwriting sample. The distinctive visual appearance of the letters or words in the simulated handwriting image mimics the visual appearance of the letters or words in the handwriting sample image, whether the letters or words in the simulated handwriting image are the same as in the handwriting sample image or different from those in the handwriting sample image.

Type: Grant

Filed: January 24, 2022

Date of Patent: February 13, 2024

Assignee: Adobe Inc.

Inventors: Christopher Alan Tensmeyer, Rajiv Jain, Curtis Michael Wigington, Brian Lynn Price, Brian Lafayette Davis
Text recognition method and apparatus

Patent number: 11893767

Abstract: A text recognition method and apparatus disclosed. The text recognition method includes: obtaining a to-be-detected image; determining a target text detection area in the to-be-detected image, where the target text detection area includes target text in the to-be-detected image, and the target text detection area is a polygonal area including m vertex pairs, m being a positive integer greater than 2; correcting the polygonal area to m?1 rectangular areas to obtain a corrected target text detection area; and performing text recognition on the corrected target text detection area to determine the target text, and outputting the target text.

Type: Grant

Filed: June 10, 2022

Date of Patent: February 6, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Jieming Li, Jianchao Huang, Xing Zhou, Yongfei Pu, Yuanlin Chen, Lifei Zhu
Method for error detection by using top-down method

Patent number: 11841737

Abstract: Disclosed is a method for error detection, performed by one or more processors of a computing device according to an example embodiment of the present disclosure. The method includes evaluating an error rate for a sentence to be evaluated, in a first language unit. the method includes evaluating an error rate in a second language unit which is smaller than the first language unit, based on the first language unit error.

Type: Grant

Filed: October 20, 2022

Date of Patent: December 12, 2023

Assignee: ACTIONPOWER CORP.

Inventors: Hwanbok Mun, Sangyoun Paik, Subong Choi, Dongchan Shin, Jihwa Lee
Multi-device audio adjustment coordination

Patent number: 11838734

Abstract: This relates to intelligent automated assistants and, more specifically, to the intelligent coordination of audio signal output adjustments among multiple electronic devices.

Type: Grant

Filed: October 25, 2022

Date of Patent: December 5, 2023

Assignee: Apple Inc.

Inventors: Yifeng Gui, Benjamin S. Phipps
Generating assistive indications based on detected characters

Patent number: 11769323

Abstract: Methods, systems, devices, and tangible non-transitory computer readable media for generating assistive indications are provided. The disclosed technology can include accessing image data that includes at least one image. Character data can be generated based at least in part on the image data and one or more optical character recognition operations. Further, the character data can include one or more characters associated with the at least one image. One or more characters that are associated with one or more recognized words and the one or more characters that are associated with one or more unrecognized words can be determined based on the character data. One or more auditory indications including a synthetic voice reciting the one or more recognized words and the one or more unrecognized words can be generated. Furthermore, the synthetic voice can recite each of the one or more unrecognized words one character at a time.

Type: Grant

Filed: February 2, 2021

Date of Patent: September 26, 2023

Assignee: GOOGLE LLC

Inventors: Sneha Ashok, Huize Shi, Andreina Reyna
Text line normalization systems and methods

Patent number: 11704476

Abstract: A method for estimating text heights of text line images includes estimating a text height with a sequence recognizer. The method further includes normalizing a vertical dimension and/or position of text within a text line image based on the text height. The method may also further include calculating a feature of the text line image. In some examples, the sequence recognizer estimates the text height with a machine learning model.

Type: Grant

Filed: July 12, 2021

Date of Patent: July 18, 2023

Assignee: LEVERTON HOLDING LLC

Inventors: Florian Kuhlmann, Michael Kieweg, Saurabh Shekhar Verma
Character recognizing apparatus and non-transitory computer readable medium

Patent number: 11568659

Abstract: A character recognizing apparatus includes an acquiring unit, an identifying unit, and a character recognizing unit. The acquiring unit acquires a string image that is an image of a string generated in accordance with one of multiple string generation schemes. The identifying unit identifies a range specified for a result of character recognition in each of the multiple string generation schemes. The character recognizing unit performs first character recognition on the string image, and if a result of the first character recognition has a feature of a particular string generation scheme of the multiple string generation schemes, the character recognizing unit performs second character recognition on the string image within the range specified for a result of character recognition in the particular string generation scheme.

Type: Grant

Filed: August 30, 2019

Date of Patent: January 31, 2023

Assignee: FUJIFILM Business Innovation Corp.

Inventor: Yusuke Suzuki
Adversarial network for transforming handwritten text

Patent number: 11551034

Abstract: Described herein are systems, methods, and other techniques for training a generative adversarial network (GAN) to perform an image-to-image transformation for recognizing text. A pair of training images are provided to the GAN. The pair of training images include a training image containing a set of characters in handwritten form and a reference training image containing the set of characters in machine-recognizable form. The GAN includes a generator and a discriminator. The generated image is generated using the generator based on the training image. Update data is generated using the discriminator based on the generated image and the reference training image. The GAN is trained by modifying one or both of the generator and the discriminator using the update data.

Type: Grant

Filed: October 8, 2020

Date of Patent: January 10, 2023

Assignee: Ancestry.com Operations Inc.

Inventors: Mostafa Karimi, Gopalkrishna Veni, Yen-Yun Yu
Methods and apparatus to determine the dimensions of a region of interest of a target object from an image using target object landmarks

Patent number: 11538235

Abstract: Methods and apparatus to determine the dimensions of a region of interest of a target object and a class of the target object from an image using target object landmarks are disclosed herein. An example method includes identifying a landmark of a target object in an image based on a match between the landmark and a template landmark; classifying a target object based on the identified landmark; projecting dimensions of the template landmark based on a location of the landmark in the image; and determining a region of interest based on the projected dimensions, the region of interest corresponding to text printed on the target object.

Type: Grant

Filed: December 7, 2020

Date of Patent: December 27, 2022

Assignee: The Nielsen Company (US), LLC

Inventor: Kevin Deng
Distinguishing humans from computers

Patent number: 11461782

Abstract: Systems and methods are provided that distinguish humans from computers. In one implementation, a computer-implemented method selects, from a storage device, a plurality of images. The method further generates a document comprising the plurality of images for the security challenge. At least one image included in the plurality of images is oriented for display in a different direction than the other images. The method further receives a selection of one or more images included in the plurality of images and determines whether the selected one or more images is oriented for display in a different direction than the other images.

Type: Grant

Filed: June 11, 2009

Date of Patent: October 4, 2022

Assignee: Amazon Technologies, Inc.

Inventor: William Randolph Zettler, Jr.
NLP-based context-aware log mining for troubleshooting

Patent number: 11409754

Abstract: A method for context-aware data mining of a text document includes receiving a list of words parsed and preprocessed from an input query; computing a related distributed embedding representation for each word in the list of words using a word embedding model of the text document being queried; aggregating the related distributed embedding representations of all words in the list of words to represent the input query with a single embedding, by using one of an average of all the related distributed embedding representations or a maximum of all the related distributed embedding representations; retrieving a ranked list of document segments of N lines that are similar to the aggregated word embedding representation of the query, where N is a positive integer provided by the user; and returning the list of retrieved segments to a user.

Type: Grant

Filed: June 11, 2019

Date of Patent: August 9, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Giacomo Domeniconi, Eun Kyung Lee, Alessandro Morari
Information processing apparatus, information processing method, and information processing program for displaying consecutive characters in alignment

Patent number: 11393079

Abstract: There is provided an image processing apparatus including an input device configured to receive a stroke input, and a display controller configured to control a displaying of a modified stroke, wherein the modified stroke is synthesized based on characteristic parameters of the received stroke input and characteristic parameters of a reference stroke that has been matched to the received stroke input.

Type: Grant

Filed: January 19, 2018

Date of Patent: July 19, 2022

Assignee: SONY CORPORATION

Inventors: Yoshihito Ohki, Yasuyuki Koga, Tsubasa Tsukahara, Ikuo Yamano, Hiroyuki Mizunuma, Miwa Ichikawa
Method and terminal for performing word segmentation on text information, and storage medium

Patent number: 11373038

Abstract: The present disclosure relates to a method and a terminal for performing word segmentation on text information, and a storage medium. The method includes: acquiring the text information and configuration information, in which the configuration information includes at least two first word segmentation rules; converting the first word segmentation rules into second word segmentation rules according to a predetermined rule; in response to determining that an intersection exists between character strings of the text information matched by two of the second word segmentation rules, determining that two first word segmentation rules corresponding to the two of the second word segmentation rules associated with the intersection conflict; and processing the text information according to the configuration information, and outputting a result of the word segmentation on the text information.

Type: Grant

Filed: May 12, 2020

Date of Patent: June 28, 2022

Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventors: Shuo Wang, Liang Shi, Yupeng Chen, Qun Guo
Clustering topics for data visualization

Patent number: 11354345

Abstract: Systems and methods for receiving a set analyzing case records by extracting case text, performing natural language processing, and allocating each case text to a topic. Topics may be clustered to identify meaningful patterns that are reflected in numerous case records. The data resulting from the analysis may be visualized on a dashboard to allow users to identify and explore these patterns.

Type: Grant

Filed: June 22, 2020

Date of Patent: June 7, 2022

Assignee: JPMORGAN CHASE BANK, N.A.

Inventors: Philip Jacob, Brandon Chihkai Yang, Maria Beltran, Sohajpal Shergill, Chienchung Chen
Method, system and non-transitory computer-readable recording medium for searching for document comprising formula

Patent number: 11334578

Abstract: According to one aspect of the invention, there is provided a method for searching for documents containing mathematical expressions, the method comprising the steps of: dividing a first document containing mathematical expressions into a plurality of components; comparing the plurality of components with a plurality of other components extracted from a plurality of other documents, with reference to weights respectively assigned to the plurality of components according to types of the components; and determining a document associated with the first document among the plurality of other documents, with reference to a result of the comparison, wherein the weights are adaptively adjusted according to a result of the determination of the document associated with the first document.

Type: Grant

Filed: April 5, 2018

Date of Patent: May 17, 2022

Assignee: CLASSCUBE CO., LTD.

Inventor: Seong Chan Ahn
Contextualized character recognition system

Patent number: 11301627

Abstract: System, method, and various embodiments for providing contextualized character recognition system are described herein. An embodiment operates by determining a plurality of predicted words of an image. An accuracy measure or each of the plurality of predicted words is identified and a replaceable word with an accuracy measure below a threshold is identified. A plurality of candidate words associated with the replaceable word are identified and a probability for each of the candidate words is calculated based on a contextual analysis. One of the candidate words with a highest probability is selected. The plurality of predicted words including the selected candidate word with the highest probability replacing the replaceable word is output.

Type: Grant

Filed: January 6, 2020

Date of Patent: April 12, 2022

Assignee: SAP SE

Inventors: Rohit Kumar Gupta, Johannes Hoehne, Anoop Raveendra Katti
Picture obtaining method and apparatus and picture processing method and apparatus

Patent number: 11302286

Abstract: A picture obtaining method and apparatus and a picture processing method and apparatus are provided. The method includes: obtaining a grayscale image corresponding to a first picture and a first image, where a size of the first picture is equal to a size of the first image, the first image includes N parallel lines, a spacing between two adjacent lines does not exceed a spacing threshold, and N is an integer greater than 1; translating a pixel included in each line in the first image based on the grayscale image, to obtain a second image, where the second image includes a contour of an image in the first picture; and set a pixel value of each pixel included in each line in the second image, to obtain a second picture.

Type: Grant

Filed: September 25, 2020

Date of Patent: April 12, 2022

Assignee: Huawei Technologies Co., Ltd.

Inventors: Simon Ekstrand, Sha Qian, Johan Larsby, Haitao Dai, Fredrik Andreasson, Jonas Hans Andreas Fredriksson, Tim Jeppsson, Guolang Li, Rubin Cai, Xueyan Huang
Text location method and apparatus

Patent number: 11270146

Abstract: Aspects of the present invention provide a new text location technique, which can be applied to general handwriting detection at a variety of levels, including characters, words, and sentences. The inventive technique is efficient in training deep learning systems to locate text. The technique works for different languages, for text in different orientations, and for overlapping text. In one aspect, the technique's ability to separate overlapping text also makes the technique useful in application to overlapping objects. Embodiments take advantage of a so-called skyline appearance that text tends to have. Recognizing a skyline appearance for text can facilitate the proper identification of bounding boxes for the text. Even in the case of overlapping text, discernment of a skyline appearance for words can help with the proper identification of bounding boxes for each of the overlapping text words/phrases, thereby facilitating the separation of the text for purposes of recognition.

Type: Grant

Filed: March 31, 2020

Date of Patent: March 8, 2022

Assignee: KONICA MINOLTA BUSINESS SOLUTIONS U.S.A., INC.

Inventor: Junchao Wei
Simulated handwriting image generator

Patent number: 11250252

Abstract: Techniques are provided for generating a digital image of simulated handwriting using an encoder-decoder neural network trained on images of natural handwriting samples. The simulated handwriting image can be generated based on a style of a handwriting sample and a variable length coded text input. The style represents visually distinctive characteristics of the handwriting sample, such as the shape, size, slope, and spacing of the letters, characters, or other markings in the handwriting sample. The resulting simulated handwriting image can include the text input rendered in the style of the handwriting sample. The distinctive visual appearance of the letters or words in the simulated handwriting image mimics the visual appearance of the letters or words in the handwriting sample image, whether the letters or words in the simulated handwriting image are the same as in the handwriting sample image or different from those in the handwriting sample image.

Type: Grant

Filed: December 3, 2019

Date of Patent: February 15, 2022

Assignee: ADOBE INC.

Inventors: Christopher Alan Tensmeyer, Rajiv Jain, Curtis Michael Wigington, Brian Lynn Price, Brian Lafayette Davis
End-to-end text recognition method and apparatus, computer device and readable medium

Patent number: 11210546

Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.

Type: Grant

Filed: March 18, 2020

Date of Patent: December 28, 2021

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Yipeng Sun, Chengquan Zhang, Zuming Huang, Jiaming Liu, Junyu Han, Errui Ding
Method and system for generating parsed document from digital document

Patent number: 11200412

Abstract: A method and system for generating a parsed document from a digital document. The method includes segmenting the digital document into at least one section; classifying the at least one section of the digital document into at least one of a class: text class, table class, figure class, noise class; identifying a reading order of the digital document; and processing each of the at least one section of the digital document. Furthermore, processing each of the at least one section of the digital document comprises extracting content from each of the at least one section based on the class; and structuring the extracted content based on the reading order to generate the parsed document.

Type: Grant

Filed: December 27, 2017

Date of Patent: December 14, 2021

Assignee: Innoplexus AG

Inventors: Gaurav Tripathi, Rohit Kewalramani, Jijeesh KR, Vatsal Agarwal
System and method for creating an image and/or automatically interpreting images

Patent number: 11176675

Abstract: A method of identifying contiguities in images is disclosed. The contiguities are indicative features and various qualities of an image, which may be used for identifying objects and/or relationships in images. Alternatively, the contiguities may be helpful in ensuring that an image has a desired switching factor, so as to create a desired effect when combined with other images in a composite image. The contiguity may be a group of picture elements that are adjacent to one another that form a continuous image element that extends generally horizontally (e.g., diagonally, horizontally) across the image.

Type: Grant

Filed: January 30, 2019

Date of Patent: November 16, 2021

Assignee: CONFLU3NCE LTD

Inventor: Tami Robyn Ellison
Computing system for extraction of textual elements from a document

Patent number: 11176364

Abstract: Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.

Type: Grant

Filed: March 19, 2019

Date of Patent: November 16, 2021

Assignee: HYLAND SOFTWARE, INC.

Inventors: Ralph Meier, Thorsten Wanschura, Johannes Hausmann, Harry Urbschat
Information processing apparatus and non-transitory computer readable medium

Patent number: 11163992

Abstract: An information processing apparatus includes a first designation unit, a second designation unit, a position acquisition unit, a memory, and an extraction unit. The first designation unit designates an extensive area from a first read image, the extensive area including an output area and an object area. The second designation unit designates the output area from the designated extensive area. The position acquisition unit acquires positional information regarding the extensive area with respect to the first read image and positional information regarding the output area with respect to the extensive area. The memory stores the positional information regarding the extensive area and the positional information regarding the output area. The extraction unit identifies a position of the extensive area in a second read image in a format identical to a format of the first read image on a basis of the positional information regarding the extensive area stored by the memory.

Type: Grant

Filed: September 6, 2018

Date of Patent: November 2, 2021

Assignee: FUJIFILM Business Innovation Corp.

Inventors: Kunihiko Kobayashi, Shintaro Adachi, Shigeru Okada, Akinobu Yamaguchi, Junichi Shimizu, Kazuhiro Oya, Shinya Nakamura, Akane Abe
Apparatus and method for matching line item textual entries with individual image objects from a file with multiple image objects

Patent number: 11126838

Abstract: A computer implemented method includes receiving a document with line item textual entries and an attachment containing images of different objects characterizing different transactions. The images of the different objects are split into individual image objects. Attributes from the individual image objects are extracted. The line item textual entries are matched with the individual image objects to form matched image objects. The matched image objects include ambiguous matches with multiple individual image objects assigned to a single line item textual entry or a single individual image object assigned to multiple line item textual entries. An assignment model is applied to resolve the ambiguous matches. The assignment model defines priority constraints, assigns pairs of line item textual entries and individual image objects that meet highest priority constraints, removes highest priority constraints when ambiguous matches remain, and repeats these operations until no ambiguous matches remain.

Type: Grant

Filed: February 7, 2020

Date of Patent: September 21, 2021

Assignee: APPZEN, INC.

Inventors: Edris Naderan, Thomas James White, Deepti Chafekar, Debashish Panigrahi, Kunal Verma, Snigdha Purohit
Apparatus and methods for extracting data from lineless tables using Delaunay triangulation and excess edge removal

Patent number: 11113518

Abstract: A method for extracting data from lineless tables includes storing an image including a table in a memory. A processor operably coupled to the memory identifies a plurality of text-based characters in the image, and defines multiple bounding boxes based on the characters. Each of the bounding boxes is uniquely associated with at least one of the text-based characters. A graph including multiple nodes and multiple edges is generated based on the bounding boxes, using a graph construction algorithm. At least one of the edges is identified for removal from the graph, and removed from the graph to produce a reduced graph. The reduced graph can be sent to a neural network to predict row labels and column labels for the table.

Type: Grant

Filed: June 28, 2019

Date of Patent: September 7, 2021

Inventors: Freddy Chongtat Chua, Tigran Ishkhanov, Nigel Paul Duffy
Information processing device, information processing method, and computer program product

Patent number: 11113569

Abstract: An information processing device according to an embodiment includes a determination unit and a first training unit. The determination unit determines whether an unlabeled data point whose class label is unknown is a non-targeted data point that is not targeted for pattern recognition. The first training unit trains a first classifier for use in the pattern recognition through semi-supervised learning using a first training dataset including unlabeled data determined not to be the non-targeted data and not including unlabeled data determined to be the non-targeted data.

Type: Grant

Filed: August 7, 2019

Date of Patent: September 7, 2021

Assignees: Kabushiki Kaisha Toshiba, Toshiba Digital Solutions Corporation

Inventor: Ryohei Tanaka
Method and device for displaying explanation of reference numeral in patent drawing image using artificial intelligence technology based machine learning

Patent number: 11080910

Abstract: The present invention relates to a device and a method for placing an original or translated explanation of a reference numeral around the reference numeral in a patent drawing, by recognizing a reference numeral included in a patent drawing, searching for a space to place an explanation corresponding to the recognized reference numeral, generating a placement information set including position information for displaying the explanation of the reference numeral in the searched empty space, and providing the placement information set to a corresponding patent drawing image. Utilization of the present invention makes it possible to recognize clearly and quickly what is represented by a reference numeral included in a patent drawing, thereby increasing the readability of a drawing, and facilitating understanding of the technical idea of a patent through patent drawings.

Type: Grant

Filed: March 22, 2018

Date of Patent: August 3, 2021

Assignee: KWANGGETOCO., LTD.

Inventors: Min Soo Kang, Jae Sung Hwang, Seok Hyoun Noe
Text line normalization systems and methods

Patent number: 11062164

Abstract: A method for estimating text heights of text line images includes estimating a text height with a sequence recognizer. The method further includes normalizing a vertical dimension and/or position of text within a text line image based on the text height. The method may also further include calculating a feature of the text line image. In some examples, the sequence recognizer estimates the text height with a machine learning model.

Type: Grant

Filed: July 16, 2019

Date of Patent: July 13, 2021

Assignee: LEVERTON HOLDING LLC

Inventors: Florian Kuhlmann, Michael Kieweg, Saurabh Shekhar Verma
Alignment of user input on a screen

Patent number: 11017258

Abstract: A system for automated user input alignment receives the user input at a touchscreen display. A skew of the user input is identified as the user input is being received at a touchscreen display. A skew correction is determined based on the identified skew. The skew correction is applied to the user input to align the user input on the touchscreen display. The skew correction applied in an automated alignment process that. The user input is displayed with the applied skew correction on the touchscreen display with improved efficiency and without user manipulation to perform the alignment.

Type: Grant

Filed: June 5, 2018

Date of Patent: May 25, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Arie Y. Gur, Amir Zyskind
Populating data fields in electronic documents

Patent number: 11010423

Abstract: Examples of systems and methods for automatic population of electronic documents are described. In an example, a digital base document having the information to be populated in a data field of the electronic document may be obtained. From the digital base document a data item to provide the information may be extracted. Further, for the digital base document, a similarity score may be computed with respect to each document type defined in predefined mapping data, the predefined mapping data including, for each document type, a weight associated with data items occurring in the document type, the weight being associated based on the importance of the data item to the document. Based on the similarity score, a document type of the digital base document may be identified. Further, based on a position of the data item in the digital base document and the identified document type, the data field may be populated.

Type: Grant

Filed: August 20, 2018

Date of Patent: May 18, 2021

Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED

Inventors: Subhasish Roy, Inderpreet Singh, Anobika Roy, Yabesh Jebaraj
Electronic device

Patent number: 10970581

Abstract: An image forming apparatus includes an image reading unit, an extraction section, a character recognition section, a search section, an attachment section, and file storage. The image reading unit generates first image information. The extraction section extracts a specific area from the image base on the first image information. The character recognition section generates text information corresponding to information of a character string image included in the specific area. The search section searches for a webpage containing information relating to a meaning of a text indicated by the text information. The attachment section attaches link information of the webpage to the information of the character string image to generate second image information. The file storage section stores the second image information as a file therein. The specific area is an area with a specific mark applied thereto.

Type: Grant

Filed: May 29, 2019

Date of Patent: April 6, 2021

Assignee: KYOCERA Document Solutions Inc.

Inventor: Daigo Tashiro
Method and apparatus for training a character detector based on weak supervision, system and medium

Patent number: 10963693

Abstract: A method and apparatus for training a character detector based on weak supervision, a character detection system and a computer readable storage medium are provided, wherein the method includes: inputting coarse-grained annotation information of a to-be-processed object, wherein the coarse-grained annotation information including a whole bounding outline of a word, text bar or line of the to-be-processed object; dividing the whole bounding outline of the coarse-grained annotation information, to obtain a coarse bounding box of a character of the to-be-processed object; obtaining a predicted bounding box of the character of the to-be-processed object through a neural network model from the coarse-grained annotation information; and determining a fine bounding box of the character of the to-be-processed object as character-based annotation of the to-be-processed object, according to the coarse bounding box and the predicted bounding box.

Type: Grant

Filed: April 21, 2020

Date of Patent: March 30, 2021

Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventors: Chengquan Zhang, Jiaming Liu, Junyu Han, Errui Ding
Information processing device and non-transitory computer readable medium storing information processing program

Patent number: 10902301

Abstract: An information processing device includes a display controller that displays a term expression expressing a term which appears in target data, on a display in a display mode based on a level of liveliness of the target data when the term appears.

Type: Grant

Filed: August 10, 2018

Date of Patent: January 26, 2021

Assignee: FUJI XEROX CO., LTD.

Inventor: Aoi Takahashi
Word segmentation system, method and device

Patent number: 10817741

Abstract: In an optical character recognition system, a word segmentation method, comprising: acquiring a sample image comprising a word spacing marker or a non-word spacing marker; processing the sample image with a convolutional neural network to obtain a first eigenvector corresponding to the sample image, a word spacing probability value and/or a non-word spacing probability value corresponding to the first eigenvector; acquiring a to-be-tested image, and processing the to-be-tested image with the convolutional neural network to obtain a second eigenvector corresponding to the to-be-tested image, a word spacing probability value or a non-word spacing probability value corresponding to the second eigenvector; and performing word segmentation on the to-be-tested image by using the just obtained word spacing probability value or the non-word spacing probability value.

Type: Grant

Filed: February 16, 2017

Date of Patent: October 27, 2020

Assignee: Alibaba Group Holding Limited

Inventors: Wenmeng Zhou, Mengli Cheng, Xudong Mao, Xing Chu

1 2 3 4 5 … next