Patents by Inventor Homa Foroughi

Homa Foroughi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Preprocessor training for optical character recognition

Patent number: 11830263

Abstract: A method includes executing a Optical Character Recognition (OCR) preprocessor on training images to obtain OCR preprocessor output, executing an OCR engine on the OCR preprocessor output to obtain OCR engine output, and executing an approximator on the OCR preprocessor output to obtain approximator output. The method further includes iteratively adjusting the approximator to simulate the OCR engine using the OCR engine output and the approximator output, and generating OCR preprocessor losses using the approximator output and target labels. The method further includes iteratively adjusting the OCR preprocessor using the OCR preprocessor losses to obtain a customized OCR preprocessor.

Type: Grant

Filed: June 16, 2021

Date of Patent: November 28, 2023

Assignee: Intuit Inc.

Inventors: Xiao Xiao, Sricharan Kallur Palli Kumar, Ayantha Randika Ponnamperuma Arachchige, Nilanjan Ray, Homa Foroughi, Allegra Latimer
Time aware self-attentive models for personalized conversational messages

Patent number: 11831588

Abstract: Systems and methods for personalizing messages in a conversational chatbot are disclosed. An example method may include receiving clickstream event data corresponding to click events by users of an application, generating featurized clickstream data based at least in part on the received clickstream event data, determining one or more predicted intentions for a first user based at least in part on the featurized clickstream data, and generating one or more personalized messages for the first user based at least in part on the one or more predicted user intentions.

Type: Grant

Filed: December 3, 2020

Date of Patent: November 28, 2023

Assignee: Intuit Inc.

Inventors: Homa Foroughi, Chang Liu, Pankaj Gupta
Region proposal networks for automated bounding box detection and text segmentation

Patent number: 11816883

Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.

Type: Grant

Filed: May 25, 2022

Date of Patent: November 14, 2023

Assignee: INTUIT INC.

Inventors: Terrence J. Torres, Homa Foroughi
Learning form-based information classification

Patent number: 11636287

Abstract: A method may include extracting training tokens from a training corpus, generating training token features and training contextual features for each of the training tokens, training a probabilistic graphical model using classes, the training token features, and the training contextual features, extracting, from actual data, actual tokens including an actual token, generating actual token features and actual contextual features for each of the actual tokens, and classifying the actual token as a class using the probabilistic graphical model, the actual token features, and the actual contextual features.

Type: Grant

Filed: March 28, 2018

Date of Patent: April 25, 2023

Assignee: Intuit Inc.

Inventors: Homa Foroughi, Joy Rimchala
REGION PROPOSAL NETWORKS FOR AUTOMATED BOUNDING BOX DETECTION AND TEXT SEGMENTATION

Publication number: 20220414335

Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.

Type: Application

Filed: May 25, 2022

Publication date: December 29, 2022

Applicant: INTUIT INC.

Inventors: Terrence J. Torres, Homa Foroughi
PREPROCESSOR TRAINING FOR OPTICAL CHARACTER RECOGNITION

Publication number: 20220405520

Abstract: A method includes executing a Optical Character Recognition (OCR) preprocessor on training images to obtain OCR preprocessor output, executing an OCR engine on the OCR preprocessor output to obtain OCR engine output, and executing an approximator on the OCR preprocessor output to obtain approximator output. The method further includes iteratively adjusting the approximator to simulate the OCR engine using the OCR engine output and the approximator output, and generating OCR preprocessor losses using the approximator output and target labels. The method further includes iteratively adjusting the OCR preprocessor using the OCR preprocessor losses to obtain a customized OCR preprocessor.

Type: Application

Filed: June 16, 2021

Publication date: December 22, 2022

Applicant: Intuit Inc.

Inventors: Xiao Xiao, Sricharan Kallur Palli Kumar, Ayantha Randika Ponnamperuma Arachchige, Nilanjan Ray, Homa Foroughi, Allegra Latimer
Region proposal networks for automated bounding box detection and text segmentation

Patent number: 11366968

Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.

Type: Grant

Filed: July 29, 2019

Date of Patent: June 21, 2022

Assignee: Intuit Inc.

Inventors: Terrence J. Torres, Homa Foroughi
TIME AWARE SELF-ATTENTIVE MODELS FOR PERSONALIZED CONVERSATIONAL MESSAGES

Publication number: 20220182342

Abstract: Systems and methods for personalizing messages in a conversational chatbot are disclosed. An example method may include receiving clickstream event data corresponding to click events by users of an application, generating featurized clickstream data based at least in part on the received clickstream event data, determining one or more predicted intentions for a first user based at least in part on the featurized clickstream data, and generating one or more personalized messages for the first user based at least in part on the one or more predicted user intentions.

Type: Application

Filed: December 3, 2020

Publication date: June 9, 2022

Applicant: Intuit Inc.

Inventors: Homa Foroughi, Chang Liu, Pankaj Gupta
Region proposal networks for automated bounding box detection and text segmentation

Patent number: 11138423

Abstract: Arbitrary image data may be transformed into data suitable for optical character recognition (OCR) processing. A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate at least one text proposal using a region proposal network (RPN). The at least one text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may merge the text proposals with one another to form a patch of the image that is predicted to contain text. The processor may determine outer coordinates of the patch. The outer coordinates may comprise at least leftmost, rightmost, topmost, and bottommost coordinates. The processor may generate a quadrilateral of the image that is a smallest quadrilateral including the leftmost, rightmost, topmost, and bottommost coordinates.

Type: Grant

Filed: July 29, 2019

Date of Patent: October 5, 2021

Assignee: Intuit Inc.

Inventors: Terrence J. Torres, Homa Foroughi
SYSTEM AND METHOD FOR PROVIDING AUTOMATED AND UNSUPERVISED INLINE QUESTION ANSWERING

Publication number: 20210240775

Abstract: Systems and methods configured to provide automated and unsupervised inline question-answering in an online community.

Type: Application

Filed: February 3, 2020

Publication date: August 5, 2021

Applicant: Intuit Inc.

Inventors: Chang LIU, Pankaj GUPTA, Homa FOROUGHI
System and method for information extraction with character level features

Patent number: 11055527

Abstract: A system and method for information extraction character level features. The system and method may be used for data extraction for various types of content including a receipt or a tax form.

Type: Grant

Filed: February 1, 2019

Date of Patent: July 6, 2021

Assignee: INTUIT INC.

Inventors: Terrence J. Torres, Homa Foroughi
REGION PROPOSAL NETWORKS FOR AUTOMATED BOUNDING BOX DETECTION AND TEXT SEGMENTATION

Publication number: 20210034856

Abstract: Arbitrary image data may be transformed into data suitable for optical character recognition (OCR) processing. A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate at least one text proposal using a region proposal network (RPN). The at least one text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may merge the text proposals with one another to form a patch of the image that is predicted to contain text. The processor may determine outer coordinates of the patch. The outer coordinates may comprise at least leftmost, rightmost, topmost, and bottommost coordinates. The processor may generate a quadrilateral of the image that is a smallest quadrilateral including the leftmost, rightmost, topmost, and bottommost coordinates.

Type: Application

Filed: July 29, 2019

Publication date: February 4, 2021

Applicant: Intuit Inc.

Inventors: Terrence J. TORRES, Homa FOROUGHI
REGION PROPOSAL NETWORKS FOR AUTOMATED BOUNDING BOX DETECTION AND TEXT SEGMENTATION

Publication number: 20210034700

Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.

Type: Application

Filed: July 29, 2019

Publication date: February 4, 2021

Applicant: Intuit Inc.

Inventors: Torrence J. TORRES, Homa FOROUGHI
Method and system for background removal from documents

Patent number: 10740644

Abstract: The invention relates to a method for background removal from documents. The method includes obtaining an image of a document, performing a clustering operation on the image to obtain a plurality of image segments, and performing, for each image segment, a foreground/background classification to determine whether the image segment includes foreground. The method further includes obtaining an augmented image by combining the image segments that include foreground, and obtaining a background-treated image by cropping the image of the document, based on the foreground in the augmented image.

Type: Grant

Filed: February 27, 2018

Date of Patent: August 11, 2020

Assignee: Intuit Inc.

Inventor: Homa Foroughi
SYSTEM AND METHOD FOR INFORMATION EXTRACTION WITH CHARACTER LEVEL FEATURES

Publication number: 20200250417

Abstract: A system and method for information extraction character level features. The system and method may be used for data extraction for various types of content including a receipt or a tax form.

Type: Application

Filed: February 1, 2019

Publication date: August 6, 2020

Applicant: INTUIT INC.

Inventors: Terrence J. TORRES, Homa FOROUGHI
System and method for synthetic form image generation

Patent number: 10546054

Abstract: A method and system for generating synthetic form image involves obtaining a multitude of field value data and associated field labels for a chosen type of form document from an electronic data source, classifying the multitude of field value data into a multitude of data categories, where the multitude of data categories, learning statistical data distributions for categorical and numerical data types using the classified categorical and numerical data, and sampling data elements randomly using the learned data distributions to generate synthetic data for categorical and numerical data. The method also involves assembling the synthetic data for the multitude of data categories with the associated field labels to generate a labeled synthetic textual data set, rendering the labeled synthetic textual data set over a structured form layout image to produce a synthetic form image, and storing the synthetic form image and the labeled synthetic textual data set.

Type: Grant

Filed: February 28, 2018

Date of Patent: January 28, 2020

Assignee: Intuit Inc.

Inventors: Homa Foroughi, Joy Rimchala
LEARNING FORM-BASED INFORMATION CLASSIFICATION

Publication number: 20190303727

Abstract: A method may include extracting training tokens from a training corpus, generating training token features and training contextual features for each of the training tokens, training a probabilistic graphical model using classes, the training token features, and the training contextual features, extracting, from actual data, actual tokens including an actual token, generating actual token features and actual contextual features for each of the actual tokens, and classifying the actual token as a class using the probabilistic graphical model, the actual token features, and the actual contextual features.

Type: Application

Filed: March 28, 2018

Publication date: October 3, 2019

Applicant: Intuit Inc.

Inventors: Homa Foroughi, Joy Rimchala
METHOD AND SYSTEM FOR BACKGROUND REMOVAL FROM DOCUMENTS

Publication number: 20190266433

Abstract: The invention relates to a method for background removal from documents. The method includes obtaining an image of a document, performing a clustering operation on the image to obtain a plurality of image segments, and performing, for each image segment, a foreground/background classification to determine whether the image segment includes foreground. The method further includes obtaining an augmented image by combining the image segments that include foreground, and obtaining a background-treated image by cropping the image of the document, based on the foreground in the augmented image.

Type: Application

Filed: February 27, 2018

Publication date: August 29, 2019

Applicant: Intuit Inc.

Inventor: Homa Foroughi