Patents by Inventor Homa Foroughi

Homa Foroughi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11830263
    Abstract: A method includes executing a Optical Character Recognition (OCR) preprocessor on training images to obtain OCR preprocessor output, executing an OCR engine on the OCR preprocessor output to obtain OCR engine output, and executing an approximator on the OCR preprocessor output to obtain approximator output. The method further includes iteratively adjusting the approximator to simulate the OCR engine using the OCR engine output and the approximator output, and generating OCR preprocessor losses using the approximator output and target labels. The method further includes iteratively adjusting the OCR preprocessor using the OCR preprocessor losses to obtain a customized OCR preprocessor.
    Type: Grant
    Filed: June 16, 2021
    Date of Patent: November 28, 2023
    Assignee: Intuit Inc.
    Inventors: Xiao Xiao, Sricharan Kallur Palli Kumar, Ayantha Randika Ponnamperuma Arachchige, Nilanjan Ray, Homa Foroughi, Allegra Latimer
  • Patent number: 11831588
    Abstract: Systems and methods for personalizing messages in a conversational chatbot are disclosed. An example method may include receiving clickstream event data corresponding to click events by users of an application, generating featurized clickstream data based at least in part on the received clickstream event data, determining one or more predicted intentions for a first user based at least in part on the featurized clickstream data, and generating one or more personalized messages for the first user based at least in part on the one or more predicted user intentions.
    Type: Grant
    Filed: December 3, 2020
    Date of Patent: November 28, 2023
    Assignee: Intuit Inc.
    Inventors: Homa Foroughi, Chang Liu, Pankaj Gupta
  • Patent number: 11816883
    Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.
    Type: Grant
    Filed: May 25, 2022
    Date of Patent: November 14, 2023
    Assignee: INTUIT INC.
    Inventors: Terrence J. Torres, Homa Foroughi
  • Patent number: 11636287
    Abstract: A method may include extracting training tokens from a training corpus, generating training token features and training contextual features for each of the training tokens, training a probabilistic graphical model using classes, the training token features, and the training contextual features, extracting, from actual data, actual tokens including an actual token, generating actual token features and actual contextual features for each of the actual tokens, and classifying the actual token as a class using the probabilistic graphical model, the actual token features, and the actual contextual features.
    Type: Grant
    Filed: March 28, 2018
    Date of Patent: April 25, 2023
    Assignee: Intuit Inc.
    Inventors: Homa Foroughi, Joy Rimchala
  • Publication number: 20220414335
    Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.
    Type: Application
    Filed: May 25, 2022
    Publication date: December 29, 2022
    Applicant: INTUIT INC.
    Inventors: Terrence J. Torres, Homa Foroughi
  • Publication number: 20220405520
    Abstract: A method includes executing a Optical Character Recognition (OCR) preprocessor on training images to obtain OCR preprocessor output, executing an OCR engine on the OCR preprocessor output to obtain OCR engine output, and executing an approximator on the OCR preprocessor output to obtain approximator output. The method further includes iteratively adjusting the approximator to simulate the OCR engine using the OCR engine output and the approximator output, and generating OCR preprocessor losses using the approximator output and target labels. The method further includes iteratively adjusting the OCR preprocessor using the OCR preprocessor losses to obtain a customized OCR preprocessor.
    Type: Application
    Filed: June 16, 2021
    Publication date: December 22, 2022
    Applicant: Intuit Inc.
    Inventors: Xiao Xiao, Sricharan Kallur Palli Kumar, Ayantha Randika Ponnamperuma Arachchige, Nilanjan Ray, Homa Foroughi, Allegra Latimer
  • Patent number: 11366968
    Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: June 21, 2022
    Assignee: Intuit Inc.
    Inventors: Terrence J. Torres, Homa Foroughi
  • Publication number: 20220182342
    Abstract: Systems and methods for personalizing messages in a conversational chatbot are disclosed. An example method may include receiving clickstream event data corresponding to click events by users of an application, generating featurized clickstream data based at least in part on the received clickstream event data, determining one or more predicted intentions for a first user based at least in part on the featurized clickstream data, and generating one or more personalized messages for the first user based at least in part on the one or more predicted user intentions.
    Type: Application
    Filed: December 3, 2020
    Publication date: June 9, 2022
    Applicant: Intuit Inc.
    Inventors: Homa Foroughi, Chang Liu, Pankaj Gupta
  • Patent number: 11138423
    Abstract: Arbitrary image data may be transformed into data suitable for optical character recognition (OCR) processing. A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate at least one text proposal using a region proposal network (RPN). The at least one text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may merge the text proposals with one another to form a patch of the image that is predicted to contain text. The processor may determine outer coordinates of the patch. The outer coordinates may comprise at least leftmost, rightmost, topmost, and bottommost coordinates. The processor may generate a quadrilateral of the image that is a smallest quadrilateral including the leftmost, rightmost, topmost, and bottommost coordinates.
    Type: Grant
    Filed: July 29, 2019
    Date of Patent: October 5, 2021
    Assignee: Intuit Inc.
    Inventors: Terrence J. Torres, Homa Foroughi
  • Publication number: 20210240775
    Abstract: Systems and methods configured to provide automated and unsupervised inline question-answering in an online community.
    Type: Application
    Filed: February 3, 2020
    Publication date: August 5, 2021
    Applicant: Intuit Inc.
    Inventors: Chang LIU, Pankaj GUPTA, Homa FOROUGHI
  • Patent number: 11055527
    Abstract: A system and method for information extraction character level features. The system and method may be used for data extraction for various types of content including a receipt or a tax form.
    Type: Grant
    Filed: February 1, 2019
    Date of Patent: July 6, 2021
    Assignee: INTUIT INC.
    Inventors: Terrence J. Torres, Homa Foroughi
  • Publication number: 20210034856
    Abstract: Arbitrary image data may be transformed into data suitable for optical character recognition (OCR) processing. A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate at least one text proposal using a region proposal network (RPN). The at least one text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may merge the text proposals with one another to form a patch of the image that is predicted to contain text. The processor may determine outer coordinates of the patch. The outer coordinates may comprise at least leftmost, rightmost, topmost, and bottommost coordinates. The processor may generate a quadrilateral of the image that is a smallest quadrilateral including the leftmost, rightmost, topmost, and bottommost coordinates.
    Type: Application
    Filed: July 29, 2019
    Publication date: February 4, 2021
    Applicant: Intuit Inc.
    Inventors: Terrence J. TORRES, Homa FOROUGHI
  • Publication number: 20210034700
    Abstract: A processor may generate a plurality of intermediate feature layers of an image using convolutional neural network (CNN) processing. For each intermediate feature layer, the processor may generate a plurality of text proposals using a region proposal network (RPN). Each text proposal may comprise a portion of the intermediate feature layer that is predicted to contain text. The processor may perform OCR processing on image data within a plurality of regions of the image to generate a text result for each region. Each region may comprise at least one of the text proposals. The processor may assemble the text results into a text string comprising the text results ordered according to a spatial order in which the plurality of regions appear within the image.
    Type: Application
    Filed: July 29, 2019
    Publication date: February 4, 2021
    Applicant: Intuit Inc.
    Inventors: Torrence J. TORRES, Homa FOROUGHI
  • Patent number: 10740644
    Abstract: The invention relates to a method for background removal from documents. The method includes obtaining an image of a document, performing a clustering operation on the image to obtain a plurality of image segments, and performing, for each image segment, a foreground/background classification to determine whether the image segment includes foreground. The method further includes obtaining an augmented image by combining the image segments that include foreground, and obtaining a background-treated image by cropping the image of the document, based on the foreground in the augmented image.
    Type: Grant
    Filed: February 27, 2018
    Date of Patent: August 11, 2020
    Assignee: Intuit Inc.
    Inventor: Homa Foroughi
  • Publication number: 20200250417
    Abstract: A system and method for information extraction character level features. The system and method may be used for data extraction for various types of content including a receipt or a tax form.
    Type: Application
    Filed: February 1, 2019
    Publication date: August 6, 2020
    Applicant: INTUIT INC.
    Inventors: Terrence J. TORRES, Homa FOROUGHI
  • Patent number: 10546054
    Abstract: A method and system for generating synthetic form image involves obtaining a multitude of field value data and associated field labels for a chosen type of form document from an electronic data source, classifying the multitude of field value data into a multitude of data categories, where the multitude of data categories, learning statistical data distributions for categorical and numerical data types using the classified categorical and numerical data, and sampling data elements randomly using the learned data distributions to generate synthetic data for categorical and numerical data. The method also involves assembling the synthetic data for the multitude of data categories with the associated field labels to generate a labeled synthetic textual data set, rendering the labeled synthetic textual data set over a structured form layout image to produce a synthetic form image, and storing the synthetic form image and the labeled synthetic textual data set.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: January 28, 2020
    Assignee: Intuit Inc.
    Inventors: Homa Foroughi, Joy Rimchala
  • Publication number: 20190303727
    Abstract: A method may include extracting training tokens from a training corpus, generating training token features and training contextual features for each of the training tokens, training a probabilistic graphical model using classes, the training token features, and the training contextual features, extracting, from actual data, actual tokens including an actual token, generating actual token features and actual contextual features for each of the actual tokens, and classifying the actual token as a class using the probabilistic graphical model, the actual token features, and the actual contextual features.
    Type: Application
    Filed: March 28, 2018
    Publication date: October 3, 2019
    Applicant: Intuit Inc.
    Inventors: Homa Foroughi, Joy Rimchala
  • Publication number: 20190266433
    Abstract: The invention relates to a method for background removal from documents. The method includes obtaining an image of a document, performing a clustering operation on the image to obtain a plurality of image segments, and performing, for each image segment, a foreground/background classification to determine whether the image segment includes foreground. The method further includes obtaining an augmented image by combining the image segments that include foreground, and obtaining a background-treated image by cropping the image of the document, based on the foreground in the augmented image.
    Type: Application
    Filed: February 27, 2018
    Publication date: August 29, 2019
    Applicant: Intuit Inc.
    Inventor: Homa Foroughi