Patents by Inventor Dinei Afonso Ferreira Florencio

Dinei Afonso Ferreira Florencio has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Techniques for Pretraining Document Language Models for Example-Based Document Classification

Publication number: 20230401386

Abstract: A data processing system implements a method for training machine learning modes, including receiving a set of one or more unlabeled documents associated one or more first categories of documents to be used to train machine learning models to analyze the one or more unlabeled documents, and fine-tuning a first machine learning model and a second machine learning model based on the one or more unlabeled document to enable the first machine learning model to determine a semantic representation of the one or more first categories of document, and to enable the second machine learning model to classify the semantic representations according to the one or more first categories of documents, the first machine learning model and the second machine learning model having been trained using first unlabeled training data including a second plurality of categories of documents that do not include the one or more first categories of documents.

Type: Application

Filed: June 9, 2022

Publication date: December 14, 2023

Applicant: Microsoft Technology Licensing, LLC

Inventors: Guoxin WANG, Dinei Afonso Ferreira FLORENCIO, Wenfeng CHENG
ENTRY DETECTION AND RECOGNITION FOR CUSTOM FORMS

Publication number: 20230084845

Abstract: The disclosure herein describes providing signature data of an input document. Text data of the input document is obtained (e.g., OCR data generated from image data) and a first set of signature fields are identified using signature key-value pairs of the text data. A first subset of signed signature fields and a first subset of unsigned signature fields are determined based on mapping to a set of predicted values. A second set of signature fields are determined using a region prediction model applied to image data of the input document. Region images associated with the first subset of unsigned signature fields and with second set of signature fields are obtained and a second set of signed signature fields and a second set of unsigned signature fields are determined using a signature recognition model. Signature output data is provided including signed signature fields and/or unsigned signature fields.

Type: Application

Filed: September 13, 2021

Publication date: March 16, 2023

Inventors: Yijuan LU, Lynsey LIU, Andrei A. GAIVORONSKI, Yu CHENG, Dinei Afonso Ferreira FLORENCIO, Cha ZHANG, John Richard CORRING
Enhanced supervised form understanding

Patent number: 11562588

Abstract: Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.

Type: Grant

Filed: March 26, 2020

Date of Patent: January 24, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Dinei Afonso Ferreira Florencio, Yu-Yun Dai, Cha Zhang, Shih Chia Wang
APPLICATION-SPECIFIC OPTICAL CHARACTER RECOGNITION CUSTOMIZATION

Publication number: 20220391647

Abstract: A method for customizing an optical character recognition system is disclosed. The optical character recognition system includes a general-purpose decoder configured to convert character images, recognized in a digital image, into text based on a general-purpose text structure. An application-specific customization is received. The application-specific customization includes an application-specific text structure that differs from the general-purpose text structure. A customized model is generated based on the application-specific customization. An enhanced application-specific decoder is generated by modifying the general-purpose decoder to, during run-time execution of the optical character recognition system, leverage the customized model to convert character images demonstrating the application-specific text structure into text.

Type: Application

Filed: June 3, 2021

Publication date: December 8, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Baoguang SHI, Dinei Afonso Ferreira FLORENCIO
Supervised OCR training for custom forms

Patent number: 11093740

Abstract: The disclosed technology is generally directed to optical character recognition for forms. In one example of the technology, optical character recognition is performed on a plurality of forms. The forms of the plurality of forms include at least one type of form. Anchors are determined for the forms, including corresponding anchors for each type of form of the plurality of forms. Feature rules are determined, including corresponding feature rules for each type of form of the plurality of forms. Features and labels are determined for each form of the plurality of forms. A training model is generated based on a ground truth that includes a plurality of key-value pairs corresponding to the plurality of forms, and further based on the determined features and labels for the plurality of forms.

Type: Grant

Filed: November 9, 2018

Date of Patent: August 17, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang, Gil Moshe Nahmias, Yu-Yun Dai
Unsupervised domain adaptation from generic forms for new OCR forms

Patent number: 11055560

Abstract: The disclosed technology is generally directed to optical text recognition for forms. In one example of the technology, line grouping rules are generated based on the generic forms and a ground truth for the generic forms. Line groupings are applied to the generic forms based on the line grouping rules. Feature extraction rules are generated. Features are extracted from the generic forms based on the feature extraction rules. A key-value classifier model is generated, such that the key-value classifier model is configured to determine, for each line of a form: a probability that the line is a value, and a probability that the line is a key. A key-value pairing model is generated, such that the key-value pairing model is configured to predict, for each key in a form, which value in the form corresponds to the key.

Type: Grant

Filed: May 15, 2019

Date of Patent: July 6, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang, Gil Moshe Nahmias, Yu-Yun Dai, Sean Louis Goldberg
ENHANCED SUPERVISED FORM UNDERSTANDING

Publication number: 20210133438

Abstract: Interfaces and systems are provided for harvesting ground truth from forms to be used in training models based on key-value pairings in the forms and to later use the trained models to identify related key-value pairings in new forms. Initially, forms are identified and clustered to identify a subset of forms to label with the key-value pairings. Users provide input to identify keys to use in labeling and then select/highlight text from forms that are presented concurrently with the keys in order to associate the highlighted text with the key(s) as the corresponding key-value pairing(s). After labeling the forms with the key-value pairings, the key-value pairing data is used as ground truth for training a model to independently identify the key-value pairing(s) in new forms. Once trained, the model is used to identify the key-value pairing(s) in new forms.

Type: Application

Filed: March 26, 2020

Publication date: May 6, 2021

Inventors: Dinei Afonso Ferreira Florencio, Yu-Yun Dai, Cha Zhang, Shih Chia Wang
UNSUPERVISED DOMAIN ADAPTATION FROM GENERIC FORMS FOR NEW OCR FORMS

Publication number: 20200160086

Abstract: The disclosed technology is generally directed to optical text recognition for forms. In one example of the technology, line grouping rules are generated based on the generic forms and a ground truth for the generic forms. Line groupings are applied to the generic forms based on the line grouping rules. Feature extraction rules are generated. Features are extracted from the generic forms based on the feature extraction rules. A key-value classifier model is generated, such that the key-value classifier model is configured to determine, for each line of a form: a probability that the line is a value, and a probability that the line is a key. A key-value pairing model is generated, such that the key-value pairing model is configured to predict, for each key in a form, which value in the form corresponds to the key.

Type: Application

Filed: May 15, 2019

Publication date: May 21, 2020

Inventors: Dinei Afonso Ferreira FLORENCIO, Cha ZHANG, Gil Moshe NAHMIAS, Yu-Yun DAI, Sean Louis GOLDBERG
SUPERVISED OCR TRAINING FOR CUSTOM FORMS

Publication number: 20200151443

Abstract: The disclosed technology is generally directed to optical character recognition for forms. In one example of the technology, optical character recognition is performed on a plurality of forms. The forms of the plurality of forms include at least one type of form. Anchors are determined for the forms, including corresponding anchors for each type of form of the plurality of forms. Feature rules are determined, including corresponding feature rules for each type of form of the plurality of forms. Features and labels are determined for each form of the plurality of forms. A training model is generated based on a ground truth that includes a plurality of key-value pairs corresponding to the plurality of forms, and further based on the determined features and labels for the plurality of forms.

Type: Application

Filed: November 9, 2018

Publication date: May 14, 2020

Inventors: Dinei Afonso Ferreira FLORENCIO, Cha ZHANG, Gil Moshe NAHMIAS, Yu-Yun DAI
Audio data transmission using frequency hopping

Patent number: 10397287

Abstract: A method includes obtaining data representing multiple characters, determining a code for each character wherein each code corresponds to a different audio frequency, and transmitting the codes at the corresponding audio frequencies.

Type: Grant

Filed: March 1, 2017

Date of Patent: August 27, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Zhengyou Zhang, Dinei Afonso Ferreira Florencio, Sasa Junuzovic
AUTOMATED PRESENTATION EQUIPMENT TESTING

Publication number: 20180332261

Abstract: An apparatus that automatically monitors a display device includes a photo sensor configured to receive light from a display screen of the display device. The photo sensor provides signals representing detected light levels to a processor. The processor is coupled to the display device and is configured to cause the display device to present a test sequence including a plurality of images on the display screen. The processor is configured to capture data from the photo sensor during the presentation of the test sequence and to compare the captured data to an expected sequence corresponding to the test sequence displayed by a well-functioning display. The processor is further configured to report any mismatch between the captured data and the expected sequence as a possible malfunction of the display device.

Type: Application

Filed: May 9, 2017

Publication date: November 15, 2018

Inventors: Zhengyou Zhang, Zicheng Liu, Dinei Afonso Ferreira Florencio, Sasa Junuzovic
Audio Data Transmission Using Frequency Hopping

Publication number: 20180255111

Abstract: A method includes obtaining data representing multiple characters, determining a code for each character wherein each code corresponds to a different audio frequency, and transmitting the codes at the corresponding audio frequencies.

Type: Application

Filed: March 1, 2017

Publication date: September 6, 2018

Inventors: Zhengyou Zhang, Dinei Afonso Ferreira Florencio, Sasa Junuzovic
Adaptive meeting management

Patent number: 9111263

Abstract: A template and/or knowledge associated with a synchronous meeting are obtained by a computing device. The computing device then adaptively manages the synchronous meeting based at least in part on the template and/or knowledge.

Type: Grant

Filed: June 15, 2009

Date of Patent: August 18, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jin Li, James E. Oker, Rajesh K. Hegde, Dinei Afonso Ferreira Florencio, Michel Pahud, Sharon K. Cunnington, Philip A. Chou, Zhengyou Zhang
Method and system for deterring product counterfeiting

Publication number: 20140324716

Abstract: The claimed subject matter relates to an architecture to produce disincentives to wearing counterfeit or stolen merchandise in public. In particular, the architecture utilizes a unique identifier associated with each unit of the product, and provides both a registration channel for receiving ownership registration and a verification channel to receive requests for verification. By way of illustration, the architecture can include associating a brand logotype that includes unique markings with each unit of a product, a private web service where the retailer may upload customer information at the time of sale, and a publicly available web service, where a third party may inquire about the ownership of a product containing a certain unique identifier.

Type: Application

Filed: April 29, 2013

Publication date: October 30, 2014

Inventors: Carolina Haber Florencio, Dinei Afonso Ferreira Florencio
Three-dimensional (3D) imaging based on MotionParallax

Patent number: 8743187

Abstract: Techniques and technologies are described herein for motion parallax three-dimensional (3D) imaging. Such techniques and technologies do not require special glasses, virtual reality helmets, or other user-attachable devices. More particularly, some of the described motion parallax 3D imaging techniques and technologies generate sequential images, including motion parallax depictions of various scenes derived from clues in views obtained of or created for the displayed scene.

Type: Grant

Filed: June 6, 2012

Date of Patent: June 3, 2014

Assignee: Microsoft Corporation

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang
Three-Dimensional (3D) Imaging Based on MotionParallax

Publication number: 20120242810

Abstract: Techniques and technologies are described herein for motion parallax three-dimensional (3D) imaging. Such techniques and technologies do not require special glasses, virtual reality helmets, or other user-attachable devices. More particularly, some of the described motion parallax 3D imaging techniques and technologies generate sequential images, including motion parallax depictions of various scenes derived from clues in views obtained of or created for the displayed scene.

Type: Application

Filed: June 6, 2012

Publication date: September 27, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang
Three-dimensional (3D) imaging based on motionparallax

Patent number: 8199186

Abstract: Techniques and technologies are described herein for motion parallax three-dimensional (3D) imaging. Such techniques and technologies do not require special glasses, virtual reality helmets, or other user-attachable devices. More particularly, some of the described motion parallax 3D imaging techniques and technologies generate sequential images, including motion parallax depictions of various scenes derived from clues in views obtained of or created for the displayed scene.

Type: Grant

Filed: March 5, 2009

Date of Patent: June 12, 2012

Assignee: Microsoft Corporation

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang
SOUND SOURCE LOCALIZATION BASED ON REFLECTIONS AND ROOM ESTIMATION

Publication number: 20110317522

Abstract: Described is modeling a room to obtain estimates for walls and a ceiling, and using the model to improve sound source localization by incorporating reflection (reverberation) data into the location estimation computations. In a calibration step, reflections of a known sound are detected at a microphone array, with their corresponding signals processed to estimate wall (and ceiling) locations. In a sound source localization step, when an actual sound (including reverberations) is detected, the signals are processed into hypotheses that include reflection data predictions based upon possible locations, given the room model. The location corresponding to the hypothesis that matches (maximum likelihood) the actual sound data is the estimated location of the sound source.

Type: Application

Filed: June 28, 2010

Publication date: December 29, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang, Flavio Protasio Ribeiro, Demba Elimane Ba
Adaptive Meeting Management

Publication number: 20100318399

Abstract: A template and/or knowledge associated with a synchronous meeting are obtained by a computing device. The computing device then adaptively manages the synchronous meeting based at least in part on the template and/or knowledge.

Type: Application

Filed: June 15, 2009

Publication date: December 16, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Jin Li, James E. Oker, Rajesh K. Hegde, Dinei Afonso Ferreira Florencio, Michel Pahud, Sharon K. Cunnington, Philip A. Chou, Zhengyou Zhang
Three-Dimensional (3D) Imaging Based on MotionParallax

Publication number: 20100225743

Abstract: Techniques and technologies are described herein for motion parallax three-dimensional (3D) imaging. Such techniques and technologies do not require special glasses, virtual reality helmets, or other user-attachable devices. More particularly, some of the described motion parallax 3D imaging techniques and technologies generate sequential images, including motion parallax depictions of various scenes derived from clues in views obtained of or created for the displayed scene.

Type: Application

Filed: March 5, 2009

Publication date: September 9, 2010

Applicant: Microsoft Corporation

Inventors: Dinei Afonso Ferreira Florencio, Cha Zhang

1 2 next