Patents by Inventor Paul Viola

Paul Viola has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Processing an electronic document for information extraction

Patent number: 7672940

Abstract: The present invention relates generally to automatically processing electronic documents. In one aspect, features and/or properties of words are identified from a set of training documents to aid in extracting information from documents to be processed. The features and/or properties relate to text of the words, position of the words and the relationship to other words. A classifier is developed to express these features and/or properties. During information extraction, documents are processed and analyzed based on the classifier and information is extracted based on correspondence of the documents and the features/properties expressed by the classifier.

Type: Grant

Filed: April 29, 2004

Date of Patent: March 2, 2010

Assignee: Microsoft Corporation

Inventors: Paul Viola, Hiu Chung Law, James Rinker
Systems and methods that facilitate improved display of electronic documents

Patent number: 7661065

Abstract: A computer-implemented word processing system comprises an interface component that receives a features vector associated with an electronic document. An analysis component communicatively coupled to the interface component analyzes the features vector and determines a viewing mode in which to display the electronic document. In accordance with one aspect of the subject invention, the viewing mode can be one of a conventional viewing mode and a viewing mode associated with enhanced readability.

Type: Grant

Filed: May 24, 2005

Date of Patent: February 9, 2010

Assignee: Microsoft Corporation

Inventors: Radoslav Petrov Nickolov, Kumar H. Chellapilla, David M. Bargeron, Patrice Y. Simard, Paul A. Viola
Methods and apparatus for reconfiguring protocol data when reducing multiplexed data streams

Patent number: 7653089

Abstract: The present invention provides methods and apparatus for reconfiguring protocol data for a multiplexed data stream which is reduced to carry fewer services for cable-side transmission in a cable television plant or the like. More particularly, the present invention provides methods and apparatus for reconfiguring protocol data for a desired combination of data streams contained within an incoming high data rate multiplexed data stream, such as a high data rate Quadrature Phase Shift Keying (QPSK) modulated multiplexed data stream, when the incoming multiplexed data stream is reduced.

Type: Grant

Filed: December 23, 2004

Date of Patent: January 26, 2010

Assignee: General Instrument Corporation

Inventors: Arthur P. Jost, Erik Elstermann, Jeffrey D. Kuczynski-Brown, Richard DiColli, Jeffrey Paul Viola
Application of grammatical parsing to visual recognition tasks

Patent number: 7639881

Abstract: Image recognition is utilized to facilitate in scoring parse trees for two-dimensional recognition tasks. Trees and subtrees are rendered as images and then utilized to determine parsing scores. Other instances of the subject invention can incorporate additional features such as stroke curvature and/or nearby white space as rendered images as well. Geometric constraints can also be employed to increase performance of a parsing process, substantially improving parsing speed, some even resolvable in polynomial time. Additional performance enhancements can be achieved in yet other instances of the subject invention by employing constellations of integral images and/or integral images of document features.

Type: Grant

Filed: June 13, 2005

Date of Patent: December 29, 2009

Assignee: Microsoft Corporation

Inventors: Paul A. Viola, Michael Shilman
FEATURE SELECTION AND EXTRACTION

Publication number: 20090316986

Abstract: Image feature selection and extraction (e.g., for image classifier training) is accomplished in an integrated manner, such that higher-order features are merely developed from first-order features selected for image classification. That is, first-order image features are selected for image classification from an image feature pool, initially populated with pre-extracted first-order image features. The selected first-order classifying features are paired with previously selected first-order classifying features to generate higher-order features. The higher-order features are placed into the image feature pool as they are developed or “on-the-fly” (e.g., for use in image classifier training).

Type: Application

Filed: April 25, 2008

Publication date: December 24, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Gang Hua, Paul Viola, David Liu
System and method for tracking movement of individuals

Patent number: 7619513

Abstract: A device for monitoring movement of an object is provided. A first module is configured to secure to the object. A second module, capable of electrically connecting to the first module, includes at least a rechargeable battery and a memory capable of storing a history of movement data. A third module, capable of electrically connecting with the second module, includes a data modem capable of connecting to a remote station, and a battery charger. When the second module is connected to the first module, the memory periodically records available location data representing a position of the device at the time of recording. When the second module is connected to the third module, the memory downloads through the data modem and the battery charger charges the battery.

Type: Grant

Filed: November 14, 2005

Date of Patent: November 17, 2009

Assignee: Satellite Tracking of People LLC

Inventors: Maurice L. Hill, Michael Mocenter, Joeseph S. Reiter, Paul Viola, Brian Moran
IMAGE CLASSIFICATION

Publication number: 20090252413

Abstract: Images are classified as photos (e.g., natural photographs) or graphics (e.g., cartoons, synthetically generated images), such that when searched (online) with a filter, an image database returns images corresponding to the filter criteria (e.g., either photos or graphics will be returned). A set of image statistics pertaining to various visual cues (e.g., color, texture, shape) are identified in classifying the images. These image statistics, combined with pre-tagged image metadata defining an image as either a graphic or a photo, may be used to train a boosting decision tree. The trained boosting decision tree may be used to classify additional images as graphics or photos based on image statistics determined for the additional images.

Type: Application

Filed: April 4, 2008

Publication date: October 8, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Gang Hua, Paul Viola
Systems and methods for detecting text

Patent number: 7570816

Abstract: The subject invention relates to facilitating text detection. The invention employs a boosted classifier and a transductive classifier to provide accurate and efficient text detection systems and/or methods. The boosted classifier is trained through features generated from a set of training connected components and labels. The boosted classifier utilizes the features to classify the training connected components, wherein inferred labels are conveyed to a transductive classifier, which generates additional properties. The initial set of features and the properties are utilized to train the transductive classifier. Upon training, the system and/or methods can be utilized to detect text in data under text detection, wherein unlabeled data is received, and connected components are extracted therefrom and utilized to generate corresponding feature vectors, which are employed to classify the connected components using the initial boosted classifier.

Type: Grant

Filed: March 31, 2005

Date of Patent: August 4, 2009

Assignee: Microsoft Corporation

Inventors: David M Bargeron, Patrice Y Simard, Paul A Viola
Continuous inference for sequence data

Patent number: 7551784

Abstract: Dynamic inference is leveraged to provide online sequence data labeling. This provides real-time alternatives to current methods of inference for sequence data. Instances estimate an amount of uncertainty in a prediction of labels of sequence data and then dynamically predict a label when an uncertainty in the prediction is deemed acceptable. The techniques utilized to determine when the label can be generated are tunable and can be personalized for a given user and/or a system. Employed decoding techniques can be dynamically adjusted to tradeoff system resources for accuracy. This allows for fine tuning of a system based on available system resources. Instances also allow for online inference because the inference does not require knowledge of a complete set of sequence data.

Type: Grant

Filed: June 1, 2006

Date of Patent: June 23, 2009

Assignee: Microsoft Corporation

Inventors: Mukund Narasimhan, Paul A. Viola, Michael Shilman
Low resolution OCR for camera acquired documents

Patent number: 7499588

Abstract: A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.

Type: Grant

Filed: May 20, 2004

Date of Patent: March 3, 2009

Assignee: Microsoft Corporation

Inventors: Charles E. Jacobs, James R. Rinker, Patrice Y. Simard, Paul A. Viola
HISTOGRAM-BASED CLASSIFIERS HAVING VARIABLE BIN SIZES

Publication number: 20090018985

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
MULTIPLE-INSTANCE PRUNING FOR LEARNING EFFICIENT CASCADE DETECTORS

Publication number: 20090018980

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
LEARNING CLASSIFIERS USING COMBINED BOOSTING AND WEIGHT TRIMMING

Publication number: 20090018981

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
Face Recognition Using Discriminatively Trained Orthogonal Tensor Projections

Publication number: 20080310687

Abstract: Systems and methods are described for face recognition using discriminatively trained orthogonal rank one tensor projections. In an exemplary system, images are treated as tensors, rather than as conventional vectors of pixels. During runtime, the system designs visual features—embodied as tensor projections—that minimize intraclass differences between instances of the same face while maximizing interclass differences between the face and faces of different people. Tensor projections are pursued sequentially over a training set of images and take the form of a rank one tensor, i.e., the outer product of a set of vectors. An exemplary technique ensures that the tensor projections are orthogonal to one another, thereby increasing ability to generalize and discriminate image features over conventional techniques.

Type: Application

Filed: June 15, 2007

Publication date: December 18, 2008

Applicant: Microsoft Corporation

Inventors: Gang Hua, Paul A. Viola, Steven M. Drucker, Michael Revow
Recognition of mathematical expressions

Publication number: 20080260251

Abstract: In embodiments consistent with the subject matter of this disclosure, a user may input strokes as digital ink to a processing device. The processing device may partition the input strokes into multiple regions of strokes. A first recognizer and a second recognizer may score grammar objects included in regions and represented by chart entries. The scores may be converted to a converted score, which may have at least a near standard normal distribution. The processing device may present a recognition result based on highest converted scores according to a recurrence formula. The processing device may receive a correction hint with respect to misrecognized strokes and may add a penalty score with respect to chart entries representing grammar objects breaking the correction hint. Incremental recognition may be performed when a pause is detected during inputting of strokes.

Type: Application

Filed: April 19, 2007

Publication date: October 23, 2008

Applicant: Microsoft Corporation

Inventors: Goran Predovic, Ahmad Abdulkader, Bodin Dresevic, Paul A. Viola, Milan Vukosavljevic
Learning A* priority function from unlabeled data

Publication number: 20080256007

Abstract: A technique for increasing efficiency of inference of structure variables (e.g., an inference problem) using a priority-driven algorithm rather than conventional dynamic programming. The technique employs a probable approximate underestimate which can be used to compute a probable approximate solution to the inference problem when used as a priority function (“a probable approximate underestimate function”) for a more computationally complex classification function. The probable approximate underestimate function can have a functional form of a simpler, easier to decode model. The model can be learned from unlabeled data by solving a linear/quadratic optimization problem. The priority function can be computed quickly, and can result in solutions that are substantially optimal. Using the priority function, computation efficiency of a classification function (e.g., discriminative classifier) can be increased using a generalization of the A* algorithm.

Type: Application

Filed: April 10, 2007

Publication date: October 16, 2008

Applicant: Microsoft Corporation

Inventors: Mukund Narasimhan, Paul A. Viola, Gregory Druck
Image Organization

Publication number: 20080226174

Abstract: A system for organizing images includes an extraction component that extracts visual information (e.g., faces, scenes, etc.) from the images. The extracted visual information is provided to a comparison component which computes similarity confidence data between the extracted visual information. The similarity confidence data is an indication of the likelihood that items of extracted visual information are similar. The comparison component then generates a visual distribution of the extracted visual information based upon the similarity confidence data. The visual distribution can include groupings of the extracted visual information based on computed similarity confidence data. For example, the visual distribution can be a two-dimensional layout of faces organized based on the computed similarity confidence data—with faces in closer proximity faces computed to have a greater probability of representing the same person.

Type: Application

Filed: March 15, 2007

Publication date: September 18, 2008

Applicant: Microsoft Corporation

Inventors: Gang Hua, Steven M. Drucker, Michael Revow, Paul A. Viola, Richard Zemel
Parsing of ink annotations

Publication number: 20080195931

Abstract: Annotation recognition and parsing is accomplished by first recognizing and grouping shapes such that relationships between the annotations and the underlying text and/or images can be determined. The recognition and grouping is followed by categorization of recognized annotations according to predefined types. The classification may be according to functionality, relation to content, and the like. In a third phase, the annotations are anchored to the underlying text or images they are found to be related to.

Type: Application

Filed: October 27, 2006

Publication date: August 14, 2008

Applicant: Microsoft Corporation

Inventors: Sashi Raghupathy, Paul A. Viola, Michael Shilman, Xin Wang
Fast Landmark Detection Using Regression Methods

Publication number: 20080187213

Abstract: A landmark detection technique that can quickly detect both objects of interest and landmarks within the objects in an image using regression methods. The present fast landmark detection scheme reuses existing feature values used for object detection (e.g., face detection) to find the landmarks in an object (e.g., the eyes and mouth of the face). Hence, the technique provides landmark detection functionality at almost no cost.

Type: Application

Filed: February 6, 2007

Publication date: August 7, 2008

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Paul Viola, Sang Min Oh
Identification Of People Using Multiple Types Of Input

Publication number: 20070297682

Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

Type: Application

Filed: June 22, 2006

Publication date: December 27, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Paul A. Viola, Pei Yin, Ross G. Cutler, Xinding Sun, Yong Rui

prev 1 2 3 4 5 6 next