Patents by Inventor Paul Viola

Paul Viola has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7672940
    Abstract: The present invention relates generally to automatically processing electronic documents. In one aspect, features and/or properties of words are identified from a set of training documents to aid in extracting information from documents to be processed. The features and/or properties relate to text of the words, position of the words and the relationship to other words. A classifier is developed to express these features and/or properties. During information extraction, documents are processed and analyzed based on the classifier and information is extracted based on correspondence of the documents and the features/properties expressed by the classifier.
    Type: Grant
    Filed: April 29, 2004
    Date of Patent: March 2, 2010
    Assignee: Microsoft Corporation
    Inventors: Paul Viola, Hiu Chung Law, James Rinker
  • Patent number: 7661065
    Abstract: A computer-implemented word processing system comprises an interface component that receives a features vector associated with an electronic document. An analysis component communicatively coupled to the interface component analyzes the features vector and determines a viewing mode in which to display the electronic document. In accordance with one aspect of the subject invention, the viewing mode can be one of a conventional viewing mode and a viewing mode associated with enhanced readability.
    Type: Grant
    Filed: May 24, 2005
    Date of Patent: February 9, 2010
    Assignee: Microsoft Corporation
    Inventors: Radoslav Petrov Nickolov, Kumar H. Chellapilla, David M. Bargeron, Patrice Y. Simard, Paul A. Viola
  • Patent number: 7653089
    Abstract: The present invention provides methods and apparatus for reconfiguring protocol data for a multiplexed data stream which is reduced to carry fewer services for cable-side transmission in a cable television plant or the like. More particularly, the present invention provides methods and apparatus for reconfiguring protocol data for a desired combination of data streams contained within an incoming high data rate multiplexed data stream, such as a high data rate Quadrature Phase Shift Keying (QPSK) modulated multiplexed data stream, when the incoming multiplexed data stream is reduced.
    Type: Grant
    Filed: December 23, 2004
    Date of Patent: January 26, 2010
    Assignee: General Instrument Corporation
    Inventors: Arthur P. Jost, Erik Elstermann, Jeffrey D. Kuczynski-Brown, Richard DiColli, Jeffrey Paul Viola
  • Patent number: 7639881
    Abstract: Image recognition is utilized to facilitate in scoring parse trees for two-dimensional recognition tasks. Trees and subtrees are rendered as images and then utilized to determine parsing scores. Other instances of the subject invention can incorporate additional features such as stroke curvature and/or nearby white space as rendered images as well. Geometric constraints can also be employed to increase performance of a parsing process, substantially improving parsing speed, some even resolvable in polynomial time. Additional performance enhancements can be achieved in yet other instances of the subject invention by employing constellations of integral images and/or integral images of document features.
    Type: Grant
    Filed: June 13, 2005
    Date of Patent: December 29, 2009
    Assignee: Microsoft Corporation
    Inventors: Paul A. Viola, Michael Shilman
  • Publication number: 20090316986
    Abstract: Image feature selection and extraction (e.g., for image classifier training) is accomplished in an integrated manner, such that higher-order features are merely developed from first-order features selected for image classification. That is, first-order image features are selected for image classification from an image feature pool, initially populated with pre-extracted first-order image features. The selected first-order classifying features are paired with previously selected first-order classifying features to generate higher-order features. The higher-order features are placed into the image feature pool as they are developed or “on-the-fly” (e.g., for use in image classifier training).
    Type: Application
    Filed: April 25, 2008
    Publication date: December 24, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Gang Hua, Paul Viola, David Liu
  • Patent number: 7619513
    Abstract: A device for monitoring movement of an object is provided. A first module is configured to secure to the object. A second module, capable of electrically connecting to the first module, includes at least a rechargeable battery and a memory capable of storing a history of movement data. A third module, capable of electrically connecting with the second module, includes a data modem capable of connecting to a remote station, and a battery charger. When the second module is connected to the first module, the memory periodically records available location data representing a position of the device at the time of recording. When the second module is connected to the third module, the memory downloads through the data modem and the battery charger charges the battery.
    Type: Grant
    Filed: November 14, 2005
    Date of Patent: November 17, 2009
    Assignee: Satellite Tracking of People LLC
    Inventors: Maurice L. Hill, Michael Mocenter, Joeseph S. Reiter, Paul Viola, Brian Moran
  • Publication number: 20090252413
    Abstract: Images are classified as photos (e.g., natural photographs) or graphics (e.g., cartoons, synthetically generated images), such that when searched (online) with a filter, an image database returns images corresponding to the filter criteria (e.g., either photos or graphics will be returned). A set of image statistics pertaining to various visual cues (e.g., color, texture, shape) are identified in classifying the images. These image statistics, combined with pre-tagged image metadata defining an image as either a graphic or a photo, may be used to train a boosting decision tree. The trained boosting decision tree may be used to classify additional images as graphics or photos based on image statistics determined for the additional images.
    Type: Application
    Filed: April 4, 2008
    Publication date: October 8, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Gang Hua, Paul Viola
  • Patent number: 7570816
    Abstract: The subject invention relates to facilitating text detection. The invention employs a boosted classifier and a transductive classifier to provide accurate and efficient text detection systems and/or methods. The boosted classifier is trained through features generated from a set of training connected components and labels. The boosted classifier utilizes the features to classify the training connected components, wherein inferred labels are conveyed to a transductive classifier, which generates additional properties. The initial set of features and the properties are utilized to train the transductive classifier. Upon training, the system and/or methods can be utilized to detect text in data under text detection, wherein unlabeled data is received, and connected components are extracted therefrom and utilized to generate corresponding feature vectors, which are employed to classify the connected components using the initial boosted classifier.
    Type: Grant
    Filed: March 31, 2005
    Date of Patent: August 4, 2009
    Assignee: Microsoft Corporation
    Inventors: David M Bargeron, Patrice Y Simard, Paul A Viola
  • Patent number: 7551784
    Abstract: Dynamic inference is leveraged to provide online sequence data labeling. This provides real-time alternatives to current methods of inference for sequence data. Instances estimate an amount of uncertainty in a prediction of labels of sequence data and then dynamically predict a label when an uncertainty in the prediction is deemed acceptable. The techniques utilized to determine when the label can be generated are tunable and can be personalized for a given user and/or a system. Employed decoding techniques can be dynamically adjusted to tradeoff system resources for accuracy. This allows for fine tuning of a system based on available system resources. Instances also allow for online inference because the inference does not require knowledge of a complete set of sequence data.
    Type: Grant
    Filed: June 1, 2006
    Date of Patent: June 23, 2009
    Assignee: Microsoft Corporation
    Inventors: Mukund Narasimhan, Paul A. Viola, Michael Shilman
  • Patent number: 7499588
    Abstract: A global optimization framework for optical character recognition (OCR) of low-resolution photographed documents that combines a binarization-type process, segmentation, and recognition into a single process. The framework includes a machine learning approach trained on a large amount of data. A convolutional neural network can be employed to compute a classification function at multiple positions and take grey-level input which eliminates binarization. The framework utilizes preprocessing, layout analysis, character recognition, and word recognition to output high recognition rates. The framework also employs dynamic programming and language models to arrive at the desired output.
    Type: Grant
    Filed: May 20, 2004
    Date of Patent: March 3, 2009
    Assignee: Microsoft Corporation
    Inventors: Charles E. Jacobs, James R. Rinker, Patrice Y. Simard, Paul A. Viola
  • Publication number: 20090018985
    Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.
    Type: Application
    Filed: July 13, 2007
    Publication date: January 15, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Cha Zhang, Paul Viola
  • Publication number: 20090018980
    Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.
    Type: Application
    Filed: July 13, 2007
    Publication date: January 15, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Cha Zhang, Paul Viola
  • Publication number: 20090018981
    Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.
    Type: Application
    Filed: July 13, 2007
    Publication date: January 15, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Cha Zhang, Paul Viola
  • Publication number: 20080310687
    Abstract: Systems and methods are described for face recognition using discriminatively trained orthogonal rank one tensor projections. In an exemplary system, images are treated as tensors, rather than as conventional vectors of pixels. During runtime, the system designs visual features—embodied as tensor projections—that minimize intraclass differences between instances of the same face while maximizing interclass differences between the face and faces of different people. Tensor projections are pursued sequentially over a training set of images and take the form of a rank one tensor, i.e., the outer product of a set of vectors. An exemplary technique ensures that the tensor projections are orthogonal to one another, thereby increasing ability to generalize and discriminate image features over conventional techniques.
    Type: Application
    Filed: June 15, 2007
    Publication date: December 18, 2008
    Applicant: Microsoft Corporation
    Inventors: Gang Hua, Paul A. Viola, Steven M. Drucker, Michael Revow
  • Publication number: 20080260251
    Abstract: In embodiments consistent with the subject matter of this disclosure, a user may input strokes as digital ink to a processing device. The processing device may partition the input strokes into multiple regions of strokes. A first recognizer and a second recognizer may score grammar objects included in regions and represented by chart entries. The scores may be converted to a converted score, which may have at least a near standard normal distribution. The processing device may present a recognition result based on highest converted scores according to a recurrence formula. The processing device may receive a correction hint with respect to misrecognized strokes and may add a penalty score with respect to chart entries representing grammar objects breaking the correction hint. Incremental recognition may be performed when a pause is detected during inputting of strokes.
    Type: Application
    Filed: April 19, 2007
    Publication date: October 23, 2008
    Applicant: Microsoft Corporation
    Inventors: Goran Predovic, Ahmad Abdulkader, Bodin Dresevic, Paul A. Viola, Milan Vukosavljevic
  • Publication number: 20080256007
    Abstract: A technique for increasing efficiency of inference of structure variables (e.g., an inference problem) using a priority-driven algorithm rather than conventional dynamic programming. The technique employs a probable approximate underestimate which can be used to compute a probable approximate solution to the inference problem when used as a priority function (“a probable approximate underestimate function”) for a more computationally complex classification function. The probable approximate underestimate function can have a functional form of a simpler, easier to decode model. The model can be learned from unlabeled data by solving a linear/quadratic optimization problem. The priority function can be computed quickly, and can result in solutions that are substantially optimal. Using the priority function, computation efficiency of a classification function (e.g., discriminative classifier) can be increased using a generalization of the A* algorithm.
    Type: Application
    Filed: April 10, 2007
    Publication date: October 16, 2008
    Applicant: Microsoft Corporation
    Inventors: Mukund Narasimhan, Paul A. Viola, Gregory Druck
  • Publication number: 20080226174
    Abstract: A system for organizing images includes an extraction component that extracts visual information (e.g., faces, scenes, etc.) from the images. The extracted visual information is provided to a comparison component which computes similarity confidence data between the extracted visual information. The similarity confidence data is an indication of the likelihood that items of extracted visual information are similar. The comparison component then generates a visual distribution of the extracted visual information based upon the similarity confidence data. The visual distribution can include groupings of the extracted visual information based on computed similarity confidence data. For example, the visual distribution can be a two-dimensional layout of faces organized based on the computed similarity confidence data—with faces in closer proximity faces computed to have a greater probability of representing the same person.
    Type: Application
    Filed: March 15, 2007
    Publication date: September 18, 2008
    Applicant: Microsoft Corporation
    Inventors: Gang Hua, Steven M. Drucker, Michael Revow, Paul A. Viola, Richard Zemel
  • Publication number: 20080195931
    Abstract: Annotation recognition and parsing is accomplished by first recognizing and grouping shapes such that relationships between the annotations and the underlying text and/or images can be determined. The recognition and grouping is followed by categorization of recognized annotations according to predefined types. The classification may be according to functionality, relation to content, and the like. In a third phase, the annotations are anchored to the underlying text or images they are found to be related to.
    Type: Application
    Filed: October 27, 2006
    Publication date: August 14, 2008
    Applicant: Microsoft Corporation
    Inventors: Sashi Raghupathy, Paul A. Viola, Michael Shilman, Xin Wang
  • Publication number: 20080187213
    Abstract: A landmark detection technique that can quickly detect both objects of interest and landmarks within the objects in an image using regression methods. The present fast landmark detection scheme reuses existing feature values used for object detection (e.g., face detection) to find the landmarks in an object (e.g., the eyes and mouth of the face). Hence, the technique provides landmark detection functionality at almost no cost.
    Type: Application
    Filed: February 6, 2007
    Publication date: August 7, 2008
    Applicant: Microsoft Corporation
    Inventors: Cha Zhang, Paul Viola, Sang Min Oh
  • Publication number: 20070297682
    Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.
    Type: Application
    Filed: June 22, 2006
    Publication date: December 27, 2007
    Applicant: Microsoft Corporation
    Inventors: Cha Zhang, Paul A. Viola, Pei Yin, Ross G. Cutler, Xinding Sun, Yong Rui