Patents by Inventor David W. Steinkraus

David W. Steinkraus has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7634137
    Abstract: Systems and methods are described that facilitate performing feature extraction across multiple received input features to reduce computational overhead associated with feature processing related to, for instance, optical character recognition. Input feature information can be unfolded and concatenated to generate an aggregated input matrix, which can be convolved with a kernel matrix to produce output feature information for multiple output features concurrently.
    Type: Grant
    Filed: October 14, 2005
    Date of Patent: December 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Patrice Y. Simard, David W. Steinkraus, Kumar H. Chellapilla
  • Patent number: 7593574
    Abstract: Systems and methods are disclosed that facilitate normalizing and beautifying digitally generated handwriting, such as can be generated on a tablet PC or via scanning a handwritten document. A classifier can identify extrema in the digital handwriting and label such extrema according to predefined categories (e.g., bottom, baseline, midline, top, other, . . . ). Multi-linear regression, polynomial regression, etc., can be performed to align labeled extrema to respective and corresponding desired points as indicated by the labels. Additionally, displacement techniques can be applied to the regressed handwriting to optimize legibility for reading by a human viewer and/or for character recognition by a handwriting recognition application. The displacement techniques can comprise a “rubber sheet” displacement algorithm in conjunction with a “rubber rod” displacement algorithm, which can collectively preserve spatial features of the handwriting during warping thereof.
    Type: Grant
    Filed: July 1, 2005
    Date of Patent: September 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Patrice Y. Simard, Maneesh Agrawala, David W. Steinkraus
  • Patent number: 7567252
    Abstract: A system and method for optimizing the performance of a graphics processing unit (GPU) for processing and execution of general matrix operations such that the operations are accelerated and optimized. The system and method describes the layouts of operands and results in graphics memory, as well as partitioning the processes into a sequence of passes through a macro step. Specifically, operands are placed in memory in a pattern, results are written into memory in a pattern appropriate for use as operands in a later pass, data sets are partitioned to insure that each pass fits into fixed sized memory, and the execution model incorporates generally reusable macro steps for use in multiple passes. These features enable greater efficiency and speed in processing and executing general matrix operations.
    Type: Grant
    Filed: June 25, 2004
    Date of Patent: July 28, 2009
    Assignee: Microsoft Corporation
    Inventors: Ian Andrew Buck, David W. Steinkraus, Richard S. Szeliski
  • Patent number: 7548892
    Abstract: A system and method for processing machine learning techniques (such as neural networks) and other non-graphics applications using a graphics processing unit (GPU) to accelerate and optimize the processing. The system and method transfers an architecture that can be used for a wide variety of machine learning techniques from the CPU to the GPU. The transfer of processing to the GPU is accomplished using several novel techniques that overcome the limitations and work well within the framework of the GPU architecture. With these limitations overcome, machine learning techniques are particularly well suited for processing on the GPU because the GPU is typically much more powerful than the typical CPU. Moreover, similar to graphics processing, processing of machine learning techniques involves problems with solving non-trivial solutions and large amounts of data.
    Type: Grant
    Filed: May 14, 2007
    Date of Patent: June 16, 2009
    Assignee: Microsoft Corporation
    Inventors: Ian Andrew Buck, Patrice Y. Simard, David W. Steinkraus
  • Patent number: 7519590
    Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.
    Type: Grant
    Filed: June 9, 2003
    Date of Patent: April 14, 2009
    Assignee: Microsoft Corporation
    Inventors: Raman Chandrasekar, David W. Steinkraus
  • Patent number: 7418128
    Abstract: A system that facilitates generation of data that can be employed in connection with training a classifier. The system comprises a component that receives a data set that is employed in connection with training the classifier, and an expansion component that applies elastic distortion algorithm(s) to a subset of the data set to generate additional labeled training data.
    Type: Grant
    Filed: July 31, 2003
    Date of Patent: August 26, 2008
    Assignee: Microsoft Corporation
    Inventors: Patrice Y. Simard, David W. Steinkraus
  • Patent number: 7219085
    Abstract: A system and method for processing machine learning techniques (such as neural networks) and other non-graphics applications using a graphics processing unit (GPU) to accelerate and optimize the processing. The system and method transfers an architecture that can be used for a wide variety of machine learning techniques from the CPU to the GPU. The transfer of processing to the GPU is accomplished using several novel techniques that overcome the limitations and work well within the framework of the GPU architecture. With these limitations overcome, machine learning techniques are particularly well suited for processing on the GPU because the GPU is typically much more powerful than the typical CPU. Moreover, similar to graphics processing, processing of machine learning techniques involves problems with solving non-trivial solutions and large amounts of data.
    Type: Grant
    Filed: April 30, 2004
    Date of Patent: May 15, 2007
    Assignee: Microsoft Corporation
    Inventors: Ian Andrew Buck, Patrice Y. Simard, David W. Steinkraus
  • Publication number: 20030200198
    Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.
    Type: Application
    Filed: June 9, 2003
    Publication date: October 23, 2003
    Inventors: Raman Chandrasekar, David W. Steinkraus
  • Patent number: 6578032
    Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.
    Type: Grant
    Filed: June 28, 2000
    Date of Patent: June 10, 2003
    Assignee: Microsoft Corporation
    Inventors: Raman Chandrasekar, David W. Steinkraus
  • Patent number: 6363373
    Abstract: Concept searching using a Boolean or keyword search engine. Documents are preprocessed before being passed to a search engine by identifying, on a word-by-word basis, the “word tokens” contained in the document. Once the word tokens have been extracted, each word token is referenced in a concept database that maps word tokens to concept identifiers. The concept identifiers associated with the word tokens are converted into unique non-word concept tokens and arranged into a list. The list is then inserted into the document as invisible but searchable text. The document is then transferred to the server monitored by the search engine. Search queries are preprocessed before being passed to the search engine in the same manner. The query is first broken into word tokens and the word tokens are then referenced in the concept database. All associated concept identifiers are retrieved and converted to unique concept tokens.
    Type: Grant
    Filed: October 1, 1998
    Date of Patent: March 26, 2002
    Assignee: Microsoft Corporation
    Inventor: David W. Steinkraus