Patents by Inventor David W. Steinkraus

David W. Steinkraus has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Unfolded convolution for fast feature extraction

Patent number: 7634137

Abstract: Systems and methods are described that facilitate performing feature extraction across multiple received input features to reduce computational overhead associated with feature processing related to, for instance, optical character recognition. Input feature information can be unfolded and concatenated to generate an aggregated input matrix, which can be convolved with a kernel matrix to produce output feature information for multiple output features concurrently.

Type: Grant

Filed: October 14, 2005

Date of Patent: December 15, 2009

Assignee: Microsoft Corporation

Inventors: Patrice Y. Simard, David W. Steinkraus, Kumar H. Chellapilla
Ink warping for normalization and beautification / ink beautification

Patent number: 7593574

Abstract: Systems and methods are disclosed that facilitate normalizing and beautifying digitally generated handwriting, such as can be generated on a tablet PC or via scanning a handwritten document. A classifier can identify extrema in the digital handwriting and label such extrema according to predefined categories (e.g., bottom, baseline, midline, top, other, . . . ). Multi-linear regression, polynomial regression, etc., can be performed to align labeled extrema to respective and corresponding desired points as indicated by the labels. Additionally, displacement techniques can be applied to the regressed handwriting to optimize legibility for reading by a human viewer and/or for character recognition by a handwriting recognition application. The displacement techniques can comprise a “rubber sheet” displacement algorithm in conjunction with a “rubber rod” displacement algorithm, which can collectively preserve spatial features of the handwriting during warping thereof.

Type: Grant

Filed: July 1, 2005

Date of Patent: September 22, 2009

Assignee: Microsoft Corporation

Inventors: Patrice Y. Simard, Maneesh Agrawala, David W. Steinkraus
Optimizing performance of a graphics processing unit for efficient execution of general matrix operations

Patent number: 7567252

Abstract: A system and method for optimizing the performance of a graphics processing unit (GPU) for processing and execution of general matrix operations such that the operations are accelerated and optimized. The system and method describes the layouts of operands and results in graphics memory, as well as partitioning the processes into a sequence of passes through a macro step. Specifically, operands are placed in memory in a pattern, results are written into memory in a pattern appropriate for use as operands in a later pass, data sets are partitioned to insure that each pass fits into fixed sized memory, and the execution model incorporates generally reusable macro steps for use in multiple passes. These features enable greater efficiency and speed in processing and executing general matrix operations.

Type: Grant

Filed: June 25, 2004

Date of Patent: July 28, 2009

Assignee: Microsoft Corporation

Inventors: Ian Andrew Buck, David W. Steinkraus, Richard S. Szeliski
Processing machine learning techniques using a graphics processing unit

Patent number: 7548892

Abstract: A system and method for processing machine learning techniques (such as neural networks) and other non-graphics applications using a graphics processing unit (GPU) to accelerate and optimize the processing. The system and method transfers an architecture that can be used for a wide variety of machine learning techniques from the CPU to the GPU. The transfer of processing to the GPU is accomplished using several novel techniques that overcome the limitations and work well within the framework of the GPU architecture. With these limitations overcome, machine learning techniques are particularly well suited for processing on the GPU because the GPU is typically much more powerful than the typical CPU. Moreover, similar to graphics processing, processing of machine learning techniques involves problems with solving non-trivial solutions and large amounts of data.

Type: Grant

Filed: May 14, 2007

Date of Patent: June 16, 2009

Assignee: Microsoft Corporation

Inventors: Ian Andrew Buck, Patrice Y. Simard, David W. Steinkraus
Method and system for performing phrase/word clustering and cluster merging

Patent number: 7519590

Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.

Type: Grant

Filed: June 9, 2003

Date of Patent: April 14, 2009

Assignee: Microsoft Corporation

Inventors: Raman Chandrasekar, David W. Steinkraus
Elastic distortions for automatic generation of labeled data

Patent number: 7418128

Abstract: A system that facilitates generation of data that can be employed in connection with training a classifier. The system comprises a component that receives a data set that is employed in connection with training the classifier, and an expansion component that applies elastic distortion algorithm(s) to a subset of the data set to generate additional labeled training data.

Type: Grant

Filed: July 31, 2003

Date of Patent: August 26, 2008

Assignee: Microsoft Corporation

Inventors: Patrice Y. Simard, David W. Steinkraus
System and method for accelerating and optimizing the processing of machine learning techniques using a graphics processing unit

Patent number: 7219085

Abstract: A system and method for processing machine learning techniques (such as neural networks) and other non-graphics applications using a graphics processing unit (GPU) to accelerate and optimize the processing. The system and method transfers an architecture that can be used for a wide variety of machine learning techniques from the CPU to the GPU. The transfer of processing to the GPU is accomplished using several novel techniques that overcome the limitations and work well within the framework of the GPU architecture. With these limitations overcome, machine learning techniques are particularly well suited for processing on the GPU because the GPU is typically much more powerful than the typical CPU. Moreover, similar to graphics processing, processing of machine learning techniques involves problems with solving non-trivial solutions and large amounts of data.

Type: Grant

Filed: April 30, 2004

Date of Patent: May 15, 2007

Assignee: Microsoft Corporation

Inventors: Ian Andrew Buck, Patrice Y. Simard, David W. Steinkraus
Method and system for performing phrase/word clustering and cluster merging

Publication number: 20030200198

Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.

Type: Application

Filed: June 9, 2003

Publication date: October 23, 2003

Inventors: Raman Chandrasekar, David W. Steinkraus
Method and system for performing phrase/word clustering and cluster merging

Patent number: 6578032

Abstract: Text classification has become an important aspect of information technology. Present text classification techniques range from simple text matching to more complex clustering methods. Clustering describes a process of discovering structure in a collection of characters. The invention automatically analyzes a text string and either updates an existing cluster or creates a new cluster. To that end, the invention may use a character n-gram matching process in addition to other heuristic-based clustering techniques. In the character n-gram matching process, each text string is first normalized using several heuristics. It is then divided into a set of overlapping character n-grams, where n is the number of adjacent characters. If the commonality between the text string and the existing cluster members satisfies a pre-defined threshold, the text string is added to the cluster. If, on the other hand, the commonality does not satisfy the pre-defined threshold, a new cluster may be created.

Type: Grant

Filed: June 28, 2000

Date of Patent: June 10, 2003

Assignee: Microsoft Corporation

Inventors: Raman Chandrasekar, David W. Steinkraus
Method and apparatus for concept searching using a Boolean or keyword search engine

Patent number: 6363373

Abstract: Concept searching using a Boolean or keyword search engine. Documents are preprocessed before being passed to a search engine by identifying, on a word-by-word basis, the “word tokens” contained in the document. Once the word tokens have been extracted, each word token is referenced in a concept database that maps word tokens to concept identifiers. The concept identifiers associated with the word tokens are converted into unique non-word concept tokens and arranged into a list. The list is then inserted into the document as invisible but searchable text. The document is then transferred to the server monitored by the search engine. Search queries are preprocessed before being passed to the search engine in the same manner. The query is first broken into word tokens and the word tokens are then referenced in the concept database. All associated concept identifiers are retrieved and converted to unique concept tokens.

Type: Grant

Filed: October 1, 1998

Date of Patent: March 26, 2002

Assignee: Microsoft Corporation

Inventor: David W. Steinkraus