Patents by Inventor Joshua T. Goodman

Joshua T. Goodman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Feedback loop for spam prevention

Patent number: 7558832

Abstract: The subject invention provides for a feedback loop system and method that facilitate classifying items in connection with spam prevention in server and/or client-based architectures. The invention makes uses of a machine-learning approach as applied to spam filters, and in particular, randomly samples incoming email messages so that examples of both legitimate and junk/spam mail are obtained to generate sets of training data. Users which are identified as spam-fighters are asked to vote on whether a selection of their incoming email messages is individually either legitimate mail or junk mail. A database stores the properties for each mail and voting transaction such as user information, message properties and content summary, and polling results for each message to generate training data for machine learning systems. The machine learning systems facilitate creating improved spam filter(s) that are trained to recognize both legitimate mail and spam mail and to distinguish between them.

Type: Grant

Filed: May 2, 2007

Date of Patent: July 7, 2009

Assignee: Microsoft Corporation

Inventors: Robert L. Rounthwaite, Joshua T. Goodman, David E. Heckerman, John D. Mehr, Nathan D. Howell, Micah C. Rupersburg, Dean A. Slawson
User-controlled profile sharing

Patent number: 7552862

Abstract: Architecture is provided that facilitates user-controlled access to user profile information. A user is allowed to selectively expose (or mask) portions of his/her profile to third parties. Additionally, advertisers and/or content providers can offer incentives or enticement in response to the acceptance of which a user exposes larger portions of their profile. The architecture comprises a system that facilitates profile management utilizing a profile component that facilitates creation and storage of an electronic profile of a user, and a control component under control of the user for controlling access to the profile. Machine learning and reasoning is provided to make inferences and automate aspects thereof.

Type: Grant

Filed: June 29, 2006

Date of Patent: June 30, 2009

Assignee: Microsoft Corporation

Inventors: Gary W. Flake, Eric J. Horvitz, Joshua T. Goodman, Eric D. Brill, Bradly A. Brunell, Susan T. Dumais, Alexander G. Gounares, Trenholme J. Griffin, Oliver Hurst-Hiller, Raymond E. Ozzie
Reducing unwanted and unsolicited electronic messages by exchanging electronic message transmission policies and solving and verifying solutions to computational puzzles

Patent number: 7552176

Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.

Type: Grant

Filed: October 10, 2003

Date of Patent: June 23, 2009

Assignee: Microsoft Corporation

Inventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite, Andrew V. Goldberg, Cynthia Dwork
Intelligent quarantining for spam prevention

Patent number: 7543053

Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates a more robust classification system in connection with spam prevention. The invention involves holding back some messages that appear to be questionable, suspicious, or untrustworthy from classification (as spam or good). In particular, the filter lacks information about these messages and thus classification is temporarily delayed. This provides more time for a filter update to arrive with a more accurate classification. The suspicious messages can be quarantined for a determined time period to allow more data to be collected regarding these messages. A number of factors can be employed to determine whether messages are more likely to be flagged for further analysis. User feedback by way of a feedback loop system can also be utilized to facilitate classification of the messages. After some time period, classification of the messages can be resumed.

Type: Grant

Filed: February 13, 2004

Date of Patent: June 2, 2009

Assignee: Microsoft Corporation

Inventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Derek Hazeur
Order-based human interactive proofs (HIPs) and automatic difficulty rating of HIPs

Patent number: 7533411

Abstract: The present invention involves a system and method that facilitate identifying human interaction by utilizing HIPs such as order-based HIPs and determining a difficulty rating of any type of HIPs in an automated fashion. Order-based HIPs require a user to identify elements in the sequence as well as to identify a correct order of the elements in the sequence. The invention involves presenting a user with at least two HIPs such that the HIP can be of known and/or unknown difficulty. A user that correctly answers the HIP of known difficulty gains access to the HIP-controlled resource, action or application. The user's response to the HIP of unknown difficulty can then be examined and employed to determine whether that HIP is too difficult for humans to solve. Alternatively, at least one HIP can be presented. Difficulty of individual HIP parameters can also be determined.

Type: Grant

Filed: September 23, 2003

Date of Patent: May 12, 2009

Assignee: Microsoft Corporation

Inventors: Joshua T. Goodman, Robert L. Rounthwaite
EXECUTIVE REPORTING

Publication number: 20090106059

Abstract: Providing for generating an executive report of business or personal activity is described herein. By way of example, such executive report can identify a change and related cause with respect to a prior report. As a particular example, an inference engine can receive an activity report and reference prior reports to identify the change and related cause. A set of results containing such information can be provided to a synthesis component that can include and highlight such information in the executive report. In addition, additional sources of data can be referenced in order to include and/or customize the report to a particular individual, organization, culture, or the like. As described, aspects of the subject innovation can provide an executive report highlighting important aspects of data and tailoring those aspects to interests of one or more users.

Type: Application

Filed: October 17, 2007

Publication date: April 23, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Eran Megiddo, Richard J. Wolf, Susan T. Dumais, Jensen M. Harris, Joshua T. Goodman
Obfuscation of spam filter

Patent number: 7519668

Abstract: The subject invention provides systems and methods that facilitate obfuscating a spam filtering system to hinder reverse engineering of the spam filters and/or to mitigate spammers from finding a message that consistently gets through the spam filters almost every time. The system includes a randomization component that randomizes a message score before the message is classified as spam or non-spam so as to obscure the functionality of the spam filter. Randomizing the message score can be accomplished in part by adding a random number or pseudo-random number to the message score before it is classified as spam or non-spam. The number added thereto can vary depending on at least one of several types of input such as time, user, message content, hash of message content, and hash of particularly important features of the message, for example. Alternatively, multiple spam filters can be deployed rather than a single best spam filter.

Type: Grant

Filed: June 20, 2003

Date of Patent: April 14, 2009

Assignee: Microsoft Corporation

Inventors: Joshua T. Goodman, Robert L. Rounthwaite, John C. Platt
Exponential priors for maximum entropy models

Patent number: 7483813

Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.

Type: Grant

Filed: October 19, 2006

Date of Patent: January 27, 2009

Assignee: Microsoft Corporation

Inventor: Joshua T. Goodman
Message rendering for identification of content features

Patent number: 7483947

Abstract: Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.

Type: Grant

Filed: May 2, 2003

Date of Patent: January 27, 2009

Assignee: Microsoft Corporation

Inventors: Bryan T. Starbuck, Robert L. Rounthwaite, David E. Heckerman, Joshua T. Goodman
MACHINE ASSISTED QUERY FORMULATION

Publication number: 20090006343

Abstract: Architecture for completing search queries by using artificial intelligence based schemes to infer search intentions of users. Partial queries are completed dynamically in real time. Additionally, search aliasing can also be employed. Custom tuning can be performed based on at least query inputs in the form of text, graffiti, images, handwriting, voice, audio, and video signals. Natural language processing occurs, along with handwriting recognition and slang recognition. The system includes a classifier that receives a partial query as input, accesses a query database based on contents of the query input, and infers an intended search goal from query information stored on the query database. A query formulation engine receives search information associated with the intended search goal and generates a completed formal query for execution.

Type: Application

Filed: June 28, 2007

Publication date: January 1, 2009

Applicant: MICROSOFT CORPORATION

Inventors: John C. Platt, Gary W. Flake, Ramez Naam, Anoop Gupta, Oliver Hurst-Hiller, Trenholme J. Griffin, Joshua T. Goodman
CLASSIFICATION USING A CASCADE APPROACH

Publication number: 20080319932

Abstract: A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.

Type: Application

Filed: June 21, 2007

Publication date: December 25, 2008

Applicant: MICROSOFT CORPORATION

Inventors: Wen-tau Yih, Joshua T. Goodman, Geoffrey J. Hulten
Training filters for detecting spasm based on IP addresses and text-related features

Patent number: 7464264

Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates detecting and preventing spam. In particular, the invention employs a machine learning filter specifically trained using origination features such as an IP address as well as destination feature such as a URL. Moreover, the system and method involve training a plurality of filters using specific feature data for each filter. The filters are trained independently each other, thus one feature may not unduly influence another feature in determining whether a message is spam. Because multiple filters are trained and available to scan messages either individually or in combination (at least two filters), the filtering or spam detection process can be generalized to new messages having slightly modified features (e.g., IP address). The invention also involves locating the appropriate IP addresses or URLs in a message as well as guiding filters to weigh origination or destination features more than text-based features.

Type: Grant

Filed: March 25, 2004

Date of Patent: December 9, 2008

Assignee: Microsoft Corporation

Inventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Wen-tau Yih
Context-Sensitive Searches and Functionality for Instant Messaging Applications

Publication number: 20080201434

Abstract: In one or more embodiments, in the context of an instant messaging application, a conversation is analyzed and contextually or textually relevant keywords and/or phrases are identified. These keywords or phrases are then highlighted in a visually-identifiable manner for selection by an individual participating in the conversation. Once selected by an individual, a user interface is presented and exposes the individual or individuals in the conversation to various contextually- or textually-relevant material or functionality that pertains to the selected word or phrase. In one or more embodiments, an individual can also manually select a word or phrase to access the user interface that exposes contextually or textually-relevant material or functionality.

Type: Application

Filed: February 16, 2007

Publication date: August 21, 2008

Applicant: Microsoft Corporation

Inventors: John S. Holmes, Heather Ferguson, Adam C. Czeisler, Joshua T. Goodman
Advanced URL and IP features

Patent number: 7409708

Abstract: Disclosed are systems and methods that facilitate spam detection and prevention at least in part by building or training filters using advanced IP address and/or URL features in connection with machine learning techniques. A variety of advanced IP address related features can be generated from performing a reverse IP lookup. Similarly, many different advanced URL based features can be created from analyzing at least a portion of any one URL detected in a message.

Type: Grant

Filed: May 28, 2004

Date of Patent: August 5, 2008

Assignee: Microsoft Corporation

Inventors: Joshua T Goodman, Robert L Rounthwaite, Geoffrey J Hulten, John A Deurbrouck, Manav Mishra, Anthony P Penta
Reducing unwanted and unsolicited electronic messages by preventing connection hijacking and domain spoofing

Patent number: 7398315

Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.

Type: Grant

Filed: October 10, 2003

Date of Patent: July 8, 2008

Assignee: Workman Nydegger

Inventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite
SPAM FILTRATION UTILIZING SENDER ACTIVITY DATA

Publication number: 20080140781

Abstract: Spam is identified by computing sender reputation derived from historical activity data across counts for various categories. A spam filter or machine learning system can be trained utilizing pre-categorized data in conjunction with activity data associated with a sender aggregated across at least one time period. This sender activity filter can be employed alone or in combination with other filters to facilitate classification of messages as spam or non-spam.

Type: Application

Filed: December 6, 2006

Publication date: June 12, 2008

Applicant: MICROSOFT CORPORATION

Inventors: Alexei V. Bocharov, Joshua T. Goodman
Out-of-vocabulary word determination and user interface for text input via reduced keypad keys

Patent number: 7385591

Abstract: Out-of-vocabulary (OOV) word determination corresponding to a key sequence entered by the user on a (typically numeric) keypad, and a user interface for the user to select one of the words, are disclosed. A word-determining logic determines letter sequences corresponding to the entered key sequence, and presents the sequences within the user interface in which the user can select one of the letter sequences as the intended word, or select the first letter of the intended word. When letters are selected, the word-determining logic determines new letter sequences, consistent with the key sequence and the selected letters, and presents the new letter sequences. The user again selects one of the letter sequences as the intended word, or selects the second letter of the intended word. This process is repeated until the user has selected the intended word.

Type: Grant

Filed: March 31, 2001

Date of Patent: June 10, 2008

Assignee: Microsoft Corporation

Inventor: Joshua T. Goodman
Document summarization by maximizing informative content words

Publication number: 20080109425

Abstract: Document summarization is performed by scoring individual words in sentences in a document or document cluster. Sentences from the document or document cluster are selected to form a summary based on the scores of the words contained in those sentences.

Type: Application

Filed: November 2, 2006

Publication date: May 8, 2008

Applicant: Microsoft Corporation

Inventors: Wen-tau Yih, Joshua T. Goodman, Lucretia H. Vanderwende, Hisami Suzuki
Exponential priors for maximum entropy models

Patent number: 7340376

Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.

Type: Grant

Filed: July 21, 2005

Date of Patent: March 4, 2008

Assignee: Microsoft Corporation

Inventor: Joshua T. Goodman
EXTENSIBLE EMAIL

Publication number: 20080022097

Abstract: A computer-implemented method and system for obtaining data is provided. In the method, to obtain data pertaining to another party, a request for an authentication key is made. Upon receiving the requested authentication key in an email, the method and system automatically send the authentication key as part of a HTTP, HTTPS or SMTP request for data. Then, in response to the request for data containing the authentication key, the requested data is received.

Type: Application

Filed: June 15, 2006

Publication date: January 24, 2008

Applicant: MICROSOFT CORPORATION

Inventors: Eliot C. Gillum, Joshua T. Goodman

prev 1 2 3 4 5 6 7 next