Patents by Inventor Joshua T. Goodman
Joshua T. Goodman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7558832Abstract: The subject invention provides for a feedback loop system and method that facilitate classifying items in connection with spam prevention in server and/or client-based architectures. The invention makes uses of a machine-learning approach as applied to spam filters, and in particular, randomly samples incoming email messages so that examples of both legitimate and junk/spam mail are obtained to generate sets of training data. Users which are identified as spam-fighters are asked to vote on whether a selection of their incoming email messages is individually either legitimate mail or junk mail. A database stores the properties for each mail and voting transaction such as user information, message properties and content summary, and polling results for each message to generate training data for machine learning systems. The machine learning systems facilitate creating improved spam filter(s) that are trained to recognize both legitimate mail and spam mail and to distinguish between them.Type: GrantFiled: May 2, 2007Date of Patent: July 7, 2009Assignee: Microsoft CorporationInventors: Robert L. Rounthwaite, Joshua T. Goodman, David E. Heckerman, John D. Mehr, Nathan D. Howell, Micah C. Rupersburg, Dean A. Slawson
-
Patent number: 7552862Abstract: Architecture is provided that facilitates user-controlled access to user profile information. A user is allowed to selectively expose (or mask) portions of his/her profile to third parties. Additionally, advertisers and/or content providers can offer incentives or enticement in response to the acceptance of which a user exposes larger portions of their profile. The architecture comprises a system that facilitates profile management utilizing a profile component that facilitates creation and storage of an electronic profile of a user, and a control component under control of the user for controlling access to the profile. Machine learning and reasoning is provided to make inferences and automate aspects thereof.Type: GrantFiled: June 29, 2006Date of Patent: June 30, 2009Assignee: Microsoft CorporationInventors: Gary W. Flake, Eric J. Horvitz, Joshua T. Goodman, Eric D. Brill, Bradly A. Brunell, Susan T. Dumais, Alexander G. Gounares, Trenholme J. Griffin, Oliver Hurst-Hiller, Raymond E. Ozzie
-
Patent number: 7552176Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.Type: GrantFiled: October 10, 2003Date of Patent: June 23, 2009Assignee: Microsoft CorporationInventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite, Andrew V. Goldberg, Cynthia Dwork
-
Patent number: 7543053Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates a more robust classification system in connection with spam prevention. The invention involves holding back some messages that appear to be questionable, suspicious, or untrustworthy from classification (as spam or good). In particular, the filter lacks information about these messages and thus classification is temporarily delayed. This provides more time for a filter update to arrive with a more accurate classification. The suspicious messages can be quarantined for a determined time period to allow more data to be collected regarding these messages. A number of factors can be employed to determine whether messages are more likely to be flagged for further analysis. User feedback by way of a feedback loop system can also be utilized to facilitate classification of the messages. After some time period, classification of the messages can be resumed.Type: GrantFiled: February 13, 2004Date of Patent: June 2, 2009Assignee: Microsoft CorporationInventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Derek Hazeur
-
Patent number: 7533411Abstract: The present invention involves a system and method that facilitate identifying human interaction by utilizing HIPs such as order-based HIPs and determining a difficulty rating of any type of HIPs in an automated fashion. Order-based HIPs require a user to identify elements in the sequence as well as to identify a correct order of the elements in the sequence. The invention involves presenting a user with at least two HIPs such that the HIP can be of known and/or unknown difficulty. A user that correctly answers the HIP of known difficulty gains access to the HIP-controlled resource, action or application. The user's response to the HIP of unknown difficulty can then be examined and employed to determine whether that HIP is too difficult for humans to solve. Alternatively, at least one HIP can be presented. Difficulty of individual HIP parameters can also be determined.Type: GrantFiled: September 23, 2003Date of Patent: May 12, 2009Assignee: Microsoft CorporationInventors: Joshua T. Goodman, Robert L. Rounthwaite
-
Publication number: 20090106059Abstract: Providing for generating an executive report of business or personal activity is described herein. By way of example, such executive report can identify a change and related cause with respect to a prior report. As a particular example, an inference engine can receive an activity report and reference prior reports to identify the change and related cause. A set of results containing such information can be provided to a synthesis component that can include and highlight such information in the executive report. In addition, additional sources of data can be referenced in order to include and/or customize the report to a particular individual, organization, culture, or the like. As described, aspects of the subject innovation can provide an executive report highlighting important aspects of data and tailoring those aspects to interests of one or more users.Type: ApplicationFiled: October 17, 2007Publication date: April 23, 2009Applicant: MICROSOFT CORPORATIONInventors: Eran Megiddo, Richard J. Wolf, Susan T. Dumais, Jensen M. Harris, Joshua T. Goodman
-
Patent number: 7519668Abstract: The subject invention provides systems and methods that facilitate obfuscating a spam filtering system to hinder reverse engineering of the spam filters and/or to mitigate spammers from finding a message that consistently gets through the spam filters almost every time. The system includes a randomization component that randomizes a message score before the message is classified as spam or non-spam so as to obscure the functionality of the spam filter. Randomizing the message score can be accomplished in part by adding a random number or pseudo-random number to the message score before it is classified as spam or non-spam. The number added thereto can vary depending on at least one of several types of input such as time, user, message content, hash of message content, and hash of particularly important features of the message, for example. Alternatively, multiple spam filters can be deployed rather than a single best spam filter.Type: GrantFiled: June 20, 2003Date of Patent: April 14, 2009Assignee: Microsoft CorporationInventors: Joshua T. Goodman, Robert L. Rounthwaite, John C. Platt
-
Patent number: 7483813Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.Type: GrantFiled: October 19, 2006Date of Patent: January 27, 2009Assignee: Microsoft CorporationInventor: Joshua T. Goodman
-
Patent number: 7483947Abstract: Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.Type: GrantFiled: May 2, 2003Date of Patent: January 27, 2009Assignee: Microsoft CorporationInventors: Bryan T. Starbuck, Robert L. Rounthwaite, David E. Heckerman, Joshua T. Goodman
-
Publication number: 20090006343Abstract: Architecture for completing search queries by using artificial intelligence based schemes to infer search intentions of users. Partial queries are completed dynamically in real time. Additionally, search aliasing can also be employed. Custom tuning can be performed based on at least query inputs in the form of text, graffiti, images, handwriting, voice, audio, and video signals. Natural language processing occurs, along with handwriting recognition and slang recognition. The system includes a classifier that receives a partial query as input, accesses a query database based on contents of the query input, and infers an intended search goal from query information stored on the query database. A query formulation engine receives search information associated with the intended search goal and generates a completed formal query for execution.Type: ApplicationFiled: June 28, 2007Publication date: January 1, 2009Applicant: MICROSOFT CORPORATIONInventors: John C. Platt, Gary W. Flake, Ramez Naam, Anoop Gupta, Oliver Hurst-Hiller, Trenholme J. Griffin, Joshua T. Goodman
-
Publication number: 20080319932Abstract: A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.Type: ApplicationFiled: June 21, 2007Publication date: December 25, 2008Applicant: MICROSOFT CORPORATIONInventors: Wen-tau Yih, Joshua T. Goodman, Geoffrey J. Hulten
-
Patent number: 7464264Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates detecting and preventing spam. In particular, the invention employs a machine learning filter specifically trained using origination features such as an IP address as well as destination feature such as a URL. Moreover, the system and method involve training a plurality of filters using specific feature data for each filter. The filters are trained independently each other, thus one feature may not unduly influence another feature in determining whether a message is spam. Because multiple filters are trained and available to scan messages either individually or in combination (at least two filters), the filtering or spam detection process can be generalized to new messages having slightly modified features (e.g., IP address). The invention also involves locating the appropriate IP addresses or URLs in a message as well as guiding filters to weigh origination or destination features more than text-based features.Type: GrantFiled: March 25, 2004Date of Patent: December 9, 2008Assignee: Microsoft CorporationInventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Wen-tau Yih
-
Publication number: 20080201434Abstract: In one or more embodiments, in the context of an instant messaging application, a conversation is analyzed and contextually or textually relevant keywords and/or phrases are identified. These keywords or phrases are then highlighted in a visually-identifiable manner for selection by an individual participating in the conversation. Once selected by an individual, a user interface is presented and exposes the individual or individuals in the conversation to various contextually- or textually-relevant material or functionality that pertains to the selected word or phrase. In one or more embodiments, an individual can also manually select a word or phrase to access the user interface that exposes contextually or textually-relevant material or functionality.Type: ApplicationFiled: February 16, 2007Publication date: August 21, 2008Applicant: Microsoft CorporationInventors: John S. Holmes, Heather Ferguson, Adam C. Czeisler, Joshua T. Goodman
-
Patent number: 7409708Abstract: Disclosed are systems and methods that facilitate spam detection and prevention at least in part by building or training filters using advanced IP address and/or URL features in connection with machine learning techniques. A variety of advanced IP address related features can be generated from performing a reverse IP lookup. Similarly, many different advanced URL based features can be created from analyzing at least a portion of any one URL detected in a message.Type: GrantFiled: May 28, 2004Date of Patent: August 5, 2008Assignee: Microsoft CorporationInventors: Joshua T Goodman, Robert L Rounthwaite, Geoffrey J Hulten, John A Deurbrouck, Manav Mishra, Anthony P Penta
-
Patent number: 7398315Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.Type: GrantFiled: October 10, 2003Date of Patent: July 8, 2008Assignee: Workman NydeggerInventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite
-
Publication number: 20080140781Abstract: Spam is identified by computing sender reputation derived from historical activity data across counts for various categories. A spam filter or machine learning system can be trained utilizing pre-categorized data in conjunction with activity data associated with a sender aggregated across at least one time period. This sender activity filter can be employed alone or in combination with other filters to facilitate classification of messages as spam or non-spam.Type: ApplicationFiled: December 6, 2006Publication date: June 12, 2008Applicant: MICROSOFT CORPORATIONInventors: Alexei V. Bocharov, Joshua T. Goodman
-
Patent number: 7385591Abstract: Out-of-vocabulary (OOV) word determination corresponding to a key sequence entered by the user on a (typically numeric) keypad, and a user interface for the user to select one of the words, are disclosed. A word-determining logic determines letter sequences corresponding to the entered key sequence, and presents the sequences within the user interface in which the user can select one of the letter sequences as the intended word, or select the first letter of the intended word. When letters are selected, the word-determining logic determines new letter sequences, consistent with the key sequence and the selected letters, and presents the new letter sequences. The user again selects one of the letter sequences as the intended word, or selects the second letter of the intended word. This process is repeated until the user has selected the intended word.Type: GrantFiled: March 31, 2001Date of Patent: June 10, 2008Assignee: Microsoft CorporationInventor: Joshua T. Goodman
-
Publication number: 20080109425Abstract: Document summarization is performed by scoring individual words in sentences in a document or document cluster. Sentences from the document or document cluster are selected to form a summary based on the scores of the words contained in those sentences.Type: ApplicationFiled: November 2, 2006Publication date: May 8, 2008Applicant: Microsoft CorporationInventors: Wen-tau Yih, Joshua T. Goodman, Lucretia H. Vanderwende, Hisami Suzuki
-
Patent number: 7340376Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.Type: GrantFiled: July 21, 2005Date of Patent: March 4, 2008Assignee: Microsoft CorporationInventor: Joshua T. Goodman
-
Publication number: 20080022097Abstract: A computer-implemented method and system for obtaining data is provided. In the method, to obtain data pertaining to another party, a request for an authentication key is made. Upon receiving the requested authentication key in an email, the method and system automatically send the authentication key as part of a HTTP, HTTPS or SMTP request for data. Then, in response to the request for data containing the authentication key, the requested data is received.Type: ApplicationFiled: June 15, 2006Publication date: January 24, 2008Applicant: MICROSOFT CORPORATIONInventors: Eliot C. Gillum, Joshua T. Goodman