Patents by Inventor Joshua T. Goodman

Joshua T. Goodman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7558832
    Abstract: The subject invention provides for a feedback loop system and method that facilitate classifying items in connection with spam prevention in server and/or client-based architectures. The invention makes uses of a machine-learning approach as applied to spam filters, and in particular, randomly samples incoming email messages so that examples of both legitimate and junk/spam mail are obtained to generate sets of training data. Users which are identified as spam-fighters are asked to vote on whether a selection of their incoming email messages is individually either legitimate mail or junk mail. A database stores the properties for each mail and voting transaction such as user information, message properties and content summary, and polling results for each message to generate training data for machine learning systems. The machine learning systems facilitate creating improved spam filter(s) that are trained to recognize both legitimate mail and spam mail and to distinguish between them.
    Type: Grant
    Filed: May 2, 2007
    Date of Patent: July 7, 2009
    Assignee: Microsoft Corporation
    Inventors: Robert L. Rounthwaite, Joshua T. Goodman, David E. Heckerman, John D. Mehr, Nathan D. Howell, Micah C. Rupersburg, Dean A. Slawson
  • Patent number: 7552862
    Abstract: Architecture is provided that facilitates user-controlled access to user profile information. A user is allowed to selectively expose (or mask) portions of his/her profile to third parties. Additionally, advertisers and/or content providers can offer incentives or enticement in response to the acceptance of which a user exposes larger portions of their profile. The architecture comprises a system that facilitates profile management utilizing a profile component that facilitates creation and storage of an electronic profile of a user, and a control component under control of the user for controlling access to the profile. Machine learning and reasoning is provided to make inferences and automate aspects thereof.
    Type: Grant
    Filed: June 29, 2006
    Date of Patent: June 30, 2009
    Assignee: Microsoft Corporation
    Inventors: Gary W. Flake, Eric J. Horvitz, Joshua T. Goodman, Eric D. Brill, Bradly A. Brunell, Susan T. Dumais, Alexander G. Gounares, Trenholme J. Griffin, Oliver Hurst-Hiller, Raymond E. Ozzie
  • Patent number: 7552176
    Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.
    Type: Grant
    Filed: October 10, 2003
    Date of Patent: June 23, 2009
    Assignee: Microsoft Corporation
    Inventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite, Andrew V. Goldberg, Cynthia Dwork
  • Patent number: 7543053
    Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates a more robust classification system in connection with spam prevention. The invention involves holding back some messages that appear to be questionable, suspicious, or untrustworthy from classification (as spam or good). In particular, the filter lacks information about these messages and thus classification is temporarily delayed. This provides more time for a filter update to arrive with a more accurate classification. The suspicious messages can be quarantined for a determined time period to allow more data to be collected regarding these messages. A number of factors can be employed to determine whether messages are more likely to be flagged for further analysis. User feedback by way of a feedback loop system can also be utilized to facilitate classification of the messages. After some time period, classification of the messages can be resumed.
    Type: Grant
    Filed: February 13, 2004
    Date of Patent: June 2, 2009
    Assignee: Microsoft Corporation
    Inventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Derek Hazeur
  • Patent number: 7533411
    Abstract: The present invention involves a system and method that facilitate identifying human interaction by utilizing HIPs such as order-based HIPs and determining a difficulty rating of any type of HIPs in an automated fashion. Order-based HIPs require a user to identify elements in the sequence as well as to identify a correct order of the elements in the sequence. The invention involves presenting a user with at least two HIPs such that the HIP can be of known and/or unknown difficulty. A user that correctly answers the HIP of known difficulty gains access to the HIP-controlled resource, action or application. The user's response to the HIP of unknown difficulty can then be examined and employed to determine whether that HIP is too difficult for humans to solve. Alternatively, at least one HIP can be presented. Difficulty of individual HIP parameters can also be determined.
    Type: Grant
    Filed: September 23, 2003
    Date of Patent: May 12, 2009
    Assignee: Microsoft Corporation
    Inventors: Joshua T. Goodman, Robert L. Rounthwaite
  • Publication number: 20090106059
    Abstract: Providing for generating an executive report of business or personal activity is described herein. By way of example, such executive report can identify a change and related cause with respect to a prior report. As a particular example, an inference engine can receive an activity report and reference prior reports to identify the change and related cause. A set of results containing such information can be provided to a synthesis component that can include and highlight such information in the executive report. In addition, additional sources of data can be referenced in order to include and/or customize the report to a particular individual, organization, culture, or the like. As described, aspects of the subject innovation can provide an executive report highlighting important aspects of data and tailoring those aspects to interests of one or more users.
    Type: Application
    Filed: October 17, 2007
    Publication date: April 23, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Eran Megiddo, Richard J. Wolf, Susan T. Dumais, Jensen M. Harris, Joshua T. Goodman
  • Patent number: 7519668
    Abstract: The subject invention provides systems and methods that facilitate obfuscating a spam filtering system to hinder reverse engineering of the spam filters and/or to mitigate spammers from finding a message that consistently gets through the spam filters almost every time. The system includes a randomization component that randomizes a message score before the message is classified as spam or non-spam so as to obscure the functionality of the spam filter. Randomizing the message score can be accomplished in part by adding a random number or pseudo-random number to the message score before it is classified as spam or non-spam. The number added thereto can vary depending on at least one of several types of input such as time, user, message content, hash of message content, and hash of particularly important features of the message, for example. Alternatively, multiple spam filters can be deployed rather than a single best spam filter.
    Type: Grant
    Filed: June 20, 2003
    Date of Patent: April 14, 2009
    Assignee: Microsoft Corporation
    Inventors: Joshua T. Goodman, Robert L. Rounthwaite, John C. Platt
  • Patent number: 7483813
    Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.
    Type: Grant
    Filed: October 19, 2006
    Date of Patent: January 27, 2009
    Assignee: Microsoft Corporation
    Inventor: Joshua T. Goodman
  • Patent number: 7483947
    Abstract: Architecture for detecting and removing obfuscating clutter from the subject and/or body of a message, e.g., e-mail, prior to filtering of the message, to identify junk messages commonly referred to as spam. The technique utilizes the powerful features built into an HTML rendering engine to strip the HTML instructions for all non-substantive aspects of the message. Pre-processing includes pre-rendering of the message into a final format, which final format is that which is displayed by the rendering engine to the user. The final format message is then converted to a text-only format to remove graphics, color, non-text decoration, and spacing that cannot be rendered as ASCII-style or Unicode-style characters. The result is essentially to reduce each message to its common denominator essentials so that the junk mail filter can view each message on an equal basis.
    Type: Grant
    Filed: May 2, 2003
    Date of Patent: January 27, 2009
    Assignee: Microsoft Corporation
    Inventors: Bryan T. Starbuck, Robert L. Rounthwaite, David E. Heckerman, Joshua T. Goodman
  • Publication number: 20090006343
    Abstract: Architecture for completing search queries by using artificial intelligence based schemes to infer search intentions of users. Partial queries are completed dynamically in real time. Additionally, search aliasing can also be employed. Custom tuning can be performed based on at least query inputs in the form of text, graffiti, images, handwriting, voice, audio, and video signals. Natural language processing occurs, along with handwriting recognition and slang recognition. The system includes a classifier that receives a partial query as input, accesses a query database based on contents of the query input, and infers an intended search goal from query information stored on the query database. A query formulation engine receives search information associated with the intended search goal and generates a completed formal query for execution.
    Type: Application
    Filed: June 28, 2007
    Publication date: January 1, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: John C. Platt, Gary W. Flake, Ramez Naam, Anoop Gupta, Oliver Hurst-Hiller, Trenholme J. Griffin, Joshua T. Goodman
  • Publication number: 20080319932
    Abstract: A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.
    Type: Application
    Filed: June 21, 2007
    Publication date: December 25, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Wen-tau Yih, Joshua T. Goodman, Geoffrey J. Hulten
  • Patent number: 7464264
    Abstract: The subject invention provides for an intelligent quarantining system and method that facilitates detecting and preventing spam. In particular, the invention employs a machine learning filter specifically trained using origination features such as an IP address as well as destination feature such as a URL. Moreover, the system and method involve training a plurality of filters using specific feature data for each filter. The filters are trained independently each other, thus one feature may not unduly influence another feature in determining whether a message is spam. Because multiple filters are trained and available to scan messages either individually or in combination (at least two filters), the filtering or spam detection process can be generalized to new messages having slightly modified features (e.g., IP address). The invention also involves locating the appropriate IP addresses or URLs in a message as well as guiding filters to weigh origination or destination features more than text-based features.
    Type: Grant
    Filed: March 25, 2004
    Date of Patent: December 9, 2008
    Assignee: Microsoft Corporation
    Inventors: Joshua T. Goodman, Robert L. Rounthwaite, Geoffrey J. Hulten, Wen-tau Yih
  • Publication number: 20080201434
    Abstract: In one or more embodiments, in the context of an instant messaging application, a conversation is analyzed and contextually or textually relevant keywords and/or phrases are identified. These keywords or phrases are then highlighted in a visually-identifiable manner for selection by an individual participating in the conversation. Once selected by an individual, a user interface is presented and exposes the individual or individuals in the conversation to various contextually- or textually-relevant material or functionality that pertains to the selected word or phrase. In one or more embodiments, an individual can also manually select a word or phrase to access the user interface that exposes contextually or textually-relevant material or functionality.
    Type: Application
    Filed: February 16, 2007
    Publication date: August 21, 2008
    Applicant: Microsoft Corporation
    Inventors: John S. Holmes, Heather Ferguson, Adam C. Czeisler, Joshua T. Goodman
  • Patent number: 7409708
    Abstract: Disclosed are systems and methods that facilitate spam detection and prevention at least in part by building or training filters using advanced IP address and/or URL features in connection with machine learning techniques. A variety of advanced IP address related features can be generated from performing a reverse IP lookup. Similarly, many different advanced URL based features can be created from analyzing at least a portion of any one URL detected in a message.
    Type: Grant
    Filed: May 28, 2004
    Date of Patent: August 5, 2008
    Assignee: Microsoft Corporation
    Inventors: Joshua T Goodman, Robert L Rounthwaite, Geoffrey J Hulten, John A Deurbrouck, Manav Mishra, Anthony P Penta
  • Patent number: 7398315
    Abstract: The present invention provides for generating inputs that can be provided to a message classification module to facilitate more reliable classification of electronic messages, such as, for example, as unwanted and/or unsolicited. In one embodiment, a sending messaging server provides an appropriate response to address verification data thereby indicating a reduced likelihood of the sending messaging server using a forged network address. In another embodiment, it is determined if a messaging server is authorized to send electronic messages for a domain. In yet another embodiment, electronic message transmission policies adhered to by a domain are identified. In yet a further embodiment, a sending computer system expends computational resources to solve a computational puzzle and includes an answer document in an electronic message. A receiving computer system receives the electronic message and verifies the answer document.
    Type: Grant
    Filed: October 10, 2003
    Date of Patent: July 8, 2008
    Assignee: Workman Nydegger
    Inventors: Robert George Atkinson, Joshua T. Goodman, James M. Lyon, Roy Williams, Khaja E. Ahmed, Harry Simon Katz, Robert L. Rounthwaite
  • Publication number: 20080140781
    Abstract: Spam is identified by computing sender reputation derived from historical activity data across counts for various categories. A spam filter or machine learning system can be trained utilizing pre-categorized data in conjunction with activity data associated with a sender aggregated across at least one time period. This sender activity filter can be employed alone or in combination with other filters to facilitate classification of messages as spam or non-spam.
    Type: Application
    Filed: December 6, 2006
    Publication date: June 12, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Alexei V. Bocharov, Joshua T. Goodman
  • Patent number: 7385591
    Abstract: Out-of-vocabulary (OOV) word determination corresponding to a key sequence entered by the user on a (typically numeric) keypad, and a user interface for the user to select one of the words, are disclosed. A word-determining logic determines letter sequences corresponding to the entered key sequence, and presents the sequences within the user interface in which the user can select one of the letter sequences as the intended word, or select the first letter of the intended word. When letters are selected, the word-determining logic determines new letter sequences, consistent with the key sequence and the selected letters, and presents the new letter sequences. The user again selects one of the letter sequences as the intended word, or selects the second letter of the intended word. This process is repeated until the user has selected the intended word.
    Type: Grant
    Filed: March 31, 2001
    Date of Patent: June 10, 2008
    Assignee: Microsoft Corporation
    Inventor: Joshua T. Goodman
  • Publication number: 20080109425
    Abstract: Document summarization is performed by scoring individual words in sentences in a document or document cluster. Sentences from the document or document cluster are selected to form a summary based on the scores of the words contained in those sentences.
    Type: Application
    Filed: November 2, 2006
    Publication date: May 8, 2008
    Applicant: Microsoft Corporation
    Inventors: Wen-tau Yih, Joshua T. Goodman, Lucretia H. Vanderwende, Hisami Suzuki
  • Patent number: 7340376
    Abstract: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.
    Type: Grant
    Filed: July 21, 2005
    Date of Patent: March 4, 2008
    Assignee: Microsoft Corporation
    Inventor: Joshua T. Goodman
  • Publication number: 20080022097
    Abstract: A computer-implemented method and system for obtaining data is provided. In the method, to obtain data pertaining to another party, a request for an authentication key is made. Upon receiving the requested authentication key in an email, the method and system automatically send the authentication key as part of a HTTP, HTTPS or SMTP request for data. Then, in response to the request for data containing the authentication key, the requested data is received.
    Type: Application
    Filed: June 15, 2006
    Publication date: January 24, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Eliot C. Gillum, Joshua T. Goodman