Patents by Inventor Mark Usher

Mark Usher has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Domain clustering for malicious campaign identification

Patent number: 11689548

Abstract: A method for identification of malicious domains is provided. The method extracts a set of domain information from one or more input streams. The set of domain information includes a set of domains and a set of domain characteristics describing each domain. The method clusters the set of domains to generate a set of campaign clusters of related domains. The clusters are based on the set of domain characteristics. The method modifies the set of campaign clusters with a set of threat intelligence ratings to generate a set of enriched campaign clusters. A portion of the set of threat intelligence ratings correspond to one or more domains within the set of campaign clusters. The method determines a cluster designation for each campaign cluster of the set of enriched campaign clusters and distributes the cluster designations for each campaign cluster to one or more threat intelligence resource.

Type: Grant

Filed: July 11, 2019

Date of Patent: June 27, 2023

Assignee: International Business Machines Corporation

Inventors: Mark Usher, Johannes Noll, Uwe Küllmar, Dirk Harz, Marc Noske
DOMAIN CLUSTERING FOR MALICIOUS CAMPAIGN IDENTIFICATION

Publication number: 20210014252

Abstract: A method for identification of malicious domains is provided. The method extracts a set of domain information from one or more input streams. The set of domain information includes a set of domains and a set of domain characteristics describing each domain. The method clusters the set of domains to generate a set of campaign clusters of related domains. The clusters are based on the set of domain characteristics. The method modifies the set of campaign clusters with a set of threat intelligence ratings to generate a set of enriched campaign clusters. A portion of the set of threat intelligence ratings correspond to one or more domains within the set of campaign clusters. The method determines a cluster designation for each campaign cluster of the set of enriched campaign clusters and distributes the cluster designations for each campaign cluster to one or more threat intelligence resource.

Type: Application

Filed: July 11, 2019

Publication date: January 14, 2021

Inventors: Mark Usher, Johannes Noll, Uwe Küllmar, Dirk Harz, Marc Noske
Unsolicited bulk email detection using URL tree hashes

Patent number: 10810176

Abstract: According to one exemplary embodiment, a method for detecting unsolicited bulk emails (UBE) is provided. The method may include receiving an email. The method may also include identifying a uniform resource locator (URL) contained in the received email. The method may then include dividing the identified URL into a plurality of component parts. The method may further include generating a tree structure based on the plurality of component parts. The method may also include generating an input string based on the generated tree structure. The method may then include calculating a hash value based on the generated input string. The method may further include determining if the calculated hash value matches a UBE hash value within a plurality of UBE hash values. The method may also include identifying the received email as a UBE based on determining that the calculated hash value matches the UBE hash value.

Type: Grant

Filed: April 28, 2015

Date of Patent: October 20, 2020

Assignee: International Business Machines Corporation

Inventors: Astrid Granacher, Dirk Harz, Juergen Kader, Johannes Noll, Mark Usher
Automatic genre classification determination of web content to which the web content belongs together with a corresponding genre probability

Patent number: 10764353

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Grant

Filed: October 18, 2018

Date of Patent: September 1, 2020

Assignee: International Business Machines Corporation

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
Unsolicited bulk email detection using URL tree hashes

Patent number: 10706032

Abstract: According to one exemplary embodiment, a method for detecting unsolicited bulk emails (UBE) is provided. The method may include receiving an email. The method may also include identifying a uniform resource locator (URL) contained in the received email. The method may then include dividing the identified URL into a plurality of component parts. The method may further include generating a tree structure based on the plurality of component parts. The method may also include generating an input string based on the generated tree structure. The method may then include calculating a hash value based on the generated input string. The method may further include determining if the calculated hash value matches a UBE hash value within a plurality of UBE hash values. The method may also include identifying the received email as a UBE based on determining that the calculated hash value matches the UBE hash value.

Type: Grant

Filed: June 3, 2015

Date of Patent: July 7, 2020

Assignee: International Business Machines Corporation

Inventors: Astrid Granacher, Dirk Harz, Juergen Kader, Johannes Noll, Mark Usher
Intrusion detection system

Patent number: 10686807

Abstract: A method for classification of suspicious activities is provided. In the method, a first intrusion detection system comprising a normal operation mode and which is connected to a second intrusion detection system by a first communications connection is implemented. In response to detecting a malfunction of the first communications connection, the first intrusion detection system is switched from the normal operation mode to a limited operation mode for receiving first data from one or more honeypot systems and second data from the second intrusion detection system. A prediction model for representing malicious attacks is generated by execution of a predefined classification algorithm with respect to the received data, wherein the predefined classification algorithm further determine a model evaluation metric with respect to the prediction model. The prediction model is deployed to detect the malicious attacks if the model evaluation metric meets a predefined validation condition.

Type: Grant

Filed: June 12, 2018

Date of Patent: June 16, 2020

Assignee: International Business Machines Corporation

Inventors: Gideon Zenz, Volker Vogeley, Dirk Harz, Mark Usher, Astrid Granacher
INTRUSION DETECTION SYSTEM

Publication number: 20190379677

Abstract: A method for classification of suspicious activities is provided. In the method, a first intrusion detection system comprising a normal operation mode and which is connected to a second intrusion detection system by a first communications connection is implemented. In response to detecting a malfunction of the first communications connection, the first intrusion detection system is switched from the normal operation mode to a limited operation mode for receiving first data from one or more honeypot systems and second data from the second intrusion detection system. A prediction model for representing malicious attacks is generated by execution of a predefined classification algorithm with respect to the received data, wherein the predefined classification algorithm further determine a model evaluation metric with respect to the prediction model. The prediction model is deployed to detect the malicious attacks if the model evaluation metric meets a predefined validation condition.

Type: Application

Filed: June 12, 2018

Publication date: December 12, 2019

Inventors: Gideon Zenz, Volker Vogeley, Dirk Harz, Mark Usher, Astrid Granacher
Automatic Genre Classification Determination of Web Content to which the Web Content Belongs Together with a Corresponding Genre Probability

Publication number: 20190052694

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Application

Filed: October 18, 2018

Publication date: February 14, 2019

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
Automatic genre classification determination of web content to which the web content belongs together with a corresponding genre probability

Patent number: 10110658

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Grant

Filed: June 3, 2015

Date of Patent: October 23, 2018

Assignee: International Business Machines Corporation

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
Automatic genre classification determination of web content to which the web content belongs together with a corresponding genre probability

Patent number: 9565236

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Grant

Filed: December 4, 2013

Date of Patent: February 7, 2017

Assignee: International Business Machines Corporation

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
UNSOLICITED BULK EMAIL DETECTION USING URL TREE HASHES

Publication number: 20160321254

Abstract: According to one exemplary embodiment, a method for detecting unsolicited bulk emails (UBE) is provided. The method may include receiving an email. The method may also include identifying a uniform resource locator (URL) contained in the received email. The method may then include dividing the identified URL into a plurality of component parts. The method may further include generating a tree structure based on the plurality of component parts. The method may also include generating an input string based on the generated tree structure. The method may then include calculating a hash value based on the generated input string. The method may further include determining if the calculated hash value matches a UBE hash value within a plurality of UBE hash values. The method may also include identifying the received email as a UBE based on determining that the calculated hash value matches the UBE hash value.

Type: Application

Filed: April 28, 2015

Publication date: November 3, 2016

Inventors: Astrid Granacher, Dirk Harz, Juergen Kader, Johannes Noll, Mark Usher
UNSOLICITED BULK EMAIL DETECTION USING URL TREE HASHES

Publication number: 20160321255

Abstract: According to one exemplary embodiment, a method for detecting unsolicited bulk emails (UBE) is provided. The method may include receiving an email. The method may also include identifying a uniform resource locator (URL) contained in the received email. The method may then include dividing the identified URL into a plurality of component parts. The method may further include generating a tree structure based on the plurality of component parts. The method may also include generating an input string based on the generated tree structure. The method may then include calculating a hash value based on the generated input string. The method may further include determining if the calculated hash value matches a UBE hash value within a plurality of UBE hash values. The method may also include identifying the received email as a UBE based on determining that the calculated hash value matches the UBE hash value.

Type: Application

Filed: June 3, 2015

Publication date: November 3, 2016

Inventors: Astrid Granacher, Dirk Harz, Juergen Kader, Johannes Noll, Mark Usher
Automatic Genre Determination of Web Content

Publication number: 20150264107

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Application

Filed: June 3, 2015

Publication date: September 17, 2015

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
Method for identifying pairs of derivative and original images

Patent number: 8879837

Abstract: The invention provides for a computer-implemented method for detecting one or more archive images matching a search image, each matching archive image being a derivative of the search image or being an original image the search image was derived from accessing a plurality of the archive images. For each of said archive images, a respective archive image histogram may be calculated, wherein each archive image histogram includes a plurality of combination micro-feature values. The archive image histogram may be stored to a database.

Type: Grant

Filed: June 26, 2012

Date of Patent: November 4, 2014

Assignee: International Business Machines Corporation

Inventor: Mark Usher
Automatic Genre Determination of Web Content

Publication number: 20140201113

Abstract: A mechanism is provided for automatic genre determination of web content. For each type of web content genre, a set of relevant feature types are extracted from collected training material, where genre features and non-genre features are represented by tokens and an integer counts represents a frequency of appearance of the token in both a first type of training material and a second type of training material. In a classification process, fixed length tokens are extracted for relevant features types from different text and structural elements of web content. For each relevant feature type, a corresponding feature probability is calculated. The feature probabilities are combined to an overall genre probability that the web content belongs to a specific trained web content genre. A genre classification result is then output comprising at least one specific trained web content genre to which the web content belongs together with a corresponding genre probability.

Type: Application

Filed: December 4, 2013

Publication date: July 17, 2014

Applicant: International Business Machines Corporation

Inventors: Dirk Harz, Ralf Iffert, Mark Keinhoerster, Mark Usher
Method for Identifying Pairs of Derivative and Original Images

Publication number: 20120328189

Abstract: The invention provides for a computer-implemented method for detecting one or more archive images matching a search image, each matching archive image being a derivative of the search image or being an original image the search image was derived from accessing a plurality of the archive images. For each of said archive images, a respective archive image histogram may be calculated, wherein each archive image histogram includes a plurality of combination micro-feature values. The archive image histogram may be stored to a database.

Type: Application

Filed: June 26, 2012

Publication date: December 27, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Mark Usher
Generating a fingerprint of a bit sequence

Patent number: 8086675

Abstract: A method of generating a fingerprint of a bit sequence includes determining a relative occurrence frequency of each bit combination of a set of bit combinations in the bit sequence, wherein the set of bit combinations comprises all possible non-redundant sub-sequences of bits having at least one bit and at most a preset maximal number of bits. The method further includes determining for each bit combination of the set of bit combinations a difference value between the relative occurrence frequency of the bit combination and a random occurrence frequency, the random occurrence frequency relating to the expected random occurrence of the bit combination in the bit sequence. Moreover, the method includes allocating a set of bins, each bin of the set of bins being associated with a predetermined interval of difference values, each bin further relating to a bin value.

Type: Grant

Filed: May 13, 2008

Date of Patent: December 27, 2011

Assignee: International Business Machines Corporation

Inventor: Mark Usher
Method and system for filtering spam using an adjustable reliability value

Patent number: 7552186

Abstract: A spam detection system can monitor incoming and outgoing email messages and prevent email messages from being delivered. This spam detection system incorporates a sender ranking system that maintains prior sender's email addresses and an associated reliability value in a database. If an email message is categorized as spam, the system searches to see if the sender is located in the database. If the sender is located in the database and their reliability value is above a certain threshold, the sender's reliability value is decreased and the email message is treated as not spam. If the sender is not located in the database, the email message is discarded as spam. If an email message is not categorized as spam, prior users located in the database will have their reliability values increased, while new users will be entered into the database at a default level.

Type: Grant

Filed: June 10, 2005

Date of Patent: June 23, 2009

Assignee: International Business Machines Corporation

Inventors: Carsten Werner, Mark Usher
GENERATING A FINGERPRINT OF A BIT SEQUENCE

Publication number: 20090030994

Abstract: A method of generating a fingerprint of a bit sequence includes determining a relative occurrence frequency of each bit combination of a set of bit combinations in the bit sequence, wherein the set of bit combinations comprises all possible non-redundant sub-sequences of bits having at least one bit and at most a preset maximal number of bits. The method further includes determining for each bit combination of the set of bit combinations a difference value between the relative occurrence frequency of the bit combination and a random occurrence frequency, the random occurrence frequency relating to the expected random occurrence of the bit combination in the bit sequence. Moreover, the method includes allocating a set of bins, each bin of the set of bins being associated with a predetermined interval of difference values, each bin further relating to a bin value.

Type: Application

Filed: May 13, 2008

Publication date: January 29, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION (IBM)

Inventor: Mark Usher
Spam filter with sender ranking system

Publication number: 20060031373

Abstract: A spam detection system can monitor incoming and outgoing email messages and prevent email messages from being delivered. This spam detection system incorporates a sender ranking system that maintains prior sender's email addresses and an associated reliability value in a database. If an email message is categorized as spam, the system searches to see if the sender is located in the database. If the sender is located in the database and their reliability value is above a certain threshold, the sender's reliability value is decreased and the email message is treated as not spam. If the sender is not located in the database, the email message is discarded as spam. If an email message is not categorized as spam, prior users located in the database will have their reliability values increased, while new users will be entered into the database at a default level.

Type: Application

Filed: June 10, 2005

Publication date: February 9, 2006

Applicant: Internet Security Systems, Inc.

Inventors: Carsten Werner, Mark Usher