Patents by Inventor Huihsin Tseng

Huihsin Tseng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for intelligent machine learning-based malware detection

Patent number: 12346445

Abstract: The methods described herein include receiving a plurality of packets associated with a file, each of the plurality of packets comprising content, and a source domain; extracting one or more features from content of a first packet of the plurality of packets; applying a trained machine learning model to the extracted one or more features to determine a probability of maliciousness associated with the first packet; responsive to determining that the probability maliciousness of the first packet is between a first threshold value and a second threshold value, labeling the first packet as having an uncertain maliciousness; extracting one or more features from content of a second packet of the plurality of packets; and applying the trained machine learning model to the extracted one or more features of the first packet and the second packet to determine a probability of maliciousness associated with the second packet.

Type: Grant

Filed: October 17, 2023

Date of Patent: July 1, 2025

Assignee: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
Systems and methods for intelligent machine learning-based malware detection

Publication number: 20250156546

Abstract: Systems and methods for training a machine learning model for malware detection include steps of collecting a training dataset comprising a plurality of malicious files and a plurality of benign files from one or more sources; extracting features from each file in the training dataset, wherein the features include at least one of n-gram features, entropy features, or domain features; labeling each file in the training dataset as malicious or benign based on a predefined criterion; and applying a supervised machine learning technique to learn patterns in the extracted features and generate a trained machine learning model configured to predict whether a file is malicious or benign based on an incremental packet-based analysis.

Type: Application

Filed: January 16, 2025

Publication date: May 15, 2025

Applicant: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
Systems and methods for intelligent machine learning-based malware detection

Publication number: 20240045963

Abstract: The methods described herein include receiving a plurality of packets associated with a file, each of the plurality of packets comprising content, and a source domain; extracting one or more features from content of a first packet of the plurality of packets; applying a trained machine learning model to the extracted one or more features to determine a probability of maliciousness associated with the first packet; responsive to determining that the probability maliciousness of the first packet is between a first threshold value and a second threshold value, labeling the first packet as having an uncertain maliciousness; extracting one or more features from content of a second packet of the plurality of packets; and applying the trained machine learning model to the extracted one or more features of the first packet and the second packet to determine a probability of maliciousness associated with the second packet.

Type: Application

Filed: October 17, 2023

Publication date: February 8, 2024

Applicant: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
Machine learning-based malware detection system and method

Patent number: 11822657

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Grant

Filed: April 20, 2022

Date of Patent: November 21, 2023

Assignee: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L Zhen
Machine learning-based malware detection system and method

Publication number: 20220245248

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Application

Filed: April 20, 2022

Publication date: August 4, 2022

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
System and method for malware detection on a per packet basis

Patent number: 11341242

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Grant

Filed: October 12, 2020

Date of Patent: May 24, 2022

Assignee: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
System and method for malware detection on a per packet basis

Publication number: 20210026962

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Application

Filed: October 12, 2020

Publication date: January 28, 2021

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
System and method for malware detection on a per packet basis

Patent number: 10817608

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Grant

Filed: April 5, 2018

Date of Patent: October 27, 2020

Assignee: Zscaler, Inc.

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
SYSTEM AND METHOD FOR MALWARE DETECTION ON A PER PACKET BASIS

Publication number: 20180293381

Abstract: Disclosed is a computer implemented method for malware detection that analyses a file on a per packet basis. The method receives a packet of one or more packets associated a file, and converting a binary content associated with the packet into a digital representation and tokenizing plain text content associated with the packet. The method extracts one or more n-gram features, an entropy feature, and a domain feature from the converted content of the packet and applies a trained machine learning model to the one or more features extracted from the packet. The output of the machine learning method is a probability of maliciousness associated with the received packet. If the probability of maliciousness is above a threshold value, the method determines that the file associated with the received packet is malicious.

Type: Application

Filed: April 5, 2018

Publication date: October 11, 2018

Inventors: Huihsin Tseng, Hao Xu, Jian L. Zhen
Query-URL N-Gram Features in Web Ranking

Publication number: 20110040769

Abstract: In one embodiment, access one or more pairs of search query and clicked Uniform Resource Locator (URL). For each of the pairs of search query and clicked URL, segment the search query into one or more query segments and the clicked URL into one or more URL segments; construct one or more query-URL n-grams, each of which comprises a query part comprising at least one of the query segments and a URL part comprising at least one of the URL segments; and calculate one or more association scores, each of which for one of the query-URL n-grams and represents a similarity between the query part and the URL part of the query-URL n-gram and is based on a first frequency of the query part and the URL part, a second frequency of the query part, and a third frequency of the URL part.

Type: Application

Filed: August 13, 2009

Publication date: February 17, 2011

Applicant: Yahoo! Inc.

Inventors: Huihsin Tseng, Longbin Chen, Yumao Lu, Fachun Peng