Patents by Inventor Henghui Zhu

Henghui Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Programmatic theme detection in contacts analytics service

Patent number: 12062368

Abstract: Systems and methods to detect themes in contacts data. Contacts data may be encoded as text (e.g., chat logs), audio (e.g., audio recordings), and various other modalities. Text-based transcripts of contacts data may be parsed into turns, an issue turn may be detected using a machine learning model, a key phrase may be extracted from the issue turn. Key phrases from across multiple contacts data may be clustered to identify themes.

Type: Grant

Filed: September 30, 2020

Date of Patent: August 13, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Anuroop Arora, Atul Deo, Ramesh M. Nallapati, Henghui Zhu, Arvind Arikatla, Sai Bharadwaj Kanduri, Srikanth Prabala, Dejiao Zhang
Method and system for creating a domain-specific training corpus from generic domain corpora

Patent number: 11874864

Abstract: A method (100) for generating a domain-specific training set, comprising: generating (130) a generic corpus comprising a plurality of tokenized documents, comprising: (i) parsing (132) a document retrieved from the generic corpus; (ii) preprocessing (134) the parsed document; (iii) tokenizing (136) the preprocessed document; and (iv) storing (138) the tokenized document in the generic corpus; generating (140) an ontology database of tokenized entries, comprising: (i) parsing (142) an ontology entry retrieved from an ontology; (ii) preprocessing (144) the parsed entry; (iii) tokenizing (146) the preprocessed entry; and (iv) storing (148) the tokenized entry in the ontology database; querying (150), using domain-specific tokenized entries from the ontology database, the tokenized documents in the generic corpus; identifying (160), based on the query, a plurality of tokenized documents specific to the domain; and storing (170), in a training set database, the identified tokenized documents as a training set spec

Type: Grant

Filed: November 26, 2019

Date of Patent: January 16, 2024

Assignee: Koninklijke Philips N.V.

Inventors: Henghui Zhu, Amir Mohammad Tahmasebi Maraghoosh, Ioannis Paschalidis
METHOD AND SYSTEM FOR CREATING A DOMAIN-SPECIFIC TRAINING CORPUS FROM GENERIC DOMAIN CORPORA

Publication number: 20210383066

Abstract: A method (100) for generating a domain-specific training set, comprising: generating (130) a generic corpus comprising a plurality of tokenized documents, comprising: (i) parsing (132) a document retrieved from the generic corpus; (ii) preprocessing (134) the parsed document; (iii) tokenizing (136) the preprocessed document; and (iv) storing (138) the tokenized document in the generic corpus; generating (140) an ontology database of tokenized entries, comprising: (i) parsing (142) an ontology entry retrieved from an ontology; (ii) preprocessing (144) the parsed entry; (iii) tokenizing (146) the preprocessed entry; and (iv) storing (148) the tokenized entry in the ontology database; querying (150), using domain-specific tokenized entries from the ontology database, the tokenized documents in the generic corpus; identifying (160), based on the query, a plurality of tokenized documents specific to the domain; and storing (170), in a training set database, the identified tokenized documents as a training set spec

Type: Application

Filed: November 26, 2019

Publication date: December 9, 2021

Inventors: Henghui Zhu, Amir Mohammad Tahmasebi Maraghoosh, Ioannis Paschalidis

Programmatic theme detection in contacts analytics service

Method and system for creating a domain-specific training corpus from generic domain corpora

METHOD AND SYSTEM FOR CREATING A DOMAIN-SPECIFIC TRAINING CORPUS FROM GENERIC DOMAIN CORPORA