Patents by Inventor Henghui Zhu

Henghui Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12062368
    Abstract: Systems and methods to detect themes in contacts data. Contacts data may be encoded as text (e.g., chat logs), audio (e.g., audio recordings), and various other modalities. Text-based transcripts of contacts data may be parsed into turns, an issue turn may be detected using a machine learning model, a key phrase may be extracted from the issue turn. Key phrases from across multiple contacts data may be clustered to identify themes.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: August 13, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Anuroop Arora, Atul Deo, Ramesh M. Nallapati, Henghui Zhu, Arvind Arikatla, Sai Bharadwaj Kanduri, Srikanth Prabala, Dejiao Zhang
  • Patent number: 11874864
    Abstract: A method (100) for generating a domain-specific training set, comprising: generating (130) a generic corpus comprising a plurality of tokenized documents, comprising: (i) parsing (132) a document retrieved from the generic corpus; (ii) preprocessing (134) the parsed document; (iii) tokenizing (136) the preprocessed document; and (iv) storing (138) the tokenized document in the generic corpus; generating (140) an ontology database of tokenized entries, comprising: (i) parsing (142) an ontology entry retrieved from an ontology; (ii) preprocessing (144) the parsed entry; (iii) tokenizing (146) the preprocessed entry; and (iv) storing (148) the tokenized entry in the ontology database; querying (150), using domain-specific tokenized entries from the ontology database, the tokenized documents in the generic corpus; identifying (160), based on the query, a plurality of tokenized documents specific to the domain; and storing (170), in a training set database, the identified tokenized documents as a training set spec
    Type: Grant
    Filed: November 26, 2019
    Date of Patent: January 16, 2024
    Assignee: Koninklijke Philips N.V.
    Inventors: Henghui Zhu, Amir Mohammad Tahmasebi Maraghoosh, Ioannis Paschalidis
  • Publication number: 20210383066
    Abstract: A method (100) for generating a domain-specific training set, comprising: generating (130) a generic corpus comprising a plurality of tokenized documents, comprising: (i) parsing (132) a document retrieved from the generic corpus; (ii) preprocessing (134) the parsed document; (iii) tokenizing (136) the preprocessed document; and (iv) storing (138) the tokenized document in the generic corpus; generating (140) an ontology database of tokenized entries, comprising: (i) parsing (142) an ontology entry retrieved from an ontology; (ii) preprocessing (144) the parsed entry; (iii) tokenizing (146) the preprocessed entry; and (iv) storing (148) the tokenized entry in the ontology database; querying (150), using domain-specific tokenized entries from the ontology database, the tokenized documents in the generic corpus; identifying (160), based on the query, a plurality of tokenized documents specific to the domain; and storing (170), in a training set database, the identified tokenized documents as a training set spec
    Type: Application
    Filed: November 26, 2019
    Publication date: December 9, 2021
    Inventors: Henghui Zhu, Amir Mohammad Tahmasebi Maraghoosh, Ioannis Paschalidis