Patents by Inventor Ho-Hsiang Wu

Ho-Hsiang Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250124292
    Abstract: A method of machine learning network includes receiving one or more sound segments and one or more associated text labels indicating captions associated with the sound segments, generating, utilizing a large language model of the machine learning network, one or more counterfactual captions associated with the one or more sound segments, wherein the one or more counterfactual captions are adversarial captions, determining a loss associated with the one or more sound segments, one or more associated text labels, and one or more counterfactual captions, updating parameters associated with an audio encoder or text encoder of the machine learning network, in response to falling below a threshold, repeating steps list above, and in response to meeting the threshold and utilizing a ranking, updating final parameters associated with the machine learning network.
    Type: Application
    Filed: October 12, 2023
    Publication date: April 17, 2025
    Inventors: Luca BONDI, Mohammad Ali VOSOUGHI, Ho-Hsiang WU, Samarjit DAS
  • Publication number: 20250099067
    Abstract: Methods and systems for training an audio-based machine learning model to predict a health condition based on biological sounds emitted by a person. Audio data corresponding to biological sounds produced by the person is generated from a microphone. The audio data is segmented into a plurality of segments, each segment associated with a respective sound event. An audio-based machine learning model is executed on the plurality of segments. The audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score. The model is trained via active learning, in which a subset of the plurality of segments are selected based on their confidence score being below a threshold, and provided to a human for annotation.
    Type: Application
    Filed: September 26, 2023
    Publication date: March 27, 2025
    Inventors: Shabnam GHAFFARZADEGAN, Samarjit DAS, Luca BONDI, Ho-Hsiang WU, Joseph Aracri, Kelly J. SHIELDS, Sirajum MUNIR
  • Publication number: 20240362269
    Abstract: Systems and methods for cross-modal retrieval are provided. According to one aspect, a method for cross-modal retrieval includes obtaining a query describing a sound using a query modality other than a sound modality; encoding the query to obtain a query embedding using a query encoder network for the query modality and a query projection network, wherein the query projection network includes a self-attention layer, and wherein the query embedding is in a joint embedding space for the query modality and the sound modality; and providing a response including an audio sample based on the query embedding, wherein the audio sample includes the sound.
    Type: Application
    Filed: April 28, 2023
    Publication date: October 31, 2024
    Inventors: Ho-Hsiang Wu, Oriol Nieto, Justin Jonathan Salamon
  • Patent number: 9753925
    Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.
    Type: Grant
    Filed: November 4, 2015
    Date of Patent: September 5, 2017
    Assignee: Gracenote, Inc.
    Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
  • Publication number: 20160124953
    Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.
    Type: Application
    Filed: November 4, 2015
    Publication date: May 5, 2016
    Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
  • Patent number: 9213747
    Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.
    Type: Grant
    Filed: February 26, 2015
    Date of Patent: December 15, 2015
    Assignee: Gracenote, Inc.
    Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
  • Publication number: 20150234833
    Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.
    Type: Application
    Filed: February 26, 2015
    Publication date: August 20, 2015
    Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
  • Patent number: 8996538
    Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.
    Type: Grant
    Filed: January 6, 2011
    Date of Patent: March 31, 2015
    Assignee: Gracenote, Inc.
    Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu