Patents by Inventor Ho-Hsiang Wu

Ho-Hsiang Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND SYSTEM TO TRAIN AUDIO RETRIEVAL AND ZERO SHOT CLASSIFICATION SYSTEMS WITH COUNTER-FACTUAL PROMPTS

Publication number: 20250124292

Abstract: A method of machine learning network includes receiving one or more sound segments and one or more associated text labels indicating captions associated with the sound segments, generating, utilizing a large language model of the machine learning network, one or more counterfactual captions associated with the one or more sound segments, wherein the one or more counterfactual captions are adversarial captions, determining a loss associated with the one or more sound segments, one or more associated text labels, and one or more counterfactual captions, updating parameters associated with an audio encoder or text encoder of the machine learning network, in response to falling below a threshold, repeating steps list above, and in response to meeting the threshold and utilizing a ranking, updating final parameters associated with the machine learning network.

Type: Application

Filed: October 12, 2023

Publication date: April 17, 2025

Inventors: Luca BONDI, Mohammad Ali VOSOUGHI, Ho-Hsiang WU, Samarjit DAS
ACTIVE LEARNING ON BIOLOGICAL SOUNDS FOR DETERMING PRESENCE OF MEDICAL CONDITION

Publication number: 20250099067

Abstract: Methods and systems for training an audio-based machine learning model to predict a health condition based on biological sounds emitted by a person. Audio data corresponding to biological sounds produced by the person is generated from a microphone. The audio data is segmented into a plurality of segments, each segment associated with a respective sound event. An audio-based machine learning model is executed on the plurality of segments. The audio-based machine learning model is configured to output, for each segment, a label of a medical condition and an associated a confidence score. The model is trained via active learning, in which a subset of the plurality of segments are selected based on their confidence score being below a threshold, and provided to a human for annotation.

Type: Application

Filed: September 26, 2023

Publication date: March 27, 2025

Inventors: Shabnam GHAFFARZADEGAN, Samarjit DAS, Luca BONDI, Ho-Hsiang WU, Joseph Aracri, Kelly J. SHIELDS, Sirajum MUNIR
SYSTEMS AND METHODS FOR CROSS-MODAL RETRIEVAL BASED ON A SOUND MODALITY AND A NON-SOUND MODALITY

Publication number: 20240362269

Abstract: Systems and methods for cross-modal retrieval are provided. According to one aspect, a method for cross-modal retrieval includes obtaining a query describing a sound using a query modality other than a sound modality; encoding the query to obtain a query embedding using a query encoder network for the query modality and a query projection network, wherein the query projection network includes a self-attention layer, and wherein the query embedding is in a joint embedding space for the query modality and the sound modality; and providing a response including an audio sample based on the query embedding, wherein the audio sample includes the sound.

Type: Application

Filed: April 28, 2023

Publication date: October 31, 2024

Inventors: Ho-Hsiang Wu, Oriol Nieto, Justin Jonathan Salamon
Systems, methods, and apparatus for generating an audio-visual presentation using characteristics of audio, visual and symbolic media objects

Patent number: 9753925

Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.

Type: Grant

Filed: November 4, 2015

Date of Patent: September 5, 2017

Assignee: Gracenote, Inc.

Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
SYSTEMS, METHODS, AND APPARATUS FOR GENERATING AN AUDIO-VISUAL PRESENTATION USING CHARACTERISTICS OF AUDIO, VISUAL AND SYMBOLIC MEDIA OBJECTS

Publication number: 20160124953

Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.

Type: Application

Filed: November 4, 2015

Publication date: May 5, 2016

Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
Systems, methods, and apparatus for generating an audio-visual presentation using characteristics of audio, visual and symbolic media objects

Patent number: 9213747

Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.

Type: Grant

Filed: February 26, 2015

Date of Patent: December 15, 2015

Assignee: Gracenote, Inc.

Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
SYSTEMS, METHODS, AND APPARATUS FOR GENERATING AN AUDIO-VISUAL PRESENTATION USING CHARACTERISTICS OF AUDIO, VISUAL AND SYMBOLIC MEDIA OBJECTS

Publication number: 20150234833

Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.

Type: Application

Filed: February 26, 2015

Publication date: August 20, 2015

Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu
Systems, methods, and apparatus for generating an audio-visual presentation using characteristics of audio, visual and symbolic media objects

Patent number: 8996538

Abstract: In an embodiment, a method and apparatus for generating a presentation is provided. The method considers characteristics of audio works and visual works when constructing the presentation. In some embodiments, the presentation may be automatically constructed.

Type: Grant

Filed: January 6, 2011

Date of Patent: March 31, 2015

Assignee: Gracenote, Inc.

Inventors: Markus K. Cremer, Ching-Wei Chen, Peter C. DiMaria, Ho-Hsiang Wu