Patents by Inventor Xuehong Mao

Xuehong Mao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MULTI-TIME-SCALE NEURAL AUDIO CODEC STREAMS

Publication number: 20250131940

Abstract: A data-driven audio codec system that involves producing multiple compressed streams comprising encoded information (e.g., codeword indices) at different time scales (time intervals or frequency). This may allow for separation of different properties of speech, such as content and aspects of style (prosody), into the different compressed streams without explicitly enforcing it, i.e., in an unsupervised manner. Speech audio is encoded to produce a plurality of encoded streams comprising encoded information for the speech audio at different time scales. The plurality of encoded streams are decoded to generate output audio.

Type: Application

Filed: December 14, 2023

Publication date: April 24, 2025

Inventors: Rafal Pilarczyk, Amir Salah Abdelsamie Abdelwahed, Hui-Ling Lu, Ivana Balic, Yusuf Ziya Isik, David Guoqing Zhang, Xuehong Mao, Samer Lutfi Hijazi
PACKET LOSS CONCEALMENT IN AN AUDIO DECODER

Publication number: 20250131933

Abstract: A method of performing packet loss concealment in a neural audio encoder/decoder (codec) system. The method includes receiving an indication of a lost audio packet at a receive side of a neural network audio codec system that includes an audio encoder and an audio decoder, wherein the lost audio packet comprises an index of a codeword that is representative of a portion of speech audio presented to the audio encoder, predicting the index of the codeword in the lost packet to obtain a predicted index, deriving a predicted embedding vector from the predicted index, and decoding, by the audio decoder, the embedding vector to generate an audio output.

Type: Application

Filed: December 14, 2023

Publication date: April 24, 2025

Inventors: Amir Salah Abdelsamie Abdelwahed, Yusuf Ziya Isik, Xuehong Mao, Samir Ouelha, Samer Lutfi Hijazi
GENERATIVE SPEECH MODEL FOR COMPACT DATA-DRIVEN SPEECH VECTORS FOR VERSATILE SPEECH APPLICATIONS

Publication number: 20250131919

Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.

Type: Application

Filed: December 14, 2023

Publication date: April 24, 2025

Inventors: Xuehong Mao, Samer Lutfi Hijazi, Christopher Rowen, Mathew Shaji Kavalekalam, Ivana Balic, Mengjun Leng, Yusuf Ziya Isik, Adam Ali Sabra, Amir Salah Abdelsamie Abdelwahed, Samir Ouelha, Mihailo Kolundzija
Computationally efficient and bitrate scalable soft vector quantization

Patent number: 12149263

Abstract: In some aspects, the techniques described herein relate to a method including: obtaining data to be compressed; determining a distance between the data to be compressed and each codeword of a plurality of codewords; selecting a predetermined number of codewords of the plurality of codewords based on the distance between the data to be compressed and each of the predetermined number of codewords; and generating compressed data, where the compressed data includes an indication of the predetermined number of codewords of the plurality of codewords.

Type: Grant

Filed: December 12, 2022

Date of Patent: November 19, 2024

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Yusuf Ziya Isik, Amir Salah Abdelsamie Abdelwahed, Xuehong Mao, Ivana M. Balic, Samer Lutfi Hijazi
DATA DRIVEN AUDIO ENHANCEMENT

Publication number: 20240371392

Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Type: Application

Filed: July 15, 2024

Publication date: November 7, 2024

Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
BANDWIDTH UTILIZATION TECHNIQUES FOR IN-BAND REDUNDANT DATA

Publication number: 20240322942

Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.

Type: Application

Filed: May 31, 2024

Publication date: September 26, 2024

Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
Data driven audio enhancement

Patent number: 12073850

Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Type: Grant

Filed: March 25, 2021

Date of Patent: August 27, 2024

Assignee: Cisco Technology, Inc.

Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
Bandwidth utilization techniques for in-band redundant data

Patent number: 12040894

Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.

Type: Grant

Filed: January 9, 2023

Date of Patent: July 16, 2024

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
BANDWIDTH UTILIZATION TECHNIQUES FOR IN-BAND REDUNDANT DATA

Publication number: 20240235727

Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.

Type: Application

Filed: January 9, 2023

Publication date: July 11, 2024

Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
COMPUTATIONALLY EFFICIENT AND BITRATE SCALABLE SOFT VECTOR QUANTIZATION

Publication number: 20240195438

Abstract: In some aspects, the techniques described herein relate to a method including: obtaining data to be compressed; determining a distance between the data to be compressed and each codeword of a plurality of codewords; selecting a predetermined number of codewords of the plurality of codewords based on the distance between the data to be compressed and each of the predetermined number of codewords; and generating compressed data, where the compressed data includes an indication of the predetermined number of codewords of the plurality of codewords.

Type: Application

Filed: December 12, 2022

Publication date: June 13, 2024

Inventors: Yusuf Ziya Isik, Amir Salah Abdelsamie Abdelwahed, Xuehong Mao, Ivana M. Balic, Samer Lutfi Hijazi
TRANSFORMING SPEECH SIGNALS TO ATTENUATE SPEECH OF COMPETING INDIVIDUALS AND OTHER NOISE

Publication number: 20240161765

Abstract: In one example embodiment, speech signals are received from a user during a communication session. The received speech signals contain noise including speech of other individuals. The received speech signals are transformed by a machine learning model to produce transformed speech signals corresponding to the received speech signals with a reduced amount of the noise. The machine learning model is trained with speech of the user satisfying a noise threshold and collected during one or more communication sessions.

Type: Application

Filed: November 16, 2022

Publication date: May 16, 2024

Inventors: Kamil Krzysztof Wojcicki, Xuehong Mao, David Guoqing Zhang, Samer Hijazi, Raul Alejandro Casas
SPEECH ENHANCEMENT TECHNIQUES THAT MAINTAIN SPEECH OF NEAR-FIELD SPEAKERS

Publication number: 20220392478

Abstract: An endpoint selectively enhances a captured audio signal based on an operating mode. The endpoint obtains an audio input signal of multiple users in a physical location. The audio input signal is captured by a microphone. The endpoint separates voice signals from the audio input signal and determines an operating mode for an audio output signal. The endpoint selectively adjusts each of the voice signals based on the operating mode to generate the audio output signal.

Type: Application

Filed: September 10, 2021

Publication date: December 8, 2022

Inventors: Samer Lutfi Hijazi, Christopher Rowen, Xuehong Mao, Ivana M. Balic, Raul Alejandro Casas, Savita Kini
Filtering in trainable networks

Patent number: 11132619

Abstract: Some embodiments perform, in a multi-layer neural network in a computing device, a convolution operation on input feature maps with multiple convolutional filters. The convolutional filters have multiple filter precisions. In other embodiments, electronic design automation (EDA) systems, methods, and computer-readable media are presented for adding such a multi-layer neural network into an integrated circuit (IC) design.

Type: Grant

Filed: February 24, 2017

Date of Patent: September 28, 2021

Assignee: Cadence Design Systems, Inc.

Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Piyush Kaul, Rishi Kumar, Xuehong Mao, Christopher Rowen
DATA DRIVEN AUDIO ENHANCEMENT

Publication number: 20210217436

Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Type: Application

Filed: March 25, 2021

Publication date: July 15, 2021

Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
Complexity optimization of trainable networks

Patent number: 10997502

Abstract: Some embodiments perform, in a multi-layer neural network in a computing device, optimization of the multi-layer neural network, for example by making a convolutional change with a first plurality of convolutional filters, or by making a connection change of a first plurality of convolutional filters. In other embodiments, electronic design automation (EDA) systems, methods, and computer-readable media are presented for adding such a multi-layer neural network into an integrated circuit (IC) design.

Type: Grant

Filed: April 13, 2017

Date of Patent: May 4, 2021

Assignee: Cadence Design Systems, Inc.

Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Piyush Kaul, Rishi Kumar, Xuehong Mao, Christopher Rowen
Data driven audio enhancement

Patent number: 10991379

Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Type: Grant

Filed: June 22, 2018

Date of Patent: April 27, 2021

Assignee: BabbleLabs LLC

Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
System and method for hyper-parameter analysis for multi-layer computational structures

Patent number: 10534994

Abstract: The present disclosure relates to a computer-implemented method for analyzing one or more hyper-parameters for a multi-layer computational structure. The method may include accessing, using at least one processor, input data for recognition. The input data may include at least one of an image, a pattern, a speech input, a natural language input, a video input, and a complex data set. The method may further include processing the input data using one or more layers of the multi-layer computational structure and performing matrix factorization of the one or more layers. The method may also include analyzing one or more hyper-parameters for the one or more layers based upon, at least in part, the matrix factorization of the one or more layers.

Type: Grant

Filed: November 11, 2015

Date of Patent: January 14, 2020

Assignee: Cadence Design Systems, Inc.

Inventors: Piyush Kaul, Samer Lutfi Hijazi, Raul Alejandro Casas, Rishi Kumar, Xuehong Mao, Christopher Rowen
DATA DRIVEN AUDIO ENHANCEMENT

Publication number: 20190392852

Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

Type: Application

Filed: June 22, 2018

Publication date: December 26, 2019

Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
Transform domain regression convolutional neural network for image segmentation

Patent number: 10290107

Abstract: Aspects of the present disclosure involve a transform domain regression convolutional neural network for image segmentation. Example embodiments include a system comprising a machine-readable storage medium storing instructions and computer-implemented methods for classifying one or more pixels in an image. The method may include analyzing the image to estimate one or more transform domain coefficients using a multi-layered function such as a convolutional neural network. The method may further include generating a segmented image by applying a change of basis transformation to the estimated one or more transform domain coefficients.

Type: Grant

Filed: June 19, 2017

Date of Patent: May 14, 2019

Assignee: Cadence Design Systems, Inc.

Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Rishi Kumar, Piyush Kaul, Xuehong Mao, Christopher Rowen, Himanshu Charaya
Method for allocating resources in cell-edge bands of OFDMA networks

Patent number: 8165098

Abstract: A method allocates bandwidth from a radio frequency spectrum in a cellular network including a set of cells. Each cell includes a base station for serving a set of mobile stations in the cell. An area around each base station is partitioned into a center region and an edge region. In each base station, cell-center bandwidth for use by the mobile stations in the center region is reserved according to an inter-cell interference coordination (ICIC) protocol, and cell-edge bandwidth for use by the mobile stations in the edge region is reserved according to the ICIC protocol. The bandwidth can be fixed or adaptive to reduce the signaling overhead. The adaptive bandwidth can be further partitioned into reserved and the free bands. Mobile stations are classified as primary and secondary users, depending on whether they use or are assigned the fixed or adaptive band radio resources.

Type: Grant

Filed: December 15, 2008

Date of Patent: April 24, 2012

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Koon Hoo Teo, Zhifeng Tao, Xuehong Mao, Amine Maaref

1 2 next