Patents by Inventor Xuehong Mao
Xuehong Mao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250131940Abstract: A data-driven audio codec system that involves producing multiple compressed streams comprising encoded information (e.g., codeword indices) at different time scales (time intervals or frequency). This may allow for separation of different properties of speech, such as content and aspects of style (prosody), into the different compressed streams without explicitly enforcing it, i.e., in an unsupervised manner. Speech audio is encoded to produce a plurality of encoded streams comprising encoded information for the speech audio at different time scales. The plurality of encoded streams are decoded to generate output audio.Type: ApplicationFiled: December 14, 2023Publication date: April 24, 2025Inventors: Rafal Pilarczyk, Amir Salah Abdelsamie Abdelwahed, Hui-Ling Lu, Ivana Balic, Yusuf Ziya Isik, David Guoqing Zhang, Xuehong Mao, Samer Lutfi Hijazi
-
Publication number: 20250131933Abstract: A method of performing packet loss concealment in a neural audio encoder/decoder (codec) system. The method includes receiving an indication of a lost audio packet at a receive side of a neural network audio codec system that includes an audio encoder and an audio decoder, wherein the lost audio packet comprises an index of a codeword that is representative of a portion of speech audio presented to the audio encoder, predicting the index of the codeword in the lost packet to obtain a predicted index, deriving a predicted embedding vector from the predicted index, and decoding, by the audio decoder, the embedding vector to generate an audio output.Type: ApplicationFiled: December 14, 2023Publication date: April 24, 2025Inventors: Amir Salah Abdelsamie Abdelwahed, Yusuf Ziya Isik, Xuehong Mao, Samir Ouelha, Samer Lutfi Hijazi
-
Publication number: 20250131919Abstract: A neural network audio codec system and related methods are provided. In one example, a method is provided comprising: obtaining speech audio to be encoded; applying the speech audio to an audio encoder that is part of a neural network audio codec system that includes the audio encoder and an audio decoder. The audio encoder and the audio decoder have been trained in an end-to-end manner. The speech audio is encoded with the audio encoder to generate embedding vectors that represent a snapshot of speech audio attributes over successive timeframes of the raw speech audio, and from the embedding vectors, codeword indices are generated to entries in a codebook. The codeword indices are then transmitted or stored for later retrieval and processing by the audio decoder.Type: ApplicationFiled: December 14, 2023Publication date: April 24, 2025Inventors: Xuehong Mao, Samer Lutfi Hijazi, Christopher Rowen, Mathew Shaji Kavalekalam, Ivana Balic, Mengjun Leng, Yusuf Ziya Isik, Adam Ali Sabra, Amir Salah Abdelsamie Abdelwahed, Samir Ouelha, Mihailo Kolundzija
-
Patent number: 12149263Abstract: In some aspects, the techniques described herein relate to a method including: obtaining data to be compressed; determining a distance between the data to be compressed and each codeword of a plurality of codewords; selecting a predetermined number of codewords of the plurality of codewords based on the distance between the data to be compressed and each of the predetermined number of codewords; and generating compressed data, where the compressed data includes an indication of the predetermined number of codewords of the plurality of codewords.Type: GrantFiled: December 12, 2022Date of Patent: November 19, 2024Assignee: CISCO TECHNOLOGY, INC.Inventors: Yusuf Ziya Isik, Amir Salah Abdelsamie Abdelwahed, Xuehong Mao, Ivana M. Balic, Samer Lutfi Hijazi
-
Publication number: 20240371392Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.Type: ApplicationFiled: July 15, 2024Publication date: November 7, 2024Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
-
Publication number: 20240322942Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.Type: ApplicationFiled: May 31, 2024Publication date: September 26, 2024Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
-
Patent number: 12073850Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.Type: GrantFiled: March 25, 2021Date of Patent: August 27, 2024Assignee: Cisco Technology, Inc.Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
-
Patent number: 12040894Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.Type: GrantFiled: January 9, 2023Date of Patent: July 16, 2024Assignee: CISCO TECHNOLOGY, INC.Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
-
Publication number: 20240235727Abstract: In some aspects, the techniques described herein relate to a method including: encoding a current data portion to generate an encoded current data portion for inclusion in a data packet; encoding, based upon content of the current data portion, a forward error correction data portion for a previous data portion to generate an encoded forward error correction data portion; generating the data packet including the encoded current data portion and the encoded forward error correction data portion; and providing the data packet to a receiver.Type: ApplicationFiled: January 9, 2023Publication date: July 11, 2024Inventors: Amir Salah Abdelsamie Abdelwahed, Ivana Balic, Yusuf Ziya Isik, Xuehong Mao, Samer Lutfi Hijazi
-
Publication number: 20240195438Abstract: In some aspects, the techniques described herein relate to a method including: obtaining data to be compressed; determining a distance between the data to be compressed and each codeword of a plurality of codewords; selecting a predetermined number of codewords of the plurality of codewords based on the distance between the data to be compressed and each of the predetermined number of codewords; and generating compressed data, where the compressed data includes an indication of the predetermined number of codewords of the plurality of codewords.Type: ApplicationFiled: December 12, 2022Publication date: June 13, 2024Inventors: Yusuf Ziya Isik, Amir Salah Abdelsamie Abdelwahed, Xuehong Mao, Ivana M. Balic, Samer Lutfi Hijazi
-
Publication number: 20240161765Abstract: In one example embodiment, speech signals are received from a user during a communication session. The received speech signals contain noise including speech of other individuals. The received speech signals are transformed by a machine learning model to produce transformed speech signals corresponding to the received speech signals with a reduced amount of the noise. The machine learning model is trained with speech of the user satisfying a noise threshold and collected during one or more communication sessions.Type: ApplicationFiled: November 16, 2022Publication date: May 16, 2024Inventors: Kamil Krzysztof Wojcicki, Xuehong Mao, David Guoqing Zhang, Samer Hijazi, Raul Alejandro Casas
-
Publication number: 20220392478Abstract: An endpoint selectively enhances a captured audio signal based on an operating mode. The endpoint obtains an audio input signal of multiple users in a physical location. The audio input signal is captured by a microphone. The endpoint separates voice signals from the audio input signal and determines an operating mode for an audio output signal. The endpoint selectively adjusts each of the voice signals based on the operating mode to generate the audio output signal.Type: ApplicationFiled: September 10, 2021Publication date: December 8, 2022Inventors: Samer Lutfi Hijazi, Christopher Rowen, Xuehong Mao, Ivana M. Balic, Raul Alejandro Casas, Savita Kini
-
Patent number: 11132619Abstract: Some embodiments perform, in a multi-layer neural network in a computing device, a convolution operation on input feature maps with multiple convolutional filters. The convolutional filters have multiple filter precisions. In other embodiments, electronic design automation (EDA) systems, methods, and computer-readable media are presented for adding such a multi-layer neural network into an integrated circuit (IC) design.Type: GrantFiled: February 24, 2017Date of Patent: September 28, 2021Assignee: Cadence Design Systems, Inc.Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Piyush Kaul, Rishi Kumar, Xuehong Mao, Christopher Rowen
-
Publication number: 20210217436Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.Type: ApplicationFiled: March 25, 2021Publication date: July 15, 2021Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
-
Patent number: 10997502Abstract: Some embodiments perform, in a multi-layer neural network in a computing device, optimization of the multi-layer neural network, for example by making a convolutional change with a first plurality of convolutional filters, or by making a connection change of a first plurality of convolutional filters. In other embodiments, electronic design automation (EDA) systems, methods, and computer-readable media are presented for adding such a multi-layer neural network into an integrated circuit (IC) design.Type: GrantFiled: April 13, 2017Date of Patent: May 4, 2021Assignee: Cadence Design Systems, Inc.Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Piyush Kaul, Rishi Kumar, Xuehong Mao, Christopher Rowen
-
Patent number: 10991379Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.Type: GrantFiled: June 22, 2018Date of Patent: April 27, 2021Assignee: BabbleLabs LLCInventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
-
Patent number: 10534994Abstract: The present disclosure relates to a computer-implemented method for analyzing one or more hyper-parameters for a multi-layer computational structure. The method may include accessing, using at least one processor, input data for recognition. The input data may include at least one of an image, a pattern, a speech input, a natural language input, a video input, and a complex data set. The method may further include processing the input data using one or more layers of the multi-layer computational structure and performing matrix factorization of the one or more layers. The method may also include analyzing one or more hyper-parameters for the one or more layers based upon, at least in part, the matrix factorization of the one or more layers.Type: GrantFiled: November 11, 2015Date of Patent: January 14, 2020Assignee: Cadence Design Systems, Inc.Inventors: Piyush Kaul, Samer Lutfi Hijazi, Raul Alejandro Casas, Rishi Kumar, Xuehong Mao, Christopher Rowen
-
Publication number: 20190392852Abstract: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.Type: ApplicationFiled: June 22, 2018Publication date: December 26, 2019Inventors: Samer Hijazi, Xuehong Mao, Raul Alejandro Casas, Kamil Krzysztof Wojcicki, Dror Maydan, Christopher Rowen
-
Patent number: 10290107Abstract: Aspects of the present disclosure involve a transform domain regression convolutional neural network for image segmentation. Example embodiments include a system comprising a machine-readable storage medium storing instructions and computer-implemented methods for classifying one or more pixels in an image. The method may include analyzing the image to estimate one or more transform domain coefficients using a multi-layered function such as a convolutional neural network. The method may further include generating a segmented image by applying a change of basis transformation to the estimated one or more transform domain coefficients.Type: GrantFiled: June 19, 2017Date of Patent: May 14, 2019Assignee: Cadence Design Systems, Inc.Inventors: Raúl Alejandro Casas, Samer Lutfi Hijazi, Rishi Kumar, Piyush Kaul, Xuehong Mao, Christopher Rowen, Himanshu Charaya
-
Patent number: 8165098Abstract: A method allocates bandwidth from a radio frequency spectrum in a cellular network including a set of cells. Each cell includes a base station for serving a set of mobile stations in the cell. An area around each base station is partitioned into a center region and an edge region. In each base station, cell-center bandwidth for use by the mobile stations in the center region is reserved according to an inter-cell interference coordination (ICIC) protocol, and cell-edge bandwidth for use by the mobile stations in the edge region is reserved according to the ICIC protocol. The bandwidth can be fixed or adaptive to reduce the signaling overhead. The adaptive bandwidth can be further partitioned into reserved and the free bands. Mobile stations are classified as primary and secondary users, depending on whether they use or are assigned the fixed or adaptive band radio resources.Type: GrantFiled: December 15, 2008Date of Patent: April 24, 2012Assignee: Mitsubishi Electric Research Laboratories, Inc.Inventors: Koon Hoo Teo, Zhifeng Tao, Xuehong Mao, Amine Maaref