Patents Examined by Feng-Tzer Tzeng

Stereo signal encoding method and apparatus, and stereo signal decoding method and apparatus

Patent number: 11790923

Abstract: A stereo signal encoding method includes performing spectrum broadening on a quantized line spectral frequency (LSF) parameter of a primary channel signal in a current frame in a stereo signal to obtain a spectrum-broadened LSF parameter of the primary channel signal, determining a prediction residual of an LSF parameter of a secondary channel signal in the current frame based on an original LSF parameter of the secondary channel signal and the spectrum-broadened LSF parameter of the primary channel signal, and performing a quantization on the prediction residual of the LSF parameter of the secondary channel signal.

Type: Grant

Filed: August 23, 2022

Date of Patent: October 17, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Eyal Shlomot, Jonathan Alastair Gibbs, Halting Li
Two stage user customizable wake word detection

Patent number: 11783818

Abstract: Described herein are devices, methods, and systems for detecting a phrase from uttered speech. A processing device may determine a first model for phrase recognition based on a likelihood ratio using a set of training utterances. The set of utterances may be analyzed by the first model to determine a second model, the second model comprising a training state sequence for each of the set of training utterances, and wherein each training state sequence indicates a likely state for each time interval of a corresponding training utterance. A determination of whether a detected utterance corresponds to the phrase may be based on a concatenation of the first model and the second model.

Type: Grant

Filed: September 25, 2020

Date of Patent: October 10, 2023

Assignee: Cypress Semiconductor Corporation

Inventors: Robert Zopf, Ashutosh Pandey
Audio quality feedback during live transmission from a source

Patent number: 11785136

Abstract: Method and system are provided for audio quality feedback during live transmission from a source that is received at multiple audience devices. The method carried out at a server includes: obtaining audio information of an audio signal as received by at least some of the audience devices in a transmission session; classifying one or more subsets of the audience devices by one or more common factors per subset; and analyzing the obtained audio information from the audience devices in conjunction with the classifications of the subsets of the audience devices to determine one or more common factors that affect received audio quality at an identified subset of the audience devices classified by the one or more common factors. The method provides feedback of the one or more common factors to at least one of the audience devices in the identified subset or to the source device, or to both.

Type: Grant

Filed: October 29, 2020

Date of Patent: October 10, 2023

Assignee: International Business Machines Corporation

Inventors: Jenny Jing He, Adrian Kyte, Joseph R Winchester, Cheng Fang Wang, Ping Xiao
Audio signal encoding method and apparatus

Patent number: 11776553

Abstract: An encoding method includes determining an adaptive broadening factor based on a quantized line spectral frequency (LSF) vector of a first channel of a current frame of an audio signal and an LSF vector of a second channel of the current frame, and writing the quantized LSF vector and the adaptive broadening factor into a bitstream.

Type: Grant

Filed: October 10, 2022

Date of Patent: October 3, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Eyal Shlomot, Jonathan Alastair Gibbs, Halting Li
Convolutional neural network with phonetic attention for speaker verification

Patent number: 11776548

Abstract: Embodiments may include determination, for each of a plurality of speech frames associated with an acoustic feature, of a phonetic feature based on the associated acoustic feature, generation of one or more two-dimensional feature maps based on the plurality of phonetic features, input of the one or more two-dimensional feature maps to a trained neural network to generate a plurality of speaker embeddings, and aggregation of the plurality of speaker embeddings into a speaker embedding based on respective weights determined for each of the plurality of speaker embeddings, wherein the speaker embedding is associated with an identity of the speaker.

Type: Grant

Filed: February 7, 2022

Date of Patent: October 3, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yong Zhao, Tianyan Zhou, Jinyu Li, Yifan Gong, Jian Wu, Zhuo Chen
Electronic device and method for processing user utterance in the electronic device

Patent number: 11769503

Abstract: According to an embodiment, an electronic device comprises: a communication module comprising communication circuitry, a memory, and at least one processor.

Type: Grant

Filed: June 8, 2021

Date of Patent: September 26, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Hyunjin Kim, Hyunduk Cho, Heeyoung Choo, Jaeyoung Lee, Jungkun Lee
Automatically recognizing and surfacing important moments in multi-party conversations

Patent number: 11763823

Abstract: A system and a method are disclosed for identifying a subjectively interesting moment in a transcript. In an embodiment, a device receives a transcription of a conversation, and identifies a participant of the conversation. The device accesses a machine learning model corresponding to the participant, and applies, as input to the machine learning model, the transcription. The device receives as output from the machine learning model a portion of the transcription having relevance to the participant, and generates for display, to the participant, information pertaining to the portion.

Type: Grant

Filed: February 18, 2021

Date of Patent: September 19, 2023

Assignee: Outreach Corporation

Inventors: Krishnamohan Reddy Nareddy, Abhishek Abhishek, Rohit Ganpat Mane, Rajiv Garg
Hybrid voice command processing

Patent number: 11763814

Abstract: Digitized audio command is decoded to generate audio features. An in-domain confidence score is calculated for a model trained by a limited set of peripheral device commands. An out-domain confidence score is calculated for a model trained without the peripheral device commands. The best score determines whether to process the audio locally or at a remote server. In some embodiments, a likelihood ratio (LR) is calculated of the in-domain and out-domain confidence scores. Based on the likelihood ratio, a locally decoded audio command is performed, or the audio features are sent to a remote server for processing to determine the audio command.

Type: Grant

Filed: June 21, 2021

Date of Patent: September 19, 2023

Assignee: Logitech Europe S.A.

Inventors: Arash Salarian, Milos Cernak, Pablo Mainar, Jean-Michael Chardon, Niccolò Antonello
Human-machine interfaces for utterance-based playlist selection

Patent number: 11755283

Abstract: Systems, methods, and devices for human-machine interfaces for utterance-based playlist selection are disclosed. In one method, a list of playlists is traversed and a portion of each is audibly output until a playlist command is received. Based on the playlist command, the traversing is stopped and a playlist is selected for playback. In examples, the list of playlists is modified based on a modification input.

Type: Grant

Filed: April 14, 2022

Date of Patent: September 12, 2023

Assignee: Spotify AB

Inventors: Daniel Bromand, Richard Mitic, Horia-Dragos Jurcut, Henriette Susanne Martine Cramer, Ruth Brillman
High resolution audio coding for improving package loss concealment

Patent number: 11749290

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing long-term prediction (LTP) are described. One example of the methods includes determining a pitch gain and a pitch lag of an input audio signal for at least a predetermined number of frames. It is determined that the pitch gain of the input audio signal has exceeded a predetermined threshold and that a change of the pitch lag of the input audio signal has been within a predetermined range for at least the predetermined number of frames. In response to determining that the pitch gain of the input audio signal has exceeded the predetermined threshold and that the change of the third pitch lag has been within the predetermined range for at least the predetermined number of frames, a pitch gain is set for a current frame of the input audio signal.

Type: Grant

Filed: July 12, 2021

Date of Patent: September 5, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventor: Yang Gao
Multi-persona social agent

Patent number: 11748558

Abstract: A system providing a multi-persona social agent includes a computing platform having a hardware processor, a system memory storing a software code, and multiple neural network (NN) based predictive models accessible by the software code. The hardware processor executes the software code to receive input data corresponding to an interaction with a user, determine a generic expression for use in the interaction, and identify one of the character personas as a persona to be assumed by the multi-persona social agent. The software code also generates, using the generic expression and one of the NN based predictive models corresponding to the persona to be assumed by the multi-persona social agent, a sentiment driven personified response for the interaction with the user based on a vocabulary, phrases, and one or more syntax rules idiosyncratic to the persona to be assumed, and renders the sentiment driven personified response using the multi-persona social agent.

Type: Grant

Filed: October 27, 2020

Date of Patent: September 5, 2023

Assignee: Disney Enterprises, Inc.

Inventors: Sanchita Tiwari, Xiuyang Yu, Brian Kazmierczak, Dirk Van Dall
Custom convolutional neural network architectures for exposure detection

Patent number: 11741344

Abstract: A system is typically configured for customizing interconnectivity of one or more layers associated with a neural network architecture, wherein the neural network architecture is associated with an application, customizing functional transformation of the one or more layers associated with the neural network architecture, wherein each of the one or more layers comprises a custom transformation function, and generating a custom neural network architecture based on customizing the interconnectivity and the functional transformation of the one or more layers.

Type: Grant

Filed: December 9, 2019

Date of Patent: August 29, 2023

Assignee: BANK OF AMERICA CORPORATION

Inventors: Eren Kursun, Hongda Shen
Modifying follow on actions based on user activity

Patent number: 11721332

Abstract: In various embodiments, a voice command is transmitted while the user is performing an activity. The user’s activity is determined, along with follow on actions for the request. If the user is performing an interaction-limiting activity, subsequent actions may be presented in a simplified display to reduce the likelihood that user interacts with the display. Additionally, certain follow on actions may be considered complex. If a complex follow on action occurs during an interaction-limiting activity, the request may be flagged and delayed until the user has completed the interaction-limiting activity.

Type: Grant

Filed: April 28, 2020

Date of Patent: August 8, 2023

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Ran Mokady, Kris Jones, Vishnu Vijay, Mark Elliott, John Stephen Mintoft
Speech model parameter estimation and quantization

Patent number: 11715477

Abstract: Quantizing speech model parameters includes, for each of multiple vectors of quantized excitation strength parameters, determining first and second errors between first and second elements of a vector of excitation strength parameters and, respectively, first and second elements of the vector of quantized excitation strength parameters, and determining a first energy and a second energy associated with, respectively, the first and second errors. First and second weights for, respectively, the first error and the second error, are determined and are used to produce first and second weighted errors, which are combined to produce a total error. The total errors of each of the multiple vectors of quantized excitation strength parameters are compared and the vector of quantized excitation strength parameters that produces the smallest total error is selected to represent the vector of excitation strength parameters.

Type: Grant

Filed: April 8, 2022

Date of Patent: August 1, 2023

Assignee: Digital Voice Systems, Inc.

Inventors: Daniel W. Griffin, John C. Hardwick
Locally distributed keyword detection

Patent number: 11710487

Abstract: In one aspect, a playback device includes at least one microphone configured to detect a voice input and generate sound input data. The playback device detects a first command keyword in the detected sound and, in response, makes a first determination, via a first local natural language unit (NLU), whether the input sound data includes at least one keyword within a first predetermined library of keywords. The playback device receives an indication of a second determination made by a second NLU that the input sound data includes at least one keyword from a second predetermined library of keywords. The playback device compares the results of the first determination and the second determination and, based on the comparison, foregoes further processing of the input sound data.

Type: Grant

Filed: October 4, 2021

Date of Patent: July 25, 2023

Assignee: Sonos, Inc.

Inventors: Nick D'Amato, Connor Kristopher Smith
Forecasting routines utilizing a mixer to combine deep neural network (DNN) forecasts of multi-variate time-series datasets

Patent number: 11704539

Abstract: Deep Neural Networks (DNNs) for forecasting future data are provided. In one embodiment, a non-transitory computer-readable medium is configured to store computer logic having instructions that, when executed, cause one or more processing devices to receive, at each of a plurality of Deep Neural Network (DNN) forecasters, an input corresponding to a time-series dataset of a plurality of input time-series datasets. The instructions further cause the one or more processing devices to produce, from each of the plurality of DNN forecasters, a forecast output and provide the forecast output from each of the plurality of DNN forecasters to a DNN mixer for combining the forecast outputs to produce one or more output time-series datasets.

Type: Grant

Filed: March 30, 2020

Date of Patent: July 18, 2023

Assignee: Ciena Corporation

Inventors: Maryam Amiri, Petar Djukic, Todd Morris
Systems and methods of audio decoder determination and selection

Patent number: 11699450

Abstract: Playback devices can support audio encoded using various encoding schemes. Playing back such content includes receiving, at a playback device, audio data from an audio source; and receiving an indication from the audio source that the audio data is encoded in the compressed audio format. The device determines, independently of receiving the indication from the audio source that the audio data is encoded in the compressed audio format, whether the audio data is encoded in a compressed audio format. If the audio data is determined to be encoded in the compressed audio format: the device selects a decoder from among a plurality of decoders; decodes the audio data using the selected decoder; and plays back the decoded audio data via the playback device. If the audio data is determined not to be encoded in the compressed audio format, the device inhibits playback of the audio data.

Type: Grant

Filed: May 3, 2022

Date of Patent: July 11, 2023

Assignee: Sonos, Inc.

Inventors: Nicholas Maniskas, Cameron Korb, Govind Jeyaram
Acoustic quality evaluation apparatus, acoustic quality evaluation method, and program

Patent number: 11699458

Abstract: To obtain an appropriate evaluation value in an acoustic quality evaluation by a conversational test. An acoustic quality evaluation apparatus 3 evaluates the acoustic quality of a call performed between a near-end terminal 1 and a far-end terminal 2 via a voice communication network 4. An evaluation value presenting unit 31 displays, on a display unit 13, evaluation categories obtained by classifying each of a plurality of evaluation viewpoints into a predetermined number of levels. An input unit 14 transmits the evaluation category selected by the evaluator for each of the evaluation viewpoints, to an evaluation value determination unit 32. The evaluation value determination unit 32 determines the lowest evaluation value among evaluation values assigned to the evaluation category received from the input unit 14 as a subjective evaluation value for acoustic quality.

Type: Grant

Filed: May 7, 2019

Date of Patent: July 11, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Sachiko Kurihara, Noboru Harada
Audio signal encoding and decoding method using learning model, training method of learning model, and encoder and decoder that perform the methods

Patent number: 11694703

Abstract: An audio signal encoding and decoding method using a learning model, a training method of the learning model, and an encoder and decoder that perform the method, are disclosed. The audio signal decoding method may include extracting a first residual signal and a first linear prediction coefficient by decoding a bitstream received from an encoder, generating a first audio signal from the first residual signal using the first linear prediction coefficient, generating a second linear prediction coefficients and a second residual signal from the first audio signal, obtaining a third linear prediction coefficient by inputting the second linear prediction coefficient into a trained learning model, and generating a second audio signal from the second residual signal using the third linear prediction coefficient.

Type: Grant

Filed: February 15, 2022

Date of Patent: July 4, 2023

Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Woo-taek Lim, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Inseon Jang
Audio processing in a low-bandwidth networked system

Patent number: 11694676

Abstract: The present disclosure is generally directed a system to detect activation phrases within input audio signals transmitted over a low-bandwidth network. The system can use a two-stage activation phrase detection process. First a sensing device, which can include a plurality of microphones for detecting an input audio signal, can detect an input audio signal that includes a candidate activation phrase. Second, the sensing device can transmit the recordings of the input audio signal to a client device for confirmation that the input audio signal includes the activation phrase.

Type: Grant

Filed: September 3, 2021

Date of Patent: July 4, 2023

Assignee: GOOGLE LLC

Inventors: Jeremy Payne, Tomer Amarilio

prev 1 2 3 4 5 6 7 … next