Specialized Information Patents (Class 704/206)

Pitch (Class 704/207)

Voiced or unvoiced (Class 704/208)

Formant (Class 704/209)

Silence decision (Class 704/210)

Method and system for generating multimedia content

Patent number: 12347452

Abstract: A method for generating multimedia content is provided. The method includes receiving voice data that includes a recording of a voice of a user, receiving a selection of one character from among a plurality of characters from the user, and transmitting a multimedia content generated based on the voice data, a state of the user identified based on the voice data, and the selected character to another user.

Type: Grant

Filed: December 6, 2021

Date of Patent: July 1, 2025

Assignee: LY Corporation

Inventors: Marc Adrian Chua Lihan, Mao-Yuan Kao, Jing-Ya Huang, Cheng-Ho Chen, Kai-Ju Chang
Methods and systems for providing insights in real-time during a conversation

Patent number: 12315505

Abstract: The disclosure describes systems, methods, and media for generating real-time insights in a voice over internet protocol (VoIP) conversation. According to the methods, an application server receives a transcript of one or more voice utterances of a participant in the VoIP conversation, and identifies a context of the VoIP conversation and a first state of the context based on the transcript. The application server further receives an intent of the participant from a conversation artificial intelligence (AI) engine based on the transcript provided to the conversation AI engine. The application server further formulates one or more queries based on the intent, the context, and the first state of the context to retrieve one or more insights from one or more backend databases, and transmits the one or more insights to a terminal of at least one participants of the VoIP conversation for display.

Type: Grant

Filed: August 30, 2022

Date of Patent: May 27, 2025

Assignee: CLARI INC.

Inventors: Harsha Kudligi Anantha, Subodh Kishorilal Sah, Rashmi Shekar, Shailesh Patil, Shreyas Shankar, Kyle Buza, Jayanth Mohana Krishna
Emergency communication system with contextual snippets

Patent number: 11825020

Abstract: Systems and methods for processing emergency communications are provided. A system may receive an emergency communication initiated by an emergency communicator. The system may detect a data field action in response to an emergency receiver entering a data input based on the emergency communication. The system may capture a timestamp of when the data field action occurred. The system may generate a communication snippet based on the action timestamp and a snippet length. The communication snippet may be configured to provide context from the emergency communication to the data input. The system may transmit the communication snippet and the data input to an emergency responder.

Type: Grant

Filed: December 22, 2020

Date of Patent: November 21, 2023

Assignee: Axon Enterprise, Inc.

Inventors: Anshuman Srivastava, Michael Bauer, Joseph Pepper
Emergency communication system with contextual snippets

Patent number: 11758045

Abstract: Systems and methods for processing emergency communications are provided. A system may receive an emergency communication initiated by an emergency communicator. The system may detect a data field action in response to an emergency receiver entering a data input based on the emergency communication. The system may capture a timestamp of when the data field action occurred. The system may generate a communication snippet based on the action timestamp and a snippet length. The communication snippet may be configured to provide context from the emergency communication to the data input. The system may transmit the communication snippet and the data input to an emergency responder.

Type: Grant

Filed: December 22, 2020

Date of Patent: September 12, 2023

Assignee: Axon Enterprise, Inc.

Inventors: Anshuman Srivastava, Michael Bauer, Joseph Pepper
Adaptive noise cancelling for conferencing communication systems

Patent number: 11657829

Abstract: A communication system with a noise cancellation (NC) assembly providing adaptive or dynamic noise cancellation. The NC assembly includes a localizer module determining, during a communication session (active speaking or during idle times), a location of the active talker. The NC assembly includes a beam generator forming a beam in the determined direction of the active talker to enhance the active talker speech. Once the NC assembly has determined the position of the active talker, the NC assembly assigns a microphone of the microphone array or generated beam in that active direction to be the “active signal” source. The NC assembly assigns a second microphone or beam to be the noise source for NC purposes, and this source may be selected to be in acoustic shadow of the first microphone used as the active signal source or may be the farthest away in its position from the active talker's position.

Type: Grant

Filed: April 28, 2021

Date of Patent: May 23, 2023

Assignee: Mitel Networks Corporation

Inventors: Mirjana Popovic, Dieter Schulz, Roger Bastin, Andrew Wu, Logendra Naidoo
Robustness score for an opaque model

Patent number: 11636284

Abstract: A method, system and computer-readable storage medium for performing a cognitive information processing operation.

Type: Grant

Filed: August 5, 2022

Date of Patent: April 25, 2023

Assignee: Tecnotree Technologies, Inc.

Inventors: Joydeep Ghosh, Jessica Henderson, Matthew Sanchez
Pulse oximetry system

Patent number: 11602311

Abstract: In one aspect, a computer-implemented method includes receiving signals corresponding to wavelengths of light detected by an optical sensor placed in proximity to a patient's body, and for each received signal: separating the signal into an AC signal and a DC signal; separating the AC signal into component signals; analyzing the component signals through a fractional phase transformation to identify a desired component signal and harmonic signals associated with the desired component signal; smoothing the desired component signal, the harmonic signals, and the DC signal; and combining the smoothed desired component signal, the smoothed harmonic signals, and the smoothed DC signal to generate a modulation signal. A modulation ratio signal is generated based on the modulation signals derived from the signals, and a peripheral oxygen saturation (SpO2) of the patient's body is determined based on the modulation ratio signal.

Type: Grant

Filed: January 29, 2019

Date of Patent: March 14, 2023

Assignee: MURATA VIOS, INC.

Inventors: Scott Thomas Mazar, Carlos A. Ricci, Vladimir V. Kovtun
Device for processing data of rolling stock

Patent number: 11606433

Abstract: A device configured to process data comprised in data messages passing on message buses of a rolling stock comprises: a universal input interface receiving data messages complying with the three following physical layers: RS232; RS485; CAN. From the message buses, the data messages comprise data; a processing engine receiving a remote requested configuration comprising one or more processing rules; a standardizing unit decoding the data messages into standardized data streams in function of the remote requested configuration; and wherein the processing engine further applies one or more of the one or more processing rules of the standardized data streams in function of the remote requested configuration.

Type: Grant

Filed: March 12, 2019

Date of Patent: March 14, 2023

Assignee: RAILNOVA SA

Inventor: Charles-Henri Mousset
Detecting user identity in shared audio source contexts

Patent number: 11361770

Abstract: Computerized systems are provided for determining an identity of one or more users that use a same audio source, such as a microphone. The identity of one or more users that use a same audio source can based on generating a list of participant candidates who are likely to participate in an associated event, such as a meeting. For instance, embodiments can generate one or more network graphs of a meeting invitee any only voice input samples of the meeting invitee's N closest connections are compared to an utterance to determine the identity of the user associated with the utterance. One or more indicators that identify the users who are using the same audio source, as well as additional information or metadata associated with the identified user can be caused to be presented.

Type: Grant

Filed: June 30, 2020

Date of Patent: June 14, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Tom Neckermann, Alexander J. Wilson, Romain Gabriel Paul Rey
Determination of spatial audio parameter encoding and associated decoding

Patent number: 11328735

Abstract: An apparatus for spatial audio signal encoding, the apparatus comprising at least one processor and at least one memory including a computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to: determine, for two or more audio signals, at least one spatial audio parameter for providing spatial audio reproduction, the at least one spatial audio parameter comprising a direction parameter with an elevation and an azimuth component; define a spherical grid generated by covering a sphere with smaller spheres, wherein the centres of the smaller spheres define points of the spherical grid; and convert the elevation and azimuth component of the direction parameter to an index value based on the defined spherical grid.

Type: Grant

Filed: November 10, 2017

Date of Patent: May 10, 2022

Assignee: NOKIA TECHNOLOGIES OY

Inventors: Lasse Juhani Laaksonen, Anssi Sakari Rämö, Adriana Vasilache, Mikko Tammi, Miikka Vilermo
Information processing device using recognition difficulty score and information processing method

Patent number: 11301615

Abstract: [Object] To achieve displaying of a text in a more flexible and highly readable manner in accordance with a situation. [Solution] According to the present disclosure, an information processing device is provided. The information processing device includes a calculator that calculates, on the basis of context data to be entered, a recognition difficulty score used for display control of a target text. An information processing method is further provided. The information processing method includes allowing a processor to calculate, on the basis of context data to be entered, a recognition difficulty score used for display control of a target text.

Type: Grant

Filed: January 23, 2018

Date of Patent: April 12, 2022

Assignee: SONY CORPORATION

Inventors: Shinichi Kawano, Yuhei Taki, Masaki Takase, Akira Miyashita, Naoki Tokiwa, Nodoka Tokunaga
Audio encoder and decoder for interleaved waveform coding

Patent number: 11145318

Abstract: There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.

Type: Grant

Filed: October 24, 2018

Date of Patent: October 12, 2021

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Robin Thesing, Harald Mundt, Heiko Purnhagen, Karl Jonas Roeden
Interactive music audition method, apparatus and terminal

Patent number: 11114079

Abstract: An interactive music audition method, apparatus and terminal are provided. The method includes: generating audition inquiry information according to audition requirement information, wherein the audition inquiry information includes a plurality of audition music options associated with the audition requirement information; generating a plurality of audition inquiry voices corresponding to the respective audition music options based on the audition inquiry information, and playing the generated audition inquiry voices; acquiring music selection information for the generated audition inquiry voices; and playing audition music according to the music selection information. Not only the interaction experience between a user and a smart device is improved, but also the accuracy of mining a user's interest is increased.

Type: Grant

Filed: November 18, 2019

Date of Patent: September 7, 2021

Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.

Inventors: Jianlong Li, Shiquan Ye, Xiangtao Jiang, Hao Yang, Zhendong Ma, Huajian Liu
Parallel analysis of different sampling rates in a touch screen controller

Patent number: 11086448

Abstract: A touch screen controller disclosed herein includes a circuit configured to generate a digital touch voltage comprises of samples, at a base sampling rate. The touch screen controller also includes a digital processing unit configured to analyze a first subset of samples of the digital touch voltage samples to determine noise content thereof, the first subset of samples corresponding to samples at a first investigated sampling rate that is a first function of the base sampling rate. The digital processing unit is also configured to analyze a second subset of samples of the digital touch voltage to determine noise content thereof, with the second subset of samples corresponding to samples at a second investigated sampling rate that is a second function of the base sampling rate, and determine a preferred sampling rate from among the first and second investigated sampling rates as a function of determined noise content thereof.

Type: Grant

Filed: December 14, 2016

Date of Patent: August 10, 2021

Assignee: STMicroelectronics Asia Pacific Pte Ltd

Inventors: Leonard Liviu Dinu, Hugo Gicquel
Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters

Patent number: 11043226

Abstract: An apparatus for encoding an audio signal includes: a converter for converting the audio signal into a spectral representation; a scale parameter calculator for calculating a first set of scale parameters from the spectral representation: a downsampler for downsampling the first set of scale parameters to obtain a second set of scale parameters, a second number of scale parameters in the second set of scale parameters being lower than a first number of scale parameters in the first set of scale parameters; a scale parameter encoder for generating an encoded representation of the second set of scale parameters; a spectral processor for processing the spectral representation using a third set of scale parameters, the third set of scale parameters having a third number of scale parameters being greater than the second number of scale parameters, the spectral processor being configured to use the first set of scale parameters or to derive the third set of scale parameters from the second set of scale parameters o

Type: Grant

Filed: April 27, 2020

Date of Patent: June 22, 2021

Assignee: Fraunhofer-Gesellschaft zur Forderung der angewandten Forschung e.V.

Inventors: Emmanuel Ravelli, Markus Schnell, Conrad Benndorf, Manfred Lutzky, Martin Dietz, Srikanth Korse
Method for determining sound and device therefor

Patent number: 10839827

Abstract: A sound discriminating method comprises sensing a sound signal; changing the sensed sound signal into an electrical signal; and determining whether the electrical signal is a predetermined sound by analyzing the electrical signal.

Type: Grant

Filed: June 26, 2015

Date of Patent: November 17, 2020

Assignee: Samsung Electronics Co., Ltd.

Inventors: Do-hyung Kim, Seok-hwan Jo, Jae-hyun Kim
Sound collection apparatus, method of controlling sound collection apparatus, and non-transitory computer-readable storage medium

Patent number: 10812898

Abstract: A sound collection direction is decided based upon an area of an object in a captured image obtained by image capturing of a periphery and a sound collection target position input as a position of a sound collection target. A noise direction is decided based upon an arrangement of the object in the captured image. A sound collected from the periphery is separated into a sound in the sound collection direction and a sound in the noise direction, and noise canceling on the sound in the sound collection direction is performed using the sound in the noise direction.

Type: Grant

Filed: June 20, 2019

Date of Patent: October 20, 2020

Assignee: CANON KABUSHIKI KAISHA

Inventor: Tomohiko Kuroki
Method of training a sound event recognition system

Patent number: 10783434

Abstract: A method of training a non-verbal sound class detection machine learning system, the non-verbal sound class detection machine learning system comprising a machine learning model configured to: receive data for each frame of a sequence of frames of audio data obtained from an audio signal; for each frame of the sequence of frames: process the data for multiple frames; and output data for at least one sound class score representative of a degree of affiliation of the frame with at least one sound class of a plurality of sound classes, wherein the plurality of sound classes comprises: one or more target sound classes; and a non-target sound class representative of an absence of each of the one or more target sound classes; wherein the method comprises: training the machine learning model using a loss function.

Type: Grant

Filed: October 7, 2019

Date of Patent: September 22, 2020

Assignee: AUDIO ANALYTIC LTD

Inventors: Christopher James Mitchell, Sacha Krstulovic, Cagdas Bilen, Juan Azcarreta Ortiz, Giacomo Ferroni, Arnoldas Jasonas, Francesco Tuveri
Encoding method, decoding method, encoding apparatus, and decoding apparatus

Patent number: 10770085

Abstract: An encoding method, a decoding method, an encoding apparatus, a decoding apparatus, a transmitter, a receiver, and a communications system, where the encoding method includes dividing a to-be-encoded time-domain signal into a low band signal and a high band signal, performing encoding on the low band signal to obtain a low frequency encoding parameter, performing encoding on the high band signal to obtain a high frequency encoding parameter, obtaining a synthesized high band signal; performing short-time post-filtering processing on the synthesized high band signal to obtain a short-time filtering signal, and calculating a high frequency gain based on the high band signal and the short-time filtering signal.

Type: Grant

Filed: January 3, 2019

Date of Patent: September 8, 2020

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Bin Wang, Zexin Liu, Lei Miao
Real time popularity based audible content acquisition

Patent number: 10762889

Abstract: A personalized news service provides personalized news programs for its users by generating personalized combinations of audible versions of news stories derived from text-based based versions of the news stories. The audible versions may be generated from the text-based version by a text-to-speech system, or may by recording a person reading aloud the text-based version. To acquire recordings, the personalized news service can make a determination that a particular news story has a threshold extent of popularity. The news service can then transmit a request to a remote recording station for a recording of a verbal reading of the particular news story. The news service can then receive the requested recording from the remote recording station.

Type: Grant

Filed: December 31, 2018

Date of Patent: September 1, 2020

Assignee: Gracenote Digital Ventures, LLC

Inventors: Venkatarama Anilkumar Panguluri, Venkata Sunil Kumar Yarram, Lalit Kumar, Gregory P. Defouw
Encoding or decoding of audio signals

Patent number: 10734001

Abstract: A device includes a receiver and a decoder. The receiver is configured to receive one or more upmix parameters, one or more inter-channel bandwidth extension parameters, one or more inter-channel prediction gain parameters, and an encoded audio signal. The encoded audio signal includes an encoded mid signal. The decoder is configured to generate a synthesized mid signal based on the encoded mid signal. The decoder is also configured to generate a synthesized side signal based on the synthesized mid signal and the one or more inter-channel prediction gain parameters. The decoder is further configured to generate one or more output signals based on the synthesized mid signal, the synthesized side signal, the one or more upmix parameters, and the one or more inter-channel bandwidth extension parameters.

Type: Grant

Filed: September 28, 2018

Date of Patent: August 4, 2020

Assignee: Qualcomm Incorporated

Inventors: Venkatraman Atti, Venkata Subramanyam Chandra Sekhar Chebiyyam
Smart nasometer

Patent number: 10692397

Abstract: A smart nasometer according to an embodiment of the present invention includes: a hardware unit worn on a head of a user for measuring nasal and oral sounds and providing feedback for the user; and a computational unit for receiving and processing speech signals of the nasal and oral sounds measured by the hardware unit, wherein the hardware unit includes: a microphone unit for separately measuring the nasal and oral sounds in a non-touched state of the user's philtrum, wherein the computational unit includes: a nasalance adjustment unit for adjusting a nasalance of the nasal and oral sounds measured by the microphone unit.

Type: Grant

Filed: August 25, 2017

Date of Patent: June 23, 2020

Assignees: POSTECH ACADEMY-INDUSTRY FOUNDATION, INDUSTRIAL COOPERATION FOUNDATION OF CHONBUK NATIONAL UNIVERSITY, CHONBUK NATIONAL UNIVERSITY HOSPITAL

Inventors: Heecheon You, Myoung-Hwan Ko, Jong-Kwan Park, Younggeun Choi, Hyun Gi Kim, Han Soo Lee, Gradiyan Budi Pratama, Min-Jung Yu, Ki Wook Kim, Yun Ju Jo, Jin Kook Lee
Regeneration of wideband speech

Patent number: 10657984

Abstract: A method of regenerating wideband speech from narrowband speech, the method comprising: receiving samples of a narrowband speech signal having a first range of frequencies; identifying, based on a characteristic of the narrowband speech signal, frequencies in the first range of frequencies to translate into a target band of a regenerated speech signal; modulating the identified frequencies in the first range of frequencies of the received samples of the narrowband speech signal with a modulation signal, the modulation signal having a modulating frequency adapted to upshift the identified frequencies in the first range of frequencies into the target band; filtering the modulated samples, using a target band filter, to form the regenerated speech signal in the target band; and combining the narrowband speech signal with the regenerated speech signal to produce a new wideband speech signal.

Type: Grant

Filed: March 12, 2018

Date of Patent: May 19, 2020

Assignee: SKYPE

Inventors: Mattias Nilsson, Soren Vang Andersen, Koen Bernard Vos
Speech signal processing method and speech signal processing apparatus

Patent number: 10600405

Abstract: A speech signal processing method of a user terminal includes: receiving a speech signal, detecting a personalized information section including personal information in the speech signal, performing data processing on the personalized information section of the speech signal by using a personalized model generated based on the personal information, and receiving, from a server, a result of the data processing performed by the server on a general information section of the speech signal that is different than the personalized information section of the speech signal.

Type: Grant

Filed: April 30, 2019

Date of Patent: March 24, 2020

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Tae-yoon Kim, Sang-ha Kim, Sung-Soo Kim, Jin-sik Lee, Chang-woo Han, Eun-kyoung Kim, Jae-won Lee
Speech analysis and synthesis method based on harmonic model and source-vocal tract decomposition

Patent number: 10586526

Abstract: This invention discloses a speech analysis/synthesis method and a simplified form of such a method. Based on a harmonic model, the present method decomposes the parameters of the harmonic model into glottal source characteristics and vocal tract characteristics in its analysis stage and recombines the glottal source and vocal tract characteristics into harmonic model parameters in its synthesis stage.

Type: Grant

Filed: December 10, 2015

Date of Patent: March 10, 2020

Inventor: Kanru Hua
Pitch extraction device and pitch extraction method by encoding a bitstream organized into equal sections according to bit values

Patent number: 10515656

Abstract: A pitch extraction device includes a processor configured to perform a process including: dividing a first bit stream in encoded data into a plurality of sections each having a prescribed section length, the encoded data being obtained by performing entropy encoding on a residual signal calculated by performing linear prediction analysis on a sound signal; allocating a first value or a second value to each of the plurality of sections in the first bit stream in accordance with a bit value in each of the plurality of sections; generating a second bit stream obtained by re-encoding the first bit stream according to the first value and the second value that have been allocated to each of the plurality of sections in the first bit stream; and calculating a fundamental frequency of the sound signal in accordance with an autocorrelation of the second bit stream.

Type: Grant

Filed: September 28, 2017

Date of Patent: December 24, 2019

Assignee: FUJITSU LIMITED

Inventors: Akira Kamano, Yohei Kishi, Takeshi Otani
Speech audio encoding device, speech audio decoding device, speech audio encoding method, and speech audio decoding method

Patent number: 10510354

Abstract: A speech/audio coding apparatus includes a receiver that receives a time-domain speech input signal. The apparatus also includes a processor that transforms a time-domain speech input signal into a frequency-domain spectrum, and divides a frequency region of the spectrum in an extended band into a plurality of bands. The processor sets a limited band for each divided band in the current frame, a width of the limited band in the current frame being narrower than the divided band and the limited band including a first frequency. The processor further encodes the spectrum in the limited band within each divided band in the current frame, wherein the width of the limited band is predetermined and is set to 31.

Type: Grant

Filed: January 9, 2019

Date of Patent: December 17, 2019

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Takuya Kawashima, Masahiro Oshikiri
Method and device for processing speech based on artificial intelligence

Patent number: 10475484

Abstract: The present disclosure discloses a method including: performing a silence detection on a speech to be decoded; cutting the speech to be decoded off to obtain a target speech if detecting that the speech to be detected is a silent speech; resetting tail features of the target speech with preset tail features of silent frames; and performing a CTC decoding process on the target speech reset. In embodiments, when a large number of blank frames are carried in the speech to be decoded, the speech to be decoded is cut off, and the tail features of the target speech is placed with the tail features of the silent frames such that there may be one CTC peak when the CTC decoding process is performed on the tail features of the target speech. Therefore, a last word of text content may be displayed rapidly on a screen.

Type: Grant

Filed: June 30, 2017

Date of Patent: November 12, 2019

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.

Inventors: Zhijian Wang, Sheng Qian
Image data transfer system, transmitter circuit and receiver circuit

Patent number: 10446111

Abstract: An image data transfer system includes a receiver and a transmitter configured to sequentially receive compressed image data and sequentially transmit transmission data corresponding to the compressed image data to the receiver. The transmitter is configured to, in transmitting a specific transmission data, perform data comparison of bits of a compressed image body data of a specific compressed image data with bits of a previous transmission data transmitted over signal lines allocated to the compressed image body data, incorporate the compressed image body data of the specific compressed image data or the bit-inverted data corresponding thereto into the specific transmission data, in response to the result of the data comparison, and incorporate the compression code of the specific compressed image data into the specific transmission data independently of the result of the data comparison.

Type: Grant

Filed: January 23, 2017

Date of Patent: October 15, 2019

Assignee: Synaptics Japan GK

Inventors: Hirobumi Furihata, Masashige Harada, Iori Shiraishi, Takashi Nose
Speech/audio encoding apparatus and method thereof

Patent number: 10446159

Abstract: A speech/audio encoding device for selectively allocating bits for higher precision encoding. The speech/audio encoding device receives a time-domain speech/audio input signal, transforms the speech/audio input signal into a frequency domain, and quantizes an energy envelope corresponding to an energy level for a frequency spectrum of the speech/audio input signal. The speech/audio encoding device further groups quantized energy envelopes into a plurality of groups, determines a perceptual significant group including one or more significant bands and a local-peak frequency, and allocates bits to a plurality of subbands corresponding to the grouped quantized energy envelopes, in which each of the subbands is obtained by splitting the frequency spectrum of the speech/audio input signal. The speech/audio encoding device encodes the frequency spectrum using the bits allocated to the subbands.

Type: Grant

Filed: November 22, 2016

Date of Patent: October 15, 2019

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Takuya Kawashima, Masahiro Oshikiri
Noise reference estimation for noise reduction

Patent number: 10418048

Abstract: A device for noise estimation comprises a first microphone capturing a nominal speech signal, and a second microphone capturing a nominal noise signal. A generalized sidelobe canceller of the device applies spatial noise reduction, and comprises a blocking matrix filter to adaptively process the nominal speech signal to produce a speech cancellation signal, a node for subtracting the speech cancellation signal from the nominal noise signal to produce a noise reference signal, a noise cancellation filter to adaptively filter the noise reference signal to produce a noise cancellation signal; and a node for subtracting the noise cancellation signal from the nominal speech signal to produce a speech reference signal.

Type: Grant

Filed: April 30, 2018

Date of Patent: September 17, 2019

Assignee: Cirrus Logic, Inc.

Inventors: Benjamin Hutchins, Brenton Robert Steele
Diagram building system and method for a signal data decomposition and analysis

Patent number: 10354422

Abstract: The present invention provides a diagram building system adapted for processing a signal with a time period. The diagram building system comprises a inputting device for receiving the signal; a computing device, dividing the signal into a plurality of window scales according to one of time interval scales; decomposing the window scales via HHT algorithm to generate a plurality of quantized windows according to different components; then, calculating the value of quantized windows with the same single-frequency component through a quantifying function to generate a plurality of specific frequency values; an outputting device, sequentially arranging the specific frequency values according to the time interval scales and the single-frequency components to form a visual diagram.

Type: Grant

Filed: April 4, 2016

Date of Patent: July 16, 2019

Assignee: NATIONAL CENTRAL UNIVERSITY

Inventors: Norden E. Huang, Bo-Jau Kuo, Yu-Cheng Lin, Chung-Kang Peng, Men-Tzung Lo
Filter coefficient updating in time domain filtering

Patent number: 10332540

Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.

Type: Grant

Filed: September 15, 2016

Date of Patent: June 25, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Dong Shi, Xuejing Sun
Customizing actions based on contextual data and voice-based inputs

Patent number: 10319383

Abstract: Methods and systems are provided for customizing an action. In some implementations, voice input is received from a user and a context is determined from the voice input. Potential contextual data is identified based on the context and the voice input. A level of confidence is determined for an association of the potential contextual data and the context. An action is performed based on the voice input, the potential contextual data, and the level of confidence. The potential contextual data is used to customize the action.

Type: Grant

Filed: August 24, 2018

Date of Patent: June 11, 2019

Assignee: Google LLC

Inventors: Zoltan Stekkelpak, Gyula Simonyi
Speech audio encoding device, speech audio decoding device, speech audio encoding method, and speech audio decoding method

Patent number: 10210877

Abstract: A speech/audio decoding apparatus is provided that includes a receiver that receives encoded data including a limited-band mode flag, and a memory that stores information on a position of a maximum amplitude spectrum frequency of a previous frame in a divided band. The speech/audio decoding apparatus also includes a processor that identifies whether a decoding band is encoded using a limited-band mode based on the decoded limited-band mode flag. Additionally, the processor decodes the spectrum in a limited band within each of the divided bands in a current frame using the stored information. Furthermore, the limited-band mode is set at an encoder side, when a difference between a first frequency with a first maximum amplitude in a spectrum of the divided band in a preceding frame and a second frequency with a second maximum amplitude in a spectrum of the divided band in the current frame is below a threshold.

Type: Grant

Filed: December 20, 2017

Date of Patent: February 19, 2019

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Takuya Kawashima, Masahiro Oshikiri
Method and apparatus for encoding/decoding an audio signal

Patent number: 10186273

Abstract: Provided are a method and apparatus for encoding an audio signal and a method and apparatus for decoding an audio signal, in which errors generated during encoding and decoding of the audio signal are reduced to enhance the audio quality of a reconstructed audio signal. The method of encoding the audio signal includes detecting a pitch of the audio signal, determining a filter coefficient based on the detected pitch, performing second filtering on the audio signal, based on the determined filter coefficient; and encoding an audio signal resulting from the second filtering.

Type: Grant

Filed: November 25, 2014

Date of Patent: January 22, 2019

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Nam-suk Lee, Hyun-wook Kim
Audio encoder, audio decoder and related methods using two-channel processing within an intelligent gap filling framework

Patent number: 10134404

Abstract: An apparatus for generating a decoded two-channel signal includes: an audio processor for decoding an encoded two-channel signal to obtain a first set of first spectral portions; a parametric decoder for providing parametric data for a second set of second spectral portions and a two-channel identification identifying either a first or a second different two-channel representation for the second spectral portions; and a frequency regenerator for regenerating a second spectral portion depending on a first spectral portion of the first set of first spectral portions, the parametric data for the second portion and the two-channel identification for the second portion.

Type: Grant

Filed: January 19, 2016

Date of Patent: November 20, 2018

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
Windowing methods for efficient channel aggregation and deaggregation

Patent number: 10117247

Abstract: A method implemented in a fronthaul communication unit, comprising applying, via a processor of the fronthaul communication unit, a plurality of first frequency-domain windowing (FDW) functions on a plurality of first communication channel signals to produce a plurality of first windowed signals, aggregating, via the processor, the plurality of first windowed signals to produce a first aggregated signal, and transmitting, via a frontend of the fronthaul communication unit, the first aggregated signal to a corresponding fronthaul communication unit over a fronthaul communication link to facilitate fronthaul communication.

Type: Grant

Filed: March 1, 2016

Date of Patent: October 30, 2018

Assignee: Futurewei Technologies, Inc.

Inventors: Huaiyu Zeng, Xiang Liu
Method and apparatus for concealing frame error and method and apparatus for audio decoding

Patent number: 10096324

Abstract: A frame error concealment (FEC) method is provided. The method includes: selecting an FEC mode based on states of a current frame and a previous frame of the current frame in a time domain signal generated after time-frequency inverse transform processing; and performing corresponding time domain error concealment processing on the current frame based on the selected FEC mode, wherein the current frame is an error frame or the current frame is a normal frame when the previous frame is an error frame.

Type: Grant

Filed: January 30, 2017

Date of Patent: October 9, 2018

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ho-sang Sung, Nam-suk Lee
System, method and computer program product for creating a summarization from recorded audio of meetings

Patent number: 10089290

Abstract: A meeting summarization method, system, and computer program product, include recording meeting audio of a meeting, capturing notes including a time stamp from each of a plurality of users associated with the meeting, synchronizing the recorded meeting audio of the meeting and each of the notes of each of the plurality of users based on a correlation between the time stamp, and analyzing the synchronized meeting audio and notes to determine highlights of the meeting based on a co-occurrence of notes between the plurality of users.

Type: Grant

Filed: October 17, 2017

Date of Patent: October 2, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Keith William Grueneberg, Jason Crawford, Jonathan Lenchner, Satya V. Nitta, Christian Makaya, Sharad C. Sundararajan
Customizing actions based on contextual data and voice-based inputs

Patent number: 10062383

Abstract: Methods and systems are provided for customizing an action. In some implementations, voice input is received from a user and a context is determined from the voice input. Potential contextual data is identified based on the context and the voice input. A level of confidence is determined for an association of the potential contextual data and the context. An action is performed based on the voice input, the potential contextual data, and the level of confidence. The potential contextual data is used to customize the action.

Type: Grant

Filed: November 20, 2017

Date of Patent: August 28, 2018

Assignee: Google LLC

Inventors: Zoltan Stekkelpak, Gyula Simonyi
Frequency envelope vector quantization method and apparatus

Patent number: 10032460

Abstract: Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the ith portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.

Type: Grant

Filed: September 26, 2017

Date of Patent: July 24, 2018

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Chen Hu, Lei Miao, Zexin Liu
Method and apparatus for encoding stereo phase parameter

Patent number: 10008211

Abstract: Present disclosure discloses a method and an apparatus for encoding a stereo phase parameter, which relate to the field of information technologies and can improve an effect of stereo audio phase information. The method includes: first, acquiring a global stereo phase parameter of a current frame; then, determining a value of the global stereo phase parameter of the current frame, and adjusting the value of the global stereo phase parameter of the current frame according to a determining result of the value of the global stereo phase parameter of the current frame; and finally, encoding an adjusted value of the global stereo phase parameter of the current frame. The embodiments of the present disclosure are applicable to recovering stereo phase information.

Type: Grant

Filed: May 13, 2016

Date of Patent: June 26, 2018

Assignee: Huawei Technologies Co., Ltd.

Inventors: Xingtao Zhang, Lei Miao, Wenhai Wu
Apparatus and method for transmitting and receiving streaming data using multiple paths

Patent number: 9973555

Abstract: The present invention relates to an apparatus and method for transmitting/receiving streaming data using multiple paths, in which the streaming data is smoothly reproduced without being interrupted, and more particularly, to an apparatus and method for transmitting/receiving streaming data using multiple paths, in which exchange of the streaming data is performed in real-time using the multiple paths regardless of obstacles. The method for transmitting streaming data using multiple paths includes managing and maintaining a path list including sequence information about a transmission path capable of transmitting data, framing the streaming data, and transmitting the framed streaming data via the transmission path according to the sequence information.

Type: Grant

Filed: April 20, 2016

Date of Patent: May 15, 2018

Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Hyoung Jin Kwon, Jin Kyeong Kim, Woo Yong Lee, Kyeongpyo Kim
Method and apparatus for encoding/decoding speech signal using coding mode

Patent number: 9928843

Abstract: An apparatus and a method to encode and decode a speech signal using an encoding mode are provided. An encoding apparatus may select an encoding mode of a frame included in an input speech signal, and encode a frame having an unvoiced mode for an unvoiced speech as the selected encoding mode.

Type: Grant

Filed: November 18, 2013

Date of Patent: March 27, 2018

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ho Sang Sung, Ki Hyun Choo, Jung Hoe Kim, Eun Mi Oh
System and methods for providing voice transcription

Patent number: 9871916

Abstract: A system and methods is provided for providing SIP based voice transcription services. A computer implemented method includes: transcribing a Session Initiation Protocol (SIP) based conversation between one or more users from voice to text transcription; identifying each of the one or more users that are speaking using a device SIP_ID of the one or more users; marking the identity of the one or more users that are speaking in the text transcription; and providing the text transcription of the speaking user to non-speaking users.

Type: Grant

Filed: March 5, 2009

Date of Patent: January 16, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: John R. Dingler, Sri Ramanathan, Matthew A. Terry, Matthew B. Trevathan
Methods and apparatus for dynamic low frequency noise suppression

Patent number: 9865277

Abstract: Methods and apparatus for dynamically suppressing low frequency non-speech audio events, such as road bumps, without suppressing speech formants. In exemplary embodiments of the invention, maximum powers in first and second windows are computed and used to determine whether dampening should be applied, and if so, to what extent.

Type: Grant

Filed: July 10, 2013

Date of Patent: January 9, 2018

Assignee: Nuance Communications, Inc.

Inventors: Friedrich Faubel, Patrick B. Hannon, Kai Wenzler
Method for recognizing a voice context for a voice control function, method for ascertaining a voice control signal for a voice control function, and apparatus for executing the method

Patent number: 9865258

Abstract: A method for recognizing a voice context for a voice control function in a vehicle. The method encompasses reading in a gaze direction datum regarding a current gaze direction of an occupant of the vehicle; allocating the gaze direction datum to a viewing zone in an interior of the vehicle in order to obtain a viewing zone datum regarding a viewing zone currently being viewed by the occupant; and determining, by utilization of the viewing zone datum, a voice context datum regarding a predetermined voice context allocated to the viewing zone currently being viewed.

Type: Grant

Filed: May 17, 2016

Date of Patent: January 9, 2018

Assignee: ROBERT BOSCH GMBH

Inventor: Philippe Dreuw
Devices and methods for use of phase information in speech synthesis systems

Patent number: 9865247

Abstract: A device may receive a speech signal. The device may determine acoustic feature parameters for the speech signal. The acoustic feature parameters may include phase data. The device may determine circular space representations for the phase data based on an alignment of the phase data with given axes of the circular space representations. The device may map the phase data to linguistic features based on the circular space representations. The linguistic features may be associated with linguistic content that includes phonemic content or text content. The device may provide a synthetic audio pronunciation of the linguistic content based on the mapping.

Type: Grant

Filed: February 25, 2015

Date of Patent: January 9, 2018

Assignee: Google Inc.

Inventors: Ioannis Agiomyrgiannakis, Byung Ha Chun
Accurate extraction of chroma vectors from an audio signal

Patent number: 9830929

Abstract: A matrix is generated that stores sinusoidal components evaluated for a given sample rate corresponding to the matrix. The matrix is then used to convert an audio signal to chroma vectors representing of a set of “chromae” (frequencies of interest). The conversion of an audio signal portion into its chromae enables more meaningful analysis of the audio signal than would be possible using the signal data alone. The chroma vectors of the audio signal can be used to perform analyzes such as comparisons with the chroma vectors obtained from other audio signals in order to identify audio matches.

Type: Grant

Filed: June 29, 2015

Date of Patent: November 28, 2017

Assignee: GOOGLE INC.

Inventor: Pedro Gonnet Anders

1 2 3 4 5 … next