Excitation Patterns Patents (Class 704/223)
  • Patent number: 11922958
    Abstract: Various embodiments provide a method and an apparatus for determining a weighting factor during stereo signal encoding. In those embodiments, a parameter value corresponding to the encoding mode of the to-be-encoded signal is determined based on an encoding mode of a to-be-encoded signal in a stereo signal and a correspondence between an encoding mode and a parameter value. Based on the determined parameter value and an energy spectrum of a linear prediction filter corresponding to an original line spectral frequency parameter of the to-be-encoded signal, a weighting factor for calculating a distance between the original line spectral frequency parameter and a target original line spectral frequency parameter is calculated.
    Type: Grant
    Filed: December 13, 2022
    Date of Patent: March 5, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Eyal Shlomot, Haiting Li, Zexin Liu
  • Patent number: 11915687
    Abstract: Systems and methods for generating training data are described herein. Pieces of metadata captured by a plurality of networked sensor systems can be captured, where each piece of metadata is associated with a specific set of sensor data captured by one of the plurality of networked sensor systems and includes a set of characteristics for the specific set of captured sensor data. A probabilistic model can be generated based on the received metadata and simulations can be performed based upon a training corpus by generating multiple scenarios, and, for each scenario, a scenario specific version of a particular annotated sample is generated by performing a simulation using the particular annotated sample. The scenario specific versions of annotated samples from the training corpus can be stored as a training data set on the at least one network device.
    Type: Grant
    Filed: January 9, 2023
    Date of Patent: February 27, 2024
    Assignee: Sonos, Inc.
    Inventors: Connor Kristopher Smith, Kurt Thomas Soto, Charles Conor Sleith
  • Patent number: 11887607
    Abstract: A stereo encoding method and apparatus, and a stereo decoding method and apparatus are disclosed. The stereo encoding method includes: performing downmix processing on a left channel signal of a current frame and a right channel signal of the current frame, to obtain a primary channel signal of the current frame and a secondary channel signal of the current frame; and when determining that a frame structure similarity value falls within a frame structure similarity interval, performing differential encoding on a pitch period of the secondary channel signal by using an estimated pitch period value of the primary channel signal, to obtain a pitch period index value of the secondary channel signal, where the pitch period index value of the secondary channel signal is used to generate a to-be-sent stereo encoded bitstream.
    Type: Grant
    Filed: December 15, 2021
    Date of Patent: January 30, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Eyal Shlomot, Yuan Gao, Bin Wang
  • Patent number: 11811489
    Abstract: Techniques for motion mitigation for uplink power control are disclosed. In one embodiment, a method for use in a satellite communication system comprises: generating a power margin associated with motion of an antenna of a satellite terminal; and generating a first power limit representing a maximum transmit power for the antenna based, at least in part, on the power margin.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: November 7, 2023
    Assignee: KYMETA CORPORATION
    Inventors: Eric Hultman, Turner Noel, Alexander L. Bautista, Jr.
  • Patent number: 11700127
    Abstract: The present disclosure provides an authentication method, an authentication device, an electronic device and a storage medium. The authentication method includes: receiving target voice data; obtaining a first voiceprint feature parameter corresponding to the target voice data from a device voiceprint model library; performing a first encryption process on the first voiceprint feature parameter with a locally stored private key to generate to-be-verified data; transmitting the to-be-verified data to a server, so that the server uses a public key which matches the private key to decrypt the to-be-verified data to obtain the first voiceprint feature parameter, and performs authentication on the first voiceprint feature parameter to obtain an authentication result; receiving the authentication result returned by the server.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: July 11, 2023
    Assignee: BOE TECHNOLOGY GROUP CO., LTD.
    Inventor: Ran Wang
  • Patent number: 11700430
    Abstract: Systems and methods are provided for applying attributes to subtitles. One example method includes accessing a subtitle file, wherein the subtitle file comprises one or more subtitles, and identifying an attribute to apply to at least a subset of the subtitles. The subtitle file is amended indicate an attribute to apply to at least a subset of the subtitles to create an amended subtitle file. At a computing device, the subtitles of the amended subtitle file are generated for display, wherein the attribute is applied to the subset of the subtitles.
    Type: Grant
    Filed: April 30, 2021
    Date of Patent: July 11, 2023
    Assignee: Rovi Guides, Inc.
    Inventors: Padmassri Chandrashekar, Reda Harb
  • Patent number: 11677477
    Abstract: Methods, systems, and devices for a decision directed multi-modulus searching algorithm are described. A receiver may receive a signal including a set of data symbols. The receiver may iteratively determine a set of centroids for demodulating the set of data symbols (e.g., as part of a training procedure). The centroids may be used to demodulate the set of data symbols according to a modulation constellation associated with the set of data symbols. The training procedure may include, for each data symbol of a subset of data symbols, assigning a centroid of the set of centroids to each data symbol and updating the set of centroids based on assigning the centroid to each data symbol. The receiver may demodulate the set of data symbols based on the updated set of centroids.
    Type: Grant
    Filed: October 11, 2021
    Date of Patent: June 13, 2023
    Assignee: Cable Television Laboratories, Inc.
    Inventors: Mu Xu, Zhensheng Jia
  • Patent number: 11669097
    Abstract: The present disclosure relates to systems and methods for autonomous driving. The systems may obtain driving information associated with a vehicle; determine a state of the vehicle; determine one or more candidate control signals and one or more evaluation values corresponding to the one or more candidate control signals based on the driving information and the state of the vehicle by using a trained control model; select a target control signal from the one or more candidate control signals based on the one or more evaluation values; and transmit the target control signal to a control component of the vehicle.
    Type: Grant
    Filed: February 26, 2021
    Date of Patent: June 6, 2023
    Assignee: BEIJING VOYAGER TECHNOLOGY CO., LTD.
    Inventor: Wei Luo
  • Patent number: 11669667
    Abstract: Systems and methods for automatic test pattern generation (ATPG) for parametric faults are described. A model may be constructed to predict a measurement margin for an integrated circuit (IC) design based on a random sample of random variables. A set of failure events may be determined for the IC design using the model, where each failure event may correspond to a set of values of the random variables that is expected to cause a metric for the IC design to violate a threshold.
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: June 6, 2023
    Assignee: Synopsys, Inc.
    Inventors: Peilin Jiang, Mayukh Bhattacharya, Chih Ping Antony Fan
  • Patent number: 11621011
    Abstract: Described herein is a method of decoding an audio or speech signal, the method including the steps of: (a) receiving, by a decoder, a coded bitstream including the audio or speech signal and conditioning information; (b) providing, by a bitstream decoder, decoded conditioning information in a format associated with a first bitrate; (c) converting, by a converter, the decoded conditioning information from the format associated with the first bitrate to a format associated with a second bitrate; and (d) providing, by a generative neural network, a reconstruction of the audio or speech signal according to a probabilistic model conditioned by the conditioning information in the format associated with the second bitrate. Described are further an apparatus for decoding an audio or speech signal, a respective encoder, a system of the encoder and the apparatus for decoding an audio or speech signal as well as a respective computer program product.
    Type: Grant
    Filed: October 29, 2019
    Date of Patent: April 4, 2023
    Assignee: Dolby International AB
    Inventors: Janusz Klejsa, Per Hedelin
  • Patent number: 11551670
    Abstract: Systems and methods for generating training data are described herein. Pieces of metadata captured by a plurality of networked sensor systems can be captured, where each piece of metadata is associated with a specific set of sensor data captured by one of the plurality of networked sensor systems and includes a set of characteristics for the specific set of captured sensor data. A probabilistic model can be generated based on the received metadata and simulations can be performed based upon a training corpus by generating multiple scenarios, and, for each scenario, a scenario specific version of a particular annotated sample is generated by performing a simulation using the particular annotated sample. The scenario specific versions of annotated samples from the training corpus can be stored as a training data set on the at least one network device.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: January 10, 2023
    Assignee: Sonos, Inc.
    Inventors: Connor Kristopher Smith, Kurt Thomas Soto, Charles Conor Sleith
  • Patent number: 11551701
    Abstract: Various embodiments provide a method and an apparatus for determining a weighting factor during stereo signal encoding. In those embodiments, a parameter value corresponding to the encoding mode of the to-be-encoded signal is determining based on an encoding mode of a to-be-encoded signal in a stereo signal and a correspondence between an encoding mode and a parameter value. Based on the determined parameter value and an energy spectrum of a linear prediction filter corresponding to an original line spectral frequency parameter of the to-be-encoded signal is a weighting factor for calculating a distance between the original line spectral frequency parameter and a target original line spectral frequency parameter is calculated.
    Type: Grant
    Filed: December 29, 2020
    Date of Patent: January 10, 2023
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Eyal Shlomot, Halting Li, Zexin Liu
  • Patent number: 11551702
    Abstract: A spectrum filler for filling non-coded residual sub-vectors of a transform coded audio signal includes a sub-vector compressor configured to compress actually coded residual sub-vectors. A sub-vector rejecter is configured to reject compressed residual sub-vectors that do not fulfill a predetermined sparseness criterion. A sub-vector collector is configured to concatenate the remaining compressed residual sub-vectors to form a first virtual codebook. A coefficient combiner is configured to combine pairs of coefficients of the first virtual codebook to form a second virtual codebook. A sub-vector filler is configured to fill non-coded residual sub-vectors below a predetermined frequency with coefficients from the first virtual codebook, and to fill non-coded residual sub-vectors above the predetermined frequency with coefficients from the second virtual codebook.
    Type: Grant
    Filed: May 28, 2021
    Date of Patent: January 10, 2023
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Volodya Grancharov, Sebastian Näslund, Sigurdur Sverrisson
  • Patent number: 11501759
    Abstract: Disclosed are a method and a system for speech recognition, an electronic device and a storage medium, which relates to the technical field of speech recognition. Embodiments of the application comprise performing encoded representation on an audio to be recognized to obtain an acoustic encoded state vector sequence of the audio to be recognized; performing sparse encoding on the acoustic encoded state vector sequence of the audio to be recognized to obtain an acoustic encoded sparse vector; determining a text prediction vector of each label in a preset vocabulary; recognizing the audio to be recognized and determining a text content corresponding to the audio to be recognized according to the acoustic encoded sparse vector and the text prediction vector. The acoustic encoded sparse vector of the audio to be recognized is obtained by performing sparse encoding on the acoustic encoded state vector of the audio to be recognized.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: November 15, 2022
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Zhengkun Tian, Jiangyan Yi
  • Patent number: 11488613
    Abstract: Disclosed are a method for coding a residual signal of LPC coefficients based on collaborative quantization and a computing device for performing the method. The residual signal coding method includes: generating encoded LPC coefficients and LPC residual signals by performing LPC analysis and quantization on an input speech; Determining a predicted LPC residual signal by applying the LPC residual signal to cross module residual learning; Performing LPC synthesis using the coded LPC coefficients and the predicted LPC residual signal; It may include the step of determining an output speech that is a synthesized output according to a result of performing the LPC synthesis.
    Type: Grant
    Filed: November 13, 2020
    Date of Patent: November 1, 2022
    Assignees: Electronics and Telecommunications Research Institute, The Trustees of Indiana University
    Inventors: Minje Kim, Kai Zhen, Mi Suk Lee, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11482232
    Abstract: Concealing a lost audio frame of a received audio signal is provided by performing a sinusoidal analysis (81) of a part of a previously received or reconstructed audio signal, wherein the sinusoidal analysis involves identifying frequencies of sinusoidal components of the audio signal, applying a sinusoidal model on a segment of the previously received or reconstructed audio signal, wherein said segment is used as a prototype frame in order to create a substitution frame for a lost audio frame, and creating the substitution frame (83) for the lost audio frame by time-evolving sinusoidal components of the prototype frame, up to the time instance of the lost audio frame, in response to the corresponding identified frequencies.
    Type: Grant
    Filed: May 16, 2019
    Date of Patent: October 25, 2022
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Stefan Bruhn
  • Patent number: 11373633
    Abstract: During text-to-speech processing, a speech model creates synthesized speech that corresponds to input data. The speech model may include an encoder for encoding the input data into a context vector and a decoder for decoding the context vector into spectrogram data. The speech model may further include a voice decoder that receives vocal characteristic data representing a desired vocal characteristic of synthesized speech. The voice decoder may process the vocal characteristic data to determine configuration data, such as weights, for use by the speech decoder.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: June 28, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Roberto Barra Chicote, Vatsal Aggarwal, Andrew Paul Breen, Javier Gonzalez Hernandez, Nishant Prateek
  • Patent number: 11343155
    Abstract: The present disclosure generally relates to apparatus, software and methods for predicting future network traffic. The disclosed apparatus, software and methods alleviate congestion and/or increase overall traffic flow by providing methods for reallocating future idle capacity.
    Type: Grant
    Filed: September 12, 2019
    Date of Patent: May 24, 2022
    Assignee: Cable Television Laboratories, Inc.
    Inventors: Bernardo Huberman, Scott H. Clearwater
  • Patent number: 11270715
    Abstract: An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio by changing, in a state buffer, a distance between elements of Immittance Spectral Pair/Immittance Spectral Frequency (ISF/LSF) parameters of a past frame.
    Type: Grant
    Filed: April 9, 2020
    Date of Patent: March 8, 2022
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
  • Patent number: 11264043
    Abstract: An apparatus for encoding a speech signal by determining a codebook vector of a speech coding algorithm is provided. The apparatus includes a matrix determiner for determining an autocorrelation matrix R, and a codebook vector determiner for determining the codebook vector depending on the autocorrelation matrix R. The matrix determiner is configured to determine the autocorrelation matrix R by determining vector coefficients of a vector r, wherein the autocorrelation matrix R includes a plurality of rows and a plurality of columns, wherein the vector r indicates one of the columns or one of the rows of the autocorrelation matrix R, wherein R(i, j)=r(|i?j|), wherein R(i, j) indicates the coefficients of the autocorrelation matrix R, wherein i is a first index indicating one of a plurality of rows of the autocorrelation matrix R, and wherein j is a second index indicating one of the plurality of columns of the autocorrelation matrix R.
    Type: Grant
    Filed: December 4, 2018
    Date of Patent: March 1, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschunq e.V.
    Inventors: Tom Baeckstroem, Markus Multrus, Guillaume Fuchs, Christian Helmrich, Martin Dietz
  • Patent number: 11240069
    Abstract: A communication device is configured to correlate a first signal with a second signal at a designated interval, the second signal corresponding to the first signal and being received by the communication device where the other communication device transmits a signal including a pulse as the first signal, convert a correlation computation result that is a result of correlating the first signal with the second signal at the designated interval into a format including a matrix product of an expanded modal matrix and an expanded signal vector, the expanded modal matrix including a plurality of elements indicating the correlation computation result obtained when assuming that the signals are received at respective set times, the expanded signal vector being a vector including a plurality of elements, each of which indicates whether or not there is a signal received at each of the set times and amplitude and phase of the signal.
    Type: Grant
    Filed: January 22, 2021
    Date of Patent: February 1, 2022
    Assignees: KABUSHIKI KAISHA TOKAI RIKA DENKI SEISAKUSHO, NAGOYA INSTITUTE OF TECHNOLOGY
    Inventors: Yoshiki Oishi, Kenichi Koga, Tatsuya Koike, Nobuyoshi Kikuma
  • Patent number: 11227613
    Abstract: A frame error concealment method based on frames including transform coefficient vectors including the following steps: It tracks sign changes between corresponding transform coefficients of predetermined sub-vectors of consecutive good stationary frames. It accumulates the number of sign changes in corresponding sub-vectors of a predetermined number of consecutive good stationary frames. It reconstructs an erroneous frame with the latest good stationary frame, but with reversed signs of transform coefficients in sub-vectors having an accumulated number of sign changes that exceeds a predetermined threshold.
    Type: Grant
    Filed: January 20, 2020
    Date of Patent: January 18, 2022
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Sebastian Näslund, Volodya Grancharov, Jonas Svedberg
  • Patent number: 11176954
    Abstract: A technique for encoding a multichannel audio encoding is provided that includes quantizing a set of first LP filter coefficients for an audio signal in a first channel using a predefined first quantizer; and quantizing a set of second LP filter coefficients for an audio signal in a second channel on the basis of the quantized set of first LP filter coefficients. The quantization of the set of second LP filter coefficients includes: deriving, on basis of the quantized set of first LP filter coefficients by using a predefined predictor, a set of predicted LP filter coefficients for the audio signal in said second channel, computing prediction error as a difference between respective LP coefficients of the set of second LP filter coefficients and the set of predicted LP filter coefficients, and quantizing the prediction error.
    Type: Grant
    Filed: April 10, 2017
    Date of Patent: November 16, 2021
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Adriana Vasilache, Anssi Ramo, Lasse Laaksonen
  • Patent number: 11069349
    Abstract: The speech command issued to a voice activated/controlled system is anonymized so that biometric voice data of the speaker may not be received by the voice activated/controlled system. A spoken audio command is converted to text, which is then converted to a synthesized voice signal. The synthesized voice signal is then provided to the voice-activated/controlled device. The synthesized voice signal may be provided to the voice-activated device within a sound shield or enclosure so that the original speech command issued by the speaker may not be received by the voice-activated/controlled system. In this way, the speaker's actual voice and related data may be kept private and secure.
    Type: Grant
    Filed: October 29, 2018
    Date of Patent: July 20, 2021
    Assignee: DILLARD-APPLE, LLC
    Inventors: Margaret Dillard, Logan Apple
  • Patent number: 10923137
    Abstract: A computer-implemented system and method provide an audio label for a noise signal. The computer-implemented system and method include receiving an audio input and obtaining the noise signal from the audio input. The computer-implemented system and method include extracting audio features of the noise signal. The computer-implemented system and method include determining and outputting an audio label for the extracted audio features of the noise signal based on machine learning data.
    Type: Grant
    Filed: May 4, 2017
    Date of Patent: February 16, 2021
    Assignee: Robert Bosch GmbH
    Inventor: Taufiq Hasan al Banna
  • Patent number: 10885089
    Abstract: A method enables identification of a similarity level between a user-provided data item and a data item within a set of data documents. The method includes a representation generator determining, for each term in an enumeration of terms, occurrence information. The representation generator generates, for each term, a sparse distributed representation (SDR) using the occurrence information. The method includes receiving, by a filtering module, a filtering criterion. The method includes generating, by the representation generator, for the filtering criterion, at least one SDR. The method includes generating, by the representation generator, for a first of a plurality of streamed documents received from a data source, a compound SDR. The method includes determining, by a similarity engine executing on the second computing device, a distance between the filtering criterion SDR and the generated compound SDR. The method includes acting on the first streamed document, based upon the determined distance.
    Type: Grant
    Filed: July 26, 2016
    Date of Patent: January 5, 2021
    Assignee: cortical.io AG
    Inventor: Francisco Eduardo De Sousa Webber
  • Patent number: 10861480
    Abstract: Embodiments of the present disclosure provide a method and a device for generating far-field speech data, a computer device and a computer readable storage medium. The method includes obtaining environmental noise in real environment and adjusting near-field speech data in a near-field speech data set based on the environmental noise, further includes generating far-field speech data based on adjusted near-field speech data and the environmental noise.
    Type: Grant
    Filed: December 20, 2018
    Date of Patent: December 8, 2020
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Jianwei Sun, Chao Li, Xin Li, Weixin Zhu, Ming Wen
  • Patent number: 10811021
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A?1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B?1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: November 22, 2019
    Date of Patent: October 20, 2020
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10762392
    Abstract: Systems, apparatuses, and methods for converting data to a tiling format when implementing convolutional neural networks are disclosed. A system includes at least a memory, a cache, a processor, and a plurality of compute units. The memory stores a first buffer and a second buffer in a linear format, where the first buffer stores convolutional filter data and the second buffer stores image data. The processor converts the first and second buffers from the linear format to third and fourth buffers, respectively, in a tiling format. The plurality of compute units load the tiling-formatted data from the third and fourth buffers in memory to the cache and then perform a convolutional filter operation on the tiling-formatted data. The system generates a classification of a first dataset based on a result of the convolutional filter operation.
    Type: Grant
    Filed: December 28, 2018
    Date of Patent: September 1, 2020
    Assignee: Advanced Micro Devices, Inc.
    Inventors: Song Zhang, Jiantan Liu, Hua Zhang, Min Yu
  • Patent number: 10757507
    Abstract: In accordance with some embodiments, an apparatus for privacy protection is provided. The apparatus includes an audio output device arranged to output sound directed to an audio input device of a second device. The apparatus further includes an audio coupling interface arranged to provide a cavity for the audio output device and the audio input device of the second device. The apparatus also includes a spectral shaper, coupled to the audio output device, operable to apply a spectral envelope to an audio signal in order to produce a shaped audio signal, wherein the shaped audio signal is selectively coupled to the audio output device.
    Type: Grant
    Filed: February 11, 2019
    Date of Patent: August 25, 2020
    Assignee: PPIP, LLC
    Inventors: Michael Fong, Neric Hsin-wu Fong, Teddy David Thomas
  • Patent number: 10706866
    Abstract: An audio signal encoding method and a mobile phone, where the audio signal encoding method includes obtaining a digital audio signal in time domain; transforming the digital audio signal in time domain to an audio signal in frequency domain, which comprises a current frame comprises a plurality of subbands; obtaining, reference parameters of the plurality of subbands; encoding, using a HQ algorithm, the current frame to obtain an encoded audio signal when the reference parameters meet a preset parameter condition; and transmitting the encoded audio signal via a network. The audio signal encoding method and the mobile phone help improve encoding quality or encoding efficiency in audio signal encoding.
    Type: Grant
    Filed: October 30, 2019
    Date of Patent: July 7, 2020
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Zexin Liu, Lei Miao
  • Patent number: 10657980
    Abstract: A computer-implemented method according to one embodiment includes creating a clean dictionary, utilizing a clean signal, creating a noisy dictionary, utilizing a first noisy signal, determining a time varying projection, utilizing the clean dictionary and the noisy dictionary, and denoising a second noisy signal, utilizing the time varying projection.
    Type: Grant
    Filed: October 25, 2017
    Date of Patent: May 19, 2020
    Assignee: International Business Machines Corporation
    Inventors: Dimitrios B. Dimitriadis, Samuel Thomas, Colin C. Vaz
  • Patent number: 10621999
    Abstract: An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a side information encoder configured to encode side information about the discontinuity.
    Type: Grant
    Filed: October 17, 2018
    Date of Patent: April 14, 2020
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
  • Patent number: 10553229
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: June 3, 2019
    Date of Patent: February 4, 2020
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10529350
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: June 3, 2019
    Date of Patent: January 7, 2020
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10438613
    Abstract: A time-varying pitch of a signal may be estimated by processing a sequence of frames of the speech signal. An estimated fractional chirp rate may be computed for each frame of the sequence of frames, and the estimated fractional chirp rates may be used to compute a pitch template for the sequence, where the pitch template indicates the time-varying pitch of the signal subject to a scale factor. A first pitch estimate for each frame of the sequence of frames may be computed by computing a scale factor and multiplying the pitch template by the scale factor. A second pitch estimate may be computed from the first pitch estimate by identifying peaks in the frequency representations using the first pitch estimates and fitting a parametric function to the peaks.
    Type: Grant
    Filed: May 3, 2019
    Date of Patent: October 8, 2019
    Assignee: Friday Harbor LLC
    Inventors: David C. Bradley, Jeremy Semko
  • Patent number: 10381015
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: July 25, 2018
    Date of Patent: August 13, 2019
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10339948
    Abstract: Disclosed are a method and apparatus for encoding and decoding a high frequency for bandwidth extension. The method includes: estimating a weight; and generating a high frequency excitation signal by applying the weight between random noise and a decoded low frequency spectrum.
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: July 2, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Ki-hyun Choo
  • Patent number: 10290300
    Abstract: Embodiments are disclosed for recognizing speech in a computing system. An example speech recognition method includes receiving metadata at a generation unit that includes a database of accented substrings, generating, via the generation unit, accent-corrected phonetic data for words included in the metadata, the accent-corrected phonetic data representing different pronunciations of the words included in the metadata based on the accented substrings stored in the database, receiving, at a voice recognition engine, extracted speech data derived from utterances input by a user to the speech recognition system, and receiving, at the voice recognition engine, the accent-corrected phonetic data.
    Type: Grant
    Filed: July 24, 2015
    Date of Patent: May 14, 2019
    Assignee: Harman International Industries, Incorporated
    Inventor: Rajat Pashine
  • Patent number: 10283143
    Abstract: A time-varying pitch of a signal may be estimated by processing a sequence of frames of the speech signal. An estimated fractional chirp rate may be computed for each frame of the sequence of frames, and the estimated fractional chirp rates may be used to compute a pitch template for the sequence, where the pitch template indicates the time-varying pitch of the signal subject to a scale factor. A first pitch estimate for each frame of the sequence of frames may be computed by computing a scale factor and multiplying the pitch template by the scale factor. A second pitch estimate may be computed from the first pitch estimate by identifying peaks in the frequency representations using the first pitch estimates and fitting a parametric function to the peaks.
    Type: Grant
    Filed: March 20, 2017
    Date of Patent: May 7, 2019
    Assignee: Friday Harbor LLC
    Inventors: David C. Bradley, Jeremy Semko
  • Patent number: 10236007
    Abstract: An audio encoder for encoding an audio signal, includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal.
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: March 19, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
  • Patent number: 10217453
    Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.
    Type: Grant
    Filed: October 14, 2016
    Date of Patent: February 26, 2019
    Assignee: SoundHound, Inc.
    Inventors: Mark Stevans, Monika Almudafar-Depeyrot, Keyvan Mohajer
  • Patent number: 10152982
    Abstract: An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio.
    Type: Grant
    Filed: September 15, 2017
    Date of Patent: December 11, 2018
    Assignee: NTT DOCOMO, INC.
    Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
  • Patent number: 10147435
    Abstract: An audio signal, having first and second regions of frequency spectrum, is coded. Spectral peaks in the first region are encoded by a first coding method. For a segment of the audio signal, a relation between energy of bands in the first and second regions is determined. A relation between the energy of the band in the second region and energy of neighboring bands in the second region is determined. A determination is made whether available bits are sufficient for encoding at least one non-peak segment of the first region and the band in the second region. Responsive to first and second relations fulfilling a respective predetermined criterion and a sufficient number of bits, encoding the band in the second region using a second coding method different from the first coding method, and otherwise, subjecting the band in the second region to BandWidth Extension BWE or noise fill.
    Type: Grant
    Filed: July 20, 2017
    Date of Patent: December 4, 2018
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventors: Erik Norvell, Volodya Grancharov
  • Patent number: 10083698
    Abstract: A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame, the excitation of a next frame is obtained according to the reduced or limited pitch gain value of the first subframe, and the next frame is encoded according to the obtained excitation. The method is used for a voiced speech class.
    Type: Grant
    Filed: August 15, 2017
    Date of Patent: September 25, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventor: Yang Gao
  • Patent number: 10074376
    Abstract: A technology of accurately coding and decoding coefficients which are convertible into linear prediction coefficients even for a frame in which the spectrum variation is great while suppressing an increase in the code amount as a whole is provided. A coding device includes: a first coding unit that obtains a first code by coding coefficients which are convertible into linear prediction coefficients of more than one order; and a second coding unit that obtains a second code by coding at least quantization errors of the first coding unit if (A-1) an index Q commensurate with how high the peak-to-valley height of a spectral envelope is, the spectral envelope corresponding to the coefficients which are convertible into the linear prediction coefficients of more than one order, is larger than or equal to a predetermined threshold value Th1 and/or (B-1) an index Q? commensurate with how short the peak-to-valley height of the spectral envelope is, is smaller than or equal to a predetermined threshold value Th1?.
    Type: Grant
    Filed: March 16, 2015
    Date of Patent: September 11, 2018
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada
  • Patent number: 10044547
    Abstract: A digital code recovery circuit includes a data transmitter that outputs either input data or a preamble code as transmitter data. A radio frequency interconnect (RFI) transmitter modulates carrier signals based on the transmitter data and transmits the modulated carrier signals over a channel to an RFI receiver that demodulates the carrier signals to obtain recovered transmitter data. A calibration storage device stores preamble data and a calibration circuit receives the recovered transmitter data. If the recovered transmitter data originated from the preamble code, the calibration circuit determines a set of digital calibration adjustments from the recovered transmitter data and the preamble data. If the recovered transmitter data originated from the input data, the calibration circuit applies the set of digital calibration adjustments to the recovered transmitter data to obtain adjusted digital code and outputs the adjusted digital code.
    Type: Grant
    Filed: October 30, 2015
    Date of Patent: August 7, 2018
    Assignee: TAIWAN SEMICONDUCTOR MANUFACTURING COMPANY, LTD.
    Inventors: Fu-Lung Hsueh, William Wu Shen, Lan-Chou Cho
  • Patent number: 10038485
    Abstract: A codebook C is provided in a MIMO transmitter as well as a MIMO receiver. The codebook C will include M codewords ci, where i is a unique codeword index for each codeword ci. Each codeword defines weighting factors to apply to the MIMO signals, and may correspond to channel matrices or vectors to apply to the MIMO signals prior to transmission from the respective antennas of the MIMO transmitter. The present invention creates codeword subsets Si for each codeword ci of the codebook C. Each codeword subset Si defines L codewords cj, which are selected from all the codewords ci in the codebook C. The codewords cj in a codeword subset Si are the L codewords in the entire codebook that best correlate with the corresponding codeword ci.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: July 31, 2018
    Assignee: Apple Inc.
    Inventors: Wen Tong, Hosein Nikopour, Amir Khandani, Hua Xu, Ming Jia, Peiying Zhu, Dong-sheng Yu
  • Patent number: 10002605
    Abstract: A method and system for achieving emotional text to speech. The method includes: receiving text data; generating emotion tag for the text data by a rhythm piece; and achieving TTS to the text data corresponding to the emotion tag, where the emotion tags are expressed as a set of emotion vectors; where each emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories. A system for the same includes: a text data receiving module; an emotion tag generating module; and a TTS module for achieving TTS, wherein the emotion tag is expressed as a set of emotion vectors; and wherein emotion vector includes a plurality of emotion scores given based on a plurality of emotion categories.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: June 19, 2018
    Assignee: International Business Machines Corporation
    Inventors: Shenghua Bao, Jian Chen, Yong Qin, Qin Shi, Zhiwei Shuang, Zhong Su, Liu Wen, Shi Lei Zhang
  • Patent number: 9972325
    Abstract: In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.
    Type: Grant
    Filed: February 15, 2013
    Date of Patent: May 15, 2018
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Yang Gao