Silence Decision Patents (Class 704/215)
  • Patent number: 11810543
    Abstract: A method and an apparatus for audio signal processing selection are provided. In the method, multiple audio signal processing operations are performed on a synthesized audio signal to generate multiple processed audio signals, the audio signal processing operations are evaluated according to the comparison results between the processed audio signals and the primary signal, and the audio signal processing operation corresponding to a designated application and the designated audio output mode is selected according to the evaluation result of the audio signal processing operations. The synthesized audio signal is generated by adding a secondary signal into a primary signal. The signal processing is related to remove the secondary signal from the synthesized audio signal. Those processed audio signals are used by the designated application at the designated audio output mode. The comparison result is related to signal similarity. The evaluation result is related to the highest signal similarity.
    Type: Grant
    Filed: October 4, 2021
    Date of Patent: November 7, 2023
    Assignee: Acer Incorporated
    Inventors: Po-Jen Tu, Jia-Ren Chang, Kai-Meng Tzeng, Ming-Chun Fang
  • Patent number: 11785382
    Abstract: Various aspects include active noise reduction (ANR) devices and approaches, one approach including: receiving an input signal representing audio captured by a feedforward microphone of an ANR headphone; receiving an error signal representing audio captured by an error measurement sensor; generating an anti-noise signal configured to reduce a noise signal over a frequency range; and applying a gain to at least one of the input signal or the anti-noise signal over the frequency range based on the error signal, wherein the applied gain is configured to enhance noise reduction for a plurality of users having distinct fits for the ANR headphone.
    Type: Grant
    Filed: October 4, 2022
    Date of Patent: October 10, 2023
    Inventors: Michael P. O'Connell, Joseph H. Cattell, Dale Ian McElhone
  • Patent number: 11736120
    Abstract: The disclosure provides a method and a device in a User Equipment (UE) and a base station for wireless communication. A first node generates a first bit block, performs channel coding and then transmits a first radio signal. The first bit block comprising all bits in a second bit block and all bits in a third bit block is used for an input of the channel coding, and an output of the channel coding is used for generating the first radio signal. A Cyclic Redundancy Check (CRC) bit block of a fourth bit block is used for generating the third bit block. The fourth bit block comprises all bits in the second bit block and all bits in a fifth bit block, the bits in the fifth bit block are of fixed values, and the fifth bit block is composed of K bits, the K being a positive integer.
    Type: Grant
    Filed: December 22, 2021
    Date of Patent: August 22, 2023
    Assignee: SHANGHAI LANGBO COMMUNICATION TECHNOLOGY COMPANY LIMITED
    Inventors: KeYing Wu, Xiaobo Zhang
  • Patent number: 11651157
    Abstract: Introduced here are computer programs and associated computer-implemented techniques for discovering the presence of filler words through tokenization of a transcript derived from audio content. When audio content is obtained by a media production platform, the audio content can be converted into text content as part of a speech-to-text operation. The text content can then be tokenized and labeled using a Natural Language Processing (NLP) library. Tokenizing/labeling may be performed in accordance with a series of rules associated with filler words. At a high level, these rules may examine the text content (and associated tokens/labels) to determine whether patterns, relationships, verbatim, and context indicate that a term is a filler word. Any filler words that are discovered in the text content can be identified as such so that appropriate action(s) can be taken.
    Type: Grant
    Filed: November 10, 2020
    Date of Patent: May 16, 2023
    Assignee: Descript, Inc.
    Inventors: Alexandre de Brébisson, Antoine d'Andigné
  • Patent number: 11636936
    Abstract: The present disclosure relates to the field of medical data processing based on natural language processing. Embodiments of the present disclosure disclose a method and apparatus for verifying a medical fact. The method may include: acquiring a description text of the medical fact; selecting a relevant paragraph related to the description text of the medical fact from a medical document; and inputting the description text of the medical fact and the corresponding relevant paragraph into a trained discrimination model for authenticity judgment, to obtain a verification result of the medical fact, the discrimination model being pre-trained based on a medical text paragraph pair extracted from the medical document, and being iteratively adjusted using a medical fact sample set including authenticity labeling information after the pre-training.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: April 25, 2023
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Zhou Fang, Shuangjie Li, Yabing Shi, Ye Jiang
  • Patent number: 11483655
    Abstract: Various aspects include active noise reduction (ANR) devices and approaches, one approach including: receiving an input signal representing audio captured by a feedforward microphone of an ANR headphone; receiving an error signal representing audio captured by an error measurement sensor; generating an anti-noise signal configured to reduce a noise signal over a frequency range; and applying a gain to at least one of the input signal or the anti-noise signal over the frequency range based on the error signal, where the gain is calculated by: filtering the anti-noise signal over the frequency range to generate a filtered feedforward signal, and filtering the error signal over the frequency range to generate a filtered error signal; estimating a feedforward path contribution to the error signal; and determining the gain based on a correlation between the filtered error signal and the filtered feedforward signal with the assigned feedforward path contribution to the error signal.
    Type: Grant
    Filed: March 31, 2021
    Date of Patent: October 25, 2022
    Assignee: Bose Corporation
    Inventors: Michael P. O'Connell, Joseph H. Cattell, Dale Ian McElhone
  • Patent number: 11368420
    Abstract: In one embodiment, a method includes, by one or more computing systems, receiving, from a client system, an input in a multi-turn message thread, parsing the input to identify a plurality of hypothesis dialog states associated with the input, generating a plurality of functions corresponding to the plurality of hypothesis dialog states, calculating a plurality of probability scores for the plurality of functions, respectively, based on a prior dialog state associated with the message thread and a current user context, wherein each probability score represents a likelihood that the respective function is associated with an intended dialog state, and storing the plurality of functions if each of the probability scores is below a threshold probability score, wherein the plurality of functions is stored as a hierarchical set of functions that is used as the prior dialog state in response to a next input in the message thread.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: June 21, 2022
    Assignee: Facebook Technologies, LLC.
    Inventors: Paul Anthony Crook, Baiyang Liu, Rajen Subba
  • Patent number: 11336602
    Abstract: In one embodiment, a method includes, by one or more computing systems, receiving, from a client system, an input in a multi-turn message thread, parsing the input to identify a plurality of hypothesis dialog states associated with the input, generating a plurality of functions corresponding to the plurality of hypothesis dialog states, calculating a plurality of probability scores for the plurality of functions, respectively, based on a prior dialog state associated with the message thread and a current user context, wherein each probability score represents a likelihood that the respective function is associated with an intended dialog state, and storing the plurality of functions if each of the probability scores is below a threshold probability score, wherein the plurality of functions is stored as a hierarchical set of functions that is used as the prior dialog state in response to a next input in the message thread.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: May 17, 2022
    Assignee: Facebook Technologies, LLC.
    Inventors: Paul Anthony Crook, Baiyang Liu, Rajen Subba
  • Patent number: 11232808
    Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
    Type: Grant
    Filed: April 25, 2019
    Date of Patent: January 25, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Zhaoqing Ma, Tony Roy Hardie, Christo Frank Devaraj
  • Patent number: 11201961
    Abstract: The present disclosure generally relates to interfaces and techniques for media playback on one or more devices. In accordance with some embodiments, an electronic device includes a display, one or more processors, and memory. The electronic device receives user input and, in response to receiving the user input, displays, on the display, a multi-device interface that includes: one or more indicators associated with a plurality of available playback devices that are connected to the device and available to initiate playback of media from the device, and a media playback status of the plurality of available playback devices.
    Type: Grant
    Filed: September 24, 2020
    Date of Patent: December 14, 2021
    Assignee: Apple Inc.
    Inventors: Patrick L. Coffman, David Chance Graham
  • Patent number: 11147922
    Abstract: This invention relates to a feedback predictive controller, systems comprising and methods employing the same. Preferably the feedback predictive controller and/or systems comprising the feedback predictive controller are part of an automatic insulin delivery system. The methods described herein can be used to control blood glucose concentration in a patient with diabetes. Preferably, the insulin delivery system is an artificial pancreas.
    Type: Grant
    Filed: July 11, 2019
    Date of Patent: October 19, 2021
    Assignee: Iowa State University Research Foundation, Inc.
    Inventors: Derrick K. Rollins, Yong Mei
  • Patent number: 11068518
    Abstract: A query including a question from a user is received by a device. It is determined that the question and an associated answer is not found in a local cache of the device. At least one category for the question is determined. At least one cloud service for answering the question is determined. The at least one cloud service is in communication with the device via a network. A request including the question is sent to the at least one cloud service. Selecting a behavior is selected based upon the at least one category. The behavior includes a feedback to the user to reduce a perceived waiting time by the user. The behavior is executed.
    Type: Grant
    Filed: May 17, 2018
    Date of Patent: July 20, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Shang Q. Guo, Maharaj Mukherjee, Jonathan Lenchner, John S. Murdock, Jr.
  • Patent number: 10855841
    Abstract: A first communication device includes a memory and a processor coupled to the memory. The processor is configured to receive, during a first call with a second communication device, an indication of a second call from a third communication device. The processor is further configured to monitor the first call, in response to the indication of the second call, for a pause in speech of at least a threshold duration. The processor is further configured to initiate, in response to detecting the pause in speech of at least the threshold duration, an audio message indicating call information associated with the second call.
    Type: Grant
    Filed: October 24, 2019
    Date of Patent: December 1, 2020
    Assignee: QUALCOMM Incorporated
    Inventors: Arpit Agrawal, Arun Kumar Singh, Shankar Anand, Murali Krishna Busanaboyina
  • Patent number: 10741192
    Abstract: A method and an apparatus for estimating speech signal in split-domain is disclosed. The method includes performing LP analysis on a noisy speech signal to generate a first plurality of LPC and a first residual signal. The method also includes estimating speech LPC spectrum to generate cleaned LPC. The method further includes estimating speech residual spectrum to generate cleaned residual signal. The method also includes synthesizing output signals based on the cleaned LPC and the cleaned residual signal.
    Type: Grant
    Filed: May 7, 2018
    Date of Patent: August 11, 2020
    Assignee: Qualcomm Incorporated
    Inventors: Vivek Rajendran, Duminda Dewasurendra, Daniel Jared Sinder
  • Patent number: 10735508
    Abstract: Described are system, apparatus, article of manufacture, method, or computer program product embodiments for controlling streaming of media content. An embodiment operates by halting a presentation of future content from a buffer upon determining that the buffer is exhausted of content to present. The embodiment includes receiving one or more packets over a network connection, the one or more packets including media information corresponding to a first portion of streaming media content, in which the first portion corresponds to a second portion of the streaming media content. The one or more packets in a buffer are stored as buffered content. Responsive to determining that the network connection is not experiencing a burst condition, the buffer is trimmed. Then, presentation of buffered content is resumed and the first portion is caused to be presented in sync with the second portion.
    Type: Grant
    Filed: July 27, 2016
    Date of Patent: August 4, 2020
    Assignee: ROKU, INC.
    Inventors: Gregory S. Gates, Michael Chin-Ming Fu
  • Patent number: 10679182
    Abstract: A method, system and computer program product for facilitating a meeting is implemented in a server computer and provides a graphical user interface to one or more client devices. The server displays a mind map of a center topic on the clients. The server collects statements of participants, and extracts words and phrases therefrom, to identify topics that are currently being discussed. The server determines whether a transition has occurred between the topics, and then calculates a correlation between the topics that are currently being discussed and the center topic. The server determines a location of the topics that are currently being discussed on the mind map, based on the calculated correlation, and displays an updated mind map on the clients. The server generates a warning on the clients when the server determines that the meeting proceedings are being retarded, based on the calculated correlation.
    Type: Grant
    Filed: May 17, 2017
    Date of Patent: June 9, 2020
    Assignee: International Business Machines Corporation
    Inventors: Isao Kadowaki, Xueting Lin, Shigetoshi Nogami, Munetaka Ohtani, Terue Watanabe, Fumiyoshi Yamada
  • Patent number: 10629215
    Abstract: An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
    Type: Grant
    Filed: September 15, 2016
    Date of Patent: April 21, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
  • Patent number: 10381014
    Abstract: A comfort noise controller for generating CN (Comfort Noise) control parameters is described. A buffer of a predetermined size is configured to store CN parameters for SID (Silence Insertion Descriptor) frames and active hangover frames. A subset selector is configured to determine a CN parameter subset relevant for SID frames based on the age of the stored CN parameters and on residual energies. A comfort noise control parameter extractor (50B) is configured to use the determined CN parameter subset to determine the CN control parameters for a first SID frame following an active signal frame.
    Type: Grant
    Filed: August 22, 2017
    Date of Patent: August 13, 2019
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventor: Tomas Jansson Toftgård
  • Patent number: 10311868
    Abstract: A device performs a method for using image data to aid voice recognition. The method includes the device capturing (302) image data of a vicinity of the device and adjusting (304), based on the image data, a set of parameters for voice recognition performed by the device (102). The set of parameters for the device performing voice recognition include, but are not limited to: a trigger threshold of a trigger for voice recognition; a set of beamforming parameters; a database for voice recognition; and/or an algorithm for voice recognition. The algorithm may include using noise suppression or using acoustic beamforming.
    Type: Grant
    Filed: March 21, 2017
    Date of Patent: June 4, 2019
    Assignee: Google Technology Holdings LLC
    Inventors: Robert A. Zurek, Adrian M. Schuster, Fu-Lin Shau, Jincheng Wu
  • Patent number: 10269375
    Abstract: The disclosed embodiments illustrate a method for classifying one or more audio segments of an audio signal. The method includes determining one or more first features of a first audio segment of the one or more audio segments. The method further includes determining one or more second features based on the one or more first features. The method includes determining one or more third features of the first audio segment, wherein each of the one or more third features is determined based on a second feature of the one or more second features of the first audio segment and at least one second feature associated with a second audio segment. Additionally, the method includes classifying the first audio segment either in an interrogative category or a non-interrogative category based on one or more of the one or more second features and the one or more third features.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: April 23, 2019
    Assignee: CONDUENT BUSINESS SERVICES, LLC
    Inventors: Harish Arsikere, Arunasish Sen, Prathosh Aragulla Prasad
  • Patent number: 10248917
    Abstract: Systems and methods for developing and utilizing a contactability profile are disclosed. Contact information may be managed by receiving activity data associated with an entity, processing the received activity data, generating at least one contactability profile associated with the entity based upon a result of the processing, and storing the generated contactability profile.
    Type: Grant
    Filed: October 14, 2005
    Date of Patent: April 2, 2019
    Assignee: CAPITAL ONE SERVICES, LLC
    Inventors: Anthony Paul Reynolds, Daniel J. Welker
  • Patent number: 10237647
    Abstract: A beamformer system that can isolate a desired portion of an audio signal resulting from a microphone array. A combination of beamformers is used to dampen undesired noise, whether diffuse or coherent. A fixed beamformer is used to dampen diffuse noise while an adaptive beamformer is used to cancel directional coherent noise. The adaptive beamformer isolates and weights audio from various directions. The weights may vary depending on the isolated desired audio signal, dynamically adjusting the step-size adjustments to the weights.
    Type: Grant
    Filed: March 1, 2017
    Date of Patent: March 19, 2019
    Assignee: Amazon Technologies, Inc.
    Inventor: Amit Singh Chhetri
  • Patent number: 10157359
    Abstract: A method for using oral feedback to facilitate user selection among a plurality of service devices. The method comprises a plurality of service devices. Each service device has a dynamically-updatable quality control profile. A first service device receives a request from the first user to perform a first service job. The first service device performs the first service job. A first recording device located proximate to the first service device obtains oral feedback from the first user regarding the quality of the first service job. The first service device analyzes the oral feedback using natural language processing to form an analysis. The first service device modifies the first quality control profile of the first service device based on the analysis.
    Type: Grant
    Filed: February 15, 2017
    Date of Patent: December 18, 2018
    Assignee: International Business Machines Corporation
    Inventors: Joshi M. Chirayil, Rajendra S. Kanyal, Dhaval K. Shah, Anupam Varma
  • Patent number: 10089993
    Abstract: An apparatus for encoding audio information is provided. The apparatus for encoding audio information includes a selector for selecting a comfort noise generation mode from two or more comfort noise generation modes depending on a background noise characteristic of an audio input signal, and an encoding unit for encoding the audio information, wherein the audio information includes mode information indicating the selected comfort noise generation mode.
    Type: Grant
    Filed: January 27, 2017
    Date of Patent: October 2, 2018
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Martin Dietz, Wolfgang Jaegers, Christian Neukam, Stefan Reuschl
  • Patent number: 9953272
    Abstract: A machine learning system for evaluating at least one characteristic of a heart valve, an inflow tract, an outflow tract or a combination thereof may include a training mode and a production mode. The training mode may be configured to train a computer and construct a transformation function to predict an unknown anatomical characteristic and/or an unknown physiological characteristic of a heart valve, inflow tract and/or outflow tract, using a known anatomical characteristic and/or a known physiological characteristic the heart valve, inflow tract and/or outflow tract. The production mode may be configured to use the transformation function to predict the unknown anatomical characteristic and/or the unknown physiological characteristic of the heart valve, inflow tract and/or outflow tract, based on the known anatomical characteristic and/or the known physiological characteristic of the heart valve, inflow tract and/or outflow tract.
    Type: Grant
    Filed: April 26, 2016
    Date of Patent: April 24, 2018
    Assignee: Stenomics, Inc.
    Inventor: Michael A. Singer
  • Patent number: 9906795
    Abstract: When removing a block distortion occurring in a local decoded image, a loop filtering part 11 of an image coding device carries out a filtering process on each of signal components (a luminance signal component and color difference signal components) after setting the intensity of a filter for removing the block distortion for each of the signal components according to a coding mode (an intra coding mode or an inter coding mode) selected by a coding controlling part 1.
    Type: Grant
    Filed: April 5, 2017
    Date of Patent: February 27, 2018
    Assignee: MITSUBISHI ELECTRIC CORPORATION
    Inventors: Shunichi Sekiguchi, Kazuo Sugimoto, Hiroharu Sakate, Tokumichi Murakami, Akira Minezawa
  • Patent number: 9866972
    Abstract: Described herein is a MEMS acoustic transducer device provided with a micromechanical detection structure that detects acoustic-pressure waves and supplies a transduced electrical quantity, and with an integrated circuit operatively coupled to the micromechanical detection structure and having a reading module that generates at output an audio signal as a function of the transduced electrical quantity. The integrated circuit is further provided with a recognition module, which recognizes a sound activity event associated to the transduced electrical quantity. The MEMS acoustic transducer has an output that supplies at output a data signal that carries information regarding recognition of the sound activity event.
    Type: Grant
    Filed: November 4, 2014
    Date of Patent: January 9, 2018
    Assignee: STMICROELECTRONICS S.R.L.
    Inventors: Marco Veneri, Alessandro Morcelli
  • Patent number: 9830925
    Abstract: An automatic speech recognition engine and a method of using the engine is described. The method pertains to front-end processing an audio signal and includes the steps of: identifying a plurality of voiced-frames of the audio signal; determining that one or more of the plurality of voiced-frames have a signal-to-noise (SNR) value greater than a first predetermined threshold; and based on the determination, bypassing noise suppression for the one or more of the plurality of voiced-frames.
    Type: Grant
    Filed: October 22, 2014
    Date of Patent: November 28, 2017
    Assignee: GM Global Technology Operations LLC
    Inventors: Gaurav Talwar, Xufang Zhao, III, Robert D. Sims, III, Md Foezur Rahman Chowdhury
  • Patent number: 9775110
    Abstract: Methods and apparatus for reducing power consumption in a wireless device operating in a discontinuous transmission (DTX) mode while using a voice over Long Term Evolution (VoLTE) service. The wireless device detects a period of voice inactivity and transmits one or more silence descriptor (SID_UPDATE) frames to a second wireless device in place of encoded speech frames. The SID_UPDATE frames are transmitted periodically based on measurements of comfort noise parameters. The wireless device determines a difference between weighted averages of comfort noise (CN) parameters of two sequences of encoded speech frames. When the difference exceeds a difference threshold, a SID_UPDATE frame is transmitted. Additionally, in some embodiments, a SID_UPDATE frame is transmitted when the weighted average of CN parameters exceeds a parameter threshold and/or when a time between SID_UPDATE frames or time elapsed after entering a silence state exceeds one or more time thresholds.
    Type: Grant
    Filed: September 25, 2014
    Date of Patent: September 26, 2017
    Assignee: Apple Inc.
    Inventors: Sreevalsan Vallath, Carola Faronius
  • Patent number: 9773505
    Abstract: An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
    Type: Grant
    Filed: September 18, 2009
    Date of Patent: September 26, 2017
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Dae Young Jang, Kyeongok Kang, Jin Woo Hong, Ho Chong Park, Young-cheol Park
  • Patent number: 9734835
    Abstract: A voice decoding apparatus includes an MBE-type decoder, a sampling convertor, a non-linear components generator and an adder. The decoder decodes digital voice-encoded information to generate a first decoded voice signal. The convertor converts the first decoded voice signal to a second decoded voice signal with a higher sampling frequency. The generator performs a non-linear process to the first or second decoded voice signal to generate an additional voice signal with the same sampling frequency as the second decoded voice signal. The additional voice signal has components in a frequency band in which the first decoded voice signal has no component and continuing to another frequency band of the first decoded voice signal. The adder adds the second decoded voice signal to the additional voice signal.
    Type: Grant
    Filed: February 5, 2015
    Date of Patent: August 15, 2017
    Assignee: Oki Electric Industry Co., Ltd.
    Inventor: Masaru Fujieda
  • Patent number: 9715883
    Abstract: In an embodiment, bitstream elements of sub-frames are encoded differentially to a global gain value so that a change of the global gain value results in an adjustment of an output level of the decoded representation of the audio content. Concurrently, the differential coding saves bits. Even further, the differential coding enables the lowering of the burden of globally adjusting the gain of an encoded bitstream. In another embodiment, a global gain control across CELP coded frames and transform coded frames is achieved by co-controlling the gain of the codebook excitation of the CELP codec, along with a level of the transform or inverse transform of the transform coded frames. In another embodiment, the gain value determination in CELP coding is performed in the weighted domain of the excitation signal.
    Type: Grant
    Filed: May 12, 2016
    Date of Patent: July 25, 2017
    Assignee: Fraundhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Ralf Geiger, Guillaume Fuchs, Markus Multrus, Bernhard Grill
  • Patent number: 9711157
    Abstract: An encoder for providing an audio stream on the basis of a transform-domain representation of an input audio signal includes a quantization error calculator configured to determine a multi-band quantization error over a plurality of frequency bands of the input audio signal for which separate band gain information is available. The encoder also includes an audio stream provider for providing the audio stream such that the audio stream includes information describing an audio content of the frequency bands and information describing the multi-band quantization error. A decoder for providing a decoded representation of an audio signal on the basis of an encoded audio stream representing spectral components of frequency bands of the audio signal includes a noise filler for introducing noise into spectral components of a plurality of frequency bands to which separate frequency band gain information is associated on the basis of a common multi-band noise intensity value.
    Type: Grant
    Filed: December 24, 2014
    Date of Patent: July 18, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
  • Patent number: 9614488
    Abstract: An information processing apparatus includes an audio processing unit that performs a predetermined process for an input audio signal, and a power saving control unit that allows the audio processing unit to transition into a power saving state, depending on the length of a silence section in the audio signal.
    Type: Grant
    Filed: March 20, 2015
    Date of Patent: April 4, 2017
    Assignee: SONY CORPORATION
    Inventors: Takashi Kobayashi, Tomohiro Katori
  • Patent number: 9515748
    Abstract: A decoding apparatus includes a sound code input unit that receives a sound code output from an encoding apparatus through a sound wave reception device; a frame division unit that divides the sound code depending on a predetermined time interval to generate a plurality of frames; a frequency identification unit that identifies a frequency corresponding to each of the plurality of the frames through frequency analysis for each of the plurality of the frames; and an information generation unit that determines a frequency band, to which each of the identified frequencies corresponds, from an audible sound wave frequency band and a non-audible sound wave frequency band, and a plurality of partial information based on the frequency band and each of the identified frequencies, and generates information corresponding to the sound code based on the plurality of the partial information.
    Type: Grant
    Filed: September 24, 2014
    Date of Patent: December 6, 2016
    Assignee: POWERVOICE CO., LTD.
    Inventors: Hee Suk Jeong, Se Hun Chin, Hyung Yup Lee, Jong Sang Tack
  • Patent number: 9454959
    Abstract: Speech recognition systems often process speech by employing models and analyzing audio data. An embodiment of the method and corresponding system described herein allow for passive monitoring of, for example, conversation between user(s) to determine context to use to prime model(s) for later speech recognition requests submitted to the speech recognition system. The embodiment improves the results of the speech recognition system by updating speech recognition model(s) with contextual information of the conversation. This increases the probability that the speech recognition system interprets the conversation to contextually relevant information.
    Type: Grant
    Filed: November 2, 2012
    Date of Patent: September 27, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Nils Lenke, William F. Ganong, III
  • Patent number: 9449609
    Abstract: Acoustic noise in an audio signal is reduced by calculating a speech probability presence (SPP) factor using minimum mean square error (MMSE). The SPP factor, which has a value typically ranging between zero and one, is modified or warped responsive to a value obtained from the evaluation of a sigmoid function, the shape of which is determined by a signal-to-noise ratio (SNR), which is obtained by an evaluation of the signal energy and noise energy output from a microphone over time. The shape and aggressiveness of the sigmoid function is determined using an extrinsically-determined SNR, not determined by the MMSE determination. The extrinsically-determined SNR is obtained from a long term history of previously-determined speech presence probabilities and a long term history of previously-determined noise histories.
    Type: Grant
    Filed: November 7, 2013
    Date of Patent: September 20, 2016
    Assignee: Continental Automotive Systems, Inc.
    Inventors: Guillaume Lamy, Bijal Joshi
  • Patent number: 9424854
    Abstract: A method for processing audio data includes determining a first common scalefactor value for representing quantized audio data in a frame. A second common scalefactor value is determined for representing the quantized audio data in the frame. A line equation common scalefactor value is determined from the first and second common scalefactor values.
    Type: Grant
    Filed: October 7, 2013
    Date of Patent: August 23, 2016
    Assignee: Intel Corporation
    Inventors: Dmitry N. Budnikov, Igor Igor Chikalov, Sergey N. Zheltov
  • Patent number: 9424531
    Abstract: A machine learning system for evaluating at least one characteristic of a heart valve, an inflow tract, an outflow tract or a combination thereof may include a training mode and a production mode. The training mode may be configured to train a computer and construct a transformation function to predict an unknown anatomical characteristic and/or an unknown physiological characteristic of a heart valve, inflow tract and/or outflow tract, using a known anatomical characteristic and/or a known physiological characteristic the heart valve, inflow tract and/or outflow tract. The production mode may be configured to use the transformation function to predict the unknown anatomical characteristic and/or the unknown physiological characteristic of the heart valve, inflow tract and/or outflow tract, based on the known anatomical characteristic and/or the known physiological characteristic of the heart valve, inflow tract and/or outflow tract.
    Type: Grant
    Filed: April 7, 2015
    Date of Patent: August 23, 2016
    Assignee: STENOMICS, INC.
    Inventor: Michael A. Singer
  • Patent number: 9031619
    Abstract: A method includes, during operation of a mobile handset when a voice call is in progress with a remote party, detecting an occurrence of a start of a discontinuous transmission period that results from a cessation of speech by the remote party; and activating a visual indicator of the mobile handset to assume a first state that indicates that a speech signal is not being received from the remote party.
    Type: Grant
    Filed: September 30, 2010
    Date of Patent: May 12, 2015
    Assignee: Nokia Corporation
    Inventor: Francis Quiers
  • Patent number: 9020816
    Abstract: A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of the audio signal. The segment is classified using a hidden Markov model (HMM) that has been trained on sequences of these feature vectors. Post-processing components can be utilized to enhance classification. An embodiment is described in which the hidden Markov model is used to classify a segment as a language speech sound or one of a variety of non-language speech sounds. Another embodiment is described in which the hidden Markov model is trained using discriminative learning.
    Type: Grant
    Filed: August 13, 2009
    Date of Patent: April 28, 2015
    Assignee: 21CT, Inc.
    Inventor: Matthew McClain
  • Patent number: 9009034
    Abstract: A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel.
    Type: Grant
    Filed: November 12, 2014
    Date of Patent: April 14, 2015
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Bing Chen, James H. James
  • Patent number: 8983851
    Abstract: A noise filler for providing a noise-filled spectral representation of an audio signal on the basis of an input spectral representation of the audio signal has a spectral region identifier configured to identify spectral regions of the input spectral representation spaced from non-zero spectral regions of the input spectral representation by at least one intermediate spectral region, to obtain identified spectral regions, and a noise inserter configured to selectively introduce noise into the identified spectral regions to obtain the noise-filled spectral representation of the audio signal. A noise filling parameter calculator for providing a noise filling parameter on the basis of a quantized spectral representation of an audio signal has a spectral region identifier, as mentioned above, and a noise value calculator configured to selectively consider quantization errors of the identified spectral regions for a calculation of the noise filling parameter.
    Type: Grant
    Filed: January 11, 2011
    Date of Patent: March 17, 2015
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Nikolaus Rettelbach, Bernhard Grill, Guillaume Fuchs, Stefan Geyersberger, Markus Multrus, Harald Popp, Juergen Herre, Stefan Wabnik, Gerald Schuller, Jens Hirschfeld
  • Patent number: 8977556
    Abstract: Embodiments of the present invention relate to a voice detector receiving an input signal that is divided into sub-signals that represent a frequency sub-band. The voice detector calculates, for each sub-band, a signal-to-noise (SNR) value based on a corresponding sub-signal for each sub-band and a background signal for each sub-band. The voice detector also calculates a power SNR value for each sub-band, where at least one of the power SNR values is calculated based on a non-linear function. The voice detector forms a single value based on the calculated power SNR values and compares the single value and a given threshold value to make a voice activity decision presented on an output port.
    Type: Grant
    Filed: March 26, 2012
    Date of Patent: March 10, 2015
    Assignee: Telefonaktiebolaget LM Ericsson (Publ)
    Inventor: Martin Sehlstedt
  • Patent number: 8954324
    Abstract: Voice activity detection using multiple microphones can be based on a relationship between an energy at each of a speech reference microphone and a noise reference microphone. The energy output from each of the speech reference microphone and the noise reference microphone can be determined. A speech to noise energy ratio can be determined and compared to a predetermined voice activity threshold. In another embodiment, the absolute value of the autocorrelation of the speech and noise reference signals are determined and a ratio based on autocorrelation values is determined. Ratios that exceed the predetermined threshold can indicate the presence of a voice signal. The speech and noise energies or autocorrelations can be determined using a weighted average or over a discrete frame size.
    Type: Grant
    Filed: September 28, 2007
    Date of Patent: February 10, 2015
    Assignee: QUALCOMM Incorporated
    Inventors: Song Wang, Samir Kumar Gupta, Eddie L. T. Choy
  • Patent number: 8942987
    Abstract: A clear picture of who is speaking in a setting where there are multiple input sources (e.g., a conference room with multiple microphones) can be obtained by comparing input channels against each other. The data from each channel can not only be compared, but can also be organized into portions which logically correspond to statements by a user. These statements, along with information regarding who is speaking, can be presented in a user friendly format via an interactive timeline which can be updated in real time as new audio input data is received.
    Type: Grant
    Filed: March 21, 2014
    Date of Patent: January 27, 2015
    Assignee: Jefferson Audio Video Systems, Inc.
    Inventors: Matthew David Bader, Nathan David Cole
  • Patent number: 8938389
    Abstract: A frame extracting means 71 extracts frames from sample data as voice data in which whether each frame is an active voice frame or a non-active voice frame is already known. A feature quantity calculating means 72 calculates multiple feature quantities of each of the frames. A feature quantity integrating means 73 calculates an integrated feature quantity of the multiple feature quantities. A judgment means 74 judges whether each of the frames is an active voice frame or a non-active voice frame. An erroneous feature quantity calculation value calculating means 75 obtains a first erroneous feature quantity calculation value and a second erroneous feature quantity calculation value by executing prescribed calculations. A weight updating means 76 updates weights used for weighting so that the rate between the first erroneous feature quantity calculation value and the second erroneous feature quantity calculation value approaches a prescribed value.
    Type: Grant
    Filed: December 7, 2009
    Date of Patent: January 20, 2015
    Assignee: NEC Corporation
    Inventors: Takayuki Arakawa, Masanori Tsujikawa
  • Patent number: 8930184
    Abstract: A signal bandwidth extending apparatus including: a bandwidth extending section configured to extend a frequency bandwidth of a target signal, the target signal included in an input signal; a calculating section configured to calculate a degree of the target signal included in the input signal; and a controller configured to change a method of extending the frequency bandwidth by the bandwidth extending section according to a result of the calculating section.
    Type: Grant
    Filed: September 14, 2009
    Date of Patent: January 6, 2015
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Takashi Sudo, Masataka Osada
  • Patent number: RE46388
    Abstract: There is provided an audio coding device which appropriately sets the quantization bit number by a small calculation amount in each stage when coding an input audio signal by performing multi-stage normalization/quantization. A quantization information calculation section determines total quantization information idwl0, based on normalization information idsf, and allocates the total quantization information idwl0 for quantization information idwl1 and quantization information idwl2. At this time, the quantization information calculation section limits the quantization information idwl1 by a limiter lim1, and allocates the total quantization information idwl0 for quantization information idwl1. If the quantization information idwl1 exceeds the limiter lim1, the excess is allocated for the quantization information idwl2. A first normalization section and a first quantization section normalizes and quantizes a frequency spectrum mdspec1 in the first stage.
    Type: Grant
    Filed: August 25, 2015
    Date of Patent: May 2, 2017
    Assignee: SONY CORPORATION
    Inventors: Yuuki Matsumura, Shiro Suzuki, Keisuke Toyama, Mitsuyuki Hatanaka, Yuhki Mitsufuji
  • Patent number: RE48272
    Abstract: There is provided an audio coding device which appropriately sets the quantization bit number by a small calculation amount in each stage when coding an input audio signal by performing multi-stage normalization/quantization. A quantization information calculation section determines total quantization information idwl0, based on normalization information idsf, and allocates the total quantization information idwl0 for quantization information idwl1 and quantization information idwl2. At this time, the quantization information calculation section limits the quantization information idwl1 by a limiter lim1, and allocates the total quantization information idwl0 for quantization information idwl1. If the quantization information idwl1 exceeds the limiter lim1, the excess is allocated for the quantization information idwl2. A first normalization section and a first quantization section normalizes and quantizes a frequency spectrum mdspec1 in the first stage.
    Type: Grant
    Filed: February 16, 2017
    Date of Patent: October 20, 2020
    Assignee: SONY CORPORATION
    Inventors: Yuuki Matsumura, Shiro Suzuki, Keisuke Toyama, Mitsuyuki Hatanaka, Yuhki Mitsufuji