Voiced Or Unvoiced Patents (Class 704/208)
  • Patent number: 11011166
    Abstract: In embodiments, a method includes: defining a plurality of visual icons; receiving a voice message, metadata and voice tagging criteria from a first client device; converting the voice message to text; generating a summary of the converted text; generating keywords based on the summary; categorizing the voice message into categories based on the keywords and the metadata; selecting visual icons based on the categories and the tagging criteria; tagging the voice message with the selected visual icons; and sending the tagged voice message to a second client device.
    Type: Grant
    Filed: November 29, 2018
    Date of Patent: May 18, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Si Bin Fan, Yu Gu, Su Liu, Yun Xiang
  • Patent number: 11010491
    Abstract: A security code input may be obfuscated from a thermal imaging device by randomly heating a random set of inputs of an input device. The security code is inputted on an input device, which communicates with a security system to grant or deny access to a user based on an entry of the security code. The input device includes a plurality of hearing elements. The input device may receive an input from the user. A random set of heating elements including one or more heating elements, are generated from the plurality of heating elements. A temperature is determined for the one or more heating elements of the random set of heating elements. The temperature is then applied to the one or more heating elements of the random set of heating elements of the input device.
    Type: Grant
    Filed: April 9, 2019
    Date of Patent: May 18, 2021
    Assignee: International Business Machines Corporation
    Inventor: Brendan Murray
  • Patent number: 10984810
    Abstract: An audio decoder provides a decoded audio information on the basis of an encoded audio information including linear prediction coefficients (LPC) and includes a tilt adjuster to adjust a tilt of a noise using linear prediction coefficients of a current frame to acquire a tilt information and a noise inserter configured to add the noise to the current frame in dependence on the tilt information. Another audio decoder includes a noise level estimator to estimate a noise level for a current frame using a linear prediction coefficient of at least one previous frame to acquire a noise level information; and a noise inserter to add a noise to the current frame in dependence on the noise level information provided by the noise level estimator. Thus, side information about a background noise in the bit-stream may be omitted. Methods and computer programs serve a similar purpose.
    Type: Grant
    Filed: February 26, 2019
    Date of Patent: April 20, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Christian Helmrich, Manuel Jander, Benjamin Schubert, Yoshikazu Yokotani
  • Patent number: 10978083
    Abstract: A wireless audio system for encoding and decoding an audio signal using spectral bandwidth replication is provided. Bandwidth extension is performed in the time-domain, enabling low-latency audio coding.
    Type: Grant
    Filed: November 13, 2019
    Date of Patent: April 13, 2021
    Assignee: Shure Acquisition Holdings, Inc.
    Inventors: Wenshun Tian, Michael Ryan Lester
  • Patent number: 10923136
    Abstract: A speech extraction method based on the supervised learning auditory attention includes: converting an original overlapping speech signal into a two-dimensional time-frequency signal representation by a short-time Fourier transform to obtain a first overlapping speech signal; performing a first sparsification on the first overlapping speech signal, mapping intensity information of a time-frequency unit of the first overlapping speech signal to preset D intensity levels, and performing a second sparsification on the first overlapping speech signal based on information of the preset D intensity levels to obtain a second overlapping speech signal; converting the second overlapping speech signal into a pulse signal by a time coding method; extracting a target pulse from the pulse signal by a trained target pulse extraction network; converting the target pulse into a time-frequency representation of the target speech to obtain the target speech by an inverse short-time Fourier transform.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: February 16, 2021
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jiaming Xu, Yating Huang, Bo Xu
  • Patent number: 10878835
    Abstract: Techniques for reducing the time used to output audio or video include increasing the output rate for portions of the content to reduce the time needed for a user to consume the content. Audio data is analyzed to determine portions that are silent and portions that include speech, such as by determining amplitude and other signal characteristics. Portions that include speech are analyzed to determine an initial rate of speech associated with each portion. Speech characteristics of the portions, such as language, volume, and pitch are used to determine a target maximum comprehensible rate of speech for a user. When audio output is generated, the portions that represent speech are each output at a rate that results in the speech being presented within a threshold of the target rate. The portions that represent silence may be removed, shortened, or output at a faster rate than the portions that represent speech.
    Type: Grant
    Filed: November 16, 2018
    Date of Patent: December 29, 2020
    Assignee: AMAZON TECHNOLOGIES, INC
    Inventors: Pedro Rodrigues Nacione Pedruzzi, Marcos Vinicius Romero, Gabriel Pereira Borges, Arthur Silva Freire, Rafael Jessen Werneck De Almeida Martins
  • Patent number: 10861458
    Abstract: A response sentence generation apparatus includes a conversion device for converting an input voice of a user into text information, an extraction device for extracting prosodic information from the input voice, a specifying device for specifying an emotion occurrence word indicating an occurrence of an emotion of the user based on the text information and the prosodic information, and a generation device for selecting a character string including the specified emotion occurrence word from the text information and generating a response sentence by performing predetermined processing on the selected character string.
    Type: Grant
    Filed: November 9, 2018
    Date of Patent: December 8, 2020
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventor: Narimasa Watanabe
  • Patent number: 10839854
    Abstract: Methods, apparatus, systems and articles of manufacture to perform speed-enhanced playback of recorded media are disclosed. Example media playback devices disclosed herein include a target frames calculator to determine a target number of frames of recorded media to drop during playback of the recorded media based on a target playback rate and a total number of frames of the recorded media. Disclosed example media playback devices also include a frames selector to select a subset of the frames of the recorded media to drop during the playback of the recorded media, the frames selector to select the subset of the frames based on the target number of frames to drop and skip bytes included in the subset of frames.
    Type: Grant
    Filed: April 6, 2018
    Date of Patent: November 17, 2020
    Assignee: The Nielsen Company (US), LLC
    Inventor: Venugopal Srinivasan
  • Patent number: 10819485
    Abstract: A channel information transmission apparatus and method, and a system are disclosed to improve channel information feedback precision. A first device includes: a receiving module, configured to receive a reference signal from a second device; a processing module, configured to obtain a precoding matrix based on the reference signal, and generate first channel information and second channel information that are used to indicate the precoding matrix; and a sending module, configured to send the first channel information and the second channel information to the second device.
    Type: Grant
    Filed: January 29, 2019
    Date of Patent: October 27, 2020
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Di Zhang, Kunpeng Liu
  • Patent number: 10789941
    Abstract: Techniques are provided for efficient acoustic event detection with reduced resource consumption. A methodology implementing the techniques according to an embodiment includes calculating frames of power spectra based on segments of received acoustic signals. The method further includes two processes, one for detecting impulsive acoustic events and another for detecting continuous acoustic events. The first process includes generating impulsive acoustic event features associated with first and second power spectrum frames, applying a neural network classifier to the impulsive acoustic event features to generate event scores, and detecting an impulsive acoustic event based on those event scores.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: September 29, 2020
    Assignee: Intel Corporation
    Inventors: Kuba Lopatka, Mateusz Kotarski, Tobias Bocklet, Marek Zabkiewicz
  • Patent number: 10762915
    Abstract: A headphone system is provided that includes a first earpiece, a first front microphone coupled to the first earpiece to receive a first acoustic signal, a first rear microphone coupled to the first earpiece to receive a second acoustic signal, the second acoustic signal being toward the rear of a user's head relative to the first acoustic signal, and a detection circuit coupled to the first front and rear microphones and configured to compare a front signal derived from the first front microphone to a rear signal derived from the first rear microphone and to selectively indicate that the user is speaking based at least in part upon the comparison.
    Type: Grant
    Filed: June 17, 2019
    Date of Patent: September 1, 2020
    Assignee: BOSE CORPORATION
    Inventors: Xiang-Ern Yeo, Mehmet Ergezer, Alaganandan Ganeshkumar
  • Patent number: 10636436
    Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the original an input narrowband audio signal. Other embodiments are disclosed.
    Type: Grant
    Filed: July 27, 2018
    Date of Patent: April 28, 2020
    Assignee: Staton Techiya, LLC
    Inventors: John Usher, Dan Ellis
  • Patent number: 10535364
    Abstract: A head-mounted wearable device incorporates a transducer that operates as a bone conduction (BC) microphone. Vibrations from a user's speech are transferred through the head of the user to the BC microphone. An air conduction (AC) microphone detects sound transferred via air. Signals from the BC microphone and the AC microphone are compared to determine if a common signal is present in both. For example, both signals may have a cross-correlation that exceeds a threshold value. Based on the comparison, voice activity data is generated that indicates the user wearing the device is speaking.
    Type: Grant
    Filed: September 8, 2016
    Date of Patent: January 14, 2020
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Xuan Zhong, Bozhao Tan, Jianchun Dong, Chia-Jean Wang
  • Patent number: 10510361
    Abstract: An audio processing apparatus is provided that includes an acquirer that acquires a surrounding audio signal indicating a sound surrounding a user. The audio processing apparatus also includes an audio extractor that extracts, from the acquired surrounding audio signal, a providing audio signal indicating a sound to be provided to the user. The audio processing apparatus further includes an output that outputs a first audio signal, indicating a main sound, and the providing audio signal.
    Type: Grant
    Filed: March 3, 2016
    Date of Patent: December 17, 2019
    Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.
    Inventor: Kazuya Nomura
  • Patent number: 10475471
    Abstract: In accordance with embodiments of the present disclosure, an integrated circuit for implementing at least a portion of an audio device may include an audio output configured to reproduce audio information by generating an audio output signal for communication to at least one transducer of the audio device, a microphone input configured to receive an input signal indicative of ambient sound external to the audio device, and a processor configured to implement an impulsive noise detector. The impulsive noise detector may comprise a plurality of processing blocks for determining a feature vector based on characteristics of the input signal and a neural network for determining based on the feature vector whether the impulsive event comprises a speech event or a noise event.
    Type: Grant
    Filed: May 1, 2017
    Date of Patent: November 12, 2019
    Assignee: Cirrus Logic, Inc.
    Inventor: Samuel Pon Varma Ebenezer
  • Patent number: 10453469
    Abstract: A signal processor comprising: a modelling block, configured to receive a frequency-domain-input-signal, a fundamental-frequency-signal representative of a fundamental frequency of the frequency-domain-input-signal; and configured to provide a pitch-model-signal based on a periodic function, the pitch-model-signal spanning a plurality of discrete frequency bins, each discrete frequency bin having a respective discrete frequency bin index, wherein within each discrete frequency bin the pitch-model-signal is defined by: the periodic function; the fundamental frequency; the frequency-domain-input-signal; and the respective discrete frequency bin index. The signal processor further comprises a manipulation block, configured to provide an output-signal based on the frequency-domain-input-signal and the pitch-model-signal.
    Type: Grant
    Filed: March 26, 2018
    Date of Patent: October 22, 2019
    Assignee: NXP B.V.
    Inventors: Nilesh Madhu, Wouter Joos Tirry
  • Patent number: 10431232
    Abstract: A method and an apparatus for synthesizing an audio signal are described. A spectral tilt is applied to the code of a codebook used for synthesizing a current frame of the audio signal. The spectral tilt is based on the spectral tilt of the current frame of the audio signal. Further, an audio decoder operating in accordance with the inventive approach is described.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: October 1, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Tom Baeckstroem, Ralf Geiger, Wolfgang Jaegers, Emmanuel Ravelli
  • Patent number: 10431241
    Abstract: A speech enhancement method is provided. The speech enhancement method includes: estimating a direction of a speaker by using an input signal, generating direction information indicating the estimated direction, detecting speech of a speaker based on a result of the estimating the direction, and enhancing the speech of the speaker by using the direction information based on a result of the detecting the speech.
    Type: Grant
    Filed: May 30, 2014
    Date of Patent: October 1, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jae-youn Cho, Weiwei Cui, Seung-yeol Lee
  • Patent number: 10388302
    Abstract: A method for processing an initial signal includes a useful signal and added noise, which comprises a step of frequency selective analysis providing starting from initial signal a plurality of wideband analysis signals corresponding to one of the analyzed frequencies, and comprising the following actions: zero or more complex frequency translations, one or more undersampling operations, computation of the instantaneous Amplitude, of the instantaneous Phase, and of the instantaneous Frequency of the wideband analysis signals. This information then allow to detect modulations of signals included in high levels of noise and to detect with a good probability the presence of a signal in a high level of noise.
    Type: Grant
    Filed: December 23, 2015
    Date of Patent: August 20, 2019
    Inventor: Yves Reza
  • Patent number: 10373608
    Abstract: A sound recognition system including time-dependent analog filtered feature extraction and sequencing. An analog front end (AFE) in the system receives input analog signals, such as signals representing an audio input to a microphone. Features in the input signal are extracted, by measuring such attributes as zero crossing events and total energy in filtered versions of the signal with different frequency characteristics at different times during the audio event. In one embodiment, a tunable analog filter is controlled to change its frequency characteristics at different times during the event. In another embodiment, multiple analog filters with different filter characteristics filter the input signal in parallel, and signal features are extracted from each filtered signal; a multiplexer selects the desired features at different times during the event.
    Type: Grant
    Filed: October 22, 2015
    Date of Patent: August 6, 2019
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Zhenyong Zhang, Wei Ma
  • Patent number: 10332535
    Abstract: An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal havi
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: June 25, 2019
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
  • Patent number: 10311883
    Abstract: A transient detector (100) analyzes (110) a given frame n of the input audio signal to determine, based on audio signal characteristics of the given frame n, a transient hangover indicator for a following frame n+1, and signals (120) the determined transient hangover indicator to an associated audio encoder (10) to enable proper encoding of the following frame n+1.
    Type: Grant
    Filed: October 18, 2016
    Date of Patent: June 4, 2019
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Anisse Taleb, Gustaf Ullberg
  • Patent number: 10269365
    Abstract: An audio decoder provides a decoded audio information on the basis of an encoded audio information including linear prediction coefficients (LPC) and includes a tilt adjuster to adjust a tilt of a noise using linear prediction coefficients of a current frame to acquire a tilt information and a noise inserter configured to add the noise to the current frame in dependence on the tilt information. Another audio decoder includes a noise level estimator to estimate a noise level for a current frame using a linear prediction coefficient of at least one previous frame to acquire a noise level information; and a noise inserter to add a noise to the current frame in dependence on the noise level information provided by the noise level estimator. Thus, side information about a background noise in the bit-stream may be omitted. Methods and computer programs serve a similar purpose.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: April 23, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Christian Helmrich, Manuel Jander, Benjamin Schubert, Yoshikazu Yokotani
  • Patent number: 10249288
    Abstract: An approach is provided that assists visually impaired users. The approach analyzes a document that is being utilized by the visually impaired user. The analysis derives a sensitivity of the document. A vocal characteristic corresponding to the derived sensitivity is retrieved. Text from the document is audibly read to the visually impaired user with a text to speech process that utilizes the retrieved vocal characteristic. The retrieved vocal characteristic conveys the derived sensitivity of the document to the visually impaired user.
    Type: Grant
    Filed: June 16, 2018
    Date of Patent: April 2, 2019
    Assignee: International Business Machines Corporation
    Inventors: Maureen E. Kraft, Fang Lu, Azadeh Salehi, Weisong Wang
  • Patent number: 10248379
    Abstract: A process at an electronic computing device (device) for automatic and selective context-based gating of an electronic digital assistant speech-output function includes detecting, via a sensor communicatively coupled to the device or via a notification received at the device, an event associated with a need to reduce a cognitive load on the user. Responsively, temporarily gating a portion of an electronic digital assistant speech-output function to prevent reproduction to the user of one or more first speech-output events generated during a first period of time associated with the event. Subsequently, and after detecting that the event has ended, ending the temporary gating to allow reproduction of one or more second speech-output events generated during a second period of time after the event has ended.
    Type: Grant
    Filed: July 27, 2017
    Date of Patent: April 2, 2019
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventor: Paul D. Steinberg
  • Patent number: 10242696
    Abstract: In accordance with embodiments of the present disclosure, an integrated circuit for implementing at least a portion of an audio device may include an audio output configured to reproduce audio information by generating an audio output signal for communication to at least one transducer of the audio device, a microphone input configured to receive an input signal indicative of ambient sound external to the audio device and a processor configured to implement an impulsive noise detector. The impulsive noise detector may include a sudden onset detector for predicting an occurrence of a signal burst event of the input signal and an impulsive detector for determining whether the signal burst event comprises a speech event or a noise event.
    Type: Grant
    Filed: October 11, 2016
    Date of Patent: March 26, 2019
    Assignee: Cirrus Logic, Inc.
    Inventor: Samuel Pon Varma Ebenezer
  • Patent number: 10224049
    Abstract: An encoding apparatus is an encoding apparatus for encoding a time-series signal for each of predetermined time sections in a frequency domain, wherein a parameter ? is a positive number, the parameter ? corresponding to a time-series signal is a shape parameter of generalized Gaussian distribution that approximates a histogram of a whitened spectral sequence, which is a sequence obtained by dividing a frequency domain sample sequence corresponding to the time-series signal by a spectral envelope estimated by regarding the ?-th power of absolute values of the frequency domain sample sequence as a power spectrum, and any of a plurality of parameters ? is selective or the parameter ? is variable for each of the predetermined time sections; and the encoding apparatus comprises an encoding portion encoding the time-series signal for each of the predetermined time sections by an encoding process with a configuration identified at least based on the parameter ? for each of the predetermined time sections.
    Type: Grant
    Filed: January 27, 2016
    Date of Patent: March 5, 2019
    Assignees: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, The University of Tokyo
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada, Takahito Kawanishi, Hirokazu Kameoka, Ryosuke Sugiura
  • Patent number: 10210886
    Abstract: A voice starting end detection apparatus includes a first detector that detects a starting end of a voice segment from input signals that are input in a time series, a first transmitting unit that transmits, when the starting end is detected, input signals subsequent to the starting end, and a first receiving unit that receives a terminal end detection signal indicating that a terminal end of the voice segment has been detected. The voice terminal end detection apparatus includes a second receiving unit that receives input signals subsequent to the starting end, a second detector that detects the terminal end from the received input signals, a second transmitting unit that transmits, when the terminal end is detected, the terminal end detection signal. The first transmitting unit stops transmitting the input signals when the first receiving unit receives the terminal end detection signal.
    Type: Grant
    Filed: December 27, 2016
    Date of Patent: February 19, 2019
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Shoko Miyamori, Mitsuyoshi Tachimori, Kouji Ueno
  • Patent number: 10134373
    Abstract: Apparatus, methods, and systems that operate to perform machine-control of a device based on machine-detected transitions are disclosed.
    Type: Grant
    Filed: September 9, 2015
    Date of Patent: November 20, 2018
    Assignee: GRACENOTE, INC.
    Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
  • Patent number: 10102862
    Abstract: Embodiments of the present disclosure provide a decoding method and a decoding apparatus. The decoding method includes: in a case in which it is determined that a current frame is a lost frame, synthesizing a high frequency band signal; determining subframe gains of multiple subframes of the current frame; determining a global gain of the current frame; and adjusting, according to the global gain and the subframe gains of the multiple subframes, the synthesized high frequency band signal to obtain a high frequency band signal of the current frame. A subframe gain of the current frame is obtained according to a gradient between subframe gains of subframes previous to the current frame, so that transition before and after frame loss is more continuous, thereby reducing noise during signal reconstruction, and improving speech quality.
    Type: Grant
    Filed: December 31, 2015
    Date of Patent: October 16, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Lei Miao, Zexin Liu
  • Patent number: 10091583
    Abstract: A home audio system that includes an audio receiver and one or more loudspeaker arrays is described. The audio receiver measures the acoustic properties of the room in which the loudspeaker arrays reside and the audio characteristics of the sound program content to be played through the loudspeaker arrays. Based on these measurements, the audio receiver assigns a directivity ratio and potentially various beam patterns to one or more segments of the sound program content. The assigned directivity ratio is used by the receiver to play the segment of the sound program content through the loudspeaker arrays. Other embodiments are also described.
    Type: Grant
    Filed: March 6, 2014
    Date of Patent: October 2, 2018
    Assignee: Apple Inc.
    Inventor: Tomlinson M. Holman
  • Patent number: 10044521
    Abstract: The present disclosure provides for devices, systems, and methods which optimize throughput of bonded connections over multiple variable bandwidth logical paths by adjusting a tunnel bandwidth weighting schema during a data transfer session in response to a change in bandwidth capabilities of one or more tunnels. By making such adjustments, embodiments of the present invention are able to optimize the bandwidth potential of multiple connections being used in a session, while minimizing the adverse consequences of reduced bandwidth issues which may occur during the data transfer session.
    Type: Grant
    Filed: February 27, 2017
    Date of Patent: August 7, 2018
    Assignee: PISMO LABS TECHNOLOGY LIMITED
    Inventors: Patrick Ho Wai Sung, Kam Chiu Ng, Ho Ming Chan
  • Patent number: 10037766
    Abstract: An apparatus for generating a bandwidth extended signal includes an anti-sparseness processing unit to perform anti-sparseness processing on a low-frequency spectrum; and a frequency domain high-frequency extension decoding unit to perform high-frequency extension encoding in the frequency domain on the low-frequency spectrum on which the anti-sparseness processing is performed.
    Type: Grant
    Filed: August 14, 2017
    Date of Patent: July 31, 2018
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Ki-hyun Choo
  • Patent number: 9978398
    Abstract: Disclosed are a voice activity detection method and device, wherein the method includes: obtaining a combined VAD judgment result according to the number of continuous active frames, an average total SNR of all sub-bands, a tonal signal flag, and at least two existing VAD judgment results (107). The method and device are suitable for voice service, solve the problem of inaccurate VAD detection and realize highly accurate VAD judgment.
    Type: Grant
    Filed: May 16, 2014
    Date of Patent: May 22, 2018
    Assignee: ZTE Corporation
    Inventors: Changbao Zhu, Hao Yuan
  • Patent number: 9886963
    Abstract: A device includes a first classifier and a second classifier coupled to the first classifier. The first classifier is configured to output first decision data that indicates a classification of an audio frame as a speech frame or a non-speech frame, the first decision data determined based on first probability data associated with a first likelihood of the audio frame being the speech frame and based on second probability data associated with a second likelihood of the audio frame being the non-speech frame. The second classifier is configured to output second decision data based on the first probability data, the second probability data, and the first decision data, the second decision data includes an indication of a selection of a particular encoder of multiple encoders available to encode the audio frame.
    Type: Grant
    Filed: March 29, 2016
    Date of Patent: February 6, 2018
    Assignee: QUALCOMM Incorporated
    Inventors: Venkatraman S. Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Vivek Rajendran, Subasingha Shaminda Subasingha
  • Patent number: 9870779
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Grant
    Filed: July 16, 2015
    Date of Patent: January 16, 2018
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
  • Patent number: 9852733
    Abstract: A device control method includes acquiring voice information, obtaining a spoken command indicating a control instruction as to a device based on the acquired voice information, identifying speaker information relating to a speaker which has uttered the acquired voice information, based on the acquired voice information, identifying, out of a plurality of devices, a device to be controlled, based on the spoken command and the speaker information, and controlling the identified device to be controlled.
    Type: Grant
    Filed: July 1, 2015
    Date of Patent: December 26, 2017
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Mariko Yamada
  • Patent number: 9842603
    Abstract: The present technology relates to an encoding device and an encoding method, a decoding device and a decoding method, and a program, configured to obtain a high quality audio with less encoding amount. A number-of-sections determining feature amount calculating circuit calculates a number-of-sections determining feature amount for determining the number of divisions to divide a process target section into continuous frame sections each including a frame for which the same estimation coefficient is selected, based on sub-band signals of a plurality of sub-bands constituting an input signal.
    Type: Grant
    Filed: August 14, 2012
    Date of Patent: December 12, 2017
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen
  • Patent number: 9818411
    Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
    Type: Grant
    Filed: November 6, 2014
    Date of Patent: November 14, 2017
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Tae Jin Lee, Seung Kwon Baek, Min Je Kim, Dae Young Jang, Jeongil Seo, Kyeongok Kang, Jin Woo Hong, Hochong Park, Young Cheol Park
  • Patent number: 9812168
    Abstract: The present disclosure provides an electronic device and method for playing image data. The method for playing back image data in an electronic device includes storing an audiovisual (A/V) data for a predetermined period of time in a memory of the electronic device. The electronic device plays back the A/V data, wherein upon playing back, by the electronic device, the A/V data comprises analyzing an audio signal of the A/V data dynamically to select one of a plurality of sound effects based on the analyzed audio signal. The A/V data is played back by applying the selected sound effect to at least a part of the A/V signal.
    Type: Grant
    Filed: February 16, 2016
    Date of Patent: November 7, 2017
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Doo-Hyun Kim, Jae-Hyun Kim, Byeong-Jun Kim, Sang-Soo Park, Jun-Soo Lee, Ho-Chul Hwang
  • Patent number: 9805732
    Abstract: Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the ith portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
    Type: Grant
    Filed: December 29, 2015
    Date of Patent: October 31, 2017
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Chen Hu, Lei Miao, Zexin Liu
  • Patent number: 9756185
    Abstract: A system and method for automated call analysis using context specific lexicons. A system includes memory and a processor configured to executed instructions. The system includes a recording component, a lexicon component, an analysis component, and a display component. The lexicon component defines a plurality of context specific lexicons, with each context specific lexicon having elements associated with one of a plurality of unique conversation segments. The analysis component configured to identify elements of the context specific lexicons, and associate each identified element with a time location in a telephonic conversation. The display component configured to graphically present a multi-line graph such that the intersections of the lines indicate transitions between the unique conversation segments.
    Type: Grant
    Filed: December 28, 2016
    Date of Patent: September 5, 2017
    Assignee: TETON1, LLC
    Inventors: Robert T. Madden, Jr., Christopher P. Derikart, Edward A. Donnelly
  • Patent number: 9747920
    Abstract: An echo cancellation system that performs audio beamforming to separate audio input into multiple directions and determines a target signal and a reference signal from the multiple directions. For example, the system may detect a strong signal associated with a speaker and select the strong signal as a reference signal, selecting another direction as a target signal. The system may determine a speech position and may select the speech position as a target signal and an opposite direction as a reference signal. The system may create pairwise combinations of opposite directions, with an individual direction being selected as a target signal and a reference signal. The system may select a fixed beamformer output for the target signal and an adaptive beamformer output for the reference signal, or vice versa. The system may remove the reference signal (e.g., audio output by the loudspeaker) to isolate speech included in the target signal.
    Type: Grant
    Filed: December 17, 2015
    Date of Patent: August 29, 2017
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Robert Ayrapetian, Philip Ryan Hilmes
  • Patent number: 9736047
    Abstract: The present invention discloses methods and systems carried out at a network device for reducing network congestion by establishing an aggregated connection, wherein the aggregated connection comprise a plurality of virtual private network (VPN) tunnels. The system further comprises assigning default weights to the plurality of VPN tunnels along with transmitting and receiving data packets of a data session through the aggregated connection. When there is a missing data packet in the received data packets, the missing data packet is recreated based on a previous or a next data packet. Furthermore the present invention discloses methods and systems for determining whether a VPN tunnel from the plurality of VPN tunnels is experiencing an unacceptable packet drop rate. If it is seen that a VPN tunnel is experiencing an unacceptable packet drop rate, an effective weight of the at least one VPN tunnel is decreased in order to overcome the setback.
    Type: Grant
    Filed: December 23, 2016
    Date of Patent: August 15, 2017
    Assignee: PISMO LABS TECHNOLOGY LIMITED
    Inventors: Ho Ming Chan, Patrick Ho Wai Sung, Kam Chiu Ng, Alex Wing Hong Chan, Kit Wai Chau
  • Patent number: 9721585
    Abstract: A processing buffer unit stores an audio signal. A pitch calculation unit and a pitch cycle correction unit calculate a multiple of N as the number of samples in a pitch cycle of the audio signal, in which N is an integer equal to or more than 1. A processing control unit and a start-position movement amount correction unit sequentially determine, as a sample in a start position of a compression process in a time axis domain of the audio signal, a (multiple of N)-th sample from a start position immediately before the start position. An operation unit compresses samples in a predetermined number times the pitch cycle from the sample in the start position in a time axis domain, and sets the number of samples after the compression to be the multiple of N. The present technology, for example, may be applied to an audio signal processing apparatus.
    Type: Grant
    Filed: May 24, 2012
    Date of Patent: August 1, 2017
    Assignee: Sony Corporation
    Inventors: Akira Inoue, Akihiro Mukai
  • Patent number: 9704504
    Abstract: The voice analysis device includes: a voice information acquiring unit that acquires a voice signal generated by plural voice acquiring units disposed at different distances from a speaking section of a speaker and acquiring voice of the speaker; and an identification unit that identifies the speaker corresponding to the voice having been acquired, on the basis of intensities of respective peaks in a frequency spectrum of a first enhanced waveform and a frequency spectrum of a second enhanced waveform. The first enhanced waveform is a waveform where a voice signal of a predetermined target speaker has been enhanced, and the second enhanced waveform is a waveform where a voice signal of a speaker other than the target speaker has been enhanced.
    Type: Grant
    Filed: July 6, 2015
    Date of Patent: July 11, 2017
    Assignee: FUJI XEROX CO., LTD.
    Inventors: Seiya Inagi, Haruo Harada, Hirohito Yoneyama, Kei Shimotani, Akira Fujii, Kiyoshi Iida
  • Patent number: 9685156
    Abstract: A low-power voice command detection method uses an audio monitoring device to capture sound. The captured sound is analyzed in steps to determine if it fulfills a number of criteria regarding sound level, voice content and identifiable voice commands. For each step the processing is more complex and power demanding. A threshold between the first and subsequent steps is used to gate further processing. This threshold is dynamically adjusted, based on the outcome of the analysis, to avoid unnecessary processing and increase system performance.
    Type: Grant
    Filed: March 12, 2015
    Date of Patent: June 20, 2017
    Assignee: Sony Mobile Communications Inc.
    Inventors: Henrik Borjeson, Magnus Abrahamsson
  • Patent number: 9672813
    Abstract: Systems, devices and methods are provided for configuring matching rules related to voice input commands. For example, a first mapping relation between one or more first original terms in a preset term database and one or more first identification terms is established; the first mapping relation is stored in a first mapping relation table; one or more first voice input commands are configured for the first identification terms or one or more first statements including the first identification terms; and a second mapping relation between the first identification terms or the first statements and the first voice input commands is stored into a second mapping relation table.
    Type: Grant
    Filed: July 14, 2015
    Date of Patent: June 6, 2017
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Pan Chu, Hao Xu, Fei Wang, Guiping Han, Ling Liao
  • Patent number: 9640189
    Abstract: An apparatus for generating a frequency enhancement signal has: a calculator for calculating a value describing an energy distribution with respect to frequency in a core signal; and a signal generator for generating an enhancement signal having an enhancement frequency range not included in the core signal, from the core signal, wherein the signal generator is configured for shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: May 2, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Ralf Geiger, Christian Helmrich, Markus Multrus, Konstantin Schmidt
  • Patent number: 9633663
    Abstract: An audio encoding apparatus includes an encoder for encoding a time segment of an input audio signal to be encoded to obtain a corresponding encoded signal segment. The audio encoding apparatus further includes a decoder for decoding the encoded signal segment to obtain a re-decoded signal segment. A clipping detector is provided for analyzing the re-decoded signal segment with respect to at least one of an actual signal clipping or an perceptible signal clipping and for generating a corresponding clipping alert. The encoder is further configured to again encode the time segment of the audio signal with at least one modified encoding parameter resulting in a reduced clipping probability in response to the clipping alert.
    Type: Grant
    Filed: June 13, 2014
    Date of Patent: April 25, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foederung der angewandten Forschung e.V.
    Inventors: Albert Heuberger, Bernd Edler, Nikolaus Rettelbach, Stefan Geyersberger, Johannes Hilpert