Voiced Or Unvoiced Patents (Class 704/208)
  • Patent number: 10431241
    Abstract: A speech enhancement method is provided. The speech enhancement method includes: estimating a direction of a speaker by using an input signal, generating direction information indicating the estimated direction, detecting speech of a speaker based on a result of the estimating the direction, and enhancing the speech of the speaker by using the direction information based on a result of the detecting the speech.
    Type: Grant
    Filed: May 30, 2014
    Date of Patent: October 1, 2019
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jae-youn Cho, Weiwei Cui, Seung-yeol Lee
  • Patent number: 10431232
    Abstract: A method and an apparatus for synthesizing an audio signal are described. A spectral tilt is applied to the code of a codebook used for synthesizing a current frame of the audio signal. The spectral tilt is based on the spectral tilt of the current frame of the audio signal. Further, an audio decoder operating in accordance with the inventive approach is described.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: October 1, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Tom Baeckstroem, Ralf Geiger, Wolfgang Jaegers, Emmanuel Ravelli
  • Patent number: 10388302
    Abstract: A method for processing an initial signal includes a useful signal and added noise, which comprises a step of frequency selective analysis providing starting from initial signal a plurality of wideband analysis signals corresponding to one of the analyzed frequencies, and comprising the following actions: zero or more complex frequency translations, one or more undersampling operations, computation of the instantaneous Amplitude, of the instantaneous Phase, and of the instantaneous Frequency of the wideband analysis signals. This information then allow to detect modulations of signals included in high levels of noise and to detect with a good probability the presence of a signal in a high level of noise.
    Type: Grant
    Filed: December 23, 2015
    Date of Patent: August 20, 2019
    Inventor: Yves Reza
  • Patent number: 10373608
    Abstract: A sound recognition system including time-dependent analog filtered feature extraction and sequencing. An analog front end (AFE) in the system receives input analog signals, such as signals representing an audio input to a microphone. Features in the input signal are extracted, by measuring such attributes as zero crossing events and total energy in filtered versions of the signal with different frequency characteristics at different times during the audio event. In one embodiment, a tunable analog filter is controlled to change its frequency characteristics at different times during the event. In another embodiment, multiple analog filters with different filter characteristics filter the input signal in parallel, and signal features are extracted from each filtered signal; a multiplexer selects the desired features at different times during the event.
    Type: Grant
    Filed: October 22, 2015
    Date of Patent: August 6, 2019
    Assignee: TEXAS INSTRUMENTS INCORPORATED
    Inventors: Zhenyong Zhang, Wei Ma
  • Patent number: 10332535
    Abstract: An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal havi
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: June 25, 2019
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
  • Patent number: 10311883
    Abstract: A transient detector (100) analyzes (110) a given frame n of the input audio signal to determine, based on audio signal characteristics of the given frame n, a transient hangover indicator for a following frame n+1, and signals (120) the determined transient hangover indicator to an associated audio encoder (10) to enable proper encoding of the following frame n+1.
    Type: Grant
    Filed: October 18, 2016
    Date of Patent: June 4, 2019
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Anisse Taleb, Gustaf Ullberg
  • Patent number: 10269365
    Abstract: An audio decoder provides a decoded audio information on the basis of an encoded audio information including linear prediction coefficients (LPC) and includes a tilt adjuster to adjust a tilt of a noise using linear prediction coefficients of a current frame to acquire a tilt information and a noise inserter configured to add the noise to the current frame in dependence on the tilt information. Another audio decoder includes a noise level estimator to estimate a noise level for a current frame using a linear prediction coefficient of at least one previous frame to acquire a noise level information; and a noise inserter to add a noise to the current frame in dependence on the noise level information provided by the noise level estimator. Thus, side information about a background noise in the bit-stream may be omitted. Methods and computer programs serve a similar purpose.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: April 23, 2019
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Guillaume Fuchs, Christian Helmrich, Manuel Jander, Benjamin Schubert, Yoshikazu Yokotani
  • Patent number: 10248379
    Abstract: A process at an electronic computing device (device) for automatic and selective context-based gating of an electronic digital assistant speech-output function includes detecting, via a sensor communicatively coupled to the device or via a notification received at the device, an event associated with a need to reduce a cognitive load on the user. Responsively, temporarily gating a portion of an electronic digital assistant speech-output function to prevent reproduction to the user of one or more first speech-output events generated during a first period of time associated with the event. Subsequently, and after detecting that the event has ended, ending the temporary gating to allow reproduction of one or more second speech-output events generated during a second period of time after the event has ended.
    Type: Grant
    Filed: July 27, 2017
    Date of Patent: April 2, 2019
    Assignee: MOTOROLA SOLUTIONS, INC.
    Inventor: Paul D. Steinberg
  • Patent number: 10249288
    Abstract: An approach is provided that assists visually impaired users. The approach analyzes a document that is being utilized by the visually impaired user. The analysis derives a sensitivity of the document. A vocal characteristic corresponding to the derived sensitivity is retrieved. Text from the document is audibly read to the visually impaired user with a text to speech process that utilizes the retrieved vocal characteristic. The retrieved vocal characteristic conveys the derived sensitivity of the document to the visually impaired user.
    Type: Grant
    Filed: June 16, 2018
    Date of Patent: April 2, 2019
    Assignee: International Business Machines Corporation
    Inventors: Maureen E. Kraft, Fang Lu, Azadeh Salehi, Weisong Wang
  • Patent number: 10242696
    Abstract: In accordance with embodiments of the present disclosure, an integrated circuit for implementing at least a portion of an audio device may include an audio output configured to reproduce audio information by generating an audio output signal for communication to at least one transducer of the audio device, a microphone input configured to receive an input signal indicative of ambient sound external to the audio device and a processor configured to implement an impulsive noise detector. The impulsive noise detector may include a sudden onset detector for predicting an occurrence of a signal burst event of the input signal and an impulsive detector for determining whether the signal burst event comprises a speech event or a noise event.
    Type: Grant
    Filed: October 11, 2016
    Date of Patent: March 26, 2019
    Assignee: Cirrus Logic, Inc.
    Inventor: Samuel Pon Varma Ebenezer
  • Patent number: 10224049
    Abstract: An encoding apparatus is an encoding apparatus for encoding a time-series signal for each of predetermined time sections in a frequency domain, wherein a parameter ? is a positive number, the parameter ? corresponding to a time-series signal is a shape parameter of generalized Gaussian distribution that approximates a histogram of a whitened spectral sequence, which is a sequence obtained by dividing a frequency domain sample sequence corresponding to the time-series signal by a spectral envelope estimated by regarding the ?-th power of absolute values of the frequency domain sample sequence as a power spectrum, and any of a plurality of parameters ? is selective or the parameter ? is variable for each of the predetermined time sections; and the encoding apparatus comprises an encoding portion encoding the time-series signal for each of the predetermined time sections by an encoding process with a configuration identified at least based on the parameter ? for each of the predetermined time sections.
    Type: Grant
    Filed: January 27, 2016
    Date of Patent: March 5, 2019
    Assignees: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, The University of Tokyo
    Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada, Takahito Kawanishi, Hirokazu Kameoka, Ryosuke Sugiura
  • Patent number: 10210886
    Abstract: A voice starting end detection apparatus includes a first detector that detects a starting end of a voice segment from input signals that are input in a time series, a first transmitting unit that transmits, when the starting end is detected, input signals subsequent to the starting end, and a first receiving unit that receives a terminal end detection signal indicating that a terminal end of the voice segment has been detected. The voice terminal end detection apparatus includes a second receiving unit that receives input signals subsequent to the starting end, a second detector that detects the terminal end from the received input signals, a second transmitting unit that transmits, when the terminal end is detected, the terminal end detection signal. The first transmitting unit stops transmitting the input signals when the first receiving unit receives the terminal end detection signal.
    Type: Grant
    Filed: December 27, 2016
    Date of Patent: February 19, 2019
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Shoko Miyamori, Mitsuyoshi Tachimori, Kouji Ueno
  • Patent number: 10134373
    Abstract: Apparatus, methods, and systems that operate to perform machine-control of a device based on machine-detected transitions are disclosed.
    Type: Grant
    Filed: September 9, 2015
    Date of Patent: November 20, 2018
    Assignee: GRACENOTE, INC.
    Inventors: Michael Jeffrey, Markus K. Cremer, Dong-In Lee
  • Patent number: 10102862
    Abstract: Embodiments of the present disclosure provide a decoding method and a decoding apparatus. The decoding method includes: in a case in which it is determined that a current frame is a lost frame, synthesizing a high frequency band signal; determining subframe gains of multiple subframes of the current frame; determining a global gain of the current frame; and adjusting, according to the global gain and the subframe gains of the multiple subframes, the synthesized high frequency band signal to obtain a high frequency band signal of the current frame. A subframe gain of the current frame is obtained according to a gradient between subframe gains of subframes previous to the current frame, so that transition before and after frame loss is more continuous, thereby reducing noise during signal reconstruction, and improving speech quality.
    Type: Grant
    Filed: December 31, 2015
    Date of Patent: October 16, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Lei Miao, Zexin Liu
  • Patent number: 10091583
    Abstract: A home audio system that includes an audio receiver and one or more loudspeaker arrays is described. The audio receiver measures the acoustic properties of the room in which the loudspeaker arrays reside and the audio characteristics of the sound program content to be played through the loudspeaker arrays. Based on these measurements, the audio receiver assigns a directivity ratio and potentially various beam patterns to one or more segments of the sound program content. The assigned directivity ratio is used by the receiver to play the segment of the sound program content through the loudspeaker arrays. Other embodiments are also described.
    Type: Grant
    Filed: March 6, 2014
    Date of Patent: October 2, 2018
    Assignee: Apple Inc.
    Inventor: Tomlinson M. Holman
  • Patent number: 10044521
    Abstract: The present disclosure provides for devices, systems, and methods which optimize throughput of bonded connections over multiple variable bandwidth logical paths by adjusting a tunnel bandwidth weighting schema during a data transfer session in response to a change in bandwidth capabilities of one or more tunnels. By making such adjustments, embodiments of the present invention are able to optimize the bandwidth potential of multiple connections being used in a session, while minimizing the adverse consequences of reduced bandwidth issues which may occur during the data transfer session.
    Type: Grant
    Filed: February 27, 2017
    Date of Patent: August 7, 2018
    Assignee: PISMO LABS TECHNOLOGY LIMITED
    Inventors: Patrick Ho Wai Sung, Kam Chiu Ng, Ho Ming Chan
  • Patent number: 10037766
    Abstract: An apparatus for generating a bandwidth extended signal includes an anti-sparseness processing unit to perform anti-sparseness processing on a low-frequency spectrum; and a frequency domain high-frequency extension decoding unit to perform high-frequency extension encoding in the frequency domain on the low-frequency spectrum on which the anti-sparseness processing is performed.
    Type: Grant
    Filed: August 14, 2017
    Date of Patent: July 31, 2018
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Ki-hyun Choo
  • Patent number: 9978398
    Abstract: Disclosed are a voice activity detection method and device, wherein the method includes: obtaining a combined VAD judgment result according to the number of continuous active frames, an average total SNR of all sub-bands, a tonal signal flag, and at least two existing VAD judgment results (107). The method and device are suitable for voice service, solve the problem of inaccurate VAD detection and realize highly accurate VAD judgment.
    Type: Grant
    Filed: May 16, 2014
    Date of Patent: May 22, 2018
    Assignee: ZTE Corporation
    Inventors: Changbao Zhu, Hao Yuan
  • Patent number: 9886963
    Abstract: A device includes a first classifier and a second classifier coupled to the first classifier. The first classifier is configured to output first decision data that indicates a classification of an audio frame as a speech frame or a non-speech frame, the first decision data determined based on first probability data associated with a first likelihood of the audio frame being the speech frame and based on second probability data associated with a second likelihood of the audio frame being the non-speech frame. The second classifier is configured to output second decision data based on the first probability data, the second probability data, and the first decision data, the second decision data includes an indication of a selection of a particular encoder of multiple encoders available to encode the audio frame.
    Type: Grant
    Filed: March 29, 2016
    Date of Patent: February 6, 2018
    Assignee: QUALCOMM Incorporated
    Inventors: Venkatraman S. Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam, Vivek Rajendran, Subasingha Shaminda Subasingha
  • Patent number: 9870779
    Abstract: According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
    Type: Grant
    Filed: July 16, 2015
    Date of Patent: January 16, 2018
    Assignee: KABUSHIKI KAISHA TOSHIBA
    Inventors: Kentaro Tachibana, Takehiko Kagoshima, Masatsune Tamura, Masahiro Morita
  • Patent number: 9852733
    Abstract: A device control method includes acquiring voice information, obtaining a spoken command indicating a control instruction as to a device based on the acquired voice information, identifying speaker information relating to a speaker which has uttered the acquired voice information, based on the acquired voice information, identifying, out of a plurality of devices, a device to be controlled, based on the spoken command and the speaker information, and controlling the identified device to be controlled.
    Type: Grant
    Filed: July 1, 2015
    Date of Patent: December 26, 2017
    Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA
    Inventor: Mariko Yamada
  • Patent number: 9842603
    Abstract: The present technology relates to an encoding device and an encoding method, a decoding device and a decoding method, and a program, configured to obtain a high quality audio with less encoding amount. A number-of-sections determining feature amount calculating circuit calculates a number-of-sections determining feature amount for determining the number of divisions to divide a process target section into continuous frame sections each including a frame for which the same estimation coefficient is selected, based on sub-band signals of a plurality of sub-bands constituting an input signal.
    Type: Grant
    Filed: August 14, 2012
    Date of Patent: December 12, 2017
    Assignee: Sony Corporation
    Inventors: Yuki Yamamoto, Toru Chinen
  • Patent number: 9818411
    Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
    Type: Grant
    Filed: November 6, 2014
    Date of Patent: November 14, 2017
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Tae Jin Lee, Seung Kwon Baek, Min Je Kim, Dae Young Jang, Jeongil Seo, Kyeongok Kang, Jin Woo Hong, Hochong Park, Young Cheol Park
  • Patent number: 9812168
    Abstract: The present disclosure provides an electronic device and method for playing image data. The method for playing back image data in an electronic device includes storing an audiovisual (A/V) data for a predetermined period of time in a memory of the electronic device. The electronic device plays back the A/V data, wherein upon playing back, by the electronic device, the A/V data comprises analyzing an audio signal of the A/V data dynamically to select one of a plurality of sound effects based on the analyzed audio signal. The A/V data is played back by applying the selected sound effect to at least a part of the A/V signal.
    Type: Grant
    Filed: February 16, 2016
    Date of Patent: November 7, 2017
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Doo-Hyun Kim, Jae-Hyun Kim, Byeong-Jun Kim, Sang-Soo Park, Jun-Soo Lee, Ho-Chul Hwang
  • Patent number: 9805732
    Abstract: Embodiments of the present application proposes a frequency envelope vector quantization method and apparatus, where the method includes: dividing N frequency envelopes in one frame into N1 vectors; quantizing a first vector in the N1 vectors by using a first codebook, to obtain a code word corresponding to the quantized first vector, where the first codebook is divided into 2B1 portions; determining, according to the code word corresponding to the quantized first vector; determining a second codebook according to the codebook of the ith portion; and quantizing a second vector in the N1 vectors based on the second codebook. In the embodiments of the present application, vector quantization can be performed on frequency envelope vectors by using a codebook with a smaller quantity of bits. Therefore, complexity of vector quantization can be reduced, and an effect of vector quantization can also be ensured.
    Type: Grant
    Filed: December 29, 2015
    Date of Patent: October 31, 2017
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Chen Hu, Lei Miao, Zexin Liu
  • Patent number: 9756185
    Abstract: A system and method for automated call analysis using context specific lexicons. A system includes memory and a processor configured to executed instructions. The system includes a recording component, a lexicon component, an analysis component, and a display component. The lexicon component defines a plurality of context specific lexicons, with each context specific lexicon having elements associated with one of a plurality of unique conversation segments. The analysis component configured to identify elements of the context specific lexicons, and associate each identified element with a time location in a telephonic conversation. The display component configured to graphically present a multi-line graph such that the intersections of the lines indicate transitions between the unique conversation segments.
    Type: Grant
    Filed: December 28, 2016
    Date of Patent: September 5, 2017
    Assignee: TETON1, LLC
    Inventors: Robert T. Madden, Jr., Christopher P. Derikart, Edward A. Donnelly
  • Patent number: 9747920
    Abstract: An echo cancellation system that performs audio beamforming to separate audio input into multiple directions and determines a target signal and a reference signal from the multiple directions. For example, the system may detect a strong signal associated with a speaker and select the strong signal as a reference signal, selecting another direction as a target signal. The system may determine a speech position and may select the speech position as a target signal and an opposite direction as a reference signal. The system may create pairwise combinations of opposite directions, with an individual direction being selected as a target signal and a reference signal. The system may select a fixed beamformer output for the target signal and an adaptive beamformer output for the reference signal, or vice versa. The system may remove the reference signal (e.g., audio output by the loudspeaker) to isolate speech included in the target signal.
    Type: Grant
    Filed: December 17, 2015
    Date of Patent: August 29, 2017
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Robert Ayrapetian, Philip Ryan Hilmes
  • Patent number: 9736047
    Abstract: The present invention discloses methods and systems carried out at a network device for reducing network congestion by establishing an aggregated connection, wherein the aggregated connection comprise a plurality of virtual private network (VPN) tunnels. The system further comprises assigning default weights to the plurality of VPN tunnels along with transmitting and receiving data packets of a data session through the aggregated connection. When there is a missing data packet in the received data packets, the missing data packet is recreated based on a previous or a next data packet. Furthermore the present invention discloses methods and systems for determining whether a VPN tunnel from the plurality of VPN tunnels is experiencing an unacceptable packet drop rate. If it is seen that a VPN tunnel is experiencing an unacceptable packet drop rate, an effective weight of the at least one VPN tunnel is decreased in order to overcome the setback.
    Type: Grant
    Filed: December 23, 2016
    Date of Patent: August 15, 2017
    Assignee: PISMO LABS TECHNOLOGY LIMITED
    Inventors: Ho Ming Chan, Patrick Ho Wai Sung, Kam Chiu Ng, Alex Wing Hong Chan, Kit Wai Chau
  • Patent number: 9721585
    Abstract: A processing buffer unit stores an audio signal. A pitch calculation unit and a pitch cycle correction unit calculate a multiple of N as the number of samples in a pitch cycle of the audio signal, in which N is an integer equal to or more than 1. A processing control unit and a start-position movement amount correction unit sequentially determine, as a sample in a start position of a compression process in a time axis domain of the audio signal, a (multiple of N)-th sample from a start position immediately before the start position. An operation unit compresses samples in a predetermined number times the pitch cycle from the sample in the start position in a time axis domain, and sets the number of samples after the compression to be the multiple of N. The present technology, for example, may be applied to an audio signal processing apparatus.
    Type: Grant
    Filed: May 24, 2012
    Date of Patent: August 1, 2017
    Assignee: Sony Corporation
    Inventors: Akira Inoue, Akihiro Mukai
  • Patent number: 9704504
    Abstract: The voice analysis device includes: a voice information acquiring unit that acquires a voice signal generated by plural voice acquiring units disposed at different distances from a speaking section of a speaker and acquiring voice of the speaker; and an identification unit that identifies the speaker corresponding to the voice having been acquired, on the basis of intensities of respective peaks in a frequency spectrum of a first enhanced waveform and a frequency spectrum of a second enhanced waveform. The first enhanced waveform is a waveform where a voice signal of a predetermined target speaker has been enhanced, and the second enhanced waveform is a waveform where a voice signal of a speaker other than the target speaker has been enhanced.
    Type: Grant
    Filed: July 6, 2015
    Date of Patent: July 11, 2017
    Assignee: FUJI XEROX CO., LTD.
    Inventors: Seiya Inagi, Haruo Harada, Hirohito Yoneyama, Kei Shimotani, Akira Fujii, Kiyoshi Iida
  • Patent number: 9685156
    Abstract: A low-power voice command detection method uses an audio monitoring device to capture sound. The captured sound is analyzed in steps to determine if it fulfills a number of criteria regarding sound level, voice content and identifiable voice commands. For each step the processing is more complex and power demanding. A threshold between the first and subsequent steps is used to gate further processing. This threshold is dynamically adjusted, based on the outcome of the analysis, to avoid unnecessary processing and increase system performance.
    Type: Grant
    Filed: March 12, 2015
    Date of Patent: June 20, 2017
    Assignee: Sony Mobile Communications Inc.
    Inventors: Henrik Borjeson, Magnus Abrahamsson
  • Patent number: 9672813
    Abstract: Systems, devices and methods are provided for configuring matching rules related to voice input commands. For example, a first mapping relation between one or more first original terms in a preset term database and one or more first identification terms is established; the first mapping relation is stored in a first mapping relation table; one or more first voice input commands are configured for the first identification terms or one or more first statements including the first identification terms; and a second mapping relation between the first identification terms or the first statements and the first voice input commands is stored into a second mapping relation table.
    Type: Grant
    Filed: July 14, 2015
    Date of Patent: June 6, 2017
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Pan Chu, Hao Xu, Fei Wang, Guiping Han, Ling Liao
  • Patent number: 9640189
    Abstract: An apparatus for generating a frequency enhancement signal has: a calculator for calculating a value describing an energy distribution with respect to frequency in a core signal; and a signal generator for generating an enhancement signal having an enhancement frequency range not included in the core signal, from the core signal, wherein the signal generator is configured for shaping the enhancement signal or the core signal so that a spectral envelope of the enhancement signal or of the core signal depends on the value describing the energy distribution with respect to frequency in the core signal.
    Type: Grant
    Filed: July 28, 2015
    Date of Patent: May 2, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Ralf Geiger, Christian Helmrich, Markus Multrus, Konstantin Schmidt
  • Patent number: 9633663
    Abstract: An audio encoding apparatus includes an encoder for encoding a time segment of an input audio signal to be encoded to obtain a corresponding encoded signal segment. The audio encoding apparatus further includes a decoder for decoding the encoded signal segment to obtain a re-decoded signal segment. A clipping detector is provided for analyzing the re-decoded signal segment with respect to at least one of an actual signal clipping or an perceptible signal clipping and for generating a corresponding clipping alert. The encoder is further configured to again encode the time segment of the audio signal with at least one modified encoding parameter resulting in a reduced clipping probability in response to the clipping alert.
    Type: Grant
    Filed: June 13, 2014
    Date of Patent: April 25, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foederung der angewandten Forschung e.V.
    Inventors: Albert Heuberger, Bernd Edler, Nikolaus Rettelbach, Stefan Geyersberger, Johannes Hilpert
  • Patent number: 9589568
    Abstract: Method and device of extending a signal band of a voice or audio signal are provided. The bandwidth extension method includes the steps of: performing a modified discrete cosine transform (MDCT) process on an input signal to generate a first transform signal; generating a second transform signal and a third transform signal on the basis of the first transform signal; generating normalized components and energy components of the first transform signal, the second transform signal, and the third transform signal therefrom; generating an extended normalized component from the normalized components and generating an extended energy component from the energy components; generating an extended transform signal on the basis of the extended normalized component and the extended energy component; and performing an inverse MDCT (IMDCT) process on the extended transform signal.
    Type: Grant
    Filed: February 8, 2012
    Date of Patent: March 7, 2017
    Assignees: LG Electronics Inc., Gwangju Institute Of Science and Technology
    Inventors: Gyu Hyeok Jeong, Young Han Lee, Hye Jeong Jeon, Hong Kook Kim, In Gyu Kang, Lag Young Kim
  • Patent number: 9570060
    Abstract: A music signal processing apparatus includes a frequency spectrum transform unit, a filter, a frequency feature amount generation unit, and a melody feature amount sequence acquisition unit. The frequency spectrum transform unit is configured to transform a music signal into a frequency spectrum, the music signal being a signal of a musical piece containing a part with a melody. The filter is configured to remove a steep peak of the frequency spectrum. The frequency feature amount generation unit is configured to generate, from a signal output from the filter, a frequency feature amount in which a fundamental frequency component of the part is emphasized. The melody feature amount sequence acquisition unit is configured to acquire, based on the frequency feature amount, a melody feature amount sequence that specifies a fundamental frequency of the part at each time.
    Type: Grant
    Filed: May 2, 2014
    Date of Patent: February 14, 2017
    Assignee: Sony Corporation
    Inventor: Emiru Tsunoo
  • Patent number: 9558755
    Abstract: Noise suppression information is used to optimize or improve automatic speech recognition performed for a signal. Noise suppression can be performed on a noisy speech signal using a gain value. The gain to apply to the noisy speech signal is selected to optimize speech recognition analysis of the resulting signal. The gain may be selected based on one or more features for a current sub band and time frame, as well as one or more features for other sub bands and/or time frames. Noise suppression information can be provided to a speech recognition module to improve the robustness of the speech recognition analysis. Noise suppression information can also be used to encode and identify speech.
    Type: Grant
    Filed: December 7, 2010
    Date of Patent: January 31, 2017
    Assignee: Knowles Electronics, LLC
    Inventors: Jean Laroche, Carlo Murgia
  • Patent number: 9531508
    Abstract: The present invention discloses methods and systems for recreating missing data packets of a data session established between a first communications router and a second communications router through an aggregated connection. The first communications router receives data packets belonging to a data session from the second communications router through the aggregated connection. The data packets are destined to a first host reachable through the first communications router. The first communications router transmits the data packets to the first host and determines whether there is one or more missing data packets. If there is one or more missing data packets, the first communications router determines global sequence number(s) (GSN) and per tunnel sequence numbers (PTSN) of the one or more missing data packets and recreates payload(s) of the one or more missing data packet(s). One or more new data packets are then transmitted to the first host.
    Type: Grant
    Filed: April 24, 2015
    Date of Patent: December 27, 2016
    Assignee: PISMO LABS TECHNOLOGY LIMITED
    Inventors: Ho Ming Chan, Patrick Ho Wai Sung, Kam Chiu Ng, Alex Wing Hong Chan, Kit Wai Chau
  • Patent number: 9484044
    Abstract: Voice enhancement and/or speech features extraction may be performed on noisy audio signals using successively refined transforms. Downsampled versions of an input signal may be obtained, which include a first downsampled signal with a lower sampling rate than a second downsampled signal. Successive transforms may be performed on the input signal to obtain a corresponding sound model of the input signal. The successive transforms performed may include: (1) performing a first transform on the first downsampled signal to yield a first pitch estimate; (2) performing a second transform on the second downsampled signal to yield a second pitch estimate and a first harmonics estimate based on the first pitch estimate; and (3) performing a third transform on the input signal to yield a third pitch estimate and a second harmonics estimate based on the second pitch estimate and the first harmonics estimate.
    Type: Grant
    Filed: July 17, 2013
    Date of Patent: November 1, 2016
    Assignee: KnuEdge Incorporated
    Inventors: Massimo Mascaro, David C. Bradley
  • Patent number: 9449616
    Abstract: Provided are a noise reduction system that highly precisely estimates noise contained in an input signal and highly precisely reduces the noise contained in the input signal using the estimated noise, a speech detection system, a speech recognition system, a noise reduction method, and a noise reduction program.
    Type: Grant
    Filed: December 25, 2013
    Date of Patent: September 20, 2016
    Assignee: NEC CORPORATION
    Inventors: Masanori Tsujikawa, Ken Hanazawa, Shuji Komeiji
  • Patent number: 9431023
    Abstract: The present technology provides a robust noise suppression system that may concurrently reduce noise and echo components in an acoustic signal while limiting the level of speech distortion. A time-domain acoustic signal may be received and be transformed to frequency-domain sub-band signals. Features, such as pitch, may be identified and tracked within the sub-band signals. Initial speech and noise models may be then be estimated at least in part from a probability analysis based on the tracked pitch sources. Speech and noise models may be resolved from the initial speech and noise models and noise reduction may be performed on the sub-band signals. An acoustic signal may be reconstructed from the noise-reduced sub-band signals.
    Type: Grant
    Filed: April 9, 2013
    Date of Patent: August 30, 2016
    Assignee: Knowles Electronics, LLC
    Inventors: Carlos Avendano, Jean Laroche, Michael M. Goodwin, Ludger Solbach
  • Patent number: 9424831
    Abstract: A voice synthesizing apparatus includes a manipulation determiner configured to determine a manipulation position which is moved according to a manipulation of a user, and a voice synthesizer configured to generate, in response to an instruction to generate a voice in which a second phoneme follows a first phoneme, a voice signal so that vocalization of the first phoneme starts before the manipulation position reaches a reference position and that vocalization from the first phoneme to the second phoneme is made when the manipulation position reaches the reference position.
    Type: Grant
    Filed: February 20, 2014
    Date of Patent: August 23, 2016
    Assignee: Yamaha Corporation
    Inventor: Yuji Hisaminato
  • Patent number: 9412396
    Abstract: A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel.
    Type: Grant
    Filed: November 25, 2015
    Date of Patent: August 9, 2016
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Bing Chen, James H. James
  • Patent number: 9318122
    Abstract: A signal processing apparatus generates a window signal, transforms the window signal into a frequency spectrum, and adjusts an amplitude component of the frequency spectrum. Then, the signal processing apparatus applies inverse transform to the amplitude component after adjustment and to a phase component of the frequency spectrum to generate a frame signal, and identifies an overlap segment such that the absolute value of the amplitude of the frame signal at at least one end of the overlap segment becomes smaller than the absolute value of the amplitude of the frame signal at a corresponding end of an overlapping section. Then, in the identified segment, the signal processing apparatus adds and compounds the frame signal corresponding to an immediately preceding frame and the frame signal corresponding to a processing-target frame.
    Type: Grant
    Filed: March 24, 2015
    Date of Patent: April 19, 2016
    Assignee: FUJITSU LIMITED
    Inventor: Takeshi Otani
  • Patent number: 9224405
    Abstract: A Voice Activity Detection/Silence Suppression (VAD/SS) system is connected to a channel of a transmission pipe. The channel provides a pathway for the transmission of energy. A method for operating a VAD/SS system includes detecting the energy on the channel, and activating or suppressing activation of the VAD/SS system depending upon the nature of the energy detected on the channel.
    Type: Grant
    Filed: March 13, 2015
    Date of Patent: December 29, 2015
    Assignee: AT&T INTELLECTUAL PROPERTY II, L.P.
    Inventors: Bing Chen, James H. James
  • Patent number: 9203943
    Abstract: A method for detecting time division duplex (TDD) noise of a communication device, the communication device includes a radio frequency (RF) unit, a voice generation unit, a voice signal acquisition unit, an audio processing unit, and a processor. The RF unit emits a plurality of radio signals. The voice generation unit generates voice signals and outputs the voice signals. The voice signal acquisition unit acquires the voice signals. The audio processing unit converts the acquired voice signals into digital audio signals, and analyzes the digital audio signals to detect whether the digital audio signals contain time division duplex (TDD) noise. The processor generates a result indicating the digital audio signals contain or do not contain the TDD noise according to the analysis of the digital audio signals, and displays the result on a display of the communication device.
    Type: Grant
    Filed: October 16, 2012
    Date of Patent: December 1, 2015
    Assignees: Fu Tai Hua Industry (Shenzhen) Co., Ltd., HON HAI PRECISION INDUSTRY CO., LTD.
    Inventor: Yang Xin
  • Patent number: 9171548
    Abstract: A system for confirming that a subject is the source of spoken audio and the identity of the subject providing the spoken audio is described. The system includes at least one motion sensor operable to capture physical motion of at least one articulator that contributes to the production of speech, at least one acoustic signal sensor to receive acoustic signals, and a processing device comprising a memory and communicatively coupled to the at least one motion sensor and the at least one acoustic signal sensor. The processing device is programmed to correlate physical motion data with acoustical signal data to uniquely characterize the subject for purposes of verifying the subject is the source of the acoustical signal data and the identity of the subject.
    Type: Grant
    Filed: August 19, 2011
    Date of Patent: October 27, 2015
    Assignee: The Boeing Company
    Inventors: George A. Velius, David A. Whelan
  • Patent number: 9141601
    Abstract: According to one embodiment, a learning device includes an input receiving unit, an inferring unit, and a learning unit. The input receiving unit receives an input of first data representing a sentence, rear boundaries of elements having an anaphoric relation within the sentence, and a correspondence relation between a first element that is an antecedent and a second element that is an anaphor. The inferring unit infers a range of the first element and a range of the second element that are determined by front boundaries and the rear boundaries by inferring the front boundaries of the first element and the second element based on the first data and a predetermined rule. The learning unit learns criteria used for determining whether or not there is the anaphoric relation in an arbitrary sentence based on the range of the first element and the range of the second element.
    Type: Grant
    Filed: November 23, 2011
    Date of Patent: September 22, 2015
    Assignees: Kabushiki Kaisha Toshiba, Toshiba Solutions Corporation
    Inventor: Shinichiro Hamada
  • Patent number: 9142218
    Abstract: Disclosed herein are systems, methods, and tangible computer readable-media for detecting synthetic speaker verification. The method comprises receiving a plurality of speech samples of the same word or phrase for verification, comparing each of the plurality of speech samples to each other, denying verification if the plurality of speech samples demonstrate little variance over time or are the same, and verifying the plurality of speech samples if the plurality of speech samples demonstrates sufficient variance over time. One embodiment further adds that each of the plurality of speech samples is collected at different times or in different contexts. In other embodiments, variance is based on a pre-determined threshold or the threshold for variance is adjusted based on a need for authentication certainty. In another embodiment, if the initial comparison is inconclusive, additional speech samples are received.
    Type: Grant
    Filed: August 7, 2014
    Date of Patent: September 22, 2015
    Assignee: AT&T Intellectual Property I, L.P.
    Inventor: Horst J. Schroeter
  • Patent number: 9104250
    Abstract: A real-time quantifying system for use in quantifying input device user behavior is introduced. The real-time quantifying system performs real-time quantification on user behavior of at least an input device operating and controlling a computer apparatus. The real-time quantifying system includes an input device and a portable electronic device connected thereto by wireless connection. The input device includes a sensing module and a wireless communication module. Various data detected during a process of operation of the computer apparatus by a user are sent by the input device to the portable electronic device in real time. Then, the data are processed by the portable electronic device to generate a graphical data for display. Accordingly, the real-time quantifying system displays the user's control status in real time and assesses the user's attributes and user behavior according to the graphical data.
    Type: Grant
    Filed: July 11, 2013
    Date of Patent: August 11, 2015
    Assignee: THERMALTAKE TECHNOLOGY CO., LTD.
    Inventor: Pei-Hsi Lin