Time Patents (Class 704/211)

Pulse code modulation (pcm) (Class 704/212)

Zero crossing (Class 704/213)

Voiced or unvoiced (Class 704/214)

Silence decision (Class 704/215)

Correlation function (Class 704/216)

Fast playback in media files with reduced impact to speech quality

Patent number: 10629223

Abstract: The present invention is a system and method for increasing the playback speed of audio waves. The system analyzes an audio wave to detect a first silent section that has a length greater than a minimum short pause length required to distinguish between words. The system then calculates a new playback speed of the first silent section so that the total playback time for the first silent section is less than or equal to the minimum short pause length and controls an audio playback device to play the audio wave in a manner so that the first silent section is played back at the new playback speed. In another embodiment, the system analyzes spoken words, phonemes by phonemes, and increases the spoken word playback speed by dynamically reducing the length of each phoneme and inter-syllable silent pauses. Thus, the system functions equally well for all languages and accents.

Type: Grant

Filed: May 31, 2017

Date of Patent: April 21, 2020

Assignee: International Business Machines Corporation

Inventor: Deepa Jain
Method and apparatus to determine and use audience affinity and aptitude

Patent number: 10614829

Abstract: One embodiment is a method of presenting an audio or audio-visual work which includes: (a) detecting media work content properties in an audio portion of the audio or audio-visual work using a media work content properties detection apparatus; (b) associating a presentation rate of the audio of the audio portion of the audio or audio-visual work with the detected media work content properties; and (c) presenting the portion of the audio or audio-visual work using the media work content properties detection apparatus so that the audio is presented at the presentation rate; wherein the media work content properties comprise one or more indicia of words of interest; and wherein the audio or audio-visual work includes conversations.

Type: Grant

Filed: October 7, 2015

Date of Patent: April 7, 2020

Assignee: Virentem Ventures, LLC

Inventor: Donald J. Hejna, Jr.
Analog voice activity detection

Patent number: 10535365

Abstract: According to some embodiments, an analog processing portion may receive an audio signal from a microphone. The analog processing portion may then convert the audio signal into sub-band signals and estimate an energy statistic value, such as a Signal-to-Noise Ratio (“SNR”) value, for each sub-band signal. A classification element may classify the estimated energy statistic values with analog processing such that a wakeup signal is generated when voice activity is detected. The wakeup signal may be associated with, for example, a battery-powered, always-listening audio application.

Type: Grant

Filed: August 27, 2018

Date of Patent: January 14, 2020

Inventors: Brandon David Rumberg, David W. Graham
Waveform data structure, waveform data storage device, waveform data storing method, waveform data extracting device, waveform data extracting method and electronic musical instrument

Patent number: 10515618

Abstract: A waveform data structure includes a plurality of types of frames having different data sizes. Each of the plurality of types of frames includes an auxiliary information area and a data area. The auxiliary information area includes an area for storing common effective-bit length data for a section of waveform samples, and an area for storing an identifier for identifying one of the plurality of types of frames. The data area is an area for storing extracted waveform samples which are extracted from the waveform samples based on the common effective-bit length. The number of the extracted waveform samples is determined based on the common effective-bit length.

Type: Grant

Filed: January 3, 2019

Date of Patent: December 24, 2019

Assignee: CASIO COMPUTER CO., LTD.

Inventor: Goro Sakata
Selecting a packet loss concealment procedure

Patent number: 10476769

Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for selecting a packet loss concealment procedure for a lost audio frame of a received audio signal. A method for selecting a packet loss concealment procedure comprises detecting an audio type of a received audio frame and determining a packet loss concealment procedure based on the audio type. In the method, detecting an audio type comprises determining a stability of a spectral envelope of signals of received audio frames.

Type: Grant

Filed: September 12, 2018

Date of Patent: November 12, 2019

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventor: Stefan Bruhn
Post-quantization gain correction in audio coding

Patent number: 10460739

Abstract: A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.

Type: Grant

Filed: August 4, 2017

Date of Patent: October 29, 2019

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Erik Norvell, Volodya Grancharov
Digital filterbank for spectral envelope adjustment

Patent number: 10460742

Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.

Type: Grant

Filed: January 22, 2018

Date of Patent: October 29, 2019

Assignee: Dolby International AB

Inventor: Per Ekstrand
Interpreting assistant system

Patent number: 10453459

Abstract: An interpreting assistant system which provides to a user captions of auditory communications in the user's vicinity. The interpreting assistant system includes a smart microphone transmitter that defines an input device which converts auditory communications into audio signals and transmit the signals a translation device, with a smart phone defining the translation device which generates a text transcript from the audio signals and send the transcript file to a display device, with the display device being defined by a wearable display interface which displays the transcript for a user to see. When in use, the interpreting assistant system provides for the display of a real time transcription and display of auditory communications such as spoken words for a user that may have hearing difficulties.

Type: Grant

Filed: January 31, 2018

Date of Patent: October 22, 2019

Inventor: Saida Ashley Florexil
Reduced complexity transform for a low-frequency-effects channel

Patent number: 10410644

Abstract: The computational resources that are needed to apply a transform-based filterbank to a limited-bandwidth audio signals are reduced by performing an integrated process of combining real-valued input data into complex-valued data and applying a short transform to the complex-valued data, applying a bank of very short transforms to the output of the integrated process, and deriving a sequence of real-valued output data from the outputs of the bank of very short transforms.

Type: Grant

Filed: March 19, 2012

Date of Patent: September 10, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Matthew C. Fellers
Exploiting visual information for enhancing audio signals via source separation and beamforming

Patent number: 10402651

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

Type: Grant

Filed: February 26, 2018

Date of Patent: September 3, 2019

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Dimitrios Dimitriadis, Donald J. Bowen, Lusheng Ji, Horst J. Schroeter
Transliteration of text entry across scripts

Patent number: 10402489

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: December 21, 2016

Date of Patent: September 3, 2019

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Compliance mechanism for messaging

Patent number: 10305831

Abstract: This disclosure describes systems, methods, and apparatus for precluding transmission of messages that breach one or more compliance or legal framework. In particular, messages, whether digital, written, or verbal, can be stopped from leaving a device on which they are created, thereby preventing non-compliant messages from reaching intermediary servers that could constitute a compliance violation even if the message never reached a recipient.

Type: Grant

Filed: December 16, 2014

Date of Patent: May 28, 2019

Assignee: FairWords, Inc.

Inventors: Anish Parikh, Evan M. Caron, Vadim Polosatov
Systems and methods of providing modified media content

Patent number: 10291966

Abstract: A method includes receiving, at a content server from a media device, a request for media content at a first playback rate. The media content is available to the content server at a second playback rate that is different from the first playback rate. The method includes generating modified media content by modifying a first portion of the media content to have a second format corresponding to a third media playback rate. The first portion having a first media characteristic. The third playback rate is different than the first playback rate and is different than the second playback rate. The third playback rate is selected such that the modified media content has a third format corresponding to the first playback rate. The method further includes sending the modified media content from the content server to a media device.

Type: Grant

Filed: March 23, 2017

Date of Patent: May 14, 2019

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Andrej Ljolje, Ann Syrdal, Alistair Conkie
Automatic conversion of speech into song, rap or other audible expression having target meter or rhythm

Patent number: 10290307

Abstract: Captured vocals may be automatically transformed using advanced digital signal processing techniques that provide captivating applications, and even purpose-built devices, in which mere novice user-musicians may generate, audibly render and share musical performances. In some cases, the automated transformations allow spoken vocals to be segmented, arranged, temporally aligned with a target rhythm, meter or accompanying backing tracks and pitch corrected in accord with a score or note sequence. Speech-to-song music applications are one such example. In some cases, spoken vocals may be transformed in accord with musical genres such as rap using automated segmentation and temporal alignment techniques, often without pitch correction. Such applications, which may employ different signal processing and different automated transformations, may nonetheless be understood as speech-to-rap variations on the theme.

Type: Grant

Filed: May 26, 2017

Date of Patent: May 14, 2019

Assignee: SMULE, INC.

Inventors: Parag Chordia, Mark Godfrey, Alexander Rae, Prerna Gupta, Perry R. Cook
Hearing device adjustment based on categorical perception

Patent number: 10129671

Abstract: Hearing device configuration and hearing treatment using categorical perception; systems and methods for categorical perception based configuration of hearing devices and hearing treatment.

Type: Grant

Filed: February 24, 2014

Date of Patent: November 13, 2018

Assignee: Securboration, Inc.

Inventors: Lee Krause, Rahul Shrivastav
Audio encoding device, method and program, and audio decoding device, method and program

Patent number: 10115402

Abstract: An audio packet error concealment system includes an encoding unit for encoding an audio signal consisting of a plurality of frames, and an auxiliary information encoding unit for estimating and encoding auxiliary information about a temporal change of power of the audio signal. The auxiliary information is used in packet loss concealment in decoding of the audio signal. The auxiliary information about the temporal change of power may contain a parameter that functionally approximates a plurality of powers of subframes shorter than one frame, or may contain information about a vector obtained by vector quantization of a plurality of powers of subframes shorter than one frame.

Type: Grant

Filed: October 20, 2016

Date of Patent: October 30, 2018

Assignee: NTT DOCOMO, INC.

Inventors: Kimitaka Tsutsumi, Kei Kikuiri
Audio classifier that includes analog signal voice activity detection and digital signal voice activity detection

Patent number: 10115399

Abstract: The disclosure relates to an audio classifier comprising: a first processor having hard-wired logic configured to receive an audio signal and detect audio activity from the audio signal; and a second processor having reconfigurable logic configured to classify the audio signal as a type of audio signal in response to the first processor detecting audio activity.

Type: Grant

Filed: July 20, 2016

Date of Patent: October 30, 2018

Assignee: NXP B.V.

Inventors: Ludovick Dominique Joel Lepauloux, Laurent Le Faucheur
Selecting a packet loss concealment procedure

Patent number: 10103958

Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for selecting a packet loss concealment procedure for a lost audio frame of a received audio signal. A method for selecting a packet loss concealment procedure comprises detecting an audio type of a received audio frame and determining a packet loss concealment procedure based on the audio type. In the method, detecting an audio type comprises determining a stability of a spectral envelope of signals of received audio frames.

Type: Grant

Filed: June 21, 2017

Date of Patent: October 16, 2018

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventor: Stefan Bruhn
Data processor

Patent number: 10098146

Abstract: A processor is disclosed. The processor includes a first-receiver-node for receiving a first-receiver-signal, a second-receiver-node for receiving a second-receiver-signal, a first-output-node for coupling to a digital-baseband-processor, a second-output-node for coupling to the digital-baseband-processor and a first-active-data-pipe extending between the first-receiver-node and the first-output-node. The first-active-data-pipe includes a first-analog-to-digital-converter comprising a first-ADC-input coupled to the first-receiver-node and a first-ADC-output coupled to the first-output-node. The first-analog-to-digital-converter is configured to provide a first-digital-signal to the first-output-node. The processor comprises a first-reference-node and a configurable-data-pipe extending between the second-receiver-node and the second-output-node.

Type: Grant

Filed: November 18, 2016

Date of Patent: October 9, 2018

Assignee: NXP B.V.

Inventors: Jan Niehof, Shagun Bajoria, Muhammed Bolatkale, Robert Rutten, Lucien Johannes Breems, Johannes Hubertus Antonius Brekelmans
Apparatus and method for processing an input audio signal using cascaded filterbanks

Patent number: 10032458

Abstract: An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.

Type: Grant

Filed: March 15, 2017

Date of Patent: July 24, 2018

Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V., Dolby International AB

Inventors: Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel, Stephan Wilde
Sound detection method for recognizing hazard situation

Patent number: 10014003

Abstract: A method of detecting a particular abnormal sound in an environment with background noise is provided. The method includes acquiring a sound from a microphone, separating abnormal sounds from the input sound based on non-negative matrix factorization (NMF), extracting Mel-frequency cepstral coefficient (MFCC) parameters according to the separated abnormal sounds, calculating hidden Markov model (HMM) likelihoods according to the separated abnormal sounds, and comparing the likelihoods of the separated abnormal sounds with a reference value to determine whether or not an abnormal sound has occurred. According to the method, based on NMF, a sound to be detected is compared with ambient noise in a one-to-one basis and classified so that the sound may be stably detected even in an actual environment with multiple noises.

Type: Grant

Filed: February 11, 2016

Date of Patent: July 3, 2018

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong-Kook Kim, Dong Yun Lee, Kwang Myung Jeon
Mobile terminal with a built-in voice message searching function and corresponding searching method

Patent number: 9992321

Abstract: A mobile terminal with a built-in voice message searching function includes: a voice recording module configured to record a voice searching signal from a user and to send the voice searching signal to the pre-processing module for pre-processing, a pre-processing module configured to pre-process the voice searching signal, and to send the pre-processed signal to the matching module for signal matching, a matching module configured to extract a characteristic parameter of the pre-processed signal, to calculate a similarity of the extracted characteristic parameter with a characteristic parameter of a stored voice message, and to send the voice message with a similarity higher than or equal to a threshold to the result outputting module, and a result outputting module configured to display the voice message with the similarity higher than or equal to the threshold on a screen of the mobile terminal.

Type: Grant

Filed: July 9, 2013

Date of Patent: June 5, 2018

Assignee: ZTE CORPORATION

Inventor: Zheng Dang
Method and system of random access compression of transducer data for automatic speech recognition decoding

Patent number: 9990917

Abstract: A system, article, and method of random access compression of transducer data for automatic speech recognition decoding.

Type: Grant

Filed: April 13, 2015

Date of Patent: June 5, 2018

Assignee: Intel Corporation

Inventor: Joachim Hofer
Voice filter system

Patent number: 9978065

Abstract: Embodiments of the invention are directed to systems and methods for voice filtering. In some embodiments, an original voice segment from a user may be received. The received original voice segment may be modified using a first predetermined algorithm. The modified voice segment may be sent to an authentication server. At the authentication server, the modified voice segment may be reconstructed into the original voice segment using a second predetermined algorithm. The user may be authenticated for a transaction based at least in part on the reconstructed original voice segment.

Type: Grant

Filed: June 25, 2014

Date of Patent: May 22, 2018

Assignee: Visa International Service Association

Inventors: Shaw Li, Dhiraj Sharda, Douglas Fisher
Systems and methods for audio based synchronization using sound harmonics

Patent number: 9972294

Abstract: Multiple audio files may be synchronized using harmonic sound included in audio content obtained from audio tracks. Individual audio tracks are partitioned into multiple temporal windows of a first and second temporal window length. Individual audio waveforms for individual temporal windows of the first and second window length are transformed into frequency space in which energy is represented as a function of frequency. Individual pitches and magnitudes of harmonic sound determined for individual temporal windows may be compared using a multi-resolution framework to correlate pitches and harmonic energy of multiple audio tracks to one another.

Type: Grant

Filed: March 14, 2017

Date of Patent: May 15, 2018

Assignee: GoPro, Inc.

Inventor: David Tcheng
Resampling of an audio signal interrupted with a variable sampling frequency according to the frame

Patent number: 9940943

Abstract: A method for resampling an audio-frequency signal with an output sampling frequency, for a current signal frame. The method is used when the preceding frame is sampled at a first sampling frequency which is different from a second sampling frequency of the current frame. The method includes: determining a first and second segments of the signal by adding samples at zero at the end of stored samples of the preceding frame and at the start of samples of the current frame, respectively; obtaining the first resampled segment and the second resampled segment by applying at least one resampling filter respectively to the first segment resampling the first frequency at the output frequency, and to the second segment resampling the second frequency at the output frequency; and combining the overlapping portion of the first and second resampled segments to obtain at least one portion of the resampled current frame.

Type: Grant

Filed: December 11, 2014

Date of Patent: April 10, 2018

Assignee: ORANGE

Inventors: Stephane Ragot, Jerome Daniel, Balazs Kovesi
Exploiting visual information for enhancing audio signals via source separation and beamforming

Patent number: 9904851

Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

Type: Grant

Filed: June 11, 2014

Date of Patent: February 27, 2018

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Dimitrios Dimitriadis, Donald J. Bowen, Lusheng Ji, Horst J. Schroeter
Synchronization of multimedia streams

Patent number: 9877066

Abstract: This method for synchronizing a first multimedia stream rendered on a first terminal and a second multimedia stream rendered on a second terminal, comprises a step of generation, from an original audio sequence of the first stream, of original audio fingerprints, and further comprises steps of: a) generation from a first sequence of the first stream first audio fingerprints; b) comparison between the first fingerprints and the original fingerprints in order to obtain one or more first synchronization positions; c) correlation of the first sequence with one or more pieces of the original sequence located around the first synchronization positions in order to obtain a second synchronization position; d) rendering of the second stream on the second terminal using the second synchronization position.

Type: Grant

Filed: April 2, 2013

Date of Patent: January 23, 2018

Assignee: THOMSON LICENSING DTV

Inventors: Quang Khanh Ngoc Duong, Yvon Legallais, Christopher Howson
Lossless embedded additional data

Patent number: 9870777

Abstract: Methods are disclosed for an encoder to embed a data stream into a quantized PCM digital audio signal and for a corresponding decoder to both retrieve the data stream and losslessly reconstruct the exact original audio. Some methods employ complimentary amplification and attenuation, while others employ gain redistribution. Pre-emphasis and soft clipping techniques are described as methods of losslessly reducing the peak excursion of the PCM audio signal. Also described is the lossless placing of data at predetermined positions within an audio stream.

Type: Grant

Filed: October 24, 2012

Date of Patent: January 16, 2018

Inventors: Peter Graham Craven, Malcolm Law
Selective phase compensation in high band coding of an audio signal

Patent number: 9858941

Abstract: A method includes determining, at an encoder, phase adjustment parameters based on a high-band residual signal. The method also includes inserting the phase adjustment parameters into an encoded version of the audio signal to enable phase adjustment during reconstruction of the audio signal from the encoded version of the audio signal.

Type: Grant

Filed: November 21, 2014

Date of Patent: January 2, 2018

Assignee: QUALCOMM Incorporated

Inventors: Venkatraman S. Atti, Venkata Subrahmanyam Chandra Sekhar Chebiyyam
Voice recognition of commands extracted from user interface screen devices

Patent number: 9858039

Abstract: A method, system, and computer program product for human interface design. Embodiments proceed upon receiving a markup language description of user interface pages (e.g., HTML pages), then, without modifying the user interface page, parsing the markup language description to identify user interface objects configured to perform an operation responsive to a keyboard or mouse or pointing device. One or more mapping techniques serve to relate the parsed-out operation(s) to one or more voice commands. In some embodiments, the parser recognizes interface objects in forms such as a button, a textbox, a checkbox, or an option menu, and the voice commands correspond to an aspect that is displayed when rendering the interface object (e.g., a button label, a menu option, etc.). After receiving a user utterance, the utterance is converted into a text representation which in turn is mapped to voice commands that were parsed from the user interface page.

Type: Grant

Filed: January 28, 2014

Date of Patent: January 2, 2018

Assignee: Oracle International Corporation

Inventors: Saurabh Kumar, Srinivasa Rao Kowdeed, Kavin Kumar Kuppusamy
Audio signal processing system for discontinuity correction

Patent number: 9799344

Abstract: An audio signal processing device comprises a discontinuity detector configured to determine an occurrence of a discontinuity from a sudden increase of an amplitude of decoded audio obtained by decoding the first audio packet which is received correctly after an occurrence of a packet loss, and a discontinuity corrector for correcting the discontinuity of the decoded audio.

Type: Grant

Filed: April 28, 2016

Date of Patent: October 24, 2017

Assignee: NTT DoCoMo, Inc.

Inventors: Kimitaka Tsutsumi, Kei Kikuiri, Atsushi Yamaguchi
Apparatus and method for processing an input audio signal using cascaded filterbanks

Patent number: 9792915

Abstract: An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.

Type: Grant

Filed: September 5, 2012

Date of Patent: October 17, 2017

Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V., Dolby International AB

Inventors: Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel, Stephan Wilde
Selecting a packet loss concealment procedure

Patent number: 9712414

Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus thereof for selecting a packet loss concealment procedure for a lost audio frame of a received audio signal. A method for selecting a packet loss concealment procedure comprises detecting an audio type of a received audio frame and determining a packet loss concealment procedure based on the audio type. In the method, detecting an audio type comprises determining a stability of a spectral envelope of signals of received audio frames.

Type: Grant

Filed: May 12, 2015

Date of Patent: July 18, 2017

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventor: Stefan Bruhn
Systems and methods for audio based synchronization using sound harmonics

Patent number: 9640159

Abstract: Multiple audio files may be synchronized using harmonic sound included in audio content obtained from audio tracks. Individual audio tracks are partitioned into multiple temporal windows of a first and second temporal window length. Individual audio waveforms for individual temporal windows of the first and second window length are transformed into frequency space in which energy is represented as a function of frequency. Individual pitches and magnitudes of harmonic sound determined for individual temporal windows may be compared using a multi-resolution framework to correlate pitches and harmonic energy of multiple audio tracks to one another.

Type: Grant

Filed: August 25, 2016

Date of Patent: May 2, 2017

Assignee: GoPro, Inc.

Inventor: David Tcheng
Frame loss recovering method, and audio decoding method and device using same

Patent number: 9633662

Abstract: The present invention relates to a frame loss recovering method, an audio decoding method, and an apparatus using the method. A method of recovering a frame loss of an audio signal according to the present invention includes: grouping transform coefficients of at least one frame into a predetermined number of bands among previous frames of a current frame; deriving an attenuation constant according to a tonality of the bands; and recovering transform coefficients of the current frame by applying the attenuation constant to the previous frame of the current frame.

Type: Grant

Filed: September 11, 2013

Date of Patent: April 25, 2017

Assignee: LG Electronics Inc.

Inventors: Gyuhyeok Jeong, Hyejeong Jeon, Ingyu Kang
Systems and methods of providing modified media content

Patent number: 9628741

Abstract: A method includes processing media content. The media content includes audio data corresponding to a first audio playback rate and video data corresponding to a first video playback rate. Processing the media content includes identifying a speech portion of the audio data. The speech portion includes a consonant portion. The method further includes producing modified media content. The modified media content is produced based on modifying the video data and modifying the audio data. Modifying the audio data includes applying a non-linear transformation to the speech portion identified in the audio data. The method further includes storing the modified media content.

Type: Grant

Filed: October 11, 2012

Date of Patent: April 18, 2017

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Andrej Ljolje, Ann Syrdal, Alistair Conkie
System and method of improving speech recognition using context

Patent number: 9626963

Abstract: A system and method are provided for improving speech recognition accuracy. Contextual information about user speech may be received, and then speech recognition analysis can be performed on the user speech using the contextual information. This allows the system and method to improve accuracy when performing tasks like searching and navigating using speech recognition.

Type: Grant

Filed: April 30, 2013

Date of Patent: April 18, 2017

Assignee: PAYPAL, INC.

Inventor: Eric J. Farraro
Method and apparatus for increasing capacity of air interface

Patent number: 9614611

Abstract: The disclosure provides a method and an apparatus for increasing the capacity of an air interface. The method comprises the following steps that: after a traffic channel is established on a base station side, a base station transmitting a ? rate frame to the air interface in a continuous transmission mode if not capturing a traffic channel frame prefix from a terminal, so as to ensure that the terminal can receive a forward frame of the base station; the base station only reducing the transmission of the ? rate frame after capturing the prefix from the terminal; and the terminal keeping calling and does not release the call if continuous good frames are received during the demodulation of forward traffic frame from the base station. Through the disclosure, the interference between forward channels is reduced, and the capacity of the air interface is increased.

Type: Grant

Filed: July 16, 2012

Date of Patent: April 4, 2017

Assignee: ZTE CORPORATION

Inventor: Haiwei Liu
Signal processing device, signal processing method and signal processing program

Patent number: 9613632

Abstract: To achieve sufficient noise cancellation when a reference signal cannot be captured in the proximity of a noise source. The present invention is characterized by comprising: a first input means for obtaining a first mixed signal in which a first signal and a second signal are mixed; a second input means for obtaining a second mixed signal in which the first signal and the second signal are mixed in a different ratio from the first mixed signal; a delay means for generating a delayed first mixed signal by delaying the first mixed signal with a delay amount based on transmission distance from a generation source of the second signal to the second input means; a subtracting means for outputting an estimated first signal in which a pseudo second signal is subtracted from the delayed first mixed signal; and an adaptive filtering means for generating the pseudo second signal applying coefficients which are updated based on the estimated first signal to the second mixed signal.

Type: Grant

Filed: September 26, 2011

Date of Patent: April 4, 2017

Assignee: NEC CORPORATION

Inventor: Akihiko Sugiyama
Apparatus and method for recognizing voice and text

Patent number: 9613618

Abstract: A method for recognizing a voice includes receiving, as an input, a voice involving multiple languages, recognizing a first voice of the voice by using a voice recognition algorithm matched to a preset primary language, identifying the preset primary language and a non-primary language different from the preset primary language, which are included in the multiple languages, determining a type of the non-primary language based on context information, recognizing a second voice of the voice in the non-primary language by applying a voice recognition algorithm, which is matched to the non-primary language of the determined type, to the second voice, and outputting a result of recognizing the voice which is based on a result of recognizing the first voice and a result of recognizing the second voice.

Type: Grant

Filed: July 3, 2014

Date of Patent: April 4, 2017

Assignee: SAMSUNG ELECTRONICS CO., LTD

Inventor: Subhojit Chakladar
Voiced sound interval classification device, voiced sound interval classification method and voiced sound interval classification program

Patent number: 9530435

Abstract: The voiced sound interval classification device comprises a vector calculation unit which calculates, from a power spectrum time series of voice signals, a multidimensional vector series as a vector series of a power spectrum having as many dimensions as the number of microphones, a difference calculation unit which calculates, with respect to each time of the multidimensional vector series, a vector of a difference between the time and the preceding time, a sound source direction estimation unit which estimates, as a sound source direction, a main component of the differential vector, and a voiced sound interval determination unit which determines whether each sound source direction is in a voiced sound interval or a voiceless sound interval by using a predetermined voiced sound index indicative of a likelihood of a voiced sound interval of the voice signal applied at each time.

Type: Grant

Filed: January 25, 2012

Date of Patent: December 27, 2016

Assignee: NEC CORPORATION

Inventor: Yoshifumi Onishi
Dynamic insertion of a quality enhancement gateway

Patent number: 9521015

Abstract: Methods, systems, and computer readable media for enhancing media quality by dynamically inserting a quality enhancement gateway are disclosed. According to one method, steps are performed at a service node. The method includes detecting a condition associated with a session between two endpoints that indicates that the session could benefit from media quality enhancement processing. The method also includes signaling the endpoints to route at least a portion of the session through a quality enhancement gateway (QEG) capable of providing the media quality enhancement processing for the session.

Type: Grant

Filed: December 21, 2010

Date of Patent: December 13, 2016

Assignee: GENBAND US LLC

Inventors: Dany Sylvain, Richard Charles Taylor
Vehicle voice recognition systems and methods

Patent number: 9418661

Abstract: A voice recognition system includes a microphone for receiving speech from a user and processing electronics. The processing electronics are in communication with the microphone and are configured to use a plurality of rules to evaluate user interactions with the voice recognition system. The processing electronics automatically determine and set an expertise level in response to and based on the evaluation. The processing electronics are configured to automatically adjust at least one setting of the voice recognition system in response to the set expertise level.

Type: Grant

Filed: May 11, 2012

Date of Patent: August 16, 2016

Assignee: Johnson Controls Technology Company

Inventors: William Fay, Brian L. Douthitt, David J. Hughes, Brian Hannum
Editing device

Patent number: 9330717

Abstract: An editing device for connecting first and second streams containing variable length coded parameters, includes a decision unit operable to compare an encoded first parameter contained in the first stream with an encoded second parameter contained in the second stream, and decide whether at least one of the first parameter and the second parameter is to be changed, and a replacing unit operable to replace at least one of the first and second parameters with the other code word of the same length as the code words assigned to the first and second parameters, when the decision unit decides that at least one of the parameters is to be changed.

Type: Grant

Filed: September 17, 2013

Date of Patent: May 3, 2016

Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.

Inventors: Toshihiro Tanaka, Hiroshi Saito
Automatic conversion of speech into song, rap or other audible expression having target meter or rhythm

Patent number: 9324330

Abstract: Captured vocals may be automatically transformed using advanced digital signal processing techniques that provide captivating applications, and even purpose-built devices, in which mere novice user-musicians may generate, audibly render and share musical performances. In some cases, the automated transformations allow spoken vocals to be segmented, arranged, temporally aligned with a target rhythm, meter or accompanying backing tracks and pitch corrected in accord with a score or note sequence. Speech-to-song music applications are one such example. In some cases, spoken vocals may be transformed in accord with musical genres such as rap using automated segmentation and temporal alignment techniques, often without pitch correction. Such applications, which may employ different signal processing and different automated transformations, may nonetheless be understood as speech-to-rap variations on the theme.

Type: Grant

Filed: March 29, 2013

Date of Patent: April 26, 2016

Assignee: Smule, Inc.

Inventors: Parag Chordia, Mark Godfrey, Alexander Rae, Prerna Gupta, Perry R. Cook
Methods and systems for determining instructions for applications that are recognizable by a voice interface

Patent number: 9318128

Abstract: Methods and systems for facilitating development of voice-enabled applications are provided. The method may comprise receiving, at a computing device, a plurality of actions associated with a given application, parameters associated with each respective action, and example instructions responsive to respective actions. The method may also comprise determining candidate instructions based on the actions, parameters, and example instructions. Each candidate instruction may comprise one or more grammars recognizable by a voice interface for the given application. The method may further comprise the computing device receiving respective acceptance information for each candidate instruction, and comparing at least a portion of the respective acceptance information with a stored acceptance information log comprising predetermined acceptance information so as to determine a correlation.

Type: Grant

Filed: February 1, 2013

Date of Patent: April 19, 2016

Assignee: Google Inc.

Inventors: Mark Edward Epstein, Pedro J. Moreno Mengibar, Fadi Biadsy
Data compression and decompression using SIMD instructions

Patent number: 9274802

Abstract: Compression and decompression of numerical data utilizing single instruction, multiple data (SIMD) instructions is described. The numerical data includes integer and floating-point samples. Compression supports three encoding modes: lossless, fixed-rate, and fixed-quality. SIMD instructions for compression operations may include attenuation, derivative calculations, bit packing to form compressed packets, header generation for the packets, and packed array output operations. SIMD instructions for decompression may include packed array input operations, header recovery, decoder control, bit unpacking, integration, and amplification. Compression and decompression may be implemented in a microprocessor, digital signal processor, field-programmable gate array, application-specific integrated circuit, system-on-chip, or graphics processor, using SIMD instructions. Compression and decompression of numerical data can reduce memory, networking, and storage bottlenecks.

Type: Grant

Filed: January 22, 2013

Date of Patent: March 1, 2016

Assignee: Altera Corporation

Inventor: Albert W. Wegener
System and method for concatenate speech samples within an optimal crossing point

Patent number: 9251782

Abstract: A method for identifying an optimal crossing point for concatenation of speech samples within an overlap area is provided. The method includes retrieving a first speech sample and a second speech sample, the second speech sample is concatenated immediately after the first speech sample is concatenated; determining a first region within the ending of the first speech sample and a second region within the beginning of the second speech sample, the first region and the second region are determined respective of relatively high spectral similarity over time between the first speech sample and the second speech sample; identifying an overlap region between the first region and the second region; determining an optimal crossing point between the first speech sample and the second speech sample, the optimal crossing point has a maximum correlation over time; and concatenating the first speech sample and the second speech sample at the optimal crossing point.

Type: Grant

Filed: June 23, 2014

Date of Patent: February 2, 2016

Assignee: VivoText Ltd.

Inventors: Yossef Ben Ezra, Shai Nissim, Gershon Silbert, Moti Zilberman
Systems and methods for data retry using averaging process

Patent number: 9190104

Abstract: Embodiments are related to systems and methods for data processing, and more particularly to systems and methods for calibration during data processing. As an example, a data processing system is discussed that includes a sample averaging circuit operable to average digital samples from an analog to digital converter circuit over multiple instances of an analog input to yield an X-average output, and a selector circuit operable to select one of the digital samples or the X-average output as a processing output.

Type: Grant

Filed: March 13, 2013

Date of Patent: November 17, 2015

Assignee: Avago Technologies General IP (Singapore) Pte. Ltd.

Inventors: Shaohua Yang, Kapil Gaba, Yoon L. Liow, Xuebin Wu, Qi Zuo, YuQing Yang, Lei Wang

prev 1 2 3 4 5 6 … next