Speech Signal Processing Patents (Class 704/200)

Psychoacoustic (Class 704/200.1)

For storage or transmission (Class 704/201)

Recognition (Class 704/231)

Synthesis (Class 704/258)

Application (Class 704/270)

Method and apparatus for watermarking successive sections of an audio signal

Patent number: 9542954

Abstract: Audio watermarking is the process of embedding watermark information items into an audio signal in an in-audible manner. In a first embodiment, in case the original audio signal has parts of low signal energy, an alternative signal having a level or strength given by the psycho-acoustic model is combined with the original audio signal. The combined signal is watermarked with watermark data to be embedded. In a second embodiment, in case the original audio signal has parts of low signal energy, an alternative signal having a level or strength given by the psycho-acoustic model is watermarked with watermark data to be embedded, and the audio signal is watermarked with the watermark data to be embedded. The watermarked alternative signal is combined with the watermarked audio signal.

Type: Grant

Filed: February 4, 2015

Date of Patent: January 10, 2017

Assignee: THOMSON LICENSING

Inventors: Peter Georg Baum, Xiaoming Chen, Michael Arnold, Ulrich Gries
Audio signal transient detection

Patent number: 9536532

Abstract: Provided are, among other things, systems, methods and techniques for detecting whether a transient exists within an audio signal. According to one representative embodiment, a segment of a digital audio signal is divided into blocks, and a norm value is calculated for each of a number of the blocks, resulting in a set of norm values for such blocks, each such norm value representing a measure of signal strength within a corresponding block. A maximum norm value is then identified across such blocks, and a test criterion is applied to the norm values. If the test criterion is not satisfied, a first signal indicating that the segment does not include any transient is output, and if the test criterion is satisfied, a second signal indicating that the segment includes a transient is output. According to this embodiment, the test criterion involves a comparison of the maximum norm value to a different second maximum norm value, subject to a specified constraint, within the segment.

Type: Grant

Filed: May 20, 2016

Date of Patent: January 3, 2017

Assignee: Digital Rise Technology Co., Ltd.

Inventor: Yuli You
System and method for user selectable audible event indication for a vehicle

Patent number: 9533616

Abstract: A sound generation system for a vehicle includes a sound generator for operating a speaker of the vehicle to produce an audible sound. A controller detects a vehicle operation event and controls the sound generator to generate an audible indication of the event, an association of the audible indication with the event being programmable by a user.

Type: Grant

Filed: March 12, 2014

Date of Patent: January 3, 2017

Inventor: Shahar Feldman
Audio encoder and decoder for interleaved waveform coding

Patent number: 9514761

Abstract: There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.

Type: Grant

Filed: April 4, 2014

Date of Patent: December 6, 2016

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Robin Thesing, Harald Mundt, Heiko Purnhagen, Karl Jonas Roeden
System for integrating video calls in telephone call centers

Patent number: 9516265

Abstract: A communication system particularly for managing voice, video and data services between the station of an operator and the station of a user, the system including at least one device controlled by the operator to receive telephone calls forwarded by a call routing center and at least one unit that is controlled by the user and provided with elements for entering and displaying information and generating telephone calls. The device further includes elements for disabling the audio component of a telephone call generated by the at least one unit and elements for establishing with the unit a video call associated with the telephone call.

Type: Grant

Filed: December 29, 2014

Date of Patent: December 6, 2016

Assignee: PHONETICA LAB S.R.L.

Inventors: Marco Durante, Giuseppe Durante, Raoul Trevisi
Sound field spatial stabilizer

Patent number: 9516418

Abstract: In a system and method for maintaining the spatial stability of a sound field a balance gain may be calculated for two or more microphone signals. The balance gain may be associated with a spatial image in the sound field. Signal values may be calculated for each of the microphone. The signal values may be signal estimates or signal gains calculated to improve a characteristic of the microphone signals. The differences between the signal values associated with each microphone signal may be limited although some difference between signal values may be allowable. One or more microphone signals are adjusted responsive to the two or more balance gains and the signal gains to maintain the spatial stability of the sound field. The adjustments of one or more microphone signals may include mixing of two or more microphone. The signal gains are applied to the two or more microphone signals.

Type: Grant

Filed: January 29, 2013

Date of Patent: December 6, 2016

Assignee: 2236008 Ontario Inc.

Inventors: Shreyas Paranjpe, Phillip Alan Hetherington
Acoustic echo preprocessing for speech enhancement

Patent number: 9508359

Abstract: A method for cancelling/reducing acoustic echoes in speech/audio signal enhancement processing comprises selecting a long-term filter based on an echo tail length detection or an echo reverberation time detection of an microphone input signal; a reference signal is pre-processed with the selected long-term filter; the pre-processed reference signal is used to excite an adaptive filter wherein the output of the adaptive filter forms a replica signal of acoustic echo and/or acoustic echo tail; the replica signal of acoustic echo and/or acoustic echo tail is subtracted from a microphone input signal to suppress the acoustic echo and/or acoustic echo tail in the microphone input signal. The echo tail length or the echo reverberation time is detected by analyzing and comparing the microphone input signal and a received signal which is sent to a speaker.

Type: Grant

Filed: June 16, 2015

Date of Patent: November 29, 2016

Inventor: Yang Gao
Wireless caption communication service system

Patent number: 9502037

Abstract: A Wireless Caption Communication Service (“WCCS”) System includes a relay center, a wireless caption communication device, and a wireless captioning service server. The wireless caption communication device has a voice collecting device and a wireless caption communication terminal. Text entered by a first user is transmitted to the wireless captioning service server and converted into a speech. Then, the speech is transmitted to the voice collecting device and the sound of the speech comes out of a speaker of the voice collecting device so that a second user can hear the speech. The voice of the second user is transmitted to the wireless captioning service server and then to the relay center. The voice is converted into a caption data and transmitted to the wireless caption communication device, and the caption data is displayed on the wireless caption communication terminal so that the first user can read the caption data.

Type: Grant

Filed: January 14, 2015

Date of Patent: November 22, 2016

Assignee: Miracom USA, Inc.

Inventor: Wonjae Cha
Acoustic activity detection apparatus and method

Patent number: 9502028

Abstract: Streaming audio is received. The streaming audio includes a frame having plurality of samples. An energy estimate is obtained for the plurality of samples. The energy estimate is compared to at least one threshold. In addition, a band pass estimate of the signal is determined. An energy estimate is obtained for the band-passed plurality of samples. The two energy estimates are compared to at least one threshold each. Based upon the comparison operation, a determination is made as to whether speech is detected.

Type: Grant

Filed: October 13, 2014

Date of Patent: November 22, 2016

Assignee: Knowles Electronics, LLC

Inventors: Dibyendu Nandy, Yang Li, Henrik Thomsen, Claus Furst
Method for processing the output of a speech recognizer

Patent number: 9502027

Abstract: A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech processing system; determining if the parsed speech input unambiguously corresponds to a command and is sufficiently complete for reliable processing, then processing the command; if the speech input ambiguously corresponds to a single command or is not sufficiently complete for reliable processing, then prompting a user for further speech input to reduce ambiguity or increase completeness, in dependence on a relationship of previously received speech input and at least one command grammar of the plurality of predetermined command grammars, reparsing the further speech input in conjunction with previously parsed speech input, and iterating as necessary. The system also monitors abort, fail or cancel conditions in the speech input.

Type: Grant

Filed: July 29, 2014

Date of Patent: November 22, 2016

Assignee: Great Northern Research, LLC

Inventors: Philippe Roy, Paul J. Lagassey
System and method of synthetic voice generation and modification

Patent number: 9495954

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of a first text-to-speech voice and a second database of a second text-to-speech voice to generate a combined database, selects from the combined database, based on a policy, voice units of a phonetic category for the synthetic voice to yield selected voice units, and synthesizes speech based on the selected voice units. The system can synthesize speech without parameterizing the first text-to-speech voice and the second text-to-speech voice. A policy can define, for a particular phonetic category, from which text-to-speech voice to select voice units. The combined database can include multiple text-to-speech voices from different speakers. The combined database can include voices of a single speaker speaking in different styles. The combined database can include voices of different languages.

Type: Grant

Filed: February 22, 2016

Date of Patent: November 15, 2016

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Alistair D. Conkie, Ann K. Syrdal
Method and system for capturing reading assessment data

Patent number: 9478146

Abstract: A method and system for assessing a student's reading ability is disclosed. An image-capturing device detects, from a worksheet comprising a position-identifying pattern, a first mark in a first region of the worksheet. The first mark is in a first indicator portion of the position-identifying pattern contained within a first indicator region that is associated with a first word. The image-capturing device detects a first note in a note region of the worksheet. Based on whether the first mark, the first note, or both indicates that the first word was read incorrectly or correctly, a processor determines a first reading assessment result for the first word and stores, in a memory, a digital document file comprising the first reading assessment result.

Type: Grant

Filed: March 4, 2013

Date of Patent: October 25, 2016

Assignee: Xerox Corporation

Inventors: Gary W. Skinner, Robert M. Lofthus, Dusan G. Lysy, Michael Robert Furst
Schema evolution via transition information

Patent number: 9471617

Abstract: Disclosed herein are system, method, and computer program product embodiments for transforming data from a first version, for example an initial version of a database, to a second version, for example a subsequent version of a database. An embodiment operates by modifying the metadata of the data to include transformational clauses, each of which describes how a portion of the data in the first version is transformed to data required by the second version.

Type: Grant

Filed: May 8, 2014

Date of Patent: October 18, 2016

Assignee: SAP AG

Inventor: Bjoern Mielenhausen
System and method for calculating similarity of audio file

Patent number: 9466315

Abstract: A method for calculating a similarity of audio files includes constituting a pitch sequence of a first audio file and a pitch sequence of a second audio file; calculating an eigenvector of the first audio file according to the pitch sequence of the first audio file, and calculating an eigenvector of the second audio file according to the pitch sequence of the second audio file; calculating a similarity between the first audio file and the second audio file according to the eigenvector of the first audio file and the eigenvector of the second audio file.

Type: Grant

Filed: August 4, 2014

Date of Patent: October 11, 2016

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Weifeng Zhao, Shenyuan Li, Liwei Zhang, Jianfeng Chen
Method, medium, and apparatus encoding and/or decoding extension data for surround

Patent number: 9460725

Abstract: A method, medium, and apparatus encoding and/or decoding an audio signal to surround data. While encoding spatial information, which can up-mix an audio signal to a surround signal, to extension data, a length of a payload corresponding to the spatial information is encoded and a payload of the spatial information is decoded using the length of the payload. Accordingly, compatibility of the spatial information can be provided, and the spatial information can be transmitted by effectively embedding the spatial information.

Type: Grant

Filed: July 17, 2012

Date of Patent: October 4, 2016

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jung-hoe Kim, Eun-Mi Oh
Named entity resolution in spoken language processing

Patent number: 9454957

Abstract: Features are disclosed for determining an element of a user utterance or user intent in conjunction with one or more related elements of the user utterance or user intent. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a natural language understanding (“NLU”) module. The NLU module may perform named entity recognition, intent classification, and/or other processes on the ASR results. In addition, the NLU module may determine or verify the values associated with the recognized named entities using a data store of known values. When two or more named entities are related, their values may be determined and/or verified in conjunction with each other in order to preserve the relationship between them.

Type: Grant

Filed: March 5, 2013

Date of Patent: September 27, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Lambert Mathias, Weam Abu Zaki, Ying Shi
Systems and methods for evaluating difficulty of spoken text

Patent number: 9449522

Abstract: Systems and methods are provided for assigning a difficulty score to a speech sample. Speech recognition is performed on a digitized version of the speech sample using an acoustic model to generate word hypotheses for the speech sample. Time alignment is performed between the speech sample and the word hypotheses to associate the word hypotheses with corresponding sounds of the speech sample. A first difficulty measure is determined based on the word hypotheses, and a second difficulty measure is determined based on acoustic features of the speech sample. A difficulty score for the speech sample is generated based on the first difficulty measure and the second difficulty measure.

Type: Grant

Filed: November 15, 2013

Date of Patent: September 20, 2016

Assignee: Educational Testing Service

Inventors: Su-Youn Yoon, Yeonsuk Cho, Klaus Zechner, Diane Napolitano
Method, application, and device for audio signal transmission

Patent number: 9437205

Abstract: The current invention discloses methods, applications, and devices for audio transmission from a mobile terminal. After receiving an audio signal transmission request from a user, the mobile terminal may initiate a recording session to record audio signals into audio frames. During the recording session, the terminal may adjust the audio codecs used for encoding the audio frames based on the workload and the performance of the terminal. By measuring and evaluating the encoding time, the terminal may change between using a floating-point AMR audio codec and a fixed-point AMR audio codec. The encoded audio frames are transmitted to a remote server. The current invention provides a flexible and efficient approach for audio signal encoding and transmission, balancing signal integrity and encoding speed at the same time.

Type: Grant

Filed: December 16, 2013

Date of Patent: September 6, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Xiaolong Zhang, Yuan Zhao, Ganrong Yang
Bandwidth extension of harmonic audio signal

Patent number: 9437202

Abstract: Methods and arrangements in a codec for supporting bandwidth extension, BWE, of an harmonic audio signal. The method in the decoder part of the codec comprises receiving a plurality of gain values associated with a frequency band b and a number of adjacent frequency bands of band b. The method further comprises determining whether a reconstructed corresponding frequency band b? comprises a spectral peak. When the band b? comprises a spectral peak, a gain value associated with the band b? is set to a first value based on the received plurality of gain values; and otherwise the gain value is set to a second value based on the received plurality of gain values. The suggested technology enables bringing gain values into agreement with peak positions in a bandwidth extended frequency region.

Type: Grant

Filed: December 21, 2012

Date of Patent: September 6, 2016

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Sebastian Näslund, Volodya Grancharov, Tomas Jansson Toftgård
Methods and apparatuses for automatic speech recognition

Patent number: 9431006

Abstract: Exemplary embodiments of methods and apparatuses for automatic speech recognition are described. First model parameters associated with a first representation of an input signal are generated. The first representation of the input signal is a discrete parameter representation. Second model parameters associated with a second representation of the input signal are generated. The second representation of the input signal includes a continuous parameter representation of residuals of the input signal. The first representation of the input signal includes discrete parameters representing first portions of the input signal. The second representation includes discrete parameters representing second portions of the input signal that are smaller than the first portions. Third model parameters are generated to couple the first representation of the input signal with the second representation of the input signal. The first representation and the second representation of the input signal are mapped into a vector space.

Type: Grant

Filed: July 2, 2009

Date of Patent: August 30, 2016

Assignee: Apple Inc.

Inventor: Jerome R. Bellegarda
Apparatus and method for managing advertisement application

Patent number: 9406070

Abstract: A method and apparatus for managing an advertisement application in a mobile advertising system is provided. When an advertisement application for representing an advertisement is installed in an advertisement-receiving terminal, a registration request for the installed advertisement application is made. The advertisement-receiving terminal assigns an application Identifier (ID) to the advertisement application in response to the registration request, and stores a profile of the advertisement application in association with the assigned application ID.

Type: Grant

Filed: October 19, 2009

Date of Patent: August 2, 2016

Assignee: Samsung Electronics Co., Ltd.

Inventors: Ji-Hye Lee, Hae-Young Jun, Seok-Hoon Choi
Method and device for acoustic language model training

Patent number: 9396723

Abstract: A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.

Type: Grant

Filed: December 17, 2013

Date of Patent: July 19, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Duling Lu, Lu Li, Feng Rao, Bo Chen, Li Lu, Xiang Zhang, Eryu Wang, Shuai Yue
Synchronizing robot motion with social interaction

Patent number: 9375845

Abstract: A method of synchronizing robot motion with a social interaction. The method comprises storing in the robot a map that associates keywords with at least one robot motion, composing by the robot a dialogue based on a context of a social interaction with a human being, searching the dialogue for keywords, parsing the dialogue to determine its syntax, and analyzing the syntax. The method further comprises generating, by the robot, a robot motion script synchronized with the dialogue based on mapping one or more keywords located in the dialogue to robot motions, based on the syntax of the dialogue, and based on a physical cadence, wherein the robot motion script comprises a sequence of separate robot motions. The method further comprises playing aloud the dialogue by the robot and performing the robot motion script by the robot in synchronization with the playing aloud of the dialogue.

Type: Grant

Filed: September 30, 2014

Date of Patent: June 28, 2016

Assignee: Sprint Communications Company, L.P.

Inventors: Brandon C. Annan, Joshua R. Cole, Deborah L. Gilbert, Dhananjay Indurkar
Voice interactive service system and method for providing different speech-based services

Patent number: 9350860

Abstract: Systems and method are provided for rendering different speech-based services to a plurality of users. A service-providing system may be accessed via a plurality of connectivity ports. Each of the connectivity ports may be associated with at least one of a plurality of different speech-related services. The connectivity ports may be associated with the different speech-related services may be performed before receiving user service requests. The service-providing system may comprise a plurality of processing components, each of which may be configurable to provide one or more of a plurality of different speech-related services. The service-providing system may further comprise a connection component, which may be operable to establish a connection between the respective connectivity port and a processing component having a configuration of suitable for performing a service requested through the respective connectivity port.

Type: Grant

Filed: October 8, 2014

Date of Patent: May 24, 2016

Assignee: SWISSCOM AG

Inventors: Roger Lagadec, Patrik Estermann, Luciano Butera
Speech translation method and apparatus utilizing prosodic information

Patent number: 9342509

Abstract: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.

Type: Grant

Filed: October 30, 2009

Date of Patent: May 17, 2016

Assignee: Nuance Communications, Inc.

Inventors: Fan Ping Meng, Yong Qin, Zhi Wei Shuang, Shi Lei Zhang
Coding device, communication processing device, and coding method

Patent number: 9324331

Abstract: Provided are a coding device, a communication processing device, and a coding method, whereby processing operation load (computational load) is significantly reduced for a configuration which computes either frame energy or sub-frame energy of an input signal, using auto-correlation operations, without causing a decline in the precision of either the frame energy or the sub-frame energy. In a coding device (101), a sub-frame energy computation unit (201) computes the sub-frame energy by substituting the sum of input signal auto-correlation operations in a first range with the sum of auto-correlation operations in a second range which differs at least partially from the first range.

Type: Grant

Filed: December 14, 2011

Date of Patent: April 26, 2016

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Tomofumi Yamanashi, Toshiyuki Morii
Feature sequence generating device, feature sequence generating method, and feature sequence generating program

Patent number: 9299338

Abstract: Spread level parameter correcting means 501 receives a contour parameter as information representing the contour of a feature sequence (a sequence of features of a signal considered as the object of generation) and a spread level parameter as information representing the level of a spread of the distribution of the features in the feature sequence. The spread level parameter correcting means 501 corrects the spread level parameter based on a variation of the contour parameter represented by a sequence of the contour parameters. Feature sequence generating means 502 generates the feature sequence based on the contour parameters and the corrected spread level parameters.

Type: Grant

Filed: October 28, 2011

Date of Patent: March 29, 2016

Assignee: NEC CORPORATION

Inventor: Masanori Kato
Model training for automatic speech recognition from imperfect transcription data

Patent number: 9280969

Abstract: Techniques and systems for training an acoustic model are described. In an embodiment, a technique for training an acoustic model includes dividing a corpus of training data that includes transcription errors into N parts, and on each part, decoding an utterance with an incremental acoustic model and an incremental language model to produce a decoded transcription. The technique may further include inserting silence between a pair of words into the decoded transcription and aligning an original transcription corresponding to the utterance with the decoded transcription according to time for each part. The technique may further include selecting a segment from the utterance having at least Q contiguous matching aligned words, and training the incremental acoustic model with the selected segment. The trained incremental acoustic model may then be used on a subsequent part of the training data. Other embodiments are described and claimed.

Type: Grant

Filed: June 10, 2009

Date of Patent: March 8, 2016

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Jinyu Li, Yifan Gong, Chaojun Liu, Kaisheng Yao
Customized voice action system

Patent number: 9275411

Abstract: Systems, methods, and computer-readable media that may be used to modify a voice action system to include voice actions provided by advertisers or users are provided. One method includes receiving electronic voice action bids from advertisers to modify the voice action system to include a specific voice action (e.g., a triggering phrase and an action). One or more bids may be selected. The method includes, for each of the selected bids, modifying data associated with the voice action system to include the voice action associated with the bid, such that the action associated with the respective voice action is performed when voice input from a user is received that the voice action system determines to correspond to the triggering phrase associated with the respective voice action.

Type: Grant

Filed: May 23, 2012

Date of Patent: March 1, 2016

Assignee: Google Inc.

Inventor: Pedro J. Moreno Mengibar
System and methods for reducing silence descriptor frame transmit rate to improve performance in a multi-SIM wireless communication device

Patent number: 9258413

Abstract: Methods and devices are disclosed for enabling improved transmission performance on a multi-SIM wireless communication device. The wireless device may detect a voice communication in a held state on a modem stack associated with the first SIM and an active voice communication on a modem stack associated with the second SIM. The wireless device may detect a conflict between at least one silence descriptor (SID) frame scheduled for transmission by the modem stack associated with the first SIM and a transmit opportunity for the modem stack associated with the second SIM. Once the wireless device identifies a SID frame transmission rate for the modem stack associated with the first SIM, the wireless device may apply a reduction scheme to the SID frames scheduled to be transmitted by the modem stack associated with the first SIM.

Type: Grant

Filed: September 29, 2014

Date of Patent: February 9, 2016

Assignee: QUALCOMM Incorporated

Inventors: Divaydeep Sikri, Neha Goel, Jafar Mohseni, Mungal Singh Dhanda
Variable bit rate LPC filter quantizing and inverse quantizing device and method

Patent number: 9245532

Abstract: A device and a method for quantizing a LPC filter in the form of an input vector in a quantization domain, comprises a calculator of a first-stage approximation of the input vector, a subtractor of the first-stage approximation from the input vector to produce a residual vector, a calculator of a weighting function from the first-stage approximation, a warper of the residual vector with the weighting function, and a quantizer of the weighted residual vector to supply a quantized weighted residual vector.

Type: Grant

Filed: July 10, 2009

Date of Patent: January 26, 2016

Assignee: VoiceAge Corporation

Inventors: Philippe Gournay, Bruno Bessette, Redwan Salami
Bio-phonetic multi-phrase speaker identity verification

Patent number: 9236051

Abstract: Systems and methods for bio-phonetic multi-phrase speaker identity verification are disclosed. Generally, a speaker identity verification engine generates a dynamic phrase including at least one dynamically-generated word. The speaker identity verification engine prompts a user to speak the dynamic phrase and receives a dynamic phrase utterance. The speaker identity verification engine extracts at least one voice characteristic from the dynamic phrase utterance and compares the at least one voice characteristic with a voice profile the generate a score. The speaker identity verification engine then determines whether to accept a speaker identity claim based on the score.

Type: Grant

Filed: June 24, 2009

Date of Patent: January 12, 2016

Assignee: AT&T Intellectual Property I, L.P.

Inventor: Hisao M. Chang
Video quality of sevice management and constrained fidelity constant bit rate video encoding systems and methods

Patent number: 9225980

Abstract: A constrained variable rate coding technique limits the number of bits used in an encoding process. A quality setting indicates a maximum level of quality to be used in the encoding process which limits the number of bits used in the encoding process. A bandwidth reclamation factor which indicates an amount of bandwidth to conserve may also be used with the quality setting. The constrained variable rate coding technique using a lower quality encoding process for less complex video data and a higher quality encoding technique for higher quality video data.

Type: Grant

Filed: June 13, 2014

Date of Patent: December 29, 2015

Assignee: ARRIS Technology, Inc.

Inventors: Neil W. Brydon, Danny R. Hunt, Sean T. McCarthy
Playback device, television receiver, apparatus selection method, program and recording medium

Patent number: 9225933

Abstract: A television (1) including a function of a call using Internet Protocol (IP) includes: a communicating section to transmit and receive a call signal over an IP communication network; an incoming call destination identifying section (11) to identify a user who is designated as an incoming call destination; a judging section to detect a person who is present around the television (1); and a communication control section (13) to transfer the call signal to a mobile phone (4) of the user in a case where a plurality of persons containing the user has been detected. This offers a television capable of a call and ensuring the privacy of the call.

Type: Grant

Filed: August 30, 2012

Date of Patent: December 29, 2015

Assignee: SHARP KABUSHIKI KAISHA

Inventor: Mitsuru Nakamura
Alias cancelling during audio coding mode transitions

Patent number: 9214160

Abstract: An apparatus for processing an audio signal and method thereof are disclosed. The present invention includes receiving, by an audio processing apparatus, an audio signal including a first data of a first block encoded with rectangular coding scheme and a second data of a second block encoded with non-rectangular coding scheme; receiving a compensation signal corresponding to the second block; estimating a prediction of an aliasing part using the first data; and, obtaining a reconstructed signal for the second block based on the second data, the compensation signal and the prediction of aliasing part.

Type: Grant

Filed: August 6, 2013

Date of Patent: December 15, 2015

Assignee: Industry-Academic Cooperation Foundation, Yonsei University

Inventors: Hyen-O Oh, Chang Heon Lee, Hong Goo Kang, Jung Wook Song
Electronic device and dictionary data display method

Patent number: 9208143

Abstract: An electronic device includes a display module and a dictionary storage module which stores dictionary data that causes a plurality of entry words including compound words obtained by connecting a plurality of words to correspond to explanatory information on the entry words. When the user retrieves a dictionary, entry words for compound words are retrieved from the entry words in the dictionary storage module and words common to the retrieved compound words are listed and displayed on the display module. Entry words for compound words connecting with a word specified by a user operation in the displayed list are read from the dictionary data and displayed in list form on the display module.

Type: Grant

Filed: September 14, 2012

Date of Patent: December 8, 2015

Assignee: CASIO COMPUTER CO., LTD.

Inventor: Yukihiro Nakano
Methods and apparatus to generate a speech recognition library

Patent number: 9202460

Abstract: Methods and apparatus to generate a speech recognition library for use by a speech recognition system are disclosed. An example method comprises identifying a plurality of video segments having closed caption data corresponding to a phrase, the plurality of video segments associated with respective ones of a plurality of audio data segments, computing a plurality of difference metrics between a baseline audio data segment associated with the phrase and respective ones of the plurality of audio data segments, selecting a set of the plurality of audio data segments based on the plurality of difference metrics, identifying a first one of the audio data segments in the set as a representative audio data segment, determining a first phonetic transcription of the representative audio data segment, and adding the first phonetic transcription to a speech recognition library when the first phonetic transcription differs from a second phonetic transcription associated with the phrase in the speech recognition library.

Type: Grant

Filed: May 14, 2008

Date of Patent: December 1, 2015

Assignee: AT&T INTELLECTUAL PROPERTY I, LP

Inventor: Hisao M. Chang
Random linear coding approach to distributed data storage

Patent number: 9165013

Abstract: A method and computer program product for providing a random linear coding approach to distributed data storage is presented. A file is broken into a plurality of pieces. For every peer (peer means storage-location with limited storage space), the number of coded-pieces the peer can store is determined. Each of the coded-piece is determined by taking random linear combination of all the pieces of the entire file. The associate code-vector is stored for every coded-piece. The file is retrieved by collecting code-vectors and the coded-pieces from the peers and viewing the collected code-vectors as a matrix. When a dimension of the matrix is equal to the number of pieces of the file, the file is recovered using the collection of code vectors in the matrix.

Type: Grant

Filed: November 16, 2012

Date of Patent: October 20, 2015

Assignee: MASSACHUSETTS INSTITUTE OF TECHNOLOGY

Inventors: Muriel Medard, Supratim Deb, Ralf Koetter
Device for operating an automated machine for handling, assembling or machining workpieces

Patent number: 9158298

Abstract: A device for operating an automated machine for handling, assembling or machining workpieces, comprising: a display apparatus having a screen for displaying a graphic user interface for controlling and/or monitoring machine functions of the machine, an operating apparatus for inputting command-triggering operator actions for controlling functions of the machine and controlling functions of the graphic user interface, and a controller for implementing input command-triggering operator actions into control commands for controlling functions of the machine and/or functions of the graphic user interface. The operating apparatus comprises an apparatus for inputting manual operator actions and an apparatus for inputting contact-free operator actions.

Type: Grant

Filed: May 4, 2012

Date of Patent: October 13, 2015

Assignee: DECKEL MAHO PFRONTEN GMBH

Inventor: Hans Gronbach
Manipulating spatial processing in an audio system

Patent number: 9161151

Abstract: A vehicle audio system that includes a source of audio signals, which may include both entertainment audio signals and announcement audio signals, speakers for radiating audio signals, and spatial enhancement circuitry comprising circuitry to avoid applying spatial enhancement processing to the announcement audio signals.

Type: Grant

Filed: May 21, 2012

Date of Patent: October 13, 2015

Assignee: Bose Corporation

Inventors: Davis Y. Pan, Shiufun Cheung, Darby Edward Hadley, Ryo Maiguma, Takao Nakayma, Bruce C. Po, Katsumi Tomida, Petr Vicherek, Tobe Z. Barksdale, Ronald A. Fowler
Spatial audio

Patent number: 9137603

Abstract: In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.

Type: Grant

Filed: November 13, 2012

Date of Patent: September 15, 2015

Assignee: Koninklijke Philips N.V.

Inventors: Dirk Jeroen Breebaart, Steven Leonardus Josephus Dimphina Elizabeth Van De Par
Audio processing device, audio processing method, audio processing program and audio processing integrated circuit

Patent number: 9113269

Abstract: Provided is an audio processing device comprising: a feature data generation unit which generates, for each unit section of an audio signal, section feature data expressing features of the audio signal in the unit section; a feature variation calculation unit which calculates, for each unit section, a feature variation value quantifying temporal variation of the features in the unit section, by setting the unit section as a target section and using section feature data of unit sections close to the target section; and a section judgment unit which judges, for each unit section, whether the unit section is a feature unit section including a variation point of the features, based on comparison of a threshold value and the feature variation value. Through the above, the audio processing device can detect feature unit sections from an audio signal of an AV content or the like.

Type: Grant

Filed: November 8, 2012

Date of Patent: August 18, 2015

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Tomohiro Konuma, Tsutomu Uenoyama
Metadata time marking information for indicating a section of an audio object

Patent number: 9105300

Abstract: The application relates to a method for encoding time marking information within audio data. According to the method, time marking information is encoded as audio metadata within the audio data. The time marking information indicates at least one section of an audio object encoded in the audio data. E.g. the time marking information may specify a start position and an end position of the section or only a start position. The at least one section may be a characteristic part of the audio object, which allows instant recognition by listening. The time marking information encoded in the audio data enables instantaneous browsing to a certain section of the audio object. The application further relates to a method for decoding the time marking information encoded in the audio data.

Type: Grant

Filed: October 14, 2010

Date of Patent: August 11, 2015

Assignee: Dolby International AB

Inventors: Barbara Resch, Jonas Engdegård
Data extraction from a call

Patent number: 9071949

Abstract: A network component comprising a communications portion and a processor portion is disclosed. The communications portion may be configured to detect a signal indicative of a call associated with a mobile device. The processor portion may be configured to detect at least one record cue in the signal. The processor portion may be also be configured to respectively capture at least one portion of the call upon the at least one record cue being detected. The processor portion may also be configured to respectively associate at least one identifier with the at least one captured portion of the call. The identifier may respectively identify the at least one captured portion of the call.

Type: Grant

Filed: April 28, 2009

Date of Patent: June 30, 2015

Assignee: AT&T Mobility II LLC

Inventors: Jeffrey Mikan, Justin McNamara, John Lewis, Fulvio Arturo Cenciarelli
User device with access behavior tracking and favorite passage identifying functionality

Patent number: 9031961

Abstract: A user device presents passages of an electronic publication. The user device tracks a user's access behavior for the passages of the electronic publication. The user device identifies the user's favorite passages of the electronic publication based on the user's access behavior and stores an identification of the user's favorite passages.

Type: Grant

Filed: March 17, 2011

Date of Patent: May 12, 2015

Assignee: Amazon Technologies, Inc.

Inventor: Christian R. Cabanero
Methods and arrangements for loudness and sharpness compensation in audio codecs

Patent number: 9031835

Abstract: In a method of improving perceived loudness and sharpness of a reconstructed speech signal delimited by a predetermined bandwidth, performing the steps of providing (S10) the speech signal, and separating (S20) the provided signal into at least a first and a second signal portion. Subsequently, adapting (S30) the first signal portion to emphasize at least a predetermined frequency or frequency interval within the first bandwidth portion. Finally, reconstructing (S40) the second signal portion based on at least the first signal portion, and combining (S50) the adapted first signal portion and the reconstructed second signal portion to provide a reconstructed speech signal with an overall improved perceived loudness and sharpness.

Type: Grant

Filed: June 29, 2010

Date of Patent: May 12, 2015

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventors: Volodya Grancharov, Sigurdur Sverrisson
Speech quality evaluation system and storage medium readable by computer therefor

Patent number: 9031837

Abstract: In prediction of a speech quality evaluation score such as a phone speech, even when a background noise exists, a subjective opinion score is predicted with high precision. A speech quality evaluation system that outputs a predicted value of the subjective opinion score for an evaluation speech such as a far-end speech of a phone, includes a speech distortion calculation unit that conducts, after calculating frequency characteristics of the evaluation speech, a process of subtracting given frequency characteristics from frequency characteristics of the evaluation speech, and calculates the speech distortion on the basis of the frequency characteristics after the subtracting process has been conducted, and a subjective evaluation prediction unit that calculates the predicted value of the subjective opinion score on the basis of the speech distortion.

Type: Grant

Filed: February 11, 2011

Date of Patent: May 12, 2015

Assignee: Clarion Co., Ltd.

Inventor: Takeshi Homma
Method and apparatus for voice clarity and speech intelligibility detection and correction

Patent number: 9031838

Abstract: Systems, methods and apparatus are described herein for continuously measuring voice clarity and speech intelligibility by evaluating a plurality of telecommunications channels in real time. Voice clarity and speech intelligibility measurements may be formed from chained, configurable DSPs that can be added, subtracted, reordered, or configured to target specific audio features. Voice clarity and speech intelligibility may be enhanced by altering the media in one or more of the plurality of telecommunications channels. Analytics describing the measurements and enhancements may be displayed in reports, or in real time via a dashboard.

Type: Grant

Filed: July 14, 2014

Date of Patent: May 12, 2015

Assignee: Vail Systems, Inc.

Inventors: Alex Nash, Mariano Tan, David Fruin, Todd Whiteley, Jon Wotman
Method and system for determining a perceived quality of an audio system

Patent number: 9025780

Abstract: The invention relates to a method for determining a quality indicator representing a perceived quality of an output signal of an audio device with respect to a reference signal. Such audio device may for example be a speech processing system. In the method the reference signal and the output signal are processed and compared. The processing includes dividing the reference signal and the output signal into mutually corresponding time frames. The processing further includes scaling the reference signal towards a fixed intensity level. Time frames of the output signal are selected based on measurements performed on the scaled reference signal. Then, a noise contrast parameter is calculated based on the selected time frames of the output signal. A noise suppression is applied on at least one of the reference signal and the output signal based on the noise contrast parameter.

Type: Grant

Filed: August 9, 2010

Date of Patent: May 5, 2015

Assignees: Koninklijke KPN N.V., Nederlandse Organisatie voor Toegepast-Natuurwetenschappelijk Onderzoek TNO

Inventors: John Gerard Beerends, Jeroen van Vugt
Methods and apparatus to monitor media exposure in vehicles

Patent number: RE45786

Abstract: Methods and apparatus to monitor media exposure in vehicles are disclosed. An example implementation includes collecting audience measurement data with a media monitoring device fixed in a vehicle and transmitting the audience measurement data from the media monitoring device to a shuttle located within the vehicle, the shuttle being incapable of collecting audience measurement data independent of the media monitoring device.

Type: Grant

Filed: April 24, 2014

Date of Patent: October 27, 2015

Assignee: THE NIELSEN COMPANY (US), LLC

Inventors: Arun Ramaswamy, Fred Martensen, Robert A Luff, Kendall Shirilla

prev 1 2 3 4 5 6 7 8 9 … next