Patents Issued in February 16, 2016

Wireless electric guitar

Patent number: 9263015

Abstract: An electronics module for an electric guitar is provided. The electronics module includes a processor, a plurality of controls, an antenna, and a computer-readable medium. The processor receives an audio signal generated by a vibration of a plurality of strings of the electric guitar. The plurality of controls are operably coupled to the processor and provide a mechanism for adjusting a sound created from the audio signal. The computer-readable medium is operably coupled to the processor and configured to cause the electric guitar to determine a control of the plurality of controls associated with the received effects parameter; adjust a state of the determined control based on the received effects parameter; modify the audio signal based on the plurality of controls and on the received effects parameter; and output the modified audio signal through the antenna to a second device.

Type: Grant

Filed: October 28, 2011

Date of Patent: February 16, 2016

Assignee: Gibson Brands, Inc.

Inventor: Henry E. Juszkiewicz
Sorting a plurality of inputted sound generation instructions to generate tones corresponding to the sound generation instruction in a sorted order

Patent number: 9263016

Abstract: Provided is an electronic musical instrument comprising: an input device for inputting sound generation instructions of tones at predetermined pitches; a tone generation device that generates tones with predetermined pitches based on sound generation instructions inputted by the input device; a specifying device that specifies a plurality of sound generation instructions inputted by the input device in a predetermined period as a sound generation instruction group; a sorting device that sorts the plurality of sound generation instructions composing the sound generation instruction group specified by the specifying device in a predetermined pitch order; and a control device that controls generation of tones by the tone generation device such that tones corresponding to the sound generation instruction group are generated in the order sorted by the sorting device.

Type: Grant

Filed: February 23, 2012

Date of Patent: February 16, 2016

Assignee: ROLAND CORPORATION

Inventors: Mizuki Nakagawa, Shun Takai
Modular electronic musical keyboard instrument

Patent number: 9263017

Abstract: In one aspect, the present invention is directed to a modular electronic musical keyboard instrument, comprising: an array of separate keyboard segments (30, 32), each comprising a piano keyboard (20) in a range of one octave; a control system, for converting a keystroke of each key in the array of keyboard segments to a note sound in a level corresponding to the order of the key in the array of keyboard segments; octave order setting means (22), for determining to the control system the order of each keyboard segment in the array of keyboard segments; communication means of each of the keyboard segments with the control system; and connection means (12, 14) with another keyboard segment.

Type: Grant

Filed: January 29, 2014

Date of Patent: February 16, 2016

Inventors: Ronen Lifshitz, Sharon Lerner
System and method for modifying musical data

Patent number: 9263018

Abstract: A computer-implemented method comprises receiving musical data including reference timing data, and a succession of musical notes arranged with respect to the reference timing data, receiving input corresponding to a selection of a groove template, and altering the arrangement of the notes (shifting in a positive or negative direction) with respect to the reference timing data based on the selected groove template. Altering the arrangement of notes can further include adding additional musical notes to the succession of musical notes to add stylistic embellishments particular to the selected groove template, where one of the selectable groove templates includes adding a shuffle dynamic to the succession of musical notes by determining a positive offset associated with each of the musical notes along the musical bar, and applying the positive offset to each of the musical notes, wherein the positive offset corresponds to the position of the musical note.

Type: Grant

Filed: July 13, 2013

Date of Patent: February 16, 2016

Assignee: Apple Inc.

Inventors: Christoph Buskies, Matthias Gros, Thomas Sauer, Oliver Ludecke, Tobias Bade, Alexander H. Little
Percussion instrument and method of manufacture

Patent number: 9263019

Abstract: A percussion instrument including a first acoustic chamber housing having a tapered shape, a second acoustic chamber housing having the tapered shape, a sound board having the tapered shape arranged between the first acoustic chamber housing and the second acoustic chamber housing to form a first acoustic chamber defined by the first acoustic chamber housing and the sound board, and a second acoustic chamber defined by the second acoustic chamber housing and the sound board.

Type: Grant

Filed: July 10, 2013

Date of Patent: February 16, 2016

Inventor: Jon Greg Dahl
Control information generating apparatus and method for percussion instrument

Patent number: 9263020

Abstract: Provided is a sound source control information generating apparatus, adapted for performing slapping techniques. According to the present invention, information based on an output value of a first sensor that detects striking on the housing is stored in a memory means. If striking on the struck head of a percussion instrument is detected based on an output value of a second sensor that detects striking on the struck head, whether an output value equal to or greater than a predetermined value is obtained from the first sensor in a predetermined time interval before a timing of detecting the striking on the struck head is determined based on the information stored in the memory means.

Type: Grant

Filed: September 25, 2014

Date of Patent: February 16, 2016

Assignee: ROLAND CORPORATION

Inventor: Ryo Takasaki
Method for generating a musical compilation track from multiple takes

Patent number: 9263021

Abstract: An apparatus for creating a musical composition comprising an audio interface, and audio converter module, and a multi-track compositor module is disclosed. The audio interface operably receives audio from an audio input device and outputting audio to an audio output device. The audio converter module is operably connected to the audio interface to convert audio received via the audio interface into an audio track having one or more partitions. The multi-track compositor module is configured to receive a first audio track and a second audio track and automatically score each partition of the first and second audio tracks based on one or more criteria. The multi-track compositor module is then configured to construct a third audio track from the partitions of the first and second audio tracks based on the scores for each partition. A method is also provided.

Type: Grant

Filed: April 5, 2013

Date of Patent: February 16, 2016

Assignee: ZYA, INC.

Inventors: Travis Robert Savo, Francesco Gerald Capodieci, Reza Rassool, Michael Winter
Systems and methods for transcoding music notation

Patent number: 9263022

Abstract: A method for transcoding music, according to various aspects of the present invention, includes in any practical order: (a) reading pitches and respective durations; (b) reading indicia of a quantity of beats per measure; (c) determining a word for each beat wherein: each word has one or more syllables, each syllable is associated with each pitch having duration that is within the duration of the beat; each syllable for a pitch, when preceded by a rest, comprises an initial consonant selected from the set consisting of ‘d’ and ‘t’; and each syllable comprises a vowel corresponding to an ordinal of the beat, wherein the vowel is selected from a set of vowels in accordance with the respective duration of the pitch associated with the syllable; and (d) outputting, for use by a music engraving engine, indicia of the pitches and words, in a manner that each syllable will be engraved in vertical alignment with the indicia of the associated pitch.

Type: Grant

Filed: July 8, 2015

Date of Patent: February 16, 2016

Inventor: William R Bachand
Audio speaker with spatially selective sound cancelling

Patent number: 9263023

Abstract: A system and method for reproducing audio sound. Audio sound, based on an audio signal, is emitted within a space comprising an audio beamwidth. Modulated ultrasonic sound energy based on a modulated ultrasonic sound signal is emitted in an ultrasonic sound direction within an ultrasonic beamwidth that is less than and within the audio beamwidth. The modulated ultrasonic sound signal is generated such that the emitted modulated ultrasonic sound energy creates an audible cancellation sound with an amplitude substantially equal to an amplitude of the audio sound at a point within the ultrasonic beamwidth so as to combine with and cancel the audio sound at the point by being substantially out of phase with the audio sound along the ultrasonic sound direction.

Type: Grant

Filed: October 25, 2013

Date of Patent: February 16, 2016

Assignee: BlackBerry Limited

Inventor: Isao Ginn Anazawa
Circuit arrangement and method for active noise cancellation

Patent number: 9263024

Abstract: In an embodiment, a circuit arrangement for active noise cancellation, comprises a first input (E1) for supplying a playback signal (Spb), a second input (E2) for supplying a sensor signal (Sanc), a first and a second terminal (A1, A2) of an output that is designed for being connected to a loudspeaker (Lsp) and a compensating device for respectively generating a first and a second noise signal (Sanc1, Sanc2) as a function of the sensor signal (Sanc), wherein the first and the second input (E1, E2) are coupled to the first and the second terminal of the output (A1, A2) by means of the compensating device (Komp) in such a way that a virtual playback signal (Ssp1) is provided at the first terminal (A1) of the output (A1, A2) and a superposition signal (Ssp2) is provided at the second terminal (A2) of the output (A1, A2) such that a differential signal between the virtual playback signal (Ssp1) and the superposition signal (Ssp2) can be fed to the loudspeaker.

Type: Grant

Filed: November 9, 2011

Date of Patent: February 16, 2016

Assignee: ams AG

Inventor: Helmut Theiler
Method for speech coding, method for speech decoding and their apparatuses

Patent number: 9263025

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: February 25, 2014

Date of Patent: February 16, 2016

Assignee: BlackBerry Limited

Inventor: Tadashi Yamaura
Screen reader having concurrent communication of non-textual information

Patent number: 9263026

Abstract: A screen reader software product for low-vision users, the software having a reader module collecting textual and non-textual display information generated by a web browser or word processor. Font styling, interface layout information and the like are communicated to the end user by sounds broadcast simultaneously rather than serially with the synthesized speech to improve the speed and efficiency in which information may be digested by the end user.

Type: Grant

Filed: July 11, 2014

Date of Patent: February 16, 2016

Assignee: Freedom Scientific, Inc.

Inventors: Christian D. Hofstader, Glen Gordon, Eric Damery, Ralph Ocampo, David Baker, Joseph K. Stephen
Broadcast system using text to speech conversion

Patent number: 9263027

Abstract: A broadcast signal receiver comprises a text data receiver for receiving broadcast text data for display to a user in relation to a user interface; a text-to-speech (TTS) converter for converting received text data into an audio speech signal, the TTS converter being operable to detect whether a word for conversion is included in a stored list of words for conversion and, if so, to convert that word according to a conversion defined by the stored list; and if not, to convert that word according to a set of predetermined conversion rules; a conversion memory storing the list of words for conversion by the TTS converter; and an update receiver for receiving additional words and associated conversions for storage in the conversion memory.

Type: Grant

Filed: June 1, 2011

Date of Patent: February 16, 2016

Assignee: SONY EUROPE LIMITED

Inventors: Huw Hopkins, Timothy Edmunds
Methods and systems for automated generation of nativized multi-lingual lexicons

Patent number: 9263028

Abstract: An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. The computing device may associate the linguistic content in the first language with one or more phonemes from a second language. The computing device may also determine a phonemic representation of the linguistic content in the first language based on use of the one or more phonemes from the second language. The phonemic representation may be indicative of a pronunciation of the linguistic content in the first language according to speech sounds of the second language.

Type: Grant

Filed: May 21, 2014

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Javier Gonzalvo Fructuoso, Ioannis Agiomyrgiannakis
Instant communication voice recognition method and terminal

Patent number: 9263029

Abstract: The present disclosure discloses a speech recognition method and a terminal, which belong to the field of communications. The method comprises: receiving speech information inputted by a user; acquiring the current environment information, and judging whether the speech information needs to be played according to the current environment information; and recognizing the speech information as text information, when it is judged that the speech information needs not to be played. The terminal comprises an acquisition module, a judgment module and a recognition module. The present disclosure provides the speech receiver with a speech recognition function, when the speech information of the instant messaging is received by the terminal, it can help the receiver to normally acquire the content to be expressed by the speech sender under an inconvenient situation.

Type: Grant

Filed: March 1, 2013

Date of Patent: February 16, 2016

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Yisha Lu
Adaptive online feature normalization for speech recognition

Patent number: 9263030

Abstract: A speech recognition system adaptively estimates a warping factor used to reduce speaker variability. The warping factor is estimated using a small window (e.g. 100 ms) of speech. The warping factor is adaptively adjusted as more speech is obtained until the warping factor converges or a pre-defined maximum number of adaptation is reached. The speaker may be placed into a group selected from two or more groups based on characteristics that are associated with the speaker's window of speech. Different step sizes may be used within the different groups when estimating the warping factor. VTLN is applied to the speech input using the estimated warping factor. A linear transformation, including a bias term, may also be computed to assist in normalizing the speech along with the application of the VTLN.

Type: Grant

Filed: January 23, 2013

Date of Patent: February 16, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shizhen Wang, Yifan Gong, Fileno Alleva
System and method of spoken language understanding in human computer dialogs

Patent number: 9263031

Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

Type: Grant

Filed: November 15, 2013

Date of Patent: February 16, 2016

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Srinivas Bangalore, Narendra K. Gupta, Mazin G. Rahim
Voice-responsive building management system

Patent number: 9263032

Abstract: A voice-responsive building management system is described herein. One system includes an interface, a dynamic grammar builder, and a speech processing engine. The interface is configured to receive a speech card of a user, wherein the speech card of the user includes speech training data of the user and domain vocabulary for applications of the building management system for which the user is authorized. The dynamic grammar builder is configured to generate grammar from a building information model of the building management system. The speech processing engine is configured to receive a voice command or voice query from the user, and execute the voice command or voice query using the speech training data of the user, the domain vocabulary, and the grammar generated from the building information model.

Type: Grant

Filed: October 24, 2013

Date of Patent: February 16, 2016

Assignee: Honeywell International Inc.

Inventor: Jayaprakash Meruva
Utterance selection for automated speech recognizer training

Patent number: 9263033

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.

Type: Grant

Filed: June 25, 2014

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Olivier Siohan, Pedro J. Mengibar
Adapting enhanced acoustic models

Patent number: 9263034

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving voice queries, obtaining, for one or more of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, generating, for the one or more voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query, selecting a subset of the one or more voice queries based on the posterior recognition confidence measures, and adapting an acoustic model using the subset of the voice queries.

Type: Grant

Filed: July 13, 2010

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Brian Strope, Douglas H. Beeferman
Promoting voice actions to hotwords

Patent number: 9263035

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

Type: Grant

Filed: March 21, 2014

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventor: Matthew Sharifi
System and method for speech recognition using deep recurrent neural networks

Patent number: 9263036

Abstract: Deep recurrent neural networks applied to speech recognition. The deep recurrent neural networks (RNNs) are preferably implemented by stacked long short-term memory bidirectional RNNs. The RNNs are trained using end-to-end training with suitable regularization.

Type: Grant

Filed: November 26, 2013

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventor: Alexander B. Graves
Interactive manual, system and method for vehicles and other complex equipment

Patent number: 9263037

Abstract: A method and system of providing an interactive manual, including a speech engine to receive and process speech from a user, convert the speech into a word sequence, and identify meaning structures from the word sequence, a structured manual including information related to an operation of a device, a visual model to relate visual representation of the information, a dialog management arrangement to interpret the meaning structures in a context and to extract pertinent information and the visual representation from the structured manual and the visual model, and an output arrangement to output the information and visual representation.

Type: Grant

Filed: April 15, 2010

Date of Patent: February 16, 2016

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Hauke Schmidt, Gengyan Bei
System and method for analyzing and classifying calls without transcription via keyword spotting

Patent number: 9263038

Abstract: A facility and method for analyzing and classifying calls without transcription via keyword spotting is disclosed. The facility uses a group of calls having known outcomes to generate one or more domain- or entity-specific grammars containing keywords and related information that are indicative of particular outcome. The facility monitors telephone calls by determining the domain or entity associated with the call, loading the appropriate grammar or grammars associated with the determined domain or entity, and tracking keywords contained in the loaded grammar or grammars that are spoken during the monitored call, along with additional information. The facility performs a statistical analysis on the tracked keywords and additional information to determine a classification for the monitored telephone call.

Type: Grant

Filed: October 3, 2013

Date of Patent: February 16, 2016

Assignee: Marchex, Inc.

Inventors: Jason Flaks, Ziad Ismail, Chris Kolbegger
Systems and methods for responding to natural language speech utterance

Patent number: 9263039

Abstract: Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for one or more users presenting questions or commands across multiple domains. The systems and methods creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech and non-speech communications and presenting the expected results for a particular question or command.

Type: Grant

Filed: September 29, 2014

Date of Patent: February 16, 2016

Assignee: Nuance Communications, Inc.

Inventors: Philippe Di Cristo, Min Ke, Robert A. Kennewick, Lynn Elise Armstrong
Method and system for using sound related vehicle information to enhance speech recognition

Patent number: 9263040

Abstract: An audio signal may be received, in a processor associated with a vehicle. Sound related vehicle information representing one or more sounds may be received by the processor. The sound related vehicle information may or may not include an audio signal. A speech recognition process or system may be modified based on the sound related vehicle information.

Type: Grant

Filed: January 17, 2012

Date of Patent: February 16, 2016

Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC

Inventors: Eli Tzirkel-Hancock, Omer Tsimhoni
Channel detection in noise using single channel data

Patent number: 9263041

Abstract: Methods related to Generalized Mutual Interdependence Analysis (GMIA), a low complexity statistical method for projecting data in a subspace that captures invariant properties of the data, are implemented on a processor based system. GMIA methods are applied to the signal processing problem of voice activity detection and classification. Real-world conversational speech data are modeled to fit the GMIA assumptions. Low complexity GMIA computations extract reliable features for classification of sound under noisy conditions and operate with small amounts of data. A speaker is characterized by a slow varying or invariant channel that is learned and is tracked from single channel data by GMIA methods.

Type: Grant

Filed: March 14, 2013

Date of Patent: February 16, 2016

Assignee: Siemens Aktiengesellschaft

Inventors: Heiko Claussen, Justinian Rosca
Providing pre-computed hotword models

Patent number: 9263042

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword models that correspond to the candidate hotword; and providing the identified, pre-computed hotword models to the computing device.

Type: Grant

Filed: July 25, 2014

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventor: Matthew Sharifi
Stuttering inhibition method and device

Patent number: 9263043

Abstract: A method and device is disclosed for reducing and controlling stuttering. The method involves tactile feedback of the stutterer's own speech to reducing stuttering. In one embodiment, the device may detect speech by audible or mechanical means, and the feedback may be produced by vibration mechanisms.

Type: Grant

Filed: December 3, 2010

Date of Patent: February 16, 2016

Assignee: UNIVERSITY OF MISSISSIPPI

Inventors: Gregory John Snyder, Dwight E. Waddell, II, Paul Mallette Goggans
Noise reduction based on mouth area movement recognition

Patent number: 9263044

Abstract: A computing device can capture video data of at least a portion of a mouth area (e.g., mouth, lips, tongue, chin, jaw) of a user of the device. The computing device can also capture sound data including a voice of the user as well as noise (e.g. background noise). The video data can be processed to detect a movement of the portion of the mouth area. The movement of the portion of the mouth area can be analyzed and compared with mouth area movement models characteristic of oral communication (e.g., speech, song). If the movement of the portion of the mouth area corresponds to at least one model characteristic of oral communication, then the movement indicates that the user is likely engaging in oral communication. Noise reduction can be applied and/or increased on the captured sound data to reduce noise and in turn enhance the user's voice.

Type: Grant

Filed: June 27, 2012

Date of Patent: February 16, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Ryan H. Cassidy, Yuzo Watanabe, Isaac S. Noble
Multi-mode text input

Patent number: 9263045

Abstract: Concepts and technologies are described herein for multi-mode text input. In accordance with the concepts and technologies disclosed herein, content is received. The content can include one or more input indicators. The input indicators can indicate that user input can be used in conjunction with consumption or use of the content. The application is configured to analyze the content to determine context associated with the content and/or the client device executing the application. The application also is configured to determine, based upon the content and/or the contextual information, which input device to use to obtain input associated with use or consumption of the content. Input captured with the input device can be converted to text and used during use or consumption of the content.

Type: Grant

Filed: May 17, 2011

Date of Patent: February 16, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Mohan Varthakavi, Jayaram N M Nanduri, Nikhil Kothari
Distributed dictation/transcription system

Patent number: 9263046

Abstract: A distributed dictation/transcription system is provided. The system provides a dictation manager having a data port to receive and transmit audio signals. The dictation manager includes a dictation server selector to a dictation server to transcribe the audio based on whether the dictation server already has a user profile uploaded.

Type: Grant

Filed: April 1, 2013

Date of Patent: February 16, 2016

Assignee: NVOQ INCORPORATED

Inventors: Richard Beach, Christopher Butler, Jon Ford, Brian Marquette, Christopher Omland
Apparatus and method for providing messages in a social network

Patent number: 9263047

Abstract: A system that incorporates teachings of the present disclosure may include, for example, a server including a controller to receive audio signals and content identification information from a media processor, generate text representing a voice message based on the audio signals, determine an identity of media content based on the content identification information, generate an enhanced message having text and additional content where the additional content is obtained by the controller based on the identity of the media content, and transmit the enhanced message to the media processor for presentation on the display device, where the enhanced message is accessible by one or more communication devices that are associated with a social network and remote from the media processor. Other embodiments are disclosed.

Type: Grant

Filed: November 5, 2014

Date of Patent: February 16, 2016

Assignee: AT&T INTELLECTUAL PROPERTY I, LP

Inventors: Hisao Chang, Bernard S. Renger
Word-level correction of speech input

Patent number: 9263048

Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.

Type: Grant

Filed: June 23, 2015

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Michael J. LeBeau, William J. Byrne, John Nicholas Jitkoff, Brandon M. Ballinger, Trausti T. Kristjansson
Artifact reduction in packet loss concealment

Patent number: 9263049

Abstract: Various techniques are disclosed for improving packet loss concealment to reduce artifacts by using audio character measures of the audio signal. These techniques include attenuation to a noise fill instead of attenuation to silence, varying how long to wait before attenuating the extrapolation, varying the rate of attenuation of the extrapolation, attenuating periodic extrapolation at a different rate than non-periodic extrapolation, and performing period extrapolation on successively longer fill data based on the audio character measures, adjusting weighting between periodic and non-periodic extrapolation based on the audio character measures, and adjusting weighting between periodic extrapolation and non-periodic extrapolation non-linearly.

Type: Grant

Filed: October 25, 2010

Date of Patent: February 16, 2016

Assignee: Polycom, Inc.

Inventor: Eric David Elias
Allocation, by sub-bands, of bits for quantifying spatial information parameters for parametric encoding

Patent number: 9263050

Abstract: A method is provided for allocating bits for quantifying spatial information parameters by frequency sub-band for parametric encoding/decoding of a multichannel audio stream representative of a soundstage consisting of a plurality of sound sources. The method includes a step of quantifying or inversely quantifying, by frequency sub-band, spatial information parameters for the sound sources of the soundscape. The method further includes: assessing a spatial resolution of the current sub-band on the basis of the spectral properties of the sub-band; and determining a number of bits to be allocated to the current sub-band, the number of bits to be allocated being inversely proportional to the estimated spatial resolution. Also provided is a device for allocating quantification bits implementing the above-described method.

Type: Grant

Filed: March 28, 2012

Date of Patent: February 16, 2016

Assignee: ORANGE

Inventors: Adrien Daniel, Rozenn Nicol
Speech coding by quantizing with random-noise signal

Patent number: 9263051

Abstract: A method, system and program for decoding a speech signal. In some embodiments, the method comprises: receiving an encoded speech signal having quantization values; transforming the quantization values by adding simulated random-noise samples; and from the encoded speech signal, determining a parameter of the transformation that is usable to control the transformation of the quantization values.

Type: Grant

Filed: February 17, 2014

Date of Patent: February 16, 2016

Assignee: Skype

Inventor: Koen Bernard Vos
Simultaneous estimation of fundamental frequency, voicing state, and glottal closure instant

Patent number: 9263052

Abstract: A method and system is disclosed for simultaneously determining glottal closure instants (GCIs), fundamental frequency (F0s), and voicing state of a speech signal. A speech signal may be processed to determine a sequence of candidate GCIs. For each candidate GCI, a set of candidate F0s may be determined. A lattice of hypotheses may be constructed, where each lattice point is a hypothesis of a concurrence of a candidate GCI, a candidate F0, and voicing state. Each given hypothesis may also include a score of the candidate GCI, F0, and voicing state for evaluating a cost of the given hypothesis and a cost of connections between the given hypothesis and other hypotheses of the lattice. Dynamic programming may be used to determine a least-cost path through the lattice, and backtracking across the path may be used to determine an optimal set of GCIs, F0s and voicing states of the speech signal.

Type: Grant

Filed: January 25, 2013

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventor: David Talkin
Method and apparatus for generating a candidate code-vector to code an informational signal

Patent number: 9263053

Abstract: A method (1100) and apparatus (100) generate a candidate code-vector to code an information signal. The method can include producing (1110) a weighted target vector from an input signal. The method can include processing (1120) the weighted target vector through an inverse weighting function to create a residual domain target vector. The method can include performing (1130) a first search process on the residual domain target vector to obtain an initial fixed codebook code-vector. The method can include performing (1140) a second search process over a subset of possible codebook code-vectors for a low weighted-domain error to produce a final fixed codebook code-vector. The subset of possible codebook code-vectors can be based on the initial fixed codebook code-vector. The method can include generating (1150) a codeword representative of the final fixed codebook code-vector. The codeword can be for use by a decoder to generate an approximation of the input signal.

Type: Grant

Filed: November 2, 2012

Date of Patent: February 16, 2016

Assignee: GOOGLE TECHNOLOGY HOLDINGS LLC

Inventors: James P Ashley, Udar Mittal
Systems and methods for controlling an average encoding rate for speech signal encoding

Patent number: 9263054

Abstract: A method for controlling an average encoding rate by an electronic device is described. The method includes obtaining a speech signal. The method also includes determining a first average rate. The method further includes determining a first threshold based on the first average rate. The method additionally includes controlling the average encoding rate by determining at least one other threshold based on the first threshold. The method also includes sending an encoded speech signal.

Type: Grant

Filed: August 30, 2013

Date of Patent: February 16, 2016

Assignee: QUALCOMM Incorporated

Inventors: Subasingha Shaminda Subasingha, Vivek Rajendran, Venkatesh Krishnan, Venkatraman Srinivasa Atti
Systems and methods for three-dimensional audio CAPTCHA

Patent number: 9263055

Abstract: Systems and methods for generating and performing a three-dimensional audio CAPTCHA are provided. One exemplary system can include a decoy signal database storing a plurality of decoy signals. The system also can include a three-dimensional audio simulation engine for simulating the sounding of a target signal and at least one decoy signal in an acoustic environment and outputting a stereophonic audio signal based on the simulation. One exemplary method includes providing an audio prompt to a resource requesting entity. The audio prompt can have been generated based on a three-dimensional audio simulation of the sounding of a target signal containing an authentication key and at least one decoy signal in an acoustic environment. The method can include receiving a response to the audio prompt from the resource requesting entity and comparing the response to the authentication key.

Type: Grant

Filed: April 10, 2013

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Yannis Agiomyrgiannakis, Edison Tan, David John Abraham
Method of simultaneously transforming a plurality of voice signals input to a communications system

Patent number: 9263056

Abstract: A method of simultaneously transforming at least two input voice signals xi of a communications system (30), each input voice signal xi being received at a specific reception frequency Fi and corresponding to the voice of a remote party communicating with a user of the communications system (30). During an initialization stage, a transformation Ti is allocated to at least one reception frequency Fi of the input voice signals xi, and during a utilization stage, transformations Ti are applied simultaneously to the input voice signals xi as a function of the reception frequencies Fi, modifying at least one characteristic of each of the input voice signals xi. Thus, the voice of each remote party in communication with the user of the communications system (30) is modified artificially by a transformation Ti, thereby making it easier for the user to perceive and discriminate between simultaneous voices from the remote parties.

Type: Grant

Filed: May 7, 2015

Date of Patent: February 16, 2016

Assignee: Airbus Helicopters

Inventor: Jean-Pierre Baudry
Time warp activation signal provider, audio signal encoder, method for providing a time warp activation signal, method for encoding an audio signal and computer programs

Patent number: 9263057

Abstract: An audio encoder has a window function controller, a windower, a time warper with a final quality check functionality, a time/frequency converter, a TNS stage or a quantizer encoder, the window function controller, the time warper, the TNS stage or an additional noise filling analyzer are controlled by signal analysis results obtained by a time warp analyzer or a signal classifier. Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.

Type: Grant

Filed: November 11, 2014

Date of Patent: February 16, 2016

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Bayer, Sascha Disch, Ralf Geiger, Guillaume Fuchs, Max Neuendorf, Gerald Schuller, Bernd Edler
Communication system and method between an on-vehicle voice recognition system and an off-vehicle voice recognition system

Patent number: 9263058

Abstract: A vehicle based system and method for receiving voice inputs and determining whether to perform a voice recognition analysis using in-vehicle resources or resources external to the vehicle.

Type: Grant

Filed: June 24, 2011

Date of Patent: February 16, 2016

Assignee: Honda Motor Co., Ltd.

Inventors: Ritchie Winson Huang, Pedram Vaghefinazari, Stuart Yamamoto
Deep tagging background noises

Patent number: 9263059

Abstract: In a method for deep tagging a recording, a computer records audio comprising speech from one or more people. The computer detects a non-speech sound within the audio. The computer determines that the non-speech sound corresponds to a type of sound, and in response, associates a descriptive term with a time of occurrence of the non-speech sound within the recorded audio to form a searchable tag. The computer stores the searchable tag as metadata of the recorded audio.

Type: Grant

Filed: September 28, 2012

Date of Patent: February 16, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Denise A. Bell, Lisa Seacat DeLuca, Jana H. Jenkins, Jeffrey A. Kusnitz
Artificial neural network based system for classification of the emotional content of digital music

Patent number: 9263060

Abstract: A system for classification of the emotional content of music is provided. An encoder receives a digital audio recording of a piece of music, and encodes it using musical notes and associated amplitudes. The artificial neural network is configured to take a plurality of encoded time slices and provide output indicative of the emotional content of the music.

Type: Grant

Filed: August 21, 2012

Date of Patent: February 16, 2016

Assignee: MARIAN MASON PUBLISHING COMPANY, LLC

Inventor: David A. Sharp
Detection of chopped speech

Patent number: 9263061

Abstract: Methods and systems are provided for detecting chop in an audio signal. A time-frequency representation, such as a spectrogram, is created for an audio signal and used to calculate a gradient of mean power per frame of the audio signal. Positive and negative gradients are defined for the signal based on the gradient of mean power, and a maximum overlap offset between the positive and negative gradients is determined by calculating a value that maximizes the cross-correlation of the positive and negative gradients. The negative gradient values may be combined (e.g., summed) with the overlap offset, and the combined values then compared with a threshold to estimate the amount of chop present in the audio signal. The chop detection model provided is low-complexity and is applicable to narrowband, wideband, and superwideband speech.

Type: Grant

Filed: May 21, 2013

Date of Patent: February 16, 2016

Assignee: GOOGLE INC.

Inventors: Andrew J. Hines, Jan Skoglund, Naomi Harte, Anil Kokaram
Vibration sensor and acoustic voice activity detection systems (VADS) for use with electronic systems

Patent number: 9263062

Abstract: A voice activity detector (VAD) combines the use of an acoustic VAD and a vibration sensor VAD as appropriate to the conditions a host device is operated. The VAD includes a first detector receiving a first signal and a second detector receiving a second signal. The VAD includes a first VAD component coupled to the first and second detectors. The first VAD component determines that the first signal corresponds to voiced speech when energy resulting from at least one operation on the first signal exceeds a first threshold. The VAD includes a second VAD component coupled to the second detector. The second VAD component determines that the second signal corresponds to voiced speech when a ratio of a second parameter corresponding to the second signal and a first parameter corresponding to the first signal exceeds a second threshold.

Type: Grant

Filed: August 5, 2013

Date of Patent: February 16, 2016

Assignee: AplihCom

Inventors: Zhinian Jing, Nicolas Jean Petit, Gregory C. Burnett
Switching off DTX for music

Patent number: 9263063

Abstract: The invention relates to a method for disabling a discontinuous transmission node DTX of a speech encoder if a music signal is detected in a call input signal. The music signal is detected by determining an activity factor corresponding to the relation of sound signal periods relative to scheme signal periods. If the activity factor is higher than a specified activity factor, the DTX is disabled.

Type: Grant

Filed: February 25, 2010

Date of Patent: February 16, 2016

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventors: Timo Suihko, Johan Gunnar Lundström, Arto Mahkonen
Reading order search method and program for recording groups on tape

Patent number: 9263064

Abstract: The present invention provides a search method used to search for the reading order of a plurality of recording groups when the plurality of recording groups written on tape are continuously read by a tape drive which manages data on tape in recording units having a fixed data length for each recording. This search method includes the steps of: receiving information on a plurality of tape groups to be read; and sorting the plurality of recording groups to be read so the reading time is shortened. In the sorting step, the time required to sort the plurality of reading groups is reduced by combining two or more recording groups into a single object to be sorted in the sorting step when at least two or more contiguous recording groups have been assigned to the same region or are assigned across adjacent regions among the plurality of regions.

Type: Grant

Filed: February 10, 2014

Date of Patent: February 16, 2016

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Takashi Katagiri, Mitsuhiro Nishida

prev … 89 90 91 92 93 94 95 96 97 … next