Patents Issued in February 16, 2016
  • Patent number: 9263015
    Abstract: An electronics module for an electric guitar is provided. The electronics module includes a processor, a plurality of controls, an antenna, and a computer-readable medium. The processor receives an audio signal generated by a vibration of a plurality of strings of the electric guitar. The plurality of controls are operably coupled to the processor and provide a mechanism for adjusting a sound created from the audio signal. The computer-readable medium is operably coupled to the processor and configured to cause the electric guitar to determine a control of the plurality of controls associated with the received effects parameter; adjust a state of the determined control based on the received effects parameter; modify the audio signal based on the plurality of controls and on the received effects parameter; and output the modified audio signal through the antenna to a second device.
    Type: Grant
    Filed: October 28, 2011
    Date of Patent: February 16, 2016
    Assignee: Gibson Brands, Inc.
    Inventor: Henry E. Juszkiewicz
  • Patent number: 9263016
    Abstract: Provided is an electronic musical instrument comprising: an input device for inputting sound generation instructions of tones at predetermined pitches; a tone generation device that generates tones with predetermined pitches based on sound generation instructions inputted by the input device; a specifying device that specifies a plurality of sound generation instructions inputted by the input device in a predetermined period as a sound generation instruction group; a sorting device that sorts the plurality of sound generation instructions composing the sound generation instruction group specified by the specifying device in a predetermined pitch order; and a control device that controls generation of tones by the tone generation device such that tones corresponding to the sound generation instruction group are generated in the order sorted by the sorting device.
    Type: Grant
    Filed: February 23, 2012
    Date of Patent: February 16, 2016
    Assignee: ROLAND CORPORATION
    Inventors: Mizuki Nakagawa, Shun Takai
  • Patent number: 9263017
    Abstract: In one aspect, the present invention is directed to a modular electronic musical keyboard instrument, comprising: an array of separate keyboard segments (30, 32), each comprising a piano keyboard (20) in a range of one octave; a control system, for converting a keystroke of each key in the array of keyboard segments to a note sound in a level corresponding to the order of the key in the array of keyboard segments; octave order setting means (22), for determining to the control system the order of each keyboard segment in the array of keyboard segments; communication means of each of the keyboard segments with the control system; and connection means (12, 14) with another keyboard segment.
    Type: Grant
    Filed: January 29, 2014
    Date of Patent: February 16, 2016
    Inventors: Ronen Lifshitz, Sharon Lerner
  • Patent number: 9263018
    Abstract: A computer-implemented method comprises receiving musical data including reference timing data, and a succession of musical notes arranged with respect to the reference timing data, receiving input corresponding to a selection of a groove template, and altering the arrangement of the notes (shifting in a positive or negative direction) with respect to the reference timing data based on the selected groove template. Altering the arrangement of notes can further include adding additional musical notes to the succession of musical notes to add stylistic embellishments particular to the selected groove template, where one of the selectable groove templates includes adding a shuffle dynamic to the succession of musical notes by determining a positive offset associated with each of the musical notes along the musical bar, and applying the positive offset to each of the musical notes, wherein the positive offset corresponds to the position of the musical note.
    Type: Grant
    Filed: July 13, 2013
    Date of Patent: February 16, 2016
    Assignee: Apple Inc.
    Inventors: Christoph Buskies, Matthias Gros, Thomas Sauer, Oliver Ludecke, Tobias Bade, Alexander H. Little
  • Patent number: 9263019
    Abstract: A percussion instrument including a first acoustic chamber housing having a tapered shape, a second acoustic chamber housing having the tapered shape, a sound board having the tapered shape arranged between the first acoustic chamber housing and the second acoustic chamber housing to form a first acoustic chamber defined by the first acoustic chamber housing and the sound board, and a second acoustic chamber defined by the second acoustic chamber housing and the sound board.
    Type: Grant
    Filed: July 10, 2013
    Date of Patent: February 16, 2016
    Inventor: Jon Greg Dahl
  • Patent number: 9263020
    Abstract: Provided is a sound source control information generating apparatus, adapted for performing slapping techniques. According to the present invention, information based on an output value of a first sensor that detects striking on the housing is stored in a memory means. If striking on the struck head of a percussion instrument is detected based on an output value of a second sensor that detects striking on the struck head, whether an output value equal to or greater than a predetermined value is obtained from the first sensor in a predetermined time interval before a timing of detecting the striking on the struck head is determined based on the information stored in the memory means.
    Type: Grant
    Filed: September 25, 2014
    Date of Patent: February 16, 2016
    Assignee: ROLAND CORPORATION
    Inventor: Ryo Takasaki
  • Patent number: 9263021
    Abstract: An apparatus for creating a musical composition comprising an audio interface, and audio converter module, and a multi-track compositor module is disclosed. The audio interface operably receives audio from an audio input device and outputting audio to an audio output device. The audio converter module is operably connected to the audio interface to convert audio received via the audio interface into an audio track having one or more partitions. The multi-track compositor module is configured to receive a first audio track and a second audio track and automatically score each partition of the first and second audio tracks based on one or more criteria. The multi-track compositor module is then configured to construct a third audio track from the partitions of the first and second audio tracks based on the scores for each partition. A method is also provided.
    Type: Grant
    Filed: April 5, 2013
    Date of Patent: February 16, 2016
    Assignee: ZYA, INC.
    Inventors: Travis Robert Savo, Francesco Gerald Capodieci, Reza Rassool, Michael Winter
  • Patent number: 9263022
    Abstract: A method for transcoding music, according to various aspects of the present invention, includes in any practical order: (a) reading pitches and respective durations; (b) reading indicia of a quantity of beats per measure; (c) determining a word for each beat wherein: each word has one or more syllables, each syllable is associated with each pitch having duration that is within the duration of the beat; each syllable for a pitch, when preceded by a rest, comprises an initial consonant selected from the set consisting of ‘d’ and ‘t’; and each syllable comprises a vowel corresponding to an ordinal of the beat, wherein the vowel is selected from a set of vowels in accordance with the respective duration of the pitch associated with the syllable; and (d) outputting, for use by a music engraving engine, indicia of the pitches and words, in a manner that each syllable will be engraved in vertical alignment with the indicia of the associated pitch.
    Type: Grant
    Filed: July 8, 2015
    Date of Patent: February 16, 2016
    Inventor: William R Bachand
  • Patent number: 9263023
    Abstract: A system and method for reproducing audio sound. Audio sound, based on an audio signal, is emitted within a space comprising an audio beamwidth. Modulated ultrasonic sound energy based on a modulated ultrasonic sound signal is emitted in an ultrasonic sound direction within an ultrasonic beamwidth that is less than and within the audio beamwidth. The modulated ultrasonic sound signal is generated such that the emitted modulated ultrasonic sound energy creates an audible cancellation sound with an amplitude substantially equal to an amplitude of the audio sound at a point within the ultrasonic beamwidth so as to combine with and cancel the audio sound at the point by being substantially out of phase with the audio sound along the ultrasonic sound direction.
    Type: Grant
    Filed: October 25, 2013
    Date of Patent: February 16, 2016
    Assignee: BlackBerry Limited
    Inventor: Isao Ginn Anazawa
  • Patent number: 9263024
    Abstract: In an embodiment, a circuit arrangement for active noise cancellation, comprises a first input (E1) for supplying a playback signal (Spb), a second input (E2) for supplying a sensor signal (Sanc), a first and a second terminal (A1, A2) of an output that is designed for being connected to a loudspeaker (Lsp) and a compensating device for respectively generating a first and a second noise signal (Sanc1, Sanc2) as a function of the sensor signal (Sanc), wherein the first and the second input (E1, E2) are coupled to the first and the second terminal of the output (A1, A2) by means of the compensating device (Komp) in such a way that a virtual playback signal (Ssp1) is provided at the first terminal (A1) of the output (A1, A2) and a superposition signal (Ssp2) is provided at the second terminal (A2) of the output (A1, A2) such that a differential signal between the virtual playback signal (Ssp1) and the superposition signal (Ssp2) can be fed to the loudspeaker.
    Type: Grant
    Filed: November 9, 2011
    Date of Patent: February 16, 2016
    Assignee: ams AG
    Inventor: Helmut Theiler
  • Patent number: 9263025
    Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.
    Type: Grant
    Filed: February 25, 2014
    Date of Patent: February 16, 2016
    Assignee: BlackBerry Limited
    Inventor: Tadashi Yamaura
  • Patent number: 9263026
    Abstract: A screen reader software product for low-vision users, the software having a reader module collecting textual and non-textual display information generated by a web browser or word processor. Font styling, interface layout information and the like are communicated to the end user by sounds broadcast simultaneously rather than serially with the synthesized speech to improve the speed and efficiency in which information may be digested by the end user.
    Type: Grant
    Filed: July 11, 2014
    Date of Patent: February 16, 2016
    Assignee: Freedom Scientific, Inc.
    Inventors: Christian D. Hofstader, Glen Gordon, Eric Damery, Ralph Ocampo, David Baker, Joseph K. Stephen
  • Patent number: 9263027
    Abstract: A broadcast signal receiver comprises a text data receiver for receiving broadcast text data for display to a user in relation to a user interface; a text-to-speech (TTS) converter for converting received text data into an audio speech signal, the TTS converter being operable to detect whether a word for conversion is included in a stored list of words for conversion and, if so, to convert that word according to a conversion defined by the stored list; and if not, to convert that word according to a set of predetermined conversion rules; a conversion memory storing the list of words for conversion by the TTS converter; and an update receiver for receiving additional words and associated conversions for storage in the conversion memory.
    Type: Grant
    Filed: June 1, 2011
    Date of Patent: February 16, 2016
    Assignee: SONY EUROPE LIMITED
    Inventors: Huw Hopkins, Timothy Edmunds
  • Patent number: 9263028
    Abstract: An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. The computing device may associate the linguistic content in the first language with one or more phonemes from a second language. The computing device may also determine a phonemic representation of the linguistic content in the first language based on use of the one or more phonemes from the second language. The phonemic representation may be indicative of a pronunciation of the linguistic content in the first language according to speech sounds of the second language.
    Type: Grant
    Filed: May 21, 2014
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventors: Javier Gonzalvo Fructuoso, Ioannis Agiomyrgiannakis
  • Patent number: 9263029
    Abstract: The present disclosure discloses a speech recognition method and a terminal, which belong to the field of communications. The method comprises: receiving speech information inputted by a user; acquiring the current environment information, and judging whether the speech information needs to be played according to the current environment information; and recognizing the speech information as text information, when it is judged that the speech information needs not to be played. The terminal comprises an acquisition module, a judgment module and a recognition module. The present disclosure provides the speech receiver with a speech recognition function, when the speech information of the instant messaging is received by the terminal, it can help the receiver to normally acquire the content to be expressed by the speech sender under an inconvenient situation.
    Type: Grant
    Filed: March 1, 2013
    Date of Patent: February 16, 2016
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Yisha Lu
  • Patent number: 9263030
    Abstract: A speech recognition system adaptively estimates a warping factor used to reduce speaker variability. The warping factor is estimated using a small window (e.g. 100 ms) of speech. The warping factor is adaptively adjusted as more speech is obtained until the warping factor converges or a pre-defined maximum number of adaptation is reached. The speaker may be placed into a group selected from two or more groups based on characteristics that are associated with the speaker's window of speech. Different step sizes may be used within the different groups when estimating the warping factor. VTLN is applied to the speech input using the estimated warping factor. A linear transformation, including a bias term, may also be computed to assist in normalizing the speech along with the application of the VTLN.
    Type: Grant
    Filed: January 23, 2013
    Date of Patent: February 16, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Shizhen Wang, Yifan Gong, Fileno Alleva
  • Patent number: 9263031
    Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
    Type: Grant
    Filed: November 15, 2013
    Date of Patent: February 16, 2016
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, Narendra K. Gupta, Mazin G. Rahim
  • Patent number: 9263032
    Abstract: A voice-responsive building management system is described herein. One system includes an interface, a dynamic grammar builder, and a speech processing engine. The interface is configured to receive a speech card of a user, wherein the speech card of the user includes speech training data of the user and domain vocabulary for applications of the building management system for which the user is authorized. The dynamic grammar builder is configured to generate grammar from a building information model of the building management system. The speech processing engine is configured to receive a voice command or voice query from the user, and execute the voice command or voice query using the speech training data of the user, the domain vocabulary, and the grammar generated from the building information model.
    Type: Grant
    Filed: October 24, 2013
    Date of Patent: February 16, 2016
    Assignee: Honeywell International Inc.
    Inventor: Jayaprakash Meruva
  • Patent number: 9263033
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.
    Type: Grant
    Filed: June 25, 2014
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventors: Olivier Siohan, Pedro J. Mengibar
  • Patent number: 9263034
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving voice queries, obtaining, for one or more of the voice queries, feedback information that references an action taken by a user that submitted the voice query after reviewing a result of the voice query, generating, for the one or more voice queries, a posterior recognition confidence measure that reflects a probability that the voice query was correctly recognized, wherein the posterior recognition confidence measure is generated based at least on the feedback information for the voice query, selecting a subset of the one or more voice queries based on the posterior recognition confidence measures, and adapting an acoustic model using the subset of the voice queries.
    Type: Grant
    Filed: July 13, 2010
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventors: Brian Strope, Douglas H. Beeferman
  • Patent number: 9263035
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.
    Type: Grant
    Filed: March 21, 2014
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventor: Matthew Sharifi
  • Patent number: 9263036
    Abstract: Deep recurrent neural networks applied to speech recognition. The deep recurrent neural networks (RNNs) are preferably implemented by stacked long short-term memory bidirectional RNNs. The RNNs are trained using end-to-end training with suitable regularization.
    Type: Grant
    Filed: November 26, 2013
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventor: Alexander B. Graves
  • Patent number: 9263037
    Abstract: A method and system of providing an interactive manual, including a speech engine to receive and process speech from a user, convert the speech into a word sequence, and identify meaning structures from the word sequence, a structured manual including information related to an operation of a device, a visual model to relate visual representation of the information, a dialog management arrangement to interpret the meaning structures in a context and to extract pertinent information and the visual representation from the structured manual and the visual model, and an output arrangement to output the information and visual representation.
    Type: Grant
    Filed: April 15, 2010
    Date of Patent: February 16, 2016
    Assignee: Robert Bosch GmbH
    Inventors: Fuliang Weng, Hauke Schmidt, Gengyan Bei
  • Patent number: 9263038
    Abstract: A facility and method for analyzing and classifying calls without transcription via keyword spotting is disclosed. The facility uses a group of calls having known outcomes to generate one or more domain- or entity-specific grammars containing keywords and related information that are indicative of particular outcome. The facility monitors telephone calls by determining the domain or entity associated with the call, loading the appropriate grammar or grammars associated with the determined domain or entity, and tracking keywords contained in the loaded grammar or grammars that are spoken during the monitored call, along with additional information. The facility performs a statistical analysis on the tracked keywords and additional information to determine a classification for the monitored telephone call.
    Type: Grant
    Filed: October 3, 2013
    Date of Patent: February 16, 2016
    Assignee: Marchex, Inc.
    Inventors: Jason Flaks, Ziad Ismail, Chris Kolbegger
  • Patent number: 9263039
    Abstract: Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for one or more users presenting questions or commands across multiple domains. The systems and methods creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech and non-speech communications and presenting the expected results for a particular question or command.
    Type: Grant
    Filed: September 29, 2014
    Date of Patent: February 16, 2016
    Assignee: Nuance Communications, Inc.
    Inventors: Philippe Di Cristo, Min Ke, Robert A. Kennewick, Lynn Elise Armstrong
  • Patent number: 9263040
    Abstract: An audio signal may be received, in a processor associated with a vehicle. Sound related vehicle information representing one or more sounds may be received by the processor. The sound related vehicle information may or may not include an audio signal. A speech recognition process or system may be modified based on the sound related vehicle information.
    Type: Grant
    Filed: January 17, 2012
    Date of Patent: February 16, 2016
    Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC
    Inventors: Eli Tzirkel-Hancock, Omer Tsimhoni
  • Patent number: 9263041
    Abstract: Methods related to Generalized Mutual Interdependence Analysis (GMIA), a low complexity statistical method for projecting data in a subspace that captures invariant properties of the data, are implemented on a processor based system. GMIA methods are applied to the signal processing problem of voice activity detection and classification. Real-world conversational speech data are modeled to fit the GMIA assumptions. Low complexity GMIA computations extract reliable features for classification of sound under noisy conditions and operate with small amounts of data. A speaker is characterized by a slow varying or invariant channel that is learned and is tracked from single channel data by GMIA methods.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: February 16, 2016
    Assignee: Siemens Aktiengesellschaft
    Inventors: Heiko Claussen, Justinian Rosca
  • Patent number: 9263042
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword models that correspond to the candidate hotword; and providing the identified, pre-computed hotword models to the computing device.
    Type: Grant
    Filed: July 25, 2014
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventor: Matthew Sharifi
  • Patent number: 9263043
    Abstract: A method and device is disclosed for reducing and controlling stuttering. The method involves tactile feedback of the stutterer's own speech to reducing stuttering. In one embodiment, the device may detect speech by audible or mechanical means, and the feedback may be produced by vibration mechanisms.
    Type: Grant
    Filed: December 3, 2010
    Date of Patent: February 16, 2016
    Assignee: UNIVERSITY OF MISSISSIPPI
    Inventors: Gregory John Snyder, Dwight E. Waddell, II, Paul Mallette Goggans
  • Patent number: 9263044
    Abstract: A computing device can capture video data of at least a portion of a mouth area (e.g., mouth, lips, tongue, chin, jaw) of a user of the device. The computing device can also capture sound data including a voice of the user as well as noise (e.g. background noise). The video data can be processed to detect a movement of the portion of the mouth area. The movement of the portion of the mouth area can be analyzed and compared with mouth area movement models characteristic of oral communication (e.g., speech, song). If the movement of the portion of the mouth area corresponds to at least one model characteristic of oral communication, then the movement indicates that the user is likely engaging in oral communication. Noise reduction can be applied and/or increased on the captured sound data to reduce noise and in turn enhance the user's voice.
    Type: Grant
    Filed: June 27, 2012
    Date of Patent: February 16, 2016
    Assignee: Amazon Technologies, Inc.
    Inventors: Ryan H. Cassidy, Yuzo Watanabe, Isaac S. Noble
  • Patent number: 9263045
    Abstract: Concepts and technologies are described herein for multi-mode text input. In accordance with the concepts and technologies disclosed herein, content is received. The content can include one or more input indicators. The input indicators can indicate that user input can be used in conjunction with consumption or use of the content. The application is configured to analyze the content to determine context associated with the content and/or the client device executing the application. The application also is configured to determine, based upon the content and/or the contextual information, which input device to use to obtain input associated with use or consumption of the content. Input captured with the input device can be converted to text and used during use or consumption of the content.
    Type: Grant
    Filed: May 17, 2011
    Date of Patent: February 16, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Mohan Varthakavi, Jayaram N M Nanduri, Nikhil Kothari
  • Patent number: 9263046
    Abstract: A distributed dictation/transcription system is provided. The system provides a dictation manager having a data port to receive and transmit audio signals. The dictation manager includes a dictation server selector to a dictation server to transcribe the audio based on whether the dictation server already has a user profile uploaded.
    Type: Grant
    Filed: April 1, 2013
    Date of Patent: February 16, 2016
    Assignee: NVOQ INCORPORATED
    Inventors: Richard Beach, Christopher Butler, Jon Ford, Brian Marquette, Christopher Omland
  • Patent number: 9263047
    Abstract: A system that incorporates teachings of the present disclosure may include, for example, a server including a controller to receive audio signals and content identification information from a media processor, generate text representing a voice message based on the audio signals, determine an identity of media content based on the content identification information, generate an enhanced message having text and additional content where the additional content is obtained by the controller based on the identity of the media content, and transmit the enhanced message to the media processor for presentation on the display device, where the enhanced message is accessible by one or more communication devices that are associated with a social network and remote from the media processor. Other embodiments are disclosed.
    Type: Grant
    Filed: November 5, 2014
    Date of Patent: February 16, 2016
    Assignee: AT&T INTELLECTUAL PROPERTY I, LP
    Inventors: Hisao Chang, Bernard S. Renger
  • Patent number: 9263048
    Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
    Type: Grant
    Filed: June 23, 2015
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventors: Michael J. LeBeau, William J. Byrne, John Nicholas Jitkoff, Brandon M. Ballinger, Trausti T. Kristjansson
  • Patent number: 9263049
    Abstract: Various techniques are disclosed for improving packet loss concealment to reduce artifacts by using audio character measures of the audio signal. These techniques include attenuation to a noise fill instead of attenuation to silence, varying how long to wait before attenuating the extrapolation, varying the rate of attenuation of the extrapolation, attenuating periodic extrapolation at a different rate than non-periodic extrapolation, and performing period extrapolation on successively longer fill data based on the audio character measures, adjusting weighting between periodic and non-periodic extrapolation based on the audio character measures, and adjusting weighting between periodic extrapolation and non-periodic extrapolation non-linearly.
    Type: Grant
    Filed: October 25, 2010
    Date of Patent: February 16, 2016
    Assignee: Polycom, Inc.
    Inventor: Eric David Elias
  • Patent number: 9263050
    Abstract: A method is provided for allocating bits for quantifying spatial information parameters by frequency sub-band for parametric encoding/decoding of a multichannel audio stream representative of a soundstage consisting of a plurality of sound sources. The method includes a step of quantifying or inversely quantifying, by frequency sub-band, spatial information parameters for the sound sources of the soundscape. The method further includes: assessing a spatial resolution of the current sub-band on the basis of the spectral properties of the sub-band; and determining a number of bits to be allocated to the current sub-band, the number of bits to be allocated being inversely proportional to the estimated spatial resolution. Also provided is a device for allocating quantification bits implementing the above-described method.
    Type: Grant
    Filed: March 28, 2012
    Date of Patent: February 16, 2016
    Assignee: ORANGE
    Inventors: Adrien Daniel, Rozenn Nicol
  • Patent number: 9263051
    Abstract: A method, system and program for decoding a speech signal. In some embodiments, the method comprises: receiving an encoded speech signal having quantization values; transforming the quantization values by adding simulated random-noise samples; and from the encoded speech signal, determining a parameter of the transformation that is usable to control the transformation of the quantization values.
    Type: Grant
    Filed: February 17, 2014
    Date of Patent: February 16, 2016
    Assignee: Skype
    Inventor: Koen Bernard Vos
  • Patent number: 9263052
    Abstract: A method and system is disclosed for simultaneously determining glottal closure instants (GCIs), fundamental frequency (F0s), and voicing state of a speech signal. A speech signal may be processed to determine a sequence of candidate GCIs. For each candidate GCI, a set of candidate F0s may be determined. A lattice of hypotheses may be constructed, where each lattice point is a hypothesis of a concurrence of a candidate GCI, a candidate F0, and voicing state. Each given hypothesis may also include a score of the candidate GCI, F0, and voicing state for evaluating a cost of the given hypothesis and a cost of connections between the given hypothesis and other hypotheses of the lattice. Dynamic programming may be used to determine a least-cost path through the lattice, and backtracking across the path may be used to determine an optimal set of GCIs, F0s and voicing states of the speech signal.
    Type: Grant
    Filed: January 25, 2013
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventor: David Talkin
  • Patent number: 9263053
    Abstract: A method (1100) and apparatus (100) generate a candidate code-vector to code an information signal. The method can include producing (1110) a weighted target vector from an input signal. The method can include processing (1120) the weighted target vector through an inverse weighting function to create a residual domain target vector. The method can include performing (1130) a first search process on the residual domain target vector to obtain an initial fixed codebook code-vector. The method can include performing (1140) a second search process over a subset of possible codebook code-vectors for a low weighted-domain error to produce a final fixed codebook code-vector. The subset of possible codebook code-vectors can be based on the initial fixed codebook code-vector. The method can include generating (1150) a codeword representative of the final fixed codebook code-vector. The codeword can be for use by a decoder to generate an approximation of the input signal.
    Type: Grant
    Filed: November 2, 2012
    Date of Patent: February 16, 2016
    Assignee: GOOGLE TECHNOLOGY HOLDINGS LLC
    Inventors: James P Ashley, Udar Mittal
  • Patent number: 9263054
    Abstract: A method for controlling an average encoding rate by an electronic device is described. The method includes obtaining a speech signal. The method also includes determining a first average rate. The method further includes determining a first threshold based on the first average rate. The method additionally includes controlling the average encoding rate by determining at least one other threshold based on the first threshold. The method also includes sending an encoded speech signal.
    Type: Grant
    Filed: August 30, 2013
    Date of Patent: February 16, 2016
    Assignee: QUALCOMM Incorporated
    Inventors: Subasingha Shaminda Subasingha, Vivek Rajendran, Venkatesh Krishnan, Venkatraman Srinivasa Atti
  • Patent number: 9263055
    Abstract: Systems and methods for generating and performing a three-dimensional audio CAPTCHA are provided. One exemplary system can include a decoy signal database storing a plurality of decoy signals. The system also can include a three-dimensional audio simulation engine for simulating the sounding of a target signal and at least one decoy signal in an acoustic environment and outputting a stereophonic audio signal based on the simulation. One exemplary method includes providing an audio prompt to a resource requesting entity. The audio prompt can have been generated based on a three-dimensional audio simulation of the sounding of a target signal containing an authentication key and at least one decoy signal in an acoustic environment. The method can include receiving a response to the audio prompt from the resource requesting entity and comparing the response to the authentication key.
    Type: Grant
    Filed: April 10, 2013
    Date of Patent: February 16, 2016
    Assignee: Google Inc.
    Inventors: Yannis Agiomyrgiannakis, Edison Tan, David John Abraham
  • Patent number: 9263056
    Abstract: A method of simultaneously transforming at least two input voice signals xi of a communications system (30), each input voice signal xi being received at a specific reception frequency Fi and corresponding to the voice of a remote party communicating with a user of the communications system (30). During an initialization stage, a transformation Ti is allocated to at least one reception frequency Fi of the input voice signals xi, and during a utilization stage, transformations Ti are applied simultaneously to the input voice signals xi as a function of the reception frequencies Fi, modifying at least one characteristic of each of the input voice signals xi. Thus, the voice of each remote party in communication with the user of the communications system (30) is modified artificially by a transformation Ti, thereby making it easier for the user to perceive and discriminate between simultaneous voices from the remote parties.
    Type: Grant
    Filed: May 7, 2015
    Date of Patent: February 16, 2016
    Assignee: Airbus Helicopters
    Inventor: Jean-Pierre Baudry
  • Patent number: 9263057
    Abstract: An audio encoder has a window function controller, a windower, a time warper with a final quality check functionality, a time/frequency converter, a TNS stage or a quantizer encoder, the window function controller, the time warper, the TNS stage or an additional noise filling analyzer are controlled by signal analysis results obtained by a time warp analyzer or a signal classifier. Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.
    Type: Grant
    Filed: November 11, 2014
    Date of Patent: February 16, 2016
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Stefan Bayer, Sascha Disch, Ralf Geiger, Guillaume Fuchs, Max Neuendorf, Gerald Schuller, Bernd Edler
  • Patent number: 9263058
    Abstract: A vehicle based system and method for receiving voice inputs and determining whether to perform a voice recognition analysis using in-vehicle resources or resources external to the vehicle.
    Type: Grant
    Filed: June 24, 2011
    Date of Patent: February 16, 2016
    Assignee: Honda Motor Co., Ltd.
    Inventors: Ritchie Winson Huang, Pedram Vaghefinazari, Stuart Yamamoto
  • Patent number: 9263059
    Abstract: In a method for deep tagging a recording, a computer records audio comprising speech from one or more people. The computer detects a non-speech sound within the audio. The computer determines that the non-speech sound corresponds to a type of sound, and in response, associates a descriptive term with a time of occurrence of the non-speech sound within the recorded audio to form a searchable tag. The computer stores the searchable tag as metadata of the recorded audio.
    Type: Grant
    Filed: September 28, 2012
    Date of Patent: February 16, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Denise A. Bell, Lisa Seacat DeLuca, Jana H. Jenkins, Jeffrey A. Kusnitz
  • Patent number: 9263060
    Abstract: A system for classification of the emotional content of music is provided. An encoder receives a digital audio recording of a piece of music, and encodes it using musical notes and associated amplitudes. The artificial neural network is configured to take a plurality of encoded time slices and provide output indicative of the emotional content of the music.
    Type: Grant
    Filed: August 21, 2012
    Date of Patent: February 16, 2016
    Assignee: MARIAN MASON PUBLISHING COMPANY, LLC
    Inventor: David A. Sharp
  • Patent number: 9263061
    Abstract: Methods and systems are provided for detecting chop in an audio signal. A time-frequency representation, such as a spectrogram, is created for an audio signal and used to calculate a gradient of mean power per frame of the audio signal. Positive and negative gradients are defined for the signal based on the gradient of mean power, and a maximum overlap offset between the positive and negative gradients is determined by calculating a value that maximizes the cross-correlation of the positive and negative gradients. The negative gradient values may be combined (e.g., summed) with the overlap offset, and the combined values then compared with a threshold to estimate the amount of chop present in the audio signal. The chop detection model provided is low-complexity and is applicable to narrowband, wideband, and superwideband speech.
    Type: Grant
    Filed: May 21, 2013
    Date of Patent: February 16, 2016
    Assignee: GOOGLE INC.
    Inventors: Andrew J. Hines, Jan Skoglund, Naomi Harte, Anil Kokaram
  • Patent number: 9263062
    Abstract: A voice activity detector (VAD) combines the use of an acoustic VAD and a vibration sensor VAD as appropriate to the conditions a host device is operated. The VAD includes a first detector receiving a first signal and a second detector receiving a second signal. The VAD includes a first VAD component coupled to the first and second detectors. The first VAD component determines that the first signal corresponds to voiced speech when energy resulting from at least one operation on the first signal exceeds a first threshold. The VAD includes a second VAD component coupled to the second detector. The second VAD component determines that the second signal corresponds to voiced speech when a ratio of a second parameter corresponding to the second signal and a first parameter corresponding to the first signal exceeds a second threshold.
    Type: Grant
    Filed: August 5, 2013
    Date of Patent: February 16, 2016
    Assignee: AplihCom
    Inventors: Zhinian Jing, Nicolas Jean Petit, Gregory C. Burnett
  • Patent number: 9263063
    Abstract: The invention relates to a method for disabling a discontinuous transmission node DTX of a speech encoder if a music signal is detected in a call input signal. The music signal is detected by determining an activity factor corresponding to the relation of sound signal periods relative to scheme signal periods. If the activity factor is higher than a specified activity factor, the DTX is disabled.
    Type: Grant
    Filed: February 25, 2010
    Date of Patent: February 16, 2016
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventors: Timo Suihko, Johan Gunnar Lundström, Arto Mahkonen
  • Patent number: 9263064
    Abstract: The present invention provides a search method used to search for the reading order of a plurality of recording groups when the plurality of recording groups written on tape are continuously read by a tape drive which manages data on tape in recording units having a fixed data length for each recording. This search method includes the steps of: receiving information on a plurality of tape groups to be read; and sorting the plurality of recording groups to be read so the reading time is shortened. In the sorting step, the time required to sort the plurality of reading groups is reduced by combining two or more recording groups into a single object to be sorted in the sorting step when at least two or more contiguous recording groups have been assigned to the same region or are assigned across adjacent regions among the plurality of regions.
    Type: Grant
    Filed: February 10, 2014
    Date of Patent: February 16, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Takashi Katagiri, Mitsuhiro Nishida