Patents Examined by Abdelali Serrou

Real-time speech-to-text conversion in an audio conference session

Patent number: 9560206

Abstract: Various embodiments of systems, methods, and computer programs are disclosed for providing real-time resources to participants in an audio conference session. One embodiment is a method for providing real-time resources to participants in an audio conference session via a communication network. One such method comprises: a conferencing system establishing an audio conference session between a plurality of computing devices via a communication network, each computing device generating a corresponding audio stream comprising a speech signal; and in real-time during the audio conference session, a server: receiving and processing the audio streams to determine the speech signals; extracting words from the speech signals; analyzing the extracted words to determine a relevant keyword being discussed in the audio conference session; identifying a resource related to the relevant keyword; and providing the resource to one or more of the computing devices.

Type: Grant

Filed: April 30, 2010

Date of Patent: January 31, 2017

Assignee: American Teleconferencing Services, Ltd.

Inventors: Boland T. Jones, David Michael Guthrie, Laurence Schaefer, J Douglas Martin
System and method for automated adaptation and improvement of speaker authentication in a voice biometric system environment

Patent number: 9553977

Abstract: A system for automated adaptation and improvement of speaker authentication in a voice biometric system environment, comprising a speech sample collector, a target selector, a voice analyzer, a voice data modifier, and a call flow creator. The speech sample collector retrieves speech samples from a database of enrolled participants in a speaker authentication system. The target selector selects target users that will be used to test the speaker authentication system. The voice analyzer extracts a speech component data set from each of the speech samples. The call flow creator creates a plurality of call flows for testing the speaker authentication system, each call flow being either an impostor call flow or a legitimate call flow. The call flows created by the call flow creator are used to test the speaker authentication system.

Type: Grant

Filed: August 24, 2015

Date of Patent: January 24, 2017

Assignee: Cyara Solutions Pty Ltd

Inventor: Alok Kulkarni
Transform audio codec and methods for encoding and decoding a time segment of an audio signal

Patent number: 9546924

Abstract: Methods and devices for efficient encoding/decoding of a time segment of an audio signal. The methods comprise deriving an indicator, z, of the position in a frequency scale of a residual vector associated with the time segment of the audio signal, and deriving a measure, ?, related to the amount of structure of the residual vector. The methods further comprise determining whether a predefined criterion involving the measure ?, the indicator z and a predefined threshold ?, is fulfilled, which corresponds to estimating whether a change of sign of at least some of the non-zero coefficients of the residual vector would be audible after reconstruction of the audio signal time segment. The respective amplitude of the coefficients of the residual vector is encoded, and the signs of the coefficients of the residual vector are encoded only when it is determined that the criterion is fulfilled, and thus that a change of sign would be audible.

Type: Grant

Filed: June 30, 2011

Date of Patent: January 17, 2017

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Volodya Grancharov, Sigurdur Sverrisson
System and method of providing speech processing in user interface

Patent number: 9530415

Abstract: Disclosed are systems, methods and computer-readable media for enabling speech processing in a user interface of a device. The method includes receiving an indication of a field and a user interface of a device, the indication also signaling that speech will follow, receiving the speech from the user at the device, the speech being associated with the field, transmitting the speech as a request to public, common network node that receives and processes speech, processing the transmitted speech and returning text associated with the speech to the device and inserting the text into the field. Upon a second indication from the user, the system processes the text in the field as programmed by the user interface. The present disclosure provides a speech mash up application for a user interface of a mobile or desktop device that does not require expensive speech processing technologies.

Type: Grant

Filed: October 30, 2015

Date of Patent: December 27, 2016

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Jay Wilpon, Giuseppe Di Fabbrizio, Benjamin J. Stern
Sentence level analysis in a reading tutor

Patent number: 9520068

Abstract: A method and related system, computer program product and device for interactively tracking oral reading of text from a document includes recording audio for a sentence read by a user and determining when the user has reached the last word of the sentence. The method also includes providing visual feedback to the user reading on a sentence by sentence level to indicate a current location in the passage.

Type: Grant

Filed: September 10, 2004

Date of Patent: December 13, 2016

Assignee: JTT Holdings, Inc.

Inventors: Valerie L. Beattie, Marilyn Jager Adams
Audio encoding device, method and program, and audio decoding device, method and program

Patent number: 9508350

Abstract: An audio packet error concealment system includes an encoding unit for encoding an audio signal consisting of a plurality of frames, and an auxiliary information encoding unit for estimating and encoding auxiliary information about a temporal change of power of the audio signal. The auxiliary information is used in packet loss concealment in decoding of the audio signal. The auxiliary information about the temporal change of power may contain a parameter that functionally approximates a plurality of powers of subframes shorter than one frame, or may contain information about a vector obtained by vector quantization of a plurality of powers of subframes shorter than one frame.

Type: Grant

Filed: May 21, 2013

Date of Patent: November 29, 2016

Assignee: NTT DOCOMO, INC.

Inventors: Kimitaka Tsutsumi, Kei Kikuiri
Methods, apparatus and computer programs for automatic speech recognition

Patent number: 9502024

Abstract: An automatic speech recognition (ASR) system includes a speech-responsive application and a recognition engine. The ASR system generates user prompts to elicit certain spoken inputs, and the speech-responsive application performs operations when the spoken inputs are recognized. The recognition engine compares sounds within an input audio signal with phones within an acoustic model, to identify candidate matching phones. A recognition confidence score is calculated for each candidate matching phone, and the confidence scores are used to help identify one or more likely sequences of matching phones that appear to match a word within the grammar of the speech-responsive application. The per-phone confidence scores are evaluated against predefined confidence score criteria (for example, identifying scores below a ‘low confidence’ threshold) and the results of the evaluation are used to influence subsequent selection of user prompts.

Type: Grant

Filed: February 26, 2014

Date of Patent: November 22, 2016

Assignee: Nuance Communications, Inc.

Inventors: John Brian Pickering, Timothy David Poultney, Benjamin Terrick Staniford, Matthew Whitbourne
Method and system for handling locale and language in a cloud management system

Patent number: 9501295

Abstract: Provided are a method, system, and computer program product for handling locale and language in a cloud management system, in which a first composite values list of applicable locales and matching languages combinations is generated from at least one language installed on a service management system and at least one locale supported by said service management system. A second composite values list of applicable locales and matching languages combinations is generated as a fall back list based on at least one base language of said service management system and at least one matching locale formed from said at least one base language, if said first composite values list of applicable locales and matching languages is empty. A resulting composite values list of valid locales and languages combinations is provided for further processing.

Type: Grant

Filed: July 2, 2012

Date of Patent: November 22, 2016

Assignee: International Business Machines Corporation

Inventors: Stephane B. Rodet, Torsten Teich
Apparatus and methods to update a language model in a speech recognition system

Patent number: 9489940

Abstract: The technology of the present application provides a method and apparatus to allow for dynamically updating a language model across a large number of similarly situated users. The system identifies individual changes to user profiles and evaluates the change for a broader application, such as, a dialect correction for a speech recognition engine, as administrator for the system identifies similarly situated user profiles and downloads the profile change to effect a dynamic change to the language model of similarly situated users.

Type: Grant

Filed: June 11, 2012

Date of Patent: November 8, 2016

Assignee: NVOQ INCORPORATED

Inventor: Charles Corfield
Audio signal processing apparatus and audio signal processing method

Patent number: 9472197

Abstract: An audio signal processing apparatus that processes a bit stream generated by coding an audio signal on a frame-by-frame basis, the bit stream including, for each frame, coded data representing the audio signal, additional data and attribute information, the audio signal processing apparatus including a decoding unit configured to decode the coded data to generate a decoded signal, a processing unit configured to process the decoded signal, a detection unit configured to detect whether or not there has been a change in the attribute information, and a storage unit, wherein the processing unit is configured to, when the change is not detected, process the decoded signal by using at least two pieces of additional data stored, and when the change is detected, process the decoded signal by using only either additional data before detection of the change or additional data after detection of the change.

Type: Grant

Filed: February 6, 2013

Date of Patent: October 18, 2016

Assignee: SOCIONEXT INC.

Inventors: Shuji Miyasaka, Satoshi Shinzaki, Sin Akamatsu, Shuhei Yamada
Autocorrecting language input for virtual keyboards

Patent number: 9471560

Abstract: Various techniques for autocorrecting virtual keyboard input for various languages (e.g., Japanese, Chinese) are disclosed. In one aspect, a system or process receives a sequence of keyboard events representing keystrokes on a virtual keyboard. A hierarchical data structure is traversed according to the sequence of keyboard events to determine candidate words for the sequence of keyboard events. A word lattice is constructed using a language model, including deriving weights or paths in the word lattice based on candidate word statistics and data from a keyboard error model. The word lattice is searched to determine one or more candidate sentences comprising candidate words based on the path weights. Paths through the word lattice can be pruned (e.g., discarded) to reduce the size and search time of the word lattice.

Type: Grant

Filed: June 1, 2012

Date of Patent: October 18, 2016

Assignee: Apple Inc.

Inventors: Yasuo Kida, Leland Douglas Collins, Jr.
System and method for disambiguating multiple intents in a natural language dialog system

Patent number: 9454960

Abstract: The present invention addresses the deficiencies in the prior art by providing an improved dialog for disambiguating a user utterance containing more than one intent. The invention comprises methods, computer-readable media, and systems for engaging in a dialog. The method embodiment of the invention relates to a method of disambiguating a user utterance containing at least two user intents. The method comprises establishing a confidence threshold for spoken language understanding to encourage that multiple intents are returned, determining whether a received utterance comprises a first intent and a second intent and, if the received utterance contains the first intent and the second intent, disambiguating the first intent and the second intent by presenting a disambiguation sub-dialog wherein the user is offered a choice of which intent to process first, wherein the user is first presented with the intent of the first or second intents having the lowest confidence score.

Type: Grant

Filed: April 13, 2015

Date of Patent: September 27, 2016

Assignee: AT&T Intellectual Property II, L.P.

Inventor: Osamuyimen Thompson Stewart
Quality improvement techniques in an audio encoder

Patent number: 9443525

Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.

Type: Grant

Filed: June 30, 2014

Date of Patent: September 13, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Methods for improving high frequency reconstruction

Patent number: 9431020

Abstract: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilizing high frequency reconstruction (HFR). It utilizes a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR unit.

Type: Grant

Filed: April 18, 2013

Date of Patent: August 30, 2016

Assignee: DOLBY INTERNATIONAL AB

Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
Devices and systems for remote control

Patent number: 9396728

Abstract: Remote controllers and systems thereof are disclosed. The remote controller remotely operates a receiving host, in which the receiving host provides voice input and speech recognition functions. The remote controller comprises a first input unit and a second input unit for generating a voice input request and a speech recognition request. The generated voice input and speech recognition requests are then sent to the receiving host, thereby forcing the receiving host to perform the voice input and speech recognition functions.

Type: Grant

Filed: July 22, 2015

Date of Patent: July 19, 2016

Assignee: ASUSTEK COMPUTER INC.

Inventors: Chia-Chen Liu, Yun-Jung Wu, Liang-Yi Huang, Yi-Hsiu Lee
Method and apparatus for detecting a voice activity in an input audio signal

Patent number: 9368112

Abstract: The disclosure provides a method and an apparatus for detecting a voice activity in an input audio signal composed of frames. A noise characteristic of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise characteristic of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.

Type: Grant

Filed: May 10, 2013

Date of Patent: June 14, 2016

Assignee: HUAWEI TECHNOLOGIES CO., LTD

Inventor: Zhe Wang
Data security system for natural language translation

Patent number: 9317501

Abstract: A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.

Type: Grant

Filed: March 12, 2015

Date of Patent: April 19, 2016

Assignee: International Business Machines Corporation

Inventors: Carl J. Kraenzel, David M. Lubensky, Baiju Dhirajlal Mandalia, Cheng Wu
Fast title/summary extraction from long descriptions

Patent number: 9317595

Abstract: Techniques are described herein for automatic generation of a title or summary from a long body of text. A grammatical tree representing one or more sentences of the long body of text is generated. One or more nodes from the grammatical tree are selected to be removed. According to one embodiment, a particular node is selected to be removed based on its position in the grammatical tree and its node-type, where the node type represents a grammatical element of the sentence. Once the particular node is selected, a branch of the tree is cut at the node. After branch has been cut, one or more sub-sentences are generated from the remaining nodes in the grammatical tree. The one or more sub-sentences may be returned as a title or summary.

Type: Grant

Filed: December 6, 2010

Date of Patent: April 19, 2016

Assignee: Yahoo! Inc.

Inventors: Xin Li, Hongjian Zhao
Dialogue detector and correction

Patent number: 9305550

Abstract: An apparatus and method for tracking dialogue and other sound signals in film, television or other systems with multiple channel sound is described. One or more audio channels which is expected to carry the speech of persons appearing in the program or other particular types of sounds is inspected to determine if that channel's audio includes particular sounds such as MUEVs, including phonemes corresponding to human speech patterns. If an improper number of particular sounds such as phonemes are found in the channel(s) an action such as a report, an alarm, a correction, or other action is taken. The inspection of the audio channel(s) may be made in conjunction with the appearance of corresponding images associated with the sound, such as visemes in the video signal, to improve the determination of types of sounds such as phonemes.

Type: Grant

Filed: December 7, 2010

Date of Patent: April 5, 2016

Inventors: J. Carl Cooper, Mirko Vojnovic, Christopher Smith
Translation of text into multiple languages

Patent number: 9304990

Abstract: Methods and systems for translating a text into multiple languages performed by at least one software component executed by at least one processor, comprise: maintaining a translation repository having a plurality of entries associating different types of content with user-specified languages; monitoring the text received by a program to identify one or more types of content and a source language of the text; retrieving the user-specified languages from the translation repository associated with the identified types of content; and for each of the identified types of content, translating the content thereof from the source language to the corresponding user-specified language when the source language is different from the corresponding user-specified language.

Type: Grant

Filed: August 20, 2012

Date of Patent: April 5, 2016

Assignee: International Business Machines Corporation

Inventors: Judith H. Bank, Liam Harpur, Ruthie D. Lyle, Patrick J. O'Sullivan, Lin Sun

prev … 9 10 11 12 13 14 15 16 17 … next