Patents Examined by Abdelali Serrou
  • Patent number: 10026404
    Abstract: A system and method for dynamically selecting speech recognition functionality on a client device for recognizing user speech inputs are disclosed. Such selection may be made based on speech recognition functionalities actually available on the client devices. The speech functionalities that may be dynamically selected may include, without limitation, speech recognition software and/or services, speech libraries, kernel drivers, speech recognition hardware, audio hardware, and/or any other speech functionality available on a client device. User speech inputs may be processed via the selected speech functionality for generating control commands in a virtual space. In some implementations, remote speech recognition support may be evoked when a client device does not have any speech recognition functionality on the client device.
    Type: Grant
    Filed: September 18, 2017
    Date of Patent: July 17, 2018
    Assignee: Electronic Arts Inc.
    Inventors: Kent Wakeford, Clifford J. Harrington
  • Patent number: 9996628
    Abstract: This disclosure includes, for example, methods and computer systems for providing audio-activated resource access for user devices. The computer systems may store instructions to cause the processor to perform operations, comprising capturing audio at a user device. The operations may also comprise using a speaker recognition system to identify a speaker in the transmitted audio and/or using a speech-to-text converter to identify text in the captured audio. The speaker identity or a condensed version of the speaker identity or other metadata along with the speaker identity may be transmitted to a server system to determine a corresponding speaker identity entry. The operations may also comprise receiving a resource corresponding to the identified speaker entry in the server system.
    Type: Grant
    Filed: June 29, 2012
    Date of Patent: June 12, 2018
    Assignee: VERISIGN, INC.
    Inventors: Harshini Ramnath Krishnan, Andrew Fregly
  • Patent number: 9997168
    Abstract: A method and an apparatus for signal extraction of audio signal are provided. An audio signal is converted into a plurality of frames, and the frames are arranged in a chronological order. Spectral data of each of the frames is obtained. The spectral data of each of N frames is extracted in the chronological order, and a spectral connectivity operation is executed for the N frames. Finally, the signal including the frames having the spectral connectivity between adjacent frames in each of the frames is determined as an ideal signal.
    Type: Grant
    Filed: July 14, 2015
    Date of Patent: June 12, 2018
    Assignee: Novatek Microelectronics Corp.
    Inventor: Chung-Chi Hsu
  • Patent number: 9959878
    Abstract: An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
    Type: Grant
    Filed: June 22, 2016
    Date of Patent: May 1, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey Riedmiller, Michael Ward
  • Patent number: 9953147
    Abstract: A computer-implemented system and method for correlating activity within a user interface with special information is provided. A user interface with data entry fields is provided. One or more of the data entry fields is designated for special information. A first screen shot of the user interface is captured, and a second screen shot of the user interface is captured at a later time. The first and second screen shots are compared. A change comprising at least a portion of an entry within one of the data entry fields for special information in the second screen shot is identified between the first and second screen shots. The entry is rendered unintelligible.
    Type: Grant
    Filed: May 19, 2014
    Date of Patent: April 24, 2018
    Assignee: Intellisist, Inc.
    Inventor: G. Kevin Doren
  • Patent number: 9953636
    Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.
    Type: Grant
    Filed: October 9, 2015
    Date of Patent: April 24, 2018
    Inventors: Michael H. Cohen, Shumeet Baluja, Pedro J. Moreno Mengibar
  • Patent number: 9940923
    Abstract: The disclosure relates to systems, methods and apparatus to convert speech to text and vice versa. One apparatus comprises a vocoder, a speech to text conversion engine, a text to speech conversion engine, and a user interface. The vocoder is operable to convert speech signals into packets and convert packets into speech signals. The speech to text conversion engine is operable to convert speech to text. The text to speech conversion engine is operable to convert text to speech. The user interface is operable to receive a user selection of a mode from among a plurality of modes, wherein a first mode enables the speech to text conversion engine, a second mode enables the text to speech conversion engine, and a third mode enables the speech to text conversion engine and the text to speech conversion engine.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: April 10, 2018
    Assignee: QUALCOMM Incorporated
    Inventors: Stephen Molloy, Khaled Helmi El-Maleh
  • Patent number: 9936914
    Abstract: A system and a method for assessing a condition in a subject. Phones from speech of the subject are recognized, one or more prosodic or speech-excitation-source features of the phones are extracted, and an assessment of a condition of the subject, is generated based on a correlation between the features of the phones and the condition.
    Type: Grant
    Filed: August 7, 2017
    Date of Patent: April 10, 2018
    Assignee: Massachusetts Institute of Technology
    Inventors: Thomas F. Quatieri, Jr., Nicolas Malyska, Andrea Carolina Trevino
  • Patent number: 9923938
    Abstract: A computer-implemented method manages drop-ins on conversations near a focal point of proximal activity in a gathering place. One or more processors receive a first set of sensor data from one or more sensors in a gathering place, and then identify a focal point of proximal activity based on the first set of received sensor data received from the one or more sensors. One or more processors characterize a conversation near the focal point based on a second set of received sensor data from the one or more sensors, and then present a characterization of the conversation to an electronic device. One or more processors enable the electronic device to allow a user to drop-in on the conversation.
    Type: Grant
    Filed: July 13, 2015
    Date of Patent: March 20, 2018
    Assignee: International Business Machines Corporation
    Inventors: Rachel K. E. Bellamy, Jonathan H. Connell, II, Robert G. Farrell, Brian P. Gaucher, Jonathan Lenchner, David O. S. Melville, Valentina Salapura
  • Patent number: 9916845
    Abstract: This method for determining alcohol use comprises the steps of: detecting an effective frame in an input audio signal; detecting a difference in signal within the original signal of the effective frame; performing a fast Fourier conversion on the difference signal to be transformed into a frequency domain; detecting high-frequency components within the difference signal subjected to the fast Fourier transform; and determining the state of alcohol use on the basis of a gradient difference between the high-frequency components. Accordingly, the present invention can identify the state and extent of alcohol use by a driver or an operator from a long distance and thus can prevent accidents caused by driving or operating vehicles and machines under the influence of alcohol.
    Type: Grant
    Filed: April 2, 2014
    Date of Patent: March 13, 2018
    Assignee: FOUNDATION OF SOONGSIL UNIVERSITY—INDUSTRY COOPERATION
    Inventors: Myung Jin Bae, Sang Gil Lee, Seong Geon Bae
  • Patent number: 9898723
    Abstract: Embodiments of the invention provide for secure voice authentication through a communication device or access device. Certain embodiments allow for providing a word string to a communication device or authentication device. The communication or authentication device plays a supplemental signal that is unique to a transaction. The communication device or authentication device concurrently records an audio segment originating from the user and the supplemental signal. The audio segment is an attempt by the user to vocally reproduce the word string. The communication device or authentication device sends the concurrently recorded audio segment and supplemental signal, to a computer, where the computer authenticates the user.
    Type: Grant
    Filed: December 18, 2013
    Date of Patent: February 20, 2018
    Assignee: Visa International Service Association
    Inventors: Robert Rutherford, Julian Hua
  • Patent number: 9899021
    Abstract: Features are disclosed for modeling user interaction with a detection system using a stochastic dynamical model in order to determine or adjust detection thresholds. The model may incorporate numerous features, such as the probability of false rejection and false acceptance of a user utterance and the cost associated with each potential action. The model may determine or adjust detection thresholds so as to minimize the occurrence of false acceptances and false rejections while preserving other desirable characteristics. The model may further incorporate background and speaker statistics. Adjustments to the model or other operation parameters can be implemented based on the model, user statistics, and/or additional data.
    Type: Grant
    Filed: December 20, 2013
    Date of Patent: February 20, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Shiv Naga Prasad Vitaladevuni, Bjorn Hoffmeister, Rohit Prasad
  • Patent number: 9888110
    Abstract: A system for automated adaptation and improvement of speaker authentication in a voice biometric system environment, comprising a speech sample collector, a target selector, a voice analyzer, a voice data modifier, and a call flow creator. The speech sample collector retrieves speech samples from a database of enrolled participants in a speaker authentication system. The target selector selects target users that will be used to test the speaker authentication system. The voice analyzer extracts a speech component data set from each of the speech samples. The call flow creator creates a plurality of call flows for testing the speaker authentication system, each call flow being either an impostor call flow or a legitimate call flow. The call flows created by the call flow creator are used to test the speaker authentication system.
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: February 6, 2018
    Assignee: Cyara Solutions Pty Ltd.
    Inventor: Alok Kulkarni
  • Patent number: 9881604
    Abstract: A system and method for identifying special information is provided. Endpoints are defined within a voice recording. One or more of the endpoints are identified within the voice recording and the voice recording is partitioned into segments based on the identified endpoints. Elements of text are identified by applying speech recognition to each of the segments and a list of prompt list candidates are applied to the text elements. The segments with text elements that match one or more prompt list candidates are identified. Portions of the voice recording following the prompt list candidates that include special information are identified and the special information is rendered unintelligible within the voice recording.
    Type: Grant
    Filed: February 9, 2015
    Date of Patent: January 30, 2018
    Assignee: Intellisist, Inc.
    Inventors: Howard M. Lee, Steven Lutz, Gilad Odinak
  • Patent number: 9865240
    Abstract: The personalized content system of the system combines a summary music identification value creation and identification algorithm that represents a mathematical summary music identification of a song, an audio file, or other relational music criteria and data (e.g title, artist, genre, style, beats per minute, etc.) or any combination thereof. The derived value represents the musical taste or style attributes of a song or style. The user can control the system by issuing one of a plurality of commands comprising artist command, song/title command, genre command, and album/filtered list command. These commands then lead to command trees that may be used in a voice controlled system, for example.
    Type: Grant
    Filed: December 19, 2007
    Date of Patent: January 9, 2018
    Assignee: Harman International Industries, Incorporated
    Inventor: Lee Bauer
  • Patent number: 9817808
    Abstract: A method includes translating a source to generate a translated source, extracting a set of terms from one of the source and the translated source comprising at least a first term and a second term related to the first term, comparing the extracted set of terms with at least one translation pair, and determining a correct translation based on the comparison.
    Type: Grant
    Filed: September 29, 2014
    Date of Patent: November 14, 2017
    Assignee: International Business Machines Corporation
    Inventors: Abraham P. Ittycheriah, Cezar Pendus
  • Patent number: 9818418
    Abstract: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: November 14, 2017
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 9818417
    Abstract: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information.
    Type: Grant
    Filed: April 20, 2016
    Date of Patent: November 14, 2017
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 9812142
    Abstract: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: November 7, 2017
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Per Ekstrand, Holger Hoerich
  • Patent number: 9805735
    Abstract: An apparatus, method and computer program for generating a wideband signal using a lowband input signal includes a processor for performing a guided bandwidth extension operation using transmitted parameters and a blind bandwidth extension operation only using derived parameters rather than transmitted parameters. To this end, the processor includes a parameter generator for generating the parameters for the blind bandwidth extension operation.
    Type: Grant
    Filed: October 12, 2012
    Date of Patent: October 31, 2017
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Frederik Nagel, Max Neuendorf, Markus Schnell, Markus Multrus