Patents Examined by Daniel Abebe
  • Patent number: 9898536
    Abstract: Methods and systems to perform textual queries on voice communications. The system has an index service for storing a audio content data sets for voice communications. The audio content data sets include at least three audio content data sets for each voice communication. The three audio content data sets include a first audio content data set generated using a speech-to-text conversion technique, a second audio content data set generated using a phoneme lattice technique, and a third audio content data set generated using a keyword identification technique. The system includes a search engine configured to: receive search criteria from a user, the search criteria having at least one keyword; search each of the first, second and third audio content data sets for at least a portion of the plurality of voice communications to identify voice communications matching the search criteria; and combine the voice communications identified by each search to produce a combined list of identified voice communications.
    Type: Grant
    Filed: June 27, 2013
    Date of Patent: February 20, 2018
    Assignees: JAJAH LTD., Telefonica, S.A.
    Inventors: Diego Urdiales Delgado, John Eugene Neystadt
  • Patent number: 9894460
    Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving a voice query at a mobile computing device and generating data that represents content of the voice query. The data is provided to a server system. A textual query that has been determined by a speech recognizer at the server system to be a textual form of at least part of the data is received at the mobile computing device. The textual query is determined to include a carrier phrase of one or more words that is reserved by a first third-party application program installed on the computing device. The first third-party application is selected, from a group of one or more third-party applications, to receive all or a part of the textual query. All or a part of the textual query is provided to the selected first application program.
    Type: Grant
    Filed: June 29, 2016
    Date of Patent: February 13, 2018
    Assignee: Google Inc.
    Inventors: Michael J. LeBeau, John Nicholas Jitkoff, William J. Byrne
  • Patent number: 9886959
    Abstract: A voice encoder/decoder (vocoder) may provide receiving a voice sample and generating zero crossings of the voice sample in response to voice excitation in a first formant and creating a corresponding output signal. Additional operations may include dividing the output signal by two, and sampling the output signal at a predefined frequency such that a resulting combination uses half of a bit rate for an excitation and a remainder for short term spectrum analysis.
    Type: Grant
    Filed: October 9, 2013
    Date of Patent: February 6, 2018
    Assignee: Open Invention Network LLC
    Inventor: Clyde Holmes
  • Patent number: 9886949
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for neural network adaptive beamforming for multichannel speech recognition are disclosed. In one aspect, a method includes the actions of receiving a first channel of audio data corresponding to an utterance and a second channel of audio data corresponding to the utterance. The actions further include generating a first set of filter parameters for a first filter based on the first channel of audio data and the second channel of audio data and a second set of filter parameters for a second filter based on the first channel of audio data and the second channel of audio data. The actions further include generating a single combined channel of audio data. The actions further include inputting the audio data to a neural network. The actions further include providing a transcription for the utterance.
    Type: Grant
    Filed: December 28, 2016
    Date of Patent: February 6, 2018
    Assignee: Google Inc.
    Inventors: Bo Li, Ron J. Weiss, Michiel A. U. Bacchiani, Tara N. Sainath, Kevin William Wilson
  • Patent number: 9881618
    Abstract: A method of controlling a terminal is provided. The method includes analyzing a sensed voice when a voice is sensed, recognizing a context based a result of the analysis, and performing a predetermined control operation based on the recognized context.
    Type: Grant
    Filed: June 19, 2013
    Date of Patent: January 30, 2018
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Yoon Jung Choi, So Yeon Kim, Tae Hyung Kim
  • Patent number: 9875296
    Abstract: Methods, systems, and apparatus for obtaining a resource, identifying a first portion of text of the resource that is characterized as a question, and a second part of text of the resource that is characterized as an answer to the question, identifying an entity that is referenced by one or more terms of the text that is characterized as the question, a relationship type that is referenced by one or more other terms of the text that is characterized as the question, and an entity that is referenced by the text that is characterized as the answer to the question, and adjusting a score for a relationship of the relationship type for the entity that is referenced by the one or more terms of the text that is characterized as the question and the entity that is referenced by the text that is characterized as the answer to the question.
    Type: Grant
    Filed: March 25, 2015
    Date of Patent: January 23, 2018
    Assignee: Google LLC
    Inventors: Wei Lwun Lu, Denis Savenkov, Amarnag Subramanya, Jeffrey Dalton, Evgeniy Gabrilovich, Eugene Agichtein
  • Patent number: 9875077
    Abstract: A personalized car radio system comprising: a remote application server comprising: a collector, being a software that scans web sites continuously, for detecting content that corresponds to keywords expressing a driver's preferences; a client application, being a software that schedules displaying collected content in accordance with an alertness rank of the driver and a rhythm of the content; and a client device interacting with the application server by Unicast communication, the client device comprising: a safety module, being a software activated continuously or intermittently, for determining an alertness rank according to (a) metered movement of an organ of the driver, and (b) road condition; and a sounding device and a user interface thereof, for sounding the scheduled content; and a text-to-speech converter, being executed either on the server or the client device, for converting text files to audio files.
    Type: Grant
    Filed: August 10, 2016
    Date of Patent: January 23, 2018
    Inventors: Haim Nachum Markovitz, Alon Markovitz
  • Patent number: 9870772
    Abstract: A guiding device, a guiding method, a program, and an information storage medium are provided which can perform output control of a guidance related to a volume at which to input voice using the recognition ranking of a received voice. A voice receiving section (46) receives a voice. When given information is identified as a result of recognition of the voice, an output control section (58) performs control so as to output a guidance related to a volume at which to input voice in a mode corresponding to the recognition ranking of the information.
    Type: Grant
    Filed: May 1, 2015
    Date of Patent: January 16, 2018
    Assignee: Sony Interactive Entertainment Inc.
    Inventor: Kotaro Imamura
  • Patent number: 9864478
    Abstract: A computer readable medium containing a set of instructions that causes a computer to perform a process comprised of receiving one or more media files. The one or more media files having one or more scenes and each scene including a starting time point and ending time point. The set of instructions may include changing the starting time point and/or the ending time point of a scene from the one or more scenes in response to an input command. The set of instructions may create a new scene and save the new scene based on the new starting time point and/or ending time point of the scene.
    Type: Grant
    Filed: October 15, 2013
    Date of Patent: January 9, 2018
    Assignee: Thomas Majchrowski & Associates, Inc.
    Inventor: Keri DeWitt
  • Patent number: 9858347
    Abstract: Methods, systems, and computer readable medium for providing translated web content. A request is received from a user for content in a second language translated from content in a first language from a first Internet source. The content in the first language is obtained and divided into one or more translatable components. Whether the one or more translatable components have been previously translated, via at least one of machine translation, human translation, and a combination thereof, into the second language and stored as translated components in a storage is determined. If there are one or more translatable components previously translated and stored, the content is generated in the second language by modifying the content in the first language so that at least some translatable components are replaced with corresponding translated components and sent in the second language to the user as a response to the request.
    Type: Grant
    Filed: August 4, 2015
    Date of Patent: January 2, 2018
    Assignee: MOTIONPOINT CORPORATION
    Inventors: Enrique Travieso, Adam Rubenstein, William Fleming, Charles Whiteman, Eugenio Alvarez, Arcadio Andrade, Collin Birdsey
  • Patent number: 9858940
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Grant
    Filed: March 31, 2016
    Date of Patent: January 2, 2018
    Assignee: Dolby International AB
    Inventors: Barbara Resch, Kristofer Kjörling, Lars Villemoes
  • Patent number: 9852736
    Abstract: Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.
    Type: Grant
    Filed: April 4, 2016
    Date of Patent: December 26, 2017
    Assignee: Digimarc Corporation
    Inventors: Ravi K. Sharma, Brett A. Bradley, Yang Bai, Shankar Thagadur Shivappa, Ajith Kamath, Aparna Gurijala, David A. Cushman
  • Patent number: 9836529
    Abstract: A system for performing semantic search receives an electronic text corpus and separates the text corpus into a plurality of sentences. The system parses and converts each sentence into a sentence tree. The system receives a search query and matches the search query with one or more of the sentence trees.
    Type: Grant
    Filed: March 10, 2015
    Date of Patent: December 5, 2017
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Vladimir Zelevinsky, Yevgeniy Dashevsky, Diana Ye
  • Patent number: 9837090
    Abstract: An apparatus and method for encoding and decoding a signal for high frequency bandwidth extension are provided. An encoding apparatus may down-sample a time domain input signal, may core-encode the down-sampled time domain input signal, may transform the core-encoded time domain input signal to a frequency domain input signal, and may perform bandwidth extension encoding using a basic signal of the frequency domain input signal.
    Type: Grant
    Filed: November 6, 2015
    Date of Patent: December 5, 2017
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki Hyun Choo, Eun Mi Oh, Ho Sang Sung
  • Patent number: 9830923
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Grant
    Filed: November 20, 2015
    Date of Patent: November 28, 2017
    Assignee: Dolby International AB
    Inventors: Barbara Resch, Kristofer Kjörling, Lars Villemoes
  • Patent number: 9830913
    Abstract: A microphone assembly includes an acoustic sensor and a voice activity detector on an integrated circuit coupled to an external-device interface. The acoustic sensor produces an electrical signal representative of acoustic energy detected by the sensor. A filter bank separates data representative of the acoustic energy into a plurality of frequency bands. A power tracker obtains a power estimate for at least one band, including a first estimate based on relatively fast changes in a power metric of the data and a second estimate based on relatively slow changes in a power metric of the data. The presence of voice activity in the electrical signal is based upon the power estimate.
    Type: Grant
    Filed: September 22, 2015
    Date of Patent: November 28, 2017
    Assignee: Knowles Electronics, LLC
    Inventors: Henrik Thomsen, Dibyendu Nandy
  • Patent number: 9824685
    Abstract: A handsfree device, which is coupled to a data processing device, may be operable to monitor at least one audio stream for occurrence of at least one keyword. Upon recognition of the at least one keyword, the handsfree device may establish a first connection between the handsfree device and the data processing device for launching a voice interface in the data processing device. The handsfree device may send audio data received after the recognition of the at least one keyword to the data processing device, via the first connection for responding to the audio data via the voice interface. During a keyword configuration operation, the handsfree device may send at least one inputted keyword to the data processing device for recording. The handsfree device may receive, via a second connection, the recorded at least one keyword from the data processing device for keyword configuration of the handsfree device.
    Type: Grant
    Filed: November 13, 2015
    Date of Patent: November 21, 2017
    Assignee: Google Inc.
    Inventor: John Richard Stracke, Jr.
  • Patent number: 9818429
    Abstract: A method and apparatus to encoding or decoding an audio signal is provided. In the method and apparatus, a noise-floor level to use in encoding or decoding a high frequency signal is updated according to the degree of a voiced or unvoiced sound included in the signal.
    Type: Grant
    Filed: October 9, 2015
    Date of Patent: November 14, 2017
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki-hyun Choo, Eun-mi Oh, Ho-sang Sung, Jung-Hoe Kim, Mi-young Kim
  • Patent number: 9812109
    Abstract: System, apparatus and method for determining semantic information from audio, where incoming audio is sampled and processed to extract audio features, including temporal, spectral, harmonic and rhythmic features. The extracted audio features are compared to stored audio templates that include ranges and/or values for certain features and are tagged for specific ranges and/or values. Extracted audio features that are most similar to one or more templates from the comparison are identified according to the tagged information. The tags are used to determine the semantic audio data that includes genre, instrumentation, style, acoustical dynamics, and emotive descriptor for the audio signal.
    Type: Grant
    Filed: October 16, 2015
    Date of Patent: November 7, 2017
    Assignee: THE NIELSEN COMPANY (US), LLC
    Inventors: Alan Neuhauser, John Stavropoulos
  • Patent number: 9799333
    Abstract: A system and method are provided for performing speech processing. A system includes an audio detection system configured to receive a signal including speech and a memory having stored therein a database of keyword models forming an ensemble of filters associated with each keyword in the database. A processor is configured to receive the signal including speech from the audio detection system, decompose the signal including speech into a sparse set of phonetic impulses, and access the database of keywords and convolve the sparse set of phonetic impulses with the ensemble of filters. The processor is further configured to identify keywords within the signal including speech based a result of the convolution and control operation the electronic system based on the keywords identified.
    Type: Grant
    Filed: August 31, 2015
    Date of Patent: October 24, 2017
    Assignee: The Johns Hopkins University
    Inventors: Keith Kintzley, Aren Jansen, Hynek Hermansky, Kenneth Church