Patents Examined by Rodrigo A Chavez
  • Patent number: 10169335
    Abstract: Embodiments described herein provide approaches for validating synonyms in ontology driven natural language processing. Specifically, an approach is provided for receiving a user input containing a token, structuring the user input into a semantic model comprising a set of classes each containing a set of related permutations of the token, designating the token as a synonym of one of the set of related permutations, annotating the token with a class from the set of classes corresponding to the one of the set of related permutations, and validating the annotation of the token by determining an accuracy of the designation of the token as a synonym of the one of the set of related permutations. In one embodiment, the accuracy is determined by quantifying a linear distance between the token and a contextual token also within the user input, and comparing the linear distance to a pre-specified linear distance limit.
    Type: Grant
    Filed: April 5, 2016
    Date of Patent: January 1, 2019
    Assignee: International Business Machines Corporation
    Inventors: Stephen J. Edwards, Ahmed M. Nassar, Craig M. Trim, Albert T. Wong
  • Patent number: 10083155
    Abstract: A system for detecting an original language of a translated document retrieves the translated document, and identifies a language of the retrieved document. The system calculates a language model for the language of the retrieved document (LM(RD)). The system calculates a distinct vector as a difference between LM(RD) and a common language model for the language of the retrieved document (LMT(RD)). The system obtains pair vectors for language model pairs associated with the language of the retrieved document, and calculates a vector distance between the distinct vector and each pair vector (or between the (LM(RD)) and each pair vector). The system identifies a given pair vector within a threshold vector distance, and calculates the confidence score. The system then identifies the original language corresponding to the given pair vector as the original language of the retrieved document, and retrieves an original document in the original language of the retrieved document.
    Type: Grant
    Filed: May 17, 2016
    Date of Patent: September 25, 2018
    Assignee: International Business Machines Corporation
    Inventors: Nadiya Kochura, Fang Lu, Sneha Palarapu, Tejaswini K. Ranadive, Anupriya Ray
  • Patent number: 10049668
    Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.
    Type: Grant
    Filed: May 16, 2016
    Date of Patent: August 14, 2018
    Assignee: Apple Inc.
    Inventors: Rongqing Huang, Ilya Oparin
  • Patent number: 10026405
    Abstract: Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.
    Type: Grant
    Filed: May 3, 2016
    Date of Patent: July 17, 2018
    Assignee: SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S.
    Inventors: Mustafa Levent Arslan, Mustafa Erden, Sedat Demirba{hacek over (g)}, Gökçe Sarar
  • Patent number: 10014007
    Abstract: A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.
    Type: Grant
    Filed: May 28, 2014
    Date of Patent: July 3, 2018
    Inventors: Rajesh Dachiraju, Aravind Ganapathiraju
  • Patent number: 9984679
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.
    Type: Grant
    Filed: July 18, 2016
    Date of Patent: May 29, 2018
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Mazin Gilbert, Vincent Goffin, Taniya Mishra
  • Patent number: 9979723
    Abstract: Obtaining and/or validating user credentials at client devices is described. A phrase may be generated based on one or more index values determined according to a function of time and a credential identifier identifying a user credential. The phrase may be output by the client device for validating the user credential.
    Type: Grant
    Filed: February 4, 2016
    Date of Patent: May 22, 2018
    Assignee: MicroStrategy Incorporated
    Inventors: Michael J. Saylor, Gang Chen, Kirill Butin, Roman Zolin, Hector Vazquez
  • Patent number: 9980074
    Abstract: In general, techniques are described for determining quantization step sizes for compression of spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. In other words, the one or more processors may be configured to determine a quantization step size to be used when compressing a spatial component of a sound field, where the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.
    Type: Grant
    Filed: May 28, 2014
    Date of Patent: May 22, 2018
    Assignee: QUALCOMM Incorporated
    Inventors: Dipanjan Sen, Sang-Uk Ryu
  • Patent number: 9905250
    Abstract: A voice detection method which makes it possible to detect the presence of voice signals in an noisy acoustic signal x(t) from a microphone, including the following consecutive steps: calculating a detection function FD(?) based on calculating a difference function D(?) varying in accordance with the shift ? on an integration window with length W starting at the time t0, with: a step of adapting the threshold in said current interval, in accordance with values calculated from the acoustic signal x(t) established in said current interval; searching for the minimum of the detection function FD(?) and comparing the minimum with a threshold, for (?) varying in a predetermined time interval referred to as current interval so as to detect the possible presence of a fundamental frequency F0 that is characteristic of a voice signal in said current interval.
    Type: Grant
    Filed: November 27, 2014
    Date of Patent: February 27, 2018
    Assignee: ADEUNIS R F
    Inventor: Karim Maouche
  • Patent number: 9881001
    Abstract: An image processing device, comprises: an input part for inputting image data; a word extracting part for extracting a word from texts contained in the image data; a synonym obtaining part for obtaining a synonym corresponds to the word, and for associating the obtained synonym with the word; a position identifying part for identifying a display position on the image data of the word with which the synonym is associated; a layer creating part for creating an accompanying layer to add to an original layer, which is the image data containing the word, and for embedding the synonym associated with the word within a position on the accompanying layer the same as the display position identified by the position identifying part; and an output image generating part for generating output image data including the original layer containing the word and the accompanying layer within which the synonym is embedded.
    Type: Grant
    Filed: June 13, 2013
    Date of Patent: January 30, 2018
    Assignee: KONICA MINOLTA, INC.
    Inventors: Katsuaki Wakui, Hideyuki Hashimoto, Takahiro Tsutsumi
  • Patent number: 9865250
    Abstract: A system and method for navigating secondary content. The system may monitor for gestures input to the system by an input device and may detect an arc gesture. The arc gesture may travel along both a horizontal axis and a vertical axis from a first point to a second point and may be delineated from a horizontal or a vertical motion. The system may identify secondary content corresponding to the arc gesture in response to the arc gesture and output data corresponding to the secondary content. The system may identify supplemental text associated with the secondary content and synthesize supplemental speech corresponding to the supplemental text. The output data may include audio including the synthesized supplemental speech.
    Type: Grant
    Filed: September 29, 2014
    Date of Patent: January 9, 2018
    Assignee: Amazon Technologies, Inc.
    Inventor: Peter Alex Korn
  • Patent number: 9858919
    Abstract: A method includes providing a deep neural network acoustic model, receiving audio data including one or more utterances of a speaker, extracting a plurality of speech recognition features from the one or more utterances of the speaker, creating a speaker identity vector for the speaker based on the extracted speech recognition features, and adapting the deep neural network acoustic model for automatic speech recognition using the extracted speech recognition features and the speaker identity vector.
    Type: Grant
    Filed: September 29, 2014
    Date of Patent: January 2, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: George A. Saon
  • Patent number: 9830314
    Abstract: Mechanisms are provided for performing tabular data correction in a document. The mechanisms receive a natural language document comprising a portion of content and analyze the portion of content within the natural language document to identify an erroneous sub-portion comprising an erroneous or missing item of information. The mechanisms generate a semantic signature for the erroneous sub-portion and generate a query based on the semantic signature. The mechanisms apply the query to a knowledge base to identify a candidate sub-portion of content. The mechanisms correct the erroneous sub-portion using the identified candidate sub-portion of content to generate a corrected natural language document.
    Type: Grant
    Filed: November 18, 2013
    Date of Patent: November 28, 2017
    Assignee: International Business Machines Corporation
    Inventors: Donna K. Byron, Alexander Pikovsky, Abhishek Shivkumar, Timothy P. Winkler
  • Patent number: 9747917
    Abstract: Methods and systems are provided for receiving desired sounds. The system includes a position sensor configured to determine an occupant position of an occupant engaging in speech within a defined space and transmit the speaking occupant position. A plurality of microphones are configured to receive sound from within the defined space and transmit audio signals corresponding to the received sound. A processor, in communication with the position sensor and the microphones, is configured to receive the speaking occupant position and the audio signals, apply a beamformer to the audio signals to direct a microphone beam toward the occupant position, and generate a beamformer output signal.
    Type: Grant
    Filed: June 14, 2013
    Date of Patent: August 29, 2017
    Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC
    Inventors: Eli Tzirkel-Hancock, Igal Bilik, Moshe Laifenfeld
  • Patent number: 9734820
    Abstract: A system, method and computer-readable storage device which balance latency and accuracy of machine translations by segmenting the speech upon locating a conjunction. The system, upon receiving speech, will buffer speech until a conjunction is detected. Upon detecting a conjunction, the speech received until that point is segmented. The system then continues performing speech recognition on the segment, searching for the next conjunction, while simultaneously initiating translation of the segment. Upon translating the segment, the system converts the translation to a speech output, allowing a user to hear an audible translation of the speech originally heard.
    Type: Grant
    Filed: November 14, 2013
    Date of Patent: August 15, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, John Chen
  • Patent number: 9679559
    Abstract: A method estimates source signals from a mixture of source signals by first training an analysis model and a reconstruction model using training data. The analysis model is applied to the mixture of source signals to obtain an analysis representation of the mixture of source signals, and the reconstruction model is applied to the analysis representation to obtain an estimate of the source signals, wherein the analysis model utilizes an analysis linear basis representation, and the reconstruction model utilizes a reconstruction linear basis representation.
    Type: Grant
    Filed: May 29, 2014
    Date of Patent: June 13, 2017
    Assignee: Mitsubishi Electric Research Laboratories, Inc.
    Inventors: Jonathan Le Roux, John R. Hershey, Felix Weninger, Shinji Watanabe
  • Patent number: 9666178
    Abstract: A device for aiding communication in the aeronautical domain, wherein the device includes a transceiver and data processor assembly that records audio messages corresponding to all the incoming and outgoing audio communications, transcribes the messages, in real time, into textual messages, displays the textual messages, and enables an audio play back of the audio messages.
    Type: Grant
    Filed: June 11, 2013
    Date of Patent: May 30, 2017
    Assignee: Airbus S.A.S.
    Inventors: Vincent Loubiere, Netra Gowda
  • Patent number: 9646626
    Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for processing audio signals. An example system configured to practice the method receives audio at a device to be transmitted to a remote speech processing system. The system analyzes one of noise conditions, need for an enhanced speech quality, and network load to yield an analysis. Based on the analysis, the system determines to bypass user-defined options for enhancing audio for speech processing. Then, based on the analysis, the system can modify an audio transmission parameter used to transmit the audio from the device to the remote speech processing system. The audio transmission parameter can be one of an amount of coding, a chosen codec, an amount of coding, or a number of audio channels, for example.
    Type: Grant
    Filed: November 22, 2013
    Date of Patent: May 9, 2017
    Assignees: AT&T Intellectual Property I, L.P., AT&T Mobility II LLC
    Inventors: Dimitrios Dimitriadis, John Crockett, Horst Juergen Schroeter
  • Patent number: 9633003
    Abstract: A system and computer product for validating the consistency between quantitative and natural language textual evaluations. An example method involves computing a numeric score for a textual evaluation, comparing the numeric score to a quantitative evaluation, and producing a rating based on the similarity of the two evaluations.
    Type: Grant
    Filed: December 18, 2012
    Date of Patent: April 25, 2017
    Assignee: International Business Machines Corporation
    Inventors: Danny Soroker, Justin D. Weisz
  • Patent number: 9626356
    Abstract: A system and computer product for validating the consistency between quantitative and natural language textual evaluations. An example method involves computing a numeric score for a textual evaluation, comparing the numeric score to a quantitative evaluation, and producing a rating based on the similarity of the two evaluations.
    Type: Grant
    Filed: September 24, 2013
    Date of Patent: April 18, 2017
    Assignee: International Business Machines Corporation
    Inventors: Danny Soroker, Justin D. Weisz