Patents Examined by Rodrigo A Chavez

Contextual validation of synonyms in otology driven natural language processing

Patent number: 10169335

Abstract: Embodiments described herein provide approaches for validating synonyms in ontology driven natural language processing. Specifically, an approach is provided for receiving a user input containing a token, structuring the user input into a semantic model comprising a set of classes each containing a set of related permutations of the token, designating the token as a synonym of one of the set of related permutations, annotating the token with a class from the set of classes corresponding to the one of the set of related permutations, and validating the annotation of the token by determining an accuracy of the designation of the token as a synonym of the one of the set of related permutations. In one embodiment, the accuracy is determined by quantifying a linear distance between the token and a contextual token also within the user input, and comparing the linear distance to a pre-specified linear distance limit.

Type: Grant

Filed: April 5, 2016

Date of Patent: January 1, 2019

Assignee: International Business Machines Corporation

Inventors: Stephen J. Edwards, Ahmed M. Nassar, Craig M. Trim, Albert T. Wong
Method for detecting original language of translated document

Patent number: 10083155

Abstract: A system for detecting an original language of a translated document retrieves the translated document, and identifies a language of the retrieved document. The system calculates a language model for the language of the retrieved document (LM(RD)). The system calculates a distinct vector as a difference between LM(RD) and a common language model for the language of the retrieved document (LMT(RD)). The system obtains pair vectors for language model pairs associated with the language of the retrieved document, and calculates a vector distance between the distinct vector and each pair vector (or between the (LM(RD)) and each pair vector). The system identifies a given pair vector within a threshold vector distance, and calculates the confidence score. The system then identifies the original language corresponding to the given pair vector as the original language of the retrieved document, and retrieves an original document in the original language of the retrieved document.

Type: Grant

Filed: May 17, 2016

Date of Patent: September 25, 2018

Assignee: International Business Machines Corporation

Inventors: Nadiya Kochura, Fang Lu, Sneha Palarapu, Tejaswini K. Ranadive, Anupriya Ray
Applying neural network language models to weighted finite state transducers for automatic speech recognition

Patent number: 10049668

Abstract: Systems and processes for converting speech-to-text are provided. In one example process, speech input can be received. A sequence of states and arcs of a weighted finite state transducer (WFST) can be traversed. A negating finite state transducer (FST) can be traversed. A virtual FST can be composed using a neural network language model and based on the sequence of states and arcs of the WFST. The one or more virtual states of the virtual FST can be traversed to determine a probability of a candidate word given one or more history candidate words. Text corresponding to the speech input can be determined based on the probability of the candidate word given the one or more history candidate words. An output can be provided based on the text corresponding to the speech input.

Type: Grant

Filed: May 16, 2016

Date of Patent: August 14, 2018

Assignee: Apple Inc.

Inventors: Rongqing Huang, Ilya Oparin
Method for speaker diarization

Patent number: 10026405

Abstract: Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.

Type: Grant

Filed: May 3, 2016

Date of Patent: July 17, 2018

Assignee: SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S.

Inventors: Mustafa Levent Arslan, Mustafa Erden, Sedat Demirba{hacek over (g)}, Gökçe Sarar
Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Patent number: 10014007

Abstract: A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.

Type: Grant

Filed: May 28, 2014

Date of Patent: July 3, 2018

Inventors: Rajesh Dachiraju, Aravind Ganapathiraju
System and method for optimizing speech recognition and natural language parameters with user feedback

Patent number: 9984679

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Type: Grant

Filed: July 18, 2016

Date of Patent: May 29, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Andrej Ljolje, Diamantino Antonio Caseiro, Mazin Gilbert, Vincent Goffin, Taniya Mishra
User credentials

Patent number: 9979723

Abstract: Obtaining and/or validating user credentials at client devices is described. A phrase may be generated based on one or more index values determined according to a function of time and a credential identifier identifying a user credential. The phrase may be output by the client device for validating the user credential.

Type: Grant

Filed: February 4, 2016

Date of Patent: May 22, 2018

Assignee: MicroStrategy Incorporated

Inventors: Michael J. Saylor, Gang Chen, Kirill Butin, Roman Zolin, Hector Vazquez
Quantization step sizes for compression of spatial components of a sound field

Patent number: 9980074

Abstract: In general, techniques are described for determining quantization step sizes for compression of spatial components of a sound field. A device comprising one or more processors may be configured to perform the techniques. In other words, the one or more processors may be configured to determine a quantization step size to be used when compressing a spatial component of a sound field, where the spatial component generated by performing a vector based synthesis with respect to a plurality of spherical harmonic coefficients.

Type: Grant

Filed: May 28, 2014

Date of Patent: May 22, 2018

Assignee: QUALCOMM Incorporated

Inventors: Dipanjan Sen, Sang-Uk Ryu
Voice detection method

Patent number: 9905250

Abstract: A voice detection method which makes it possible to detect the presence of voice signals in an noisy acoustic signal x(t) from a microphone, including the following consecutive steps: calculating a detection function FD(?) based on calculating a difference function D(?) varying in accordance with the shift ? on an integration window with length W starting at the time t0, with: a step of adapting the threshold in said current interval, in accordance with values calculated from the acoustic signal x(t) established in said current interval; searching for the minimum of the detection function FD(?) and comparing the minimum with a threshold, for (?) varying in a predetermined time interval referred to as current interval so as to detect the possible presence of a fundamental frequency F0 that is characteristic of a voice signal in said current interval.

Type: Grant

Filed: November 27, 2014

Date of Patent: February 27, 2018

Assignee: ADEUNIS R F

Inventor: Karim Maouche
Image processing device, image processing method and non-transitory computer readable recording medium

Patent number: 9881001

Abstract: An image processing device, comprises: an input part for inputting image data; a word extracting part for extracting a word from texts contained in the image data; a synonym obtaining part for obtaining a synonym corresponds to the word, and for associating the obtained synonym with the word; a position identifying part for identifying a display position on the image data of the word with which the synonym is associated; a layer creating part for creating an accompanying layer to add to an original layer, which is the image data containing the word, and for embedding the synonym associated with the word within a position on the accompanying layer the same as the display position identified by the position identifying part; and an output image generating part for generating output image data including the original layer containing the word and the accompanying layer within which the synonym is embedded.

Type: Grant

Filed: June 13, 2013

Date of Patent: January 30, 2018

Assignee: KONICA MINOLTA, INC.

Inventors: Katsuaki Wakui, Hideyuki Hashimoto, Takahiro Tsutsumi
Audibly indicating secondary content with spoken text

Patent number: 9865250

Abstract: A system and method for navigating secondary content. The system may monitor for gestures input to the system by an input device and may detect an arc gesture. The arc gesture may travel along both a horizontal axis and a vertical axis from a first point to a second point and may be delineated from a horizontal or a vertical motion. The system may identify secondary content corresponding to the arc gesture in response to the arc gesture and output data corresponding to the secondary content. The system may identify supplemental text associated with the secondary content and synthesize supplemental speech corresponding to the supplemental text. The output data may include audio including the synthesized supplemental speech.

Type: Grant

Filed: September 29, 2014

Date of Patent: January 9, 2018

Assignee: Amazon Technologies, Inc.

Inventor: Peter Alex Korn
Speaker adaptation of neural network acoustic models using I-vectors

Patent number: 9858919

Abstract: A method includes providing a deep neural network acoustic model, receiving audio data including one or more utterances of a speaker, extracting a plurality of speech recognition features from the one or more utterances of the speaker, creating a speaker identity vector for the speaker based on the extracted speech recognition features, and adapting the deep neural network acoustic model for automatic speech recognition using the extracted speech recognition features and the speaker identity vector.

Type: Grant

Filed: September 29, 2014

Date of Patent: January 2, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: George A. Saon
Error correction in tables using a question and answer system

Patent number: 9830314

Abstract: Mechanisms are provided for performing tabular data correction in a document. The mechanisms receive a natural language document comprising a portion of content and analyze the portion of content within the natural language document to identify an erroneous sub-portion comprising an erroneous or missing item of information. The mechanisms generate a semantic signature for the erroneous sub-portion and generate a query based on the semantic signature. The mechanisms apply the query to a knowledge base to identify a candidate sub-portion of content. The mechanisms correct the erroneous sub-portion using the identified candidate sub-portion of content to generate a corrected natural language document.

Type: Grant

Filed: November 18, 2013

Date of Patent: November 28, 2017

Assignee: International Business Machines Corporation

Inventors: Donna K. Byron, Alexander Pikovsky, Abhishek Shivkumar, Timothy P. Winkler
Position directed acoustic array and beamforming methods

Patent number: 9747917

Abstract: Methods and systems are provided for receiving desired sounds. The system includes a position sensor configured to determine an occupant position of an occupant engaging in speech within a defined space and transmit the speaking occupant position. A plurality of microphones are configured to receive sound from within the defined space and transmit audio signals corresponding to the received sound. A processor, in communication with the position sensor and the microphones, is configured to receive the speaking occupant position and the audio signals, apply a beamformer to the audio signals to direct a microphone beam toward the occupant position, and generate a beamformer output signal.

Type: Grant

Filed: June 14, 2013

Date of Patent: August 29, 2017

Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC

Inventors: Eli Tzirkel-Hancock, Igal Bilik, Moshe Laifenfeld
System and method for translating real-time speech using segmentation based on conjunction locations

Patent number: 9734820

Abstract: A system, method and computer-readable storage device which balance latency and accuracy of machine translations by segmenting the speech upon locating a conjunction. The system, upon receiving speech, will buffer speech until a conjunction is detected. Upon detecting a conjunction, the speech received until that point is segmented. The system then continues performing speech recognition on the segment, searching for the next conjunction, while simultaneously initiating translation of the segment. Upon translating the segment, the system converts the translation to a speech output, allowing a user to hear an audible translation of the speech originally heard.

Type: Grant

Filed: November 14, 2013

Date of Patent: August 15, 2017

Assignee: Nuance Communications, Inc.

Inventors: Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, John Chen
Source signal separation by discriminatively-trained non-negative matrix factorization

Patent number: 9679559

Abstract: A method estimates source signals from a mixture of source signals by first training an analysis model and a reconstruction model using training data. The analysis model is applied to the mixture of source signals to obtain an analysis representation of the mixture of source signals, and the reconstruction model is applied to the analysis representation to obtain an estimate of the source signals, wherein the analysis model utilizes an analysis linear basis representation, and the reconstruction model utilizes a reconstruction linear basis representation.

Type: Grant

Filed: May 29, 2014

Date of Patent: June 13, 2017

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Jonathan Le Roux, John R. Hershey, Felix Weninger, Shinji Watanabe
Device for aiding communication in the aeronautical domain

Patent number: 9666178

Abstract: A device for aiding communication in the aeronautical domain, wherein the device includes a transceiver and data processor assembly that records audio messages corresponding to all the incoming and outgoing audio communications, transcribes the messages, in real time, into textual messages, displays the textual messages, and enables an audio play back of the audio messages.

Type: Grant

Filed: June 11, 2013

Date of Patent: May 30, 2017

Assignee: Airbus S.A.S.

Inventors: Vincent Loubiere, Netra Gowda
System and method for network bandwidth management for adjusting audio quality

Patent number: 9646626

Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for processing audio signals. An example system configured to practice the method receives audio at a device to be transmitted to a remote speech processing system. The system analyzes one of noise conditions, need for an enhanced speech quality, and network load to yield an analysis. Based on the analysis, the system determines to bypass user-defined options for enhancing audio for speech processing. Then, based on the analysis, the system can modify an audio transmission parameter used to transmit the audio from the device to the remote speech processing system. The audio transmission parameter can be one of an amount of coding, a chosen codec, an amount of coding, or a number of audio channels, for example.

Type: Grant

Filed: November 22, 2013

Date of Patent: May 9, 2017

Assignees: AT&T Intellectual Property I, L.P., AT&T Mobility II LLC

Inventors: Dimitrios Dimitriadis, John Crockett, Horst Juergen Schroeter
System support for evaluation consistency

Patent number: 9633003

Abstract: A system and computer product for validating the consistency between quantitative and natural language textual evaluations. An example method involves computing a numeric score for a textual evaluation, comparing the numeric score to a quantitative evaluation, and producing a rating based on the similarity of the two evaluations.

Type: Grant

Filed: December 18, 2012

Date of Patent: April 25, 2017

Assignee: International Business Machines Corporation

Inventors: Danny Soroker, Justin D. Weisz
System support for evaluation consistency

Patent number: 9626356

Abstract: A system and computer product for validating the consistency between quantitative and natural language textual evaluations. An example method involves computing a numeric score for a textual evaluation, comparing the numeric score to a quantitative evaluation, and producing a rating based on the similarity of the two evaluations.

Type: Grant

Filed: September 24, 2013

Date of Patent: April 18, 2017

Assignee: International Business Machines Corporation

Inventors: Danny Soroker, Justin D. Weisz

prev 1 2 3 4 5 next