Patents by Inventor Bhuvana Ramabhadran

Bhuvana Ramabhadran has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Verbal description

Patent number: 8839086

Abstract: A verbal description method and system. A computing system broadcasts first audio data and video data associated with the first audio data. The computing system determines that the video data comprises a graphic without a description in the first audio data. The computing system receives audible description data associated with the graphic. The computing system generates second audio data comprising the first audio data and the audible description data. The computing system synchronizes portions of the second audio data with associated portions of the video data. The computing system generates synchronized audio/video data comprising the portions of said second audio data aligned with the associated portions of said video data. The computing system broadcasts the synchronized audio/video data.

Type: Grant

Filed: March 29, 2012

Date of Patent: September 16, 2014

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Brian Reginald Heasman, Dimitri Kanevsky, Edward Emile Kelley, Bhuvana Ramabhadran
CONVERSION OF NON-BACK-OFF LANGUAGE MODELS FOR EFFICIENT SPEECH DECODING

Publication number: 20140244261

Abstract: Techniques for conversion of non-back-off language models for use in speech decoders. For example, an apparatus for conversion of non-back-off language models for use in speech decoders. For example, an apparatus is configured convert a non-back-off language model to a back-off language model. The converted back-off language model is pruned. The converted back-off language model is usable for decoding speech.

Type: Application

Filed: August 12, 2013

Publication date: August 28, 2014

Inventors: Ebru Arisoy, Bhuvana Ramabhadran, Abhinav Sethy, Stanley Chen
CONVERSION OF NON-BACK-OFF LANGUAGE MODELS FOR EFFICIENT SPEECH DECODING

Publication number: 20140244248

Abstract: Techniques for conversion of non-back-off language models for use in speech decoders. For example, a method comprises the following step. A non-back-off language model is converted to a back-off language model. The converted back-off language model is pruned. The converted back-off language model is usable for decoding speech.

Type: Application

Filed: February 22, 2013

Publication date: August 28, 2014

Applicant: International Business Machines Corporation

Inventors: Ebru Arisoy, Bhuvana Ramabhadran, Abhinav Sethy, Stanley Chen
Recovering the structure of sparse markov networks from high-dimensional data

Patent number: 8775345

Abstract: A method, information processing system, and computer readable article of manufacture model data. A first dataset is received that includes a first set of physical world data. At least one data model associated with the first dataset is generated based on the receiving. A second dataset is received that includes a second set of physical world data. The second dataset is compared to the at least one data model. A probability that the second dataset is modeled by the at least one data model is determined. A determination is made that the probability is above a given threshold. A decision associated with the second dataset based on the at least one data model is generated in response to the probability being above the given threshold. The probability and the decision are stored in memory. The probability and the decision are provided to user via a user interface.

Type: Grant

Filed: September 14, 2012

Date of Patent: July 8, 2014

Assignee: International Business Machines Corporation

Inventors: Narges Bani Asadi, Guillermo A. Cecchi, Dimitri Kanevsky, Bhuvana Ramabhadran, Irina Rish, Katya Scheinberg
Elimination of typing noise from conference calls

Patent number: 8767922

Abstract: A method for eliminating typing noise from a conference call in which a plurality of participants communicate via a plurality of client devices connected to a conference server via a corresponding plurality of channels, includes determining a probability value for each channel of the plurality of channels representing a likelihood of a typing noise being present on the corresponding channel. A channel of the plurality of channels having a highest determined probability value is temporarily muted. It is testing whether the temporary muting has successfully remove the typing noise from the conference call. A warning is generated for a client device of the plurality of client devices that corresponds to the channel having a highest determined probability value when it is determined that the temporary muting has successfully removed the typing noise from the conference call.

Type: Grant

Filed: October 4, 2012

Date of Patent: July 1, 2014

Assignee: International Business Machines Corporation

Inventors: Dimitri Kanevsky, Bhuvana Ramabhadran, Abhinav Sethy
Machine translation with side information

Patent number: 8768686

Abstract: A method of identifying and using side information available to statistical machine translation systems within an enterprise setting, the method including extracting user-specific interaction and non-interaction-based information from at least one corresponding database within the enterprise for each of a plurality of users, aggregating the user-specific interaction and non-interaction based information from a plurality of users, by using a processor on a computer, to tune and adapt background translation and language models, and updating all relevant models within the enterprise after user activity based on the tuned and adapted translation and language models.

Type: Grant

Filed: May 13, 2010

Date of Patent: July 1, 2014

Assignee: International Business Machines Corporation

Inventors: Ruhi Sarikaya, Jiri Navratil, Bhuvana Ramabhadran, David Eubensky, Salim Estephan Roukos
HYBRID PRE-TRAINING OF DEEP BELIEF NETWORKS

Publication number: 20140164299

Abstract: Pretraining for a DBN initializes weights of the DBN (Deep Belief Network) using a hybrid pre-training methodology. Hybrid pre-training employs generative component that allows the hybrid PT method to have better performance in WER (Word Error Rate) compared to the discriminative PT method. Hybrid pre-training learns weights which are more closely linked to the final objective function, allowing for a much larger batch size compared to generative PT, which allows for improvements in speed; and a larger batch size allows for parallelization of the gradient computation, speeding up training further.

Type: Application

Filed: December 6, 2012

Publication date: June 12, 2014

Inventors: Tara Sainath, Brian Kingsbury, Bhuvana Ramabhadran
Elimination of typing noise from conference calls

Patent number: 8750461

Abstract: A method for eliminating typing noise from a conference call in which a plurality of participants communicate via a plurality of client devices connected to a conference server via a corresponding plurality of channels, includes determining a probability value for each channel of the plurality of channels representing a likelihood of a typing noise being present on the corresponding channel. A channel of the plurality of channels having a highest determined probability value is temporarily muted. It is testing whether the temporary muting has successfully remove the typing noise from the conference call. A warning is generated for a client device of the plurality of client devices that corresponds to the channel having a highest determined probability value when it is determined that the temporary muting has successfully removed the typing noise from the conference call.

Type: Grant

Filed: September 28, 2012

Date of Patent: June 10, 2014

Assignee: International Business Machines Corporation

Inventors: Dimitri Kanevsky, Bhuvana Ramabhadran, Abihnav Sethy
Method and Apparatus of Processing Data Using Deep Belief Networks Employing Low-Rank Matrix Factorization

Publication number: 20140156575

Abstract: Deep belief networks are usually associated with a large number of parameters and high computational complexity. The large number of parameters results in a long and computationally consuming training phase. According to at least one example embodiment, low-rank matrix factorization is used to approximate at least a first set of parameters, associated with an output layer, with a second and a third set of parameters. The total number of parameters in the second and third sets of parameters is smaller than the number of sets of parameters in the first set. An architecture of a resulting artificial neural network, when employing low-rank matrix factorization, may be characterized with a low-rank layer, not employing activation function(s), and defined by a relatively small number of nodes and the second set of parameters. By using low rank matrix factorization, training is faster, leading to rapid deployment of the respective system.

Type: Application

Filed: November 30, 2012

Publication date: June 5, 2014

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Tara N. Sainath, Ebru Arisoy, Bhuvana Ramabhadran
ACCURACY IMPROVEMENT OF SPOKEN QUERIES TRANSCRIPTION USING CO-OCCURRENCE INFORMATION

Publication number: 20140136197

Abstract: Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

Type: Application

Filed: January 16, 2014

Publication date: May 15, 2014

Inventors: Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Joseph Vozila, Nathan Bodenstab
ELIMINATION OF TYPING NOISE FROM CONFERENCE CALLS

Publication number: 20140093059

Abstract: A method for eliminating typing noise from a conference call in which a plurality of participants communicate via a plurality of client devices connected to a conference server via a corresponding plurality of channels, includes determining a probability value for each channel of the plurality of channels representing a likelihood of a typing noise being present on the corresponding channel. A channel of the plurality of channels having a highest determined probability value is temporarily muted. It is testing whether the temporary muting has successfully remove the typing noise from the conference call. A warning is generated for a client device of the plurality of client devices that corresponds to the channel having a highest determined probability value when it is determined that the temporary muting has successfully removed the typing noise from the conference call.

Type: Application

Filed: October 4, 2012

Publication date: April 3, 2014

Applicant: International Business Machines Corporation

Inventors: Dimitri Kanevsky, Bhuvana Ramabhadran, Abhinav Sethy
Elimination of Typing Noise From Conference Calls

Publication number: 20140093053

Abstract: A method for eliminating typing noise from a conference call in which a plurality of participants communicate via a plurality of client devices connected to a conference server via a corresponding plurality of channels, includes determining a probability value for each channel of the plurality of channels representing a likelihood of a typing noise being present on the corresponding channel. A channel of the plurality of channels having a highest determined probability value is temporarily muted. It is testing whether the temporary muting has successfully remove the typing noise from the conference call. A warning is generated for a client device of the plurality of client devices that corresponds to the channel having a highest determined probability value when it is determined that the temporary muting has successfully removed the typing noise from the conference call.

Type: Application

Filed: September 28, 2012

Publication date: April 3, 2014

Applicant: International Business Machines Corporation

Inventors: Dimitri KANEVSKY, BHUVANA RAMABHADRAN, ABHNAV SETHY
LANGUAGE TRANSLATION IN AN ENVIRONMENT ASSOCIATED WITH A VIRTUAL APPLICATION

Publication number: 20140052435

Abstract: Methods and apparatus for language translation in a computing environment associated with a virtual application are presented. For example, a method for providing language translation includes determining languages of a user and a correspondent; determining one or more sequences of translators; determining a selected sequence of selected translators from the one or more sequences of the translators; requesting a change in virtual locations, within the computing environment associated with the virtual application, of one or more selected translator virtual representations of the selected translators to a virtual meeting location within the computing environment associated with the virtual application; and changing virtual locations of the one or more selected translator virtual representations to the virtual meeting location.

Type: Application

Filed: October 25, 2013

Publication date: February 20, 2014

Applicant: International Business Machines Corporation

Inventors: Dimitri Kanevsky, Clifford Alan Pickover, Bhuvana Ramabhadran, Irina Rish
Language translation in an environment associated with a virtual application

Patent number: 8655644

Abstract: Methods and apparatus for language translation in a computing environment associated with a virtual application are presented. For example, a method for providing language translation includes determining languages of a user and a correspondent; determining one or more sequences of translators; determining a selected sequence of selected translators from the one or more sequences of the translators; requesting a change in virtual locations, within the computing environment associated with the virtual application, of one or more selected translator virtual representations of the selected translators to a virtual meeting location within the computing environment associated with the virtual application; and changing virtual locations of the one or more selected translator virtual representations to the virtual meeting location.

Type: Grant

Filed: September 30, 2009

Date of Patent: February 18, 2014

Assignee: International Business Machines Corporation

Inventors: Dimitri Kanevsky, Clifford Alan Pickover, Bhuvana Ramabhadran, Irina Rish
Accuracy improvement of spoken queries transcription using co-occurrence information

Patent number: 8650031

Abstract: Techniques disclosed herein include systems and methods for voice-enabled searching. Techniques include a co-occurrence based approach to improve accuracy of the 1-best hypothesis for non-phrase voice queries, as well as for phrased voice queries. A co-occurrence model is used in addition to a statistical natural language model and acoustic model to recognize spoken queries, such as spoken queries for searching a search engine. Given an utterance and an associated list of automated speech recognition n-best hypotheses, the system rescores the different hypotheses using co-occurrence information. For each hypothesis, the system estimates a frequency of co-occurrence within web documents. Combined scores from a speech recognizer and a co-occurrence engine can be combined to select a best hypothesis with a lower word error rate.

Type: Grant

Filed: July 31, 2011

Date of Patent: February 11, 2014

Assignee: Nuance Communications, Inc.

Inventors: Jonathan Mamou, Abhinav Sethy, Bhuvana Ramabhadran, Ron Hoory, Paul Joseph Vozila, Nathan Bodenstab
Multiple audio/video data stream simulation

Patent number: 8644550

Abstract: A multiple audio/video data stream simulation method and system. A computing system receives first audio and/or video data streams. The first audio and/or video data streams include data associated with a first person and a second person. The computing system monitors the first audio and/or video data streams. The computing system identifies emotional attributes comprised by the first audio and/or video data streams. The computing system generates second audio and/or video data streams associated with the first audio and/or video data streams. The second audio and/or video data streams include the first audio and/or video data streams data without the emotional attributes. The computing system stores the second audio and/or video data streams.

Type: Grant

Filed: May 31, 2012

Date of Patent: February 4, 2014

Assignee: International Business Machines Corporation

Inventors: Sara H. Basson, Dimitri Kanevsky, Edward Emile Kelley, Bhuvana Ramabhadran
METHODS AND APPARATUS FOR PERFORMING TRANSFORMATION TECHNIQUES FOR DATA CLUSTERING AND/OR CLASSIFICATION

Publication number: 20130325472

Abstract: Some aspects include transforming data, at least a portion of which has been processed to determine frequency information associated with features in the data. Techniques include determining a first transformation based, at least in part, on the frequency information, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.

Type: Application

Filed: August 8, 2012

Publication date: December 5, 2013

Applicant: Nuance Communications, Inc.

Inventors: Leonid Rachevsky, Dimitri Kanevsky, Bhuvana Ramabhadran
METHODS AND APPARATUS FOR PERFORMING TRANSFORMATION TECHNIQUES FOR DATA CLUSTERING AND/OR CLASSIFICATION

Publication number: 20130325471

Abstract: Some aspects include transforming data, at least a portion of which has been processed to determine at least one representative vector associated with each of a plurality of classifications associated with the data to obtain a plurality of representative vectors. Techniques comprise determining a first transformation based, at least in part, on the plurality of representative vectors, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.

Type: Application

Filed: August 8, 2012

Publication date: December 5, 2013

Applicant: Nuance Communications, Inc.

Inventors: Leonid Rachevsky, Dimitri Kanevsky, Bhuvana Ramabhadran
METHODS AND APPARATUS FOR PERFORMING TRANSFORMATION TECHNIQUES FOR DATA CLUSTERING AND/OR CLASSIFICATION

Publication number: 20130325759

Abstract: Some aspects include transforming data for which at least one constraint has been specified on a portion of the data, the at least one constraint relating to a similarity and/or dissimilarity of at least some of the portion of the data. Techniques comprise determining a first transformation that approximates the at least one constraint using a cosine similarity as a measure of the similarity and/or dissimilarity of the at least a portion of the data, applying at least the first transformation to the data to obtain transformed data, and fitting a plurality of clusters to the transformed data to obtain a plurality of established clusters. Some aspects include classifying input data by transforming the input data using at least the first transformation and comparing the transformed input data to the established clusters.

Type: Application

Filed: August 8, 2012

Publication date: December 5, 2013

Applicant: Nuance Communications, Inc.

Inventors: Leonid Rachevsky, Dimitri Kanevsky, Bhuvana Ramabhadran
Sparse representations for text classification

Patent number: 8566270

Abstract: A sparse representation method of text classification is described. An input text document is represented as a document feature vector y. A category dictionary H provides possible examples [h1; h2; . . . ; hn] of the document feature vector y. The input text document is classified using a sparse representation text classification algorithm that solves for y=H? where a sparseness condition is enforced on ? to select a small number of examples from the dictionary H to describe the document feature vector y.

Type: Grant

Filed: September 23, 2011

Date of Patent: October 22, 2013

Assignee: Nuance Communications, Inc.

Inventors: Tara N. Sainath, Sameer R. Maskey, Bhuvana Ramabhadran, Dimitri Kanevsky

prev … 3 4 5 6 7 8 9 10 next