Patents by Inventor Srinivas Bangalore

Srinivas Bangalore has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9805026
    Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.
    Type: Grant
    Filed: August 12, 2014
    Date of Patent: October 31, 2017
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, David Crawford Gibbon, Mazin Gilbert, Patrick Guy Haffner, Zhu Liu, Behzad Shahraray
  • Publication number: 20170300487
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.
    Type: Application
    Filed: June 29, 2017
    Publication date: October 19, 2017
    Inventors: Michael J. Johnston, Srinivas Bangalore, Junlan Feng, Taniya Mishra
  • Patent number: 9792904
    Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.
    Type: Grant
    Filed: July 23, 2014
    Date of Patent: October 17, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Srinivas Bangalore, Mazin Gilbert, Narendra K. Gupta
  • Patent number: 9764477
    Abstract: A system, method and computer-readable storage devices are for processing natural language commands, such as commands to a robotic arm, using a Tag & Parse approach to semantic parsing. The system first assigns semantic tags to each word in a sentence and then parses the tag sequence into a semantic tree. The system can use statistical approach for tagging, parsing, and reference resolution. Each stage can produce multiple hypotheses, which are re-ranked using spatial validation. Then the system selects a most likely hypothesis after spatial validation, and generates or outputs a command. In the case of a robotic arm, the command is output in Robot Control Language (RCL).
    Type: Grant
    Filed: December 1, 2014
    Date of Patent: September 19, 2017
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Svetlana Stoyanchev, Srinivas Bangalore, John Chen, Hyuckchul Jung
  • Patent number: 9767221
    Abstract: Delivering targeted content includes collecting, via at least one tangible processor, user activity data for users during a specified time period. questions asked by the users during the specified time period are extracted from the user activity data, via the at least one tangible processor, and stored in user profiles for the users. The user profiles are clustered, via the at least one tangible processor, based on the questions asked. Targeted content is delivered, via the at least one tangible processor, to a subset of the users based on the clustering.
    Type: Grant
    Filed: October 8, 2010
    Date of Patent: September 19, 2017
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Srinivas Bangalore, Junlan Feng, Michael James Robert Johnston, Taniya Mishra
  • Patent number: 9741338
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for processing speech. A system configured to practice the method monitors user utterances to generate a conversation context. Then the system receives a current user utterance independent of non-natural language input intended to trigger speech processing. The system compares the current user utterance to the conversation context to generate a context similarity score, and if the context similarity score is above a threshold, incorporates the current user utterance into the conversation context. If the context similarity score is below the threshold, the system discards the current user utterance. The system can compare the current user utterance to the conversation context based on an n-gram distribution, a perplexity score, and a perplexity threshold. Alternately, the system can use a task model to compare the current user utterance to the conversation context.
    Type: Grant
    Filed: December 9, 2015
    Date of Patent: August 22, 2017
    Assignee: Nuance Communications, Inc.
    Inventor: Srinivas Bangalore
  • Patent number: 9734820
    Abstract: A system, method and computer-readable storage device which balance latency and accuracy of machine translations by segmenting the speech upon locating a conjunction. The system, upon receiving speech, will buffer speech until a conjunction is detected. Upon detecting a conjunction, the speech received until that point is segmented. The system then continues performing speech recognition on the segment, searching for the next conjunction, while simultaneously initiating translation of the segment. Upon translating the segment, the system converts the translation to a speech output, allowing a user to hear an audible translation of the speech originally heard.
    Type: Grant
    Filed: November 14, 2013
    Date of Patent: August 15, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, John Chen
  • Publication number: 20170229112
    Abstract: Systems, methods, and computer-readable storage devices for receiving an utterance from a user and analyzing the utterance to identify the demographics of the user. The system then analyzes the utterance to determine the prosody of the utterance, and retrieves from the Internet data associated with the determined demographics. Using the retrieved data, the system retrieves, also from the Internet, recorded speech matching the identified prosody. The recorded speech, which is based on the demographic data of the utterance and has a prosody matching the utterance, is then saved to a database for future use in generating speech specific to the user.
    Type: Application
    Filed: April 24, 2017
    Publication date: August 10, 2017
    Inventors: Srinivas BANGALORE, Taniya MISHRA
  • Patent number: 9727557
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for collecting web data in order to create diverse language models. A system configured to practice the method first crawls, such as via a crawler operating on a computing device, a set of documents in a network of interconnected devices according to a visitation policy, wherein the visitation policy is configured to focus on novelty regions for a current language model built from previous crawling cycles by crawling documents whose vocabulary considered likely to fill gaps in the current language model. A language model from a previous cycle can be used to guide the creation of a language model in the following cycle. The novelty regions can include documents with high perplexity values over the current language model.
    Type: Grant
    Filed: July 18, 2016
    Date of Patent: August 8, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Luciano De Andrade Barbosa, Srinivas Bangalore
  • Patent number: 9720907
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for learning latent representations for natural language tasks. A system configured to practice the method analyzes, for a first natural language processing task, a first natural language corpus to generate a latent representation for words in the first corpus. Then the system analyzes, for a second natural language processing task, a second natural language corpus having a target word, and predicts a label for the target word based on the latent representation. In one variation, the target word is one or more word such as a rare word and/or a word not encountered in the first natural language corpus. The system can optionally assigning the label to the target word. The system can operate according to a connectionist model that includes a learnable linear mapping that maps each word in the first corpus to a low dimensional latent space.
    Type: Grant
    Filed: September 14, 2015
    Date of Patent: August 1, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Srinivas Bangalore, Sumit Chopra
  • Patent number: 9721558
    Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
    Type: Grant
    Filed: December 10, 2015
    Date of Patent: August 1, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Srinivas Bangalore, Junlan Feng, Mazin Gilbert, Juergen Schroeter, Ann K. Syrdal, David Schulz
  • Patent number: 9703769
    Abstract: A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.
    Type: Grant
    Filed: October 7, 2015
    Date of Patent: July 11, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Srinivas Bangalore, Narendra K. Gupta, Mazin Gilbert
  • Patent number: 9697206
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.
    Type: Grant
    Filed: October 7, 2015
    Date of Patent: July 4, 2017
    Assignee: Interactions LLC
    Inventors: Michael J. Johnston, Srinivas Bangalore, Junlan Feng, Taniya Mishra
  • Patent number: 9679561
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating domain-specific speech recognition models for a domain of interest by combining and tuning existing speech recognition models when a speech recognizer does not have access to a speech recognition model for that domain of interest and when available domain-specific data is below a minimum desired threshold to create a new domain-specific speech recognition model. A system configured to practice the method identifies a speech recognition domain and combines a set of speech recognition models, each speech recognition model of the set of speech recognition models being from a respective speech recognition domain. The system receives an amount of data specific to the speech recognition domain, wherein the amount of data is less than a minimum threshold to create a new domain-specific model, and tunes the combined speech recognition model for the speech recognition domain based on the data.
    Type: Grant
    Filed: March 28, 2011
    Date of Patent: June 13, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Srinivas Bangalore, Robert Bell, Diamantino Antonio Caseiro, Mazin Gilbert, Patrick Haffner
  • Publication number: 20170147561
    Abstract: In an embodiment of a messaging system, a method for presenting a commercial message to a user is provided. A target language in which the user is comfortable communicating may be determined based on at least one communication received by the user or at least one communication provided by the user. The commercial message may be presented to the user in the target language.
    Type: Application
    Filed: February 6, 2017
    Publication date: May 25, 2017
    Inventor: Srinivas BANGALORE
  • Publication number: 20170147563
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for presenting a machine translation and alternative translations to a user, where a selection of any particular alternative translation results in the re-ranking of the remaining alternatives. The system then presents these re-ranked alternatives to the user, who can continue proofing the machine translation using the re-ranked alternatives or by typing an improved translation. This process continues until the user indicates that the current portion of the translation is complete, at which point the system moves to the next portion.
    Type: Application
    Filed: February 2, 2017
    Publication date: May 25, 2017
    Inventors: Srinivas BANGALORE, Kateryna Kuksenok
  • Patent number: 9633649
    Abstract: Systems, methods, and computer-readable storage devices for receiving an utterance from a user and analyzing the utterance to identify the demographics of the user. The system then analyzes the utterance to determine the prosody of the utterance, and retrieves from the Internet data associated with the determined demographics. Using the retrieved data, the system retrieves, also from the Internet, recorded speech matching the identified prosody. The recorded speech, which is based on the demographic data of the utterance and has a prosody matching the utterance, is then saved to a database for future use in generating speech specific to the user.
    Type: Grant
    Filed: May 2, 2014
    Date of Patent: April 25, 2017
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Srinivas Bangalore, Taniya Mishra
  • Publication number: 20170091178
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for bootstrapping a language translation system. A system configured to practice the method performs a bidirectional web crawl to identify a bilingual website. The system analyzes data on the bilingual website to make a classification decision about whether the root of the bilingual website is an entry point for the bilingual website. The bilingual site can contain pairs of parallel pages. Each pair can include a first website in a first language and a second website in a second language, and a first portion of the first web page corresponds to a second portion of the second web page. Then the system analyzes the first and second web pages to identify corresponding information pairs in the first and second languages, and extracts the corresponding information pairs from the first and second web pages for use in a language translation model.
    Type: Application
    Filed: October 17, 2016
    Publication date: March 30, 2017
    Inventors: Luciano De Andrade BARBOSA, Srinivas BANGALORE, Vivek Kumar RANGARAJAN SRIDHAR
  • Patent number: 9571887
    Abstract: Aspects of the subject disclosure may include, for example, a process that determines information from a first media stream including a number of keywords associated with viewing habits of a user. A second media stream of a media program is scanned for one of a word or phrase corresponding to a keyword of the number of keywords. A second keyword is determined based on the one of the word or phrase, and additional information associated with the second keyword is identified. Results associated with the additional information are provided to a multimedia device. Other embodiments are disclosed.
    Type: Grant
    Filed: November 17, 2014
    Date of Patent: February 14, 2017
    Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Srinivas Bangalore, Mazin E. Gilbert, Michael Johnston
  • Patent number: 9563625
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for presenting a machine translation and alternative translations to a user, where a selection of any particular alternative translation results in the re-ranking of the remaining alternatives. The system then presents these re-ranked alternatives to the user, who can continue proofing the machine translation using the re-ranked alternatives or by typing an improved translation. This process continues until the user indicates that the current portion of the translation is complete, at which point the system moves to the next portion.
    Type: Grant
    Filed: March 21, 2016
    Date of Patent: February 7, 2017
    Assignee: AT&T Intellectual Property I. L.P.
    Inventors: Srinivas Bangalore, Kateryna Kuksenok