Punctuation Patents (Class 704/6)
  • Patent number: 11256879
    Abstract: Systems and methods for translating a source segment are disclosed. In embodiments, a computer-implemented method for translating a source segment comprises receiving, by a computing device, the source segment in a first language to be translated into a second language; identifying, by the computing device, linguistic markers within the source segment and associated noise values to produce a tagged source segment, wherein the linguistic markers are associated with one or more linguistic patterns likely to introduce noise into a translation channel; transforming, by the computing device, the tagged source segment into an amplified source segment; and sending, by the computing device, the amplified source segment to a machine translation module, wherein the machine translation module is configured to process the amplified source segment to produce a return amplified match in the second language.
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: February 22, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alejandro Martinez Corria, Santiago Pont Nesta, Consuelo Rodríguez Magro, Francis X. Rojas, Linda F. Traudt, Saroj K. Vohra
  • Patent number: 11195507
    Abstract: Systems and methods are described herein for generating alternate audio for a media stream. The media system receives media that is requested by the user. The media comprises a video and audio. The audio includes words spoken in a first language. The media system stores the received media in a buffer as it is received. The media system separates the audio from the buffered media and determines an emotional state expressed by spoken words of the first language. The media system translates the words spoken in the first language into words spoken in a second language. Using the translated words of the second language, the media system synthesizes speech having the emotional state previously determined. The media system then retrieves the video of the received media from the buffer and synchronizes the synthesized speech with the video to generate the media content in a second language.
    Type: Grant
    Filed: October 4, 2018
    Date of Patent: December 7, 2021
    Assignee: Rovi Guides, Inc.
    Inventors: Vijay Kumar, Rajendran Pichaimurthy, Madhusudhan Seetharam
  • Patent number: 11062693
    Abstract: To provide a more natural sounding set of voice prompts of an interactive voice response (IVR) script, the voice recordings of the prompts may be modified to have a predetermined amount of silence at the end of the recording. The amount of silence required can be determined from the context in which the voice prompt appears in the IVR script. Different contexts may include mid-sentence, terminating in a comma, or a sentence ending context. These contexts may require silence periods of 100 ms, 250 ms and 500 ms respectively. Voice files may be trimmed to remove any existing silence and then the required silence period may be added.
    Type: Grant
    Filed: June 20, 2019
    Date of Patent: July 13, 2021
    Assignee: West Corporation
    Inventors: Terry Olson, Mark Sempek, Roger Wehrle
  • Patent number: 10949620
    Abstract: A probabilistic framework to detect controversy on the web. The prior kNN-WC algorithm is recast into a theoretical framework and a new language model introduced. Language models are constructed that are used to calculate probabilities. The probabilities are compared to determine whether or not a certain document is controversial.
    Type: Grant
    Filed: October 6, 2017
    Date of Patent: March 16, 2021
    Assignee: UNIVERSITY OF MASSACHUSETTS
    Inventors: Shiri Dori-Hacohen, James Allan, John Foley, Myung-ha Jang
  • Patent number: 10769386
    Abstract: An input terminology is received in a source language to determine a target language proposal. In a proposal engine when a configuration profile is available the following steps are performed. A set of target language equivalents are determined by applying a user-defined set of approaches to the input terminology. Statistics is computed corresponding to the set of target language equivalents. A target language proposal from the set of target language equivalents is determined based on the computed statistics. The target language proposal along with the computed statistics is displayed in a graphical user interface.
    Type: Grant
    Filed: December 5, 2017
    Date of Patent: September 8, 2020
    Assignee: SAP SE
    Inventors: Michail Vasiltschenko, Benjamin Schork, Mark Childress, Annika Berger, Yvonne Miklosch, Thomas Alcantara Kliche, Hanne Smaadahl, Ying Zou
  • Patent number: 10650195
    Abstract: A bilingual-corpus generating method includes: acquiring a first clause in the first language, a first translated clause obtained by translating the first clause into a second language, a second clause in the first language, a second translated clause obtained by translating the second clause into the second language; and generating, as a translated clause of a third clause obtained by constructing the first clause and the second clause as one sentence by using a first conjunction in the first language, a third translated clause obtained by constructing the first translated clause and the second translated clause as one sentence by using a second conjunction in the second language.
    Type: Grant
    Filed: April 18, 2018
    Date of Patent: May 12, 2020
    Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.
    Inventors: Nanami Fujiwara, Masaki Yamauchi, Masahiro Imade
  • Patent number: 10614168
    Abstract: A translation apparatus that can perform translation based on a correct syntax, a syntax evaluation apparatus employed in the translation apparatus, a syntax evaluation method, and a syntax evaluation program are provided. A syntax evaluation unit 13 includes a grammatical element guessing part 131 that guesses a grammatical element that should be connected before and after or after a word string indicated by first fragment data, and a grammatical element complementing part 132 that complements, based on the guessing result of the grammatical element guessing part 131, the first fragment data as necessary to generate second fragment data. When the grammatical element guessing part 131 guesses a grammatical element, the grammatical element complementing part 132 complements, at a location to which the grammatical element of the word string indicated by the first fragment data should be connected, a tag indicating the grammatical element in order to generate the second fragment data.
    Type: Grant
    Filed: February 23, 2016
    Date of Patent: April 7, 2020
    Assignee: National University Corporation Nara Institute of Science and Technology
    Inventors: Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
  • Patent number: 10402499
    Abstract: A method includes performing, with at least one processing device, natural language understanding using both (i) a semantic word and clause representation generated from syntactically-labeled context and (ii) a syntax generated from common semantic relations between sequential words and clauses. The semantic word and clause representation and the syntax could be generated iteratively. The method could include generating the semantic word and clause representation, where a significance of particular items of context is modified by a presence of words or clauses from a given lexicon of amplifiers along a syntax tree. The method could also include generating the syntax, where multiple semantic relations are considered to be equivalent if the multiple semantic relations have identical representations in an auto-associative memory. The semantic word and clause representation could include content semantics constructed from closest nodes in a syntax tree to a clause.
    Type: Grant
    Filed: November 15, 2017
    Date of Patent: September 3, 2019
    Assignee: Goldman Sachs & Co. LLC
    Inventor: Paul Burchard
  • Patent number: 10346547
    Abstract: A device for automatic translation of patent claims automatically translates a first language claim into a second language claim. The device includes an element noun retrieving module, a complement determining module, an organizing module. and an automatic computer translating module. The element noun retrieving module retrieves an element noun from the first language claim. The complement determining module determines a complement definition from the first language claim, as a combination of an interrelationship definition and object element. The organizing module forms an organized structure by organizing the element noun, interrelationship definition, object element and an organization identifying word.
    Type: Grant
    Filed: September 5, 2017
    Date of Patent: July 9, 2019
    Assignee: INTEGRAL SEARCH INTERNATIONAL LIMITED
    Inventor: Hong-Shin Tsai
  • Patent number: 9721573
    Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.
    Type: Grant
    Filed: December 16, 2014
    Date of Patent: August 1, 2017
    Assignee: MModal IP LLC
    Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
  • Patent number: 9711147
    Abstract: Language dictation recognition systems and methods for using the same. In at least one exemplary system for analyzing verbal records, the system comprises a database capable of receiving a plurality of verbal records, the verbal record comprising at least one identifier and at least one verbal feature and a processor operably coupled to the database, where the processor has and executes a software program. The processor being operational to identify a subset of the plurality of verbal records from the database, extract at least one verbal feature from the identified records, analyze the at least one verbal feature of the subset of the plurality of verbal records, process the subset of the plurality of records using the analyzed feature according to at least one reasoning approach, generate a processed verbal record using the processed subset of the plurality of records, and deliver the processed verbal record to a recipient.
    Type: Grant
    Filed: May 25, 2016
    Date of Patent: July 18, 2017
    Assignee: InfraWare, Inc.
    Inventors: Nick Mahurin, Nathan Lindle, Markus Dickinson, Sandra Kuebler
  • Patent number: 9323854
    Abstract: A method, apparatus and system may enhance translation results from translation applications running on mobile devices. Specifically, according to one embodiment, location data may be automatically identified by a location awareness or location detection scheme (e.g., Global Positioning System or GPS, WiFi and/or 3G) on the device or coupled to the device. The location data may be utilized by the enhanced translation scheme according to an embodiment of the invention to pin point the specific physical geographical location of the device. Thereafter, if an enhanced local language dictionary exists that includes any local dialect details, the enhanced local language dictionary may be downloaded to the device. When the user desires to translate a word and/or phrase, the device may look up the meaning in the downloaded enhanced local language dictionary, thus ensuring that the translation is specific to that geographic location.
    Type: Grant
    Filed: December 19, 2008
    Date of Patent: April 26, 2016
    Assignee: Intel Corporation
    Inventor: Lucas A. Massuh
  • Patent number: 9031849
    Abstract: A system for providing multi-language conference is provided. The system includes conference terminals and a multipoint control unit. The conference terminals are adapted to process a speech of a conference site, transmitting the processed speech to the multipoint control unit, process an audio data received from the multipoint control unit and output it. At least one of the conference terminals is an interpreting terminal adapted to interpret the speech of the conference according to the audio data transmitted from the multipoint control unit, process the interpreted audio data and output the processed audio data. The multipoint control unit is adapted to perform a sound mixing process of the audio data from the conference terminals in different sound channels according to language types, and then sends mixed audio data after the sound mixing process to the conference terminals.
    Type: Grant
    Filed: March 27, 2009
    Date of Patent: May 12, 2015
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Zhihui Liu, Zhonghui Yue
  • Patent number: 9002699
    Abstract: Techniques and solutions are provided for adaptive input language switching. Automatic switching of a current input language can be performed by receiving an indication of a first language of displayed content (where the current input language is set to a second language different from the first language), receiving a selection of a text input area, and based on the selection, automatically switching the current input language from the second language to the first language. Switching of the current input language can also be performed by providing convenient user interface elements for selection by a user to automatically switch the current input language.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: April 7, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Lijuan Qin
  • Patent number: 8996357
    Abstract: A computer analyzes a statement written in a natural language to determine the types and structure of phrases that constitute the statement. A memory stores a plurality of translation rules for different types of phrases, each rule describing how a phrase is to be translated into graphical symbol datasets and relationship link datasets. Each graphical symbol dataset specifies a graphical symbol having a content label representing its content. The content label is or includes a word contained in the pertinent phrase. Each relationship link dataset specifies a relationship link that represents a relationship between graphical symbols and has a particular end shape to indicate what type of relationship it is. The computer translates each phrase of the statement into two or more graphical symbol datasets and one or more relationship link datasets, according to the translation rules pertinent to the determined phrase types.
    Type: Grant
    Filed: June 18, 2012
    Date of Patent: March 31, 2015
    Assignee: Fujitsu Limited
    Inventor: Kuniharu Takayama
  • Patent number: 8959013
    Abstract: A method, including presenting, by a computer system executing a non-tactile three dimensional user interface, a virtual keyboard on a display, the virtual keyboard including multiple virtual keys, and capturing a sequence of depth maps over time of a body part of a human subject. On the display, a cursor is presented at positions indicated by the body part in the captured sequence of depth maps, and one of the multiple virtual keys is selected in response to an interruption of a motion of the presented cursor in proximity to the one of the multiple virtual keys.
    Type: Grant
    Filed: September 25, 2011
    Date of Patent: February 17, 2015
    Assignee: Apple Inc.
    Inventors: Micha Galor, Ofir Or, Shai Litvak, Erez Sali
  • Patent number: 8949112
    Abstract: One embodiment of the present invention is an XML application module that processes an XML character stream, which module includes an XML interface module, a parallel bit stream module, a lexical item stream module, a parser and a parsed data receiver. The XML interface module applies the XML character stream as input to the parallel bit stream module and the parser; the parallel bit stream module forms parallel bit streams and applies them as input to the lexical item stream module; the lexical stream module forms lexical item streams and applies them as input to the parser; the parser forms a stream of parsed XML data and applies it as input to the parsed data receiver; and the parsed data receiver processes the stream of parsed XML data. The parsed data receiver may be, for example, a communication module of a portable communication device.
    Type: Grant
    Filed: February 6, 2013
    Date of Patent: February 3, 2015
    Assignee: International Characters, Inc.
    Inventor: Robert D. Cameron
  • Patent number: 8949111
    Abstract: A method includes accessing text that includes a plurality of words, tagging each of the plurality of words with one of a plurality of parts of speech (POS) tags, and creating a plurality of tokens, each token comprising one of the plurality of words and its associated POS tag. The method further includes clustering one or more of the created tokens into a chunk of tokens, the one or more tokens clustered into the chunk of tokens based on the POS tags of the one or more tokens, and forming a phrase based on the chunk of tokens, the phrase comprising the words of the one or more tokens clustered into the chunk of tokens.
    Type: Grant
    Filed: December 14, 2011
    Date of Patent: February 3, 2015
    Assignee: Brainspace Corporation
    Inventor: Paul A. Jakubik
  • Patent number: 8918317
    Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.
    Type: Grant
    Filed: September 25, 2009
    Date of Patent: December 23, 2014
    Assignee: Multimodal Technologies, LLC
    Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
  • Patent number: 8909516
    Abstract: Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the computing functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The computing functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The computing functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.
    Type: Grant
    Filed: December 7, 2011
    Date of Patent: December 9, 2014
    Assignee: Microsoft Corporation
    Inventors: Julie A. Medero, Daniel S. Morris, Lucretia H. Vanderwende, Michael Gamon
  • Patent number: 8886518
    Abstract: A system and method for capitalizing translated text is provided. A capitalized source text is automatically translated to a target text. The target text is capitalized according to information in the capitalized source text.
    Type: Grant
    Filed: August 7, 2006
    Date of Patent: November 11, 2014
    Assignee: Language Weaver, Inc.
    Inventors: Wei Wang, Kevin Knight, Daniel Marcu
  • Patent number: 8880389
    Abstract: A method, computer program product and system are disclosed for determining the semantic density of textualized digital media (a measure of how much information is conveyed in a sentence or clause relative to its length). The more semantically dense text is, the more information it conveys in a given space. Users input a topic, a timeline, and one or more target web media sources for analysis. Text in the target media sources is deconstructed to determine density, and a density rating assigned to the web media source. Over time, users can track trends in the density of text media relative to a given topic, and determine how much information is being conveyed in connection with the topic, such as a political campaign. Line graphs, pie charts, and other time-elapsed output graphic representations of the semantic density are generated and rendered for the user.
    Type: Grant
    Filed: December 9, 2011
    Date of Patent: November 4, 2014
    Inventor: Igor Iofinov
  • Patent number: 8855995
    Abstract: Systems, methods, and apparatuses including computer program products for machine translation. A method is provided that includes generating a plurality of machine translation systems using a single machine translation engine, and generating a consensus translation from a plurality of candidate translations for a source sentence, where each candidate translation of the plurality of candidate translations is an output of a respective machine translation system of the plurality of machine translation systems.
    Type: Grant
    Filed: December 3, 2012
    Date of Patent: October 7, 2014
    Assignee: Google Inc.
    Inventors: Wolfgang Macherey, Franz Josef Och
  • Patent number: 8843360
    Abstract: Disclosed are various embodiments for client-side internationalization of network pages. A network page and code that localizes the network page are obtained from a server. The code that localizes the network page is executed in a client and determines a locale associated with the client. One or more internationalized elements are identified in the network page. The internationalized elements are replaced with corresponding localized translations. The network page is rendered for display in the client after the network page has been localized.
    Type: Grant
    Filed: March 4, 2011
    Date of Patent: September 23, 2014
    Assignee: Amazon Technologies, Inc.
    Inventors: Simon K. Johnston, Margaux Eng, James K. Keiger, Gideon Shavit
  • Patent number: 8838434
    Abstract: Techniques disclosed herein include systems and methods for creating a bootstrap call router for other languages by using selected N-best translations. Techniques include using N-best translations from a machine translation system so as to increase a possibility that desired keywords in a target language are covered in the machine translation output. A 1-best translation is added to a new text corpus. This is followed by selecting a subset that provides a varied set of translations for a given source transcribed utterance for better translation coverage. Additional translations are added to the new text corpus based on a measure of possible translations having words not yet seen for the selected transcribed utterances, and also based on possible translation having words that are not associated with very many or semantic tags in the new text corpus. Candidate translations can be selected from a window of N-best translations calculated based on machine translation accuracy.
    Type: Grant
    Filed: July 29, 2011
    Date of Patent: September 16, 2014
    Assignee: Nuance Communications, Inc.
    Inventor: Ding Liu
  • Patent number: 8818792
    Abstract: An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
    Type: Grant
    Filed: July 8, 2011
    Date of Patent: August 26, 2014
    Assignee: SK Planet Co., Ltd.
    Inventors: Young Sook Hwang, Sang-Bum Kim, Chang Hao Yin, Hae Chang Rim, Joo Young Lee
  • Patent number: 8805677
    Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.
    Type: Grant
    Filed: February 4, 2014
    Date of Patent: August 12, 2014
    Assignee: West Corporation
    Inventor: Steven John Schanbacher
  • Patent number: 8798986
    Abstract: Portable, real time voice translation systems, and associated methods of use, are provided. The systems include a translation system for use on a single unit, portable computing device and operable for accessing a multilanguage database, selecting a source language from a plurality of source languages and a destination language from a plurality of destination languages, inputting a source phrase, transmitting the source phrase to a speech recognition module, a translation engine, and a template look-up engine for finding the phrase template associated with the destination phrase from among the multiple languages. The spoken translation is then output in the selected destination language. The translation system has a total time between the input of the source phrase and output of the destination phrase that is no slower than 0.010 seconds, and a communications interface operable for communicating with a second computer system.
    Type: Grant
    Filed: December 26, 2012
    Date of Patent: August 5, 2014
    Assignee: NewTalk, Inc.
    Inventors: Robert H. Clemons, Bruce W. Nash, Martha P. Robinson, Craig A. Robinson
  • Patent number: 8788260
    Abstract: Systems, methods, and computer storage media having computer-executable instructions embodied thereon that facilitate generation of snippets. In embodiments, text features within a keyword-sentence window are identified. The text features are utilized to determine break features that indicate favorability of breaking at a particular location of the keyword-sentence window. The break features are used to recognize features of partial snippets such that a snippet score to indicate the strength of the partial snippet can be calculated. Snippet scores associated with partial snippets are compared to select an optimal snippet, that is, the snippet having the highest snippet score.
    Type: Grant
    Filed: May 11, 2010
    Date of Patent: July 22, 2014
    Assignee: Microsoft Corporation
    Inventors: Valerie Rose Nygaard, Riccardo Turchetto, Joanna Mun Yee Chan, Christian Biemann, David Dongjah Ahn, Andrea Ryerson Burbank, Feng Pan, Timothy McDonnell Converse, James Michael Reinhold, Tracy Holloway King
  • Patent number: 8738357
    Abstract: Systems, methods, and apparatus for accessing distributed models in automated machine processing, including using large language models in machine translation, speech recognition and other applications.
    Type: Grant
    Filed: October 22, 2012
    Date of Patent: May 27, 2014
    Assignee: Google Inc.
    Inventors: Franz Josef Och, Jeffrey Dean, Thorsten Brants, Alexander Mark Franz, Jay Ponte, Peng Xu, Sha-Mayn Teh, Jeffrey Chin, Ignacio E. Thayer, Anton Carver, Daniel Rosart, John S. Hawkins, Karel Driesen
  • Patent number: 8725491
    Abstract: An improved mobile electronic device and associated method enable the identification of previously-entered textual objects in one or more custom wordlists to identify possible transliterations of textual inputs. Such textual objects can be stored as data in a Pinyin custom wordlist if it is in the nature of Standard Mandarin characters or it can be stored in a BoPoMoFo custom wordlist if it is in the nature of Traditional Chinese characters, or it can be stored in both if it is in the nature of both Standard Mandarin characters and Traditional Chinese characters.
    Type: Grant
    Filed: May 9, 2013
    Date of Patent: May 13, 2014
    Assignee: BlackBerry Limited
    Inventors: Michael Elizarov, Xin Wang, Xiaoting Sun
  • Patent number: 8719003
    Abstract: A first test phrase can be rendered to a user in a first language. A first answer phrase in a second language can be received from the user. Based on the determination of that the first answer phrase is a translation of the first test phrase, a second test phrase in the first language can be rendered to the user. A second answer phrase in the second language can be received from the user. A set of parser data can be modified based upon the second answer phrase and the user can be permitted access to a resource.
    Type: Grant
    Filed: June 24, 2011
    Date of Patent: May 6, 2014
    Assignee: Google Inc.
    Inventors: Johnny Chen, Steve Langdon
  • Patent number: 8719004
    Abstract: A system, method and software product punctuates voicemail transcription text. A transcription text of the voicemail message is generated and the pauses between words of the transcribed text are determined. Ellipses are inserted into the transcription text at the position of “er” and “ahh” type words and pauses between words of the transcribed text.
    Type: Grant
    Filed: March 19, 2010
    Date of Patent: May 6, 2014
    Assignee: Ditech Networks, Inc.
    Inventor: James Siminoff
  • Patent number: 8666730
    Abstract: A question-answering system for searching exact answers in text documents provided in the electronic or digital form to questions formulated by user in the natural language is based on automatic semantic labeling of text documents and user questions. The system performs semantic labeling with the help of markers in terms of basic knowledge types, their components and attributes, in terms of question types from the predefined classifier for target words, and in terms of components of possible answers. A matching procedure makes use of mentioned types of semantic labels to determine exact answers to questions and present them to the user in the form of fragments of sentences or a newly synthesized phrase in the natural language. Users can independently add new types of questions to the system classifier and develop required linguistic patterns for the system linguistic knowledge base.
    Type: Grant
    Filed: March 12, 2010
    Date of Patent: March 4, 2014
    Assignee: Invention Machine Corporation
    Inventors: James Todhunter, Igor Sovpel, Dzianis Pastanohau
  • Patent number: 8666729
    Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.
    Type: Grant
    Filed: February 10, 2010
    Date of Patent: March 4, 2014
    Assignee: West Corporation
    Inventor: Steven John Schanbacher
  • Patent number: 8655642
    Abstract: An improved method of transliterating non-Latin input within an e-mail address field to the Latin equivalent. A routine in a handheld device is structured to detect a triggering event that indicates an e-mail address is being input into an e-mail address field. Following the triggering event, both prior and subsequent input is transliterated to Latin characters as these characters are required by Internet protocols. The transliteration routine may also be utilized to search an email address book wherein names are recorded using both Latin and non-Latin characters.
    Type: Grant
    Filed: May 4, 2013
    Date of Patent: February 18, 2014
    Assignee: BlackBerry Limited
    Inventors: Vadim Fux, Michael Elizarov, Dan Rubanovich
  • Patent number: 8655640
    Abstract: An unsupervised boosting strategy is applied to refining automatic word alignment. In some examples, the strategy improves the quality of automatic word alignment, for example for resource poor language pairs, thus improving Statistical Machine Translation (SMT) performance.
    Type: Grant
    Filed: March 2, 2011
    Date of Patent: February 18, 2014
    Assignee: Raytheon BBN Technologies Corp.
    Inventor: Sankaranarayanan Ananthakrishnan
  • Patent number: 8635060
    Abstract: A foreign language writing service method includes: recognizing, when a mixed text of foreign language portions and mother tongue portions is entered by a learner, the mother tongue portions from the mixed text; translating the mother tongue portions; combining a mother tongue translation result with the foreign language portions of the mixed text to generate a combined text; and providing the learner with the combined text of the mother tongue translation result and the foreign language portions of the mixed text.
    Type: Grant
    Filed: June 29, 2010
    Date of Patent: January 21, 2014
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Young Ae Seo, Chang Hyun Kim, Seong Il Yang, Jinxia Huang, Sung Kwon Choi, Ki Young Lee, Yoon Hyung Roh, Oh Woog Kwon, Yun Jin, Ying Shun Wu, Eun Jin Park, Young Kil Kim, Sang Kyu Park
  • Patent number: 8620793
    Abstract: A network clearinghouse may be provided that brings together organizations (subjects) requiring outsourcing of a service and service providers (operators). The clearinghouse manages the bidding and awarding of contracts, by collecting and authorizing requests for proposals (RFPs), sending bid invitations to operators that meet the requirements of the subject, sending a notification that the contract has been awarded, and collecting payment from the subject and paying the operator.
    Type: Grant
    Filed: June 1, 2010
    Date of Patent: December 31, 2013
    Assignee: SDL International America Incorporated
    Inventors: Iko Knyphausen, Jochen Hummel
  • Patent number: 8612207
    Abstract: Language analysis means 21 analyzes texts read from a text DB 11, and generates a sentence structure as the analysis result. Similar-structure generation adjustment means 25 generates, from an input of an input device, a determination item for determining whether or not the structures are identical every type of differences between the sentence structures. Similar-structure determination adjustment means 26 generates, from an input of the input device 6, a determination item for determining whether or not the difference between attribute values is ignored every type of attribute values. Similar-structure generating means 22 generates a similar structure of a partial structure forming the sentence structure obtained by language analysis means 21 in accordance with the determination item from the similar-structure generation adjustment means 25, and sets the generated similar structure as an equivalent class of the partial structure on the generation source.
    Type: Grant
    Filed: March 17, 2005
    Date of Patent: December 17, 2013
    Assignee: NEC Corporation
    Inventors: Yousuke Sakao, Kenji Satoh, Susumu Akamine
  • Patent number: 8589165
    Abstract: The present disclosure provides method and system for converting a free text expression of an identity to a phonetic equivalent code. The conversion follows a set of rules based on phonetic groupings and compresses the expression to a shorter series of characters than the expression. The phonetic equivalent code may be compared to one or more other phonetic equivalent code to establish a correlation between the codes. The phonetic equivalent code of the free text expression may be associated with the code of a known identity. The known identity may be provided to a user for confirmation of the identity. Further, a plurality of expressions stored in a database may be consolidated by converting the expressions to phonetic equivalent codes, comparing the codes to find correlations, and if appropriate reducing the number of expressions or mapping the expressions to a fewer number of expressions.
    Type: Grant
    Filed: January 24, 2012
    Date of Patent: November 19, 2013
    Assignee: United Services Automobile Association (USAA)
    Inventors: Gregory Brian Meyer, James Elden Nicholson
  • Patent number: 8566080
    Abstract: The present invention provides a method and system for text processing. The method comprises determining at least a part of characters in a text; dividing the text into a plurality of text segments by using the at least a part of characters as separators; and decoding the plurality of text segments respectively.
    Type: Grant
    Filed: April 29, 2010
    Date of Patent: October 22, 2013
    Assignee: International Business Machines Corporation
    Inventors: Bin Li, Li Qun Pang, Zhi Qiang Sha, Zhi Bo Zuo
  • Patent number: 8554542
    Abstract: A system and method are provided for processing an input document which enable assessment of the coherence of an abstract of the document. The method includes storing the document in memory and, for each sentence of the abstract, comparing the sentence with sentences of a main body of the document using textual entailment techniques to identify whether the sentence of the abstract entails a sentence in the main body of the document. Links can then be generated between the entailing sentences of the abstract and the corresponding entailed sentences of the document. The document and generated links are output. The links enable the coherence of the abstract to be evaluated, either manually or automatically, using an evaluation component of the system.
    Type: Grant
    Filed: May 5, 2010
    Date of Patent: October 8, 2013
    Assignee: Xerox Corporation
    Inventors: Ágnès Sandor, Guillaume Jacquet
  • Patent number: 8554558
    Abstract: An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a source language, and a machine translation (MT) engine is used for translating this text data to text data in a target language. In this embodiment, the text data in the target language is rendered on a display device, and different visualization schemes are applied to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. In one embodiment, the defined characteristics include a defined confidence value representing the accuracy of the rendered text. For example, this confidence value may be based on both the accuracy of the conversion of the audio input and the accuracy of the translation of the text data to the target language.
    Type: Grant
    Filed: July 12, 2010
    Date of Patent: October 8, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Jeffrey S. McCarley, Leiming R. Qian
  • Patent number: 8543381
    Abstract: This invention is a method for “text morphing,” wherein text morphing involves integrating or blending together substantive content from two or more bodies of text into a single body of text based on locations of linguistic commonality among the two or more bodies of text. This method entails: identifying pairs of “Synonym-Different-Synonym” (SDS) text segments between an import body of text and an export body of text; and, for each selected pair of SDS text segments, substituting some or all of the SDS text segment from the export body of text for some or all of the SDS text segment in the import body of text. In some respects, this method is analogous to splicing and substituting gene segments with compatible starting and ending sequences, but different middle sequences. Text morphing as disclosed herein can be useful for creative ideation, product development, integrative search engines, and entertainment purposes.
    Type: Grant
    Filed: June 17, 2010
    Date of Patent: September 24, 2013
    Assignee: Holovisions LLC
    Inventor: Robert A. Connor
  • Patent number: 8538742
    Abstract: A system and method for translating a social feed is disclosed. The system comprises a communication module, a decoding engine and a re-encoding engine. The communication module receives social feed data and a request from a social network application. The social feed data is configured to cause a client to display a social feed in a first language. The request includes data indicating that the social feed should be displayed in a second language. The decoding engine decodes the social feed data to generate decoded social feed data. The re-encoding engine re-encodes the decoded social feed data to cause the client to display the social feed in the second language based at least in part on the request. The communication module sends the translated social feed data to the client.
    Type: Grant
    Filed: September 13, 2011
    Date of Patent: September 17, 2013
    Assignee: Google Inc.
    Inventors: Christopher R. Wren, Nadav Aharony
  • Patent number: 8527262
    Abstract: Systems and methods are provided for automated semantic role labeling for languages having complex morphology. In one aspect, a method for processing natural language text includes receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes, identifying a target verb as a stem of an inflicted word in the text sentence, grouping morphemes from one or more inflicted words with the same syntactic role into constituents, and predicting a semantic role of a constituent for the target verb.
    Type: Grant
    Filed: June 22, 2007
    Date of Patent: September 3, 2013
    Assignee: International Business Machines Corporation
    Inventors: Nandakishore Kambhatla, Imed Zitouni
  • Patent number: 8521507
    Abstract: Training data in one language is leveraged to develop classifiers for multiple languages under circumstances where all of those classifiers will be performing the same kind of classification task, but relative to linguistically different sets of texts, thereby saving the cost of manually labeling a different set of training data for each language. Classification knowledge is learned for a source language in which training data are available. That knowledge is transferred to another target language's classifier through the integration of language transition knowledge. The transferred model is adjusted to better fit the target language.
    Type: Grant
    Filed: February 22, 2010
    Date of Patent: August 27, 2013
    Assignee: Yahoo! Inc.
    Inventors: Lei Shi, Mingjun Tian
  • Patent number: 8515729
    Abstract: The display language of a site may be changed to another alternate language by users of a site at any time. For example, a first user may access the same site in its default language (i.e. English) and a second user may access the site using their preferred language (i.e. French) even though the default language of the site is different from their preferred language. The language of the site may be changed from one language to another language at any time a user is accessing the site. Application content changes are identified by the site helping to ensure consistency between the default language and the alternate languages.
    Type: Grant
    Filed: March 31, 2008
    Date of Patent: August 20, 2013
    Assignee: Microsoft Corporation
    Inventors: Tomasz Grzegorz Tomko, Jay R. Rathi, Almon C. Dao
  • Patent number: 8510097
    Abstract: Computer methods, apparatus and articles of manufacture therefor, are disclosed for text-characterization using a finite state transducer that along each path accepts on a first side an n-gram of text-characterization (e.g., a language or a topic) and outputs on a second side a sequence of symbols identifying one or more text-characterizations from a set of text-characterizations. The finite state transducer is applied to input data. For each n-gram accepted by the finite state transducer, a frequency counter associated with the n-gram of the one or more text-characterizations in the set of text-characterizations is incremented. The input data is classified as one or more text-characterizations from the set of text-characterizations using the frequency counters associated therewith.
    Type: Grant
    Filed: December 18, 2008
    Date of Patent: August 13, 2013
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Lauri J Karttunen, Ji Fang