Punctuation Patents (Class 704/6)
-
Patent number: 12086559Abstract: A computer system extracts clauses using machine translation. An input sentence in a source language is translated into a translated sentence in a target language using a trained machine translation model, wherein the trained machine translation model inserts a grammatical indicator into a position of the translated sentence that identifies a dependent clause. The input sentence and the translated sentence are aligned to determine a position in the input sentence that corresponds to the position of the grammatical indicator in the translated sentence. The dependent clause is extracted, in the source language, from the input sentence based on the determined position in the input sentence. Embodiments of the present invention further include a method and program product for clause extraction using machine translation in substantially the same manner described above.Type: GrantFiled: March 31, 2021Date of Patent: September 10, 2024Assignee: International Business Machines CorporationInventors: Vadim Sheinin, Octavian Popescu, Ngoc Phuoc An Vo, Irene Lizeth Manotas Gutiérrez
-
Patent number: 11256879Abstract: Systems and methods for translating a source segment are disclosed. In embodiments, a computer-implemented method for translating a source segment comprises receiving, by a computing device, the source segment in a first language to be translated into a second language; identifying, by the computing device, linguistic markers within the source segment and associated noise values to produce a tagged source segment, wherein the linguistic markers are associated with one or more linguistic patterns likely to introduce noise into a translation channel; transforming, by the computing device, the tagged source segment into an amplified source segment; and sending, by the computing device, the amplified source segment to a machine translation module, wherein the machine translation module is configured to process the amplified source segment to produce a return amplified match in the second language.Type: GrantFiled: July 2, 2019Date of Patent: February 22, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alejandro Martinez Corria, Santiago Pont Nesta, Consuelo Rodríguez Magro, Francis X. Rojas, Linda F. Traudt, Saroj K. Vohra
-
Patent number: 11195507Abstract: Systems and methods are described herein for generating alternate audio for a media stream. The media system receives media that is requested by the user. The media comprises a video and audio. The audio includes words spoken in a first language. The media system stores the received media in a buffer as it is received. The media system separates the audio from the buffered media and determines an emotional state expressed by spoken words of the first language. The media system translates the words spoken in the first language into words spoken in a second language. Using the translated words of the second language, the media system synthesizes speech having the emotional state previously determined. The media system then retrieves the video of the received media from the buffer and synchronizes the synthesized speech with the video to generate the media content in a second language.Type: GrantFiled: October 4, 2018Date of Patent: December 7, 2021Assignee: Rovi Guides, Inc.Inventors: Vijay Kumar, Rajendran Pichaimurthy, Madhusudhan Seetharam
-
Patent number: 11062693Abstract: To provide a more natural sounding set of voice prompts of an interactive voice response (IVR) script, the voice recordings of the prompts may be modified to have a predetermined amount of silence at the end of the recording. The amount of silence required can be determined from the context in which the voice prompt appears in the IVR script. Different contexts may include mid-sentence, terminating in a comma, or a sentence ending context. These contexts may require silence periods of 100 ms, 250 ms and 500 ms respectively. Voice files may be trimmed to remove any existing silence and then the required silence period may be added.Type: GrantFiled: June 20, 2019Date of Patent: July 13, 2021Assignee: West CorporationInventors: Terry Olson, Mark Sempek, Roger Wehrle
-
Patent number: 10949620Abstract: A probabilistic framework to detect controversy on the web. The prior kNN-WC algorithm is recast into a theoretical framework and a new language model introduced. Language models are constructed that are used to calculate probabilities. The probabilities are compared to determine whether or not a certain document is controversial.Type: GrantFiled: October 6, 2017Date of Patent: March 16, 2021Assignee: UNIVERSITY OF MASSACHUSETTSInventors: Shiri Dori-Hacohen, James Allan, John Foley, Myung-ha Jang
-
Patent number: 10769386Abstract: An input terminology is received in a source language to determine a target language proposal. In a proposal engine when a configuration profile is available the following steps are performed. A set of target language equivalents are determined by applying a user-defined set of approaches to the input terminology. Statistics is computed corresponding to the set of target language equivalents. A target language proposal from the set of target language equivalents is determined based on the computed statistics. The target language proposal along with the computed statistics is displayed in a graphical user interface.Type: GrantFiled: December 5, 2017Date of Patent: September 8, 2020Assignee: SAP SEInventors: Michail Vasiltschenko, Benjamin Schork, Mark Childress, Annika Berger, Yvonne Miklosch, Thomas Alcantara Kliche, Hanne Smaadahl, Ying Zou
-
Patent number: 10650195Abstract: A bilingual-corpus generating method includes: acquiring a first clause in the first language, a first translated clause obtained by translating the first clause into a second language, a second clause in the first language, a second translated clause obtained by translating the second clause into the second language; and generating, as a translated clause of a third clause obtained by constructing the first clause and the second clause as one sentence by using a first conjunction in the first language, a third translated clause obtained by constructing the first translated clause and the second translated clause as one sentence by using a second conjunction in the second language.Type: GrantFiled: April 18, 2018Date of Patent: May 12, 2020Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.Inventors: Nanami Fujiwara, Masaki Yamauchi, Masahiro Imade
-
Patent number: 10614168Abstract: A translation apparatus that can perform translation based on a correct syntax, a syntax evaluation apparatus employed in the translation apparatus, a syntax evaluation method, and a syntax evaluation program are provided. A syntax evaluation unit 13 includes a grammatical element guessing part 131 that guesses a grammatical element that should be connected before and after or after a word string indicated by first fragment data, and a grammatical element complementing part 132 that complements, based on the guessing result of the grammatical element guessing part 131, the first fragment data as necessary to generate second fragment data. When the grammatical element guessing part 131 guesses a grammatical element, the grammatical element complementing part 132 complements, at a location to which the grammatical element of the word string indicated by the first fragment data should be connected, a tag indicating the grammatical element in order to generate the second fragment data.Type: GrantFiled: February 23, 2016Date of Patent: April 7, 2020Assignee: National University Corporation Nara Institute of Science and TechnologyInventors: Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura
-
Patent number: 10402499Abstract: A method includes performing, with at least one processing device, natural language understanding using both (i) a semantic word and clause representation generated from syntactically-labeled context and (ii) a syntax generated from common semantic relations between sequential words and clauses. The semantic word and clause representation and the syntax could be generated iteratively. The method could include generating the semantic word and clause representation, where a significance of particular items of context is modified by a presence of words or clauses from a given lexicon of amplifiers along a syntax tree. The method could also include generating the syntax, where multiple semantic relations are considered to be equivalent if the multiple semantic relations have identical representations in an auto-associative memory. The semantic word and clause representation could include content semantics constructed from closest nodes in a syntax tree to a clause.Type: GrantFiled: November 15, 2017Date of Patent: September 3, 2019Assignee: Goldman Sachs & Co. LLCInventor: Paul Burchard
-
Patent number: 10346547Abstract: A device for automatic translation of patent claims automatically translates a first language claim into a second language claim. The device includes an element noun retrieving module, a complement determining module, an organizing module. and an automatic computer translating module. The element noun retrieving module retrieves an element noun from the first language claim. The complement determining module determines a complement definition from the first language claim, as a combination of an interrelationship definition and object element. The organizing module forms an organized structure by organizing the element noun, interrelationship definition, object element and an organization identifying word.Type: GrantFiled: September 5, 2017Date of Patent: July 9, 2019Assignee: INTEGRAL SEARCH INTERNATIONAL LIMITEDInventor: Hong-Shin Tsai
-
Patent number: 9721573Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.Type: GrantFiled: December 16, 2014Date of Patent: August 1, 2017Assignee: MModal IP LLCInventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
-
Patent number: 9711147Abstract: Language dictation recognition systems and methods for using the same. In at least one exemplary system for analyzing verbal records, the system comprises a database capable of receiving a plurality of verbal records, the verbal record comprising at least one identifier and at least one verbal feature and a processor operably coupled to the database, where the processor has and executes a software program. The processor being operational to identify a subset of the plurality of verbal records from the database, extract at least one verbal feature from the identified records, analyze the at least one verbal feature of the subset of the plurality of verbal records, process the subset of the plurality of records using the analyzed feature according to at least one reasoning approach, generate a processed verbal record using the processed subset of the plurality of records, and deliver the processed verbal record to a recipient.Type: GrantFiled: May 25, 2016Date of Patent: July 18, 2017Assignee: InfraWare, Inc.Inventors: Nick Mahurin, Nathan Lindle, Markus Dickinson, Sandra Kuebler
-
Patent number: 9323854Abstract: A method, apparatus and system may enhance translation results from translation applications running on mobile devices. Specifically, according to one embodiment, location data may be automatically identified by a location awareness or location detection scheme (e.g., Global Positioning System or GPS, WiFi and/or 3G) on the device or coupled to the device. The location data may be utilized by the enhanced translation scheme according to an embodiment of the invention to pin point the specific physical geographical location of the device. Thereafter, if an enhanced local language dictionary exists that includes any local dialect details, the enhanced local language dictionary may be downloaded to the device. When the user desires to translate a word and/or phrase, the device may look up the meaning in the downloaded enhanced local language dictionary, thus ensuring that the translation is specific to that geographic location.Type: GrantFiled: December 19, 2008Date of Patent: April 26, 2016Assignee: Intel CorporationInventor: Lucas A. Massuh
-
Patent number: 9031849Abstract: A system for providing multi-language conference is provided. The system includes conference terminals and a multipoint control unit. The conference terminals are adapted to process a speech of a conference site, transmitting the processed speech to the multipoint control unit, process an audio data received from the multipoint control unit and output it. At least one of the conference terminals is an interpreting terminal adapted to interpret the speech of the conference according to the audio data transmitted from the multipoint control unit, process the interpreted audio data and output the processed audio data. The multipoint control unit is adapted to perform a sound mixing process of the audio data from the conference terminals in different sound channels according to language types, and then sends mixed audio data after the sound mixing process to the conference terminals.Type: GrantFiled: March 27, 2009Date of Patent: May 12, 2015Assignee: Huawei Technologies Co., Ltd.Inventors: Zhihui Liu, Zhonghui Yue
-
Patent number: 9002699Abstract: Techniques and solutions are provided for adaptive input language switching. Automatic switching of a current input language can be performed by receiving an indication of a first language of displayed content (where the current input language is set to a second language different from the first language), receiving a selection of a text input area, and based on the selection, automatically switching the current input language from the second language to the first language. Switching of the current input language can also be performed by providing convenient user interface elements for selection by a user to automatically switch the current input language.Type: GrantFiled: November 14, 2011Date of Patent: April 7, 2015Assignee: Microsoft Technology Licensing, LLCInventor: Lijuan Qin
-
Patent number: 8996357Abstract: A computer analyzes a statement written in a natural language to determine the types and structure of phrases that constitute the statement. A memory stores a plurality of translation rules for different types of phrases, each rule describing how a phrase is to be translated into graphical symbol datasets and relationship link datasets. Each graphical symbol dataset specifies a graphical symbol having a content label representing its content. The content label is or includes a word contained in the pertinent phrase. Each relationship link dataset specifies a relationship link that represents a relationship between graphical symbols and has a particular end shape to indicate what type of relationship it is. The computer translates each phrase of the statement into two or more graphical symbol datasets and one or more relationship link datasets, according to the translation rules pertinent to the determined phrase types.Type: GrantFiled: June 18, 2012Date of Patent: March 31, 2015Assignee: Fujitsu LimitedInventor: Kuniharu Takayama
-
Patent number: 8959013Abstract: A method, including presenting, by a computer system executing a non-tactile three dimensional user interface, a virtual keyboard on a display, the virtual keyboard including multiple virtual keys, and capturing a sequence of depth maps over time of a body part of a human subject. On the display, a cursor is presented at positions indicated by the body part in the captured sequence of depth maps, and one of the multiple virtual keys is selected in response to an interruption of a motion of the presented cursor in proximity to the one of the multiple virtual keys.Type: GrantFiled: September 25, 2011Date of Patent: February 17, 2015Assignee: Apple Inc.Inventors: Micha Galor, Ofir Or, Shai Litvak, Erez Sali
-
Patent number: 8949112Abstract: One embodiment of the present invention is an XML application module that processes an XML character stream, which module includes an XML interface module, a parallel bit stream module, a lexical item stream module, a parser and a parsed data receiver. The XML interface module applies the XML character stream as input to the parallel bit stream module and the parser; the parallel bit stream module forms parallel bit streams and applies them as input to the lexical item stream module; the lexical stream module forms lexical item streams and applies them as input to the parser; the parser forms a stream of parsed XML data and applies it as input to the parsed data receiver; and the parsed data receiver processes the stream of parsed XML data. The parsed data receiver may be, for example, a communication module of a portable communication device.Type: GrantFiled: February 6, 2013Date of Patent: February 3, 2015Assignee: International Characters, Inc.Inventor: Robert D. Cameron
-
Patent number: 8949111Abstract: A method includes accessing text that includes a plurality of words, tagging each of the plurality of words with one of a plurality of parts of speech (POS) tags, and creating a plurality of tokens, each token comprising one of the plurality of words and its associated POS tag. The method further includes clustering one or more of the created tokens into a chunk of tokens, the one or more tokens clustered into the chunk of tokens based on the POS tags of the one or more tokens, and forming a phrase based on the chunk of tokens, the phrase comprising the words of the one or more tokens clustered into the chunk of tokens.Type: GrantFiled: December 14, 2011Date of Patent: February 3, 2015Assignee: Brainspace CorporationInventor: Paul A. Jakubik
-
Patent number: 8918317Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.Type: GrantFiled: September 25, 2009Date of Patent: December 23, 2014Assignee: Multimodal Technologies, LLCInventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
-
Patent number: 8909516Abstract: Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the computing functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The computing functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The computing functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.Type: GrantFiled: December 7, 2011Date of Patent: December 9, 2014Assignee: Microsoft CorporationInventors: Julie A. Medero, Daniel S. Morris, Lucretia H. Vanderwende, Michael Gamon
-
Patent number: 8886518Abstract: A system and method for capitalizing translated text is provided. A capitalized source text is automatically translated to a target text. The target text is capitalized according to information in the capitalized source text.Type: GrantFiled: August 7, 2006Date of Patent: November 11, 2014Assignee: Language Weaver, Inc.Inventors: Wei Wang, Kevin Knight, Daniel Marcu
-
Patent number: 8880389Abstract: A method, computer program product and system are disclosed for determining the semantic density of textualized digital media (a measure of how much information is conveyed in a sentence or clause relative to its length). The more semantically dense text is, the more information it conveys in a given space. Users input a topic, a timeline, and one or more target web media sources for analysis. Text in the target media sources is deconstructed to determine density, and a density rating assigned to the web media source. Over time, users can track trends in the density of text media relative to a given topic, and determine how much information is being conveyed in connection with the topic, such as a political campaign. Line graphs, pie charts, and other time-elapsed output graphic representations of the semantic density are generated and rendered for the user.Type: GrantFiled: December 9, 2011Date of Patent: November 4, 2014Inventor: Igor Iofinov
-
Patent number: 8855995Abstract: Systems, methods, and apparatuses including computer program products for machine translation. A method is provided that includes generating a plurality of machine translation systems using a single machine translation engine, and generating a consensus translation from a plurality of candidate translations for a source sentence, where each candidate translation of the plurality of candidate translations is an output of a respective machine translation system of the plurality of machine translation systems.Type: GrantFiled: December 3, 2012Date of Patent: October 7, 2014Assignee: Google Inc.Inventors: Wolfgang Macherey, Franz Josef Och
-
Patent number: 8843360Abstract: Disclosed are various embodiments for client-side internationalization of network pages. A network page and code that localizes the network page are obtained from a server. The code that localizes the network page is executed in a client and determines a locale associated with the client. One or more internationalized elements are identified in the network page. The internationalized elements are replaced with corresponding localized translations. The network page is rendered for display in the client after the network page has been localized.Type: GrantFiled: March 4, 2011Date of Patent: September 23, 2014Assignee: Amazon Technologies, Inc.Inventors: Simon K. Johnston, Margaux Eng, James K. Keiger, Gideon Shavit
-
Patent number: 8838434Abstract: Techniques disclosed herein include systems and methods for creating a bootstrap call router for other languages by using selected N-best translations. Techniques include using N-best translations from a machine translation system so as to increase a possibility that desired keywords in a target language are covered in the machine translation output. A 1-best translation is added to a new text corpus. This is followed by selecting a subset that provides a varied set of translations for a given source transcribed utterance for better translation coverage. Additional translations are added to the new text corpus based on a measure of possible translations having words not yet seen for the selected transcribed utterances, and also based on possible translation having words that are not associated with very many or semantic tags in the new text corpus. Candidate translations can be selected from a window of N-best translations calculated based on machine translation accuracy.Type: GrantFiled: July 29, 2011Date of Patent: September 16, 2014Assignee: Nuance Communications, Inc.Inventor: Ding Liu
-
Patent number: 8818792Abstract: An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.Type: GrantFiled: July 8, 2011Date of Patent: August 26, 2014Assignee: SK Planet Co., Ltd.Inventors: Young Sook Hwang, Sang-Bum Kim, Chang Hao Yin, Hae Chang Rim, Joo Young Lee
-
Patent number: 8805677Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.Type: GrantFiled: February 4, 2014Date of Patent: August 12, 2014Assignee: West CorporationInventor: Steven John Schanbacher
-
Patent number: 8798986Abstract: Portable, real time voice translation systems, and associated methods of use, are provided. The systems include a translation system for use on a single unit, portable computing device and operable for accessing a multilanguage database, selecting a source language from a plurality of source languages and a destination language from a plurality of destination languages, inputting a source phrase, transmitting the source phrase to a speech recognition module, a translation engine, and a template look-up engine for finding the phrase template associated with the destination phrase from among the multiple languages. The spoken translation is then output in the selected destination language. The translation system has a total time between the input of the source phrase and output of the destination phrase that is no slower than 0.010 seconds, and a communications interface operable for communicating with a second computer system.Type: GrantFiled: December 26, 2012Date of Patent: August 5, 2014Assignee: NewTalk, Inc.Inventors: Robert H. Clemons, Bruce W. Nash, Martha P. Robinson, Craig A. Robinson
-
Patent number: 8788260Abstract: Systems, methods, and computer storage media having computer-executable instructions embodied thereon that facilitate generation of snippets. In embodiments, text features within a keyword-sentence window are identified. The text features are utilized to determine break features that indicate favorability of breaking at a particular location of the keyword-sentence window. The break features are used to recognize features of partial snippets such that a snippet score to indicate the strength of the partial snippet can be calculated. Snippet scores associated with partial snippets are compared to select an optimal snippet, that is, the snippet having the highest snippet score.Type: GrantFiled: May 11, 2010Date of Patent: July 22, 2014Assignee: Microsoft CorporationInventors: Valerie Rose Nygaard, Riccardo Turchetto, Joanna Mun Yee Chan, Christian Biemann, David Dongjah Ahn, Andrea Ryerson Burbank, Feng Pan, Timothy McDonnell Converse, James Michael Reinhold, Tracy Holloway King
-
Patent number: 8738357Abstract: Systems, methods, and apparatus for accessing distributed models in automated machine processing, including using large language models in machine translation, speech recognition and other applications.Type: GrantFiled: October 22, 2012Date of Patent: May 27, 2014Assignee: Google Inc.Inventors: Franz Josef Och, Jeffrey Dean, Thorsten Brants, Alexander Mark Franz, Jay Ponte, Peng Xu, Sha-Mayn Teh, Jeffrey Chin, Ignacio E. Thayer, Anton Carver, Daniel Rosart, John S. Hawkins, Karel Driesen
-
Patent number: 8725491Abstract: An improved mobile electronic device and associated method enable the identification of previously-entered textual objects in one or more custom wordlists to identify possible transliterations of textual inputs. Such textual objects can be stored as data in a Pinyin custom wordlist if it is in the nature of Standard Mandarin characters or it can be stored in a BoPoMoFo custom wordlist if it is in the nature of Traditional Chinese characters, or it can be stored in both if it is in the nature of both Standard Mandarin characters and Traditional Chinese characters.Type: GrantFiled: May 9, 2013Date of Patent: May 13, 2014Assignee: BlackBerry LimitedInventors: Michael Elizarov, Xin Wang, Xiaoting Sun
-
Patent number: 8719004Abstract: A system, method and software product punctuates voicemail transcription text. A transcription text of the voicemail message is generated and the pauses between words of the transcribed text are determined. Ellipses are inserted into the transcription text at the position of “er” and “ahh” type words and pauses between words of the transcribed text.Type: GrantFiled: March 19, 2010Date of Patent: May 6, 2014Assignee: Ditech Networks, Inc.Inventor: James Siminoff
-
Patent number: 8719003Abstract: A first test phrase can be rendered to a user in a first language. A first answer phrase in a second language can be received from the user. Based on the determination of that the first answer phrase is a translation of the first test phrase, a second test phrase in the first language can be rendered to the user. A second answer phrase in the second language can be received from the user. A set of parser data can be modified based upon the second answer phrase and the user can be permitted access to a resource.Type: GrantFiled: June 24, 2011Date of Patent: May 6, 2014Assignee: Google Inc.Inventors: Johnny Chen, Steve Langdon
-
Patent number: 8666729Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.Type: GrantFiled: February 10, 2010Date of Patent: March 4, 2014Assignee: West CorporationInventor: Steven John Schanbacher
-
Question-answering system and method based on semantic labeling of text documents and user questions
Patent number: 8666730Abstract: A question-answering system for searching exact answers in text documents provided in the electronic or digital form to questions formulated by user in the natural language is based on automatic semantic labeling of text documents and user questions. The system performs semantic labeling with the help of markers in terms of basic knowledge types, their components and attributes, in terms of question types from the predefined classifier for target words, and in terms of components of possible answers. A matching procedure makes use of mentioned types of semantic labels to determine exact answers to questions and present them to the user in the form of fragments of sentences or a newly synthesized phrase in the natural language. Users can independently add new types of questions to the system classifier and develop required linguistic patterns for the system linguistic knowledge base.Type: GrantFiled: March 12, 2010Date of Patent: March 4, 2014Assignee: Invention Machine CorporationInventors: James Todhunter, Igor Sovpel, Dzianis Pastanohau -
Patent number: 8655640Abstract: An unsupervised boosting strategy is applied to refining automatic word alignment. In some examples, the strategy improves the quality of automatic word alignment, for example for resource poor language pairs, thus improving Statistical Machine Translation (SMT) performance.Type: GrantFiled: March 2, 2011Date of Patent: February 18, 2014Assignee: Raytheon BBN Technologies Corp.Inventor: Sankaranarayanan Ananthakrishnan
-
Patent number: 8655642Abstract: An improved method of transliterating non-Latin input within an e-mail address field to the Latin equivalent. A routine in a handheld device is structured to detect a triggering event that indicates an e-mail address is being input into an e-mail address field. Following the triggering event, both prior and subsequent input is transliterated to Latin characters as these characters are required by Internet protocols. The transliteration routine may also be utilized to search an email address book wherein names are recorded using both Latin and non-Latin characters.Type: GrantFiled: May 4, 2013Date of Patent: February 18, 2014Assignee: BlackBerry LimitedInventors: Vadim Fux, Michael Elizarov, Dan Rubanovich
-
Patent number: 8635060Abstract: A foreign language writing service method includes: recognizing, when a mixed text of foreign language portions and mother tongue portions is entered by a learner, the mother tongue portions from the mixed text; translating the mother tongue portions; combining a mother tongue translation result with the foreign language portions of the mixed text to generate a combined text; and providing the learner with the combined text of the mother tongue translation result and the foreign language portions of the mixed text.Type: GrantFiled: June 29, 2010Date of Patent: January 21, 2014Assignee: Electronics and Telecommunications Research InstituteInventors: Young Ae Seo, Chang Hyun Kim, Seong Il Yang, Jinxia Huang, Sung Kwon Choi, Ki Young Lee, Yoon Hyung Roh, Oh Woog Kwon, Yun Jin, Ying Shun Wu, Eun Jin Park, Young Kil Kim, Sang Kyu Park
-
Patent number: 8620793Abstract: A network clearinghouse may be provided that brings together organizations (subjects) requiring outsourcing of a service and service providers (operators). The clearinghouse manages the bidding and awarding of contracts, by collecting and authorizing requests for proposals (RFPs), sending bid invitations to operators that meet the requirements of the subject, sending a notification that the contract has been awarded, and collecting payment from the subject and paying the operator.Type: GrantFiled: June 1, 2010Date of Patent: December 31, 2013Assignee: SDL International America IncorporatedInventors: Iko Knyphausen, Jochen Hummel
-
Patent number: 8612207Abstract: Language analysis means 21 analyzes texts read from a text DB 11, and generates a sentence structure as the analysis result. Similar-structure generation adjustment means 25 generates, from an input of an input device, a determination item for determining whether or not the structures are identical every type of differences between the sentence structures. Similar-structure determination adjustment means 26 generates, from an input of the input device 6, a determination item for determining whether or not the difference between attribute values is ignored every type of attribute values. Similar-structure generating means 22 generates a similar structure of a partial structure forming the sentence structure obtained by language analysis means 21 in accordance with the determination item from the similar-structure generation adjustment means 25, and sets the generated similar structure as an equivalent class of the partial structure on the generation source.Type: GrantFiled: March 17, 2005Date of Patent: December 17, 2013Assignee: NEC CorporationInventors: Yousuke Sakao, Kenji Satoh, Susumu Akamine
-
Patent number: 8589165Abstract: The present disclosure provides method and system for converting a free text expression of an identity to a phonetic equivalent code. The conversion follows a set of rules based on phonetic groupings and compresses the expression to a shorter series of characters than the expression. The phonetic equivalent code may be compared to one or more other phonetic equivalent code to establish a correlation between the codes. The phonetic equivalent code of the free text expression may be associated with the code of a known identity. The known identity may be provided to a user for confirmation of the identity. Further, a plurality of expressions stored in a database may be consolidated by converting the expressions to phonetic equivalent codes, comparing the codes to find correlations, and if appropriate reducing the number of expressions or mapping the expressions to a fewer number of expressions.Type: GrantFiled: January 24, 2012Date of Patent: November 19, 2013Assignee: United Services Automobile Association (USAA)Inventors: Gregory Brian Meyer, James Elden Nicholson
-
Patent number: 8566080Abstract: The present invention provides a method and system for text processing. The method comprises determining at least a part of characters in a text; dividing the text into a plurality of text segments by using the at least a part of characters as separators; and decoding the plurality of text segments respectively.Type: GrantFiled: April 29, 2010Date of Patent: October 22, 2013Assignee: International Business Machines CorporationInventors: Bin Li, Li Qun Pang, Zhi Qiang Sha, Zhi Bo Zuo
-
Patent number: 8554558Abstract: An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a source language, and a machine translation (MT) engine is used for translating this text data to text data in a target language. In this embodiment, the text data in the target language is rendered on a display device, and different visualization schemes are applied to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. In one embodiment, the defined characteristics include a defined confidence value representing the accuracy of the rendered text. For example, this confidence value may be based on both the accuracy of the conversion of the audio input and the accuracy of the translation of the text data to the target language.Type: GrantFiled: July 12, 2010Date of Patent: October 8, 2013Assignee: Nuance Communications, Inc.Inventors: Jeffrey S. McCarley, Leiming R. Qian
-
Patent number: 8554542Abstract: A system and method are provided for processing an input document which enable assessment of the coherence of an abstract of the document. The method includes storing the document in memory and, for each sentence of the abstract, comparing the sentence with sentences of a main body of the document using textual entailment techniques to identify whether the sentence of the abstract entails a sentence in the main body of the document. Links can then be generated between the entailing sentences of the abstract and the corresponding entailed sentences of the document. The document and generated links are output. The links enable the coherence of the abstract to be evaluated, either manually or automatically, using an evaluation component of the system.Type: GrantFiled: May 5, 2010Date of Patent: October 8, 2013Assignee: Xerox CorporationInventors: Ágnès Sandor, Guillaume Jacquet
-
Patent number: 8543381Abstract: This invention is a method for “text morphing,” wherein text morphing involves integrating or blending together substantive content from two or more bodies of text into a single body of text based on locations of linguistic commonality among the two or more bodies of text. This method entails: identifying pairs of “Synonym-Different-Synonym” (SDS) text segments between an import body of text and an export body of text; and, for each selected pair of SDS text segments, substituting some or all of the SDS text segment from the export body of text for some or all of the SDS text segment in the import body of text. In some respects, this method is analogous to splicing and substituting gene segments with compatible starting and ending sequences, but different middle sequences. Text morphing as disclosed herein can be useful for creative ideation, product development, integrative search engines, and entertainment purposes.Type: GrantFiled: June 17, 2010Date of Patent: September 24, 2013Assignee: Holovisions LLCInventor: Robert A. Connor
-
Patent number: 8538742Abstract: A system and method for translating a social feed is disclosed. The system comprises a communication module, a decoding engine and a re-encoding engine. The communication module receives social feed data and a request from a social network application. The social feed data is configured to cause a client to display a social feed in a first language. The request includes data indicating that the social feed should be displayed in a second language. The decoding engine decodes the social feed data to generate decoded social feed data. The re-encoding engine re-encodes the decoded social feed data to cause the client to display the social feed in the second language based at least in part on the request. The communication module sends the translated social feed data to the client.Type: GrantFiled: September 13, 2011Date of Patent: September 17, 2013Assignee: Google Inc.Inventors: Christopher R. Wren, Nadav Aharony
-
Patent number: 8527262Abstract: Systems and methods are provided for automated semantic role labeling for languages having complex morphology. In one aspect, a method for processing natural language text includes receiving as input a natural language text sentence comprising a sequence of white-space delimited words including inflicted words that are formed of morphemes including a stem and one or more affixes, identifying a target verb as a stem of an inflicted word in the text sentence, grouping morphemes from one or more inflicted words with the same syntactic role into constituents, and predicting a semantic role of a constituent for the target verb.Type: GrantFiled: June 22, 2007Date of Patent: September 3, 2013Assignee: International Business Machines CorporationInventors: Nandakishore Kambhatla, Imed Zitouni
-
Patent number: 8521507Abstract: Training data in one language is leveraged to develop classifiers for multiple languages under circumstances where all of those classifiers will be performing the same kind of classification task, but relative to linguistically different sets of texts, thereby saving the cost of manually labeling a different set of training data for each language. Classification knowledge is learned for a source language in which training data are available. That knowledge is transferred to another target language's classifier through the integration of language transition knowledge. The transferred model is adjusted to better fit the target language.Type: GrantFiled: February 22, 2010Date of Patent: August 27, 2013Assignee: Yahoo! Inc.Inventors: Lei Shi, Mingjun Tian
-
Patent number: 8515729Abstract: The display language of a site may be changed to another alternate language by users of a site at any time. For example, a first user may access the same site in its default language (i.e. English) and a second user may access the site using their preferred language (i.e. French) even though the default language of the site is different from their preferred language. The language of the site may be changed from one language to another language at any time a user is accessing the site. Application content changes are identified by the site helping to ensure consistency between the default language and the alternate languages.Type: GrantFiled: March 31, 2008Date of Patent: August 20, 2013Assignee: Microsoft CorporationInventors: Tomasz Grzegorz Tomko, Jay R. Rathi, Almon C. Dao