For Partial Translation Patents (Class 704/5)
-
Patent number: 9002699Abstract: Techniques and solutions are provided for adaptive input language switching. Automatic switching of a current input language can be performed by receiving an indication of a first language of displayed content (where the current input language is set to a second language different from the first language), receiving a selection of a text input area, and based on the selection, automatically switching the current input language from the second language to the first language. Switching of the current input language can also be performed by providing convenient user interface elements for selection by a user to automatically switch the current input language.Type: GrantFiled: November 14, 2011Date of Patent: April 7, 2015Assignee: Microsoft Technology Licensing, LLCInventor: Lijuan Qin
-
Patent number: 9002695Abstract: Provided are two types of translation engines. One of the translation engines translates a text written in a source language X into an intermediate language C, and the other translates the intermediate language C into a target language Y. The X-C translation engine translates an original text written in the source language X into a text in a predetermined intermediate language C as well as acquires and adds annotations obtained by this translation to the translated text written in the intermediate language C. The C-Y translation engine references annotations, which are obtained by the X-C translation engine and added to the text in the intermediate language C, and translates the text in the intermediate language into a text in the target language Y sentence.Type: GrantFiled: May 12, 2004Date of Patent: April 7, 2015Assignee: International Business Machines CorporationInventors: Hideo Watanabe, Hiroshi Kanayama
-
Patent number: 9002697Abstract: Systems for translating text messages in an instant messaging system comprise a translation engine for translating text messages into a preferred language of a recipient of the text messages. The systems are preferably configured to send and receive the text messages and to determine whether the text messages that are received in a source language are in the preferred language of the recipients so that the text messages are displayed in the preferred language of the recipients of the text messages. Other systems and methods are also provided.Type: GrantFiled: February 12, 2014Date of Patent: April 7, 2015Assignee: AT&T Intellectual Property I, L.P.Inventors: Joel A. Davis, Larry G. Kent, William Todd Daniell, Brian K. Daigle
-
Patent number: 9002696Abstract: A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.Type: GrantFiled: November 30, 2010Date of Patent: April 7, 2015Assignee: International Business Machines CorporationInventors: Carl J. Kraenzel, David M. Lubensky, Baiju Dhirajlal Mandalia, Cheng Wu
-
Patent number: 8996352Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.Type: GrantFiled: June 3, 2014Date of Patent: March 31, 2015Assignee: Machine Zone, Inc.Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
-
Patent number: 8983850Abstract: A method and system provides a graphical user interface for instant messaging on any of a plurality of instant messaging networks. The interface provides a roster of contacts in each instant messaging network. Instant messages entered through the interface are machine translated into a preferred language for each intended recipient contact. The translated message is sent over the respective instant messaging networks of the intended recipient contacts. Response messages are translated into the source language of the user of the graphical user interface.Type: GrantFiled: July 20, 2012Date of Patent: March 17, 2015Assignee: Ortsbo Inc.Inventors: Mark Charles Hale, Leemon Baird
-
Patent number: 8977536Abstract: A system with a nonstatistical translation component integrated with a statistical translation component engine. The same corpus may be used for training the statistical engine and also for determining when to use the statistical engine and when to use the translation component. This training may use probabilistic techniques. Both the statistical engine and the translation components may be capable of translating the same information, however the system determines which component to use based on the training. Retraining can be carried out to add additional components, or when after additional translator training.Type: GrantFiled: June 3, 2008Date of Patent: March 10, 2015Assignee: University of Southern CaliforniaInventor: Franz Josef Och
-
Patent number: 8965750Abstract: A method is disclosed for translating a sentence from a source language or input language into an output language. The method includes analyzing a source sentence using linguistic descriptions of the source language, constructing a language-independent semantic structure to represent the meaning of the source sentence, and generating an output sentence to represent the meaning of the source sentence using linguistic descriptions of the output language. To improve the accuracy of translation, the analysis or synthesis stage may include ratings or statistics obtained by analyzing a corpus of parallel texts. Disambiguation is remedied automatically or through user input such as through user interface elements.Type: GrantFiled: November 17, 2011Date of Patent: February 24, 2015Assignee: ABBYY InfoPoisk LLCInventor: Tatiana Danielyan
-
Patent number: 8959013Abstract: A method, including presenting, by a computer system executing a non-tactile three dimensional user interface, a virtual keyboard on a display, the virtual keyboard including multiple virtual keys, and capturing a sequence of depth maps over time of a body part of a human subject. On the display, a cursor is presented at positions indicated by the body part in the captured sequence of depth maps, and one of the multiple virtual keys is selected in response to an interruption of a motion of the presented cursor in proximity to the one of the multiple virtual keys.Type: GrantFiled: September 25, 2011Date of Patent: February 17, 2015Assignee: Apple Inc.Inventors: Micha Galor, Ofir Or, Shai Litvak, Erez Sali
-
Patent number: 8954317Abstract: A method and apparatus of processing communications with a mobile station are disclosed. One example method may include receiving a message from the mobile station and processing the message by parsing the message and performing a natural language interpretation of the message and processing the parsed message to determine the user's requested objective. In response, the method may further provide generating a response to the message based on the user's requested objective and sending the response to the mobile station acknowledging the user's requested objective. Live agents may be notified to check the status of a message and continually override automated message responses to ensure the integrity of the responses.Type: GrantFiled: July 1, 2011Date of Patent: February 10, 2015Assignee: West CorporationInventors: Aaron Scott Fisher, Bruce Pollock, Silke Witt-ehsani
-
Patent number: 8954314Abstract: Disclosed is subject matter that provides a technique and a device that may include an accelerometer, a display device, an input device and a processor. The input device may receive textual information in a first language. The processor may be configured to generate a plurality of probable translation alternatives for a translation result. Each probable translation alternative may be a translation of the textual information into a second language. The processor may present a first of the plurality of probable translation alternatives on the display device in an alternate translation result dialog screen. Based on an accelerometer signal, the processor may determine whether the device is being shaken. In response to a determination the device is being shaken, the processor may present a second of the plurality of probable translation alternatives on the display device in place of the first of the plurality of probable translation alternatives.Type: GrantFiled: September 24, 2012Date of Patent: February 10, 2015Assignee: Google Inc.Inventor: Piotr Powalowski
-
Patent number: 8949112Abstract: One embodiment of the present invention is an XML application module that processes an XML character stream, which module includes an XML interface module, a parallel bit stream module, a lexical item stream module, a parser and a parsed data receiver. The XML interface module applies the XML character stream as input to the parallel bit stream module and the parser; the parallel bit stream module forms parallel bit streams and applies them as input to the lexical item stream module; the lexical stream module forms lexical item streams and applies them as input to the parser; the parser forms a stream of parsed XML data and applies it as input to the parsed data receiver; and the parsed data receiver processes the stream of parsed XML data. The parsed data receiver may be, for example, a communication module of a portable communication device.Type: GrantFiled: February 6, 2013Date of Patent: February 3, 2015Assignee: International Characters, Inc.Inventor: Robert D. Cameron
-
Patent number: 8949111Abstract: A method includes accessing text that includes a plurality of words, tagging each of the plurality of words with one of a plurality of parts of speech (POS) tags, and creating a plurality of tokens, each token comprising one of the plurality of words and its associated POS tag. The method further includes clustering one or more of the created tokens into a chunk of tokens, the one or more tokens clustered into the chunk of tokens based on the POS tags of the one or more tokens, and forming a phrase based on the chunk of tokens, the phrase comprising the words of the one or more tokens clustered into the chunk of tokens.Type: GrantFiled: December 14, 2011Date of Patent: February 3, 2015Assignee: Brainspace CorporationInventor: Paul A. Jakubik
-
Patent number: 8942974Abstract: A device searches for an available network. The device automatically sends a request message to an identified available network. After the device receives a response message from the identified available network, the device selects a language and/or other device setting based on contents of the response message.Type: GrantFiled: March 4, 2011Date of Patent: January 27, 2015Assignee: Amazon Technologies, Inc.Inventor: Jonathan A. Leblang
-
Patent number: 8942973Abstract: The present technology may translate a content of a web page such as content locator (e.g., a uniform resource locator (URL)) from a source language to a target language. The content locator may be associated with a content page. The translation may involve dividing the content locator into segment tokens in a first language, followed by translating, transliterating or not changing a segment token. The processed tokens are then reassembled in a second language. The translation may be provided by a translation module through a content page provided by a network browser.Type: GrantFiled: March 9, 2012Date of Patent: January 27, 2015Assignee: Language Weaver, Inc.Inventor: Narayanaswamy Viswanathan
-
Patent number: 8935151Abstract: A source language sentence is tagged with non-lexical tags, such as part-of-speech tags and is parsed using a lexicalized parser trained in the source language. A target language sentence that is a translation of the source language sentence is tagged with non-lexical labels (e.g., part-of speech tags) and is parsed using a delexicalized parser that has been trained in the source language to produce k-best parses. The best parse is selected based on the parse's alignment with lexicalized parse of the source language sentence. The selected best parse can be used to update the parameter vector of a lexicalized parser for the target language.Type: GrantFiled: December 7, 2011Date of Patent: January 13, 2015Assignee: Google Inc.Inventors: Slav Petrov, Ryan McDonald, Keith Hall
-
Patent number: 8933827Abstract: A data processing apparatus that is capable of reducing the garbling of characters caused by the difference among the character codes when setting data are transferred to another apparatus by the import-export function. A storage unit stores setting data for the data processing apparatus. A receiving unit receives an instruction for exporting the setting data stored in the storage unit. A converting unit converts Unicode data included in the setting data into character code data of language, which is set to the data processing apparatus. An export unit exports the character code data converted by the converting unit and the Unicode data.Type: GrantFiled: May 24, 2013Date of Patent: January 13, 2015Assignee: Canon Kabushiki KaishaInventor: Noritsugu Okayama
-
Patent number: 8930176Abstract: Techniques for interactively presenting word-alignments of multilingual translations and automatically improving those translations based upon user feedback are described herein. With one or more implementations of the techniques described herein, a word-alignment user-interface (UI) concurrently displays a pair of bilingual sentences, where one is a translation of the other, and interactively highlights linked (i.e., “word-aligned”) words and phrases of the pair. Other implementations of the techniques described herein offer an option for a user to provide feedback about the existing word-alignments or realign the words or phrases. In still other described implementations, word-alignment is automatically improved based upon that user feedback.Type: GrantFiled: April 1, 2010Date of Patent: January 6, 2015Assignee: Microsoft CorporationInventors: Henry Li, Matthew Robert Scott, Xiaohua Liu, Hao Wei, Ming Zhou
-
Publication number: 20150006146Abstract: In a computing resource environment including at least two different resource name spaces, a method for generating a fully-qualified name for a resource based on a context-based name of that resource and a usage context is disclosed. Method steps include receiving the resource's name schema, its context-based name, and usage context; comparing an entry in the schema and an entry in the context-based name and identifying schema monikers missing from the context-based name. For a missing moniker, determining whether it is an attribute space (aspace) moniker. If it is an aspace moniker, adding it to a full name that includes the context-based name. If it is not an aspace moniker, determining the aspace associated with the missing moniker; searching the usage context for an association between the missing moniker, its aspace, and a value assigned to the missing moniker; and appending the searched-for items to the full name.Type: ApplicationFiled: June 28, 2013Publication date: January 1, 2015Inventors: John WILKES, Todd PU-TSE WANG, Walfredo CIRNE, Brian GRANT, Indranil GUPTA
-
Patent number: 8923618Abstract: An expression, for which complementary information can be outputted, is extracted from a document obtained by character recognition for an image. Complementary information related to the extracted expression is outputted when a character or a symbol adjacent to the beginning or the end of the extracted expression is not a predetermined character or symbol. Output of complementary information related to the extracted expression is skipped when the character or symbol adjacent to the beginning or the end of the extracted expression is the predetermined character or symbol. A problem that complementary information unrelated to an original text is outputted is prevented even when a false character recognition occurs.Type: GrantFiled: September 14, 2012Date of Patent: December 30, 2014Assignee: Sharp Kabushiki KaishaInventor: Takeshi Kutsumi
-
Patent number: 8924195Abstract: In a machine-translation apparatus, an example storage unit stores therein target language examples in a target language and source language examples in a source language, while keeping the target language examples and the source language examples in correspondence with one another. An input receiving unit receives an input sentence in the source language. A searching unit conducts a search in the example storage unit for one of the target language examples corresponding to one of the source language examples that either matches or is similar to the input sentence. A translating unit generates a reverse-translated sentence by translating the one of the target language examples found in the search into the source language. A detecting unit detects a difference portion between the reverse-translated sentence and the input sentence. An output unit outputs the difference portion.Type: GrantFiled: September 19, 2008Date of Patent: December 30, 2014Assignee: Kabushiki Kaisha ToshibaInventors: Satoshi Kamatani, Tetsuro Chino, Kentaro Furihata
-
Patent number: 8918309Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.Type: GrantFiled: December 20, 2012Date of Patent: December 23, 2014Assignee: ABBYY InfoPoisk LLCInventors: Diar Tuganbaev, Konstantin Anisimovich, Konstantin Zuev, Vladimir Selegey
-
Patent number: 8909516Abstract: Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the computing functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The computing functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The computing functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.Type: GrantFiled: December 7, 2011Date of Patent: December 9, 2014Assignee: Microsoft CorporationInventors: Julie A. Medero, Daniel S. Morris, Lucretia H. Vanderwende, Michael Gamon
-
Patent number: 8909513Abstract: Various embodiments provide a method that comprises receiving a set of segments from a text field, analyzing the set of segments to determine at least one of a target subtext or a target meaning associated with the set of segments, and identifying a set of candidate emoticons where each candidate emoticon in the set of candidate emoticons has an association between the candidate emoticon and at least one of the target subtext or the target meaning. The method may further comprise presenting the set of candidate emoticons for entry selection at a current position of an input cursor, receiving an entry selection for a set of selected emoticons from the set of candidate emoticons, and inserting the set of selected emoticons into the text field at the current position of the input cursor.Type: GrantFiled: April 2, 2014Date of Patent: December 9, 2014Assignee: Machine Zone, Inc.Inventor: Gabriel Leydon
-
Patent number: 8904517Abstract: A system and method for contextually interpreting image sequences are provided. The method comprises receiving video from one or more video sources, and generating one or more questions associated with one or more portions of the video based on at least one user-defined objective. The method further comprises sending the one or more portions of the video and the one or more questions to one or more assistants, receiving one or more answers to the one or more questions from the one or more assistants, and determining a contextual interpretation of the video based on the one or more answers and the video.Type: GrantFiled: June 28, 2011Date of Patent: December 2, 2014Assignee: International Business Machines CorporationInventors: Rajaraman Hariharan, Sri Ramanathan, Karthik Subbian, Matthew B. Trevathan
-
Patent number: 8903710Abstract: Verifying text translation in a program including obtaining a first program integrated information (PII) file and a second PII file of text in the program. An original page is generated by using the first PII file. Document object model (DOM) tree information of the original page is obtained. For each text unit in the original page: identification information corresponding to a first text unit in the first PII file is extracted; a second text unit is identified in the second PII file using the extracted identification information and a relationship between the identification information and the second text unit; and the first text unit is replaced in the DOM tree with the second text unit. A new page is generated using the text units from the second PII file.Type: GrantFiled: October 28, 2011Date of Patent: December 2, 2014Assignee: International Business Machines CorporationInventors: Ling Luo, Li Qiang Qi, Yan Min Sheng, Wei Ji Shi
-
Patent number: 8886515Abstract: Systems and methods for enhancing machine translation post edit review processes are provided herein. According to some embodiments, methods for displaying confidence estimations for machine translated segments of a source document may include executing instructions stored in memory, the instructions being executed by a processor to calculate a confidence estimation for a machine translated segment of a source document, compare the confidence estimation for the machine translated segment to one or more benchmark values, associate the machine translated segment with a color based upon the confidence estimation for the machine translated segment relative to the one or more benchmark values, and provide the machine translated segment having the color in a graphical format, to a client device.Type: GrantFiled: October 19, 2011Date of Patent: November 11, 2014Assignee: Language Weaver, Inc.Inventor: Gert Van Assche
-
Patent number: 8886518Abstract: A system and method for capitalizing translated text is provided. A capitalized source text is automatically translated to a target text. The target text is capitalized according to information in the capitalized source text.Type: GrantFiled: August 7, 2006Date of Patent: November 11, 2014Assignee: Language Weaver, Inc.Inventors: Wei Wang, Kevin Knight, Daniel Marcu
-
Patent number: 8880389Abstract: A method, computer program product and system are disclosed for determining the semantic density of textualized digital media (a measure of how much information is conveyed in a sentence or clause relative to its length). The more semantically dense text is, the more information it conveys in a given space. Users input a topic, a timeline, and one or more target web media sources for analysis. Text in the target media sources is deconstructed to determine density, and a density rating assigned to the web media source. Over time, users can track trends in the density of text media relative to a given topic, and determine how much information is being conveyed in connection with the topic, such as a political campaign. Line graphs, pie charts, and other time-elapsed output graphic representations of the semantic density are generated and rendered for the user.Type: GrantFiled: December 9, 2011Date of Patent: November 4, 2014Inventor: Igor Iofinov
-
Patent number: 8874427Abstract: Methods, systems and program product are disclosed for determining matching level of a text lookup segment with a plurality of source texts in a translation memory in terms of context. The invention determines exact matches for the lookup segment in the plurality of source texts, and determines, in the case that at least one exact match is determined, that a respective exact match is an in-context exact match for the lookup segment in the case that a context of the lookup segment matches that of the respective exact match. Degree of context matching required can be predetermined, and results prioritized. The invention also includes methods, systems and program products for storing a translation pair of source text and target text in a translation memory including context, and the translation memory so formed. The invention ensures that content is translated the same as previously translated content and reduces translator intervention.Type: GrantFiled: July 1, 2011Date of Patent: October 28, 2014Assignee: SDL Enterprise Technologies, Inc.Inventors: Russ Ross, Kevin Gillespie
-
Patent number: 8855995Abstract: Systems, methods, and apparatuses including computer program products for machine translation. A method is provided that includes generating a plurality of machine translation systems using a single machine translation engine, and generating a consensus translation from a plurality of candidate translations for a source sentence, where each candidate translation of the plurality of candidate translations is an output of a respective machine translation system of the plurality of machine translation systems.Type: GrantFiled: December 3, 2012Date of Patent: October 7, 2014Assignee: Google Inc.Inventors: Wolfgang Macherey, Franz Josef Och
-
Patent number: 8849665Abstract: A machine translation method, system for using the method, and computer readable media are disclosed. The method includes the steps of receiving a source language sentence, selecting a set of target language n-grams using a lexical classifier and based on the source language sentence. When selecting the set of target language n-grams, in at least one n-gram, n is greater than 1. The method continues by combining the selected set of target language n-grams as a finite state acceptor (FSA), weighting the FSA with data from the lexical classifier, and generating an n-best list of target sentences from the FSA. As an alternate to using the FSA, N strings may be generated from the n-grams and ranked using a language model. The N strings may be represented by an FSA for efficiency but it is not necessary.Type: GrantFiled: January 30, 2008Date of Patent: September 30, 2014Assignee: AT&T Intellectual Property I, L.P.Inventors: Srinivas Bangalore, Emil Ettelaie
-
Patent number: 8843360Abstract: Disclosed are various embodiments for client-side internationalization of network pages. A network page and code that localizes the network page are obtained from a server. The code that localizes the network page is executed in a client and determines a locale associated with the client. One or more internationalized elements are identified in the network page. The internationalized elements are replaced with corresponding localized translations. The network page is rendered for display in the client after the network page has been localized.Type: GrantFiled: March 4, 2011Date of Patent: September 23, 2014Assignee: Amazon Technologies, Inc.Inventors: Simon K. Johnston, Margaux Eng, James K. Keiger, Gideon Shavit
-
Patent number: 8831945Abstract: A text in a corpus including a set of world wide web (web) pages is analyzed. At least one word appropriate for a document type set according to a voice recognition target is extracted based on an analysis result. A word set is generated from the extracted at least one word. A retrieval engine is caused to perform a retrieval process using the generated word set as a retrieval query of the retrieval engine on the Internet, and a link to a web page from the retrieval result is acquired. A language model for voice recognition is generated from the acquired web page.Type: GrantFiled: October 12, 2011Date of Patent: September 9, 2014Assignee: NEC Informatec Systems, Ltd.Inventors: Kazuhiro Arai, Tadashi Emori
-
Patent number: 8831929Abstract: Methods, systems, and apparatus, including computer program products, in which an input method editor receives composition inputs and determines language context values based on the composition inputs. Candidate selections based on the language context values and the composition inputs are identified.Type: GrantFiled: September 23, 2013Date of Patent: September 9, 2014Assignee: Google Inc.Inventor: Feng Hong
-
Patent number: 8825466Abstract: Systems and methods for automatically modifying an annotated bilingual segment pair are provided. An annotated bilingual segment pair (“Pair”) may be modified to generate improved translation rules used in machine translation of documents from a source language to a target language. Because a single Pair may be used to translate a phrase, many Pairs are used in a machine translation system and manual correction of each model is impractical. Each Pair may be modified by re-labeling syntactic categories within the Pair, re-structuring a tree within the Pair, and/or re-aligning source words to target words within the Pair. In exemplary embodiments, many alternate Pairs (or portions thereof) are generated automatically, rule sequences corresponding to each are derived, and one or more rule sequences are selected. Using the selected rule sequence, a modified Pair is distilled.Type: GrantFiled: June 8, 2007Date of Patent: September 2, 2014Assignees: Language Weaver, Inc., University of Southern CaliforniaInventors: Wei Wang, Jonathan May, Kevin Knight
-
Patent number: 8825470Abstract: A system and method of providing a response with different language options for a data communication protocol, such as Session Initiation Protocol, are disclosed. For example, data communication is controlled between at least two endpoints. A response code indicative of a condition of the data communication is transmitted to one of the at least two endpoints. The response code is associated with a reason phrase operable to be displayed at the one of the at least two endpoints in a language selected from an option of a plurality of languages.Type: GrantFiled: September 27, 2007Date of Patent: September 2, 2014Assignee: Siemens Enterprise Communications Inc.Inventors: Mallikarjuna Samayamantry Rao, Dennis Kucmerowski
-
Patent number: 8818790Abstract: A syntactic analysis and hierarchical phrase model based machine translation system and method are provided. The machine translation system includes a word alignment module, a phrase extraction module, a gender syntactic annotation module, a syntactic based noncontiguous phrase abstract module, a noncontiguous phrase based translation module and an evaluation module. The machine translation system and method performs syntactic analysis based on a common contiguous phrase based machine translation model, and extracts a syntactic based noncontiguous phrase rule set from a bilingual aligned text, so as to address and issue of noncontiguous fixed custom in context of a whole sentence, and to comply with syntactic features of a language, translation is performed based on the noncontiguous phrase rule set and the phrase aligned table, and the translation results are evaluated based on the evaluation model, thereby improving the translation result.Type: GrantFiled: April 4, 2011Date of Patent: August 26, 2014Assignees: Samsung Electronics Co., Ltd., Samsung Electronics (China) R&D CenterInventors: Liang He, Zhangliang Xiong, Lei Wan
-
Patent number: 8818792Abstract: An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.Type: GrantFiled: July 8, 2011Date of Patent: August 26, 2014Assignee: SK Planet Co., Ltd.Inventors: Young Sook Hwang, Sang-Bum Kim, Chang Hao Yin, Hae Chang Rim, Joo Young Lee
-
Patent number: 8812291Abstract: Systems, methods, and computer program products for machine translation are provided. In some implementations a system is provided. The system includes a language model including a collection of n-grams from a corpus, each n-gram having a corresponding relative frequency in the corpus and an order n corresponding to a number of tokens in the n-gram, each n-gram corresponding to a backoff n-gram having an order of n?1 and a collection of backoff scores, each backoff score associated with an n-gram, the backoff score determined as a function of a backoff factor and a relative frequency of a corresponding backoff n-gram in the corpus.Type: GrantFiled: December 10, 2012Date of Patent: August 19, 2014Assignee: Google Inc.Inventors: Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean
-
Patent number: 8812301Abstract: A system and method for natural language processing of queries are provided. A lexicon includes text elements that are recognized as being a proper noun when capitalized. A natural language query includes a sequence of text elements including words. The query is processed. The processing includes a preprocessing step, in which part of speech features are assigned to the text elements in the query. This includes identifying, from a lexicon, a text element in the query which starts with a lowercase letter and assigning recapitalization information to the text element in the query, based on the lexicon. This information includes a part of speech feature of the capitalized form of the text element. Then parts of speech for the text elements in the query are disambiguated, which includes applying rules for recapitalizing text elements based on the recapitalization information.Type: GrantFiled: September 26, 2011Date of Patent: August 19, 2014Assignee: Xerox CorporationInventors: Caroline Brun, Vassilina Nikoulina, Nikolaos Lagos
-
Patent number: 8805677Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.Type: GrantFiled: February 4, 2014Date of Patent: August 12, 2014Assignee: West CorporationInventor: Steven John Schanbacher
-
Patent number: 8806326Abstract: A method and apparatus for user selectable content linking in internet applications. The usability of downloaded content may be modified for specific users by allowing a user to select the content in which and how their content data is linked. Particular text may be selected, and when content is downloaded from an origin service including the particular text, the text may be automatically linked to a service or database. The service or database is independent of the origin service and may be provided by a third party.Type: GrantFiled: September 25, 2000Date of Patent: August 12, 2014Assignee: Nokia Inc.Inventor: Mika Leppinen
-
Patent number: 8798986Abstract: Portable, real time voice translation systems, and associated methods of use, are provided. The systems include a translation system for use on a single unit, portable computing device and operable for accessing a multilanguage database, selecting a source language from a plurality of source languages and a destination language from a plurality of destination languages, inputting a source phrase, transmitting the source phrase to a speech recognition module, a translation engine, and a template look-up engine for finding the phrase template associated with the destination phrase from among the multiple languages. The spoken translation is then output in the selected destination language. The translation system has a total time between the input of the source phrase and output of the destination phrase that is no slower than 0.010 seconds, and a communications interface operable for communicating with a second computer system.Type: GrantFiled: December 26, 2012Date of Patent: August 5, 2014Assignee: NewTalk, Inc.Inventors: Robert H. Clemons, Bruce W. Nash, Martha P. Robinson, Craig A. Robinson
-
Patent number: 8788262Abstract: Embodiments of the invention provide techniques for incorporating variable values into textual content. In one embodiment, an abstract phrase including a text phrase and a variable at a particular position in the text phrase is received. The abstract phrase may include multiple variables. A text value for the variable is received. The text phrase of the abstract phrase is combined with the text value according to the particular position of the variable. An integration rule is applied at a boundary of the text phrase of the abstract phrase and the text value, where the integration rule is based on a language rule. The integration rule modifies a portion of the text phrase of the abstract phrase or a portion of the text value to produce an integrated phrase.Type: GrantFiled: February 1, 2013Date of Patent: July 22, 2014Assignee: Facebook, Inc.Inventor: David C. Ellis
-
Patent number: 8788260Abstract: Systems, methods, and computer storage media having computer-executable instructions embodied thereon that facilitate generation of snippets. In embodiments, text features within a keyword-sentence window are identified. The text features are utilized to determine break features that indicate favorability of breaking at a particular location of the keyword-sentence window. The break features are used to recognize features of partial snippets such that a snippet score to indicate the strength of the partial snippet can be calculated. Snippet scores associated with partial snippets are compared to select an optimal snippet, that is, the snippet having the highest snippet score.Type: GrantFiled: May 11, 2010Date of Patent: July 22, 2014Assignee: Microsoft CorporationInventors: Valerie Rose Nygaard, Riccardo Turchetto, Joanna Mun Yee Chan, Christian Biemann, David Dongjah Ahn, Andrea Ryerson Burbank, Feng Pan, Timothy McDonnell Converse, James Michael Reinhold, Tracy Holloway King
-
Patent number: 8781810Abstract: A method and a system for making merging decisions for a translation are disclosed which are suited to use where the target language is a productive compounding one. The method includes outputting decisions on merging of pairs of words in a translated text string with a merging system. The merging system can include a set of stored heuristics and/or a merging model. In the case of heuristics, these can include a heuristic by which two consecutive words in the string are considered for merging if the first word of the two consecutive words is recognized as a compound modifier and their observed frequency f1 as a closed compound word is larger than an observed frequency f2 of the two consecutive words as a bigram. In the case of a merging model, it can be one that is trained on features associated with pairs of consecutive tokens of text strings in a training set and predetermined merging decisions for the pairs.Type: GrantFiled: July 25, 2011Date of Patent: July 15, 2014Assignee: Xerox CorporationInventors: Nicola Cancedda, Sara Stymne
-
Patent number: 8775155Abstract: A system and method for machine translation are disclosed. Source sentences are received. For each source sentence, a target sentence comprising target words is generated. A plurality of translation neighbors of the target sentence is generated. Phrase alignments are computed between the source sentence and the translation neighbor. Translation neighbors are scored with a translation scoring model, based on the phrase alignment. Translation neighbors are ranked, based on the scores. In training the model, parameters of the model are updated based on an external ranking of the ranked translation neighbors. The generating of translation neighbors, scoring, ranking, and, in the case of training, updating the parameters, are iterated with one of the translation neighbors as the target sentence. In the case of decoding, one of the translation neighbors is output as a translation. The system and method may be at least partially implemented with a computer processor.Type: GrantFiled: October 25, 2010Date of Patent: July 8, 2014Assignee: Xerox CorporationInventors: Benjamin Roth, Andrew R. McCallum, Marc Dymetman, Nicola Cancedda
-
Patent number: 8762128Abstract: A translation system receives a test pair that includes a source test phrase in a first language and a target test phrase in a second language. The test pair can be evaluated by comparing its components with phrases in primary pairs. The test source phrase can be compared to a primary source phrase that is the phrase most commonly translated by the machine translation system into the test target phrase. The test target phrase can be compared to a primary target phrase that is the phrase into which the target source phrase is most often translated. If one and/or both comparisons are sufficiently dissimilar, the machine translation system can be modified by deleting the test pair, by flagging it for human review, or in other ways.Type: GrantFiled: May 31, 2011Date of Patent: June 24, 2014Assignee: Google Inc.Inventors: Thorsten Brants, Jan Pfeifer
-
Patent number: 8756050Abstract: Techniques, including systems and methods, for providing translated content are described and suggested herein. In an embodiment, translations for instances of content are received and reviewers provide votes regarding the translations' quality. Votes received for translations are utilized in order to determine scores for the translations. Reviewers and translators may be scored as well. Scores for translations are used in various processes for providing appropriate content to users.Type: GrantFiled: September 14, 2010Date of Patent: June 17, 2014Assignee: Amazon Technologies, Inc.Inventors: Roy N. Harkness, Bryan K. Beatty, Michael G. Curtis, Alexander Slutsker, Kelly L. Duggar