For Partial Translation Patents (Class 704/5)
  • Patent number: 9002699
    Abstract: Techniques and solutions are provided for adaptive input language switching. Automatic switching of a current input language can be performed by receiving an indication of a first language of displayed content (where the current input language is set to a second language different from the first language), receiving a selection of a text input area, and based on the selection, automatically switching the current input language from the second language to the first language. Switching of the current input language can also be performed by providing convenient user interface elements for selection by a user to automatically switch the current input language.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: April 7, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Lijuan Qin
  • Patent number: 9002695
    Abstract: Provided are two types of translation engines. One of the translation engines translates a text written in a source language X into an intermediate language C, and the other translates the intermediate language C into a target language Y. The X-C translation engine translates an original text written in the source language X into a text in a predetermined intermediate language C as well as acquires and adds annotations obtained by this translation to the translated text written in the intermediate language C. The C-Y translation engine references annotations, which are obtained by the X-C translation engine and added to the text in the intermediate language C, and translates the text in the intermediate language into a text in the target language Y sentence.
    Type: Grant
    Filed: May 12, 2004
    Date of Patent: April 7, 2015
    Assignee: International Business Machines Corporation
    Inventors: Hideo Watanabe, Hiroshi Kanayama
  • Patent number: 9002697
    Abstract: Systems for translating text messages in an instant messaging system comprise a translation engine for translating text messages into a preferred language of a recipient of the text messages. The systems are preferably configured to send and receive the text messages and to determine whether the text messages that are received in a source language are in the preferred language of the recipients so that the text messages are displayed in the preferred language of the recipients of the text messages. Other systems and methods are also provided.
    Type: Grant
    Filed: February 12, 2014
    Date of Patent: April 7, 2015
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Joel A. Davis, Larry G. Kent, William Todd Daniell, Brian K. Daigle
  • Patent number: 9002696
    Abstract: A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.
    Type: Grant
    Filed: November 30, 2010
    Date of Patent: April 7, 2015
    Assignee: International Business Machines Corporation
    Inventors: Carl J. Kraenzel, David M. Lubensky, Baiju Dhirajlal Mandalia, Cheng Wu
  • Patent number: 8996352
    Abstract: Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of communication including, for example, Internet-based chat, e-mail, text-based mobile phone communications, postings to online forums, postings to online social media services, and the like. Certain embodiments implement communication systems and methods that translate text between two or more languages. Users of the systems and methods may be incentivized to submit corrections for inaccurate or erroneous translations, and may receive a reward for these submissions. Systems and methods for assessing the accuracy of translations are described.
    Type: Grant
    Filed: June 3, 2014
    Date of Patent: March 31, 2015
    Assignee: Machine Zone, Inc.
    Inventors: Francois Orsini, Nikhil Bojja, Arun Nedunchezhian
  • Patent number: 8983850
    Abstract: A method and system provides a graphical user interface for instant messaging on any of a plurality of instant messaging networks. The interface provides a roster of contacts in each instant messaging network. Instant messages entered through the interface are machine translated into a preferred language for each intended recipient contact. The translated message is sent over the respective instant messaging networks of the intended recipient contacts. Response messages are translated into the source language of the user of the graphical user interface.
    Type: Grant
    Filed: July 20, 2012
    Date of Patent: March 17, 2015
    Assignee: Ortsbo Inc.
    Inventors: Mark Charles Hale, Leemon Baird
  • Patent number: 8977536
    Abstract: A system with a nonstatistical translation component integrated with a statistical translation component engine. The same corpus may be used for training the statistical engine and also for determining when to use the statistical engine and when to use the translation component. This training may use probabilistic techniques. Both the statistical engine and the translation components may be capable of translating the same information, however the system determines which component to use based on the training. Retraining can be carried out to add additional components, or when after additional translator training.
    Type: Grant
    Filed: June 3, 2008
    Date of Patent: March 10, 2015
    Assignee: University of Southern California
    Inventor: Franz Josef Och
  • Patent number: 8965750
    Abstract: A method is disclosed for translating a sentence from a source language or input language into an output language. The method includes analyzing a source sentence using linguistic descriptions of the source language, constructing a language-independent semantic structure to represent the meaning of the source sentence, and generating an output sentence to represent the meaning of the source sentence using linguistic descriptions of the output language. To improve the accuracy of translation, the analysis or synthesis stage may include ratings or statistics obtained by analyzing a corpus of parallel texts. Disambiguation is remedied automatically or through user input such as through user interface elements.
    Type: Grant
    Filed: November 17, 2011
    Date of Patent: February 24, 2015
    Assignee: ABBYY InfoPoisk LLC
    Inventor: Tatiana Danielyan
  • Patent number: 8959013
    Abstract: A method, including presenting, by a computer system executing a non-tactile three dimensional user interface, a virtual keyboard on a display, the virtual keyboard including multiple virtual keys, and capturing a sequence of depth maps over time of a body part of a human subject. On the display, a cursor is presented at positions indicated by the body part in the captured sequence of depth maps, and one of the multiple virtual keys is selected in response to an interruption of a motion of the presented cursor in proximity to the one of the multiple virtual keys.
    Type: Grant
    Filed: September 25, 2011
    Date of Patent: February 17, 2015
    Assignee: Apple Inc.
    Inventors: Micha Galor, Ofir Or, Shai Litvak, Erez Sali
  • Patent number: 8954317
    Abstract: A method and apparatus of processing communications with a mobile station are disclosed. One example method may include receiving a message from the mobile station and processing the message by parsing the message and performing a natural language interpretation of the message and processing the parsed message to determine the user's requested objective. In response, the method may further provide generating a response to the message based on the user's requested objective and sending the response to the mobile station acknowledging the user's requested objective. Live agents may be notified to check the status of a message and continually override automated message responses to ensure the integrity of the responses.
    Type: Grant
    Filed: July 1, 2011
    Date of Patent: February 10, 2015
    Assignee: West Corporation
    Inventors: Aaron Scott Fisher, Bruce Pollock, Silke Witt-ehsani
  • Patent number: 8954314
    Abstract: Disclosed is subject matter that provides a technique and a device that may include an accelerometer, a display device, an input device and a processor. The input device may receive textual information in a first language. The processor may be configured to generate a plurality of probable translation alternatives for a translation result. Each probable translation alternative may be a translation of the textual information into a second language. The processor may present a first of the plurality of probable translation alternatives on the display device in an alternate translation result dialog screen. Based on an accelerometer signal, the processor may determine whether the device is being shaken. In response to a determination the device is being shaken, the processor may present a second of the plurality of probable translation alternatives on the display device in place of the first of the plurality of probable translation alternatives.
    Type: Grant
    Filed: September 24, 2012
    Date of Patent: February 10, 2015
    Assignee: Google Inc.
    Inventor: Piotr Powalowski
  • Patent number: 8949112
    Abstract: One embodiment of the present invention is an XML application module that processes an XML character stream, which module includes an XML interface module, a parallel bit stream module, a lexical item stream module, a parser and a parsed data receiver. The XML interface module applies the XML character stream as input to the parallel bit stream module and the parser; the parallel bit stream module forms parallel bit streams and applies them as input to the lexical item stream module; the lexical stream module forms lexical item streams and applies them as input to the parser; the parser forms a stream of parsed XML data and applies it as input to the parsed data receiver; and the parsed data receiver processes the stream of parsed XML data. The parsed data receiver may be, for example, a communication module of a portable communication device.
    Type: Grant
    Filed: February 6, 2013
    Date of Patent: February 3, 2015
    Assignee: International Characters, Inc.
    Inventor: Robert D. Cameron
  • Patent number: 8949111
    Abstract: A method includes accessing text that includes a plurality of words, tagging each of the plurality of words with one of a plurality of parts of speech (POS) tags, and creating a plurality of tokens, each token comprising one of the plurality of words and its associated POS tag. The method further includes clustering one or more of the created tokens into a chunk of tokens, the one or more tokens clustered into the chunk of tokens based on the POS tags of the one or more tokens, and forming a phrase based on the chunk of tokens, the phrase comprising the words of the one or more tokens clustered into the chunk of tokens.
    Type: Grant
    Filed: December 14, 2011
    Date of Patent: February 3, 2015
    Assignee: Brainspace Corporation
    Inventor: Paul A. Jakubik
  • Patent number: 8942974
    Abstract: A device searches for an available network. The device automatically sends a request message to an identified available network. After the device receives a response message from the identified available network, the device selects a language and/or other device setting based on contents of the response message.
    Type: Grant
    Filed: March 4, 2011
    Date of Patent: January 27, 2015
    Assignee: Amazon Technologies, Inc.
    Inventor: Jonathan A. Leblang
  • Patent number: 8942973
    Abstract: The present technology may translate a content of a web page such as content locator (e.g., a uniform resource locator (URL)) from a source language to a target language. The content locator may be associated with a content page. The translation may involve dividing the content locator into segment tokens in a first language, followed by translating, transliterating or not changing a segment token. The processed tokens are then reassembled in a second language. The translation may be provided by a translation module through a content page provided by a network browser.
    Type: Grant
    Filed: March 9, 2012
    Date of Patent: January 27, 2015
    Assignee: Language Weaver, Inc.
    Inventor: Narayanaswamy Viswanathan
  • Patent number: 8935151
    Abstract: A source language sentence is tagged with non-lexical tags, such as part-of-speech tags and is parsed using a lexicalized parser trained in the source language. A target language sentence that is a translation of the source language sentence is tagged with non-lexical labels (e.g., part-of speech tags) and is parsed using a delexicalized parser that has been trained in the source language to produce k-best parses. The best parse is selected based on the parse's alignment with lexicalized parse of the source language sentence. The selected best parse can be used to update the parameter vector of a lexicalized parser for the target language.
    Type: Grant
    Filed: December 7, 2011
    Date of Patent: January 13, 2015
    Assignee: Google Inc.
    Inventors: Slav Petrov, Ryan McDonald, Keith Hall
  • Patent number: 8933827
    Abstract: A data processing apparatus that is capable of reducing the garbling of characters caused by the difference among the character codes when setting data are transferred to another apparatus by the import-export function. A storage unit stores setting data for the data processing apparatus. A receiving unit receives an instruction for exporting the setting data stored in the storage unit. A converting unit converts Unicode data included in the setting data into character code data of language, which is set to the data processing apparatus. An export unit exports the character code data converted by the converting unit and the Unicode data.
    Type: Grant
    Filed: May 24, 2013
    Date of Patent: January 13, 2015
    Assignee: Canon Kabushiki Kaisha
    Inventor: Noritsugu Okayama
  • Patent number: 8930176
    Abstract: Techniques for interactively presenting word-alignments of multilingual translations and automatically improving those translations based upon user feedback are described herein. With one or more implementations of the techniques described herein, a word-alignment user-interface (UI) concurrently displays a pair of bilingual sentences, where one is a translation of the other, and interactively highlights linked (i.e., “word-aligned”) words and phrases of the pair. Other implementations of the techniques described herein offer an option for a user to provide feedback about the existing word-alignments or realign the words or phrases. In still other described implementations, word-alignment is automatically improved based upon that user feedback.
    Type: Grant
    Filed: April 1, 2010
    Date of Patent: January 6, 2015
    Assignee: Microsoft Corporation
    Inventors: Henry Li, Matthew Robert Scott, Xiaohua Liu, Hao Wei, Ming Zhou
  • Publication number: 20150006146
    Abstract: In a computing resource environment including at least two different resource name spaces, a method for generating a fully-qualified name for a resource based on a context-based name of that resource and a usage context is disclosed. Method steps include receiving the resource's name schema, its context-based name, and usage context; comparing an entry in the schema and an entry in the context-based name and identifying schema monikers missing from the context-based name. For a missing moniker, determining whether it is an attribute space (aspace) moniker. If it is an aspace moniker, adding it to a full name that includes the context-based name. If it is not an aspace moniker, determining the aspace associated with the missing moniker; searching the usage context for an association between the missing moniker, its aspace, and a value assigned to the missing moniker; and appending the searched-for items to the full name.
    Type: Application
    Filed: June 28, 2013
    Publication date: January 1, 2015
    Inventors: John WILKES, Todd PU-TSE WANG, Walfredo CIRNE, Brian GRANT, Indranil GUPTA
  • Patent number: 8923618
    Abstract: An expression, for which complementary information can be outputted, is extracted from a document obtained by character recognition for an image. Complementary information related to the extracted expression is outputted when a character or a symbol adjacent to the beginning or the end of the extracted expression is not a predetermined character or symbol. Output of complementary information related to the extracted expression is skipped when the character or symbol adjacent to the beginning or the end of the extracted expression is the predetermined character or symbol. A problem that complementary information unrelated to an original text is outputted is prevented even when a false character recognition occurs.
    Type: Grant
    Filed: September 14, 2012
    Date of Patent: December 30, 2014
    Assignee: Sharp Kabushiki Kaisha
    Inventor: Takeshi Kutsumi
  • Patent number: 8924195
    Abstract: In a machine-translation apparatus, an example storage unit stores therein target language examples in a target language and source language examples in a source language, while keeping the target language examples and the source language examples in correspondence with one another. An input receiving unit receives an input sentence in the source language. A searching unit conducts a search in the example storage unit for one of the target language examples corresponding to one of the source language examples that either matches or is similar to the input sentence. A translating unit generates a reverse-translated sentence by translating the one of the target language examples found in the search into the source language. A detecting unit detects a difference portion between the reverse-translated sentence and the input sentence. An output unit outputs the difference portion.
    Type: Grant
    Filed: September 19, 2008
    Date of Patent: December 30, 2014
    Assignee: Kabushiki Kaisha Toshiba
    Inventors: Satoshi Kamatani, Tetsuro Chino, Kentaro Furihata
  • Patent number: 8918309
    Abstract: In one embodiment, the invention provides a method for machine translation of a source document in an input language to a target document in an output language, comprising generating translation options corresponding to at least portions of each sentence in the input language; and selecting a translation option for the sentence based on statistics associated with the translation options.
    Type: Grant
    Filed: December 20, 2012
    Date of Patent: December 23, 2014
    Assignee: ABBYY InfoPoisk LLC
    Inventors: Diar Tuganbaev, Konstantin Anisimovich, Konstantin Zuev, Vladimir Selegey
  • Patent number: 8909516
    Abstract: Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input linguistic item corresponds to a complaint by a person receiving medical care, and the normalized linguistic item corresponds to a definitive and error-free version of that complaint. In operation, the computing functionality uses plural reference resources to expand the input linguistic item, creating an expanded linguistic item. The computing functionality then forms a graph based on candidate tokens that appear in the expanded linguistic item, and then finds a shortest path through the graph; that path corresponds to the normalized linguistic item. The computing functionality may use a statistical language model to assign weights to edges in the graph, and to determine whether the normalized linguistic incorporates two or more component linguistic items.
    Type: Grant
    Filed: December 7, 2011
    Date of Patent: December 9, 2014
    Assignee: Microsoft Corporation
    Inventors: Julie A. Medero, Daniel S. Morris, Lucretia H. Vanderwende, Michael Gamon
  • Patent number: 8909513
    Abstract: Various embodiments provide a method that comprises receiving a set of segments from a text field, analyzing the set of segments to determine at least one of a target subtext or a target meaning associated with the set of segments, and identifying a set of candidate emoticons where each candidate emoticon in the set of candidate emoticons has an association between the candidate emoticon and at least one of the target subtext or the target meaning. The method may further comprise presenting the set of candidate emoticons for entry selection at a current position of an input cursor, receiving an entry selection for a set of selected emoticons from the set of candidate emoticons, and inserting the set of selected emoticons into the text field at the current position of the input cursor.
    Type: Grant
    Filed: April 2, 2014
    Date of Patent: December 9, 2014
    Assignee: Machine Zone, Inc.
    Inventor: Gabriel Leydon
  • Patent number: 8904517
    Abstract: A system and method for contextually interpreting image sequences are provided. The method comprises receiving video from one or more video sources, and generating one or more questions associated with one or more portions of the video based on at least one user-defined objective. The method further comprises sending the one or more portions of the video and the one or more questions to one or more assistants, receiving one or more answers to the one or more questions from the one or more assistants, and determining a contextual interpretation of the video based on the one or more answers and the video.
    Type: Grant
    Filed: June 28, 2011
    Date of Patent: December 2, 2014
    Assignee: International Business Machines Corporation
    Inventors: Rajaraman Hariharan, Sri Ramanathan, Karthik Subbian, Matthew B. Trevathan
  • Patent number: 8903710
    Abstract: Verifying text translation in a program including obtaining a first program integrated information (PII) file and a second PII file of text in the program. An original page is generated by using the first PII file. Document object model (DOM) tree information of the original page is obtained. For each text unit in the original page: identification information corresponding to a first text unit in the first PII file is extracted; a second text unit is identified in the second PII file using the extracted identification information and a relationship between the identification information and the second text unit; and the first text unit is replaced in the DOM tree with the second text unit. A new page is generated using the text units from the second PII file.
    Type: Grant
    Filed: October 28, 2011
    Date of Patent: December 2, 2014
    Assignee: International Business Machines Corporation
    Inventors: Ling Luo, Li Qiang Qi, Yan Min Sheng, Wei Ji Shi
  • Patent number: 8886515
    Abstract: Systems and methods for enhancing machine translation post edit review processes are provided herein. According to some embodiments, methods for displaying confidence estimations for machine translated segments of a source document may include executing instructions stored in memory, the instructions being executed by a processor to calculate a confidence estimation for a machine translated segment of a source document, compare the confidence estimation for the machine translated segment to one or more benchmark values, associate the machine translated segment with a color based upon the confidence estimation for the machine translated segment relative to the one or more benchmark values, and provide the machine translated segment having the color in a graphical format, to a client device.
    Type: Grant
    Filed: October 19, 2011
    Date of Patent: November 11, 2014
    Assignee: Language Weaver, Inc.
    Inventor: Gert Van Assche
  • Patent number: 8886518
    Abstract: A system and method for capitalizing translated text is provided. A capitalized source text is automatically translated to a target text. The target text is capitalized according to information in the capitalized source text.
    Type: Grant
    Filed: August 7, 2006
    Date of Patent: November 11, 2014
    Assignee: Language Weaver, Inc.
    Inventors: Wei Wang, Kevin Knight, Daniel Marcu
  • Patent number: 8880389
    Abstract: A method, computer program product and system are disclosed for determining the semantic density of textualized digital media (a measure of how much information is conveyed in a sentence or clause relative to its length). The more semantically dense text is, the more information it conveys in a given space. Users input a topic, a timeline, and one or more target web media sources for analysis. Text in the target media sources is deconstructed to determine density, and a density rating assigned to the web media source. Over time, users can track trends in the density of text media relative to a given topic, and determine how much information is being conveyed in connection with the topic, such as a political campaign. Line graphs, pie charts, and other time-elapsed output graphic representations of the semantic density are generated and rendered for the user.
    Type: Grant
    Filed: December 9, 2011
    Date of Patent: November 4, 2014
    Inventor: Igor Iofinov
  • Patent number: 8874427
    Abstract: Methods, systems and program product are disclosed for determining matching level of a text lookup segment with a plurality of source texts in a translation memory in terms of context. The invention determines exact matches for the lookup segment in the plurality of source texts, and determines, in the case that at least one exact match is determined, that a respective exact match is an in-context exact match for the lookup segment in the case that a context of the lookup segment matches that of the respective exact match. Degree of context matching required can be predetermined, and results prioritized. The invention also includes methods, systems and program products for storing a translation pair of source text and target text in a translation memory including context, and the translation memory so formed. The invention ensures that content is translated the same as previously translated content and reduces translator intervention.
    Type: Grant
    Filed: July 1, 2011
    Date of Patent: October 28, 2014
    Assignee: SDL Enterprise Technologies, Inc.
    Inventors: Russ Ross, Kevin Gillespie
  • Patent number: 8855995
    Abstract: Systems, methods, and apparatuses including computer program products for machine translation. A method is provided that includes generating a plurality of machine translation systems using a single machine translation engine, and generating a consensus translation from a plurality of candidate translations for a source sentence, where each candidate translation of the plurality of candidate translations is an output of a respective machine translation system of the plurality of machine translation systems.
    Type: Grant
    Filed: December 3, 2012
    Date of Patent: October 7, 2014
    Assignee: Google Inc.
    Inventors: Wolfgang Macherey, Franz Josef Och
  • Patent number: 8849665
    Abstract: A machine translation method, system for using the method, and computer readable media are disclosed. The method includes the steps of receiving a source language sentence, selecting a set of target language n-grams using a lexical classifier and based on the source language sentence. When selecting the set of target language n-grams, in at least one n-gram, n is greater than 1. The method continues by combining the selected set of target language n-grams as a finite state acceptor (FSA), weighting the FSA with data from the lexical classifier, and generating an n-best list of target sentences from the FSA. As an alternate to using the FSA, N strings may be generated from the n-grams and ranked using a language model. The N strings may be represented by an FSA for efficiency but it is not necessary.
    Type: Grant
    Filed: January 30, 2008
    Date of Patent: September 30, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Srinivas Bangalore, Emil Ettelaie
  • Patent number: 8843360
    Abstract: Disclosed are various embodiments for client-side internationalization of network pages. A network page and code that localizes the network page are obtained from a server. The code that localizes the network page is executed in a client and determines a locale associated with the client. One or more internationalized elements are identified in the network page. The internationalized elements are replaced with corresponding localized translations. The network page is rendered for display in the client after the network page has been localized.
    Type: Grant
    Filed: March 4, 2011
    Date of Patent: September 23, 2014
    Assignee: Amazon Technologies, Inc.
    Inventors: Simon K. Johnston, Margaux Eng, James K. Keiger, Gideon Shavit
  • Patent number: 8831945
    Abstract: A text in a corpus including a set of world wide web (web) pages is analyzed. At least one word appropriate for a document type set according to a voice recognition target is extracted based on an analysis result. A word set is generated from the extracted at least one word. A retrieval engine is caused to perform a retrieval process using the generated word set as a retrieval query of the retrieval engine on the Internet, and a link to a web page from the retrieval result is acquired. A language model for voice recognition is generated from the acquired web page.
    Type: Grant
    Filed: October 12, 2011
    Date of Patent: September 9, 2014
    Assignee: NEC Informatec Systems, Ltd.
    Inventors: Kazuhiro Arai, Tadashi Emori
  • Patent number: 8831929
    Abstract: Methods, systems, and apparatus, including computer program products, in which an input method editor receives composition inputs and determines language context values based on the composition inputs. Candidate selections based on the language context values and the composition inputs are identified.
    Type: Grant
    Filed: September 23, 2013
    Date of Patent: September 9, 2014
    Assignee: Google Inc.
    Inventor: Feng Hong
  • Patent number: 8825466
    Abstract: Systems and methods for automatically modifying an annotated bilingual segment pair are provided. An annotated bilingual segment pair (“Pair”) may be modified to generate improved translation rules used in machine translation of documents from a source language to a target language. Because a single Pair may be used to translate a phrase, many Pairs are used in a machine translation system and manual correction of each model is impractical. Each Pair may be modified by re-labeling syntactic categories within the Pair, re-structuring a tree within the Pair, and/or re-aligning source words to target words within the Pair. In exemplary embodiments, many alternate Pairs (or portions thereof) are generated automatically, rule sequences corresponding to each are derived, and one or more rule sequences are selected. Using the selected rule sequence, a modified Pair is distilled.
    Type: Grant
    Filed: June 8, 2007
    Date of Patent: September 2, 2014
    Assignees: Language Weaver, Inc., University of Southern California
    Inventors: Wei Wang, Jonathan May, Kevin Knight
  • Patent number: 8825470
    Abstract: A system and method of providing a response with different language options for a data communication protocol, such as Session Initiation Protocol, are disclosed. For example, data communication is controlled between at least two endpoints. A response code indicative of a condition of the data communication is transmitted to one of the at least two endpoints. The response code is associated with a reason phrase operable to be displayed at the one of the at least two endpoints in a language selected from an option of a plurality of languages.
    Type: Grant
    Filed: September 27, 2007
    Date of Patent: September 2, 2014
    Assignee: Siemens Enterprise Communications Inc.
    Inventors: Mallikarjuna Samayamantry Rao, Dennis Kucmerowski
  • Patent number: 8818790
    Abstract: A syntactic analysis and hierarchical phrase model based machine translation system and method are provided. The machine translation system includes a word alignment module, a phrase extraction module, a gender syntactic annotation module, a syntactic based noncontiguous phrase abstract module, a noncontiguous phrase based translation module and an evaluation module. The machine translation system and method performs syntactic analysis based on a common contiguous phrase based machine translation model, and extracts a syntactic based noncontiguous phrase rule set from a bilingual aligned text, so as to address and issue of noncontiguous fixed custom in context of a whole sentence, and to comply with syntactic features of a language, translation is performed based on the noncontiguous phrase rule set and the phrase aligned table, and the translation results are evaluated based on the evaluation model, thereby improving the translation result.
    Type: Grant
    Filed: April 4, 2011
    Date of Patent: August 26, 2014
    Assignees: Samsung Electronics Co., Ltd., Samsung Electronics (China) R&D Center
    Inventors: Liang He, Zhangliang Xiong, Lei Wan
  • Patent number: 8818792
    Abstract: An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
    Type: Grant
    Filed: July 8, 2011
    Date of Patent: August 26, 2014
    Assignee: SK Planet Co., Ltd.
    Inventors: Young Sook Hwang, Sang-Bum Kim, Chang Hao Yin, Hae Chang Rim, Joo Young Lee
  • Patent number: 8812291
    Abstract: Systems, methods, and computer program products for machine translation are provided. In some implementations a system is provided. The system includes a language model including a collection of n-grams from a corpus, each n-gram having a corresponding relative frequency in the corpus and an order n corresponding to a number of tokens in the n-gram, each n-gram corresponding to a backoff n-gram having an order of n?1 and a collection of backoff scores, each backoff score associated with an n-gram, the backoff score determined as a function of a backoff factor and a relative frequency of a corresponding backoff n-gram in the corpus.
    Type: Grant
    Filed: December 10, 2012
    Date of Patent: August 19, 2014
    Assignee: Google Inc.
    Inventors: Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean
  • Patent number: 8812301
    Abstract: A system and method for natural language processing of queries are provided. A lexicon includes text elements that are recognized as being a proper noun when capitalized. A natural language query includes a sequence of text elements including words. The query is processed. The processing includes a preprocessing step, in which part of speech features are assigned to the text elements in the query. This includes identifying, from a lexicon, a text element in the query which starts with a lowercase letter and assigning recapitalization information to the text element in the query, based on the lexicon. This information includes a part of speech feature of the capitalized form of the text element. Then parts of speech for the text elements in the query are disambiguated, which includes applying rules for recapitalizing text elements based on the recapitalization information.
    Type: Grant
    Filed: September 26, 2011
    Date of Patent: August 19, 2014
    Assignee: Xerox Corporation
    Inventors: Caroline Brun, Vassilina Nikoulina, Nikolaos Lagos
  • Patent number: 8805677
    Abstract: Creating and processing a natural language grammar set of data based on an input text string are disclosed. The method may include tagging the input text string, and examining, via a processor, the input text string for at least one first set of substitutions based on content of the input text string. The method may also include determining whether the input text string is a substring of a previously tagged input text string by comparing the input text string to a previously tagged input text string, such that the substring determination operation determines whether the input text string is wholly included in the previously tagged input text string.
    Type: Grant
    Filed: February 4, 2014
    Date of Patent: August 12, 2014
    Assignee: West Corporation
    Inventor: Steven John Schanbacher
  • Patent number: 8806326
    Abstract: A method and apparatus for user selectable content linking in internet applications. The usability of downloaded content may be modified for specific users by allowing a user to select the content in which and how their content data is linked. Particular text may be selected, and when content is downloaded from an origin service including the particular text, the text may be automatically linked to a service or database. The service or database is independent of the origin service and may be provided by a third party.
    Type: Grant
    Filed: September 25, 2000
    Date of Patent: August 12, 2014
    Assignee: Nokia Inc.
    Inventor: Mika Leppinen
  • Patent number: 8798986
    Abstract: Portable, real time voice translation systems, and associated methods of use, are provided. The systems include a translation system for use on a single unit, portable computing device and operable for accessing a multilanguage database, selecting a source language from a plurality of source languages and a destination language from a plurality of destination languages, inputting a source phrase, transmitting the source phrase to a speech recognition module, a translation engine, and a template look-up engine for finding the phrase template associated with the destination phrase from among the multiple languages. The spoken translation is then output in the selected destination language. The translation system has a total time between the input of the source phrase and output of the destination phrase that is no slower than 0.010 seconds, and a communications interface operable for communicating with a second computer system.
    Type: Grant
    Filed: December 26, 2012
    Date of Patent: August 5, 2014
    Assignee: NewTalk, Inc.
    Inventors: Robert H. Clemons, Bruce W. Nash, Martha P. Robinson, Craig A. Robinson
  • Patent number: 8788262
    Abstract: Embodiments of the invention provide techniques for incorporating variable values into textual content. In one embodiment, an abstract phrase including a text phrase and a variable at a particular position in the text phrase is received. The abstract phrase may include multiple variables. A text value for the variable is received. The text phrase of the abstract phrase is combined with the text value according to the particular position of the variable. An integration rule is applied at a boundary of the text phrase of the abstract phrase and the text value, where the integration rule is based on a language rule. The integration rule modifies a portion of the text phrase of the abstract phrase or a portion of the text value to produce an integrated phrase.
    Type: Grant
    Filed: February 1, 2013
    Date of Patent: July 22, 2014
    Assignee: Facebook, Inc.
    Inventor: David C. Ellis
  • Patent number: 8788260
    Abstract: Systems, methods, and computer storage media having computer-executable instructions embodied thereon that facilitate generation of snippets. In embodiments, text features within a keyword-sentence window are identified. The text features are utilized to determine break features that indicate favorability of breaking at a particular location of the keyword-sentence window. The break features are used to recognize features of partial snippets such that a snippet score to indicate the strength of the partial snippet can be calculated. Snippet scores associated with partial snippets are compared to select an optimal snippet, that is, the snippet having the highest snippet score.
    Type: Grant
    Filed: May 11, 2010
    Date of Patent: July 22, 2014
    Assignee: Microsoft Corporation
    Inventors: Valerie Rose Nygaard, Riccardo Turchetto, Joanna Mun Yee Chan, Christian Biemann, David Dongjah Ahn, Andrea Ryerson Burbank, Feng Pan, Timothy McDonnell Converse, James Michael Reinhold, Tracy Holloway King
  • Patent number: 8781810
    Abstract: A method and a system for making merging decisions for a translation are disclosed which are suited to use where the target language is a productive compounding one. The method includes outputting decisions on merging of pairs of words in a translated text string with a merging system. The merging system can include a set of stored heuristics and/or a merging model. In the case of heuristics, these can include a heuristic by which two consecutive words in the string are considered for merging if the first word of the two consecutive words is recognized as a compound modifier and their observed frequency f1 as a closed compound word is larger than an observed frequency f2 of the two consecutive words as a bigram. In the case of a merging model, it can be one that is trained on features associated with pairs of consecutive tokens of text strings in a training set and predetermined merging decisions for the pairs.
    Type: Grant
    Filed: July 25, 2011
    Date of Patent: July 15, 2014
    Assignee: Xerox Corporation
    Inventors: Nicola Cancedda, Sara Stymne
  • Patent number: 8775155
    Abstract: A system and method for machine translation are disclosed. Source sentences are received. For each source sentence, a target sentence comprising target words is generated. A plurality of translation neighbors of the target sentence is generated. Phrase alignments are computed between the source sentence and the translation neighbor. Translation neighbors are scored with a translation scoring model, based on the phrase alignment. Translation neighbors are ranked, based on the scores. In training the model, parameters of the model are updated based on an external ranking of the ranked translation neighbors. The generating of translation neighbors, scoring, ranking, and, in the case of training, updating the parameters, are iterated with one of the translation neighbors as the target sentence. In the case of decoding, one of the translation neighbors is output as a translation. The system and method may be at least partially implemented with a computer processor.
    Type: Grant
    Filed: October 25, 2010
    Date of Patent: July 8, 2014
    Assignee: Xerox Corporation
    Inventors: Benjamin Roth, Andrew R. McCallum, Marc Dymetman, Nicola Cancedda
  • Patent number: 8762128
    Abstract: A translation system receives a test pair that includes a source test phrase in a first language and a target test phrase in a second language. The test pair can be evaluated by comparing its components with phrases in primary pairs. The test source phrase can be compared to a primary source phrase that is the phrase most commonly translated by the machine translation system into the test target phrase. The test target phrase can be compared to a primary target phrase that is the phrase into which the target source phrase is most often translated. If one and/or both comparisons are sufficiently dissimilar, the machine translation system can be modified by deleting the test pair, by flagging it for human review, or in other ways.
    Type: Grant
    Filed: May 31, 2011
    Date of Patent: June 24, 2014
    Assignee: Google Inc.
    Inventors: Thorsten Brants, Jan Pfeifer
  • Patent number: 8756050
    Abstract: Techniques, including systems and methods, for providing translated content are described and suggested herein. In an embodiment, translations for instances of content are received and reviewers provide votes regarding the translations' quality. Votes received for translations are utilized in order to determine scores for the translations. Reviewers and translators may be scored as well. Scores for translations are used in various processes for providing appropriate content to users.
    Type: Grant
    Filed: September 14, 2010
    Date of Patent: June 17, 2014
    Assignee: Amazon Technologies, Inc.
    Inventors: Roy N. Harkness, Bryan K. Beatty, Michael G. Curtis, Alexander Slutsker, Kelly L. Duggar