Storage Or Retrieval Of Data Patents (Class 704/7)
  • Patent number: 8626489
    Abstract: A data processing method and apparatus that may set emotion based on development of a story are provided. The method and apparatus may set emotion without inputting emotion for each sentence of text data. Emotion setting information is generated based on development of the story and the like, and may be applied to the text data.
    Type: Grant
    Filed: April 5, 2010
    Date of Patent: January 7, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Dong Yeol Lee, Seung Seop Park, Jae Hyun Ahn
  • Patent number: 8626486
    Abstract: Methods, systems, and apparatus, including computer program products, for correcting spelling in text. A text input is received for translation. One or more suspect words in the text input are identified. For each suspect word, one or more candidate words are identified. A score for the text input and scores for each of one or more candidate inputs are determined, where each candidate input is the text input with one or more of the suspect words each replaced by a respective candidate word. If any, a candidate input whose score is highest among the scores for the candidate inputs and is greater than the text input score by at least a threshold is selected. Otherwise, the text input is selected. A translation of a selected candidate input or the selected text input is provided as the translation of the text input.
    Type: Grant
    Filed: September 5, 2007
    Date of Patent: January 7, 2014
    Assignee: Google Inc.
    Inventors: Franz J. Och, Dmitriy Genzel
  • Patent number: 8626488
    Abstract: Systems, methods, and computer program products are provided for statistical machine translation. In some implementations a method is provided. The method includes receiving multi-lingual parallel text associating a source language, a target language, and one or more bridge languages, determining an alignment between the source language and the target language using a first bridge language that is distinct from the source language and the target language, and using the determined alignment to generate a candidate translation of an input text in the source language to the target language.
    Type: Grant
    Filed: April 6, 2012
    Date of Patent: January 7, 2014
    Assignee: Google Inc
    Inventors: Shankar Kumar, Franz Josef Och, Wolfgang Macherey
  • Patent number: 8620793
    Abstract: A network clearinghouse may be provided that brings together organizations (subjects) requiring outsourcing of a service and service providers (operators). The clearinghouse manages the bidding and awarding of contracts, by collecting and authorizing requests for proposals (RFPs), sending bid invitations to operators that meet the requirements of the subject, sending a notification that the contract has been awarded, and collecting payment from the subject and paying the operator.
    Type: Grant
    Filed: June 1, 2010
    Date of Patent: December 31, 2013
    Assignee: SDL International America Incorporated
    Inventors: Iko Knyphausen, Jochen Hummel
  • Patent number: 8612205
    Abstract: A system and method for generating word alignments from pairs of aligned text strings are provided. A corpus of text strings provides pairs of text strings, primarily sentences, in source and target languages. A first alignment between a text string pair creates links therebetween. Each link links a single token of the first text string to a single token of the second text string. A second alignment also creates links between the text string pair. In some cases, these links may correspond to bi-phrases. A modified first alignment is generated by selectively modifying links in the first alignment which include a word which is infrequent in the corpus, based on links generated in the second alignment. This results in removing at least some of the links for the infrequent words, allowing more compact and better quality bi-phrases, with higher vocabulary coverage, to be extracted for use in a machine translation system.
    Type: Grant
    Filed: June 14, 2010
    Date of Patent: December 17, 2013
    Assignee: Xerox Corporation
    Inventors: Gregory Alan Hanneman, Nicola Cancedda, Marc Dymetman
  • Patent number: 8612207
    Abstract: Language analysis means 21 analyzes texts read from a text DB 11, and generates a sentence structure as the analysis result. Similar-structure generation adjustment means 25 generates, from an input of an input device, a determination item for determining whether or not the structures are identical every type of differences between the sentence structures. Similar-structure determination adjustment means 26 generates, from an input of the input device 6, a determination item for determining whether or not the difference between attribute values is ignored every type of attribute values. Similar-structure generating means 22 generates a similar structure of a partial structure forming the sentence structure obtained by language analysis means 21 in accordance with the determination item from the similar-structure generation adjustment means 25, and sets the generated similar structure as an equivalent class of the partial structure on the generation source.
    Type: Grant
    Filed: March 17, 2005
    Date of Patent: December 17, 2013
    Assignee: NEC Corporation
    Inventors: Yousuke Sakao, Kenji Satoh, Susumu Akamine
  • Patent number: 8606559
    Abstract: A method for automatically detecting errors in machine translation using a parallel corpus includes analyzing morphemes of a target language sentence in the parallel corpus and a machine-translated target language sentence, corresponding to a source language sentence, to classify the morphemes into words; aligning by words and decoding, respectively, a group of the source language sentence and the machine-translated target language sentence, and a group of the source language sentence and the target language sentence in the parallel corpus; classifying by types errors in the machine-translated target language sentence by making a comparison, word by word, between the decoded target language sentence in the parallel corpus and the decoded machine-translated target language sentence; and computing error information in the machine-translated target language sentence by examining a frequency of occurrence of the classified error types.
    Type: Grant
    Filed: June 26, 2009
    Date of Patent: December 10, 2013
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yun Jin, Oh Woog Kwon, Ying Shun Wu, Changhao Yin, Sung Kwon Choi, Chang Hyun Kim, Seong Il Yang, Ki Young Lee, Yoon Hyung Roh, Young Ao Seo, Eun Jin Park, Young Kii Kim, Sang Kyu Park
  • Patent number: 8600736
    Abstract: A method of operating a computer to perform linguistic analysis includes the steps of splitting an input text into words and sentences; for each sentence, comparing phrases in the sentence with known phrases stored in a database, as follows: for each word in the sentence, comparing its value and values of words following it with values of words of stored phrases, starting with the longest stored phrase that starts with that word, and working from longest to shortest; in the event a match is found for two or more consecutive words, and considering the words around the phrase, labelling the matched phrase with an overphrase that describes the grammar use of the matched phrase; after the penultimate word has been compared, recasting the sentence by replacing the matched phrases by their respective overphrases; and then repeating the comparison process with the recast sentence until there is no further recasting.
    Type: Grant
    Filed: December 21, 2007
    Date of Patent: December 3, 2013
    Assignee: Thinking Solutions Pty Ltd
    Inventor: John Ball
  • Patent number: 8600930
    Abstract: Provided is a conversation assistance device which enables a user to easily search for a desired content. The conversation assistance device includes an input unit, a memory unit, a processing unit, and an output unit. A template database stored in the memory unit includes a plurality of templates. Each of the templates associates category sentences in a plurality of languages with a keyword. A keyword is specified by one keyword expression and one or more character inputs (keyword reading). When the input unit receives any of the plurality of character inputs, an example sentence selection unit retrieves one of the templates having a keyword corresponding to the input character.
    Type: Grant
    Filed: July 23, 2009
    Date of Patent: December 3, 2013
    Assignee: Sharp Kabushiki Kaisha
    Inventors: Ichiko Sata, Norihide Iida
  • Patent number: 8594994
    Abstract: Systems and methods are disclosed for searching across multi-lingual information. A user makes a query in a first language, and a group of documents that were previously machine-translated into the first language are searched for information responsive to the query. Contextual information derived can be used to improve the accuracy of the machine translation. Responsive documents are returned to the user. Alternatively, a query provided in a user's language may be translated into one or more other languages. Documents written in these languages can then be searched for information responsive to the appropriate translated query. Responsive documents can be translated into the user's language prior to providing them to the user.
    Type: Grant
    Filed: December 12, 2011
    Date of Patent: November 26, 2013
    Assignee: Google Inc.
    Inventor: Jeffrey A. Dean
  • Patent number: 8589165
    Abstract: The present disclosure provides method and system for converting a free text expression of an identity to a phonetic equivalent code. The conversion follows a set of rules based on phonetic groupings and compresses the expression to a shorter series of characters than the expression. The phonetic equivalent code may be compared to one or more other phonetic equivalent code to establish a correlation between the codes. The phonetic equivalent code of the free text expression may be associated with the code of a known identity. The known identity may be provided to a user for confirmation of the identity. Further, a plurality of expressions stored in a database may be consolidated by converting the expressions to phonetic equivalent codes, comparing the codes to find correlations, and if appropriate reducing the number of expressions or mapping the expressions to a fewer number of expressions.
    Type: Grant
    Filed: January 24, 2012
    Date of Patent: November 19, 2013
    Assignee: United Services Automobile Association (USAA)
    Inventors: Gregory Brian Meyer, James Elden Nicholson
  • Patent number: 8589144
    Abstract: Provided herein is a character processing device that converts an input character formed of an input alphanumeric or symbol to an extended Latin character similar to the input character including: a display unit displaying as an editing character the input character with a cursor attached thereto; a conversion target distinction unit discerning whether or not the editing character is convertible to the extended Latin character; and a notification unit indicating that the editing character is convertible to the extended Latin character when the editing character is regarded as convertible.
    Type: Grant
    Filed: September 15, 2008
    Date of Patent: November 19, 2013
    Assignee: Seiko Epson Corporation
    Inventor: Hiroyasu Kurashina
  • Patent number: 8589145
    Abstract: A method for enabling input into a handheld electronic device having at least three selectable languages available thereon includes detecting a predetermined input a number of times and switching a selected language between one of the three selectable languages and another of the three selectable languages wherein the another language is an immediately preceding selected language.
    Type: Grant
    Filed: April 26, 2012
    Date of Patent: November 19, 2013
    Assignee: BlackBerry Limited
    Inventors: Vadim Fux, Carlo Chiarello, Andrew D. Bocking, Harry R. Major
  • Patent number: 8589146
    Abstract: A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device.
    Type: Grant
    Filed: May 3, 2010
    Date of Patent: November 19, 2013
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Srinivas Bangalore, David Crawford Gibbon, Mazin Gilbert, Patrick Guy Haffner, Zhu Liu, Behzad Shahraray
  • Patent number: 8583440
    Abstract: An apparatus and method for providing visual indication of character ambiguity and ensuing reduction of such ambiguity during text entry are described. An application text entry field is presented in a display screen, into which the user enters text by means of a reduced keyboard and a disambiguating system. The default or most likely word construct for the current key sequence may be presented at the insertion point of the text entry field. An indication of ambiguity is presented in the display screen to communicate to the user the possible ambiguous characters associated with each key. A word choice list field may also be present to display at least one word construct matching the current key sequence.
    Type: Grant
    Filed: August 26, 2005
    Date of Patent: November 12, 2013
    Assignee: Tegic Communications, Inc.
    Inventors: James Stephanick, Ethan R. Bradford, Pim Van Meurs, Richard Eyraud, Michael R. Longé
  • Patent number: 8577669
    Abstract: Some embodiments of an efficient string search have been presented. In one embodiment, a string of bytes representing content written in a non-delimited language is received, wherein the content has been classified into a predetermined category. In a single pass through the string of bytes, a set of N-grams is searched for simultaneously. Statistical information on occurrences of the N-grams, if any, in the string of bytes is collected. In some embodiments, a model is generated based on the statistical information, where the model is usable by a content filter to classify content.
    Type: Grant
    Filed: December 22, 2011
    Date of Patent: November 5, 2013
    Assignee: SonicWALL, Inc.
    Inventors: Thomas E. Raffill, Shunhui Zhu, Roman Yanovsky, Boris Yanovsky, John Gmuender
  • Patent number: 8570558
    Abstract: An image processing apparatus includes: a data memory that stores in itself voice guide data pieces; a voice output portion that outputs the voice guide data pieces stored in the data memory; and a controller. The controller prohibits a first voice guide data piece from being outputted according to a job by the voice output portion, while a second voice guide data piece is being outputted according to another job thereby.
    Type: Grant
    Filed: April 2, 2009
    Date of Patent: October 29, 2013
    Assignee: Konica Minolta Business Technologies, Inc.
    Inventors: Toshimichi Iwai, Takeshi Morikawa, Kaitaku Ozawa, Kei Shigehisa
  • Patent number: 8572379
    Abstract: A server and a client mutually exclusively execute server-side and client-side commutative cryptographic processes and server-side and client-side commutative permutation processes. The server has access to a hash table, while the client does not. The server and client perform a method including: encrypting and reordering the hash table using the server; communicating the encrypted and reordered hash table to the client; further encrypting and further reordering the hash table using the client; communicating the further encrypted and further reordered hash table back to the server; and partially decrypting and partially undoing the reordering using the server to generate a double-blind hash table. To read an entry, the client hashes and permute an index key and communicates same to the server which retrieves an item from the double-blind hash table using the hashed and permuted index key and sends it back to the client which decrypts the retrieved item.
    Type: Grant
    Filed: August 8, 2011
    Date of Patent: October 29, 2013
    Assignee: Xerox Corporation
    Inventor: Nicola Cancedda
  • Publication number: 20130282360
    Abstract: A system providing a mobile foreign language database and associated equipment. The system enables real time or near real time searching for foreign language business and real time or near real time learning of foreign languages.
    Type: Application
    Filed: April 19, 2013
    Publication date: October 24, 2013
    Inventors: James A. Shimota, Lawrence Lien, Kenneth H. Bridges
  • Patent number: 8566077
    Abstract: A digital sign language translator has a case configured to be supported by a hand of a user, a touch screen display located on a face of the case, a microprocessor for selectively translating words, letters, and numbers into video clips of an actual person performing a sign language translation. The translator has an internal memory device for storing a standard database selected words, letters, and numbers and the corresponding video clip of an actual person performing a sign language translation of words, letters, and numbers. The translator further includes a memory card slot for receiving an external memory card, the external memory card having an expanded vocabulary to supplement the standard database contained on the internal memory. The translator further includes a battery for powering the translator and a keyboard selectively shown on the touch screen display.
    Type: Grant
    Filed: July 26, 2010
    Date of Patent: October 22, 2013
    Inventors: Barbara Ander, Sidney Ander
  • Patent number: 8560299
    Abstract: A first computer system sends a request to a second computer system. The second computer system determines that the first computer system utilizes a message catalog file that is not installed on the second computer system. As a result, the second computer system sends a catalog request that requests the message catalog file. The second computer system receives the message catalog file and sends a response message from the second computer system to the first computer system using the received message catalog file.
    Type: Grant
    Filed: April 29, 2010
    Date of Patent: October 15, 2013
    Assignee: International Business Machines Corporation
    Inventors: Su Liu, George F. Ramsay, III
  • Publication number: 20130268259
    Abstract: Disclosed herein are a translation apparatus and a translation method. The translation apparatus includes: a speech input unit that receives a speech of a first language from a user; a control unit that generates sentences to be translated of the first language from the speech of the first language input from the speech input unit; a communication unit that transmits the sentences to be translated of the first language to a translation server and receives the sentences to be translated of a second language from the translation server; a display unit that displays the translated sentences of the second language along with previously translated sentences; a memory that stores a translation history including the sentences to be translated of the first language and the translated sentences of the second language; and a user input unit that receives an operation input of the previously translated sentences from a user.
    Type: Application
    Filed: January 24, 2013
    Publication date: October 10, 2013
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventor: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
  • Patent number: 8554558
    Abstract: An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a source language, and a machine translation (MT) engine is used for translating this text data to text data in a target language. In this embodiment, the text data in the target language is rendered on a display device, and different visualization schemes are applied to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. In one embodiment, the defined characteristics include a defined confidence value representing the accuracy of the rendered text. For example, this confidence value may be based on both the accuracy of the conversion of the audio input and the accuracy of the translation of the text data to the target language.
    Type: Grant
    Filed: July 12, 2010
    Date of Patent: October 8, 2013
    Assignee: Nuance Communications, Inc.
    Inventors: Jeffrey S. McCarley, Leiming R. Qian
  • Patent number: 8554542
    Abstract: A system and method are provided for processing an input document which enable assessment of the coherence of an abstract of the document. The method includes storing the document in memory and, for each sentence of the abstract, comparing the sentence with sentences of a main body of the document using textual entailment techniques to identify whether the sentence of the abstract entails a sentence in the main body of the document. Links can then be generated between the entailing sentences of the abstract and the corresponding entailed sentences of the document. The document and generated links are output. The links enable the coherence of the abstract to be evaluated, either manually or automatically, using an evaluation component of the system.
    Type: Grant
    Filed: May 5, 2010
    Date of Patent: October 8, 2013
    Assignee: Xerox Corporation
    Inventors: Ágnès Sandor, Guillaume Jacquet
  • Patent number: 8548792
    Abstract: Described are field of training devices and methods to simulate the use of interpreters and spontaneous verbal exchanges between participants who speak different languages. More particularly, the devices and methods simulate an environment wherein participants are communicating through a third-party interpreter, which simulates the use of an interpreter. Some embodiments of the devices include audio components that are configured in a manner that distorts direct verbal communication between two parties of a trilateral verbal exchange. Although the direct verbal communication is distorted, tonality and tempo of the communication can still be preserved. Further, visual information from the speaker is still conveyed.
    Type: Grant
    Filed: May 27, 2010
    Date of Patent: October 1, 2013
    Assignee: Dyncorp International LLC
    Inventor: James O. Pyle
  • Patent number: 8543383
    Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.
    Type: Grant
    Filed: October 28, 2011
    Date of Patent: September 24, 2013
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Mehryar Mohri, Mark-Jan Nederhof
  • Patent number: 8543373
    Abstract: A system for assisting a user who is learning a language to prioritize words to be learned in order of usage frequency is disclosed. A frequency determination program running on a computer determines the frequency of usage of each word at a list of locations provided by the user. Different algorithms to identify what constitutes a word are employed depending upon the language of the source data. The total number of words at each location and their usage frequency found during the user session, along with a total number of words and their usage frequency for all user sessions performed regardless of location, are calculated and made available to the user. The user can view usage frequencies for words from a single location, a group of locations, or all user sessions performed.
    Type: Grant
    Filed: November 13, 2008
    Date of Patent: September 24, 2013
    Assignee: International Business Machines Corporation
    Inventors: Yen-Fu Chen, John W. Dunsmoir
  • Patent number: 8543381
    Abstract: This invention is a method for “text morphing,” wherein text morphing involves integrating or blending together substantive content from two or more bodies of text into a single body of text based on locations of linguistic commonality among the two or more bodies of text. This method entails: identifying pairs of “Synonym-Different-Synonym” (SDS) text segments between an import body of text and an export body of text; and, for each selected pair of SDS text segments, substituting some or all of the SDS text segment from the export body of text for some or all of the SDS text segment in the import body of text. In some respects, this method is analogous to splicing and substituting gene segments with compatible starting and ending sequences, but different middle sequences. Text morphing as disclosed herein can be useful for creative ideation, product development, integrative search engines, and entertainment purposes.
    Type: Grant
    Filed: June 17, 2010
    Date of Patent: September 24, 2013
    Assignee: Holovisions LLC
    Inventor: Robert A. Connor
  • Patent number: 8538742
    Abstract: A system and method for translating a social feed is disclosed. The system comprises a communication module, a decoding engine and a re-encoding engine. The communication module receives social feed data and a request from a social network application. The social feed data is configured to cause a client to display a social feed in a first language. The request includes data indicating that the social feed should be displayed in a second language. The decoding engine decodes the social feed data to generate decoded social feed data. The re-encoding engine re-encodes the decoded social feed data to cause the client to display the social feed in the second language based at least in part on the request. The communication module sends the translated social feed data to the client.
    Type: Grant
    Filed: September 13, 2011
    Date of Patent: September 17, 2013
    Assignee: Google Inc.
    Inventors: Christopher R. Wren, Nadav Aharony
  • Patent number: 8532978
    Abstract: A method and system for enabling non-programmers to create certifiable Extensible Access Control Markup Language (XACML) policies. The graphical user interface (GUI) presents a form to the security policy author using a natural language such as English, as specified by a context-free grammar. The compiler software translates the GUI's filled-in form—representing a human-readable natural language policy—into XACML code. A reverse compiler or de-compiler provides a certification of the XACML code to render the original policy in a natural language format such as English. Optionally, a tokenized intermediate form, a set of policy-specific data sets and a graph theory-based intermediate representation can be configured. Logic checks and code validation checks can be also preferably configured. Apparatus and medium claims are also provided.
    Type: Grant
    Filed: October 31, 2008
    Date of Patent: September 10, 2013
    Assignee: AFRL/RIJ
    Inventor: Ronald C Turner
  • Patent number: 8532674
    Abstract: A method of operating a vehicle telematics unit includes determining the location of a vehicle equipped with a vehicle telematics unit; determining if telematics dialing software operated by the vehicle telematics unit includes a verbal dialing protocol used at the determined vehicle location; if not, identifying one or more verbal dialing protocols used at the determined location of the vehicle; requesting telematics dialing software that includes the one or more identified verbal dialing protocols; receiving the requested telematics dialing software from a central facility; and storing the received telematics dialing software at the vehicle.
    Type: Grant
    Filed: December 10, 2010
    Date of Patent: September 10, 2013
    Assignee: General Motors LLC
    Inventors: Uma Arun, Rathinavelu Chengalvarayan, Kevin R. Krause, Eray Yasan, Gaurav Talwar, Xufang Zhao, Michael A. Wuergler
  • Patent number: 8527518
    Abstract: A search query for a collection of electronic documents is parsed to identify one or more terms and such identified terms are associated with one or more languages (i.e., spoken languages such as English, German, Spanish, etc.). A terms inverted index and a language inverted index are accessed to identify documents responsive to the query. Related apparatus, systems, techniques and articles are also described.
    Type: Grant
    Filed: December 16, 2010
    Date of Patent: September 3, 2013
    Assignee: SAP AG
    Inventors: Frederik Transier, Holger Schwedes, Wolfgang Stephan, Thomas Peh
  • Patent number: 8527259
    Abstract: Methods and apparatus, including computer program products, implementing and using techniques for translating digital content from a source language to a target language. A message is displayed to a user. The message contains digital content to be translated from the source language to the target language, as well as the context of the digital content in the source language and/or a reference to a context in which the digital content occurs. A proposed translation of the digital content into the target language is received from the user. The proposed translation is submitted to a translation server.
    Type: Grant
    Filed: February 28, 2011
    Date of Patent: September 3, 2013
    Assignee: Google Inc.
    Inventors: Przemyslaw Broniek, Joanna Chwastowska, Brendan Clavin, Dawid Duda, Terence Haddock, Marcin Mikosik, Maciej Molerus, Michal Pociecha-Los, Jan Wrobel
  • Patent number: 8521506
    Abstract: A computer-implemented method for use in natural language translation comprises performing in software processes, the steps of: comparing source material with stored material in a first natural language, said stored material having previously been translated from said first natural language to at least a second natural language, identifying at least a part of said source material which has a relationship with at least a part of said stored material, outputting said identified part of source material and said identified part of stored material in a form suitable for review by a user, and replacing said identified part of source material with said identified part of stored material to assist full translation of said source material from said first natural language to at least said second natural language.
    Type: Grant
    Filed: September 21, 2006
    Date of Patent: August 27, 2013
    Assignee: SDL PLC
    Inventors: Mark Lancaster, Alastair Gordon, Keith Mills
  • Patent number: 8521516
    Abstract: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules, and generating a normalized phrase table including a plurality of key-value pairs, each key value pair includes a key corresponding to a normalized phrase and a value corresponding to one or more un-normalized phrases associated with the normalized key, each un-normalized phrase having one or more parameters.
    Type: Grant
    Filed: March 25, 2009
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Franz Josef Och, Ignacio E Thayer, Ioannis Tsochandaridis, Dmitriy Genzel
  • Patent number: 8521509
    Abstract: A method for creating and using a cross-idea association database that includes a method for associating words and word strings in a language by analyzing word formations around a word or word string to identify other words or word strings that are equivalents or near equivalents semantically. One method for associating words and word strings includes querying a collection of documents with a user-supplied word or word string, determining a user-defined amount of words or word strings to the left and right of the query string, determining the frequency of occurrence of words or word strings located on the left and right of the query string, and ranking the located words.
    Type: Grant
    Filed: May 3, 2010
    Date of Patent: August 27, 2013
    Assignee: Meaningful Machines LLC
    Inventor: Eli Abir
  • Patent number: 8521507
    Abstract: Training data in one language is leveraged to develop classifiers for multiple languages under circumstances where all of those classifiers will be performing the same kind of classification task, but relative to linguistically different sets of texts, thereby saving the cost of manually labeling a different set of training data for each language. Classification knowledge is learned for a source language in which training data are available. That knowledge is transferred to another target language's classifier through the integration of language transition knowledge. The transferred model is adjusted to better fit the target language.
    Type: Grant
    Filed: February 22, 2010
    Date of Patent: August 27, 2013
    Assignee: Yahoo! Inc.
    Inventors: Lei Shi, Mingjun Tian
  • Patent number: 8515728
    Abstract: The present translation system translates visual input and/or audio input from one language into another language. Some implementations incorporate a context-based translation that uses information obtained from visual input or audio input to aid in the translation of the other input. Other implementations combine the visual and audio translation. The translation system includes visual components and/or audio components. The visual components analyze visual input to identify a textual element and translate the textual element into a translated textual element. The visual image represents a captured image of a target scene. The visual components may further substitute the translated textual element for the textual element in the captured image. The audio components convert audio input into translated audio.
    Type: Grant
    Filed: March 29, 2007
    Date of Patent: August 20, 2013
    Assignee: Microsoft Corporation
    Inventors: Jonathan J. Boyd, Binay K. Pathak
  • Patent number: 8515735
    Abstract: Articles, surfaces, media or educational material containing a universal script, comprised of glyphs derived almost entirely from the Roman script and with only a few new glyphs, for transcription of all the world's languages, with particular attention to a means for expression of the phonemic idiosyncrasies within and between languages and language families are provided.
    Type: Grant
    Filed: April 21, 2010
    Date of Patent: August 20, 2013
    Inventor: Prasanna Chandrasekhar
  • Patent number: 8515729
    Abstract: The display language of a site may be changed to another alternate language by users of a site at any time. For example, a first user may access the same site in its default language (i.e. English) and a second user may access the site using their preferred language (i.e. French) even though the default language of the site is different from their preferred language. The language of the site may be changed from one language to another language at any time a user is accessing the site. Application content changes are identified by the site helping to ensure consistency between the default language and the alternate languages.
    Type: Grant
    Filed: March 31, 2008
    Date of Patent: August 20, 2013
    Assignee: Microsoft Corporation
    Inventors: Tomasz Grzegorz Tomko, Jay R. Rathi, Almon C. Dao
  • Patent number: 8515732
    Abstract: A first computer system sends a request to a second computer system. The second computer system determines that the first computer system utilizes a message catalog file that is not installed on the second computer system. As a result, the second computer system sends a catalog request that requests the message catalog file. The second computer system receives the message catalog file and sends a response message from the second computer system to the first computer system using the received message catalog file.
    Type: Grant
    Filed: April 19, 2012
    Date of Patent: August 20, 2013
    Assignee: International Business Machines Corporation
    Inventors: Su Liu, George F. Ramsay, III
  • Patent number: 8515730
    Abstract: An improved method of transliterating non-Latin input within an e-mail address field to the Latin equivalent. A routine in a handheld device is structured to detect a triggering event that indicates an e-mail address is being input into an e-mail address field. Following the triggering event, both prior and subsequent input is transliterated to Latin characters as these characters are required by Internet protocols. The transliteration routine may also be utilized to search an e-mail address book wherein names are recorded using both Latin and non-Latin characters.
    Type: Grant
    Filed: May 8, 2009
    Date of Patent: August 20, 2013
    Assignee: Research In Motion Limited
    Inventors: Vadim Fux, Michael Elizarov, Dan Rubanovich
  • Patent number: 8515749
    Abstract: Systems and methods for facilitating communication including recognizing speech in a first language represented in a first audio signal; forming a first text representation of the speech; processing the first text representation to form data representing a second audio signal; and causing presentation of the second audio signal to a second user while responsive to an interrupt signal from a first user. In some embodiments, processing the first text representation includes translating the first text representation to a second text representation in a second language and processing the second text representation to form the data representing the second audio signal. In some embodiments include accepting an interrupt signal from the first user and interrupting the presentation of the second audio signal.
    Type: Grant
    Filed: May 20, 2009
    Date of Patent: August 20, 2013
    Assignee: Raytheon BBN Technologies Corp.
    Inventor: David G. Stallard
  • Patent number: 8510097
    Abstract: Computer methods, apparatus and articles of manufacture therefor, are disclosed for text-characterization using a finite state transducer that along each path accepts on a first side an n-gram of text-characterization (e.g., a language or a topic) and outputs on a second side a sequence of symbols identifying one or more text-characterizations from a set of text-characterizations. The finite state transducer is applied to input data. For each n-gram accepted by the finite state transducer, a frequency counter associated with the n-gram of the one or more text-characterizations in the set of text-characterizations is incremented. The input data is classified as one or more text-characterizations from the set of text-characterizations using the frequency counters associated therewith.
    Type: Grant
    Filed: December 18, 2008
    Date of Patent: August 13, 2013
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Lauri J Karttunen, Ji Fang
  • Patent number: 8510093
    Abstract: An image processing apparatus includes a region dividing section, an character recognizing section, a classifying section, a translating section, a calculation section and a correcting section. The region dividing section divides a document image into sentence regions. The character recognizing section recognizes characters in the respective sentence regions. The classifying section classifies the sentence regions into groups in accordance with sizes of the characters. The translating section translates a sentence into a given language for each of the sentence regions. The calculation section calculates a character size of a sentence, which has been translated for each of the sentence regions by the translating section. And The correcting section corrects a size of a translated character of each character region for every sentence region classified into the same group such that the character sizes calculated by the calculating section become equal.
    Type: Grant
    Filed: March 27, 2008
    Date of Patent: August 13, 2013
    Assignee: Fuji Xerox Co., Ltd.
    Inventor: Yuya Konno
  • Patent number: 8498857
    Abstract: A system and method for porting of existing speech recognition solutions in a source language to a target language has been disclosed. The system envisaged by the present invention enables porting of a working speech recognition solution in the source language to a working system in the target language, thus minimizing the development process and reusing existing speech recognition solution components to recognize multiple languages.
    Type: Grant
    Filed: May 18, 2010
    Date of Patent: July 30, 2013
    Assignee: Tata Consultancy Services Limited
    Inventors: Sunil Kumar Kopparapu, Imran Ahmed Sheikh, Amol Sitaram Pharande
  • Patent number: 8498972
    Abstract: Inverted indexes for terms and for term separators are separately provided to minimize data redundancy. Search queries are parsed to identify terms and term separators, if any, and the corresponding inverted indexes are searched for responsive documents. Related apparatus, systems, techniques and articles are also described.
    Type: Grant
    Filed: December 16, 2010
    Date of Patent: July 30, 2013
    Assignee: SAP AG
    Inventors: Frederik Transier, Franz Faerber
  • Patent number: 8495491
    Abstract: A method, system and apparatus for locale and operating platform independent font selection. In an operating platform having an operating platform configuration and an associated locale, a locale and operating platform independent font selection method can include parsing a pre-established font properties file to identify whether a desired font referenced in the font properties file supports at least one of the operating platform configuration and the associated locale. Consequently, if it is indicated within the font properties file that the desired font supports either or both of the operating platform configuration and the associated locale, the desired font can be utilized in the operating platform.
    Type: Grant
    Filed: April 20, 2005
    Date of Patent: July 23, 2013
    Assignee: International Business Machines Corporation
    Inventor: Emad Muhanna
  • Patent number: 8494837
    Abstract: Systems and methods for active learning of statistical machine translation systems through dynamic creation and updating of parallel corpus. The systems and methods provided create accurate parallel corpus entries from a test set of sentences, words, phrases, etc. by calculating confidence scores for particular translations. Translations with high confidence scores are added directly to the corpus and the translations with low confidence scores are presented to human translations for corrections.
    Type: Grant
    Filed: August 14, 2012
    Date of Patent: July 23, 2013
    Assignee: International Business Machines Corporation
    Inventors: Yuqing Gao, Bing Xiang, Bowen Zhou
  • Patent number: 8494834
    Abstract: A method for local, computer-aided translation using remotely-generated translation predictions includes the step of determining that a translation stored in a remote translation memory is useful in translating a first portion of a local document. A local machine receives the translation. Prior to receiving a request form a translator for the translation, a determination is made that the remote translation memory stores an updated version of the translation. The updated version of the translation is identified as useful in translating a second portion of the document. The local machine generates a translation of the second portion of the document through reuse of the updated version of the translation, responsive to the identification of the utility of the updated version of the translation in translating the second portion of the document.
    Type: Grant
    Filed: November 21, 2006
    Date of Patent: July 23, 2013
    Assignee: Lionbridge Technologies, Inc.
    Inventor: Joachim Schurig