Determining Word Hyphenation Location Patents (Class 715/258)
  • Patent number: 9021025
    Abstract: A server-based system is provided for analyzing and valuating social media campaign performance. The system collects online traffic data from member users of a social media channel, and tracks engagement activity with respect to a social media campaign operating on the channel according to parameters defined with respect to specific activities including: user interactions with media content of the campaign; user transactions as non-commercial data exchanges or commercial purchases conducted via the campaign; and user sharing of the campaign with other member users of the social media channel. The system establishes baseline performance thresholds and aggregates the tracked engagement activity with respect to the defined parameters. A graphical user interface is generated to display client feedback related to a comparison of the aggregated engagement activity with the established baseline performance thresholds for each of the defined parameters.
    Type: Grant
    Filed: March 28, 2014
    Date of Patent: April 28, 2015
    Assignee: Moontoast, LLC
    Inventors: Marcus Whitney, John Baker
  • Patent number: 9009591
    Abstract: Architecture that enables a user to perform manual word-breaking by phrase input. Phrase input is where the user inserts a phrase-key (or separator) as a delimiter that indicates to an editor application such as an IME (input method editor) the composition of a specific phrase when entering characters (e.g., Asian). The word-breaking is controlled by the user. The conversion quality is improved as the user knows the desired input and ambiguous cases are reduced. A phrase can be specified while the user is composing the characters. By selecting a phrase-key separator, the user can specify the composing characters before the characters are presented as a phrase. Moreover, the architecture includes a phrase prioritization mechanism wherein each phrase can be treated as a single entity and assigned a character identifier (ID), which is related to the sequence of a candidate list.
    Type: Grant
    Filed: December 11, 2008
    Date of Patent: April 14, 2015
    Assignee: Microsoft Corporation
    Inventors: Yun-Chiang Kung, Sheng-Yao Shih, Chin-Nan Lee, Yu-Li Huang, Hsiang-Fu Liu
  • Patent number: 8996994
    Abstract: Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file.
    Type: Grant
    Filed: January 16, 2008
    Date of Patent: March 31, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Siarhei Alonichau, Ravi Shahani, Kevin Powell
  • Patent number: 8933827
    Abstract: A data processing apparatus that is capable of reducing the garbling of characters caused by the difference among the character codes when setting data are transferred to another apparatus by the import-export function. A storage unit stores setting data for the data processing apparatus. A receiving unit receives an instruction for exporting the setting data stored in the storage unit. A converting unit converts Unicode data included in the setting data into character code data of language, which is set to the data processing apparatus. An export unit exports the character code data converted by the converting unit and the Unicode data.
    Type: Grant
    Filed: May 24, 2013
    Date of Patent: January 13, 2015
    Assignee: Canon Kabushiki Kaisha
    Inventor: Noritsugu Okayama
  • Patent number: 8904273
    Abstract: A value formatting system for determining format specifications in reports is provided. The value formatting system comprises a format collection unit for collecting data value attributes and patterns for a data item in a report, and a format merging unit for merging the collected data value attributes and patterns into a format specification for the data item.
    Type: Grant
    Filed: July 2, 2004
    Date of Patent: December 2, 2014
    Assignee: International Business Machines Corporation
    Inventors: Kevin M. Ferguson, Xu Wu
  • Patent number: 8849653
    Abstract: Methods, computer program products and systems for updating a dictionary on a computer are provided including identifying a set of words included with an application program being installed on the computer and identifying one or more installed dictionaries within the computer. The dictionaries that are identified, typically by searching the computer for installed dictionaries, may then be updated by adding one or more words of the set to at least one of the installed dictionaries. Optionally, one or more installed dictionaries may be identified by the user responding to a query asking the user to identify dictionaries or applications that contain dictionaries installed on the computer. An application programming interface may add the words to the dictionaries by associating the installed dictionaries with one or more installed application programs and identifying an API that is specific for the application and using the API to update the dictionaries.
    Type: Grant
    Filed: May 9, 2006
    Date of Patent: September 30, 2014
    Assignee: International Business Machines Corporation
    Inventors: Brian Bauman, Amanda J. Burton, Michael P. Carlson, Herman Rodriguez
  • Patent number: 8762370
    Abstract: One embodiment of the present invention provides a system that automatically generates synonyms for words from documents. During operation, this system determines co-occurrence frequencies for pairs of words in the documents. The system also determines closeness scores for pairs of words in the documents, wherein a closeness score indicates whether a pair of words are located so close to each other that the words are likely to occur in the same sentence or phrase. Finally, the system determines whether pairs of words are synonyms based on the determined co-occurrence frequencies and the determined closeness scores. While making this determination, the system can additionally consider correlations between words in a title or an anchor of a document and words in the document as well as word-form scores for pairs of words in the documents.
    Type: Grant
    Filed: February 8, 2013
    Date of Patent: June 24, 2014
    Assignee: Google Inc.
    Inventors: Oleksandr Grushetskyy, Steven D. Baker
  • Patent number: 8510101
    Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.
    Type: Grant
    Filed: August 13, 2012
    Date of Patent: August 13, 2013
    Assignee: HyperQuest, Inc.
    Inventors: Dennis M. Hogan, Jeffrey J. Hogan
  • Patent number: 8478583
    Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.
    Type: Grant
    Filed: August 17, 2012
    Date of Patent: July 2, 2013
    Assignee: HyperQuest, Inc.
    Inventors: Dennis M. Hogan, Jeffrey J. Hogan
  • Patent number: 8473280
    Abstract: A method for facilitating texting in a non-Latin-based language includes storing mapping between a non-Latin-based language and a Latin-based language in a computer storage. The characters in the non-Latin-based language are mapped to a first group of characters in the Latin-based language. The diacritics in the non-Latin-based language are mapped to a second group of characters in the Latin-based language. The characters in the non-Latin-based language in a texting message are converted into in the coded text comprising characters in the Latin-based language according to the mapping. The texting message having the coded text is transmitted from a first communication device to a second communication device. The characters of the Latin-based language in the coded text are converted back into the characters in the non-Latin-based language according to the mapping to allow a user to read the text message in the non-Latin-based language on the second communication device.
    Type: Grant
    Filed: August 6, 2010
    Date of Patent: June 25, 2013
    Assignee: King Abdulaziz City for Science & Technology
    Inventors: Hussein K. Al-Omari, Mohammad S. Khorsheed
  • Patent number: 8412516
    Abstract: A document analysis, commenting, and reporting system provides tools that automate quality assurance analysis tailored to specific document types. As one example, the specific document type may be a requirements specification and the system may tag different parts of requirements, including actors, entities, modes, and a remainder. However, the flexibility of the system permits analysis of any other document type, such as instruction manuals and best practices guides. The system helps avoid confusion over the document when it is delivered because of non-standard terms, ambiguous language, conflicts between document sections, incomplete or inaccurate descriptions, size and complexity of the document, and other issues.
    Type: Grant
    Filed: November 27, 2007
    Date of Patent: April 2, 2013
    Assignee: Accenture Global Services Limited
    Inventors: Kunal Verma, Alex Kass
  • Patent number: 8346764
    Abstract: The present inventors devised, among other things, systems, methods, and software for enhancing the relevancy of content presented to users in response to queries in an online information retrieval system. One exemplary system refines a user input query by making suggestions of alternatives queries. A switchboard module converts the refined query, administers one or more searches, and collects search results from one or more search engines based on the refined query. And, a post-processor module refines the collected search results by, for example, modifying the order of the results, removing inappropriate or undesirable content from the results, and/or applying historical performance analysis, based for example on social feedback.
    Type: Grant
    Filed: September 11, 2009
    Date of Patent: January 1, 2013
    Assignee: Thomson Reuters Global Resources
    Inventors: Douglas T. Rosenoff, Christopher Crowhurst
  • Patent number: 8271873
    Abstract: Automatically detecting the layout of a bidirectional (BIDI) text. In one embodiment, this comprises: determining a shaping attribute of the text based on scanning the text to detect at least one character of a specific shape, while preferably ignoring initial, final, and stand-alone shaped characters; determining a text type attribute of the text from the shaping attribute; and determining an orientation attribute of the text from the determined text type. A numeric shaping attribute and a symmetric swapping attribute may also be determined. Embodiments of the invention may be used, by way of illustration but not of limitation, for bidirectional layout transformation to allow conversion of a BIDI text received from a source computer system in a source BIDI layout format into a layout format of a target computer system.
    Type: Grant
    Filed: July 27, 2010
    Date of Patent: September 18, 2012
    Assignee: International Business Machines Corporation
    Inventors: Hisham E. El-Shishiny, Waleed M. Oransa
  • Patent number: 8255205
    Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.
    Type: Grant
    Filed: May 29, 2009
    Date of Patent: August 28, 2012
    Assignee: HyperQuest, Inc.
    Inventors: Dennis M. Hogan, Jeffrey J. Hogan
  • Patent number: 8234567
    Abstract: A method and device for displaying a document written in a Western language in a readable form without decreasing the utilization efficiency of a display screen. When a tail of a word to be displayed goes beyond an end of a line on the screen and the number of characters contained in the word does not exceeds a reference value, a line feed is inserted immediately before the word to perform word-wrapping, and when the number of characters contained in the word exceeds the reference value, a line feed is inserted halfway within the word to display divided parts of the word on different lines.
    Type: Grant
    Filed: April 16, 2008
    Date of Patent: July 31, 2012
    Assignee: Sharp Kabushiki Kaisha
    Inventor: Hisashi Tamaru
  • Patent number: 8140966
    Abstract: A computer system, method and program is generated for editing text data, which have been created on a basis of a voice or a video image. In one embodiment, a first computer for outputting voice or video image(s), and for identifying a part of text data, which needs to be edited, in response to a received instruction is provided. A second computer for outputting voice or video image, and for editing the part, which has been identified by the first computer, in response to said received instruction is also provided. A mouse and keyboard can also be incorporated into the invention.
    Type: Grant
    Filed: October 25, 2005
    Date of Patent: March 20, 2012
    Assignee: International Business Machines Corporation
    Inventors: Kohtaroh Miyamoto, Midori Shoji
  • Patent number: 8095878
    Abstract: The exemplary embodiment of the present invention relates to the dynamic activation and deactivation of a spell check feature option for instant message chat session features within a virtual environment. As such, the spell check feature can be activated and deactivated within the virtual environment based upon the identity of an engaged party to chat session or the identities of other parties that are in proximity to the chat session parties within the virtual environment.
    Type: Grant
    Filed: June 23, 2008
    Date of Patent: January 10, 2012
    Assignee: International Business Machines Corporation
    Inventors: Cary Lee Bates, Zachary A. Garbow
  • Patent number: 7979413
    Abstract: In accordance with an aspect of the invention, a method and system are disclosed for constructing an embedded signature in order to facilitate post-facto detection of leakage of sensitive data. The leakage detection mechanism involves: 1) identifying at least one set of words in an electronic document containing sensitive data, the set of words having a low frequency of occurrence in a first collection of electronic documents; and, 2) transmitting a query to search a second collection of electronic documents for any electronic document that contains the set of words having a low frequency of occurrence.
    Type: Grant
    Filed: May 30, 2008
    Date of Patent: July 12, 2011
    Assignees: AT&T Intellectual Property l, L.P., New York University
    Inventors: Balachander Krishnamurthy, Saurabh Kumar, Lakshminarayanan Subramanian
  • Patent number: 7890521
    Abstract: One embodiment of the present invention provides a system that automatically generates synonyms for words from documents. During operation, this system determines co-occurrence frequencies for pairs of words in the documents. The system also determines closeness scores for pairs of words in the documents, wherein a closeness score indicates whether a pair of words are located so close to each other that the words are likely to occur in the same sentence or phrase. Finally, the system determines whether pairs of words are synonyms based on the determined co-occurrence frequencies and the determined closeness scores. While making this determination, the system can additionally consider correlations between words in a title or an anchor of a document and words in the document as well as word-form scores for pairs of words in the documents.
    Type: Grant
    Filed: February 7, 2008
    Date of Patent: February 15, 2011
    Assignee: Google Inc.
    Inventors: Oleksandr Grushetskyy, Steven D. Baker
  • Patent number: 7805464
    Abstract: A setup dialog for a web viewer dynamically changes to reflect parameters needed to generate a web address. The setup dialog can include a list box containing websites for which website templates are available for assisting the user in generating web addresses. The list box can also include an option to generate a custom web address without using a website template. An extensible grammar is provided for generating custom website templates.
    Type: Grant
    Filed: September 18, 2006
    Date of Patent: September 28, 2010
    Assignee: Apple Inc.
    Inventors: David McKee, Tom Lloyd, John Lorin Welshofer, Christopher Crim
  • Patent number: 7770111
    Abstract: A method and computer-readable medium are provided for optimized paragraph layout. According to the method, a line-by-line paragraph layout is generated determine a maximum penalty for a paragraph. One or more layout nodes may be generated that include data representing a possible layout of the paragraph to a point within a backing store and identifying any changes to a geometry of the page caused by the layout of the paragraph defined by the layout node. A penalty is also calculated for each of the layout nodes. Each layout node that has a penalty less than the maximum penalty calculated during the line-by-line formatting processing is added to the node pool. This process repeats until only a single layout node remains in the node pool that represents the entire paragraph of text.
    Type: Grant
    Filed: December 20, 2004
    Date of Patent: August 3, 2010
    Assignee: Microsoft Corporation
    Inventors: Andrei Burago, Christoph E. Ammann, Sergey Genkin, Eliyezer Kohen, Victor Kozyrev, Anton A. Sukhanov, Igor Zverev
  • Patent number: 7698127
    Abstract: In a computer, a user interface allows users to input requests to various applications or software modules executing on the computer. During the input process, the invention presents to the user a grammar-based automatic completion of the user input so far, and/or a grammar-based suggestion list of one or more possible options for completing the user input.
    Type: Grant
    Filed: October 1, 2004
    Date of Patent: April 13, 2010
    Assignee: Microsoft Corporation
    Inventors: Tandy Woodson Trower, II, Mark J. Weinberg
  • Patent number: 7516418
    Abstract: A unique system and method that facilitate automatic tracking of user data and providing on-demand site or merchant reputation information is provided. The system and method involve monitoring user input such as by keystrokes over a period of time and forming a long continuous string of characters. The string of characters can be monitored for repeating substrings or non-repeating substrings that indicate sensitive user information such as SSN or credit card number (CCN). The substrings can be displayed to the user in a digest form and order according to frequency of occurrence, for example. Thus, the user can readily view where and when their sensitive information has been entered. In addition, the typing of certain substrings such as CCNs or other sensitive field data can trigger a reputation service to retrieve the reputation data for the current site or merchant to mitigate dealing with fraudulent sites or merchants.
    Type: Grant
    Filed: June 1, 2006
    Date of Patent: April 7, 2009
    Assignee: Microsoft Corporation
    Inventors: Cormac E. Herley, Dinei A. Florencio
  • Patent number: 7516404
    Abstract: Systems and methods of correcting grammatical and spelling errors that involve improper positioning of a whitespace character and/or an extra whitespace character. Removal of an extra whitespace character or repositioning of an improperly positioned whitespace character may result in correction of two misspelled words in a single correction step.
    Type: Grant
    Filed: June 2, 2004
    Date of Patent: April 7, 2009
    Inventor: Steven M. Colby
  • Patent number: 7451398
    Abstract: Providing capitalization correction for unstructured excerpts is described. An excerpt of unstructured content is tokenized into a set of words. The set of words is analyzed for correct capitalization. Individual characters constituting at least one such word in the set of words are evaluated. The at least one such word is skipped if determined to be of a predefined type.
    Type: Grant
    Filed: November 18, 2003
    Date of Patent: November 11, 2008
    Assignee: Google, Inc.
    Inventor: Christopher Rohrs
  • Patent number: 7447627
    Abstract: A method of determining the component words of a compound word is disclosed. The method identifies the component words, by comparing the word with a list of words found in a lexicon. If the word is not found in the lexicon the method proceeds to analyze the word on a character-by-character basis. After each character the method identifies any potential matches to the selected characters in the lexicon. If a match is found, it is added to a hypothesis trace in a lattice. Next, the method checks to see whether the remaining characters form a valid entry in the lexicon, and whether the entry is allowed to be a final segment.
    Type: Grant
    Filed: March 19, 2004
    Date of Patent: November 4, 2008
    Assignee: Microsoft Corporation
    Inventors: Andrea Maria Jessee, Miriam R. Eckert, Kevin R. Powell