Determining Word Hyphenation Location Patents (Class 715/258)
-
Patent number: 9021025Abstract: A server-based system is provided for analyzing and valuating social media campaign performance. The system collects online traffic data from member users of a social media channel, and tracks engagement activity with respect to a social media campaign operating on the channel according to parameters defined with respect to specific activities including: user interactions with media content of the campaign; user transactions as non-commercial data exchanges or commercial purchases conducted via the campaign; and user sharing of the campaign with other member users of the social media channel. The system establishes baseline performance thresholds and aggregates the tracked engagement activity with respect to the defined parameters. A graphical user interface is generated to display client feedback related to a comparison of the aggregated engagement activity with the established baseline performance thresholds for each of the defined parameters.Type: GrantFiled: March 28, 2014Date of Patent: April 28, 2015Assignee: Moontoast, LLCInventors: Marcus Whitney, John Baker
-
Patent number: 9009591Abstract: Architecture that enables a user to perform manual word-breaking by phrase input. Phrase input is where the user inserts a phrase-key (or separator) as a delimiter that indicates to an editor application such as an IME (input method editor) the composition of a specific phrase when entering characters (e.g., Asian). The word-breaking is controlled by the user. The conversion quality is improved as the user knows the desired input and ambiguous cases are reduced. A phrase can be specified while the user is composing the characters. By selecting a phrase-key separator, the user can specify the composing characters before the characters are presented as a phrase. Moreover, the architecture includes a phrase prioritization mechanism wherein each phrase can be treated as a single entity and assigned a character identifier (ID), which is related to the sequence of a candidate list.Type: GrantFiled: December 11, 2008Date of Patent: April 14, 2015Assignee: Microsoft CorporationInventors: Yun-Chiang Kung, Sheng-Yao Shih, Chin-Nan Lee, Yu-Li Huang, Hsiang-Fu Liu
-
Patent number: 8996994Abstract: Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file.Type: GrantFiled: January 16, 2008Date of Patent: March 31, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Siarhei Alonichau, Ravi Shahani, Kevin Powell
-
Patent number: 8933827Abstract: A data processing apparatus that is capable of reducing the garbling of characters caused by the difference among the character codes when setting data are transferred to another apparatus by the import-export function. A storage unit stores setting data for the data processing apparatus. A receiving unit receives an instruction for exporting the setting data stored in the storage unit. A converting unit converts Unicode data included in the setting data into character code data of language, which is set to the data processing apparatus. An export unit exports the character code data converted by the converting unit and the Unicode data.Type: GrantFiled: May 24, 2013Date of Patent: January 13, 2015Assignee: Canon Kabushiki KaishaInventor: Noritsugu Okayama
-
Patent number: 8904273Abstract: A value formatting system for determining format specifications in reports is provided. The value formatting system comprises a format collection unit for collecting data value attributes and patterns for a data item in a report, and a format merging unit for merging the collected data value attributes and patterns into a format specification for the data item.Type: GrantFiled: July 2, 2004Date of Patent: December 2, 2014Assignee: International Business Machines CorporationInventors: Kevin M. Ferguson, Xu Wu
-
Patent number: 8849653Abstract: Methods, computer program products and systems for updating a dictionary on a computer are provided including identifying a set of words included with an application program being installed on the computer and identifying one or more installed dictionaries within the computer. The dictionaries that are identified, typically by searching the computer for installed dictionaries, may then be updated by adding one or more words of the set to at least one of the installed dictionaries. Optionally, one or more installed dictionaries may be identified by the user responding to a query asking the user to identify dictionaries or applications that contain dictionaries installed on the computer. An application programming interface may add the words to the dictionaries by associating the installed dictionaries with one or more installed application programs and identifying an API that is specific for the application and using the API to update the dictionaries.Type: GrantFiled: May 9, 2006Date of Patent: September 30, 2014Assignee: International Business Machines CorporationInventors: Brian Bauman, Amanda J. Burton, Michael P. Carlson, Herman Rodriguez
-
Patent number: 8762370Abstract: One embodiment of the present invention provides a system that automatically generates synonyms for words from documents. During operation, this system determines co-occurrence frequencies for pairs of words in the documents. The system also determines closeness scores for pairs of words in the documents, wherein a closeness score indicates whether a pair of words are located so close to each other that the words are likely to occur in the same sentence or phrase. Finally, the system determines whether pairs of words are synonyms based on the determined co-occurrence frequencies and the determined closeness scores. While making this determination, the system can additionally consider correlations between words in a title or an anchor of a document and words in the document as well as word-form scores for pairs of words in the documents.Type: GrantFiled: February 8, 2013Date of Patent: June 24, 2014Assignee: Google Inc.Inventors: Oleksandr Grushetskyy, Steven D. Baker
-
Patent number: 8510101Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.Type: GrantFiled: August 13, 2012Date of Patent: August 13, 2013Assignee: HyperQuest, Inc.Inventors: Dennis M. Hogan, Jeffrey J. Hogan
-
Patent number: 8478583Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.Type: GrantFiled: August 17, 2012Date of Patent: July 2, 2013Assignee: HyperQuest, Inc.Inventors: Dennis M. Hogan, Jeffrey J. Hogan
-
Patent number: 8473280Abstract: A method for facilitating texting in a non-Latin-based language includes storing mapping between a non-Latin-based language and a Latin-based language in a computer storage. The characters in the non-Latin-based language are mapped to a first group of characters in the Latin-based language. The diacritics in the non-Latin-based language are mapped to a second group of characters in the Latin-based language. The characters in the non-Latin-based language in a texting message are converted into in the coded text comprising characters in the Latin-based language according to the mapping. The texting message having the coded text is transmitted from a first communication device to a second communication device. The characters of the Latin-based language in the coded text are converted back into the characters in the non-Latin-based language according to the mapping to allow a user to read the text message in the non-Latin-based language on the second communication device.Type: GrantFiled: August 6, 2010Date of Patent: June 25, 2013Assignee: King Abdulaziz City for Science & TechnologyInventors: Hussein K. Al-Omari, Mohammad S. Khorsheed
-
Patent number: 8412516Abstract: A document analysis, commenting, and reporting system provides tools that automate quality assurance analysis tailored to specific document types. As one example, the specific document type may be a requirements specification and the system may tag different parts of requirements, including actors, entities, modes, and a remainder. However, the flexibility of the system permits analysis of any other document type, such as instruction manuals and best practices guides. The system helps avoid confusion over the document when it is delivered because of non-standard terms, ambiguous language, conflicts between document sections, incomplete or inaccurate descriptions, size and complexity of the document, and other issues.Type: GrantFiled: November 27, 2007Date of Patent: April 2, 2013Assignee: Accenture Global Services LimitedInventors: Kunal Verma, Alex Kass
-
Patent number: 8346764Abstract: The present inventors devised, among other things, systems, methods, and software for enhancing the relevancy of content presented to users in response to queries in an online information retrieval system. One exemplary system refines a user input query by making suggestions of alternatives queries. A switchboard module converts the refined query, administers one or more searches, and collects search results from one or more search engines based on the refined query. And, a post-processor module refines the collected search results by, for example, modifying the order of the results, removing inappropriate or undesirable content from the results, and/or applying historical performance analysis, based for example on social feedback.Type: GrantFiled: September 11, 2009Date of Patent: January 1, 2013Assignee: Thomson Reuters Global ResourcesInventors: Douglas T. Rosenoff, Christopher Crowhurst
-
Patent number: 8271873Abstract: Automatically detecting the layout of a bidirectional (BIDI) text. In one embodiment, this comprises: determining a shaping attribute of the text based on scanning the text to detect at least one character of a specific shape, while preferably ignoring initial, final, and stand-alone shaped characters; determining a text type attribute of the text from the shaping attribute; and determining an orientation attribute of the text from the determined text type. A numeric shaping attribute and a symmetric swapping attribute may also be determined. Embodiments of the invention may be used, by way of illustration but not of limitation, for bidirectional layout transformation to allow conversion of a BIDI text received from a source computer system in a source BIDI layout format into a layout format of a target computer system.Type: GrantFiled: July 27, 2010Date of Patent: September 18, 2012Assignee: International Business Machines CorporationInventors: Hisham E. El-Shishiny, Waleed M. Oransa
-
Patent number: 8255205Abstract: Described are computer-based methods and apparatuses, including computer program products, for automation of auditing claims. Data indicative of an insurance company name is received, the data comprising one or more words. The data is processed through one or more processing steps to generate processed data comprising one or more processed words. One or more candidate word strings are selected based on the one or more processed words. Matching information is associated with each of the one or more candidate word strings. Analysis information is generated for each of the one or more candidate word strings based on the associated matching information. An insurance company identifier is associated with received data based on the analysis information and one or more matching rules.Type: GrantFiled: May 29, 2009Date of Patent: August 28, 2012Assignee: HyperQuest, Inc.Inventors: Dennis M. Hogan, Jeffrey J. Hogan
-
Patent number: 8234567Abstract: A method and device for displaying a document written in a Western language in a readable form without decreasing the utilization efficiency of a display screen. When a tail of a word to be displayed goes beyond an end of a line on the screen and the number of characters contained in the word does not exceeds a reference value, a line feed is inserted immediately before the word to perform word-wrapping, and when the number of characters contained in the word exceeds the reference value, a line feed is inserted halfway within the word to display divided parts of the word on different lines.Type: GrantFiled: April 16, 2008Date of Patent: July 31, 2012Assignee: Sharp Kabushiki KaishaInventor: Hisashi Tamaru
-
Patent number: 8140966Abstract: A computer system, method and program is generated for editing text data, which have been created on a basis of a voice or a video image. In one embodiment, a first computer for outputting voice or video image(s), and for identifying a part of text data, which needs to be edited, in response to a received instruction is provided. A second computer for outputting voice or video image, and for editing the part, which has been identified by the first computer, in response to said received instruction is also provided. A mouse and keyboard can also be incorporated into the invention.Type: GrantFiled: October 25, 2005Date of Patent: March 20, 2012Assignee: International Business Machines CorporationInventors: Kohtaroh Miyamoto, Midori Shoji
-
Patent number: 8095878Abstract: The exemplary embodiment of the present invention relates to the dynamic activation and deactivation of a spell check feature option for instant message chat session features within a virtual environment. As such, the spell check feature can be activated and deactivated within the virtual environment based upon the identity of an engaged party to chat session or the identities of other parties that are in proximity to the chat session parties within the virtual environment.Type: GrantFiled: June 23, 2008Date of Patent: January 10, 2012Assignee: International Business Machines CorporationInventors: Cary Lee Bates, Zachary A. Garbow
-
Patent number: 7979413Abstract: In accordance with an aspect of the invention, a method and system are disclosed for constructing an embedded signature in order to facilitate post-facto detection of leakage of sensitive data. The leakage detection mechanism involves: 1) identifying at least one set of words in an electronic document containing sensitive data, the set of words having a low frequency of occurrence in a first collection of electronic documents; and, 2) transmitting a query to search a second collection of electronic documents for any electronic document that contains the set of words having a low frequency of occurrence.Type: GrantFiled: May 30, 2008Date of Patent: July 12, 2011Assignees: AT&T Intellectual Property l, L.P., New York UniversityInventors: Balachander Krishnamurthy, Saurabh Kumar, Lakshminarayanan Subramanian
-
Patent number: 7890521Abstract: One embodiment of the present invention provides a system that automatically generates synonyms for words from documents. During operation, this system determines co-occurrence frequencies for pairs of words in the documents. The system also determines closeness scores for pairs of words in the documents, wherein a closeness score indicates whether a pair of words are located so close to each other that the words are likely to occur in the same sentence or phrase. Finally, the system determines whether pairs of words are synonyms based on the determined co-occurrence frequencies and the determined closeness scores. While making this determination, the system can additionally consider correlations between words in a title or an anchor of a document and words in the document as well as word-form scores for pairs of words in the documents.Type: GrantFiled: February 7, 2008Date of Patent: February 15, 2011Assignee: Google Inc.Inventors: Oleksandr Grushetskyy, Steven D. Baker
-
Patent number: 7805464Abstract: A setup dialog for a web viewer dynamically changes to reflect parameters needed to generate a web address. The setup dialog can include a list box containing websites for which website templates are available for assisting the user in generating web addresses. The list box can also include an option to generate a custom web address without using a website template. An extensible grammar is provided for generating custom website templates.Type: GrantFiled: September 18, 2006Date of Patent: September 28, 2010Assignee: Apple Inc.Inventors: David McKee, Tom Lloyd, John Lorin Welshofer, Christopher Crim
-
Patent number: 7770111Abstract: A method and computer-readable medium are provided for optimized paragraph layout. According to the method, a line-by-line paragraph layout is generated determine a maximum penalty for a paragraph. One or more layout nodes may be generated that include data representing a possible layout of the paragraph to a point within a backing store and identifying any changes to a geometry of the page caused by the layout of the paragraph defined by the layout node. A penalty is also calculated for each of the layout nodes. Each layout node that has a penalty less than the maximum penalty calculated during the line-by-line formatting processing is added to the node pool. This process repeats until only a single layout node remains in the node pool that represents the entire paragraph of text.Type: GrantFiled: December 20, 2004Date of Patent: August 3, 2010Assignee: Microsoft CorporationInventors: Andrei Burago, Christoph E. Ammann, Sergey Genkin, Eliyezer Kohen, Victor Kozyrev, Anton A. Sukhanov, Igor Zverev
-
Patent number: 7698127Abstract: In a computer, a user interface allows users to input requests to various applications or software modules executing on the computer. During the input process, the invention presents to the user a grammar-based automatic completion of the user input so far, and/or a grammar-based suggestion list of one or more possible options for completing the user input.Type: GrantFiled: October 1, 2004Date of Patent: April 13, 2010Assignee: Microsoft CorporationInventors: Tandy Woodson Trower, II, Mark J. Weinberg
-
Patent number: 7516404Abstract: Systems and methods of correcting grammatical and spelling errors that involve improper positioning of a whitespace character and/or an extra whitespace character. Removal of an extra whitespace character or repositioning of an improperly positioned whitespace character may result in correction of two misspelled words in a single correction step.Type: GrantFiled: June 2, 2004Date of Patent: April 7, 2009Inventor: Steven M. Colby
-
Patent number: 7516418Abstract: A unique system and method that facilitate automatic tracking of user data and providing on-demand site or merchant reputation information is provided. The system and method involve monitoring user input such as by keystrokes over a period of time and forming a long continuous string of characters. The string of characters can be monitored for repeating substrings or non-repeating substrings that indicate sensitive user information such as SSN or credit card number (CCN). The substrings can be displayed to the user in a digest form and order according to frequency of occurrence, for example. Thus, the user can readily view where and when their sensitive information has been entered. In addition, the typing of certain substrings such as CCNs or other sensitive field data can trigger a reputation service to retrieve the reputation data for the current site or merchant to mitigate dealing with fraudulent sites or merchants.Type: GrantFiled: June 1, 2006Date of Patent: April 7, 2009Assignee: Microsoft CorporationInventors: Cormac E. Herley, Dinei A. Florencio
-
Patent number: 7451398Abstract: Providing capitalization correction for unstructured excerpts is described. An excerpt of unstructured content is tokenized into a set of words. The set of words is analyzed for correct capitalization. Individual characters constituting at least one such word in the set of words are evaluated. The at least one such word is skipped if determined to be of a predefined type.Type: GrantFiled: November 18, 2003Date of Patent: November 11, 2008Assignee: Google, Inc.Inventor: Christopher Rohrs
-
Patent number: 7447627Abstract: A method of determining the component words of a compound word is disclosed. The method identifies the component words, by comparing the word with a list of words found in a lexicon. If the word is not found in the lexicon the method proceeds to analyze the word on a character-by-character basis. After each character the method identifies any potential matches to the selected characters in the lexicon. If a match is found, it is added to a hypothesis trace in a lattice. Next, the method checks to see whether the remaining characters form a valid entry in the lexicon, and whether the entry is allowed to be a final segment.Type: GrantFiled: March 19, 2004Date of Patent: November 4, 2008Assignee: Microsoft CorporationInventors: Andrea Maria Jessee, Miriam R. Eckert, Kevin R. Powell