Using Extracted Text (epo) Patents (Class 707/E17.022)
  • Patent number: 11507770
    Abstract: Described is a system and method that provides a data protection risk assessment for the overall functioning of a backup and recovery system. Accordingly, the system may provide a single overall risk assessment score that provide an operator with an “at-a-glance” overview of the entire system. Moreover, the system may account for changes that occur over time based on leveraging statistical methods to automatically generate assessment scores for various components (e.g. application, server, network, load, etc.). In order to determine a risk assessment score, the system may utilize a predictive model based on historical data. Accordingly, residual values for newly observed data may be determined using the predictive model and the system may identify potentially anomalous or high risk indicators.
    Type: Grant
    Filed: May 1, 2020
    Date of Patent: November 22, 2022
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Qiang Chen, Jing Yu, Pengfei Wu, Naveen Rastogi
  • Patent number: 11481554
    Abstract: Techniques are described herein for training and evaluating machine learning (ML) models for document processing computing applications using generalized vocabulary tokens. In some embodiments, an ML system determines a set of tokens for non-textual content in a plurality of documents. The ML system generates a fixed-length vocabulary that includes the set of tokens for the non-textual content. The ML system further generates for each respective document in a training dataset of documents, a respective feature vector based at least in part on which tokens in the fixed-length vocabulary occur in the respective document. The ML system trains a ML model based at least in part on the respective feature vector for each respective document in the training dataset.
    Type: Grant
    Filed: November 8, 2019
    Date of Patent: October 25, 2022
    Assignee: Oracle International Corporation
    Inventor: Sudhakar Kalluri
  • Patent number: 11423052
    Abstract: User information categorization using consent-based class rules is described. Consent from a user is received regarding at least one functional area where user information is shareable is received. Based on the consent, at least one data class that is permitted to be shared is determined. A user information designation is associated with the at least one data class and class rules are applied to user information associated with the user information designation based on the association between the user information designation and the at least one data class.
    Type: Grant
    Filed: December 14, 2017
    Date of Patent: August 23, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sushain Pandit, Martin Oberhofer, Steven Lockwood
  • Patent number: 8639707
    Abstract: Retrieval is completed in a short time for presenting a retrieval result of a document file, which satisfies a retrieval condition, to a user having the authority to perform predetermined processing.
    Type: Grant
    Filed: December 16, 2010
    Date of Patent: January 28, 2014
    Assignee: International Business Machines Corporation
    Inventors: Masaki Komedani, Hirofumi Nishikawa, Fumihiko Terui
  • Patent number: 8639714
    Abstract: A variety of computer based service that permit users to edit, compose, upload, or otherwise generate content also provide for the integration of sponsored media into presentations along with user-generated content. An exemplary service generates text based on user input, provides tags based on the text to a sponsored media repository, receives a sponsored media data structure in return, and formats sponsored media from the data structure for display to the user.
    Type: Grant
    Filed: August 29, 2007
    Date of Patent: January 28, 2014
    Assignee: Yahoo! Inc.
    Inventor: Roelof van Zwol
  • Patent number: 8626704
    Abstract: A map update data supply device and method includes an update map database of per section versions of an update data file, and a request update data extraction unit for extracting a request update section and an update data file. A safeguard update data extraction unit extracts a safeguard update section to safeguard a road network connection between adjacent sections. An integrated data generation unit integrates all versions of the update data file for each extracted request update section and generates a request update integrated data file. The integrated data generation unit integrates, per safeguard update section, versions of the update data file up to the update safeguard version for each extracted safeguard update section, and generates a safeguard update integrated data file. An integrated data supply unit supplies the generated request update integrated data file and the safeguard update integrated data file to a navigation device.
    Type: Grant
    Filed: January 13, 2011
    Date of Patent: January 7, 2014
    Assignee: Aisin Aw Co., Ltd.
    Inventor: Kimiyoshi Sawai
  • Publication number: 20130311489
    Abstract: A method for automatically extracting names that is implemented by a computer having a computer memory includes the steps of storing a list of first names in the computer memory; receiving a document in the computer memory, where at least some of the characters of the document are represented in a machine readable format; identifying a grouping of words in the document as a name candidate based on capitalization of a leading character of at least two of the words; selecting a subject word of the name candidate; comparing the subject word to the list of first names; and determining that the name candidate includes a personal name if the subject word is present in the list of first names, using the computer.
    Type: Application
    Filed: September 30, 2011
    Publication date: November 21, 2013
    Applicant: GOOGLE INC.
    Inventor: Alex Kerschhofer
  • Publication number: 20130144907
    Abstract: The present discussion relates to patient image data workflows. One example can temporarily serially arrange a set of semantic labeling modules in a patient image data workflow pipeline responsive to receiving an event trigger. The example can also remove the set of modules from the patient image data workflow pipeline responsive to receiving an event completion trigger.
    Type: Application
    Filed: December 6, 2011
    Publication date: June 6, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Steven J. White, Sayan D. Pathak, Bryan Dove, Duncan P. Robertson, Khan M. Siddiqui, Prabhu KrishnaMoorthy
  • Publication number: 20130080475
    Abstract: A system for generating statistics relating to recorded employee behavior, the system including: a first database of tasks performed by employees, the first database being stored on a computer-readable storage medium; a second database of actions taken by the employees while performing the tasks, the second database being stored on a computer-readable storage medium; and a software program, stored on a computer-readable storage medium, configured to extract information from the databases regarding the tasks performed by the employees as well as the actions performed by the employees while carrying out the tasks. The software program then calculates performance statistics relating to success or failure regarding a particular task. The software program furthermore sorts the employees into subgroups based on their status in the company and then calculates performance statistics for the subgroup to compare against individual performance within the subgroup.
    Type: Application
    Filed: September 25, 2011
    Publication date: March 28, 2013
    Inventor: Jonathon Gillen
  • Publication number: 20130073514
    Abstract: This document describes techniques that label text nodes of a seed site for each of a plurality of verticals. Once a seed site is labeled for a given vertical, the techniques extract features from the labeled text nodes of the seed site. The techniques learn vertical knowledge for the seed site based on the human labels and the extracted features, and adapt the learned vertical knowledge to a new web site to automatically and accurately identify attributes and extract attribute values targeted within a given vertical for structured web data extraction.
    Type: Application
    Filed: September 20, 2011
    Publication date: March 21, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Rui Cai, Lei Zhang, Qiang Hao
  • Publication number: 20130024476
    Abstract: A computer implemented method and system provide for automatic selection and extraction of metadata and media content from projects in a craft tool. Automated identification, classification and management of such metadata and content is provided using including techniques such as pattern recognition for audio and visual content. The automatic tracking and centralised storage of metadata and content for compliance purposes can be facilitated, and can enable querying of organised metadata stored in a central database. In an example, metadata and media content are extracted automatically from a project in a craft tool at a client system and are forwarded to a host system for the creation of a cue sheet including timings for media files from timing metadata in a project file to create the timings on the cue sheet.
    Type: Application
    Filed: October 7, 2010
    Publication date: January 24, 2013
    Inventors: Charles Hodgkinson, Kirk Zavieh
  • Publication number: 20130013553
    Abstract: Some embodiments provide a verification system for automated verification of entities. The verification system automatedly verifies entities using a two part verification campaign. One part verifies that the entity is the true owner of the entity account to be verified. This verification step involves (1) the entity receiving a verification code at the entity account and returning the verification code to the verification system, (2) the entity associating an account that it has registered at a service provider to an account that the verification system has registered at the service provider, (3) both. Another part verifies the entity can respond to communications that are sent to methods of contact that have been previously verified as belonging to the entity. The verification system submits a first communication with a code using a verified method of contact. The verification system then monitors for a second communication to be returned with the code.
    Type: Application
    Filed: November 7, 2011
    Publication date: January 10, 2013
    Inventors: Aaron B. Stibel, Peter Delgrosso, Jeffrey M. Stibel, Shailen Misltry, Bryan Mierke, Paul Servino, Charles Chi Thoi Le, David Lo, David Allen Lyon
  • Patent number: 8346620
    Abstract: A system for interactive paper is described. Data fragments are captured at locations in a rendered document. A digital version of the document is optionally located. Markup data applied to the capture creates a rich set of interactions for the user. New models for publishing documents and new document-related services are described.
    Type: Grant
    Filed: September 28, 2010
    Date of Patent: January 1, 2013
    Assignee: Google Inc.
    Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushler, James Q. Stafford-Fraser
  • Publication number: 20120303661
    Abstract: Described herein are methods, systems, apparatuses and products for automatically discovering patterns in a text corpus. An aspect provides extracting at least one context string related to at least one annotator from the at least one text corpus; analyzing the at least one context string for at least one sequence, the at least one sequence comprised of at least one subsequence; determining at least one sequence signature for each at least one sequence by applying applicable rules to the at least one sequence; and grouping the at least one sequence signature into at least one group.
    Type: Application
    Filed: May 27, 2011
    Publication date: November 29, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sebastian Johannes Blohm, Vivian Yaw-Wen Chu, Ching-Tien Ho, Yunyao Li, Huaiyu Zhu
  • Publication number: 20120264480
    Abstract: Generally described, the present disclosure relates to an electronic device having limited memory. More specifically, the disclosure relates to intelligent data sharing for advanced features on mobile platforms. In one illustrative embodiment, a mobile device provides a platform having native services that use shared data. The data can be received from a central server. In turn, the data can be separated on the mobile device into categories. For a number of contacts, these categories can include, but are not limited to, usage, total count, grouping, location and organization. After the data is placed within the categories, the data can be shared between the services for applications. These applications can include, but are not limited to, voice dialing, Bluetooth™ dialing, searching and dialing. The data can be prioritized depending on the categories. Through prioritization, data can be removed when memory is low and new data is received.
    Type: Application
    Filed: April 18, 2011
    Publication date: October 18, 2012
    Inventors: Suriyaprakash Soundrapandian, James Dean Midtun
  • Publication number: 20120239668
    Abstract: Various embodiments of systems and methods for extraction and grouping of feature words are described herein. Feature words are obtained from a first corpus of text bodies comprising a plurality of reviews. A second corpus is created using a combination of the obtained feature words, verbs and adjectives from the first corpus. The second corpus comprises filtered reviews and each of the filtered reviews pertains to a review. Topics are preliminarily assigned for words in the filtered reviews of the second corpus. For each of the feature words in the second corpus, a topic count is determined for every preliminarily assigned topic. After determining the topic count, one or more of the topics are finally assigned to the feature words based on a topic count value. At least one topic is presented as a group of the feature words for which the at least one topic is assigned based on the topic count value.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 20, 2012
    Inventors: CHIRANJIB BHATTACHARYYA, Himabindu Lakkaraju, Kaushik Nath, Sunil Arvindam
  • Patent number: 8261200
    Abstract: An interactive system provides for increasing retrieval performance of images depicting text by allowing users to provide relevance feedback on words contained in the images. The system includes a user interface through which the user queries the system with query terms for images contained in the system. Word image suggestions are displayed to the user through the user interface, where each word image suggestion contains the same or slightly variant text as recognized from the word image by the system than the particular query terms. Word image suggestions can be included in the system by the user to increase system recall of images for the one or more query terms and can be excluded from the system by the user to increase precision of image retrieval results for particular query terms.
    Type: Grant
    Filed: April 26, 2007
    Date of Patent: September 4, 2012
    Assignee: Fuji Xerox Co., Ltd.
    Inventors: Laurent Denoue, John E. Adcock, David M. Hilbert, Daniel Billsus
  • Publication number: 20120203764
    Abstract: A method of identifying one or more particular images from an image collection, includes indexing the image collection to provide image descriptors for each image in the image collection such that each image is described by one or more of the image descriptors; receiving a query from a user specifying at least one keyword for an image search; and using the keyword(s) to search a second collection of tagged images to identify co-occurrence keywords. The method further includes using the identified co-occurrence keywords to provide an expanded list of keywords; using the expanded list of keywords to search the image descriptors to identify a set of candidate images satisfying the keywords; grouping the set of candidate images according to at least one of the image descriptors, and selecting one or more representative images from each grouping; and displaying the representative images to the user.
    Type: Application
    Filed: February 4, 2011
    Publication date: August 9, 2012
    Inventors: Mark D. Wood, Alexander C. Loui
  • Publication number: 20120150792
    Abstract: The present disclosure involves systems, software, and computer implemented methods for providing a data extraction framework for extracting data and metadata from an application to provide additional functionality for the extracted data and metadata. One process includes operations for identifying a first application for data extraction and determining a set of data suitable for extraction from the first application using a software development kit associated with the first application. The set of data is stored in a repository without storing visualization components of the first application in the repository. The set of data is sent to a second application for further processing of the set of data. The second application is configured to bind different visualization components to the set of data for display of data elements in the set of data to a user.
    Type: Application
    Filed: December 9, 2010
    Publication date: June 14, 2012
    Applicant: SAP PORTALS ISRAEL LTD.
    Inventors: Ohad Yassin, Pavel Kravets, Nisim Hafzadi, Ram Alon
  • Publication number: 20120136812
    Abstract: One embodiment of the present invention provides a system for optimizing and customizing document-similarity calculation. During operation, the system presents a collection of similar documents to a user, collects feedback on the similarity of the documents from the user, generates generic rules for calculating document similarity, and filters documents with customized similarity calculation based on the feedback provided by the user.
    Type: Application
    Filed: November 29, 2010
    Publication date: May 31, 2012
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventor: Oliver Brdiczka
  • Publication number: 20120089642
    Abstract: The system and methods described herein provide results previewing for an interactive text mining system in order to feedback partial query results to users before all results that are responsive to a query have been found. These partial results allow the user to see the progress of their text mining query much sooner.
    Type: Application
    Filed: October 6, 2010
    Publication date: April 12, 2012
    Inventors: David R. Milward, Roger W. Hale, Malcolm R. Parsons, Sylvia F. Knight, Christopher I. Sullivan, Jason Trenouth, James R. Thomas
  • Publication number: 20120089643
    Abstract: A computer implemented method and system provide for automatic selection and extraction of metadata and media content from projects in a craft tool. Automated identification, classification and management of such metadata and content is provided using including techniques such as pattern recognition for audio and visual content. The automatic tracking and centralised storage of metadata and content for compliance purposes can be facilitated, and can enable querying of organised metadata stored in a central database. In an example, metadata and media content are extracted automatically from a project in a craft tool at a client system and are forwarded to a host system for the creation of a cue sheet including timings for media files from timing metadata in a project file to create the timings on the cue sheet.
    Type: Application
    Filed: October 7, 2010
    Publication date: April 12, 2012
    Inventors: Charles Hodgkinson, Kirk Zavieh
  • Publication number: 20120047167
    Abstract: A portable terminal includes a word extracting unit that extracts a word contained in data of a Web page being viewed; a Web search request unit that transmits a search request to a search site with the word extracted by the word extracting unit as a search word and that receives a list of Web pages that contain the search word from the search site as a search result; and a display unit that displays the search result received by the Web search request unit.
    Type: Application
    Filed: November 2, 2011
    Publication date: February 23, 2012
    Applicant: FUJITSU TOSHIBA MOBILE COMMUNICATIONS LIMITED
    Inventors: Masaki SAKAI, Natsuko OUCHI
  • Publication number: 20120047172
    Abstract: A technique includes providing a collection of documents in multiple languages, identifying, from the collection of documents, a group of candidate documents, where each candidate document in the group shares multiple corresponding rare features, evaluating pairs of candidate documents in the group using multiple common features present in the collection of documents, and determining, based on evaluating the pairs of candidate documents, whether each pair of candidate documents corresponds to a translated pair of documents.
    Type: Application
    Filed: August 22, 2011
    Publication date: February 23, 2012
    Applicant: Google Inc.
    Inventors: Jay M. Ponte, Jakob Uszkoreit, Ashok C. Popat, Moshe Dubiner
  • Publication number: 20120047176
    Abstract: A system and methodology for real-time content aggregation and syndication is described. In one embodiment, for example, a method is described for assisting a user with extracting items relevant to search queries from documents including items of various types, the method comprises steps of: receiving a search query specifying a search phrase and a particular item type; identifying documents matching the search phrase; for each matching document, determining whether the document includes an item having the particular item type; and extracting items having the particular item type from the matching documents for display to the user. The solution enables a user to aggregate and syndicate content without a professional content manager or complicated content management software tools.
    Type: Application
    Filed: November 2, 2011
    Publication date: February 23, 2012
    Applicant: SYBASE, INC.
    Inventor: Michael Timmons
  • Publication number: 20120036144
    Abstract: According to one embodiment, an information recommendation device includes following units. The input unit is configured to input a first document and a second document which has been browsed before the first document. The subject-keyword extraction unit is configured to extract first and second subject keywords from the first and second documents, respectively. The interest-keyword extraction unit is configured to extract first interest keywords from the first and second subject keywords, and to extract second interest keywords based on information specifying the first and second documents, the first interest keywords, and the first and second subject keywords. The second interest keywords are estimated to be keywords in which the user is next interested. The acquiring unit is configured to acquire, based on the second interest keywords, recommendation information on third documents which are candidates to be browsed after the first document. The presentation unit presents the recommendation information.
    Type: Application
    Filed: August 25, 2011
    Publication date: February 9, 2012
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Masayuki Okamoto, Nayuko Watanabe, Masaaki Kikuchi, Takayuki Iida, Mika Fukui
  • Publication number: 20110307497
    Abstract: “Synthewiser”™ is a search method and system that synthesizes a single non-template, text-based document that is organized by topic and integrates and consolidates information from multiple sources. This is accomplished by: having a user provide a search phrase; creating seed phrases; identifying seed locations in multiple sources; creating expanded text segments; grouping expanded text segments; consolidating content; and synthesizing a single document. Synthewiser has advantages over today's dominant search engine. Its results are organized by topic and are integrated across multiple sources.
    Type: Application
    Filed: June 14, 2010
    Publication date: December 15, 2011
    Inventor: Robert A. Connor
  • Publication number: 20110302179
    Abstract: Described is using context information obtained from entity mentions in likely relevant documents to extract entity mentions from documents that are ambiguous with respect to their relevance to a domain. A list of entities is input into an entity extraction mechanism, which processes a large collection of documents to determine data (counts) corresponding to frequency of entity mentions. Infrequently mentioned entities are specific entities, while frequently mentioned entities are non-specific (generic or ambiguous) entities. The context surrounding mentions of the specific entities is processed to obtain interesting context terms (words, phrases or both) for the domain. The interesting context terms are then compared against the contexts of non-specific entity mentions to determine whether each non-specific entity mention is relevant to the domain. A result set containing only relevant documents or relevant mentions collection is output.
    Type: Application
    Filed: June 7, 2010
    Publication date: December 8, 2011
    Applicant: Microsoft Corporation
    Inventor: Sanjay Agrawal
  • Publication number: 20110295893
    Abstract: A method of searching an expected image in an electronic apparatus comprises the steps of inputting a hand drawing of the expected image into the electronic apparatus; determining whether or not a text description for partially characterizing the expected image is inputted; identifying and searching the expected image in the electronic apparatus according to the hand drawing if the text description is not inputted, or selecting a text label from the text description and interpreting the selected text label by the electronic apparatus if the text description is inputted; and searching a database in the electronic apparatus according to the text label, and fetching the expected image from the database if the value of the image item matches the text label. The hand drawing and/or text label inputted from a mobile phone screen are provided for arranging and searching pictures or images in the database efficiently.
    Type: Application
    Filed: April 21, 2011
    Publication date: December 1, 2011
    Applicants: INVENTEC APPLIANCES (SHANGHAI) CO. LTD., INVENTEC APPLIANCES (NANCHANG) CO. LTD., INVENTEC APPLIANCES CORP.
    Inventor: PENG-FEI WU
  • Publication number: 20110295775
    Abstract: Techniques for identifying near-duplicates of a media object and associating metadata of the near-duplicates with the media object are described herein. One or more devices implementing the techniques are configured to identify the near duplicates based at least on similarity attributes included in the media object. Metadata is then extracted from the near-duplicates and is associated with the media object as descriptors of the media object to enable discovery of the media object based on the descriptors.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Xin-Jing Wang, Lei Zhang, Ming Liu, Yi Li, Wei-Ying Ma
  • Publication number: 20110270819
    Abstract: Query classification techniques attempt to classify user search queries in order to better understand user search intent. Understanding a user's search intent allows search engines to provide relevant content tailored to the user's interest. Unfortunately, current classification techniques do not take into account contextual information. Accordingly, as provided herein, a target query may be classified based upon contextual information. In particular, features may be extracted from contextual information and/or other sources. For example, features may be extracted from the target query, related queries, and/or invoked search results of the related queries. In this way, the target query may be classified based upon other queries performed by the user and/or search results of the queries the user found interesting. In addition, a CRF model may be utilized in classifying the target query by providing generalized parameters learned from labeled query sessions.
    Type: Application
    Filed: April 30, 2010
    Publication date: November 3, 2011
    Applicant: Microsoft Corporation
    Inventors: Dou Shen, Daxin Jiang, Jian-Tao Sun
  • Publication number: 20110264675
    Abstract: A searching apparatus includes a memory unit which stores transposed indexes representing appearing positions of all n-grams in plural pieces of document data subjected to searching and appearing frequencies, an n-gram extracting unit that extracts all n-grams extractable from a searching character string, a smallest-frequency deriving unit which refers to the appearing frequency of the n-gram represented by the transposed index, and derives an n-gram with the smallest appearing frequency among all of the extracted n-grams, a searching n-gram selecting unit that selects, from all extracted n-grams, a plurality of searching n-grams which form the searching character string and include the n-gram with the smallest appearing frequency, and a document specifying unit that specifies, based on the plurality of selected searching n-grams and the appearing position of the searching n-gram represented by the transposed index, document data including the searching character string among the plural pieces of document da
    Type: Application
    Filed: April 26, 2011
    Publication date: October 27, 2011
    Applicant: CASIO COMPUTER CO., LTD.
    Inventor: Katsuhiko SATOH
  • Publication number: 20110246027
    Abstract: An image processing system inputs a captured image of a scene viewed from a vehicle in a predetermined road section and an image-capturing position at which the image is captured. The system uses a given position in the predetermined road section as a specific position, and sets a target vehicle movement amount at the specific position, for passing through the predetermined road section. The system generates reference image data from the captured image obtained at the specific position. The system generates reference data that is used when scenic image recognition is performed, by associating the reference image data with the specific position and the target vehicle movement amount at the specific position, and generates a reference data database that is a database of the reference data.
    Type: Application
    Filed: January 25, 2011
    Publication date: October 6, 2011
    Applicant: AISIN AW CO., LTD.
    Inventor: Takayuki MIYAJIMA
  • Patent number: 8019648
    Abstract: Embodiments of the disclosed innovations provide systems and methods for locating data associated with rendered documents. Some embodiments support the use of a handheld document data capture device.
    Type: Grant
    Filed: April 1, 2005
    Date of Patent: September 13, 2011
    Assignee: Google Inc.
    Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushler, James Q. Stafford-Fraser
  • Publication number: 20110213804
    Abstract: Disclosed herein is a system structure for extracting relations between technical terms within a large amount of literature information using verb-based patterns. The present invention provides a system that is capable of extracting relations based on verb-based patterns from abstract and bibliography databases in all fields of science and technology using a Tech Association Mining Appliance (TAMA) capable of detecting the technical terms of text and relations therebetween in academic literature databases in the fields of science and technology. The present invention has an advantage of providing a practical relation extraction system structure using a number of academic databases.
    Type: Application
    Filed: December 15, 2008
    Publication date: September 1, 2011
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Min Ho Lee, Yun Soo Choi, Sung Pil Choi, Nam Gyu Kang, Kwang Young Kim, Han Gee Kim, Chang Hoo Jeong, Min Hee Cho, Hwa Mook Yoon
  • Publication number: 20110191381
    Abstract: Described is a technology for efficiently labeling a webpage. A wrapper tool labels records of a webpage at the record level. If an existing wrapper exists that is appropriate for labeling a record, the wrapper tool automatically labels that record. For unlabeled records, the tool provides a user interface to label those records, and updates the set of existing wrappers with a new wrapper that is generated based upon the labeling operation; the new wrapper is then applied to any unlabeled records if appropriate for those records. As a result, a user typically needs only to label a relatively few records, with the wrappers generated for those records automatically used to label the other unlabeled records of the webpage.
    Type: Application
    Filed: January 29, 2010
    Publication date: August 4, 2011
    Applicant: Microsoft Corporation
    Inventors: Shuyi Zheng, Ruihua Song, Matthew Robert Scott, Ji-Rong Wen
  • Publication number: 20110191285
    Abstract: A map update data supply device and method includes an update map database of per section versions of an update data file, and a request update data extraction unit for extracting a request update section and an update data file. A safeguard update data extraction unit extracts a safeguard update section to safeguard a road network connection between adjacent sections. An integrated data generation unit integrates all versions of the update data file for each extracted request update section and generates a request update integrated data file. The integrated data generation unit integrates, per safeguard update section, versions of the update data file up to the update safeguard version for each extracted safeguard update section, and generates a safeguard update integrated data file. An integrated data supply unit supplies the generated request update integrated data file and the safeguard update integrated data file to a navigation device.
    Type: Application
    Filed: January 13, 2011
    Publication date: August 4, 2011
    Applicant: AISIN AW CO., LTD.
    Inventor: Kimiyoshi SAWAI
  • Publication number: 20110153625
    Abstract: To provide a retrieval device, a retrieval system, a retrieval method, and a computer program capable of completing retrieval in a short time for presenting a retrieval result of a document file, which satisfies a retrieval condition, to a user having the authority to perform predetermined processing.
    Type: Application
    Filed: December 16, 2010
    Publication date: June 23, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Masaki KOMEDANI, Hirofumi NISHIKAWA, Fumihiko TERUI
  • Publication number: 20110137943
    Abstract: A word for which a keyword is desired to be decided is input, and a web page related to the input word is found by a search. Keywords (“programming language”, “object-oriented”, “education”, “seminar”), which are described in a meta tag of the found web page, are extracted. The extracted keywords are transmitted to a dictionary server where a specialized dictionary containing the input word has been registered. If any of these transmitted keywords has been registered at this dictionary server, then this keyword is decided upon as a keyword related to the input word.
    Type: Application
    Filed: November 23, 2010
    Publication date: June 9, 2011
    Inventor: Motoshige ASANO
  • Publication number: 20110125705
    Abstract: A mapping is received and stored that maps elements of a data warehouse to types of a type system implemented by a data source. Program code is generated that performs a transform of data retrieved from a data source based on the mapping. Generation of the program code may include generating program code for performing a dimension transform based on the mapping, generating program code for performing a fact transform based on the mapping, and generating program code for performing an outrigger transform based on the mapping. The generated program code may then be executed to transform the data retrieved from the data source prior to loading into the data warehouse.
    Type: Application
    Filed: November 25, 2009
    Publication date: May 26, 2011
    Inventors: Vijaykumar K. Aski, Danny Chen
  • Patent number: 7941749
    Abstract: Resolution and composition of electronic document layout are provided. An intermediate text data structure may be generated to hold a “resolved” rich text state for a given document. Properties contained in the “resolved” rich text state are a composite of all relevant properties including user defined and entered properties and including properties associated with the document according to a pre-built document context. This text body resolution process then may be utilized for generating a composite text layout for the text streams associated with a plurality of document components for generating a single rich text stream for presentation to and editing by a user.
    Type: Grant
    Filed: May 15, 2007
    Date of Patent: May 10, 2011
    Assignee: Microsoft Corporation
    Inventors: Siddharth Agrawal, Robert Parker, Dachuan Zhang
  • Publication number: 20110106752
    Abstract: Extensible observer system with report management, visual observer, and event management functionality. Uses a component loader for loading new functionality. Leverages data provider, data extractor, and state display components. Includes report manager, report generator, and report propagator. Displays visual observers that include states generated using state display components. Visual observers selectable through hierarchical selector. Manages data-driven events, generating event responses, such as message distribution, based on triggering criteria. Data extractor components used to extract state information. Component generator produces skeleton component code.
    Type: Application
    Filed: April 28, 2008
    Publication date: May 5, 2011
    Inventor: Andrew Blencowe
  • Publication number: 20110055804
    Abstract: Methods and systems for automating technology integrations are presented. A source application system that connects to external technologies, such as plug-ins, is ported from one computing environment or ecosystem to another and thereby integrated on the other ecosystem. The porting is facilitated by the extraction of information and code from the source environment, creating an XML “ecoprint” payload file, copying the ecoprint file to the target system, and applying an integration defined by the XML ecoprint payload file to connect and otherwise integrate the application system with external technologies in the target environment.
    Type: Application
    Filed: August 31, 2009
    Publication date: March 3, 2011
    Applicant: Oracle International Corporation
    Inventor: Ivo Dujmovic
  • Publication number: 20110046975
    Abstract: Methods, systems, and computer storage media are provided for determining and communicating dynamic threshold values to a particular site system. Upon initiation of one or more rules, a request is received for one or more dynamic threshold values, which are dynamically changeable values associated with healthcare data. The dynamic threshold values are determined and are communicated. One or more rules are updated to reflect the received dynamic threshold values. The rules are used in association with patient care or healthcare data extraction.
    Type: Application
    Filed: August 20, 2010
    Publication date: February 24, 2011
    Applicant: CERNER INNOVATION, INC.
    Inventor: MARK A. HOFFMAN
  • Publication number: 20110040801
    Abstract: Systems and methods consistent with the invention may include generating, using a processor of the computer system, a definition file of a first format for the data object, generating a database table, generating a mapping between the definition file and the database table, linking the definition file to a data source by including a path of the data source in the definition file, the data source including an attribute, executing, using the processor, a query to extract the attribute from the data source, importing the extracted attribute into the database table using the mapping between the definition file and the database table, and storing, in the memory device, the definition file, the database table, and the attribute for generation of the data object with the attribute.
    Type: Application
    Filed: August 11, 2009
    Publication date: February 17, 2011
    Inventors: Stephen Macaleer, John Schaefer
  • Publication number: 20110040782
    Abstract: A context sensitive searching front-end is disclosed for use in a deposition or trial proceeding wherein a computer aided transcription terminal provides real-time transcribed text down-line to attorney terminals. The terminals may thereafter use the transcribed text and any other text currently being displayed to formulate searches with little or no typing interaction required. Other text which may be used as a basis for searching includes communications from other attorney terminals, from artificial intelligence objection messages, and personal notes. Searching may be conducted on natural language or boolean front-ends which provide virtually instant feed-back as to the value of a search formulation before and after any “searching” actually occurs. Graphing of search results, including individual search word contribution, is provided for modification and selection of the documents to be reviewed.
    Type: Application
    Filed: October 12, 2010
    Publication date: February 17, 2011
    Inventors: James D. Bennett, Lawrence M. Jarvis
  • Publication number: 20110022598
    Abstract: The disclosed embodiments of computer systems and techniques utilize an ensemble semantics framework to combine knowledge acquisition systems that yield significantly higher quality resources than each system in isolation. Gains in entity extraction are achieved by combining state-of-the-art distributional and pattern-based systems with a large set of features from, for example, a webcrawl, query logs, and wisdom of the crowd sources. This results in improved query interpretation and greater relevancy in providing search results and advertising, for example.
    Type: Application
    Filed: July 24, 2009
    Publication date: January 27, 2011
    Applicant: YAHOO! INC.
    Inventors: Marco Pennacchiotti, Patrick Pantel
  • Publication number: 20100332364
    Abstract: Provided is a charging method capable of offering a user an incentive to use a particle generation factor determining system. In the particle generation factor determining system including a user interface device 11 through which a user inputs a particle map and a server 13, the server 13 calculates accuracy of each of multiple particle generation factors based on the particle map; the user interface device 11 displays the calculated accuracy or a title of generation-factor-relevant information 27 on each particle generation factor corresponding to this accuracy; the server 13 provides the generation-factor-relevant information 27 to the user interface device 11; a charged fee for providing particle generation-factor-relevant information 27 is determined based on accuracy of a particle generation factor corresponding to the provided generation-factor-relevant information 27.
    Type: Application
    Filed: June 22, 2010
    Publication date: December 30, 2010
    Applicant: TOKYO ELECTRON LIMITED
    Inventor: Tsuyoshi Moriya
  • Publication number: 20100312767
    Abstract: Provided is an information process apparatus including: an extraction unit which is configured to extract words in a predetermined word class from comments which predetermined users write about a predetermined item; a grouping unit which is configured to group the predetermined users by performing a multivariate analysis using the words extracted by the extraction unit; a storage unit which is configured to store the groups, the predetermined item, and the words in association with each other; a determination unit which is configured to determine which group a user who is to write a comment belongs to when the user is to write the comment about the predetermined item; and a reading unit which is configured to read from the storage unit words which are associated with the group determined by the determination unit and the predetermined item which the comment is to be written about.
    Type: Application
    Filed: May 14, 2010
    Publication date: December 9, 2010
    Inventor: Mari SAITO
  • Publication number: 20100306094
    Abstract: Systems, methods, and apparatus for identifying payees from cleared items posted to a financial account are provided. Information associated with one or more cleared items posted to a financial account of a consumer may be obtained. Based at least in part on the obtained information, at least one payee may be identified. A suggestion to add the identified at least one payee as one of an electronic biller of the consumer or a payee of the consumer for online payment functionality may be generated, wherein receiving an acceptance of the suggestion facilitates activation of an associated service for the consumer by a service provider. The generated suggestion may be transmitted to a network entity for presentation to the consumer.
    Type: Application
    Filed: May 28, 2009
    Publication date: December 2, 2010
    Applicant: FISERV, INC.
    Inventors: Robert T. Homer, Mary Elizabeth Lawson, Donald Kenneth Hobday, JR., Hans Daniel Dreyer