Of Unstructured Textual Data (epo) Patents (Class 707/E17.058)
  • Publication number: 20130269543
    Abstract: A nutritional substance information system collects, stores, tracks, and transmits information regarding the creation, preservation, transformation, conditioning and consumption of nutritional substances, and importantly, in nutritional values, and correlates such information with various organizations, entities, industries, and governments outside the nutritional substance supply systems, so as to optimize the production of nutritional substances, as well as optimize the consumption of nutritional substances.
    Type: Application
    Filed: May 31, 2012
    Publication date: October 17, 2013
    Inventor: Eugenio Minvielle
  • Publication number: 20130268462
    Abstract: A method and related apparatus for producing a compilation of works, including the steps of generating an excerpt from a work that has an associated original copyright metadata, creating an additional copyright metadata with per object pricing for objects of the excerpt, and associating the original copyright metadata and the additional copyright metadata with the excerpt.
    Type: Application
    Filed: April 5, 2012
    Publication date: October 10, 2013
    Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
    Inventors: David ASAO, Daniel BARBER, Philip WU, Toshiro FUJIMORI
  • Publication number: 20130262483
    Abstract: An approach is provided for providing intelligent processing of contextual information. An context platform determines at least one feature based, at least in part, on one or more contextual parameters The context platform further processes one or more contextual records to determine whether the at least one feature is a feature anchor based, at least in part, on whether the at least feature is represented above at least one threshold level. The context platform also processes the one or more contextual records to determine at least one profile for the at least one feature anchor.
    Type: Application
    Filed: March 30, 2012
    Publication date: October 3, 2013
    Applicant: Nokia Corporation
    Inventors: Jan Otto Blom, Juha Kalevi Laurila, Julian Charles Nolan, Nikolai Nefedov
  • Publication number: 20130232160
    Abstract: A novel system and computer-implemented method for quickly and efficiently finding and reporting all clones with a large corpus of text. This is achieved by tokenizing the corpus, computing a rolling hash, filtering for hashes that occur more than once, and constructing an equivalence relation over these hashes in which hashes are equated if they are part of the same instance of duplication. The equivalence relation is then used to report all detected clones.
    Type: Application
    Filed: March 2, 2012
    Publication date: September 5, 2013
    Applicant: SEMMLE LIMITED
    Inventor: Julian TIBBLE
  • Publication number: 20130218914
    Abstract: A recommendation method includes receiving a user's review of an item that includes a textual comment. Deficient features of the reviewed item are identified from the text by applying a set of extraction patterns. Each pattern is satisfied when a term in the text, which is associated in a structured terminology with one of a predefined set of features, is in a syntactic relation with another term in the text, such as a polar adjective or expression of a wish or a lack. When such a pattern is satisfied, the corresponding feature is considered a deficient feature. Feature attributes of the reviewed item are compared with corresponding feature attributes of a set of items to identify any improved items whose attribute for the deficient feature is better than that for the reviewed item. The improved item or items can be recommended to the user or to others reading the review.
    Type: Application
    Filed: February 20, 2012
    Publication date: August 22, 2013
    Applicant: Xerox Corporation
    Inventors: Anna Stavrianou, Caroline Brun
  • Publication number: 20130218888
    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning tags in a calendar. These tags can be hashtags, metadata, or other markings used to identify or classify an item. These tags are received and associated with an event, which can be a previous, present, or future event. A calendar is identified based on the tag and the event. In certain instances this identification lends to a work calendar, whereas in other instances the identification lends to a personal calendar, a community calendar, a shared calendar, or a public calendar. Upon identifying the calendar, the tag is inserted into the identified calendar, and associated with a calendar entry in the calendar, the calendar entry being associated with the event.
    Type: Application
    Filed: February 21, 2012
    Publication date: August 22, 2013
    Applicant: Avaya Inc.
    Inventor: Doree Duncan SELIGMANN
  • Publication number: 20130204884
    Abstract: Profiles associated with two applications are received. Each profile identifies a set of data fields identified by a corresponding full path name. Associations between data fields of the profiles are identified based on mapping pairs included in a full path mapping database, mapping pairs included in a shortest unique path mapping database, and mapping pairs included in a leaf mapping database. A prioritized list of mapping suggestions is provided based on the identified associations. A mapping suggestion can include a data manipulation operation according to information associated with a corresponding mapping pair.
    Type: Application
    Filed: February 6, 2012
    Publication date: August 8, 2013
    Applicant: DELL PRODUCTS, LP
    Inventors: Mitchell J. Stewart, James T. Ahlborn
  • Publication number: 20130204883
    Abstract: Various technologies described herein pertain to computing top-K pairwise co-occurrence statistics using an upper bounding heuristic. Upper bound values of a co-occurrence statistic for items in a set can be computed based on a query item, and items can be sorted into an order. The items and the query item are represented by respective portions of a tensor. An item from the order associated with a highest upper bound value can be selected, an actual value of the co-occurrence statistic can be computed for the selected item, the upper bound value for the selected item can be replaced with the actual value for the selected item, and the selected item can be repositioned in the order. When the top-K items in the order lack an item associated with an upper bound value, the top-K items and actual values of the co-occurrence statistic for the top-K items can be outputted.
    Type: Application
    Filed: February 2, 2012
    Publication date: August 8, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Alice Xiao-Zhou Zheng, Yucheng Low
  • Publication number: 20130198181
    Abstract: A computer-implemented method for summarising a set of articles relating to a topic, comprises using metadata of respective articles in the set to generate multiple subsets of articles, each article within a subset linked by a common article parameter, summarising content of the articles in a subset by extracting key phrases from constituent articles, editing extracted summaries for respective ones of the subsets of articles according to a predetermined optimisation goal to generate an article review for the topic.
    Type: Application
    Filed: March 29, 2012
    Publication date: August 1, 2013
    Applicant: Qatar Foundation
    Inventors: Sihem AMER-YAHIA, Paul Coyne, Arend Kuster
  • Publication number: 20130185314
    Abstract: Data sources, such as web pages or databases, store or output entities that include data or other information. To compare entities generated by different data sources, and to identify duplicate entities, a scoring function is generated for each pair of data sources that can generate a similarity score that represents the similarity of two entities from the data sources in the pair. To generate the scoring functions, training data is generated for each pair of data sources and reviewed by a judge. The training data is used to generate the scoring functions using machine learning. In order to reduce the amount of training data that is used, transfer learning techniques are applied to use information learned from generating one scoring function for a pair of sources when generating a scoring function for a subsequent pair of sources.
    Type: Application
    Filed: January 16, 2012
    Publication date: July 18, 2013
    Applicant: Microsoft Corporation
    Inventors: Benjamin Rubinstein, Olivier Dabrowski, Sahand Negahban-Hagh, David James Gemmell
  • Publication number: 20130166563
    Abstract: Example systems and methods of integrating text analysis and search functionality are presented. In one example, a plurality of documents, as well as search information comprising search terms for a search category, are accessed. Each of the documents that include at least one of the search terms is identified. The identified documents are analyzed to determine those of the identified documents that are logically associated with the search category. Each of the documents determined to be logically associated with the search category are tagged with the search category.
    Type: Application
    Filed: December 21, 2011
    Publication date: June 27, 2013
    Applicant: SAP AG
    Inventors: Thomas Mueller, Florian Kresser, Daniel Buchmann, Hans-Martin Ludwig, Thomas Finke, Karl Fuerst
  • Publication number: 20130166550
    Abstract: Example systems and methods of integrating data tags with their associated object data are presented. In one implementation, a data object employed in a first computer application is accessed. Examples of the data object include, but are not limited to, structured data and unstructured data. Tagging data that is descriptive of the first data object is also accessed. The tagging data is stored in at least one of the first data object and a separate data object linked with the first data object. The tagging data and the first data object are processed using a second computer application.
    Type: Application
    Filed: December 21, 2011
    Publication date: June 27, 2013
    Applicant: SAP AG
    Inventors: Daniel Buchmann, Thomas Mueller, Hans-Martin Ludwig, Florian Kresser, Thomas Finke, Karl Fuerst
  • Publication number: 20130155118
    Abstract: Methods of generating heatmaps including receiving, at a first electronic device, first information associated with a first zone of a plurality of zones of a content item, determining at least one first concept related to the first information, receiving at least one target content characteristic, determining at least one second concept related to the at least one target content characteristic, and determining a first heat of the first zone based on the first and second concepts, the first heat representing a measure of similarity between the first and second concepts.
    Type: Application
    Filed: December 20, 2011
    Publication date: June 20, 2013
    Applicants: INSTITUT TELECOM, ALCATEL-LUCENT
    Inventors: Julien Robinson, Myriam Ribière, Mathias Baglioni, Eric Lecolinet, Johann Daigremont
  • Publication number: 20130144878
    Abstract: The subject disclosure relates to one or more computer-implemented processes for collecting, analyzing, and employing annotations of data sources. In particular, an annotation component is configured to receive annotations of data for a data source, wherein the respective annotations comprise different associations of a global terms with the data of the data source, a data store configured to store the annotations, and an interface component configured to render the data based on the annotations in response to a request for the data. In an aspect, storing information, the data also stores descriptions of the data sources and definitions of the global terms, and the interface component determines a subset of the information in the data store based on the annotations. A method is further provided comprising receiving a global term and determining data sources that have the global term associated with the data thereof based on the information in the data store.
    Type: Application
    Filed: December 2, 2011
    Publication date: June 6, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Alex James, Michael Pizzo, Pablo Castro, Michael Justin Flasko, Lance Olson, Jason Clark, Siddharth Jayadevan
  • Publication number: 20130132395
    Abstract: Expanding a user's social connections to include mobile contacts includes: receiving a user's call logs for call/text transactions from a mobile device; deriving usage information from the call logs to determine social pairs; ranking the social pairs based on strength of connectedness; and merging the ranked social pairs with the user's contacts from on-line social networks.
    Type: Application
    Filed: November 23, 2011
    Publication date: May 23, 2013
    Applicant: Yahoo! Inc.
    Inventor: Arpit Gupta
  • Publication number: 20130132410
    Abstract: In accordance with the teachings described herein, systems and methods are provided for identifying potential duplicate entries in a database. Matchcodes are generated for a plurality of records, wherein a matchcode for a record may be generated by: receiving a character string from the record; determining whether the character string includes a non-essential character substring; if the non-essential character substring is missing from the character string, then generating the matchcode from the character string and adding a wildcard character to the matchcode in place of the missing non-essential character substring. The matchcodes for the plurality of records may be compared to identify matching pairs of matchcodes, wherein for the purpose of identifying a matching pair of matchcodes, two characters are considered the same if they are equal or if one or both are wildcard characters.
    Type: Application
    Filed: November 18, 2011
    Publication date: May 23, 2013
    Inventor: Brian Carl Rineer
  • Publication number: 20130117289
    Abstract: The present disclosure involves computer-implemented methods, software, and systems for supporting migration of unstructured data stored in enterprise content management systems. A computer-implemented method includes generating a search for content matching at least one content search rule, receiving a list of matched documents, wherein each document in the list of matched documents is associated with at least a source repository identifier and a unique document identifier, calculating a target repository identifier and at least one metadata change instruction for each unique document identifier using at least one migration rule, and modifying metadata for the document associated with each unique document identifier using the calculated at least one metadata change instruction.
    Type: Application
    Filed: November 9, 2011
    Publication date: May 9, 2013
    Applicant: SAP AG
    Inventors: Martin P. Fischer, Heiko Kiessling, Dieter Guendisch, Alexander Rieder, Achim Weigel, Paul Goetz, Martin Hermes, Stephan Klevenz, Martin Kreyscher, Corneliu D. Mitu, Juergen Schneider, Johannes Weber
  • Publication number: 20130103698
    Abstract: Items are sorted in an order. Each item has a relevance score. The items are displayed in the order in which the items are sorted. Each item is displayed in a manner corresponding to or based on the relevance score of the item.
    Type: Application
    Filed: October 21, 2011
    Publication date: April 25, 2013
    Inventor: Carsten Schlipf
  • Publication number: 20130097129
    Abstract: A method of dynamically performing data transformations on information that is transmitted between a user device and a web service may include receiving interface code from the web service, receiving an input from the user device that identifies a data type, and a data transformation to be applied to data instances matching the data type. The method may also include causing a definition file to be stored with the data type, the data transformation, and a resource locator. The method may additionally include, in a second communication session, intercepting a transmission, accessing the definition file using the resource locator, determining whether the data instance matches the data type, causing the data transformation to be performed on the data instance to generate transformed data, and inserting the transformed data into the transmission if the data instance matches the data type.
    Type: Application
    Filed: October 17, 2012
    Publication date: April 18, 2013
    Applicant: CIPHERPOINT SOFTWARE, INC.
    Inventor: CIPHERPOINT SOFTWARE, INC.
  • Publication number: 20130086474
    Abstract: Exemplary media content management and presentation systems and methods are described herein. An exemplary method includes a media content presentation system linking together multiple media content instances, playing back or managing a playlist that includes the linked media content instances, and processing the linked media content instances as a block of linked media content instances within the playing back or managing of the playlist. Another exemplary method includes a media content presentation system playing back a media content instance for experiencing by a user, presenting a playback user interface in conjunction with the playing back of the media content instance, and providing, within the playback user interface, one or more media management tools configured for use by the user to manage the media content instance during the playing back of the media content instance. Corresponding systems and methods are also disclosed.
    Type: Application
    Filed: September 30, 2011
    Publication date: April 4, 2013
    Applicant: VERIZON PATENT AND LICENSING INC.
    Inventors: Michael R. Oliver, Brian F. Roberts
  • Publication number: 20130086077
    Abstract: An approach is provided for presenting a user interface and associating one or more commenting information with on one or more content items detected in one or more media items. Further, a user may associate one or more commenting information related to a point of interest/object wherein one or more content items associated with the point of interest/object may be retrieved and aggregated with the one or more commenting information.
    Type: Application
    Filed: September 30, 2011
    Publication date: April 4, 2013
    Applicant: NOKIA CORPORATION
    Inventors: Petri Matti Olavi Piippo, Jan Peter Erik Eskolin, Jussi Severi Uusitalo, Tero Juhani Hakala
  • Publication number: 20130086059
    Abstract: A method of automatically processing text data is described. An initial set of data tags is developed that characterize text data in a text database. Higher order entities are determined which are characteristic of patterns in the data tags. Then the text data is automatically tagged based on the higher order entities.
    Type: Application
    Filed: October 3, 2011
    Publication date: April 4, 2013
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Rajesh Balchandran, Leonid Rachevsky, Bhuvana Ramabhadran
  • Patent number: 8412536
    Abstract: A system that facilitates publishing and consuming information that is of time sensitivity, for example, price information. Methods are employed to achieve completeness and freshness in information for a given domain. A preferred embodiment is a shopping site that is capable of comparing prices, purchasing bundled products and dealing with coupons.
    Type: Grant
    Filed: August 4, 2009
    Date of Patent: April 2, 2013
    Assignee: Namul Applications LLC
    Inventors: Yu Cao, Leonard Kleinrock
  • Publication number: 20130060795
    Abstract: Performance of database systems may be improved by reducing the processing performed with each database query. For example, when a database query, such as a SQL statement, is executed with a first set of values, the query may be stored as a prepared statement and parsed and optimized as a section. When a similar database query is executed with a new set of values the section may be re-executed with the new set of values without re-parsing or re-optimizing the prepared statement. A similar database query may continue to be executed with new sets of values until the section is invalid because of an alteration to the table definitions of the database schema.
    Type: Application
    Filed: September 7, 2011
    Publication date: March 7, 2013
    Applicant: Unisys Corp.
    Inventors: James M. Plasek, Michael S. Jende, Ronald H. Menzhuber, Jennifer J. Smith
  • Publication number: 20130060772
    Abstract: A computerized project management analytical system and method that develops and manages an ontology that links objects and is capable of being mined. The ontology is comprised of a project ontology framework, a matching engine and a project status matrix that illustrates a multi-relational view of the project status, of confidence levels, or interdiction points and/or positions on project timelines.
    Type: Application
    Filed: October 5, 2012
    Publication date: March 7, 2013
    Applicant: Metier, Ltd.
    Inventor: Metier, Ltd.
  • Publication number: 20130054622
    Abstract: In one exemplary embodiment, a set of attributes derived from an element of a first digital document is obtained. The element is identified from eye-tracking data of a user viewing the digital document. A search query of a database comprising at least one query term is received. A set of documents in the database is identified according to the search query. An attribute score is determined for each document. The set of documents are sorted according to the attribute score. Optionally, a commonality between the query term and at least one member of the set of attributes may be determined. The search query may be generated by the user. The database may be a hypermedia database.
    Type: Application
    Filed: September 15, 2011
    Publication date: February 28, 2013
    Inventors: Amit V. Karmarkar, Sharada Karmarkar, Richard R. Peters
  • Publication number: 20130054596
    Abstract: Methods and apparatus consistent with the invention provide the ability to organize and build understandings of machine data generated by a variety of information-processing environments. Machine data is a product of information-processing systems (e.g., activity logs, configuration files, messages, database records) and represents the evidence of particular events that have taken place and been recorded in raw data format. In one embodiment, machine data is turned into a machine data web by organizing machine data into events and then linking events together.
    Type: Application
    Filed: October 30, 2012
    Publication date: February 28, 2013
    Applicant: SPLUNK INC.
    Inventor: SPLUNK INC.
  • Publication number: 20130024463
    Abstract: Systems, methods, and computer-readable code stored on a non-transitory media for assessing an entity's innovation level by one or more computing devices include gathering information relating to an entity's performance in plural disciplines; capturing strengths and opportunities of the entity based on the gathered information; generating an innovation score of the entity; analyzing the innovation score to generate an innovation report; and returning the innovation report to the entity
    Type: Application
    Filed: January 3, 2012
    Publication date: January 24, 2013
    Applicant: INFOSYS LIMITED
    Inventor: Rajaram Venkataraman
  • Publication number: 20130013627
    Abstract: An embodiment relates to a novel apparatus and method for changing modes of notification in an electronic device. An electronic device includes a calendar application and a variety of other applications such as the message reader application or the daily alarm application. The device is configured to use the calendar application to track whether and how the user is notified of the receipt of an electronic message. In one embodiment, the user specifically associates a profile behavior to the calendar entry when the calendar entry is first created.
    Type: Application
    Filed: September 14, 2012
    Publication date: January 10, 2013
    Applicant: RESEARCH IN MOTION LIMITED
    Inventors: David Yach, David Castell, Neil Adams, Michael K. Brown, Ian Patterson
  • Publication number: 20130007026
    Abstract: In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold.
    Type: Application
    Filed: September 13, 2012
    Publication date: January 3, 2013
    Inventors: Joshua ALSPECTOR, Aleksander KOLCZ, Abdur R. CHOWDHURY
  • Publication number: 20130007020
    Abstract: An exemplary embodiment of the present techniques extracts concepts and relationships from a text. Concepts may be generated from the text using singular value decomposition, and ranked based on a term weight and a distance metric. The concepts that are ranked above a particular threshold may be iteratively extracted, and the concepts may be merged to form larger concepts until the generation of concepts has stabilized. Relationships may be generated based on the concepts using singular value decomposition, then ranked based on various metrics. The relationships that are ranked above a particular threshold may be extracted.
    Type: Application
    Filed: June 30, 2011
    Publication date: January 3, 2013
    Inventors: Sujoy Basu, Sharad Singhal
  • Publication number: 20130007013
    Abstract: Various embodiments are disclosed that relate to negatively matching users over a network. For example, one disclosed embodiment provides a method including storing a plurality of user profiles corresponding to a plurality of users, each user profile in the plurality of user profiles including one or more user attributes, and receiving a request from a user for a list of one or more suggested negatively matched other users. In response to the request, the method further includes ranking each of a plurality of other users based on a magnitude of a difference between one or more user attributes of the user and corresponding one or more user attributes of the other user, and sending a list of one or more negatively matched users to the exclusion of more positively matched users based on the ranking.
    Type: Application
    Filed: June 30, 2011
    Publication date: January 3, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Kevin Geisner, Relja Markovic, Stephen Latta
  • Publication number: 20130006995
    Abstract: A method for configuring a computer system to provide access to stored electronic resources may be described. The method can include determining a topic framework between stored electronic resources and topic names by determining topic names for topic framework by generating topic names based on names assigned to storage sets and generating topic names based on attributes of resources. Further forming associations between resources and topic names by associating resources with topic names generated based a storage set and associating resources having attributes with topic names generated based on attributes of the resources. Also, storing the framework to provide structure so resources can be accessed using the topic names and using the topic framework to present a group of stored resources associated with topic names so that the group of resources can be selected for access.
    Type: Application
    Filed: December 10, 2010
    Publication date: January 3, 2013
    Applicant: Chesterdeal Limited
    Inventor: Sebastian Toke-Nichols
  • Publication number: 20120330906
    Abstract: A system and method comprising: receiving itinerary data from at least two sources; identifying a traveler associated with the itinerary data; and adding information about the identified traveler to the itinerary data.
    Type: Application
    Filed: September 4, 2012
    Publication date: December 27, 2012
    Applicant: CONCUR TECHNOLOGIES, INC.
    Inventors: Michael FREDERICKS, Keith MOFFATT, Brian Jeffrey OLLENBERGER, Lisa Anne SILVERIA, Richard Thor DENMARK, Michael LORE
  • Publication number: 20120330978
    Abstract: Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.
    Type: Application
    Filed: September 11, 2012
    Publication date: December 27, 2012
    Applicant: Microsoft Corporation
    Inventors: Wen-tau Yih, Christopher A. Meek
  • Publication number: 20120330975
    Abstract: Profiling systems and methods of creating and using user interest profiles are described. In some example embodiments, the method includes: creating a topic set which includes topics which are organized in a hierarchical structure which includes a plurality of topic levels including an upper topic level and a lower topic level, each topic in the lower topic level being a subtopic of at least one of the topics in the upper topic level; monitoring interest in a plurality of documents for a user to identify one or more documents-of-interest to the user; and based on the monitored interest for the user, creating an interest profile for the user by determining a measure of topical interest for the user for at least one of the topics at the upper topic level and for a subtopic of that topic, the subtopic being at the lower topic level.
    Type: Application
    Filed: December 2, 2011
    Publication date: December 27, 2012
    Applicant: ROGERS COMMUNICATIONS INC.
    Inventors: Hyun Chul LEE, Yingbo MIAO, Liqin XU
  • Publication number: 20120323936
    Abstract: A computer-implemented method of and a device, such as a base station for a headset, for arranging text items in a predefined order, comprising storing, in the memory of a peripheral device, a collection of multiple text items arranged in multiple sets of text items and in multiple groups of text items; storing a respective code item with a respective group of text items; and storing a sort key that has values that designate a predefined order of the text items in each set. The sort key is appended to the text items and comprises at least one character with a value within the Private Use range of the Unicode format.
    Type: Application
    Filed: June 14, 2012
    Publication date: December 20, 2012
    Applicant: GN Netcom A/S
    Inventor: Christian Paulsen
  • Publication number: 20120322401
    Abstract: A wireless, mobile device application for a first-responder to create incident reports for events such as an automobile accident or natural disaster, including digital photographs, audio recordings, notes, ordinance numbers, number of individuals affected, type of incident, real time global positioning satellite coordinates (GPS), and any other relevant information that a first-responder deems necessary. Such a report may then be transmitted to a server for access by emergency personnel, E-911 dispatchers, or any others with access to a secure server hosted on the World Wide Web.
    Type: Application
    Filed: June 20, 2011
    Publication date: December 20, 2012
    Inventor: Lee Collins
  • Publication number: 20120323932
    Abstract: A set expansion system is described herein that uses general-purpose web data to expand a set of seed entities. The system includes a simple yet effective quality metric to measure the expanded set, and includes two iterative thresholding processes to rank candidate entities. The system models web data sources and integrates relevance and coherence measurements to evaluate potential set candidates using an iterative process. The system uses general-purpose web data that is not specific to the given seeds. The system defines quality of the result set as the sum of two component scores: the relevance of a set of entities that measures their similarity with the given seeds, and the coherence of the set of entities produced which is how closely the entities in the set are related to each other. Based on this quality measure, the system develops a class of iterative set expansion processes.
    Type: Application
    Filed: June 20, 2011
    Publication date: December 20, 2012
    Applicant: Microsoft Corporation
    Inventors: Dong Xin, Yeye He, Tao Cheng
  • Publication number: 20120323928
    Abstract: A system and method for automatic generating suggestions for personalized reactions or messages. A suggestion generation module includes a plurality of collector modules, a credentials module, a suggestion analyzer module, a user interface module and a decision tree. The plurality of collector modules are coupled to respective systems to collect information accessible by the user and important to the user from other systems such as e-mail systems, SMS/MMS systems, micro blogging systems, social networks or other systems. The information from these collector modules is provided to the suggestion analyzer module. The suggestion analyzer module cooperates with the user interface module and the decision tree to generate suggested reactions or messages for the user to send. The suggested reactions or messages are presented by the user interface module to the user.
    Type: Application
    Filed: June 17, 2011
    Publication date: December 20, 2012
    Applicant: Google Inc.
    Inventor: Ashish Bhatia
  • Publication number: 20120317100
    Abstract: A reference string set including a group of strings is set. At least two specific tuples of substring triples is found inside the reference string set of strings. Each tuple is considered as a candidate for representing a related concept. Each concatenation of the substrings triples is an explicit member of the reference string set. Each middle substring of the substring triples is unequal to another middle substring within the substring triples found inside the reference string set. Each prefix substring is equal to all other prefix substrings within the substring triples found inside the reference string set. Each suffix substring is equal to all other suffix substrings within the substring triples found inside the reference string set. Either the prefix substring or the suffix substring is not empty.
    Type: Application
    Filed: August 24, 2012
    Publication date: December 13, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Andreas Arning, Roland Seiffert
  • Publication number: 20120317085
    Abstract: Systems and methods are provided for cataloging content metadata from a variety of sources and providing metadata to client devices. A processing device receives inconsistent data records representative of a common content element, with different values for a metadata field descriptive of a common attribute of the content element. The processor assign confidence scores metadata fields from each data record, and use these confidence scores to select the metadata that is transmitted to the client device.
    Type: Application
    Filed: February 24, 2012
    Publication date: December 13, 2012
    Applicant: United Video Properties, Inc.
    Inventors: Benjamin Green, Alex Helsinger, Michael Papish
  • Publication number: 20120310950
    Abstract: In case plural pieces of data are analyzed, parts of these pieces of data including a difference which should be compared and analyzed with priority are analyzed exhaustively, with suppressing a cost of analyzing.
    Type: Application
    Filed: December 15, 2010
    Publication date: December 6, 2012
    Applicant: NEC CORPORATION
    Inventors: Kai Ishikawa, Shinichi Ando, Akihiro Tamura
  • Publication number: 20120310957
    Abstract: In certain embodiments, a parser parses a formula to yield one or more functions, at least one function comprising a dependent value of a dependent object. One or more macro handlers configured to execute the functions are determined. At least one macro handler is instructed to register with one or more dominant objects on behalf of the dependent object, where the dominant objects are used to evaluate the dependent value.
    Type: Application
    Filed: May 31, 2011
    Publication date: December 6, 2012
    Applicant: Computer Associates Think, Inc.
    Inventor: Tad A. Deffler
  • Publication number: 20120290597
    Abstract: Near-duplicate documents may be identified by (a) accepting a set of documents, (b) processing the set of documents to determine a first set of near-duplicate documents using a first document similarity technique, and (c) processing the first set of near duplicate documents to determine a second set of near-duplicate documents using a second document similarity technique. The first document similarity technique might be token order dependent, and the second document similarity technique might be order independent. The first document similarity technique might be token frequency independent, and the second document similarity technique might be frequency dependent.
    Type: Application
    Filed: September 2, 2011
    Publication date: November 15, 2012
    Applicant: Google Inc.
    Inventor: Monika H. Henzinger
  • Publication number: 20120290594
    Abstract: A computer-implemented system and methods are provided to allow for the remote/wireless event/performance data aggregation, monitoring, and feedback to generate real time performance metric data for participating individuals of an event/performance that can be used to provide various feedback to the participating individuals regarding the participating individuals' efforts during an event/performance. In an illustrative implementation, exemplary server computer environment is operable to electronically/wireless communicate with one or more sensor/data aggregator mechanisms capable of aggregating one or more desired data inputs and to a display device operable to display exemplary generated event performance metric data.
    Type: Application
    Filed: May 11, 2012
    Publication date: November 15, 2012
    Applicant: Ciright Systems, Inc.
    Inventor: Joseph M. Callahan
  • Publication number: 20120265759
    Abstract: A computer-implemented method for processing electronic documents having different native file formats is provided. The method is implemented in a computer system comprising one or more processors configured to execute one or more computer program modules. The method includes (a) receiving electronic documents in different native file formats; (b) identifying the native file format for each received electronic document; (c) retrieving a stored configuration data for the identified native file format, the configuration data includes a mapping of regions of interest in the electronic document with the identified native file format and their associations with output fields; and (d) processing the electronic documents using their retrieved configuration data to extract data from the electronic documents.
    Type: Application
    Filed: April 15, 2011
    Publication date: October 18, 2012
    Applicant: XEROX CORPORATION
    Inventors: John E. BERGERON, John Allott Moore
  • Publication number: 20120254203
    Abstract: System and method for conducting a forensic analysis of electronic data having files and information indicative of a location of each of the files. The system has processors and a controller. The controller is configured to characterize the electronic data based, at least in part, on the files and the information indicative of the location of each of the files to obtain a characterization and distribute segments of the electronic data to the processors based, at least in part, on the characterization, each of the processors corresponding to at least one of the segments and each of the segments corresponding to at least one of the processors. Each one of the processors is configured to process each corresponding one of the segments to identify at least one characteristic of each corresponding one of the segments.
    Type: Application
    Filed: March 31, 2011
    Publication date: October 4, 2012
    Inventors: Jon Stewart, Geoffrey N. Black
  • Publication number: 20120254211
    Abstract: The present invention provides a method and an apparatus for mode matching. The method for mode matching includes: reading the mode matching string, where the mode matching string includes at least one logical matching field that is used to match the logical relationship; reading the target character string to be matched, where the target character string to be matched includes logical relationship information; using the logical matching field in the mode matching string to match the logical relationship in the target character string to be matched. The technical solution of the present invention improves the capability for matching the text information that carries the logical relationship information.
    Type: Application
    Filed: March 30, 2012
    Publication date: October 4, 2012
    Applicant: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Erez BUCHNIK, Jingzhong QIU
  • Publication number: 20120239665
    Abstract: Described are a reputation analysis device, reputation analysis method, and reputation analysis-use program capable of suitably analyzing temporal changes in reputation for an object indicated by a keyword. The disclosed reputation analysis device is provided with a voluntary activity description extraction means for extracting descriptions representing voluntary activity related to an object indicated by a keyword that has been input from within a plurality of documents; and a reputation chronological data estimation means for counting the number of occurrences of voluntary activity at each time point wherein the voluntary activity expressed by a description representing the voluntary activity related to the object has been performed, and estimating reputation chronological data for chronologically representing evaluations for the object by the agents of the voluntary activity.
    Type: Application
    Filed: November 15, 2010
    Publication date: September 20, 2012
    Applicant: NEC CORPORATION
    Inventors: Yuzuru Okajima, Shinichi Ando, Satoshi Nakazawa