Data Mining Patents (Class 707/776)
  • Publication number: 20110202562
    Abstract: Systems and methods are provided for data mining in the context of an interactive video. During the presentation of an interactive video, a user may interact with the interactive video by, e.g., making selections, choosing options, etc. related to one or more aspects of the interactive video. Such events and details regarding the events may be recorded, stored, and analyzed in the context of one or more campaigns associated with the interactive video, such as marketing campaigns, advertising campaigns, interactive examinations, etc. Once the details regarding the events have been stored, reports may be extracted based upon the details detailing any desired information relevant to the one or more campaigns.
    Type: Application
    Filed: February 24, 2011
    Publication date: August 18, 2011
    Inventors: Jonothan Bloch, Barak Feldman, Tal Zubalsky, Kfir Y. Rotbard
  • Publication number: 20110202561
    Abstract: The invention discloses a system for adjusting an automated valuation model (AVM) value. The system includes a property data source for receiving property data for a property, a data mining module for searching the property data for keywords with corresponding values, and a data matching module for recognizing the keywords, for determining an adjustment value based on the corresponding values, for receiving an AVM value representing an estimated value of the property, and for obtaining an adjusted AVM value based on the AVM value and the adjustment value.
    Type: Application
    Filed: November 12, 2010
    Publication date: August 18, 2011
    Inventors: Benjamin C. Graboske, Robert L. Walker
  • Patent number: 8001329
    Abstract: A system and method for partitioning a data stream into tokens includes steps or acts of: receiving the data stream; setting a partition scanner to a beginning point in the data stream; identifying likely token boundaries in the data stream using the partition scanner; partitioning the data stream according to the likely token boundaries as determined by the partition scanner, wherein each partition of the partitioned data stream bounded by the likely token boundaries comprises a chunk; and passing the chunk to a next available token scanner, one chunk per token scanner, for identifying at least one actual token within each chunk.
    Type: Grant
    Filed: May 19, 2008
    Date of Patent: August 16, 2011
    Assignee: International Business Machines Corporation
    Inventor: Christoph von Praun
  • Patent number: 8001144
    Abstract: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern.
    Type: Grant
    Filed: March 26, 2008
    Date of Patent: August 16, 2011
    Assignee: International Business Machines Corporation
    Inventor: Jasmine Novak
  • Publication number: 20110196895
    Abstract: Behavior-based associations, such as item-to-item or query-to-item associations, are extrapolated to other items to create new associations. The items to which the associations are extrapolated may be “behavior deficient” items, or items for which the quantity of collected user activity data is insufficient to create meaningful or reliable behavior-based associations. The behavior-based associations are extrapolated based on content-based associations, or another type of “substitutability” association, between items. The items can be any type of item (e.g., products, web sites, documents, etc.) for which user behaviors (e.g., purchases, accesses, downloads, etc.) can be monitored and analyzed to detect behavior-based associations, and for which item content or other available information can be used to assess item substitutability.
    Type: Application
    Filed: April 22, 2011
    Publication date: August 11, 2011
    Inventor: Jin Y. Yi
  • Publication number: 20110196880
    Abstract: A system has a processing pipeline with a plurality of processing stages, where each of the processing stages has one or plural processors, and where the processing stages are individually and independently scalable. A first of the processing stages of the processing pipeline provides a received date update into an update data structure, where the update data structure is accessible to process a query received by the system. One or more additional of the processing stages transforms the update data structure to allow for merging of the transformed update data structure into a database, where the transformed update data structure is accessible to process the query. Content of the transformed update data structure is stored into the database.
    Type: Application
    Filed: February 11, 2010
    Publication date: August 11, 2011
    Inventors: CRAIG A.N. SOULES, Kimberly Keeton, Charles B. Morrey, III, Alistair Veitch
  • Patent number: 7996424
    Abstract: Embodiments of the present invention relate to systems and methods for optimizing and reducing the memory requirements of state machine algorithms in pattern matching applications. Memory requirements of an Aho-Corasick algorithm are reduced in an intrusion detection system by representing the state table as three separate data structures. Memory requirements of an Aho-Corasick algorithm are also reduced by applying a banded-row sparse matrix technique to the state transition table of the state table. The pattern matching performance of the intrusion detection system is improved by performing a case insensitive search, where the characters of the test sequence are converted to uppercase as the characters are read. Testing reveals that state transition tables with sixteen bit elements outperform state transition tables with thirty-two bit elements and do not reduce the functionality of intrusion detection system using the Aho-Corasick algorithm.
    Type: Grant
    Filed: January 31, 2008
    Date of Patent: August 9, 2011
    Assignee: Sourcefire, Inc.
    Inventors: Marc A. Norton, Daniel J. Roelker
  • Patent number: 7996403
    Abstract: A method and system for performing a search request for a name among a database including a plurality of names. In one implementation, the method includes receiving the search request on the name, determining a geographic location associated with the name, assigning a cultural classification to the name based on the geographic location associated with the name, and completing the search request by searching for the name among the plurality of names within the database based on the cultural classification assigned to the name.
    Type: Grant
    Filed: September 27, 2007
    Date of Patent: August 9, 2011
    Assignee: International Business Machines Corporation
    Inventors: Anna Khasin, Frankie Elizabeth Patman Maguire, Leonard Arthur Shaefer, Jr., Stephen John Watjen, Charles Kinston Williams
  • Patent number: 7996356
    Abstract: A disclosed process accesses text data that is to be mined. The text data includes text snippets. Rules are encoded in a rule base. A search request is submitted to a search request handler. A search request handler applies the rules from the rule base to the text and associates different labels to respective text snippets in the text data in accordance with the rule base.
    Type: Grant
    Filed: March 24, 2005
    Date of Patent: August 9, 2011
    Assignee: Xerox Corporation
    Inventor: Nathaniel G. Martin
  • Publication number: 20110191372
    Abstract: A computer-based method for generating intelligence from social media data, such as blog data, that is publicly available on the Internet. A server is provided that runs a tribe analysis tool, and the method includes accessing a set of the social media data with the tribe analysis tool. The social media data is associated with a plurality of network users or authors. The method continues with operating the tribe analysis tool to identify members of a tribe from the authors by processing the set of social media data to determine the authors having associated portions of the social media data that satisfies tribe membership criteria. Common interests for the identified members of the tribe are determined by processing the social media data associated with the tribe authors. A report is generated for the tribe that includes information related to the set of common interests and additional generated tribe-based intelligence.
    Type: Application
    Filed: January 26, 2011
    Publication date: August 4, 2011
    Inventors: Howard KAUSHANSKY, Ted V. Kremer, Nicolas Nicolov, William A. Tuohig, Richard Hansen Wolniewicz
  • Publication number: 20110191373
    Abstract: Event data (e.g., log messages) are represented as sets of attribute/value pairs. An index maps each attribute/value pair or attribute/value tuple to a pointer that points to event data which contains the attribute/value pair or attribute/value tuple. An attribute co-occurrence map or matrix can be generated that includes attribute names that co-occur together. Queries and custom reports can be generated by projecting event data into one or more attributes or attribute/value pairs, and then determining statistics on other attributes using a combination of the inverted index, the attribute co-occurrence map or matrix, operations on sets and/or math and statistical functions.
    Type: Application
    Filed: April 11, 2011
    Publication date: August 4, 2011
    Applicant: LOGLOGIC, INC.
    Inventors: Sherif Botros, Jian L. Zhen, Minjun Liu, Boris Galitsky
  • Patent number: 7991754
    Abstract: Computer configurations, search processors (2), software, and methods of viewing and analyzing information regarding agriculture or land use automatically located relationally-linked agronomic entities with both real (18) and virtual (8) displays. Relational linking exist through broad assessment of commonality information with fuzzy logic heuristics. Dynamic link presentation (6) can exist with congregated and hierarchical information displays (29) such as at the farm level, at a location level, at a physically aggregated parcel level with hierarchical display of farms or agronomic entity ownership, management, organization, and crop usages that afford users an unprecedented series of views into the businesses of land use, food production, and resource conservation. A meta-syntactic agronomic information generator (31) can facilitate imputed information through the integration of multiple databases (32).
    Type: Grant
    Filed: December 5, 2006
    Date of Patent: August 2, 2011
    Assignee: OneImage, LLC
    Inventors: Margaret Stewart Maizel, William L. Thoen
  • Publication number: 20110184983
    Abstract: Methods and devices for use in gathering and analyzing data from a corpus of documents. A corpus of documents is initially scanned for words that qualify as entities according to user defined criteria. Multiple counters track the number of documents which mention specific entities. A database of entities mentioned in the documents is maintained and an entry for each entity in the corpus is placed in the entity database. The results are then presented to a user in a spiral form with the most important entity at the center of the spiral. The importance of an entity may be determined by either how many entities it is connected to or how many documents mention that entity. A connection exists between two entities if they are both mentioned in at least one document and the more documents mention two specific entities at the same time, the stronger the connection between those two specific entities.
    Type: Application
    Filed: January 27, 2011
    Publication date: July 28, 2011
    Applicants: OF THE DEPARTMENT OF NATIONAL DEFENCE
    Inventors: Peter J. KWANTES, Philip G. TER HAAR
  • Publication number: 20110184982
    Abstract: The present invention discloses a computer system for reporting online sessions and a computer enabled method utilizing the same. The computer system is made up of an icon that preferably appears on a user screen. The icon is capable of capturing a screen session on the user screen and saving it within a recording. The recording may then be communicated to a database server that is capable of extracting a plurality of target components from said recording, and is capable of storing them in a database. The database may contain a benchmark content of the plurality of target components. Target components may then be compared against the benchmark content in a variety of ways to determine whether the level of target components is above or below reasonable and socially accepted levels.
    Type: Application
    Filed: January 25, 2010
    Publication date: July 28, 2011
    Inventors: Glenn Adamousky, Dennis Nagy
  • Publication number: 20110172874
    Abstract: A vehicle fault diagnosis and prognosis system includes a computing platform configured to receive a classifier from a remote server, the computing platform tangibly embodying computer-executable instructions for evaluating data sequences received from a vehicle control network and applying the classifier to the data sequences, wherein the classifier is configured to determine if the data sequences define a pattern that is associated with a particular fault.
    Type: Application
    Filed: January 13, 2010
    Publication date: July 14, 2011
    Applicant: GM GLOBAL TECHNOLOGY OPERATIONS, INV.
    Inventors: Debprakash Patnaik, Pulak Bandyopadhyay, Steven W. Holland, Kootaala P. Unnikrishnan, George Paul Montgomery, JR.
  • Publication number: 20110173232
    Abstract: Methods and apparatuses for searching network data for one or more predetermined strings are disclosed. In one embodiment, the string search is a multi-stage search where the stages of the search are performed by different hardware components. In one embodiment in a first search stage, a first processor performs a comparison of blocks of incoming data to determine whether the blocks potentially represent the beginning of one of the predetermined strings. If a potential predetermined string is identified, a second processor performs a further search to determine whether the string matches one of the predetermined strings. Because the first processor searches only for the beginning of the predetermined strings, the first stage comparison can be performed quickly, which improves network performance as compared to more detailed searching. The second stage is performed by second processor, which allows the first processor to search for potential matching strings.
    Type: Application
    Filed: March 7, 2011
    Publication date: July 14, 2011
    Applicant: INTEL CORPORATION
    Inventor: Boris Beylin
  • Patent number: 7979446
    Abstract: A method of: submitting reference sequences to a taxonomic database to produce taxonomic results; and reporting a taxonomic identification based on the taxonomic results. The reference sequences are the output of genetic database queries that return a score for each reference sequence. A method for processing a biological sequence obtained from an assay by: converting base calls located in a predetermined list of positions within the biological sequence to N; and determining the ratio of single nucleotide polymorphisms in the biological sequence relative to a reference sequence. Each entry in the predetermined list of positions represents the capability of a substance hybridizing to a microarray used to generate the biological sequence. The substance is not the nucleic acid of a target pathogen.
    Type: Grant
    Filed: November 12, 2009
    Date of Patent: July 12, 2011
    Assignee: The United States of America as represented by the Secretary of the Navy
    Inventors: Anthony P. Malanoski, Baochuan Lin, Joel M Schnur, David A Stenger
  • Patent number: 7979373
    Abstract: A system for analyzing the risks of adverse effect resulting from the use of a drug comprises a selector for identifying at least one drug, a profiler for selecting from multiple profiles related to the safety of the drug, using at least one filter; at least one data mining engine; and an output device for displaying the analytic results from the data mining engine. Preferably, the at least one data mining engine is selected from (1) a proportional analysis engine to assess deviations in a set of reactions to the drug; (2) a comparator to measure the reactions to the drug against a user-defined backdrop, and (3) a correlator to look for correlated signal characteristics in drug/reaction/demographic information; and an output device whereby a user can receive analytic.
    Type: Grant
    Filed: May 22, 2009
    Date of Patent: July 12, 2011
    Assignee: DrugLogic, Inc.
    Inventor: Victor V. Gogolak
  • Patent number: 7979362
    Abstract: An interactive data mining system (100, 3000) that is suitable for data mining large high dimensional (e.g., 200 dimension) data sets is provided. The system graphically presents rules in a context allowing users to readily gain an intuitive appreciation of the significance of important attributes (data fields) in the data. The system (100, 3000) uses metrics to quantify the importance of the various data attributes, data values, attribute/value pairs, ranks them according to the metrics and displays histograms and lists of attributes and values in order according to the metric, thereby allowing the user to rapidly find the most interesting aspects of the data. The system explores the impact of user defined constraints and presents histograms and rule cubes including superposed and interleaved rule cubes showing the effect of the constraints.
    Type: Grant
    Filed: August 10, 2007
    Date of Patent: July 12, 2011
    Assignee: Motorola Solutions, Inc.
    Inventors: Kaidi Zhao, Jeffrey G. Benkler, Weimin Xiao, Bing Liu
  • Publication number: 20110167077
    Abstract: User locality information can be used to improve various aspects of search results pages. Queries can be suggested based on the user location while excluding common query suggestions that involve an unrelated geographic entity. Deeplinks can also be modified to include location based suggestions. Additionally, results for specialized searches such as travel searches can be improved by employing user locality information.
    Type: Application
    Filed: January 7, 2010
    Publication date: July 7, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Tabreez Govani, Nikhil Dandekar, Gheorghe Muresan
  • Patent number: 7974984
    Abstract: A system and method may include retrieving a first taxonomy comprising at least one first category and one or more second taxonomies, at least one second category being associated with at least one of the one or more second taxonomies. The system and method may further include creating a new taxonomy by merging the first taxonomy with the second taxonomy based on a comparison of a first category profile of the at least one first category with a second category profile of the at least one second category.
    Type: Grant
    Filed: April 19, 2007
    Date of Patent: July 5, 2011
    Assignee: Mobile Content Networks, Inc.
    Inventor: Phyllis Reuther
  • Publication number: 20110161116
    Abstract: Pursuant to some embodiments, insurance systems, methods and devices are provided which include a data storage device for storing, updating and providing access to loss risk score data, a computer processor for executing program instructions and for retrieving the loss risk score data from the data storage device, a memory, coupled to the computer processor, for storing program instructions for execution by the computer processor, a geocoding engine comprising program instructions stored in the memory for geocoding historical loss data and a plurality of loss risk factors, a scoring engine comprising program instructions stored in the memory for calculating a loss risk score for each of a plurality of geographical locations based on said historical loss data and said plurality of loss risk factors, and a communication device, coupled to the computer processor, to output loss risk score data based on geographical location.
    Type: Application
    Filed: April 5, 2010
    Publication date: June 30, 2011
    Inventors: David F. Peak, Andrew J. Amigo, Richard M. Borden, Keven J. Busque, Eugene J. Walters
  • Publication number: 20110161368
    Abstract: A text mining apparatus, a text mining method, and a program are provided that accurately discriminate inherent portions of each of a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used performs text mining using, as targets, a plurality of text data pieces including a text data piece generated by computer processing. Confidence is set for each of the text data pieces. The text mining apparatus 1 includes an inherent portion extraction unit 6 that extracts an inherent portion of each text data piece relative to another of the text data pieces, using the confidence set for each of the text data pieces.
    Type: Application
    Filed: August 28, 2009
    Publication date: June 30, 2011
    Inventors: Kai Ishikawa, Akihiro Tamura, Shinichi Ando
  • Publication number: 20110161367
    Abstract: A text mining apparatus, a text mining method, and a program are provided that enable the influence that computer processing errors have on mining results to be reduced during text mining performed on a plurality of text data pieces including a text data piece generated by computer processing. A text mining apparatus 1 to be used includes an inherent portion extraction unit 6 that, for each of a plurality of text data pieces including a text data piece generated by computer processing, extracts an inherent portion of the text data piece relative to another of the text data pieces, an inherent confidence setting unit 7 that, for each inherent portion of each of the text data pieces, sets inherent confidence indicating confidence of the inherent portion, using the confidence that has been set for each of the text data pieces, and a mining processing unit 8 that performs text mining on each inherent portion of each of the text data pieces, using the inherent confidence.
    Type: Application
    Filed: August 28, 2009
    Publication date: June 30, 2011
    Applicant: NEC CORPORATION
    Inventors: Kai Ishikawa, Akihiro Tamura, Shinichi Ando
  • Patent number: 7970785
    Abstract: Sources of operational problems in business transactions often show themselves in relatively small pockets of data, which are called trouble hot spots. Identifying these hot spots from internal company transaction data is generally a fundamental step in the problem's resolution, but this analysis process is greatly complicated by huge numbers of transactions and large numbers of transaction variables to analyze. A suite of practical modifications are provided to data mining techniques and logistic regressions to tailor them for finding trouble hot spots. This approach thus allows the use of efficient automated data mining tools to quickly screen large numbers of candidate variables for their ability to characterize hot spots. One application is the screening of variables which distinguish a suspected hot spot from a reference set.
    Type: Grant
    Filed: October 15, 2008
    Date of Patent: June 28, 2011
    Assignee: Verizon Services Corp.
    Inventor: James Howard Drew
  • Publication number: 20110153665
    Abstract: Provided are an apparatus for providing a social network service using the relationship of ontology and a method thereof. The apparatus includes: an ontology storage unit storing social ontology defining relationship information between a user and a social network subscriber, service ontology defining position and relationship information of services, and tag ontology defining tag information related to information included in the social ontology and the service ontology; when a service request is inputted from the user, an ontology analysis unit retrieving a tag corresponding to the user's current position and the service request factor by using the relationship of the ontologies stored in the ontology storage unit; a service processing unit extracting the corresponding service on the basis of the retrieved tag information; and a service providing unit providing the user with the extracted service.
    Type: Application
    Filed: December 17, 2010
    Publication date: June 23, 2011
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventor: Jung-Tae KIM
  • Publication number: 20110154054
    Abstract: The invention relates to a computer implemented method for generating a pseudonym for a user comprising entering a user-selected secret, storing the user-selected secret in memory, computing a private key by applying an embedding and randomizing function onto the secret, storing the private key in the memory, computing a public key using the private key, the public key and the private key forming an asymmetric cryptographic key, erasing the secret and the private key from the memory, and outputting the public key for providing the pseudonym
    Type: Application
    Filed: January 20, 2010
    Publication date: June 23, 2011
    Applicant: CompuGROUP Holding AG
    Inventors: Adrian Spalka, Jan Lehnhardt
  • Publication number: 20110153663
    Abstract: Systems and methods to provide a recommendation engine that uses implicit feedback observations are provided. A particular method includes receiving accessing data comprising a plurality of implicit feedback observations for a plurality of users. The plurality of users includes a first user that requested a recommendation. Each implicit feedback observation is associated with a particular user and a particular item of a plurality of items. The method includes determining a plurality of preference ratings and a plurality of confidence ratings for each user of the plurality of users for each item based on the plurality of implicit feedback observations. The method includes generating a recommendation list of one or more of the plurality of items for the first user based on the plurality of preference ratings and the plurality of confidence ratings.
    Type: Application
    Filed: December 21, 2009
    Publication date: June 23, 2011
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Yehuda Koren, Yifan Hu, Christopher T. Volinsky
  • Publication number: 20110153664
    Abstract: Computerized methods, data processing systems, and computer program products for storing of data mining models (DMMs) are provided. A new DMM is created having at least one of the following characteristics: quality and complexity. The new DMM is handled as a candidate for storing in a storage device if a predefined criterion for the characteristics is met. The sum of the sizes of the new DMM and already stored DMMs is determined In response to the sum falling below a storage limit, the new DMM is stored in the storage device. In response to the sum exceeding the storage limit, a decision is taken based on priorities of the DMMs which DMMs to store in the storage device. The priorities depend at least on access frequencies of the DMMs. Upon a data mining request, a corresponding DMM is determined and a user is requested to confirm that data mining is to proceed if quality of the determined DMM does not fulfill a further predefined criterion.
    Type: Application
    Filed: November 22, 2010
    Publication date: June 23, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alexander Lang, Bernhard Mitschang, Ruben Pulido de los Reyes, Christoph Sieb, Michael Wurst
  • Publication number: 20110145285
    Abstract: A method for intent mining is provided. The method includes performing a preliminary search of a constrained source using one or more seed phrases to generate multiple preliminary search results representing different ways of expressing a desired intent. The method also includes identifying each of the plurality of preliminary search results that have expressed the desired intent to generate a plurality of intent results. The method also includes producing multiple action search strings around one or more action verbs in each of the multiple intent results. The method further includes applying each of the multiple action search strings on one or more non-constrained sources to generate multiple action search results.
    Type: Application
    Filed: December 15, 2009
    Publication date: June 16, 2011
    Applicant: GENERAL ELECTRIC COMPANY
    Inventors: Steven Matt Gustafson, David Brian Bracewell
  • Publication number: 20110144795
    Abstract: Systems and methods for managing machine tools are provided. When a current abnormality occurs in one of at least one machine tool, a specific failure category is determined according to the current abnormality, and at least one suggested combination of parameters is generated according to the specific failure category and a transaction database, wherein the specific failure category is one of a plurality of predefined failure categories, and each suggested combination of parameters includes a plurality of associated parameters, which are commonly retrieved for the specific failure category. Each transaction data in the transaction database records a plurality of parameters corresponding to a failure category, wherein the parameters are the parameters whose parameter values are retrieved from the at least one machine tool, having the abnormality according to the failure category.
    Type: Application
    Filed: June 30, 2010
    Publication date: June 16, 2011
    Inventors: Shin Yen LIU, ChunTai Yen, HsiaoWei Chen
  • Publication number: 20110144853
    Abstract: A system and method for reducing or eliminating built-in tests and diagnostic trouble codes that are set as a result of improper parameter values. The method includes collecting field failure data that identifies diagnostic trouble codes and parameters of the system that are used to set diagnostic trouble codes. The method transforms the collected data into a format more appropriate for human analysis and pre-processes the transferred data to identify and remove information that could bias the human analysis. The method includes plotting linear and nonlinear combinations of operation parameters, performing data mining and analysis for detecting inappropriate settings of fault codes in the pre-processed data and providing the mined data to a subject matter expert for review to determine whether a diagnostic trouble code has been issued because of improper parameters.
    Type: Application
    Filed: December 15, 2009
    Publication date: June 16, 2011
    Applicant: GM GLOBAL TECHNOLOGY OPERATIONS, INC.
    Inventors: Halasya Siva Subramania, Satnam Singh, Steven W. Holland, Jason T. Davis, Tim Felke, Ravindra Patankar, Aru Narla
  • Patent number: 7962511
    Abstract: A statistical patent rating method and system is provided for independently assessing the relative breadth (“B”), defensibility (“D”) and commercial relevance (“R”) of individual patent assets and other intangible intellectual property assets. The invention provides new and valuable information that can be used by patent valuation experts, investment advisors, economists and others to help guide future patent investment decisions, licensing programs, patent appraisals, tax valuations, transfer pricing, economic forecasting and planning, and even mediation and/or settlement of patent litigation lawsuits. In one embodiment the invention provides a statistically-based patent rating method and system whereby relative ratings or rankings are generated using a database of patent information by identifying and comparing various characteristics of each individual patent to a statistically determined distribution of the same characteristics within a given patent population.
    Type: Grant
    Filed: April 29, 2003
    Date of Patent: June 14, 2011
    Assignee: PatentRatings, LLC
    Inventor: Jonathan A. Barney
  • Publication number: 20110131244
    Abstract: Certain types of entities may be extracted from a document. In one example, the entities to be recognized are cultural entities, such as the names of movies, video games, books, etc. For each such entity, a concept graph may be built that shows the relationship between the entity itself and other entities, such as the relationship between a movie and the actor(s) who act in the movie. When a candidate entity name is detected in the document, the concept graph may be used to look for other entities that appear in the context of the candidate entity. The presence of related entities in the context of the candidate may be used to disambiguate the meaning of the candidate. For example, a common word like “up” might be recognized as the name of a movie if the names of actors or characters in that movie appear near the word “up”.
    Type: Application
    Filed: November 29, 2009
    Publication date: June 2, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Amir J. Padovitz, Matthew F. Hurst
  • Publication number: 20110131206
    Abstract: Methods and apparatus are provided for presenting search results with indication of relative position of search terms. According to one aspect of the invention, search results are displayed for a search query comprising a plurality of search terms. A search query is received, for example, from a user and at least one document satisfying the search query is obtained. The disclosed method determines a relative position of at least two of the search terms in the document and at least a portion of the document is presented with an indication of the relative position of the at least two search terms in the document. The relative position is indicated using a predefined character to indicate one or more intervening elements between the at least two search terms. A relevance ranking can optionally be presented that is based on the relative position of the at least two search terms.
    Type: Application
    Filed: November 30, 2009
    Publication date: June 2, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Peter K. Malkin, John C. Thomas, JR.
  • Publication number: 20110131207
    Abstract: A device includes a memory to store instructions; and a processor to execute the instructions to implement a data collector to collect text messages, a keyword extractor to extract keywords or key phrases from the collected text messages, and a user interface to present one or more of the extracted keywords or key phrases to a particular user. The device further includes a filter to filter the collected text messages based on one or more criteria, and a keyword ranker to rank the extracted keywords or key phrases based on one or more criteria.
    Type: Application
    Filed: December 1, 2009
    Publication date: June 2, 2011
    Applicant: SONY ERICSSON MOBILE COMMUNICATIONS AB
    Inventor: Hakan Lars Emanuel Jonsson
  • Publication number: 20110125794
    Abstract: The invention relates to a method which receives location information of a mobile terminal of a single user. One or more journeys are extracted from the location information of the single user. The corresponding journey data is stored in a journey database. From the journey data in the journey database, journey patterns for the single user are extracted. A journey pattern indicates at least the regularity of a particular journey in time, i.e. over a number of days. The journey patterns are stored in the pattern database. The journey patterns of the single user are matched with patterns of other users. If a match is found, at least one match based on the journey patterns is sent to the single user. These features enable the carpool service to find a match which takes into account the regularity across a period of days. By identifying the regularity, a better match can be made with users which travel the same route, as also the days on which the users travel are taken into account.
    Type: Application
    Filed: June 5, 2008
    Publication date: May 26, 2011
    Applicant: Telefonaktiebolaget LM Ericsson
    Inventor: Mathias H. M. HUTSCHEMAEKERS
  • Publication number: 20110125793
    Abstract: Some social networks provide message histories that record information about previous posts that users make to the social media network. From this information, a contact center determines trends in the usage of a social media network by a user. The contact center can mine the message history database for times, frequency of posts, location of the user during posts, and other information provided in the message histories. From this information or metadata about the messages, the contact center develops trends about the user's postings of messages on social media networks. The contact center can further receive subsequent posts and read metadata related to the subsequent posts. The new metadata can be used to modify the trends over time.
    Type: Application
    Filed: February 17, 2010
    Publication date: May 26, 2011
    Applicant: AVAYA INC.
    Inventors: George ERHART, David SKIBA, Valentine C. MATULA
  • Patent number: 7949676
    Abstract: Using an ontology to perform an information search utilizing a meaning given to information on a network without being required to perform complicated operations for forming an inquiry sentence in conformity with the ontology. A pre-stage before a search engine provides an information search supporting system having a morpheme analysis section, a syntactic and semantic analysis section, and a conversion execution section which converts a natural language sentence on which syntactic analysis and semantic analysis have been performed into an inquiry sentence described in an ontology description language by referring to a case frame ontology dictionary in which are stored information indicating to which property in an ontology does the relationship among a predicate, a subject and an object in the natural language sentence correspond and the case frame of the natural language sentence in the property.
    Type: Grant
    Filed: April 29, 2008
    Date of Patent: May 24, 2011
    Assignee: International Business Machines Corporation
    Inventors: Aya Mori, Hirobumi Toyoshima, Masami Tada
  • Patent number: 7945572
    Abstract: The present invention provides systems and methods for automatically mining massive intelligence databases to discover sequential patterns therein using a novel combination of forward and reverse temporal processing techniques as an enhancement to well known pattern discovery algorithms.
    Type: Grant
    Filed: March 21, 2008
    Date of Patent: May 17, 2011
    Assignee: The Johns Hopkins University
    Inventors: Brett D. Lapin, David W. Porter
  • Patent number: 7945583
    Abstract: A technique for the deployment of data mining algorithms on a web service, such as IBM's WebSphere Application Server, is disclosed. Rather than having to deploy the data mining models with the data, the data can be transported to the web server as part of a message. Models can be cached on the web server and easily changed by operations executed by the client. This allows for efficient administration of the operational environment. Because a web services environment is inherently scalable, servers can be transparently enabled based on demand. Further, with web services communication is via data objects in memory which allows for ease of implementation and operational efficiency.
    Type: Grant
    Filed: June 15, 2007
    Date of Patent: May 17, 2011
    Assignee: International Business Machines Corporation
    Inventors: Yan Moyaux, Charles J Schott, David A Selby, Vince P Thomas
  • Publication number: 20110113028
    Abstract: A targeted advertising system performs context-based association mining using a publicly available corpus to identify a product or brand name that, under a given context, is associated with a product or brand being marketed. The system analyzes documents within the publicly available corpus that are associated with the given context, and identifies products or brand names that have a high association to the product or brand being marketed. The system can also analyze the publicly available corpus to determine contextual information which is correlated to two or more products or brand names. This contextual information includes a set of terms that facilitates filtering the publicly available corpus into an optimal set of documents that has a high association to a desired market category or demographic.
    Type: Application
    Filed: November 12, 2009
    Publication date: May 12, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Jessica N. Staddon, Richard Chow, Philippe J.P. Golle, Lisa S. Purvis
  • Patent number: 7941440
    Abstract: A computer automated method of aggregating data includes the steps of inputting a set of user-defined instructions into a computer database system, inputting a user query into the computer database system, mining the computer database system for data relevant to the user query, creating a data set comprising said data relevant to the user query, and aggregating data in the data set using domain metrics selected based on any of predefined and configurable rules and past user usage.
    Type: Grant
    Filed: September 28, 2010
    Date of Patent: May 10, 2011
    Assignee: Semantifi, Inc.
    Inventor: Sreenivasa R Pragada
  • Publication number: 20110106849
    Abstract: A new case whose type is the same as that of a case about information desired to be extracted can be generated with high accuracy.
    Type: Application
    Filed: March 9, 2009
    Publication date: May 5, 2011
    Applicant: NEC Corporation
    Inventors: Takao Kawai, Shinichi Ando
  • Publication number: 20110106927
    Abstract: System and method for implementing cloud mitigation and operations controllers are described. One embodiment is a system for controlling operation of a cloud computing environment, wherein the system comprises a repository for storing data regarding characteristics of the cloud computing environment, wherein the stored data includes policy notations designating compliance or noncompliance of the data with policy; an analyst module for analyzing the stored data in combination with external report information regarding the cloud computing environment and for providing results of the analysis; and a controller for evaluating the analysis results and issuing instructions for controlling operation of the cloud computing environment based on the evaluating.
    Type: Application
    Filed: November 5, 2009
    Publication date: May 5, 2011
    Applicant: Novell, Inc.
    Inventors: Stephen R. Carter, Lloyd Leon Burch, Carolyn Bennion McClain, Dale Robert Olds
  • Publication number: 20110093501
    Abstract: A computer automated method of presenting data. The method includes the steps of inputting a set of user-defined instructions into a computer database system, inputting a user query into said computer database system, mining the computer database system for data relevant to said user query, creating a data set comprising the data relevant to the user query, and selecting at least one presentation report for compiling the data, wherein the selection is based on any of predefined and configurable rules and past user usage. At least one presentation report is then displayed to the user, wherein the displaying process further includes the step of graphically arranging the at least one presentation report based on an available viewing area of a device accessing the at least one presentation report.
    Type: Application
    Filed: December 22, 2010
    Publication date: April 21, 2011
    Applicant: EXECUE, INC.
    Inventors: Sreenivasa R. Pragada, Viswanath Dasari
  • Publication number: 20110093293
    Abstract: The invention provides a method and clinical data mining system for enabling a user to derive knowledge from data corresponding to a plurality of electronic health records stored in a repository. One or more data elements are provided as an input. The data elements may include textual reports, images, and one or more criteria specified by the user. Information is extracted from one or more images associated with one or more electronic health records stored in the repository, based on the data elements. Further, information is extracted from one or more textual reports and structured data associated with the one or more electronic health records. Thereafter, one or more reports are generated based on the extracted information to enable the user to analyze the information. Subsequently, the user may derive knowledge from the data based on the analysis.
    Type: Application
    Filed: June 14, 2010
    Publication date: April 21, 2011
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Harikrishna Rai G. N., Ashish Sureka, Sivaram V. Thangam, Pranav Prabhakar Mirajkar, K. Sai Deepak
  • Patent number: 7930277
    Abstract: Cost-based optimizer functionality for an XML database repository provides means for optimizing the execution of database queries that access XML resources in the database repository. Statistics about XML resources that are stored in the database repository are gathered, stored and utilized by a query optimizer to compute computational costs associated with each of multiple methods of accessing particular XML resources requested in a database query. Hence, the optimizer is able to select the most efficient query execution plan based on the costs of possible access paths. In one embodiment, specific statistics about the hierarchical structure of XML resources stored in the XML database repository are gathered, stored in a relational table in the database management system, and used to compute the selectivity of query predicates and the index cost associated with traversing one or more indexes to access requested XML resources.
    Type: Grant
    Filed: April 21, 2004
    Date of Patent: April 19, 2011
    Assignee: Oracle International Corporation
    Inventors: Fei Ge, Sivasankaran Chandrasekar, Nipun Agarwal, Ravi Murthy, Eric Sedlar
  • Patent number: 7930197
    Abstract: Personal data mining mechanisms and methods are employed to identify relevant information that otherwise would likely remain undiscovered. Users supply personal data that can be analyzed in conjunction with data associated with a plurality of other users to provide useful information that can improve business operations and/or quality of life. Personal data can be mined alone or in conjunction with third party data to identify correlations amongst the data and associated users. Applications or services can interact with such data and present it to users in a myriad of manners, for instance as notifications of opportunities.
    Type: Grant
    Filed: September 28, 2006
    Date of Patent: April 19, 2011
    Assignee: Microsoft Corporation
    Inventors: Raymond E. Ozzie, William H. Gates, III, Gary W. Flake, Thomas F. Bergstraesser, Arnold N. Blinn, Christopher W. Brumme, Lili Cheng, Michael Connolly, Nishant V. Dani, Dane A. Glasgow, Daniel S. Glasser, Alexander G. Gounares, James R. Larus, Matthew B. MacLaurin, Henricus Johannes Maria Meijer, Debi P. Mishra, Amit Mital, Ira L. Snyder, Jr., Chandramohan A. Thekkath, David R. Treadwell, III, Melora Zaner-Godsey
  • Publication number: 20110087700
    Abstract: An event is described herein as being representable by a quantified abstraction of the event. The event includes at least one predicate, and the at least one predicate has at least one constant symbol corresponding thereto. An instance of the constant symbol corresponding to the event is identified, and the instance of the constant symbol is replaced by a free variable to obtain an abstracted predicate. Thus, a quantified abstraction of the event is composed as a pair: the abstracted predicate and a mapping between the free variable and an instance of the constant symbol that corresponds to the predicate. A data mining algorithm is executed over abstracted, quantified events to ascertain a correlation between the event and another event.
    Type: Application
    Filed: October 14, 2009
    Publication date: April 14, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: David Lo, Ganesan Ramalingam, Venkatesh-Prasad Ranganath, Kapil Vaswani