Data Mining Patents (Class 707/776)
  • Patent number: 8103646
    Abstract: An automated mechanism of automatically tagging media files such as podcasts, blog entries, and videos, for example, with meaningful taxonomy tags. The mechanism provides active (or automated) assistance in assigning appropriate tags to a particular piece of content (or media). Included is a system for automatic tagging of audio streams on the Internet, whether from audio files, or from the audio tracks of audio/video files, using the folksonomy of the Internet. The audio streams may be provided by the media author. For example, the author can make a recording to be posted on a website, and use the system to automatically suggest (via prompted author interaction) folksonomically appropriate tags for the media recording. Alternatively, the system can be used in an automated fashion to develop and assign without any intervention by the author.
    Type: Grant
    Filed: March 13, 2007
    Date of Patent: January 24, 2012
    Assignee: Microsoft Corporation
    Inventor: Robert I. Brown
  • Patent number: 8099429
    Abstract: Systems and methods that integrate user assigned association among a plurality of resources or entities. The subject innovation employs an association component that relates such resources or entities, based on aggregate of user notions that are assigned for relationships; and/or based on how users perceive existence of relationships among such resources. Accordingly, resources can be related (e.g., linked, matched, tagged and the like) based on relevance of collective user behavior during tagging.
    Type: Grant
    Filed: December 11, 2006
    Date of Patent: January 17, 2012
    Assignee: Microsoft Corporation
    Inventors: Roderic C. Deyo, Sandeep Sahasrabudhe, Sunil Swami, Brian D. Price
  • Publication number: 20120011156
    Abstract: Methods, systems, devices and/or apparatuses are provided for computationally deriving molecular association connectivity maps for the study of inter-class molecular associations in toxicogenomics and drug discovery applications. The inter-class molecular associations can be between at least one bio-molecular entity and at least one therapeutic agent. The methods, systems, devices and/or apparatuses apply integrated molecular interaction network mining and text mining techniques.
    Type: Application
    Filed: June 29, 2011
    Publication date: January 12, 2012
    Applicant: Indiana University Research and Technology Corporation
    Inventor: Jake Yue Chen
  • Publication number: 20120011155
    Abstract: Embodiments of the invention related to a method and system for finding a distance between a plurality of time series, wherein each individual time series in the plurality of time series including a data, wherein the data is uncertain, and using such distance computed in business applications.
    Type: Application
    Filed: July 9, 2010
    Publication date: January 12, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Karin Murthy, Smruti R. Sarangi
  • Patent number: 8095489
    Abstract: A Real-Time Group Intelligence Creation System 100 comprising of Group Intelligence Creation Controller 200, Facilitator Expert System 300, Adaptive Group Intelligence Mining Engine 400, Intelligent Web Communicator 500, Idea and Solution Source Data Server 600, and Classification, Extraction, Thinking Pattern and Hint Data Server 700. The System extends traditional computational grid to include idea creations and problem solving generations to form a collaborative thinking grid that is made up of mass volume of participants using either mobile device or stationary device, and is without the need of face-to-face interaction. The System uses both shallow knowledge and deep knowledge mining agents to mine unstructured ideas and solutions in real-time for unifying multiple topics and generating classifications, extractions, thinking patterns and hints. This information are provided to participants during creation processes in order to simulate and accelerate participants' thinking further.
    Type: Grant
    Filed: June 5, 2009
    Date of Patent: January 10, 2012
    Inventors: Thomas C. H. Chen, Jenny P. Chen
  • Publication number: 20120005195
    Abstract: A method for generating an ontology may include selecting, by a processing device, a tag in a tag cloud. The method may also include searching, by the processing device, an online encyclopedia for content corresponding to the selected tag and determining, by the processing device, at least one category to which the content belongs in the online encyclopedia in response to finding the content corresponding to the selected tag in the online encyclopedia. The method may additionally include adding, by the processing device, a class to the ontology corresponding to the at least one category of the content in the online encyclopedia.
    Type: Application
    Filed: June 30, 2010
    Publication date: January 5, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CLYDE LEE CAIN, JR., FENG-WEI CHEN, JOO Y. LEE, MARGARET H. MAGO, NIKHIL PAREKH, WILLIAM D. REED
  • Publication number: 20110320019
    Abstract: A data mining system and method retrieve data related to an item from a database. A survey is generated for presentation in a game. The survey includes the retrieved item data and solicits from a user input data pertaining to the retrieved item data. The input data is received from the survey and stored in a database with the item data. The input data is transmitted to the game and incorporated into the game such that the user interacts with the input data as part of playing the game.
    Type: Application
    Filed: April 21, 2011
    Publication date: December 29, 2011
    Applicant: EBAY INC.
    Inventors: Kirk Lanciani, Nicole Stewart, Steve Washington, Neelakantan Sundaresan
  • Publication number: 20110320491
    Abstract: A module and method for determining a named entity of a terminology using a named entity dictionary and a mining rule combined with an ontology schema is provided. The module includes a named entity dictionary and mining rule database storing the named entity dictionary and a mining rule database; a mining pattern generation unit recognizing a terminology from a text and converting the terminology into a mining pattern; a named entity and mining rule search unit searching for a corresponding named entity and a mining rule respectively from the named entity dictionary and the mining rule database using the recognized terminology and the mining pattern; and a names entity selection unit selecting, if two or more named entities corresponding to the recognized terminology are searched, a named entity matching to the concept configuring the RDF triple of the searched mining rule as a named entity of the terminology among the searched named entities.
    Type: Application
    Filed: June 4, 2011
    Publication date: December 29, 2011
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Han Min JUNG, Pyung KIM, Seung Woo LEE, Mi Kyung LEE, Dong Min SEO, Won Kyung SUNG
  • Publication number: 20110320490
    Abstract: An apparatus and method for updating a named entity dictionary or a mining rule database using the named entity dictionary and a mining rule combined with an ontology schema is provided. The apparatus includes a named entity dictionary and mining rule database storage module storing the named entity dictionary and a mining rule database; a mining pattern generation module recognizing a terminology from a text and converting the terminology into the mining pattern; a named entity and mining rule search module searching for a corresponding named entity and a mining rule from the named entity dictionary and the mining rule database using the recognized terminology and the mining pattern; and a named entity dictionary update module estimating a named entity of the terminology using the mining rule and storing the estimated named entity of the terminology in the named entity dictionary depending on a user's selection.
    Type: Application
    Filed: June 3, 2011
    Publication date: December 29, 2011
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Han Min JUNG, Pyung KIM, Seung Woo LEE, Mi Kyung LEE, Dong Min SEO, Won Kyung SUNG
  • Publication number: 20110320492
    Abstract: A method and system that allows anyone to flag good or bad driving incidents by fellow motorists. Driving behavior data is captured as user-generated content, and the system includes the necessary provision to verify the authenticity and accuracy of all records submitted. The database of driving records is subsequently used to calculate a driver risk-score, allowing companies to make informed decisions related to their business when impacted by driver behavior (e.g. calculating a car insurance premium).
    Type: Application
    Filed: June 24, 2011
    Publication date: December 29, 2011
    Applicant: DriveMeCrazy, Inc.
    Inventor: Philip INGHELBRECHT
  • Publication number: 20110320493
    Abstract: Method for extracting information from a data file comprising a first step wherein the data are transmitted to a device (3.1) or “tokenizer” adapted to convert them in the course of a first step into elementary units or “tokens”, the elementary units being transmitted to a second step of searching in the dictionaries (3.2) and a third step (3.3) of searching in grammars, wherein, for each conversion step, a sliding window of given size is used, the data are converted into “tokens” as and when they arrive in the tokenizer and the tokens are transmitted as and when they are formed to the step of searching in dictionaries (3.2), then to the step of searching in the grammars (3.3).
    Type: Application
    Filed: September 6, 2011
    Publication date: December 29, 2011
    Applicant: THALES
    Inventor: Julien LEMOINE
  • Publication number: 20110307303
    Abstract: A computer-implemented method for predicting a future characteristic of a worker is provided. The method includes collecting a plurality of attributes associated with each of a plurality of workers, applying a data mining tool to the attributes to identify a pattern between the attributes and a future characteristic of the workers, and using the identified pattern to predict the future characteristic of a worker. In one example, the future characteristic is the future performance of the employee and/or the likelihood that the worker leaves at some point in the future.
    Type: Application
    Filed: June 14, 2010
    Publication date: December 15, 2011
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Debasis DUTTA, Brian GASPAR, Julian CHALLENGER, Dinesh ARORA
  • Patent number: 8078407
    Abstract: The present invention describes a system and method of using individuals' behavioral and physiologic information to identify disease-influencing genes.
    Type: Grant
    Filed: February 2, 2000
    Date of Patent: December 13, 2011
    Assignee: Health Hero Network, Inc.
    Inventor: Stephen J. Brown
  • Patent number: 8078634
    Abstract: A computer automated method and system of presenting data. The method may include the steps of inputting a set of user-defined instructions into a remotely located computer database system via a local network connection, inputting a user query into the computer database system via the local network connection, mining the computer database system for data relevant to the user query, creating a data set comprising the data relevant to the user query, and aggregating data in the data set using domain metrics selected based on any of predefined and configurable rules and past user usage. The aggregation may further include tagging all data attributes in the data set based on database metadata and inputs from a user, wherein the data attributes comprise any of data identifications (IDs), data grouping attributes, and data measure attributes.
    Type: Grant
    Filed: December 22, 2010
    Date of Patent: December 13, 2011
    Assignee: Semantifi, Inc.
    Inventors: Sreenivasa R Pragada, Viswanath Dasari
  • Publication number: 20110302124
    Abstract: Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user generated content based on automated analysis of the user generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques described herein include, for example, computer annotating travelogues with learned tags, performing topic learning to obtain an interest model, and performing location matching based on the interest model.
    Type: Application
    Filed: June 8, 2010
    Publication date: December 8, 2011
    Applicant: Microsoft Corporation
    Inventors: Rui Cai, Qiang Hao, Changhu Wang, Rong Xiao, Lei Zhang
  • Publication number: 20110295851
    Abstract: An annotation suggestion platform is described herein. The annotation suggestion platform may comprise a client and a server, where the client captures a media object and sends the captured object to the server, and the server provides a list of suggested annotations for a user to associate with the captured media object. The user may then select which of the suggested metadata is to be associated or stored with the captured media. In this way, a user may more easily associate metadata with a media object, facilitating the media object's search and retrieval. The server may also provide web page links related to the captured media object. A user interface for the annotation suggestion platform is also described herein, as are optimizations including indexing and tag propagation.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Motaz Ahmed El-Saban, Xin-Jing Wang, May Abdelreheem Sayed
  • Publication number: 20110295894
    Abstract: System and method for matching a pattern are provided. The pattern matching method includes performing a sub pattern matching operation to match at least one sub data of a plurality of sub data of a target data with a pre-stored pattern data, and performing a full pattern matching operation to determine whether the target data is identical to at least the pre-stored pattern data by referring to a result of the sub pattern matching operation, and wherein the full pattern matching operation is performed or not performed according to a type of the pre-stored pattern data. Accordingly, an accurate matching operation is performed with respect to the target data of various patterns.
    Type: Application
    Filed: May 26, 2011
    Publication date: December 1, 2011
    Applicant: SAMSUNG SDS CO., LTD.
    Inventor: InSeon YOO
  • Publication number: 20110295892
    Abstract: A method and system for web mining and clustering is described. The method includes receiving and dividing input data into a plurality of primitive datasets. Additionally, one or more combinations of the plurality of primitive datasets may be created. Further, a model for each primitive dataset in the plurality of primitive datasets and each of the one or more combinations of the plurality of primitive datasets may be generated. Subsequently, a cost associated with a model corresponding to each primitive dataset in the plurality of primitive datasets, and each of the one or more combinations of the plurality of primitive datasets may be computed. Further, a sum of the costs associated with the models corresponding to each primitive dataset in the plurality of primitive datasets may be compared with the cost associated with each model corresponding to each of the one or more combinations of the plurality of primitive datasets.
    Type: Application
    Filed: May 25, 2010
    Publication date: December 1, 2011
    Applicant: GENERAL ELECTRIC COMPANY
    Inventors: Scott Charles Evans, Abha Moitra, Thomas Stephen Markham, Steven Matt Gustafson
  • Publication number: 20110295893
    Abstract: A method of searching an expected image in an electronic apparatus comprises the steps of inputting a hand drawing of the expected image into the electronic apparatus; determining whether or not a text description for partially characterizing the expected image is inputted; identifying and searching the expected image in the electronic apparatus according to the hand drawing if the text description is not inputted, or selecting a text label from the text description and interpreting the selected text label by the electronic apparatus if the text description is inputted; and searching a database in the electronic apparatus according to the text label, and fetching the expected image from the database if the value of the image item matches the text label. The hand drawing and/or text label inputted from a mobile phone screen are provided for arranging and searching pictures or images in the database efficiently.
    Type: Application
    Filed: April 21, 2011
    Publication date: December 1, 2011
    Applicants: INVENTEC APPLIANCES (SHANGHAI) CO. LTD., INVENTEC APPLIANCES (NANCHANG) CO. LTD., INVENTEC APPLIANCES CORP.
    Inventor: PENG-FEI WU
  • Publication number: 20110295832
    Abstract: Techniques for identifying one or more communities in an information network are provided. The techniques include collecting one or more nodes and one or more edges from an information network, performing a random walk on the one or more nodes to produce a sequence of one or more nodes, creating a sequence database from one or more sequences produced via random walk, and mining the sequence database to determine one or more patterns in the network, wherein the one or more patterns identify one or more communities in the information network.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Charu C. Aggarwal, Rajesh R. Bordawekar
  • Patent number: 8065326
    Abstract: Decision trees are efficiently represented in a relational database. A computer-implemented method of representing a decision tree model in relational form comprises providing a directed acyclic graph comprising a plurality of nodes and a plurality of links, each link connecting a plurality of nodes, encoding a tree structure by including in each node a parent-child relationship of the node with other nodes, encoding in each node information relating to a split represented by the node, the split information including a splitting predictor and a split value, and encoding in each node a target histogram.
    Type: Grant
    Filed: February 1, 2006
    Date of Patent: November 22, 2011
    Assignee: Oracle International Corporation
    Inventors: Wei Li, Shiby Thomas, Joseph Yarmus, Ari W. Mozes, Mahesh Jagannath
  • Patent number: 8055677
    Abstract: An analyzer/classifier/synthesizer/prioritizing tool for data comprises use of an admissible geometrization process with data transformed and partitioned by an input process into one or more input matrices and one or more partition classes and one or more scale groups. The data to be analyzed/classified/synthesized/prioritized is processed by an admissible geometrization technique such as 2-partition modified individual differences multidimensional scaling (2p-IDMDS) to produce at least a measure of geometric fit. Using the measure of geometric fit and possibly other 2p-IDMDS output, a back end process analyzes, synthesizes, classifies, and prioritizes data through patterns, structure, and relations within the data.
    Type: Grant
    Filed: December 28, 2010
    Date of Patent: November 8, 2011
    Inventor: Abel Gordon Wolman
  • Publication number: 20110270882
    Abstract: RDF network construction device and method using an ontology schema having class dictionaries and mining rules are provided. The RDF network construction device includes an ontology schema storing module, a class managing module, a mining rule managing module, a mining pattern creating module, and an RDF triple creating module.
    Type: Application
    Filed: October 5, 2010
    Publication date: November 3, 2011
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Han Min JUNG, Pyung KIM, Seung Woo LEE, Mi Kyung LEE, Dong Min SEO, Won Kyung SUNG
  • Patent number: 8051095
    Abstract: Systems and methods for data classification to facilitate and improve data management within an enterprise are described. The disclosed systems and methods evaluate and define data management operations based on data characteristics rather than data location, among other things. Also provided are methods for generating a data structure of metadata that describes system data and storage operations. This data structure may be consulted to determine changes in system data rather than scanning the data files themselves.
    Type: Grant
    Filed: January 28, 2010
    Date of Patent: November 1, 2011
    Assignee: CommVault Systems, Inc.
    Inventors: Anand Prahlad, Jeremy A. Schwartz, David Ngo, Brian Brockway, Marcus S. Muller
  • Patent number: 8051066
    Abstract: Configurably storing data in a plurality of files based on expressions and conditions associated with the data. Logging software enables tracking of the navigation pattern of users for selected network properties under specified conditions. The logging software is configurable such that most current and future logging specifications may be fulfilled without any code changes to the logging software.
    Type: Grant
    Filed: July 7, 2009
    Date of Patent: November 1, 2011
    Assignee: Microsoft Corporation
    Inventors: Rajeev Prasad, Kevin Paul Kornelson
  • Publication number: 20110264663
    Abstract: A server arrangement for managing observation data of wireless devices, including data input logic for obtaining observation data from wireless devices, the obtained data including behavioral and contextual raw data relative to the wireless devices, data mining logic for establishing a number of derived data elements, on the basis of processing and analyzing the obtained observation and optional supplementary data, the processing and analyzing incorporating aggregation procedures. At least one derived data element includes usage metrics with contextual dimension relative to applications or other features of wireless devices and users, data storage for storing the obtained data and the number of derived information elements, and a data distribution logic providing derived data. The distribution logic may serve a data query constructed by an external entity through provision of derived information from derived data elements according to the query parameters.
    Type: Application
    Filed: May 8, 2009
    Publication date: October 27, 2011
    Applicant: ZOKEM OY
    Inventor: Hannu Verkasalo
  • Patent number: 8046322
    Abstract: A method of mining data to discover activity patterns within the data is described. The method includes receiving data to be mined from at least one data source, determining which of a number of specified interests and constraints are associated with the mining process, selecting corresponding mining agents that combine search algorithms with propagators from the specified constraints, and finding any activity patterns that meet the specified interests and constraints.
    Type: Grant
    Filed: August 7, 2007
    Date of Patent: October 25, 2011
    Assignee: The Boeing Company
    Inventors: Changzhou Wang, Anne Kao, Jai J. Choi, Rodney A. Tjoelker
  • Publication number: 20110258229
    Abstract: Techniques for utilizing data mining technology to extract universal topics with multilingual representations from a multilingual database, and to organize existing or new documents in different languages by analyzing their respective topic distributions.
    Type: Application
    Filed: April 15, 2010
    Publication date: October 20, 2011
    Applicant: Microsoft Corporation
    Inventors: Xiaochuan Ni, Jian-Tao Sun, Zheng Chen, Jian Hu
  • Publication number: 20110258374
    Abstract: A method and system of optimizing the memory usage and performance of data deduplication storage systems includes organizing the metadata of data blocks needed by deduplicating storage systems. A three level hierarchy is used. Level 1 stores the metadata on disk along with the user data. Level 2 uses low latency storage (e.g. RAM and Solid State Disks) to cache the on-disk meta data for faster direct access. Level 3 organizes the fingerprints using a Trie and is entirely resident in RAM. Thus, the search, to determine whether a data block is unique or not and a candidate for transfer, can be more efficiency executed and to ensure that the meta data is transactionally secure.
    Type: Application
    Filed: April 18, 2011
    Publication date: October 20, 2011
    Applicant: GREENBYTES, INC.
    Inventor: Robert Pertocelli
  • Patent number: 8041619
    Abstract: A hybrid model for new account acquisition is disclosed. A software tool can be provided to implement a statistical model that produces a hybrid score and corresponding decile segregation. The statistical model is a hybrid of a net conversion rate (NCR) model and an approval model, and may thus be referred to herein as a hybrid model. In example embodiments, a set of hybrid scores is calculated for each of a plurality of values of alpha to produce a decile level lift table for each set of hybrid scores. Potential values for alpha tend to lower the hybrid scores for declined prospects. The decile level lift tables can be used to facilitate selection of a value for alpha to optimize a performance metric associated with the new account acquisition. In some embodiments, a cost-benefit curve can be created from the lift tables.
    Type: Grant
    Filed: October 24, 2008
    Date of Patent: October 18, 2011
    Assignee: Bank of America Corporation
    Inventors: Xiahou Liu, Richard W. Cole, Shaohui Jia
  • Patent number: 8037016
    Abstract: A system and method are disclosed for the transcoding of data from a first format to a second format. A data format transcoder receives a request for data from a requester. The desired format of the requested data is determined. A descriptor file containing formatting information describing the requested format is loaded into the data format transcoder. The requested data is retrieved in its native format along with its corresponding common descriptor. The requested data is read in its native format, using the formatting information from its associated common descriptor. The data format transcoder then uses the formatting information in the common descriptor of the requested format to perform transcoding operations to convert the requested data from its native format into the requested format.
    Type: Grant
    Filed: July 9, 2008
    Date of Patent: October 11, 2011
    Assignee: Dell Products L.P.
    Inventors: Bogdan Odulinski, James C. Lowery, Jimmy D. Pike, Drue Reeves, Brent Schroeder
  • Patent number: 8037009
    Abstract: An embodiment relates generally to a method of linking. The method includes receiving a message associated with at least one technical issue being resolved in a first system and containing non-confidential information and searching a knowledgebase in a second system based on the message to obtain at least one related entry. The method also includes associating at least one related entry with the non-confidential information of the message, updating at least one related entry with the non-confidential information, or creating a new entry with the non-confidential information, in the knowledgebase.
    Type: Grant
    Filed: August 27, 2007
    Date of Patent: October 11, 2011
    Assignee: Red Hat, Inc.
    Inventor: Jason S. Hibbets
  • Publication number: 20110246521
    Abstract: A system for discovering information related to diagnostic imaging performance at a medical imaging site. The system includes at least one database of stored digital diagnostic images; and a user instruction interface for obtaining an operator request for information related to image quality of the stored digital diagnostic images. A data processor is in communication with the at least one database, the data processor being programmed with instructions to use only information found within the stored digital diagnostic images themselves. A data mining engine is in communication with the data processor, the data mining engine being programmed with instructions to use only information found within the retrieved digital diagnostic images themselves.
    Type: Application
    Filed: May 10, 2011
    Publication date: October 6, 2011
    Inventors: Hui Luo, Jacquelyn S. Whaley, David H. Foos
  • Publication number: 20110238674
    Abstract: A system for creation of term taxonomies by mining web based user generated content according. The system includes a network interface enabling access to one or more data sources; a mining unit for collecting textual content from the one or more sources and generating phrases, the generated phrases include sentiment phrases and non-sentiment phrases; an analysis unit for generating at least associations between a non-sentiment phrase and a sentiment phrase based on the generated phrases, wherein an association between a non-sentiment phrase and at least one corresponding sentiment phrase is a taxonomy; and storing the taxonomies in a data warehouse storage connected to the network wherein responsive to a query the system provides a sentiment to a non-sentiment phrase provided in the query.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 29, 2011
    Applicant: TAYKEY LTD.
    Inventors: Amit Avner, Omer Dror, Itay Birnboim
  • Patent number: 8027981
    Abstract: A method, system and program product for classifying data elements into different levels of a business hierarchy. The method includes identifying data elements to be classified into one or more levels of a business hierarchy, selecting a first logic decision tree for evaluating the data elements identified for classification into the hierarchy and executing the first tree for recursively evaluating each data element identified until the first tree has been traversed. Further, the method includes dynamically creating configurable anchor point classifications for the data elements evaluated through the first tree and assigning a respective anchor point classification to each data element evaluated, such that, a respective anchor point classification assigned to a data element evaluated links the data element to a lowest level of the hierarchy, and where the anchor point classification conveys classification information as to each higher level of the hierarchy that the data element belongs to.
    Type: Grant
    Filed: December 10, 2008
    Date of Patent: September 27, 2011
    Assignee: International Business Machines Corporation
    Inventors: James D. Episale, Mark A. Musa, David G. Ruest
  • Publication number: 20110231217
    Abstract: A method and system for integration of real-time field data in chemical delivery vehicle operations is disclosed. Initially, information for a plurality of regions designated for chemical delivery is received at a command center, each of the plurality of regions having a defined boundary. Real-time and forecast environmental conditions are utilized in conjunction with prior and real-time field data to rank each of the plurality of regions. One or more chemical delivery vehicles are automatically selected and dispatched to one or more of the plurality of regions. When the one or more chemical delivery vehicles are within the defined boundary of one of the plurality of regions for chemical delivery, region specific chemical delivery procedures incorporating real-time environmental conditions are automatically calculated and initiated.
    Type: Application
    Filed: March 18, 2010
    Publication date: September 22, 2011
    Inventor: Lynn HAND
  • Publication number: 20110231444
    Abstract: Sources of operational problems in business transactions often show themselves in relatively small pockets of data, which are called trouble hot spots. Identifying these hot spots from internal company transaction data is generally a fundamental step in the problem's resolution, but this analysis process is greatly complicated by huge numbers of transactions and large numbers of transaction variables to analyze. A suite of practical modifications are provided to data mining techniques and logistic regressions to tailor them for finding trouble hot spots. This approach thus allows the use of efficient automated data mining tools to quickly screen large numbers of candidate variables for their ability to characterize hot spots. One application is the screening of variables which distinguish a suspected hot spot from a reference set.
    Type: Application
    Filed: May 27, 2011
    Publication date: September 22, 2011
    Applicant: Verizon Patent and Licensing Inc.
    Inventor: James Howard Drew
  • Publication number: 20110231443
    Abstract: A scalable access filter that is used together with others like it in a virtual private network to control access by users at clients in the network to information resources provided by servers in the network. Each access filter uses a local copy of an access control data base to determine whether an access request is made by a user. Each user belongs to one or more user groups and each information resource belongs to one or more information sets. Access is permitted or denied according to access policies which define access in terms of the user groups and information sets. The first access filter in the path performs the access check, encrypts and authenticates the request; the other access filters in the path do not repeat the access check. The interface used by applications to determine whether a user has access to an entity is now an SQL entity. The policy server assembles the information needed for the response to the query from various information sources, including source external to the policy server.
    Type: Application
    Filed: March 21, 2011
    Publication date: September 22, 2011
    Inventors: Clifford Lee Hannel, Anthony May
  • Publication number: 20110231261
    Abstract: Methods are provided for determining a customized voice to be used for an audio advertisement. Text of an advertisement is received, and it is determined that audio is to be generated from the text of the advertisement, and that the audio is to comprise a customized voice. The customized voice is selected by the advertiser. The audio is generated from the text of the advertisement, and the text, audio, and indications that the advertisement is voice-enabled and that a customized voice is to be used in association with the audio are stored in an advertisement storage. When a request is received for advertisements, at least one advertisement is communicated for presentation to the user, such that the text is visually presented to the user and audio comprising the customized voice is audibly presented to the user.
    Type: Application
    Filed: March 17, 2010
    Publication date: September 22, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: PRAVEEN CHAKRAVARTHY SATTARU, TCHAVDAR DANGALTCHEV
  • Publication number: 20110225193
    Abstract: A method for retrieving data in a data source is provided. The method includes receiving a search term; identifying an active tag associated with the search term; correlating the active tag to dynamic data that is operative to adapt to a mining context in which data is stored; and retrieving the data using the dynamic data.
    Type: Application
    Filed: March 9, 2010
    Publication date: September 15, 2011
    Inventors: Cullen F. Jennings, Joseph Brian Burton, Thomas M. Wesselman, Shantanu Sarkar
  • Publication number: 20110225194
    Abstract: Disclosed herein is an apparatus and method for analyzing information about floating population. The apparatus includes an information collection unit, a data integration unit, a data mining analysis unit, and an interface unit. The information collection unit collects information about locations provided by mobile communication terminals of moving objects, information about attributes of the moving objects, and information about locations and attributes related to stationary objects. The data integration unit creates integrated data by integrating the information collected by the information collection unit, national statistical information, and map data registered previously. The data mining analysis unit extracts data, consistent with conditions input by a system user, from the integrated data, and searches the map data for based moving patterns of the moving objects using data mining analysis. The interface unit provides a map service in which search results have been applied to the map data.
    Type: Application
    Filed: March 8, 2011
    Publication date: September 15, 2011
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: In-Sung JANG, Jin-Hyoung PARK, Moon-Soo LEE, Chung-Ho LEE, In-Hak JOO, Min-Soo KIM, Ju-Wan KIM
  • Publication number: 20110225195
    Abstract: The techniques introduced here provide a method of gathering ecommerce data. The techniques described here allow a system to return information about a product from several non-related ecommerce sites in response to a single search query. Using the techniques described here, a data mining system determines from the search query a product ID and retrieves from a database one or more product links that correspond to the product ID. Using the product links retrieved from the database, the data mining system traverses the links and parses the web-pages corresponding each of the links to determine up to date product information. The product information can then be returned to the application that initiated the request.
    Type: Application
    Filed: March 11, 2011
    Publication date: September 15, 2011
    Inventors: Kristopher Kubicki, Lawrence Hsieh
  • Patent number: 8019752
    Abstract: A data-driven information navigation system and method enable search and analysis of a set of objects or other materials by certain common attributes that characterize the materials, as well as by relationships among the materials. The invention includes several aspects of a data-driven information navigation system that employs this navigation mode. The navigation system of the present invention includes features of a knowledge base, a navigation model that defines and enables computation of a collection of navigation states, a process for computing navigation states that represent incremental refinements relative to a given navigation state, and methods of implementing the preceding features.
    Type: Grant
    Filed: November 10, 2005
    Date of Patent: September 13, 2011
    Assignee: Endeca Technologies, Inc.
    Inventors: Adam J. Ferrari, Frederick C. Knabe, Vinay Seth Mohta, Jason Paul Myatt, Benjamin S. Scarlet, Daniel Tunkelang, John S. Walter, Joyce Wang, Michael Tucker
  • Patent number: 8019769
    Abstract: A system and method are provided for comparing portions of document text with potential citation components, determining if individual portions correspond to a citation component, and determining if a set of portions correspond to a valid citation pattern. A set of valid citation patterns is provided. Each citation pattern may include a specified combination of citation components. The invention further relates to identifying potential citation components from text in a document, analyzing a pattern of the identified citation components by comparing the pattern to a set of stored citation patterns to determine if the potential citation is a type of citation, and if so, is it a valid (and/or invalid) citation pattern. Once citation patterns have been determined in the document, annotations may be inserted into the document, and subsequent action may be taken, for example, generating a list of citations, providing research services, error-handling, and/or providing other options related to the citations.
    Type: Grant
    Filed: January 18, 2008
    Date of Patent: September 13, 2011
    Assignee: Litera Corp.
    Inventor: Tony Rollé
  • Patent number: 8019762
    Abstract: An information processing apparatus 100 for realizing a binary data classification method of the present invention includes a CPU for computing a column vector a that has at least a quarter of its components equal to zero, which satisfies diag(y)Dna>0, where a represents a column vector having a coefficient of each term of the set polynomial function as an element, Dn represents a matrix determined on the basis of a combination of the values taken by the respective terms, and y represents a row vector having as an element the value of a class to which binary data in which a value of each element is 1 or ?1 should be classified when the binary data is given, and thus classifies the data of an object of classification, which is inputted through a keyboard, in accordance with a set polynomial function.
    Type: Grant
    Filed: February 2, 2007
    Date of Patent: September 13, 2011
    Assignee: Japan Science & Technology Agency
    Inventor: Erhan Oztop
  • Publication number: 20110219013
    Abstract: Methods and systems supporting curation of items in a searchable knowledge base are provided. The methods and systems include mining one or more search queries of the searchable knowledge base, where each of the search queries includes a plurality of the items. The method further includes determining one or more pairs of items using a processor, where each of the pairs of items includes a correlation value exceeding a threshold. The correlation values for the pairs of items are based upon the frequency the items of the pairs of items co-occur within the search queries. The method further includes providing the pairs of items to a curator, where the curator reviews the pairs of items.
    Type: Application
    Filed: March 5, 2010
    Publication date: September 8, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventor: John T. Maxwell, III
  • Publication number: 20110213804
    Abstract: Disclosed herein is a system structure for extracting relations between technical terms within a large amount of literature information using verb-based patterns. The present invention provides a system that is capable of extracting relations based on verb-based patterns from abstract and bibliography databases in all fields of science and technology using a Tech Association Mining Appliance (TAMA) capable of detecting the technical terms of text and relations therebetween in academic literature databases in the fields of science and technology. The present invention has an advantage of providing a practical relation extraction system structure using a number of academic databases.
    Type: Application
    Filed: December 15, 2008
    Publication date: September 1, 2011
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
    Inventors: Min Ho Lee, Yun Soo Choi, Sung Pil Choi, Nam Gyu Kang, Kwang Young Kim, Han Gee Kim, Chang Hoo Jeong, Min Hee Cho, Hwa Mook Yoon
  • Patent number: 8010551
    Abstract: A computer server system may include a plurality of database modules for storing user data for a plurality of users, and at least one processing module comprising a plurality of processing threads for processing jobs for users based upon respective user data. The computer server system may further include a database pool module connected between the plurality of database modules and the at least one processing module. The database pool module may be for selectively connecting the processing threads to corresponding database modules including respective user data for jobs to be processed, and determining when a database module becomes unresponsive and terminating processing thread connections to the unresponsive database module based thereon. The database pool module may also be for determining when the unresponsive database module becomes responsive and restoring processing thread connectivity thereto based thereon.
    Type: Grant
    Filed: May 3, 2010
    Date of Patent: August 30, 2011
    Assignee: Research in Motion Limited
    Inventors: Nathan Provo, Harshad N. Kamat
  • Publication number: 20110208725
    Abstract: A computer system and method is disclosed for mining current and archived address data in order to identify a preferred address for each service point in a territory. The data mining system may start in response to the presentation of a candidate address for matching The set of mined data may be prioritized by clustering like characteristics, building similarity matrices, and by constructing dendrograms with nodes joined according to common characteristics. A computer system and method for maintaining a central database of preferred addresses is also disclosed. Selected address data gathered in a queue may be scored by characteristic, grouped by consignee location, and staged for processing. The scored queue of data may be prioritized by clustering like characteristics, building similarity matrices, and by constructing dendrograms.
    Type: Application
    Filed: March 14, 2011
    Publication date: August 25, 2011
    Inventors: Timothy C. Owens, Duane Anderson
  • Patent number: RE42870
    Abstract: A text mining system for collecting business intelligence about a client, as well as for identifying prospective customers of the client, for use in a lead generation system accessible by the client via the Internet. The text mining system has various components, including a data acquisition process that extracts textual data from Internet web sites, including their logs, content, processes, and transactions. The system compares log data to content and process data, and relates the results of the comparison to transaction data. This permits the system to provide aggregate cluster data representing statistics useful for customer lead generation.
    Type: Grant
    Filed: December 1, 2008
    Date of Patent: October 25, 2011
    Assignee: Dafineais Protocol Data B.V., LLC
    Inventors: John C. Seibel, Yu Feng, Robert L. Foster