Data Mining Patents (Class 707/776)
  • Patent number: 8935277
    Abstract: A question is received to be answered by a question answering (QA) system. The question may be a business intelligence question that is expressed in a natural language. The question is parsed. The parsed question is matched to a pattern from a number of patterns. A technical query associated with the matched pattern is processed to retrieve data relevant to the question from a number of data sources. The QA system generates an answer to the question based on retrieved data. In one aspect, the QA system generates answers based contextual information.
    Type: Grant
    Filed: March 30, 2012
    Date of Patent: January 13, 2015
    Assignee: SAP SE
    Inventors: Nicolas Kuchmann-Beauger, Marie-Aude Aufaure, Raphael Thollot
  • Patent number: 8935285
    Abstract: A method and system for tracking visitors' access to web content using a searchable and size-constrained local log repository is disclosed. A repository indexer receives visitor logs from a remote source and indexes the logs to include a usage field indicating when or how frequently an associated visitor log was accessed from a local log repository by a report request manager. The local log repository stores the logs and is size constrained. A repository manager removes a subset of the logs in the local log repository based on parameters within the subset of the logs' usage field. The report request manager receives a request from a requestor for a report indicating visitors' access to a content object, searches for logs within the local log repository pertinent to the request, aggregates data across the logs responsive to the search, and generates a report presenting the aggregated data.
    Type: Grant
    Filed: July 25, 2013
    Date of Patent: January 13, 2015
    Assignee: Limelight Networks, Inc.
    Inventors: Soam Acharya, Paul Cho, Jonathan Cornwell, Chris Kwok
  • Publication number: 20150012563
    Abstract: A method of mining frequent items in data is described. Categorical associations between elements of data are the core of information contained in the data and are all that is needed to perform data mining. These associations are extracted from data and held in optimized associative matrices whose structure is independent of the nature and structure of the data. All data mining operations and discoveries can be performed using only these associative matrices which provides many advantages over present methods. It allows real-time interactive navigation through the information in the data, enables efficient automatic and user guided determination of the most highly correlated data components, and a winnowing navigation through a large number of automatically determined associations, as for example frequent item sets, amongst which the needle-in-the-haystack may be more easily found.
    Type: Application
    Filed: July 7, 2014
    Publication date: January 8, 2015
    Applicant: SpeedTrack, Inc.
    Inventor: Jerzy Josef Lewak
  • Patent number: 8930377
    Abstract: A system for creation of term taxonomies by mining web based user generated content according. The system includes a network interface enabling access to one or more data sources; a mining unit for collecting textual content from the one or more sources and generating phrases, the generated phrases include sentiment phrases and non-sentiment phrases; an analysis unit for generating at least associations between a non-sentiment phrase and a sentiment phrase based on the generated phrases, wherein an association between a non-sentiment phrase and at least one corresponding sentiment phrase is a taxonomy; and storing the taxonomies in a data warehouse storage connected to the network wherein responsive to a query the system provides a sentiment to a non-sentiment phrase provided in the query.
    Type: Grant
    Filed: March 17, 2011
    Date of Patent: January 6, 2015
    Assignee: Taykey Ltd.
    Inventors: Amit Avner, Omer Dror, Itay Birnboim
  • Publication number: 20150006519
    Abstract: Aspects of the subject disclosure are directed towards automatically inferring the significant parts of bug reports, including by querying a knowledge base built by mining information from a large number of earlier bug reports. Common phrases in the earlier bug reports are filtered to provide a subset of phrases relevant to a bug domain, which are stored in the knowledge base and mapped to an ontology model of the knowledge base. When a new bug report is received for inference/analysis, the phrases therein are tagged based upon the ontology model, and the knowledge base queried with the tagged phrases to determine problems, activities and actions that are likely related to the new bug report.
    Type: Application
    Filed: June 28, 2013
    Publication date: January 1, 2015
    Inventors: Navendu Jain, Rahul Potharaju
  • Patent number: 8924419
    Abstract: Methods and systems for automatically determining, from a body of emails, blogs, and other documents, authors of the documents who are authorities on certain subjects, and what those subjects are. An intersection of the semantic footprints of documents by an author are deemed to be the derived skills footprint of the author. The derived skills footprints of many authors are compared with a user's query to determine who is the best person that could respond to the user.
    Type: Grant
    Filed: January 10, 2011
    Date of Patent: December 30, 2014
    Assignee: salesforce.com, inc.
    Inventors: Jari Koister, Mike Micucci
  • Publication number: 20140379697
    Abstract: Cross tabulation operation is performed within a columnar database management system. The columnar database management system receives a request to perform a cross-tabulation operation on a set of database tables. The columnar database management system determines values of cross tabulation operation for each row of the result. The columnar database management system determines a domain for each value of the row dimension corresponding to a row combination. The columnar database management system determines an intersection set of the domains corresponding to values of the row dimensions for the row combination. The columnar database management system determines a value for the result column for the row combination as an aggregate value based on the records of the intersection set.
    Type: Application
    Filed: June 18, 2014
    Publication date: December 25, 2014
    Inventors: Carles Bayés Martín, Jesús Malo Poyatos, Marc Rodríguez Sierra, Alejandro Sualdea Pérez
  • Patent number: 8918396
    Abstract: An information processing apparatus determines a weight of each physical feature for hierarchical clustering by acquiring training data of multiple pieces of content in triplets with label information indicating a pair specified by a user as having a highest degree of similarity among three contents of the triplet and executing hierarchical clustering using a feature vector of each piece of content of the training data and the weight of each feature to determine the hierarchical structure of the training data. The information processing apparatus updates the weight of each feature so that the degree of agreement between a pair combined first as being the same clusters among three contents of the triplet in a determined hierarchical structure and a pair indicated by label information corresponding to the triplet increases.
    Type: Grant
    Filed: June 28, 2012
    Date of Patent: December 23, 2014
    Assignee: International Business Machines Corporation
    Inventors: Toru Nagano, Masafumi Nishimura, Takashima Ryoichi, Ryuki Tachibana
  • Patent number: 8918422
    Abstract: Improvement of the quality of name and address matching processes using e-mail domains is provided. A distinction is made between e-mail domains designed to be used by employees of an entity and domains designed to be used by individuals or organizations who aren't employees of the domain owner entity. By analyzing domain names in conjunction with known relationships between e-mail addresses and names of companies, it is possible to differentiate between employee-use domains and public-use domains and maintain a collection of employee-use domains that are associated with the domain owner's business name. When performing a name and address matching process, the e-mail domains of the input records can be checked against the collection of employee-use domains and the records for the input name and address can be supplemented to include the domain owner's name and address as alternative information.
    Type: Grant
    Filed: September 5, 2012
    Date of Patent: December 23, 2014
    Assignee: Pitney Bowes Inc.
    Inventors: Vadim L Stelman, Alla Tsipenyuk
  • Publication number: 20140372483
    Abstract: A multi-user system for text mining a large population of research documents in an efficient and cost-effective fashion includes a content repository, a text mining processor, and a derived data repository that are linked via a user-accessible, central project manager. The content repository includes a data storage device for storing the research documents and a content selection facility for receiving a user-defined query that is able to support cost-related search parameters. The query is utilized by the content selection facility to select an initial collection of documents from the data storage device. Content spread metrics are then displayed through user-intuitive reports to allow for subsequent modification of the search query to yield an optimized document collection. The optimized document collection is then parsed, tagged and clustered by the text mining processor to produce search results that are stored as a data set in the derived data repository.
    Type: Application
    Filed: June 18, 2014
    Publication date: December 18, 2014
    Inventors: Babis Marmanis, Skott Klebe, John Billington
  • Publication number: 20140372482
    Abstract: Data mining operations are performed within a columnar database management system. The columnar database management system stores input sets of data for a data mining operation. An input set of data is represented as a column of data in the columnar database management system. The columnar database management system stores instructions to perform one or more data mining operations for processing the input sets of data. The columnar database management system receives requests for performing data mining operations and performs the processing of the data mining operation within the columnar database management system. As a result, the processing of data mining operations is performed without requiring multiple data transfers between an application implementing the data mining operations and the columnar database management system.
    Type: Application
    Filed: June 12, 2014
    Publication date: December 18, 2014
    Inventors: Carles Bayés Martín, Jesús Malo Poyatos, Marc Rodríguez Sierra, Alejandro Sualdea Pérez
  • Patent number: 8914410
    Abstract: A scalable access filter that is used together with others like it in a virtual private network to control access by users at clients in the network to information resources provided by servers in the network. Each access filter uses a local copy of an access control data base to determine whether an access request is made by a user. Each user belongs to one or more user groups and each information resource belongs to one or more information sets. Access is permitted or denied according to access policies which define access in terms of the user groups and information sets. The first access filter in the path performs the access check, encrypts and authenticates the request; the other access filters in the path do not repeat the access check. The interface used by applications to determine whether a user has access to an entity is now an SQL entity. The policy server assembles the information needed for the response to the query from various information sources, including source external to the policy server.
    Type: Grant
    Filed: March 21, 2011
    Date of Patent: December 16, 2014
    Assignee: SonicWALL, Inc.
    Inventors: Clifford Lee Hannel, Anthony May
  • Patent number: 8914318
    Abstract: Provided are architectures, system, methods, and computer program products that provide a user with the ability to define an association of data and/or information from known reference sets perceived by the user as relevant to a subject matter domain, thereby imparting and formalizing some of the user's knowledge about the domain. An associative relevancy knowledge profiler may also allow a user to create a profile by modifying or restricting the known reference sets and windowing the results from the association as a user might refine any other analysis algorithms. An associative relevancy knowledge profiler may also be used to define a user profile used by the user and others. A user profile may be usable in various manners depending upon, for example, rights management permissions and restrictions for a user.
    Type: Grant
    Filed: April 21, 2008
    Date of Patent: December 16, 2014
    Assignee: Araicom Research LLC
    Inventor: Anthony Prestigiacomo
  • Publication number: 20140365524
    Abstract: Aspects of the present invention provide a solution for recognizing a pattern in a set of data, such as data streaming over a data communication system. In an embodiment, a set of data events is retrieved in the data stream. The retrieved objects each have a plurality of characteristics that can be matched to a predetermined desired characteristic, such as a key value. The retrieved data events can be evaluated with respect to a pattern, with a characteristic of data events being evaluated with respect to an aggregate value related to the pattern. This aggregate value can be updated incrementally based on the data in the characteristic. Based on the evaluation, a determination as to whether the set of data events received subsequent to the first object satisfies the pattern.
    Type: Application
    Filed: June 10, 2013
    Publication date: December 11, 2014
    Inventor: Martin J. Hirzel
  • Patent number: 8909672
    Abstract: Disclosed is a method and system of matching a string of symbols to a ruleset. The ruleset comprise a set of rules. The method includes ignoring begin anchor requirements when constructing a DFA from all the rules of the ruleset, annotating the accepting states of the DFA with the begin anchor information, executing the DFA, and checking begin anchor annotations to determine if begin anchor requirement are satisfied if an accepting state is reached. Embodiments also include rulesets with begin anchors on matches, rulesets with early exit information on non-accepting states, and rulesets with accept begin anchors in accepting states.
    Type: Grant
    Filed: January 18, 2012
    Date of Patent: December 9, 2014
    Assignee: LSI Corporation
    Inventor: Michael Ruehle
  • Patent number: 8909623
    Abstract: Systems and methods are provided to select potential titles for online content using search query logs from web search service providers. A plurality of search queries are collected from one or more web search service providers. A lifetime value is determined for each of the search queries. Potential titles are then selected from the plurality of search queries using selection criteria including the lifetime value of the search queries. The potential titles can then be provided to content developers who develop online content based on the potential titles.
    Type: Grant
    Filed: June 29, 2010
    Date of Patent: December 9, 2014
    Assignee: Demand Media, Inc.
    Inventors: David M. Yehaskel, Henrik M. Kjallbring
  • Patent number: 8909646
    Abstract: Aspects and implementations of the present disclosure are directed to methods and systems of pre-processing a social network structure for fast discovery of cohesive groups. In general, in some implementations, a data processing system identifies a cohesive user group in a social network for delivery of a tailored content item. Generally, the data processing system identifies an affinity criteria; generates a set of user identifiers having characteristics that satisfy the affinity criteria; and generates graphs of users with at least one direct or indirect social network user connection with other user identifiers in the graph. The data processing system returns or stores the graph on computer readable media for later use. A graph may be generated with edges representing connections between user identifiers; edges may be weighted for the number and lengths of connection paths for indirect connections, and for similarities between users.
    Type: Grant
    Filed: December 31, 2012
    Date of Patent: December 9, 2014
    Assignee: Google Inc.
    Inventors: Alexander Fabrikant, Atish Das Sarma
  • Publication number: 20140358926
    Abstract: The present invention relates to a system, method and computer program product that is a multi-dimensional data mining environment and that operable to apply a series of temporal and relative rules (i.e., STDMn0) and is further operable in at least one of the following ways: to incorporate a framework to support temporal abstractions and relative alignments to data (i.e., STDMn0); and to derive characteristics within the data (STDMn0). The present invention may incorporate data from multiple sources, and potentially multiple centres. The analysis and alignment of the data may involve both temporal dimensions and other dimensions (or relative aspects) of the data. The present invention may further be a data mining environment that is flexible enough to permit relatively open ended queries thereby enabling, for example, the detection of trends, including trends with new dimensions, or trends based on relatively small data sets.
    Type: Application
    Filed: December 12, 2012
    Publication date: December 4, 2014
    Inventors: Carolyn Patricia McGregor, Kathleen Patricia Smith, Agam Dhanoa
  • Patent number: 8903859
    Abstract: Systems, methods, and media for generating fused risk scores for determining fraud in call data are provided herein. Some exemplary methods include generating a fused risk score used to determine fraud from call data by generating a fused risk score for a leg of call data, via a fuser module of an analysis system, the fused risk score being generated by fusing together two or more uniquely calculated fraud risk scores, each of the uniquely calculated fraud risk scores being generated by a sub-module of the analysis system; and storing the fused risk score in a storage device that is communicatively couplable with the fuser module.
    Type: Grant
    Filed: March 8, 2012
    Date of Patent: December 2, 2014
    Assignee: Verint Americas Inc.
    Inventors: Torsten Zeppenfeld, N. Nikki Mirghafori, Lisa Guerra, Richard Gutierrez, Anthony Rajakumar
  • Patent number: 8903843
    Abstract: A media recommendation system for recommending media content that is historically related to seed media content is provided. The recommended media content may be songs, television programs, movies, or a combination thereof, and the seed media content may be a song, television program, or movie.
    Type: Grant
    Filed: June 21, 2006
    Date of Patent: December 2, 2014
    Assignee: Napo Enterprises, LLC
    Inventor: Eugene M. Farrelly
  • Patent number: 8898190
    Abstract: An apparatus, method and article of manufacture of the present invention detects the presence of references to the same concept in separate sections of text, and, with no input required from the reader, presents the reader with information concerning the detected references to the concept. The information provided may comprise information related to the location of the reference to the concept in other sections of text, and the reader also is provided the ability to move from one reference to a concept directly to another reference to the same concept.
    Type: Grant
    Filed: September 24, 2012
    Date of Patent: November 25, 2014
    Inventor: Philip R. Krause
  • Patent number: 8898714
    Abstract: Systems and methods for identifying which video segment is being displayed on a screen of a television system. The video segment is identified by deriving data from the television signals, the derived data being indicative of the video segment being displayed on the screen. This feature can be used to extract a viewer's reaction (such as changing the channel) to a specific video segment (such as an advertisement) and reporting the extracted information as metrics. The systems and methods may further provide contextually targeted content to the television system. The contextual targeting is based on not only identification of the video segment being displayed, but also a determination concerning the playing time or offset time of the particular portion of the video segment being currently displayed.
    Type: Grant
    Filed: November 25, 2013
    Date of Patent: November 25, 2014
    Assignee: Cognitive Media Networks, Inc.
    Inventors: Zeev Neumeier, Edo Liberty
  • Patent number: 8886764
    Abstract: Method for retrieving data from an object database stored in a server (220) as a network (501) of objects (502) connected via links (503), said method including the steps of: storing (301) said object database in a server (220); forming (302) at a client (210) a request for a requested object (903) of a predetermined identity in said database and objects (904) connected to said requested object (903); transmitting (303) said request from said client (210) to said server (220) over a computer network (230); creating a data package containing said requested object (903) and objects connected to said requested object; and transmitting (306) said data package to said client. The invention furthermore relates to a system for retrieving data from an object database, a server, a computer program, a computer program product and a computer system.
    Type: Grant
    Filed: September 29, 2010
    Date of Patent: November 11, 2014
    Assignee: Systemite AB
    Inventors: Jan Ok Söderberg, Claes Andersson, Mikael Strömberg
  • Patent number: 8886589
    Abstract: Systems, methods, and computer-storage media for generating and providing knowledge content to users utilizing a web architecture that integrates information across data silos through a common, flexible data storage schema, such as a star or snowflake schema, are provided. Data from a content graph, a user activity graph, a social graph, and temporal data as it relates to each of the content graph, the social graph and the user activity graph, is stored in a knowledge content database utilizing the star schema. In this way, data from each of these formerly disparate sources may be accessed from a common, extensible application platform utilizing ontologies and pivot table functionality, thus providing smarter, more comprehensive knowledge in response to received user queries.
    Type: Grant
    Filed: May 16, 2013
    Date of Patent: November 11, 2014
    Assignee: Microsoft Corporation
    Inventors: Arungunram Chandrasekaran Surendran, Tarek Najm, Phani Vaddadi, Rajeev Prasad, Siva Mohan
  • Patent number: 8887076
    Abstract: Methods and systems for providing a user interface for building a graphical flowchart that represents a database query. One method includes presenting a plurality of flowchart step types, wherein each of the flowchart step types is associated with a different logical expression format. The method also includes receiving a selection of one of the plurality of flowchart step types, presenting at least one expression option for the logical expression format of the selected flowchart step type, and receiving at least one input for the expression option. The method further includes generating a graphical flowchart step associated with a logical expression, wherein the logical expression is based on the at least one input and the logical expression format of the selected flowchart step type. The method also includes displaying the graphical flowchart step and automatically generating a database query corresponding to the logical expression associated with the displayed graphical flowchart step.
    Type: Grant
    Filed: November 1, 2011
    Date of Patent: November 11, 2014
    Assignee: Aver Informatics Inc.
    Inventor: Matthew Scott Frohliger
  • Publication number: 20140324908
    Abstract: The present disclosure relates to the use of both semantic analysis and statistical text mining to process data records, improving the completeness and accuracy of records so processed. By way of example, a data record may be iteratively processed by text mining using seeds derived from a semantic template and by validating the results based on semantic reasoning based on the semantic template.
    Type: Application
    Filed: April 29, 2013
    Publication date: October 30, 2014
    Applicant: GENERAL ELECTRIC COMPANY
    Inventors: Michael Evans Graham, Andrew Walter Crapo, Abha Moitra, Gerald Bowden Wise, Steven Matt Gustafson, Victor Manuel Perez-Zarate, Luis Babaji Ng Tari
  • Patent number: 8874502
    Abstract: A method and apparatus for real time datamining. In one embodiment, the method includes receiving a user request for datamining with respect to a value from a report associated with a specific pyramid level, identifying a datamining function to be performed for statistical analysis of lower level data pertaining to the value from the report, identifying dimensions to be used as variables for the statistical analysis, and determining criteria for selecting the lower level data associated with the value from the report. The method may further include submitting a request to one or more source databases, the request reflecting the identified dimensions and the determined criteria, performing the datamining function on a data set received from the source databases, and creating a datamining report based on a result of the performed datamining function.
    Type: Grant
    Filed: August 29, 2008
    Date of Patent: October 28, 2014
    Assignee: Red Hat, Inc.
    Inventor: Eric J. Williamson
  • Patent number: 8874610
    Abstract: Methods and systems for identifying stability exceptions in a data log are disclosed. In one method, at least one key that is present in the data log is determined. The data log is comprised of at least one data set, at least one of which includes a plurality of iterations indicating states of the corresponding data set at different points in time. For each data set and for each key, a map is generated. The map indicates, for each iteration of the corresponding data set, whether the corresponding key is present in the corresponding iteration. Moreover, at least one expression pattern rule that models data item stability characteristics over data set iterations is compared to each of the maps to determine whether the corresponding map satisfies the one or more expression pattern rules. Further, at least one unstable data item is identified in the data log based on the comparison.
    Type: Grant
    Filed: December 6, 2011
    Date of Patent: October 28, 2014
    Assignee: International Business Machines Corporation
    Inventor: Michael T. Geroulo
  • Patent number: 8874611
    Abstract: An apparatus, method and article of manufacture of the present invention detects the presence of references to the same concept in separate sections of text, and, with no input required from the reader, presents the reader with information concerning the detected references to the concept. The information provided may comprise information related to the location of the reference to the concept in other sections of text, and the reader also is provided the ability to move from one reference to a concept directly to another reference to the same concept.
    Type: Grant
    Filed: September 24, 2012
    Date of Patent: October 28, 2014
    Inventor: Philip R Krause
  • Patent number: 8874432
    Abstract: Systems and methods are disclosed to perform relation extraction in text by applying a convolution strategy to determine a kernel between sentences; applying one or more semi-supervised strategies to the kernel to encode syntactic and semantic information to recover a relational pattern of interest; and applying a classifier to the kernel to identify the relational pattern of interest in the text in response to a query.
    Type: Grant
    Filed: April 3, 2011
    Date of Patent: October 28, 2014
    Assignee: NEC Laboratories America, Inc.
    Inventors: Yanjun Qi, Bing Bai, Xia Ning, Pavel Kuksa
  • Patent number: 8874500
    Abstract: The present invention generally relates to a computerized system and method for creating, optimizing, and using a rules processing system that evaluates multiple rules against facts and events and detects, identifies, reacts to, and reports on events of interest. Events of interest may pertain to any subject matter, and in an embodiment, relate to securities (e.g., stocks, bonds, etc.) transactions. The system and method of the present invention also identifies patterns in large data sets using dynamically changing rules, and as a result, makes the processing and use of rules more efficient.
    Type: Grant
    Filed: December 16, 2009
    Date of Patent: October 28, 2014
    Assignee: Barclays Capital Inc.
    Inventors: Daniel Sandholdt, Erick Berkowitz
  • Publication number: 20140317140
    Abstract: Disclosed here are methods, systems, paradigms and structures for predicting queries, creating tables to store data for the predicted queries, and selecting a particular table to obtain the data from in response to a query. The methods include determining various combinations of a finite set of columns users may query on, based on (i) a list of columns users are interested in obtaining data for, and (ii) cardinality information of a column or combinations of columns in the list of columns. The methods further includes creating various tables based on the determined combinations of the columns using a meta query language. A query is responded to by selecting a table that has least number of rows, among the tables that satisfy query parameters. The methods include selecting a table that has a longest sequence of columns matching with a portion of the query parameters.
    Type: Application
    Filed: April 18, 2013
    Publication date: October 23, 2014
    Inventors: SAMUEL RASH, TIMOTHY WILLIAMSON, MARTIN TRAVERSO
  • Publication number: 20140310313
    Abstract: A processor-implemented method, system, and/or computer program product generates and utilizes synthetic context-based objects. A non-contextual data object is associated with a context object, which comports with a predetermined set of constraints, to define a synthetic context-based object, where the non-contextual data object ambiguously relates to multiple subject-matters, and where the context object provides a context that identifies a specific subject-matter, from the multiple subject-matters, of the non-contextual data object. The synthetic context-based object is then associated with at least one specific data store, which includes data that is associated with data contained in the non-contextual data object and the context object. A request for a data store that is associated with the synthetic context-based object results in the return of at least one data store that is associated with the synthetic context-based object.
    Type: Application
    Filed: April 11, 2013
    Publication date: October 16, 2014
    Applicant: International Business Machines Corporation
    Inventors: SAMUEL S. ADAMS, ROBERT R. FRIEDLANDER, JOHN K. GERKEN, III, JAMES R. KRAEMER, PHILIP R. VARKER
  • Patent number: 8862579
    Abstract: Systems and methods for search and search optimization using a pattern in a location identifier is disclosed. In one aspect, embodiments of the present disclosure include a method, which may be implemented on a system, of search and search optimization. The method includes, detecting a set of location identifiers that have a pattern that matches a specified pattern and identifying a set of search results as having content related to the semantic type. The specified pattern can be stored in a computer-readable storage medium and corresponds to a semantic type. The set of search results can include objects associated with the set of location identifiers having the specified pattern.
    Type: Grant
    Filed: April 14, 2010
    Date of Patent: October 14, 2014
    Assignee: VCVC III LLC
    Inventors: James M. Wissner, Nova Spivack
  • Patent number: 8862621
    Abstract: A relational database is used to determine a possibility of events, such as terrorist threats. A database is populated or updated in an automated fashion by using appropriate sensor sources. Whenever a field is augmented or updated, an event is defined. Events trigger intelligent data collection agents using a push technology. A list of events is defined over a relative time interval. A selection of lists of events is made in response to events. The defined database is updated according to an iterative architecture for the defined database.
    Type: Grant
    Filed: November 26, 2008
    Date of Patent: October 14, 2014
    Assignee: The United States of America as represented by the Secretary of the Navy
    Inventor: Stuart H Rubin
  • Patent number: 8856174
    Abstract: A search extend setting unit that identifies a layer made to correspond to an asset specified by referencing a first database for recording assets made to correspond to each of users by relating each of the assets to a first layer that is a layer related to a virtual system individually used by each of the users, or to a second layer that is a layer related to hardware and software, and to set an extent for extracting information about other assets having a relationship with the specified asset according to a layer of the specified asset, and an extracting unit that extract other assets that have a relationship with the specified asset and are present in the extent set by referencing the first database and a second database for recording information indicating a relationship among the assets, and the first database based on the first asset.
    Type: Grant
    Filed: September 7, 2012
    Date of Patent: October 7, 2014
    Assignee: Fujitsu Limited
    Inventors: Shigeki Fueta, Hiroyuki Tamon, Masayuki Iguchi, Naoki Matsushita
  • Publication number: 20140270407
    Abstract: Various technologies pertaining to assigning metadata to images in a personal image collection of a user based upon images and associated metadata assigned thereto that are accessible to the user by way of a social network application are described. An account of the user in a social network application is accessed to retrieve images and metadata that is accessible to the user. A face recognition algorithm is trained based upon the retrieved images and metadata, and the trained face recognition algorithm is executed over the personal image collection of the user, where the personal image collection of the user is external to the social network application.
    Type: Application
    Filed: March 14, 2013
    Publication date: September 18, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Shobana Balakrishnan, Surajit Chaudhuri
  • Publication number: 20140280151
    Abstract: Methods, systems and articles of manufacture for discovering relationships among data elements within a dataset are disclosed. A first relationship is identified between a first data element and a second data element by identifying a correlation between a first attribute of the first data element and the first attribute of a second data element. A second relationship indicator is generated that is indicative of a relationship between the first data element and the second data element based on the correlation between the first attribute of the first and second data elements. Various embodiments can identify implicit relationships across one or more levels of explicit relationships where the explicit relationships can be across different attributes. Such techniques can be employed in various types of application programs.
    Type: Application
    Filed: March 16, 2013
    Publication date: September 18, 2014
    Inventor: Fadi Victor Micaelian
  • Publication number: 20140280341
    Abstract: An apparatus, computer-readable medium, and computer-implemented method for contextual data mining using a relational data set includes monitoring one or more data sources for information relating to the relational data set, the relational data set comprising one or more data objects in one or more classes, detecting activity corresponding to a first data object in the one or more data objects based at least in part on information gathered from at least one data source, determining whether the activity exceeds a predefined threshold, identifying a second data object in the one or more data objects which is connected to the first data object based at least in part on an analysis of relationships between the one or more data objects, and transmitting information relating to the second data object based at least in part on a determination that the activity exceeds the predefined threshold.
    Type: Application
    Filed: March 13, 2014
    Publication date: September 18, 2014
    Applicant: Geographic Services, Inc.
    Inventors: Keyvan Rafei, Alex Taranenko
  • Patent number: 8838606
    Abstract: Systems and methods for classifying electronic information or documents into a number of classes and subclasses are provided through an active learning algorithm. In certain embodiments, seed sets may be eliminated by merging relevance feedback and machine learning phases. Such document classification systems are easily scalable for large document collections, require less manpower and can be employed on a single computer, thus requiring fewer resources. Furthermore, the classification systems and methods described can be used for any pattern recognition or classification effort in a wide variety of fields, including electronic discovery in legal proceedings.
    Type: Grant
    Filed: June 18, 2013
    Date of Patent: September 16, 2014
    Inventors: Gordon Villy Cormack, Maura Robin Grossman
  • Publication number: 20140258333
    Abstract: A mechanism is provided for computing the frequency packets in network devices. Respective packets are associated with entities in a vector, where each of the entities is mapped to corresponding ones of the respective packets, and the entities correspond to computers. Upon a network device receiving the respective packets, a count is individually increased for the respective packets in the vector respectively mapped to the entities, and computing a matrix vector product of a matrix A and the vector. The matrix A is a product of at least a first matrix and a second matrix. The first matrix includes rows and columns where each of the rows has a single random location with a one value and remaining locations with zero values. The matrix vector product is transmitted to a centralized computer for aggregating with other matrix vector products.
    Type: Application
    Filed: September 10, 2013
    Publication date: September 11, 2014
    Applicant: International Business Machines Corporation
    Inventor: David P. Woodruff
  • Publication number: 20140258332
    Abstract: A mechanism is provided for computing the frequency packets in network devices. Respective packets are associated with entities in a vector, where each of the entities is mapped to corresponding ones of the respective packets, and the entities correspond to computers. Upon a network device receiving the respective packets, a count is individually increased for the respective packets in the vector respectively mapped to the entities, and computing a matrix vector product of a matrix A and the vector. The matrix A is a product of at least a first matrix and a second matrix. The first matrix includes rows and columns where each of the rows has a single random location with a one value and remaining locations with zero values. The matrix vector product is transmitted to a centralized computer for aggregating with other matrix vector products.
    Type: Application
    Filed: March 8, 2013
    Publication date: September 11, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: David P. Woodruff
  • Publication number: 20140250128
    Abstract: Systems and methods of returning location and/or event results using information social media content are provided. One or more social networking servers are accessed to retrieve social media content. One or more items within the social media content are then identified. These items may then be categorized. Information about the categories of the one or more items are stored in a database storing information about locations or events. A search query for a location or event may be received, and results for the search query may be selected by accessing the database and utilizing the information about locations or events as well as the information about the categories of the one or more items. The results may then be returned to a user device for display.
    Type: Application
    Filed: March 1, 2013
    Publication date: September 4, 2014
    Applicant: eBay
    Inventors: Jeremiah Joseph Akin, Jayasree Mekala, Praveen Nuthulapati, Joseph Vernon Paulson, IV, Kamal Zamer
  • Publication number: 20140250150
    Abstract: A method of searching a pattern of sequence data, includes setting an interest pattern model comprising a length of an interest pattern, a value of an allowed mismatch, and a minimum support, obtaining supports of similar patterns of a child pattern, each of the similar patterns having a mismatch value with the child pattern that is greater than the value of the allowed mismatch, based on mismatch values of similar patterns of a parent pattern, and determining whether a support of the child pattern fulfills a condition of the minimum support based on the supports of the similar patterns of the child pattern, and a support of the parent pattern.
    Type: Application
    Filed: March 4, 2014
    Publication date: September 4, 2014
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Yo-Han ROH, Hyoung-Min PARK, Kyoung-Gu WOO, Joo-Hyuk JEON, Seok-Jin HONG
  • Publication number: 20140250151
    Abstract: Architecture introduces a new pattern operator referred to as called an augmented transition network (ATN), which is a streaming adaptation of non-reentrant, fixed-state ATNs for dynamic patterns. Additional user-defined information is associated with automaton states and is accessible to transitions during execution. ATNs are created that directly model complex pattern continuous queries with arbitrary cycles in a transition graph. The architecture can express the desire to ignore some events during pattern detection, and can also detect the absence of data as part of a pattern. The architecture facilitates efficient support for negation, ignorable events, and state cleanup based on predicate punctuations.
    Type: Application
    Filed: May 13, 2014
    Publication date: September 4, 2014
    Applicant: Microsoft Corporation
    Inventors: Badrish Chandramouli, Jonathan D. Goldstein, David Maier, Mohamed H. Ali, Roman Schindlauer
  • Publication number: 20140250149
    Abstract: An information processing apparatus includes a text mining section configured to perform text mining on text data acquired from the outside and to output extracted information; an identification section configured to search a development database storing elements constituting a product and the relationship among the elements by using the information extracted by text mining to identify an element related to the information; and a notification section configured to notify the identified element to a user, a program for use in the information processing apparatus.
    Type: Application
    Filed: February 19, 2014
    Publication date: September 4, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: SEIJI HAMADA, YUTAKA MORIYA, TADAHIKO NAKAMURA, MASAKI WAKAO, TAKESHI WATANABE
  • Patent number: 8819064
    Abstract: Method, system, and programs for heterogeneous data management. Information from multiple data sources is first obtained. Data/metadata from each of the data sources is modeled based on the source and/or granularity information of the data/metadata to generate data/metadata models. The data/metadata from multiple data sources are integrated, by applying one or more processes to the data/metadata from different data sources based on the data/metadata models, to generate integrated data/metadata. A provenance representation for the integrated data/metadata is created tracing sources, granularities, and/or processes applied and archived for enabling an query associated with the integrated data/metadata.
    Type: Grant
    Filed: February 7, 2011
    Date of Patent: August 26, 2014
    Assignee: Yahoo! Inc.
    Inventors: Chris Olston, Anish Das Sarma
  • Publication number: 20140236996
    Abstract: A search device (101) obtains a specified length to be specified in a search query based on a position of an object having such a position set in accordance with the intent of a user. A detector (102) detects respective positions of multiple objects changing the respective positions in accordance with the intent of the user in a real space. A calculator (103) calculates a specified length on the basis of the intent of the user based on the detected positions of the multiple objects. A searcher (104) searches for product records having a product size satisfying a search condition based on the calculated specified length from a product database managing product records each containing at least a product size and a product image. A display (105) displays on a screen the product image of the searched product record.
    Type: Application
    Filed: April 2, 2012
    Publication date: August 21, 2014
    Applicant: RAKUTEN, INC.
    Inventors: Soh Masuko, Jiro Tanaka, Shigaku Iwabuchi, Kenzo Nirasawa, Tatsuhito Oe
  • Patent number: 8812553
    Abstract: A method for populating a data system is provided. The method includes the step of mapping at least one application path of the data system to at least one conceptual path of an ontology system. The application path addresses parts of the structure of the data system, and the conceptual path addresses parts of the structure of the ontology system. The method further includes the step of automatically populating the data system at a location addressed by the application path with data values contained in the conceptual path.
    Type: Grant
    Filed: April 29, 2010
    Date of Patent: August 19, 2014
    Assignees: Collibra NV/SA, Vrije Universiteit Brussel
    Inventors: Damien Trog, Stijn Christiaens, Pieter De Leenheer, Felix Urbain Yolande Van De Maele, Robert Alfons Meersman
  • Patent number: 8812543
    Abstract: Systems, methods, and computer-readable code stored on a non-transitory media for mining association rules include determining a minimum support threshold and a minimum confidence threshold for association rule mining; determining a sampling model; sampling transactions from a transaction dataset; mining association rules from the sampled transactions; and transmitting mined association rules.
    Type: Grant
    Filed: May 19, 2011
    Date of Patent: August 19, 2014
    Assignee: Infosys Limited
    Inventors: Balasubramanian Kanagasabapathi, K Antony Arokia Durai Raj