Clustering Or Classification (epo) Patents (Class 707/E17.046)
  • Publication number: 20140101155
    Abstract: Deterministic Finite Automatons (DFAs) and Nondeterministic Finite Automatons (NFAs) are two typical automatons used in the Network Intrusion Detection System (NIDS). Although they both perform regular expression matching, they have quite different performance and memory usage properties. DFAs provide fast and deterministic matching performance but suffer from the well-known state explosion problem. NFAs are compact, but their matching performance is unpredictable and with no worst case guarantee. A new automaton representation of regular expressions, called Tunable Finite Automaton (TFA), is described. TFAs resolve the DFAs' state explosion problem and the NFAs' unpredictable performance problem. Different from a DFA, which has only one active state, a TFA allows multiple concurrent active states. Thus, the total number of states required by the TFA to track the matching status is much smaller than that required by the DFA.
    Type: Application
    Filed: October 10, 2012
    Publication date: April 10, 2014
    Inventors: H. Jonathan CHAO, Yang Xu
  • Publication number: 20140101154
    Abstract: An aspect of the present invention simplifies grouping of data items previously stored in a database, the data items being stored in the form of rows and columns in respective tables (in the database). In one embodiment, a system displays a cross product of values from two or more columns in the form of multiple lines, where each line contains a respective value from each of the two or more columns to specify a corresponding criterion (combination of values). In response to receiving inputs indicating the respective groups for each of the lines, the system determines a group for each data item (stored in the database) based on the received inputs. A user is accordingly required to only specify the desired groups corresponding to various combinations of values of the columns to cause grouping of data items in the database.
    Type: Application
    Filed: October 10, 2012
    Publication date: April 10, 2014
    Applicant: Oracle Financial Services Software Limited
    Inventors: Gangadhar Nagulakonda, Rajaram Narasimha Vadapandeshwara, Subramanian Ramakrishnan
  • Publication number: 20140095502
    Abstract: Techniques are provided that address the problems associated with prior approaches for clustering a fact table in a relational database management system. According to one aspect of the invention, a database server clusters a fact table in a database based on one or more dimension tables. More specifically, rows are stored in the fact table in a sorted order and the order in which the rows are sorted is based on values in one or more columns of one or more of the dimension tables. A user specifies the columns of the dimension tables on which the sorted order is based in “clustering criteria”. The database server uses the clustering criteria to automatically store the rows in the fact table in the sorted order in response to certain user-initiated database operations on the fact-table.
    Type: Application
    Filed: September 28, 2012
    Publication date: April 3, 2014
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Mohamed Ziauddin, Andrew Witkowski
  • Publication number: 20140089311
    Abstract: A system, method, and computer-readable medium that facilitate classification of database requests as problematic based on estimated processing characteristics of the request are provided. Estimated processing characteristics may include estimated skew including central processing unit skew and input/output operation skew, central processing unit duration per input/output operation, and estimated memory usage. The estimated processing characteristics are made on a request step basis. The request is classified as problematic responsive to determining one or more of the estimated characteristics of a request step exceed a corresponding threshold. In this manner, mechanisms for predicting bad query behavior are provided. Workload management of those requests may then be more successfully provided through workload throttles, filters, or even a more confident exception detection that correlates with the estimated bad behavior.
    Type: Application
    Filed: September 26, 2012
    Publication date: March 27, 2014
    Inventors: Anita Richards, Douglas Brown, Bruce Britton, Todd Walter
  • Publication number: 20140089090
    Abstract: The invention teaches systems, methods and devices for searching data storage systems and devices by a topical category known as a theme. It is emphasized that this abstract is provided to comply with the rules requiring an abstract that will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. 37 CFR 1.72(b).
    Type: Application
    Filed: September 21, 2012
    Publication date: March 27, 2014
    Inventor: Steven Thrasher
  • Publication number: 20140074839
    Abstract: A user of a network-based system may correspond to a user profile that describes the user. The user profile may describe the user using one or more descriptors of items that correspond to the user (e.g., items owned by the user, items liked by the user, or items rated by the user). In some situations, such a user profile may be characterized as a “taste profile” that describes an array or distribution of one or more tastes, preferences, or habits of the user. Accordingly, the user profile machine within the network-based system may generate the user profile by accessing descriptors of items that correspond to the user, clustering one or more of the descriptors, and generating the user profile based on one or more clusters of the descriptors.
    Type: Application
    Filed: September 12, 2012
    Publication date: March 13, 2014
    Applicant: GRACENOTE, INC.
    Inventors: Phillip Popp, Ching-Wei Chen, Peter C. DiMaria, Markus K. Cremer
  • Publication number: 20140067817
    Abstract: Methods and systems for recommending social networking connections are disclosed. Information is received from a mobile device relating to software applications that are installed on a first user's mobile device. A weight for each software application is calculated based on usage information and each software application is designated to at least one category. A priority is calculated for each category based at least in part on respective weights of software applications designated to the category. A second user, who is not connected to the first user, is detected, wherein the second user has a predetermined number of categories that are the same as the first user's categories, and wherein priorities corresponding to the second user's categories are within a predetermined range of priorities corresponding to the first user's categories. A recommendation to connect with the second user is provided to the first user.
    Type: Application
    Filed: August 30, 2012
    Publication date: March 6, 2014
    Applicant: Yahoo! Inc.
    Inventors: Anupam SETH, Allie K. Watfa, Dale Nussel, Jonathan Kilroy
  • Publication number: 20140067808
    Abstract: Techniques, an apparatus and an article of manufacture for distributed scalable clustering and community detection. A method includes generating a label for each node in a graph, wherein said label identifies a community in which a node participates, propagating each label locally within two or more segments of the graph based on a participation percentage of each node in at least one identified community within the graph, and deriving at least one cluster of nodes in the graph that corresponds to the at least one identified community based on said propagating.
    Type: Application
    Filed: September 6, 2012
    Publication date: March 6, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ankur Narang, Jyothish Soman
  • Publication number: 20140059047
    Abstract: According to one embodiment, an apparatus stores a plurality of datapoints. A datapoint comprises a first value and a second value that depends upon the value of the first value. The apparatus associates the datapoint with a group from a plurality of groups. The group is associated with an identifying range and the datapoint is associated with the group based at least in part upon the first value of the datapoint and the identifying range of the group. The apparatus calculates a median of the second values of the datapoints associated with the group and a performance value by performing a regression based at least in part upon the identifying range and the calculated median of the group. The apparatus determines that the performance value exceeds a baseline value and in response, presents, on a display, an illustration depicting the identifying range and the associated median of the group.
    Type: Application
    Filed: August 27, 2012
    Publication date: February 27, 2014
    Applicant: Bank of America Corporation
    Inventors: Kasilingam B. Laxmanan, Yudong Chen, Julea K. Duke, Ming Xue
  • Publication number: 20140052726
    Abstract: Techniques are described for performing grouping and aggregation operations. In one embodiment, a request is received to aggregate data grouped by a first column. In response to receiving the request, a group value in a row of a first column is mapped to an address. A pointer is stored for a first group at a first location identified by the address. The pointer identifies a second location of a set of aggregation data for the first group. An aggregate value included in the set of aggregation data is updated based on a value in the row of a second column.
    Type: Application
    Filed: August 20, 2012
    Publication date: February 20, 2014
    Inventors: Philip Amberg, Justin Schauer, Robert David Hopkins
  • Publication number: 20140047045
    Abstract: A user creates an event in a social networking system specifying a location, a time, and a guest list of other users invited to the event. The social networking system generates a page associated with the event that provides information about the event and identifies whether users have responded to invitations to the event. The content of the page may be customized for the user viewing the page to encourage the viewing user to attend the event. For example, the viewing user's relationship to and/or similar characteristics with other users on the guest list is determined and used by the social networking system to identify the users whose responses to invitations are shown to the viewing user via the page. Additionally, a notification method more prominently distributes acceptances of invitations to other users to encourage attendance.
    Type: Application
    Filed: August 13, 2012
    Publication date: February 13, 2014
    Inventors: Robert Michael Baldwin, Henry Bridge, Robyn David Morris
  • Publication number: 20140046942
    Abstract: A method for computerized batching of huge populations of electronic documents, including computerized assignment of electronic documents into at least one sequence of electronic document batches such that each document is assigned to a batch in the sequence of batches and such that there is no conflict between batching requirements, the following batching requirements being maintained by a suitably programmed processor: a. pre-defined subsets of documents are always kept together in the same batch, b. batches are equal in size, c. the population is partitioned into clusters, and all documents in any given batch belong to a single cluster rather than to two or more clusters.
    Type: Application
    Filed: August 8, 2012
    Publication date: February 13, 2014
    Applicant: EQUIVIO LTD.
    Inventor: Yiftach RAVID
  • Publication number: 20140040262
    Abstract: Techniques for facilitating a similarity search of digital assets (e.g., audio files, image files, video files, etc.) are described. Consistent with some embodiments, a cloud-based search service manages one or more search tree data structures for use in organizing digital assets to make the digital assets searchable. Each digital asset is associated with a feature vector based on the various attributes and/or characteristics of the digital asset. The digital assets are then assigned to leaf nodes in one or more search tree data structures based on a measure of the distance between the feature vector of the digital asset and a virtual feature vector associated with a leaf node. When a search for similar digital assets is invoked, a prioritized breadth first search of a search tree is performed to identify the digital assets having the feature vectors closest in distance to the reference digital asset.
    Type: Application
    Filed: August 3, 2012
    Publication date: February 6, 2014
    Applicant: Adobe Systems Incorporated
    Inventors: Sven Winter, Jonathan Brandt
  • Publication number: 20140032552
    Abstract: Defining relationships are described. Defining relationships can include retrieving a number of event notifications that correspond to a number of nodes. Defining relationships can include defining a number of group patterns that correspond to the number of event notifications. Defining relationships can also include grouping the number of nodes into a number of groups that correlate with the number of group patterns, the number of groups defining a number of relationships between the number of nodes. Defining relationships can include assigning a number of weights to the number of relationships between the number of nodes, wherein the number of weights are based on a strength of the number of relationships between the number of nodes.
    Type: Application
    Filed: July 30, 2012
    Publication date: January 30, 2014
    Inventors: Ira Cohen, Ruth Bernstein, Yonatan Ben Simhon
  • Publication number: 20140019239
    Abstract: Embodiments for a method for ranking social quality of content published on a plurality of web pages are provided. In an embodiment, the method includes receiving at least one log record from a tracking component on at least one web page. The one log record is indicative of at least one user activity on the at least one web page. The method further includes aggregating the at least one log record corresponding to preferably each of the plurality of web pages based on one or more parameters. The method also includes assigning a first score for preferably each of the plurality of web pages based on the aggregating. The first score is indicative of a social quality of content published in the at least one web page. The method includes ranking the plurality of web pages based on the first score.
    Type: Application
    Filed: July 12, 2012
    Publication date: January 16, 2014
    Inventors: Yan Qu, Nanda Kishore, Timothy Schigel, Juan Valencia, Andrew Stevens, Ishika Paul, Ping Zhu
  • Publication number: 20140019453
    Abstract: Methods and apparatuses for assessing user interest scores of users of a mobile network are provided. A method includes for each of a plurality of users (A) determining initial interest scores corresponding to user's interests and interest scores of friends of the user for the user's interests, based on browsing information, and (B) assessing user's interest scores based on the initial interest scores, the interest scores of the friends and friends' influence. The method further includes outputting a list including a subset of the users selected based on the user's interest scores.
    Type: Application
    Filed: July 13, 2012
    Publication date: January 16, 2014
    Applicant: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)
    Inventors: Saravanan MOHAN, Divya SUNDAR
  • Patent number: 8630890
    Abstract: A method and system for mining a database for product migration analysis includes querying product usage data for a legacy product and a new product from the database as time series data. The product usage data is representative for a large number of consumers of the legacy and new products. A mathematical model may be used to determine a relationship between the two time series data. Product migration values and other features related to product migration, such as a transition period of product usage, may be estimated, determined or predicted.
    Type: Grant
    Filed: December 3, 2008
    Date of Patent: January 14, 2014
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Siu-Tong Au, Rong Duan
  • Publication number: 20140012854
    Abstract: Methods and/or systems are provided that may be utilized to rank categories of an entity based at least in part on relevance.
    Type: Application
    Filed: July 3, 2012
    Publication date: January 9, 2014
    Applicant: Yahoo! Inc.
    Inventor: Syama Prasad Suprasadachandranpilliai
  • Publication number: 20140012849
    Abstract: A technique of extracting hierarchies for multilabel classification. The technique can process a plurality of labels related to a plurality of documents, using a clustering process, to cluster the labels into plurality of clusterings representing a plurality of classes. The technique classifies the documents and predicts a plurality of performance characteristics, respectively, for the plurality of clusterings. The technique selects at least one of the clusterings using information from the performance characteristics and adds the selected clustering into a resulting hierarchy.
    Type: Application
    Filed: July 6, 2012
    Publication date: January 9, 2014
    Inventors: Alexander Ulanov, German Sapozhnikov, Georgy Shevlyakov
  • Publication number: 20140012852
    Abstract: Disclosed are methods and apparatus for correlating metadata from a plurality of different sources. The methods and apparatus may use an order for the data sources. The metadata from each of the data sources may be divided or split into one or more chunks. The metadata from each of the chunks may be filtered and sorted, e.g., to ensure that the metadata relate to the same multimedia content. The metadata chunks from the first data source in the order and the second data source in the order may then be aligned to produce currently aligned metadata. The metadata data chunks from the next data source in the order may then be aligned with the currently aligned metadata to produce new currently aligned metadata. This process may be repeated until the metadata from all of the sources are aligned, thereby providing a set of correlated metadata.
    Type: Application
    Filed: July 3, 2012
    Publication date: January 9, 2014
    Applicant: SETJAM, INC.
    Inventors: Grzegorz Kapkowski, Marcin Kaszynski, Marek M. Stepniowski
  • Publication number: 20140012847
    Abstract: Embodiments of an inspection system and method for a collection of information objects, for example, a collection of executable software applications may be inspected for computer viruses, or a collection of genomes may be inspected for common or unique gene sequences. Information objects may contain identified sequences of instructions, each of which may be labeled with a symbol. In the software context, programming languages may include symbols that indicate functionality. In some embodiments, an inspection of the statistical properties of the information objects and their included symbols may allow for the symbols (and thus instruction sequences) to be grouped into logical components. In some embodiments, objects that include individual logical components may be grouped together. These groupings and their dependencies may be used to determine the structure of each object by detailing its constituent components, how they relate or depend on one another, and how the information object may function.
    Type: Application
    Filed: July 5, 2012
    Publication date: January 9, 2014
    Applicant: Raytheon BBN Technologies Corp.
    Inventor: Richard Lee Barnes, II
  • Publication number: 20140012818
    Abstract: Disclosed are methods and apparatus for processing correlated metadata (e.g., programmatic metadata relating to one or more episodes of a television show). Mappings, or correlations, between chunks of the metadata that originated from a particular data source and the metadata clusters may be determined and displayed, e.g., on a graphical user interface. Using this display, a user (i.e., a human operator) may detect inconsistencies in the correlated metadata. An inconsistency may be an incorrect mapping, the mapping of more than one of the metadata chunks that originated from the same data source to the same metadata cluster, or that one or more of the metadata chunks have not been mapped to a metadata cluster. The mappings may then be edited so as to remove detected inconsistencies.
    Type: Application
    Filed: July 3, 2012
    Publication date: January 9, 2014
    Applicant: SETJAM, INC.
    Inventors: Marcin Kaszynski, Grzegorz Kapkowski, Marek M. Stepniowski
  • Publication number: 20140006399
    Abstract: Method, apparatus, and programs for recommending websites. Information related to a user's browsing history of a plurality of websites is obtained. A browsing co-occurrence of at least some of the plurality of websites in one or more time periods is determined based on the obtained information related to the user's browsing history. The plurality of websites are assigned to a plurality of website groups based on the determined browsing co-occurrence. Each of the plurality of website groups is associated with one of the one or more time periods. At least one of the plurality of website groups is presented to the user based on their associated time periods.
    Type: Application
    Filed: June 29, 2012
    Publication date: January 2, 2014
    Applicant: Yahoo! Inc.
    Inventors: Sudharsan Vasudevan, Eugene Kouichi Kashida, Ethan Batraski
  • Publication number: 20140006402
    Abstract: A contents distribution server using an identification code of contents is disclosed. The apparatus includes an interface providing unit configured to provide an interface for registration of the contents to a device, if a request for the registration of the contents is received from the device; an code information extraction unit configured to extract code information from input information through the interface; an identification code generation unit configured to generate the identification code by combining codes corresponding to the extracted code information, and a contents distribution unit configured to match the contents with the generated identification code, register the matched contents in a database and transmit the registered contents to a contents managing server with reference to the identification code.
    Type: Application
    Filed: July 2, 2012
    Publication date: January 2, 2014
    Applicant: KT CORPORATION
    Inventors: Sang-Bum LEE, Chang-Seuk OK, Hye-Mi KIM, Se-Cheol PARK, Joo-Young YOON
  • Publication number: 20140006400
    Abstract: A system and method of managing online social networking which includes identifying a plurality of users related to a primary user on a social networking tool using a computer. The method and system identifies a plurality of activities performed by the plurality of users on the social networking tool, and assigning a score to each of the activities. A threshold cumulative score for users to enter a group is defined. The system and method evaluates the activities of each of the users, and calculates a cumulative score for each of the users based on their respective activities, and evaluates the cumulative score of each of users in relation to the group. One or more of the plurality of users who meet the threshold cumulative score are assigned to the group. A status for each user in the group based on their cumulative score is determined.
    Type: Application
    Filed: June 29, 2012
    Publication date: January 2, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Paul R. Bastide, Matthew E. Broomhall, Robert E. Loredo
  • Publication number: 20130339357
    Abstract: Embodiments of the invention include methods for identifying one or more clusters in a streaming graph, the method includes receiving a stream of edges and sampling the stream of edges to create a structural reservoir and support reservoir. The method also includes creating a sampled graph from the structural reservoir and identifying the one or more clusters in the sampled graph by grouping one or more connected vertices in the sampled graph.
    Type: Application
    Filed: June 26, 2012
    Publication date: December 19, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ahmed S. Eldawy, Rohit M. Khandekar, Kun-Lung Wu
  • Publication number: 20130339355
    Abstract: A system for clustering vertices in a streaming graph includes a structural sampler configured to receive a stream of edges. The structural sampler includes a reservoir manager configured to receive the stream of edges and create a structural reservoir and a support reservoir and a graph manager configured to receive the structural reservoir from the reservoir manager and to create a sampled graph from the structural reservoir, wherein the sampled graph includes one or more clusters that each include one or more connected vertices.
    Type: Application
    Filed: June 14, 2012
    Publication date: December 19, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ahmed S. Eldawy, Rohit M. Khandekar, Kun-Lung Wu
  • Publication number: 20130332450
    Abstract: A method for automatically extracting and organizing information by a processing device from a plurality of data sources is provided. A natural language processing information extraction pipeline that includes an automatic detection of entities is applied to the data sources. Information about detected entities is identified by analyzing products of the natural language processing pipeline. Identified information is grouped into equivalence classes containing equivalent information. At least one displayable representation of the equivalence classes is created. An order in which the at least one displayable representation is displayed is computed. A combined representation of the equivalence classes that respects the order in which the displayable representation is displayed is produced.
    Type: Application
    Filed: June 11, 2012
    Publication date: December 12, 2013
    Applicant: International Business Machines Corporation
    Inventors: Vittorio Castelli, Radu Florian, Xiaoqiang Luo, Hema Raghavan
  • Publication number: 20130325863
    Abstract: Embodiments of the invention relate to a modeling activity area associated with groups of data items. Tools are provided to profile activity area involvement, both from the data item and from associated participants. The data items are placed into clusters and one or more activity areas are derived from the formed clusters. Each activity area is defined from the perspective of a single user. Participants in an activity area are connected to a user, but not necessarily to each other. The combination of formations of clusters and activity areas provides a multi-facetted organization of connections between data items and associated participants.
    Type: Application
    Filed: August 28, 2012
    Publication date: December 5, 2013
    Applicant: International Business Machines Corporation
    Inventor: Hongxia Jin
  • Publication number: 20130325866
    Abstract: Embodiments of the invention relate to modeling communities associated with groups of data items. Tools are provided to iteratively assign data items to communities and to update topic and participant distribution in the assigned communities. As the distributions are updated, the characteristics of the communities are updated. Each activity area is defined from the perspective of a single user. Participants in a community are connected to a user, but not necessarily to each other. The combination of formations of communities and the statistical aspect of evaluating characteristics of the communities provides a multi-facetted organization of connections between data items and associated participants.
    Type: Application
    Filed: May 31, 2012
    Publication date: December 5, 2013
    Applicant: International Business Machines Corporation
    Inventors: Hongxia Jin, Yan Liu, Wenjun Zhou
  • Publication number: 20130325867
    Abstract: The disclosure generally describes computer-implemented methods, software, and systems for providing a homogeneous data model based on in-memory database views. One computer-implemented method includes creating an application view field associated with an application view, indicating a base database field in a base database table for the created application view field, collecting additional information associated with the indicated base database field, determining at least a data element and a domain associated with the indicated base database field using the collected additional information, determining, by operation of a computer using the collected additional information, that multiple determined catalog entries associated with the indicated base database field exist in a catalog, and proposing names for the application view field, wherein the proposed names are presented from most specific to least specific.
    Type: Application
    Filed: June 4, 2012
    Publication date: December 5, 2013
    Applicant: SAP AG
    Inventors: 69190 Kemmler, Torsten Kamenz
  • Publication number: 20130311437
    Abstract: A system and method obtain a database stored on a storage device containing information on multiple assets, the information including measurements taken from devices monitoring each asset, and context information corresponding to the environment the items are subjected to. The system and method groups assets via a computer system into a homogenous group as a function of selected context information and performs analytics via the computer system on the grouped assets to manage the assets.
    Type: Application
    Filed: May 16, 2012
    Publication date: November 21, 2013
    Applicant: Honeywell Internatioanl Inc.
    Inventors: Petr Stluka, Eva Jerhotova, Karel Marik, Ondrej Holub, Wendy Foslien, Rylan Clark
  • Publication number: 20130311467
    Abstract: A method and a system for coreference resolution are provided. The method includes receiving a set of document clusters, each cluster in the set of document clusters including a set of text documents. Instances of each of a set of candidate named entities are identified in the document clusters. For a pairs of the candidate named entities, at least one socio-temporal feature is computed that is based on the similarity of the distributions of identified instances of the respective candidate name entities among the document clusters. A decision for merging for the candidate named entities into a common real named entity is based on the socio-temporal features.
    Type: Application
    Filed: May 18, 2012
    Publication date: November 21, 2013
    Applicant: Xerox Corporation
    Inventors: Matthias Gallé, Jean-Michel Renders, Guillaume Jacquet
  • Publication number: 20130305058
    Abstract: A method, system and computer program product for controlling enterprise data on mobile devices. Data on a mobile device is tagged as being associated with either enterprise data or with personal data. Upon identifying the storage location of the tagged data and the identifier of the application that generated the tagged data, the tag, the storage location of the tagged data and the identifier of the application are stored in an index. A mobile agent residing on the mobile device may be directed by a mobile device management server of the enterprise to perform various actions (e.g., deleting, encrypting, backing-up) on the enterprise data using the index. In this manner, the enterprise has the ability to control their applications and data that resides on employees' mobile devices to ensure that such data is not lost or used in a manner that is contrary to the wishes of the employer.
    Type: Application
    Filed: May 15, 2012
    Publication date: November 14, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Shalini Kapoor, Palanivel A. Kodeswaran, Sridhar R. Muppidi, Nataraj Nagaratnam, Vikrant Nandakumar
  • Publication number: 20130304741
    Abstract: Method, system, and programs for providing identifiers to objects. Input data representing a plurality of objects is received and categorized into a plurality of entity categories. A first graph of entities is generated using the plurality of entity categories. The first graph of entities are matched with a second graph of entities. A comparison of object pairs is then made, in which each object pair includes a first object from the first graph of entities and a corresponding second object from the second graph of entities. Identifiers are assigned to each object based on comparing the object pairs.
    Type: Application
    Filed: May 10, 2012
    Publication date: November 14, 2013
    Applicant: YAHOO! INC.
    Inventors: Balaji Kannan, Aamod Sane, Zhiwei Gu
  • Publication number: 20130304738
    Abstract: Systems, methods and computer program products manage collections of information using latent semantic analysis. The collections of information may be text based such as collections of documents or non-text data such as audio, image, video or multimedia data. Semantic information groups are created by grouping collections of information according to a degree of relatedness. A system allocates discontiguous node locations of one or more distributed databases to the semantic information groups. The system manages a dynamic semantic table that maps the discontiguous node locations to a semantic virtual table having a contiguous memory space.
    Type: Application
    Filed: May 11, 2012
    Publication date: November 14, 2013
    Applicant: International Business Machines Corporation
    Inventors: Sandra K. Johnson, Grant D. Miller
  • Publication number: 20130304737
    Abstract: A classification system executing on one or more computer systems includes a processor and a memory coupled to the processor. The memory includes a discovery engine configured to navigate through non-volatile memory storage to discover an identity and location of one or more files in one or more computer storage systems by tracing the one or more files from file system mount points through file system objects and to disk objects. A classifier is configured to classify the one or more the files into a classification category. The one or more files are associated with the classification category and stored in at least one data structure. Methods are also provided.
    Type: Application
    Filed: May 10, 2012
    Publication date: November 14, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: NIKOLAI JOUKOV, AMITKUMAR M. PARADKAR, BIRGIT M. PFITZMANN, WILLIAM R. REOHR, PETER URBANETZ
  • Publication number: 20130297603
    Abstract: A monitoring system includes a database storing configuration information about a plurality of objects in the data center; a first inventory instance that adds a first object to the database, where the first inventory instance classifies the first object based on a set of classification rules to select a set of monitoring rules for the first object based on its classification and add configuration information about the first object to the configuration database; and a first monitoring instance to monitor the first object, the monitoring instance monitoring status of the first object based on respective configuration information in the database; at least one of the first inventory instance and the first monitoring instance identifying a further object functionally connected to the first object, the further objects added to the database by the first or a second inventory instance and monitored by the first or a second monitoring instance.
    Type: Application
    Filed: May 1, 2012
    Publication date: November 7, 2013
    Applicant: Fujitsu Technology Solutions Intellectual Property GmbH
    Inventors: Fritz Brenker, Michael Burnicki, Patrick Kaspari, Oliver Niehörster, Ulrich Recker
  • Publication number: 20130290333
    Abstract: A system for extracting customer feedback from a microblog site includes a retrieval unit coupled to the microblog site to capture microblog updates. A filter unit coupled to the retrieval unit filters the captured microblog updates according to filter criteria that remove non-actionable items from the captured microblog updates. A learning unit coupled to the filter unit prioritizes the filtered microblog updates, and a classification unit coupled to the learning unit classifies the filtered and prioritized microblog updates. An action unit coupled to the classification unit performs appropriate actions based on the classified, filtered and prioritized microblog updates.
    Type: Application
    Filed: April 27, 2012
    Publication date: October 31, 2013
    Applicant: Benbria Corporation
    Inventors: Wojciech Fraczak, Ying Du
  • Publication number: 20130282725
    Abstract: A system is disclosed for automation-assisted curation of technical information from technical support tickets into a technical information knowledge base. In one example, a method includes mapping information from a plurality of fields of a support ticket in a technical support reporting tool to a plurality of corresponding fields of a structured information file. The method further includes rendering the structured information file in a user-editable format in a user interface; saving user inputs to the structured information file, thereby generating a curated structured information file that incorporates the mapped information and the user inputs; and saving the curated structured information file to a searchable technical support information data store.
    Type: Application
    Filed: April 24, 2012
    Publication date: October 24, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Benjamin I. Rubinger
  • Publication number: 20130276089
    Abstract: A security application manages security and reliability of networked applications executing collection of interacting computing elements within a distributed computing architecture. The security application monitors various classes of resources utilized by the collection of nodes within the distributed computing architecture and determine whether utilization of a class of resources is approaching a pre-determined maximum limit. The security application performs a vulnerability scan of a networked application to determine whether the networked application is prone to a risk of intentional or inadvertent breach by an external application. The security application scans a distributed computing architecture for the existence of access control lists (ACLs), and stores ACL configurations and configuration changes in a database.
    Type: Application
    Filed: April 12, 2012
    Publication date: October 17, 2013
    Inventors: Ariel Tseitlin, Roy Rapoport, Jason Chan
  • Publication number: 20130246435
    Abstract: A knowledge extraction framework may iteratively enrich an ontology that is used to classify structured knowledge obtained from web pages based on structured knowledge previously acquired from other web pages. The framework may enable a user to define the ontology for extracting structured knowledge from a plurality of web pages. The framework applies the ontology using a supervised extraction algorithm to extract seed information from a set of web pages. The framework further applies an unsupervised extraction algorithm to extract the structured knowledge from an additional set of web pages. The framework subsequently maps the structured knowledge to the ontology based on the seed information to enrich the ontology.
    Type: Application
    Filed: March 14, 2012
    Publication date: September 19, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Jun Yan, Lei Ji, Edward W. Wild, Yi Li, Ning Liu, Zheng Chen
  • Publication number: 20130246429
    Abstract: A canopy clustering process merges at least one set of multiple single-center canopies together into a merged multi-center canopy. Multi-center canopies, as well as the single-center canopies, can then be used to partition data objects in a dataset. The multi-center canopies allow a canopy assignment condition constraint to be relaxed without risk of leaving any data objects in a dataset outside of all canopies. Approximate distance calculations can be used as similarity metrics to define and merge canopies and to assign data objects to canopies. In one implementation, a distance between a data object and a canopy is represented as the minimum of the distances between the data object and each center of a canopy (whether merged or unmerged), and the distance between two canopies is represented as the minimum of the distances for each pairing of the center(s) in one canopy and the center(s) in the other canopy.
    Type: Application
    Filed: March 19, 2012
    Publication date: September 19, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Xiong Zhang, Danny Lange, Hung-Chih Yang
  • Publication number: 20130246424
    Abstract: A method is provided in one example embodiment and it includes receiving a state request and determining whether a state exists in a translation dictionary for the state request. The method further includes reproducing the state if it is not in the dictionary and adding a new state to the dictionary. In more specific embodiments, the method includes compiling a rule, based on the state, into a given state table. The rule affects data management for one or more documents that satisfy the rule. In yet other embodiments, the method includes determining that the state represents a final state such that a descriptor is added to the state. In one example, if the state is not referenced in the algorithm, then the state is released. If the state is referenced in the algorithm, then the state is replaced with the new state.
    Type: Application
    Filed: March 30, 2012
    Publication date: September 19, 2013
    Inventors: William Deninger, Ratinder Paul Singh Ahuja, Lee C. Cheung
  • Publication number: 20130238610
    Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.
    Type: Application
    Filed: March 7, 2012
    Publication date: September 12, 2013
    Applicant: International Business Machines Corporation
    Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
  • Publication number: 20130238611
    Abstract: Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.
    Type: Application
    Filed: March 8, 2012
    Publication date: September 12, 2013
    Applicant: International Business Machines Corporation
    Inventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
  • Patent number: 8527515
    Abstract: Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.
    Type: Grant
    Filed: November 7, 2011
    Date of Patent: September 3, 2013
    Assignee: Oracle OTC Subsidiary LLC
    Inventors: Vladimir V. Zelevinsky, Omri Traub, Vladimir Gluzman Peregrine, Daniel Tunkelang, Joyce Jeanpin Wang
  • Publication number: 20130226920
    Abstract: The invention provides in some aspects methods of digital data processor-based analysis of digital data that represent interactions to identify distinct individuals and/or the entities with which they are affiliated (e.g., households, businesses, social or other groups) involved in those interactions. The methods can be employed, for example, to analyze digital data representing retail purchase, marketing and visitor interactions for tracking and/or reporting purposes.
    Type: Application
    Filed: February 28, 2012
    Publication date: August 29, 2013
    Applicant: CQuotient, Inc.
    Inventors: Bharath K. Krishnan, Vishwamitra S. Ramakrishnan
  • Publication number: 20130226926
    Abstract: An approach for enabling mobile device users to acquire information regarding events in their proximity on demand is described. An event determination platform processes and/or facilitates a processing of captured data (e.g., images, audio, video, etc.) that depict, at least in part, one or more events to determine one or more characteristics of the one or more events, the captured data, or a combination thereof. The event determination platform further causes, at least in part, an identification of one or more events based, at least in part, on a comparison of the one or more characteristics against one or more other characteristics associated with one or more registered events.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 29, 2013
    Applicant: Nokia Corporation
    Inventor: Jerome Beaurepaire
  • Publication number: 20130226921
    Abstract: A method for identifying an auto-complete communication pattern within a sequence of request entities includes grouping the request entities into a plurality of clusters according to a criterion. Clusters are removed from the plurality according to at least one of pattern analysis, a cluster size, and a cluster timing. Remaining clusters are identified as having an auto-complete communication pattern.
    Type: Application
    Filed: February 29, 2012
    Publication date: August 29, 2013
    Inventor: Ofer Eliassaf