Clustering Or Classification (epo) Patents (Class 707/E17.046)
-
Publication number: 20140101155Abstract: Deterministic Finite Automatons (DFAs) and Nondeterministic Finite Automatons (NFAs) are two typical automatons used in the Network Intrusion Detection System (NIDS). Although they both perform regular expression matching, they have quite different performance and memory usage properties. DFAs provide fast and deterministic matching performance but suffer from the well-known state explosion problem. NFAs are compact, but their matching performance is unpredictable and with no worst case guarantee. A new automaton representation of regular expressions, called Tunable Finite Automaton (TFA), is described. TFAs resolve the DFAs' state explosion problem and the NFAs' unpredictable performance problem. Different from a DFA, which has only one active state, a TFA allows multiple concurrent active states. Thus, the total number of states required by the TFA to track the matching status is much smaller than that required by the DFA.Type: ApplicationFiled: October 10, 2012Publication date: April 10, 2014Inventors: H. Jonathan CHAO, Yang Xu
-
Publication number: 20140101154Abstract: An aspect of the present invention simplifies grouping of data items previously stored in a database, the data items being stored in the form of rows and columns in respective tables (in the database). In one embodiment, a system displays a cross product of values from two or more columns in the form of multiple lines, where each line contains a respective value from each of the two or more columns to specify a corresponding criterion (combination of values). In response to receiving inputs indicating the respective groups for each of the lines, the system determines a group for each data item (stored in the database) based on the received inputs. A user is accordingly required to only specify the desired groups corresponding to various combinations of values of the columns to cause grouping of data items in the database.Type: ApplicationFiled: October 10, 2012Publication date: April 10, 2014Applicant: Oracle Financial Services Software LimitedInventors: Gangadhar Nagulakonda, Rajaram Narasimha Vadapandeshwara, Subramanian Ramakrishnan
-
Publication number: 20140095502Abstract: Techniques are provided that address the problems associated with prior approaches for clustering a fact table in a relational database management system. According to one aspect of the invention, a database server clusters a fact table in a database based on one or more dimension tables. More specifically, rows are stored in the fact table in a sorted order and the order in which the rows are sorted is based on values in one or more columns of one or more of the dimension tables. A user specifies the columns of the dimension tables on which the sorted order is based in “clustering criteria”. The database server uses the clustering criteria to automatically store the rows in the fact table in the sorted order in response to certain user-initiated database operations on the fact-table.Type: ApplicationFiled: September 28, 2012Publication date: April 3, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Mohamed Ziauddin, Andrew Witkowski
-
Publication number: 20140089311Abstract: A system, method, and computer-readable medium that facilitate classification of database requests as problematic based on estimated processing characteristics of the request are provided. Estimated processing characteristics may include estimated skew including central processing unit skew and input/output operation skew, central processing unit duration per input/output operation, and estimated memory usage. The estimated processing characteristics are made on a request step basis. The request is classified as problematic responsive to determining one or more of the estimated characteristics of a request step exceed a corresponding threshold. In this manner, mechanisms for predicting bad query behavior are provided. Workload management of those requests may then be more successfully provided through workload throttles, filters, or even a more confident exception detection that correlates with the estimated bad behavior.Type: ApplicationFiled: September 26, 2012Publication date: March 27, 2014Inventors: Anita Richards, Douglas Brown, Bruce Britton, Todd Walter
-
Publication number: 20140089090Abstract: The invention teaches systems, methods and devices for searching data storage systems and devices by a topical category known as a theme. It is emphasized that this abstract is provided to comply with the rules requiring an abstract that will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. 37 CFR 1.72(b).Type: ApplicationFiled: September 21, 2012Publication date: March 27, 2014Inventor: Steven Thrasher
-
Publication number: 20140074839Abstract: A user of a network-based system may correspond to a user profile that describes the user. The user profile may describe the user using one or more descriptors of items that correspond to the user (e.g., items owned by the user, items liked by the user, or items rated by the user). In some situations, such a user profile may be characterized as a “taste profile” that describes an array or distribution of one or more tastes, preferences, or habits of the user. Accordingly, the user profile machine within the network-based system may generate the user profile by accessing descriptors of items that correspond to the user, clustering one or more of the descriptors, and generating the user profile based on one or more clusters of the descriptors.Type: ApplicationFiled: September 12, 2012Publication date: March 13, 2014Applicant: GRACENOTE, INC.Inventors: Phillip Popp, Ching-Wei Chen, Peter C. DiMaria, Markus K. Cremer
-
Publication number: 20140067817Abstract: Methods and systems for recommending social networking connections are disclosed. Information is received from a mobile device relating to software applications that are installed on a first user's mobile device. A weight for each software application is calculated based on usage information and each software application is designated to at least one category. A priority is calculated for each category based at least in part on respective weights of software applications designated to the category. A second user, who is not connected to the first user, is detected, wherein the second user has a predetermined number of categories that are the same as the first user's categories, and wherein priorities corresponding to the second user's categories are within a predetermined range of priorities corresponding to the first user's categories. A recommendation to connect with the second user is provided to the first user.Type: ApplicationFiled: August 30, 2012Publication date: March 6, 2014Applicant: Yahoo! Inc.Inventors: Anupam SETH, Allie K. Watfa, Dale Nussel, Jonathan Kilroy
-
Publication number: 20140067808Abstract: Techniques, an apparatus and an article of manufacture for distributed scalable clustering and community detection. A method includes generating a label for each node in a graph, wherein said label identifies a community in which a node participates, propagating each label locally within two or more segments of the graph based on a participation percentage of each node in at least one identified community within the graph, and deriving at least one cluster of nodes in the graph that corresponds to the at least one identified community based on said propagating.Type: ApplicationFiled: September 6, 2012Publication date: March 6, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ankur Narang, Jyothish Soman
-
Publication number: 20140059047Abstract: According to one embodiment, an apparatus stores a plurality of datapoints. A datapoint comprises a first value and a second value that depends upon the value of the first value. The apparatus associates the datapoint with a group from a plurality of groups. The group is associated with an identifying range and the datapoint is associated with the group based at least in part upon the first value of the datapoint and the identifying range of the group. The apparatus calculates a median of the second values of the datapoints associated with the group and a performance value by performing a regression based at least in part upon the identifying range and the calculated median of the group. The apparatus determines that the performance value exceeds a baseline value and in response, presents, on a display, an illustration depicting the identifying range and the associated median of the group.Type: ApplicationFiled: August 27, 2012Publication date: February 27, 2014Applicant: Bank of America CorporationInventors: Kasilingam B. Laxmanan, Yudong Chen, Julea K. Duke, Ming Xue
-
Publication number: 20140052726Abstract: Techniques are described for performing grouping and aggregation operations. In one embodiment, a request is received to aggregate data grouped by a first column. In response to receiving the request, a group value in a row of a first column is mapped to an address. A pointer is stored for a first group at a first location identified by the address. The pointer identifies a second location of a set of aggregation data for the first group. An aggregate value included in the set of aggregation data is updated based on a value in the row of a second column.Type: ApplicationFiled: August 20, 2012Publication date: February 20, 2014Inventors: Philip Amberg, Justin Schauer, Robert David Hopkins
-
Publication number: 20140047045Abstract: A user creates an event in a social networking system specifying a location, a time, and a guest list of other users invited to the event. The social networking system generates a page associated with the event that provides information about the event and identifies whether users have responded to invitations to the event. The content of the page may be customized for the user viewing the page to encourage the viewing user to attend the event. For example, the viewing user's relationship to and/or similar characteristics with other users on the guest list is determined and used by the social networking system to identify the users whose responses to invitations are shown to the viewing user via the page. Additionally, a notification method more prominently distributes acceptances of invitations to other users to encourage attendance.Type: ApplicationFiled: August 13, 2012Publication date: February 13, 2014Inventors: Robert Michael Baldwin, Henry Bridge, Robyn David Morris
-
Publication number: 20140046942Abstract: A method for computerized batching of huge populations of electronic documents, including computerized assignment of electronic documents into at least one sequence of electronic document batches such that each document is assigned to a batch in the sequence of batches and such that there is no conflict between batching requirements, the following batching requirements being maintained by a suitably programmed processor: a. pre-defined subsets of documents are always kept together in the same batch, b. batches are equal in size, c. the population is partitioned into clusters, and all documents in any given batch belong to a single cluster rather than to two or more clusters.Type: ApplicationFiled: August 8, 2012Publication date: February 13, 2014Applicant: EQUIVIO LTD.Inventor: Yiftach RAVID
-
Publication number: 20140040262Abstract: Techniques for facilitating a similarity search of digital assets (e.g., audio files, image files, video files, etc.) are described. Consistent with some embodiments, a cloud-based search service manages one or more search tree data structures for use in organizing digital assets to make the digital assets searchable. Each digital asset is associated with a feature vector based on the various attributes and/or characteristics of the digital asset. The digital assets are then assigned to leaf nodes in one or more search tree data structures based on a measure of the distance between the feature vector of the digital asset and a virtual feature vector associated with a leaf node. When a search for similar digital assets is invoked, a prioritized breadth first search of a search tree is performed to identify the digital assets having the feature vectors closest in distance to the reference digital asset.Type: ApplicationFiled: August 3, 2012Publication date: February 6, 2014Applicant: Adobe Systems IncorporatedInventors: Sven Winter, Jonathan Brandt
-
Publication number: 20140032552Abstract: Defining relationships are described. Defining relationships can include retrieving a number of event notifications that correspond to a number of nodes. Defining relationships can include defining a number of group patterns that correspond to the number of event notifications. Defining relationships can also include grouping the number of nodes into a number of groups that correlate with the number of group patterns, the number of groups defining a number of relationships between the number of nodes. Defining relationships can include assigning a number of weights to the number of relationships between the number of nodes, wherein the number of weights are based on a strength of the number of relationships between the number of nodes.Type: ApplicationFiled: July 30, 2012Publication date: January 30, 2014Inventors: Ira Cohen, Ruth Bernstein, Yonatan Ben Simhon
-
Publication number: 20140019239Abstract: Embodiments for a method for ranking social quality of content published on a plurality of web pages are provided. In an embodiment, the method includes receiving at least one log record from a tracking component on at least one web page. The one log record is indicative of at least one user activity on the at least one web page. The method further includes aggregating the at least one log record corresponding to preferably each of the plurality of web pages based on one or more parameters. The method also includes assigning a first score for preferably each of the plurality of web pages based on the aggregating. The first score is indicative of a social quality of content published in the at least one web page. The method includes ranking the plurality of web pages based on the first score.Type: ApplicationFiled: July 12, 2012Publication date: January 16, 2014Inventors: Yan Qu, Nanda Kishore, Timothy Schigel, Juan Valencia, Andrew Stevens, Ishika Paul, Ping Zhu
-
Publication number: 20140019453Abstract: Methods and apparatuses for assessing user interest scores of users of a mobile network are provided. A method includes for each of a plurality of users (A) determining initial interest scores corresponding to user's interests and interest scores of friends of the user for the user's interests, based on browsing information, and (B) assessing user's interest scores based on the initial interest scores, the interest scores of the friends and friends' influence. The method further includes outputting a list including a subset of the users selected based on the user's interest scores.Type: ApplicationFiled: July 13, 2012Publication date: January 16, 2014Applicant: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)Inventors: Saravanan MOHAN, Divya SUNDAR
-
Patent number: 8630890Abstract: A method and system for mining a database for product migration analysis includes querying product usage data for a legacy product and a new product from the database as time series data. The product usage data is representative for a large number of consumers of the legacy and new products. A mathematical model may be used to determine a relationship between the two time series data. Product migration values and other features related to product migration, such as a transition period of product usage, may be estimated, determined or predicted.Type: GrantFiled: December 3, 2008Date of Patent: January 14, 2014Assignee: AT&T Intellectual Property I, L.P.Inventors: Siu-Tong Au, Rong Duan
-
Publication number: 20140012854Abstract: Methods and/or systems are provided that may be utilized to rank categories of an entity based at least in part on relevance.Type: ApplicationFiled: July 3, 2012Publication date: January 9, 2014Applicant: Yahoo! Inc.Inventor: Syama Prasad Suprasadachandranpilliai
-
Publication number: 20140012849Abstract: A technique of extracting hierarchies for multilabel classification. The technique can process a plurality of labels related to a plurality of documents, using a clustering process, to cluster the labels into plurality of clusterings representing a plurality of classes. The technique classifies the documents and predicts a plurality of performance characteristics, respectively, for the plurality of clusterings. The technique selects at least one of the clusterings using information from the performance characteristics and adds the selected clustering into a resulting hierarchy.Type: ApplicationFiled: July 6, 2012Publication date: January 9, 2014Inventors: Alexander Ulanov, German Sapozhnikov, Georgy Shevlyakov
-
Publication number: 20140012852Abstract: Disclosed are methods and apparatus for correlating metadata from a plurality of different sources. The methods and apparatus may use an order for the data sources. The metadata from each of the data sources may be divided or split into one or more chunks. The metadata from each of the chunks may be filtered and sorted, e.g., to ensure that the metadata relate to the same multimedia content. The metadata chunks from the first data source in the order and the second data source in the order may then be aligned to produce currently aligned metadata. The metadata data chunks from the next data source in the order may then be aligned with the currently aligned metadata to produce new currently aligned metadata. This process may be repeated until the metadata from all of the sources are aligned, thereby providing a set of correlated metadata.Type: ApplicationFiled: July 3, 2012Publication date: January 9, 2014Applicant: SETJAM, INC.Inventors: Grzegorz Kapkowski, Marcin Kaszynski, Marek M. Stepniowski
-
Publication number: 20140012847Abstract: Embodiments of an inspection system and method for a collection of information objects, for example, a collection of executable software applications may be inspected for computer viruses, or a collection of genomes may be inspected for common or unique gene sequences. Information objects may contain identified sequences of instructions, each of which may be labeled with a symbol. In the software context, programming languages may include symbols that indicate functionality. In some embodiments, an inspection of the statistical properties of the information objects and their included symbols may allow for the symbols (and thus instruction sequences) to be grouped into logical components. In some embodiments, objects that include individual logical components may be grouped together. These groupings and their dependencies may be used to determine the structure of each object by detailing its constituent components, how they relate or depend on one another, and how the information object may function.Type: ApplicationFiled: July 5, 2012Publication date: January 9, 2014Applicant: Raytheon BBN Technologies Corp.Inventor: Richard Lee Barnes, II
-
Publication number: 20140012818Abstract: Disclosed are methods and apparatus for processing correlated metadata (e.g., programmatic metadata relating to one or more episodes of a television show). Mappings, or correlations, between chunks of the metadata that originated from a particular data source and the metadata clusters may be determined and displayed, e.g., on a graphical user interface. Using this display, a user (i.e., a human operator) may detect inconsistencies in the correlated metadata. An inconsistency may be an incorrect mapping, the mapping of more than one of the metadata chunks that originated from the same data source to the same metadata cluster, or that one or more of the metadata chunks have not been mapped to a metadata cluster. The mappings may then be edited so as to remove detected inconsistencies.Type: ApplicationFiled: July 3, 2012Publication date: January 9, 2014Applicant: SETJAM, INC.Inventors: Marcin Kaszynski, Grzegorz Kapkowski, Marek M. Stepniowski
-
Publication number: 20140006399Abstract: Method, apparatus, and programs for recommending websites. Information related to a user's browsing history of a plurality of websites is obtained. A browsing co-occurrence of at least some of the plurality of websites in one or more time periods is determined based on the obtained information related to the user's browsing history. The plurality of websites are assigned to a plurality of website groups based on the determined browsing co-occurrence. Each of the plurality of website groups is associated with one of the one or more time periods. At least one of the plurality of website groups is presented to the user based on their associated time periods.Type: ApplicationFiled: June 29, 2012Publication date: January 2, 2014Applicant: Yahoo! Inc.Inventors: Sudharsan Vasudevan, Eugene Kouichi Kashida, Ethan Batraski
-
Publication number: 20140006402Abstract: A contents distribution server using an identification code of contents is disclosed. The apparatus includes an interface providing unit configured to provide an interface for registration of the contents to a device, if a request for the registration of the contents is received from the device; an code information extraction unit configured to extract code information from input information through the interface; an identification code generation unit configured to generate the identification code by combining codes corresponding to the extracted code information, and a contents distribution unit configured to match the contents with the generated identification code, register the matched contents in a database and transmit the registered contents to a contents managing server with reference to the identification code.Type: ApplicationFiled: July 2, 2012Publication date: January 2, 2014Applicant: KT CORPORATIONInventors: Sang-Bum LEE, Chang-Seuk OK, Hye-Mi KIM, Se-Cheol PARK, Joo-Young YOON
-
Publication number: 20140006400Abstract: A system and method of managing online social networking which includes identifying a plurality of users related to a primary user on a social networking tool using a computer. The method and system identifies a plurality of activities performed by the plurality of users on the social networking tool, and assigning a score to each of the activities. A threshold cumulative score for users to enter a group is defined. The system and method evaluates the activities of each of the users, and calculates a cumulative score for each of the users based on their respective activities, and evaluates the cumulative score of each of users in relation to the group. One or more of the plurality of users who meet the threshold cumulative score are assigned to the group. A status for each user in the group based on their cumulative score is determined.Type: ApplicationFiled: June 29, 2012Publication date: January 2, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Paul R. Bastide, Matthew E. Broomhall, Robert E. Loredo
-
Publication number: 20130339357Abstract: Embodiments of the invention include methods for identifying one or more clusters in a streaming graph, the method includes receiving a stream of edges and sampling the stream of edges to create a structural reservoir and support reservoir. The method also includes creating a sampled graph from the structural reservoir and identifying the one or more clusters in the sampled graph by grouping one or more connected vertices in the sampled graph.Type: ApplicationFiled: June 26, 2012Publication date: December 19, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ahmed S. Eldawy, Rohit M. Khandekar, Kun-Lung Wu
-
Publication number: 20130339355Abstract: A system for clustering vertices in a streaming graph includes a structural sampler configured to receive a stream of edges. The structural sampler includes a reservoir manager configured to receive the stream of edges and create a structural reservoir and a support reservoir and a graph manager configured to receive the structural reservoir from the reservoir manager and to create a sampled graph from the structural reservoir, wherein the sampled graph includes one or more clusters that each include one or more connected vertices.Type: ApplicationFiled: June 14, 2012Publication date: December 19, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ahmed S. Eldawy, Rohit M. Khandekar, Kun-Lung Wu
-
Publication number: 20130332450Abstract: A method for automatically extracting and organizing information by a processing device from a plurality of data sources is provided. A natural language processing information extraction pipeline that includes an automatic detection of entities is applied to the data sources. Information about detected entities is identified by analyzing products of the natural language processing pipeline. Identified information is grouped into equivalence classes containing equivalent information. At least one displayable representation of the equivalence classes is created. An order in which the at least one displayable representation is displayed is computed. A combined representation of the equivalence classes that respects the order in which the displayable representation is displayed is produced.Type: ApplicationFiled: June 11, 2012Publication date: December 12, 2013Applicant: International Business Machines CorporationInventors: Vittorio Castelli, Radu Florian, Xiaoqiang Luo, Hema Raghavan
-
Publication number: 20130325863Abstract: Embodiments of the invention relate to a modeling activity area associated with groups of data items. Tools are provided to profile activity area involvement, both from the data item and from associated participants. The data items are placed into clusters and one or more activity areas are derived from the formed clusters. Each activity area is defined from the perspective of a single user. Participants in an activity area are connected to a user, but not necessarily to each other. The combination of formations of clusters and activity areas provides a multi-facetted organization of connections between data items and associated participants.Type: ApplicationFiled: August 28, 2012Publication date: December 5, 2013Applicant: International Business Machines CorporationInventor: Hongxia Jin
-
Publication number: 20130325866Abstract: Embodiments of the invention relate to modeling communities associated with groups of data items. Tools are provided to iteratively assign data items to communities and to update topic and participant distribution in the assigned communities. As the distributions are updated, the characteristics of the communities are updated. Each activity area is defined from the perspective of a single user. Participants in a community are connected to a user, but not necessarily to each other. The combination of formations of communities and the statistical aspect of evaluating characteristics of the communities provides a multi-facetted organization of connections between data items and associated participants.Type: ApplicationFiled: May 31, 2012Publication date: December 5, 2013Applicant: International Business Machines CorporationInventors: Hongxia Jin, Yan Liu, Wenjun Zhou
-
Publication number: 20130325867Abstract: The disclosure generally describes computer-implemented methods, software, and systems for providing a homogeneous data model based on in-memory database views. One computer-implemented method includes creating an application view field associated with an application view, indicating a base database field in a base database table for the created application view field, collecting additional information associated with the indicated base database field, determining at least a data element and a domain associated with the indicated base database field using the collected additional information, determining, by operation of a computer using the collected additional information, that multiple determined catalog entries associated with the indicated base database field exist in a catalog, and proposing names for the application view field, wherein the proposed names are presented from most specific to least specific.Type: ApplicationFiled: June 4, 2012Publication date: December 5, 2013Applicant: SAP AGInventors: 69190 Kemmler, Torsten Kamenz
-
Publication number: 20130311437Abstract: A system and method obtain a database stored on a storage device containing information on multiple assets, the information including measurements taken from devices monitoring each asset, and context information corresponding to the environment the items are subjected to. The system and method groups assets via a computer system into a homogenous group as a function of selected context information and performs analytics via the computer system on the grouped assets to manage the assets.Type: ApplicationFiled: May 16, 2012Publication date: November 21, 2013Applicant: Honeywell Internatioanl Inc.Inventors: Petr Stluka, Eva Jerhotova, Karel Marik, Ondrej Holub, Wendy Foslien, Rylan Clark
-
Publication number: 20130311467Abstract: A method and a system for coreference resolution are provided. The method includes receiving a set of document clusters, each cluster in the set of document clusters including a set of text documents. Instances of each of a set of candidate named entities are identified in the document clusters. For a pairs of the candidate named entities, at least one socio-temporal feature is computed that is based on the similarity of the distributions of identified instances of the respective candidate name entities among the document clusters. A decision for merging for the candidate named entities into a common real named entity is based on the socio-temporal features.Type: ApplicationFiled: May 18, 2012Publication date: November 21, 2013Applicant: Xerox CorporationInventors: Matthias Gallé, Jean-Michel Renders, Guillaume Jacquet
-
Publication number: 20130305058Abstract: A method, system and computer program product for controlling enterprise data on mobile devices. Data on a mobile device is tagged as being associated with either enterprise data or with personal data. Upon identifying the storage location of the tagged data and the identifier of the application that generated the tagged data, the tag, the storage location of the tagged data and the identifier of the application are stored in an index. A mobile agent residing on the mobile device may be directed by a mobile device management server of the enterprise to perform various actions (e.g., deleting, encrypting, backing-up) on the enterprise data using the index. In this manner, the enterprise has the ability to control their applications and data that resides on employees' mobile devices to ensure that such data is not lost or used in a manner that is contrary to the wishes of the employer.Type: ApplicationFiled: May 15, 2012Publication date: November 14, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Shalini Kapoor, Palanivel A. Kodeswaran, Sridhar R. Muppidi, Nataraj Nagaratnam, Vikrant Nandakumar
-
Publication number: 20130304741Abstract: Method, system, and programs for providing identifiers to objects. Input data representing a plurality of objects is received and categorized into a plurality of entity categories. A first graph of entities is generated using the plurality of entity categories. The first graph of entities are matched with a second graph of entities. A comparison of object pairs is then made, in which each object pair includes a first object from the first graph of entities and a corresponding second object from the second graph of entities. Identifiers are assigned to each object based on comparing the object pairs.Type: ApplicationFiled: May 10, 2012Publication date: November 14, 2013Applicant: YAHOO! INC.Inventors: Balaji Kannan, Aamod Sane, Zhiwei Gu
-
Publication number: 20130304738Abstract: Systems, methods and computer program products manage collections of information using latent semantic analysis. The collections of information may be text based such as collections of documents or non-text data such as audio, image, video or multimedia data. Semantic information groups are created by grouping collections of information according to a degree of relatedness. A system allocates discontiguous node locations of one or more distributed databases to the semantic information groups. The system manages a dynamic semantic table that maps the discontiguous node locations to a semantic virtual table having a contiguous memory space.Type: ApplicationFiled: May 11, 2012Publication date: November 14, 2013Applicant: International Business Machines CorporationInventors: Sandra K. Johnson, Grant D. Miller
-
Publication number: 20130304737Abstract: A classification system executing on one or more computer systems includes a processor and a memory coupled to the processor. The memory includes a discovery engine configured to navigate through non-volatile memory storage to discover an identity and location of one or more files in one or more computer storage systems by tracing the one or more files from file system mount points through file system objects and to disk objects. A classifier is configured to classify the one or more the files into a classification category. The one or more files are associated with the classification category and stored in at least one data structure. Methods are also provided.Type: ApplicationFiled: May 10, 2012Publication date: November 14, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: NIKOLAI JOUKOV, AMITKUMAR M. PARADKAR, BIRGIT M. PFITZMANN, WILLIAM R. REOHR, PETER URBANETZ
-
Publication number: 20130297603Abstract: A monitoring system includes a database storing configuration information about a plurality of objects in the data center; a first inventory instance that adds a first object to the database, where the first inventory instance classifies the first object based on a set of classification rules to select a set of monitoring rules for the first object based on its classification and add configuration information about the first object to the configuration database; and a first monitoring instance to monitor the first object, the monitoring instance monitoring status of the first object based on respective configuration information in the database; at least one of the first inventory instance and the first monitoring instance identifying a further object functionally connected to the first object, the further objects added to the database by the first or a second inventory instance and monitored by the first or a second monitoring instance.Type: ApplicationFiled: May 1, 2012Publication date: November 7, 2013Applicant: Fujitsu Technology Solutions Intellectual Property GmbHInventors: Fritz Brenker, Michael Burnicki, Patrick Kaspari, Oliver Niehörster, Ulrich Recker
-
Publication number: 20130290333Abstract: A system for extracting customer feedback from a microblog site includes a retrieval unit coupled to the microblog site to capture microblog updates. A filter unit coupled to the retrieval unit filters the captured microblog updates according to filter criteria that remove non-actionable items from the captured microblog updates. A learning unit coupled to the filter unit prioritizes the filtered microblog updates, and a classification unit coupled to the learning unit classifies the filtered and prioritized microblog updates. An action unit coupled to the classification unit performs appropriate actions based on the classified, filtered and prioritized microblog updates.Type: ApplicationFiled: April 27, 2012Publication date: October 31, 2013Applicant: Benbria CorporationInventors: Wojciech Fraczak, Ying Du
-
Publication number: 20130282725Abstract: A system is disclosed for automation-assisted curation of technical information from technical support tickets into a technical information knowledge base. In one example, a method includes mapping information from a plurality of fields of a support ticket in a technical support reporting tool to a plurality of corresponding fields of a structured information file. The method further includes rendering the structured information file in a user-editable format in a user interface; saving user inputs to the structured information file, thereby generating a curated structured information file that incorporates the mapped information and the user inputs; and saving the curated structured information file to a searchable technical support information data store.Type: ApplicationFiled: April 24, 2012Publication date: October 24, 2013Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventor: Benjamin I. Rubinger
-
Publication number: 20130276089Abstract: A security application manages security and reliability of networked applications executing collection of interacting computing elements within a distributed computing architecture. The security application monitors various classes of resources utilized by the collection of nodes within the distributed computing architecture and determine whether utilization of a class of resources is approaching a pre-determined maximum limit. The security application performs a vulnerability scan of a networked application to determine whether the networked application is prone to a risk of intentional or inadvertent breach by an external application. The security application scans a distributed computing architecture for the existence of access control lists (ACLs), and stores ACL configurations and configuration changes in a database.Type: ApplicationFiled: April 12, 2012Publication date: October 17, 2013Inventors: Ariel Tseitlin, Roy Rapoport, Jason Chan
-
Publication number: 20130246435Abstract: A knowledge extraction framework may iteratively enrich an ontology that is used to classify structured knowledge obtained from web pages based on structured knowledge previously acquired from other web pages. The framework may enable a user to define the ontology for extracting structured knowledge from a plurality of web pages. The framework applies the ontology using a supervised extraction algorithm to extract seed information from a set of web pages. The framework further applies an unsupervised extraction algorithm to extract the structured knowledge from an additional set of web pages. The framework subsequently maps the structured knowledge to the ontology based on the seed information to enrich the ontology.Type: ApplicationFiled: March 14, 2012Publication date: September 19, 2013Applicant: MICROSOFT CORPORATIONInventors: Jun Yan, Lei Ji, Edward W. Wild, Yi Li, Ning Liu, Zheng Chen
-
Publication number: 20130246429Abstract: A canopy clustering process merges at least one set of multiple single-center canopies together into a merged multi-center canopy. Multi-center canopies, as well as the single-center canopies, can then be used to partition data objects in a dataset. The multi-center canopies allow a canopy assignment condition constraint to be relaxed without risk of leaving any data objects in a dataset outside of all canopies. Approximate distance calculations can be used as similarity metrics to define and merge canopies and to assign data objects to canopies. In one implementation, a distance between a data object and a canopy is represented as the minimum of the distances between the data object and each center of a canopy (whether merged or unmerged), and the distance between two canopies is represented as the minimum of the distances for each pairing of the center(s) in one canopy and the center(s) in the other canopy.Type: ApplicationFiled: March 19, 2012Publication date: September 19, 2013Applicant: MICROSOFT CORPORATIONInventors: Xiong Zhang, Danny Lange, Hung-Chih Yang
-
Publication number: 20130246424Abstract: A method is provided in one example embodiment and it includes receiving a state request and determining whether a state exists in a translation dictionary for the state request. The method further includes reproducing the state if it is not in the dictionary and adding a new state to the dictionary. In more specific embodiments, the method includes compiling a rule, based on the state, into a given state table. The rule affects data management for one or more documents that satisfy the rule. In yet other embodiments, the method includes determining that the state represents a final state such that a descriptor is added to the state. In one example, if the state is not referenced in the algorithm, then the state is released. If the state is referenced in the algorithm, then the state is replaced with the new state.Type: ApplicationFiled: March 30, 2012Publication date: September 19, 2013Inventors: William Deninger, Ratinder Paul Singh Ahuja, Lee C. Cheung
-
Publication number: 20130238610Abstract: Computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.Type: ApplicationFiled: March 7, 2012Publication date: September 12, 2013Applicant: International Business Machines CorporationInventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
-
Publication number: 20130238611Abstract: Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.Type: ApplicationFiled: March 8, 2012Publication date: September 12, 2013Applicant: International Business Machines CorporationInventors: Snigdha Chaturvedi, Tanveer A. Faruquie, Hima P. Karanam, Marvin Mendelssohn, Mukesh K. Mohania, L. Venkata Subramaniam
-
Patent number: 8527515Abstract: Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.Type: GrantFiled: November 7, 2011Date of Patent: September 3, 2013Assignee: Oracle OTC Subsidiary LLCInventors: Vladimir V. Zelevinsky, Omri Traub, Vladimir Gluzman Peregrine, Daniel Tunkelang, Joyce Jeanpin Wang
-
Publication number: 20130226920Abstract: The invention provides in some aspects methods of digital data processor-based analysis of digital data that represent interactions to identify distinct individuals and/or the entities with which they are affiliated (e.g., households, businesses, social or other groups) involved in those interactions. The methods can be employed, for example, to analyze digital data representing retail purchase, marketing and visitor interactions for tracking and/or reporting purposes.Type: ApplicationFiled: February 28, 2012Publication date: August 29, 2013Applicant: CQuotient, Inc.Inventors: Bharath K. Krishnan, Vishwamitra S. Ramakrishnan
-
Publication number: 20130226926Abstract: An approach for enabling mobile device users to acquire information regarding events in their proximity on demand is described. An event determination platform processes and/or facilitates a processing of captured data (e.g., images, audio, video, etc.) that depict, at least in part, one or more events to determine one or more characteristics of the one or more events, the captured data, or a combination thereof. The event determination platform further causes, at least in part, an identification of one or more events based, at least in part, on a comparison of the one or more characteristics against one or more other characteristics associated with one or more registered events.Type: ApplicationFiled: February 29, 2012Publication date: August 29, 2013Applicant: Nokia CorporationInventor: Jerome Beaurepaire
-
Publication number: 20130226921Abstract: A method for identifying an auto-complete communication pattern within a sequence of request entities includes grouping the request entities into a plurality of clusters according to a criterion. Clusters are removed from the plurality according to at least one of pattern analysis, a cluster size, and a cluster timing. Remaining clusters are identified as having an auto-complete communication pattern.Type: ApplicationFiled: February 29, 2012Publication date: August 29, 2013Inventor: Ofer Eliassaf