Clustering Or Classification (epo) Patents (Class 707/E17.089)
  • Publication number: 20130060778
    Abstract: Provided are a device, a method, and a program for displaying a document list with which a desired document can be effectively specified. The present invention groups documents in accordance with a displaying method of a document list, dynamically gives a group a name with which a range of the grouped documents can be seen, and organizes the document list.
    Type: Application
    Filed: August 31, 2012
    Publication date: March 7, 2013
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Kazunari Yamanakajima
  • Publication number: 20130060777
    Abstract: A method begins by a dispersed storage (DS) processing module receiving a first coded matrix that includes a first plurality of pairs of coded values corresponding to first data segments of a first data stream and a second data stream. The method continues with the DS processing module receiving a second coded matrix that includes a second plurality of pairs of coded values corresponding to first data segments of a third data stream and a fourth data stream. The method continues with the DS processing module generating a new coded matrix to include a plurality of groups of selected coded values. The method continues with the DS processing module outputting the plurality of groups of selected coded values to a requesting entity in a manner to maintain time alignment of the first data segments of the first, second, third, and fourth data streams.
    Type: Application
    Filed: August 2, 2012
    Publication date: March 7, 2013
    Applicant: CLEVERSAFE, INC.
    Inventors: Gary W. Grube, Timothy W. Markison
  • Publication number: 20130060774
    Abstract: A system and method for semantically classifying numerical data includes using semantic classification techniques on ‘nearby’ non-numerical data to identify a context whereby opaque data sets of numbers can be semantically classified inside of that context. An Electronic Knowledge Base is used to query against the context and determine the semantics of the opaque numeric data sets.
    Type: Application
    Filed: September 7, 2011
    Publication date: March 7, 2013
    Applicant: XEROX CORPORATION
    Inventors: Michael David Shepherd, Dale Ellen Gaucas, Kirk J. Ocke
  • Publication number: 20130060794
    Abstract: An approach for building management, energy management and facility management systems and particularly to data models representing building and operational configurations of the systems. More particularly, the disclosure pertains to standard data models for representing these configurations and their transformation from non-standard form into a standard form defined by domain ontologies. The transformation is of ad hoc and disparate technical reference information into an ontologically correct and validated complex hierarchy with an associated set of integrated digital information.
    Type: Application
    Filed: September 6, 2011
    Publication date: March 7, 2013
    Applicant: Honeywell International Inc.
    Inventors: Ramesha Nellikere Puttabasappa, Conrad Bruce Beaulieu
  • Publication number: 20130060776
    Abstract: A disjoint partial-area taxonomy abstraction network and methods of producing same for a hierarchy, which partitions overlapping concepts into singly-rooted disjoint groups that are more manageable to work with and comprehend. This provides abstract models for summarizing overlapping concepts which permit enhanced, high-level display for users at a user interface.
    Type: Application
    Filed: August 2, 2012
    Publication date: March 7, 2013
    Inventors: Yehoshua Perl, James Geller, Michael Howard Halper, Joyce Wang
  • Patent number: 8392266
    Abstract: A method for providing certified feedback information on a transaction entity (e.g., seller, purchaser, and/or object (e.g., good, service)) involved in a transaction between a seller and a purchaser.
    Type: Grant
    Filed: November 13, 2009
    Date of Patent: March 5, 2013
    Assignee: Omnione USA, Inc.
    Inventor: Davide Lombardi
  • Publication number: 20130054553
    Abstract: A method for automatically extracting information of products, includes searching documents based on product names; and extracting sentences including advantages and disadvantages for products having the product names from the searched documents. Further, the method for automatically extracting the information of the products includes classifying the sentences by similar contents among the extracted sentences; selecting representative sentences among the classified sentences; and calculating each weight of the selected representative sentences.
    Type: Application
    Filed: July 26, 2012
    Publication date: February 28, 2013
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Yeo Chan YOON, HyunKi Kim, Hyo-Jung Oh, Changki Lee, Chung Hee Lee, Myung Gil Jang, Yohan Jo, Miran Choi, Yoonjae Choi, Jeong Heo, Pum Mo Ryu, Hyeon Jin Kim
  • Publication number: 20130054559
    Abstract: An online marketing research measurement that allows a user to derive and/or monitor knowledge metrics, such as awareness metrics, recommendation metrics, advocacy metrics, etc. about a target subject, such as the user's brands and/or products using existing data on the Internet. Rather than requiring responses solicited from active participants in a survey (as in traditional surveys), unsolicited opinion data residing on the Internet can be gathered and processed for deriving various types of knowledge metrics. A recommendation metric can be derived from opinion data gathered from the Internet, which reflects a measure of recommendation opinions about the target subject. Users may identify the specific brand in which they are interested. After an Internet crawler is sent out to select data, the engine cleans the results of poor quality data, codes the data according to the appropriate constructs or variables, and then scores the sentiment using the system's sentiment engine.
    Type: Application
    Filed: August 30, 2012
    Publication date: February 28, 2013
    Applicant: e-Rewards, Inc.
    Inventor: Frances Annie Pettit
  • Publication number: 20130054600
    Abstract: A clustered database environment (e.g. Oracle Real Application Cluster (RAC)) includes multiple database instances that appear as one server. An application server (e.g. WebLogic Server (WLS)) can use a data source (e.g. an Oracle GridLink data source) and connection pools to connect with the clustered database. In accordance with an embodiment, a data source configuration allows for specification of a preferred affinity policy, such as a data affinity, temporal affinity, and/or session or session-based affinity policy. In accordance with an embodiment, the system includes a number of features that improve application connectivity in the clustered database environment, including a select-only case for application continuity, wherein an application-independent infrastructure, e.g. implemented within a Java Database Connectivity (JDBC) driver, enables recovery of work from an application perspective and masks system communications, hardware failures and hangs.
    Type: Application
    Filed: February 16, 2012
    Publication date: February 28, 2013
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Alexander Somogyi, Naresh Revanuru, Stephen Felts, Tong Zhou
  • Publication number: 20130046763
    Abstract: Systems and methods are disclosed for identifying associations between binary samples, such as e-mail files and their attachments or a document and an executable program associated with the document. In one implementation, the method includes receiving a plurality of binary samples, and extracting metadata from the plurality of binary samples. The metadata for a binary sample from the plurality of binary samples includes a set of attributes of the binary sample. The method further includes identifying a set of associations between the plurality of binary samples based on the extracted metadata. Each association is characterized by at least one attribute the associated binary samples have in common, and each association has a confidence level indicative of a strength of the association. The method also includes identifying associations with a confidence level that exceeds a predefined threshold.
    Type: Application
    Filed: December 28, 2011
    Publication date: February 21, 2013
    Inventors: Gregory SINCLAIR, Ryan Olson, Robert Falcone
  • Publication number: 20130041901
    Abstract: Disclosed are electronic systems and techniques for filtering data comprising content requested from a variety of sources, and tracking which of the sources supplied the data and a category that classifies the content. The filtering may comprise implementing a variety of filtering techniques to acquire the content. In this regard, the content can be stored and grouped with other content related to the content in data storage, while discarding a subset of the data not containing the content.
    Type: Application
    Filed: August 12, 2011
    Publication date: February 14, 2013
    Inventor: Andrey N. Nikankin
  • Publication number: 20130041723
    Abstract: Generating an analytical tool for use in assessing a state of an entity. Source data relating to a state of a community of which the entity forms a part is retrieved. The source data relates to at least one variable and variable comprises drivers. Cluster analysis is performed on the source data to produce an array of reference data. The array of reference data is organized into a form to be used in analysing data collected from the community. An analytical tool is useful in assessing the state of an entity. The analytical tool comprises an array of cells, each cell containing a subset of reference data which provides a measure of each driver related to the state of a community. The positioning of the cells relative to one another is governed by the inter-relationship of the reference data in the cells.
    Type: Application
    Filed: July 12, 2012
    Publication date: February 14, 2013
    Inventor: Warren John Parry
  • Publication number: 20130031093
    Abstract: A difference in tendency of times associated with combinations of content clusters and user clusters among the user clusters is reflected in a result of correspondence between the content cluster and the user cluster. A data acquisition unit acquires association data indicating a combination of a content belonging to the content cluster, a user belonging to one of a plurality of user clusters, and a time relating to a combination of the content and the user. A dividing unit divides, under a condition that the tendency of the times associated with the users in the association data differs among the plurality of user clusters to which the users belong, the content cluster into a plurality of clusters each corresponding to at least one of the plurality of user clusters.
    Type: Application
    Filed: July 12, 2012
    Publication date: January 31, 2013
    Applicant: SONY COMPUTER ENTERTAINMENT INC.
    Inventor: Takayuki ISHIDA
  • Publication number: 20130031095
    Abstract: A computer-readable recording medium has an entry support program embodied therein for causing a computer to perform detecting text being entered, extracting text examples corresponding to the detected text from a storage unit, the storage unit storing text examples and frequencies of use of the text examples such that the frequencies of use are associated with the respective text examples, classifying the extracted text examples into text-example groups each containing one or more text examples based on comparison of letters included in the extracted text examples, determining display order of the text-example groups, based on the frequencies of use that are associated in the storage unit with text examples belonging to the text-example groups, and displaying the extracted text examples in the determined display order.
    Type: Application
    Filed: July 24, 2012
    Publication date: January 31, 2013
    Applicant: FUJITSU LIMITED
    Inventor: Kiyoshi Takeuchi
  • Publication number: 20130031092
    Abstract: A method of compressing sequence data in a text-based format, the method involving parsing text of the sequence data into a plurality of fields, identifying encoding algorithms that achieve greatest compression gains with respect to the plurality of fields based on collected statistics, and generating a bitstream, compressed from the sequence data, by encoding the sequence data using the identified encoding algorithms.
    Type: Application
    Filed: June 8, 2012
    Publication date: January 31, 2013
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Vishal BHOLA, Shyamsunder Ajit BOPARDIKAR, Rangavittal NARAYANAN, Kyu-Sang LEE, Tae-Jin AHN
  • Publication number: 20130031075
    Abstract: Action-based deeplinks are provided with search results to allow users to access and perform actions that are common to web pages within a given category. To identify action-based deeplinks for web pages in a category, hyperlinks within the web pages are identified and clustered. Each cluster may correspond with an action that may be commonly accessed when visiting the web pages. When a web page that contains a hyperlink to such an action is returned as a search result, an action-based deeplink is provided as part of the search result to allow a user to directly access the location to perform the action.
    Type: Application
    Filed: July 26, 2011
    Publication date: January 31, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: ANTOINE EL DAHER, FARID HOSSEINI
  • Publication number: 20130031079
    Abstract: Search results are provided with personalized deeplinks for an end user. User behavior information is gathered regarding web pages visited by the end user. When the end user submits a search query, the website category of a search result is identified and user behavior information regarding web pages visited at other websites within the website category is identified. At least one deeplink is selected for the search result based on that user behavior information. In some instances, user behavior information may be tracked for a group of end users. The user behavior information for the group of end users may be used in conjunction with the user behavior information for the end user to facilitate deeplink selections for search results returned in response to search queries from the end user.
    Type: Application
    Filed: February 27, 2012
    Publication date: January 31, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: ANTOINE EL DAHER, DEEPAK VIJAYWARGI, YOGESH KANT ROY
  • Publication number: 20130031101
    Abstract: A method of determining which users are experts and which tags are appropriate without some of the disadvantages of the prior art is described. The level of a user's expertise is determined based on previous tags, the categorization of one or more tags, and the rating of the tags previously left by the user. The appropriateness of a tag is based on previous tagging of information by the user, by the number of times a user has tagged information with the same categorization, and the rating of a user.
    Type: Application
    Filed: July 31, 2012
    Publication date: January 31, 2013
    Applicant: Avaya Inc.
    Inventors: Doree Duncan Seligmann, Ajita John, Shreeharsh Kelkar
  • Publication number: 20130031094
    Abstract: A computer-implemented method comprising: receiving a first report related to an application configured to run on one or more computing devices; identifying one or more terms included in the first report; identifying one or more second reports including at least one of the one or more terms; retrieving a term relevance value for a term included in at least one of the one or more second reports; determining that the term relevance value is less than a term relevance threshold value; identifying at least one of the one or more second reports for a clustering process, wherein at least one of the one or more second reports that include the term is excluded from the clustering process; implementing the clustering process using the identified at least one of the one or more second reports; and assigning the first report to a cluster.
    Type: Application
    Filed: July 19, 2012
    Publication date: January 31, 2013
    Applicant: GOOGLE INC.
    Inventor: Michal M. Kozak
  • Publication number: 20130031102
    Abstract: The invention discloses a method and system for content categorization, which aims at reducing the processing burthen of the content categorization as well as the network transmission traffic. The method comprises: transmitting, by a content categorization requester, a content digest of a content to be categorized to a content categorization provider; and performing, by the content categorization provider, content categorization according to the content digest.
    Type: Application
    Filed: September 28, 2012
    Publication date: January 31, 2013
    Applicant: Huawei Technologies Co., Ltd.
    Inventor: Huawei Technologies Co., Ltd.
  • Publication number: 20130031098
    Abstract: A mismatch detection system includes: a statement unit extracting portion that extracts a set of statement units by dividing a given document, which is written in a natural language, into pieces; a statement constructing portion that constructs each statement as a combination of a context and specifics by sorting each of the statement units into the context, which indicate additional information of statements, and the specifics, which indicate information of the statements; and a data generating portion that generates a data set obtained by merging a set of predetermined check specifics and a set of the statements generated by the statement constructing portion.
    Type: Application
    Filed: March 25, 2011
    Publication date: January 31, 2013
    Applicant: NEC CORPORATION
    Inventor: Yukiko Kuroiwa
  • Publication number: 20130024452
    Abstract: A method and system for managing a project. The method and system comprise accepting at least two project templates from a database, wherein the project database contains personal project templates and work project templates categorized by type of project. A start date and/or an end date for each project template may be accepted. Information related to each project template may be automatically generated. The information related to all project templates may be aggregated and a user may access the information related to all project templates from one user interface.
    Type: Application
    Filed: June 25, 2012
    Publication date: January 24, 2013
    Inventors: Scott A. DEFUSCO, Jeffrey David ECKERLE, Lisa Anne RABIDEAU, Benjamin Joe ROSSI, Mark J. NUTTER
  • Publication number: 20130024456
    Abstract: Embodiments of the invention relate to a category based navigation system obtaining user data related to a plurality of users relevant to the primary user. The method further comprises obtaining entity data associated with an entity in a plurality of entities. The category based navigation system then determines one or more entities relevant to the primary user, and determines an initial order of relevance of a set of relevant entities. The method further comprises categorizing and displaying the set of relevant entities with an initial categorization on a user device to the primary user. The category based navigation system may then obtain, via the user device, user feedback, adjust the initial categorization and initial order of relevance based on the user feedback; and display the adjusted categorization and adjusted order of relevance of the set of relevant entities to the primary user on the user device.
    Type: Application
    Filed: July 19, 2012
    Publication date: January 24, 2013
    Applicant: Ness Computing, Inc.
    Inventors: Scott Paul Goodson, Sourav Chatterji, Jeremy Ryan Schiff, Corey Layne Reese, Paul Kenneth Twohey
  • Publication number: 20130018884
    Abstract: Systems and methods are provided for identifying unsolicited or unwanted electronic communications, such as spam. The disclosed embodiments also encompass systems and methods for selecting content items from a content item database. Consistent with certain embodiments, computer-implemented systems and methods may use a clustering based statistical content matching anti-spam algorithm to identify and filter spam. Such a anti-spam algorithm may be implemented to determine a degree of similarity between an incoming e-mail with a collection of one or more spam e-mails stored in a database. If the degree of similarity exceeds a predetermined threshold, the incoming e-mail may be classified as spam. Further, in accordance with other embodiments, systems and methods may be provided to determine a degree of similarity between a query or search string from a user and content items stored in a database.
    Type: Application
    Filed: July 11, 2011
    Publication date: January 17, 2013
    Inventors: Santhosh Baramasagara Chandrasekharappa, Sivakumar Ekambaram, Saurabh Sohoney, Rakesh Nigam
  • Publication number: 20130018885
    Abstract: A status management system includes a computer-implemented method for delivering status information to a requester, comprising providing status codes, clustering the status codes in a number of status codes clusters, hierarchically sorting the status codes clusters and transmitting at least one of the status codes to the requester depending on the hierarchy of the sorted status codes clusters.
    Type: Application
    Filed: July 13, 2012
    Publication date: January 17, 2013
    Applicant: HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH
    Inventor: Teodora Guenkova-Luy
  • Publication number: 20130013613
    Abstract: A requirements management tool, where each requirement is defined by one or more design element values. Each of the design element values is a unique value and is a member of a group of design element values defined for the project. As each requirement is created, the design element values for the requirement are selected from the group of design element values, or alternatively, a new design element value may be entered by a user, and the new design element value will be added to the group of element values. Each design element value corresponds to a category that each of the requirements are broken down into. Design element values in the created requirement are compared to design element values in existing requirements, and results of this duplication check are presented to a user of the requirements management tool.
    Type: Application
    Filed: July 5, 2011
    Publication date: January 10, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Maneesh Kumar Sharma
  • Publication number: 20130013600
    Abstract: Embodiments of the invention provide for applying multiple attribute changes to components of a dataset. According to one embodiment, coalescing changes can comprise reading a definition of the dataset. For example, the definition can comprise an identity and a context for each of the plurality of components. A component tree can be generated representing the data set and based on the context and identity. An indication of one or more changes to the components of the data set can be received and the changes can be classified based on a type of each of the changes. For example, the type of the changes can comprise one or more of a single component change, a cross-component change, and a cross-component change the affects the identity of at least one of the components The changes can be coalesced based on the type of the changes.
    Type: Application
    Filed: July 7, 2011
    Publication date: January 10, 2013
    Applicant: Oracle International Corporation
    Inventor: Blake Sullivan
  • Publication number: 20130013614
    Abstract: Software installed on a computer network is often inconsistently, or even incorrectly, identified. The same software may be identified in different ways. A catalogue of standardised identifiers is provided. The actual identifiers of software installed on the network are accessed and they are mapped to the standardised identifiers of the catalogue. The standardised identifiers are used to manage the installed software, monitor license compliance and/or, monitor maintenance agreements amongst other uses. Data relating to the use of the software may also be obtained and associated with the identification data. The usage data together with the standardised identifiers allows managers to more reliably manage software on the network. For example un-used software may be un-installed and licenses cancelled or reallocated.
    Type: Application
    Filed: July 7, 2011
    Publication date: January 10, 2013
    Applicant: 1E LIMITED
    Inventor: Andrew Mayo
  • Publication number: 20130013604
    Abstract: It can automatically be extracted a document module from a plurality of documents and be made a document module database. A method of making a document module, which is performed in a computer system including a computer, having a program for realizing a document module making module for making the document module, and a document module database, the document module making module including an analysis module and a similarity calculation module, the method including: a step of comparing the plurality of the subject documents, which read from the document module database, with each other to calculate the similarity in the arrangement of the characters of the strings between the plurality of the subject documents, and extracting first similar strings based on the calculated similarity; and a step of registering, each of the first similar strings as the document module to the document module database.
    Type: Application
    Filed: July 2, 2012
    Publication date: January 10, 2013
    Applicant: Hitachi, Ltd.
    Inventor: Yosiyuki KOBAYASI
  • Publication number: 20130013601
    Abstract: Members of a social network user's social graph are automatically segregated into overlapping clusters according to patterns of their past communications. Each cluster within the social graph represents a group of members having a high degree of intra-cluster communication or other connection with one another. The clustering is performed according to a sorting or ranking in accordance with non-principal eigenvectors of connectivity matrices describing the intra-cluster communications/connections. The overlapping clusters exhibit maximum internal density and minimum external sparsity.
    Type: Application
    Filed: October 7, 2011
    Publication date: January 10, 2013
    Inventors: Igor Kabiljo, Borislav Agapiev, Aleksandar Ilic
  • Publication number: 20130006979
    Abstract: A search query including search criteria can be received. The search criteria can be a text string. An enhanced search against an enhanced index can be executed. The enhanced index can be metadata associated with an enhanced cluster. The enhanced cluster can be a document cluster associated with the metadata. The enhanced cluster can be aggregated into a merged document. The merged document can be a document including the enhanced cluster contents. The ranking algorithm can be executed on the merged document to obtain a final ranking of content within the single document.
    Type: Application
    Filed: June 29, 2011
    Publication date: January 3, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CRISTINA BONANNI, FRANCESCO DAURI, ARCANGELO DI BALSAMO, ALESSANDRO DONATELLI, GIOVANNI FALCHETTI, LUCA LAZZARO
  • Publication number: 20130006990
    Abstract: A search query including search criteria can be received. The search criteria can be a text string. An enhanced search against an enhanced index can be executed. The enhanced index can be metadata associated with an enhanced cluster. The enhanced cluster can be a document cluster associated with the metadata. The enhanced cluster can be aggregated into a merged document. The merged document can be a document including the enhanced cluster contents. The ranking algorithm can be executed on the merged document to obtain a final ranking of content within the single document.
    Type: Application
    Filed: March 2, 2012
    Publication date: January 3, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: CRISTINA BONANNI, FRANCESCO DAURI, ARCANGELO DI BALSAMO, ALESSANDRO DONATELLI, GIOVANNI FALCHETTI, LUCA LAZZARO
  • Publication number: 20130006997
    Abstract: According to one embodiment, an information processing apparatus includes a storage, a log receiver and a merge module. The storage stores a plurality of log data, and first index data corresponding to the plurality of log data. The log receiver receives first log data and second index data from a client apparatus connected via a network, the second index data corresponding to the first log data. The merge module generates third index data by merging the first index data and the second index data. The storage stores the plurality of log data, the first log data and the third index data.
    Type: Application
    Filed: March 1, 2012
    Publication date: January 3, 2013
    Inventor: Atsushi Asayama
  • Publication number: 20120330955
    Abstract: A document similarity calculation device, configured to calculate a similarity indicating a degree of how much a plurality of documents are similar, includes: an associative word group storage portion for storing an associative word group composed of words associated with one another, a word-in-document frequency matrix generation portion for generating a matrix of word frequency in document which is a matrix each element of which is the frequency of a word present in a document with respect to each combination of the word and the document, a word-in-document frequency matrix transformation portion for transforming the generated matrix of word frequency in document based on the stored associative word group so as to reduce the number of dimensions of the matrix of word frequency in document, and a similarity calculation portion for calculating the similarity based on the transformed matrix of word frequency in document.
    Type: Application
    Filed: May 15, 2012
    Publication date: December 27, 2012
    Applicant: NEC Corporation
    Inventor: Mitsugu MIURA
  • Publication number: 20120330954
    Abstract: A system that implements a scalable data storage service may maintain tables in a non-relational data store on behalf of clients. The system may provide a Web services interface through which service requests are received, and an API usable to request that a table be created, deleted, or described; that an item be stored, retrieved, deleted, or its attributes modified; or that a table be queried (or scanned) with filtered items and/or their attributes returned. An asynchronous workflow may be invoked to create or delete a table. Items stored in tables may be partitioned and indexed using a simple or composite primary key. The system may not impose pre-defined limits on table size, and may employ a flexible schema. The service may provide a best-effort or committed throughput model. The system may automatically scale and/or re-partition tables in response to detecting workload changes, node failures, or other conditions or anomalies.
    Type: Application
    Filed: June 27, 2011
    Publication date: December 27, 2012
    Inventors: Swaminathan Sivasubramanian, Stefano Stefani, Chiranjeeb Buragohain, Rande A. Blackman, Timothy Andrew Rath, Raymond S. Bradford, Grant A.M. McAlister, Jakub Kulesza, James Hamilton, Luis Felipe Cabrera
  • Publication number: 20120330960
    Abstract: Source values are mapped to new user-defined categories. The new user-defined categories are stored in a new user-defined field. In an embodiment, a user-selection of an existing field is received. Based on a data type of the existing field, a specific mapping interface is displayed. The interface guides the user through specification of the new field and categories, and identification of the source values to map to the new categories.
    Type: Application
    Filed: January 27, 2012
    Publication date: December 27, 2012
    Applicant: salesforce.com, inc.
    Inventors: Marko Koosel, Donovan Schneider, Michael Tang, David Park
  • Publication number: 20120330957
    Abstract: An information processing apparatus determines a weight of each physical feature for hierarchical clustering by acquiring training data of multiple pieces of content in triplets with label information indicating a pair specified by a user as having a highest degree of similarity among three contents of the triplet and executing hierarchical clustering using a feature vector of each piece of content of the training data and the weight of each feature to determine the hierarchical structure of the training data. The information processing apparatus updates the weight of each feature so that the degree of agreement between a pair combined first as being the same clusters among three contents of the triplet in a determined hierarchical structure and a pair indicated by label information corresponding to the triplet increases.
    Type: Application
    Filed: September 6, 2012
    Publication date: December 27, 2012
    Applicant: International Business Machines Corporation
    Inventors: Toru Nagano, Masafumi Nishimura, Takashima Ryoichi, Ryuki Tachibana
  • Publication number: 20120323916
    Abstract: A method and system for document clustering. The method includes: extracting text feature information of the documents, establish a social network based on information related with the documents, performing graph clustering based on the social network to obtain structural sub-set, extracting structural feature information of the structural sub-set, and performing clustering on the documents based on the text feature information and the structural feature information.
    Type: Application
    Filed: June 14, 2012
    Publication date: December 20, 2012
    Applicant: International Business Machines Corporation
    Inventors: Ju Wei Shi, Wen Jie Wang, Wei Xue, Bo Yang
  • Publication number: 20120323917
    Abstract: Grouping media files via playlists on a computer-readable medium. One or more media files are selected according to a grouping criterion to define one or more playlists from the media files. A container group is associated with the playlists and stores values identifying each of the playlists associated with the container group along with references to each of the playlists.
    Type: Application
    Filed: August 29, 2012
    Publication date: December 20, 2012
    Applicants: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD., MICROSOFT CORPORATION
    Inventors: Ian Cameron Mercer, Kevin Leigh LaChapelle, Harutoshi Miyamoto, Yoshifumi Yanagawa, Nobuyasu Takeguchi, Chiyoko Matsumi
  • Publication number: 20120323918
    Abstract: A method and system for document clustering. The method includes: extracting text feature information of the documents, establish a social network based on information related with the documents, performing graph clustering based on the social network to obtain structural sub-set, extracting structural feature information of the structural sub-set, and performing clustering on the documents based on the text feature information and the structural feature information.
    Type: Application
    Filed: August 30, 2012
    Publication date: December 20, 2012
    Applicant: International Business Machines Corporation
    Inventors: Ju Wei Shi, Wen Jie Wang, Wei Xue, Bo Yang
  • Publication number: 20120317119
    Abstract: Product line type development is supported by analyzing inter-feature dependency relations based on the use state of feature in existing products and uses the analysis result. Inter-feature linking means 15 analyses the inter-feature dependency relations in the existing products based on a product-feature related information database 14. A related information management unit 16 compares the analysis result of the inter-feature dependency relations to existing information stored in an inter-feature related information database 17, and outputs the result to an interface unit 18 and the inter-feature related information database. A developer can easily consult the analysis result.
    Type: Application
    Filed: February 10, 2010
    Publication date: December 13, 2012
    Applicant: Hitachi, Ltd.
    Inventors: Takeshi Fukuda, Kentaro Yoshimura, Yoshitaka Atarashi
  • Publication number: 20120317091
    Abstract: Most valuable newly-updated information in current social network systems and web search engines become useless before users can timely search them out. The inventive system uses the communication mechanism and the preset query mechanism to help users timely get newly updates. System components include: a web crawler, a filter and a classifier to mine public newly-updated web pages from the web; a database to store various newly updates; an integrator to collect newly updates from other sources; a controller to send newly updates to corresponding users; a query box to store each user's preset queries and manager users' preferences; a communication platform to help users easily get preset queries. The method can help current social network systems and web search engines make better use of their newly updated data. Web search engines can better cooperate with social networks by using the method.
    Type: Application
    Filed: June 9, 2011
    Publication date: December 13, 2012
    Inventor: Luping Li
  • Publication number: 20120317116
    Abstract: An apparatus generates configuration group information by classifying, based on first log information storing messages outputted by a first plurality of configuration items of a first system, into first configuration groups each including one or more configuration items that have outputted messages having a commonality. The apparatus generates relation class information that defines, in association with the first configuration groups, first one or more message propagation relations.
    Type: Application
    Filed: May 29, 2012
    Publication date: December 13, 2012
    Applicant: FUJITSU LIMITED
    Inventors: Masataka SONODA, Yasuhide Matsumoto, Yukihiro Watanabe
  • Publication number: 20120310936
    Abstract: A processing method for duplicated data includes the following steps. A stored file is partitioned into a plurality of raw tanks and a plurality of meta tanks, in which the raw tanks correspond to the meta tanks in a one to one manner, and each meta tank has a stored fingerprint value of the corresponding raw tank. A duplicated data determination request is received, in which the duplicated data determination request includes a requested fingerprint value. At least one of the meta tanks is read, and the requested fingerprint value is compared with the stored fingerprint value of the read meta tank. A referred counter value of the read meta tank is modified, and the modified meta tank is stored back, when the requested fingerprint value is the same as the stored fingerprint value of the read meta tank.
    Type: Application
    Filed: September 22, 2011
    Publication date: December 6, 2012
    Applicant: INVENTEC CORPORATION
    Inventors: Ming-Sheng Zhu, Chih-Feng Chen
  • Publication number: 20120310935
    Abstract: Methods and systems of integrated batching and random sampling of documents for enhanced functionality and quality control, such as validation, within a document review process are provided herein. According to various embodiments, a batching request may be received and may include a population size that corresponds to a total amount of documents available for sampling. The batching request may also include an acceptable margin of error. A random sample size may be calculated based on the batching request, and then a subset of documents corresponding to the random sample size may be selected from the total amount of documents available for sampling. The subset of documents may be grouped into one or more batches, and the one or more batches may be assigned to one or more review nodes.
    Type: Application
    Filed: June 4, 2011
    Publication date: December 6, 2012
    Inventor: Jan Puzicha
  • Publication number: 20120310943
    Abstract: A system and method is provided that provides an early indication of consensus of opinion among a number of users regarding an event or observation indicated by a user. Such an opinion may be interesting to an information consumer, who may be interested in determining the outcome of the consensus relating to the event or observation, or may otherwise desire to perform surveillance or survey of a particular issue or subject. Such recognition of early events or observations may be useful in different areas, such as healthcare, finance, etc., where initial observations, if provided early, allow resulting decisions to be made much earlier. The opinion may, for instance, be used as an early indicator of problem with a product, company, etc. that would permit an information consumer to perform an action at a much earlier point than if he/she relied on traditional sources of information. Opinion information may be invaluable as a tool for monitoring events.
    Type: Application
    Filed: April 16, 2012
    Publication date: December 6, 2012
    Inventors: Daniel Palestrant, Graham Gardner
  • Publication number: 20120310942
    Abstract: A method for queuing conference participants by category includes, with a physical computing system, receiving requests from a number of conference registrants to attend a conference, with the physical computing system, placing each of the registrants into a number of queues based on a category assigned to the registrants, and with the physical computing system, allowing a number of the registrants from the queues to attend the conference such that the conference comprises a number of participants from each of the queues so as to meet predefined criteria.
    Type: Application
    Filed: June 3, 2011
    Publication date: December 6, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Thomas Richard Haynes, Elizabeth Vera Woodward
  • Publication number: 20120310944
    Abstract: A boundary word identification unit (103) identifies a boundary word belonging to a plurality of categories among words gathered in dictionary growth processing. Then, a category membership degree calculation unit (104) calculates, for each category to which the boundary word belongs, a category membership degree indicating a degree to which the boundary word belongs to the category on the basis of information recorded in a gathering process memory unit (108). Next, a category update unit (105) determines the category to which the boundary word belongs on the basis of the category membership degree calculated by the category membership degree calculation unit (104) and updates information stored in a gathered-by-category word memory unit (109) so that the determination result is reflected.
    Type: Application
    Filed: December 3, 2010
    Publication date: December 6, 2012
    Applicant: NEC CORPORATION
    Inventors: Hironori Mizuguchi, Yukitaka Kusumura, Dai Kusui
  • Publication number: 20120303624
    Abstract: Embodiments are directed to generating a customized classification rule execution order and to identifying optimal ordering rules for previously processed data. In an embodiment, a computer system fingerprints a message received via a computer network. The fingerprinting identifies specific characteristics of the message. The computer system compares the message's fingerprint to various stored message fingerprints generated from previously received messages. The comparison determines that the fingerprint does not match the stored fingerprints. The computer system applies classification rules to the message according to a predetermined rule execution order to determine a classification for the message. The computer system then generates a customized classification rule execution order to order those classification rules that optimally identified the message's class at the top of the customized classification rule execution order.
    Type: Application
    Filed: May 25, 2011
    Publication date: November 29, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Mauktik H. Gandhi, Shashank Kavishwar, Charles W. Lamanna
  • Publication number: 20120303618
    Abstract: Data representing capabilities of devices in a data is aggregated on a cluster-basis. Information representing capability attributes of devices in the data center is received. The information representing the capability attributes is analyzed to generate data that groups devices based on similarity of at least one capability attribute. Aggregation data is stored that represents the grouping of the devices based on similarity of the at least one capability attribute and identifies the devices in corresponding groups.
    Type: Application
    Filed: May 23, 2011
    Publication date: November 29, 2012
    Applicant: CISCO TECHNOLOGY, INC.
    Inventors: Debojyoti Dutta, Subrata Banerjee, Ethan M. Spiegel, Arpan K. Ghosh