Clustering Or Classification (epo) Patents (Class 707/E17.089)
  • Publication number: 20110208730
    Abstract: A model generated from search log data predicts a hidden state based on a query to determine a context of the query, such as for providing re-ranked search results, query suggestions and/or URL recommendations.
    Type: Application
    Filed: February 23, 2010
    Publication date: August 25, 2011
    Applicant: Microsoft Corporation
    Inventors: Daxin Jiang, Hang Li
  • Publication number: 20110208429
    Abstract: Techniques for providing a route based on route-oriented vehicle trajectories are described. This disclosure describes receiving GPS logs and extracting route-oriented vehicle trajectory content from the GPS log data to pertain to a single trip. Next, the process maps each route-oriented vehicle trajectory to a corresponding road segment to construct a landmark graph. A landmark is a road segment frequently visited by route-oriented vehicles. The process includes receiving a user query with a starting point and a destination point; searching the landmark graph for a sequence of landmarks with corresponding transition times and a least amount of travel time. Then the process identifies and connects sets of road segments between each pair of consecutive landmarks, and displays a route to a user with a nearest landmark to the starting point, other landmarks along the route, and another nearest landmark to the destination point.
    Type: Application
    Filed: February 24, 2010
    Publication date: August 25, 2011
    Applicant: Microsoft Corporation
    Inventors: Yu Zheng, Yin Lou, Chengyang Zhang, Xing Xie
  • Publication number: 20110208740
    Abstract: Organizing information and/or contacts using emotient properties are provided. A method can include associating respective emotient attributes with contacts utilizing respective index values, and grouping at least two contacts of the contacts into a group based on at least one index value of the index values. The method can further include defining an event by at least one of a time interval or a scenario related to the time interval; receiving, during the event, a message from a contact of the contacts including information associated with the contact; and determining, based on the information, a membership of the contact in the group. Another method can include receiving an emotient property of a contact associated with information, and associating the emotient property with the information utilizing an index value. Further, the method can include providing at least a portion of the information based on the index value.
    Type: Application
    Filed: May 4, 2011
    Publication date: August 25, 2011
    Applicant: LIANG HOLDINGS, LLC
    Inventor: Thomas W. Lynch
  • Publication number: 20110208741
    Abstract: A continuous, emergent, anytime process clusters input documents according to a similarity function within a node-based, distributed computing environment, for example, within a client/server environment. An agent (DAg) assigned to each document determines whether its document should remain at a node or be moved to another node to increase similarity clustering. An agent (SAg) assigned to each node may be operative to manage storage requirements within its node, and/or manage communications between the nodes of the environment as the DAgs operate. Typically a move request is issued to another node if it is determined that clustering would increase by moving a document to that node. In such an instance, the SAg assigned to that other node would probabilistically consider the move request in view of other such requests in sequence to avoid overloading. To enhance performance, documents may be preprocessed and given values representative of similarity.
    Type: Application
    Filed: January 10, 2011
    Publication date: August 25, 2011
    Inventor: SVEN BRUECKNER
  • Publication number: 20110208739
    Abstract: In accordance with embodiments, there are provided mechanisms and methods for conditionally performing a query including an aggregate function. These mechanisms and methods for conditionally performing a query including an aggregate function can limit performance of queries including aggregate functions based on a number or records associated with such performance of such aggregate functions. The ability to limit performance of queries including aggregate functions can enable performance quality of a computer system to be maintained.
    Type: Application
    Filed: February 25, 2011
    Publication date: August 25, 2011
    Applicant: salesforce.com, inc.
    Inventor: Craig Weissman
  • Publication number: 20110208738
    Abstract: A method for associating sparse keywords with non-sparse keywords. The method comprises determining from metrics of a plurality of keywords a list of sparse keywords and non-sparse keywords; generating a similarity score for each sparse keyword with respect of each non-sparse keyword; associating a sparse keyword with a non-sparse keyword; and storing the association between the non-sparse keyword and the sparse keyword in a database.
    Type: Application
    Filed: February 22, 2011
    Publication date: August 25, 2011
    Applicant: KENSHOO LTD.
    Inventors: Amir Bar, Michael Aronowich, Nir Cohen, Gilad Armon-Kest, Shahar Siegman
  • Publication number: 20110202532
    Abstract: There has been a problem in a related art that as information of a certain topic is scattered, the information cannot be shared efficiently,. An information sharing system includes a specified section linguistic analysis element that performs a linguistic analysis to a specified section text and outputs linguistic analysis information, a specified section topic generation element that generates topic information from the linguistic analysis information, where the topic information is a topic of the specified section text, and a bulletin board management element that refers to a bulletin board information storage unit and if address information of a bulletin board corresponding to the topic information is obtained, outputs the address information or a set of the topic information and the address information as corresponding bulletin board information.
    Type: Application
    Filed: August 8, 2008
    Publication date: August 18, 2011
    Applicant: NEC CORPORATION
    Inventors: Satoshi Nakazawa, Takahiro Ikeda, Yoshihiro Ikeda, Kunihiko Sadamasa, Takao Kawai
  • Publication number: 20110202530
    Abstract: An information processing device includes an obtaining unit that obtains a plurality of contents to which labels indicating users' subjective evaluation of the contents are assigned as metadata, a selection unit that selects labels having a high reliability in regards to evaluation of the contents among the labels assigned to the plurality of contents obtained by the obtaining unit, a calculation unit that calculates a degree of similarity between the labels selected by the selection unit, a clustering unit that clusters the labels based on the degree of similarity calculated by the calculation unit, and a storage unit that stores a cluster obtained as a result of the clustering in the clustering unit, as one label.
    Type: Application
    Filed: February 4, 2011
    Publication date: August 18, 2011
    Applicant: Sony Corporation
    Inventor: Mari Saito
  • Publication number: 20110202529
    Abstract: This document discusses, among other things, a method for reconciliation of a configuration item with a configuration management database. Properties of the configuration item are divided into a plurality of classes. Different classes correspond to properties having a different relationship with a corresponding configuration item. At least one property of the configuration item is compared to properties of configuration items in a configuration management database. Different actions are taken with respect to the configuration item based on the class of the property being compared.
    Type: Application
    Filed: February 16, 2010
    Publication date: August 18, 2011
    Applicant: Computer Associates Think, Inc.
    Inventor: Marvin Garold Waschke
  • Publication number: 20110202535
    Abstract: A method of identifying a provenance of a document is provided. The method may include obtaining a query document that is included in a document set comprising a plurality of documents. The method may also include grouping the plurality of documents into a plurality of fine clusters based on a textual similarity between the plurality of documents. The method may also include identifying a target fine cluster within the plurality of fine clusters, the target fine cluster including the query document. The method may also include ordering the documents included in the target fine cluster based, at least in part, on metadata associated with each of the documents to identify a source document. The method may also include generating a query response that includes the source document.
    Type: Application
    Filed: February 13, 2010
    Publication date: August 18, 2011
    Inventors: Vinay Deolalikar, Hernan Laffitte
  • Publication number: 20110196869
    Abstract: Storage of data segments is disclosed. For each segment, a similar segment to the segment is identified, wherein the similar segment is already managed by a cluster node. In the event the similar segment is identified, a reference to the similar segment and a delta between the similar segment and the segment are caused to be stored instead of the segment.
    Type: Application
    Filed: April 19, 2011
    Publication date: August 11, 2011
    Applicants: EMC CORPORATION
    Inventors: R. Hugo Patterson, Kai Li, Ming Benjamin Zhu, Sazzala Venkata Reddy, Umesh Maheshwari, Edward K. Lee
  • Publication number: 20110196857
    Abstract: Techniques for generating a set of one or more materialized query table (MQT) candidates for a workload are provided. The techniques include receiving a workload, wherein the workload comprises a set of one or more queries, generating one or more best matching MQTs (BMQTs) based on one or more query blocks of the one or more queries by removing syntax that is not qualified for a MQT re-write, determining one or more frequently used multi-joins in the workload, using the one or more BMQTs and the one or more frequently used multi-joins to generate a set of one or more workload MQTs (WMQTs), and grouping one or more WMQTs and one or more BMQTs into one or more groups to merge into a set of a smaller number of MQTs and to cover the workload.
    Type: Application
    Filed: February 9, 2010
    Publication date: August 11, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dong Sheng Chen, Hong Min, Terence P. Purcell, Yefim Shuf, Xiao Bo Wang, Zhong Liang Zhang
  • Publication number: 20110196866
    Abstract: Methods and apparatus are described for partitioning native tables in a database cluster into logical tables. Each logical table is mapped into a unique portion of the native table by an intermediary server. Clients access a logical table as an ordinary, full-fledged database table through the intermediary server, which translates queries on the logical table into queries on the corresponding portion of the native table. The mapping may use the application name, logical table name, and a version number to create a native table key for each key in the logical table. A data structure storing these mappings may be stored at the intermediary server or in a native table in the database. This approach affords clients quick and flexible access to the database with better data integrity and security than native tables allow.
    Type: Application
    Filed: February 9, 2010
    Publication date: August 11, 2011
    Applicant: YAHOO! INC.
    Inventor: Brian Frank Cooper
  • Publication number: 20110196879
    Abstract: A system and method for propagating classification decisions is provided. Text marked within one or more unclassified documents that is determined to be responsive to a predetermined issue is received from a user. The unclassified documents are selected from a corpus. A search query is generated from the responsive text. Same result documents are identified by applying inclusive search parameters to the query, applying the search query to the corpus, and identifying the documents that satisfy the query. Similar result documents are identified by adjusting a breadth of the query by applying less inclusive search parameters and identifying documents from the corpus that satisfy the query. A responsive classification code is automatically assigned to each same result document for classification as responsive documents. The similar documents are provided to the user. A responsive classification decision is received form the user for classification as the responsive documents.
    Type: Application
    Filed: February 4, 2011
    Publication date: August 11, 2011
    Inventors: Eric Michael Robinson, Manfred J. Gabriel
  • Publication number: 20110196868
    Abstract: Methods and apparatus for the convenient arrangement of a user's address book according to intelligent algorithms. These intelligent algorithms, in one embodiment, take advantage of one or more of: (i) stored contact information associated with one or more users, (ii) stored geographic location information associated with the users and one or more contact entries in the user's address book, and/or (iii) stored voice and data communication information associated with the user. This algorithm arranges the entries in the users address book, using the stored information as an input, in an intelligent manner. In other embodiments, additional information is used as an input to the contact entry arranging algorithms such as, for example, entries in a user's digital calendar. Business methods utilizing the aforementioned methods and apparatus are also disclosed.
    Type: Application
    Filed: February 11, 2010
    Publication date: August 11, 2011
    Inventors: Martin Hans, Andreas Schmidt
  • Publication number: 20110196872
    Abstract: A computational method and system for the comparison and analysis of different objects of information within a database or collection. All objects are compared in a pair-wise fashion so the relative similarity between each object to every other object in the collection is known. A generalized alignment-free method is described for comparing whole genome (coding and non-coding) DNA sequences is used to investigate the relationship among placental mammalian genomes. Differences in word feature frequency profiles (FFP) are used to derive distance and infer evolutionary relationships.
    Type: Application
    Filed: October 9, 2009
    Publication date: August 11, 2011
    Applicant: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
    Inventors: Gregory E. Sims, Sung-Hou Kim
  • Publication number: 20110191020
    Abstract: In map data, a road corresponds to a multilink defined as links connected consecutively with an identical attribute. The map data contains a road management information list, link information list, and coordinate information list of a real data list. In the road management information list, fixed-length road management information elements, each of which indicates the number of links in each multilink, are arrayed in an order. In the link information list, fixed-length link information elements, each of which indicates the number of coordinate points in each link, are arrayed in an order in which corresponding road management information elements are arrayed in the road management information list. In the coordinate information list, fixed-length coordinates information elements, each of which indicates coordinate points arranged in one link to illustrate a shape of the link, are arrayed in an order in which the coordinate points are arranged in the link.
    Type: Application
    Filed: January 26, 2011
    Publication date: August 4, 2011
    Applicant: DENSO CORPORATION
    Inventor: Takayuki MATSUNAGA
  • Publication number: 20110191342
    Abstract: A URL reputation system may have a reputation server and a client device with a cache of reputation information. A URL reputation query from the client to the server may return reputation data along with probabilistic set membership information for several variants of the requested URL. The client may use the probabilistic set membership information to determine if the reputation server has additional information for another related URL as well as whether the classifications are inheritable from one of the variants. If the probabilistic set membership determines that the reputation server may have additional information, a query may be made to the reputation server, otherwise the reputation may be inferred from the data stored in the cache.
    Type: Application
    Filed: February 1, 2010
    Publication date: August 4, 2011
    Applicant: Microsoft Corporation
    Inventors: Jason COHEN, Benjamin Arai, Craig Boucher, Nicholas Waggoner, Jose Marcos de Oliveira, Yun Lin
  • Publication number: 20110184950
    Abstract: A system and method for assisting a user in navigation of an image dataset are disclosed. The method includes receiving a user's text query, retrieving images responsive to the query from an image dataset, providing for receiving the user's selection of a first feature selected from a set of available first features via a graphical user interface, providing for receiving the user's selection of a second feature selected from a set of available second features different from the first features via the graphical user interface, and displaying at least some of the retrieved images on the graphical user interface. The displayed images are arranged, e.g., grouped, according to levels and/or combinations of levels of the user-selected first and second features.
    Type: Application
    Filed: January 26, 2010
    Publication date: July 28, 2011
    Applicant: Xerox Corporation
    Inventors: Sandra SKAFF, Luca Marchesotti, Tommaso Colombino, Ana Fucs, Gabriela Csurka, Yanal Wazaefi, Marco Bressan
  • Publication number: 20110184948
    Abstract: A music recommendation method and a computer readable recording medium storing a computer program performing the method are provided. In the music recommendation method, music items and rating data matrix comprising ratings and user IDs are first provided. Then, the ratings of each music item are classified into positive ratings and negative ratings. Thereafter, a pre-processing phase comprising a frame-based clustering step and a sequence-based clustering step is performed to transform the music items into perceptual patterns. Then, a prediction phase is performed to determine an interest value of a plurality of target music items for an active user. Thereafter, the target music items arranged into a music recommendation list in accordance with the first interest value and the second interest values, wherein the music recommendation list is a reference for the active user to select one of the target items.
    Type: Application
    Filed: January 22, 2010
    Publication date: July 28, 2011
    Applicant: NATIONAL CHENG KUNG UNIVERSITY
    Inventors: Shin-Mu TSENG, Ja-Hwung SU, Hsin-Ho YEH
  • Publication number: 20110184951
    Abstract: Methods and computer-readable media are provided for determining suggested queries. A user enters a search website, and the user is identified based on a user identification. Suggested queries are determined based on a group associated with the user. This association is created by extracting queries from data logs, categorizing the queries into groups based on their respective subject matter, associating the user with one or more groups, and determining suggested queries for each group. The suggested queries are communicated for display.
    Type: Application
    Filed: January 28, 2010
    Publication date: July 28, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: STELIOS PAPARIZOS, CHRIS ANDERSON, JANINE CRUMB, David James GEMMELL, AJAY NAIR, GENNADII TERTYCHNYI, AN YAN
  • Publication number: 20110184952
    Abstract: According to embodiments of the subject matter disclosed in this application, a large audio database in a multiprocessor system may be searched for a target audio clip using a robust and parallel search method. The large audio database may be partitioned into a number of smaller groups, which are dynamically scheduled to available processors in the system. Processors may process the scheduled groups in parallel by partitioning each group into smaller segments, extracting acoustic features from the segments; and modeling the segments using a common component Gaussian Mixture model (“CCGMM”). One processor may also extract acoustic features from the target audio clip and model it using the CCGMM. Kullback-Leibler (KL) distance may be further computed between the target audio clip and each segment. Based on the KL distance, a segment may be determined to match the target audio clip; and/or a number of following segments may be skipped.
    Type: Application
    Filed: February 1, 2011
    Publication date: July 28, 2011
    Inventor: Yurong Chen
  • Publication number: 20110178846
    Abstract: The present invention improves upon existing systems and methods by providing a passive profile creation method. The data accessible to a financial processor, such as spend level data, is leveraged using sophisticated data clustering and/or data appending techniques. Associations are established among entities (e.g., consumers), among merchants, and between entities and merchants. In one embodiment, a system and method for passively collecting spend level data for a transaction of a first entity, aggregating the collected spend level data for a plurality of entities; and clustering the first entity with a subset of the plurality of entities, based on aggregated spend level data of the first entity is provided.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 21, 2011
    Applicant: American Express Travel Related Services Company, Inc.
    Inventors: Rajendra R. Rane, Melissa Schwartz
  • Publication number: 20110178848
    Abstract: The present invention improves upon existing systems and methods by providing a passive profile creation method. The data accessible to a financial processor, such as spend level data, is leveraged using sophisticated data clustering and/or data appending techniques. Associations are established among entities (e.g., consumers), among merchants, and between entities and merchants. In one embodiment, a system and method for passively collecting spend level data for a transaction of a first entity, aggregating the collected spend level data for a plurality of entities; and clustering the first entity with a subset of the plurality of entities, based on aggregated spend level data of the first entity is provided.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 21, 2011
    Applicant: American Express Travel Related Services Company, Inc.
    Inventors: Rajendra R. Rane, Melissa Schwartz
  • Publication number: 20110179036
    Abstract: Systems and methods are provided for creating abstracted, normalized, and reuseable and combinable representations of information contained in multiple documents and information of any supported format, and allowing for exporting of information in any other desired and supported format. Further the system and methods provide for uploading documents based on a known template, where the data members can be automatically recognized and the document stored in normalized format without end-user or developer intervention. Normalization of data is achieved transparently on upload and denormalization performed transparently on download. Further, embodiments provide for the reuse and recombination of data members to create entirely new representations.
    Type: Application
    Filed: December 16, 2010
    Publication date: July 21, 2011
    Inventors: Jason Townes French, Auston John Stewart
  • Publication number: 20110179034
    Abstract: An information processor carrying out statistical natural language processing for a document, the information processor includes a characteristic amount extraction unit configured to detect context information including a proper noun pair from the document and extract a characteristic amount of the detected context information; a characteristic amount analysis unit configured to, by analyzing the characteristic amount of the extracted context information using a probability model in which a document topic meaning an entire topic of the document and a context topic meaning a local topic of the document are considered, estimate a potential variable and a context topic ratio in the probability model; and a clustering unit configured to cluster the proper noun pair included in the context information based on the context topic ratio estimated regarding the characteristic amount of the respective context information.
    Type: Application
    Filed: January 13, 2011
    Publication date: July 21, 2011
    Applicant: Sony Corporation
    Inventor: Shingo Takamatsu
  • Publication number: 20110179032
    Abstract: A Natural Language Understanding system is provided for indexing of free text documents. The system according to the invention utilizes typographical and functional segmentation of text to identify those portions of free text that carry meaning. The system then uses words and multi-word terms and phrases identified in the free to text to identify concepts in the free text. The system uses a lexicon of terms linked to a formal ontology that is independent of a specific language to extract concepts from the free text based on the words and multi-word terms in the free text. The formal ontology contains both language independent domain knowledge concepts and language dependent linguistic concepts that govern the relationships between concepts and contain the rules about how language works. The system according to the current invention may preferably be used to index medical documents and assign codes from independent coding systems, such as, SNOMED, ICD-9 and ICD-10.
    Type: Application
    Filed: March 1, 2011
    Publication date: July 21, 2011
    Applicant: Nuance Communications, Inc.
    Inventors: Werner Ceusters, Mick O'Donnell, Frank Montyne, Frederik Coppens, Maarten Van Mol
  • Publication number: 20110178842
    Abstract: The present invention improves upon existing systems and methods by providing a passive profile creation method. The data accessible to a financial processor, such as spend level data, is leveraged using sophisticated data clustering and/or data appending techniques. Associations are established among entities (e.g., consumers), among merchants, and between entities and merchants. In one embodiment, a system and method for passively collecting spend level data for a transaction of a first entity, aggregating the collected spend level data for a plurality of entities; and clustering the first entity with a subset of the plurality of entities, based on aggregated spend level data of the first entity is provided.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 21, 2011
    Applicant: American Express Travel Related Services Company, Inc.
    Inventors: Rajendra R. Rane, Melissa Schwartz
  • Publication number: 20110179030
    Abstract: A method for indexing a suffix tree in a social network includes: scanning an input string and dividing the string into partitions each having a common prefix; performing no-merge suffix tree indexing on the divided partitions; storing information on the partitions on which no-merge suffix tree indexing is performed; storing suffix nodes of the no-merge suffix tree; and establishing a prefix tree. The performing no-merge suffix tree indexing includes: generating a set of suffixes having the common prefix in the input string; generating a suffix set from the set of suffixes and storing the suffix set; and building the suffix set as a sub-tree.
    Type: Application
    Filed: December 2, 2010
    Publication date: July 21, 2011
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jong-Hoon LEE, Young Ho PARK, Hoo Young AHN, Jung Tae KIM, Hoon Ki LEE, EUIHYUN PAIK
  • Publication number: 20110179029
    Abstract: An experience information processing apparatus for a social networking service, includes an ontology unit for providing a social ontology including social connection information and location information of a user and a service ontology including web service information, service location information and tag information. Further, the experience information processing apparatus includes an experience information management unit for extracting experience information content having location information from a plurality of mobile devices, classifying the extracted experience information content using the ontology unit to establish an experience information database, and searching the established experience information database based on the location information in response to a request from the mobile device to provide a social media service by linking the social connections information, the location information, and the tag information.
    Type: Application
    Filed: November 23, 2010
    Publication date: July 21, 2011
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jung Tae KIM, Jong-Hoon LEE, Hoon Ki LEE, Euihyun PAIK
  • Publication number: 20110178855
    Abstract: The present invention improves upon existing systems and methods by providing a passive profile creation method. The data accessible to a financial processor, such as spend level data, is leveraged using sophisticated data clustering and/or data appending techniques. Associations are established among entities (e.g., consumers), among merchants, and between entities and merchants. In one embodiment, a system and method for passively collecting spend level data for a transaction of a first entity, aggregating the collected spend level data for a plurality of entities; and clustering the first entity with a subset of the plurality of entities, based on aggregated spend level data of the first entity is provided.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 21, 2011
    Applicant: American Express travel Related Services Company,
    Inventors: Rajendra R. Rane, Melissa Schwartz
  • Publication number: 20110179031
    Abstract: A configuration information management device includes a configuration information storage unit for storing a configuration item indicative of information about a target of management, and an item relationship indicative of information about a connection between configuration items independently of a different configuration information management device. When a request to enter a cluster is accepted that is a group of a configuration item and an item relationship connected together, the configuration information management device determines a destination to store the cluster, and controls to cause the configuration information storage unit or the different configuration information management device to store the cluster.
    Type: Application
    Filed: January 6, 2011
    Publication date: July 21, 2011
    Applicant: FUJITSU LIMITED
    Inventors: Atsuji Sekiguchi, Yuji Wada, Masazumi Matsubara
  • Publication number: 20110178845
    Abstract: The present invention improves upon existing systems and methods by providing a passive profile creation method. The data accessible to a financial processor, such as spend level data, is leveraged using sophisticated data clustering and/or data appending techniques. Associations are established among entities (e.g., consumers), among merchants, and between entities and merchants. In one embodiment, a system and method for passively collecting spend level data for a transaction of a first entity, aggregating the collected spend level data for a plurality of entities; and clustering the first entity with a subset of the plurality of entities, based on aggregated spend level data of the first entity is provided.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 21, 2011
    Applicant: American Express Travel Related Services Company, Inc.
    Inventors: Rajendra R. Rane, Melissa Schwartz
  • Publication number: 20110172874
    Abstract: A vehicle fault diagnosis and prognosis system includes a computing platform configured to receive a classifier from a remote server, the computing platform tangibly embodying computer-executable instructions for evaluating data sequences received from a vehicle control network and applying the classifier to the data sequences, wherein the classifier is configured to determine if the data sequences define a pattern that is associated with a particular fault.
    Type: Application
    Filed: January 13, 2010
    Publication date: July 14, 2011
    Applicant: GM GLOBAL TECHNOLOGY OPERATIONS, INV.
    Inventors: Debprakash Patnaik, Pulak Bandyopadhyay, Steven W. Holland, Kootaala P. Unnikrishnan, George Paul Montgomery, JR.
  • Publication number: 20110173198
    Abstract: Embodiments are directed towards determining dependent interest affinity values between users to identify users that may mirror interests and thereby have an increased probability of becoming friends. A plurality of tracked online activities are classified into a plurality of interests categories, and used to determine weighted scores for each interest based on a quantity and quality of related activities for the interest. A proportional score for each interest is also determined and used with the weighed scores to generate dependent interest affinities between pairs of users. Interest indices are obtained and rank ordered for a given user and another user based on relevant dependent interest affinities. The resulting interest indices may be filtered based on a variety of criteria. At least some information about the related other users may be displayed to the given user based on the rank ordering, as possible mirrored friends.
    Type: Application
    Filed: January 12, 2010
    Publication date: July 14, 2011
    Applicant: Yahoo! Inc.
    Inventors: Prasannakumar Jobigenahally Malleshaiah, Supreeth Hosur Nagesh Rao
  • Publication number: 20110167067
    Abstract: Methods for classification of application commands are described. An application command associated with a classification parameter is generated by an application in a first device. A classification value is determined for the application command based on the classification parameter. The classification value is associated with the application command and is sent to a second device for processing.
    Type: Application
    Filed: August 6, 2010
    Publication date: July 7, 2011
    Inventor: Kishore Kumar MUPPIRALA
  • Publication number: 20110167063
    Abstract: Web pages are efficiently categorized in a data processor without analyzing the content of the web pages. According to at least one embodiment, data is maintained that represents sample URLs grouped into a plurality of clusters. The sample URLs of a cluster are used to produce a URL regular expression pattern (“URL-regex”) that differentiates the sample URLs of the cluster from the sample URLs of other clusters and that covers at least a specified percentage of the sample URLs in the cluster. The process of producing a URL-regex is repeated for each of the clusters producing a URL-regex for each cluster. Web pages are then categorized into one of the clusters by determining which of the URL-regex patterns produced for the clusters match URLs that refer to the web pages. Thus, a web page may be categorized based on a URL that refers to the web page without having to obtain and analyze the content of the web page.
    Type: Application
    Filed: January 5, 2010
    Publication date: July 7, 2011
    Inventors: Ashwin Tengli, Rajeev Rastogi, Jeyashankher Ramamirtham, Srinivasan H. Sengamedu, Sandeepkumar Bhuramal Satpal
  • Publication number: 20110167064
    Abstract: A system and associated method for evaluating cross-domain clusterability upon a target domain and a source domain. The cross-domain clusterability is calculated as a linear combination of a target clusterability and a source-target pair matchability, by use of a trade-off parameter that determines relative contribution of the target clusterability and the source-target pair matchability. The target clusterability quantifies how clusterable the target domain is. The source-target pair matchability is calculated as an average of a target-side matchability and a source-side matchability, which quantifies how well target centroids of the target domain are aligned with the source centroids and how well source centroids of the source domain are aligned with the target centroids, respectively.
    Type: Application
    Filed: January 6, 2010
    Publication date: July 7, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: JEFFREY M. ACHTERMANN, INDRAJIT BHATTACHARYA, KEVIN W. ENGLISH, Jr., SHANTANU R. GODBOLE, SACHINDRA JOSHI, ASHWIN SRINIVASAN, ASHISH VERMA
  • Publication number: 20110167068
    Abstract: A method of managing information comprises generating a categorized document base. Generating the document base comprises providing a pre-existing classification of things other than documents, providing a source collection of documents, and automatically assessing the documents using Information Retrieval techniques to assign at least some of the documents to one or more taxa of the classification. For each taxon in the classification one or more numerical scores are assigned, based at least in part on a composition, makeup or constitution of the documents assigned to the taxon of the categorized document base.
    Type: Application
    Filed: March 18, 2011
    Publication date: July 7, 2011
    Applicant: Sizatola, LLC
    Inventors: Steven O. Kimbrough, Ian C. MacMillan, John P. Ranieri, James D. Thompson
  • Publication number: 20110161323
    Abstract: There is provided an information processing device including: a storage unit that stores information element data defining a plurality of information elements; an information acquisition unit that acquires an information set having a referential relationship with each other from an information source accessible through a communication network; a classification unit that classifies information included in the information set acquired by the information acquisition unit into information of a first class corresponding to an information element defined by the information element data and information of a second class other than the information of the first class; and an evaluation unit that evaluates a degree of association between information elements respectively corresponding to two or more information of the first class based on a referential relationship between the information of the first class and the information of the second class in the information set.
    Type: Application
    Filed: December 3, 2010
    Publication date: June 30, 2011
    Inventor: Takehiro HAGIWARA
  • Publication number: 20110153611
    Abstract: Disclosed are systems and methods for extracting data from a report document for analysis. A report document is retrieved from a group of report documents. Data present in the report document may include fields and associated metadata. The fields and the associated metadata present in the report are categorized as corresponding data source parameters. The data source parameters are rendered on a user interface, to receive a user definition of a scope for analyzing the data present in the report document. The data source parameters associated with the user definition are qualified to rendered result objects for each associated data source parameter. Based upon the result objects, a query is generated to define the data for analyzing the report document. Based upon a user input to the query, the data present in the report document associated to the query is extracted to generate a multi-dimensional result data.
    Type: Application
    Filed: December 22, 2009
    Publication date: June 23, 2011
    Inventors: ANIL BABU ANKISETTIPALLI, Prashanth Pai, Amrita Prabhakaran, Sumitesh Ranjan Srivastava
  • Publication number: 20110153613
    Abstract: Disclosed herein is an information search method using automatic category generation, which automatically generates a category related to a keyword entered by a user, constructs an ontology for processing menu information based on the searched results, and provides a customized menu suitable for the use of the keyword. More particularly, the present invention relates to an apparatus and method for automatically generating keyword-related categories, which automatically classifies menu information related to the results of search of information using a keyword entered by a user on the basis of locational/societal relations.
    Type: Application
    Filed: December 20, 2010
    Publication date: June 23, 2011
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventor: Hoon-Ki LEE
  • Publication number: 20110153593
    Abstract: An optimizer uses comprehensive reasoning regarding partitioning, sorting, and grouping properties for query optimization. When optimizing an input query expression, logical exploration generates alternative logical expressions. Physical optimization explores physical operator alternatives for logical operators. Required partitioning, sorting, and grouping properties of inputs to physical operators are determined. Additionally, delivered partitioning, sorting, and grouping properties of outputs from physical operators are determined. In some embodiments, enforcer rules are employed to modify structural property requirements to introduce alternatives for consideration. Property matching identifies valid execution plans in which the delivered partitioning, sorting, and grouping properties satisfy corresponding required partitioning, sorting, and grouping properties. An execution plan having the lowest cost is selected as the optimized execution plan.
    Type: Application
    Filed: December 17, 2009
    Publication date: June 23, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: JINGREN ZHOU, PRE-AKE LARSON, RONNIE IRA CHAIKEN
  • Publication number: 20110153606
    Abstract: Provided are an apparatus and a method which can be easily implemented with flexibility enabling distributing all metadata of trees and files in an asymmetric distributed file system. The apparatus includes: a metadata storage unit storing metadata corresponding to a part of partitions of a virtual metadata address space storing metadata for directories and/or files for each of the partitions; and a metadata storage management unit controlling the metadata so that the metadata are stored in the metadata storage unit and manages a master map including information on the part of the partitions. Since all directories and files can be distributed to a plurality of metadata servers without a limitation, it is possible to prevent a load from being concentrated on a predetermined metadata server. Metadata roles of the metadata servers are very simply readjusted and as a result, the load can be easily distributed in a partition level.
    Type: Application
    Filed: December 16, 2010
    Publication date: June 23, 2011
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Hong-Yeon KIM, Young-Kyun Kim, Han Namgoong
  • Publication number: 20110153612
    Abstract: A method for providing a customized application on different requesting device types of a user is provided. The method enables, firstly, receiving requests made by the user using the different device types over multiple communication channels. Secondly, the method enables assigning a rank to the user based on requests received and one or more rules. Further the method enables determining personalization information based on the ranking. Finally, the method enables rendering a customized application on the different device types based on the personalization information and configuration information stored in a central data repository. The configuration information is related to the application and features thereof based on the user's subscription profile.
    Type: Application
    Filed: May 24, 2010
    Publication date: June 23, 2011
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Sanjoy PAUL, Manish JAIN
  • Publication number: 20110153515
    Abstract: A distributed capture system is disclosed which enables digital content to be captured in various formats and interfaced with a plurality ECM) platforms which enables the distributed capture system to be seamlessly integrated with a customer's legacy ECM system. The system is configured to receive various financial records that are normally created at a financial institution, such as loan applications and customer signature cards, in various formats, such as Microsoft Word, PDF, and Printer Control Language (PCL). The financial records are directed to a virtual printer and converted to a TIFF format. The print stream associated with the text embedded in the TIFF image of the financial record is captured and compared with document classification template. The document classification template allows the document to be automatically classified and indexed. Documents are then sent to the ECM interface.
    Type: Application
    Filed: December 17, 2009
    Publication date: June 23, 2011
    Inventors: Joseph J. Pitzo, Kurt L. Brzezinski, Thomas A. Oberholtzer
  • Publication number: 20110145242
    Abstract: Embodiments of the present invention include methods, systems and computer program products. The embodiments of the present invention intelligently distribute data files within a database based upon predetermined conditions. In one embodiment, the present invention includes a computer-implemented method including, classifying a data set in response to metadata corresponding to one or more data files located on a single database; and creating a data file topology comprising a data file identifier, a data file location and a data file type. The method may also include receiving a predetermined rule directory comprising a set of features corresponding to one or more file systems; and in response to the data file topology and the predetermined rule directory, reorganizing the data set such that at least a portion of the data set is moved to one of a set of new file systems having a predetermined optimized characteristic.
    Type: Application
    Filed: December 16, 2009
    Publication date: June 16, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Gaurav Mehrotra, Abhinay R. Nagpal, Yan Wang Stein
  • Publication number: 20110145239
    Abstract: A computer information database system manages computer profile data for a set of computers. A profile group managing server coupled to the database manages the database such that there is a multiple node tree structure of groups for the set of computers in which each node is a group level and a top level is a root, based upon primary grouping criteria that correspond to selected computer profile data. Included in a database mapping table are fields that correspond to ranges of values for computer profile data of interest corresponding to primary grouping criteria including ranges that extend between a selected high and a selected low value. The ranges for any or all of the grouping criteria may be altered. The data in the database can be manipulated to produce summaries and reports of attributes of the computers in a given group.
    Type: Application
    Filed: December 15, 2009
    Publication date: June 16, 2011
    Inventors: Gary H. Newman, James W. Franklin
  • Publication number: 20110145252
    Abstract: A system receives context data associated with a context and a user. The system then associates the context data to a user identifier and retrieves data associated with the context. The system then filters the data according to the context data to create result data. In another embodiment, the system also receives context data from a plurality of users, where the context data pertains to one or more attributes of a context. The system then using the context data ranks the one or more attributes of the context to create ranked data and generates a user interface based on the ranked data. In yet another embodiment, the system communicates context data associated with a context and a user to a server, and receives result data created by the server filtering data retrieved based on the context data. The system then generates a user interface based on the result data.
    Type: Application
    Filed: December 13, 2010
    Publication date: June 16, 2011
    Inventors: Neelakantan Sundaresan, Alec Reitter
  • Publication number: 20110145237
    Abstract: System, method and computer program product for adjusting a representation of a merchandise hierarchy associated with an entity such as a retailer or wholesaler of products. Product correlation information discovered in that entity's customers' shopping records are obtained and incorporated into an existing merchandise hierarchy with a constraint on the consistency with the existing hierarchy.
    Type: Application
    Filed: December 11, 2009
    Publication date: June 16, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Xin Xin Bai, Jin Dong, Ta-Hsin Li, Zhong Lin Lin, Hai Rong Lv, Wen Jun Yin