Clustering Or Classification (epo) Patents (Class 707/E17.089)
  • Publication number: 20120047139
    Abstract: In an aspect there is provided a method. The method may include providing, at a repository, storage for a plurality of tenants, providing a plurality of layers, and providing a plurality of versions; and separating, based on the plurality of layers and the plurality of versions, data for each of the plurality of tenants, wherein during runtime one of the plurality of tenants corresponds to the plurality of layers and one of the plurality of versions. Related apparatus, systems, techniques and articles are also described.
    Type: Application
    Filed: August 23, 2010
    Publication date: February 23, 2012
    Inventors: Joachim Fitzer, Juergen Specht, Ye Jin, Juergen Sattler
  • Publication number: 20120041955
    Abstract: A method for document management includes automatically extracting respective features from each of a set of documents. The features are processed in a computer so as to generate respective vectors for the documents, each vector including elements having respective values that represent properties of a respective document. A similarity between the documents is assessed by computing a measure of distance between the respective vectors. The documents are automatically clustered responsively to the similarity so as to identify a cluster of the documents belonging to a common document type. Similar methods may be used in supervised categorization, wherein documents are compared and categorized based on a training set that is prepared for each document type.
    Type: Application
    Filed: August 10, 2010
    Publication date: February 16, 2012
    Applicant: NOGACOM LTD.
    Inventors: Yizhar Regev, Gilad Weiss
  • Publication number: 20120041952
    Abstract: In the display of a search result using a virtual space, the display and operation in the space is associated with an addition and change of search instruction to facilitate to grasp the content and to operate a search. A plurality of contents each of which having a keyword is arranged in the virtual space and displayed on a display screen. When one key is set, a content to be a search target is selected from among the plurality of contents based on a position at which the key is set, a search is performed on the selected search target with the set key, and an arrangement of the contents is changed and displayed based on a relationship with the key and the plurality of contents.
    Type: Application
    Filed: August 12, 2011
    Publication date: February 16, 2012
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Hidetomo Sohma
  • Publication number: 20120036085
    Abstract: A system is configured to determine aggregated social media variables that may be used for modeling. The system includes an information identifier module determining keywords and phrases. The system also includes an aggregator receiving information collected from social media applications using the keywords and phrases and determining values for social media variables from the collected information. The aggregator aggregates the social media variables based on the values and weightings of the social media variables.
    Type: Application
    Filed: August 5, 2010
    Publication date: February 9, 2012
    Applicant: ACCENTURE GLOBAL SERVICES GMBH
    Inventors: Janmesh Dev SRIVASTAVA, Andris UMBLIJS, Chao WANG, Stephen Denis KIRKBY, Peter Charles KELLETT, Thoai Duy Khang TRAN, Dharmendra K. DUBEY
  • Publication number: 20120030203
    Abstract: A method for establishing multiple look-up tables and a data acquisition method using multiple look-up tables are provided. In the present method, a plurality of input data is classified into a plurality of groups, and a plurality of input data and a plurality of output data corresponding to the input data are respectively provided to the groups to establish a plurality of corresponding look-up tables. At least one bit is selectively removed from the input data in each of the look-up tables corresponding to at least one of the groups, and the result input data and the corresponding output data are recorded in the look-up table corresponding to the group.
    Type: Application
    Filed: October 12, 2010
    Publication date: February 2, 2012
    Applicant: Industrial Technology Research Institute
    Inventors: Hsiao-Lan Chiang, Pang-An Ting, Kuo-Tai Chiu
  • Publication number: 20120030204
    Abstract: Provided are categorizing unit (2) which analyzes whether or not a character string contained in text data matches any of member character strings of a category provided as parameters, and acquires a category character string provided as a parameter for a name scale of the matched member character string; and an integration unit (3) which integrates the acquired category character string to a relational database. A category character string converted from the text data can be arbitrarily changed by a parameter, and a member character string used for assessing the match at the time of converting the text data to the category character string can be also changed arbitrarily by a parameter, and thereby, the text data can be flexibly categorized interactively.
    Type: Application
    Filed: February 17, 2010
    Publication date: February 2, 2012
    Applicant: Digital Works, Inc.
    Inventors: Koji Ito, Koji Kawahara
  • Publication number: 20120030477
    Abstract: A system in accordance with exemplary embodiments may provide a scalable segment-based data de-duplication for incremental backups. In the system, a master device on a secondary-storage node side may receive at least incremental changes, fingerprints, mapping entities, and distribute de-duplication functionality to at least a slave device, and performs data de-duplication on said plurality of segments via a way to cluster a plurality of fingerprints in a data locality unit called container for the incremental changes, varied sampling rates of a plurality of segments by having a fixed sampling rate for stable segments and by assigning a lower sampling rate for a plurality of unstable target files of de-duplication, and a per-segment summary structure to avoid unnecessary I/Os involved in de-duplication.
    Type: Application
    Filed: July 29, 2010
    Publication date: February 2, 2012
    Inventors: Maohua Lu, Tzi-Cker Chiueh
  • Publication number: 20120023102
    Abstract: Methods of and systems for dynamically rearranging search results into hierarchically organized concept clusters are provided. A method of searching for and presenting content items as an arrangement of conceptual clusters to facilitate further search and navigation on a display-constrained device includes providing a set of content items and receiving incremental input to incrementally identify search terms for content items. Content items are selected and grouped into sets based on how the incremental input matches various metadata associated with the content items. The selected content items are grouped into explicit conceptual clusters and user-implied conceptual clusters based on metadata in common to the selected content items. The clustered content items are presented according to the conceptual clusters into which they are grouped.
    Type: Application
    Filed: August 30, 2011
    Publication date: January 26, 2012
    Applicant: Veveo, Inc.
    Inventors: Sashikumar Venkataraman, Pankaj Garg, Pranav Rajanala
  • Publication number: 20120023104
    Abstract: The construction of a semantically associated computer search database is disclosed. A text catalog containing the text particulars and contents of a plurality of electronic documents, a term association database which joins text search terms with their semantic meanings or definitions, and a content association database which contains records associating text search terms and their contextual definitions in respect of individual electronic documents are used to in conjunction provide a text index in which it is possible to search based upon the actual definitions or meanings of certain text or terms within the electronic documents, reducing or refining query search result sets. A method of operating a search engine with the index in question, as well as other e-commerce methods employing the index as a refining tool are also disclosed.
    Type: Application
    Filed: September 8, 2009
    Publication date: January 26, 2012
    Inventors: Bruce Johnson, Chad Frederick, Scott Montgomerie, Allen Yee
  • Patent number: 8103650
    Abstract: In particular embodiments, advertising groups may be formed based on keywords. Access a plurality of keywords, each keyword comprising one or more words and describing one or more characteristics or features of particular advertising subject matter. Annotate each of the keywords with one or more labels selected from a plurality of labels, each label comprising one or more words and describing one or more aspects of or one or more categories or concepts represented by the keyword. Group the keywords into one or more keyword groups based on similarities between the labels of the keywords, each keyword group comprising one or more of the keywords. Form one or more advertising groups from the keyword groups, comprising from each of the keyword groups, form one or more of the advertising groups, each of the advertising groups comprising one or more of the keywords in the keyword group.
    Type: Grant
    Filed: June 29, 2009
    Date of Patent: January 24, 2012
    Assignee: Adchemy, Inc.
    Inventors: David Robert Kauchak, Monifa Akanke Porter, Shreehari Manikarnika
  • Publication number: 20120016878
    Abstract: Methods and systems for clustering information items using nonnegative tensor factorization are disclosed. A processing device receives one or more class labels, each corresponding to an information item, a selection for a nonnegative tensor factorization model having an associated objective function and one or more parameter values, each corresponding to one of one or more penalty constraints. The processing device determines a constrained objective function based on the objective function associated with the selected nonnegative tensor factorization model, the one or more parameter values and the one or more class labels and including the one or more penalty constraints. The processing device determines clusters for the plurality of information items by evaluating the constrained objective function. Pairwise constraints may be received in addition to or instead of the class labels.
    Type: Application
    Filed: July 15, 2010
    Publication date: January 19, 2012
    Applicant: XEROX CORPORATION
    Inventor: Wei Peng
  • Publication number: 20120016877
    Abstract: One particular embodiment clusters a plurality of documents using one or more clustering algorithms to obtain one or more first sets of clusters, wherein: each first set of clusters results from clustering the documents using one of the clustering algorithms; and with respect to each first set of clusters, each of the documents belongs to one of the clusters from the first set of clusters; accesses a search query; identifies a search result in response to the search query, wherein the search result comprises two or more of the documents; and clusters the search result to obtain a second set of clusters, wherein each document of the search result belongs to one of the clusters from the second set of clusters.
    Type: Application
    Filed: July 14, 2010
    Publication date: January 19, 2012
    Applicant: YAHOO! INC.
    Inventors: Srinivas Vadrevu, Yi Chang, Zhaohui Zheng, Bo Long
  • Publication number: 20120016886
    Abstract: To determine a seasonal effect in temporal data, for each of a plurality of candidate seasons, the following is performed. An interval is defined for the corresponding candidate season. The interval is divided (108) into plural buckets containing respective sets of the temporal data, and statistical measures for corresponding buckets are computed. The computed statistical measures are used to compute (112) a score for the corresponding candidate season. Scores computed for corresponding candidate seasons are used for identifying which of the candidate seasons represents the seasonal effect of the temporal data.
    Type: Application
    Filed: July 14, 2009
    Publication date: January 19, 2012
    Inventors: Ira Cohen, Kobi Eisenberg, Liat Ashkenazi, Avishai Krepel, Marina Lyan, Michael Rozman
  • Publication number: 20120011124
    Abstract: According to one embodiment, a latent semantic mapping (LSM) space is generated from a collection of a plurality of documents, where the LSM space includes a plurality of document vectors, each representing one of the documents in the collection. For each of the document vectors considered as a centroid document vector, a group of document vectors is identified in the LSM space that are within a predetermined hypersphere diameter from the centroid document vector. As a result, multiple groups of document vectors are formed. The predetermined hypersphere diameter represents a predetermined closeness measure among the document vectors in the LSM space. Thereafter, a group from the plurality of groups is designated as a cluster of document vectors, where the designated group contains a maximum number of document vectors among the plurality of groups.
    Type: Application
    Filed: July 7, 2010
    Publication date: January 12, 2012
    Applicant: APPLE INC.
    Inventor: Jerome R. Bellegarda
  • Publication number: 20120011119
    Abstract: A database for object recognition is generated by performing at least one of intra-object pruning and inter-object pruning, as well as keypoint clustering and selection. Intra-object pruning removes similar and redundant keypoints within an object and different views of the same object, and may be used to generate and associate a significance value, such as a weight, with respect to remaining keypoint descriptors. Inter-object pruning retains the most informative set of descriptors across different objects, by characterizing the discriminability of the keypoint descriptors for all of the objects and removing keypoint descriptors with a discriminability that is less than a threshold. Additionally, a mobile platform may download a geographically relevant portion of the database and perform object recognition by extracting features from the query image and using determined confidence levels for each query feature during outlier removal.
    Type: Application
    Filed: July 8, 2010
    Publication date: January 12, 2012
    Applicant: QUALCOMM Incorporated
    Inventors: Pawan K. Baheti, Ashwin Swaminathan, Serafin Diaz Spindola, Xia Ning
  • Publication number: 20120011121
    Abstract: Data analysis is disclosed, including: receiving data to be analyzed, wherein the data includes one or more data identifiers (IDs) and one or more preset key-value pairs, wherein each preset key-value pair includes a preset key and a preset value; acquiring data to be analyzed based at least in part on the data IDs; segmenting the acquired data into one or more data elements; classifying the one or more data elements based at least in part on one preset key of the one or more preset key-value pairs; and analyzing the classified one or more data elements based at least in part on one preset value of the one or more preset key-value pairs.
    Type: Application
    Filed: July 5, 2011
    Publication date: January 12, 2012
    Inventor: Wenchu Cen
  • Publication number: 20120005212
    Abstract: A system and method for updating a system that controls files executed on a workstation. The workstation includes a workstation management module configured to detect the launch of an application. A workstation application server receives data associated with the application from the workstation. This data can include a hash value. The application server module can determine one or more categories to associate with the application by referencing an application inventory database or requesting the category from an application database factory. The application database factory can receive applications from multiple application server modules. The application database factory determines whether the application was previously categorized by the application database factory and provides the category to the application server module. Once the application server module has the category, it forwards a hash/policy table to the workstation management module.
    Type: Application
    Filed: September 12, 2011
    Publication date: January 5, 2012
    Applicant: Websense, Inc.
    Inventors: Harold M. Kester, Ronald B. Hegli, John Ross Dimm, Mark Richard Anderson
  • Publication number: 20120005207
    Abstract: A method includes generating, a plurality of sets of pairs of records from a set of records, for each attribute-position pair in the set of records. Each attribute-position pair being indicative of a position of an attribute in a record. Further, the method includes forming, electronically, a plurality of groups, each group comprising two attribute-position pairs having different attributes. Further, the method also includes determining, electronically for each group, number of pairs of records that are common in the two attribute-position pairs of that group. Furthermore, the method includes extracting results based on a first group of the plurality of groups if the number of pairs of records that are common in the two attribute-position pairs of the first group is greater than a second threshold, is highest among the plurality of groups, and no group having three or more attribute-position pairs with different attributes is possible.
    Type: Application
    Filed: July 1, 2010
    Publication date: January 5, 2012
    Applicant: Yahoo! Inc.
    Inventors: Pankaj Gulhane, Srinivasan Hanumantha Rao Sengamedu, Ashwin Tengli, Rajeev Rastogi
  • Publication number: 20120005021
    Abstract: Techniques are described herein for selecting an advertisement using user search history segmentation. Instances of queries that are provided by a user are referred to collectively as the search history of the user. The search history is segmented into sessions that correspond to respective tasks of the user. Each of the sessions includes a respective subset of the query instances that are included in the user's search history. A weight is assigned to each session based on attribute(s) of the respective session. A session that includes a first subset of the query instances is selected based on the weight that is assigned to that session. Features are extracted from query instances that are included in the first subset. Weights are assigned to the extracted features based on attribute(s) of the first subset. An advertisement is selected to be provided to the user based on the extracted features and corresponding weights.
    Type: Application
    Filed: July 2, 2010
    Publication date: January 5, 2012
    Applicant: YAHOO! INC.
    Inventor: Rajiv Ratnam
  • Publication number: 20120004893
    Abstract: The present invention relates to a method for the automatic identification of at least one informative data filter from a data set that can be used to identify at least one relevant data subset against a target feature for subsequent hypothesis generation, model building and model testing. The present invention describes methods, and an initial implementation, for efficiently linking relevant data both within and across multiple domains and identifying informative statistical relationships across this data that can be integrated into agent-based models. The relationships, encoded by the agents, can then drive emergent behavior across the global system that is described in the integrated data environment.
    Type: Application
    Filed: September 10, 2009
    Publication date: January 5, 2012
    Applicant: QUANTUM LEAP RESEARCH, INC.
    Inventors: Akhileswar Ganesh VAIDYANATHAN, Stephen D. PRIOR, Jijun Wang, Bin Yu
  • Publication number: 20110319053
    Abstract: In one embodiment, data relating to usage patterns of the user is stored, wherein the data includes information as to items which were used and the context in which they were used. The data is then clustered into clusters of data points. Then a centroid is determined for each of the clusters. A cluster similar to a current context of the user is selected by comparing a data point representing the current context of the user to one or more of the centroids. For each of one or more items, a threshold based on values for a plurality of the centroids with respect to the corresponding item, wherein a threshold is used to compare with centroid value of an item in a selected cluster to determine whether to recommend the item.
    Type: Application
    Filed: September 6, 2011
    Publication date: December 29, 2011
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Sangoh Jeong, Doreen Cheng, Yu Song, Swaroop Kalasapur
  • Publication number: 20110320449
    Abstract: A method includes receiving a list of sequential telephone numbers, and iteratively: identifying a first number and a last number of the list, selecting a group size, creating a group of sequential telephone numbers from the list of sequential telephone numbers based on the selected group size, and removing the created group from the list of sequential telephone numbers. Creating the group may include determining whether a first number in the list of sequential telephone numbers ends with a predetermined digit, assigning the first number as a start number of the group, and identifying an end number in the group based on the start number and the selected group size.
    Type: Application
    Filed: June 29, 2010
    Publication date: December 29, 2011
    Applicant: Verizon Patent and Licensing Inc.
    Inventor: Bhaskar R Gudlavenkatasiva
  • Publication number: 20110320452
    Abstract: An information estimation apparatus 1 for estimating a transmission point in time of a document whose transmission point in time is not specified in a document set to be analyzed includes a structure analysis unit 3 configured to specify, from the document set, a document having a document structure in which a link relationship with another document is indicated in a table-of-contents manner, and extract the link relationship of documents included in the document set from the document structure of the specified document, a grouping unit 4 configured to set a group of documents using the specified document and the extracted link relationship, and an estimation unit 5 configured to estimate, based on the set group and a transmission point in time of a document that is included in the group and whose transmission point in time is specified, a transmission point in time of a document that is included in the group and whose transmission point in time is not specified.
    Type: Application
    Filed: December 21, 2009
    Publication date: December 29, 2011
    Applicant: Nec Corpration
    Inventors: Takao Kawai, Satoshi Nakazawa, Shinichi Ando
  • Publication number: 20110320446
    Abstract: This patent application relates to interval-based information retrieval (IR) search techniques for efficiently and correctly answering keyword search queries. In some embodiments, a range of information-containing blocks for a search query can be identified. Each of these blocks, and thus the range, can include document identifiers that identify individual corresponding documents that contain a term found in the search query. From the range, a subrange(s) having a smaller number of blocks than the range can be selected. This can be accomplished without decompressing the blocks by partitioning the range into intervals and evaluating the intervals. The smaller number of blocks in the subranges(s) can then be decompressed and processed to identify a doc ID(s) and thus document(s) that satisfies the query.
    Type: Application
    Filed: June 25, 2010
    Publication date: December 29, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Kaushik Chakrabarti, Surajit Chaudhuri, Venkatesh Ganti
  • Publication number: 20110320455
    Abstract: Systems, methods and articles of manufacture are disclosed for extracting a feature of an entity. A data record that includes the feature may be received. The feature may be characterized. Data related to the feature may be retrieved from a data source. The feature may be recharacterized based on the retrieved data. The data source may be updated with an association between the recharacterized feature and the retrieved data.
    Type: Application
    Filed: June 28, 2010
    Publication date: December 29, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: JEFFREY J. JONAS
  • Publication number: 20110320445
    Abstract: Inferences acquired by applying clustering analysis cannot be reliably assessed before data-originated errors are quantified, an exacting task that is often not performed. This invention presents a clustering method suited for this purpose. Designed for systems with normally distributed error, a common trait to many data systems, and built on a framework of agglomerative hierarchical clustering, this invention treats each observation as a Gaussian distribution function, uses an exact mathematical relation to track error, and gives results from which quantitative statistics are easily extracted.
    Type: Application
    Filed: June 23, 2010
    Publication date: December 29, 2011
    Applicant: NATIONAL CENTRAL UNIVERSITY
    Inventors: Hong-Chien Lee, Chih-Hao Chen
  • Publication number: 20110320453
    Abstract: A system and method for grouping similar documents is provided. Frequencies of occurrences are determined for terms and noun phrases within a set of documents. A subset of the documents is selected by removing those documents having terms and noun phrases that fall outside a bounded range of upper and lower conditions for frequency of occurrence. Each of the documents in the subset is mapped to a cluster of documents based on a similarity of the documents to the cluster documents.
    Type: Application
    Filed: September 2, 2011
    Publication date: December 29, 2011
    Inventors: Dan Gallivan, Kenji Kawai
  • Publication number: 20110313984
    Abstract: A method and system for performing a graph search, includes constructing an abstract representation of the graph using state-space abstraction. The abstract representation of the graph includes one or more abstract nodes having duplicate detection scopes and one or more abstract edges having operator groups. The duplicate detection scopes of the abstract nodes are partitioned into smaller duplicate detection scopes using edge partitioning. The abstract edges include the smaller duplicate detection scopes. Nodes in the current search layer are expanded using the operator groups of outgoing abstract edges of the abstract nodes the nodes map to. The operator groups associated with abstract edges having disjoint duplicate detection scopes are used to expand the nodes in parallel. Once all the operator groups in the current search layer have been used for node expansion the method progresses to the next search layer.
    Type: Application
    Filed: July 23, 2010
    Publication date: December 22, 2011
    Applicant: Palo Alto Research Center Incorporated
    Inventors: Rong Zhou, Tim Schmidt, Minh Binh Do, Serdar Uckun
  • Publication number: 20110313844
    Abstract: A “Real-Time-Ready Analyzer” combines a data stream management system (DSMS) with a map-reduce (M-R) framework to construct a streaming map-reduce framework that is suitable for real-time Behavioral Targeting (BT) (or other temporal queries). The Real-Time-Ready Analyzer allows users to write “dual-intent” temporal analysis queries for BT. These queries are succinct and easy to express, scale well on large-scale offline data, and can also work over real-time data. Further, the Real-Time-Ready Analyzer uses the aforementioned streaming map-reduce framework to provide dual-intent algorithms for end-to-end BT phases. Experiments using real data from an advertisement system show that the Real-Time-Ready Analyzer is very efficient and incurs orders-of-magnitude lower development effort than conventional systems.
    Type: Application
    Filed: June 17, 2010
    Publication date: December 22, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Badrish Chandramouli, Jonathan Goldstein, Xin Jin, Balan Sethu Raman, Songyun Duan
  • Publication number: 20110314020
    Abstract: Disclosed are methods for developing authorized chemical palettes for formulating products with reduced adverse environmental and/or health concerns, and advising the public to a greater extent regarding the ingredients of products formulated using these palettes. Also disclosed are computer systems to implement such methods.
    Type: Application
    Filed: June 7, 2011
    Publication date: December 22, 2011
    Inventors: Christopher Beard, Kylee E. Eblin, Fred J. Joachim, H. Fisk Johnson, III, Frank A. Jones, Jennifer C. Perkins, Robert J. Seifert, Kelly M. Semrau, George C. Daher, Usha Vedula
  • Publication number: 20110314024
    Abstract: One or more techniques and/or systems are disclosed that provide for document retrieval where a user can identify key attributes of potential target documents that are desirable (e.g., have a particular semantic content for the user). Further, relevant documents that comprise the desired semantic content can be retrieved. Additionally, the user can provide feedback on the retrieved documents, for example, based on key semantic concepts found in the documents, and the input can be used to update the classification. For example, this process can be iterated to improve the retrieval and precision of documents found through machine learning techniques.
    Type: Application
    Filed: June 18, 2010
    Publication date: December 22, 2011
    Applicant: Microsoft Corporation
    Inventors: Eric I-Chao Chang, Michael Todd Gillam, Yan Xu, Craig Feied, Jonathan Handler
  • Publication number: 20110314017
    Abstract: Techniques to manage social connections are described. An apparatus may comprise a processor communicatively coupled to a memory. The memory may be arranged to store a social analysis component that when executed by the processor is operative to receive a list of members in a social network, receive at least one relationship indicator derived from multiple member attributes of a member, and generate a social identifier based on the relationship indicator, the social identifier representing a social connection type for a social connection or potential social connection between two or more members of the list of members in the social network. Other embodiments are described and claimed.
    Type: Application
    Filed: June 18, 2010
    Publication date: December 22, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Eran Yariv, Kfir Karmon
  • Publication number: 20110307486
    Abstract: The illustrative embodiments provide a method, computer program product, and apparatus for managing collectively sensitive data. Collectively sensitive data is divided into a first partition for reassembly data, a second partition of the collectively sensitive data, and a third partition of the collectively sensitive data. Each of the second partition and the third partition are collectively nonsensitive in isolation. The first partition is stored in a translation table in a secure database. The translation table is configured for use in assembling collectively sensitive data from the second partition and the third partition. The second partition of the collectively sensitive data is stored in a first database associated with a first cloud computing environment. The third partition of collectively sensitive data is stored in a second database associated with a second cloud computing environment.
    Type: Application
    Filed: June 15, 2010
    Publication date: December 15, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Frank C. Breslau, Benjamin Briggs, Ori Pomerantz
  • Publication number: 20110307356
    Abstract: Systems and methods to extract and utilize textual semantics are described. The system receives item information that describes an item for sale on a network-based marketplace and analyzes the item information to generate application information that identifies a plurality of applications. The plurality of applications includes a first application that further includes the item as a first component of the first application. The system stores a listing in a database that includes the application information and the item information and publishes the listing on the network-based marketplace to sell the item via the network-based marketplace.
    Type: Application
    Filed: November 3, 2010
    Publication date: December 15, 2011
    Applicant: eBay Inc.
    Inventors: Guenter Wiesinger, Shilpa Jois Muguru Ramachandra
  • Publication number: 20110307485
    Abstract: Keyword extraction technique embodiments are presented which extract topically related keywords from a set of topically related documents. In one general embodiment, this keyword extraction involves first accessing a set of topically related documents. A number of candidate keywords are then identified from the set of related documents. A weighted keyword candidate-document matrix is formed using these candidate keywords, and it is partitioned into multiple groups of keyword candidates. Dense clusters of keyword candidates whose density exceeds a prescribed density threshold are then identified in each of the groups of keyword candidates. Finally, the keyword candidates associated with each dense cluster are designated as topically related keywords.
    Type: Application
    Filed: June 10, 2010
    Publication date: December 15, 2011
    Applicant: Microsoft Corporation
    Inventors: Uppinakuduru Raghavendra Udupa, Santosh Raju Vysyaraju
  • Publication number: 20110307450
    Abstract: A system includes a database instance, a data storage system, and a database management system. The data storage system stores a first data partition and a second data partition. The first data partition includes a first plurality of database pages storing tenant-independent data and a first converter associated with the first partition. The second data partition includes a second plurality of database pages storing tenant-independent data and a second converter associated with the second partition.
    Type: Application
    Filed: June 14, 2010
    Publication date: December 15, 2011
    Inventors: Uwe H.O. Hahn, Torsten Strahl, Henrik Hempelmann
  • Publication number: 20110307513
    Abstract: A system and method for updating a filtering system which controls access to a website/page between a local area network (LAN) and an Internet. The LAN includes an Internet gateway system coupled to a workstation and configured to receive a URL request. The system controls access to the website/page associated with the URL based on one or more categories that are associated with the URL. The Internet gateway system can determine the category that is associated with the URL by referencing a master database or requesting the category from a database factory. The database factory can receive URLs from multiple Internet gateway systems. The database factory determines whether the identifier was previously categorized by the database factory and provides the category to the Internet gateway system. Once the Internet gateway system has the category, it applies rules associated with the category and user to filter access to the requested website/page.
    Type: Application
    Filed: August 26, 2011
    Publication date: December 15, 2011
    Applicant: Websense, Inc.
    Inventors: Harold Kester, Dan Ruskin, Chris Lee, Mark Anderson
  • Publication number: 20110307488
    Abstract: A behavior index indicates a development of occurrence of events included in log data by a connection of a plurality of nodes, and includes a branching node connecting to two or more subsequent nodes and an integrating node connected to two or more preceding nodes. When a regular expression converting part inputs a search condition indicating an event appearance order from a client, a search automaton retaining part generates a state transition table in accordance with the search condition. While a node type determining part determines the node types of the respective nodes of the behavior index, a state evaluating part analyzes the respective nodes to determine the states of the respective nodes, and determines whether or not a state transition pattern matches the state transition table. Regarding a branching node, the respective nodes of each of branching routes branching from the branching node are analyzed.
    Type: Application
    Filed: February 27, 2009
    Publication date: December 15, 2011
    Applicant: MITSUBISHI ELECTRIC CORPORATION
    Inventors: Norio Hirai, Mitsunori Kori
  • Publication number: 20110302164
    Abstract: In a system and method for order-independent stream query processing, one or more input streams of data are received, and the one or more input streams are analyzed to determine data which is older than an already emitted progress indicator. The data which is older than the already emitted progress indicator is partitioned into one or more partitions, and each of the one or more partitions are independently processed using out-of-order processing techniques. A query is received, rewritten and decomposed into one or more sub-queries that produce partial results for each of the one or more partitions, where each of the one or more sub-queries correspond to a partition. A view is also produced that consolidates the partial results for each partition. The partial results are consolidated at a consolidation time specified by the query to produce final results, and the final results are provided.
    Type: Application
    Filed: May 5, 2011
    Publication date: December 8, 2011
    Inventors: Saileshwar Krishnamurthy, Michael R. Lewis, Wing Li, Neil Thombre, Jeffrey A. Davis, Daniel Robert Farina
  • Publication number: 20110302167
    Abstract: A computer-implemented method according to one embodiment includes, for each of a plurality of accessories: determining a compatibility of an accessory; determining a type of the accessory; and determining features of the accessory. The accessories are associated into logical groups based on the compatibility, type and features thereof. A computer-implemented method according to one embodiment includes obtaining information about accessories; parsing out individual offers corresponding to the accessories; extracting meaningful phrases from the offers; classifying new offers based on the phrases; and outputting a result of the classification. Additional systems, methods and computer program products are also presented.
    Type: Application
    Filed: June 2, 2011
    Publication date: December 8, 2011
    Applicant: RETREVO INC.
    Inventors: Aditya Vailaya, Jiang Wu, Jeffrey Ronne
  • Publication number: 20110302169
    Abstract: In a user-activity identification technique, a user's actions are monitored while the user is using a computer. While these user actions are associated with user activities, the user activities are initially unspecified, so the tracked user actions constitute unsupervised data. Then, the tracked user actions are aggregated into subsets (for example, using clustering analysis), and user-activity classifications for the subsets (such as activity labels) are provided by the user, so the subsets constitute supervised data. Subsequently, when additional user actions (which are associated with one or more initially unspecified current user activities) are tracked, they can be associated with one or more of the classified subsets. For example, information about the additional user actions can be mapped in real time (or near real time) to one or more of the subsets using a supervised learning technique. In this way, the one or more current user activities can be identified.
    Type: Application
    Filed: June 3, 2010
    Publication date: December 8, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Oliver Brdiczka, Shane P. Ahern, Victoria M.E. Bellotti
  • Publication number: 20110302165
    Abstract: A content recommendation device deciding content to be recommended to a user among a plurality of content items includes: a clustering section creating a cluster set including clusters by clustering use statuses of content of users on the basis of a predetermined index; an effectiveness determining section determining effectiveness of the clustering by evaluating a correlation between the content and the cluster in the cluster set; a popular content deciding section selecting the cluster to which the user who becomes a recommendation partner belongs from the cluster set and deciding the popularity degree of each content item in accordance with the use status of each content item by the users in the cluster; and a recommended content deciding section evaluating the popularity degree of each content item in the cluster to which the user who becomes the recommendation partner belongs by taking into account and estimating the effectiveness of the cluster set therein and deciding the relatively popular content ite
    Type: Application
    Filed: May 26, 2011
    Publication date: December 8, 2011
    Inventors: Kazuo ISHII, Nozomu Ikeda, Yoshikazu Takahashi
  • Publication number: 20110302171
    Abstract: A method and system for discovering a control event from electronically published documents and received data streams is provided, in which a computer control program identifies electronically published documents and data stored in a plurality of network servers which potentially contain control events relevant to the control of goods and/or services, the control events identified by reference to user interest identifiers. Identified material is analyzed by a classification program to determine whether control events are present. A control event classification is assigned to documents and received data determined to contain at least one discovered control event, the assigned control event classification and information identifying the associated document and data is stored in a classification database, and a report of discovery of documents and data containing control events is be provided to a user. The report may includes a link to the control event classification and/or its associated document or data.
    Type: Application
    Filed: June 14, 2011
    Publication date: December 8, 2011
    Applicant: Decernis, LLC
    Inventors: Patrick Blackmon Waldo, Andrew B. Waldo
  • Publication number: 20110295977
    Abstract: Provided is a base sequence cluster generating system, method, and program product for performing cluster generation. The base sequence cluster generating system utilizes a computer system having a database containing base sequences receives a query sequence over. The computer uses spliced base sequences as a query sequence to generate a first cluster including base sequences that are likely to constitute a spliced pair with the query sequence. Spliced alignment is applied to the generated first cluster to generate a second cluster including spliced pairs. The generated second cluster is returned to the requester.
    Type: Application
    Filed: August 12, 2011
    Publication date: December 1, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Tetsuo Shibuya
  • Publication number: 20110295773
    Abstract: Assisted clustering systems and methods are described herein that provide a user interface by which a user can easily create clusters and selectively associate data items with such clusters. Information regarding data item-cluster associations made by the user is processed by a recommendation engine to learn a clustering model. The clustering model is then be used to generate recommendations for the user regarding which unassociated data items should be associated with which clusters. In certain embodiments, after the user has determined that the clustering model is performing at a satisfactory level based on the quality of the recommendations, the user can cause the system to automatically cluster a large quantity of remaining unassociated data items. In accordance with further embodiments, a user can specify arbitrary data item types for clustering as well as features of such data types that should be considered in generating the clustering model.
    Type: Application
    Filed: May 26, 2010
    Publication date: December 1, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Danyel A. Fisher, Sumit Basu, Steven Drucker, Gonzalo A. Ramos, Hao Lu
  • Publication number: 20110295846
    Abstract: Systems and methods are provided for selecting contexts for new invitational content and invitational content for new contexts. In particular, a performance history of delivered invitational content in known contexts is combined with similarity measures for the delivered invitational content, with respect to a new invitational content, to generate a list of potential contexts for the new invitational content. Similarly, a performance history of in known contexts with delivered invitational content can combined with similarity measures for known contexts, with respect to a new context, to generate a list of potential content for the new context. Further, a combination of these methods can be used to pair new invitational content with new contexts.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: Apple Inc.
    Inventors: Eswar Priyadarshan, Dan Marius Grigorovici, Ravikiran Chittari, Prasenjit Mukherjee, Omar Abdala
  • Publication number: 20110295857
    Abstract: A system and method for aligning multilingual content and indexing multilingual documents, to a computer readable data storage medium having stored thereon computer code means for indexing multilingual documents, to a system for presenting multilingual content. The method for aligning multilingual content and indexing multilingual documents comprises the steps of generating multiple bilingual terminology databases, wherein each bilingual terminology database associates respective terms in a pivot language with one or more terms in another language; and combining the multiple bilingual terminology databases to form a multilingual terminology database, wherein the multilingual terminology database associates terms in different languages via the pivot language terms.
    Type: Application
    Filed: June 20, 2008
    Publication date: December 1, 2011
    Inventors: Ai Ti Aw, Min Zhang, Lian Hau Lee, Thuy Vu, Fon Lin Lai
  • Publication number: 20110295854
    Abstract: A method and system for automatically refining information extraction (IE) rules. A provenance graph for IE rules on a set of test documents is determined. The provenance graph indicates a sequence of evaluations of the IE rules that generates an output of each operator of the IE rules. Based on the provenance graph, high-level rule changes (HLCs) of the IE rules are determined. Low-level rule changes (LLCs) of the IE rules are determined to specify how to implement the HLCs. Each LLC specifies changing an operator's structure or inserting a new operator in between two operators. Based on how the LLCs affect the IE rules and previously received correct results of applying the rules on the test documents, a ranked list of the LLCs is determined. The IE rules are refined based on the ranked list.
    Type: Application
    Filed: May 27, 2010
    Publication date: December 1, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Laura Chiticariu, Bin Liu, Frederick R. Reiss
  • Publication number: 20110289049
    Abstract: Metadata may be stored in, and retrieved from, a scalable, fault-tolerant metadata service. In one example, metadata is divided into partitions, and each partition is served by one or more nodes. For each partition, a first one of the nodes may handle read and write requests, and the other nodes may handle read requests in the event that the first node is down or is experiencing high load. When a request is made with respect to metadata, a metadata server may identify a node, in the partition to which the metadata is assigned, to which the request is to be made. The entity that is making the request then contacts that node, and requests the read or write on the metadata. In a partition, metadata may be replicated between the first node and the other nodes using a log-based replication protocol.
    Type: Application
    Filed: May 19, 2010
    Publication date: November 24, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Nanshan Zeng, Meng Ye, Honghua Feng, Junwei Xu, Yu-chao Cao, Yingjun Yu, Lin Song
  • Publication number: 20110289090
    Abstract: This specification describes technologies relating to content presentation. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving data from a plurality of users within a geographic region associated with a location; analyzing the received data to derive a category for the location; annotating the location with the category; and using the category to boost candidate content items for delivery to users in the location in response to future content item requests.
    Type: Application
    Filed: May 19, 2011
    Publication date: November 24, 2011
    Applicant: GOOGLE INC.
    Inventors: Jian Zhou, Anshul Kothari, Subhadip Sarkar, Theodore B. Van Belle, Surojit Chatterjee