Clustering Or Classification (epo) Patents (Class 707/E17.089)

E Subclasses

Into predefined classes (epo) (Class 707/E17.09)

Including class or cluster creation or modification (epo) (Class 707/E17.091)

Including cluster or class visualization or browsing (epo) (Class 707/E17.092)

PLAYLIST-BASED DETECTION OF SIMILAR DIGITAL WORKS AND WORK CREATORS

Publication number: 20120303713

Abstract: A computer-implemented service recommends digital works (and/or creators of works) to a user based on works currently or previously played or downloaded by the user on a player device or based on playlists stored on the player device. The works may be, for example, music files, video files, electronic books, or other digital content for playing by users. A user may thus obtain personalized recommendations that are based on works obtained from sources (web sites, physical CDs, etc.) that are independent of the recommendations system. In one embodiment, the service identifies pairs of works (and/or work creators) that are similar to each other by virtue of the relatively high frequency with which they co-occur on playlists or within play histories of users. The resulting mappings are used to provide recommendations to users.

Type: Application

Filed: August 2, 2012

Publication date: November 29, 2012

Inventors: Andrew V. Harbick, Ryan J. Snodgrass, Joel R. Spiegel
SYSTEM FOR INCREMENTALLY CLUSTERING NEWS STORIES

Publication number: 20120303623

Abstract: Disclosed are methods and apparatus for clustering news stories, which are to be presented over a computer network. In general, an incremental clustering system is configured to update a current set of news clusters with newly arrived news articles without having to recompute the clusters for the entire corpus, as well as form new clusters for recently generated news topics. In one embodiment, a plurality of news articles are initially obtained via the computer network, and the news articles are clustered into a plurality of initial clusters. For only news articles, including any unclustered news articles, that are less than a predetermined age limit, it is determined in an incremental clustering process whether to form one or more new clusters or assign to the initial clusters.

Type: Application

Filed: May 26, 2011

Publication date: November 29, 2012

Applicant: YAHOO! INC.

Inventors: Kunal Punera, Suju Rajan, Choon Hui Teo, Srinivas Vadrevu
MANAGING HETEROGENEOUS DATA

Publication number: 20120303625

Abstract: Methods of managing data. A master catalog of properties may be generated. An object model catalog containing a plurality of object models may be generated, each object model including at least one property listed in the master catalog. A data set including a plurality of data objects may be defined, each data object an instantiation of a respective object model from the object model catalog. Data may be collected in accordance with the data set definition. Data collection may be performed, at least in part, by an automatic data collection system.

Type: Application

Filed: January 18, 2012

Publication date: November 29, 2012

Applicant: Ixia

Inventors: Florin Ciodaru, Flaviu Matan
INFORMATION PROCESSING APPARATUS AND METHOD OF PROCESSING DATA FOR AN INFORMATION PROCESSING APPARATUS

Publication number: 20120296911

Abstract: According to one embodiment, an information processing apparatus includes a keyword display module, a selection module and an information-retrieval module. The keyword display module is configured to display at least two keywords. The selection module is configured to select a keyword from the at least two keywords displayed by the keyword display module. The information-retrieval module is configured to retrieve information by using the keyword selected by the selection module. The keyword display module is further configured to display one or more keywords belonging to a preset category, as at least one of the at least two keywords.

Type: Application

Filed: March 26, 2012

Publication date: November 22, 2012

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventors: Sumi Omura, Kentaro Nagahama, Kensuke Horiuchi, Takayuki Iida
APPAPATUS AND METHOD FOR GENERATING A COLLECTION PROFILE AND FOR COMMUNICATING BASED ON THE COLLECTION PROFILE

Publication number: 20120296908

Abstract: An apparatus for generating a collection profile of a collection of different media data items has a feature extractor for extracting at least two different features describing a content of a media data item for a plurality of media data items of the collection, and a profile creator for creating the collection profile by combining the extracted features or weighted extracted features so that the collection profile represents a quantitative fingerprint of a content of the media data collection. This collection profile or music DNA can be used for transmitting information, which is based on this collection profile, to the entity itself or to a remote entity.

Type: Application

Filed: August 8, 2012

Publication date: November 22, 2012

Applicant: BACH TECHNOLOGY AS

Inventors: Dagfinn BACH, Sebastian SCHMIDT
SYSTEM AND METHOD FOR IDENTIFYING THE PRINCIPAL DOCUMENTS IN A DOCUMENT SET

Publication number: 20120296902

Abstract: A method (200) of identifying a principal document in a document set is provided. An exemplary method includes obtaining a document set comprising a plurality of documents (202) and grouping the plurality of documents into a plurality of clusters based, at least in part, on a textual similarity between each of the plurality of documents (204). The method also includes obtaining one or more descriptive terms corresponding to the plurality of documents, wherein the descriptive terms are terms within the plurality of documents that have been identified as being useful for discriminating between the clusters (206). The method also includes, for each cluster, identifying a subset of descriptive terms based, at least in part, on a prevalence of the descriptive terms within the documents of the cluster (208) and identifying the principal documents in the cluster based, at least in part, on a prevalence of the subset of descriptive terms within each of the documents in the cluster (210).

Type: Application

Filed: February 13, 2010

Publication date: November 22, 2012

Inventors: Vinay Deolalikar, Hernan Laffitte
DENSITY-BASED DATA CLUSTERING METHOD

Publication number: 20120296905

Abstract: A density-based data clustering method executed by a computer system is disclosed. The method includes a setup step, a clustering step, an expansion step and a termination step. The setup step sets a radius and a threshold value. The clustering step defines a single cluster on a plurality of data points of a data set, and provides and adds a plurality of first boundary marks to a seed list as seeds. The expansion step expands the cluster from each seed of the seed list, and provides and adds at least one second boundary mark to the seed list as seeds. The termination step determines whether each of the data points is clustered, wherein the clustering step is re-performed if the determination is negative.

Type: Application

Filed: May 2, 2012

Publication date: November 22, 2012

Inventors: Cheng-Fa TSAI, Tang-Wei Huang
CLUSTERING CUSTOMERS

Publication number: 20120290580

Abstract: A computer implemented method for clustering customers includes receiving a source set of customer records, wherein each customer record represents one customer, and each customer record includes at least one data attribute, and each data attribute has an attribute value; pre-processing the source set of customer records to generate a pre-processed set of customer records; executing a clustering algorithm on the pre-processed set of customer records to group the pre-processed set of customer records into clusters of a pre-defined number. The pre-processing comprises: determining the type of a customer in the source set of customer records; using a type attribute value to indicate the type of the customer in its customer record; normalizing data attribute values and type attribute values; weighting to the data attribute values and the type attribute values respectively to obtain weighted attribute values of the data attribute and weighted attribute values of the type attribute.

Type: Application

Filed: July 30, 2012

Publication date: November 15, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Heng Cao, Jin Dong, Jacqueline Giang Huong Morris, Ming Xie, Wen Jun Yin, Bin Zhang
SERVER COMPUTER, COMPUTER SYSTEM, AND FILE MANAGEMENT METHOD

Publication number: 20120290579

Abstract: A server computer which determines the configuration of a file for configuring a plurality of virtual computers respectively is configured to comprise: an OS/AP file evaluation criteria table which stores evaluation criteria for judging whether to split and manage a file required for the configuration of the virtual computers; a user data evaluation criteria TBL; and a verification and splitting unit which judges whether the file conforms to the evaluation criteria, and determines a part of a file judged to conform to the evaluation criteria as a first file stored as an entity and determines the remaining part of the file as a second file for referencing an entity of a predetermined destination storage.

Type: Application

Filed: July 25, 2012

Publication date: November 15, 2012

Applicant: Hitachi, Ltd.

Inventor: Toyohiro NOMOTO
DYNAMICALLY DETERMINING THE RELATEDNESS OF WEB OBJECTS

Publication number: 20120284266

Abstract: A first cluster of web objects is identified from a click-through data structure. The click-through data structure can organize web objects into clusters based on query results of web objects selected by a user. Also, a second cluster of web objects can be identified from a metadata data structure. The metadata data structure can organize web objects into clusters based on metadata associated with the web objects. An output set of web objects is selected, in real time, from the identifier clusters.

Type: Application

Filed: May 4, 2011

Publication date: November 8, 2012

Applicant: Yahoo! Inc.

Inventors: Prateeksha Uday CHANDRAGHATGI, Subhajit Sanyal, Sriram J. Sathish
REQUIREMENT EXTRACTION SYSTEM, REQUIREMENT EXTRACTION METHOD AND REQUIREMENT EXTRACTION PROGRAM

Publication number: 20120284271

Abstract: Included are a candidate extraction unit 61 that extracts, from a document formed by a group of character strings, a longest consecutive partial string common to one character string and the other character string as a candidate for an important word related to the one character string; a candidate integration unit 62 that selects a longest partial string of the candidate for the important word related to the one character string and extracted by the candidate extraction unit 61; and a group integration unit 63 that integrates a group of the longest partial string of each character string selected by the candidate integration unit 62, this group not forming a subset of a group of the other character string, thereby forming a group of the important word.

Type: Application

Filed: December 13, 2010

Publication date: November 8, 2012

Applicant: NEC CORPORATION

Inventor: Yukiko Kuroiwa
HIERARCHICAL ANT CLUSTERING AND FORAGING

Publication number: 20120284269

Abstract: A clustering method yields a searchable hierarchy to speed retrieval, and can function dynamically with a changing document population. Nodes of the hierarchy climb up and down the emerging hierarchy based on locally sensed information. Like previous ant clustering algorithms, the inventive process is dynamic, decentralized, and anytime. Unlike them, it yields a hierarchical structure. For simplicity, and reflecting our initial application in the domain of textual information, the items being clustered are documents, but the principles may be applied to any collection of data items.

Type: Application

Filed: February 7, 2012

Publication date: November 8, 2012

Inventors: Henry Van Dyke Parunak, Theodore C. Belding, Sven Brueckner, Paul Chiusano, Peter Weinstein
Career Criminal and Habitual Violator (CCHV) Intelligence Tool

Publication number: 20120278325

Abstract: A computer implemented method, apparatus, and computer usable program product for ranking and categorizing criminal offenders in a jurisdiction. In one embodiment, external data associated with the offenders is processed in a set of data models to generate a ranking index of criminal offenders. The external data comprises offender data elements related to prior arrests. The computer software and web application enables officers, detectives, and supervisors to research the offenders in their jurisdiction. They can intentionally track and monitor the status of the offenders that are not currently incarcerated. They can deliberately increase lawful contacts with these high-rate and treacherous offenders.

Type: Application

Filed: April 27, 2012

Publication date: November 1, 2012

Inventors: Daniel Scott Jenkins, Brandon Matthew Rana
Joining Tables in a Mapreduce Procedure

Publication number: 20120278323

Abstract: Systems and techniques by which tables can be joined in a mapreduce procedure. In some implementations, when a large table of business data (e.g., having one billion transaction records or more) is to be joined with a large table of customer data (e.g., having hundreds of millions of customer records), then these two tables can be organized before the mapreduce procedure to speed up the table join. For example, the business data and the customer data can both be hash partitioned, based on the same key, into shards of business data and shards of customer data, respectively. The number of shards in these two groups has an integer relationship with each other: for example such that there are two business data shards for every customer data shard, or vice versa.

Type: Application

Filed: August 15, 2011

Publication date: November 1, 2012

Inventors: Biswapesh Chattopadhyay, Liang Lin
DATA CLASSIFICATION METHODS AND APPARATUS FOR USE WITH DATA FUSION

Publication number: 20120278328

Abstract: Methods and apparatus for classifying data for use in data fusion processes are disclosed. An example method of classifying data selectively groups nodes of a classification tree so that each node is assigned to only one of a plurality of groups and so that at least one of the groups includes at least two of the nodes. Data is classified based on the classification tree and the selective grouping of the nodes, and the results displayed.

Type: Application

Filed: June 29, 2012

Publication date: November 1, 2012

Inventors: Jerome Samson, Francis Gavin McMillan
MULTILINGUAL SEARCH FOR TRANSLITERATED CONTENT

Publication number: 20120278302

Abstract: The multilingual search for transliterated content technique described herein enables a user to submit a search query in both a native script and its foreign script (e.g., Roman script) transliteration and return relevant results in both the scripts while taking care of the spelling variations in transliterated forms. The technique crawls the World Wide Web for data in both the native script and foreign script transliterated forms of the data. It uses a transliteration engine to generate native script equivalents of the foreign script transliterated data and disambiguates the data in native script (whenever possible). The unique native script word forms are then used to jointly index the data in both the scripts. If the query is in native script, it is directly searched for in the index, otherwise the transliterated query is first converted into native script form(s) and then searched in the indexed database to retrieve and rank results in both the scripts.

Type: Application

Filed: April 29, 2011

Publication date: November 1, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Monojit Choudhury, Kalika Bali, Kanika Gupta, Narendranath Datha
DATA COLLECTING METHOD FOR DETECTION AND ON-TIME WARNING SYSTEM OF INDUSTRIAL PROCESS

Publication number: 20120271826

Abstract: A data collection method for a process margin monitoring system of industrial equipment includes preparing a learning data set based on data determined to be normal in an operation history of the industrial equipment so that the learning data set is sorted for each operation mode, in a case in which the industrial equipment includes equipment units performing the same functions, receiving data for each of the equipment units and processing the received data as data for the equipment units, sorting and grouping associated ones of the data in the learning data set, and sampling the collected data to reduce the amount of data.

Type: Application

Filed: April 18, 2011

Publication date: October 25, 2012

Applicant: BNF Technology Inc.

Inventor: Su Young Kim
DATA PROCESSING DEVICE

Publication number: 20120271830

Abstract: Disclosed is a data processing device including a data acquisition unit for acquiring data from a medium, an integrated database for integrating the data acquired by the data acquisition unit thereinto, a data analysis determination unit for analyzing the data integrated into the integrated database, a display control unit for creating an image of a side view of and an image of a bottom view of a solid expressing the data integrated into the integrated database on the basis of an analysis result acquired by the data analysis determination unit and the data, and a display unit for displaying the images created by the display control unit.

Type: Application

Filed: December 16, 2009

Publication date: October 25, 2012

Inventors: Tomohiro Shiino, Yoko Sano, Tsuyoshi Sempuku, Hideto Miyazaki, Kuniyo Ieda, Takashi Sadahiro, Shoji Tanaka
METHODS AND SYSTEMS FOR IMPLEMENTING APPROXIMATE STRING MATCHING WITHIN A DATABASE

Publication number: 20120271827

Abstract: A computer-based method for character string matching of a candidate character string with a plurality of character string records stored in a database is described. The method includes performing a clustering operation on at least a portion of the plurality of character string records, the clustering operation generating a plurality of clusters, each cluster comprising a plurality of character strings from the plurality of character string records, the plurality of character strings in each cluster are determined to be similar with respect to each other based on at least one characteristic of the plurality of character strings. The method also includes generating a set of reference character strings that are selected from the plurality of character strings in each cluster, generating an n-gram representation for one of the reference character strings in the set of reference character strings, and generating an n-gram representation for the candidate character string.

Type: Application

Filed: June 26, 2012

Publication date: October 25, 2012

Inventor: Christopher J. Merz
Localized Translation of Keywords

Publication number: 20120271828

Abstract: In one implementation, a method includes receiving a request for translation of one or more first keywords from a source language to a target language; and translating, using a machine translation process, the first keywords from the source language into a plurality of second keywords in the target language. The method can also include determining, by a computer system, frequencies with which each of the second keywords occur in a corpus associated with the target language. The method can further include selecting, by the computer system, a subset of the second keywords to use in the target language based on the determined frequencies of occurrence.

Type: Application

Filed: April 21, 2011

Publication date: October 25, 2012

Applicant: Google Inc.

Inventor: Mandayam Thondanur Raghunath
Random Walk on Query Pattern Graph for Query Task Classification

Publication number: 20120265760

Abstract: A classification process may reduce the computational resources and time required to collect and classify training data utilized to enable a user to effectively access online information. According to some implementations, training data is established by defining one or more seed queries and query patterns. A bi-partite graph may be constructed using the seed query and query pattern information. A traversal of the bi-partite graph can be performed to expand the training data to encompass sufficient data to perform classification of the present search task.

Type: Application

Filed: April 18, 2011

Publication date: October 18, 2012

Applicant: Microsoft Corporation

Inventors: Jun Yan, Ning Liu, Lei Ji, Zheng Chen
FILE PROCESSING OF NATIVE FILE FORMATS

Publication number: 20120265759

Abstract: A computer-implemented method for processing electronic documents having different native file formats is provided. The method is implemented in a computer system comprising one or more processors configured to execute one or more computer program modules. The method includes (a) receiving electronic documents in different native file formats; (b) identifying the native file format for each received electronic document; (c) retrieving a stored configuration data for the identified native file format, the configuration data includes a mapping of regions of interest in the electronic document with the identified native file format and their associations with output fields; and (d) processing the electronic documents using their retrieved configuration data to extract data from the electronic documents.

Type: Application

Filed: April 15, 2011

Publication date: October 18, 2012

Applicant: XEROX CORPORATION

Inventors: John E. BERGERON, John Allott Moore
SYSTEM AND METHOD FOR GATHERING, FILTERING, AND DISPLAYING CONTENT CAPTURED AT AN EVENT

Publication number: 20120265758

Abstract: A method for generating event compilations during an event comprising: providing an event client designated to display event content captured at the event; identifying an event moderator to review event content captured by attendees of the event; receiving event content captured by one or more event attendees; transmitting the event content to the event moderator for review; receiving a response from the event moderator, the response indicating whether the event content is allowed or blocked; and displaying the event content from the event client at the event if the response from the moderator indicates that the event content is allowed.

Type: Application

Filed: April 14, 2011

Publication date: October 18, 2012

Inventors: Edward Han, Kelly Berger
METHOD FOR RECOMMENDING BEST INFORMATION IN REAL TIME BY APPROPRIATELY OBTAINING GIST OF WEB PAGE AND USER'S PREFERENCE

Publication number: 20120259859

Abstract: Disclosed is an information recommendation method for providing a construction method for a classified word database capable of rapidly accommodating changes in associations between words. The disclosed information recommendation method basically is based on the finding that, by analyzing occurrence frequency information of an arbitrary word in a Web site having an arbitrary classified word in real-time and obtaining the real-time degree of similarity between the classified word and the arbitrary word, it is possible to construct a database that is capable of being sensitive in responding to changes in associations between words.

Type: Application

Filed: December 24, 2010

Publication date: October 11, 2012

Applicant: TAGGY, Inc.

Inventor: Yutaka Ishigami
PARTITIONING A DIRECTORY WHILE ACCESSING THE DIRECTORY

Publication number: 20120259823

Abstract: A process for reading entries in a directory is initiated. A first index is maintained to indicate how far the read has progressed in the directory. If, during execution of the process, the directory is partitioned into subdirectories, then a second index is maintained for each of the subdirectories to indicate how far the read has progressed in each of the subdirectories. A third index that indicates how far the read has progressed in the partitioned directory is also maintained.

Type: Application

Filed: April 8, 2011

Publication date: October 11, 2012

Applicant: SYMANTEC CORPORATION

Inventors: Anindya Banerjee, Maneesh Pusalkar
AGGREGATION OF CONVERSION PATHS UTILIZING USER INTERACTION GROUPING

Publication number: 20120259851

Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer-readable media, for aggregating conversion paths utilizing user interaction grouping. In one aspect, information regarding a plurality of conversion paths is received. Each conversion path includes one or more user interactions that include a plurality of dimensional data. A sorted list of grouping definitions that includes one or more group rules is received and the conversion paths are converted into group paths based upon the one or more group rules. Each group path includes one or more group elements corresponding to each user interaction of a corresponding conversion path. The plurality of group paths are aggregated based upon the number and order of group elements within each group path. Information regarding the aggregated group paths can then be provided, for example, through a report.

Type: Application

Filed: April 11, 2011

Publication date: October 11, 2012

Inventors: Ying Hua JIA, Sissie Ling-Ie Hsiao, Theodore Nicholas Choc, Hongxu Cai, Nicholas Seckar
METHOD AND APPARATUS PROVIDING OMNIBUS VIEW OF ONLINE AND OFFLINE CONTENT OF VARIOUS FILE TYPES AND SOURCES

Publication number: 20120259858

Abstract: An online service provider (OSP) operates online data centers to store members' data objects relating to various online member services of the OSP. An aggregated catalog lists members' data objects residing in the online data centers and also those residing in member computers' local storage. An aggregator monitors contents of the online storage facilities to detect new storage of prescribed types of data objects owned by the members, and also communicates with member computers to identify prescribed types of data objects newly stored in the respective local storage. The aggregator updates the aggregated catalog to list the newly stored data objects. Responsive to a request by a member, a finder searches the aggregated catalog and utilizes results of the search to provide, for display at the requesting member's computer, a consolidated listing of online data objects and locally stored data objects owned by the requesting member.

Type: Application

Filed: June 1, 2012

Publication date: October 11, 2012

Inventors: Grainville R. Fairchild, Bill Frischling, John Keeling, Dan Pacheco, Myron Rosmarin
Real Time Association of Related Breaking News Stories Across Different Content Providers

Publication number: 20120259853

Abstract: Methods and systems for relating breaking news stories across content providers include receiving a breaking news headline for a breaking news from a content provider. The breaking news headline is tokenized in substantial real time by identifying a plurality of headline tokens. A plurality of news stories is received from a plurality of content providers. Each of the plurality of news stories is tokenized to identify a plurality of story tokens. The plurality of headline tokens and story tokens are analyzed to determine if one or more of the news stories are related to the breaking news headline. Based on the analysis, one or more of the news stories are mapped to the breaking news headline. The mapping enables presentation of the one or more news stories from one or more of the content providers while rendering the breaking news headline.

Type: Application

Filed: April 11, 2011

Publication date: October 11, 2012

Applicant: Yahoo!, Inc.

Inventors: Abhijit Khasnis, Subramanian Narayanan
CLUSTERING CUSTOMERS

Publication number: 20120254179

Abstract: A computer implemented method for clustering customers includes receiving a source set of customer records, wherein each customer record represents one customer, and each customer record includes at least one data attribute, and each data attribute has an attribute value; pre-processing the source set of customer records to generate a pre-processed set of customer records; executing a clustering algorithm on the pre-processed set of customer records to group the pre-processed set of customer records into clusters of a pre-defined number. The pre-processing comprises: determining the type of a customer in the source set of customer records; using a type attribute value to indicate the type of the customer in its customer record; normalizing data attribute values and type attribute values; weighting to the data attribute values and the type attribute values respectively to obtain weighted attribute values of the data attribute and weighted attribute values of the tune attribute.

Type: Application

Filed: March 28, 2012

Publication date: October 4, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Heng Cao, Jin Dong, Jacqueline Giang Huong Morris, Ming Xie, Wen Jun Yin, Bin Zhang
SYSTEM AND METHOD FOR STREAK DISCOVERY AND PREDICTION

Publication number: 20120254176

Abstract: The disclosed embodiment relates to identifying performance regions in time-series data. An exemplary method comprises identifying, with a computing device, one or more streaks in the time-series data based on at least one streak parameter, ranking, with a computing device, the identified streaks based on at least one characteristic of the identified streaks, and predicting, with a computing device, a future occurrence of at least one streak based on the characteristics of the identified streaks. The steps of identifying and ranking may be carried out using at least one of a linear graph method, a statistical based approach, a curve-line intersection method, and a hypothesis-based method, and the step of predicting the future occurrence of at least one streak may comprise predicting at least one of how long a current streak will continue, when a current streak will end, and when a new streak will begin.

Type: Application

Filed: May 19, 2011

Publication date: October 4, 2012

Applicant: INFOSYS TECHNOLOGIES LIMITED

Inventors: Satyabrata Pradhan, Radha Krishna Pisipati, Syed Mohammed
CLUSTER-BASED IDENTIFICATION OF NEWS STORIES

Publication number: 20120254188

Abstract: Methods, systems, and techniques for cluster-based content recommendation are described. Some embodiments provide a content recommendation system (“CRS”) configured to recommend news stories about events or occurrences. In some embodiments, a news story about an event includes multiple related content items that each include an account of the event and that each reference one or more entities or categories that are represented by the CRS. In one embodiment, the CRS identifies news stories by generating clusters of related content items. Then, in response to a received query that indicates a keyterm, entity, or category, the CRS determines and provides indications of one or more news stories that are relevant to the received query. In some embodiments, at least some of these techniques are employed to implement a news story recommendation facility in an online news service.

Type: Application

Filed: March 29, 2012

Publication date: October 4, 2012

Inventors: Krzysztof Koperski, Satish Bhatti, Jisheng Liang, Adrian Klein
GROUPING DATA

Publication number: 20120254173

Abstract: A computer-executed method for grouping data comprising, with a processor, generating a number of sorted runs from an unsorted input, storing the sorted runs in temporary storage, placing pages of data from the sorted runs, one at a time, into a portion of a buffer allocated to receive that page, and from the allocated portion of the buffer, merging each page of data, one at a time, into a number of aggregated records, the number of aggregated records also being stored in the buffer.

Type: Application

Filed: March 31, 2011

Publication date: October 4, 2012

Inventor: Goetz Graefe
METHOD OF CATEGORIZING AN INVENTION WITHIN AN INVENTION LANDSCAPE

Publication number: 20120254187

Abstract: A computer-based method is described for categorizing inventions within the context of an invention landscape. A set of key phases and/or semantic properties is employed based upon the likelihood that the description of the invention to be categorized will share these key phrases and/or semantic properties with the descriptions of similar inventions from within the invention landscape. The results are ranked in such a way as to enable a tentative assignment of the target invention to one or more categories, and to optionally estimate the value of the invention.

Type: Application

Filed: June 28, 2011

Publication date: October 4, 2012

Inventors: N. Edward White, G. Edward Powell, JR.
Exploitation of Correlation Between Original and Desired Data Sequences During Run Generation

Publication number: 20120254171

Abstract: A computer executed method of exploiting correlations between original and desired data sequences during run generation comprises, with a processor, adding a number of data values from a data source to a first memory device, the first memory device defining a workspace, determining whether the data values within the workspace should be output in ascending or descending order for a number of runs, and writing a number of the data values as a run to a second memory device in the determined order.

Type: Application

Filed: March 30, 2011

Publication date: October 4, 2012

Inventors: Goetz Graefe, Harumi Kuno
SYSTEM AND METHOD FOR PROCESSING AN SQL QUERY MADE AGAINST A RELATIONAL DATABASE

Publication number: 20120254178

Abstract: A system and method for processing an SQL query made against a relational database is disclosed. In one example embodiment, the method includes receiving the SQL query made against the relational database. Further, the received SQL query is parsed to obtain each operator and associated one or more operands and sequence of execution of the operators. Furthermore, a closure-friendly operator is dynamically generated for each operator and the associated one or more operands in the received SQL query. In addition, the dynamically generated closure-friendly operators are executed based on the obtained sequence of execution of the operators.

Type: Application

Filed: February 16, 2012

Publication date: October 4, 2012

Inventor: Sudarshan Srinivasa Murthy
APPARATUS, METHOD AND COMPUTER-READABLE STORAGE MEDIUMS FOR CLUSTERING AND RANKING A LIST OF MULTIMEDIA OBJECTS

Publication number: 20120254172

Abstract: An apparatus is provided that includes a processor and a memory storing executable instructions that in response to execution by the processor cause the apparatus to at least perform a number of functions. The apparatus is caused to direct presentation of a list for a plurality of patients and that is clustered by patient. The apparatus is caused to apply a keyword filter to identify a subset of the patient exams that match the keyword filter, and rank the respective exams by relevance to the keyword filter. The apparatus is caused to direct presentation of a filtered list of patient exams that is clustered by patient in the filtered list of patient exams. And for each patient having patient exams in the subset of the patient exams, the respective patient exams are in ranked order in the filtered list of patient exams according to the keyword filter.

Type: Application

Filed: March 30, 2011

Publication date: October 4, 2012

Inventor: Radu Catalin Bocirnea
FACET SUPPORT, CLUSTERING FOR CODE QUERY RESULTS

Publication number: 20120254162

Abstract: Techniques and tools are described for refining source-code query results. For example, source-code query results for a query can be generated, semantic clusters of the source-code query results can be generated, and based on a selection of a semantic cluster option, refined source-code query results can be sent. Also, for example, source-code query results can be received, selections of facet values associated with groups of the source-code query results can be sent, and based on selected facet values, a subset of the source-code query results can be received.

Type: Application

Filed: May 19, 2011

Publication date: October 4, 2012

Applicant: Infosys Technologies Ltd.

Inventors: Allahbaksh Mohammedali Asadullah, Susan George, Basava Raju Muddu
METHOD OF CATEGORIZING AN INVENTION WITHIN AN INVENTION LANDSCAPE

Publication number: 20120254185

Abstract: A computer-based method is described for categorizing inventions within the context of an invention landscape. A set of key phases is employed based upon the likelihood that the description of the invention to be categorized will share these key phrases with the descriptions of similar inventions from within the invention landscape. The results are ranked in such a way as to enable a tentative assignment of the target invention to one or more categories, and to optionally estimate the value of the invention.

Type: Application

Filed: April 4, 2011

Publication date: October 4, 2012

Inventors: N. Edward White, G. Edward Powell, JR.
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM

Publication number: 20120246176

Abstract: An information processing apparatus includes: a document analyzing unit that extracts phrases including a pair of entities, to which a relevance label is granted, from document data; and a label granting unit that grants the relevance label. The label granting unit acquires vocabulary syntax patterns included in the phrases including the pair of entities, acquires the appearing number of times the vocabulary syntax pattern appears in the document data from the document data, counts the number of pairs of entities, sets a probability model created from a probability density distribution, a parameter Z indicating validity of the granting of the relevance label, and a parameter a indicating a probability of rightly granting the relevance label, calculates the parameters Z and a for which a likelihood is maximum in the probability model, evaluates the validity of the granting of the relevance label, and grants the relevance label on the evaluation result.

Type: Application

Filed: March 8, 2012

Publication date: September 27, 2012

Applicant: SONY CORPORATION

Inventor: Shingo Takamatsu
VARIABLE PAGE SIZING FOR IMPROVED PHYSICAL CLUSTERING

Publication number: 20120246160

Abstract: A data size characteristic of contents of a related unit of data to be written to a storage by an input/output module of a data storage application can be determined, and a storage page size consistent with the data size can be selected from a plurality of storage page sizes. The related unit of data can be assigned to a storage page having the selected storage page size, and the storage page can be passed to the input/output module so that the input/output module physically clusters the contents of the related unit of data when the input/output module writes the contents of the related unit of data to the storage. Related methods, systems, and articles of manufacture are also disclosed.

Type: Application

Filed: March 25, 2011

Publication date: September 27, 2012

Inventors: Dirk Thomsen, Axel Schroeder, Ivan Schreter
METHOD AND DEVICE FOR GENERATING A SIMILAR MEANING TERM LIST AND SEARCH METHOD AND DEVICE USING THE SIMILAR MEANING TERM LIST

Publication number: 20120246162

Abstract: In a generation device, a term determiner, for reference terms and a similar meaning term that has similar meaning to any of the reference terms, determines if each of the reference terms and the similar meaning term are both included in a document data group. An extractor extracts a reference term and the similar meaning term of the reference term that were both determined to be included in the document data group. A priority determiner determines an output priority to the extracted similar meaning term on the basis of appearance of at least either of the similar meaning term and the reference term in the document data group. And a list generator generates a the similar meaning term list in such a way that the extracted reference term, the similar meaning term of the extracted reference term, and the output priority are associated with one another.

Type: Application

Filed: March 20, 2012

Publication date: September 27, 2012

Applicant: CASIO COMPUTER CO., LTD.

Inventor: Tomoharu Yamaguchi
Method and system for automatic objects classification

Patent number: 8275765

Abstract: The present invention provides a method and system for automatic objects classification. The method comprises: acquiring a set of objects; classifying the objects based on query log to generate a first classification result; classifying the objects based on ontological information to generate a second classification result; and semantically fusing the first and second classification results to generate a final classification result. According to the present invention, compared with the prior arts, by semantically fusing the query log-based classification result and the ontology-based classification result, the accuracy and user-friendness of the object classification can be improved.

Type: Grant

Filed: October 28, 2009

Date of Patent: September 25, 2012

Assignee: NEC (China) Co., Ltd.

Inventors: Jianqiang Li, Xin Meng, Yu Zhao, Jingwei Shi
UNSUPERVISED MESSAGE CLUSTERING

Publication number: 20120239650

Abstract: Unsupervised clustering can be used for organization of micro-blog or other short length messages into message clusters. Messages can be compared with existing clusters to determine a similarity score. If at least one similarity score is greater than a threshold value, a message can be added to an existing message cluster. If a message is not similar to an existing cluster, the message can be compared against criteria for starting a new message cluster.

Type: Application

Filed: March 18, 2011

Publication date: September 20, 2012

Applicant: MICROSOFT CORPORATION

Inventors: KI YEUN KIM, LEI DUAN, SEOKKYUNG CHUNG
INFORMATION PROCESSING APPARATUS, MESSAGE CLASSIFYING METHOD AND NON-TRANSITORY MEDIUM

Publication number: 20120239656

Abstract: A tied server includes a first storage unit that stores appearance patterns of messages having a transaction identifier to identify a transaction. The tied server also includes a second storage unit that stores messages executed on the transaction DB server having the transaction ID by the application server and communicated between an application server and a DB server. The tied server classifies the messages stored in the second storage unit with respect to each transaction based on the appearance patterns of the messages stored in the first storage unit.

Type: Application

Filed: January 23, 2012

Publication date: September 20, 2012

Applicant: FUJITSU LIMITED

Inventors: Yuuji HOTTA, Motoyuki KAWABA
CATEGORY CLASSIFICATION PROCESSING DEVICE AND METHOD

Publication number: 20120239657

Abstract: A category classification processing device includes a search unit that stores, as a search keyword log assembly, Q&A examples, which are actually referred to by a client, together with keywords; and a category extracting unit that obtains keyword storage frequencies expressing a number of times each of the keywords, which are recorded together with the Q&A examples in the search keyword logs, is stored for each of the Q&A examples, extracts, as category candidates of each of the Q&A examples, an m number of top keywords (m being a positive integer) in a descending order of the keyword storage frequency, uses the extracted category candidates as categories, and associates the categories with the Q&A examples.

Type: Application

Filed: February 3, 2012

Publication date: September 20, 2012

Applicant: FUJITSU LIMITED

Inventors: Reiko NAGANO, Hajime INOUE
EXTENT VIRTUALIZATION

Publication number: 20120239649

Abstract: Files can be segmented into distinct groups and allocated storage units such as blocks. Files associated with parent and child files can be segmented into separate groups, for instance. Further, a group associated with parent files can be extended to include additional blocks reserved for subsequent update. Additionally, metadata can be merged across groups to provide a unified view of the distinct groups.

Type: Application

Filed: March 15, 2011

Publication date: September 20, 2012

Applicant: MICROSOFT CORPORATION

Inventor: Galen C. Hunt
Machine Assisted Query Formulation

Publication number: 20120239653

Abstract: Architecture for completing search queries by using artificial intelligence based schemes to infer search intentions of users. Partial queries are completed dynamically in real time. Additionally, search aliasing can also be employed. Custom tuning can be performed based on at least query inputs in the form of text, graffiti, images, handwriting, voice, audio, and video signals. Natural language processing occurs, along with handwriting recognition and slang recognition. The system includes a classifier that receives a partial query as input, accesses a query database based on contents of the query input, and infers an intended search goal from query information stored on the query database. A query formulation engine receives search information associated with the intended search goal and generates a completed formal query for execution.

Type: Application

Filed: May 25, 2012

Publication date: September 20, 2012

Applicant: Microsoft Corporation

Inventors: John C. Platt, Gary W. Flake, Ramez Naam, Anoop Gupta, Oliver Hurst-Hiller, Trenholme J. Griffin, Joshua T. Goodman
Hardware Accelerated Application-Based Pattern Matching for Real Time Classification and Recording of Network Traffic

Publication number: 20120239652

Abstract: An indexing database utilizes a non-transitory storage medium. A pattern matching processing unit generates preclassification data for the network data packets utilizing pattern matching analysis. At least one processing unit implements a storage process that receives the network data packets, stores the network data packets in at least one of the slots, and transfers the network data packets to a packet capture repository when slots in a shared memory are full. A preclassification process requests from the pattern matching processing unit the preclassification data. An indexing process determines, based upon the preclassification data, whether to invoke or omit additional analysis of the network data packets, and performs at least one of aggregation, classification, or annotation of the network data packets in the shared memory to maintain one or more indices in the indexing database.

Type: Application

Filed: March 15, 2012

Publication date: September 20, 2012

Applicant: SOLERA NETWORKS, INC.

Inventors: Matthew S. Wood, Joseph H. Levy, McKay Marston
TAG INFORMATION MANAGEMENT APPARATUS, TAG INFORMATION MANAGEMENT SYSTEM,CONTENT DATA MANAGEMENT PROGRAM, AND TAG INFORMATION MANAGEMENT METHOD

Publication number: 20120233166

Abstract: A content data management apparatus that manages tag data indicating attributes relating to content data, comprising: an extraction section that extracts positional information indicating geographic positions associated with the content data and time information indicating time points associated with the content data, the positional information and the time information being attached to the content data; a speed computation section that computes speeds associated with the content data, based on the positional information and the time information extracted by the extraction section; and a grouping section that groups the content data, based on the speeds computed by the speed computation section.

Type: Application

Filed: November 10, 2011

Publication date: September 13, 2012

Applicant: Buffalo Inc.

Inventors: Hayato Kato, Hiroaki Kawasaki, Yutaka Maruyama, Kenji Takahashi
DETERMINING PREFERRED CATEGORIES BASED ON USER ACCESS ATTRIBUTE VALUES

Publication number: 20120233173

Abstract: Determining one or more preferred categories for a user is disclosed, including: determining a plurality of access attribute values corresponding to a plurality of types of access attributes associated with an access of the website by the current user; determining a plurality of categories corresponding to the plurality of access attribute values based at least in part on stored corresponding relationships between categories and access attribute values, wherein at least a portion of the determined plurality of categories comprises one or more preferred categories from which one or more products are configured to be recommended to the current user; and presenting product information associated with the one or more preferred categories.

Type: Application

Filed: March 7, 2012

Publication date: September 13, 2012

Applicant: ALIBABA GROUP HOLDING LIMITED

Inventors: Zhixiong Yang, Ningjun Su, Rongshen Long, Xu Zhang

prev 1 2 3 4 5 6 7 8 9 … next