Clustering Or Classification (epo) Patents (Class 707/E17.089)
  • Publication number: 20100293164
    Abstract: The invention relates to a system for accessing a database comprising a plurality of image data sets. The system comprises an acquisition unit for acquiring a query for searching the database for an image data set or an image data subset comprised in an image data set, the query comprising at least one medically relevant term defining search criteria; a determining unit for determining the image data set or the image data subset comprised in the image data set, based on the strength of semantic matches between the at least one medically relevant term and (a) corresponding medical annotation(s) describing the image data set; and a retrieving unit for retrieving the determined image data set or image data subset from the database.
    Type: Application
    Filed: July 25, 2008
    Publication date: November 18, 2010
    Applicant: KONINKLIJKE PHILIPS ELECTRONICS N.V.
    Inventors: Juergen Weese, Helko Lehmann, Yuechen Qian, Warner Rudolph Theophile Ten Kate
  • Publication number: 20100293165
    Abstract: A subscriber identification system 100 is presented in which subscriber selection data 250 including channel changes 134, volume changes 132, and time-of-day viewing information is used to identify a subscriber (user) 130 from a group of subscribers. In one instance, the subscriber selection data 250 is recorded and a signal processing algorithm such as a Fourier transform is used to produce a processed version of the subscriber selection data. The processed version of the subscriber selection data can be correlated with stored common identifiers of subscriber profiles to determine which subscriber 130 from the group is presently viewing the programming. A neural network or fuzzy logic can be used as the mechanism for identifying the subscriber 130 from clusters of information which are associated with individual subscribers.
    Type: Application
    Filed: July 26, 2010
    Publication date: November 18, 2010
    Inventors: Charles Eldering, M. Lamine Sylla
  • Publication number: 20100287163
    Abstract: A collaborative online content editing and approval system for creating an automated workflow process for editing and approving content before displaying in public domain. Content and data document including information of the content is received. Content and information of the content is then compared with editor information for selecting an editor. Further, a selected editor is automatically alerted to participate in the online content editing.
    Type: Application
    Filed: January 3, 2008
    Publication date: November 11, 2010
    Inventors: G. S. Sridhar, Vishwanath Ramdas, Pradeep Bennur Teregowda
  • Publication number: 20100287161
    Abstract: An information processing system and method for gathering and interpreting information includes capturing information from at least one of a plurality of information streams/sensors wherein the information includes video, audio, seismic, radio frequency (RF), and/or text then applying a standardized tag to an event at a predetermined time or over a predetermined period of time and storing the standardized tag in a repository which can be interrogated rapidly for situation/scene understanding. The information processing system and method include providing a plurality of segmentation algorithms, determining the type of information to be processed and selecting one or more of the segmentation algorithms to process the information based upon the type of information to be processed.
    Type: Application
    Filed: April 4, 2008
    Publication date: November 11, 2010
    Inventor: Waseem Naqvi
  • Publication number: 20100287160
    Abstract: A method and system for clustering a plurality of data elements is provided. According to embodiments of the present invention, a bit vector is generated based on each of the data elements. Bit operations are used to group each data element into a cluster. Clustering may be performed by partition clustering or hierarchical clustering. Embodiments of the present invention cluster data elements such as text documents, audio files, video files, photos, or other data files.
    Type: Application
    Filed: May 4, 2010
    Publication date: November 11, 2010
    Inventor: Nick Pendar
  • Publication number: 20100287508
    Abstract: The present embodiment provides an apparatus for providing a search screen comprising: a data communication unit that performs a data communication with a storage medium in which at least one data is stored; a control unit that analyzes types of data for the at least one data provided through the data communication unit and sorts the provided at least one data based on the analyzed types of data; and a search screen generation unit that generates a search screen for the at least one data sorted based on the types of data, wherein the search screen generation unit generates virtual folders corresponding to the types of data and disposes the data corresponding thereto in the virtual folders for generated each type of data.
    Type: Application
    Filed: March 18, 2009
    Publication date: November 11, 2010
    Inventor: Won-Sik Kim
  • Publication number: 20100281027
    Abstract: The present invention provides a flexible, dynamic database partition method and system. The method includes the steps of acquiring a data partition rule, where the data partition rule is used to identify a first relationship between a data partition condition and a database partition; establishing a second relationship between the data partition condition and a data partition key based on the data partition rule and a third relationship between the database partition and the data partition key; adding the data partition key to a data item where the data item is stored in the database based on the second relationship between the data partition condition and the data partition key; and storing the data item in the database partition based on the data partition key of the data item.
    Type: Application
    Filed: April 29, 2010
    Publication date: November 4, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Ning Duan, Bo Gao, Chang Jie Guo, Jian Ming Zhang
  • Publication number: 20100274787
    Abstract: A method and a system for summarization of short comments are provided. The system comprises a memory to store a comments collection. The comments collection stores a plurality of comments for later access. The comments respectively include an overall rating and at least one phrase. The system also includes one or more processors to implement an aspect module to identify a first head term and a second head term based on a first portion of the comments and to map the first head term and the second head term into an aspect cluster. The one or more processor also implement a rating module to predict an aspect rating corresponding to the aspect cluster based on the respective overall ratings of the portion of the comments.
    Type: Application
    Filed: April 23, 2010
    Publication date: October 28, 2010
    Inventors: Yue Lu, Neelakantan Sundaresan
  • Publication number: 20100274786
    Abstract: A method and system a method for compressing and searching a plurality of strings. The method includes inputting a plurality of strings into a compression engine. The method also includes converting each of the plurality of strings into a new, prefix-preserving compressed string, using the compression engine. For every string P that is a strict prefix of a string S, P's resulting compressed string is a strict prefix of S's resulting compressed string.
    Type: Application
    Filed: April 9, 2010
    Publication date: October 28, 2010
    Applicant: BRIGHTCLOUD INC.
    Inventors: Christopher K. Harris, Hal Lonas
  • Publication number: 20100274788
    Abstract: In a method of encapsulating information in a database, a message is partitioned into a plurality of object class entries within the database. An object class pointer is generated for each of a first subset of the plurality of object class entries, the generating further including executing a pointer key algorithm, the algorithm additionally generating a random number for each object class entry and concatenating the randomly generated numbers to form a single parameter string adapted to obfuscate a path between a pointer and its corresponding object class entry. The plurality of object class entries are stored in non-adjacent storage locations within the database, with each of a second subset of the plurality of object class entries stored in association with one of the generated pointers.
    Type: Application
    Filed: July 6, 2010
    Publication date: October 28, 2010
    Inventor: Christopher B. A. Coker
  • Publication number: 20100268714
    Abstract: The present invention relates to an information analysis system comprising: a summary table creation unit for analyzing an input file if the file is inputted, extracting a field list corresponding to the field list information stored in a provided database, and creating a summary table including the extracted field list; a preprocessing module for performing a preprocess including at least one of field refinement, group creation, and sub-data set creation, for fields of the summary table created by the summary table creation unit; a matrix creation unit for creating a matrix based on matrix setting information inputted by a user, for the fields created by the summary table creation unit or the preprocessing module; a cluster analysis unit for analyzing a cluster of corresponding fields according to a cluster analysis method inputted by the user, for fields selected by the user among the fields created by the summary table creation unit or the preprocessing module; and a visualization data creation unit for cr
    Type: Application
    Filed: December 16, 2008
    Publication date: October 21, 2010
    Applicant: KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFOMATION
    Inventors: Yeong Ho Moon, Sang Pil Lee, Chang Hoan Lee, Sang Jin Bae, June Young Lee, Oh Jin Kwon, Bang Rae Lee, Eui Seob Jeong, Woon Dong Yeo
  • Publication number: 20100262599
    Abstract: A content processing system may include any of a number of content processing techniques such as condensed content management, multi-content compilation management, associated content compilation management, recommended content management, and content cluster management.
    Type: Application
    Filed: April 14, 2010
    Publication date: October 14, 2010
    Applicant: SRI INTERNATIONAL
    Inventor: Kenneth C. Nitz
  • Publication number: 20100262604
    Abstract: A method includes: collecting message sequences including a series of messages issued in response to one processing request; classifying the collected message sequences into groups of the message sequences whose simplified message sequences generated by excluding words other than reserved words from a database message that is a message including a SQL sentence are identical, wherein the database message is included in the series of messages; generating, for each group, a normalized expression including the reserved words in the database message as fixed character strings and arbitrary character strings replaced with portions other than the fixed character strings in the database message, for the database message included in the message sequence belonging to the group; and generating a rule for converting the database message considered to be identical with the normalized expression into a series of fixed character strings included in the normalized expression.
    Type: Application
    Filed: March 3, 2010
    Publication date: October 14, 2010
    Applicant: FUJITSU LIMITED
    Inventor: Naoki AKABOSHI
  • Publication number: 20100257169
    Abstract: A method for generating search collection of query is disclosed, which can provide a search result list displayed by an optimized method of automatically generating a specific collection by each query, the method comprising obtaining a first query and search results selected by a user from a search result list generated in response to the first query; classifying the search results into one or more groups; and generating a search collection for the first query by each group.
    Type: Application
    Filed: August 4, 2008
    Publication date: October 7, 2010
    Applicant: NHN Corporation
    Inventors: Byounghak Kim, Tae Yeong Kwak
  • Publication number: 20100257173
    Abstract: Recurring components found in Chinese-type characters can be identified and classified by stroke count and free-endpoint count according to embodiments of the present invention. The bidirectional many-to-many relationships between characters and their components can be identified and recorded in electronic or non-electronic format and the recurring components can be ordered and retrieved according to stroke-endpoint value pair. In accordance with an embodiment, bidirectional many-to-many relationships between simple and composite components can be identified and recorded in an electronic or non-electronic format. An embodiment can provide a classification/retrieval method and apparatus for rapid search and retrieval of Chinese-type characters and their components based on stroke-endpoint value pairs and relationships between components and characters.
    Type: Application
    Filed: November 25, 2008
    Publication date: October 7, 2010
    Inventor: Warren Daniel Child
  • Publication number: 20100257527
    Abstract: A computer implemented method comprising: monitoring a plurality of processes relating to a plurality of applications and using a plurality of computing resources, at least some of the processes relating to a user activity; analyzing the use of the computing resources by each of the processes; analyzing the user activity in respect to the processes; and classifying the processes in respect to the analyzed user activity, wherein the monitoring; the processing; the analyzing use; the analyzing user activity; and the classifying, are carried out during running the processes, and wherein at least one of: the monitoring; the processing; the analyzing use; the analyzing user activity; and the classifying, is performed by at least one computer.
    Type: Application
    Filed: March 31, 2010
    Publication date: October 7, 2010
    Applicant: SOLUTO LTD
    Inventors: Tomer DVIR, Ishay GREEN, Omer BAKI, Amit LAVIAN
  • Publication number: 20100257170
    Abstract: Disclosed is a naming service system and method of an SCA-based application component. The naming service system includes a naming service server having an arrangement information analyzing module, a database module, and a naming service deploying module, and a plurality of clients each having a module for selecting a naming service server for registration and a module for selecting an optimum naming service server for search. The naming service method includes: obtaining arrangement information of components forming an SCA-based application from hostcollocation elements of an SAD file; deploying local naming services for each node; arranging the naming services according to information of a naming service allocation table; and providing a function of registering a naming service of the component. When searching the IOR of a component, the naming service server selects an optimum naming service from a local naming service and a remote naming service, thereby providing a high-speed naming service.
    Type: Application
    Filed: September 3, 2008
    Publication date: October 7, 2010
    Applicant: Electronics and Telecomunications Research Institute
    Inventors: Hongsoog Kim, Namhoon Park
  • Publication number: 20100257144
    Abstract: Systems and methods for data aggregation, targeting and acquisition are described. A method may receive data and storing the data in one or more source data tables and select one of the one or more source data tables. The selected source data table includes updated data fields. The method may also identify a plurality of destination data tables that need to be updated, in which each destination data table is linked to and contains an aggregation of a subset of data from the selected source data table, identify one or more data fields in the identified destination data tables that need to be updated with data from the updated data fields in the selected source data table, and determine using the processor, for each identified destination data table, a best aggregation source data table.
    Type: Application
    Filed: April 1, 2010
    Publication date: October 7, 2010
    Applicant: Touchstone Systems, Inc.
    Inventors: Jerry Lambert, Shiraz Khalid
  • Publication number: 20100257137
    Abstract: The invention addresses the access to subscriber data in a telecommunication system, and provides for a database system with a master database and with a plurality of slave databases acting as memory caches closely located with the requester applications and for a method of handling such database system.
    Type: Application
    Filed: July 23, 2007
    Publication date: October 7, 2010
    Inventors: Berta Isabel Escribano Bullon, Alfredo Gonzalez Plaza
  • Publication number: 20100250543
    Abstract: A query having multiple parts may be processed to form an intermediate results set. This intermediate results set may be partitioned into a plurality of groups. Thereafter, the groups may be sorted into a plurality of containers so that each container contains data sufficient to calculate one requested result in the multipart query. Related techniques, apparatuses, systems, and computer program products are also described.
    Type: Application
    Filed: June 10, 2010
    Publication date: September 30, 2010
    Inventors: Franz X. Faerber, Christian M. Bartholomae, Erich Marschall, Stefan Dipper, Guenter Radestock
  • Publication number: 20100250546
    Abstract: A method for operating a server to improve bandwidth efficiency in a computer network is disclosed. The server is operable to transmit files between a memory of the server and destinations on the computer network through a communication link having a finite bandwidth. The files are distinguishable by type and the server is provided with a rule set for prioritizing transmission of files by type. The method comprises monitoring a bandwidth usage of the communication link, and triggering application of the rule set when the bandwidth usage exceeds a threshold amount. The threshold amount is determined relative to the finite bandwidth. The method further comprises distinguishing between the files according to type, and prioritizing transmission of the files according to type and according to the rule set.
    Type: Application
    Filed: June 7, 2010
    Publication date: September 30, 2010
    Inventor: Gary Stephen Shuster
  • Publication number: 20100250539
    Abstract: The present application relates to a method for implementing picture search and a website server thereof.
    Type: Application
    Filed: March 22, 2010
    Publication date: September 30, 2010
    Inventors: Chunyi Zhou, Weiwei Wang, Xinfeng Zhou, Yu Dong, Xiaoying Weng, Jialong Huang
  • Publication number: 20100250542
    Abstract: A separation surface set storage part stores information defining a plurality of separation surfaces which separate a feature space into at least one known class region respectively corresponding to at least one known class and an unknown class region. Each of the at least one known class region is separated from outside region by more than one of the plurality of separation surfaces which do not intersect to each other. A data classification apparatus determine a classification of a classification target data whose inner product in the feature space is calculable by calculating to which region of the at least one known class region and the unknown class region determined by the information stored in the separation surface set storage part the classification target data belongs. A method and apparatus for data classification which can simultaneously perform identification and outlying value classification with high reliability in a same procedure are provided.
    Type: Application
    Filed: April 21, 2008
    Publication date: September 30, 2010
    Inventor: Ryohei Fujimaki
  • Publication number: 20100250540
    Abstract: A method is provided for managing a relational database of the SQL type for information technology and network infrastructure service information, including a method in which the following are created, in a system for managing a database of the MySQL type, a read-only data storage engine, and unmodifiable tables, for example of WORM, defined as “Write Once Read Many” type managed by the storage engine; each table includes a column of digital counting data called a “timestamp”; each table is partitioned by time intervals; partition files are grouped in subdirectories of a file system, these directories forming a tree structure, each node of which is uniquely identified from a timestamp.
    Type: Application
    Filed: March 22, 2010
    Publication date: September 30, 2010
    Inventors: Serge ADDA, Olivier CHEDRU
  • Publication number: 20100250547
    Abstract: A method, system and article of manufacture therefor, are disclosed for automatically generating a query from document content.
    Type: Application
    Filed: June 10, 2010
    Publication date: September 30, 2010
    Inventors: Gregory T. Grefenstette, James G. Shanahan
  • Publication number: 20100241519
    Abstract: Systems and methods for capturing and managing information from a plurality of individual items sold in a commercial transaction between a consumer and a merchant can include capturing, at a point-of-sale, transactional data for the a plurality of individual items sold during the transaction. The transactional data can include item identification information for each of the a plurality of individual items sold. The transactional data can further be used to authorize the transaction.
    Type: Application
    Filed: February 18, 2010
    Publication date: September 23, 2010
    Applicant: GreenReceipts, LLC
    Inventors: Adam N. Lindahl, Michael D. Madden, Mel A. Shaftel
  • Publication number: 20100235357
    Abstract: A method of processing capacity information is disclosed. The capacity information relates to data capacity in a data network in which a consumer circuit is carried on, and consumes bandwidth made available by, a bearer circuit. The method comprises storing, in a network information database, an entity representing the bearer circuit, and associating capacity information with the bearer circuit entity specifying a first bandwidth quantity defining a quantity of bandwidth made available by the bearer circuit. Also stored is an entity representing the consumer circuit, and capacity information is associated with the consumer circuit entity specifying a second bandwidth quantity defining a quantity of bandwidth allocated to the consumer circuit.
    Type: Application
    Filed: May 26, 2010
    Publication date: September 16, 2010
    Inventors: Peter John Briscoe, Elizabeth Graves Tector
  • Publication number: 20100228733
    Abstract: A system and method for performing classification using semantic distance measurements. Items of electronic content accessed by individuals over a global communications network are identified. A set of content that includes the plurality of identified items of electronic content are stored. The set of content is normalized. Each of the keywords contained the set of content is identified and a semantic distance between each of the identified keywords is measured.
    Type: Application
    Filed: November 11, 2009
    Publication date: September 9, 2010
    Applicant: COLLECTIVE MEDIA, INC.
    Inventors: Paul Harrison, James Oliphant, Hal Fulton, Armin Roehrl
  • Publication number: 20100228715
    Abstract: A system and method for creating a user profile and for using the user profile to order search results returned by a search engine. The user profile is based on search queries submitted by a user, the user's specific interaction with the documents identified by the search engine and personal information provided by the user. Terms for the user profile may be selected from the documents accessed by the user by performing paragraph sampling or context analysis. Generic scores associated with the search results are modulated by the user profile to measure their relevance to a user's preference and interest. The search results are re-ordered accordingly so that the most relevant results appear on the top of the list. User profiles can be created and/or stored on the client side or server side of a client-server network environment.
    Type: Application
    Filed: May 12, 2010
    Publication date: September 9, 2010
    Inventor: Stephen R. Lawrence
  • Publication number: 20100228732
    Abstract: An information offering apparatus and an information offering method are provided. The information offering apparatus is configured to arrange information, which is generated as a user uses services, according to a time period, group the arranged information, and then display the arranged information together with the time period.
    Type: Application
    Filed: February 18, 2010
    Publication date: September 9, 2010
    Inventors: Young-ho Rhee, Hyun-joo Kang, Ju-youn Lee
  • Publication number: 20100223122
    Abstract: An exemplary embodiment of the invention relates to a method, system, and storage medium for providing variable consumer information at a retail display location. The system comprises a host system further including a server; a commercial display services application including a user interface executing on the server; and a data storage device coupled to the server. The data storage device stores databases of diverse media formats including: a linked advertisement database operably configured to store audio-video advertising content and audio-video advertisement records; a static advertisement database operably configured to store static advertising content and static advertisement records; an audio clip database operably configured to store audio clip content and audio clip records; and a file database storing registration information; and a link to at least one retail entity.
    Type: Application
    Filed: March 1, 2010
    Publication date: September 2, 2010
    Applicant: CBS Intractive, Inc.
    Inventors: George Burling Prince, III, Jeffrey Allen Martin
  • Publication number: 20100217763
    Abstract: An automatic clustering method using an Average-linkage algorithm and a KPower Means algorithm, and a method and apparatus for multi-path clustering required for a spatial channel modeling (SCM) in a wireless communication environment are provided. The automatic clustering method, including: a first step of obtaining an initial cluster centroid using a hierarchical clustering algorithm; a second step of moving the initial cluster centroid using a two dimensional clustering algorithm; a third step of clustering a data set according to the moved initial cluster centroid; and a fourth step of calculating a validation index with respect to the clustered data set and determining an optimal number of clusters.
    Type: Application
    Filed: May 19, 2008
    Publication date: August 26, 2010
    Applicants: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, SEOUL NATIONAL UNIVERSITY INDUSTRY FOUNDATION
    Inventors: Jae Joon Park, Won Sop Kim, Myung Don Kim, Hyun Kyu Chung, Seong-Cheol Kim, Ja-ho Koo, Namkoo Kang
  • Publication number: 20100217764
    Abstract: According to one embodiment, generating a dictionary and determining a co-occurrence context includes accessing a domain corpus comprising articles. Each article corresponds to a particular topic and comprises one or more terms that link to one or more other articles corresponding to one or more other topics. Each topic is designated as a term to yield a dictionary. A co-occurrence context is defined for the domain corpus. At least two terms appearing in the co-occurrence context are considered co-occurring. Co-occurrences among the terms are calculated according to the co-occurrence context.
    Type: Application
    Filed: February 24, 2010
    Publication date: August 26, 2010
    Applicant: Fujitsu Limited
    Inventors: Yannis Labrou, Stergios Stergiou
  • Publication number: 20100211917
    Abstract: A method and an apparatus for reminding and browsing related information of contacts, and a recording medium using the same are provided. In the present method, a communication device displays a contact list comprising at least one contact on a screen thereof. Meanwhile, the communication device checks whether each contact has unread related information and displays an indicating icon on the corresponding contact in the contact list when it is determined that the contact has related information unread, so as to remind a user of the communication device that the contact has unread related information.
    Type: Application
    Filed: February 11, 2010
    Publication date: August 19, 2010
    Applicant: HTC CORPORATION
    Inventor: Yuan-Mao Tsuei
  • Publication number: 20100211515
    Abstract: The method, system and a computer program and a computer product for managing workers and documents is provided. The method includes storing industry representations and a list of workers with data related to the workers, linking the industry representations to the workers and selecting at least one worker from based on the industry representations for that worker. The method also includes scheduling workers to a job based on whether they have all of the required industry representations. In addition, the method includes uploading industry representations and bar code scanning industry representations into the database. The method further includes dispatching said industry representations to other users and automatically mapping fields of the dispatched data in the destination database. In addition, the method includes selectively encrypting only sensitive fields in data transmission between two entities.
    Type: Application
    Filed: September 17, 2009
    Publication date: August 19, 2010
    Inventors: Lewis E. Woodings, James D. Bosse, Mathew P. Hodgson, Alexander Smorodintsev, Alexei Klimantov, Viatcheslav Karassik
  • Publication number: 20100211570
    Abstract: The present invention relates to distributed systems in which resource utilisation decisions depend upon the semi-automatic categorisation of resource descriptions stored in the distributed system. In the principal embodiment, the resource descriptions are web service descriptions which are augmented with tags (i.e. descriptive words or phrases) entered by users and/or by web service administrators. The initial use of automatic categorisation of these descriptions, followed by a user-driven fine-tuning of the automatically-generated categories enables the rapid creation of reliable categorisation of the resource descriptions, which in turns results in better resource utilisation decisions and hence a more efficient use of the resources of the distributed system.
    Type: Application
    Filed: September 3, 2008
    Publication date: August 19, 2010
    Inventors: Robert Ghanea-Hercock, Hakan Duman, Alexander L. Healing
  • Publication number: 20100205179
    Abstract: A social networking system allows users to upload information about themselves to a remote database, preferably over the Internet. Users are able to connect to other users thus establishing links. These links may be categorized based on the relationship between the users, e.g., family, friends, co-workers, etc. The uploaded information may also be categorized using the same categories of relationships. The information of each user may then only be sent to users having a connection category that matches the information category.
    Type: Application
    Filed: October 26, 2007
    Publication date: August 12, 2010
    Inventors: Anthony R. Carson, Bryan L. Noland, Darren M. Ford, James D. Cunningham
  • Publication number: 20100205177
    Abstract: An object identification apparatus includes an image data input unit configured to input captured image data including an object, an object identification data generation unit configured to generate data for identifying the object by extracting a feature vector from a partial area of the input image data to convert the feature vector according to the partial area, an object dictionary data storage unit configured to store object dictionary data generated from previously recorded image data, and an object identification unit configured to identify a class to which the object belongs, which is included in the image data input by the image data input unit, based on the data for identifying the object and the object dictionary data.
    Type: Application
    Filed: January 11, 2010
    Publication date: August 12, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventors: Hiroshi Sato, Katsuhiko Mori, Yoshinori Ito
  • Publication number: 20100198827
    Abstract: A method for finding text reading order in a document such as a scanned newspaper or magazine includes the steps of pruning unnecessary text zones using semantic analysis (40), using text correlation measures to cluster zones (41), and then finding a reading order within each of the clusters (42).
    Type: Application
    Filed: July 27, 2005
    Publication date: August 5, 2010
    Inventors: Sherif Yacoub, Daniel Ortega, Paolo Faraboschi, Jose Abad Peiro
  • Publication number: 20100191731
    Abstract: One embodiment of the invention provides a method of grouping defects. The method includes the steps of obtaining a plurality of defect reports, preprocessing the defect reports, and applying a clustering algorithm, thereby grouping the defect reports. Another embodiment of the invention provides a computer-readable medium whose contents cause a computer to perform a method comprising: obtaining a plurality of defect reports; preprocessing the defect reports; and applying a clustering algorithm, thereby grouping the defect reports. Another aspect of the invention provides a system for grouping defect reports. The system includes: a preprocessing module, a representation module in communication with the preprocessing module, and a clustering module in communication with representation module.
    Type: Application
    Filed: January 23, 2009
    Publication date: July 29, 2010
    Inventors: Vasile Rus, Sajjan Shiva
  • Publication number: 20100191733
    Abstract: A music linked photocasting service system and method are provided. The music linked photocasting service method includes reproducing music at the request of a user, analyzing a mood of the reproduced music at prescribed times, until music reproduction is completed, searching photographs suitable for a analyzed mood of the music, and displaying the searched photographs.
    Type: Application
    Filed: January 29, 2010
    Publication date: July 29, 2010
    Inventors: Sung-Jin PARK, Won-Sang KWON, Won-Suk YANG, Chan-Seok YANG
  • Publication number: 20100185607
    Abstract: Embodiments of the present invention provide a method and system for sorting Internet music files, a searching method and a searching engine. The method for sorting Internet music file includes: calculating text correlation and value correlation of each of music files; calculating comprehensive correlation of each of the music files according to the text correlation and the value correlation; and sorting the music files according to the comprehensive correlation. Embodiments of the present invention are to show the user music files possessing relatively good correlation with the searching request of the user.
    Type: Application
    Filed: September 4, 2008
    Publication date: July 22, 2010
    Applicant: Tencent Technology (Shenzhen) Company Limited
    Inventors: Rongfang Shao, Zhiping Wang, Ying Xiong, Yang Guo
  • Publication number: 20100185618
    Abstract: Data records containing one or more fields, which can be considered keys and/or values, are received, and processed such that data values of records that contain key values of interest are aggregated together. The keys of the resultant aggregations or “resultant keys” are created under the control of simple parameters to an aggregation framework. Similarly, the particular aggregations performed are also under the control of a similar set of simple parameters to the aggregation framework. Mapping of keys to reduce originality is one of the important features of resultant key creation. Finally, the structure of the parameters used to control aggregation is simple, flexible, and powerful.
    Type: Application
    Filed: March 29, 2010
    Publication date: July 22, 2010
    Applicant: Microsoft Corporation
    Inventor: Glenn R. Peterson
  • Publication number: 20100185620
    Abstract: A computer implemented method of generating an ordered list of geographical locations having similarities in preselected categories relative to a first geographical location.
    Type: Application
    Filed: March 10, 2010
    Publication date: July 22, 2010
    Applicant: LOCATION INC. GROUP CORPORATION
    Inventor: Andrew Schiller
  • Publication number: 20100185619
    Abstract: Sampling analysis includes classifying a plurality of query keywords into a plurality of query keyword subsets according to page view (PV) values associated with the plurality of query keywords, the plurality of query keywords being submitted by a plurality of users; determining a respective plurality of sample rates of a respective plurality of query keywords in a respective one of the plurality of query keyword subsets; and sampling query data in the respective one of the plurality of query keyword subsets according to the respective plurality of sample rates.
    Type: Application
    Filed: January 20, 2010
    Publication date: July 22, 2010
    Inventors: Junlin Zhang, Jian Sun, Lei Hou, Qin Zhang
  • Publication number: 20100185695
    Abstract: A method for data clustering may comprise entering data into a computer network comprising a master processor, an array of slave processors, and two cluster seats associated with each slave processor; executing a master process comprising dividing the data into clusters, sending the clusters to the cluster seats, initializing an optimization cycle, and computing an objective function. The optimization cycle includes the parallel execution by the slave processors of a slave process, which includes exchanging data between paired clusters so as to increase the objective function based on two modalities, and then resorting the cluster pairs for a subsequent iteration of the process.
    Type: Application
    Filed: January 22, 2009
    Publication date: July 22, 2010
    Inventors: Ron Bekkerman, Martin B. Scholz
  • Publication number: 20100179951
    Abstract: Systems and methods for mapping enterprise data are described. Information associated with an enterprise is obtained from a plurality of sources and transformed to obtain formatted data. The formatted data is orchestrated and relationships between portions of the formatted data are determined from which business intelligence related to the enterprise is obtained. A plurality of analytics is performed on the formatted data and business intelligence.
    Type: Application
    Filed: March 3, 2009
    Publication date: July 15, 2010
    Inventor: Lon Daniel McPhail
  • Publication number: 20100179950
    Abstract: A system and method of profile matching using a multi-media survey is described. The method is capable to capturing the emotional reflex of a user. The method is generalized to categorizing an entity (a user or an object) to specific segment with similar emotional profiles. Each entity can be assigned to an emotional code. Such code can be used as a universal vocabulary in the emotional space for both commerce and consumers to adopt in facilitating communication among different parties.
    Type: Application
    Filed: March 27, 2007
    Publication date: July 15, 2010
    Applicant: Imagini Holdings Limited
    Inventor: Alex Willcock
  • Publication number: 20100174716
    Abstract: Methods and systems for improving text segmentation are disclosed. In one embodiment, at least a first segmented result and a second segmented result are determined from a string of characters, a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result are determined, and an operable segmented result is identified from the first segmented result and the second segmented result based at least in part on the first frequency of occurrence and the second frequency of occurrence.
    Type: Application
    Filed: March 15, 2010
    Publication date: July 8, 2010
    Applicant: Google Inc.
    Inventors: Gilad Israel Elbaz, Jacob Leon Mandelson
  • Publication number: 20100174707
    Abstract: An apparatus has a feature value extracting section 106 for extracting a feature value from a search key image; a distance calculating section 107 for calculating distances between the feature values of search target images and the feature value of the search key image; a distance analyzing section 108 for analyzing the distances calculated by the distance calculating section 107, and for selecting a feature value having a feature similar to the search key image as a feature value effective for the image search; and a search executing section 109 for arranging the search target images in a feature value space with coordinate axes to which a principal component analysis result of the feature value effective for the image search is assigned, and for searching for an image similar to the search key image from within the feature value space.
    Type: Application
    Filed: September 3, 2008
    Publication date: July 8, 2010
    Inventors: Daiki Kudo, Yoshiaki Kato, Hirofumi Nishikawa