Data Mining Patents (Class 707/776)
  • Patent number: 8195684
    Abstract: In an information/call center where calls are received, requesting information concerning entities, goods and services, directions to a given destination, etc., data is collected in processing such calls. In accordance with the invention, the collected data is analyzed to generate dynamic data to supplement and/or improve the traditional databases, typically searched by an operator for responses to the information requests. In providing a public information assistance service, such dynamic data may concern, e.g., the most popular movies, restaurants, requested categories, etc. In providing a personalized information assistance service, such dynamic data may concern, e.g., previous telephone connections made for a subscriber, the most popular telephone connections requested by a subscriber, etc. In addition, based on the past search behavior, “fuzzy” logic is developed for correlating between search terms.
    Type: Grant
    Filed: July 26, 2004
    Date of Patent: June 5, 2012
    Assignee: Grape Technology Group, Inc.
    Inventors: Nicholas J. Elsey, Karen L. Johnson, Timothy A. Timmins
  • Publication number: 20120136895
    Abstract: A location point determination apparatus comprises a geographic feature harvesting module (202) arranged to access and collect, when in use, geographic feature information associated with a predetermined named area datum. The apparatus also comprises a data assessment module (208) arranged to receive the geographic feature information collected by the geographic feature harvesting module and to evaluate from the geographic feature information collected in respect of at least one attribute of each geographic feature associated with the geographic feature information. The apparatus further comprises a selection module (210) arranged to select a geographic feature from the geographic features evaluated in accordance with a predetermined criterion associated with the evaluation of the geographic feature information.
    Type: Application
    Filed: April 29, 2010
    Publication date: May 31, 2012
    Inventor: Terry William Johnson
  • Patent number: 8190625
    Abstract: A method includes analyzing a plurality of electronic documents available via a network service, selecting content of the documents encountered during the analysis to generate signatures for the documents based on the content of the documents, generating an index comprising the signatures, and updating the index by performing additional analyses. The index is updated to include documents having the same signatures.
    Type: Grant
    Filed: March 29, 2006
    Date of Patent: May 29, 2012
    Assignee: A9.com, Inc.
    Inventor: James E. Beach
  • Publication number: 20120131056
    Abstract: According to one embodiment, a plurality of test drop recipes are first created based on design data on a semiconductor integrated circuit. Based on a defect inspection result of a pattern of a hardening resin material, which is formed by pressing a template on which patterns of the semiconductor integrated circuit are formed onto the hardening resin material applied to a substrate to be processed by use of the test drop recipes, a drop recipe with least defects is selected per press position on the substrate to be processed from the test drop recipes. The selected drop recipes for respective press positions are collected per functional circuit block configuring the semiconductor integrated circuit, thereby to generate a drop recipe creation assistant database.
    Type: Application
    Filed: September 21, 2011
    Publication date: May 24, 2012
    Inventors: Yasuo Matsuoka, Takumi Ota, Ryoichi Inanami
  • Publication number: 20120131055
    Abstract: Be TT.p the “technique teaching” of a patent or venture, RS a “reference set” of prior art “technique teachings TT.i”, any “element” of any TT described by its properties, and all this information be presented as meaningful items. Then the FSTP Expert System supports managing an analysis of TT.p over RS such that it is able to reply automatically and instantly to any query for any item in this information. These answers may describe any interrelation between any items or properties/facts or comment on such interrelations or on some insights into them achieved while generating these items by or interactively with the FSTP Expert System. By formalization of these properties it also supports determining the value of q dependably indicating TT.p as trivial/obvious over RS iff q=0 and for q>0 showing the “creative height of TT.p over RS” and quantifying the “power” of this indication.
    Type: Application
    Filed: September 9, 2011
    Publication date: May 24, 2012
    Applicant: Sigram Schindler Beteiligungsgesellschaft mbH
    Inventor: Sigram Schindler
  • Patent number: 8185515
    Abstract: Information regarding the structure of information in a content database is maintained in a structure database. The structure database is used to correlate the data structure of a query to the structure of the content database, in order to determine that information in the content database which needs to be provided to a searcher in response to the query. In one embodiment, this search method is used in an online forum, and the forum maintains a reputation score for users with respect to given subject matter. The reputation score is dependent upon the quality of a user's participation in the forum. A user's reputation score depends upon the evaluation by others of information he posts and. upon the user evaluating information posted by others.
    Type: Grant
    Filed: December 1, 2008
    Date of Patent: May 22, 2012
    Assignee: Transparensee Systems, Inc.
    Inventor: Steven David Lavine
  • Publication number: 20120124089
    Abstract: A user interest pattern modeling server includes a history collection unit, a keyword extraction unit, a time pattern extraction unit, a keyword extension unit, a time pattern analysis unit and a pattern modeling unit. The history collection unit collects a user's use history of a content. The keyword extraction unit extracts a keyword from the use history of the content. The time pattern extraction unit extracts a first time pattern of the keyword. The keyword extension unit extracts an extended keyword through searching related words of the keyword. The time pattern analysis unit analyzes a second time pattern of the extended keyword based on the first time pattern. The pattern modeling unit models a user interest pattern for the keyword and the extended keyword based on the first and second time patterns.
    Type: Application
    Filed: November 11, 2011
    Publication date: May 17, 2012
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Jae-Cheol SIM, Kang-Yong Lee, Hwa-Shin Moon
  • Publication number: 20120117114
    Abstract: A system for collaborative analysis from different processes on different data sources. The system uses a unique approach to lightweight temporary data structures in order to allow communication of interim results among processes, and construction of semantically appropriate reports. The data structures are generated in near real time and their lightweight nature supports massive scaling, including many diverse streaming inputs.
    Type: Application
    Filed: November 7, 2011
    Publication date: May 10, 2012
    Inventor: Harold Theodore Goranson
  • Publication number: 20120109880
    Abstract: An illustrative embodiment of a computer-implemented method for using organizational awareness in locating business intelligence receives an identity of an individual in an organizational hierarchy of users to form an identified individual and identifies people related to the identified individual in the organizational hierarchy of users using a people information database and relationship criteria to form related people. The computer-implemented method further identifies documents associated with the related people to form identified documents, inspects gathered information of the identified documents using a subset of relationship criteria to form inspected information and creates a list of suggested documents based at least on the inspected information.
    Type: Application
    Filed: July 18, 2011
    Publication date: May 3, 2012
    Applicant: International Business Machines Corporation
    Inventors: David Dewar, Jason Hiltz-Laforge, Matthew J. Postle-Hacon
  • Publication number: 20120109965
    Abstract: The present invention relates generally to a system for automatic semantic-based mining that enables web mining for populate semantic artifacts data to be carried out with minimal user interaction.
    Type: Application
    Filed: March 23, 2010
    Publication date: May 3, 2012
    Applicant: Mimos Derhad
    Inventors: A/L Perumal Nagendran, Yuan Kai Chow, Yusrin Amruddin Amru
  • Publication number: 20120110014
    Abstract: Embodiments of the present invention provide a detector apparatus for detecting a physical resource employed in providing a particular virtual resource in a computer network, the computer network including a plurality of physical resources each being operable to be employed in providing virtual resources and having an environment sensor outputting sensor data representing changes in an operating property of the physical resource. A detector apparatus embodying the present invention comprises a sensor data receptor operable to receive sensor data output by the environment sensors, a pattern extractor operable to extract a pattern from the received sensor data from a physical resource, and a pattern matcher, wherein the pattern matcher is operable to compare the extracted pattern with a unique pattern known to be generated by a particular virtual resource, and to detect that the physical resource is employed in providing the particular virtual resource when a match is found.
    Type: Application
    Filed: September 22, 2011
    Publication date: May 3, 2012
    Applicant: FUJITSU LIMITED
    Inventor: David Snelling
  • Patent number: 8171033
    Abstract: Methods and systems for determination of thresholds for time-series data. Data is transformed by reducing outliers, dividing the time series data into discrete time intervals, and taking parts of the data corresponding to the range that the thresholds will bound. If data cycles are known, they may be applied to the data and the resulting sets are weighted. Thresholds are then derived from the weighted means and variances of the sets of weighted data.
    Type: Grant
    Filed: September 30, 2008
    Date of Patent: May 1, 2012
    Assignee: VMware, Inc.
    Inventor: Mazda A. Marvasti
  • Publication number: 20120102068
    Abstract: A system and method for identifying high order associations between variables in complex systems that is particularly useful where there is no correlation or weak correlation between variables due to the influence of a third variable, a ternary relationship. The ternary relationship describes how the variation in the pattern of association between a pair of variables, including its sign and strength, is mediated by a third variable. In one embodiment applied to gene expression data, the activity of pairs of correlated genes due to the activity of one or more third genes is shown.
    Type: Application
    Filed: October 24, 2011
    Publication date: April 26, 2012
    Applicant: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
    Inventor: Ker-Chau Li
  • Patent number: 8166035
    Abstract: A grid-based data clustering method comprises: a parameter setting step, a partition step, a searching step, a seed-classifying step, an extension step, and a termination step. Through the above-mentioned steps, data in a data set are disposed in a plurality of grids, and the grids are classified into dense grids and uncrowded grids for a cluster to extend from one of the dense grid to gradually combine data in other dense grids nearby. Consequently, convenience in parameter setting, efficiency and accuracy in data clustering, and performance in noise filtering are achieved.
    Type: Grant
    Filed: January 6, 2010
    Date of Patent: April 24, 2012
    Assignee: National Pingtung University of Science & Technology
    Inventors: Cheng-Fa Tsai, Chien-Sheng Chiu
  • Patent number: 8166064
    Abstract: Disclosed is a computer method and system for identifying significance of patterns across a plurality of data patterns, which involves identifying pattern types of the plurality of data patterns, determining a relative pattern significance factor to compare the pattern types. Determining the relative pattern significance factor further involves calculating a percentage change of an identified outlier from a median for a outlier pattern, calculating a value of a step change as a percentage of a last value of a step preceding the step change for a step change pattern and calculating a percentage change from a start value on the fitted curve to an end value on the fitted curve for a trend pattern. A ranked list of the pattern types are returned based on their corresponding relative pattern significant factors.
    Type: Grant
    Filed: May 6, 2009
    Date of Patent: April 24, 2012
    Assignee: Business Objects Software Limited
    Inventor: John MacGregor
  • Publication number: 20120095770
    Abstract: A mechanism, in a data processing system, is provided for defining marketing strategies. The mechanism dynamically obtains information related to customer interactions associated with a plurality of customers, analyzes the information to identify patterns, selects patterns to define a marketing strategy for a marketer, and defines a marketing strategy based on the selected patterns.
    Type: Application
    Filed: October 19, 2010
    Publication date: April 19, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: George T. Jacob Sushil, Kalapriya Kannan
  • Publication number: 20120096031
    Abstract: A system and method for enabling efficient extraction of only meaningful frequent itemsets. The system includes a decision unit that decides a new itemset that becomes an investigation target in the same sequence as that of searching an itemset tree in a depth-first manner and in descending order, a frequent occurrence determining unit that registers the frequency of occurrence of the new itemset in a table if the frequency of occurrence is equal to or more than a predetermined threshold, a correlation determining unit that determines whether there is a correlation between each item in the new itemset and a subset of remaining items that were removed from the new itemset, and a registration unit that registers the new itemset in a set of meaningful frequent itemsets if the determination is positive for all items of the new itemset.
    Type: Application
    Filed: October 5, 2011
    Publication date: April 19, 2012
    Applicant: International Business Machines Corporation
    Inventor: Issei Yoshida
  • Patent number: 8161062
    Abstract: A method of analyzing customer behavior, where customers are engaged in customer-to-customer transactions in the third-party network, includes the transformation of data representing the customer-to-customer transactions from a data representation to a network representation, and then analyzing the network representation. The network representation includes a set of nodes and a set of links where each node represents a customer and each link represents a transaction between two of the customers.
    Type: Grant
    Filed: May 11, 2010
    Date of Patent: April 17, 2012
    Assignee: Mantas, Inc.
    Inventors: Tao Zhang, Steven Kirk Donoho
  • Publication number: 20120089642
    Abstract: The system and methods described herein provide results previewing for an interactive text mining system in order to feedback partial query results to users before all results that are responsive to a query have been found. These partial results allow the user to see the progress of their text mining query much sooner.
    Type: Application
    Filed: October 6, 2010
    Publication date: April 12, 2012
    Inventors: David R. Milward, Roger W. Hale, Malcolm R. Parsons, Sylvia F. Knight, Christopher I. Sullivan, Jason Trenouth, James R. Thomas
  • Publication number: 20120089643
    Abstract: A computer implemented method and system provide for automatic selection and extraction of metadata and media content from projects in a craft tool. Automated identification, classification and management of such metadata and content is provided using including techniques such as pattern recognition for audio and visual content. The automatic tracking and centralised storage of metadata and content for compliance purposes can be facilitated, and can enable querying of organised metadata stored in a central database. In an example, metadata and media content are extracted automatically from a project in a craft tool at a client system and are forwarded to a host system for the creation of a cue sheet including timings for media files from timing metadata in a project file to create the timings on the cue sheet.
    Type: Application
    Filed: October 7, 2010
    Publication date: April 12, 2012
    Inventors: Charles Hodgkinson, Kirk Zavieh
  • Publication number: 20120084373
    Abstract: A device, server, method, and computer program product for reading an e-book are provided. The e-book may include at least a content identifier corresponding to a content in the e-book. The device may include a content navigator configured to present the content according to a command from a user and a processing unit configured to acquire the content identifier corresponding to the content presented by the content navigator, send the content identifier to a server, and receive from the server a message associated with the content. An output unit configured to output the message to the user may be provided.
    Type: Application
    Filed: September 30, 2011
    Publication date: April 5, 2012
    Applicant: International Business Machines Corporation
    Inventors: LI-JU CHEN, GARY CHIH-YUAN LIN, CHIEN-CHIAO TU, SHIH-YEH Wang, MJ XIAO
  • Publication number: 20120084323
    Abstract: Textual information may be harvested from photos that are associated with a geographic location, and the text may be used to respond to searches. In one example, photos are taken from a vehicle that has a camera and a GPS receiver. Each of the photos is marked with the geographic location at which it was taken, and text is extracted from the photos. Thus, each piece of text is associated with a particular geographic location, and the association between text and location is stored in a database. At some point in time, a query is received from a user, where the query specifies or implies a geographic criterion. The database is then examined to determine what items in the database meet the textual and geographic constraints of the query, and those pieces of information may be provided as search results.
    Type: Application
    Filed: October 2, 2010
    Publication date: April 5, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Boris Epshtein, Eyal Ofek
  • Publication number: 20120084262
    Abstract: Software, firmware, and systems are described herein that permit an organization to dock previously-utilized, limited-feature data management modules with a full-featured data management system. By docking limited-feature data management modules to a full-featured data management system, metadata and data from the various limited-feature data management modules can be integrated and utilized more efficiently and effectively. Moreover, additional data management features can be provided to users after a more seamless transition.
    Type: Application
    Filed: September 30, 2011
    Publication date: April 5, 2012
    Inventors: Rama Naga Bheemeswara Reddy Dwarampudi, Rajiv Kottomtharayil, Rahul S. Pawar, Parag Gokhale
  • Patent number: 8150873
    Abstract: A method and apparatus to find maximal frequent itemsets over data streams. A prefix tree manages itemsets and appearance frequencies of the itemsets, and each of nodes of the prefix tree has information about an appearance frequency, a maximum lifetime, and a mark indicating whether the corresponding itemset is a maximal frequent itemset. The method includes: receiving transaction Tk generated at a current point in time; updating the information owned by each node corresponding to the itemset of the transaction Tk among the nodes of the prefix tree; adding each node that is not managed in the prefix tree among nodes corresponding to the itemset of the transaction Tk, to the prefix tree and setting the information on the added nodes; and finding maximal frequent itemsets by visiting each node of the prefix tree that has the mark indicating the maximal frequent itemset and checking whether the corresponding itemset is frequent.
    Type: Grant
    Filed: October 27, 2008
    Date of Patent: April 3, 2012
    Assignee: Industry-Academic Cooperation Foundation, Yonsei University
    Inventor: Wong Suk Lee
  • Publication number: 20120078883
    Abstract: Methods and systems for accessing documents in document collections using predictive word sequences are disclosed. A method for accessing documents using predictive word sequences include creating a candidate list of word sequences where respective ones of the word sequences comprise one or more elements derived from the document corpus; expanding the candidate list by adding one or more new word sequences, where each new pattern is created by combining one or more elements derived from the document corpus with one of the word sequences currently in the candidate list; determining a predictive power with respect to the subject for respective ones of entries of the candidate list, where the entries include the word sequences and the new word sequences; pruning from the candidate list ones of said entries with the determined predictive power less than a predetermined threshold; and accessing documents from the document corpus based on the pruned candidate list.
    Type: Application
    Filed: September 28, 2010
    Publication date: March 29, 2012
    Applicant: The MITRE Corporation
    Inventor: Paul Christian MELBY
  • Publication number: 20120078876
    Abstract: Although recording of usage data is common in scholarly information services, its exploitation for the creation of value-added services remains limited due to concerns regarding, among others, user privacy, data validity, and the lack of accepted standards for the representation, sharing and aggregation of usage data. A technical, standards-based architecture for sharing usage information is presented. In this architecture, OpenURL-compliant linking servers aggregate usage information of a specific user community as it navigates the distributed information environment that it has access to. This usage information is made OAI-PMH harvestable so that usage information exposed by many linking servers can be aggregated to facilitate the creation of value-added services with a reach beyond that of a single community or a single information service.
    Type: Application
    Filed: December 9, 2011
    Publication date: March 29, 2012
    Applicant: LOS ALAMOS NATIONAL SECURITY, LLC
    Inventors: Johan Bollen, Herbert Van De Sompel
  • Publication number: 20120072429
    Abstract: A method, information processing system, and computer readable storage medium manage public relations queries using semantic analysis. A public relations query expressed in natural language is received from a user. A set of relevant topics and subjects associated with the query are identified. A set of information sources are identified that comprise data associated with the set of topics and subjects based on the set of topics and subjects. A set of data from the set of information sources is identified that satisfies the query. A set of weights are assigned to the set of data that has been identified. A set of data elements within the set of data that comprises a set of weights above a given threshold are identified. A response to the query is generated using the set of data elements that has been identified.
    Type: Application
    Filed: September 20, 2010
    Publication date: March 22, 2012
    Applicant: International Business Machines Corporation
    Inventors: YOUSSEF DRISSI, Tyrone W. Grandison, Colin G. Harrison, Kaan K. Katircioglu, Jurij R. Paraszczak
  • Publication number: 20120072454
    Abstract: A method for determining the authorship of a picture, wherein the method comprises at least the following steps: —transferring the picture to be examined or parts of the picture to be examined with the aid of a digitizing means, in particular a scanner, into at least one data set, —analyzing the data set(s) and determining characteristic features or parts of characteristic features, in particular dots or lines or dot or line groups or patterns, contained in the data set in digitized form, wherein the characteristic features to be determined are stored in a database, —and wherein the database includes an additional associated data set for each of the stored characteristic features.
    Type: Application
    Filed: May 17, 2010
    Publication date: March 22, 2012
    Inventor: Werner Schlozen
  • Publication number: 20120072453
    Abstract: Systems, methods, and media for analyzing fraud patterns and creating fraud behavioral models are provided herein. In some embodiments, methods for analyzing call data associated with fraudsters may include executing instructions stored in memory to compare the call data to a corpus of fraud data to determine one or more unique fraudsters associated with the call data, associate the call data with one or more unique fraudsters based upon the comparison, generate one or more voiceprints for each of the one or more identified unique fraudsters from the call data, and store the one or more voiceprints in a database.
    Type: Application
    Filed: November 4, 2011
    Publication date: March 22, 2012
    Inventors: Lisa Guerra, Richard Gutierrez, David Hartig, Anthony Rajakumar, Vipul Vyas
  • Patent number: 8140572
    Abstract: In accordance with embodiments, there are provided mechanisms and methods for aggregating on-demand database service data. These mechanisms and methods for aggregating on-demand database service data can enable embodiments to more flexibly summarize data. The ability of embodiments to provide such feature may lead to enhanced aggregation features which may be used for providing more effective ways of summarizing data.
    Type: Grant
    Filed: July 18, 2008
    Date of Patent: March 20, 2012
    Assignee: salesforce.com, inc.
    Inventors: Alan Ballard, Eric Bezar, Lars Hofhansl, Mary Scotton, Eric Wilson, Simon Wong
  • Publication number: 20120066260
    Abstract: A computer-implemented method of creating a data mining model in a database management system comprises accepting a database language statement at the database management system, the database language statement indicating a dataset and a data mining model to be created from the dataset, and creating, in the database management system, the indicated data mining model using the indicated dataset, wherein creation and application of the data mining model does not require moving data to a separate data mining engine.
    Type: Application
    Filed: November 18, 2011
    Publication date: March 15, 2012
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Wei LI, Shiby THOMAS, Joseph YARMUS, Ari W. MOZES, Mahesh JAGANNATH
  • Publication number: 20120066259
    Abstract: System(s) and method(s) provide access management to femto cell service through access control list(s) (e.g., white list(s), or black list(s)). White list(s) includes a set of subscriber station(s) identifier numbers, codes, or tokens, and also can include additional fields for femto cell access management based on desired complexity. White list(s) can have associated white list profile(s) therewith to establish logic of femto coverage access based on the white list(s). A mechanism for reciprocal addition of access field attributes in access control lists and white list profiles also is provided. The mechanism allows at least in part for a first subscriber to be added to a configured white list of a second subscriber, when the first subscriber configures a new white list, the second subscriber is reciprocally incorporated in the new white list. Such mechanism can be driven and facilitates generation of associations among groups of subscribers that share specific commonalities.
    Type: Application
    Filed: November 17, 2011
    Publication date: March 15, 2012
    Applicant: AT&T MOBILITY II LLC
    Inventors: Kurt Donald Huber, Judson John Flynn, William Gordon Mansfield
  • Patent number: 8135738
    Abstract: A predicate over a single column of a table is converted into at least one IN-list, wherein the IN-list is generated for a set of tuples of the column, and the generation is done over a data structure representing a set of distinct values of the column where the predicate applies and having a smaller cardinality than the table. The generated IN-list is evaluated over the set of tuples and the results of the evaluation are outputted as an evaluation of the predicate.
    Type: Grant
    Filed: August 20, 2008
    Date of Patent: March 13, 2012
    Assignee: International Business Machines Corporation
    Inventors: Lin Qiao, Vijayshankar Raman, Frederick Ralph Reiss, Richard S. Sidle, Garret Frederick Swart, F. Ryan Johnson
  • Publication number: 20120059850
    Abstract: A computer vision dating system analyzes combinations of face features of the system's user's photographs and recommends potential dating partners. A user selects preferred and not-preferred faces from a sample of other user's pictures. The system analyzes the features of the preferred and not-preferred faces comparing the combinations of features in both categories with the features of other users in the database to find the users that most match the collective features preferred by the user. These pictures are presented to the user. Data from the user's profile input are analyzed to automatically generate the sample pictures from which the user selects his/her preferences. As the users are presented pictures after their sample selection, they can continue to select and reject pictures allowing the system to learn and refine the combinations of features and better locate those that most conform to a user's most preferred photo images.
    Type: Application
    Filed: September 6, 2010
    Publication date: March 8, 2012
    Inventors: Jonathan Binnings Bent, Aaron Liu, Kenneth Zhou
  • Patent number: 8131756
    Abstract: The disclosed invention includes an apparatus, system and method for developing tools to explore, organize, structure, extract, and mine natural language text. The system contains three sub-systems: a run-time engine, a development environment, and a feedback system. The invention also includes a system and method for improving the quality of information extraction applications consisting of an ensemble of per-user, adaptive, on-line machine-learning classifiers that adapt to document content and judgments of users by continuously incorporating feedback from information extraction results and corrections that users apply to these results. At least one of the machine-learning classifier also provides explanations or justifications for classification decisions in the form of rules; other machine-learning classifiers may provide feedback in the form of supporting instances or patterns.
    Type: Grant
    Filed: June 5, 2007
    Date of Patent: March 6, 2012
    Inventors: Alwin B. Carus, Thomas J. DePlonty
  • Publication number: 20120054239
    Abstract: A method of providing a search service and a display device applying the same are provided. The method of providing a search service selects a partial region of a screen if a user's specified operation is input, extracts a plurality of keywords using the selected region, displays a list of the plurality of keywords, and searches for information based on one of the plurality of keywords. Accordingly, the display device can receive the keyword through the selection of the region. Accordingly, a user can input a desired search keyword through an input of only an operation for selecting the region.
    Type: Application
    Filed: August 24, 2011
    Publication date: March 1, 2012
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ji-hye CHUNG, Hye-jeong LEE, Eun-young LIM, Ji-sun YANG, Sin-oug YEO
  • Patent number: 8126911
    Abstract: Methods and systems are provided for partitioning data of a database or data store into several independent parts as part of a data mining process. The methods and systems use a mining application having content-based partitioning logic to partition the data. Once the data is partitioned, the partitioned data may be grouped and distributed to an associated processor for further processing. The mining application and content-based partitioning logic may be used in a computing system, including shared memory and distributed memory multi-processor computing systems. Other embodiments are described and claimed.
    Type: Grant
    Filed: April 27, 2006
    Date of Patent: February 28, 2012
    Assignee: Intel Corporation
    Inventors: Wei Hu, Chunrong Lai
  • Publication number: 20120047172
    Abstract: A technique includes providing a collection of documents in multiple languages, identifying, from the collection of documents, a group of candidate documents, where each candidate document in the group shares multiple corresponding rare features, evaluating pairs of candidate documents in the group using multiple common features present in the collection of documents, and determining, based on evaluating the pairs of candidate documents, whether each pair of candidate documents corresponds to a translated pair of documents.
    Type: Application
    Filed: August 22, 2011
    Publication date: February 23, 2012
    Applicant: Google Inc.
    Inventors: Jay M. Ponte, Jakob Uszkoreit, Ashok C. Popat, Moshe Dubiner
  • Patent number: 8122047
    Abstract: A search technology generates recommendations with minimal user data and participation, and provides better interpretation of user data, such as popularity, thus obtaining breadth and quality in recommendations. It is sensitive to the semantic content of natural language terms and lets users briefly describe the intended recipient (i.e., interests, eccentricities, previously successful gifts). Based on that input, the recommendation software system and method determines the meaning of the entered terms and creatively discover connections to gift recommendations from the vast array of possibilities. The user may then make a selection from these recommendations. The search/recommendation engine allows the user to find gifts through connections that are not limited to previously available information on the Internet. Thus, interests can be connected to buying behavior by relating terms to respective items.
    Type: Grant
    Filed: May 17, 2010
    Date of Patent: February 21, 2012
    Assignee: Kit Digital Inc.
    Inventors: Issar Amit Kanigsberg, Daniel Marc Veidlinger, Tamer El Shazli, Myer Joshua Mozersky
  • Patent number: 8122429
    Abstract: A database table of predefined data transformations is provided. Each predefined data transformation is associated in the table with a unique identifier, a corresponding description and a validity period. When a data modeler wishes to develop a data model for a desired prediction, he/she will first determine a set of variables that will be used therefor. The set of variables can include any of the predefined data transformations from the database table. The data model will then be developed by applying raw data to the set of variables and determining a mathematical relationship there between. Once the data model has been developed, the data modeler will write a reusable specification for applying the data model operationally. Thereafter, IT personnel or the like can code and deploy the data model using the specification.
    Type: Grant
    Filed: April 17, 2008
    Date of Patent: February 21, 2012
    Assignee: International Business Machines Corporation
    Inventors: Mark S. Ramsey, David A. Selby
  • Publication number: 20120041953
    Abstract: A latent topic labels text mining system and method to mine and analyze the content of textual data. Embodiments of the system and method are particularly well suited for use on microblog data to help people identify posts they want to read and to find people that they want to follow. Embodiments of the system and method use a modified Labeled LDA technique (called an L+LDA technique) that analyzes content using a combination of labeled and latent topics. The resultant data is assigned labels one of four labels to generate a lower-dimensional representation of the data that the individual words in a microblog post. This learned topic representation is used to characterize, summarize, filter, find, suggest, and compare the content of microblog posts. Embodiments of the system and method also include visualization techniques such as a tag cloud visualization that is used to visualize microblogging data.
    Type: Application
    Filed: August 16, 2010
    Publication date: February 16, 2012
    Applicant: Microsoft Corporation
    Inventors: Susan Theresa Dumais, Daniel Ramage, Daniel John Liebling, Steven Mark Drucker
  • Publication number: 20120041979
    Abstract: The present disclosure relates to a method for generating a context hierarchy and a system for generating a context hierarchy, and more particularly, to a method for generating a context hierarchy from data streams configured of an infinite set of continuously transactions and a system for generating a context hierarchy from the data streams.
    Type: Application
    Filed: March 18, 2011
    Publication date: February 16, 2012
    Applicant: Industry-Academic Cooperation Foundation, YONSEI University
    Inventor: Won Suk LEE
  • Publication number: 20120036157
    Abstract: A system and method are provided for comparing portions of document text with potential citation components, determining if individual portions correspond to a citation component, and determining if a set of portions correspond to a valid citation pattern. A set of valid citation patterns is provided. Each citation pattern may include a specified combination of citation components. The invention further relates to identifying potential citation components from text in a document, analyzing a pattern of the identified citation components by comparing the pattern to a set of stored citation patterns to determine if the potential citation is a type of citation, and if so, is it a valid (and/or invalid) citation pattern. Once citation patterns have been determined in the document, annotations may be inserted into the document, and subsequent action may be taken, for example, generating a list of citations, providing research services, error-handling, and/or providing other options related to the citations.
    Type: Application
    Filed: August 30, 2011
    Publication date: February 9, 2012
    Inventor: Tony Rolle
  • Patent number: 8112440
    Abstract: A system and method of identifying relational patterns across a plurality of databases using a data structure and the data structure itself. The data structure including one or more data node branches, each of the one or more data node branches including one or more data nodes, each of the one or more data nodes representing a data item of interest and corresponding data item support values for the data item across the plurality of databases in relation to other data items represented in the data node branch. The data structure can be used to mine one or more relational patterns considering pattern support data across the plurality of databases at the same time.
    Type: Grant
    Filed: April 14, 2008
    Date of Patent: February 7, 2012
    Assignee: The University of Vermont and State Agricultural College
    Inventors: Xindong Wu, Xingquan Zhu
  • Patent number: 8108409
    Abstract: Embodiments of the present invention pertain to determining top combinations of items to present to a user. According to one embodiment, data that includes information describing a plurality of combinations of records is accessed. Each record describes a plurality of items. The data is analyzed using a branch and bound search procedure to determine top combinations of items based on a specified metric and a specified number. According to one embodiment, the metric is value enabled and the specified number determines how many combinations of items are associated with the top combinations of items.
    Type: Grant
    Filed: July 19, 2007
    Date of Patent: January 31, 2012
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Julie W. Drew, Juan Antonio R. Garay, Krishna Venkatraman
  • Patent number: 8108395
    Abstract: The present invention relates to the field of network computing, and in particular to method and system for designing a Web Portal comprising a hierarchical structure of portal pages and portlets for accessing Web contents accessible via the Portal. A typical larger enterprise's portal contains large numbers, e.g., thousands of pages and portlets. Due to the complexity of an enterprise portal, manual administration is inefficient as it is time-consuming, error-prone and thus expensive. In order to overcome these disadvantages, it is proposed that a Portal according to the invention performs some mining of the portlet markup and/or that of the portlet description in order to autonomously compute and propose an enhanced portal content structure. This helps to provide a user-friendly content structure that reflects well the relationships between portlets.
    Type: Grant
    Filed: November 22, 2009
    Date of Patent: January 31, 2012
    Assignee: International Business Machines Corporation
    Inventors: Timo Kussmaul, Andreas Arning
  • Publication number: 20120023135
    Abstract: The method is for using a virtual face. The virtual face is provided on a screen associated with a computer system having a cursor. A user manipulates the virtual face with the cursor to show a facial expression. The computer system determines coordinates of the facial expression. The computer system searches for facial expression coordinates in a database to match the coordinates. A word or phrase is identified that is associated with the identified facial expression coordinates. The screen displays the word to the user. The user may also feed a word to the computer system that displays the facial expression associated with the word.
    Type: Application
    Filed: October 29, 2010
    Publication date: January 26, 2012
    Inventors: Erik Dahlkvist, Martin Gumpert, Johan Van Der Schoot
  • Publication number: 20120023134
    Abstract: An optimal subspace for a distance or a similarity cannot be obtained by a pattern matching device which obtains a subspace independent from the distance or the similarity used for matching. A pattern matching device includes a feature extraction unit for extracting a feature value by lowering the dimension of data using a feature extraction parameter; a calculation unit for calculating a distance or a similarity of the data to be matched using the feature value; and a parameter updating unit for comparing the distance or the similarity, and updating the feature extraction parameter so that the value of the distance or the similarity becomes closer to a matching result regarding whether or not the values of the distance or the similarity are in the same category.
    Type: Application
    Filed: March 15, 2010
    Publication date: January 26, 2012
    Applicant: NEC Corporation
    Inventor: Hiroyoshi Miyano
  • Patent number: 8103693
    Abstract: Various embodiments disclosed herein are directed to managing and sharing data between web accessed calculators. The systems include a data store to persist calculator inputs and outputs and share them with other calculators and with customer service representatives.
    Type: Grant
    Filed: February 17, 2011
    Date of Patent: January 24, 2012
    Assignee: United Services Automobile Association (USAA)
    Inventors: Mason Eubank, Nikolay Eshkenazi, Neff Karl Hudson, Michael Wayne Lester
  • Patent number: 8103646
    Abstract: An automated mechanism of automatically tagging media files such as podcasts, blog entries, and videos, for example, with meaningful taxonomy tags. The mechanism provides active (or automated) assistance in assigning appropriate tags to a particular piece of content (or media). Included is a system for automatic tagging of audio streams on the Internet, whether from audio files, or from the audio tracks of audio/video files, using the folksonomy of the Internet. The audio streams may be provided by the media author. For example, the author can make a recording to be posted on a website, and use the system to automatically suggest (via prompted author interaction) folksonomically appropriate tags for the media recording. Alternatively, the system can be used in an automated fashion to develop and assign without any intervention by the author.
    Type: Grant
    Filed: March 13, 2007
    Date of Patent: January 24, 2012
    Assignee: Microsoft Corporation
    Inventor: Robert I. Brown