Data Mining Patents (Class 707/776)
  • Patent number: 9864954
    Abstract: A method and associated systems for using wreath products and invariance groups to test a partially symmetric quantum-logic circuits. A test system receives information that describes the architecture of a quantum-logic circuit to be tested. The system uses this information to hierarchically organize the circuit's inputs into non-overlapping blocks. The system creates set of groups associated with the blocks, and then generates an invariance group that contains one or more invariant permutations of the inputs by computing a wreath product of the set of groups. These invariant permutations identify a minimal number of tests required to verify the circuit for all possible input vectors. The system then directs a test apparatus to perform the resulting optimized test sequence upon the circuit.
    Type: Grant
    Filed: February 16, 2017
    Date of Patent: January 9, 2018
    Assignee: International Business Machines Corporation
    Inventor: Pawel Jasionowski
  • Patent number: 9860298
    Abstract: An aspect of the present disclosure provides access via HTTP verbs to services implemented by stateless objects. In one embodiment, the list of services implemented by a stateless object deployed on an application server is displayed to a user/administrator. Upon receiving (from the user/administrator) an input data indicating selection of some of the services (from the displayed list), only the selected service are provided access via a corresponding HTTP verb. In other words, a first service that is included in the selection is provided access via a HTTP verb, while a second service not included in the selection is not made accessible via HTTP verbs. Thus, a user/administrator is facilitated to provide access via HTTP verbs to only services of interest among those implemented by a stateless object at or after the deployment of the stateless object.
    Type: Grant
    Filed: March 17, 2015
    Date of Patent: January 2, 2018
    Assignee: Oracle International Corporation
    Inventors: Rajesh Ghosh, Vikas Soolapani, Rekha Ayothi
  • Patent number: 9792289
    Abstract: A system and method for file clustering, multi-drive forensic analysis and protection of sensitive data. Multiple memory devices can store files. A module can extract characteristics from the stored files, identify similarities between the files based on the extracted characteristics and generate file clusters based on the identified similarities. A visual representation of the file clusters, which can be generated to show the identified similarities among the files, can be displayed by a user interface module.
    Type: Grant
    Filed: November 7, 2014
    Date of Patent: October 17, 2017
    Assignee: Semandex Networks Inc.
    Inventors: Daniel J. Reininger, Dhananjay D. Makwana, Raymond William Kulberda, Eric Heath Larson, Timothy P. Hearn
  • Patent number: 9720983
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for content presentation. In one aspect, a method includes obtaining information associated with a mobile application of interest; determining a plurality of similar applications to the application of interest; determining keywords from the similar applications; and extracting new keywords for the application of interest using a model trained using statistical information for keywords of the plurality of similar applications that overlap with keywords of the application of interest.
    Type: Grant
    Filed: July 7, 2014
    Date of Patent: August 1, 2017
    Assignee: Google Inc.
    Inventors: Zhou Yu, Yudong Gao
  • Patent number: 9716767
    Abstract: The embodiments of the invention provide a method for pushing input resources, comprising: obtaining user characteristic information in a client terminal, the user characteristic information including user interest information or user position information; obtaining at least one input resource based on the user characteristic information; the input resources including at least one of input method skin, input method font, and input method font size; pushing the at least one input method resource to the client terminal. The embodiments of the invention further provide a system for pushing input resources. The technical solutions of the embodiments of the invention can easily and conveniently achieve replacement of input resources.
    Type: Grant
    Filed: August 29, 2014
    Date of Patent: July 25, 2017
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventor: Long Chen
  • Patent number: 9715533
    Abstract: Example methods and systems are directed to providing multi-dimensional search results. A source (e.g., a closed captioning stream) may provide a series of keywords. The series of keywords may be used to generate a series of searches. The results from the searches may be presented as part of a user interface in a grid. For example, one row may be presented for each keyword, with the row for the keyword containing the results from searching using that keyword. Alternatively, one column may be presented for each keyword, with the column for the keyword containing the results from searching using that keyword. The series of keywords may be generated once. Alternatively, new keywords may be periodically added to the series of keywords, causing the grid to be updated. Old keywords and their corresponding search results may periodically be removed from the grid.
    Type: Grant
    Filed: December 11, 2013
    Date of Patent: July 25, 2017
    Assignee: eBay Inc.
    Inventors: Selina Lam, Marc Peter Hosein
  • Patent number: 9665825
    Abstract: A system and computer-usable medium for using cognitive graph vectors to refine cognitive insights.
    Type: Grant
    Filed: February 24, 2015
    Date of Patent: May 30, 2017
    Assignee: Cognitive Scale, Inc.
    Inventor: Matthew Sanchez
  • Patent number: 9529867
    Abstract: Systems and method for providing a list of search categories from which to perform a user action are provided. An initiation command is received from a client device, and a set of information corresponding to the client device is retrieved in response to receiving the initiation command. A subset of search categories is selected from a plurality of search categories based on the retrieved set of information corresponding to the client device. The subset of search categories is provided for display to the client device.
    Type: Grant
    Filed: September 19, 2013
    Date of Patent: December 27, 2016
    Assignee: Google Inc.
    Inventor: Gregory Michael Blevins
  • Patent number: 9436660
    Abstract: Methods and arrangements for managing development of information extraction rules. One or more documents are opened for extraction. An interface is provided to create a label and thereupon label a portion of the document. The created label is stored, and an extractor is developed based on the labeling. A test interface is provided for the extractor, and results of a test conducted through the test interface are displayed. The extractor is exported. In accordance with at least one embodiment, developers are presented with eased automated guidance to write extractors, which thereby reduces an overall manual effort involved in extractor development. Generally, a focused, tutorial-type environment serves as a guide based on previously developed best practices.
    Type: Grant
    Filed: November 16, 2012
    Date of Patent: September 6, 2016
    Assignee: International Business Machines Corporation
    Inventors: Arnaldo Carreno-Fuentes, Laura Chiticariu, Eser Kandogan, Yunyao Li, Huahai Yang
  • Patent number: 9372589
    Abstract: Structured information about nodes may be generated and shared using sub-nodes. A node in a social networking system may be associated with sub-nodes that are definable by the node owner, such as menu items for a restaurant or songs in playlists for an artist. Users of the system may interact with the sub-nodes, and the interactions may be presented back on the page to a user, aggregated according to the user's connections in the social networking system (e.g., which songs your friends listened to the most by the artist, which menu items were consumed the most). Users may associate other sub-nodes to the node, such as identifying other menu items served by a restaurant, and the node owner may confirm these associations. Location awareness functionalities may be used to inform a user of highly recommended sub-nodes nearby as indicated by other users of the social networking system.
    Type: Grant
    Filed: April 18, 2012
    Date of Patent: June 21, 2016
    Assignee: Facebook, Inc.
    Inventors: Bruno Rahle, Blaise DiPersia, Rousseau Kazi
  • Patent number: 9361343
    Abstract: Disclosed herein is a method for parallel mining of temporal relations in a large event file using a MapReduce model. In the method for parallel mining of temporal relations in a large even file according to the present invention, an event file is sorted based on customer identification (ID) and event time at which each event has occurred. A set of large event types satisfying a preset support or more is generated from the event file. The event file is converted into a large event sequence including the large event type set. The large event sequence is summarized and then a time interval data file is created. Candidate temporal relations are generated from the time interval data file, and frequent temporal relations satisfying a preset support or more are derived from the candidate temporal relations. A temporal relation rule is generated from the derived frequent temporal relations.
    Type: Grant
    Filed: October 9, 2013
    Date of Patent: June 7, 2016
    Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventor: Yong-Joon Lee
  • Patent number: 9355165
    Abstract: A method and apparatus for accessing, processing and manipulating data in an OLAP database. According to one aspect, the present invention comprises a user interface configured for accessing, processing and manipulating data in an OLAP cube. According to another aspect, the present invention comprises a calculation engine for manipulating and managing data in the OLAP cube.
    Type: Grant
    Filed: March 31, 2008
    Date of Patent: May 31, 2016
    Inventors: Paul Grant Barber, Robert John Walker
  • Patent number: 9317609
    Abstract: A semantic vector is generated for a search term based upon a global frequency of other, closely related terms within a corpus that is used to compute the semantic vector relative to the search term. The semantic vector is used in connection with a textual search engine, responsive to a user query comprising a search term, to promote any of documents and sites within results returned to the query by the search engine that contain other, closely related terms that strongly correlate with the search term.
    Type: Grant
    Filed: October 1, 2013
    Date of Patent: April 19, 2016
    Assignee: FortyTwo, Inc.
    Inventors: Danny Blumenfeld, Yasuhiro Matsuda, Eishay Smith
  • Patent number: 9262255
    Abstract: A hierarchical multi-stage model of asset failure risk for complex heterogeneously distributed physical assets is built. The hierarchical multi-stage model considers heterogeneity of failure patterns for the assets. At least one data stream is analyzed to determine whether the hierarchical multi-stage model needs to be updated due to a change in the failure patterns. If the analysis indicates that the hierarchical multi-stage model needs to be updated, the hierarchical multi-stage model is dynamically updated to obtain an updated hierarchical multi-stage model.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: February 16, 2016
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Arun Hampapur, Hongfei Li, Zhiguo Li, Yada Zhu
  • Patent number: 9245257
    Abstract: Disclosed are systems, apparatus, and methods for generating a user profile interface based on skill information associated with a user. Skill information associated with the user may be received. The skill information may include data values that identify at least one skill associated with the user, and that further identify a skill level associated with the at least one skill. A plurality of user interface components may be generated based on the received skill information. The plurality of user interface components may be configured to display a graphical representation generated based on at least some of the skill information. An input may be received. The input may identify a configuration of the plurality of user interface components and may further identify a representation of the skill information within the plurality of user interface components. The plurality of user interface components may be rendered and displayed on a display device.
    Type: Grant
    Filed: April 30, 2013
    Date of Patent: January 26, 2016
    Assignee: salesforce.com, inc.
    Inventor: Jager McConnell
  • Patent number: 9235607
    Abstract: A method and a test system for specifying a predetermined degree of inconsistency for test data are disclosed. The test system obtains a test policy, which specifies a predetermined degree of inconsistency between write operations and subsequent read operations on a set of data and subsequently receives a request to provide test data to an application. In response to the request to provide test data to the application the test system generates a set of test data including a plurality of entities retrieved from the set of data, based at least in part on the test policy. The test data includes a respective entity that is not consistent with a previous write operation. The test system further provides the set of test data to the application. The application optionally processes the set of test data to produce results, which are used to determine performance of the application.
    Type: Grant
    Filed: March 29, 2013
    Date of Patent: January 12, 2016
    Assignee: GOOGLE INC.
    Inventors: Max C. Ross, Alfred R. K. Fuller
  • Patent number: 9196244
    Abstract: Arrangements are described for reducing response latency in intelligent personal assistant applications. While receiving a user request, preemptive responses are automatically prepared for a received portion of the user request. Partial classification word candidates are generated for words in the received portion of the user request, and then a predictive component is applied to generate extended classification word candidates that include the partial classification word candidates and additional classification word candidates. A preliminary search is performed of the extended classification word candidates to prepare the preemptive responses. While the input request continues, the preemptive responses are updated, and when the input request ends, the prepared preemptive responses are used to respond to the user request.
    Type: Grant
    Filed: January 8, 2014
    Date of Patent: November 24, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Alfred K. Wong, Leor Doron
  • Patent number: 9135249
    Abstract: Numbered sequences detection includes (i) extracting one or more numbered item token patterns from a document comprising an ordered sequence of text units, each numbered item token pattern including an incremental portion and a fixed portion that matches at least one text unit of the document and (ii) identifying at least one numbered sequence in the document conforming with a matching numbered item token pattern of the extracted one or more numbered item token patterns. The identified at least one numbered sequence comprises an ordered sub-sequence of text units of the document that match the matching numbered item token pattern. The detection may further comprise determining that a second type of numbered sequence nests in the document between consecutive text units belonging to a numbered sequence of a first type, and optimizing one or more numbered sequences of the second type based on information provided by the determining.
    Type: Grant
    Filed: May 29, 2009
    Date of Patent: September 15, 2015
    Assignee: XEROX Corporation
    Inventor: Herve Dejean
  • Patent number: 9135571
    Abstract: Techniques for entity detection include matching a token from at least a portion of a text string with a matching concept in an ontology, wherein the at least a portion of the text string has been labeled as corresponding to a particular entity type. A first concept may be identified as being hierarchically related to the matching concept within the ontology, and a second concept may be identified as being hierarchically related to the first concept within the ontology. Based at least in part on the labeling of the at least a portion of the text string as corresponding to the particular entity type, a statistical model may be trained to associate the first concept with a first probability of corresponding to the particular entity type and the second concept with a second probability of corresponding to the particular entity type.
    Type: Grant
    Filed: March 12, 2013
    Date of Patent: September 15, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Brian W. Delaney, Girija Yegnanarayanan
  • Patent number: 9129013
    Abstract: Techniques for entity detection include matching a token from at least a portion of a text string with a matching concept in an ontology. A first concept may be identified as being hierarchically related to the matching concept within the ontology, and a second concept may be identified as being hierarchically related to the first concept within the ontology. The first and second concepts may be included in a set of features of the token. Based at least in part on the set of features of the token, a measure related to a likelihood that the at least a portion of the text string corresponds to a particular entity type may be determined.
    Type: Grant
    Filed: March 12, 2013
    Date of Patent: September 8, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Brian W. Delaney, Girija Yegnanarayanan
  • Patent number: 9043360
    Abstract: Method, system, and programs for providing one or more explanations. An inquiry is received via a communication platform where the inquiry is about how a set of entities are related. Information is retrieved from a knowledge storage in accordance with the set of entities and such information describes a plurality of entities and relationships existing among the plurality of entities. Based on such retrieved information, one or more explanations with respect to each relationship by which the set of entities are connected are generated. The one or more explanations are then transmitted as a response to the inquiry.
    Type: Grant
    Filed: December 17, 2010
    Date of Patent: May 26, 2015
    Assignee: Yahoo! Inc.
    Inventors: Lujun Fang, Anish Das Sarma, Cong Yu, Philip Bohannon
  • Patent number: 9037607
    Abstract: Disclosed is a method generally applicable to any financial dataset for the purposes of: (1) determining the most important patterns in the given dataset, in order of importance; (2) determining any trends in those patterns; (3) determining relationships between patterns and trends; and (4) allowing quick visual identification of anomalies for closer audit investigation. These purposes generally fall within the scope of what in financial auditing is known as ‘analytical review’. The current method's advantages over existing methods are that is fully independent of the financial data subject to analysis, requires no background knowledge of the target business or industry, and is both scalable (to large datasets) and fully scale-invariant, requiring no a priori notion of financial materiality.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: May 19, 2015
    Assignee: GALISTEO CONSULTING GROUP INC.
    Inventor: Peter Alexander Chew
  • Patent number: 9026551
    Abstract: A system for evaluating text data to support multiple insurance applications is disclosed. In some embodiments, text input data is received from multiple sources. The text input data may then be aggregated and mapped to create composite text input data. A semantic event in the composite text input data may be automatically detected, such as by being triggered by a semantic rule and associated semantic tag. A text mining result database may be updated by adding an entry to the database identifying the detected semantic event and the triggering semantic rule. An indication associated with the text mining result database may then be transmitted to a plurality of insurance applications.
    Type: Grant
    Filed: June 25, 2013
    Date of Patent: May 5, 2015
    Assignee: Hartford Fire Insurance Company
    Inventor: Arthur Paul Drennan, III
  • Patent number: 9026553
    Abstract: Systems and methods for obtaining access to a database file managed by an operating system in a computing system are disclosed. One method includes transmitting a call to an operating system from a database management system, the call requesting access to a database file. The method also includes receiving an address from the operating system at the database management system. The address represents a general address of the database file managed by the operating system. The method further includes transmitting a call to the operating system from the database management system, which includes an address and a size of a view of the database file to be created. The method also includes receiving an address of the view of the database file from the operating system.
    Type: Grant
    Filed: November 29, 2012
    Date of Patent: May 5, 2015
    Assignee: Unisys Corporation
    Inventors: Michael Rieschl, James Merten, Matthew Trautman, John Loberg
  • Publication number: 20150120777
    Abstract: Data from at least one outside data source containing Big Data is translated into a virtual three-dimensional object that identifies data of interest. In an embodiment, the data is translated into a tactile three-dimensional object that can be felt, for example, with a haptic controller. Embodiments allow for navigation, mining, and structuring of the data, as well as facilitating real time analysis of the data.
    Type: Application
    Filed: October 24, 2014
    Publication date: April 30, 2015
    Inventor: Olivia Ramos
  • Patent number: 9021095
    Abstract: Disclosed is an improved approach for implementing an on-demand scheduler in a mobile device and the structures to support realtime on-demand schedulers. A lightweight word-based structure is disclosed for storing scheduling-related data on the mobile device. Using this lightweight word-based structure enables on-demand and real-time scheduling. This type of lightweight structure also permits scheduling activities to be performed in a disconnected mode, which can then be later synchronized with the server to confirm the booking In addition to appointment scheduling, this technique can also be implemented for scheduling of any type of resource.
    Type: Grant
    Filed: May 27, 2011
    Date of Patent: April 28, 2015
    Assignee: Oracle International Corporation
    Inventors: Hari Krishna Gutlapalli, Suhas R. Mehta
  • Publication number: 20150113018
    Abstract: An adaptive system processes social media streams in real time. The adaptive system included a data management engine that generates combined data sets by detecting and mining a plurality of text-based messages from a social networking service on the Internet. An analytics engine in communication with the data management engine monitors topics in the text-based messages and tracks topic evolution contained in the text-based messages. A visualization engine in communication with the analytics engine renders historical and current activity associated with the plurality of text-based messages.
    Type: Application
    Filed: September 3, 2014
    Publication date: April 23, 2015
    Inventors: Chad A. Steed, Robert M. Patton, Paul L. Bogen, Thomas E. Potok, Christopher T. Symons
  • Publication number: 20150113003
    Abstract: An information retrieval device includes a degree-of-association information storage unit capable of storing an item(s) of degree-of-association information indicating a degree of association between each of an item(s) of first information and each of an item(s) of second information; an accepting unit that accepts a query including an item(s) of query information which is/are an item(s) of information used for retrieval of content; a query converter that obtains, by using an item(s) of first information corresponding to each of the item(s) of query information, and the item(s) of degree-of-association information, an item(s) of second information whose degree of association with the item(s) of first information is greater as a predetermined condition is better satisfied; and a retrieval unit that retrieves content by using the item(s) of second information obtained by the query converter. Accordingly, content necessary for a user can be retrieved.
    Type: Application
    Filed: June 30, 2014
    Publication date: April 23, 2015
    Inventors: Toru HOTTA, Yukihiro TAGAMI, Shingo HOSHINO, Yusuke TANAKA
  • Patent number: 9009104
    Abstract: Techniques for replicating data between database systems without taking checkpoints are provided. In an embodiment, a capture process restarts. Upon restarting, the capture process reestablishes an association with an apply process. A particular logical time maintained by the apply process is then communicated to the capture process. Upon receiving the particular logical time, the capture process restarts mining from this particular logical time.
    Type: Grant
    Filed: August 10, 2010
    Date of Patent: April 14, 2015
    Assignee: Oracle International Corporation
    Inventors: Lik Wong, Nimar S. Arora, Cristina Schmidt, Lei Gao, Thuyan Hoang
  • Patent number: 9009193
    Abstract: Techniques are presented for providing a software fitting assessment. The techniques may be performed by methods, apparatus, and/or computer program products. The techniques include automatically matching on a computer system one or more specified requirements for a project with one or more software functions stored in a repository. The automatically matching includes mining the repository in order to match requirements. The repository includes software functions, requirements accumulated from previous projects, and results of stored matches between the software functions and the requirements accumulated from previous projects. The techniques include outputting by the computer system one or more results of the matching.
    Type: Grant
    Filed: September 12, 2012
    Date of Patent: April 14, 2015
    Assignee: International Business Machines Corporation
    Inventors: Matthew J. Callery, Michael Desmond, Sophia Krasikov, Harold L. Ossher, Edith Schonberg, Harini Srinivasan
  • Patent number: 9002888
    Abstract: A method, computer program product and system of minimizing epigenetic surprisal data either by comparing epigenetic surprisal data to a fixed baseline epigenetic data, so that all of the comparisons were made to the same baseline epigenetic data or by comparing epigenetic surprisal data to a rolling baseline of epigenetic surprisal data—that is, after each comparison the baseline is changed to the data from the time point which had been compared previously.
    Type: Grant
    Filed: June 29, 2012
    Date of Patent: April 7, 2015
    Assignee: International Business Machines Corporation
    Inventors: Robert R. Friedlander, James R. Kraemer
  • Patent number: 9002887
    Abstract: An external traffic advertisement system is provided that generates advertisement sets based on analysis of visits to a web site that were referred by an external source. The advertisement system aggregates the referral information for each referral type. A referral type may be defined by one or more of keyword text derived from the query text of the referrals, landing page type, external source, product identifier, and so on. The advertisement system may, for each referral type, aggregate the total revenue from the visits of that referral type and may generate a count of the number of converting visits for that referral type. The advertisement system then identifies those referral types whose aggregated information satisfies an advertisement criterion and generates an advertisement set for each identified referral type with a keyword derived from keyword text and with a link based on the landing page type of the referral type.
    Type: Grant
    Filed: March 30, 2007
    Date of Patent: April 7, 2015
    Assignee: Amazon Technologies, Inc.
    Inventors: Eric Alfred Herrmann, Stephan G. Betz, Joel Andrew Shapiro
  • Patent number: 8996523
    Abstract: Feature information, such as street address data, is provided by multiple sources of varying levels of trust. The street address data provided by these sources may include various representations of an address for a map feature. Thus, overlapping street address data for the map feature exists. In one embodiment, a feature selection server merges the street address data for the map feature to create a representative street address for the address data provided from multiple sources.
    Type: Grant
    Filed: May 24, 2011
    Date of Patent: March 31, 2015
    Assignee: Google Inc.
    Inventor: Yechezkia Fisher
  • Publication number: 20150081735
    Abstract: Systems and methods are provided for identifying data variable roles during initial data exploration. A variable type, unique data value count values, and an overflow count value are determined for a variable. The unique data value count values include a number of occurrences of each of a plurality of unique data values for the variable in a data set. The overflow count value is a number of occurrences of data values other than the plurality of unique data values for the variable in the data set. When a number of the plurality of unique data values is greater than a value for a high cardinality threshold, the variable is determined to be a high cardinality variable. When the variable is not determined to be the high cardinality variable, a class variable role is assigned to the variable. When the variable is determined to be the high cardinality variable, Whether or not the variable is a numeric variable type is determined based on the determined variable type.
    Type: Application
    Filed: November 10, 2014
    Publication date: March 19, 2015
    Inventors: Georges H. Guirguis, Scott Pope
  • Patent number: 8983980
    Abstract: Embodiments for a Mining Data Records based on Anchor Trees (MiBAT) process are disclosed. In accordance with at least one embodiment, the MiBAT process extracts data records containing user-generated content from web documents. The web document is processed into a Document Object Model (DOM) tree in which sub-trees of the DOM tree represent the data records of the web document. Domain constraints are used to locate structured portions of the DOM tree. Anchor trees are then located as being sets of sibling sub-trees which contain the domain constraints. The anchor trees are then used to determine a record boundary (i.e. the start offset and length) of the data records. Finally, the data records are extracted based on the anchor trees and the record boundaries.
    Type: Grant
    Filed: November 12, 2010
    Date of Patent: March 17, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xinying Song, Yunbo Cao, Chin-Yew Lin
  • Patent number: 8983962
    Abstract: The question and answer data editing device for editing dialog content to generate question and answer data, includes a detecting unit that detects a part of the dialog content similar to existing question and answer data stored, and a extracting unit that extracts a context in which the dialog content is made from dialog content in the proximity of the similar part detected and registers the context extracted as new question and answer data or as index information of the question and answer data.
    Type: Grant
    Filed: February 8, 2006
    Date of Patent: March 17, 2015
    Assignee: NEC Corporation
    Inventors: Satoshi Nakazawa, Kenji Satoh, Yoshihiro Ikeda
  • Patent number: 8972406
    Abstract: A method, computer program product, and system generating epigenetic cohorts for a specific time period through clustering of epigenetic surprisal data at a specific time comprising. receiving a phenotypic and/or demographic parameter and a cluster characteristics input from a user; searching the epigenetic surprisal data at a specific time for the parameter and storing matches in a repository; generating a cluster comprising a centroid for each parameter by populating the cluster based on the matches of the parameter with the epigenetic surprisal data at a specific time period; determining at least two epigenetic cohorts for a specific time period from the cluster for each parameter and based on the input from the user; and if the cohorts do not match the input of the user, reporting the cohorts determined to the user and returning to the step of receiving a parameter and characteristic input from a user.
    Type: Grant
    Filed: July 31, 2012
    Date of Patent: March 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Robert R. Friedlander, James R. Kraemer
  • Patent number: 8972262
    Abstract: In one embodiment, indexing content in streamed data includes receiving streams of audio data encoding a recording of a live ongoing group communication, where each stream of audio data encodes a different one of multiple voices. Each of the streams of audio data is provided to a recognizer to cause separate recognition of words in each of the streams. The recognized words are indexed to corresponding locations in each of the streams, and the streams are combined into a combined stream of audio data by synchronizing at least one common location in the streams. Embodiments allow accurate recognition of speech in group communications in which multiple speakers have simultaneously spoken, and accurate search of content encoded and processed from such speech.
    Type: Grant
    Filed: January 18, 2012
    Date of Patent: March 3, 2015
    Assignee: Google Inc.
    Inventor: Kirill Buryak
  • Patent number: 8972407
    Abstract: An information processing apparatus determines a weight of each physical feature for hierarchical clustering by acquiring training data of multiple pieces of content in triplets with label information indicating a pair specified by a user as having a highest degree of similarity among three contents of the triplet and executing hierarchical clustering using a feature vector of each piece of content of the training data and the weight of each feature to determine the hierarchical structure of the training data. The information processing apparatus updates the weight of each feature so that the degree of agreement between a pair combined first as being the same clusters among three contents of the triplet in a determined hierarchical structure and a pair indicated by label information corresponding to the triplet increases.
    Type: Grant
    Filed: September 6, 2012
    Date of Patent: March 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Toru Nagano, Masafumi Nishimura, Takashima Ryoichi, Ryuki Tachibana
  • Patent number: 8954468
    Abstract: An efficient extraction of a meaningful frequent itemsets. The present invention discloses a system that includes a decision unit that decides a new itemset that becomes an investigation target in the same sequence as searching an itemset tree in a depth-first manner and in descending order. The present invention further discloses a frequent occurrence determining unit that registers the frequency of occurrence of the new itemset in a table if the frequency of occurrence is equal to or more than a predetermined threshold. The present invention includes a correlation determining unit that determines whether there is a correlation between each item in the new itemset and a subset of remaining items that were removed from the new itemset. The present invention discloses a registration unit that registers the new itemset in a set of meaningful frequent itemsets if the determination is positive for all items of the new itemset.
    Type: Grant
    Filed: October 5, 2011
    Date of Patent: February 10, 2015
    Assignee: International Business Machines Corporation
    Inventor: Issei Yoshida
  • Patent number: 8954474
    Abstract: A method of maintaining data described in a plurality of data models. An ontology is used to describe the data models. The data models are managed using the ontology and using a validation schema to validate object(s) governed by the ontology and derived from data-centric component(s) of content that has a semantically independent structure. Management of the data models is neutral relative to implementation of the content.
    Type: Grant
    Filed: April 21, 2008
    Date of Patent: February 10, 2015
    Assignee: The Boeing Company
    Inventors: Mark A. Dahl, Edward J. Levinskas, Patrick L. Walsh, Russell G. Gianni, James G. Tanner, Roberto Aaron Vergaray
  • Patent number: 8954438
    Abstract: Structured metadata extraction may include accessing one or more documents from which to extract the structured metadata from each of a plurality of hosts. A plurality of entity names can be extracted from the one or more documents from one of the plurality of hosts using an entity name pattern. A first element list can be extracted from the one or more documents based at least in part on the plurality of entity names and based at least in part on one or more heuristic rules. An element list pattern may be generated based at least in part on the first element list, and a second element list may be extracted from the one or more documents based at least in part on the element list pattern.
    Type: Grant
    Filed: May 31, 2012
    Date of Patent: February 10, 2015
    Assignee: Google Inc.
    Inventors: Yiqiang Mao, Alvin Tang, Nitin Khandelwal
  • Patent number: 8949271
    Abstract: The present disclosure is related to a method for monitoring at least one event data generating machine, including a data logging device for providing event data. The method comprises transferring logged event data from at least one of the event data generating machines to a central processor, mining a multi-dimensional sequential pattern within said transferred event data wherein at least one dimensional attribute holds information indicating said event data generating machine or the at least one event data generating machine property, and matching said mined multi-dimensional sequential pattern with patterns stored in a central pattern database.
    Type: Grant
    Filed: October 23, 2012
    Date of Patent: February 3, 2015
    Assignee: Liebherr-Werk Nenzing GmbH
    Inventors: Michael Kocher, Martin Rajek
  • Patent number: 8949176
    Abstract: A system and method may include monitoring communications between a user device, a website, and a behavioral tracking provider, capturing user information transmitted during the monitored communications, analyze the user information to determine one or more relationships between the user device, the websites, and the behavioral tracking provider, and outputting the one or more relationships to the user device.
    Type: Grant
    Filed: July 29, 2008
    Date of Patent: February 3, 2015
    Assignee: Verizon Patent and Licensing Inc.
    Inventors: Jeffrey Swinton, Steven Whitehead, Kay Bechtel
  • Publication number: 20150032746
    Abstract: A method for determining a cause of events detected in a plurality of interactions includes: identifying, on a processor, a plurality of elements in the interactions; detecting, on the processor, a plurality of sequences of elements in the interactions; mining, on the processor, the plurality of sequences for generating a set of supported patterns; computing, on the processor, association rules from the set of supported patterns; and returning the computed association rules.
    Type: Application
    Filed: July 26, 2013
    Publication date: January 29, 2015
    Applicant: GENESYS TELECOMMUNICATIONS LABORATORIES, INC.
    Inventors: Amir Lev-Tov, Avraham Faizakof, David Ollinger, Yochai Konig
  • Patent number: 8943100
    Abstract: In a method for storing data in a relational database system using a processor, a collection of values is assigned to a structure dictionary, each of the values represents the value of a row for an attribute and has a unique ordinal number within the collection. and the structure dictionary contains structures defined based on at least one of interaction with a user of the system via an interface, automatic detection of structures occurring in data, automatic detection of frequencies of values occurring in data, analysis of a history of queries, and predetermined information about structures relevant to data content that is stored in the system. For each structure, forming a structure match list from ordinal numbers of values matching the structure, and a structure sub-collection from values matching the structure, using the processor.
    Type: Grant
    Filed: March 13, 2013
    Date of Patent: January 27, 2015
    Assignee: Infobright Inc.
    Inventors: Dominik Slezak, Graham Toppin, Marcin Kowalski, Arkadiusz Wojna
  • Patent number: 8938386
    Abstract: When redacting natural language text, a classifier is used to provide a sensitive concept model according to features in natural language text and in which the various classes employed are sensitive concepts reflected in the natural language text. Similarly, the classifier is used to provide an utility concepts model based on utility concepts. Based on these models, and for one or more identified sensitive concept and identified utility concept, at least one feature in the natural language text is identified that implicates the at least one identified sensitive topic more than the at least one identified utility concept. At least some of the features thus identified may be perturbed such that the modified natural language text may be provided as at least one redacted document. In this manner, features are perturbed to maximize classification error for sensitive concepts while simultaneously minimizing classification error in the utility concepts.
    Type: Grant
    Filed: March 15, 2011
    Date of Patent: January 20, 2015
    Assignee: Accenture Global Services Limited
    Inventors: Chad Cumby, Rayid Ghani
  • Publication number: 20150019588
    Abstract: Assuming that an initial social network is unavailable because explicit connections between users are missing or incomplete, temporal analysis may be used to identify the implicit relationship between social media users. Temporal data may be used to extract implicit relationship regardless of their specific activities such as visiting the same web pages or commenting on the same web objects.
    Type: Application
    Filed: July 10, 2014
    Publication date: January 15, 2015
    Applicant: DREXEL UNIVERSITY
    Inventors: Christopher Yang, Xuning Tang
  • Patent number: 8935284
    Abstract: A computer-implemented method for associating website browsing behavior with a spam mailing list is described. A history of website browsing behavior is collected for a plurality of users. At least one spam mailing list is identified that includes an e-mail address for at least two users of the plurality of users. A determination is made as to whether a common website exists between the histories of website browsing behavior for the at least two users. Reputation information for the common website is updated.
    Type: Grant
    Filed: July 15, 2010
    Date of Patent: January 13, 2015
    Assignee: Symantec Corporation
    Inventor: Shaun Cooley
  • Patent number: 8935230
    Abstract: A method, machine readable storage medium, and system for providing a self learning semantic search engine. A semantic network may be set up with initial configuration. A search engine coupled to the semantic network may build indexes and semantic indexes. A user request for business data may be received. The search engine may be accessed via a semantic dispatcher. And based on the access, search engine may update the indexes and semantic indexes.
    Type: Grant
    Filed: August 25, 2011
    Date of Patent: January 13, 2015
    Assignees: SAP SE, intelligent views GmbH
    Inventors: Robert Heidasch, Stefan Scheidl, Klaus Reichenberger, Steffen Moldaner, Archim Heimann, Stephan Brand, Nico Licht, Michael Neumann, Christoph Meinel