Patents by Inventor Maria G. Castellanos

Maria G. Castellanos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20120076416
    Abstract: A collection of documents are correlated with information items in a fast stream of information using categorical hierarchical neighborhood trees (C-HNTs). First data entities extracted from the documents are inserted into corresponding C-HNTs. The first data entities that are neighbors in the C-HNTs of second data entities extracted from the fast stream items are identified. Similarities between the documents and the fast stream items are determined based on the location at which the neighbors are located.
    Type: Application
    Filed: September 24, 2010
    Publication date: March 29, 2012
    Inventors: Maria G. Castellanos, Chetan Kumar Gupta, Song Wang, Umeshwar Dayal
  • Publication number: 20120072413
    Abstract: A method and device may accept a first database query plan including a plurality of rows and a second database query plan including a plurality of rows, compare each row in the first database query plan with a corresponding row in the second database query plan, and based on each comparison, calculate a difference between the compared rows. Each difference may be recorded in an output file.
    Type: Application
    Filed: September 22, 2010
    Publication date: March 22, 2012
    Inventors: Maria G. CASTELLANOS, Perla Ruiz
  • Patent number: 8122066
    Abstract: A query profiling system and associated operating methods implement a database query profiler. The illustrative database system comprises a parser, a query model, a catalog manager, a binder and view rewriter, and a query analyzer. The parser parses statements by checking syntax of a statement according to a structured query language (SQL) grammar and representing the statements in a syntax-oriented internal format. The catalog manager processes data definition SQL statements (DDL) that are transformed to the internal representation; transform the representations of the DDL to database objects abstractions; groups the database objects into a Catalog. The binder and view rewriter take the internal representations obtained by parsing workload queries (data manipulation SQL statements or DML) and rewrite table references by their corresponding View definitions; bind database object references to their actual database objects contained in the catalog.
    Type: Grant
    Filed: October 14, 2008
    Date of Patent: February 21, 2012
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Maria G. Castellanos, Ivo Jose Jimenez Ramos
  • Publication number: 20110209149
    Abstract: Computer-based methods, computer-readable storage media and computer systems are provided for optimizing integration flow plans. An initial integration flow plan, one or more objectives and/or an objective function related to the one or more objectives may be received as input. A computing cost of the initial integration flow plan may be compared with the objective function. Using one or more heuristics, a set of close-to-optimal integration flow plans may be identified from all possible integration flow plans that are functionally equivalent to the initial integration flow plan. A close-to-optimal integration flow plan with a lowest computing cost may be selected from the set as a replacement for the initial integration flow plan.
    Type: Application
    Filed: February 25, 2010
    Publication date: August 25, 2011
    Inventors: Alkiviadis SIMITSIS, William K. WILKINSON, Umeshwar DAYAL, Maria G. CASTELLANOS
  • Patent number: 7933814
    Abstract: A method and system for determining whether a composite service level agreement (SLA) may be met. An exemplary method comprises calculating a baseline metric value for a plurality component service level agreements (SLA) and comparing the baseline metric value to historical metric values for each of the plurality of component services.
    Type: Grant
    Filed: September 26, 2003
    Date of Patent: April 26, 2011
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Maria G. Castellanos, Mehmet Sayal
  • Publication number: 20110047525
    Abstract: A method for quality objective-based ETL pipeline optimization is provided. An improvement objective is obtained from user input into a computing system. The improvement objective represents a priority optimization desired by a user for improved ETL flows for an application designed to run in memory of the computing system. An ETL flow is created in the memory of the computing system. The ETL flow is restructured for flow optimization with a processor of the computing system. The flow restructuring is based on the improvement objective. Flow restructuring can include application of flow rewriting optimization or application of an algebraic rewriting optimization. The optimized ETL flow is stored as executable code on a computer readable storage medium.
    Type: Application
    Filed: August 18, 2009
    Publication date: February 24, 2011
    Inventors: Maria G. Castellanos, Umeshwar Dayal, Alkiviadis Simitsis, William K. Wilkinson
  • Patent number: 7840556
    Abstract: Provided are, among other things, systems, methods and techniques for processing database queries, such as the following. A current database query to be executed is obtained, and a plan for executing the current database query is generated. A cluster that includes at least one previously executed database query is identified, based on similarity of the plan for executing the current database query to execution-plan information representing the cluster. Then, a query-execution characteristic is assigned to the current database query based on corresponding characteristic information representing the cluster, and performance of the current database query is managed based on the query-execution characteristic.
    Type: Grant
    Filed: July 31, 2007
    Date of Patent: November 23, 2010
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Umeshwar Dayal, Harumi Kuno, Stefan Krompass, Maria G. Castellanos
  • Publication number: 20100280990
    Abstract: One embodiment is a method extract information technology (IT) events that indicate start and completion times of a business process. The method transforms the IT events into business data changes that are agnostic to multiple different ETL implementation languages and transforms the business data changes into execution data. Execution data is stored in a data warehouse.
    Type: Application
    Filed: April 30, 2009
    Publication date: November 4, 2010
    Inventors: Maria G. Castellanos, Umeshwar Dayal
  • Patent number: 7797341
    Abstract: In a method of preserving characteristics of desensitized database information for use in database management system benchmarking, at least one column of sensitive data from a database is accessed. The at least one column of sensitive data is sorted into a sorted data representation. Desensitized data is generated by sequentially mapping ascending different values of the sorted data representation to ascending desensitized values generated according to a strictly monotone increasing randomly generated function. The mapping results in a plurality of mapped desensitized values which are associated with the sorted data representation. The mapped desensitized values are unsorted into a desensitized database according to sorting information related to the sorted data representation. As a result of the unsorting, cardinalities of the at least one column of sensitive data are maintained within the desensitized database.
    Type: Grant
    Filed: April 30, 2007
    Date of Patent: September 14, 2010
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Maria G. Castellanos, Bin Zhang, Umeshwar Dayal, Ivo Jimenez
  • Publication number: 20100228783
    Abstract: A method for desensitizing character strings comprises ordering an original set of character strings and generating a tree structure that has nodes corresponding to the ordered set of sensitive strings. The sensitive characters on the nodes of the tree are then desensitized in depth-first order such that a set of desensitized strings is generated that preserves the ordering of the original sensitive set.
    Type: Application
    Filed: March 6, 2009
    Publication date: September 9, 2010
    Inventors: Maria G. Castellanos, Umeshwar Dayal, Bin Zhang
  • Publication number: 20100114976
    Abstract: Embodiments in accordance with the present invention include methods and systems for database design. A method includes analyzing tables to generate recommendations for an initial design of a database before the database is constructed.
    Type: Application
    Filed: October 21, 2008
    Publication date: May 6, 2010
    Inventors: Maria G. Castellanos, Neal A. Coddington
  • Publication number: 20100094829
    Abstract: A query profiling system and associated operating methods implement a database query profiler. The illustrative database system comprises a parser, a query model, a catalog manager, a binder and view rewriter, and a query analyzer. The parser parses statements by checking syntax of a statement according to a structured query language (SQL) grammar and representing the statements in a syntax-oriented internal format. The catalog manager processes data definition SQL statements (DDL) that are transformed to the internal representation; transform the representations of the DDL to database objects abstractions; groups the database objects into a Catalog. The binder and view rewriter take the internal representations obtained by parsing workload queries (data manipulation SQL statements or DML) and rewrite table references by their corresponding View definitions; bind database object references to their actual database objects contained in the catalog.
    Type: Application
    Filed: October 14, 2008
    Publication date: April 15, 2010
    Inventors: Maria G. Castellanos, Ivo Jose Jimenez Ramos
  • Publication number: 20080270370
    Abstract: In a method of preserving characteristics of desensitized database information for use in database management system benchmarking, at least one column of sensitive data from a database is accessed. The at least one column of sensitive data is sorted into a sorted data representation. Desensitized data is generated by sequentially mapping ascending different values of the sorted data representation to ascending desensitized values generated according to a strictly monotone increasing randomly generated function. The mapping results in a plurality of mapped desensitized values which are associated with the sorted data representation. The mapped desensitized values are unsorted into a desensitized database according to sorting information related to the sorted data representation. As a result of the unsorting, cardinalities of the at least one column of sensitive data are maintained within the desensitized database.
    Type: Application
    Filed: April 30, 2007
    Publication date: October 30, 2008
    Inventors: Maria G. Castellanos, Bin Zhang, Umeshwar Dayal, Ivo Jimenez
  • Patent number: 7146359
    Abstract: A method of filtering content in a discovered topic. In one embodiment, a method for filtering content in a discovered topic is comprised of preprocessing querying data. The querying data has caused retrieval of a collection of documents. The collection of documents includes documents containing subject matter related to said querying data. The collection of documents also includes documents containing subject matter extraneous to the querying data. The querying data is clustered. Clustering of the querying data enables the discovered topic to be identified. The collection of documents are postfiltered. The postfiltering of the collection of documents generates a collection of documents having the related subject matter, and extraneous subject matter is excluded.
    Type: Grant
    Filed: May 3, 2002
    Date of Patent: December 5, 2006
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Maria G. Castellanos
  • Publication number: 20030208485
    Abstract: A method of filtering content in a discovered topic. In one embodiment, a method for filtering content in a discovered topic is comprised of preprocessing querying data. The querying data has caused retrieval of a collection of documents. The collection of documents includes documents containing subject matter related to said querying data. The collection of documents also includes documents containing subject matter extraneous to said querying data. The querying data is clustered. Clustering of the querying data enables the discovered topic to be identified. The collection of documents are postfiltered. The postfiltering of the collection of documents generates a collection of documents having the related subject matter, and extraneous subject matter is excluded.
    Type: Application
    Filed: May 3, 2002
    Publication date: November 6, 2003
    Inventor: Maria G. Castellanos