Based On Joins Patents (Class 707/714)

Method and system for combining data objects

Patent number: 8682885

Abstract: Systems and methods for combining data objects are provided. A computer system may analyze each of the data objects, identify at least one possible combination operation between the data objects, and present the identified combination operation to a user interface. The identified combination operation may include a union or a join operation. The computer system may further determine a combination grade associated with the identified combination operation and present the combination grade to the user interface. Subsequently, the computer system may receive a selection of a particular presented combination operation via the user interface and execute the particular combination operation between the data objects.

Type: Grant

Filed: May 15, 2012

Date of Patent: March 25, 2014

Assignee: SAP AG

Inventors: David Lotan Bolotnikoff, Netanel Slomianski, Natan Teren, Dmitry Gorenchteine, Tal Sapan, Udi Cohen, Kobi Sasson, Shachar Soel
MULTIPLICATION-BASED METHOD FOR STITCHING RESULTS OF PREDICATE EVALUATION IN COLUMN STORES

Publication number: 20140074818

Abstract: A system joins predicate evaluated column bitmaps having varying lengths. The system includes a column unifier for querying column values with a predicate and generating an indicator bit for each of the column values that is then joined with the respective column value. The system also includes a bitmap generator for creating a column-major linear bitmap from the column values and indicator bits. The column unifier also determines an offset between adjacent indicator bits. The system also includes a converter for multiplying the column-major linear bitmap with a multiplier to shift the indicator bits into consecutive positions in the linear bitmap.

Type: Application

Filed: September 13, 2012

Publication date: March 13, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Ronald J. Barber, Min-Soo Kim, Jae Gil Lee, Sam S. Lightstone, Guy M. Lohman, Lin Qiao, Vijayshankar Raman, Richard S. Sidle
Optimizing snowflake schema queries

Patent number: 8671091

Abstract: For a database query that defines a plurality of separate snowflake schemas, a query optimizer computes separately for each of the snowflake schemas a logical access plan for obtaining from that schema's tables a respective record set that includes the data requested from those tables by that query. The query optimizer also computes a logical access plan for obtaining the query's results from the record sets in which execution of the logical access plans thus computed will result.

Type: Grant

Filed: August 2, 2006

Date of Patent: March 11, 2014

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Mitch Cherniack, Shilpa Lawande, Nga Tran
OPTIMIZATION TECHNIQUE OF GENERALIZED DISJUNCTIVE SEMI/ANTI JOIN

Publication number: 20140067789

Abstract: A method, apparatus, and stored instructions are provided for transforming a query representation by unnesting a predicate condition that is based on whether or not a result exists for a subquery of the predicate condition. An initial query representation is received. The initial query representation represents an initial query that includes an EXISTS-equivalent predicate or a NOT-EXISTS-equivalent predicate and at least one other predicate in a disjunction. The initial query representation is transformed into a semantically equivalent transformed query representation that represents a transformed query. The transformed query includes, instead of the EXISTS-equivalent predicate or a NOT-EXISTS-equivalent predicate, a join operator that references the data object.

Type: Application

Filed: September 4, 2012

Publication date: March 6, 2014

Applicant: ORACLE INTERNATIONAL CORPORATION

Inventors: Rafi Ahmed, Chun-Chieh Lin, Mohamed Zait
TECHNIQUES FOR CONSTRUCTING MINIMUM SUPERSETS OF TEST DATA FROM RELATIONAL DATABASES

Publication number: 20140067790

Abstract: A computer-implemented method is provided for constructing a minimum superset of data that complies with referential completeness as well as other specified data relationships. Given extraction criteria for the data, an iterative process analyzes the database for records to be included in the minimal superset. The iterative process includes: selecting a given table from a listing of tables implicated by the extraction criteria; identifying tables in the relational database having either a parent or a child relationship with the given table; evaluating concurrently and asynchronously each of the identified tables; and repeating these steps for each of the tables in the listing of tables.

Type: Application

Filed: September 5, 2012

Publication date: March 6, 2014

Applicant: Compuware Corporation

Inventor: Andrew Lipin
Automatic Denormalization For Analytic Query Processing In Large-Scale Clusters

Publication number: 20140067791

Abstract: A system and method for improving performance of join operations is disclosed. The method in one embodiment includes building a graph of vertices and edges, which represent the tables and join conditions for a particular database schema. The method then traverses the graph to find a set of candidate join operations. The method further includes selecting from the set of subset of join operations that makes the best use of a given amount of memory, which means finding a collection of denormalizations which when residing in memory provide the most benefit and the least cost for the given amount of memory. The method then deploys the denormalizations so that current queries in a query processing system can use them instead of actually performing the joins.

Type: Application

Filed: September 6, 2012

Publication date: March 6, 2014

Applicant: ORACLE INTERNATIONAL CORPORATION

Inventors: Sam Idicula, Sabina Petride, Nipun Agarwal
Join algorithms over full text indexes

Patent number: 8661019

Abstract: According to one embodiment of the present invention, a method for processing join predicates in full-text indexes is provided. The method includes evaluating local predicates of an outer full text index to generate a first posting list of documents. For each document in the first posting list, the value of a join attribute is determined and an inner full text index is probed to obtain a second posting list of documents containing one of the join attributes determined for each document. Local predicates of an inner full text index are evaluated to generate a third posting list of documents, and the second posting list is merged with the third posting list to generate a merge list of documents. Documents in the first posting list may be paired up with documents in the merge list.

Type: Grant

Filed: January 28, 2010

Date of Patent: February 25, 2014

Assignee: International Business Machines Corporation

Inventors: Latha Sankar Colby, Quanzhong Li, Fatma Ozcan, Mir Hamid Pirahesh, Eugene J. Shekita, Zografoula Vagena
OLAP execution model using relational operations

Patent number: 8650181

Abstract: In one embodiment the present invention includes an OLAP execution model using relational operations. In one embodiment, the present invention includes, a method comprising receiving a first query in an online analytic processor (OLAP) executing on one or more computers, the OLAP generating and comprising a model specifying a graph defining a plurality of nodes and a plurality of tiers, each node corresponding to a different operation on data. A second query is generated by the OLAP. The second query includes a plurality of layered subqueries each corresponding to one of the nodes in the graph for specifying the different operations on data. The second query is received in a relational engine coupled to the datastore. The relational engine executes the second query, and in accordance therewith, retrieves data.

Type: Grant

Filed: November 22, 2010

Date of Patent: February 11, 2014

Assignee: SAP AG

Inventors: Stefan Dipper, Erich Marschall, Tobias Mindnich, Daniel Baeumges, Christoph Weyerhaeuser
SYSTEMS AND METHODS FOR RELINKING DATA ITEMS

Publication number: 20140032526

Abstract: In one example embodiment, a method is implemented to relink a data item. The method includes identifying that a data item, referenced by a referencing document, is absent from a data item location for the data item indicated by the referencing document. User preference data, indicating a designated relink location to search for absent data items, is accessed. The relink location is searched for the absent data items. Another example embodiment may include a method to relink a current data item to a replacement data item location by accessing user preference data and searching a relink location for a replacement data item in accordance with the relink location indicated by the user preference data.

Type: Application

Filed: September 18, 2008

Publication date: January 30, 2014

Applicant: Adobe Systems Incorporated

Inventors: Lance D. Bushore, Michael Easter, Thomas M. McDonald
Concepts for viewing and accessing claim versions

Patent number: 8639664

Abstract: Systems, methods, apparatus, and computer program products are provided for accessing, navigating, and displaying multiple versions of claims in an efficient and customer-friendly manner. In various embodiments, claims can be processed and stored such that they can be displayed via an interface in a graphical format (e.g., textual, circular, hierarchical, etc.).

Type: Grant

Filed: March 29, 2012

Date of Patent: January 28, 2014

Inventors: Tracy Ann Weber, Mark Galik, Christine Belanger
GENERATING STATISTICAL VIEWS IN A DATABASE SYSTEM

Publication number: 20140012835

Abstract: Techniques are disclosed for generating statistical views in a database system. In one embodiment, a request is received to execute a database workload. One or more constraints pertaining to executing the database workload is retrieved. The database workload is evaluated to generate multiple statistical view candidates. The statistical view candidates are refined based on the one or more constraints. One or more statistical views are then generated based on the refined statistical view candidates.

Type: Application

Filed: May 3, 2013

Publication date: January 9, 2014

Applicant: International Business Machines Corporation

Inventor: International Business Machines Corporation
Executing CASE expressions containing subqueries

Patent number: 8626744

Abstract: Methods and apparatus, including computer program products, implementing and using techniques for executing a query in a computer to retrieve data from a relational database stored on a data storage device. A query is received that contains a CASE expression having one or more subqueries. A query execution plan is generated. The query execution plan uses a parameterized union all operator to keep the semantics of the CASE expression, and a nested loop join operator to drive data across the section of the query execution plan that executes the CASE expression without performing any join operations. The generated query execution plan is executed in the computer to retrieve data from the relational database.

Type: Grant

Filed: January 26, 2012

Date of Patent: January 7, 2014

Assignee: International Business Machines Corporation

Inventors: Qi Cheng, John F. Hornibrook, Jose A. Saborit, David C. Sharpe, Calisto P. Zuzarte
EFFICIENT PARTITIONED JOINS IN A DATABASE WITH COLUMN-MAJOR LAYOUT

Publication number: 20140006379

Abstract: Embodiments of the present invention provide a database processing system for efficient partitioning of a database table with column-major layout for executing one or more join operations. One embodiment comprises a method for partitioning a database table with column-major layout, partitioning only the join-columns by limiting the partitions by size and number, executing one or more join operations for joining the partitioned columns, and optionally de-partitioning the join result to the original order by sequentially writing and randomly reading table values using P cursors.

Type: Application

Filed: June 29, 2012

Publication date: January 2, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Stefan Arndt, Gopi K. Attaluri, Ronald J. Barber, Guy M. Lohman, Lin Qiao, Vijayshankar Raman, Eugene J. Shekita, Richard S. Sidle
EFFICIENT PARTITIONED JOINS IN A DATABASE WITH COLUMN-MAJOR LAYOUT

Publication number: 20140006380

Abstract: Embodiments of the present invention provide a database processing system for efficient partitioning of a database table with column-major layout for executing one or more join operations. One embodiment comprises a method for partitioning a database table with column-major layout, partitioning only the join-columns by limiting the partitions by size and number, executing one or more join operations for joining the partitioned columns, and optionally de-partitioning the join result to the original order by sequentially writing and randomly reading table values using P cursors.

Type: Application

Filed: August 24, 2012

Publication date: January 2, 2014

Applicant: International Business Machines Corporation

Inventors: Stefan ARNDT, Gopi K. Attaluri, Ronald J. Barber, Guy M. Lohman, Lin Qiao, Vijayshankar Raman, Eugene J. Shekita, Richard S. Sidle
Generating materialized query table candidates

Patent number: 8620899

Abstract: Techniques for generating a set of one or more materialized query table (MQT) candidates for a workload are provided. The techniques include receiving a workload, wherein the workload comprises a set of one or more queries, generating one or more best matching MQTs (BMQTs) based on one or more query blocks of the one or more queries by removing syntax that is not qualified for a MQT re-write, determining one or more frequently used multi-joins in the workload, using the one or more BMQTs and the one or more frequently used multi-joins to generate a set of one or more workload MQTs (WMQTs), and grouping one or more WMQTs and one or more BMQTs into one or more groups to merge into a set of a smaller number of MQTs and to cover the workload.

Type: Grant

Filed: February 9, 2010

Date of Patent: December 31, 2013

Assignee: International Business Machines Corporation

Inventors: Dongsheng Chen, Hong Min, Terence P. Purcell, Yefim Shuf, Xiaobo Wang, Zhongliang Zhang
TECHNIQUES FOR PROCESSING GROUP MEMBERSHIP DATA IN A MULTI-TENANT DATABASE SYSTEM

Publication number: 20130346389

Abstract: In accordance with embodiments, there are provided techniques for processing group membership data in a multi-tenant database system. These techniques for processing group membership data in a multi-tenant database system may enable embodiments to provide great flexibility to a tenant of the architecture to select the content that may be perceived by the tenant users while allowing the owner of the architecture control over the content.

Type: Application

Filed: June 12, 2013

Publication date: December 26, 2013

Applicant: salesforce.com, inc.

Inventors: Scott Yancey, Kedar Doshi, Yongsheng Wu
Method, system and program product for rewriting structured query language (SQL) statements

Patent number: 8612422

Abstract: A method and system for rewriting Structured Query Language (SQL) statements. A SQL statement having search criteria including conditions pertaining to fields in database tables is received. A table from the SQL statement listing all combinations of the search criteria is created, every combination reflecting a subset of information used to determine combined results. A set of patterns among the possible combinations of the SQL statement based on the fields referenced in the combinations is identified. The table based on the set of patterns is sorted. Based on the sorting, the table is divided into temporary tables which are joined with the SQL statement. New temporary tables are generated. Each new temporary table includes results of a search conducted using a separate portion of the SQL statement as joined with the set of temporary tables. All results from the new temporary tables are combined into a single result table.

Type: Grant

Filed: March 21, 2012

Date of Patent: December 17, 2013

Assignee: International Business Machines Corporation

Inventors: Howard S. Bloom, Roy Froehlich, Thomas A. Jobson, Jr., Edith A. Kanyock, Charles F. Matula, Arnold M. Rosenberg
GENERATING DATA PROCESSING CODE FROM A DIRECTED ACYCLIC GRAPH

Publication number: 20130332449

Abstract: The present invention provides a computer-implemented code generation system that generates data processing code from a directed acyclic graph (DAG). The generated code is both declarative and procedural, and can be run in a relational database or in a Map Reduce implementation using Apache Pig. Each node of the DAG specifies operations performed on tabular data that can be stored in a delimited plain text file, a spreadsheet, or a relational database.

Type: Application

Filed: June 6, 2013

Publication date: December 12, 2013

Inventors: John David Amos, Oleg Merlugov
Adaptive processing of top-queries in nested-structure arbitrary markup language such as XML

Patent number: 8606794

Abstract: A method of adaptively evaluating a top-k query involves (1204) forming a servers having respective server queues storing candidate answers, processing (1322) the candidate answers, and (1232) providing a top-k set as a query evaluation. Processing includes (1402) adaptively choosing a winning server to whose queue a current candidate answer should be sent; (1404) sending the current candidate answer to the winning server's queue; (1334) adaptively choosing a next candidate answer to process from the winning server's queue; (1336) computing a join between the current candidate answer and next candidate answers at the winning server, so as to produce a new current candidate answer; and (1338) updating the top-k set with the new current candidate answer only if a score of the new current candidate answer exceeds a score of a top-k answer in a top-k set. A method of calculating scores for candidate answers is also provided.

Type: Grant

Filed: March 4, 2013

Date of Patent: December 10, 2013

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Sihem Amer-Yahia, Nikolaos Koudas, Amelie Marian-Guerrier, Divesh Srivastava
HARDWARE-ACCELERATED RELATIONAL JOINS

Publication number: 20130318067

Abstract: Techniques are provided for hardware-accelerated relational joins. A first table comprising one or more rows is processed through a hardware accelerator. At least one join column in at least one of the one or more rows of the first table is hashed to set at least one bit in at least one bit vector. A second table comprising one or more rows is processed through a hardware accelerator. At least one join column in at least one of the one or more rows of the second table is hashed to generate at least one hash value. At least one bit vector is probed using the at least one hash value. A joined row is constructed responsive to the probing step. The row-construction step is performed in the hardware accelerator.

Type: Application

Filed: May 23, 2012

Publication date: November 28, 2013

Applicant: International Business Machines Corporation

Inventors: Bharat Sukhwani, Sameh W. Asaad, Hong Min, Matthew S. Thoennes, Gong Su
METHOD AND SYSTEM FOR COMBINING DATA OBJECTS

Publication number: 20130311443

Abstract: Systems and methods for combining data objects are provided. A computer system may analyze each of the data objects, identify at least one possible combination operation between the data objects, and present the identified combination operation to a user interface. The identified combination operation may include a union or a join operation. The computer system may further determine a combination grade associated with the identified combination operation and present the combination grade to the user interface. Subsequently, the computer system may receive a selection of a particular presented combination operation via the user interface and execute the particular combination operation between the data objects.

Type: Application

Filed: May 15, 2012

Publication date: November 21, 2013

Applicant: SAP AG

Inventors: David Lotan Bolotnikoff, Netanel Slomianski, Natan Teren, Dmitry Gorenchteine, Tal Sapan, Udi Cohen, Kobi Sasson, Shachar Soel
System, Method, and Data Structure for Automatically Generating Database Queries which are Data Model Independent and Cardinality Independent

Publication number: 20130311442

Abstract: Systems and methods are provided for performing a database query. After receiving an identification of data item components and associations between the data item components, wherein the data item components include a measure and a category, a query generation system creates and executes a database query that comprises one or more Uqueries, wherein each Uquery comprises a Mx segment subquery and a U0 segment subquery. The Mx segment subquery is associated with a table that contains the measure, a table that contains columns for a calculated measure, or a correlated table, and the U0 segment subquery is associated with a table that contains the category, a table that contains columns for a calculated category, a dependent measure table, or a dependent filter table. The query generation system can generate a virtual results table by aggregating Uquery results from one Uquery with Uquery results from another Uquery.

Type: Application

Filed: May 15, 2012

Publication date: November 21, 2013

Inventors: Thomas P. Weber, Frederick J. Levine, James P. Kelley
Efficient Top-Down Hierarchical Join On A Hierarchically Clustered Data Stream

Publication number: 20130311444

Abstract: An apparatus for conducting a hierarchical join on a hierarchically clustered data stream is disclosed. The apparatus may include a memory storing query conditions that each determine whether a Structured Query Language query is suitable for performing a non-buffering read operation on a hierarchical data storage. A processor in communication with the memory may be configured to receive a Structured Query Language (“SQL”) query from a client device and evaluate the SQL query for one or more of the query conditions stored in the memory. When the processor determines that the SQL query satisfies one or more of the query conditions, the processor may then perform a non-buffering read operation on the hierarchical data storage to obtain a hierarchically clustered data stream. The hierarchically clustered data stream may be evaluated to obtain a plurality of record results, which may be transmitted to the client device.

Type: Application

Filed: March 13, 2013

Publication date: November 21, 2013

Applicant: Google Inc.

Inventors: Bart Samwel, Chad Whipkey
JOIN PROCESSING DEVICE, DATA MANAGEMENT DEVICE, AND STRING SIMILARITY JOIN SYSTEM

Publication number: 20130311445

Abstract: Provided is a join processing device that performs a similarity join process to plural tuples using an edit distance threshold value ? (positive integer). The join processing device includes a join processing unit that excludes, from a target of edit distance calculation, a pair of tuples that do not have any common character in an end portion ranging from a head character or a tail character to a (?+1)th character in a join key string in each of the tuples.

Type: Application

Filed: November 7, 2011

Publication date: November 21, 2013

Applicant: NEC CORPORATION

Inventor: Kazuyo Narita
Method to detect reference data tables in ETL processes

Patent number: 8583626

Abstract: A method, system and computer program product for identifying reference data tables in an Extract-Transform-Load (ETL) process, by identifying, by operation of one or more computer processors, at least a first reference data operator in the process, wherein the first reference data operator references one or more tables and evaluating at least a first table referenced by the reference data operator to determine whether the first table is a reference data table by assigning a score to the first table, wherein the score is indicative of the likelihood that the first table is a reference data table and wherein a reference data table contains a set of values that describes other data.

Type: Grant

Filed: March 8, 2012

Date of Patent: November 12, 2013

Assignee: International Business Machines Corporation

Inventors: Dan J. Mandelstein, Ivan M. Milman, Martin Oberhofer, Sushain Pandit
EVALUATION BY NESTED QUERIES

Publication number: 20130290294

Abstract: A system, method, and non-transitory computer readable medium are disclosed herein to optimize nested queries. At least one field is associated with each row of data evaluated by at least one nested query. A value is assigned to the at least one field so as to indicate whether each row of data satisfies the at least one nested query. The at least one field associated with each row of data is evaluated to determine satisfaction of the at least one nested query.

Type: Application

Filed: April 27, 2012

Publication date: October 31, 2013

Inventors: Matthew Steven Fuller, Charles Edward Bear
Calculating Count Distinct Using Vertical Unions

Publication number: 20130290293

Abstract: A query statement is received that specifies a count distinct. Thereafter, a data flow graph that comprises a plurality of nodes for executing the query is generated. The nodes provide aggregation operations, sorting of results on join attributes and vertically appending columns of count distinct results with intermediate results from at least one of the aggregation operations. Thereafter, execution of the query is initiated using the data flow graph. Related apparatus, systems, techniques and articles are also described.

Type: Application

Filed: April 26, 2012

Publication date: October 31, 2013

Applicant: SAP AG

Inventors: Christoph Weyerhaeuser, Tobias Mindnich, Daniel Baeumges, Gerrit Simon Kazmaier
MAINTAINING FAULT DOMAINS IN A DISTRIBUTED DATABASE

Publication number: 20130290295

Abstract: In at least some examples, a system includes a distributed database and control logic to enable updates and queries to the distributed database. The control logic applies a plurality of identifiers to the updates and queries to maintain distinct fault domains in the distributed database.

Type: Application

Filed: April 30, 2012

Publication date: October 31, 2013

Inventors: Craig A. Soules, Alistair Veitch, Charles B. Morrey, III, Kimberly Keeton
Method to estimate the number of distinct value combinations for a set of attributes in a database system

Patent number: 8572067

Abstract: A method to estimate the number of distinct value combinations for a set of attributes in a database system is disclosed. The method comprises utilizing frequency information within the set of attributes to provide a best estimate for the number of distinct value combinations. In a preferred embodiment, the utilizing step comprises estimating the number of distinct value combinations utilizing frequency information of the set of attributes based upon probability theory and further includes providing bounds on the distinct value information combinations utilizing the frequency information of the set of attributes. In so doing, an estimate for the number of distinct value combinations is provided.

Type: Grant

Filed: March 14, 2006

Date of Patent: October 29, 2013

Assignee: International Business Machines Corporation

Inventors: Calisto Paul Zuzarte, Xiaohui Yu
METHOD AND COMPUTER PROGRAM FOR EVALUATING DATABASE QUERIES INVOLVING RELATIONAL AND HIERARCHICAL DATA

Publication number: 20130282694

Abstract: A method and apparatus for resolving conjunctive queries, comprising relational predicates and external predicates, such as XML predicates. The method and apparatus obtain results from a relational engine or from an external engine, and then use the obtained results as input for the other engine, so as to optimize the activity of the other engine. The process may repeat further times, with the results obtained by the other engine being input to the first engine. The method and apparatus further disclose handling value, being isomorphic hierarchical sub-trees.

Type: Application

Filed: June 18, 2013

Publication date: October 24, 2013

Applicant: TECHNION RESEARCH AND DEVELOPMENT FOUNDATION LTD.

Inventors: Oded SHMUELI, Royi Ronen
ADAPTIVE PROCESSING OF TOP-K QUERIES IN NESTED-STRUCTURE ARBITRARY MARKUP LANGUAGE SUCH AS XML

Publication number: 20130275407

Abstract: A method of adaptively evaluating a top-k query involves (1204) forming a servers having respective server queues storing candidate answers, processing (1322) the candidate answers, and (1232) providing a top-k set as a query evaluation. Processing includes (1402) adaptively choosing a winning server to whose queue a current candidate answer should be sent; (1404) sending the current candidate answer to the winning server's queue; (1334) adaptively choosing a next candidate answer to process from the winning server's queue; (1336) computing a join between the current candidate answer and next candidate answers at the winning server, so as to produce a new current candidate answer; and (1338) updating the top-k set with the new current candidate answer only if a score of the new current candidate answer exceeds a score of a top-k answer in a top-k set. A method of calculating scores for candidate answers is also provided.

Type: Application

Filed: March 4, 2013

Publication date: October 17, 2013

Inventors: Sihem Amer-Yahia, Nikolaos Koudas, Amelie Marian-Guerrier, Divesh Srivastava
Data replication framework

Patent number: 8554762

Abstract: Generally described, the present disclosure is directed to an eventually consistent replicated data store that uses, for its underlying storage, a computer software library that provides a high-performance embedded database for data. The replicated data store employs a plurality of hosts interconnected to one another, allowing for writes to any host and full awareness of membership across all hosts. With the data replication framework disclosed herein, various modes are allowed to be built up on top of the core system.

Type: Grant

Filed: December 28, 2010

Date of Patent: October 8, 2013

Assignee: Amazon Technologies, Inc.

Inventors: Brian S. O'Neill, Jason Shijia Long, Archit Shivaprakash, Olga I. Kuznetsova
System and method for optimizing queries

Patent number: 8554760

Abstract: There is provided a computer-implemented method of optimizing a query. An exemplary method comprises receiving a first query configured to generate a result set. The query may comprise an operation for an outer table, a first predicate, and a second predicate. The second predicate may comprise a subquery correlated to the operation. Additionally, the first predicate and the second predicate may be disjunctive. The exemplary method also comprises generating a first query tree representing the first query, and generating a second query tree representing a second query. The second query may be configured to generate the result set, and may not comprise the subquery. The exemplary method additionally comprises generating a query plan based on the second query tree.

Type: Grant

Filed: October 7, 2009

Date of Patent: October 8, 2013

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Suresh Subbiah, Lars Fredriksen, Murali Mallela Krishna, Hansjorg Zeller
Transforming a single-table join predicate into a pseudo-join predicate

Patent number: 8554761

Abstract: Methods, computing devices, and computer-readable media are provided for transforming an initial query with a single-table outer join predicate to a second, semantically equivalent query that includes a multi-table predicate instead of the single-table outer join predicate. Because the second query does not include the single-table outer join predicate, the second query may be available for further transformations or optimization techniques, may produce a more efficient or otherwise different execution plan, or may be executed more efficiently or differently. The single-table outer join predicate may be in a full outer join ON clause of the initial query or nested in a left outer-joined LATERAL VIEW of the initial query. The multi-table predicate may be in a WHERE clause of the second query. Unlike the initial query, the second query allows use of a temporary organization of data from multiple tables for evaluation of multiple rows of the left outer join.

Type: Grant

Filed: May 25, 2012

Date of Patent: October 8, 2013

Assignee: Oracle International Corporation

Inventor: Rafi Ahmed
Filtering a Distributed Join

Publication number: 20130262433

Abstract: Methods, program products, and systems for filtering a distributed join are described. A computing device can receive a database join request to join a first table stored on a first system and a second table stored on a second system to create a related data set. The join request can include a filtering condition for filtering the related data set. The system can divide the filtering condition into a first sub-condition and a second sub-condition, and send the first sub-condition and a second sub-condition to the first system and second system, respectively. The first system and second system can then perform filtering locally, and send filtered intermediate result to the computing device.

Type: Application

Filed: April 2, 2012

Publication date: October 3, 2013

Inventors: James Qiu, Murali Vemulapati
Technique of decorrelation of lateral views

Patent number: 8548983

Abstract: Methods, computer-readable media, and machines are provided for transforming a query by removing a correlated predicate from a lateral view in an initial query. A query processor may transform the lateral view into a regular view such that a transformed query with the regular view is semantically equivalent to the initial query. The transformed query may support a larger set of access paths, join methods, and join permutations, may qualify for other transformations or optimizations, or may otherwise be executed or optimized differently from the initial query. The lateral view may reference an aggregation function, and the query processor may add a GROUP BY operator to the transformed query to preserve, after the correlated predicate has been pulled out of the lateral view, an interaction that was present between the aggregation function and the correlated predicate when the correlated predicate was in the lateral view.

Type: Grant

Filed: June 7, 2012

Date of Patent: October 1, 2013

Assignee: Oracle International Corporation

Inventor: Rafi Ahmed
SEMANTIC DISCOVERY AND MAPPING BETWEEN DATA SOURCES

Publication number: 20130254183

Abstract: An apparatus and method are described for the discovery of semantics, relationships and mappings between data in different software applications, databases, files, reports, messages, or systems. In one aspect, semantics and relationships and mappings are identified between a first and a second data source. A binding condition is discovered between portions of data in the first and the second data source. The binding condition is used to discover correlations between portions of data in the first and the second data source. The binding condition and the correlations are used to discover a transformation function between portions of data in the first and the second data source.

Type: Application

Filed: May 9, 2013

Publication date: September 26, 2013

Applicant: International Business Machines Corporation

Inventors: Alexander Gorelik, Lingling Yan
On-demand database service system, method and computer program product for generating a custom report

Patent number: 8543567

Abstract: In accordance with embodiments, there are provided mechanisms and methods for generating a custom report using an on-demand database service. These mechanisms and methods for generating an on-demand database service custom report can enable embodiments to generate reports that reflect a relationship between at least two different objects. The ability of embodiments to provide such additional insight into database contents may lead to more efficient and effective reporting.

Type: Grant

Filed: August 8, 2012

Date of Patent: September 24, 2013

Assignee: salesforce.com, inc.

Inventors: Jesse Collins, Mark A. Fischer, Thomas Kim, Thomas Joseph Tobin, Simon Wong
Two phase method for processing multi-way join query over data streams

Patent number: 8538953

Abstract: Provided is a method for processing a multi-way join query comprising: filling matrix-based synopses according to input source stream and generating a result vector evaluating a query equation defined by the matrix multiplication operation of the matrix-based synopses corresponding to nodes of a query walk; and generating a result tuple for the multi-way join query from the matrix-based synopses based on the result vector.

Type: Grant

Filed: March 16, 2012

Date of Patent: September 17, 2013

Assignee: Industry-Academic Cooperation Foundation, Yonsei University

Inventor: Won Suk Lee
METHOD TO DETECT REFERENCE DATA TABLES IN ETL PROCESSES

Publication number: 20130238596

Abstract: A method, system and computer program product for identifying reference data tables in an Extract-Transform-Load (ETL) process, by identifying, by operation of one or more computer processors, at least a first reference data operator in the process, wherein the first reference data operator references one or more tables and evaluating at least a first table referenced by the reference data operator to determine whether the first table is a reference data table by assigning a score to the first table, wherein the score is indicative of the likelihood that the first table is a reference data table and wherein a reference data table contains a set of values that describes other data.

Type: Application

Filed: March 8, 2012

Publication date: September 12, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Dan J. Mandelstein, Ivan M. Milman, Martin A. Oberhofer, Sushain Pandit
Multi-entity-centric integrated search system and method

Patent number: 8533174

Abstract: Disclosed herein is a multi-entity-centric integrated search system and method. The multi-entity-centric integrated search system includes an entity information acquisition server for receiving and analyzing a query term and determining entities and types of entities, and an integrated search result provision server for configuring an integrated search results page using results acquired from unit service calling units, such as an external API calling unit, a search engine calling unit, and an inference engine calling unit, and presenting integrated search results. The present invention is advantageous in that precise search results can be shown more rapidly, satisfaction with search results for a query term in which multiple entities coexist can be improved, the ambiguity of query terms is overcome, and an open platform capable of operating in conjunction with various types of web services is provided.

Type: Grant

Filed: July 17, 2008

Date of Patent: September 10, 2013

Assignee: Korea Institute of Science and technology Information

Inventors: Han Min Jung, Mi Kyung Lee, Pyung Kim, Seung Woo Lee, Du Seok Jin, Jung Sun Yoon, Dong In Park, Won Kyung Sung, Sun Hwa Hahn
SYSTEMS AND METHODS FOR PERFORMING A NESTED JOIN OPERATION

Publication number: 20130232133

Abstract: Systems and methods are provided for performing a nested join operation. Partitioning key values are computed for an outer data source using a partitioning key function used to partition an inner table. A join process is established for each of a plurality of partitions of the inner table (at 204), with a given partition of the inner table representing a plurality of partitioning key values. Each row from the outer data source is routed to a join process according to its associated partitioning key value (at 206). The inner table is probed to return a row from the inner table having the partitioning key value associated with the row from the outer data source (at 208). The row from the outer data source and the row returned from the inner table are joined to form a row in a combined table (at 210).

Type: Application

Filed: December 3, 2010

Publication date: September 5, 2013

Inventors: Awny K. Al-Omari, Qifan Chen, Michael J. Hanlon
Database consistent sample data extraction

Patent number: 8527552

Abstract: Various embodiments of systems and methods for database consistent sample data extraction are described herein. The technique uses production data as input and outputs sample data in the same relational schema while preserving the integrity of joins of the different tables in the schema. For a given relational schema, the master tables are found. Then a subset is created by placing selection criteria in a query defining how to sample the data for these master tables. Following the joins, the dependent tables are added to the query automatically.

Type: Grant

Filed: July 7, 2011

Date of Patent: September 3, 2013

Assignee: Business Objects Software Limited

Inventor: John O'Byrne
Method, system and computer-readable media for software object relationship traversal for object-relational query binding

Patent number: 8527502

Abstract: A system method and computer-readable media for searching an object oriented database are provided. The present invention may be employed to detect software bugs, deficiencies and/or errors in, affecting, or related to, an object database management system. The invented technique may use an SQL table join query to search for incomplete, damaged or erroneous software objects.

Type: Grant

Filed: March 29, 2007

Date of Patent: September 3, 2013

Inventor: Blade Makai Doyle
Preprocessing text to enhance statistical features

Patent number: 8527500

Abstract: A document preprocessor preprocess a document to enhance the statistical features of the document. The system preprocesses the document by matching a prefix and a trailing context in the document with one or more matching prefixes in a transformation database, where the prefix is a first string of one or more tokens in the first document and the trailing context is a second string of one or more tokens in the first document that trail the prefix. Alternatively, the system preprocesses the document by computing cyclic permutations of the document, sorting these permutations and taking the last token from each of the sorted permutations.

Type: Grant

Filed: February 27, 2009

Date of Patent: September 3, 2013

Assignee: Red Hat, Inc.

Inventor: James Paul Schneider
PROCESSING ABSTRACT DERIVED ENTITIES DEFINED IN A DATA ABSTRACTION MODEL

Publication number: 20130226901

Abstract: A method, apparatus and article of manufacture for utilizing an abstract derived entity are disclosed. An abstract derived entity is a data object present in an abstract data model that may be referenced by other entities in the abstract data model as though it were a relational table present in a physical data source. In one embodiment, an abstract data model provides data access independent of the manner in which the data is physically represented in a set of physical data sources. In addition, an abstract derived entity may be used to provide aggregate data joined with other non-aggregate data. Aggregate data values are calculated from the column of a relational table, or other sets of input data. In one embodiment, an abstract derived entity may describe a describe relational table comprising a set of aggregate data used to form a column of the relational table joined to other columns formed from the sets of input data specified by the abstract derived entity.

Type: Application

Filed: March 15, 2013

Publication date: August 29, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: International Business Machines Corporation
SYSTEM AND PROGRAM FOR JOINING SOURCE TABLE ROWS WITH TARGET TABLE ROWS

Publication number: 20130226902

Abstract: A data processing system, and an article of manufacturing, join rows associated with a source table column with rows associated with a target table column. A source node and a target node contain the source and target tables, respectively. A reduced representation of selected rows associated with the source table column is generated, as is a representation of the target table column. A filtering module filters the generated reduced representation of selected rows associated with the source table column through the generated representation of the target table column, the filtered generated reduced representation of selected rows identifying source table rows that do not have to be joined with the target table. The rows associated with the source table column minus the filtered generated reduced representation of selected rows are joined to the rows associated with the target table column.

Type: Application

Filed: March 18, 2013

Publication date: August 29, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
Transforming correlated scalar subqueries

Patent number: 8521723

Abstract: A method, apparatus, and stored instructions are provided for transforming an initial query having a scalar subquery with correlated predicates into a transformed query having transformed predicates that are not in the scalar subquery. The transformed predicates correspond to the correlated predicates, but the transformed predicates are no longer correlated. The transformed predicates reference an outer join operation to preserve null values that may have been generated by the correlated predicates that were in the scalar subquery of the initial query. The transformed query is semantically equivalent to the initial query. When creating an execution plan for the transformed query, the query processor may choose from a much larger set of access paths, join methods, and join permutations than those that were available for the initial query. The transformed query may qualify for other transformations or optimizations, or may otherwise be executed or optimized differently from the initial query.

Type: Grant

Filed: July 18, 2012

Date of Patent: August 27, 2013

Assignee: Oracle International Corporation

Inventor: Rafi Ahmed
MULTIMODAL GRAPH MODELING AND COMPUTATION FOR SEARCH PROCESSES

Publication number: 20130218866

Abstract: Architecture that includes a multimodal graph modeling and computation system employed in a search framework. The framework utilizes entities to diversify and explore the results page. The multimodal graph modeling paradigm can include web modeling by way of a click graph, a web graph, a social graph, a geospatial graph, and an entity graph, for example. These graphs are then joined based on common properties such as links, clicks, and document entities. Computation can then be performed over the joined graphs to generate a related entity list and a related page list. These lists are then processed by a recommendation engine to provide recommendations to the user.

Type: Application

Filed: February 20, 2012

Publication date: August 22, 2013

Applicant: Microsoft Corporation

Inventors: Richard J. Qian, Xiaodong Fan
Parallel partitioning index scan

Patent number: 8515945

Abstract: System, methods and articles of manufacture for joining data in the database tables comprising, performing an index scan on a global index of a first database table, determining rows in the first database table that may be joined with a second database table based on a needed partitioning, wherein the needed partitioning is determined using an index scan, determining a number of partitions in the second database table, and joining each of the corresponding partitions in the first database table with a corresponding partitions in the second database table.

Type: Grant

Filed: November 16, 2010

Date of Patent: August 20, 2013

Assignee: Sybase, Inc.

Inventors: Santosh Pendap, Sudipto Chowdhuri, Peter J. Schneider

prev … 5 6 7 8 9 10 11 12 13 … next