Query Optimization (epo) Patents (Class 707/E17.017)
-
Patent number: 11822546Abstract: The invention is a method, an apparatus and a computer program for the automatic processing of data identifiers (20, 22), wherein data (12) associated with data identifiers (20, 22) is transmitted to a cloud platform (16) and to an analysis unit (32) there having at least one functional unit (45-50), wherein each functional unit (45-50) performs an analysis of the data (12) and/or associated data identifier (20, 22) and outputs a confidence value as an analysis result, wherein the or each confidence value is mapped to an overall confidence value (60), wherein the overall confidence value encodes a correlation between two data identifiers (20, 22), wherein data identifiers (20, 22) determined as correlated are combined under a higher-order variable name (66), and wherein the higher-order variable name (66) can be used for a database query, which supplies data relating to all data identifiers (20, 22) combined under the variable name (66), wherein data identifiers (20, 22) having an overall confidence value (60Type: GrantFiled: October 15, 2019Date of Patent: November 21, 2023Assignee: Siemens AktiengesellschaftInventors: Parag Mogre, Philipp Pott
-
Patent number: 11797539Abstract: The present invention relates to hash join acceleration. In an embodiment, a computer receives a request to join build data rows with probe data rows. Each data row is associated with a respective join key value. From multiple buckets of a hash table, a bucket is selected that contains entries of multiple respective join key values in respective locations within the bucket. Whether or not the bucket contains an entry of the join key value of a build data row is detected. While building, multiple locations within the bucket are concurrently inspected to find an empty location in which to store the entry of the join key value of the build data row. The bucket is concurrently probed to detect whether or not the entries in the bucket include an entry of a join key value of a probe data row. For the join request, a response is sent that is based on the concurrent probing of the bucket.Type: GrantFiled: September 11, 2020Date of Patent: October 24, 2023Assignee: Oracle International CorporationInventors: Shasank Kisan Chavan, James Kearney, Weiwei Gong
-
Patent number: 11734324Abstract: The embodiments described herein enable the efficient storage and searching of large amounts of data to retrieve desired data with high accuracy and fast response time. The embodiments include for example parsing an input document containing input data into multiple sub-documents based on the structure of the input document and generating for each sub-document, a unique ID, pathway, and metadata. The embodiments also include storage of the ID and pathway for each sub-document in an index database and storage of the input document itself in a separate database. When a query for information from the input document is received, the ID and pathway of the sub document containing the queried information are identified and used to locate the queried information from the input document.Type: GrantFiled: April 16, 2021Date of Patent: August 22, 2023Assignee: Walmart Apollo, LLCInventors: Shiva K. Potu, Madhavan Kandhadai Vasantham, Srinivasan Venkatesan, Yashwanth Thumallapalle
-
Patent number: 11734274Abstract: the present disclosure relates to systems, methods, and computer-readable media for optimizing and implementing operator trees based on a received query. For example, systems disclosed herein may generate an operator tree based on a received query. The systems described herein may systematically analyze the impact of bitvector filters in optimizing a join order of the operator tree to generate an optimized operator tree. The systems described herein may further implement the bit-vector aware operator tree by providing the optimized operator tree to an execution engine for further processing.Type: GrantFiled: June 30, 2020Date of Patent: August 22, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Bailu Ding, Vivek Ravindranath Narasayya, Surajit Chaudhuri
-
Patent number: 11727698Abstract: An example method of detecting an element using an autonomous vehicle includes the following operations: using a sensor on the autonomous vehicle to capture image data in a region of interest containing the element, where the image data represents components of the element; filtering the image data to produce filtered data having less of an amount of data than the image data; identifying the components of the element by analyzing the filtered data using a deterministic process; and detecting the element based on the components.Type: GrantFiled: May 11, 2020Date of Patent: August 15, 2023Assignee: AUTOGUIDE, LLCInventors: Syed Mohammed Jerrar Bukhari, Tyler Barron, Justin Holwell
-
Patent number: 11704100Abstract: Embodiments described herein provide techniques for in-line compiling of UDFs in other programming languages. These techniques enable faster programming iterations because it allows users to compile directly in the cloud processing system. Moreover, it allows the UDFs to tie into existing libraries. The compiled results are treated as sensitive and handled with appropriate security policies, as with any other user data in the system.Type: GrantFiled: August 31, 2022Date of Patent: July 18, 2023Assignee: Snowflake Inc.Inventors: Elliott Brossard, Siddharth Kulkarni, Isaac Kunen
-
Patent number: 11663179Abstract: A simulated dataset is queried for regression by validating a structured query language (SQL) statement, determining a pattern type of the SQL statement, reconstructing the SQL statement according to a predetermined process for the pattern type, creating a mutated SQL statement for querying a simulated dataset, and validating the mutated SQL statement. The simulated dataset is based on a confidential dataset having the confidential elements removed or replaced.Type: GrantFiled: December 21, 2020Date of Patent: May 30, 2023Assignee: International Business Machines CorporationInventors: Shuo Li, Xiaobo Wang, Sheng Yan Sun, Hong Mei Zhang
-
Patent number: 11651050Abstract: Methods and systems predict parameters in a dataset of an identified piece of (“information technology”) IT equipment. An automated method identifies datasets IT equipment in a same category of IT equipment as a piece of IT equipment identified as having incomplete dataset information. Each dataset of IT equipment parameters is used to construct generalized linear models of different classes of IT equipment within the category of IT equipment. The class of the identified IT equipment is determined. A predicted equipment parameter of incomplete information of the identified piece of IT equipment is computed using the generalized linear model associated with the class. The predicted equipment parameter can be used to complete the dataset of the identified piece of IT equipment.Type: GrantFiled: May 5, 2020Date of Patent: May 16, 2023Assignee: VMware, Inc.Inventors: Chandrashekhar Jha, Jobin George, Prateek Sahu, Kumar Gaurav, Jusvinder Singh
-
Patent number: 11599556Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.Type: GrantFiled: February 4, 2022Date of Patent: March 7, 2023Assignee: Snowflake Inc.Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
-
Patent number: 11580103Abstract: Joining data using a disjunctive operator using a lookup table is described. An example computer-implemented method can include receiving a query with a set of conjunctive predicates and a set of disjunctive predicates. The method may also include generating a lookup table for each predicate in the sets of conjunctive predicates and disjunctive predicates. The method, for each row in a probe-side table, may also further include looking up a value associated with that row in each of the lookup tables and adding the row to a results set when there is a match. Additionally, the method may also include returning the results set.Type: GrantFiled: April 20, 2021Date of Patent: February 14, 2023Assignee: Snowflake Inc.Inventors: Thierry Cruanes, Florian Andreas Funke, Guangyan Hu, Jiaqi Yan
-
Patent number: 11500900Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization. Furthermore, the device processes the set of queries using the updated set of processors.Type: GrantFiled: January 4, 2022Date of Patent: November 15, 2022Assignee: Snowflake Inc.Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
-
Patent number: 11436230Abstract: In a multi-tenant computing system, a cross-tenant user search system receives a user search input from a user using a client of a first tenant. The cross-tenant user search system accesses a policy that identifies one or more related tenants that are related to the first tenant. The policy also identifies directory records of users in the related tenant that can be searched by users of the first tenant. The cross-tenant user search system executes a search against a directory for the first tenant and executes a separate search against the directory records in the related tenants, identified in the policy. The search results from searching the directory for the first tenant and the related search results from searching the directory records in the tenant are aggregated into an aggregated set of search results which are then returned to the client.Type: GrantFiled: April 29, 2021Date of Patent: September 6, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Shankaranand Arunachalam, Robert Allen Lowe, Jason Craig Nelson, Paul Baecke, Xiaoqin Zhu, Ganesh Pandey, Parul Manek
-
Patent number: 10929446Abstract: According to one embodiment, a document search apparatus includes the following elements. The query word extraction unit extracts, from an input sentence, a word for use in a search as a query word. The intention estimation unit estimates a search intention based on the input sentence. The query generator acquires, from a search expression list, a search expression corresponding to the estimated search intention, acquires, from an additional query word list, an additional query word corresponding to a set of the acquired search expression and the extracted query word, and generates a query based on the extracted query word, the acquired additional query word, and the acquired search expression. The search unit searches for a document that matches the query.Type: GrantFiled: January 31, 2017Date of Patent: February 23, 2021Assignee: KABUSHIKI KAISHA TOSHIBAInventors: Yuka Kobayashi, Tomomi Tsubota
-
Patent number: 10824677Abstract: Methods, systems and programming for providing query suggestions. In one example, a search query is received from a user. At least one rewritten search query is generated by replacing at least some text in the search query with a synonym of the at least some text. The synonym of the at least some text is obtained based on a plurality of previously submitted search queries. A plurality of query suggestions is obtained based on the search query and the at least one rewritten search query. At least a portion of the plurality of query suggestions is provided to the user.Type: GrantFiled: August 29, 2016Date of Patent: November 3, 2020Assignee: Oath Inc.Inventors: Zhongqiang Chen, Yuan Zhang, Shenhong Zhu
-
Patent number: 10725800Abstract: Aspects of the present disclosure relate to systems and methods that help a user manage a functional product. Every functional product, be it a service, device, or combination thereof, has one or more documents associated with it. These documents may include such documentation as: (1) Release Notes; (2) Configuration Guides; (3) command line interfaces (CLIs)/application program interfaces (APIs); (4) Data Sheets; (5) Installation Guides; (6) User Manuals; (7) Errata notices; and (8) other documentation. In embodiments, command extracted from the documents along with features available in the functional product and user history (if any) may be used to help suggest one or more commands for a user managing or using the functional product.Type: GrantFiled: September 16, 2016Date of Patent: July 28, 2020Assignee: DELL PRODUCTS L.P.Inventors: Vinay Sawal, Sachinrao Chittaranjan Panemangalore, Vivek Dharmadhikari, Kuntal Atulbhai Patel, Gavin Cato, Krishnamurthy Subramanian, Joseph White
-
Patent number: 10635727Abstract: Embodiments of the present disclosure relate generally to semantic indexing to improve search results of a large corpus. Some embodiments identify one or more closest matches between (i) a search semantic vector that corresponds to a search query, the search semantic vector based on a first machine-learned model that projects the search query into a semantic vector space, and (ii) a plurality of publication vectors corresponding to respective publications in the publication corpus, the plurality of publication vectors based on a second machine-learned model that projects the plurality of publication vectors into the semantic vector space.Type: GrantFiled: February 22, 2017Date of Patent: April 28, 2020Assignee: eBay Inc.Inventors: Mingkuan Liu, Hao Zhang, Xianjing Liu, Alan Qing Lu
-
Patent number: 10613735Abstract: Systems and methods for preloading an amount of content based on user scrolling are disclosed. A body of content may be presented that takes up a certain amount of display space within a graphical user interface. Scroll information characterizing user scrolling within the graphical user interface may be obtained. A portion of the body of content outside a field of view of the graphical user interface may be determined based on the scroll information.Type: GrantFiled: April 4, 2018Date of Patent: April 7, 2020Assignee: Asana, Inc.Inventors: Ummon Karpe, Louis Lafair
-
Patent number: 10372736Abstract: A computing system receives as inputs data records stored in a database. The computing parses the data records into file pairs that each include a keyword file and record ID file and merge file pairs into a keyword file and record ID file, where the keyword file includes keywords in sorted order, and where the record ID file includes a list of record IDs for keywords in the keyword file. The computing system further creates an offset file which stores offset values for starting addresses of record ID lists in the record ID file, and generates an index of keywords by assigning unique identifiers to keywords in the keyword file. The computing system also provides a query interface that allows the database's data records to be searched using the generated index of keywords.Type: GrantFiled: May 16, 2014Date of Patent: August 6, 2019Assignee: University of Utah Research FoundationInventors: Feifei Li, Cody Hansen
-
Patent number: 10216859Abstract: In one embodiment, a method includes by a client computing device, sending a request for two or more categories of content for display in a user interface, wherein each of the categories of content are received from a different content source, and wherein each of the categories of content are displayed in a corresponding region of the user interface. The method further includes determining a space requirement for each of the categories of content, wherein the determining is based at least in part on information in the request and information stored on the client computing device. The method further includes allocating, based on the space requirements, space for each of the regions of the user interface, and displaying each of the categories of content in its corresponding region as the content is received.Type: GrantFiled: March 22, 2016Date of Patent: February 26, 2019Assignee: Facebook, Inc.Inventor: Yen-Ting Tung
-
Patent number: 10061767Abstract: Methods and apparatus are described herein for classifying user reviews or portions thereof as being related to various entities, and for associating extracted descriptive segments of text contained in those user reviews or portions thereof with entities based on the classifications. In various implementations, one or more categories of observed user interest may be identified based on a corpus of user queries. One or more segments of text related to the one or more categories of observed user interest may be detected in one or more user reviews associated with a product. Based on the detecting, the product may be indexed on the one or more categories of observed user interest in a searchable database. In some implementations, the searchable database may be accessible to one or more remote client devices, and may be searchable by the one or more categories of observed user interest to provide search results to be rendered by the one or more remote client devices.Type: GrantFiled: June 16, 2017Date of Patent: August 28, 2018Assignee: GOOGLE LLCInventors: Advay Mengle, Jindong Chen, Charmaine Cynthia Rose D'Silva, Anna Patterson
-
Patent number: 9672537Abstract: A dynamic content controller is configured for communication with one or more data sources. The dynamic content controller comprises an analytics engine and a personalization engine coupled to the analytics engine. The analytics engine is configured to analyze cultural data collected from the one or more data sources during a current content browsing session. The personalization engine is configured to adapt content to be presented based at least in part on the analysis of the cultural data. The adaptation of the content to be presented is performed during the current content browsing session.Type: GrantFiled: December 11, 2014Date of Patent: June 6, 2017Assignee: EMC IP Holding Company LLCInventors: David Dietrich, Ronald Wilfred Reidy, Beibei Yang, Barry William Heller
-
Patent number: 9443016Abstract: A system and method for a collection of digital items that includes generating a collection of result items, wherein each result item corresponds to result data of at least one query input, and wherein at least one result item includes a context property; storing the collection of result items as a first instance; and augmenting the collection in a second instance through at least the context property.Type: GrantFiled: February 10, 2014Date of Patent: September 13, 2016Assignee: Verbify Inc.Inventor: Bobby Lo
-
Patent number: 8954419Abstract: Techniques are provided for improving performance of parallel execution plans that include serial operations. A database server generates a parallel execution plan and includes one or more serial data flow objects in the parallel execution plan, each data flow object including one or more serial row sources. A parallel selector row source is associated with each serial data flow object included in the parallel execution plan. Each parallel selector row source coordinates with a query coordinator process the execution of the associated serial data flow object. In the event that that a plurality of serial data flow objects are included in a parallel execution plan, serial row sources included in the serial data flow objects may be executed concurrently by a separate parallel slave process, instead of sequential execution of each serial row source by the query coordinator process.Type: GrantFiled: May 22, 2012Date of Patent: February 10, 2015Assignee: Oracle International CorporationInventors: Himani Apte, Thierry Cruanes
-
Patent number: 8903801Abstract: Techniques are provided for a fully-automated process for tuning database query language statements that selects database query language statements for tuning, tunes the database query language statements and generates tuning recommendations, tests the tuning recommendations, and determines whether to implement the tuning recommendations based on the test results. The fully-automated tuning process may also automatically implement certain tuning recommendations and monitor the performance of the database query language statements for which tuning recommendations have been implemented.Type: GrantFiled: August 8, 2008Date of Patent: December 2, 2014Assignee: Oracle International CorporationInventors: Peter Belknap, Benoit Dageville, Karl Dias, Khaled Yagoub
-
Patent number: 8898146Abstract: A method and device may accept a first database query plan including a plurality of rows and a second database query plan including a plurality of rows, compare each row in the first database query plan with a corresponding row in the second database query plan, and based on each comparison, calculate a difference between the compared rows. Each difference may be recorded in an output file.Type: GrantFiled: September 22, 2010Date of Patent: November 25, 2014Assignee: Hewlett-Packard Development Company, L.P.Inventors: Maria G. Castellanos, Perla Ruiz
-
Patent number: 8862625Abstract: Embodiments of the present invention provide hardware-friendly indexing of databases. In particular, forward and reverse indexing are utilized to allow for easy traversal of primary key to foreign key relationships. A novel structure known as a hit list also allows for easy scanning of various indexes in hardware. Group indexing is provided for flexible support of complex group key definition, such as for date range indexing and text indexing. A Replicated Reordered Column (RRC) may also be added to the group index to convert random I/O pattern into sequential I/O of only needed column elements.Type: GrantFiled: April 7, 2008Date of Patent: October 14, 2014Assignee: Teradata US, Inc.Inventors: Krishnan Meiyyappan, Liuxi Yang, Jeremy Branscome, Michael Corwin, Ravi Krishnamurthy, Kapil Surlaker, James Shau, Joseph I. Chamdani
-
Patent number: 8788484Abstract: Certain example embodiments described herein relate to techniques for user feedback driven dynamic query rewriting in complex event processing (CEP) environments. In certain example embodiments, the adjustment of already-initiated queries may be permitted during runtime in order to help continuously or substantially continuously meet a user's requirements. In certain example embodiments, parameters of a query may be adapted dynamically, e.g., without the need to redeploy it. Adaptable parameters in certain example embodiments may be identified and/or based on, for instance, an automatic determination from original query creation, user specifications, the way in which a user interacts with a user interface for visualizing the results of queries, etc.Type: GrantFiled: December 27, 2010Date of Patent: July 22, 2014Assignee: Software AGInventors: Michael Cammert, Christoph Heinz, Jürgen Krämer, Tobias Riemenschneider
-
Patent number: 8756120Abstract: A hybrid context information matching approach may produce a customized product catalogue based on the user's context and the mobile device the user is using. A Knowledge Base (KB) and a KB manager, along with various processes perform specific collaborative tasks in order to achieve the overall goal of producing a customized product catalogue. In addition, the effort builds and/or updates the KB. Various contextual inputs are provided from both the user environment and data repositories. Hybrid matching is performed in order to determine optimal search results based on the contextual input provided.Type: GrantFiled: November 1, 2011Date of Patent: June 17, 2014Assignee: SAP AGInventors: Felix Ntawanga, Ernest Ketcha Ngassam, Jan Harm Petrus Eloff, Marek Piotr Zielinski
-
Patent number: 8738651Abstract: A technique for cataloging documents based on user activity includes assigning documents to a relevant document list based on activity of a user of a device. In this case, at least two of the documents are associated with different applications. The technique then provides the relevant document list to the user.Type: GrantFiled: March 6, 2008Date of Patent: May 27, 2014Assignee: Lenovo (Singapore) Pte LtdInventors: Jennifer G. Zawacki, David C. Challener, Justin T. Dubs, James J. Thrasher
-
Publication number: 20140136513Abstract: A database system maintains a feature set of a modern database system while operating directly on raw data files. Systems may use an adaptive indexing mechanism that maintains positional information to provide efficient access to raw data files, a flexible caching structure, and techniques for selective parsing and selective tokenizing. In doing so, possible performance bottlenecks associated with repeated parsing, tokenizing, and expensive data type conversion costs can be overcome.Type: ApplicationFiled: November 15, 2012Publication date: May 15, 2014Applicant: Ecole Polytechnique Fédérale de Lausanne (EPFL)Inventors: Anastasia Ailamaki, Stratos Idreos, Ioannis Alagiannis, Renata Borovica, Miguel Sergio De Oliveira Branco
-
Publication number: 20140136511Abstract: Detection and use of relationships in tabular data is enhanced. A foreign-key-based relationship between two tables in memory is automatically discovered by a module. A view over table data is computationally augmented by adding an expandable hierarchical column. The expandable hierarchical column includes second table data that is related to rows of a first table by the discovered foreign-key-based relationship. A naming scheme for the added column is described. Sometimes the augmentation places an AddJoinColumn operator in a query expression tree; sometimes the tree is rewritten by moving a query operator or replacing a specified pattern of operators by a flat Join operation between two tables. Column expansion may be followed by reshaping data into a flat table, by getting a homogeneous query, by performing a join, or other operations.Type: ApplicationFiled: November 15, 2012Publication date: May 15, 2014Applicant: MICROSOFT CORPORATIONInventors: Gregory Lawrence Hughes, Clemens Alden Szyperski, Michael James Coulson
-
Publication number: 20140136512Abstract: Embodiments of the present invention provide a method, system and computer program product for pre-migration performance prediction of a database management system (DBMS). In an embodiment of the invention, a method for pre-migration performance prediction of a DBMS can include executing a calibration workload in a target DBMS to produce a conversion factor of cost of executing the calibration workload to temporal performance of executing the calibration workload. The method also can include subsequently submitting a sample workload from a database of a source DBMS for evaluation of cost of execution on an empty replica of the database in the target DBMS. Finally, the method can include predicting a temporal performance of the sample workload in the target DBMS as a product of the conversion factor and the cost of execution of the sample workload on the empty replica of the database in the target DBMS.Type: ApplicationFiled: November 9, 2012Publication date: May 15, 2014Inventors: Austin Clifford, Enda McCallig
-
Publication number: 20140114952Abstract: The present invention extends to methods, systems, and computer program products for optimizing queries of parallel databases. Queries can be partially optimized at an optimizer that is unaware of its use to optimize queries for parallel processing. The optimizer can produce a data structure (e.g., a SQL Server MEMO) that encapsulates a logical serial plan search space. The logical serial plan search space may not incorporate any notion of parallelism into the plan space itself. A parallel-aware optimizer can parallelize the logical serial plan search space by augmenting the data structure (e.g., transforming the SQL Server MEMO into a parallel MEMO). Augmentation can be with data movement operations that move data associated one or more compute nodes in a distributed architecture. Cost estimates can be calculated for the operations contained in the parallelized data structure. The parallel plan with the lowest estimated cost can be selected for the query.Type: ApplicationFiled: October 23, 2012Publication date: April 24, 2014Applicant: MICROSOFT CORPORATIONInventors: Eric R. Robinson, Alan Dale Halverson, Rimma V. Nehme, Srinath Shankar
-
Publication number: 20140114942Abstract: A search index for a collection of documents includes a plurality of keywords associated with the documents. Access to individual documents is detected based on searches employing the search index and keywords are recorded that are utilized in the searches and resulted in document access. The search index is modified to maintain the recorded keywords and remove keywords absent from the searches resulting in the document access.Type: ApplicationFiled: October 23, 2012Publication date: April 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Igor L. Belakovskiy, Matthew E. Broomhall, Itzhack Goldberg, Boaz Mizrachi, Neil Sondhi
-
Publication number: 20140108378Abstract: A technique for factoring uncertainty into cost-based query optimization includes: determining the degree of uncertainty involved in the cost estimates for the query, determining the degree of sensitivity the query has to that uncertainty, and determining if there is an access path that performs well across the range of possible conditions that could occur at execution time, reducing the risk of performance spikes and performance volatility. If such an access path exists, select that access path; if not, perform parametric query optimization or query re-optimization.Type: ApplicationFiled: October 17, 2012Publication date: April 17, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Jason L. Alpers, Thomas A. Beavin, Patrick D. Bossman, Shuanglin Guo, Terence P. Purcell
-
Publication number: 20140108369Abstract: Systems and method of the present invention provide for one or more server computers configured to receive one or more keywords topically relevant to a content of a web page, request from a search engine a first metric comprising a quantity of times the keywords have appeared in a search query with one or more question keywords during a time period and a second metric comprising a probability of receiving a high rank associated with the one or more keywords and the one or more question keywords, receive, from the search engine, the first metric and the second metric, calculate a keyword effectiveness index from the first metric and the second metric, and generate and transmit to a client computer one or more recommendations to include a high ranked suggested content on the web page according to the keyword effectiveness index.Type: ApplicationFiled: October 11, 2012Publication date: April 17, 2014Applicant: Go Daddy Operating Company, LLCInventor: Rajinder Nijjer
-
Publication number: 20140101128Abstract: In an embodiment, a recursive query is received that comprises a first select statement with a seed select statement and a second select statement with a recursive reference, wherein the recursive query further identifies at least two columns in at least one table, wherein the at least two columns have parent-child relationships represented by nodes in a graph, wherein the graph represents the organization of values in rows in the at least one table. A target recursion depth is calculated for the graph based on a fanout of the graph. In an embodiment, the target recursion depth is calculated by summing the fanout at each recursion depth of the graph multiplied by a number of nodes at each recursion depth of the graph. An estimated number of rows that the recursive query will return is estimated based on the target recursion depth.Type: ApplicationFiled: October 5, 2012Publication date: April 10, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Lynnette E. Carston, Brian R. Muras, Andrew P. Passe
-
Publication number: 20140101131Abstract: In an embodiment, a hit percentage of an expected affinity for a first query is calculated, wherein the expected affinity comprises a first address range in a query plan cache, a hit percentage of a candidate affinity for the first query is calculated, wherein the candidate affinity comprises a second address range in a query plan cache, and if the hit percentage of the candidate affinity is greater than the hit percentage of the expected affinity by more than a threshold amount, query plans in the candidate affinity are swapped with query plans in the expected affinity.Type: ApplicationFiled: October 8, 2012Publication date: April 10, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Rafal P. Konik, Roger A. Mittelstadt, Brian R. Muras, Mark W. Theuer
-
Publication number: 20140089294Abstract: Presented is a system and method for evaluating relational database queries in a distributed system. An optimized query plan is received by a control node. The query plan is decoded to a SQL statement that is semantically equivalent to the query plan, even though the query plan contains elements that have no direct analogue in SQL. The decoded SQL is transmitted to a compute node for execution.Type: ApplicationFiled: September 26, 2012Publication date: March 27, 2014Applicant: MICROSOFT CORPORATIONInventors: Srinath Shankar, Rimma Nehme
-
Publication number: 20140074819Abstract: A method for providing optimized data representation of relations for in-memory database query processing is disclosed. The method seeks to optimize the use of the available memory by encoding relations on which the in-memory database query processing is performed and by employing auxiliary structures to maintain performance. Relations are encoded based on data patterns in one or more attribute-columns of the relation and the encoding that is selected is suited to a particular type of data in the column. Members of a set of auxiliary structures are selected based on the benefit the structure can provide and the cost of the structure in terms of the amount of memory used. Encoding of the relations is performed in real-time while query processing occurs, using locks to eliminate conflicts between the query processing and encoding.Type: ApplicationFiled: September 12, 2012Publication date: March 13, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Sam Idicula, Kevin Moore, Brian Gold, Nipun Agarwal, Eric Sedlar
-
Publication number: 20140074818Abstract: A system joins predicate evaluated column bitmaps having varying lengths. The system includes a column unifier for querying column values with a predicate and generating an indicator bit for each of the column values that is then joined with the respective column value. The system also includes a bitmap generator for creating a column-major linear bitmap from the column values and indicator bits. The column unifier also determines an offset between adjacent indicator bits. The system also includes a converter for multiplying the column-major linear bitmap with a multiplier to shift the indicator bits into consecutive positions in the linear bitmap.Type: ApplicationFiled: September 13, 2012Publication date: March 13, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ronald J. Barber, Min-Soo Kim, Jae Gil Lee, Sam S. Lightstone, Guy M. Lohman, Lin Qiao, Vijayshankar Raman, Richard S. Sidle
-
Publication number: 20140067791Abstract: A system and method for improving performance of join operations is disclosed. The method in one embodiment includes building a graph of vertices and edges, which represent the tables and join conditions for a particular database schema. The method then traverses the graph to find a set of candidate join operations. The method further includes selecting from the set of subset of join operations that makes the best use of a given amount of memory, which means finding a collection of denormalizations which when residing in memory provide the most benefit and the least cost for the given amount of memory. The method then deploys the denormalizations so that current queries in a query processing system can use them instead of actually performing the joins.Type: ApplicationFiled: September 6, 2012Publication date: March 6, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Sam Idicula, Sabina Petride, Nipun Agarwal
-
Publication number: 20140067789Abstract: A method, apparatus, and stored instructions are provided for transforming a query representation by unnesting a predicate condition that is based on whether or not a result exists for a subquery of the predicate condition. An initial query representation is received. The initial query representation represents an initial query that includes an EXISTS-equivalent predicate or a NOT-EXISTS-equivalent predicate and at least one other predicate in a disjunction. The initial query representation is transformed into a semantically equivalent transformed query representation that represents a transformed query. The transformed query includes, instead of the EXISTS-equivalent predicate or a NOT-EXISTS-equivalent predicate, a join operator that references the data object.Type: ApplicationFiled: September 4, 2012Publication date: March 6, 2014Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Rafi Ahmed, Chun-Chieh Lin, Mohamed Zait
-
Publication number: 20140067792Abstract: A system includes a coordinator node and worker nodes in communication with the coordinator node. Each worker node stores data blocks. Each data block has data in a semi-structured format and each data block has an associated foreign table declaration specifying conversion of the data in the semi-structured format into a tabular format interpretable by a query language. A query processor executed by the coordinator node produces a distributed query plan in response to a query language query. The distributed query plan includes sub-queries. The sub-queries are executed by selected worker nodes of the worker nodes. The selected worker nodes use foreign table declarations to convert data in semi-structured formats into tabular formats of a distributed database to provide tabular data in response to the query language query.Type: ApplicationFiled: August 30, 2012Publication date: March 6, 2014Applicant: CITUS DATA BILGI ISLEMLERI TICARET A.S.Inventors: Ozgun Ali Erdogan, Sumedh Suhas Pathak, Hadi Moshayedi
-
Publication number: 20140059033Abstract: Systems, methods, and apparatuses, including computer program products, are provided for machine translation using information retrieval techniques. In general, in one implementation, a method is provided. The method includes providing a received input segment as a query to a search engine, the search engine searching an index of one or more collections of documents, receiving one or more candidate segments in response to the query, determining a similarity of each candidate segment to the received input segment, and for one or more candidate segments having a determined similarity that exceeds a threshold similarity, providing a translated target segment corresponding to the respective candidate segment.Type: ApplicationFiled: April 23, 2008Publication date: February 27, 2014Applicant: Google Inc.Inventors: Hayden Shaw, Thorsten Brants
-
Publication number: 20140046928Abstract: In an embodiment, a first query is received that specifies a first object identifier. If a first query plan exists that implements the first query, except that the first query plan does not comprise the first object identifier and instead comprises a parameter marker in place of the first object identifier, a first query execution plan is created from the first query plan, substituting the first object identifier in the first query execution plan for the parameter marker, and the first query execution plan is executed to read a first object identified by the first object identifier.Type: ApplicationFiled: August 9, 2012Publication date: February 13, 2014Applicant: International Business Machines CorporationInventors: Rafal P. Konik, Roger A. Mittelstadt, Brian R. Muras, Mark W. Theuer
-
Publication number: 20140025658Abstract: A method, system, and computer program product for validating database table partitioning from partition advisors. The method commences by receiving a workload comprising a plurality of queries, then analyzes the queries to determine stratification buckets based on the usage of tables in the queries. Further analysis of the queries results in assigning the queries into one or more of the stratification buckets from which buckets a number n of queries (n being smaller than the total number of queries in the received workload) are drawn from the stratification buckets to form a representative workload having a confidence interval C and a margin of error M. Now, having a representative workload that is smaller, yet statistically representative of the received workload, a computer evaluates each of a plurality of partition schemes using the representative workload to determine an optimal partitioning scheme. The confidence interval C can be increased or decreased.Type: ApplicationFiled: July 17, 2012Publication date: January 23, 2014Applicant: Oracle International CorporationInventors: Murali THIYAGARAJAN, Praveen T.J. Kumar
-
Patent number: 8635207Abstract: A networked data processor maintains a database of information concerning potential evidence from litigants in legal proceedings supporting stipulated discovery agreements, and requiring meet-and-confer sessions prior to seeking court supervision. Individual or adverse parties can participate. Sources include data custodians, file repositories, electronic data, witnesses, etc. Managers and employees are polled to populate a database defining the litigant's organization and evidence. The evidence may be sequestered, copied and processed, e.g., filtered for confidentiality or privilege, analyzed as to format, and queried to assess the volume of data that would be responsive under alternative discovery specifications. Cost and time are assessed under alternative specifications and reports are provided for use in negotiating a discovery plan. A stipulated discovery plan may result, or if not, a meet-and-confer session is electronically managed in a multi-user teleconference.Type: GrantFiled: January 21, 2011Date of Patent: January 21, 2014Assignee: 26-F, LLCInventor: Michael Marlin
-
Patent number: 8626789Abstract: Geocoding architecture based on information retrieval. The geocoding information retrieval system operates on locations and geographic entities. The architecture uses parameters in addition to the query string, if available. The parameters provide contextual information that enables filtering and sorting the results. The additional parameters include market, culture, map view, and user location. A geocoding triage process uses the query and context information provided and output of a machine-learning parser to select the data-lookup geocoders for processing the query. A results processing component collects the results from the underlying data-lookup geocoders, merges the results into a single list after normalizing the scores, and then re-sorts the results list.Type: GrantFiled: June 1, 2007Date of Patent: January 7, 2014Assignee: Microsoft CorporationInventors: Harish Jayanti, Eugene Shinn
-
Publication number: 20140006381Abstract: Embodiments of the present invention provide query processing for column stores by accumulating table record attributes during application of query plan operators on a table. The attributes and associated attribute values are compacted when said attribute values are to be consumed for an operation in the query plan, during the execution of the query plan. Table column record values are materialized late in query plan execution.Type: ApplicationFiled: June 29, 2012Publication date: January 2, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ronald J. Barber, Lin Qiao, Vijayshankar Raman, Richard S. Sidle