Patents by Inventor Caleb E. Welton
Caleb E. Welton has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11436224Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.Type: GrantFiled: May 30, 2018Date of Patent: September 6, 2022Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 11281669Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.Type: GrantFiled: January 9, 2020Date of Patent: March 22, 2022Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 11164092Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.Type: GrantFiled: January 13, 2020Date of Patent: November 2, 2021Assignee: Pivotal Software, Inc.Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
-
Patent number: 10922053Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.Type: GrantFiled: October 29, 2019Date of Patent: February 16, 2021Assignee: Pivotal Software, Inc.Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
-
Patent number: 10838960Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.Type: GrantFiled: November 22, 2017Date of Patent: November 17, 2020Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Shengwen Yang
-
Publication number: 20200285973Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.Type: ApplicationFiled: January 13, 2020Publication date: September 10, 2020Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
-
Publication number: 20200151179Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.Type: ApplicationFiled: January 9, 2020Publication date: May 14, 2020Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Publication number: 20200065067Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.Type: ApplicationFiled: October 29, 2019Publication date: February 27, 2020Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
-
Patent number: 10572479Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.Type: GrantFiled: September 25, 2017Date of Patent: February 25, 2020Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 10535008Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.Type: GrantFiled: May 15, 2014Date of Patent: January 14, 2020Assignee: Pivotal Software, Inc.Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
-
Patent number: 10496375Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.Type: GrantFiled: August 27, 2018Date of Patent: December 3, 2019Assignee: Pivotal Software, Inc.Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
-
Patent number: 10445657Abstract: A general framework for cross-validation of any supervised learning algorithm on a distributed database comprises a multi-layer software architecture that implements training, prediction and metric functions in a C++ layer and iterates processing of different subsets of a data set with a plurality of different models in a Python layer. The best model is determined to be the one with the smallest average prediction error across all database segments.Type: GrantFiled: December 8, 2015Date of Patent: October 15, 2019Assignee: EMC IP Holding Company, LLCInventors: Hai Qian, Rahul Iyer, Shengwen Yang, Caleb E. Welton
-
Publication number: 20190012144Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.Type: ApplicationFiled: August 27, 2018Publication date: January 10, 2019Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
-
Publication number: 20180373755Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.Type: ApplicationFiled: November 22, 2017Publication date: December 27, 2018Inventors: Caleb E. Welton, Shengwen Yang
-
Publication number: 20180276274Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.Type: ApplicationFiled: May 30, 2018Publication date: September 27, 2018Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 10061562Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments, during a normal query plan dispatch process.Type: GrantFiled: September 28, 2013Date of Patent: August 28, 2018Assignee: Pivotal Software, Inc.Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
-
Patent number: 10013456Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.Type: GrantFiled: August 3, 2016Date of Patent: July 3, 2018Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 10002145Abstract: A specified data type and pre-storage operation are receive. In response, an association between the two is created in storage metadata. After the association is created, data to be stored is received where the data has a data type. In response to receiving the data, the storage metadata, including the association, is accessed and it is determined if the data type of the data is the same as the specified one. If so, the specified pre-storage operation is automatically performed on the data in order to obtain an output and the output is stored.Type: GrantFiled: April 12, 2016Date of Patent: June 19, 2018Assignee: EMC IP Holding Company LLCInventors: Gavin John Sherry, Caleb E. Welton
-
Publication number: 20180011902Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.Type: ApplicationFiled: September 25, 2017Publication date: January 11, 2018Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
-
Patent number: 9858315Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.Type: GrantFiled: December 22, 2016Date of Patent: January 2, 2018Assignee: EMC IP Holding Company LLCInventors: Caleb E. Welton, Shengwen Yang