Patents by Inventor Caleb E. Welton

Caleb E. Welton has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11436224
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.
    Type: Grant
    Filed: May 30, 2018
    Date of Patent: September 6, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 11281669
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.
    Type: Grant
    Filed: January 9, 2020
    Date of Patent: March 22, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 11164092
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: November 2, 2021
    Assignee: Pivotal Software, Inc.
    Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
  • Patent number: 10922053
    Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.
    Type: Grant
    Filed: October 29, 2019
    Date of Patent: February 16, 2021
    Assignee: Pivotal Software, Inc.
    Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
  • Patent number: 10838960
    Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: November 17, 2020
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Shengwen Yang
  • Publication number: 20200285973
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.
    Type: Application
    Filed: January 13, 2020
    Publication date: September 10, 2020
    Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
  • Publication number: 20200151179
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.
    Type: Application
    Filed: January 9, 2020
    Publication date: May 14, 2020
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Publication number: 20200065067
    Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.
    Type: Application
    Filed: October 29, 2019
    Publication date: February 27, 2020
    Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
  • Patent number: 10572479
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.
    Type: Grant
    Filed: September 25, 2017
    Date of Patent: February 25, 2020
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 10535008
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel time series analysis. One of the methods includes receiving a plurality of data elements. The plurality of data elements are partitioned into a plurality of chunks, wherein the plurality of chunks, including a first chunk and a plurality of additional chunks, have an ordering according to the data elements included in each chunk. Each chunk is assigned to a particular segment of a plurality of segments. A first iteration of an autoregressive integrated moving average is computed for each chunk assigned to each segment. A second iteration of the autoregressive integrated moving average is computed for each chunk assigned to each segment, wherein computing uses the result data for a corresponding preceding chunk in the first iteration. One or more additional iterations of the autoregressive integrated moving average are computed until stopping criteria has been satisfied.
    Type: Grant
    Filed: May 15, 2014
    Date of Patent: January 14, 2020
    Assignee: Pivotal Software, Inc.
    Inventors: Hai Qian, Caleb E. Welton, Rahul Iyer, Shengwen Yang, Xixuan Feng
  • Patent number: 10496375
    Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.
    Type: Grant
    Filed: August 27, 2018
    Date of Patent: December 3, 2019
    Assignee: Pivotal Software, Inc.
    Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
  • Patent number: 10445657
    Abstract: A general framework for cross-validation of any supervised learning algorithm on a distributed database comprises a multi-layer software architecture that implements training, prediction and metric functions in a C++ layer and iterates processing of different subsets of a data set with a plurality of different models in a Python layer. The best model is determined to be the one with the smallest average prediction error across all database segments.
    Type: Grant
    Filed: December 8, 2015
    Date of Patent: October 15, 2019
    Assignee: EMC IP Holding Company, LLC
    Inventors: Hai Qian, Rahul Iyer, Shengwen Yang, Caleb E. Welton
  • Publication number: 20190012144
    Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments during a normal query plan dispatch process.
    Type: Application
    Filed: August 27, 2018
    Publication date: January 10, 2019
    Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
  • Publication number: 20180373755
    Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.
    Type: Application
    Filed: November 22, 2017
    Publication date: December 27, 2018
    Inventors: Caleb E. Welton, Shengwen Yang
  • Publication number: 20180276274
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.
    Type: Application
    Filed: May 30, 2018
    Publication date: September 27, 2018
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 10061562
    Abstract: A random number generation process generated uncorrelated random numbers from identical random number sequences on parallel processing database segments of an MPP database without communications between the segments by establishing a different starting position in the sequence on each segment using an identifier that is unique to each segment, query slice information and the number of segments. A master node dispatches a seed value to initialize the random number sequence generation on all segments, and dispatches the query slice information and information as to the number of segments, during a normal query plan dispatch process.
    Type: Grant
    Filed: September 28, 2013
    Date of Patent: August 28, 2018
    Assignee: Pivotal Software, Inc.
    Inventors: Hitoshi Harada, Caleb E. Welton, Florian Schoppmann
  • Patent number: 10013456
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, and the master node may distribute this metadata with query plans to worker nodes. The worker nodes may request additional metadata from the master node when necessary.
    Type: Grant
    Filed: August 3, 2016
    Date of Patent: July 3, 2018
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 10002145
    Abstract: A specified data type and pre-storage operation are receive. In response, an association between the two is created in storage metadata. After the association is created, data to be stored is received where the data has a data type. In response to receiving the data, the storage metadata, including the association, is accessed and it is determined if the data type of the data is the same as the specified one. If so, the specified pre-storage operation is automatically performed on the data in order to obtain an output and the output is stored.
    Type: Grant
    Filed: April 12, 2016
    Date of Patent: June 19, 2018
    Assignee: EMC IP Holding Company LLC
    Inventors: Gavin John Sherry, Caleb E. Welton
  • Publication number: 20180011902
    Abstract: A method and system for executing database queries in parallel using a shared metadata store. The metadata store may reside on a master node, where the master node is the root node in a tree. The master node may distribute query plans and query metadata to other nodes in the cluster. These additional nodes may request additional metadata from each other or the master nodes as necessary.
    Type: Application
    Filed: September 25, 2017
    Publication date: January 11, 2018
    Inventors: Caleb E. Welton, Hitoshi Harada, Jeffrey Ira Cohen, Lei Chang, Radhika Reddy, Tao Ma, Zhanwei Wang
  • Patent number: 9858315
    Abstract: Performing data analytics processing in the context of a large scale distributed system that includes a massively parallel processing (MPP) database and a distributed storage layer is disclosed. In various embodiments, a data analytics request is received. A plan is created to generate a response to the request. A corresponding portion of the plan is assigned to each of a plurality of distributed processing segments, including by invoking as indicated in the assignment one or more data analytical functions embedded in the processing segment.
    Type: Grant
    Filed: December 22, 2016
    Date of Patent: January 2, 2018
    Assignee: EMC IP Holding Company LLC
    Inventors: Caleb E. Welton, Shengwen Yang