Patents by Inventor Anna Veselova

Anna Veselova has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11983172
    Abstract: A method includes generating a predictive model mapping batch size to estimated memory utilization based on a plurality of prior batch sizes applied for performing a corresponding plurality of prior data format conversions, and further based on a plurality of prior actual memory utilization occurring in performing the corresponding plurality of prior data format conversions. A next batch size is automatically selected for performance of a subsequent data format conversion based on the predictive model and a current memory availability. A batch of data for performance of the subsequent data format conversion is identified based on the batch of data having the next batch size. The subsequent data format conversion is performed the batch of data, and an actual amount of memory utilized in performing the subsequent data format conversion is determined. The predictive model is updated based on the next batch size and the actual amount of memory utilized.
    Type: Grant
    Filed: December 7, 2021
    Date of Patent: May 14, 2024
    Assignee: Ocient Holdings LLC
    Inventors: Natalia Marie Jenuwine, George Kondiles, Anna Veselova, Ravi V. Khadiwala
  • Publication number: 20240134858
    Abstract: A database system is operable to determine a query for execution. A disjunction of a plurality of conjunctive normal form (CNF) filtering predicates is determined based on filtering predicates of the query. An IO pipeline is generated based on the disjunction of the plurality of CNF filtering predicates that includes, based on applying at least one type of optimization, at least one shared IO element common to multiple ones of plurality of the CNF filtering predicates. The IO pipeline is executed in accordance with execution of the query to generate a filtered subset of a plurality of rows that each meet at least one of the plurality of CNF filtering predicates.
    Type: Application
    Filed: October 11, 2023
    Publication date: April 25, 2024
    Applicant: Ocient Holdings LLC
    Inventors: Sarah Kate Schieferstein, Jason Arnold, Austen Thomas Darre, Darius Russell Kish, Matthew Ashbeck, Greg R. Dhuse, Anna Veselova, Benjamin Daniel Rabe, Pieter Charles Jas Svenson
  • Publication number: 20240118905
    Abstract: A node of a database system is operable to determine, at a first time, to prepare for a shutdown. The shutdown of the node is performed at a second time that is a period of time after the first time based on the node delaying the shutdown until a plurality of currently running processes being run by the node that initiated prior to the first time are determined to be complete. During the period of time after the first time and prior to performing the shutdown, a set of new processing requests are rejected by the node.
    Type: Application
    Filed: October 9, 2023
    Publication date: April 11, 2024
    Applicant: Ocient Holdings LLC
    Inventors: George Kondiles, Alyssa Catherine Wagenmaker, Ellis Mihalko Saupe, Jason Arnold, Natalia Marie Jenuwine, Finley Jordan Lau, David Charles Boutcher, Anna Veselova, Susmita Saha
  • Publication number: 20240111745
    Abstract: A method includes performing a search of an inverted index structure indexing values of a column to generate an in-range indexed value set by identifying all indexed values of the inverted index structure falling within a range corresponding to a range-based filter. A set of characteristics of the in-range indexed value set are identified based on performing the search of an inverted index structure. When the set of characteristics compare favorably to the set of index-usage requirements, output is generated based on performing a plurality of searches to the inverted index structure based on the in-range indexed value set. When the set of characteristics compare unfavorably to the set of index-usage requirements, the output is generated without performing any searches to the inverted index structure.
    Type: Application
    Filed: September 15, 2023
    Publication date: April 4, 2024
    Applicant: Ocient Holdings LLC
    Inventors: Richard George Wendel, III, Greg R. Dhuse, Hassan Farahani, Matthew Ashbeck, Anna Veselova, Benjamin Daniel Rabe
  • Publication number: 20240004848
    Abstract: A method includes generating a plurality of pages from a plurality of records received from a plurality of data sources. Deduplication of the plurality of pages is facilitated based on a plurality of page metadata of the plurality of pages based on, for the each page of the plurality of pages. A filtered set of potentially-intersecting pages is identified for each given page as a proper subset of the plurality of pages stored in the page storage system based on first comparison parameters, and an intersecting set of pages that include a row number intersection with the given page is identified as a proper subset of the filtered set of potentially-intersecting pages based on second comparison parameters. Records with records with row numbers included in row number intersections with other pages in the intersecting set of pages are removed from the each page.
    Type: Application
    Filed: September 15, 2023
    Publication date: January 4, 2024
    Applicant: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova
  • Publication number: 20230418820
    Abstract: A database system is operable to store a plurality of column values for a first column of a plurality of rows. A query having a query predicate indicating a negation of a condition for the first column based on a first value is executed. A first index element is applied to identify a first proper subset of the plurality of rows having values for the first column meeting the condition based on accessing index data. At least one second index element is applied to identify a second proper subset of the plurality of rows having values for the first column meeting at least one missing data-based condition based on accessing the index data. A set difference between the plurality of rows and a union of the first proper subset and the second proper subset is identified. A query resultant for the query is generated based on the set difference.
    Type: Application
    Filed: September 7, 2023
    Publication date: December 28, 2023
    Applicant: Ocient Holdings LLC
    Inventors: Anna Veselova, Greg R. Dhuse, Matthew Ashbeck
  • Patent number: 11803526
    Abstract: A record processing and storage system is operable to receive a plurality of row data that each indicate a batch number. Each of the plurality of row data is added to a pending row data pool. A plurality of pages is generated from the plurality of row data via a plurality of processing core resources. Each processing core resource in the plurality of processing core resources processing a corresponding subset of the plurality of row data by retrieving, in each time slice of a plurality of time slices, one row data from the pending row data pool with a most favorably ordered batch number of row data in the pending row data pool. Each processing core resource further processes the one row data in the in each time slice to participate in generation of at least one of the plurality of pages.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: October 31, 2023
    Assignee: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova
  • Patent number: 11803544
    Abstract: A method for execution by at least one processor of a database system includes storing a plurality of column values for a first column of a plurality of rows. Each of a set of missing data-based conditions for the first column are indexed via an indexing scheme. A subset of the set of missing data-based conditions that satisfy a query predicate of a query are identified. An IO pipeline for access of the first column based on the query predicate and further based on the subset of the set of missing data-based conditions is generated. The IO pipeline is applied in conjunction with execution of the query by applying at least one index element to identify a proper subset of the plurality of rows based on index data of the indexing scheme, and by generating a query resultant based on the proper subset.
    Type: Grant
    Filed: October 6, 2021
    Date of Patent: October 31, 2023
    Assignee: Ocient Holdings LLC
    Inventors: Anna Veselova, Greg R. Dhuse, Matthew Ashbeck
  • Patent number: 11734239
    Abstract: A record processing and storage system is operable to receive a plurality of labeled row data from a data source. Each labeled row data of the plurality of labeled row data includes at least one record and a corresponding row number of a plurality of row numbers. A plurality of pages are generated from records included in the labeled row data. The plurality of pages are stored via a page storage system. A plurality of page metadata corresponding to the plurality of pages is generated, where each of the plurality of page metadata is generated based on at least corresponding one row number of at least one labeled row data with records included in a corresponding one of the plurality of pages. Deduplication of duplicated records included the plurality of pages is facilitated based on the plurality of page metadata.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: August 22, 2023
    Assignee: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova
  • Publication number: 20230237031
    Abstract: A record processing and storage system operates by: generating a set of pages from a plurality of row data via a plurality of processing core resources, wherein each processing core resource in the plurality of processing core resources generate a corresponding subset of the set of pages, independently from and in parallel with processing of other subsets of the set of pages via other ones of the plurality of processing core resources; facilitating performance of a single storage transaction to store the set of pages; identifying a page set interval based on a plurality of row number intervals of the set of pages; generating, based on completing the single storage transaction, row durability data indicating a least favorably ordered row number of a plurality of row numbers corresponding to the plurality of row data; and transmitting the row durability data to a computing device associated with the plurality of row data.
    Type: Application
    Filed: March 30, 2023
    Publication date: July 27, 2023
    Applicant: Ocient Holdings LLC
    Inventors: Anna Veselova, Ravi V. Khadiwala, George Kondiles
  • Patent number: 11675757
    Abstract: A database system operates by: receiving a plurality of row data associated with a first data source; identifying a subset of row data from the plurality of row data that includes only ones of the plurality of row data that compare favorably to maintained row durability data; generating at least one page from ones of the plurality of row data included in the subset of row data; storing the at least one page in long term storage; generating updated row durability data indicating a least favorably ordered row number of a plurality of row numbers corresponding to the subset of row data based on storing the at least one page in long term storage; and updating the maintained row durability data to indicate the least favorably ordered row number of the updated row durability data.
    Type: Grant
    Filed: March 29, 2021
    Date of Patent: June 13, 2023
    Assignee: Ocient Holdings LLC
    Inventors: Anna Veselova, Ravi V. Khadiwala, George Kondiles
  • Publication number: 20230177043
    Abstract: A method includes generating a predictive model mapping batch size to estimated memory utilization based on a plurality of prior batch sizes applied for performing a corresponding plurality of prior data format conversions, and further based on a plurality of prior actual memory utilization occurring in performing the corresponding plurality of prior data format conversions. A next batch size is automatically selected for performance of a subsequent data format conversion based on the predictive model and a current memory availability. A batch of data for performance of the subsequent data format conversion is identified based on the batch of data having the next batch size. The subsequent data format conversion is performed the batch of data, and an actual amount of memory utilized in performing the subsequent data format conversion is determined. The predictive model is updated based on the next batch size and the actual amount of memory utilized.
    Type: Application
    Filed: December 7, 2021
    Publication date: June 8, 2023
    Applicant: Ocient Holdings LLC
    Inventors: Natalia Marie Jenuwine, George Kondiles, Anna Veselova, Ravi V. Khadiwala
  • Publication number: 20230107652
    Abstract: A method for execution by at least one processor of a database system includes storing a plurality of column values for a first column of a plurality of rows. Each of a set of missing data-based conditions for the first column are indexed via an indexing scheme. A subset of the set of missing data-based conditions that satisfy a query predicate of a query are identified. An IO pipeline for access of the first column based on the query predicate and further based on the subset of the set of missing data-based conditions is generated. The IO pipeline is applied in conjunction with execution of the query by applying at least one index element to identify a proper subset of the plurality of rows based on index data of the indexing scheme, and by generating a query resultant based on the proper subset.
    Type: Application
    Filed: October 6, 2021
    Publication date: April 6, 2023
    Applicant: Ocient Holdings LLC
    Inventors: Anna Veselova, Greg R. Dhuse, Matthew Ashbeck
  • Publication number: 20220207008
    Abstract: A record processing and storage system is operable to receive a plurality of labeled row data from a data source. Each labeled row data of the plurality of labeled row data includes at least one record and a corresponding row number of a plurality of row numbers. A plurality of pages are generated from records included in the labeled row data. The plurality of pages are stored via a page storage system. A plurality of page metadata corresponding to the plurality of pages is generated, where each of the plurality of page metadata is generated based on at least corresponding one row number of at least one labeled row data with records included in a corresponding one of the plurality of pages. Deduplication of duplicated records included the plurality of pages is facilitated based on the plurality of page metadata.
    Type: Application
    Filed: March 15, 2022
    Publication date: June 30, 2022
    Applicant: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova
  • Publication number: 20220207009
    Abstract: A record processing and storage system is operable to receive a plurality of row data that each indicate a batch number. Each of the plurality of row data is added to a pending row data pool. A plurality of pages is generated from the plurality of row data via a plurality of processing core resources. Each processing core resource in the plurality of processing core resources processing a corresponding subset of the plurality of row data by retrieving, in each time slice of a plurality of time slices, one row data from the pending row data pool with a most favorably ordered batch number of row data in the pending row data pool. Each processing core resource further processes the one row data in the in each time slice to participate in generation of at least one of the plurality of pages.
    Type: Application
    Filed: March 15, 2022
    Publication date: June 30, 2022
    Applicant: Ocient Holdings LLC
    Inventors: George KONDILES, Ravi V. KHADIWALA, Donald Scott CLARK, Anna VESELOVA
  • Publication number: 20220138168
    Abstract: A database system operates by: receiving a plurality of row data associated with a first data source; identifying a subset of row data from the plurality of row data that includes only ones of the plurality of row data that compare favorably to maintained row durability data; generating at least one page from ones of the plurality of row data included in the subset of row data; storing the at least one page in long term storage; generating updated row durability data indicating a least favorably ordered row number of a plurality of row numbers corresponding to the subset of row data based on storing the at least one page in long term storage; and updating the maintained row durability data to indicate the least favorably ordered row number of the updated row durability data.
    Type: Application
    Filed: March 29, 2021
    Publication date: May 5, 2022
    Applicant: Ocient Holdings LLC
    Inventors: Anna Veselova, Ravi V. Khadiwala, George Kondiles
  • Patent number: 11321288
    Abstract: A method for execution by a record processing and storage system includes receiving a plurality of records and corresponding row numbers. Pages are generated from the received records. Page metadata is generated for each page that includes row number span data based on row numbers of the records included in each page. Pairs of pages are identified in the plurality of pages based on having row number span data in their page metadata that include a row number span overlap. For each pair of pages, row number span data is updated for a first page in the pair by removing the row number span overlap with a second page in the pair. Reads of pages are performed based on their row number span data. Only records of each first page of each pair of pages having row numbers that are within the updated row number span data are read.
    Type: Grant
    Filed: August 5, 2020
    Date of Patent: May 3, 2022
    Assignee: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova
  • Publication number: 20220043787
    Abstract: A method for execution by a record processing and storage system includes receiving a plurality of records and corresponding row numbers. Pages are generated from the received records. Page metadata is generated for each page that includes row number span data based on row numbers of the records included in each page. Pairs of pages are identified in the plurality of pages based on having row number span data in their page metadata that include a row number span overlap. For each pair of pages, row number span data is updated for a first page in the pair by removing the row number span overlap with a second page in the pair. Reads of pages are performed based on their row number span data. Only records of each first page of each pair of pages having row numbers that are within the updated row number span data are read.
    Type: Application
    Filed: August 5, 2020
    Publication date: February 10, 2022
    Applicant: Ocient Holdings LLC
    Inventors: George Kondiles, Ravi V. Khadiwala, Donald Scott Clark, Anna Veselova