Patents by Inventor Niall F. McCarroll
Niall F. McCarroll has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11556496Abstract: Method and system are provided for writing output from map-reduce jobs to an archive file. The method may include providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. The method may also include using a buffering database as a temporary cache to buffer updates to the archive file. Handling by the archive manager calls from map-reduce jobs may allow: reading directly from an archive file or from a job index at the buffering database; writing to a job index at the buffering database used as a temporary cache to buffer updates; and serializing updates from the buffering database to the archive file.Type: GrantFiled: November 2, 2018Date of Patent: January 17, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Curtis N. Browning, Niall F. McCarroll
-
Patent number: 10902005Abstract: Method and systems for parallel scoring an ensemble model are provided. Aspects include loading data into a first distributed data structure having a plurality of partitions, each partition having loaded data in the form of a set of pairs of data formed of a record to be scored and a partial score for that record. A component model in the ensemble model is selected and processing of the records carried out in parallel across the partitions including updating the partial score for each record. In response to a partial score for a record not meeting an accuracy threshold, the method retains the record in the first distributed data structure to be scored by a subsequent component model. In response to the partial score for a record meeting the accuracy threshold, the method moves the record and updated partial score to an output result data structure to provide a final score.Type: GrantFiled: October 26, 2017Date of Patent: January 26, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Julian J. Clinton, Niall F. McCarroll, Lei Tian
-
Patent number: 10650008Abstract: Method and systems for parallel scoring an ensemble model are provided. Aspects include loading data into a first distributed data structure having a plurality of partitions, each partition having loaded data in the form of a set of pairs of data formed of a record to be scored and a partial score for that record. A component model in the ensemble model is selected and processing of the records carried out in parallel across the partitions including updating the partial score for each record. In response to a partial score for a record not meeting an accuracy threshold, the method retains the record in the first distributed data structure to be scored by a subsequent component model. In response to the partial score for a record meeting the accuracy threshold, the method moves the record and updated partial score to an output result data structure to provide a final score.Type: GrantFiled: August 26, 2016Date of Patent: May 12, 2020Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Julian J. Clinton, Niall F. McCarroll, Lei Tian
-
Publication number: 20190079939Abstract: Method and system are provided for writing output from map-reduce jobs to an archive file. The method may include providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. The method may also include using a buffering database as a temporary cache to buffer updates to the archive file. Handling by the archive manager calls from map-reduce jobs may allow: reading directly from an archive file or from a job index at the buffering database; writing to a job index at the buffering database used as a temporary cache to buffer updates; and serializing updates from the buffering database to the archive file.Type: ApplicationFiled: November 2, 2018Publication date: March 14, 2019Inventors: Curtis N. Browning, Niall F. McCarroll
-
Patent number: 10171311Abstract: A method of generating synthetic data from a model of a dataset comprises the steps of receiving a model of a dataset, extracting information from the received model, constructing a database view from the extracted information, receiving a query to the constructed database view, and generating synthetic data from the constructed database view according to the received query.Type: GrantFiled: October 17, 2013Date of Patent: January 1, 2019Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Matthew C. Harvey, Niall F. McCarroll, Yefim Shuf
-
Patent number: 10146779Abstract: Method and system are provided for writing output from map-reduce jobs to an archive file. The method may include providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. The method may also include using a buffering database as a temporary cache to buffer updates to the archive file. Handling by the archive manager calls from map-reduce jobs may allow: reading directly from an archive file or from a job index at the buffering database; writing to a job index at the buffering database used as a temporary cache to buffer updates; and serializing updates from the buffering database to the archive file.Type: GrantFiled: June 26, 2015Date of Patent: December 4, 2018Assignee: International Business Machines CorporationInventors: Curtis N. Browning, Niall F. McCarroll
-
Publication number: 20180060330Abstract: Method and systems for parallel scoring an ensemble model are provided. Aspects include loading data into a first distributed data structure having a plurality of partitions, each partition having loaded data in the form of a set of pairs of data formed of a record to be scored and a partial score for that record. A component model in the ensemble model is selected and processing of the records carried out in parallel across the partitions including updating the partial score for each record. In response to a partial score for a record not meeting an accuracy threshold, the method retains the record in the first distributed data structure to be scored by a subsequent component model. In response to the partial score for a record meeting the accuracy threshold, the method moves the record and updated partial score to an output result data structure to provide a final score.Type: ApplicationFiled: October 26, 2017Publication date: March 1, 2018Inventors: JULIAN J. CLINTON, NIALL F. McCARROLL, LEI TIAN
-
Publication number: 20180060324Abstract: Method and systems for parallel scoring an ensemble model are provided. Aspects include loading data into a first distributed data structure having a plurality of partitions, each partition having loaded data in the form of a set of pairs of data formed of a record to be scored and a partial score for that record. A component model in the ensemble model is selected and processing of the records carried out in parallel across the partitions including updating the partial score for each record. In response to a partial score for a record not meeting an accuracy threshold, the method retains the record in the first distributed data structure to be scored by a subsequent component model. In response to the partial score for a record meeting the accuracy threshold, the method moves the record and updated partial score to an output result data structure to provide a final score.Type: ApplicationFiled: August 26, 2016Publication date: March 1, 2018Inventors: JULIAN J. CLINTON, NIALL F. McCARROLL, LEI TIAN
-
Publication number: 20160070711Abstract: Method and system are provided for writing output from map-reduce jobs to an archive file. The method may include providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. The method may also include using a buffering database as a temporary cache to buffer updates to the archive file. Handling by the archive manager calls from map-reduce jobs may allow: reading directly from an archive file or from a job index at the buffering database; writing to a job index at the buffering database used as a temporary cache to buffer updates; and serializing updates from the buffering database to the archive file.Type: ApplicationFiled: June 26, 2015Publication date: March 10, 2016Inventors: Curtis N. Browning, Niall F. McCarroll
-
Publication number: 20140115007Abstract: A method of generating synthetic data from a model of a dataset comprises the steps of receiving a model of a dataset, extracting information from the received model, constructing a database view from the extracted information, receiving a query to the constructed database view, and generating synthetic data from the constructed database view according to the received query.Type: ApplicationFiled: October 17, 2013Publication date: April 24, 2014Applicant: International Business Machines CorporationInventors: Matthew C. Harvey, Niall F. McCarroll, Yefim Shuf