Patents by Inventor Paul O'Hara

Paul O'Hara has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11928562
    Abstract: A system and method include input of data records to a first trained predictive model to obtain a predicted value associated with each input data record. A model region is then associated with each of the input data records based on the first trained predictive model, the input data records and the predicted values. Enhanced input data records are generated by, for each model region, adding derived values of engineered features associated with the model region to input data records associated with the model region and default values of the engineered features associated with the model region to input training records not associated with the model region. The enhanced input data records are input to a second trained predictive model to obtain an enhanced predicted value associated with each input data record.
    Type: Grant
    Filed: September 16, 2020
    Date of Patent: March 12, 2024
    Assignee: BUSINESS OBJECTS SOFTWARE LIMITED
    Inventors: Paul O'Hara, Ying Wu
  • Publication number: 20240062101
    Abstract: A historical feature contribution score dataset comprising a number of sets of scores generated by machine learning model may be obtained. Additional feature contribution score sets may be materialized such that the size of each additional feature contribution score set is based on a corresponding randomly selected values within a set-size range. A training dataset may be produced that includes feature contribution scores and corresponding classification labels extracted from the historical feature contribution score dataset and the additional feature contribution score sets. The classification labels may indicate an amount that the corresponding feature contribution scores contribute to a prediction of a target feature. A machine learning model may be trained to predict the classification labels using the training dataset. An input feature contribution score set may be applied to the machine learning model to obtain predicted classification labels.
    Type: Application
    Filed: August 17, 2022
    Publication date: February 22, 2024
    Inventor: Paul O'Hara
  • Publication number: 20240020896
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Application
    Filed: July 12, 2023
    Publication date: January 18, 2024
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Patent number: 11734864
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Grant
    Filed: October 29, 2021
    Date of Patent: August 22, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Patent number: 11727030
    Abstract: The present disclosure involves systems, software, and computer implemented methods for automatically detecting hot areas in heat map visualizations. One example method includes identifying a two-dimensional heat map. The identified two-dimensional heat map is converted to a one-dimensional heat map. Cells of the one-dimensional heat map are clustered using a density-based clustering algorithm to generate at least one dense region of cells. A mean value of cells in each dense region is calculated and the dense regions are sorted by mean value in descending order. An approach for identifying hot areas is selected and the selected approach is used to identify at least one dense region as a hot area of the one-dimensional heat map.
    Type: Grant
    Filed: May 5, 2020
    Date of Patent: August 15, 2023
    Assignee: Business Objects Software Ltd.
    Inventors: Ben Murphy, Ying Wu, Paul O'Hara, Emmet Norton, Malte Christian Kaufmann, Orla Cullen
  • Patent number: 11720579
    Abstract: Systems and methods include determination, for each of a plurality of discrete features, of statistics based on a number of occurrences of each discrete value of the discrete feature in the data, determination of first summary statistics based on the determined statistics, determine of a dissimilarity for each discrete feature based on the first summary statistics and on the statistics determined for the discrete feature, determination of candidate discrete features based on the determined dissimilarities, determination, for each of the candidate discrete features, of second summary statistics based on values of a continuous feature associated with each discrete value of the candidate discrete feature, determination of a deviation score for each of the candidate discrete features based on the second summary statistics, and transmission of the candidate discrete features for display in association with the continuous feature based on the determined deviation scores.
    Type: Grant
    Filed: July 6, 2021
    Date of Patent: August 8, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Alan McShane, Anirban Banerjee, Mark Ahern
  • Publication number: 20230216752
    Abstract: Techniques for enabling secure access to data using data blocks is described. Computing device(s) can provide instruction(s) to a component associated with an entity, wherein the instruction(s) are associated with an identifier corresponding to a data block of a plurality of data blocks. The computing device(s) can receive, from the component, data associated with the component, wherein the data is associated with the identifier and is indicative of a state of the component. The computing device(s) can store the data in the data block and monitor, using rule(s), changes to the state of the component based at least partly on the data in the data block. As a result, techniques described herein enable near real-time—and in some examples, automatic—reporting and/or remediation for correcting changes to the state of the component using data that is securely accessed by use of data blocks.
    Type: Application
    Filed: March 10, 2023
    Publication date: July 6, 2023
    Inventors: Chad Campbell, Carroll Wayne Moon, Christopher James Carlson, Jeremy David Sublett, Paul O'Hara, David Ray Garza, David James Weatherford, Jason Aaron Graham, Jon Matthew Loflin, Kyle J. Wagner
  • Patent number: 11693879
    Abstract: Systems and methods include reception of a set of data including continuous features and a discrete feature, each continuous feature associated with a plurality of values and the discrete feature associated with a plurality of discrete values, determine, for each continuous feature, a relationship factor representing a relationship between the discrete feature and the continuous feature based on the plurality of values associated with the continuous feature and the plurality of discrete values, identify one of the continuous features associated with a largest one of the determined relationship factors, generate, for each of the other features, a correlation factor representing a correlation between the continuous feature and the identified continuous feature, determine, for each of the continuous features other than the identified continuous feature, a composite relationship score based on the relationship factor and the correlation factor associated with the feature, and present a visualization associated wi
    Type: Grant
    Filed: May 19, 2021
    Date of Patent: July 4, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Paul O'Hara, Ying Wu, Jiazheng Li, Cathal McGovern, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Kerry O'Connor, Michael Golden, Satinder Singh, Vlad Zat
  • Patent number: 11681715
    Abstract: Systems and methods include determination, determine, for each of a plurality of discrete features, of statistics for each discrete value of the discrete feature based on values of a continuous feature associated with the discrete value, determination, for each discrete feature, of first summary statistics based on the statistics determined for each discrete value of the discrete feature, determination, for each discrete feature, of a dissimilarity based on the first summary statistics determined for the discrete feature and on the statistics determined for each discrete value of the discrete feature, determination of candidate discrete features of the discrete features based on the determined dissimilarities, the candidate discrete features comprising less than all of the discrete features, determination, for each of the candidate discrete features, of second summary statistics based on values of the continuous feature associated with each discrete value of the candidate discrete feature, determine of a devi
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: June 20, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Anirban Banerjee, Ian Denver, Alan McShane
  • Publication number: 20230133856
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Application
    Filed: October 29, 2021
    Publication date: May 4, 2023
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Publication number: 20230113850
    Abstract: The present disclosure provides for accurate and efficient identification of candidate features for an input dataset comprising one or more continuous features and one or more categorical features is obtained. A number of categorical feature categories based on the one or more categorical features is determined. Record counts for each of the categorical feature categories are determined. Skew statistics for each category are determined based on the record counts for each of the categorical feature categories. Cardinality skew factors for each of the one or more categorical features are then determined based on the record counts and the skew statistics. A number of the one or more categorical features having the highest cardinality skew factors are selected from among the cardinality skew factors. Then, a top contributor deviation analysis is performed using the selected number of the categorical features having the highest cardinality skew factors.
    Type: Application
    Filed: October 8, 2021
    Publication date: April 13, 2023
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Alan McShane
  • Patent number: 11606270
    Abstract: Techniques for enabling secure access to data using data blocks is described. Computing device(s) can provide instruction(s) to a component associated with an entity, wherein the instruction(s) are associated with an identifier corresponding to a data block of a plurality of data blocks. The computing device(s) can receive, from the component, data associated with the component, wherein the data is associated with the identifier and is indicative of a state of the component. The computing device(s) can store the data in the data block and monitor, using rule(s), changes to the state of the component based at least partly on the data in the data block. As a result, techniques described herein enable near real-time—and in some examples, automatic—reporting and/or remediation for correcting changes to the state of the component using data that is securely accessed by use of data blocks.
    Type: Grant
    Filed: April 19, 2021
    Date of Patent: March 14, 2023
    Assignee: CloudFit Software, LLC
    Inventors: Chad Campbell, Carroll Wayne Moon, Christopher James Carlson, Jeremy David Sublett, Paul O'Hara, David Ray Garza, David James Weatherford, Jason Aaron Graham, Jon Matthew Loflin, Kyle Wagner
  • Publication number: 20230010992
    Abstract: Systems and methods include determination, for each of a plurality of discrete features, of statistics based on a number of occurrences of each discrete value of the discrete feature in the data, determination of first summary statistics based on the determined statistics, determine of a dissimilarity for each discrete feature based on the first summary statistics and on the statistics determined for the discrete feature, determination of candidate discrete features based on the determined dissimilarities, determination, for each of the candidate discrete features, of second summary statistics based on values of a continuous feature associated with each discrete value of the candidate discrete feature, determination of a deviation score for each of the candidate discrete features based on the second summary statistics, and transmission of the candidate discrete features for display in association with the continuous feature based on the determined deviation scores.
    Type: Application
    Filed: July 6, 2021
    Publication date: January 12, 2023
    Inventors: Paul O'HARA, Malte Christian KAUFMANN, Alan McSHANE, Anirban BANERJEE, Mark AHERN
  • Publication number: 20220398246
    Abstract: Systems and methods include determination, determine, for each of a plurality of discrete features, of statistics for each discrete value of the discrete feature based on values of a continuous feature associated with the discrete value, determination, for each discrete feature, of first summary statistics based on the statistics determined for each discrete value of the discrete feature, determination, for each discrete feature, of a dissimilarity based on the first summary statistics determined for the discrete feature and on the statistics determined for each discrete value of the discrete feature, determination of candidate discrete features of the discrete features based on the determined dissimilarities, the candidate discrete features comprising less than all of the discrete features, determination, for each of the candidate discrete features, of second summary statistics based on values of the continuous feature associated with each discrete value of the candidate discrete feature, determine of a devi
    Type: Application
    Filed: June 9, 2021
    Publication date: December 15, 2022
    Inventors: Paul O'HARA, Malte Christian KAUFMANN, Anirban BANERJEE, Ian DENVER, Alan McSHANE
  • Publication number: 20220374450
    Abstract: Systems and methods include reception of a set of data including continuous features and a discrete feature, each continuous feature associated with a plurality of values and the discrete feature associated with a plurality of discrete values, determine, for each continuous feature, a relationship factor representing a relationship between the discrete feature and the continuous feature based on the plurality of values associated with the continuous feature and the plurality of discrete values, identify one of the continuous features associated with a largest one of the determined relationship factors, generate, for each of the other features, a correlation factor representing a correlation between the continuous feature and the identified continuous feature, determine, for each of the continuous features other than the identified continuous feature, a composite relationship score based on the relationship factor and the correlation factor associated with the feature, and present a visualization associated wi
    Type: Application
    Filed: May 19, 2021
    Publication date: November 24, 2022
    Inventors: Paul O'HARA, Ying WU, Jiazheng LI, Cathal McGOVERN, Malte Christian KAUFMANN, Esther Rodrigo ORTIZ, Kerry O'CONNOR, Michael GOLDEN, Satinder SINGH
  • Publication number: 20220374765
    Abstract: Systems and methods include reception of a set of data, the set of data comprising a plurality of features, building, for each of a plurality of subsets of the plurality of features, a dimension reduction model based on the subset of features and associated values of the set of data, and, for each dimension reduction model, determination of a weight associated with each of subset of features based on the dimension model, identification of a predetermined number of features associated with the highest weights, and generation, for each dimension reduction model, of a data structure comprising the predetermined number of features and the weight associated with each of the predetermined number of features. A plurality of top features are determined based on the plurality of data structures, and a supervised learning model is trained based on the plurality of top features of the set of data.
    Type: Application
    Filed: May 24, 2021
    Publication date: November 24, 2022
    Inventors: Ying WU, Jiazheng LI, Paul O'HARA, Malte Christian KAUFMANN
  • Publication number: 20220357920
    Abstract: Systems and methods include reception of data including a plurality of continuous features and a first discrete feature, each of the plurality of continuous features associated with a plurality of values and the first discrete feature associated with a plurality of discrete values, determination of an overall output value of a function based on the plurality of values associated with each of the plurality of continuous features, determination, for each discrete value of the plurality of discrete values, of an output value of the function based on ones of the plurality of values associated with the discrete value, scaling of the output value determined for each discrete value based on the determined output values and the overall output value, and presentation of the scaled output values.
    Type: Application
    Filed: May 6, 2021
    Publication date: November 10, 2022
    Inventors: Paul O'HARA, Malte Christian KAUFMANN
  • Patent number: 11475021
    Abstract: The present disclosure involves systems, software, and computer implemented methods for ranking time dimensions. One example method includes receiving a request for an insight analysis for a dataset that includes a value dimension and a set of multiple date dimensions. Each date dimension is converted into a time series and a value quality factor is determined for each time series that represents a level of data quality for the time series. A time series informative factor is determined for each time series that represents how informative the time series is within a specified time window. An insight score is determined, for each time dimension, based on the determined value quality factors and the determined time series informative factors. The insight score for the time dimension is provided, for at least some of the time dimensions.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: October 18, 2022
    Assignee: Business Objects Software Ltd.
    Inventors: Ying Wu, Paul O'Connor, Esther Rodrigo Ortiz, Artur Stulka, Mateusz Lewandowski, Paul Sheedy, Mairtin Keane, Paul O'Hara, Malte Christian Kaufmann, Robert McGrath
  • Publication number: 20220172130
    Abstract: A method includes receiving training data including sequential data, determining a plurality of future time points, generating a first prediction by applying a first forecasting algorithm to the training data, generating a second prediction by applying a second forecasting algorithm to the training data, extracting predicted values from the first prediction and the second prediction that corresponds to a future time point of the plurality of future time points, applying a regression model in sequence on each of the plurality of future time points to generate a final predicted value of each of the plurality of future time points, and outputting the final predicted values of the plurality of future time points.
    Type: Application
    Filed: February 16, 2022
    Publication date: June 2, 2022
    Inventors: Ying Wu, Paul Pallath, Paul O'hara
  • Patent number: 11321332
    Abstract: The present disclosure involves systems, software, and computer implemented methods for automatically recommending one or more frequencies for time series data. One example method includes receiving a request for an insight analysis for an input time series included in a dataset. For each of multiple frequencies to analyze, the input time series is transformed into a frequency time series. An absolute percentage change impact factor and an absolute trend impact factor are determined for each frequency time series. A frequency interest score is determined based on the determined absolute percentage change factors and the determined absolute trend impact factors, for each time frequency time series. The frequency interest score is provided for at least some of the frequency time series.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: May 3, 2022
    Assignee: Business Objects Software Ltd.
    Inventors: Paul O'Hara, Ying Wu, Esther Rodrigo Ortiz, Paul O'Connor, Gabor Szabo, Artur Stulka