Patents by Inventor Malte Christian Kaufmann

Malte Christian Kaufmann has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240020896
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Application
    Filed: July 12, 2023
    Publication date: January 18, 2024
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Patent number: 11734864
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Grant
    Filed: October 29, 2021
    Date of Patent: August 22, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Patent number: 11727030
    Abstract: The present disclosure involves systems, software, and computer implemented methods for automatically detecting hot areas in heat map visualizations. One example method includes identifying a two-dimensional heat map. The identified two-dimensional heat map is converted to a one-dimensional heat map. Cells of the one-dimensional heat map are clustered using a density-based clustering algorithm to generate at least one dense region of cells. A mean value of cells in each dense region is calculated and the dense regions are sorted by mean value in descending order. An approach for identifying hot areas is selected and the selected approach is used to identify at least one dense region as a hot area of the one-dimensional heat map.
    Type: Grant
    Filed: May 5, 2020
    Date of Patent: August 15, 2023
    Assignee: Business Objects Software Ltd.
    Inventors: Ben Murphy, Ying Wu, Paul O'Hara, Emmet Norton, Malte Christian Kaufmann, Orla Cullen
  • Patent number: 11720579
    Abstract: Systems and methods include determination, for each of a plurality of discrete features, of statistics based on a number of occurrences of each discrete value of the discrete feature in the data, determination of first summary statistics based on the determined statistics, determine of a dissimilarity for each discrete feature based on the first summary statistics and on the statistics determined for the discrete feature, determination of candidate discrete features based on the determined dissimilarities, determination, for each of the candidate discrete features, of second summary statistics based on values of a continuous feature associated with each discrete value of the candidate discrete feature, determination of a deviation score for each of the candidate discrete features based on the second summary statistics, and transmission of the candidate discrete features for display in association with the continuous feature based on the determined deviation scores.
    Type: Grant
    Filed: July 6, 2021
    Date of Patent: August 8, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Alan McShane, Anirban Banerjee, Mark Ahern
  • Patent number: 11693879
    Abstract: Systems and methods include reception of a set of data including continuous features and a discrete feature, each continuous feature associated with a plurality of values and the discrete feature associated with a plurality of discrete values, determine, for each continuous feature, a relationship factor representing a relationship between the discrete feature and the continuous feature based on the plurality of values associated with the continuous feature and the plurality of discrete values, identify one of the continuous features associated with a largest one of the determined relationship factors, generate, for each of the other features, a correlation factor representing a correlation between the continuous feature and the identified continuous feature, determine, for each of the continuous features other than the identified continuous feature, a composite relationship score based on the relationship factor and the correlation factor associated with the feature, and present a visualization associated wi
    Type: Grant
    Filed: May 19, 2021
    Date of Patent: July 4, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Paul O'Hara, Ying Wu, Jiazheng Li, Cathal McGovern, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Kerry O'Connor, Michael Golden, Satinder Singh, Vlad Zat
  • Patent number: 11681715
    Abstract: Systems and methods include determination, determine, for each of a plurality of discrete features, of statistics for each discrete value of the discrete feature based on values of a continuous feature associated with the discrete value, determination, for each discrete feature, of first summary statistics based on the statistics determined for each discrete value of the discrete feature, determination, for each discrete feature, of a dissimilarity based on the first summary statistics determined for the discrete feature and on the statistics determined for each discrete value of the discrete feature, determination of candidate discrete features of the discrete features based on the determined dissimilarities, the candidate discrete features comprising less than all of the discrete features, determination, for each of the candidate discrete features, of second summary statistics based on values of the continuous feature associated with each discrete value of the candidate discrete feature, determine of a devi
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: June 20, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Anirban Banerjee, Ian Denver, Alan McShane
  • Patent number: 11675765
    Abstract: A system and method including determining, for a specified target measure column of a first dataset including a plurality of records, the metadata of the first dataset, including a probability distribution for the specified target column and dimension scores for the dimensions for the first dataset conditioned on the specified target measure column, where the first dataset comprises a plurality of columns including the at least one target measure column and a plurality of non-numeric, dimension columns for the records of the first dataset; determining, for a subset of data of the first dataset based on one or more specified variables, dimension scores for the dimensions of the subset of data approximately derived from the determined metadata of the first dataset; and providing recommendations of top contributors based on the approximated dimension scores of dimensions of the subset of data.
    Type: Grant
    Filed: May 25, 2021
    Date of Patent: June 13, 2023
    Assignee: BUSINESS OBJECTS SOFTWARE LTD.
    Inventors: Ying Wu, Malte Christian Kaufmann, Alan McShane, Anirban Banerjee, Gareth Maguire
  • Publication number: 20230133856
    Abstract: Using approximated bin intervals to label the histograms provides clarity and allows for the histogram to be more intuitively understood. A dataset may comprise a plurality of records having a plurality of features including one or more continuous features. A selection of a continuous feature may be obtained. A bin width based on a number of bins and feature statistics of the continuous feature may be determined. An approximated bin interval range is determined by applying a bin mask based on the bin width to the feature statistics. An approximated bin width is determined based on the number of bins and the approximated bin interval range. Approximated bin intervals for the histogram are determined based on the approximated bin width. A histogram is generated having bins with intervals based the approximated bin intervals.
    Type: Application
    Filed: October 29, 2021
    Publication date: May 4, 2023
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Esther Rodrigo Ortiz, Conor White
  • Publication number: 20230113850
    Abstract: The present disclosure provides for accurate and efficient identification of candidate features for an input dataset comprising one or more continuous features and one or more categorical features is obtained. A number of categorical feature categories based on the one or more categorical features is determined. Record counts for each of the categorical feature categories are determined. Skew statistics for each category are determined based on the record counts for each of the categorical feature categories. Cardinality skew factors for each of the one or more categorical features are then determined based on the record counts and the skew statistics. A number of the one or more categorical features having the highest cardinality skew factors are selected from among the cardinality skew factors. Then, a top contributor deviation analysis is performed using the selected number of the categorical features having the highest cardinality skew factors.
    Type: Application
    Filed: October 8, 2021
    Publication date: April 13, 2023
    Inventors: Paul O'Hara, Malte Christian Kaufmann, Alan McShane
  • Publication number: 20230010992
    Abstract: Systems and methods include determination, for each of a plurality of discrete features, of statistics based on a number of occurrences of each discrete value of the discrete feature in the data, determination of first summary statistics based on the determined statistics, determine of a dissimilarity for each discrete feature based on the first summary statistics and on the statistics determined for the discrete feature, determination of candidate discrete features based on the determined dissimilarities, determination, for each of the candidate discrete features, of second summary statistics based on values of a continuous feature associated with each discrete value of the candidate discrete feature, determination of a deviation score for each of the candidate discrete features based on the second summary statistics, and transmission of the candidate discrete features for display in association with the continuous feature based on the determined deviation scores.
    Type: Application
    Filed: July 6, 2021
    Publication date: January 12, 2023
    Inventors: Paul O'HARA, Malte Christian KAUFMANN, Alan McSHANE, Anirban BANERJEE, Mark AHERN
  • Publication number: 20220398246
    Abstract: Systems and methods include determination, determine, for each of a plurality of discrete features, of statistics for each discrete value of the discrete feature based on values of a continuous feature associated with the discrete value, determination, for each discrete feature, of first summary statistics based on the statistics determined for each discrete value of the discrete feature, determination, for each discrete feature, of a dissimilarity based on the first summary statistics determined for the discrete feature and on the statistics determined for each discrete value of the discrete feature, determination of candidate discrete features of the discrete features based on the determined dissimilarities, the candidate discrete features comprising less than all of the discrete features, determination, for each of the candidate discrete features, of second summary statistics based on values of the continuous feature associated with each discrete value of the candidate discrete feature, determine of a devi
    Type: Application
    Filed: June 9, 2021
    Publication date: December 15, 2022
    Inventors: Paul O'HARA, Malte Christian KAUFMANN, Anirban BANERJEE, Ian DENVER, Alan McSHANE
  • Publication number: 20220382906
    Abstract: A system and method including receiving numeric data of a first dataset including a plurality of columns having numeric values with one of the plurality of columns specified as a target column; generating a trained generative model based on numeric values in non-target columns of the plurality of columns; generating a trained predictive model based on numeric values in non-target columns of the plurality of columns being input variables and the target column being a target variable; generating, by the trained generative model, a new set of numeric data for the non-target columns; generating predicted target values for the non-target columns by the trained predictive model using the new set of numeric data as an input to the predictive model; and generating anonymized numeric data for the first dataset by combining the new set of numeric data and the target column populated with the generated predicted target values.
    Type: Application
    Filed: May 26, 2021
    Publication date: December 1, 2022
    Inventors: Ying Wu, Malte Christian Kaufmann
  • Publication number: 20220382729
    Abstract: A system and method including determining, for a specified target measure column of a first dataset including a plurality of records, the metadata of the first dataset, including a probability distribution for the specified target column and dimension scores for the dimensions for the first dataset conditioned on the specified target measure column, where the first dataset comprises a plurality of columns including the at least one target measure column and a plurality of non-numeric, dimension columns for the records of the first dataset; determining, for a subset of data of the first dataset based on one or more specified variables, dimension scores for the dimensions of the subset of data approximately derived from the determined metadata of the first dataset; and providing recommendations of top contributors based on the approximated dimension scores of dimensions of the subset of data.
    Type: Application
    Filed: May 25, 2021
    Publication date: December 1, 2022
    Inventors: Ying Wu, Malte Christian Kaufmann, Alan McShane, Anirban Banerjee, Gareth Maguire
  • Publication number: 20220374450
    Abstract: Systems and methods include reception of a set of data including continuous features and a discrete feature, each continuous feature associated with a plurality of values and the discrete feature associated with a plurality of discrete values, determine, for each continuous feature, a relationship factor representing a relationship between the discrete feature and the continuous feature based on the plurality of values associated with the continuous feature and the plurality of discrete values, identify one of the continuous features associated with a largest one of the determined relationship factors, generate, for each of the other features, a correlation factor representing a correlation between the continuous feature and the identified continuous feature, determine, for each of the continuous features other than the identified continuous feature, a composite relationship score based on the relationship factor and the correlation factor associated with the feature, and present a visualization associated wi
    Type: Application
    Filed: May 19, 2021
    Publication date: November 24, 2022
    Inventors: Paul O'HARA, Ying WU, Jiazheng LI, Cathal McGOVERN, Malte Christian KAUFMANN, Esther Rodrigo ORTIZ, Kerry O'CONNOR, Michael GOLDEN, Satinder SINGH
  • Publication number: 20220374765
    Abstract: Systems and methods include reception of a set of data, the set of data comprising a plurality of features, building, for each of a plurality of subsets of the plurality of features, a dimension reduction model based on the subset of features and associated values of the set of data, and, for each dimension reduction model, determination of a weight associated with each of subset of features based on the dimension model, identification of a predetermined number of features associated with the highest weights, and generation, for each dimension reduction model, of a data structure comprising the predetermined number of features and the weight associated with each of the predetermined number of features. A plurality of top features are determined based on the plurality of data structures, and a supervised learning model is trained based on the plurality of top features of the set of data.
    Type: Application
    Filed: May 24, 2021
    Publication date: November 24, 2022
    Inventors: Ying WU, Jiazheng LI, Paul O'HARA, Malte Christian KAUFMANN
  • Publication number: 20220357920
    Abstract: Systems and methods include reception of data including a plurality of continuous features and a first discrete feature, each of the plurality of continuous features associated with a plurality of values and the first discrete feature associated with a plurality of discrete values, determination of an overall output value of a function based on the plurality of values associated with each of the plurality of continuous features, determination, for each discrete value of the plurality of discrete values, of an output value of the function based on ones of the plurality of values associated with the discrete value, scaling of the output value determined for each discrete value based on the determined output values and the overall output value, and presentation of the scaled output values.
    Type: Application
    Filed: May 6, 2021
    Publication date: November 10, 2022
    Inventors: Paul O'HARA, Malte Christian KAUFMANN
  • Patent number: 11475021
    Abstract: The present disclosure involves systems, software, and computer implemented methods for ranking time dimensions. One example method includes receiving a request for an insight analysis for a dataset that includes a value dimension and a set of multiple date dimensions. Each date dimension is converted into a time series and a value quality factor is determined for each time series that represents a level of data quality for the time series. A time series informative factor is determined for each time series that represents how informative the time series is within a specified time window. An insight score is determined, for each time dimension, based on the determined value quality factors and the determined time series informative factors. The insight score for the time dimension is provided, for at least some of the time dimensions.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: October 18, 2022
    Assignee: Business Objects Software Ltd.
    Inventors: Ying Wu, Paul O'Connor, Esther Rodrigo Ortiz, Artur Stulka, Mateusz Lewandowski, Paul Sheedy, Mairtin Keane, Paul O'Hara, Malte Christian Kaufmann, Robert McGrath
  • Publication number: 20210374770
    Abstract: The present disclosure involves systems, software, and computer implemented methods for measuring successful interactions with an insight tool. One example method includes receiving a request for insights for a data point of a data visualization. Insights for the data point are identified and presented in an insights interface in a user session. User interactions with the insights interface are tracked during the user session. A determination is made that the user session has completed. At least one insights success rule is identified for determining whether user sessions with the insights interface are successful. The one or more insights success rules are evaluated to determine whether the user session was successful. In response to determining that the user session was successful, a measure of success for the user session is recorded. In response to determining that the user session was unsuccessful, a measure of failure is recorded for the user session.
    Type: Application
    Filed: June 2, 2020
    Publication date: December 2, 2021
    Inventors: Anirban Banerjee, Robert McGrath, Malte Christian Kaufmann, Eoin Goslin, Esther Rodrigo Ortiz
  • Publication number: 20210365471
    Abstract: The present disclosure involves systems, software, and computer implemented methods for generating insights based on numeric and categorical data. One example method includes receiving a request for an insight analysis for a dataset that includes at least one continuous feature and at least one categorical feature. Continuous features can have any value within a range of numerical values and categorical features are enumerated features that can have a value from a predefined set of values. A selection of a first continuous feature for analysis is received, and at least one categorical feature is identified for analysis. A deviation factor and a relationship factor are determined for each identified categorical feature. An insight score is determined for each identified categorical feature that combines the deviation factor and the relationship factor for the categorical feature. The insight score is provided for at least some of the identified categorical features.
    Type: Application
    Filed: May 19, 2020
    Publication date: November 25, 2021
    Inventors: Paul O'Hara, Robert McGrath, Ying Wu, Shekhar Chhabra, Eoin Goslin, Pat Connaughton, John Bowden, Alan Maher, David Hutchinson, Leanne Long, Malte Christian Kaufmann, Pukhraj Saxena, Priti Mulchandani, Anirban Banerjee
  • Publication number: 20210357417
    Abstract: The present disclosure involves systems, software, and computer implemented methods for ranking time dimensions. One example method includes receiving a request for an insight analysis for a dataset that includes a value dimension and a set of multiple date dimensions. Each date dimension is converted into a time series and a value quality factor is determined for each time series that represents a level of data quality for the time series. A time series informative factor is determined for each time series that represents how informative the time series is within a specified time window. An insight score is determined, for each time dimension, based on the determined value quality factors and the determined time series informative factors. The insight score for the time dimension is provided, for at least some of the time dimensions.
    Type: Application
    Filed: May 18, 2020
    Publication date: November 18, 2021
    Inventors: Ying Wu, Paul O'Connor, Esther Rodrigo Ortiz, Artur Stulka, Mateusz Lewandowski, Paul Sheedy, Mairtin Keane, Paul O'Hara, Malte Christian Kaufmann, Robert McGrath