Patents by Inventor Gary Shon Katzenberger
Gary Shon Katzenberger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12153900Abstract: Sparse data handling and/or buffer sharing are implemented. Data may be buffered in reusable buffer arrays. Data may comprise fixed or variable length vectors, which may be represented as sparse or dense vectors in a values array and indices array. Data may be materialized from a dataview comprising a non-materialized view of data in a machine-learning pipeline by cursoring over rows of the dataview and calling delegate functions to compute data for rows in an active column. A buffer and/or its set of arrays storing a first vector may be reused for a second and additional vectors, for example, when the length of buffer arrays is equal to or greater than the length of the second and additional vectors, which may be selectively stored as sparse or dense vectors to fit the array set. Shared buffers may be passed as references between delegate functions for reuse.Type: GrantFiled: October 31, 2019Date of Patent: November 26, 2024Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Gary Shon Katzenberger, Petro Luferenko, Costin I. Eseanu, Eric Anthony Erhardt, Ivan Matantsev
-
Patent number: 11609746Abstract: Methods, systems, and computer products are herein provided for lazy evaluation of input data by a machine learning (ML) framework. An ML pipeline receives input data and compiles a chain of operators into a chain of dataviews configured for lazy evaluation of the input data. Each dataview in the chain represents a computation over data as a non-materialized view of the data. The ML pipeline receives a request for column data and selects a chain of delegates comprising one or more delegates for one or more dataviews in the chain to fulfill the request. The ML pipeline processes the input data with the selected chain of delegates. The ML pipeline performs delegate chaining on a dataview. A feature value for a feature column of the dataview is determined based on the delegate chaining and provided to an ML algorithm to predict column data.Type: GrantFiled: October 23, 2019Date of Patent: March 21, 2023Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Gary Shon Katzenberger, Thomas William Finley, Pete Luferenko, Mohammad Zeeshan Siddiqui, Costin Eseanu, Eric Anthony Erhardt, Yael Dekel, Ivan Matantsev
-
Patent number: 10977006Abstract: Embodiments provide a machine learning framework that enables developers to author and deploy machine learning pipelines into their applications regardless of the programming language in which the applications are structured. The framework may provide a programming language-specific API that enables the application to call a plurality of operators provided by the framework. The framework provides any number of APIs, each for a different programming language. The pipeline generated via the application is represented as an execution graph comprising node(s), where each node represents a particular operator. When a pipeline is submitted for execution, calls to the operators are detected, and nodes corresponding to the operators are generated for the execution graph.Type: GrantFiled: October 10, 2019Date of Patent: April 13, 2021Assignee: Microsoft Technology Licensing, LLCInventors: Gary Shon Katzenberger, Thomas William Finley, Petro Luferenko, Mohammad Zeeshan Siddiqui, Costin I. Eseanu, Eric Anthony Erhardt, Yael Dekel, Ivan Matantsev
-
Publication number: 20200349469Abstract: An efficient, streaming-based, lazily-evaluated machine learning (ML) framework is provided. An ML pipeline of operators produce and consume a chain of dataviews representing a computation over data. Non-materialized (e.g., virtual) views of data in dataviews permit efficient, lazy evaluation of data on demand regardless of size (e.g., in excess of main memory). Data may be materialized by DataView cursors (e.g., movable windows over rows of an input dataset or DataView). Computation and data movement may be limited to rows for active columns without processing or materializing unnecessary data. A chain of dataviews may comprise a chain of delegates that reference a chain of functions. Assembled pipelines of schematized compositions of operators may be validated and optimized with efficient execution plans. A compiled chain of functions may be optimized and executed in a single call. Dataview based ML pipelines may be developed, trained, evaluated and integrated into applications.Type: ApplicationFiled: October 23, 2019Publication date: November 5, 2020Inventors: Gary Shon Katzenberger, Thomas William Finley, Pete Luferenko, Mohammad Zeeshan Siddiqui, Costin Eseanu, Eric Anthony Erhardt, Yael Dekel, Ivan Matantsev
-
Publication number: 20200348912Abstract: Embodiments provide a machine learning framework that enables developers to author and deploy machine learning pipelines into their applications regardless of the programming language in which the applications are structured. The framework may provide a programming language-specific API that enables the application to call a plurality of operators provided by the framework. The framework provides any number of APIs, each for a different programming language. The pipeline generated via the application is represented as an execution graph comprising node(s), where each node represents a particular operator. When a pipeline is submitted for execution, calls to the operators are detected, and nodes corresponding to the operators are generated for the execution graph.Type: ApplicationFiled: October 10, 2019Publication date: November 5, 2020Inventors: Gary Shon Katzenberger, Thomas William Finley, Petro Luferenko, Mohammad Zeeshan Siddiqui, Costin I. Eseanu, Eric Anthony Erhardt, Yael Dekel, Ivan Matantsev
-
Publication number: 20200348990Abstract: Sparse data handling and/or buffer sharing are implemented. Data may be buffered in reusable buffer arrays. Data may comprise fixed or variable length vectors, which may be represented as sparse or dense vectors in a values array and indices array. Data may be materialized from a dataview comprising a non-materialized view of data in a machine-learning pipeline by cursoring over rows of the dataview and calling delegate functions to compute data for rows in an active column. A buffer and/or its set of arrays storing a first vector may be reused for a second and additional vectors, for example, when the length of buffer arrays is equal to or greater than the length of the second and additional vectors, which may be selectively stored as sparse or dense vectors to fit the array set. Shared buffers may be passed as references between delegate functions for reuse.Type: ApplicationFiled: October 31, 2019Publication date: November 5, 2020Inventors: Gary Shon Katzenberger, Petro Luferenko, Costin I. Eseanu, Eric Anthony Erhardt, Ivan Matantsev
-
Patent number: 9953069Abstract: A business intelligence (BI) document preserves references to identities and formats of remote data sources and allows a local computing device to offload analytical operations to remote data sources. The BI document specifies a graph of entities connected by directed edges from the output of one entity to an input of another entity. An entity, for example, can represent without limitation a data structure, an external data source, a control element, an external event source, a visualization, or an update service. The entities of a BI document at a local computing device can reference data at an original data source—rather than extracting data from the original data source to a preferred local datastore. An entity of the BI document can direct a remote data source to execute transformations on the remote data before returning a solution to the local computing device.Type: GrantFiled: May 26, 2015Date of Patent: April 24, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Darryl Rubin, David George Green, Gary Shon Katzenberger, Olivier Colle, Suraj Poozhiyil
-
Patent number: 9864966Abstract: A business intelligence document provides functionality for testing a hypothesis on aggregated data in a business intelligence document (e.g., a spreadsheet-like document), wherein one or more of the input data values and transformation properties are designated as constrained (e.g., invariant or constrained within a range, set, enumeration, or domain). The hypothesis, which is articulated as a data mining assertion, is input through the user interface of the business intelligence document (e.g., via an expression interface or properties of a row, column, or cell) and solved over the aggregated data. The solution is then presented through the user interface of the spreadsheet-like document, such as in a table, graph, histogram, etc.Type: GrantFiled: July 8, 2015Date of Patent: January 9, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Gary Shon Katzenberger, Darryl Rubin, David George Green
-
Publication number: 20170300461Abstract: Techniques for representing and publishing an interactive document useful for analyzing data. The document may be represented as a directed acyclic graph of entities interconnected by edges. The entities may be of multiple types. Yet, a broad range of interactive documents may be represented by a limited number of types of entities and the capabilities to interconnect entities of different types and to share a data schema across entities of different types. A tool may enable a user to author such documents. The tool may also facilitate publishing of the document. For publishing, the document may be converted to an executable form. Prior to such a conversion, the graph may be modified for more efficient processing. The graph may also be partitioned such that portions of the graph, when distributed across tiers of a computing system, such as a cloud-based platform, execute on computing devices that provide efficient operation.Type: ApplicationFiled: April 13, 2016Publication date: October 19, 2017Applicant: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Darryl Ellis Rubin, David G. Green, Suraj T. Poozhiyil, Gary Shon Katzenberger, Olivier Colle, Brian C. Beckman, Krasimir A. Aleksandrov, Andrew D. Reddish
-
Patent number: 9336184Abstract: Techniques for representing and publishing an interactive document useful for analyzing data. The document may be represented as a directed acyclic graph of entities interconnected by edges. The entities may be of multiple types. Yet, a broad range of interactive documents may be represented by a limited number of types of entities and the capabilities to interconnect entities of different types and to share a data schema across entities of different types. A tool may enable a user to author such documents. The tool may also facilitate publishing of the document. For publishing, the document may be converted to an executable form. Prior to such a conversion, the graph may be modified for more efficient processing. The graph may also be partitioned such that portions of the graph, when distributed across tiers of a computing system, such as a cloud-based platform, execute on computing devices that provide efficient operation.Type: GrantFiled: December 17, 2010Date of Patent: May 10, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Darryl Ellis Rubin, David G. Green, Suraj T. Poozhiyil, Gary Shon Katzenberger, Olivier Colle, Brian C. Beckman, Krasimir A. Aleksandrov, Andrew D. Reddish
-
Patent number: 9304672Abstract: Techniques for representing and publishing an interactive document useful for analyzing data. The document may be represented as a directed acyclic graph of entities interconnected by edges. The entities may be of multiple types. Yet, a broad range of interactive documents may be represented by a limited number of types of entities and the capabilities to interconnect entities of different types and to share a data schema across entities of different types. A tool may enable a user to author such documents. The tool may also facilitate publishing of the document. For publishing, the document may be converted to an executable form. Prior to such a conversion, the graph may be modified for more efficient processing. The graph may also be partitioned such that portions of the graph, when distributed across tiers of a computing system, such as a cloud-based platform, execute on computing devices that provide efficient operation.Type: GrantFiled: December 17, 2010Date of Patent: April 5, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Gary Shon Katzenberger, Darryl Ellis Rubin, Andrew D. Reddish, Brian C. Beckman, Olivier Colle
-
Publication number: 20150379108Abstract: A business intelligence document provides functionality for testing a hypothesis on aggregated data in a business intelligence document (e.g., a spreadsheet-like document), wherein one or more of the input data values and transformation properties are designated as constrained (e.g., invariant or constrained within a range, set, enumeration, or domain). The hypothesis, which is articulated as a data mining assertion, is input through the user interface of the business intelligence document (e.g., via an expression interface or properties of a row, column, or cell) and solved over the aggregated data. The solution is then presented through the user interface of the spreadsheet-like document, such as in a table, graph, histogram, etc.Type: ApplicationFiled: July 8, 2015Publication date: December 31, 2015Applicant: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Vijay Mital, Gary Shon Katzenberger, Darryl Rubin, David George Green
-
Publication number: 20150331918Abstract: A business intelligence (BI) document preserves references to identities and formats of remote data sources and allows a local computing device to offload analytical operations to remote data sources. The BI document specifies a graph of entities connected by directed edges from the output of one entity to an input of another entity. An entity, for example, can represent without limitation a data structure, an external data source, a control element, an external event source, a visualization, or an update service. The entities of a BI document at a local computing device can reference data at an original data source—rather than extracting data from the original data source to a preferred local datastore. An entity of the BI document can direct a remote data source to execute transformations on the remote data before returning a solution to the local computing device.Type: ApplicationFiled: May 26, 2015Publication date: November 19, 2015Applicant: MICROSOFT TECHNOLOGY LICENSING, LLPInventors: Vijay Mital, Darryl Rubin, David George Green, Gary Shon Katzenberger, Olivier Colle, Suraj Poozhiyil
-
Patent number: 9110957Abstract: A business intelligence document provides functionality for testing a hypothesis on aggregated data in a business intelligence document (e.g., a spreadsheet-like document), wherein one or more of the input data values and transformation properties are designated as constrained (e.g., invariant or constrained within a range, set, enumeration, or domain). The hypothesis, which is articulated as a data mining assertion, is input through the user interface of the business intelligence document (e.g., via an expression interface or properties of a row, column, or cell) and solved over the aggregated data. The solution is then presented through the user interface of the spreadsheet-like document, such as in a table, graph, histogram, etc.Type: GrantFiled: December 17, 2010Date of Patent: August 18, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Vijay Mital, Gary Shon Katzenberger, Darryl Rubin, David George Green
-
Patent number: 9069557Abstract: A business intelligence (BI) document preserves references to identities and formats of remote data sources and allows a local computing device to offload analytical operations to remote data sources. The BI document specifies a graph of entities connected by directed edges from the output of one entity to an input of another entity. An entity, for example, can represent without limitation a data structure, an external data source, a control element, an external event source, a visualization, or an update service. The entities of a BI document at a local computing device can reference data at an original data source—rather than extracting data from the original data source to a preferred local datastore. An entity of the BI document can direct a remote data source to execute transformations on the remote data before returning a solution to the local computing device.Type: GrantFiled: December 17, 2010Date of Patent: June 30, 2015Assignee: Microsoft Technology Licensing, LLPInventors: Vijay Mital, Darryl Rubin, David George Green, Gary Shon Katzenberger, Olivier Colle, Suraj Poozhiyil
-
Patent number: 8566261Abstract: An interactive recommendation system generates one or more recommendations (e.g., recommended products, travel destinations, etc.) for a user based on a recommendation model. The recommendation model includes one or more criteria that are used to analyze a datastore of user characteristics (e.g., a user's age, location, past online behavior, etc.) and generate one or more recommendations based thereon. The interactive recommendation system further presents a user interface that allows the user to interactively modify the criteria of the recommendation model and to apply the modified recommendation model to the datastore in order to generate one or more modified recommendations. In this manner, for example, the user can customize the recommendations he or she receives by interacting with the recommendation system to modify the recommendation model used to generate such recommendations.Type: GrantFiled: December 17, 2010Date of Patent: October 22, 2013Assignee: Microsoft CorporationInventors: Vijay Mital, R. Donald Thompson, III, Robert Povey, Gary Shon Katzenberger
-
Patent number: 8458678Abstract: A compiler supporting a language in which selected semantic objects are represented as data objects. The data objects may be used in multiple ways to expand the capabilities of the programming language. Data objects may be passed to applications and used to create executable instructions for that application. In this way, instructions written in the native language of the compiler may be used to control applications that accept programs in a language inconsistent with the native language of the compiler. The syntax checking and variable binding capabilities of the compiler may be used for those instructions that will be executed by an application separate from the object code generated by the compiler. The semantic objects represented as data objects may be selected based on express operations included in the source code or may be based on implicit type conversion.Type: GrantFiled: June 16, 2011Date of Patent: June 4, 2013Assignee: Microsoft CorporationInventors: Henricus Johannes Maria Meijer, Anders Hejlsberg, Matthew Warren, Dinesh Chandrakant Kulkarni, Luca Bolognese, Peter A. Hallam, Gary Shon Katzenberger, Donald F. Box
-
Publication number: 20120159465Abstract: A business intelligence (BI) document preserves references to identities and formats of remote data sources and allows a local computing device to offload analytical operations to remote data sources. The BI document specifies a graph of entities connected by directed edges from the output of one entity to an input of another entity. An entity, for example, can represent without limitation a data structure, an external data source, a control element, an external event source, a visualization, or an update service. The entities of a BI document at a local computing device can reference data at an original data source—rather than extracting data from the original data source to a preferred local datastore. An entity of the BI document can direct a remote data source to execute transformations on the remote data before returning a solution to the local computing device.Type: ApplicationFiled: December 17, 2010Publication date: June 21, 2012Applicant: MICROSOFT CORPORATIONInventors: Vijay Mital, Darryl Rubin, David George Green, Gary Shon Katzenberger, Olivier Colle, Suraj Poozhiyil
-
Publication number: 20120159312Abstract: Techniques for representing and publishing an interactive document useful for analyzing data. The document may be represented as a directed acyclic graph of entities interconnected by edges. The entities may be of multiple types. Yet, a broad range of interactive documents may be represented by a limited number of types of entities and the capabilities to interconnect entities of different types and to share a data schema across entities of different types. A tool may enable a user to author such documents. The tool may also facilitate publishing of the document. For publishing, the document may be converted to an executable form. Prior to such a conversion, the graph may be modified for more efficient processing. The graph may also be partitioned such that portions of the graph, when distributed across tiers of a computing system, such as a cloud-based platform, execute on computing devices that provide efficient operation.Type: ApplicationFiled: December 17, 2010Publication date: June 21, 2012Applicant: Microsoft CorporationInventors: Vijay Mital, Darryl Ellis Rubin, David G. Green, Suraj T. Poozhiyil, Gary Shon Katzenberger, Olivier Colle, Brian C. Beckman, Krasimir A. Aleksandrov, Andrew D. Reddish
-
Publication number: 20120158643Abstract: A business intelligence document provides functionality for testing a hypothesis on aggregated data in a business intelligence document (e.g., a spreadsheet-like document), wherein one or more of the input data values and transformation properties are designated as constrained (e.g., invariant or constrained within a range, set, enumeration, or domain). The hypothesis, which is articulated as a data mining assertion, is input through the user interface of the business intelligence document (e.g., via an expression interface or properties of a row, column, or cell) and solved over the aggregated data. The solution is then presented through the user interface of the spreadsheet-like document, such as in a table, graph, histogram, etc.Type: ApplicationFiled: December 17, 2010Publication date: June 21, 2012Applicant: MICROSOFT CORPORATIONInventors: Vijay Mital, Gary Shon Katzenberger, Darryl Rubin, David George Green