Patents by Inventor C. James MacLennan
C. James MacLennan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10387126Abstract: An intermediate representation of a workflow that comprises software functions may be generated to efficiently perform data marshalling. The workflow is analyzed, including identifying that a first software function is implemented in a first language, a second software function is implemented in a second language, and a third software function is not explicitly implemented in an implementation language. Factors associated with the software functions are analyzed, including implementation languages of the software functions. Based on the analysis of the factors, an implementation language is assigned to the third software function that comprises either the first or second language. Based on the analysis of the workflow, an intermediate representation of the workflow is generated that represents each of the plurality of software functions using declarative language.Type: GrantFiled: June 30, 2017Date of Patent: August 20, 2019Assignee: Microsoft Technology Licensing, LLCInventors: C. James MacLennan, Andy J. Linfoot
-
Publication number: 20190004776Abstract: An intermediate representation of a workflow that comprises software functions may be generated to efficiently perform data marshalling. The workflow is analyzed, including identifying that a first software function is implemented in a first language, a second software function is implemented in a second language, and a third software function is not explicitly implemented in an implementation language. Factors associated with the software functions are analyzed, including implementation languages of the software functions. Based on the analysis of the factors, an implementation language is assigned to the third software function that comprises either the first or second language. Based on the analysis of the workflow, an intermediate representation of the workflow is generated that represents each of the plurality of software functions using declarative language.Type: ApplicationFiled: June 30, 2017Publication date: January 3, 2019Inventors: C. James MACLENNAN, Andy J. LINFOOT
-
Publication number: 20140046879Abstract: The subject technology discloses configurations for creating reusable predictive models for applying to one or more data sources. The subject technology specifies a business problem to determine a probability of an event occurring. The business problem may include a constraint. A data source is selected for a predictive model associated with a predictive algorithm in which the predictive model includes one or more queries and parameters. A set of transformations are then determined based on the queries and parameters for at least a subset of data from the data source to be processed by the predictive algorithm. The subject technology identifies a set of patterns based on the set of transformations for at least the subset of data from the data source. A trained predictive model is then provided including the determined set of patterns, the set of transformations, and the associated predictive algorithm for solving the specified business problem.Type: ApplicationFiled: August 13, 2013Publication date: February 13, 2014Applicant: Predixion Software, Inc.Inventors: C. James MACLENNAN, Ioan Bogdan CRIVAT
-
Publication number: 20130117650Abstract: A method and system that generate reproducible reports describing one or more analytical functions are disclosed. The reports describe a sequence of analytical functions and allow subsequent executions of the sequence of analytical functions. The matrix space that is inherent in worksheets is used to record a sequence of operations as a tabular report that can be interpreted by a computer program.Type: ApplicationFiled: March 27, 2012Publication date: May 9, 2013Inventors: C. James MacLennan, Ioan Bogdan Crivat
-
Patent number: 8260738Abstract: Described is time-weighted blending of the results of time series algorithms in a manner that changes their relative weights based on the prediction time. The prediction values from each algorithm are mathematically blended into a forecast result corresponding to the desired time of prediction. In this manner, an ARTXP algorithm that provides accurate near term predictions is given more weight than an ARIMA for near term predictions, and less relative weight for long term predictions. An example exponential function to compute the relative weights is described; the function corresponds to a curve having a control variable for the slope and the start of the curve, and constant coefficients, with the weights based (in part) on the prediction time. A user-provided parameter may also affect the relative weights used in the blending result.Type: GrantFiled: June 27, 2008Date of Patent: September 4, 2012Assignee: Microsoft CorporationInventors: C. James MacLennan, Shuvro Mitra
-
Patent number: 7930322Abstract: Various technologies and techniques are disclosed for text based schema discovery and information extraction. Documents are analyzed to identify sections of the documents and a relationship between the sections. Statistics are stored regarding occurrences of items in the documents. A probabilistic model is generated based on the stored statistics. A database schema is generated with a plurality of tables based upon the probabilistic model. The documents are analyzed against the probabilistic model to determine how the documents map to the tables generated from the database schema. The tables are populated from the documents based on a result of the analysis against the probabilistic model.Type: GrantFiled: May 27, 2008Date of Patent: April 19, 2011Assignee: Microsoft CorporationInventor: C. James MacLennan
-
Patent number: 7886289Abstract: Systems and methods that supply extensibility mechanisms for analysis services, via a plug-in component that enables additional functionalities. The plug-in component provide additional custom logic for the analysis services unified dimensional model (UDM). Accordingly, server functionalities can be extended in an agile manner, and without a requirement for a new release, for example.Type: GrantFiled: March 20, 2007Date of Patent: February 8, 2011Assignee: Microsoft CorporationInventors: Thulusalamatom K. Anand, Paul J. Sanders, Richard R. Tkachuk, Cristian Petculescu, Chu Xu, Akshai M. Mirchandani, Valeri Kim, Andriy Garbuzov, C. James MacLennan, Marius Dumitru, Ioan Bogdan Crivat
-
Patent number: 7797264Abstract: Data expressed as tabular data having columns and rows can be analyzed and data determined to be an exception can be flagged. In addition, reasons for flagging such data as exceptions can be presented to a user to facilitate further analysis and action on the data. A predictive analysis component can utilize a clustering algorithm with predictive capabilities to autonomously analyze the data. Periodic re-analysis of the data can be performed to determine if exceptions have changed based on new or modified data.Type: GrantFiled: February 2, 2007Date of Patent: September 14, 2010Assignee: Microsoft CorporationInventors: Raman S. Iyer, C. James MacLennan, Ioan Bogdan Crivat
-
Patent number: 7797356Abstract: Fields contained in data expressed as tabular data having columns and rows can initially be marked as exceptions, wherein a column within a row can be the potential cause of the exception. A user configurable parameter can be utilized to change the sensitivity or allowable exceptions for each row and/or column, to increase or decrease the number of exceptions detected. As data within each field are modified, added or deleted, or when the configurable parameter is changed, the exceptions marked can be automatically updated. Such updated exceptions can be the same or different from the initially marked exceptions. As such, a user can evaluate data and determine whether various changes within the data will change various outcomes.Type: GrantFiled: February 2, 2007Date of Patent: September 14, 2010Assignee: Microsoft CorporationInventors: Raman S. Iyer, C. James MacLennan, Ioan Bogdan Crivat
-
Patent number: 7788200Abstract: Seeking goals in data that can be expressed as rows and columns is provided through predictive analytics. If a desired goal is achievable, the changes to the rows and/or columns that can achieve the goal are presented to a user. If the desired goal is not achievable, an error message or other indicator can be presented to the user. Predictive analytics can include a predictive algorithm, various data mining techniques, or other predictive techniques. A confidence metric of a goal-seek result can be normalized to estimate the degree of confidence that a particular change will yield the desired outcome.Type: GrantFiled: February 2, 2007Date of Patent: August 31, 2010Assignee: Microsoft CorporationInventors: Ioan Bogdan Crivat, C. James MacLennan, Raman S. Iyer
-
Patent number: 7756881Abstract: A system that effectuates fetching a complete set of relational data into a mining services server and subsequently defining desired partitions upon the fetched data is provided. In accordance with the innovation, the data can be locally cached and partitioned therefrom. Accordingly, upon the same mining structure (e.g., cache) that has been partitioned, the novel innovation can build mining models for each partition. In other words, the innovation can employ the concept of mining structure as a data cache while manipulating only partitions of this cache in certain operations. The innovation can be employed in scenarios where a user wants to train a mining model using only data points that satisfy a particular Boolean condition, a user wants to split the training set into multiple partitions (e.g., training/testing) and/or a user wants to perform a data mining procedure known as “N-fold cross validation.Type: GrantFiled: March 9, 2006Date of Patent: July 13, 2010Assignee: Microsoft CorporationInventors: Ioan Bogdan Crivat, Raman S. Iyer, C. James MacLennan
-
Patent number: 7747641Abstract: The subject invention relates to systems and methods to extend the capabilities of declarative data modeling languages. In one aspect, a declarative data modeling language system is provided. The system includes a data modeling language component that generates one or more data mining models to extract predictive information from local or remote databases. A language extension component facilitates modeling capability in the data modeling language by providing a data sequence model or a time series model within the data modeling language to support various data mining applications.Type: GrantFiled: April 28, 2005Date of Patent: June 29, 2010Assignee: Microsoft CorporationInventors: Pyungchul Kim, C. James MacLennan, ZhaoHui Tang
-
Patent number: 7689703Abstract: The subject invention relates to systems and methods that extend the network data access capabilities of mark-up language protocols. In one aspect, a network data transfer system is provided. The system includes a protocol component that employs a computerized mark-up language to facilitate data interactions between network components, whereby the data interactions were previously limited or based on a statement command associated with the markup language. An extension component operates with the protocol component to support the data transactions, where the extension component supplies at least one other command from the statement command to facilitate the data interactions.Type: GrantFiled: March 1, 2005Date of Patent: March 30, 2010Assignee: Microsoft CorporationInventors: Mosha Pasumansky, Marius Dumitru, Adrian Dumitrascu, Cristian Petculescu, Akshai M. Mirchandani, Paul J. Sanders, Thulusalamatom Krishnamurthi Anand, Richard R. Tkachuk, Raman S. Iyer, Thomas P. Conlon, Alexander Berger, Sergei Gringauze, Ioan Bogdan Crivat, C. James MacLennan, Rong J. Guan
-
Publication number: 20090327206Abstract: Described is time-weighted blending of the results of time series algorithms in a manner that changes their relative weights based on the prediction time. The prediction values from each algorithm are mathematically blended into a forecast result corresponding to the desired time of prediction. In this manner, an ARTXP algorithm that provides accurate near term predictions is given more weight than an ARIMA for near term predictions, and less relative weight for long term predictions. An example exponential function to compute the relative weights is described; the function corresponds to a curve having a control variable for the slope and the start of the curve, and constant coefficients, with the weights based (in part) on the prediction time. A user-provided parameter may also affect the relative weights used in the blending result.Type: ApplicationFiled: June 27, 2008Publication date: December 31, 2009Applicant: MICROSOFT CORPORATIONInventors: C. James MacLennan, Shuvro Mitra
-
Publication number: 20090319330Abstract: Various technologies and techniques are disclosed for calculating and evaluating the behavior of recommendation systems. Accuracy measures are computed for a plurality of items in a real recommendation system, an ideal recommendation system, and a popularity-based baseline recommendation system. The accuracy measures for the plurality of items are presented to a user so the user can evaluate a performance of the real recommendation system in comparison to the ideal recommendation system and the popularity-based baseline recommendation system. The accuracy measures can be presented in an interactive graph.Type: ApplicationFiled: June 18, 2008Publication date: December 24, 2009Applicant: MICROSOFT CORPORATIONInventors: Ioan Bogdan Crivat, C. James MacLennan, Yue Liu, Michael D. Moore
-
Patent number: 7636698Abstract: Architecture for analyzing pattern shifts in data patterns of data mining models and outputting the results. This allows comparing and describing differences between two semantically similar sets of patterns (or mining models), and for analyzing historical changes in versions of the same model or differences in patterns found by two or more different algorithms applied to the same data. The architecture can also facilitate explaining data patterns that shift over time and over different data populations, and between versions of the same model that use different algorithms. A model component is employed for storing data mining models have respective sets of data patterns obtained from a dataset, and an analysis component analyzes the sets of the data patterns for difference data therebetween. The dataset can be a subsample of a larger set of data and can be analyzed by the analysis component over a time period.Type: GrantFiled: March 16, 2006Date of Patent: December 22, 2009Assignee: Microsoft CorporationInventors: Ioan Bogdan Crivat, Elena D. Cristofor, C. James MacLennan
-
Publication number: 20090300043Abstract: Various technologies and techniques are disclosed for text based schema discovery and information extraction. Documents are analyzed to identify sections of the documents and a relationship between the sections. Statistics are stored regarding occurrences of items in the documents. A probabilistic model is generated based on the stored statistics. A database schema is generated with a plurality of tables based upon the probabilistic model. The documents are analyzed against the probabilistic model to determine how the documents map to the tables generated from the database schema. The tables are populated from the documents based on a result of the analysis against the probabilistic model.Type: ApplicationFiled: May 27, 2008Publication date: December 3, 2009Applicant: MICROSOFT CORPORATIONInventor: C. James MacLennan
-
Patent number: 7627555Abstract: A language schema that integrates multidimensional extensions (e.g., MDX) and data mining extensions (e.g., DMX) for performing data mining operations on data residing in OLAP cubes. The schema provides that the <source-data-query> can not only be a relational query, rather a multidimensional query formed using MDX, for example. The operations of model creation, training and prediction are described.Type: GrantFiled: June 22, 2004Date of Patent: December 1, 2009Assignee: Microsoft CorporationInventors: C. James MacLennan, Pyungchul Kim, ZhaoHui Tang
-
Patent number: 7593927Abstract: A standard mechanism for directly accessing unstructured data types (e.g., image, audio, video, gene sequencing and text data) in accordance with data mining operations is provided. The subject innovation can enable access to unstructured data directly from within the data mining engine or tool. Accordingly, the innovation enables multiple vendors to provide algorithms for mining unstructured data on a data mining platform (e.g., an SQL-brand server), thereby increasing adoption. As well, the subject innovation allows users to directly mine unstructured data that is not fixed-length, without pre-processing and tokenizing the data external to the data mining engine. In accordance therewith, the innovation can provide a mechanism to expand declarative language content types to include an “unstructured” data type thereby enabling a user and/or application to affirmatively designate mining data as an unstructured type.Type: GrantFiled: March 10, 2006Date of Patent: September 22, 2009Assignee: Microsoft CorporationInventors: C. James MacLennan, Ioan Bogdan Crivat, ZhaoHui Tang, Raman S. Iyer
-
Patent number: 7565335Abstract: Systems and methods that cleanse data in Extract, Transform, Load environments (ETL), via employing an outlier detect component that is positioned in data pipeline to data warehouse(s). Such outlier detect component employs a cluster mining model to split data into normal and outlier data. Different predictive models can be employed to detect outliers in different data slices to enhance the accuracy of the predictions. In addition, a graphical user interface (GUI) enables a user to interact with cluster groups that are created and/or analyzed by the outlier detect component.Type: GrantFiled: March 15, 2006Date of Patent: July 21, 2009Assignee: Microsoft CorporationInventors: ZhaoHui Tang, Donald M. Farmer, C. James MacLennan