Patents by Inventor Alexander Sasha Stojanovic

Alexander Sasha Stojanovic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Cost-optimal cluster configuration analytics package

Patent number: 10671445

Abstract: Systems, methods, and computer-readable media for identifying an optimal cluster configuration for performing a job in a remote cluster computing system. In some examples, one or more applications and a sample of a production load as part of a job for a remote cluster computing system is received. Different clusters of nodes are instantiated in the remote cluster computing system to form different cluster configurations. Multi-Linear regression models segmented into different load regions are trained by running at least a portion of the sample on the instantiated different clusters of nodes. Expected completion times of the production load across varying cluster configurations are identified using the multi-linear regression models. An optimal cluster configuration of the varying cluster configurations is determined for the job based on the identified expected completion times.

Type: Grant

Filed: December 4, 2017

Date of Patent: June 2, 2020

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Antonio Nucci, Dragan Milosavljevic, Ping Pamela Tang, Athena Wong, Alex V. Truong, Alexander Sasha Stojanovic, John Oberon, Prasad Potipireddi, Ahmed Khattab, Samudra Harapan Bekti
System and method for dynamic, incremental recommendations within real-time visual simulation

Patent number: 10620923

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can include a software development component and graphical user interface, referred to herein in some embodiments as a pipeline editor, or Lambda Studio IDE, that provides a visual environment for use with the system, including providing real-time recommendations for performing semantic actions on data accessed from an input HUB, based on an understanding of the meaning or semantics associated with the data.

Type: Grant

Filed: August 22, 2017

Date of Patent: April 14, 2020

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: David Allan, Alexander Sasha Stojanovic, Hassan Heidari Namarvar, Ganesh Seetharaman
System and method for ontology induction through statistical profiling and reference schema matching

Patent number: 10620924

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can perform an ontology analysis of a schema definition, to determine the types of data, and datasets or entities, associated with that schema; and generate, or update, a model from a reference schema that includes an ontology defined based on relationships between datasets or entities, and their attributes. A reference HUB including one or more schemas can be used to analyze data flows, and further classify or make recommendations such as, for example, transformations enrichments, filtering, or cross-entity data fusion of an input data.

Type: Grant

Filed: August 22, 2017

Date of Patent: April 14, 2020

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Alexander Sasha Stojanovic, Hassan Heidari Namarvar, David Allan, Ganesh Seetharaman
COST-OPTIMAL CLUSTER CONFIGURATION ANALYTICS PACKAGE

Publication number: 20190171494

Abstract: Systems, methods, and computer-readable media for identifying an optimal cluster configuration for performing a job in a remote cluster computing system. In some examples, one or more applications and a sample of a production load as part of a job for a remote cluster computing system is received. Different clusters of nodes are instantiated in the remote cluster computing system to form different cluster configurations. Multi-Linear regression models segmented into different load regions are trained by running at least a portion of the sample on the instantiated different clusters of nodes. Expected completion times of the production load across varying cluster configurations are identified using the multi-linear regression models. An optimal cluster configuration of the varying cluster configurations is determined for the job based on the identified expected completion times.

Type: Application

Filed: December 4, 2017

Publication date: June 6, 2019

Inventors: Antonio Nucci, Dragan Milosavljevic, Ping Pamela Tang, Athena Wong, Alex V. Truong, Alexander Sasha Stojanovic, John Oberon, Prasad Potipireddi, Ahmed Khattab, Samudra Harapan Bekti
Dynamic visual profiling and visualization of high volume datasets and real-time smart sampling and statistical profiling of extremely large datasets

Patent number: 10296192

Abstract: The present disclosure relates generally to a data enrichment service that automatically profiles data sets and provides visualizations of the profiles using a visual-interactive model within a client application (such as a web browser or mobile app). The visual profiling can be refined through end user interaction with the visualization objects and guide exploratory data visualization and discovery. Additionally, data sampling of heterogeneous data streams can be performed during ingestion to extract statistical attributes from multi-columnar data (e.g., standard deviation, median, mode, correlation coefficient, histogram, etc.). Data sampling can continue in real-time as data sources are updated.

Type: Grant

Filed: September 24, 2015

Date of Patent: May 21, 2019

Assignee: Oracle International Corporation

Inventors: Alexander Sasha Stojanovic, Luis E. Rivas, Kevin L. Markey, Christopher F. Bidwell
TECHNIQUES FOR SIMILARITY ANALYSIS AND DATA ENRICHMENT USING KNOWLEDGE SOURCES

Publication number: 20190138538

Abstract: The present disclosure relates to performing similarity metric analysis and data enrichment using knowledge sources. A data enrichment service can compare an input data set to reference data sets stored in a knowledge source to identify similarly related data. A similarity metric can be calculated corresponding to the semantic similarity of two or more datasets. The similarity metric can be used to identify datasets based on their metadata attributes and data values enabling easier indexing and high performance retrieval of data values. A input data set can labeled with a category based on the data set having the best match with the input data set. The similarity of an input data set with a data set provided by a knowledge source can be used to query a knowledge source to obtain additional information about the data set. The additional information can be used to provide recommendations to the user.

Type: Application

Filed: December 31, 2018

Publication date: May 9, 2019

Applicant: Oracle International Corporation

Inventors: Alexander Sasha Stojanovic, Mark Kreider, Michael Malak, Glenn Allen Murray
Techniques for similarity analysis and data enrichment using knowledge sources

Patent number: 10210246

Abstract: The present disclosure relates to performing similarity metric analysis and data enrichment using knowledge sources. A data enrichment service can compare an input data set to reference data sets stored in a knowledge source to identify similarly related data. A similarity metric can be calculated corresponding to the semantic similarity of two or more datasets. The similarity metric can be used to identify datasets based on their metadata attributes and data values enabling easier indexing and high performance retrieval of data values. A input data set can labeled with a category based on the data set having the best match with the input data set. The similarity of an input data set with a data set provided by a knowledge source can be used to query a knowledge source to obtain additional information about the data set. The additional information can be used to provide recommendations to the user.

Type: Grant

Filed: September 24, 2015

Date of Patent: February 19, 2019

Assignee: Oracle International Corporation

Inventors: Alexander Sasha Stojanovic, Mark Kreider, Michael Malak, Glenn Allen Murray
SYSTEM AND METHOD FOR INFERENCING OF DATA TRANSFORMATIONS THROUGH PATTERN DECOMPOSITION

Publication number: 20180067732

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide a service to recommend actions and transformations, on an input data, based on patterns identified from the functional decomposition of a data flow for a software application, including determining possible transformations of the data flow in subsequent applications. Data flows can be decomposed into a model describing transformations of data, predicates, and business rules applied to the data, and attributes used in the data flows.

Type: Application

Filed: August 22, 2017

Publication date: March 8, 2018

Inventors: Ganesh Seetharaman, Alexander Sasha Stojanovic, Hassan Heidari Namarvar, David Allan
SYSTEM AND METHOD FOR DYNAMIC, INCREMENTAL RECOMMENDATIONS WITHIN REAL-TIME VISUAL SIMULATION

Publication number: 20180052898

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can include a software development component and graphical user interface, referred to herein in some embodiments as a pipeline editor, or Lambda Studio IDE, that provides a visual environment for use with the system, including providing real-time recommendations for performing semantic actions on data accessed from an input HUB, based on an understanding of the meaning or semantics associated with the data.

Type: Application

Filed: August 22, 2017

Publication date: February 22, 2018

Inventors: David Allan, Alexander Sasha Stojanovic, Hassan Heidari Namarvar, Ganesh Seetharaman
SYSTEM AND METHOD FOR DYNAMIC LINEAGE TRACKING, RECONSTRUCTION, AND LIFECYCLE MANAGEMENT

Publication number: 20180052878

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide data governance functionality such as, for example, provenance (where a particular data came from), lineage (how the data was acquired/processed), security (who was responsible for the data), classification (what is the data about), impact (how impactful is the data to a business), retention (how long should the data live), and validity (whether the data should be excluded/included for analysis/processing), for each slice of data pertinent to a particular snapshot in time; which can then be used in making lifecycle decisions and dataflow recommendations.

Type: Application

Filed: August 22, 2017

Publication date: February 22, 2018

Inventors: Ganesh Seetharaman, Alexander Sasha Stojanovic, Hassan Heidari Namarvar, David Allan
SYSTEM AND METHOD FOR METADATA-DRIVEN EXTERNAL INTERFACE GENERATION OF APPLICATION PROGRAMMING INTERFACES

Publication number: 20180052861

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system provides a programmatic interface, referred to herein in some embodiments as a foreign function interface, by which a user or third-party can define a service, functional and business types, semantic actions, and patterns or predefined complex data flows based on functional and business types, in a declarative manner, to extend the functionality of the system.

Type: Application

Filed: August 22, 2017

Publication date: February 22, 2018

Inventors: Ganesh Seetharaman, Alexander Sasha Stojanovic, Hassan Heidari Namarvar, David Allan
SYSTEM AND METHOD FOR AUTOMATED MAPPING OF DATA TYPES FOR USE WITH DATAFLOW ENVIRONMENTS

Publication number: 20180052897

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide support for auto-mapping of complex data structures, datasets or entities, between one or more sources or targets of data, referred to herein in some embodiments as HUBs. The auto-mapping can be driven by a metadata, schema, and statistical profiling of a dataset; and used to map a source dataset or entity associated with an input HUB, to a target dataset or entity or vice versa, to produce an output data prepared in a format or organization (projection) for use with one or more output HUBs.

Type: Application

Filed: August 22, 2017

Publication date: February 22, 2018

Inventors: Hassan Heidari Namarvar, Alexander Sasha Stojanovic, David Allan, Ganesh Seetharaman
SYSTEM AND METHOD FOR ONTOLOGY INDUCTION THROUGH STATISTICAL PROFILING AND REFERENCE SCHEMA MATCHING

Publication number: 20180052870

Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can perform an ontology analysis of a schema definition, to determine the types of data, and datasets or entities, associated with that schema; and generate, or update, a model from a reference schema that includes an ontology defined based on relationships between datasets or entities, and their attributes. A reference HUB including one or more schemas can be used to analyze data flows, and further classify or make recommendations such as, for example, transformations enrichments, filtering, or cross-entity data fusion of an input data.

Type: Application

Filed: August 22, 2017

Publication date: February 22, 2018

Inventors: Alexander Sasha Stojanovic, Hassan Heidari Namarvar, David Allan, Ganesh Seetharaman
DECLARATIVE LANGUAGE AND VISUALIZATION SYSTEM FOR RECOMMENDED DATA TRANSFORMATIONS AND REPAIRS

Publication number: 20160092474

Abstract: The present disclosure relates generally to a data enrichment service that extracts, repairs, and enriches datasets, resulting in more precise entity resolution and correlation for purposes of subsequent indexing and clustering. As the data enrichment service can include a visual recommendation engine and language for performing large-scale data preparation, repair, and enrichment of heterogeneous datasets. This enables the user to select and see how the recommended enrichments (e.g., transformations and repairs) will affect the user's data and make adjustments as needed. The data enrichment service can receive feedback from users through a user interface and can filter recommendations based on the user feedback.

Type: Application

Filed: September 24, 2015

Publication date: March 31, 2016

Inventors: Alexander Sasha Stojanovic, Luis E. Rivas, Philip Ogren, Glenn Allen Murray
DYNAMIC VISUAL PROFILING AND VISUALIZATION OF HIGH VOLUME DATASETS AND REAL-TIME SMART SAMPLING AND STATISTICAL PROFILING OF EXTREMELY LARGE DATASETS

Publication number: 20160092090

Abstract: The present disclosure relates generally to a data enrichment service that automatically profiles data sets and provides visualizations of the profiles using a visual-interactive model within a client application (such as a web browser or mobile app). The visual profiling can be refined through end user interaction with the visualization objects and guide exploratory data visualization and discovery. Additionally, data sampling of heterogeneous data streams can be performed during ingestion to extract statistical attributes from multi-columnar data (e.g., standard deviation, median, mode, correlation coefficient, histogram, etc.). Data sampling can continue in real-time as data sources are updated.

Type: Application

Filed: September 24, 2015

Publication date: March 31, 2016

Inventors: Alexander Sasha Stojanovic, Luis E. Rivas, Kevin L. Markey, Christopher F. Bidwell
DECLARATIVE EXTERNAL DATA SOURCE IMPORTATION, EXPORTATION, AND METADATA REFLECTION UTILIZING HTTP AND HDFS PROTOCOLS

Publication number: 20160092476

Abstract: Techniques are disclosure for a data enrichment system that enables declarative external data source importation and exportation. A user can specify via a user interface input for identifying different data sources from which to obtain input data. The data enrichment system is configured to import and export various types of sources storing resources such as URL-based resources and HDFS-based resources for high-speed bi-directional metadata and data interchange. Connection metadata (e.g., credentials, access paths, etc.) can be managed by the data enrichment system in a declarative format for managing and visualizing the connection metadata.

Type: Application

Filed: September 24, 2015

Publication date: March 31, 2016

Inventors: Alexander Sasha Stojanovic, Douglas C. Savolainen, Mark Kreider
AUTOMATED ENTITY CORRELATION AND CLASSIFICATION ACROSS HETEROGENEOUS DATASETS

Publication number: 20160092475

Abstract: The present disclosure describes techniques for entity classification and data enrichment of data sets. A data enrichment system is disclosed that can extract, repair, and enrich datasets, resulting in more precise entity resolution and classification for purposes of subsequent indexing and clustering. Disclosed techniques may include performing entity recognition to identify segments of interest that relate to an entity. Related data may be analyzed for classification, which can be used to transform the data for enrichment to its users.

Type: Application

Filed: September 24, 2015

Publication date: March 31, 2016

Inventors: Alexander Sasha Stojanovic, Philip Ogren, Kevin L. Markey, Mark Kreider
TECHNIQUES FOR SIMILARITY ANALYSIS AND DATA ENRICHMENT USING KNOWLEDGE SOURCES

Publication number: 20160092557

Abstract: The present disclosure relates to performing similarity metric analysis and data enrichment using knowledge sources. A data enrichment service can compare an input data set to reference data sets stored in a knowledge source to identify similarly related data. A similarity metric can be calculated corresponding to the semantic similarity of two or more datasets. The similarity metric can be used to identify datasets based on their metadata attributes and data values enabling easier indexing and high performance retrieval of data values. A input data set can labeled with a category based on the data set having the best match with the input data set. The similarity of an input data set with a data set provided by a knowledge source can be used to query a knowledge source to obtain additional information about the data set. The additional information can be used to provide recommendations to the user.

Type: Application

Filed: September 24, 2015

Publication date: March 31, 2016

Inventors: Alexander Sasha Stojanovic, Mark Kreider, Michael Malak, Glenn Allen Murray
DYNAMIC USER EXPERIENCE ADAPTATION AND SERVICES PROVISIONING

Publication number: 20130159228

Abstract: The subject disclosure generally relates to dynamic user experience adaptation and services provisioning. A user experience component can provide a user experience (UX) to a user. The UX can include, but is not limited to, an operating system, an application (e.g., word processor, electronic mail, computer aided drafting, video game, etc.), a user interface, and so forth. A monitoring component can monitor feedback generated in association with interaction with the user experience by the user. An update component can analyze the feedback, and update a user model associated with the user based at least in part on the analysis, and an adaptation component can modify the user experience based at least in part the user model.

Type: Application

Filed: December 16, 2011

Publication date: June 20, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Henricus Johannes Maria Meijer, Roger Barga, Carl Carter-Schwendler, Alexander Sasha Stojanovic
De-focusing over big data for extraction of unknown value

Patent number: 8452792

Abstract: Techniques for defocusing queries over big datasets and dynamic datasets are provided to broaden search results and incorporate all potentially relevant data and avoid overly narrowing queries. An analytic component can receive queries directed at one region of a dataset and analyze the queries to generate inferences about the queries. The queries can then be defocused by a defocusing component and incorporate a larger dataset than originally searched to broaden the queries. The larger dataset can incorporate all, or a part of the original dataset and can also be disparate from the original dataset. Clusters of queries can also be merged and unified to deal with ‘local minima’ issues and broaden the understanding of the dataset. In other embodiments, dynamic data can be monitored and changes tracked, to ensure that all portions of the dataset are being searched by the queries.

Type: Grant

Filed: October 28, 2011

Date of Patent: May 28, 2013

Assignee: Microsoft Corporation

Inventors: Roger Barga, Alexander Sasha Stojanovic, Henricus Johannes Maria Meijer, Carl Carter-Schwendler, Michael Isard, Savas Parastatidis

prev 1 2 3 next