Patents by Inventor Orna Raz

Orna Raz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for test modeling

Patent number: 11003567

Abstract: A method, system and computer program product, the method comprising: receiving a user document describing at least a part of a system to be tested; computing a similarity measurement between the user document and documents in a document corpus, each describing at least part of a system and associated with a combinatorial model representing the at least part of the system, wherein the combinatorial model comprises a set of attributes, a respective domain for each attribute defining possible values for the attribute, and restrictions, wherein each restriction comprises at least one attribute, a respective value and a logical operator, based upon the similarity measurement, selecting selected documents from the document corpus; obtaining one or more combinatorial models corresponding to the selected documents; selecting elements from the combinatorial models; generating an initial combinatorial model for the user document, wherein said generating comprises adding the elements to the initial combinatorial model.

Type: Grant

Filed: December 6, 2017

Date of Patent: May 11, 2021

Assignee: International Business Machines Corporation

Inventors: Howard Hess, Eitan Farchi, Orna Raz, Rachel Tzoref-Brill, Aviad Zlotnick
METHOD AND APPARATUS FOR ENHANCING EFFECTIVITY OF MACHINE LEARNING SOLUTIONS

Publication number: 20210125080

Abstract: A method, system and computer program product, the method comprising: creating a model representing underperforming cases; from a case collection having a total performance, and which comprises for each of a multiplicity of records: a value for each feature from a collection of features, a ground truth label and a prediction of a machine learning (ML) engine, obtaining one or more features; dividing the records into groups, based on values of the features in each record; for one group of the groups, calculating a performance parameter of the ML engine over the portion of the records associated with the group; subject to the performance parameter of the group being below the total performance in at least a predetermined threshold: determining a characteristic for the group; adding the characteristic of the group to the model; and providing the model to a user, thus indicating under-performing parts of the test collection.

Type: Application

Filed: October 24, 2019

Publication date: April 29, 2021

Inventors: ORNA RAZ, Marcel Zalmanovici, Aviad Zlotnick
METHOD AND APPARATUS FOR EMPLOYING MACHINE LEARNING SOLUTIONS

Publication number: 20210117848

Abstract: A method, system and computer program product, the method comprising: obtaining computer code of an employed system comprising a plurality of components; obtaining data related to operating the plurality of components; based on the computer code and the data, identifying: a first component from the plurality of components, to be maintained; and a second component from the plurality of components, to be at least partly replaced by a machine learning component; and providing to a user an identification of the first component and the second component.

Type: Application

Filed: October 17, 2019

Publication date: April 22, 2021

Inventors: Eitan Daniel Farchi, Howard Michael Hess, Orna Raz
Dynamic Automation of DevOps Pipeline Vulnerability Detecting and Testing

Publication number: 20210042217

Abstract: An artificial intelligence (AI) platform to support a continuous integration and deployment pipeline for software development and operations (DevOps). One or more running processes are subject to monitoring to identify presence of vulnerabilities. An automated rebuild of the monitored processes is initiated, which includes constructing a map representing a relationship of test code elements corresponding to different portions of source code. The identified vulnerable source code reflected in a new container image is subject to an automatic verification to ascertain if the source code is covered by at least one of the represented test code elements. A risk assessment is employed as part of the verification. A new container image is selectively deployed responsive to the risk assessment.

Type: Application

Filed: August 9, 2019

Publication date: February 11, 2021

Applicant: International Business Machines Corporation

Inventors: Jinho Hwang, Shripad Nadgowda, Hai Huang, Orna Raz
ESTIMATING FEASIBILITY AND EFFORT FOR A MACHINE LEARNING SOLUTION

Publication number: 20210012221

Abstract: A method, computer system, and a computer program product for assessing a likelihood of success associated with developing at least one machine learning (ML) solution is provided. The present invention may include generating a set of questions based on a set of raw training data. The present invention may also include computing a feasibility score based on an answer corresponding with each question from the generated set of questions. The present invention may then include, in response to determining that the computed feasibility score satisfies a threshold, computing a level of effort associated with developing the at least one ML solution to address a problem. The present invention may further include presenting, to a user, a plurality of results associated with assessing the likelihood of success of the at least one ML solution.

Type: Application

Filed: July 11, 2019

Publication date: January 14, 2021

Inventors: Pathirage Dinindu Sujan Udayanga Perera, Orna Raz, Ramani Routray, Eitan Daniel Farchi
AUTOMATED DATA SLICING BASED ON AN ARTIFICIAL NEURAL NETWORK

Publication number: 20210004671

Abstract: A method, apparatus and a computer program product for automated data slicing based on an Artificial Neural Network (ANN). The method comprising: obtaining an ANN, wherein the ANN is configured to provide a prediction for a data instance, wherein the ANN comprises a set of nodes having interconnections therebetween; determining an attribute vector based on a subset of the nodes of the ANN; determining, based on the attribute vector, a plurality of data slices; obtaining a testing dataset comprising testing data instances; computing, for each data slice, a performance measurement of the ANN over the data slice, wherein said computing is based on an application of the ANN on each testing data instance that is mapped to the data slice; and performing an action based on at least a portion of the performance measurements of the data slices.

Type: Application

Filed: July 3, 2019

Publication date: January 7, 2021

Inventors: RACHEL BRILL, Eitan Farchi, Orna Raz, Aviad Zlotnick
MITIGATING GOVERNANCE IMPACT ON MACHINE LEARNING

Publication number: 20200410129

Abstract: A method, a computerized apparatus and a computer program product for mitigating governance and regulation implications on machine learning. A governance impact assessment is generated for a partial data set generated by applying a data governance enforcement on a data set of instances comprising valuations of a feature vector. The partial data set comprises partial instances each comprising partial feature vectors. The governance impact assessment comprises information about data excluded from the data set. A machine learning model trained based on the partial data set and configured to provide an estimated prediction for a partial instance is obtained. A set of core features is determined. A bias introduced by the data governance is identified based on a core feature being affected by the data governance. In response to identifying a bias, an anti-bias procedure is applied on the machine learning model, whereby mitigating the bias introduced by the data governance.

Type: Application

Filed: June 26, 2019

Publication date: December 31, 2020

Inventors: Sima Nadler, Orna Raz, Marcel Zalmanovici
GENERATING TRAINING SETS TO TRAIN MACHINE LEARNING MODELS

Publication number: 20200394461

Abstract: A computer system trains a machine learning model. A vector representation is generated for each document in a collection of documents. The documents are clustered based on the vector representations of the documents to produce a plurality of clusters. A training set is produced by selecting one or more documents from each cluster, wherein the selected documents represent a sample of the collection of documents to train the machine learning model. The machine learning model is trained by applying the training set to the machine learning model. Embodiments of the present invention further include a method and program product for training a machine learning model in substantially the same manner described above.

Type: Application

Filed: June 12, 2019

Publication date: December 17, 2020

Inventors: Pathirage D. S. U. Perera, Eitan D. Farchi, Orna Raz, Ramani Routray, Sheng Hua Bao, Marcel Zalmanovici
DATA SLICING FOR MACHINE LEARNING PERFORMANCE TESTING AND IMPROVEMENT

Publication number: 20200387753

Abstract: A method and a computer program product for identification and improvement of machine learning (ML) under-performance The method comprises slicing data of ML model based on a functional model representing requirements of a system utilizing the ML model. The functional model comprises a set of attributes and respective domain of values. Each data slice is associated with a different valuation of one or more attributes of the functional model. Each data instance of the ML model is mapped to one or more data slices, based on valuation of the attributes. A performance measurement of the ML model over is computed for each data slice, based on an application of the ML model on each data instance that is mapped to the data slice. A Determination whether ML model adheres to a target performance requirement may be performed based on the performance measurements of the data slices.

Type: Application

Filed: June 10, 2019

Publication date: December 10, 2020

Inventors: Rachel Brill, Eitan Farchi, Orna Raz, Aviad Zlotnick
IDENTIFYING DATA DRIFTS THAT HAVE AN ADVERSE EFFECT ON PREDICTORS

Publication number: 20200342260

Abstract: A method, apparatus and product for identifying data drifts.

Type: Application

Filed: April 28, 2019

Publication date: October 29, 2020

Inventors: Eitan Farchi, Orna Raz, Marcel Zalmanovici
IDENTIFYING DATA DRIFTS

Publication number: 20200342310

Abstract: A method, apparatus and product for identifying data drifts. The method comprising: obtaining a seen dataset, wherein the seen dataset comprises seen instances, each of which comprising feature values in a feature space; determining a first measurement of a statistical metric of the seen dataset; obtaining an unseen dataset, wherein the unseen dataset comprises unseen instances, each of which comprising features values in the feature space; determining a second measurement of the statistical metric of the unseen dataset; identifying a data drift in the unseen dataset with respect to the seen dataset based on the first and second measurements of the statistical metric; and performing a responsive action based on the identification of the data drift.

Type: Application

Filed: April 28, 2019

Publication date: October 29, 2020

Inventors: Eitan Farchi, Orna Raz, Marcel Zalmanovici, Aviad Zlotnick
Optimizing Hierarchical Classification with Adaptive Node Collapses

Publication number: 20200285943

Abstract: A mechanism is provided in a data processing system having a processor and a memory. The memory comprises instructions which are executed by the processor to cause the processor to implement a training system for finding an optimal surface for hierarchical classification task on an ontology. The training system receives a training data set and a hierarchical classification ontology data structure. The training system generates a neural network architecture based on the training data set and the hierarchical classification ontology data structure. The neural network architecture comprises an indicative layer, a parent tier (PT) output and a lower leaf tier (LLT) output. The training system trains the neural network architecture to classify the training data set to leaf nodes at the LLT output and parent nodes at the PT output. The indicative layer in the neural network architecture determines a surface that passes through each path from a root to a leaf node in the hierarchical ontology data structure.

Type: Application

Filed: March 4, 2019

Publication date: September 10, 2020

Inventors: Pathirage Dinindu Sujan Udayanga Perera, Orna Raz, Ramani Routray, Vivek Krishnamurthy, Sheng Hua Bao, Eitan D. Farchi
ENCODING-ASSISTED ANNOTATION OF NARRATIVE TEXT

Publication number: 20200250209

Abstract: From metadata corresponding to a narrative text, a first encoding is constructed, the first encoding comprising a standardized text string, the first encoding formed according to an encoding scheme. A specified portion of the standardized text string of the first encoding is marked as an anchor term. A correspondence between the first encoding and a second encoding is tested using the encoding scheme and a Natural Language Processing engine, responsive to finding the anchor term within the narrative text. The second encoding corresponds to a text window. The text window comprises a portion of the narrative text comprising an instance of the anchor term and a word within a predetermined distance from the instance. Responsive to the second encoding being identical to the first encoding, the narrative text is annotated, the annotating creating new data linking the narrative text with the second encoding.

Type: Application

Filed: February 4, 2019

Publication date: August 6, 2020

Applicant: International Business Machines Corporation

Inventors: Nakul Chakrapani, Ramani Routray, Pathirage Perera, Sheng Hua Bao, Orna Raz, Eitan Farchi
CLASSIFIER CONFIDENCE AS A MEANS FOR IDENTIFYING DATA DRIFT

Publication number: 20200242505

Abstract: Embodiments of the present systems and methods may provide improved machine learning performance even though data drift has occurred. For example, a method may comprise providing a machine learning model in a computer system, operating the machine learning model using a first dataset to obtain results of the first dataset, operating the machine learning model using a second dataset to obtain results of the second dataset, performing statistical testing on a confidence distribution of results of the first dataset and of results of the second dataset to determine a difference in a result confidence distribution between the first dataset and of the second dataset, and determining whether data included in the second dataset has data drift relative to the first dataset based on the difference in a result confidence distribution between the first dataset and of the second dataset.

Type: Application

Filed: January 24, 2019

Publication date: July 30, 2020

Inventors: Orna Raz, Marcel Zalmanovici, Aviad Zlotnick
Identifying security risks in code using security metric comparison

Patent number: 10678926

Abstract: An example system includes a processor to receive a source code and history information, wherein the history information includes a version control history or a defect history, or a combination of the version control history and the defect history. The processor is to also divide the source code into security-related components and security-non-related components. The processor is to further calculate security metrics for each of the security-related components and each of the security-non-related components based on the history information. The processor is also to compare the security metrics of the security-related components with the security metrics of the security-non-related components. The processor is to further generate a visual representation comprising a highlighted area of concern based on the comparison.

Type: Grant

Filed: January 9, 2017

Date of Patent: June 9, 2020

Assignee: International Business Machines Corporation

Inventors: Aharon Brodie, Christopher V Derobertis, Orna Raz
Method and apparatus for test modeling

Patent number: 10489280

Abstract: A method, system and computer program product, the method comprising: receiving a partial description of a combinatorial model defining a test space and comprising a set of attributes, a respective domain for each attribute defining possible values for the attribute, and restrictions, each comprising an attribute, a respective value and a logical operator, the partial description comprising an attribute and a value thereof; computing a similarity measurement between the partial description and each model in a corpus, based on similarity of attributes and domains, and not on a test space defined by compared models, whereby similarity between models is indifferent to a size of the test space of the models; selecting models from the corpus according to the computed similarity measurements; determining elements from the selected models to be suggested for modifying the partial description; and outputting the suggestions to modify the partial description useful for completing the combinatorial model.

Type: Grant

Filed: November 21, 2017

Date of Patent: November 26, 2019

Assignee: International Business Machines Corporation

Inventors: Howard Hess, Eitan Farchi, Orna Raz, Rachel Tzoref-Brill, Aviad Zlotnick
Finding Optimal Surface for Hierarchical Classification Task on an Ontology

Publication number: 20190354899

Abstract: A mechanism is provided in a data processing system having a processor and a memory. The memory comprises instructions which are executed by the processor to cause the processor to implement a training system for finding an optimal surface for hierarchical classification task on an ontology. The training system receives a training data set and a hierarchical ontology data structure. A surface finding component executing within the training system selects a surface that passes through each path from a root to a leaf node in the hierarchical ontology data structure. The surface finding component determines a plurality of adjacent surfaces that differ from the selected component by one node. The surface finding component selects an optimal surface, based on the selected surface and the plurality of adjacent surfaces, that maximizes accuracy and coverage. The training system trains a classifier model for a cognitive system using the optimal surface and the training data set.

Type: Application

Filed: November 14, 2018

Publication date: November 21, 2019

Inventors: Eitan D. Farchi, Pathirage Perera, Orna Raz
Finding Optimal Surface for Hierarchical Classification Task on an Ontology

Publication number: 20190354898

Abstract: A mechanism is provided in a data processing system having a processor and a memory. The memory comprises instructions which are executed by the processor to cause the processor to implement a training system for finding an optimal surface for hierarchical classification task on an ontology. The training system receives a training data set and a hierarchical ontology data structure. A surface finding component executing within the training system selects a surface that passes through each path from a root to a leaf node in the hierarchical ontology data structure. The surface finding component determines a plurality of adjacent surfaces that differ from the selected component by one node. The surface finding component selects an optimal surface, based on the selected surface and the plurality of adjacent surfaces, that maximizes accuracy and coverage. The training system trains a classifier model for a cognitive system using the optimal surface and the training data set.

Type: Application

Filed: May 21, 2018

Publication date: November 21, 2019

Inventors: Eitan D. Farchi, Pathirage Perera, Orna Raz
METHOD AND APPARATUS FOR TEST MODELING

Publication number: 20190171545

Abstract: A method, system and computer program product, the method comprising: receiving a user document describing at least a part of a system to be tested; computing a similarity measurement between the user document and documents in a document corpus, each describing at least part of a system and associated with a combinatorial model representing the at least part of the system, wherein the combinatorial model comprises a set of attributes, a respective domain for each attribute defining possible values for the attribute, and restrictions, wherein each restriction comprises at least one attribute, a respective value and a logical operator, based upon the similarity measurement, selecting selected documents from the document corpus; obtaining one or more combinatorial models corresponding to the selected documents; selecting elements from the combinatorial models; generating an initial combinatorial model for the user document, wherein said generating comprises adding the elements to the initial combinatorial model.

Type: Application

Filed: December 6, 2017

Publication date: June 6, 2019

Inventors: Howard Hess, Eitan Farchi, Orna Raz, Rachel Tzoref-Brill, Aviad Zlotnick
METHOD AND APPARATUS FOR TEST MODELING

Publication number: 20190155717

Abstract: A method, system and computer program product, the method comprising: receiving a partial description of a combinatorial model defining a test space and comprising a set of attributes, a respective domain for each attribute defining possible values for the attribute, and restrictions, each comprising an attribute, a respective value and a logical operator, the partial description comprising an attribute and a value thereof; computing a similarity measurement between the partial description and each model in a corpus, based on similarity of attributes and domains, and not on a test space defined by compared models, whereby similarity between models is indifferent to a size of the test space of the models; selecting models from the corpus according to the computed similarity measurements; determining elements from the selected models to be suggested for modifying the partial description; and outputting the suggestions to modify the partial description useful for completing the combinatorial model.

Type: Application

Filed: November 21, 2017

Publication date: May 23, 2019

Inventors: Howard Hess, Eitan Farchi, Orna Raz, Rachel Tzoref-Brill, Aviad Zlotnick

prev 1 2 3 4 next