Patents by Inventor Alexander Sobran
Alexander Sobran has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11736300Abstract: Software for producing and verifying computational determinations using a distributed ledger, by: (i) receiving an indication of a first artificial intelligence (AI) inferencing event, the first AI inferencing event including a first AI inferencing result produced by a first machine learning model based, at least in part, on a first input from a user; (ii) computing a hash of the first machine learning model using a cryptographic hash function; (iii) sending a record of the first AI inferencing event to a verification system, the record of the first AI inferencing event including the hash of the first machine learning model; and (iv) receiving a verification from the verification system indicating that the hash of the first machine learning model matches a hash of a second machine learning model and that the record of the first AI inferencing event has been stored in a first distributed ledger.Type: GrantFiled: March 21, 2022Date of Patent: August 22, 2023Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11455566Abstract: Provided are a computer program product, system, and method for classifying code as introducing a bug or not introducing a bug to train a bug detection algorithm. For each commit in a commit history of code changes to the code base, a determination is made of lines of code changed by the commit. For each line of code changed by the commit, a determination is made as to whether the commit is for a bug fix. A determination is made as to whether a previous commit changing the line of code changed by the commit for the bug fix in response to determining that the commit is for the bug fix. Indication is made that the previous commit introduced a bug. The algorithm is trained to classify changes to lines of code by commits indicated as having introduced a bug as bug introducing commits.Type: GrantFiled: March 16, 2018Date of Patent: September 27, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alexander Sobran, Bo Zhang
-
Publication number: 20220216998Abstract: Software for producing and verifying computational determinations using a distributed ledger, by: (i) receiving an indication of a first artificial intelligence (AI) inferencing event, the first AI inferencing event including a first AI inferencing result produced by a first machine learning model based, at least in part, on a first input from a user; (ii) computing a hash of the first machine learning model using a cryptographic hash function; (iii) sending a record of the first AI inferencing event to a verification system, the record of the first AI inferencing event including the hash of the first machine learning model; and (iv) receiving a verification from the verification system indicating that the hash of the first machine learning model matches a hash of a second machine learning model and that the record of the first AI inferencing event has been stored in a first distributed ledger.Type: ApplicationFiled: March 21, 2022Publication date: July 7, 2022Inventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11379220Abstract: In an approach, one or more computer processors create a dictionary for each source code commit in a set of historical source code commits associated with a software deployment; create a similarity model based on the created dictionary for each source code commit in the set of historical source code commits; generate a vector embedding for a source code commit pair based on a set of log differences between source code commit pairs utilizing the created similarity model; generate, responsive to a new source code commit, a new vector embedding based on a set of log differences between the new source code commit and a preceding source code commit utilizing the created similarity model; generate a defect likelihood utilizing the generated new vector embedding; determine, responsive to the generated defect likelihood exceeding a defect likelihood threshold, that the new source code commit contains defects.Type: GrantFiled: November 25, 2019Date of Patent: July 5, 2022Assignee: International Business Machines CorporationInventors: Alexander Sobran, Bo Zhang, Bradley C. Herrin, Xianjun Zhu
-
Patent number: 11334467Abstract: A computer-implemented method, system and computer program product for representing source code in vector space. The source code is parsed into an abstract syntax tree, which is then traversed to produce a sequence of tokens. Token embeddings may then be constructed for a subset of the sequence of tokens, which are inputted into an encoder artificial neural network (“encoder”) for encoding the token embeddings. A decoder artificial neural network (“decoder”) is initialized with a final internal cell state of the encoder. The decoder is run the same number of steps as the encoding performed by the encoder. After running the decoder and completing the training of the decoder to learn the inputted token embeddings, the final internal cell state of the encoder is used as the code representation vector which may be used to detect errors in the source code.Type: GrantFiled: May 3, 2019Date of Patent: May 17, 2022Assignee: International Business Machines CorporationInventors: David Wehr, Eleanor Pence, Halley Fede, Isabella Yamin, Alexander Sobran, Bo Zhang
-
Patent number: 11303454Abstract: Software for producing and verifying computational determinations using a distributed ledger, by: (i) receiving a first input from a user; (ii) producing a first computational determination utilizing a first computational model, wherein the first computational determination includes a first computational output that is based, at least in part, on the first input; (iii) computing a hash of the first computational model; (iv) sending a record of the first computational determination to a verification system, wherein the record of the first computational determination includes the hash of the first computational model; (v) receiving a verification from the verification system indicating that the hash of the first computational model matches a hash of a second computational model and that the record of the first computational determination has been stored in a first distributed ledger; and (vi) in response to receiving the verification, providing the first computational output to the user.Type: GrantFiled: November 28, 2018Date of Patent: April 12, 2022Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11288065Abstract: Approaches presented herein enable domain knowledge-based software defect prediction. More specifically, a cost function configured to train a machine learning model to predict a defect in a software version is obtained. A domain knowledge penalty metric is then determined for a software development team associated with the software version. A linear combination of the determined domain knowledge penalty metric is then applied to the obtained cost function to create a domain knowledge-modified cost function. Using this domain knowledge-modified cost function, the machine learning model is then trained based on a minimization of the domain knowledge-modified cost function. Once trained, the machine learning model is used to report a set of predicted values based on the domain knowledge-modified cost function trained machine learning model.Type: GrantFiled: July 2, 2018Date of Patent: March 29, 2022Assignee: International Business Machines CorporationInventors: Alexander Sobran, Yogesh Rane, Bo Zhang, Guilherme Ferreira
-
Patent number: 11238306Abstract: A method, system and computer program product for obtaining vector representations of code snippets capturing semantic similarity. A first and second training set of code snippets are collected, where the first training set of code snippets implements the same function representing semantic similarity and the second training set of code snippets implements a different function representing semantic dissimilarity. A vector representation of a first and second code snippet from either the first or second training set of code snippets is generated using a machine learning model. A loss value is generated utilizing a loss function that is proportional or inverse to the distance between the first and second vectors in response to receiving the first and second code snippets from the first or second training set of code snippets, respectively. The machine learning model is trained to capture the semantic similarity in the code snippets by minimizing the loss value.Type: GrantFiled: September 27, 2018Date of Patent: February 1, 2022Assignee: International Business Machines CorporationInventors: Bo Zhang, Alexander Sobran, David Wehr, Halley Fede, Eleanor Pence, Joseph Hughes, John H. Walczyk, III, Guilherme Ferreira
-
Patent number: 11170108Abstract: An example operation may include one or more of seeking consensus among users of a source tree to approve execution of a blocked command to a source control system, creating a child ledger for a user issuing the blocked command and initiating a blockchain transaction to link the child ledger to a master ledger, merging the child ledger into the master ledger when consensus is reached, and executing the blocked command.Type: GrantFiled: November 19, 2018Date of Patent: November 9, 2021Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11144645Abstract: An example operation may include one or more of intercepting a command from a user to modify a source tree in a source control system, creating a child ledger associated with a master ledger when the intercepted command is destructive, seeking consensus among users of the source tree to approve execution of the intercepted command, merging the child ledger into the master ledger with a transaction describing the intercepted command, a commit tree history, and status of the consensus, and a tree hash, and merging execution results of the intercepted command into a history of the source tree.Type: GrantFiled: November 19, 2018Date of Patent: October 12, 2021Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11139980Abstract: Software for immutably storing computational determinations using distributed ledgers. The software performs the following operations: (i) receiving an indication that a first computational model is ready to be deployed; (ii) storing a copy of the first computational model in a first distributed ledger; (iii) computing a hash of the first computational model using a cryptographic hash function; (iv) receiving an indication that a second computational model has been used to produce a first computational determination; (vi) receiving a hash of the second computational model; and (vii) in response to determining that the hash of the second computational model matches the hash of the first computational model, storing a record of the first computational determination in a second distributed ledger, wherein the record of the first computational determination identifies the second computational model as being the first computational model and includes the hash of the first computational model.Type: GrantFiled: November 28, 2018Date of Patent: October 5, 2021Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Xianjun Zhu, Bo Zhang, Alexander Sobran
-
Patent number: 11061790Abstract: A method, system and computer program product for detecting potential failures in completing a continuous delivery (CD) pipeline using machine learning. A CD pipeline is defined to include stages, where each stage includes a binary event(s). A model is created by applying an Apriori algorithm and a sequential pattern mining algorithm to a set of previous patterns of sequences of binary events to calculate confidence scores for completing a set of binary events in a particular order. After identifying an ongoing CD sequence (ordered set of binary events) for a software application, the model is used to predict a likelihood of the ongoing CD sequence for the software application completing the CD pipeline by generating confidence score(s) for the ongoing CD sequence. A notification is issued regarding a potential failure in completing the CD pipeline for the software application if a confidence score is below a threshold value.Type: GrantFiled: January 7, 2019Date of Patent: July 13, 2021Assignee: International Business Machines CorporationInventors: Bo Zhang, Alexander Sobran, Bradley C. Herrin, Xianjun Zhu
-
Patent number: 11061791Abstract: A method, system and computer program product for detecting potential failures in completing a continuous delivery (CD) pipeline using machine learning. A CD pipeline is defined to include stages, where each stage includes a binary event(s). A model is created by applying an Apriori algorithm and a sequential pattern mining algorithm to a set of previous patterns of sequences of binary events to calculate confidence scores for completing a set of binary events in a particular order. After identifying an ongoing CD sequence (ordered set of binary events) for a software application, the model is used to predict a likelihood of the ongoing CD sequence for the software application completing the CD pipeline by generating confidence score(s) for the ongoing CD sequence. A notification is issued regarding a potential failure in completing the CD pipeline for the software application if a confidence score is below a threshold value.Type: GrantFiled: July 8, 2019Date of Patent: July 13, 2021Assignee: International Business Machines CorporationInventors: Bo Zhang, Alexander Sobran, Bradley C. Herrin, Xianjun Zhu
-
Patent number: 11055091Abstract: The present invention relates to a method, system, and computer program product for project adoption documentation generation using machine learning. A method includes receiving a set of project parameters for a set of projects. In an embodiment, a method includes receiving a set of input project parameters for an input project. In an embodiment, a method includes comparing each set of project parameters to the set of input project parameters. In an embodiment, a method includes selecting, responsive to the comparison, a project from the set of projects. In an embodiment, a method includes generating, responsive to the selection, documentation for the input project.Type: GrantFiled: May 10, 2019Date of Patent: July 6, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alexander Sobran, Bo Zhang, Joseph Hughes, John H. Walczyk, III, Guilherme Ferreira
-
Patent number: 11055081Abstract: A system and method for recommending whether to upgrade dependencies of a software project includes mining historical software data artifacts associated with a plurality of software projects to extract a plurality of metrics and dependency version lineages from the historical software data artifacts, clustering the software projects based on the metrics and the dependency version lineages, extracting target metrics and a target version lineage of a target software project selecting a software project cluster from the plurality of clusters that best matches the target software project, analyzing the metrics for each of the software projects included in the at least one software project cluster to determine that a measurable change to the metrics occurred as a result of upgrading dependencies of the software projects, and recommending which software dependencies of the target software project should be upgraded based on the measurable change to the metrics of the software projects.Type: GrantFiled: September 17, 2018Date of Patent: July 6, 2021Assignee: International Business Machines CorporationInventors: Alexander Sobran, Joseph Hughes, John H. Walczyk, III, Bo Zhang, Darrough West
-
Publication number: 20210157577Abstract: In an approach, one or more computer processors create a dictionary for each source code commit in a set of historical source code commits associated with a software deployment; create a similarity model based on the created dictionary for each source code commit in the set of historical source code commits; generate a vector embedding for a source code commit pair based on a set of log differences between source code commit pairs utilizing the created similarity model; generate, responsive to a new source code commit, a new vector embedding based on a set of log differences between the new source code commit and a preceding source code commit utilizing the created similarity model; generate a defect likelihood utilizing the generated new vector embedding; determine, responsive to the generated defect likelihood exceeding a defect likelihood threshold, that the new source code commit contains defects.Type: ApplicationFiled: November 25, 2019Publication date: May 27, 2021Inventors: Alexander Sobran, Bo Zhang, Bradley C. Herrin, Xianjun ZHU
-
Patent number: 10936810Abstract: Token embedding based on target-context pairs includes obtaining a structural representation of data, the structural representation including nodes and indicating relationships between the nodes, obtaining a context template that identifies relationship(s) to use in identifying a respective context for different nodes of the structural representation, applying the context template to the structural representation to obtain a set of target-context pairs, each of which includes a respective target node of the structural representation and a respective set of context node(s), of the structural representation, for that target node, as identified based on the context template, and using the target-context pairs in a model and obtaining, as output of the model, representations of target nodes of the target-context pairs as vectors in a vector space.Type: GrantFiled: December 4, 2018Date of Patent: March 2, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Alexander Sobran, Bo Zhang
-
Patent number: 10901876Abstract: A method, system and computer program product for detecting potential failures in a continuous delivery pipeline. A machine learning model is created to predict whether changed portion of codes under development at various stages of the continuous delivery pipeline will result in a pipeline failure. After creating the machine learning model, log file(s) may be received that were generated by development tool(s) concerning a changed portion of code under development at a particular stage of the continuous delivery pipeline. The machine learning model provides relationship information between the log file(s) and the changed portion of code. A message is then generated and displayed based on this relationship information, where the message may provide a prediction or a recommendation concerning potential failures in the continuous delivery pipeline. In this manner, the potential failures in the continuous delivery pipeline may be prevented without requiring context switching.Type: GrantFiled: November 20, 2019Date of Patent: January 26, 2021Assignee: International Business Machines CorporationInventors: Bradley C. Herrin, Alexander Sobran, Bo Zhang, Xianjun Zhu
-
Patent number: 10884893Abstract: A method, system and computer program product for detecting software build errors. A classification system is created that identifies users' questions in crowdsource data pertaining to errors in computer programs that are associated with a log report. A model is built to classify log data as bug-related or not bug-related based on the classification system. Log reports from log data obtained from crowdsource data are identified as being bug-related based on the model. After vectorizing such log reports and storing the vectorized log reports, the language of a new build log report for a software product is vectorized upon completion of the build of the software product. If the vectorized log report is within a threshold amount of distance to a stored vectorized log report, then a copy of the log report (bug-related) and a source of the log report associated with the stored vectorized log report is provided.Type: GrantFiled: August 24, 2018Date of Patent: January 5, 2021Assignee: International Business Machines CorporationInventors: Alexander Sobran, Bo Zhang, Bradley C. Herrin
-
Publication number: 20200356364Abstract: The present invention relates to a method, system, and computer program product for project adoption documentation generation using machine learning. A method includes receiving a set of project parameters for a set of projects. In an embodiment, a method includes receiving a set of input project parameters for an input project. In an embodiment, a method includes comparing each set of project parameters to the set of input project parameters. In an embodiment, a method includes selecting, responsive to the comparison, a project from the set of projects. In an embodiment, a method includes generating, responsive to the selection, documentation for the input project.Type: ApplicationFiled: May 10, 2019Publication date: November 12, 2020Inventors: Alexander Sobran, Bo Zhang, Joseph Hughes, John H. Walczyk, III, Guilherme Ferreira