Patents by Inventor Igor Balabine

Igor Balabine has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11886467
    Abstract: An apparatus, computer-readable medium, and computer-implemented method for efficiently classifying a data object, including representing the data object as a data object vector in a vector space, each dimension of the data object vector corresponding to a different feature of the data object, determining a distance between the data object vector and centroids of data domain clusters in the vector space, each data domain cluster comprising data domain vectors representing data domains, sorting the data domain clusters according to their respective distances to the data object vector, and iteratively applying data domain classifiers corresponding to data domains represented in a closest data domain cluster in the sorted data domain clusters to the data object.
    Type: Grant
    Filed: November 3, 2021
    Date of Patent: January 30, 2024
    Assignee: Informatica LLC
    Inventor: Igor Balabine
  • Publication number: 20240028650
    Abstract: A system, method and computer-readable medium for determining a data domain associated with data, including parsing a document to generate one or more document indexes corresponding to the document, the one or more document indexes comprising a plurality of index terms and location information, determining a syntactic confidence score corresponding to a non-dictionary term in the plurality of index terms based on a syntactic analysis of the non-dictionary term, determining a proximity confidence score corresponding to the non-dictionary term based on the location information and at least one proximity query associated with the non-dictionary term and one or more other terms in the document index, determining a semantic confidence score based on a plurality of dictionary terms in the plurality of index terms, and determining an overall confidence score corresponding to the non-dictionary term based on the syntactic confidence score, the proximity confidence score, and the semantic confidence score.
    Type: Application
    Filed: April 25, 2023
    Publication date: January 25, 2024
    Inventor: Igor Balabine
  • Publication number: 20230412357
    Abstract: A method, apparatus, and computer-readable medium for searching polymorphically encrypted data includes storing one or more pseudonymous tokens in a data store, the one or more pseudonymous tokens being generated by encrypting a ciphertext using a first algorithm and an encryption key, the first algorithm comprising a polymorphic algorithm configured to generate a distinct pseudonymous token for each application of the polymorphic algorithm to the same plaintext, and identifying data in the data store that corresponds to the ciphertext by querying the data store using a search token generated by encrypting the plaintext using a second algorithm and the encryption key, the search token being distinct from the one or more pseudonymous tokens.
    Type: Application
    Filed: June 15, 2023
    Publication date: December 21, 2023
    Inventors: Igor Balabine, Richard Grondin
  • Patent number: 11805143
    Abstract: The present invention relates to systems and methods for detecting anomalies in computer network traffic with fewer false positives and without the need for time-consuming and unreliable historical baselines. Upon detection, traffic anomalies can be processed to determine valuable network insights, including health of interfaces, devices and network services, as well as to provide timely alerts in the event of attack.
    Type: Grant
    Filed: August 3, 2021
    Date of Patent: October 31, 2023
    Assignee: NETFLOW LOGIC CORPORATION
    Inventors: Igor Balabine, Alexander Velednitsky
  • Patent number: 11792218
    Abstract: A system, method, and computer-readable medium for determining risk associated with anomalous behavior of a user on a computer network including receiving anomalous behavioral data corresponding to anomalous activity of the user on the computer network, determining surprisal values corresponding to one or more behavioral factors based on one or more of: one or more probabilities corresponding to one or more current values or one or more characteristics of the one or more behavioral factors, determining one or more dynamic weights corresponding to the one or more behavioral factors based at least in part on the one or more current values and historically expected values of the one or more behavioral factors for the user, and determining a risk metric corresponding to the anomalous activity of the user based on the surprisal values, the one or more dynamic weights, and static weights assigned to the one or more behavioral factors.
    Type: Grant
    Filed: March 21, 2022
    Date of Patent: October 17, 2023
    Assignee: Informatica LLC
    Inventor: Igor Balabine
  • Patent number: 11716191
    Abstract: A method, apparatus, and computer-readable medium for searching polymorphically encrypted data includes storing one or more pseudonymous tokens in a data store, the one or more pseudonymous tokens being generated by encrypting a ciphertext using a first algorithm and an encryption key, the first algorithm comprising a polymorphic algorithm configured to generate a distinct pseudonymous token for each application of the polymorphic algorithm to the same plaintext, and identifying data in the data store that corresponds to the ciphertext by querying the data store using a search token generated by encrypting the plaintext using a second algorithm and the encryption key, the search token being distinct from the one or more pseudonymous tokens.
    Type: Grant
    Filed: February 10, 2021
    Date of Patent: August 1, 2023
    Assignee: Informatica LLC
    Inventors: Igor Balabine, Richard Grondin
  • Patent number: 11669574
    Abstract: A system, method and computer-readable medium for determining a data domain associated with data, including parsing a document to generate one or more document indexes corresponding to the document, the one or more document indexes comprising a plurality of index terms and location information, determining a syntactic confidence score corresponding to a non-dictionary term in the plurality of index terms based on a syntactic analysis of the non-dictionary term, determining a proximity confidence score corresponding to the non-dictionary term based on the location information and at least one proximity query associated with the non-dictionary term and one or more other terms in the document index, determining a semantic confidence score based on a plurality of dictionary terms in the plurality of index terms, and determining an overall confidence score corresponding to the non-dictionary term based on the syntactic confidence score, the proximity confidence score, and the semantic confidence score.
    Type: Grant
    Filed: August 1, 2017
    Date of Patent: June 6, 2023
    Assignee: Informatica LLC
    Inventor: Igor Balabine
  • Publication number: 20230133127
    Abstract: An apparatus, computer-readable medium, and computer-implemented method for efficiently classifying a data object, including representing the data object as a data object vector in a vector space, each dimension of the data object vector corresponding to a different feature of the data object, determining a distance between the data object vector and centroids of data domain clusters in the vector space, each data domain cluster comprising data domain vectors representing data domains, sorting the data domain clusters according to their respective distances to the data object vector, and iteratively applying data domain classifiers corresponding to data domains represented in a closest data domain cluster in the sorted data domain clusters to the data object.
    Type: Application
    Filed: November 3, 2021
    Publication date: May 4, 2023
    Inventor: Igor BALABINE
  • Publication number: 20220327403
    Abstract: An apparatus, computer-readable medium, and computer-implemented method for postal address identification, including receiving one or more sequences of tokens corresponding to candidate postal address data objects, evaluating the sequences of tokens with the statistical postal address model to identify candidate postal address data objects, computing candidate vectors corresponding to the identified candidate postal address data objects in the vector space, and determining whether the identified candidate postal address data objects correspond to a postal address based on applying outlier detection methods to the candidate vectors and one or more clusters of the clusters.
    Type: Application
    Filed: April 13, 2021
    Publication date: October 13, 2022
    Inventors: Igor Balabine, Dina Laevsky
  • Publication number: 20220217171
    Abstract: A system, method, and computer-readable medium for determining risk associated with anomalous behavior of a user on a computer network including receiving anomalous behavioral data corresponding to anomalous activity of the user on the computer network, determining surprisal values corresponding to one or more behavioral factors based on one or more of: one or more probabilities corresponding to one or more current values or one or more characteristics of the one or more behavioral factors, determining one or more dynamic weights corresponding to the one or more behavioral factors based at least in part on the one or more current values and historically expected values of the one or more behavioral factors for the user, and determining a risk metric corresponding to the anomalous activity of the user based on the surprisal values, the one or more dynamic weights, and static weights assigned to the one or more behavioral factors.
    Type: Application
    Filed: March 21, 2022
    Publication date: July 7, 2022
    Inventor: Igor BALABINE
  • Publication number: 20220130282
    Abstract: A system, method and computer-readable medium format-preserving encryption of a numerical value, including storing a binary numerical value, the binary numerical value comprising a plurality of binary bits, dividing the plurality of binary bits into a plurality of bit groups and storing the plurality of bit groups in a plurality of bytes, encrypting each byte in the plurality of bytes using a radix value corresponding to a quantity of binary bits in a bit group corresponding to that byte to generate a plurality of ciphertext bytes, and combining a quantity of least-significant bits from each ciphertext byte in the plurality of ciphertext bytes to generate a binary ciphertext value, the quantity of least-significant bits combined from each ciphertext byte corresponding to the radix value used to generate that ciphertext byte.
    Type: Application
    Filed: January 10, 2022
    Publication date: April 28, 2022
    Inventors: Igor BALABINE, Rajagopal GUDURU, Ramesh NALLAMOTHU
  • Patent number: 11283821
    Abstract: A system, method, and computer-readable medium for determining risk associated with anomalous behavior of a user on a computer network including receiving anomalous behavioral data corresponding to anomalous activity of the user on the computer network, determining surprisal values corresponding to one or more behavioral factors based on one or more of: one or more probabilities corresponding to one or more current values or one or more characteristics of the one or more behavioral factors, determining one or more dynamic weights corresponding to the one or more behavioral factors based at least in part on the one or more current values and historically expected values of the one or more behavioral factors for the user, and determining a risk metric corresponding to the anomalous activity of the user based on the surprisal values, the one or more dynamic weights, and static weights assigned to the one or more behavioral factors.
    Type: Grant
    Filed: September 13, 2017
    Date of Patent: March 22, 2022
    Assignee: INFORMATICA LLC
    Inventor: Igor Balabine
  • Publication number: 20220030021
    Abstract: The present invention relates to systems and methods for detecting anomalies in computer network traffic with fewer false positives and without the need for time-consuming and unreliable historical baselines. Upon detection, traffic anomalies can be processed to determine valuable network insights, including health of interfaces, devices and network services, as well as to provide timely alerts in the event of attack.
    Type: Application
    Filed: August 3, 2021
    Publication date: January 27, 2022
    Inventors: Igor Balabine, Alexander Velednitsky
  • Publication number: 20220019678
    Abstract: A system, method and computer-readable medium for generating a data masking syntactic definition for a data element of an unknown data type, including generating one or more alphabets corresponding to one or more element member positions of the data element based at least in part on element members occurring at each element member position in a plurality of data elements of the unknown type, each alphabet comprising a set of one or more sequential element members that have occurred in the plurality of data elements at an element member position and generating a positional map describing a syntactic structure of the data element by mapping at least one of the one or more alphabets to each element member position of the data element.
    Type: Application
    Filed: February 24, 2021
    Publication date: January 20, 2022
    Inventors: Bala Kumaresan, Igor Balabine
  • Patent number: 11222554
    Abstract: A system, method and computer-readable medium format-preserving encryption of a numerical value, including storing a binary numerical value, the binary numerical value comprising a plurality of binary bits, dividing the plurality of binary bits into a plurality of bit groups and storing the plurality of bit groups in a plurality of bytes, encrypting each byte in the plurality of bytes using a radix value corresponding to a quantity of binary bits in a bit group corresponding to that byte to generate a plurality of ciphertext bytes, and combining a quantity of least-significant bits from each ciphertext byte in the plurality of ciphertext bytes to generate a binary ciphertext value, the quantity of least-significant bits combined from each ciphertext byte corresponding to the radix value used to generate that ciphertext byte.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: January 11, 2022
    Assignee: INFORMATICA LLC
    Inventors: Igor Balabine, Rajagopal Guduru, Ramesh Nallamothu
  • Patent number: 11089041
    Abstract: The present invention relates to systems and methods for detecting anomalies in computer network traffic with fewer false positives and without the need for time-consuming and unreliable historical baselines. Upon detection, traffic anomalies can be processed to determine valuable network insights, including health of interfaces, devices and network services, as well as to provide timely alerts in the event of attack.
    Type: Grant
    Filed: January 16, 2020
    Date of Patent: August 10, 2021
    Assignee: NETFLOW LOGIC CORPORATION
    Inventors: Igor Balabine, Alexander Velednitsky
  • Publication number: 20210167943
    Abstract: A method, apparatus, and computer-readable medium for searching polymorphically encrypted data includes storing one or more pseudonymous tokens in a data store, the one or more pseudonymous tokens being generated by encrypting a ciphertext using a first algorithm and an encryption key, the first algorithm comprising a polymorphic algorithm configured to generate a distinct pseudonymous token for each application of the polymorphic algorithm to the same plaintext, and identifying data in the data store that corresponds to the ciphertext by querying the data store using a search token generated by encrypting the plaintext using a second algorithm and the encryption key, the search token being distinct from the one or more pseudonymous tokens.
    Type: Application
    Filed: February 10, 2021
    Publication date: June 3, 2021
    Inventors: Igor Balabine, Richard Grondin
  • Patent number: 10970404
    Abstract: A system, method and computer-readable medium for generating a data masking syntactic definition for a data element of an unknown data type, including generating one or more alphabets corresponding to one or more element member positions of the data element based at least in part on element members occurring at each element member position in a plurality of data elements of the unknown type, each alphabet comprising a set of one or more sequential element members that have occurred in the plurality of data elements at an element member position and generating a positional map describing a syntactic structure of the data element by mapping at least one of the one or more alphabets to each element member position of the data element.
    Type: Grant
    Filed: May 10, 2017
    Date of Patent: April 6, 2021
    Assignee: Informatica LLC
    Inventors: Bala Kumaresan, Igor Balabine
  • Patent number: 10958415
    Abstract: A method, apparatus, and computer-readable medium for searching polymorphically encrypted data includes generating one or more pseudonymous tokens by encrypting a ciphertext using a first algorithm and an encryption key, the first algorithm comprising a polymorphic algorithm configured to generate a distinct pseudonymous token for each application of the polymorphic algorithm to the same plaintext, storing, the one or more pseudonymous tokens in one or more data stores, and identifying data in the one or more data stores that corresponds to the ciphertext by querying the data store using a search token generated by encrypting the plaintext using a second algorithm and the encryption key, the search token being distinct from the one or more pseudonymous tokens.
    Type: Grant
    Filed: July 11, 2018
    Date of Patent: March 23, 2021
    Assignee: Informatica LLC
    Inventors: Igor Balabine, Richard Grondin
  • Publication number: 20210049935
    Abstract: A system, method and computer-readable medium format-preserving encryption of a numerical value, including storing a binary numerical value, the binary numerical value comprising a plurality of binary bits, dividing the plurality of binary bits into a plurality of bit groups and storing the plurality of bit groups in a plurality of bytes, encrypting each byte in the plurality of bytes using a radix value corresponding to a quantity of binary bits in a bit group corresponding to that byte to generate a plurality of ciphertext bytes, and combining a quantity of least-significant bits from each ciphertext byte in the plurality of ciphertext bytes to generate a binary ciphertext value, the quantity of least-significant bits combined from each ciphertext byte corresponding to the radix value used to generate that ciphertext byte.
    Type: Application
    Filed: August 16, 2019
    Publication date: February 18, 2021
    Inventors: Igor Balabine, Rajagopal Guduru, Ramesh Nallamothu