Patents by Inventor Konstantin Anisimovich

Konstantin Anisimovich has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11568140
    Abstract: Embodiments of the present disclosure describe a system and method for optical character recognition. In one embodiment, a system receives an image depicting text. The system extracts features from the image using a feature extractor. The system applies a first decoder to the features to generate a first intermediary output. The system applies a second decoder to the features to generate a second intermediary output, wherein the feature extractor is common to the first decoder and the second decoder. The system determines a first quality metric value for the first intermediary output and a second quality metric value for the second intermediary output based on a language model. Responsive to determining that the first quality metric value is greater than the second quality metric value, the system selects the first intermediary output to represent the text.
    Type: Grant
    Filed: November 25, 2020
    Date of Patent: January 31, 2023
    Assignee: ABBYY DEVELOPMENT INC.
    Inventors: Konstantin Anisimovich, Aleksei Zhuravlev
  • Patent number: 11379656
    Abstract: Systems and methods for automatic generation of templates for information extraction rules to extract information objects from natural language texts.
    Type: Grant
    Filed: October 26, 2018
    Date of Patent: July 5, 2022
    Assignee: ABBYY Development Inc.
    Inventors: Konstantin Anisimovich, Ruslan Garashchuk, Stepan Matskevich
  • Publication number: 20220164533
    Abstract: Embodiments of the present disclosure describe a system and method for optical character recognition. In one embodiment, a system receives an image depicting text. The system extracts features from the image using a feature extractor. The system applies a first decoder to the features to generate a first intermediary output. The system applies a second decoder to the features to generate a second intermediary output, wherein the feature extractor is common to the first decoder and the second decoder. The system determines a first quality metric value for the first intermediary output and a second quality metric value for the second intermediary output based on a language model. Responsive to determining that the first quality metric value is greater than the second quality metric value, the system selects the first intermediary output to represent the text.
    Type: Application
    Filed: November 25, 2020
    Publication date: May 26, 2022
    Inventors: Konstantin Anisimovich, Aleksei Zhuravlev
  • Publication number: 20200104354
    Abstract: Systems and methods for automatic generation of templates for information extraction rules to extract information objects from natural language texts.
    Type: Application
    Filed: October 26, 2018
    Publication date: April 2, 2020
    Inventors: Konstantin Anisimovich, Ruslan Garashchuk, Stepan Matskevich
  • Publication number: 20190294874
    Abstract: Systems and methods for automatic definition of natural language document classes. An example method comprises: producing, by a computer system, a plurality of image features by processing images of a plurality of documents; producing a plurality of text features by processing texts of a plurality of documents; producing a plurality of feature vectors, wherein each feature vector of the plurality of feature vectors comprises at least one of: a subset of the plurality of image features and a subset of the plurality of text features; clusterizing the plurality feature vectors to produce a plurality of clusters; defining a plurality of document categories, such that each document category of the plurality of document categories is defined by a respective feature cluster of the plurality of feature clusters; and training a classifier to produce a value reflecting a degree of association of an input document with one or more document categories of the plurality of document categories.
    Type: Application
    Filed: March 28, 2018
    Publication date: September 26, 2019
    Inventors: Nikita Orlov, Konstantin Anisimovich
  • Publication number: 20190180154
    Abstract: A method includes obtaining an image of text. The text in the image includes one or more words in one or more sentences. The method also includes providing the image of the text as first input to a set of trained machine learning models, obtaining one or more final outputs from the set of trained machine learning models, and extracting, from the one or more final outputs, one or more predicted sentences from the text in the image. Each of the one or more predicted sentences includes a probable sequence of words.
    Type: Application
    Filed: December 20, 2017
    Publication date: June 13, 2019
    Inventors: Nikita Orlov, Vladimir Rybkin, Konstantin Anisimovich, Azat Davletshin
  • Patent number: 10209859
    Abstract: An illustrative method according to a set of instructions stored on a memory of a computing device includes receiving, by a processor of the computing device, a search input. The method further includes searching, by the processor, a plurality of electronic storage locations for electronic files related to the search input. The method further includes determining, by the processor, the presence of an electronic file on a first electronic storage location of the plurality of electronic storage locations that is related to the search input. The method further includes displaying, by the processor, on a graphical user interface (GUI), representative information of the electronic file. The representative information includes descriptive information relating to the electronic file.
    Type: Grant
    Filed: December 22, 2014
    Date of Patent: February 19, 2019
    Assignee: Findo, Inc.
    Inventors: David Yan, Konstantin Anisimovich
  • Patent number: 10078688
    Abstract: Systems and methods for evaluating text classifier parameters based on semantic features. An example method comprises: performing a semantico-syntactic analysis of a natural language text of a corpus of natural language texts to produce a semantic structure representing a set of semantic classes; identifying a natural language text feature to be extracted using a set of values of a plurality of feature extraction parameters; partitioning the corpus of natural language texts into a training data set comprising a first plurality of natural language texts and a validation data set comprising a second plurality of natural language texts; determining, in view of the category of the training data set, the set of values of the feature extraction parameters; validating the set of values of the feature extraction parameters using the validation data set.
    Type: Grant
    Filed: May 18, 2016
    Date of Patent: September 18, 2018
    Assignee: ABBYY Production LLC
    Inventors: Sergey Kolotienko, Konstantin Anisimovich
  • Patent number: 9928234
    Abstract: An example method for natural language text classification based on semantic features comprises: performing semantico-syntactic analysis of a natural language text to produce a semantic structure representing a set of semantic classes; associating a first semantic class of the set of semantic classes with a first value reflecting a specified semantic class attribute; identifying a second semantic class associated with the first semantic class by a pre-defined semantic relationship; associating the second semantic class with a second value reflecting the specified semantic class attribute, wherein the second value is determined by applying a pre-defined transformation to the first value; evaluating a feature of the natural language text based on the first value and the second value; and determining, by a classifier model using the evaluated feature of the natural language text, a degree of association of the natural language text with a category of a pre-defined set of categories.
    Type: Grant
    Filed: May 18, 2016
    Date of Patent: March 27, 2018
    Assignee: ABBYY PRODUCTION LLC
    Inventors: Sergey Kolotienko, Konstantin Anisimovich, Andrey Valerievich Myakutin, Evgeny Mikhaylovich Indenbom
  • Patent number: 9817818
    Abstract: A method and computer system for translating sentences between languages from an intermediate language-independent semantic representation is provided. On the basis of comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, to build syntactic structures and language independent semantic structures and representations, and to synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and to perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language.
    Type: Grant
    Filed: May 21, 2012
    Date of Patent: November 14, 2017
    Assignee: ABBYY PRODUCTION LLC
    Inventors: Konstantin Anisimovich, Vladimir Selegey, Konstantin Zuev
  • Publication number: 20170293607
    Abstract: An example method for natural language text classification based on semantic features comprises: performing semantico-syntactic analysis of a natural language text to produce a semantic structure representing a set of semantic classes; associating a first semantic class of the set of semantic classes with a first value reflecting a specified semantic class attribute; identifying a second semantic class associated with the first semantic class by a pre-defined semantic relationship; associating the second semantic class with a second value reflecting the specified semantic class attribute, wherein the second value is determined by applying a pre-defined transformation to the first value; evaluating a feature of the natural language text based on the first value and the second value; and determining, by a classifier model using the evaluated feature of the natural language text, a degree of association of the natural language text with a particular category of a pre-defined set of categories.
    Type: Application
    Filed: May 18, 2016
    Publication date: October 12, 2017
    Inventors: Sergey Kolotienko, Konstantin Anisimovich, Andrey Valerievich Myakutin, Evgeny Mikhaylovich Indenbom
  • Publication number: 20170293687
    Abstract: Systems and methods for evaluating text classifier parameters based on semantic features. An example method comprises: performing a semantico-syntactic analysis of a natural language text of a corpus of natural language texts to produce a semantic structure representing a set of semantic classes; identifying a natural language text feature to be extracted using a set of values of a plurality of feature extraction parameters; partitioning the corpus of natural language texts into a training data set comprising a first plurality of natural language texts and a validation data set comprising a second plurality of natural language texts; determining, in view of the category of the training data set, the set of values of the feature extraction parameters; validating the set of values of the feature extraction parameters using the validation data set.
    Type: Application
    Filed: May 18, 2016
    Publication date: October 12, 2017
    Inventors: Sergey Kolotienko, Konstantin Anisimovich
  • Patent number: 9778817
    Abstract: Various embodiments are provided for enabling tagging of image files based on tagging or commenting on images on a social networking site. The tagging or commenting on an image on the social networking site is detected by the system. The social network tag or comment is analyzed to determine a textual tag to be assigned to image files corresponding to the social network images that have been tagged or commented on. In some implementations semantic analysis of text component of the social network tags or comments is performed. In some implementations the textual tags are then propagated to other image files associated with the user.
    Type: Grant
    Filed: June 23, 2014
    Date of Patent: October 3, 2017
    Assignee: FINDO, INC.
    Inventors: David Yan, Konstantin Anisimovich
  • Patent number: 9772998
    Abstract: The preferred embodiments provide an automated machine translation from one language to another. The source language may contain expressions or words that are not readily handled by the translation system. Such problematic words or word combinations may, for example, include the words not found in the dictionary of the translation system, as well as text fragments corresponding to structures with low ratings. To improve translation quality, such potentially erroneous words or questionable word combinations are identified by the translation system and displayed to a user by distinctive display styles in the display of a document in the source language and in its translation to a target language. A user is provided with a capability to correct erroneous or questionable words so as to improve the quality of translation.
    Type: Grant
    Filed: May 22, 2014
    Date of Patent: September 26, 2017
    Assignee: ABBYY PRODUCTION LLC
    Inventors: Konstantin Anisimovich, Tatiana Danielyan, Vladimir Selegey, Konstantin Zuev
  • Patent number: 9703776
    Abstract: Disclosed are methods, systems, and computer-readable mediums for automatic training of a syntactic and semantic parser using a genetic algorithm. An initial population is created, where the initial population comprises a vector of parameters for elements of syntactic and semantic descriptions of a source sentence. A natural language compiler (NLC) system is used to translate the sentence from the source language into a target language based on the syntactic and semantic descriptions of the source sentence. A vector of quality ratings is generated where each quality rating in the vector of quality ratings is of a corresponding parameter in the vector of parameters. Quality ratings are evaluated according to specific criterion, which comprise parameters such as a BLEU score and a number of emergency sentences. A number of parameters in the vector of parameters are replaced with adjusted parameters.
    Type: Grant
    Filed: October 17, 2016
    Date of Patent: July 11, 2017
    Assignee: ABBYY PRODUCTION LLC
    Inventor: Konstantin Anisimovich
  • Patent number: 9633005
    Abstract: A system for natural language processing is provided. A first natural language processing program may be constructed using language-independent semantic descriptions, and language-dependent morphological descriptions, lexical descriptions, and syntactic descriptions of one or more target languages. The natural language processing program may include any of machine translation, fact extraction, semantic indexing, semantic search, sentiment analysis, document classification, summarization, big data analysis, or another program. Additional sets of natural language processing programs may be constructed.
    Type: Grant
    Filed: October 8, 2014
    Date of Patent: April 25, 2017
    Assignee: ABBYY InfoPoisk LLC
    Inventors: Tatiana Danielyan, Anatoly Starostin, Konstantin Zuev, Konstantin Anisimovich, Vladimir Selegey
  • Patent number: 9626353
    Abstract: The present disclosure provides methods and systems for performing syntactic analysis of a text. In some implementations the method includes performing rough syntactic analysis of the text, generating a graph of generalized constituents of the text and filtering arcs of the graph of generalized constituents with a combination classifier which includes a tree classifier and one or more linear classifiers. The combination classifier is trained using parallel analysis of an untagged two-language text corpus.
    Type: Grant
    Filed: January 2, 2015
    Date of Patent: April 18, 2017
    Assignee: ABBYY INFOPOISK LLC
    Inventors: Konstantin Anisimovich, Konstantin Alekseevich Zuev
  • Patent number: 9588958
    Abstract: Methods are described for performing classification (categorization) of text documents written in various languages. Language-independent semantic structures are constructed before classifying documents. These structures reflect lexical, morphological, syntactic, and semantic properties of documents. The methods suggested are able to perform cross-language text classification which is based on document properties reflecting their meaning. The methods are applicable to genre classification, topic detection, news analysis, authorship analysis, etc.
    Type: Grant
    Filed: June 28, 2012
    Date of Patent: March 7, 2017
    Assignee: ABBYY InfoPoisk LLC
    Inventors: Tatiana Danielyan, Konstantin Zuev, Konstantin Anisimovich, Vladimir Selegey
  • Publication number: 20170031900
    Abstract: Disclosed are methods, systems, and computer-readable mediums for automatic training of a syntactic and semantic parser using a genetic algorithm. An initial population is created, where the initial population comprises a vector of parameters for elements of syntactic and semantic descriptions of a source sentence. A natural language compiler (NLC) system is used to translate the sentence from the source language into a target language based on the syntactic and semantic descriptions of the source sentence. A vector of quality ratings is generated where each quality rating in the vector of quality ratings is of a corresponding parameter in the vector of parameters. Quality ratings are evaluated according to specific criterion, which comprise parameters such as a BLEU score and a number of emergency sentences. A number of parameters in the vector of parameters are replaced with adjusted parameters.
    Type: Application
    Filed: October 17, 2016
    Publication date: February 2, 2017
    Inventor: Konstantin Anisimovich
  • Patent number: 9542381
    Abstract: Disclosed are methods, systems, and computer-readable mediums for automatic training of a syntactic and semantic parser using a genetic algorithm. An initial population is created, where the initial population comprises a vector of parameters for elements of syntactic and semantic descriptions of a source sentence. A natural language compiler (NLC) system is used to translate the sentence from the source language into a target language based on the syntactic and semantic descriptions of the source sentence. A vector of quality ratings is generated where each quality rating in the vector of quality ratings is of a corresponding parameter in the vector of parameters. Quality ratings are evaluated according to specific criterion, which comprise parameters such as a BLEU score and a number of emergency sentences. A number of parameters in the vector of parameters are replaced with adjusted parameters.
    Type: Grant
    Filed: January 2, 2015
    Date of Patent: January 10, 2017
    Assignee: ABBYY InfoPoisk LLC
    Inventor: Konstantin Anisimovich