Patents Examined by Hau H Hoang
  • Patent number: 11410052
    Abstract: A method, system and computer-usable medium are disclosed for minimizing reevaluation of a ground truth corpus in response to concept drift. Certain embodiments are directed to a computer implemented comprising: generating a knowledge graph using a ground truth corpus, where the knowledge graph includes concept nodes, context definition nodes, and document nodes, where each concept node has one or more edges to a context definition node and to a document node; updating a context definition node in the knowledge graph based on context drift; identifying edges between the updated context definition node and concept nodes affected by the updated context definition; and identifying edges between the affected concept nodes and corresponding document nodes to identify document nodes affected by the context drift; and reevaluating documents in the ground truth corpus corresponding to the affected document nodes pursuant to updating the ground truth corpus to compensate for the context drift.
    Type: Grant
    Filed: January 24, 2019
    Date of Patent: August 9, 2022
    Assignee: International Business Machines Corporation
    Inventors: Tristan A. TeNyenhuis, Andrew R. Freed, Jocelyn Kong, Allegra Larche, Christopher R. Weber
  • Patent number: 11409724
    Abstract: Aspects create a tree data structure that indexes a collection of documents present in a data repository at a point in time. The tree data structure includes a plurality of nodes. For each such node, a respective root hash value of that node is determined. The root hash value of a leaf node is determined from hash value(s) for element(s) of that node that are keyed to documents in the collection. The root hash value of a parent node is determined from a root hash value for each of its child nodes. For a given document that is purported to be a target document present in the data repository at the point in time, processing is performed that uses the tree data structure in facilitating verification that the given document is the target document. This includes providing a cryptographic proof to demonstrate whether the given document is the target document.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: August 9, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Jeronimo Irazabal
  • Patent number: 11403304
    Abstract: According to one or more embodiments, operations may include gathering a set of machine learning (ML) projects from one or more repositories of ML projects based on a filtering criteria. The operations may also include ensuring executability of ML pipelines in the set of ML projects. In addition, the operations may include identifying irrelevant portions of the ML pipelines in the set of ML projects. Moreover, the operations may include generating quality features for the set of ML projects. In addition, the operations may include generating diversity features for the set of ML projects. Moreover, the operations may include selecting a subset of ML projects from the set of ML projects based on the quality features and the diversity features. In addition, the operations may include storing the subset of ML projects in a corpus of ML projects that may be adapted for use in new ML projects.
    Type: Grant
    Filed: September 2, 2020
    Date of Patent: August 2, 2022
    Assignee: FUJITSU LIMITED
    Inventors: Ripon K. Saha, Mukul R. Prasad, Chenguang Zhu
  • Patent number: 11386104
    Abstract: Disclosed is a system and method for improving database memory consumption and performance using compression of time stamp columns. A number of time stamps of a time series is received. The time stamps have a start time, and are separated by an equal increment of time that defines an interval. The start time and interval are stored in a dictionary of a column store of a database. An index is generated in the column store of the database, the index having a number of index vectors. Using the index vectors, each time stamp of the number of time stamps can be calculated from the start time stored in the dictionary and the position in the time series based on the interval stored in the dictionary.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: July 12, 2022
    Assignee: SAP SE
    Inventors: Gordon Gaumnitz, Lars Dannecker, Robert Schulze, Ivan T. Bowman, Daniel James Farrar
  • Patent number: 11386161
    Abstract: Systems and methods are disclosed herein for machine-learned vehicle desking operations. A vehicle recommendation system receives a request to determine similarities between vehicles. The request can indicate an identifier of a user-specified vehicle associated with vehicle attribute values (e.g., white color, sedan body style, 2020 manufacturing year, etc.). A machine learning model can determine respective embeddings for the vehicle attribute values and the respective embeddings can be concatenated, where the concatenated embeddings represent the user-specified vehicle in one embedding. The system can determine similarity metrics of the concatenated embeddings against reference embeddings. For example, a cosine similarity value can be determined for the concatenated embedding of the user-specified vehicle and the respective reference embeddings. Each similarity metric can represent a measure of similarity between the user-specified vehicle and a given vehicle.
    Type: Grant
    Filed: September 10, 2021
    Date of Patent: July 12, 2022
    Assignee: Tekion Corp
    Inventors: Nitika Gupta, Ved Surtani
  • Patent number: 11379421
    Abstract: Method and apparatus for compressing raw event logs into smaller readable formats are described. An example includes receiving an uncompressed log file including traces of events executed on a computing system. In the uncompressed log file, a number of consecutive events are identified referencing an action performed with different parameters, and the uncompressed log file is modified by replacing the identified consecutive events with a record indicating that an event has been repeated the number of times. In the modified log file, repeated sequences of events are identified, a compressed log file is generated by replacing, in the modified log file, repeated sequences of events with a record referencing an initial repetition of events and a difference between parameters included in the initial repetition of events and a respective repeated sequence, and the generated compressed log file is output.
    Type: Grant
    Filed: June 25, 2019
    Date of Patent: July 5, 2022
    Assignee: Amazon Technologies, Inc.
    Inventor: Mircea Ciubotariu
  • Patent number: 11366819
    Abstract: A method for obtaining an answer to a question is provided. The method may include: acquiring a question; determining at least a part of articles in a preset article database as candidate articles, and determining first scores of the candidate articles respectively, the first score of any of the candidate articles representing a matching degree between the candidate article and the question; determining at least a part of texts in each of the candidate articles as candidate texts, and determining second scores of the candidate texts respectively, the second score of any of the candidate texts representing a matching degree between the candidate text and the question; and determining at least a part of the candidate texts as the answer based on a score set of each of the candidate texts, the score set of any of the candidate texts including the second score and the first score.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: June 21, 2022
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Songtai Dai, Xinwei Feng, Miao Yu, Huanyu Zhou, Xunchao Song, Pengcheng Yuan
  • Patent number: 11361044
    Abstract: As an example, a server hosting a search engine may receive a search query and determine a searched time interval, a searched object, and a searched event. The server may select, based on the searched time interval, a portion of an object-event bipartite graph that was created using information gathered from social media sites. The server may compare attributes of individual events in the portion with attributes of the searched event to identify a set of relevant events. The server may determine objects associated with the relevant events and compare attributes of individual objects with the attributes of the searched object to identify a set of relevant objects. The search engine may provide search results that include the set of relevant objects ordered according to their similarity to the searched object.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: June 14, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Falaah Arif Khan, Tousif Mohammed, Shubham Gupta, Hung The Dinh, Ramu Kannappan
  • Patent number: 11361034
    Abstract: Embodiments are directed to representing documents using document keys. Documents that include one or more clauses may be provided. Each clause type for the one or more clauses in documents may be determined based on one or more classification models. One or more clause identifiers may be associated with the one or more clauses based on one or more clause types of each clause. A document key may be generated for each document based on an ordered collection of the one or more clauses included in each document such that each clause identifier may be positioned in the document key based on an order of its location in a corresponding clause of a document. The documents may be analyzed based on comparisons of one or more document keys corresponding to the documents. One or more reports may be generated based on one or more results of the analysis.
    Type: Grant
    Filed: November 30, 2021
    Date of Patent: June 14, 2022
    Assignee: Icertis, Inc.
    Inventors: Yogesh Haribhau Kulkarni, Sunu Engineer, Amitabh Jain, Ravi Kothari, Monish Mangalkumar Darda
  • Patent number: 11354361
    Abstract: Document discrepancy determination and mitigation can include marking a fragment of a first document and a corresponding fragment of a second document in response to determining a dependency between the first document and the second document. A discrepancy probability with respect to the first document and the second document can be identified based on a discrepancy measure, which can be determined by comparing the marking of the fragment of the first document and the marking of the corresponding fragment of the second document. One or more discrepancy mitigation procedures can be initiated in response to the discrepancy measure exceeding a predetermined threshold.
    Type: Grant
    Filed: July 11, 2019
    Date of Patent: June 7, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Paul R. Bastide, Matthew E. Broomhall, Robert E. Loredo
  • Patent number: 11347759
    Abstract: A document acquisition unit (15c) collects documents, a feature amount calculation unit (15d) calculates feature amounts of words included in the collected documents, a relevance calculation unit (15f) calculates relevances between the documents and words included in operation logs in a window operated by a user, using the calculated feature amounts of the words included in the documents, and a presentation unit (15g) presents, to the user, a predetermined number of the documents in an order of descending relevance, as related documents. In this manner, it is possible to present documents related to a user's operation to the user.
    Type: Grant
    Filed: February 1, 2019
    Date of Patent: May 31, 2022
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Yuki Urabe, Shiro Ogasawara, Kentaro Hotta
  • Patent number: 11341193
    Abstract: A computer-implemented method for template generation may include: receiving a first plurality of variables, each of the first plurality of variables specifying a feature to be implemented in a new template for document generation; for each of a plurality of existing templates for document generation, determining a degree of similarity between the first plurality of variables and a plurality of variables included in the respective existing template; upon determining that none of the degrees of similarity respectively determined for the plurality of existing templates satisfies a similarity threshold, determining that a combination of variables from two or more of the plurality of existing templates has a degree of similarity with the first plurality of variables satisfying the similarity threshold; and generating the new template based on the two or more of the plurality of existing templates.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: May 24, 2022
    Assignee: Capital One Services, LLC
    Inventor: Curtis Holt
  • Patent number: 11341192
    Abstract: A method may include receiving, from a first document management system, a first request to store a document in a blockchain enabled data store including multiple blockchain platforms. In response to the first request, the document may be converted from a first format associated with the document management system to a portable binary code format (e.g., WebAssembly format) before being sent to one of the blockchain platforms. A second request to access the document may be received from a second document management system. In response to the second request, the document in the portable binary code format may be retrieved from the blockchain platform, converted to a second to a second format associated with the second document management system, and sent to the second document management system. Related systems and articles of manufacture are also provided.
    Type: Grant
    Filed: January 28, 2020
    Date of Patent: May 24, 2022
    Assignee: SAP SE
    Inventor: Meenakshi Sundaram P
  • Patent number: 11341091
    Abstract: Customers in regulated industries face demanding compliance regulations, including content immutability. While broadened to allow software-based solutions, the regulations for immutability require content preservation to prevent overwriting, erasure or alteration of the content, where the preservation must be implemented through irrevocable features. Embodiments are directed to provision of an administrative user experience to enable customers to create a preservation policy that defines item(s) to be preserved. After detecting enablement of the policy, the item(s) may be preserved, a preservation lock on the policy may be initiated by disabling controls associated with the policy, and an attribute may be set to the policy to identify the policy as locked.
    Type: Grant
    Filed: April 17, 2019
    Date of Patent: May 24, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Julian Zbogar-Smith, Kamal Janardhan, Sanjay Ramaswamy, Le-Wu Tung
  • Patent number: 11329666
    Abstract: A method of transforming a data file, the method executed by a processor. The method includes segmenting the data file into data segments and creating a bit index for each data segment having a size that is based on a configurable or preset data group unit. The method then involves indexing each data segment into its corresponding bit index by reading all data group unit values within the data segment and updating the bit index based on the read values, and generating an output data file or files comprising the bit indexes that represent the original data file.
    Type: Grant
    Filed: October 3, 2016
    Date of Patent: May 10, 2022
    Assignee: Pacbyte Solutions Pty Ltd
    Inventor: Bruce Parker
  • Patent number: 11328025
    Abstract: A device that includes an enterprise data indexing engine (EDIE) configured to determine a first set of similarity scores between a first set of sentences from a first document and a plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds a predetermined threshold value. The EDIE is further configured to determine a second set of similarity scores between a second set of sentences from a second document and the plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds the predetermined threshold value. The EDIE is further configured to populate a data structure that identifies the tokens within the first set of tokens and the second set of tokens and the number of times each token appears.
    Type: Grant
    Filed: August 30, 2019
    Date of Patent: May 10, 2022
    Assignee: Bank of America Corporation
    Inventors: Matthew I. Cobb, Melissa A. Fraser, Arjun Thimmareddy
  • Patent number: 11321360
    Abstract: A method and system for receiving data relating to one or more activities performed by a user on a document within a specific time period, the one or more activities being performed by using an application, analyzing the data to identify a category of user activity based at least on the type of activity performed on the document, and transmitting a signal to a device for storage in association with the document, the signal including the identified category.
    Type: Grant
    Filed: January 17, 2020
    Date of Patent: May 3, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bernhard Kohlmeier, Madeline Schuster Kleiner, Jon Meling, Jan Heier Johansen, Vegar Skjaerven Wang
  • Patent number: 11314819
    Abstract: Techniques for intaking one or more documents are described. An exemplary method includes receiving an ingestion request to ingest a document; extracting text from the document; pre-processing the extracted text to generate pre-processed text that is predictable and analyzable; generating an index entry for the extracted text, the index entry to map the extracted text to a reserved field of a plurality of reserved fields; and storing the extracted text, index entry, and pre-processed text in at least one data storage location.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: April 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Jared Lee Katzman, Nithin Kunala, Bing Xiang, Krishnakumar Rajagopalan, Andrew M. Grant
  • Patent number: 11308091
    Abstract: An information collection system for efficiently collecting target information from an enormous amount of contents in a variety of formats is provided. The information collection system 100 includes a learning unit 110 and an extraction unit 130. The learning unit 110 generates, by using learning data, a parser rule for extracting a target character string from data, the target character string being a character string including specific information. The extraction unit 130 extracts the target character string from data by using the parser rule.
    Type: Grant
    Filed: September 20, 2017
    Date of Patent: April 19, 2022
    Assignee: NEC CORPORATION
    Inventors: Tatsuya Ito, Yuki Ashino, Masato Yamane
  • Patent number: 11308099
    Abstract: A method and system for ranking digital object based on an objective characteristic associated therewith are provided. The method comprising: generating a set of digital objects based on a user request, the set of digital objects being rankable according to an objective characteristic thereof; receiving a filter request from the user, the filter request being based on a secondary characteristic of digital objects in the set of digital objects; determining object parameters for the digital objects in the set of digital objects, a given object parameter being indicative of a likelihood that an inclusion of a respective digital object in a re-ranked set of digital objects will increase a quality metric of the re-ranked set of digital objects; selecting digital objects based on object parameters; ranking digital objects based on respective values of the secondary characteristic, thereby generating the re-ranked set of digital objects.
    Type: Grant
    Filed: June 10, 2020
    Date of Patent: April 19, 2022
    Assignee: YANDEX EUROPE AG
    Inventors: Aleksey Ivanovich Ustimenko, Aleksandr Leonidovich Vorobyev, Gleb Gennadevich Gusev, Pavel Viktorovich Serdyukov