Patents Examined by Merilyn P Nguyen
  • Patent number: 11734282
    Abstract: Example aspects include techniques for performing vectorized delete operations in a distributed database system including clustering multiple files stored in the distributed database system, and generating, for each of the multiple files and based on a deletion signal, a delta array including multiple bits representing the data items in the file and indicating, based on bit value, the target data items to be deleted from the file. Generating, for each of the multiple files, the delta array can include reading at least one second file shard of multiple second file shards before performing a join operation on at least one first file shard of multiple first file shards is completed.
    Type: Grant
    Filed: March 30, 2022
    Date of Patent: August 22, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Wangchao Le, Marc Todd Friedman, Hiren Patel
  • Patent number: 11734280
    Abstract: A source table organized into a set of micro-partitions is accessed by a network-based data warehouse. A pruning index is generated based on the source table. The pruning index comprises a set of filters that indicate locations of distinct values in each column of the source table. A query directed at the source table is received at the network-based data warehouse. The query is processed using the pruning index. The processing of the query comprises pruning the set of micro-partitions of the source table to scan for data matching the query, the pruning of the plurality of micro-partitions comprising identifying, using the pruning index, a sub-set of micro-partitions to scan for the data matching the query.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: August 22, 2023
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Benoit Dageville, Ismail Oukid, Stefan Richter
  • Patent number: 11720452
    Abstract: Disclosed herein are systems and method for determining data storage insurance policies. In an exemplary implementation, a method comprises receiving a request to add a data storage insurance policy to a plurality of data files. The method comprises extracting data file attributes and determining a data recovery score for each respective data file based on a uniqueness, criticality, and/or importance of the respective data file. The method comprises determining a hardware score for each of a plurality of performance tiers comprising at least one storage server, based on an available capacity, a performance cost, and/or data recovery scores of data files currently stored at each of the plurality of performance tiers. The method comprises selecting and executing a data storage insurance policy for the respective data file based on a plurality of data recovery rules and/or the comparison of the data recovery score and the hardware score.
    Type: Grant
    Filed: June 29, 2021
    Date of Patent: August 8, 2023
    Assignee: Acronis International GmbH
    Inventors: Alexander Tormasov, Serguei Beloussov, Stanislav Protasov
  • Patent number: 11720583
    Abstract: In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.
    Type: Grant
    Filed: August 1, 2022
    Date of Patent: August 8, 2023
    Assignee: Ab Initio Technology LLC
    Inventors: Ian Schechter, Tim Wakeling, Ann M. Wollrath
  • Patent number: 11714835
    Abstract: Embodiments of the present disclosure relate generally to organizing electronic text documents. In particular, one or more embodiments comprise a content management system that improves the organization of electronic text documents by intelligently and accurately categorizing electronic text documents by topic. The content management system organizes electronic text documents based on one or more topics, without the need for a human reviewer to manually classify each electronic text document, and without the need for training a classification algorithm based on a set of manually classified electronic text documents. Further, the content management system identifies and suggests topics for electronic text documents that relate to new or emerging topics.
    Type: Grant
    Filed: February 28, 2022
    Date of Patent: August 1, 2023
    Assignee: Qualtrics, LLC
    Inventors: Jeffrey Wade Starr, Robert David Norton
  • Patent number: 11704320
    Abstract: A table organized into a set of batch units is accessed. A set of N-grams are generated for a data value in the source table. The set of N-grams include a first N-gram of a first length and a second N-gram of a second length where the first N-gram corresponds to a prefix of the second N-gram. A set of fingerprints are generated for the data value based on the set of N-grams. The set of fingerprints include a first fingerprint generated based on the first N-gram and a second fingerprint generated based on the second N-gram and the first fingerprint. A pruning index that indexes distinct values in each column of the source table is generated based on the set of fingerprints and stored in a database with an association with the source table.
    Type: Grant
    Filed: September 29, 2022
    Date of Patent: July 18, 2023
    Assignee: Snowflake Inc.
    Inventors: Ismail Oukid, Stefan Richter
  • Patent number: 11704203
    Abstract: Replication of a filesystem or a mount point or share may replicate all data that it consists of irrespective of where the data is stored. Replication protects data irrespective of location. One method is to replicate the filesystem namespace as is while skipping the data outside of the appliance/machine so that replication cost and time are reasonable. The data outside of the machine, like cloud/tape data is protected differently. One example method includes a data protection operation configured to replication a namespace associated with multiple data tiers. During replication, data from one of the tiers is skipped while all of the namespace metadata is replicated. The recovery restores the namespace metadata and the data that was replicated from the other tier. This may be performed in connection with cyber security, for example when replicating multi-tier data to a vault.
    Type: Grant
    Filed: September 2, 2022
    Date of Patent: July 18, 2023
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Nitin Madan, Kalyan C. Gunda, Bhimsen Bhanjois
  • Patent number: 11704586
    Abstract: Computational systems implement problem solving using hybrid digital/quantum computing approaches. A problem may be represented as a problem graph which is larger and/or has higher connectivity than a working and/or hardware graph of a quantum processor. A quantum processor may be used determine approximate solutions, which solutions are provided as initial states to one or more digital processors which may implement classical post-processing to generate improved solutions. Techniques for solving problems on extended, more-connected, and/or “virtual full yield” variations of the processor's actual working and/or hardware graphs are provided. A method of operation in a computational system comprising a quantum processor includes partitioning a problem graph into sub-problem graphs, and embedding a sub-problem graph onto the working graph of the quantum processor. The quantum processor and a non-quantum processor-based device generate partial samples.
    Type: Grant
    Filed: May 9, 2022
    Date of Patent: July 18, 2023
    Assignee: D-WAVE SYSTEMS INC.
    Inventors: Murray C. Thom, Aidan P. Roy, Fabian A. Chudak, Zhengbing Bian, William G. Macready, Robert B. Israel, Kelly T. R. Boothby, Sheir Yarkoni, Yanbo Xue, Dmytro Korenkevych
  • Patent number: 11688159
    Abstract: In one embodiment, a method includes receiving an indication of a trigger action by a first user at a client system, wherein the trigger action is associated with a priming content object, identifying related content objects associated with the priming content object, selecting recommended content objects based on the priming content object, the related content objects, and profile information of the first user, wherein each of the selected recommended content objects comprises entity information of entities associated with the priming content object, and presenting content suggestions at the client system, wherein each content suggestion comprises one of the selected recommended content objects.
    Type: Grant
    Filed: April 7, 2021
    Date of Patent: June 27, 2023
    Assignee: Meta Platforms, Inc.
    Inventors: Fuchun Peng, Kun Han, Wenhai Yang, Cheng Zhang, Vivek Narayanan, Emmanouil Koukoumidis
  • Patent number: 11681708
    Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.
    Type: Grant
    Filed: September 23, 2022
    Date of Patent: June 20, 2023
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Ismail Oukid, Stefan Richter, Alejandro Salinger
  • Patent number: 11681871
    Abstract: Systems and methods are provided for receiving a query created by a user, receiving output data of at least one function to retrieve data related to the query and analyzing the output data of the at least one function to retrieve data related to the query. The systems and methods further provide for generating at least one dynamic knowledge graph associated with the output data of the at least one function, wherein the at least one dynamic knowledge graph comprises data from the output data of the at least one function and indicates relationships between the data, analyzing the at least one dynamic knowledge graph to determine data relevant to the query generated by the user, and generating a response to the query based on the data relevant in the at least one dynamic knowledge graph.
    Type: Grant
    Filed: March 4, 2022
    Date of Patent: June 20, 2023
    Assignee: SAP SE
    Inventors: Anil Babu Ankisettipalli, Karthik Uppuluri, Frank Blechschmidt
  • Patent number: 11675926
    Abstract: Methods and systems for data management of documents in one or more data repositories in a computer network or cloud infrastructure are provided. The method includes sampling the documents in the one or more data repositories and formulating representative subsets of the sampled documents. The method further includes generating sampled data sets of the sampled documents and balancing the sampled data sets for further processing of the sampled documents. The formulation of the representative subsets is performed for identification of some of the representative subsets for initial processing.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: June 13, 2023
    Assignee: DATHENA SCIENCE PTE LTD
    Inventors: Christopher Muffat, Tetiana Kodliuk
  • Patent number: 11669567
    Abstract: Methods and systems are disclosed in which audio broadcasts are converted into audio segments, for example, based on segment content. These audio segments are indexed, so as to be searchable, as computer searchable segments, for example, by network search engines and other computerized search tools.
    Type: Grant
    Filed: May 24, 2021
    Date of Patent: June 6, 2023
    Assignee: SubPLY Solutions Ltd.
    Inventors: Gal Klein, Rachel Ludmer
  • Patent number: 11663090
    Abstract: A method for recovery of missing or extra data using a bloom filter includes: storing a plurality of transaction messages, each including a transaction value; generating a bloom filter of the transaction messages, the bloom filter being generated using a number of hash rounds and with a size at least double the number of transaction messages; generating a recover message including the number of transaction messages, the number of hash rounds, the size, and the generated bloom filter; transmitting the recover message to a consensus node; receiving a response message from the consensus node, the response message including at least one additional transaction message; and inserting the at least one additional transaction message into the plurality of transaction messages.
    Type: Grant
    Filed: September 30, 2021
    Date of Patent: May 30, 2023
    Assignee: MASTERCARD INTERNATIONAL INCORPORATED
    Inventor: Steven C. Davis
  • Patent number: 11657046
    Abstract: Computer-readable media, methods, and systems are disclosed for improving performance when dropping database snapshots in an in-memory database by pruning tree branches in one or more converters associated with a snapshot of an in-memory database. A request to compare at least two database snapshots is received. Physical pages associated with at least one of the two database snapshots are loaded. First and second converter streams are iterated. Physical block numbers of intermediate converter pages to which the first and second converter streams currently point are compared. If the physical page numbers are all non-equal, the physical pages referenced by the second converter stream are set to free. If any of the physical page numbers are equal, subsequent traversal of a corresponding converter subtree in the first and second converter streams may be skipped to improve performance.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: May 23, 2023
    Assignee: SAP SE
    Inventor: Dirk Thomsen
  • Patent number: 11657024
    Abstract: A map-reduce compatible distributed file system that consists of successive component layers that each provide the basis on which the next layer is built provides transactional read-write-update semantics with file chunk replication and huge file-create rates. Containers provide the fundamental basis for data replication, relocation, and transactional updates. A container location database allows containers to be found among all file servers, as well as defining precedence among replicas of containers to organize transactional updates of container contents. Volumes facilitate control of data placement, creation of snapshots and mirrors, and retention of a variety of control and policy information. Also addressed is the use of distributed transactions in a map-reduce system; the use of local and distributed snapshots; replication, including techniques for reconciling the divergence of replicated data after a crash; and mirroring.
    Type: Grant
    Filed: August 23, 2021
    Date of Patent: May 23, 2023
    Assignee: Hewlett Packard Enterprise Development LP
    Inventors: Mandayam C. Srivas, Pindikura Ravindra, Uppaluri Vijaya Saradhi, Arvind Arun Pande, Chandra Guru Kiran Babu Sanapala, Lohit Vijaya Renu, Vivekanand Vellanki, Sathya Kavacheri, Amit Ashoke Hadke
  • Patent number: 11650988
    Abstract: The database management system (DBMS) receives a first instruction specifying anonymization rule information corresponding to a column of the relation table, among anonymization rule information that is present for each column included in the relation table and shows a plurality of generalization rules. The DBMS reads the column from the relation table in response to the first instruction, and generates a temporary result obtained by generalizing each attribute value of the column based on any of a plurality of generalization rules. The DBMS generates an aggregate result obtained by aggregating the temporary result. The DBMS generates an anonymization method including generalization information indicating a correspondence relationship of each attribute value of the column and any of the plurality of generalization rules when the aggregate result satisfies a disclosure rule. The DBMS generates anonymization information as a result of processing the relation table based on the first anonymization method.
    Type: Grant
    Filed: September 23, 2020
    Date of Patent: May 16, 2023
    Assignees: HITACHI, LTD., THE UNIVERSITY OF TOKYO
    Inventors: Yuya Isoda, Kazuhiko Mogi, Norifuml Nishikawa, Kazuo Goda, Yuto Hayamizu, Masaru Kitsuregawa
  • Patent number: 11650984
    Abstract: A technique for using a caching layer for key-value storage in a database is described. In one example of the present disclosure, a system can receive, at an unsorted data structure of a caching layer, a key-value pair associated with a data object. The unsorted data structure can store a first plurality of key-value pairs. The system can receive one or more operations for updating the key-value pair in the caching layer. The system can determine the key-value pair is to be migrated to a sorted memory table based on a caching algorithm. The system can migrate the key-value pair to a sorted memory table configured to store a second plurality of key-value pairs that is larger than the first plurality of key-value pairs and sort the key-value pair with the second plurality of key-value pairs prior to storing the key-value pair in the sorted memory table.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: May 16, 2023
    Assignee: RED HAT, INC.
    Inventors: Mark Nelson, Gabriel Zvi BenHanokh
  • Patent number: 11651586
    Abstract: In one embodiment, a method includes receiving an indication of a trigger action by a first user at a client system, wherein the trigger action is associated with a priming content object, identifying related content objects associated with the priming content object, selecting recommended content objects based on the priming content object, the related content objects, and profile information of the first user, wherein each of the selected recommended content objects comprises entity information of entities associated with the priming content object, and presenting content suggestions at the client system, wherein each content suggestion comprises one of the selected recommended content objects.
    Type: Grant
    Filed: April 7, 2021
    Date of Patent: May 16, 2023
    Assignee: Meta Platforms, Inc.
    Inventors: Fuchun Peng, Kun Han, Wenhai Yang, Cheng Zhang, Vivek Narayanan, Emmanouil Koukoumidis
  • Patent number: 11645253
    Abstract: A method includes: receiving an indication of a new integration flow document that updates a current integration flow document of an integration flow; determining a last switchable node that is included in both the current integration flow document and the new integration flow document; determining, for each running instance of the integration flow using the current integration flow document, whether a currently executing node of the instance is less than or equal to the last switchable node; in response to determining the currently executing node of the instance is less than or equal to the last switchable node, switching the instance to the new integration flow document; and in response to determining the currently executing node of the instance is greater than or the last switchable node, waiting for completion of the instance using the current integration flow document.
    Type: Grant
    Filed: December 1, 2021
    Date of Patent: May 9, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Chengxuan Xing, Ashley Donald Harrison, Matthew Paul Chapman, Vlad Balanescu