Patents Examined by Lin Lin M Htay
  • Patent number: 10684996
    Abstract: A distributed storage system maintains multiple logically independent file systems. Each file system includes a data set stored by a storage device of the distributed storage system. During operation, access pattern levels for the multiple logically independent file systems are determined. Thereafter, the data sets included in the multiple logically independent file systems are redistributed across multiple storage devices of the distributed storage. Redistribution of a particular data set is based at least in part on the particular file system including the particular data set and on the determined access pattern levels for the multiple logically independent file systems. In addition, each disk of a plurality of disks in the distributed storage includes a physically separated partition dedicated to storing the data of the file system that is most frequently accessed. The distribution of data is based at least in part on the presence of the physically separated partition.
    Type: Grant
    Filed: September 12, 2017
    Date of Patent: June 16, 2020
    Assignee: Quantcast Corporation
    Inventor: Silvius V. Rus
  • Patent number: 10650020
    Abstract: A system analyzes transformations for processing datasets. The transformations may be used for build a transformation script for preprocessing data for analysis by big data analysis systems. The system receives a new transformation for analysis. The system determines a measure of an impact of the new transformation operation on a dataset. The system determines statistical information describing rows of the transformed dataset that are impacted by the new transformation. The system receives request to add the new transformation to the transformation script responsive to presenting the statistical information.
    Type: Grant
    Filed: September 16, 2016
    Date of Patent: May 12, 2020
    Assignee: Trifacta Inc.
    Inventors: Vihang Jitendra Mehta, Seshadri Sadasivan Mahalingam, Philip John Vander Broek
  • Patent number: 10628490
    Abstract: Techniques for using digital entity correlation to generate a composite knowledge graph from constituent graphs. In an aspect, digital attribute values associated with primary entities may be encoded into primitives, e.g., using a multi-resolution encoding scheme. A pairs graph may be constructed, based on seed pairs calculated from correlating encoded primitives, and further expanded to include subjects and objects of the seed pairs, as well as pairs connected to relationship entities. A similarity metric is computed for each candidate pair to determine whether a match exists. The similarity metric may be based on summing a weighted landing probability over all primitives associated directly or indirectly with each candidate pair. By incorporating primitive matches from not only the candidate pair but also from pairs surrounding the candidate pair, entity matching may be efficiently implemented on a holistic basis.
    Type: Grant
    Filed: November 5, 2015
    Date of Patent: April 21, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Mohamed Yakout, Kaushik Chakrabarti, Maria Pershina
  • Patent number: 10613759
    Abstract: Described are computer-implemented methods and computing systems for automatically deduplicating a target dataset relative to a baseline dataset by providing distributed analysis of a first dataset to automatically generate a baseline dataset of the most common blocks of the first dataset, wherein the analysis is conducted in a distributed computing environment comprising a master computer system connected via a computer network to a plurality of computer systems.
    Type: Grant
    Filed: October 22, 2018
    Date of Patent: April 7, 2020
    Assignee: CODER TECHNOLOGIES, INC.
    Inventor: Ammar Hussain Bandukwala
  • Patent number: 10585888
    Abstract: A computing device is provided, comprising a processor configured to select at least one pair of tuples of columns including a source tuple from a first table and a target tuple from a second table. For each pair, the processor may select one or more rows from the source tuple and elements of the target tuple. For each selected row, the processor may programmatically generate a script that, when performed on the source tuple, produces a value consistent with the target tuple. The processor may apply each script to other rows of the source tuple and determine that an output is in the target tuple. For each column of the target tuple, for the script with output that meets a matching criterion, the processor may convey the output and, in response to a signal accepting the script, join the tables at least in part by performing each accepted script.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: March 10, 2020
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Sumit Gulwani, Daniel Adam Perelman, Ranvijay Kumar, Euan Peter Garden, Chairy Chiu Ying Cheung
  • Patent number: 10552416
    Abstract: A Processing method for processing SQL statements of different flavors by a database management system may be provided. The method comprises receiving SQL statements of different SQL flavors, parsing each received SQL statement for identifying an SQL flavor characteristic, tracking an SQL statement identifier of a related SQL PREPARE operation for determining the SQL flavor characteristic at subsequent DMBS operations, adapting a DBMS mode of the DBMS dynamically to the identified SQL flavor characteristic, adapting another received SQL statement to a DBMS mode of the database management system, and sending each of the adapted SQL statements to the DBMS for the SQL PREPARE operation under the adapted database management system mode.
    Type: Grant
    Filed: August 12, 2016
    Date of Patent: February 4, 2020
    Assignee: International Business Machines Corporation
    Inventor: Eric Kass
  • Patent number: 10552454
    Abstract: Examples of partitioning a group of related database tables are provided herein. A database table in a group of related database tables can be designated as a lead database table. A partitioning field can also be determined by which database tables in the group are partitioned. A data load, with respect to the partitioning field, of the lead database table can be calculated. The data load can include a data distribution across different values of the partitioning field. A group partitioning scheme can be determined based on the data load of the lead database table. The database tables of the group can then be partitioned according to the group partitioning scheme.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: February 4, 2020
    Assignee: SAP SE
    Inventors: Ajalesh P. Gopi, Veit Bolik, Santosh Vijayan, Naveen Kolli
  • Patent number: 10536545
    Abstract: Techniques are described herein for routing requests for database cloud services. In an embodiment, a different Uniform Resource Identifier (URI) is assigned to each respective database schema of a plurality of database schemas in one or more databases. For each respective database schema, mapping data is stored that maps at least a portion of the URI to the respective database schema. In response to receiving a first request that includes a URI and a set of one or more database commands, a target database schema in a target database is identified based on the mapping data and the URI. A connection is established for accessing the target database schema in the target database. The set of one or more database commands is sent over the connection to access the target database schema in the target database.
    Type: Grant
    Filed: September 12, 2014
    Date of Patent: January 14, 2020
    Assignee: Oracle International Corporation
    Inventors: Colm Divilly, Kristopher Leland Rice
  • Patent number: 10534790
    Abstract: Systems, methods, and computer program products to perform an operation comprising receiving, by a database management system (DBMS) from an application, a request specifying to generate a global temporary table (GTT) of a first format, identifying a first GTT, of a plurality of GTTs stored by the DBMS, that has a second format, different than the first format, modifying the first GTT such that at least a portion of the modified first GTT is of the first format, and providing the modified first GTT to the application.
    Type: Grant
    Filed: October 20, 2016
    Date of Patent: January 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Shawn J. Baranczyk, Rafal P. Konik, Roger A. Mittelstadt, Brian R. Muras
  • Patent number: 10467005
    Abstract: In a distributed processing system having multiple processing nodes including alive nodes and dead nodes, a method is provided for collecting an object from the alive nodes. The method includes maintaining a separate count value for each of remote nodes at which the object is remotely-referenced. The method further includes suppressing a collection of the object when the separate count value for any of the remote nodes is non-zero. The method also includes clearing the separate count value for a given one of the remote nodes when the given one of the remote nodes is dead.
    Type: Grant
    Filed: December 15, 2015
    Date of Patent: November 5, 2019
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: Kiyokuni Kawachiya
  • Patent number: 10459648
    Abstract: File measurements are computed and stored in persistent memory of a deduplicated storage system as files are written or on demand, where the file measurements are used to estimate storage requirements for storing a subset of files. The file measurements are accumulated into an initial measurement at a first point in time and a final measurement at a second point in time to obtain an estimate of any change in a quantity of unique segments required to store the subset of files in the deduplicated storage system between the first and second points in time. Future storage requirements can be estimated based on a computed rate of change in the amount of storage required to store the subset of files between the first and second points in time.
    Type: Grant
    Filed: December 14, 2015
    Date of Patent: October 29, 2019
    Assignee: EMC IP Holding Company LLC
    Inventors: Guilherme Menezes, Abdullah Reza
  • Patent number: 10452677
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.
    Type: Grant
    Filed: June 19, 2016
    Date of Patent: October 22, 2019
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Patent number: 10445326
    Abstract: A method includes retrieving event analytics data for a software application installed on a plurality of user devices. The event analytics data indicates a number of times each of the application states of the software application were accessed by the user devices. The method further includes mapping function names to the application states accessed by the user devices, each of the function names indicating a functionality associated with the software application. The method further includes determining a usage fraction value for each of the function names, the usage fraction value for each function name indicating the number of times the function name was mapped relative to the number of times each of the function names was mapped. Additionally, the method includes receiving a search query, generating search results based on at least one of the usage fraction values, and transmitting search results.
    Type: Grant
    Filed: December 31, 2015
    Date of Patent: October 15, 2019
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Eric Glover
  • Patent number: 10430445
    Abstract: Methods and systems for indexing document passages are presented. In some embodiments, a computing device may identify a plurality of documents that comprise a plurality of passages. A passage index comprising a plurality of entries may be generated. Each entry may comprise keywords from a passage of the plurality of passages in one of the plurality of documents. Each entry may further comprise at least one annotation associated with the passage. A search query comprising at least one search keyword may be received. The passage index for each document of the plurality of documents may be analyzed using the at least one search keyword to identify at least one passage from the plurality of documents that matches the search query. In response to the query, the at least one passage may be presented.
    Type: Grant
    Filed: September 12, 2014
    Date of Patent: October 1, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Richard S. Crouch, Marisa F. Boston, Ali Erdem Ozcan, Peter R. Stubley
  • Patent number: 10417257
    Abstract: Systems for managing large database systems. Storage devices hold a set of one or more subject database tables that hold subject data accessed by one or more users. A data definition statement to perform a table alteration operation (e.g., a table partition operation) over a particular table in the subject database is received, parsed, and the table alteration operation over the particular table is initiated. During the processing of the data definition statement to complete the table alteration operation, a data manipulation statement to perform a table access operation over the same table is received and parsed. The data manipulation statement is not blocked, even while the table alteration operations are ongoing. Rather, during the time that the table alteration operations are ongoing, the data manipulation statement is processed using hidden tables. After the table alteration operations have completed, incoming data manipulation statements are performed over the altered database tables.
    Type: Grant
    Filed: September 16, 2016
    Date of Patent: September 17, 2019
    Assignee: Oracle International Corporation
    Inventors: Ramesh Kumar, Ananth Raghavan, George Eadon
  • Patent number: 10394613
    Abstract: A method includes encoding a data object using an encoding matrix having a unity matrix portion to produce a plurality of sets of encoded data slices that includes data encoded slices and redundancy encoded slices. The method further includes dividing a task into a set of partial tasks. The method further includes determining processing speeds of a set of distributed storage and task (DST) execution units allocated for storing the plurality of sets of encoded data slices. The method further includes mapping storage and partial task assignments regarding the data encoded slices of the plurality of sets of encoded data slices to the set of DST execution units based on the processing speeds. The method further includes outputting the data encoded slices of the plurality of sets of encoded data slices and the set of partial tasks to the set of DST execution units in accordance with the storage-task mapping.
    Type: Grant
    Filed: January 10, 2017
    Date of Patent: August 27, 2019
    Assignee: PURE STORAGE, INC.
    Inventors: Gary W. Grube, Timothy W. Markison
  • Patent number: 10372763
    Abstract: Methods, systems, and computer program products for generating probabilistic annotations for entities and relations using reasoning and corpus-level evidence are provided herein. A method includes determining candidate entities and candidate relationships between entities from text contained within textual documents associated with a given domain; generating questions to support or refute legitimacy of a given one of the candidate entities and/or a given one of the candidate relationships; automatically determining an answer to each of the questions based on analysis of one or more additional items of knowledge pertaining to the given domain; computing a confidence score for each answer; and assigning a confidence value in support of the legitimacy of the given one of the candidate entities and/or the given one of the candidate relationships, wherein said confidence value is based on an aggregation of the confidence scores for the answers to the one or more questions.
    Type: Grant
    Filed: July 13, 2015
    Date of Patent: August 6, 2019
    Assignee: International Business Machines Corporation
    Inventors: Sachindra Joshi, Raghuram Krishnapuram
  • Patent number: 10346387
    Abstract: A method includes a primary storage unit receiving a first write request including a first key and a first value; persisting the first value in a first non-volatile memory in association with the first key; broadcasting the first write request and a first set of globally-durable keys to secondary storage units; receiving, from the secondary storage units, an acknowledgement of the first write request and a first set of locally-durable keys, each of the first sets of locally-durable keys including the first key; the primary storage unit receiving a second write request including a second key and a second value; persisting the second value in the first non-volatile memory in association with the second key; and broadcasting the second write request and a second set of globally-durable keys to the secondary storage units, the second set of locally-durable keys including the first key. A system is also disclosed.
    Type: Grant
    Filed: June 13, 2016
    Date of Patent: July 9, 2019
    Assignee: SAP SE
    Inventors: Peter Bumbulis, Jeffrey Pound, Nathan Auch, Anil Kumar Goel, Matthias Ringwald, Thomas Bodner, Scott MacLean
  • Patent number: 10339337
    Abstract: An apparatus has a processor and a memory connected to the processor. The memory stores instructions executed by the processor to receive a relational query at a semi-structured document database. An access permissions index is looked-up for available documents for a user submitting the relational query. The relational query is resolved against relational schema within a relational schema index for the available documents to produce secure relational query results. The secure relational query results are supplied.
    Type: Grant
    Filed: January 12, 2017
    Date of Patent: July 2, 2019
    Assignee: MarkLogic Corporation
    Inventor: John Snelson
  • Patent number: 10331680
    Abstract: Methods for ranking search results are disclosed. The methods include obtaining an unranked set of functionally similar search that all correspond to an action. The methods further include retrieving a ranking vector corresponding to the action. The ranking vector identifies a plurality of software applications that perform a function corresponding to the action and a ranking range for each of the plurality of software applications. The methods further include iteratively ranking the functionally similar search results of the unranked set to obtain a ranked set of functionally similar search results. The ranking is based on the retrieved ranking vector and a random number generator that generates random numbers between the minimum value and the maximum value of the ranking vector.
    Type: Grant
    Filed: December 28, 2015
    Date of Patent: June 25, 2019
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Eric Glover, Patrick La Victoire