Patents Examined by Dara J Glasser
  • Patent number: 11921672
    Abstract: Systems and methods are described for executing a query of raw machine data that is stored at a remote data store that may store heterogeneous data. The system can determine the directories or file types that may store event data and may instruct one or more worker nodes to access files that may store events based on the determined directories of file types. Further, the system may exclude files at the remote data store that may not be identified as potentially storing events enabling a query that implicates a heterogeneous data store to be efficiently executed.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: March 5, 2024
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee, Timothy Tully
  • Patent number: 11860834
    Abstract: Techniques are provided for reporting space savings due to pattern matching in storage systems. For example, in one embodiment, an exemplary method comprises, when a given allocation unit in a storage system matches one or more predefined patterns, (i) setting a corresponding pattern flag for the given allocation unit, and (ii) incrementing at least one pattern counter; generating at least one snapshot of at least a portion of a file comprising the given allocation unit; and determining a range of data reduction attributed to pattern matching based on said at least one pattern counter, wherein one extreme of said range of data reduction attributed to pattern matching excludes said one or more predefined patterns in said at least one snapshot.
    Type: Grant
    Filed: November 1, 2017
    Date of Patent: January 2, 2024
    Assignee: EMC IP Holding Company LLC
    Inventors: Ivan Bassov, Ahsan Rashid, Philippe Armangau
  • Patent number: 11836122
    Abstract: A system, method, and computer-readable medium for generating synthetic data are described. Improved data models for databases may be achieved by improving the quality of synthetic data upon for modeling those databases. According to some aspects, these and other benefits may be achieved by using numeric distribution information in a schema describing one or more numeric fields and, based on that schema, distribution-appropriate numerical data may be generated. The schema may be compared against actual data and the schema adjusted to more closely match the actual data. In implementation, this may be effected by storing a schema with distribution information and/or one or more parameters, generating synthetic numerical data based on the schema, and, based on a comparison with actual data, modify the schema until the synthetic data is statistically similar to the actual data. A benefit may include improved database performance and indexing based on repeatable, statistically appropriate, synthetic data.
    Type: Grant
    Filed: December 19, 2019
    Date of Patent: December 5, 2023
    Assignee: Capital One Services, LLC
    Inventor: Steven Lott
  • Patent number: 11663084
    Abstract: Methods and systems for automatically upgrading or synchronizing a remote data management agent running on a remote host machine (e.g., a hardware server) to a particular version that is in-sync with a corresponding version used by a cluster of data storage nodes controlling the remote data management agent are described. The remote agent may be initially installed on the remote host and subsequent updates to the remote agent may be performed using the remote agent itself without requiring intervention by the remote host. The remote agent may comprise a backup agent and a bootstrap agent that are each exposed in different network ports or associated with different port numbers or networking addresses. The backup agent may perform data backup related tasks for backing up files stored on the remote host and the bootstrap agent may perform upgrade related tasks for upgrading the backup agent.
    Type: Grant
    Filed: August 8, 2017
    Date of Patent: May 30, 2023
    Assignee: Rubrik, Inc.
    Inventors: Jiangbin Luo, Guilherme Vale Ferreira Menezes
  • Patent number: 11663246
    Abstract: Systems, methods, and non-transitory computer readable media are configured to apply a spectral clustering technique to at least a portion of a similarity graph to generate clusters of geographic sub-regions constituting geographic regions. A tf-idf technique is performed to determine pages of a social networking system associated with a geographic region as potential local suggestions for a user associated with a geographic sub-region in the geographic region. References to at least a portion of the pages are presented as local suggestions to the user.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: May 30, 2023
    Assignee: Meta Platforms, Inc.
    Inventors: Apaorn Tanglertsampan, Jason Eric Brewer, Bradley Ray Green
  • Patent number: 11586674
    Abstract: This invention relates to methods and systems for searching. It is particularly applicable to methods of searching which enable efficient identification of compatible portfolios. Embodiments of the invention propose methods of searching which address the huge search space issue associated with identifying compatible portfolios. In particular, embodiments of the invention start their search operations simultaneously from both sides by both trying to form valid portfolios from candidate products until a valid solution is found and trying to find conflicts from the defined compatibility rules until a conflict is found which leads to the conclusion that no valid solution exists. A conclusion from either process will stop the whole searching process which can significantly reduce blind and unnecessary searching in the whole search space.
    Type: Grant
    Filed: December 28, 2016
    Date of Patent: February 21, 2023
    Assignees: Khalifa University of Science and Technology, British Telecommunications plc, Emirates Telecommunications Corporation
    Inventors: Di Wang, Ivan Boyd, Ahmad Al-Rubaie, Jason Ng
  • Patent number: 11556548
    Abstract: Systems and methods are provided that automatically process a message input, construct an intelligent query based on the processing of the message input, and provide at least one attachable entity according to the processing results and the intelligent query. In some example aspects, a message is received. A natural language processor to determine if the message is requesting content may then process the message. If the message is determined to be requesting content, then candidate sub-queries may be generated to serve as a training set for a query that will be sent to an external search engine to retrieve the attachable entity. The sub-queries may be ranked in order of relevance and performance score. The highest ranked sub-queries may then be used in the actual query that is fired against the external search engine. The external search engine may search local and remote repositories for the top K most relevant attachable entities and present them to a user for attachment in a reply message.
    Type: Grant
    Filed: August 8, 2017
    Date of Patent: January 17, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Amy Huyen Phuoc Nguyen, Bhaskar Mitra, Christophe Jacky Henri Van Gysel, Grzegorz Stanislaw Kukla, Lynn Carter Ayres, Mark Rolland Knight, Matteo Venanzi, Nicola Cancedda, Rachel Elizabeth Sirkin, Robin Michael Thomas, Roy Rosemarin, Shobana Balakrishnan, Sri Ramya Mallipudi, Tariq Sharif, Yamin Wang
  • Patent number: 11556496
    Abstract: Method and system are provided for writing output from map-reduce jobs to an archive file. The method may include providing an archive manager and exposing an interface to be called from map-reduce jobs to output to an archive file in a map-reduce distributed file system. The method may also include using a buffering database as a temporary cache to buffer updates to the archive file. Handling by the archive manager calls from map-reduce jobs may allow: reading directly from an archive file or from a job index at the buffering database; writing to a job index at the buffering database used as a temporary cache to buffer updates; and serializing updates from the buffering database to the archive file.
    Type: Grant
    Filed: November 2, 2018
    Date of Patent: January 17, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Curtis N. Browning, Niall F. McCarroll
  • Patent number: 11403295
    Abstract: Techniques for processing queries are provided. In one approach, an execution plan for a query includes multiple sub-plans, one or more of which are selected at runtime while one or more other sub-plans are not executed during execution of the execution plan. In another approach, data about misestimate is generated and stored persistently for subsequent queries. In another approach, statistics for a database object are generated automatically and efficiently while the database object is created or data items are added thereto. In another approach, a hybrid histogram is created that includes a feature of frequency histograms and a feature of height-balanced histograms. In another approach, computer jobs are executed in such a way to avoid deadlock. In another approach, changes to a database object trigger a hard parse of a query even though an execution plan already exists for the query.
    Type: Grant
    Filed: October 17, 2016
    Date of Patent: August 2, 2022
    Assignee: Oracle International Corporation
    Inventors: Sunil P. Chakkappen, Mohamed Zait, Allison W. Lee, Ali Cakmak
  • Patent number: 11321317
    Abstract: Techniques for processing queries are provided. In one approach, an execution plan for a query includes multiple sub-plans, one or more of which are selected at runtime while one or more other sub-plans are not executed during execution of the execution plan. In another approach, data about misestimate is generated and stored persistently for subsequent queries. In another approach, statistics for a database object are generated automatically and efficiently while the database object is created or data items are added thereto. In another approach, a hybrid histogram is created that includes a feature of frequency histograms and a feature of height-balanced histograms. In another approach, computer jobs are executed in such a way to avoid deadlock. In another approach, changes to a database object trigger a hard parse of a query even though an execution plan already exists for the query.
    Type: Grant
    Filed: August 1, 2017
    Date of Patent: May 3, 2022
    Assignee: Oracle International Corporation
    Inventors: Hong Su, Sunil P. Chakkappen, Mohamed Zait
  • Patent number: 11314690
    Abstract: A regenerated container file is detected, and a file in the regenerated container file is determined that is different from any file in an existing container file related to the regenerated container file. To store the regenerated container file, the different file is sent to the data storage for storing.
    Type: Grant
    Filed: January 29, 2015
    Date of Patent: April 26, 2022
    Assignee: LONGSAND LIMITED
    Inventor: Randy S. Darling
  • Patent number: 11238036
    Abstract: Described are methods, systems and computer readable media for performance logging of complex query operations.
    Type: Grant
    Filed: September 19, 2017
    Date of Patent: February 1, 2022
    Assignee: Deephaven Data Labs, LLC
    Inventors: David R. Kent, IV, Ryan Caudy, Charles Wright, Radu Teodorescu
  • Patent number: 10997037
    Abstract: A request is received from a client for accessing a first backup instance of first backup data associated with the client, wherein the request includes a first filename and a first savetime. A first directory record database is identified from a plurality of directory record databases based on a client identifier (ID) identifying the client. A first node identifier associated with a first directory comprising the first filename from the first directory record is identified. A first node record database is identified based on the first savetime from the node record databases, the first node record database including node records, each corresponding to a file or a directory of files associated with the first backup instance. A first node record is obtained from the first node record database based on the first node identifier. Node record information associated with the first node record is returned to the client.
    Type: Grant
    Filed: August 31, 2016
    Date of Patent: May 4, 2021
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Manjunatha Hebbar, Vaibhav Khanduja, Kiran Bachu, Srinivasan Ramachandran
  • Patent number: 10956443
    Abstract: The subject-matter described herein relates to a computer-implemented method of enabling interoperability between a first knowledge base and a second knowledge base. Each knowledge base is graphically represented and includes a plurality of nodes each defining a concept and a plurality of relations linking the plurality of nodes. The first knowledge base and the second knowledge base are encoded using different coding standards. The method comprises: identifying an entity from the plurality of entities in the second knowledge base; obtaining a mapping between the identified entity from the second knowledge base and a matching entity from the first knowledge base; and creating and storing a link between the identified entity from the second knowledge base and the matching entity from the first knowledge base.
    Type: Grant
    Filed: June 6, 2019
    Date of Patent: March 23, 2021
    Assignee: Babylon Partners Limited
    Inventors: Georgios Stoilos, David Geleta, Damir Juric, Gregory McKay, Jonathan Moore, Jessica Tanon, Claudia Schulz, Mohammad Khodadadi
  • Patent number: 10853370
    Abstract: Systems, apparatuses and methods may provide a query response. A value of a time-to-live (TTL) for data associated with a computation unit may be determined, wherein the computation unit may disallow access to the data when the TTL is expired. Additionally, a determination may be made whether the data associated with the computation unit satisfies a query. Also, a query response may be generated including the data associated with the computation unit that satisfies the query. In one example, a context for the data associated with the computation unit may be determined, wherein the computation unit may disallow access to the data based on the context. Data may be represented as a network of computation units which may provide data with a TTL that is contextual, which may be handled in a massively distributed fashion.
    Type: Grant
    Filed: June 26, 2015
    Date of Patent: December 1, 2020
    Assignee: Intel Corporation
    Inventor: Gabriel G. Infante-Lopez
  • Patent number: 10776319
    Abstract: A query of time series data stored in a database is received that specifies at least one value. The database includes (i) an index table specifying groups of segments of compressed time series data with corresponding ranges each having a lowest value and a highest value, and (ii) a segments table specifying individual segments of compressed time series data. Thereafter, using the index table, at least one group for which the specified at least one value falls within the corresponding range is identified. The segments table is then queried for the segments corresponding to the identified at least one group to generate a new segments table specifying at least one segment. Next, the at least one segment specified by the new segments table is decompressed. Data responsive to the query within the decompressed at least one segment is then identified using the specified at least one value.
    Type: Grant
    Filed: August 23, 2016
    Date of Patent: September 15, 2020
    Assignee: SAP SE
    Inventors: Frank Eichinger, Dennis Kurfiss
  • Patent number: 10754859
    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a group of two or more integer values. Next, the system sets, for each integer value in the two or more integer values, a delta encoding tag that represents a delta encoding of the integer value with respect to at least one other integer value in the group and uses a delta encoding state represented by the delta encoding tag to calculate an encoded value from the integer value. The system then encodes a length of the encoded value in a length tag for the integer value. Finally, the system replaces the integer value in the group with an encoded version containing the delta encoding tag, the length tag, and the encoded value.
    Type: Grant
    Filed: October 28, 2016
    Date of Patent: August 25, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: SungJu Cho, Yinyi Wang, Qingpeng Niu, Andrew Rodriguez
  • Patent number: 10503799
    Abstract: A system and method for building a hyperdata hub to access an enriched data model is presented. One or more data models are built based on user input to a user interface, and one or more query definitions are built based on the user input to the user interface. Data is collected from external data sources and internal data sources, and contextual data is extracted based on the collected data according to the one or more data models and the one or more query definitions. The metadata associated with the one or more data models and one or more query definitions are stored, and data is matched with the contextual data associated with the hyperdata metadata repository.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: December 10, 2019
    Assignee: SAP SE
    Inventor: Olaf Schmidt
  • Patent number: 10437825
    Abstract: A data condenser and method provides lossless condensation of numbers, letters, words, phrases, and other indicia to data object values which results in reduction of file size. The data condenser and method classifies data as individual data objects or groups of data objects and distinguishes terms which repeat (e.g. recur). A reference library is optimized according to the quantity of classified data to minimize storage requirements. The classified data is assigned a unique value which populates the reference file. An output file is created by the data condenser using the reference library to achieve optimal lossless condensation. A data reverter and method provides for reversion of condensed data objects such as numbers, letters, words, phrases and other indicia to uncondensed data objects for efficient and accurate use without loss of data objects.
    Type: Grant
    Filed: January 29, 2015
    Date of Patent: October 8, 2019
    Assignee: Relican Analytics, Inc.
    Inventors: Andrew J. Goodwin, Matthew P. Fisher
  • Patent number: 10331688
    Abstract: A method for searching data on multiple domains includes transmitting a search query to an enterprise domain; receiving, from the enterprise domain, enterprise content matching the search query; receiving, from the enterprise domain, a search query normalized for searching a social site domain, the normalized search query being the search query in a format recognizable by the social site domain; transmitting the normalized search query to the social site domain; and receiving, from the social site domain, social site content matching the normalized search query.
    Type: Grant
    Filed: January 29, 2015
    Date of Patent: June 25, 2019
    Assignee: HYLAND SWITZERLAND SÀRL
    Inventors: Thomas David Montgomery, Derek Cunninghame Murphy, Benjamin Stephen Truscott