Patents Examined by David T Brooks
  • Patent number: 9928250
    Abstract: A deduplication system and method involves interaction between a software-based system that manages the overall background deduplication process and a hardware-based system that includes a hardware-based filesystem manager and a hash generator. A filesystem checkpoint mechanism is leveraged to manage the background deduplication process and also to reduce processing complexity with respect to identifying chunks that are candidates for deduplication and processing such chunks.
    Type: Grant
    Filed: February 17, 2016
    Date of Patent: March 27, 2018
    Assignee: HITACHI DATA SYSTEMS ENGINEERING UK LIMITED
    Inventors: Neil Berrington, Namita Lal, Gaurav Sharma
  • Patent number: 9928277
    Abstract: A computer system can generate a materialized query table (MQT) for data from a base table that has a plurality of rows and columns that each contain stored data. The computer system can receive a query to data stored in a row from the base table. The system can identify a column in the row and determine that data from a cell in the row and the column is above a threshold memory size. The system can generate, in response to determining that the data being above the threshold memory size, an MQT that includes data for the row and that includes a reference pointer that points to a location shared with another table.
    Type: Grant
    Filed: November 10, 2014
    Date of Patent: March 27, 2018
    Assignee: International Business Machines Corporation
    Inventors: Rafal P. Konik, Roger A. Mittelstadt, Brian R. Muras, Mark W. Theuer
  • Patent number: 9824142
    Abstract: The information processing device 1 processes document collections having tags permitting semantic class identification appended to each document and comprises a search unit 2, which creates multiple semantic class units containing one, two, or more semantic classes based on a taxonomy that identifies relationships between semantic classes, and a frequency calculation unit 3 which, for each of the semantic class units, identifies documents that match that semantic class unit in the document collections and, for these matching documents, calculates a first frequency that represents the frequency of occurrence in a designated document collection and a second frequency that represents the frequency of occurrence in non-designated document collections. Once the calculations have been performed, the search unit 2 identifies any of the semantic class units based on the first frequency and the second frequency of the matching documents.
    Type: Grant
    Filed: December 21, 2010
    Date of Patent: November 21, 2017
    Assignee: NEC CORPORATION
    Inventors: Yukitaka Kusumura, Hironori Mizuguchi, Dai Kusui
  • Patent number: 9824132
    Abstract: Disclosed are a method and system for recovering a distributed system from a failure of a data storage unit. The distributed system includes a plurality of computer systems, each having a read-write computer and a data storage unit. Data is replicated from a particular data storage unit to other data storage units using publish-subscribe model. A read-write computer receives the replicated data, processes the data for any conflicts and stores it in the data storage unit. If a data storage unit fails, another data storage unit that has latest data corresponding to the failed data storage unit is determined and the latest data is replicated to other data storage units. Accordingly, the distributed system continues to have the data of the failed data storage unit. The failed data storage unit may be reconstructed using data from one of the other data storage units in the distributed system.
    Type: Grant
    Filed: January 8, 2013
    Date of Patent: November 21, 2017
    Assignee: Facebook, Inc.
    Inventors: Sachin Kulkarni, Sanjeev Kumar, Harry Li, Laurent Demailly, Liat Atsmon Guz
  • Patent number: 9734607
    Abstract: A mutable multilevel data structure representing a graph structure may include multiple read-only levels and a single writable level. Each read-only level may include a vertex table (with references to edge tables on the same level or a different level containing elements of adjacency lists for some vertices) and an edge table (with elements of adjacency lists that changed since the previous read-only level). A hybrid variant may switch between a performance-optimized variant (whose edge tables include complete adjacency lists for vertices whose edge sets were modified) and a space-optimized variant (whose edge tables include only newly added adjacency list elements). The vertex tables and/or the writable level may be implemented using copy-on-write arrays, each including an indirection table and multiple fixed-sized data pages. Computations may be run on the read-only levels or on the writable level and read-only levels.
    Type: Grant
    Filed: September 10, 2014
    Date of Patent: August 15, 2017
    Assignee: Oracle International Corporation
    Inventors: Peter Macko, Virendra J. Marathe, Margo I. Seltzer
  • Patent number: 9720927
    Abstract: Embodiments of the present invention relate to run-length encoded sequences and supporting efficient offset-based updates of values while allowing fast lookups. In an embodiment of the present invention, an indexing scheme is disclosed, herein called count indexes, that supports O(log n) offset-based updates and lookups on a run-length sequence with n runs. In an embodiment, count indexes of the present invention support O(log n) updates on bitmapped sequences of size n. Embodiments of the present invention can be generalize to be applied to block-oriented storage systems.
    Type: Grant
    Filed: July 12, 2012
    Date of Patent: August 1, 2017
    Assignee: The Board of Trustees of the Leland Stanford Junior University
    Inventors: Abhijeet Mohapatra, Michael Genesereth
  • Patent number: 9646108
    Abstract: Methods and systems for providing a search engine capability for large datasets are disclosed. These methods and systems employ a Partition-by-Query index containing key-values pairs corresponding to keys reflecting concept-ordered search phrases and values reflecting ordered lists of document references that are responsive to the concept-ordered search phrase in a corresponding key. A large Partition-by-Query index may be partitioned across multiple servers depending on the size of the index, or the size of the index may be reduced by compressing query-references pairs into clusters. The methods and systems described herein may to provide suggestions and spelling corrections to the user, thereby improving the user's search engine experience while meeting user expectations for search quality and responsiveness.
    Type: Grant
    Filed: May 10, 2012
    Date of Patent: May 9, 2017
    Assignee: UBER TECHNOLOGIES, INC.
    Inventor: Geoffrey Hendrey
  • Patent number: 9589016
    Abstract: A computer system can generate a materialized query table (MQT) for data from a base table that has a plurality of rows and columns that each contain stored data. The computer system can receive a query to data stored in a row from the base table. The system can identify a column in the row and determine that data from a cell in the row and the column is above a threshold memory size. The system can generate, in response to determining that the data being above the threshold memory size, an MQT that includes data for the row and that includes a reference pointer that points to a location shared with another table.
    Type: Grant
    Filed: April 14, 2016
    Date of Patent: March 7, 2017
    Assignee: International Business Machines Corporation
    Inventors: Rafal P. Konik, Roger A. Mittelstadt, Brian R. Muras, Mark W. Theuer
  • Patent number: 9588858
    Abstract: Systems and methods for replicating data from a production server to a backup server include recording at least one operation on one or more data items stored in a volume of a production server. The operation may be recorded as at least one journal event in a memory. A determination may then be made regarding whether a system malfunction incident has occurred in the production server and if so, a first set of journal events may be transferred from the memory to an auxiliary storage at a first time instant. At a second time instant, a second set of journal events recorded in the memory between the first and second time instants may be transferred to the auxiliary storage. At one journal event stored in the auxiliary storage unit may then be sent for replication to a backup server.
    Type: Grant
    Filed: November 29, 2010
    Date of Patent: March 7, 2017
    Assignee: CA, Inc.
    Inventors: Xiaopin Wang, Haiyang Zhang, Shaorong Li
  • Patent number: 9582559
    Abstract: One or more processing platforms are configured to implement at least a first site of a multi-site storage system. The first site comprises a first file system resident at the first site, and a replicated version of a second file system. The second file system is resident at a second site of the multi-site storage system, and the replicated version of the second file system is resident at the first site. Virtual block storage appliances running on respective virtual machines of the first site interact with one another and with similar appliances at the second site to allow the replicated version of the second file system to be synchronized at a cache level with the second file system, and to allow a replicated version of the first file system at the second site to be synchronized at a cache level with the first file system at the first site.
    Type: Grant
    Filed: June 29, 2012
    Date of Patent: February 28, 2017
    Assignee: EMC IP Holding Company LLC
    Inventors: Sorin Faibish, John Bent
  • Patent number: 9514213
    Abstract: Systems, methods, and other embodiments associated with clustering using tri-point arbitration are described. In one embodiment, a method includes selecting a data point pair and a set of arbiter points. A tri-point arbitration similarity is calculated for data point pairs based, at least in part, on a distance between the first and second data points and the arbiter points. In one embodiment, similar data points are clustered.
    Type: Grant
    Filed: March 15, 2013
    Date of Patent: December 6, 2016
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Alan Paul Wood, Aleksey M. Urmanov, Anton A. Bougaev
  • Patent number: 9471630
    Abstract: The efficient query processing on ordered views includes: obtaining a view definition for a view for database data; obtaining a query to be executed on the database data; identifying any columns in the view that are unnecessary for the query; after processing any predicates of the query, identifying any sorts in the view definition that are unnecessary or redundant in the query; rewriting the query to eliminate the unnecessary columns and unnecessary and redundant sorts; and executing the rewritten query without materializing the view. Thus, the query is rewritten in a way that avoids materializing the view and eliminates duplicate or unnecessary sorts. This reduces the number of rows to be sorted and eliminates unnecessary columns from materialization. These reductions lead to resource savings, including memory, temporary table space required for sorting, and CPU resources.
    Type: Grant
    Filed: March 6, 2012
    Date of Patent: October 18, 2016
    Assignee: International Business Machines Corporation
    Inventors: Ajaykumar B. Gupte, Raghupathi K. Murthy, Joaquim A. Zuzarte
  • Patent number: 9449049
    Abstract: Techniques for returning estimated value of search keywords of an entire account include, for the entire account, obtaining one or more selected search keywords and their respective forecast periods and parameter settings. An estimated value of a respective search keyword in the respective forecast period is forecasted. Based on stored historical data and parameter settings of the respective search keyword, the estimated value of the respective search keyword is modified to obtain a modified estimated value. The modified estimated value of each search keyword is added up to generate an estimated value of the entire account. The estimated value of the entire account is returned to a client terminal from which the entire account is logged in. The present disclosure modifies the respective search keyword's estimated value so that the estimated value of the entire account satisfies the expected value of the client.
    Type: Grant
    Filed: August 4, 2011
    Date of Patent: September 20, 2016
    Assignee: Alibaba Group Holding Limited
    Inventors: Tao Zhang, Jiaqing Guo, Ning Guo
  • Patent number: 9442928
    Abstract: A system, method, and/or computer program product for automatic topic identification using a hypertext corpus may include a) receiving a content document(s); b) identifying or lexically scoring candidate topic(s) in the received content document based on label(s) used in a corpus to link to or relate to the candidate topics; c) evaluating or semantically scoring the candidate topic(s) of the received document based on a relationship between two or more candidate topics in the corpus; and d) weighting candidate topics for relevance based on algorithmic or statistical analysis of links or relationships in the corpus.
    Type: Grant
    Filed: September 7, 2012
    Date of Patent: September 13, 2016
    Assignee: Venio Inc.
    Inventors: John J. Szucs, Kurtis L. Warner, Thomas C. Paris, Charles D. Moye
  • Patent number: 9442930
    Abstract: A system, method, and/or computer program product for automatic topic identification using a hypertext corpus may include a) receiving a content document(s); b) identifying or lexically scoring candidate topic(s) in the received content document based on label(s) used in a corpus to link to or relate to the candidate topics; c) evaluating or semantically scoring the candidate topic(s) of the received document based on a relationship between two or more candidate topics in the corpus; and d) weighting candidate topics for relevance based on algorithmic or statistical analysis of links or relationships in the corpus.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: September 13, 2016
    Assignee: Venio Inc.
    Inventors: John Joseph Szucs, Kurtis Lee Warner, Thomas Carl Paris, Charles David Moye
  • Patent number: 9436726
    Abstract: A system, method, or computer program product for translating non-quantitative, text-based data into a quantitative risk management tool(s) including: receiving, by a computer processor(s), non-quantitative data relating to cross-border public policy; receiving, by the processor(s), at least one tag relating to said non-quantitative data; storing, by the processor(s), said non-quantitative data and said at least one tag in a database; and providing, by the processor(s), quantitative risk management tools designed to provide customized, automatic daily graphical illustrations of policy activity levels on a cross-border basis using concepts and other meta-tagging tools to generate graphs. Tools may mining data to extract quantitative and graphical information from stored, tagged non-quantitative data and may semantically search those documents as well as assess correlations and covariances of cross-border policy processes, and deliver quantitative and/or graphical output results.
    Type: Grant
    Filed: June 22, 2012
    Date of Patent: September 6, 2016
    Assignee: BCM International Regulatory Analytics LLC
    Inventor: Barbara C. Matthews
  • Patent number: 9324112
    Abstract: The author ranking technique described herein is a technique to rank authors in social media systems along various dimensions, using a variety of statistical methods for utilizing those dimensions. More particularly, the technique ranks authors in social media systems through a combination of statistical techniques that leverage usage metrics, and social and topical graph characteristics. In various exemplary embodiments, the technique can rank author authority by the following: 1) temporal analysis of link sharing in which authority is computed based on a user's propensity to provide early links to web pages that subsequently become popular; 2) topical authority based on the author's links and content updates in specific topic areas; and 3) popularity and influence based on nodal properties of authors.
    Type: Grant
    Filed: November 9, 2010
    Date of Patent: April 26, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Peter Richard Bailey, Chad Carson, Scott Joseph Counts, Nikhil Bharat Dandekar, Ho John Lee, Shubha Umesh Nabar, Aditya Pal, Michael Ching, Paul Alexander Dow, Shuang Guo, Hyun-Ju Seo
  • Patent number: 9244887
    Abstract: Systems and methods are provided for analyzing through one-pass of unstructured time stamped data of a physical process. A distribution of time-stamped unstructured data is analyzed to identify a plurality of potential hierarchical structures for the unstructured data. A hierarchical analysis of the potential hierarchical structures is performed to determine an optimal frequency and a data sufficiency metric for the potential hierarchical structures. One of the potential hierarchical structures is selected as a selected hierarchical structure based on the data sufficiency metrics. The unstructured data is structured according to the selected hierarchical structure and the optimal frequency associated with the selected hierarchical structure, where said structuring of the unstructured data is performed via a single pass though the unstructured data. The identified statistical analysis of the physical process is performed using the structured data.
    Type: Grant
    Filed: July 13, 2012
    Date of Patent: January 26, 2016
    Assignee: SAS Institute Inc.
    Inventors: Michael James Leonard, Keith Eugene Crowe, Stacey M. Christian, Jennifer Leigh Sloan Beeman, David Bruce Elsheimer, Edward Tilden Blair
  • Patent number: 9218414
    Abstract: A method for searching multiple documents on a computer system includes steps for sending a query to a system core where the query is passed to a search component for searching the documents. The system core in turn receives results from the search component indicating related documents to the query and passes to a summarization component a specified number of the results. The summarization component processes related documents corresponding to the specified number of results to produce a multi-document summary. The system core receives the summary from the summarization component. The multi-document summary is received from the system core.
    Type: Grant
    Filed: June 29, 2012
    Date of Patent: December 22, 2015
    Inventor: Dmitri Soubbotin
  • Patent number: 9208590
    Abstract: Techniques are disclosed for effectively reflecting a manipulation of an object in graph data. In one example, a method enables a manipulation performed on an object as an image of a mapping of at least a part of graph data including nodes and edges to be reflected in the graph data. The method includes the step of specifying, from among nodes in the graph data that are related to the object being modified by the manipulation, a shared node that may be related to another object as well, and the step of reflecting the manipulation in the graph data while preventing the shared node from being modified.
    Type: Grant
    Filed: June 15, 2012
    Date of Patent: December 8, 2015
    Assignee: International Business Machines Corporation
    Inventors: Futoshi Iwama, Hisashi Miyashita, Hideki Tai