Optimizing Replication Patents (Class 707/637)
-
Publication number: 20110161294Abstract: The disclosed embodiments provide a system that determines whether to dynamically replicate data segments on a node in a computing cluster that stores a collection of data segments. During operation, the system identifies a data segment from the collection that is predicted to be frequently accessed by future tasks executing in the cluster. The system then determines a slowdown that would result for the current workload of the node if the data segment were to be replicated to the node. The system also determines a predicted future benefit that would be associated with replicating the data segment to the node. If the predicted slowdown is less than the predicted future benefit, the replication system replicates the data segment to the node.Type: ApplicationFiled: December 30, 2009Publication date: June 30, 2011Applicant: SUN MICROSYSTEMS, INC.Inventors: David Vengerov, George Porter
-
Patent number: 7962448Abstract: A three tiered data synchronization system including a client, an enterprise data server, a synchronization server, and a pre-fetch data store. The client can have an associated client data store and the enterprise data server can have an associated server data store. The synchronization sever can intermittently and bidirectionally synchronize data within the client data store with data within the server data store. The pre-fetch data store can maintains data specific to synchronization requests that is used by the synchronization server when performing synchronization operations responsive to detected synchronization requests. The data specific to synchronization request used by the synchronization server is stored within the pre-fetch data store before the corresponding synchronization requests are detected by the synchronization server.Type: GrantFiled: December 17, 2004Date of Patent: June 14, 2011Assignee: International Business Machines CorporationInventors: Thomas E. Creamer, Curtis E. Hrischuk
-
Patent number: 7945533Abstract: Systems, methodologies, media, and other embodiments associated with index replication using crawl modification information are described. One exemplary system embodiment includes an enterprise search system comprising a target search system comprising an index logic that uses modified crawl information related to items associated with sources to maintain an index that supports searching of the items; and, a crawl search system comprising a pipeline processor configured to receive modified crawl information related to the items and to propagate the modified crawl information to the target system.Type: GrantFiled: February 23, 2007Date of Patent: May 17, 2011Assignee: Oracle International Corp.Inventors: Muralidhar Krishnaprasad, Chi-Ming Yang, Joaquin A. Delgado, Meeten Bhavsar, Thomas H. Chang
-
Patent number: 7933894Abstract: Systems and methods that generate specialized plans for compiling SQL queries. A plan generator component scans the query representation for parameter sensitive predicates and evaluates each predicate individually based on the parameter values. Accordingly, queries can be identified not only based on their structures, but also based on their parameter conditions. The specialized plans are more efficient for particular values, wherein queries that employ such values are optimally executed.Type: GrantFiled: June 15, 2007Date of Patent: April 26, 2011Assignee: Microsoft CorporationInventors: Marc T. Friedman, Cesar A. Galindo-Legaria, Erik Ismert, Milind Madhukar Joshi, Naveen Prakash
-
Patent number: 7933869Abstract: Exemplary methods and systems may clone a tenant database and send updates to the clone. By doing so, methods and systems may determine a difference between the clone and the tenant, making it possible to test upgrades for a tenant based on the difference. With this capability, a provider-tenant system may upgrade tenant business applications and customize tenant databases.Type: GrantFiled: December 29, 2006Date of Patent: April 26, 2011Assignee: SAP AGInventors: Wolfgang A. Becker, Alex Gruener
-
Patent number: 7904422Abstract: An identifier generation unit generates first identifiers corresponding to a plurality of first fragmentary data, into which first data that is stored in a database is divided. A database stores the first data and the first identifiers. The identifier generation unit generates, when the first data is updated to second data, second identifiers corresponding to a plurality of second fragmentary data into which the second data is divided. A deployment control unit determines whether the first identifiers are identical to the second identifiers which correspond to the first identifiers. The deployment control unit writes in the database the second fragmentary data corresponding to the second identifiers and the second identifiers when it is determined that the first identifiers are not identical to the second identifiers which correspond to the first identifiers.Type: GrantFiled: March 14, 2008Date of Patent: March 8, 2011Assignees: Kabushiki Kaisha Toshiba, Toshiba Solutions CorporationInventor: Makoto Kobara
-
Patent number: 7895160Abstract: One embodiment includes a system for application-layer monitoring of communication between one or more database clients and one or more database servers. The system includes one or more decoders residing at a decoding layer above a network layer. The decoders reside at a first network location between one or more database clients residing at one or more second network locations and one or more database servers residing at one or more third network locations. The decoders receive database messages communicated from the database clients and intended for the database servers and database messages communicated from the database servers and intended for the database clients, decode the database messages, and extract query-language statements from the database messages. The system also includes a monitoring application residing at an application layer above the decoding layer. The monitoring application resides at the first network location.Type: GrantFiled: January 26, 2009Date of Patent: February 22, 2011Assignee: Crossroads Systems, Inc.Inventors: David B. Ewing, Rick A. Martindale
-
Patent number: 7890456Abstract: A system may include reading a provider database mapping of file identifiers to page numbers of the provider database, determining one or more shared database objects of the provider database based on the provider database mapping, and registering the one or more shared database objects in a local namespace of a reader database.Type: GrantFiled: March 8, 2007Date of Patent: February 15, 2011Assignee: SAP AGInventors: Henrik Hempelmann, Martin Kittel, Torsten Strahl, Uwe H. O. Hahn
-
Patent number: 7885928Abstract: Methods and systems are provided relating to a decentralized algorithm for managing replicas of a resource in a peer-to-peer network to satisfy quality of service requirements. Individual nodes within the network storing a resource replica continually monitor the node performance against quality of service (QoS) parameters and when necessary, take appropriate action to either add or delete the resource, all the while informing the other nodes within the network that maintain a replica of the resource of the particular action for dynamic fluctuation of the count of resource replicas therein.Type: GrantFiled: March 8, 2007Date of Patent: February 8, 2011Assignee: Xerox CorporationInventors: Steven J. Harrington, Hua Liu
-
Patent number: 7882068Abstract: In one embodiment, a user or client device communicates with a distributed file system comprised of one or more physical nodes. The data on each of the physical nodes store metadata about files and directories within the file system. Some of the embodiments permit a user to take a snapshot of data stored on the file system. The snapshot may include a single file, a single directory, a plurality of files within a directory, a plurality of directories, a path on the file system that includes nested files and subdirectories, or more than one path on the file system that each includes nested files and directories. In some embodiments, systems and methods intelligently choose whether to use copy-on-write or point-in-time copy when saving data in a snapshot version of a file whose current version is being overwritten. In some embodiments, systems and methods allow snapshot users to return from a snapshot directory to the immediate parent directory from which the user entered into the snapshot.Type: GrantFiled: August 21, 2007Date of Patent: February 1, 2011Assignee: Isilon Systems, Inc.Inventors: Darren P. Schack, Eric M. Lemar, Neal T. Fachan
-
Patent number: 7877355Abstract: Techniques are described for sharing multidimensional data between software systems, such as enterprise software systems. Specifically, the techniques provide mechanisms for defining inter-application “links” for automatically moving data among different databases associated with the enterprise software systems. For example, a system may include a first database storing multidimensional data for a first software application, and a second database storing multidimensional data for a second software application. The system further includes an interface by which an administrator defines a link that specifies a source area of the first database and a target area of the second database. A link control module automatically moves multidimensional data from the source area to the target area in accordance with the link. The link control module may automatically move the multidimensional data directly between live portions of the respective databases.Type: GrantFiled: September 13, 2007Date of Patent: January 25, 2011Assignee: International Business Machines CorporationInventors: Rasmus Borgsmidt, David S. Bowen
-
Publication number: 20110016088Abstract: In accordance with certain aspects of the present invention, an anticipatory integrated system and method for performance and capacity monitoring and management of a data redundancy backup system is disclosed. In one embodiment, capacity and system performance benchmark parameters set in backup appliances prior to customer shipment are integrated into the backup appliance shipped to the customer to perform real-time field monitoring and analysis of system performance and capacity requirements. In one embodiment, these parameters are updated over time on the basis of local measurements and remotely loaded data. In one embodiment, the capacity and performance component may be usable as a standalone simulation tool to provided system modeling, monitoring and prediction of the performance and capacity requirements as the system is used by the customer.Type: ApplicationFiled: July 20, 2009Publication date: January 20, 2011Inventor: Stephen Philip SPACKMAN
-
Publication number: 20110010343Abstract: An optimization method and system. The method includes receiving by a computing system a data footprint associated with data and a human resource model. The data footprint comprises a primary data section, a secondary data section, and an archive data section. A plurality of data storage strategies are associated with the primary data section, said secondary data section, and said archive data section. The plurality of data storage strategies are compared to each other. A data staging orchestrator software module is executed. The computing system determines based on results of executing the data staging orchestrator software module, an optimal migration time, an optimal migration speed, and an optimal migration cost for managing storage for portions of the data. The computing system executes a risk modulation software module and determines a risk associated with the managing.Type: ApplicationFiled: July 13, 2009Publication date: January 13, 2011Applicant: International Business Machines CorporationInventors: Kavita Chavda, Mickey Iqbal, Seshashayee Sankarshana Murthy, Sandeep Madhav Uttamchandani
-
Publication number: 20110010342Abstract: A computer-implemented method for selecting a portion of a content repository for replication including monitoring a usage pattern of a content repository, determining one or more preferred relationship patterns for replication in response to the usage pattern, identifying content items of the content repository matching the preferred relationship patterns, and selecting the identified content items of the content repository. Monitoring the usage pattern is performed by a pattern recognition engine operating on a computer. The content repository includes two or more content items. The relationship patterns are associated with an ontology describing relationships between content items in the content repository.Type: ApplicationFiled: July 8, 2009Publication date: January 13, 2011Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Feng-Wei Chen, Michael T. Donaghy, William T. Pack, Nikhil Parekh, Jacques-Marie Thrasybule
-
Publication number: 20110010341Abstract: A computer-implemented method for selecting a portion of a content repository for replication including receiving an input describing one or more desired relationship patterns for replication, generating a query for the desired relationship patterns, identifying content items of a content repository matching the desired relationship patterns, and selecting the identified content items of the content repository. The input is received at a graphical user interface (GUI). The GUI operates on a computer. The relationship patterns are associated with an ontology describing relationships between content items in a content repository. The content repository includes a plurality of content items.Type: ApplicationFiled: July 8, 2009Publication date: January 13, 2011Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Feng-Wei Chen, Michael T. Donaghy, William T. Pack, Nikhil Parekh, Jacques-Marie Thrasybule
-
Patent number: 7860833Abstract: Statistics for temporary tables are used in optimizing the execution of a database request. A first temporary table is created using data from a data source as the result of performing an operation while executing the request. A subset of the demographics of the data source is saved in a session level memory as the demographics of the first temporary table. The saved demographics are used in optimizing the execution of the portion of the request involving the first temporary table.Type: GrantFiled: November 28, 2007Date of Patent: December 28, 2010Assignee: Teradata US, Inc.Inventor: Ramakrishna Korlapati
-
Patent number: 7861229Abstract: A schema code complexity analysis architecture that computes complexity data for output to a code designer. The complexity data can be in the form of a number and/or complexity level. A schema component is provided for processing schema code, and a metrics component, for outputting a complexity value based on complexity of the schema code. In one implementation, the schema code is based on an XML schema, and the metrics component employs a newly-defined variation of a McCabe cyclomatic complexity metric to determine the complexity value. The decisions employed for computing schema complexity can include choices, occurrence constraints, element references to substitution groups, type references to types that are extended or restricted, a multiplicity of root element declarations, and nillable elements.Type: GrantFiled: March 16, 2006Date of Patent: December 28, 2010Assignee: Microsoft CorporationInventors: Ralf Lammel, Stanislav Kitsis, David L. Remy, Priyamvadha P. Lakshminarayanan
-
Patent number: 7853561Abstract: A target database is synchronized with a source database and the source database is replicated to the target database in a database replication system. The source database is on-line during the synchronization. The source database is located at a first node of the system, and the target database is located at a second node of the system. Changes in the source database are replicated to the target database by sending a replication stream from the first node to the second node. Successive blocks of data are sent from the first node to the target database to synchronize or resynchronize the target database with the source database. Each block of data includes one or more rows of data. The successively sent blocks of data are inserted into the replication stream at a point when the respective blocks of data are current or consistent in the source database, and the replication stream is posted to the target database. Synchronization or resynchronization thereby occurs simultaneously with replication.Type: GrantFiled: January 8, 2008Date of Patent: December 14, 2010Assignee: Gravic, Inc.Inventors: Paul J. Holenstein, Bruce D. Holenstein, Gary E. Strickler
-
Patent number: 7849057Abstract: An on-disk structure of a file system has the capability to efficiently manage and organize data containers, such as snapshots, stored on a storage system. A multi-bit, monotonically increasing, snapshot identifier (“snapid”) is provided that represents a snapshot and that increases every time a snapshot is generated for a volume of the storage system. The snapid facilitates organization of snapshot metadata within, e.g., a data structure used to organize metadata associated with snapshot data. In the illustrative embodiment, the data structure is a balanced tree structure configured to index the copy-out snapshot data blocks. The snapid is also used to determine which blocks belong to which snapshots. To that end, every block that is used in a snapshot has an associated “valid-to” snapid denoting the newest snapshot for which the block is valid. The oldest snapshot for which the block is valid is one greater than the valid-to field of the next older block at the same file block number.Type: GrantFiled: March 30, 2007Date of Patent: December 7, 2010Assignee: NetApp, Inc.Inventors: Michael Kazar, Peter F. Corbett
-
Publication number: 20100293140Abstract: It is a purpose of this invention to achieve Scale-Out of the distributed database system that assumes a real-time update to be a requirement and which is achieved by dividing the database system into two or more database domains. This is to achieve handling of even larger scale databases while providing even higher performance. Assuming that the large-scale database system has been distributed to two or more of above-mentioned data base domains, in multi transaction processing with real-time update of the database object across two or more of above-mentioned database domain, this invention is achieved by executing the above-mentioned multi transaction processing to the database meta information storage management part in the database meta information management repository device by applying partition topology technology or replication topology technology for exchange and synchronization of meta information such as status information etc. at even higher speeds.Type: ApplicationFiled: May 12, 2010Publication date: November 18, 2010Inventor: Shuhei Nishiyama
-
Patent number: 7831859Abstract: A method for providing fault tolerance to multiple computer servers is disclosed. Basically, t backup computer servers are utilized to back up data from multiple active computer servers such that up to t faults can be tolerated. Data from the active computer servers are categorized under their respective data structure accordingly. In response to any access to data within one or more of the active computer servers, backup operations are performed on the accessed data in the t backup computer servers according to their data structures such that data with similar data structures are grouped under their respective fusible data structure within the t backup computer servers.Type: GrantFiled: June 17, 2008Date of Patent: November 9, 2010Assignee: The Board of Regents, University of Texas SystemInventors: Vijay K. Garg, Vinit A. Ogale
-
Patent number: 7831569Abstract: A method, apparatus, and program product are provided for preserving a query plan cache on a backup system having a dataspace containing a copy of data and a copy of a query plan cache from a production system. An update is initiated of at least a portion of the copy of the data on the backup system with a portion of the data on the production system. A merge is initiated of updated query plans in the query plan cache from the production system with corresponding query plans in the copy of the query plan cache on the backup system. Objects are correlated in the updated query plans in the copy of the query plan cache with the updated copy of the data on the backup system.Type: GrantFiled: October 10, 2007Date of Patent: November 9, 2010Assignee: International Business Machines CorporationInventors: Robert Joseph Bestgen, Shantan Kethireddy, John Matthew Santosuosso
-
Publication number: 20100280993Abstract: Systems, methods, computer program product embodiments for an active/push notification facade for a passive database are described herein. An embodiment includes an event listener configured to poll (or query) a passive database and selectively retrieve data objects based on the poll results. Such a selective database read operation may be based on the priority of an event occurring in the passive database. If events of a low priority occur, the event listener does not open the passive database immediately to retrieve additional information related to the low priority event but defers reading the passive database by a pre-determined amount of time. If a higher priority event occurs, the passive database is immediately read to retrieve details related to the high priority event from the passive database. In another embodiment, event listener reads the passive database based on event timestamp information in the passive database and stored timestamp information.Type: ApplicationFiled: April 29, 2009Publication date: November 4, 2010Inventor: Ponnusamy Jesudoss DURAI
-
Patent number: 7805533Abstract: A method for communicating with remote devices wherein the method includes sending a request through an application programming interface (API), formatting the request and routing the request to a first connection associated with a first protocol. The method further includes mapping the request into a first command, wherein the first command conforms to the first protocol and sending the first command to a remote device.Type: GrantFiled: May 28, 2008Date of Patent: September 28, 2010Assignee: Dell Products, L.P.Inventors: Ross Kevin Burns, Thomas T. O'Connell, Sudhir Shetty
-
Patent number: 7801856Abstract: Instances of complex types are logically replicated. In general, the logical replication of complex types involves converting a complex type instance from its storage format into a logical representation written in a markup language, like XML. The logical representation is then propagated to a destination (or destinations), which converts the logical representation to a storage format used at the destination for that complex type.Type: GrantFiled: August 9, 2006Date of Patent: September 21, 2010Assignee: Oracle International CorporationInventors: James W. Warner, Zhen Hua Liu, Sundeep Abraham, Muralidhar Krishnaprasad, Geeta Arora, Ravi Murthy, Nimar Arora, Edwina Lu
-
Publication number: 20100235891Abstract: A user uploads content such as files containing audio, video, graphical, data, points of interest, and other information from a user device such as a personal computer to a central server over the interne. Upon determining that a trigger event has occurred, communication and processing circuitry in a vehicle device automatically download the previously uploaded content over the interne and a short-range wireless network and store the content for use by a device such as an audio/visual/navigation unit. Examples of a trigger event include proximity to a short-range wireless communication network coupled to the internet, presence within a geofence, turning off a vehicle's engine, or detecting an SMS wake-up message while the vehicle device's main processor, transceivers not used for SMS, and auxiliary circuitry are in sleep mode. The short-range wireless network can be a vehicle owner's home network, or a commercial wi-fi hot spot, or subscription wireless service.Type: ApplicationFiled: March 15, 2010Publication date: September 16, 2010Inventors: Robert J. Oglesbee, Dane Dickie, Charles M. Link, II
-
Patent number: 7774309Abstract: A method and apparatus for obtaining data from a cluster of cache nodes in a tree-based structure is disclosed. A querying cache node outside a partition group queries data of a cache node of the partition group. The partition group is made of one or more cache nodes of the cluster, where an instance of a cache node in the partition group is copied to each cache node in the partition group. The instance of the data of the cache nodes of the partition group is copied to the querying cache node. The partition group is adjusted to include the querying cache node as a member of the partition group.Type: GrantFiled: November 19, 2007Date of Patent: August 10, 2010Assignee: Red Hat, Inc.Inventors: Manik Ram Surtanl, Jason Thomas Greene
-
Patent number: 7774312Abstract: A self-managing workload repository (AWR) infrastructure useful for a database server to collect and manage selected sets of important system performance statistics. Based on a schedule, the AWR runs automatically to collect data about the operation of the database system, and stores the data that it captures into the database. The AWR is advantageously designed to be lightweight and to self manage its use of storage space so as to avoid ending up with a repository of performance data that is larger than the database that it is capturing data about. The AWR is configured to automatically capture snapshots of statistics data on a periodic basis as well as purge stale data on a periodic basis. Both the frequency of the statistics data capture and length of time for which data is kept is adjustable. Manual snapshots and purging may also be performed. The AWR captured data allows for both system level and user level analysis to be automatically performed without unduly impacting system performance, e.g.Type: GrantFiled: September 3, 2004Date of Patent: August 10, 2010Assignee: Oracle International CorporationInventors: Gary Ngai, Michael Feng, Alex Tsukerman, Benoit Dageville, Mark Ramacher, Karl Dias, Graham S. Wood, Leng Leng Tan, Richard Sarwal, Sushil Kumar
-
Patent number: 7761421Abstract: Read, write and recovery operations for replicated data are provided. In one aspect, a system for redundant storage of data included a plurality of storage devices and a communication medium for interconnecting the storage devices. At least two of the storage devices are designated devices for storing a block of data. Each designated device has a version of the data and a first timestamp that is indicative of when the version of data was last updated and a second timestamp that is indicative of any pending update to the block of data. The read, write and recovery operations are performed to the data using the first and second timestamps to coordinate the operations among the designated devices.Type: GrantFiled: May 16, 2003Date of Patent: July 20, 2010Assignee: Hewlett-Packard Development Company, L.P.Inventors: Svend Frolund, Arif Merchant, Yasusuhi Saito, Susan Spence, Alistair Veitch
-
Patent number: 7761423Abstract: The system provides a technique for finding relevant content and content sources based on the aggregation and analysis of individual indications of relevance. The system identifies and provides selections of relevant content. It may comprise a selection acquisition subsystem, a selection network repository subsystem, a scoring engine, and a recommendation engine, and is used to generate sources of content comprising sets of prioritized links directed to a topic or community of interest.Type: GrantFiled: October 11, 2006Date of Patent: July 20, 2010Assignee: OneSpot, Inc.Inventor: Nathan Matthew Cohen
-
Patent number: 7761422Abstract: A method and apparatus for reproducing data from a recording medium using a local storage is disclosed. A method for reproducing data from a recording medium using a local storage includes the steps of: a) downloading an additional data file and a binding unit manifest file, and storing the additional data file and the binding unit manifest file in the local storage; b) determining a binding unit from among the downloaded additional data file according to the binding unit manifest file; c) performing a binding operation for binding the binding unit with an original data file contained in the recording medium according to the binding unit manifest file, and forming a virtual package; and d) reproducing additional data and original data using the formed virtual package.Type: GrantFiled: January 5, 2006Date of Patent: July 20, 2010Assignee: LG Electronics, Inc.Inventors: Kang Soo Seo, Jea Yong Yoo, Byung Jin Kim
-
Patent number: 7752164Abstract: A database system that incorporates numerous features that reduce the total cost of maintaining the database system is provided. The database system includes a database appliance that executes a database server on a platform that includes a special purpose operating system specifically tailored to the services required by the database server. The hardware may also be specially tailored to the services required by the database server. The database sever may be generated from another database server by modifying the code of the other database server optimize the code for execution on said database appliance. According to another aspect, the database appliance includes an operating system and a database server generated from another database server by modifying the code of the other database server optimize the code for execution on said database appliance.Type: GrantFiled: January 21, 2004Date of Patent: July 6, 2010Assignee: Oracle International CorporationInventors: Jay Rossiter, David Stowell, Pari Bhaduri, Jane Chen, Ramkumar Venkataraman, Yuanjiang Ou, Muthu Olagappan, Thivakaram Prakash Sivakumar
-
Patent number: 7739239Abstract: A distributed storage system including support for distinct storage classes. A system may implement a web services interface, a number of storage nodes and a coordinator. The web services interface may receive, according to a web services protocol, client requests to store data objects. A request for a given object may include an indication of a particular storage class to be associated with the given object, and where each storage class respectively specifies one or more service level agreement (SLA) characteristics. The storage nodes may store replicas of the data objects. The coordinator may receive the client requests to store the data objects from the web services interface, where in response to the given client request, the coordinator may store one or more replicas of the given data object among the storage nodes according to the SLA characteristics specified by the particular storage class.Type: GrantFiled: March 8, 2006Date of Patent: June 15, 2010Assignee: Amazon Technologies, Inc.Inventors: John David Cormie, James Christopher Sorenson, III, Alan B. Atlas, David M. Barth
-
Patent number: 7725435Abstract: A system for providing a dynamic portal page in a content distributed network responds to a client request by generating the dynamic portal page at a content engine. The content engine has a portal template with at least one dynamic portion. The content engine writes into the at least one dynamic portion of the portal template links to content cached at the content engine. The content engine also writes into the at least one dynamic portion of the portal template information about availability of content not yet cached at the content engine.Type: GrantFiled: December 15, 2003Date of Patent: May 25, 2010Assignee: Cisco Technology, Inc.Inventors: Dan Li, Ivy K. Lui
-
Publication number: 20100125552Abstract: Methods, systems, and articles of manufacture for updating map viewers include associating a first map viewer update cache with a first map viewer, the first map viewer update cache comprising a first map viewer data update, associating a second map viewer update cache with a second map viewer, the second map viewer update cache comprising a second map viewer data update, and sending one of the first and second map viewer data updates from one of the first and second map viewer update caches to the associated one of the first and second map viewers.Type: ApplicationFiled: October 30, 2008Publication date: May 20, 2010Inventors: Barry L. Peterson, John S. Bryan, Demron Ignace
-
Patent number: 7716181Abstract: Methods, apparatus and computer programs are provided for data replication, such as for replicating a database or updating a cache. Source database change transactions are captured and batched together for efficient data transfer. On receipt at a target replica system, the batch of transactions within a message is separated and separate apply transactions corresponding to the source transactions are performed in parallel. A ‘transaction processed’ table is updated to reflect apply processing for each apply transaction. This table provides a confirmation of the status of each transaction that can be used during recovery processing to prevent re-apply of an apply transaction. The table can also be used to determine when it is safe to delete an incoming message from persistent storage on the target system, and to facilitate one-phase commit processing.Type: GrantFiled: June 15, 2005Date of Patent: May 11, 2010Assignee: International Business Machines CorporationInventor: Stephen J Todd
-
Publication number: 20100114824Abstract: Systems and methods that supply a replication layer/agent that is generic to supporting a plurality of storage configuration as part of a distributed store. Such distributed store employs a Common Availability Substrate (CAS) for data transport and consistency, to render the distributed store scalable and available. Such an arrangement enables continuous operation of the store, while the replication subsystem creates new replicas (e.g., for load balancing, failover, and the like).Type: ApplicationFiled: April 8, 2009Publication date: May 6, 2010Applicant: MICROSOFT CORPORATIONInventors: Muralidhar Krishnaprasad, Anil K. Nori, Lu Xun, Anurag Gupta, Manoj Valiyaparambil
-
Publication number: 20100094806Abstract: A file storage system for storing data of a file received from a client includes a back-end file server in which the data is stored. The system includes a cache appliance in communication with the file server, such that the appliance stores portions of the data or attributes of the file, and uses the stored data or attributes to process file system requests received from the client, and which reads and writes data and attributes to the back-end file server independently. A system for responding to a file system protocol request in regard to a back-end server includes a token server.Type: ApplicationFiled: September 18, 2008Publication date: April 15, 2010Inventors: Vasilis J. Apostolides, Jason Bertschi, John R. Boyles, Joseph Nevi Hosteny, IV, Michael L. Kazar
-
Patent number: 7698318Abstract: A backup administrator can backup files from a production server on any of a plurality of different bases. In particular, some files can be replicated on a changed-byte basis. In other cases, files can be backed up by replicating updated copies of the entire file, or even byte blocks of the file. Determinations as to how a replication agent will back up a certain file or set of files can be made by a backup administrator, automatically through a predefined logic, or dynamically based on defined criteria. Corresponding agents at the production server can then flag these files as indicated. Thus, at a later point, when the DPM server requests the updates of each file, the production server can either send over copies of the changed file bytes, entire copies of the changed file itself, or even changed blocks of a file, as appropriate.Type: GrantFiled: February 10, 2006Date of Patent: April 13, 2010Assignee: Microsoft CorporationInventors: Robert M. Fries, Vinay S. Badami, Michael L. Michael, Shiraz M. Somji
-
Patent number: 7698317Abstract: Techniques are disclosed for detecting web pages with duplicate content. In one embodiment, a set of shingles is computed for each page of a group of pages. An aggregate set of shingles is determined based on the sets of shingles computed for the group of pages. A first subset from the aggregate set of shingles is determined by selecting, from the aggregate set, shingles whose frequencies in the aggregate set exceed a specified threshold. A modified set of shingles is generated for each page of the group of pages by removing, from the set of shingles for that page, any shingle included in the first subset. One or more duplicate pages in the group of pages are determined based at least in part on the modified sets of shingles generated for the group of pages.Type: GrantFiled: April 20, 2007Date of Patent: April 13, 2010Assignee: Yahoo! Inc.Inventors: Amit Sasturkar, Rajat Ahuja, Shanmugasundaram Ravikumar, Vladimir Ofitserov
-
Publication number: 20100088277Abstract: Embodiments deploy delayering techniques, and the relationships between successive versions of a rich-media file become apparent. With this, modified rich-media files suddenly present far smaller storage overhead as compared to traditional application-unaware snapshot and versioning implementations. Optimized file data is stored in suitcases. As a file is versioned, each new version of the file is placed in the same suitcase as the previous version, allowing embodiments to employ correlation techniques to enhance optimization savings.Type: ApplicationFiled: October 7, 2009Publication date: April 8, 2010Applicant: OCARINA NETWORKSInventors: Goutham Rao, Eric Brueggemann, Carter George
-
Patent number: 7685176Abstract: The present invention relates to systems and methods of asynchronous data replication. In an embodiment, the system includes a primary host that asynchronously transmits write and sync commands where each write and sync command includes a data set ID and a sequence number. A secondary host coupled to the primary host receives the write and sync commands out of order, tracks receipt of write and sync commands, and after receipt of the write and sync commands of the data set writes blocks associated with the write commands to data storage coupled to the secondary host.Type: GrantFiled: September 15, 2006Date of Patent: March 23, 2010Assignee: Pillar Data Systems, Inc.Inventors: David Alan Burton, Noel Simen Otterness, Kurt Alan Shoens