Safe Storing Data for Disaster Recovery
A computer-implemented method and system for safe storing data is disclosed. A sending queue manager associated with a client computer transmits an asynchronous write of a transaction message containing data to a receiving queue manager associated with a transaction processing computer not colocated with the client computer. The receiving queue manager transmits a synchronous write of the transaction message to a remote queue manager associated with a remote disaster recovery computer to safe store the transaction message before it can be operated upon by the transaction processing computer.
The present invention relates generally to data management and, more particularly, to a method for safe storing data in a disaster recovery scenario.
BACKGROUND OF THE INVENTIONTransaction processing systems, particularly those employed in financial institutions, receive and process thousands of transactions a day. Each of these transactions may require operations to be performed on large amounts of data. As such, data management between communicating systems must be highly reliable. If the processing of a transaction fails due to an outage (e.g., a power loss or server failure), then it may be necessary to return a system to a known state of operation. The process of returning to a known state may be referred to as reconciliation.
It is desirable for transaction processing systems to be enabled with mechanisms for protecting against, as well as for recovering from, loss of data due to unexpected outages. Common mechanisms employed for data protection include, but are not limited to: (1) backup of data made to electronic storage media stored at regular intervals; (2) replication of data to an off-site location, which overcomes the need to restore the data (the corresponding systems then need only be restored or synchronized); and (3) high availability systems configured to keep both the data and system replicated off-site, enabling continuous access to systems and data.
In a disaster recovery context, replication of data may also be referred to as data mirroring. Depending on the technologies used, data mirroring may be performed synchronously, asynchronously, semi-synchronously, or point-in-time. As used herein, the term “asynchronous process” refers to a process that executes in the background and occurs as soon as it can in the background. As used herein, the term “asynchronous process” refers to a process that executes directly in line with other processes and does not allow other processes to continue until one or more executable steps (e.g., a put or wire) is completed.
Prior art data mirroring executed synchronously (i.e., using one or more synchronous processes) achieves a recovery point objective (RPO) of zero lost data, but may require unacceptably long execution time of a few minutes to perhaps several hours. Prior art data mirroring executed asynchronously (i.e., using one or more asynchronous processes) may achieve an RPO of just a few seconds, but does not guarantee zero data lost.
In high volume transaction processing systems, such as those used by financial institutions, an RPO of even just a few seconds is not acceptable, and may result in the loss of millions of dollars to clients and/or the transaction system provider. In addition, any remedial steps taken, depending upon the volume of data being received, should not add more than about a 50 to 100 milliseconds additional delay to complete a single message process. A person skilled in the art will recognize that a business can tolerate this time increase in the complete message cycle, since the time to transfer a single message is on the order of 250 milliseconds, mostly as a result of long distances between client and server. Also, a 50 to 100 milliseconds additional delay will not have any noticeable effect unless new messages arrive while the current message is still being processed on a specific channel.
Due to the smaller delays introduced by asynchronous mirroring methods, they are more frequently implemented. Unfortunately, prior art disaster recovery systems that employ asynchronous mirroring methods over long distances run the risk of data loss in the event of an outage. A disaster recovery declaration will result in a systems recovery to a point-in-time preceding the actual outage event. This results in a potential loss of data, which can be several seconds or minutes in duration and account for a plurality of transactions. In such circumstances, a receiving transaction processing system may complete a number of transactions and acknowledge their completion back to a requesting system before a disaster recovery system has safe stored all of the transactions. As used herein, the term “safe storing” refers to a transaction message that is received and stored in its original state prior to being processed.
Solutions are needed to account for and reconcile lost transaction messages, as well as to retrieve and process the same. Unfortunately, the widespread use of MQ network messaging technology with its “destructive” read of message traffic creates an environment whereby lost data cannot be re-sent by the sending systems or cannot be retrieved from message queues associated with the transaction processing system. Thus, a disaster recovery system may have no record of the most recent messages processed by the transaction processing system, thereby necessitating a difficult reconciliation process. This presents an unacceptable financial risk to businesses and requires a solution.
Accordingly, there exists a need for a method and system for safe storing transaction messages, data, and acknowledgements over long distances that permits minimal or no loss of data in a disaster recovery scenario.
SUMMARY OF THE INVENTIONThe above-described problems are addressed and a technical solution is achieved in the art by providing a computer-implemented method and system for safe storing transaction messages in a disaster recovery scenario. A client computer sends transaction messages to a remotely located transaction processing computer. At substantially the same time, these transaction messages are also routed to a remotely located disaster recovery computer that is neither colocated with the sending computer or the transaction processing computer.
In a preferred embodiment, a sending queue manager associated with the client computer transmits an asynchronous write of a transaction message to a receiving queue manager associated with the transaction processing computer. Upon receipt, the receiving queue manager transmits a synchronous write of the transaction message to a remote disaster recovery queue manager associated with a remote disaster recovery computer to safe store the transaction message. Thereafter, the receiving queue manager transmits the same transaction message to a transaction processing queue manager where the transaction message is then processed by the transaction processing system, thereby preventing any possible loss of the message/data.
When the transaction message is received b, the remote disaster recovery computer, the corresponding data is written to at least one memory component communicatively coupled to the remote disaster recovery computer at a remote recovery site. In a preferred embodiment, writing the information to the at least one memory component communicatively coupled to the remote disaster recovery computer may further comprise executing a synchronous PUT command. Additionally, when the transaction message is received by the transaction processing computer the corresponding data may also be written to at least one memory component communicatively coupled to the transaction processing computer at a primary site. In a preferred embodiment, writing the information to the at least one memory component communicatively coupled to the transaction processing computer at the primary site may further comprise executing an asynchronous PUT command.
The present invention will be more readily understood from the detailed description of exemplary embodiments presented below considered in conjunction with the attached drawings in which like reference numerals refer to similar elements and in which:
A client computer may not be colocated with a transaction processing computer. As a result, transaction messages may be transmitted between computers over a network. A computer may be any data processing device, such as a desktop computer, a laptop computer, a tablet computer, a mainframe computer, a server, a handheld device, a digital signal processor (DSP), an embedded processor, or any other device able to process data. Computers may be configured with one or more processors and may be communicatively connected to one or more non-transitory computer-readable media and one or more networks. The term “communicatively connected” is intended to include any type of communication may be conducted over a wireless or wired medium and includes, but is not limited to, OTA (over-the-air transmission, ATSC, DVB-T), packet-switched networks (TCP/IP, e.g., the Internet), satellite (microwave, MPEG transport stream or IP), direct broadcast satellite, analog cable transmission systems (RF), and digital video transmission systems (ATSC, HD-SDI, HDMI, DVI, VGA), or any other applicable communication-enabled mechanism.
The one or more computer-readable media may be used for storing the instructions to be executed by the one or more processors, including an operating system, such as the Windows™, UNJX™, OSX™, or the Linux operating system. The computer readable media may further be used for the storing and retrieval of data in one or more databases. The computer readable media may include a combination of volatile memory, such as RAM memory, and non-volatile memory, such as flash memory, optical disk(s), and/or hard disk(s).
In
In a preferred embodiment, the transmitted messages may be transaction messages. The transaction messages may include, but are not limited to, MQ messages. Each of the transaction messages, in turn, may include, but are not limited to, one or more executable methods or commands, data elements or structures associated with the commands, acknowledgement messages, negative acknowledgement messages, function calls, or any other applicable arrangement of data.
Client computer 16 may be a plurality of servers or, alternatively, a single server. Memory 18 may be a plurality of volatile and non-volatile memory devices. As illustrated in
Transaction messages 24a-24n are received at primary site 26 and processed by at least one programmed computer 28 and associated memory 30, as illustrated in
Computer 28 is also configured to transmit the Transaction messages 24a-24n received by computer 28 at primary site 26 may also be transmitted to at least one programmed computer 50 and associated memory 52, over network 14, located at remote recovery site 36. In a preferred embodiment, each of client sites 12, primary site 26, and remote recovery site 36 are not colocated.
Similar to computer 28 at primary site 26, computer 50 may be a plurality of servers or a single server. Similarly, memory 52 may be a plurality of volatile and non-volatile memory devices. In a preferred embodiment, computer 50 may comprise a message routing server 54, as illustrated in
In
When transaction message 24a is received, at step S2a, by receiving queue manager 38, transaction message 24a may be temporarily stored, at step S2b, in processing queue 40a. Receiving queue manager 38 may then retrieve, at step S3a, transaction message 24a from processing queue 40a and, because messages may be destructively read from queues, receiving queue manager 38 may further make a copy of transaction message 24a (hereinafter referred to as transaction message 24a′). Then receiving queue manager 38 synchronously writes, at step S3b transaction message 24a′ to processing queue 60a via remote recovery queue manager 58 at remote recovery site 36 over network 14. For example, transaction message 24a′ is synchronously written to message processing queue 60a using a synchronous PUT command.
After remote queue manager 58 receives, at step S3c, transaction message 24a′, it may then temporarily store the message in processing queue 60a. Remote queue manager 58 retrieves, at step S3d, transaction message 24a′ from processing queue 60a and, similar to processing queue 40a at primary site 26, remote queue manager 58 may make a copy of transaction message 24a′ (herein after referred to as the “transaction message 24a”).
Remote queue manager 58 may then transmit, at step S3e, transaction messages 24a″ to production queue manager 62 in disaster recovery server 56. Production queue manager 62 may then operate (i.e., completes the transaction), at step S3f, via recovery system application 63 on transaction message 24a″. The processed transaction message 24e may have associated data. As a result, production queue manager 62 may store, ° step S3g, transaction message 24″, along with associated original/processed data, in database 64a.
Once the synchronous write of transaction message 24a′ to remote recovery site 36 has been completed, transaction message 24a′ may be further processed by message routing server 32 at primary site 26. Receiving queue manager 38 of message routing server 32 transmits, at step S4a, transaction message 24a′ to production queue manager 42 in production server 34. Production queue manager 42 operates (i.e., completes the transaction), at step S4b, on transaction message 24a′ via production system application 43. The processed transaction message 24a′ may have associated data. As a result, production queue manager 42 may store, at step S4c, transaction message 24′, along with associated original/processed data and acknowledgement message 44a, in database 48a.
Once data has been written to database 48a, receiving queue manager 38 may then send, at step S5, acknowledgement message 44a back to sending queue manager 20 associated with client computer 16 at remote client site 12 over network 14.
It is to be understood that the exemplary embodiments are merely illustrative of the invention and that many variations of the above-described embodiments may be devised by one skilled in the art without departing from the scope of the invention. It is therefore intended that all such variations be included within the scope of the following claims and their equivalents.
Claims
1-14. (canceled)
15. A computer-implemented method for safe storing one or more transaction messages on a handheld mobile device, the method comprising the steps of:
- receiving the transaction messages from the handheld mobile device, over a network, via an asynchronous write, on a programmed computer at a primary site, wherein the one or more transaction messages comprises data;
- creating a copy of the one or more transaction message received on the programmed computer at the primary site;
- transmitting the copy of the one or more transaction messages, over the network, received on the programmed computer at the primary site, via a synchronous write, to a programmed computer at a remote recovery site, wherein the one or more transaction messages received is not operated on at the primary site prior to the copy of the one or more transaction messages being transmitted synchronously to the remote recovery site and wherein the remote recovery site is not colocated with the primary site and the programmed computer at the primary site;
- storing the transmitted copy of the one or more transaction messages received in a database at the remote recovery site, wherein the database is communicatively coupled to the programmed computer at the remote recovery site;
- operating on the one or more transaction messages received at the primary site after the copy of the one or more transaction messages has been stored at the remote recovery site; and
- transmitting an electronic acknowledgement from the primary site to a user interface on the hand-held mobile device after storing the copy of the one or more transaction messages at the remote recovery site, said electronic acknowledgement confirming the asynchronous write.
16. The method of claim 15, wherein the at least one memory component is configured as at least one message queue.
17. The method of claim 15, wherein saving the one or more transaction messages to the at least one memory component at the remote recovery site further comprises executing a synchronous PUT command.
18. The method of claim 15, wherein receiving the one or more transaction messages asynchronously at the primary site further comprises saving the one or more transaction messages to at least one memory component communicatively coupled to the programmed computer at the primary site.
19. The method of claim 18, wherein the at least one memory component is configured as at least one message queue
20. The method of claim 18, wherein saving the one or more transaction messages to the at least one memory component at the primary site further comprises executing an asynchronous PUT command.
21. The method of claim 15, further comprising transmitting an electronic acknowledgement to the remote client computer after operating on the one or more transaction messages at the primary site.
22. The method of claim 15, wherein the one or more transaction messages comprises at least one message.
23. The method of claim 15, wherein the programmed computer at the primary site and the programmed computer at the remote recovery site are not colocated.
24. The method of claim 15, wherein the programmed computer at the primary site is communicatively coupled to the programmed computer at the remote recovery site by a data network.
25. A computer-based system for safe storing one or more transaction messages on a handheld mobile device, the system:
- a programmed computer at a primary site for receiving the one or more transaction messages from the handheld mobile device over a network via an asynchronous write wherein the programmed computer creates a copy of the one or more transaction messages received on the programmed computer at the primary site and wherein the one or more transaction messages comprises data;
- a transmission engine for transmitting the copy of the one or more transaction messages received on the programmed computer at the primary site, via a synchronous write, to a programmed computer at a remote recovery site, wherein the one or more transaction messages received is not operated on at the primary site prior to the copy of the one or more transaction messages being transmitted synchronously to the remote recovery site and wherein the remote recovery site is not colocated with the primary site and the programmed computer at the primary site;
- a storage engine for storing the transmitted copy of the one or more transaction messages received in a database at the remote recovery site, wherein the database is communicatively coupled to the programmed computer at the remote recovery site;
- an operation engine for operating on the data received at the primary site after the copy of the one or more transaction messages has been stored at the remote recovery site; and
- an acknowledgement transmission engine for transmitting an electronic acknowledgement from the primary site to a user interface on the hand-held mobile device after storing the copy of the one or more transaction messages at the remote recovery site, said electronic acknowledgement confirming the asynchronous write.
26. The system of claim 25, wherein the at least one memory component is configured as at least one message queue.
27. The system of claim 25, wherein saving the one or more transaction messages to the at least one memory component at the remote recovery site further comprises executing a synchronous PUT command.
28. The system of claim 25, wherein receiving the one or more transaction messages asynchronously at the primary site further comprises saving the one or more transaction messages to at least one memory component communicatively coupled to the programmed computer at the primary site.
29. The system of claim 28, wherein the at least one memory component is configured as at least one message queue.
30. The system of claim 28, wherein saving the one or more transaction messages to the at least one memory component at the primary site further comprises executing an asynchronous PUT command.
31. The system of claim 25, wherein the acknowledgement transmission engine transmits an electronic acknowledgement to the remote client computer after operating on the one or more transaction messages at the primary site.
32. The system of claim 25, wherein the one or more transaction messages comprises at least one message.
33. The system of claim 25, wherein the programmed computer at the primary site and the programmed computer at the remote recovery site are not colocated.
34. The system of claim 25, wherein the programmed computer at the primary site is communicatively coupled to the programmed computer at the remote recovery site by a data network.
Type: Application
Filed: Feb 9, 2016
Publication Date: Jun 2, 2016
Patent Grant number: 9971654
Inventors: Mitchell Layne (Lutz, FL), Herb Stranzl (New York, NY), Thomas G. Allaire (Seaford, NY), Erol Raymond (New Providence, NJ)
Application Number: 15/018,964