SYSTEM AND METHOD FOR PROVIDING AN ACTIVELY INVALIDATED CLIENT-SIDE NETWORK RESOURCE CACHE
A system and method for providing an actively invalidated client-side network resource cache are disclosed. A particular embodiment includes: a client configured to request, for a client application, data associated with an identifier from a server; the server configured to provide the data associated with the identifier, the data being subject to subsequent change, the server being further configured to establish a queue associated with the identifier at a scalable message queuing system, the scalable message queuing system including a plurality of gateway nodes configured to receive connections from client systems over a network, a plurality of queue nodes containing subscription information about queue subscribers, and a consistent hash table mapping a queue identifier requested on a gateway node to a corresponding queue node for the requested queue identifier; the client being further configured to subscribe to the queue at the scalable message queuing system to receive invalidation information associated with the data; the server being further configured to signal the queue of an invalidation event associated with the data; the scalable message queuing system being configured to convey information indicative of the invalidation event to the client; and the client being further configured to re-request the data associated with the identifier from the server upon receipt of the information indicative of the invalidation event from the scalable message queuing system.
This is a continuation U.S. patent application claiming priority to U.S. patent application Ser. No. 14/292,926, filed Jun. 1, 2014; which is a continuation-in-part patent application claiming priority to U.S. patent application Ser. No. 13/019,505; filed Feb. 2, 2011 by the same assignee as the present application. This present patent application draws priority from the referenced patent applications. The entire disclosure of the referenced patent applications is considered part of the disclosure of the present application and is hereby incorporated by reference herein in its entirety.
TECHNICAL FIELDThis application relates to a system and method for use with networked entities, according to one embodiment, and more specifically, to a system and method for providing an actively invalidated client-side network resource cache.
COPYRIGHTA portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever. The following notice applies to the software and data as described below and in the drawings that form a part of this document: Copyright 2009-2017 IMVU Corporation, All Rights Reserved.
BACKGROUNDThe use of chat or instant messaging communications in a networked computer environment is well known. The America Online™ Instant Messaging system (AIM) is one well known system for controlling instant messaging in a networked environment. In these prior art systems, two computer users can communicate non-persistent text messages in real time using an instant message (IM) client on their computers in concert with an IM server.
Most messaging services are subscription-based or user-identity-based and may generate large numbers of content followers or users of particular message or content sources (denoted herein as subscribers). These content followers or subscribers can form communities or social networks around a particular content source or content distribution system. Social networks have gained in popularity as people have used messaging as a basis for connecting with each other.
As the numbers and size of the user pool, subscribers, and social networks expand, it becomes more difficult to track and manage the subscribers, the listening users, and the degree to which the users are involved with the message content. Similarly, it becomes more difficult to identify and rank the most popular content items being consumed across a variety of content sources and social networks.
In current practice, content is typically delivered from web servers to web clients using HTTP (Hypertext Transfer protocol). The delivered content stored on a permanent medium of the web server can be transferred via a network to the web client through a sequence of caches. Data caching at multiple layers is necessary to reduce the network resources needed to transfer the content from the web server to the web client. The layered data caches work well for static content. However, current multi-layer data caching systems are not efficient when the data content is subject to frequent changes or updates.
The various embodiments is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which:
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the various embodiments. It will be evident, however, to one of ordinary skill in the art that the various embodiments may be practiced without these specific details.
Description of Various Example EmbodimentsThe Message Queue system of a particular embodiment is a stand-alone software stack, which implements a scalable, efficient, secure, flexible and easily understood message and state update switching and routing fabric. The Message Queue system is a robust, scalable message queue system to which clients connect that routes messages from the site to clients. The Message Queue system can also be considered a light-weight message queue system and state management protocol optimized for real-time interactive multi-player systems.
Topologically, the Client Gateways 12 of various embodiments are the interface to the users at client systems 20.
Gateways 12 can perform a muxing/demuxing function to vector user requests onto a pool of message queue nodes 14, where actual message queues are processed. Clients 20 can get subscribed to inspect (e.g., obtain access to) the messages coming across specific, named message queues. Additionally, clients 20 can get subscribed to send messages to specific, named message queues.
A supervisor 16 is the authoritative source of the node map, as well as the central point where system-wide statistics are aggregated for easy monitoring. However, the supervisor 16 is not involved in any real-time message flow, and thus the entire system can serve in a fully operational state even if the supervisor node is temporarily down. However, the node map cannot be re-configured without participation by the supervisor.
In a particular embodiment, all machines run the same code source tree; but, the command line to start up each node determines what kind of node it is. Each node will register itself with the supervisor 16; thus, the supervisor 16 is a convenient place to determine which nodes are effectively part of the Message Queue system 10. In a particular embodiment, each “node” can be a typical 8-core, 8 GB RAM, compute-only (not database) server. In a particular system, one supervisor node (“boss”) 16 is provided. If this supervisor node 16 goes down, instant replacement is not necessary; but, the ability to monitor the system will be degraded. In a particular embodiment, between five and ten client gateway nodes can be provided. A load balancer 22 can be provided to spread client requests across the available client gateway nodes. In a particular system, between five and ten message queue nodes can be provided. These message queue nodes or queue nodes can mostly message between themselves. In a typical system, internal traffic will be fairly low. In a typical embodiment, the entire queue will likely use less than a gigabit of bandwidth in aggregate at full load, until features and user counts swell beyond this point. A system-level metric for network packets and bandwidth in/out can also be provided.
The message queue nodes 14 can also do some messaging into web machines, mainly for authorization purposes. In a particular embodiment, this traffic uses Hypertext Transport Protocol (HTTP) with JavaScript Object Notation (JSON). The message queue nodes can either use an existing web pool or create a new pool specifically for the Message Queue system as described herein. In general, a “pool” of servers is a set of servers that respond to a specific Domain Name Server (DNS) address, using front-end load balancing, in the terminology used herein. The server can get to the set of gateways to send messages to users, in a load-balanced way. In a particular embodiment, this traffic can use HTTP with JSON. In a particular embodiment, the software can be written on top of Erlang/OTP R13B04, which can be downloaded and built from source at http://www.erlang.org/
Each node has a Uniform Resource Locator (URL) that can output statistics in Nagios-compatible “key=value” text format. Thus, application-level monitoring scripts can be written by simply hitting that URL on each node. Node software can be set up to start in/etc/init.d, and can have the correct working directory and command line options specified.
The following sections describe a functionality level implementation of the Message Queue system processes of an example embodiment. The description includes an explanation of how the Message Queue system of an example embodiment is decomposed into software executables and processes. The Message Queue system of an example embodiment includes three kinds of processes: 1) a plurality of Gateway 12 processes, 2) a plurality of Queue Node 14 processes, and 3) a singular Supervisor 16 process. Additionally as shown in
Glossary of Terms
-
- Message Queue System
- The entire system—consisting of client libraries, gateways, queue nodes and supervisor, as well as ancillary systems like translators and web services called by the core Message Queue System.
- Message Queue (there can be on the order of 1,000,000 of these)
- A single unit of subscription and access control in the Message Queue System, containing a list of subscribers (clients interested in listening on traffic through the queue) and a list of mounts. Message Queues are not typically disk persistent.
- Message Stream (there can be on the order of 1,000,000 of these)
- A single addressable endpoint in the Message Queue System, through which messages are broadcast to all subscribers of the queue. A message stream is identified as a named Mount within a message queue.
- State Stream (there can be on the order of 1,000,000 of these)
- A single addressable endpoint in the Message Queue System, through which state updates flow. A State Stream is typically not disk persistent, but supports eventual consistency of key/value state, as well as verification of state change requests through call-out to state managers. Addressed as a named mount within a Message Queue. May have zero or more state managers.
- Mount
- A specific named stream within a message queue—a Message Stream or a State Stream. As used herein, the term “mount” can be considered a synonym of the term “stream”.
- Client (there can be on the order of 1,000,000 of these)
- A participant in the game/simulation. The client is not trusted to make any authoritative decisions about creation/deletion of shared state, unless it has special authorization.
- Gateway (there can be on the order of 1,000,000 of these)
- A process that accepts connections from clients, and routes them to appropriate endpoint message queue nodes. Additionally, the gateway performs authentication and Message Queue SystemT->Erlang serialization.
- Queue Node (there can be on the order of 10 of these)
- A machine hosting a process that routes messages and state updates through named message queues to subscribed clients (via gateways).
- Gateway Node (there can be on the order of 10 of these)
- A machine hosting Gateway and Persist processes.
- Supervisor (there is 1 of these)
- A machine and process that owns authoritative state about the configuration of the message queue infrastructure.
- Persist (there can be on the order of 1,000,000 of these)
- A process that acts as a “new” client from the point of view of a Gateway, but acts as a REST-style (Representational State Transfer-style) server from the point of view of a Translator. Associated with a specific client identifier (Cid) for a specific logged-in client. May in fact be a Gateway with specific Filter plug-ins.
- Translator (there can be on the order of 1,000,000 of these)
- A process that acts as a HTTP server on one end (for perlbal to talk to) and serializes to Erlang on the other. The Translator dispatches to the appropriate Persist for the incoming request based on Cid. Used by old clients to talk to the new system.
- Translator node (there can be on the order of 10 of these)
- A machine hosting Translator processes.
- State Manager
- A Plug-in that can check and evolve state update requests at the behest of queue nodes. Receives “old state” and “requested changes” as input, and outputs “new state.”
- Call-out
- A process which responds to specific requests from Message Queue System according to a well-defined protocol, typically HTTP JSON services. One form of State Manager uses a Call-out. Gateway uses Call-out for authentication verification and connection drop notification.
- Services (there can be on the order of 100 of these)
- Processes that, through means outside this system, have authority to request creation, re-configuration and deletion of message queues.
- State Plug-in
- An Erlang module used by a State Stream mount to enforce state update consistency. One example is the PHP call-out plug-in.
- Filter Plug-in
- An Erlang module used by Gateways to filter messages going to and from clients. Can be used to translate messages using different formats into a uniform format for use by the rest of the message system or surrounding integrated system.
- Message Queue System
The gateway 12 processes maintain persistent connections with clients as well as with persistent server processes. Creating the connection establishes authentication, which persists for the duration of the connection. Other than authentication and binary protocol decode, the gateway process is relatively simple. The gateway process can map a queue name (id) to a target queue node and forward messages to the target queue node, as well as return responses to the appropriate connected clients.
In a particular embodiment, queue nodes 14 operate message queues.
The supervisor 16 process can manage all nodes in the Message Queue system. The supervisor process also manages the queue name to message queue node mapping table. In a particular embodiment, this mapping is maintained as a traditional “circle hash” (or, alternatively, as buckets mapped to nodes, allowing a 1-to-N re-mapping of a new node added to a set of N existing nodes. Additionally, the supervisor process collects statistics from the gateway nodes and queue nodes on performance and load of the system, and provides an aggregate view of system performance through a management interface. The supervisor process is typically not visible from outside the cluster network, but instead is fully accessed through web management services. The Supervisor can tell gateways about updates to the node mapping table.
In a particular embodiment, there is only one supervisor process. However, the supervisor process is not a single point of failure, in that the configured network of gateways and queue nodes can continue operating even if the supervisor process is temporarily inactive. In this case, the Message Queue system just won't be able to report statistics or re-configure the node mapping table. The supervisor process can, in a particular embodiment, simply serve as collector of statistics. The main functionality lost from this simplification would be the ability to add capacity or redistribute load from downed nodes without interruption of existing client connections.
Translator ProcessesOlder clients, not updated to communicate with the Message Queue gateways directly, or unable to do so because they are behind restrictive HTTP-only proxy firewalls, can continue to make XMLRPC calls to existing chat scripts. XMLRPC is a conventional remote procedure call (RPC) protocol, which uses Extensible Mark-up Language (XML) to encode its calls and HTTP as a transport mechanism. Those scripts may need to translate and forward messages to the Message Queue gateways. Because the Message Queue gateways use a persistent connection model, and the XMLRPC web servers typically use a Representational State Transfer (REST)-like polling model, something needs to translate between the two models.
The translator 17 process of a particular embodiment is a persistent, stateful process. The translator process establishes connections to the message queue gateways, one per client using the XMLRPC Application Programming Interface (API). The translator process then receives JSON requests from XMLRPC and translates them to Message Queue system messages. Additionally, the translator process buffers messages and state updates from Message Queue system queues and makes them available as JSON data to poll at the convenience of the XMLRPC system. If no messages have been polled for a pre-determined time (e.g., five minutes), the client is assumed to have disconnected, and the persistent connection is torn down.
A slight implementation variation would be to use XML instead of JSON for the translator inputs, making the work for the existing scripts easier. A more substantial implementation variation would be to entirely emulate the behavior of chat scripts to significantly reduce or eliminate the need for chat web servers. Mapping from an XMLRPC (chatweb) server to a translator process can be done using a simple mod operation on the customer id (cid). Alternatively, the mapping can be done statically based on the chatweb instance or using any other consistent mapping function, such as a circle hash or other consistent hashing mechanism.
State Manager ServicesA queue state stream that receives a request to introduce, update, or delete a specific state (property value) can call aside to a state manager service 18 to enforce rules for state updates. The queue state stream can bundle up current state into a JSON formatted data block, as well as the requested state change, and post a request to a state manager web service. The state manager web service 18 then returns one of three results: a) ok: apply change as requested, b) denied: make no change, or c) altered: apply changes as returned by the state manager. State managers can run a stateless web service on a main web server. If load from state managers needs to be managed separately, this can easily be shifted to a separate pool of web front-ends as needed. The state manager in effect for a queue state stream is configured as a parameter when the queue is created. There can be any number of state managers in effect for a given queue. A given room, object, or service may use multiple queue state streams to implement different sets of state as needed. A state manager can listen to multiple mounted streams to make determinations based on multiple sets of state. For example, a rule that requires information about per-room avatar state (who is “king”) and information about per-room state (which node is “the throne”) can be mounted to see both kinds of information to be able to enforce the rule that only kings can sit on thrones.
Gateway NodesGateway nodes can immediately start serving client requests when they have connected to the supervisor and received the queue namespace map. All gateway nodes need is to have the hardware load balancer start pointing clients at them. In a particular embodiment, a design goal is for 20,000 users to connect to a single server (8 cores, 24 GB/s memory bus). For clients that drop and re-establish TCP connections (e.g., because of intermittent WiFi hotspots), it is likely that a new gateway will be chosen for the new connection. To mitigate this, subscription information for a client is kept in a state stream specific to the client and managed by the gateway. When the client re-connects to the gateway, the client will have re-delivered all the subscriptions of which the client is currently partaking. This queue will be automatically removed after some amount of time-out (e.g., 60 seconds). If no client subscribes to the queue within the time-out period, the queue “hard” disconnects to not keep system resources allocated needlessly. To provide immediate sign-off notification to buddies, a controlled (user-initiated) sign-off will result in a message through the gateway, and a state update in the buddy state channel, before the gateway is disconnected.
Internally, the Gateway of a particular embodiment is based on gen_server (specifically, a TCP subclass for binary, and a HTTP server subclass for HTTP interface). Each of the acceptors starts a new Erlang process for each incoming request. For long-running connections (binary protocol for clients), this process builds up routing state specific to the user. Messages within the system are sent as Erlang tuples, typically constructed as Erlang structs.
Additionally, there is one “node dispatcher” process or “mapped forwarder” process per gateway node. This process translates a message queue name to a corresponding queue node and forwards outgoing packets to the queue node and receives packets from the queue node to dispatch to subscribing connected users. In a particular embodiment, client connections come in typically through port 443. Each accepted connection spawns a client framing handler, which decodes the protocol buffer protocol and turns requests into Erlang tuples (structs). Most messages go on to the queue map, which translates destination queue name to a given physical queue node and dispatches to the communication process for that queue node. There is at least one such process per node in each gateway node and used by gateway processes on that node. In a particular embodiment, responses from queues to subscribers can bypass this “node dispatcher” process; these responses can go straight to each subscribing gateway process. If the message “node dispatcher” process turns out to be a bottleneck process, we can parallelize this process, and round-robin between N message queue node dispatcher processes when we configure connection handling processes. The point of making this a separate process is that it makes dynamic re-mapping of the namespace much easier to implement.
The supervisor management interface is initially limited to pushing statistics to the supervisor. In a particular embodiment, these statistics can include:
-
- # messages routed per second<--so we can track workloads
- # clients connected<--so we can verify load balancing and detect outages
- # connected queue nodes<--so we can find out if there are connectivity problems
- # queues subscribed per client<--for statistical and planning purposes
- minimum, average and maximum latency for turn-around requests (requests with op_id codes)<--to track performance against a particular target (e.g., a 100 ms target)
- the slowest message round-trips from the last time period (e.g., hour)<--to track troublesome requests
- # web requests per second<--to track cluster load on the Message Queue system
- minimum, average and maximum latency for web requests<--to track performance through the system
- the longest-running web requests from the last time period (e.g., hour)<--to track troublesome requests
- total Erlang process memory sizes
- largest Erlang processes, size and type
- Erlang exception counts per process kind
- software versions
As part of the implementation of online queue node insertion/movement/removal as described in more detail below, the management interface can also partake in the queue map update commit cycle. The HTTP interface accepts JSON requests and returns JSON data. The HTTP interface spawns one Erlang process per HTTP connection. In an alternative embodiment, we can choose to move to a pool of request handlers. The interface translates JSON to Erlang structs, and back again from response structs to JSON. We extend JSON to support \xXX for hexadecimal, and use that encoding for characters less than 32 or greater than 126. An alternative is to send the characters as-is (expected by Unicode-style JSON) or as \uXXXX (expected by ASCII JSON)
Queue NodesMessage queue nodes cannot start serving queue requests until they have been assigned a namespace area from the queue node map, which is managed by the supervisor. In a particular embodiment, each queue is represented by a single Erlang process. This means that all mounted name spaces in the queue (e.g., message streams and state streams denoted “property bags”) are serialized within the queue. Call-outs to web services are serialized as well for streams that require call-out; messages posted on message streams may not need call-outs and thus will re-order compared to state change requests needing external verification. Queues can also include in-process filters or state managers, loaded as plug-ins into the Erlang process, and run as part of the queue process itself (again, for serialization purposes).
In a particular embodiment, the design goal is for up to 100,000 queues to live on a single server (8 GB, 8 cores, 24 GB/s memory bus). This allows up to 80 kB of state per queue. If memory restrictions become a problem because of per-queue overhead, we can split onto more queue nodes, rather than increase RAM in a single box, so as to keep system-wide latency low. In a particular embodiment, the design goal is for a single message to flow through the system (from incoming gateway via queue nodes out to listening clients) within 100 milliseconds. Note that a server with 24 GB/s memory bus and 8 GB of RAM would have to spend 350 milliseconds to work through an 8 GB working set.
Mapping of a named queue to a queue node is done through an MD5 hash of the queue name, followed by a map of the top N bits (e.g., 10 bits) of the MD5 hash to a sequence of buckets allocated to node servers. MD5 hashing is well known to those of ordinary skill in the art. Initially, many buckets will be allocated to each participating queue node. As more queue nodes come online, buckets will be fairly removed from allocated nodes and added to the new node to retain stochastically balanced 1/N load mapping. The supervisor holds the master map of hash-to-node mappings. In a particular embodiment, we can enforce a minimum number of buckets per node, to even the load. For example, if we allowed one node to be mapped by two buckets, and one node to be mapped by three buckets, the difference in load assuming otherwise homogenous load distribution would be 50% more on the second node than on the first node. In a particular embodiment, a policy of a minimum number of buckets per node can be enforced (e.g., eight buckets minimum per node). Once all nodes have eight buckets, for example, we can double the number of buckets per node without changing the mapped address space. In this manner, each bucket turns into two new buckets for a total of 16 buckets per node. Then, we can start fairly redistributing again without fear of load imbalance. The minimum number of buckets used per node determines the maximum load imbalance assuming otherwise homogenous loading (no “hot keys”). For a minimum of eight buckets per node, the worst-case ratio is 8:9, or 12.5% more load on the second host. There is a cost in increasing the minimum number of buckets; because, the node map is “hot data” for the queue node mapper processes. Thus, the node map should ideally fit well into an L1 cache.
Internally, each message queue is an Erlang process. Additionally, there is a receiving process for incoming messages from each gateway.
The message queue nodes receive requests from gateways. These requests go to the “central input” process for the physical node, which in turn dispatches to the appropriate message queue process. Message queues are Erlang processes—one per message queue. Within each message queue, different named “handlers” are “mounted.” These implement specific behaviors, such as message forwarding or state storage. Handlers take the form of Erlang modules, referenced by name. The state storage handler additionally supports plug-in state managers, again written as Erlang modules. It will be apparent to those of ordinary skill in the art that modules of a type other than Erlang modules can be similarly used. One state manager handler plug-in is the PHP call-out handler. This means that plug-ins must be allowed configuration data, such as what URL to call out to, in this case. Each queue contains the list of subscribed users, sorted as a list of subscribed gateway nodes, with a list of users per gateway entry, allowing reference counting for the output nodes, as well as generating presence information.
We design for hot code re-load in most cases. This is illustrated as the “loop” function calling itself tail-recursively explicitly through the module name. Certain data structure updates will require rolling restarts instead. Tests can be used to find most of these cases, and we can detect them through exception counters similar to the web push monitor on staged deployment. The supervisor currently is the recipient of runtime metrics. In a particular embodiment, these metrics can include:
-
- # routed messages per time period
- # message queues
- # mounts per message queue
- average state storage memory per queue
- total memory used by queue processes
- largest processes, sizes and types
- Erlang exceptions, counts and process types
- software versions
The supervisor 16 can also make the node receiver process partake in queue migration. Queue migration means that the node will direct message queues to serialize and move to a new host node, after which messages intended for the moved message queues will be forwarded to the target node. This will continue until all gateways have committed the new message dispatch map, at which time any knowledge about the migrated-out queues can be purged. This process can be denoted a hot add node processing.
The message queue is responsible for the re-sending of missed messages if a client disconnects and re-connects. To support this, the message queue can number each outgoing message generated by mounts. When a client connects, the client can provide a serial number for the “last received message.” If this serial number matches a message that's still remembered by the queue (or the serial number prior to the oldest remembered message), then messages after that will be re-delivered, and the client will be assumed to be up to date. If the serial number is 0, or if the serial number falls before the remembered range of messages, then the connection will be treated as “new” and each mount will be called upon to deliver “new state,” which for message streams does nothing, and for state streams delivers a snapshot of all the state.
Supervisor NodeThe supervisor will be addressed using a system-wide registered name (e.g., “supervisor”) in the Erlang runtime. The supervisor is started as supervisor using special command-line arguments. Message queue nodes and gateway nodes that come online will register themselves with the supervisor, using the type described by command-line parameters to the node's executing process. The supervisor will aggregate statistics from the different nodes, and provide a comprehensive management overview of metrics within the system. When inserting a new message queue node into the system, the supervisor will first tell all queue nodes about the new map, and have them forward queue state (as well as incoming traffic for the target queues) to the new node. Then, the supervisor will distribute the new map to all gateways, so that gateways will know to send incoming traffic to the appropriate new node. Finally, all nodes will be told that the new node has “committed,” and the old nodes can remove any state related to the now moved message queues.
ClientThe client is updated to connect to the Message Queue system (e.g., by a Domain Name Server—DNS name) for chat message based communications. XMLRPC calls, such as checkForMessages( ) are re-vectored to be driven by traffic from the Message Queue gateway. There needs to be only a single connection between the gateway and the client. A user is identified to the gateway through a hash-signed cookie, containing an expiry time, a user id and a hash signature. This cookie is issued by the web system when the client first logs in (until and unless login happens entirely through the gateway). To avoid cookie theft, there is a three-way handshake, where the gateway issues a cryptographically random challenge to the client, and the client signs this with the user's password and returns to the gateway. The gateway then verifies that the signature correlates with the signature obtained through signing the challenge locally. This requires the user's password to be held server-side. To counter this, in one embodiment, the user signs the challenge with a hash of the password, and a password is stored server-side, which means that the hash of the password is the new “password,” but avoids plaintext password leakage should we have a system intrusion event.
SecurityThe system of various embodiments is designed to avoid user impersonation attacks. The system is also designed to mitigate identity theft attacks, and to reduce the cost of authentication checking to the set-up handshake phase. As long as services use the established identity (e.g., customer id) for any source-dependent operations, the system will be secure. Mal-formed services that pass plaintext ids to services cannot be guarded against at this level; but instead have to be mitigated by proper API design and separate service auditing.
Client/Server Integration in an Example EmbodimentAll creation, subscription, and un-subscription to queues happen on the server side, as a side effect of some XMLRPC or other API call. For instance, as part of user login processing, a login process can create a user's system chat and buddy-state queues and subscribe the user to these chat and state queues. When subscribing to a queue, three flags can be specified:
-
- Whether you should create the queue if it doesn't already exist (i.e. true when subscribing to your buddystate, false when subscribing to your friend's buddystate)
- Whether you are interested in knowing of the other subscribers for the queue
- Whether you are a “keep-alive” participant; whether your presence should control the queue's existence (again, true for your buddystate, false for a friend's)
Successful subscription generates a “queue subscribed” message on the network to the client, which is how a client session learns of a queue subscription that happened out in an XMLRPC call. If no direct TCP connection has been established for that client, the gateway remembers the queue subscription so that when the client does connect, all subscription messages can be then sent down. This handles the case of queue subscriptions that occur during user login, which happens before the client has connected to a gateway via TCP. Similarly, un-subscriptions will send a “queue unsubscribed” message to the client.
Client Processing in an Example EmbodimentInitially, a service is registered at the client to handle the client-side processing associated with the Message Queue system described herein. In a particular embodiment, a ServiceProvider module on the client can register a new service, MQManager, for handling the client-side processing associated with the Message Queue system. After client login, the MQManager service can have the necessary data to start an authentication and connection process on the client. See below for more detail on the login process in a particular embodiment.
Objects interested in queues can register as a listener for that queue, by name, with MQManager, and provide a message handler callback. Only one object is responsible for authoritative handling and consumption of a given queue, so only one object is allowed to listen. If MQManager already has a listener waiting for queue named X and another listener attempts to register to listen, the MQManager will raise an exception. If more objects need to know of messages for that queue, the listening object's message handler can communicate events to other listeners.
MQManager can store callbacks for each queue name and call those callbacks as the MQManager receives messages. The MQManager decodes the messages first, so the listener's message handler receives a message object and not a bit string. The queue subscribed message itself will be sent along to the listener's message handler, as the message may contain initial state or initial participant data. Also, the listening object may be interested in the point of subscription. Similarly, unsubscribe messages can be sent to the message handler. Note that when an unsubscribe message is received, we do not immediately remove the listener from the list. Instead, we want the listener to automatically detect if another subscribe request for the same queue happens, without the listening object having to subscribe again. When MQManager receives a message and calls the listener's callback with the decoded message object, MQManager can also pass the queue name, for objects using one callback to listen to multiple queues. MQManager can filter out messages that are marked as originating from this user, so chat sessions don't have to deal with echo messages.
If a subscription message received by MQManager doesn't have an object listening to that queue, MQManager backlogs the subscription message by queue name. Subsequent messages that come in for that queue will be backlogged in the same spot. If an object attempts to listen to that queue name at some later point, the object can immediately have this backlogged batch of messages sent to the object. This allows objects to listen for subscriptions if they don't immediately know of the queue name. For example, a user creates a chat, but doesn't know that the new chat's queue name is “chat/49723942/messages” until a call returns with the newly-created chatId 49723942. When backlogging a message, if the oldest message in the backlog for that queue is older than a time period (e.g., 5 minutes), we can log an error, discard that queue's backlog, and stop backlogging subsequent messages for that queue.
The message sending component of the MQManager (e.g., sendMessage) can take an optional “expectAsyncResult” flag. Messages sent with this flag can have an op_id generated by MQManager to be appended to the message and returned by sendMessage. Messages sent with “expectAsyncResult” are state messages that expect a pass/fail response; a message sent with an op_id can asynchronously receive a response from the network specifying to which op_id the message relates, and a pass/fail result. What the calling object does with the op_id created by sendMessage, and the subsequent result message sent to its handler, is entirely the responsibility of the calling object. See the chat disclosure below for an example of usage.
MQManager can also handle the sending of a keep-alive ping over TCP every time period (e.g., 20 seconds), if there is no other outbound traffic. MQManager can also handle the receiving of a keep-alive ping every time period (e.g., 20 seconds) from the gateway if there is no other inbound traffic. If the expected keep-alive isn't present or the connection is otherwise unexpectedly lost, MQManager can reconnect transparently, queuing up messages to be sent during the non-connected state, including the message that may have been pending during a socket error. These queued up messages can be sent out once connection is reestablished. The gateway can maintain the subscriptions for the client automatically, if the reconnect happens quickly enough. If a connection cannot be reestablished within a reasonable time frame (as the gateway will consider this client's extended absence of connection a timeout and will kill existing subscriptions), the client should behave like it normally does for an extended network outage, requiring the user to sign in again. Connection lapse is the only thing that will cause an outgoing message to be queued up for later delivery; any failure in sendMessage in the gateway or beyond (e.g., sending to non-existent queue, sending to queue that one isn't subscribed to or can't write to) will silently fail, unless an op_id was specified (e.g., a “expectAsyncResult” was passed to MQManager.sendMessage). In this case, an error result will come back.
Login Processing in an Example EmbodimentThough the MQManager is created when it is registered with serviceProvider, MQManager does not begin to establish a TCP connection until after user login is finished. After connection occurs, the authentication process is initiated. In a particular embodiment, authentication is a multi-step process including the following steps:
-
- Client sends a cookie, provided at login, to identify itself
- Gateway responds with random challenge
- Client signs challenge with hash of password and sends signed challenge
- Gateway signs its challenge locally with the hash of the password and verifies this matches what the client just served, and indicates pass/fail to the client
If authentication fails, the user is asked to log in again, in a manner similar to a login failure. As part of the login process, a login component creates and subscribes the user to a system-chat queue and a buddy-state queue. For both the system-chat queue and the buddy-state queue, the subscriptions are marked as not interested in other subscribers, and that the user is a keep-alive participant. Additionally, the login component loops over all of the user's friends, fans, recent chats, anyone who would appear in the user's friends mode, to subscribe the user to their queues and all of them to the user's queue. Because the subscriptions are marked as non-creating, nothing will happen for offline friends with no queues of their own or no way to subscribe to the user's queue. The subscriptions are also all marked as non-keep-alive.
System Chat Processing in an Example EmbodimentIn a particular embodiment, there is a manager object, which listens for systemchat subscription messages. Upon receiving systemchat messages, the manager object echoes the systemchat message content on an event bus, for all clients to consume. The manager object does not echo the subscription and un-subscription messages. In one embodiment, system chat events come from the server as a string token with a JSON blob of arbitrary payload.
Friends Processing in an Example EmbodimentShortly after login on the client, a BuddyState object is activated. The BuddyState object is the main buddy manager object that manages the user's buddies. The BuddyState object can immediately fetch a list of the user's friends (buddies) shortly after login. For each buddy in the list, the BuddyState object can listen to the buddyState queue for that friend (e.g., using one single callback for all of the buddies). A subscription message for a particular queue means that a user's friend is online, and an un-subscription message for that particular queue means that a user's friend is offline. Therefore, at BuddyState object initialization time, all friends are offline until proven otherwise. Other messages over those queues can notify the user of their actual buddy state (e.g., Do Not Disturb-DND, adults only, away, etc.). Updates to online status and actual buddy state can be delivered to the various parts of the client using the BuddyState object's event system. The BuddyState object is also responsible for handling the user's buddyState queue, and sending messages to the user's buddyState queue (e.g., I went DND, I went available, I am signing off, etc.). When a user signs off, a “signing off” message is sent on their buddyState queue, so the BuddyState object can interpret either a signoff message or an un-subscription for some friend as the user going offline. When a user vanishes without the signing off message, their gateway will eventually time them out and unsubscribe them from all of their queues. Because that user was the keep-alive participant for their own buddy queue, that buddy queue will be torn down and all other participants will be unsubscribed, which is how other clients will learn of the user's timeout. Note that because listeners are kept around in MQManager even after an un-subscription occurs, the BuddyState object doesn't have to listen again when a friend goes offline and an un-subscription is received from the network. A user learns of new friends coming online; because, part of that friend's login processing is to subscribe to all of their friends' buddyState queue. For buddy list changes (e.g., adding friends, removing friends, etc.), the BuddyState object can listen to subscriptions for the new buddy, and subscribe or un-subscribe the users from each others' buddyState queues. Similarly, for new recent chats, the BuddyState object can listen to the subscription for the new recent chat, and can create the appropriate subscriptions between the recent chatting users.
Logout Processing in an Example EmbodimentA TCP connection that is cleanly disconnected will cause the gateway to unsubscribe the logged out user from all queues, which, depending on the user's keep-alive status for those queues, will tear down some queues entirely. In the event of an unclean shutdown, the user's gateway will time out the user and perform the same steps. Additionally, in case of a gateway crashing, the existence of a queue stops if there have been no “strong” subscribed users for a period of time-out.
Chat Creation Processing in an Example EmbodimentWhen a user initiates a chat session at a client, a client chat module or the MQManager can create a chat session identifier (chatId). The MQManager can then create and subscribe the user to a message queue (messageQueue) and a state queue (stateQueue) for the chat. The message queue is marked as interested in participants, and for both the message queue and the state queue, the user is not marked as a keep-alive user. Initial and ongoing participant information is communicated in either direction to the client's session objects through the messageQueue, as well as actual messages and other chatstream messages, such as animations. Initial and ongoing seat state is communicated in either direction to the chat session through the stateQueue. Seat assignment, being a state queue message, can expect an asynchronous response. Therefore, the chat session, when sending seat messages, can set the “expectAsyncResponse” flag to true, store the op_id returned from sendMessage, and expect a result message to come to its message listener function at some point in the future that has the result for that op_id. For example, we can move the seat locally, send the message and record the seat move and op_id in the chat session somewhere (e.g., an object containing op_id: (cid, old_seat, new_seat). When a result message is received from the network for that given op_id, we can either bump our entry out of the object containing op_id (if the seat assignment passed) or undo the seat move locally (if the seat assignment failed). Note that chatIds may only exist as a means to name and uniquely identify queues for a particular chat. The existence of a queue for some chat is what really represents the existence of that chat.
Chat Join Processing in an Example EmbodimentWhen a user joins a chat session at a client, a client chat module or the MQManager can subscribe the user to the messageQueue and stateQueue for the chatId the user is attempting to join, marking the subscriptions as non-keep-alive and non-creating. In the event that the queues don't exist, we can increment an error counter. The client chat module or the MQManager can be responsible for unsubscribing the user from the messageQueue and stateQueue for the chatId.
Invite Processing in an Example EmbodimentIn a particular embodiment, a chatgateway.attemptInvite XMLRPC call can be used to create an invite in the database and to send a systemChat notification to the invitee instructing their client to call chatGateway.checkForInvite, retrieve the invite and either accept or give a decline reason. The accept or decline reason can be returned to the invitor as the synchronous return value to their attemptInvite call. In an alternative embodiment, an invitor can send a systemchat invite to the invitee, whose reply is delivered back asynchronously as another systemChat message to the invitor.
Details of a Particular Example EmbodimentReferring to
Networks 120 and 114 are configured to couple one computing device with another computing device. Networks 120 and 114 may be enabled to employ any form of computer readable media for communicating information from one electronic device to another. Network 120 can include the Internet in addition to LAN 114, wide area networks (WANs), direct connections, such as through a universal serial bus (USB) port, other forms of computer-readable media, or any combination thereof. On an interconnected set of LANs, including those based on differing architectures and protocols, a router acts as a link between LANs, enabling messages to be sent between computing devices. Also, communication links within LANs typically include twisted wire pair or coaxial cable, while communication links between networks may utilize analog telephone lines, full or fractional dedicated digital lines including T1, T2, T3, and T4, Integrated Services Digital Networks (ISDNs), Digital User Lines (DSLs), wireless links including satellite links, or other communication links known to those of ordinary skill in the art. Furthermore, remote computers and other related electronic devices can be remotely connected to either LANs or WANs via a modem and temporary telephone link.
Networks 120 and 114 may further include any of a variety of wireless sub-networks that may further overlay stand-alone ad-hoc networks, and the like, to provide an infrastructure-oriented connection. Such sub-networks may include mesh networks, Wireless LAN (WLAN) networks, cellular networks, and the like. Networks 120 and 114 may also include an autonomous system of terminals, gateways, routers, and the like connected by wireless radio links or wireless transceivers. These connectors may be configured to move freely and randomly and organize themselves arbitrarily, such that the topology of networks 120 and 114 may change rapidly.
Networks 120 and 114 may further employ a plurality of access technologies including 2nd (2G), 2.5, 3rd (3G), 4th (4G) generation radio access for cellular systems, WLAN, Wireless Router (WR) mesh, and the like. Access technologies such as 2G, 3G, 4G, and future access networks may enable wide area coverage for mobile devices, such as one or more of client devices 141, with various degrees of mobility. For example, networks 120 and 114 may enable a radio connection through a radio network access such as Global System for Mobile communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), Wideband Code Division Multiple Access (WCDMA), CDMA2000, and the like. Networks 120 and 114 may also be constructed for use with various other wired and wireless communication protocols, including TCP/IP, UDP, SIP, SMS, RTP, WAP, CDMA, TDMA, EDGE, UMTS, GPRS, GSM, LTE, UWB, WiMax, IEEE 802.11x, and the like. In essence, networks 120 and 114 may include virtually any wired and/or wireless communication mechanisms by which information may travel between one computing device and another computing device, network, and the like. In one embodiment, network 114 may represent a LAN that is configured behind a firewall (not shown), within a business data center, for example.
The user platforms 140 may include any of a variety of providers of network transportable digital content. Typically, the file format that is employed is XML, however, the various embodiments are not so limited, and other file formats may be used. For example, feed formats other than HTML/XML or formats other than open/standard feed formats can be supported by various embodiments. Any electronic file format, such as Portable Document Format (PDF), audio (e.g., Motion Picture Experts Group Audio Layer 3-MP3, and the like), video (e.g., MP4, and the like), and any proprietary interchange format defined by specific content sites can be supported by the various embodiments described herein. Syndicated content includes, but is not limited to such content as news feeds, events listings, news stories, blog content, headlines, project updates, excerpts from discussion forums, business or government information, and the like. As used throughout this application, including the claims, the term “feed,” sometimes called a channel, refers to any mechanism that enables content access from a user platform 140.
In a particular embodiment, a user platform 140 with one or more client devices 141 enables a user to access content from other user platforms 140 via the message queue system site 110 and network 120. Client devices 141 may include virtually any computing device that is configured to send and receive information over a network, such as network 120. Such client devices 141 may include portable devices 144 or 146 such as, cellular telephones, smart phones, display pagers, radio frequency (RF) devices, infrared (IR) devices, global positioning devices (GPS), Personal Digital Assistants (PDAs), handheld computers, wearable computers, tablet computers, integrated devices combining one or more of the preceding devices, and the like. Client devices 141 may also include other computing devices, such as personal computers 142, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PC's, and the like. As such, client devices 141 may range widely in terms of capabilities and features. For example, a client device configured as a cell phone may have a numeric keypad and a few lines of monochrome LCD display on which only text may be displayed. In another example, a web-enabled client device may have a touch sensitive screen, a stylus, and several lines of color LCD display in which both text and graphics may be displayed. Moreover, the web-enabled client device may include a browser application enabled to receive and to send wireless application protocol messages (WAP), and/or wired application messages, and the like. In one embodiment, the browser application is enabled to employ HyperText Markup Language (HTML), Dynamic HTML, Handheld Device Markup Language (HDML), Wireless Markup Language (WML), WMLScript, JavaScript, EXtensible HTML (xHTML), Compact HTML (CHTML), and the like, to display and send a message.
Client devices 141 may also include at least one client application that is configured to receive content or messages from another computing device via a network transmission. The client application may include a capability to provide and receive textual content, graphical content, video content, audio content, alerts, messages, notifications, and the like. Moreover, client devices 141 may be further configured to communicate and/or receive a message, such as through a Short Message Service (SMS), direct messaging (e.g., Twitter), email, Multimedia Message Service (MMS), instant messaging (IM), internet relay chat (IRC), mIRC, Jabber, Enhanced Messaging Service (EMS), text messaging, Smart Messaging, Over the Air (OTA) messaging, or the like, between another computing device, and the like.
Client devices 141 may also include a wireless application device 148 on which a client application is configured to enable a user of the device to subscribe to at least one message source. Such subscription enables the user at user platform 140 to receive through the client device 141 at least a portion of the message content. Such content may include, but is not limited to, instant messages, Twitter tweets, posts, stock feeds, news articles, personal advertisements, shopping list prices, images, search results, blogs, sports, weather reports, or the like. Moreover, the content may be provided to client devices 141 using any of a variety of delivery mechanisms, including IM, SMS, Twitter, Facebook, MMS, IRC, EMS, audio messages, HTML, email, or another messaging application. In a particular embodiment, the application executable code used for content subscription as described herein can itself be downloaded to the wireless application device 148 via network 120.
In some cases, a user at user platform 140 can subscribe to certain content and/or content channels provided by all mechanisms available on the client device(s) 141. In various embodiments described herein, the host site 110 can employ processed information to deliver content channel information to the user using a variety of delivery mechanisms. For example, content channel information can be delivered to a user via email, Short Message Service (SMS), wireless applications, and direct messaging (e.g., Twitter) to name a few. Additionally, content channel information can be provided to a user in response to a request from the user.
Referring still to
Referring now to
In current practice, content is delivered from web servers to web clients using the HTTP protocol. This protocol is an important part in REST-ful application design (e.g., see Fielding, Roy Thomas, Architectural Styles and the Design of Network-based Software Architectures, Doctoral dissertation, University of California, Irvine, 2000; which can be found at the following web location: http://www.ics.uci.edu/˜fielding/pubs/dissertation/rest_arch_style.htm). To scale a data transfer system to Internet scale, data needs to be cached at multiple layers.
Referring to
To allow content to change over time, data that is mutable can be given an expiry date. For example, if the data consists of “today's news,” the expiry date can be set to midnight, after which new content may be available. All caching nodes on the network along this path, using the HTTP protocol, will be aware of this expiration header and will discard the cached data at the point at which the data becomes stale (e.g., when the current time/date passes the expiry time/date corresponding to the data). For example, see RFC 2616, the Expires: header definition of the HTTP protocol: http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html.
When the expiration date of content is well known, this method works great. When the expiration date of content is unpredictable, some amount of caching will still help reduce network load and/or request load on each of the pieces of data in the above-described data pipeline. For example, a cache time-to-live of fifteen minutes will help reduce perhaps thousands of requests per second to a small fraction, yet allow new content to be delivered to consumers with at most a fifteen minute latency.
Caching is conceptually implemented as a key-value look-up store, where the key is the Uniform Resource Identifier (URI) of the content (also known as the “endpoint address” or “endpoint” for short) and various metadata about the request, known as the “varying headers.” For more information, see RFC 2616 cited above.
When the latency involved in caching is not acceptable, two additional methods exist to improve latency from the time of new content being available at a given endpoint, to the time at which clients will see this new content.
The first such method is the “ETags” header, which tells the client a version number or checksum of the content. When the client wants to receive content again, the client requests the content again, providing the Etag value the client previously received. The server can then compare this version number (or other Etag value) with the version number (or other Etag value) of the content currently on the server. If the Etag values match, the server can return an empty response that says “no change.” The client then knows that the content in the cache is current and can be re-used. This process saves network transfer time and resources over the network, especially for large pieces of content, such as images, sounds, and movies. However, the process requires a network round-trip to verify whether or not the content has changed. Depending on implementation details, this process may also invalidate the use of caching proxies in the middle of the chain, such as content delivery networks.
The method of providing cached data as described above is effective, as long as the client doesn't need to be told about new data availability pre-emptively. In other implementations, the client needs to be aware of when to request new data, should new data be available. As a result, the method of providing cached data as described above is useful for non-dynamic data, such as common background assets used to build a web site (e.g., images, scripts, etc.); but, the described caching method is not as useful or efficient for quickly changing, dynamic main content.
A second method for improving latency, typically used when content changes often, is called “long polling” (Long polling is sometimes referred to as “comet style” resource fetching). In long polling, a client makes a request similar to the Etags request described above. But, instead of immediately returning data, the server stalls the request for some amount of time (e.g., 15-30 seconds is typical). During this stall time, no data flows back to the client. If the data at the endpoint changes within this time interval, the new data is immediately returned to the client requesting this data. If the data at the endpoint does not change within this time interval, the request is finally returned as a “no change” result. The client will then re-request the same data, and get stalled again, in a process that repeats for as long as the client is interested in up-to-date data. An example of HTTP long polling is shown in
Referring now to
The various example embodiments described herein significantly improve on the responsiveness and machine resources needed to implement web applications that rely on frequently-changing data. In the disclosed example embodiments, the web server 310 shown in
Referring now to
As a result of the data transfer architecture as described herein and the inclusion of the message queue system 410 as part of the architecture, the standard HTTP protocol is now augmented with additional metadata. This additional metadata can indicate a message queue to which the client can connect, and addressing data within that message queue on which the client can listen to hear about updates to particular resources/endpoints. This additional metadata is transparently added to the HTTP headers of the response, such that the content can still be served to clients who do not connect to the message queue. Additionally, for resources on which data is expected to infrequently change, the web server does not need to add the queue headers, and the client will not attempt to subscribe to the queue.
When the server, or some system upon which the server relies, determines that the data for an endpoint has changed, a message can be sent to a message queue in the message queue system 410. Clients connected to the message queue can then receive this message and become aware that it is time to re-request the changed data from the given resource. Such a re-request can use the Etags feature described above to bypass content caching. In this manner, the data transfer architecture as described herein can provide a queue invalidation mechanism to alert a client of changed data.
As an alternative to adding queue connection and/or subscription information to the HTTP headers, the content can be modified to include this information. If the content is in a structured format that allows upwards/downwards compatibility, such as XML, or JSON, this modification can be done without affecting the ability of other clients to understand the response without knowing about the queue invalidation mechanism.
In an example embodiment, content can be structured into a URI of four segments within the web server. These four URI segments are detailed below in the example embodiment:
-
- /datatype
- /datatype/:id
- /datatype/:id/relation
- /datatype/:id/relation/:id
In the example of the four URI segments as detailed above, “datatype” identifies a particular kind of data, such as “user” or “picture” or “message.” In the example embodiment, the kind of data maps to a particular type and schema of the resulting endpoint, which makes it possible for the client web browser to properly format and display the given data. Specifically, each data type conforms to some JSON schema (e.g., see http://json-schema.org/).
At the endpoint, “/datatype” (or “/datatype/”) can be a collection (list) of URIs for all instances of that type that is visible to the current user. For very large collections, this list may be paginated (only partially shown, with links available for getting more data,) or the list may be omitted entirely. Additionally, query parameters may be used to modify the list, such as finding all users of a particular age. The example URI for such a query would be “/user?age=45” and the resulting entity would be the list (possibly paginated) of all user entities matching the query.
The URIs of each “user” entity will have the form “/user/:id” where “:id” is a unique identifier for each user. For example, the unique user identifier could be a customer ID or an email address. Thus, the URI of a particular user entity would be “/user/12345”. The data at this endpoint conforms to a defined JSON schema for a user entity, and would be data specific to the user with customer ID 12345 within the namespace of this web application.
Part of the user entity is a set of relations. For example, “friends on the friends list of this user” may be a relation named “friends” in the “user” entity. In JSON, this would be expressed as a link to a “friends” relation, such as the example below:
Rather than requiring the client application to explicitly know about “friends” relation URLs, the relative URL can be explicitly specified in the user entity. For more information on this state of the art, see “Hypertext As The Engine Of Application State” (HATEOAS) in the Fielding dissertation cited above.
The content of the “/user/:id/friends” endpoint is another list (possibly paginated, possibly queried/filtered) of URIs to entities that fulfill the relation in question. For example, if user 12345 has made friends with users 654 and 98765432, the content might look like the following example:
Each data element is individually addressable. Thus, a particular friend relation is addressable individually. This is important for allowing the application to add, modify, and delete specific friend relations. For example, removing the friend relation to user 654 might be expressed in HTTP as a DELETE verb on the URI “/user/12345/friends/1” given the entity data above.
A particular example embodiment of cache invalidation implemented with the message queue as described herein can contain one queue per unique URI, and can allow the client subscribe to each URI-specific queue to learn about the availability of new data (or modified data) at this URI endpoint. As memory resources are required in the transient message queue whenever a subscription or queue is created, the amount of RAM used can be fine-tuned by merging multiple URI updates into a single queue. This implementation reduces the number of queues to which a client can subscribe, but potentially increases the number of clients subscribed to each such queue. Two methods can be used by an example embodiment to manage the allocation of URIs to message queues. These methods are each described below:
-
- 1) Use consistent hashing from the name of the URI to the name of the queue, with some maximum size of the name space. For example, a hashing function such as CRC32 or MD5 can be used to calculate a (potentially large) integer value, that is then divided or masked down to a suitable size. For example, all queue names can be hashed and the lowest 16 bits can be masked off to create 65536 possible queue names on which to listen for updates. A straightforward hash will generate an even coverage over all the generated queue names, which is not actually optimal. A better approach is to use a structure-aware hash function. In this example, the name of the data type is left alone, and only the ID part is hashed. As a result, the sample URI “/user/12345” hashes to the queue name “/user/345”, if the hash function “id modulo 1000” is used.
- 2) URIs that are often accessed together can be queued together. For example, anyone interested in updates to the entity “/user/12345” will likely also be interested in updates to the relations of that entity. Thus, the queue used for entities that are relations off a main (two-component URI) entity can be shared with that main entity.
If a URI is very long, a shortened version of the URI can be used to avoid unnecessarily sending large amounts of data. Available methods of referring to long URIs using short methods are available in practice, including:
-
- 1) Using a hash of the URI that is unlikely to collide with other hashed URIs, encoded in a textual form. For example, the base-64 encoded MD5 hash of the URI, or the hex-encoded SHA256 hash of the URI. Other check-summing or hashing methods and other encoding methods may be used to similar effect.
- 2) Using a link shortening scheme, such as provided by the services goo.gl or bit.ly, that shorten a long URI to a short URI, and provides for later resolution of that shorter URI into the original long URI.
- 3) Using a custom dictionary of URI shortening tokens, communicated as header information in the HTTP response, where the server introduces a key/value look-up table that is retained by the client. Each time the server refers to a URI in an invalidation message, the server uses a short key previously introduced in a response to the client, and the client looks up this key in its table kept of previously introduced URIs.
-
- 1) Client requests the data for a particular endpoint, addressed by URI, using the HTTP protocol, from the server. The client includes a header that signals the server that the client is interested in real-time cache invalidation of the returned data. This information lets the server also serve data to clients that do not want or do not understand real-time invalidation, without allocating invalidation queue resources for them (see processing operation 501 shown in
FIG. 14 ). - 2) The server establishes a queue related to the endpoint URI in the queuing system, the queue name of which is derived as described above (see processing operation 502 shown in
FIG. 14 ). - 3) The server returns the data at the given URI to the client, together with information about the queuing system address and queue name. Because real-time invalidation is used, the server marks the data as “do not cache” or “cache for a brief time” for intermediate caches using HTTP cache control headers (see processing operation 503 shown in
FIG. 14 ). - 4) The client establishes a connection to the queuing system if not already present (see processing operation 504 shown in
FIG. 14 ). - 5) Over the established queuing connection, the client then requests a subscription to the named queue corresponding to the URI endpoint, if the client doesn't already have such a subscription (see processing operation 505 shown in
FIG. 14 ). - 6) When and if the data at the URI endpoint changes, the server sends a message on the named queue, the message includes information that it is a data invalidation event, and the specific URI endpoint that was invalidated (see processing operations 506 shown in
FIG. 14 ). - 7) The client re-requests the data from the server (see processing operation 507 shown in
FIG. 14 ). - 8) The server provides the new, updated data to the client (see processing operation 508 shown in
FIG. 14 ). - 9) The client updates the display of the data to the user in whatever means make sense. For example, update a text box on the screen, change the behavior of some animated game character, or just keep the new data for further computation.
- 1) Client requests the data for a particular endpoint, addressed by URI, using the HTTP protocol, from the server. The client includes a header that signals the server that the client is interested in real-time cache invalidation of the returned data. This information lets the server also serve data to clients that do not want or do not understand real-time invalidation, without allocating invalidation queue resources for them (see processing operation 501 shown in
The client, additionally, implements a local cache of URI endpoint data to re-use data that has not been invalidated. This is necessary to avoid unnecessary re-requests to the server when data is not stale and already received.
In practice, the most popular data request and delivery protocol in use today is HTTP (RFC 2616,) and other protocols closely related or evolved from HTTP, such as HTTPS (secure layer) and SPDY (a Google-specific extension, precursor to version 2.0 of the HTTP protocol.)
In a particular embodiment, a suitable protocol to use for connecting to the message queue is the WebSocket (RFC 6455) protocol. WebSocket is a protocol providing full-duplex communications channels over a single TCP connection. The WebSocket protocol was standardized by the Internet Engineering Task Force (IETF) as RFC 6455 in 2011. Additionally, other protocols, such as direct TCP connections or UDP connections can also be used, assuming the semantics of the message queue can be retained through additional intermediate framing mechanisms.
In an alternative embodiment, a system can be implemented where the updated data is sent through the invalidation mechanism (queuing system) to participating clients, to avoid the second fetch of the resource over HTTP. However, doing so complicates both the server and the client. The server is more complex, because the server has to know how to format and transmit the data over two separate protocols: HTTP, and the queuing system. The client is more complex, because the client needs to be able to receive and decode data over two separate protocols as well.
The example embodiment of cache invalidation implemented with the message queue as described herein lends itself very well to a Reactive Programming implementation approach in the client. In computing, reactive programming is a programming paradigm oriented around data flows and the propagation of change. Reactive Programming is a best practice well known to those of ordinary skill in the art. One embodiment using reactive programming to implement an end-user visible application on top of this near-real-time data delivery system is shown in
Referring now to
-
- 1) Referring to
FIG. 15 , the client application contains at least a local caching component, a user interface component, and a program logic component. - 2) The program logic component decides what data items to display to the user.
- 3) The program logic component creates a user interface component to display the data (see processing operation 603 shown in
FIG. 15 ). - 4) The program logic component requests the data in question from the local caching component be delivered to the user interface component (see processing operation 604 shown in
FIG. 15 ). - 5) The local caching component requests the data from the server (see processing operation 605 shown in
FIG. 15 ). - 6) The server returns the requested data together with real-time invalidation information (see processing operation 606 shown in
FIG. 15 ). - 7) The local caching component subscribes to real-time cache invalidation if the server response indicates that such invalidation is available (see processing operation 607 shown in
FIG. 15 ). - 8) The local caching component pushes the data into the user interface component when the data is received (see processing operation 608 shown in
FIG. 15 ). - 9) The user interface component displays (or re-displays) the data as it is received from the local caching component.
- 10) When a cache invalidation event is received from the queuing system, the local caching component re-fetches the data, and pushes the new data into the user interface component, which re-formats and re-displays the data (see processing operation 610 shown in
FIG. 15 ). - 11) When the user dismisses the information display of the given URI through whatever means, the user interface component removes itself from the set of listeners on the local cache for the given URI component (see processing operation 611 shown in
FIG. 15 ). - 12) The local cache can, when the set of listeners for a particular URI is empty, drop subscriptions to the given endpoint from the messaging system, to conserve system resources, because no display of the data is currently needed on the client system (see processing operation 612 shown in
FIG. 15 ). - 13) As described above with respect to the message queue system, when no clients exist that subscribe to a particular queue, that queue can be removed from the server system.
- 14) For data where time locality of use is likely, the client local cache can optimistically hang on to cached data and subscriptions for some amount of time after the last active listener has removed itself. If another user interface component is configured as a listener for the data, the data in cache can immediately be delivered.
- 1) Referring to
In the various embodiments described above, a main feature is that the program logic component as shown in
In the example embodiment described above, there can be a race condition in some circumstances. Specifically, if the entity (e.g., the data object) changes after the data has been sent from the server to the client, but before the client manages to connect to the queue and establish the connection, the client may not get the invalidation event. There are two solutions to this problem:
-
- 1) The simpler approach is to keep a backlog of the last X messages in the queue. When the client subscribes, the client will get invalidation messages that are retained in the queue. As long as the backlog of messages in the queue is longer than the time between the server returning data and the client establishing the subscription, the invalidation message in question will be received. The benefit of this mechanism is that it is simple. The draw-back is that it may unnecessarily invalidate client data by delivering older invalidation messages to the client when first connecting to the queue.
- 2) A second approach is to keep a version number for each entity in the queue. This can be kept as a state mount as described above in regard to the message queue system. Each time an entity is returned to the client, the entity is stamped with the version number. For example, the entity can be stamped using a custom HTTP header or an Etag. When the entity is updated, the version number of the entity stored in the queue is updated. When the client receives information about the latest available version of the data, the client compares the latest available version of the data to the version of the data the client currently has, and re-requests the entity only if the version available is later than the version the client already has.
Bootstrapping: As an alternative to providing address information about the queuing system to use for invalidation messages in HTTP headers or other metadata attached to the response data, one embodiment of the system disclosed herein uses a “bootstrap” URI, that returns information about other endpoints for clients to use when interacting with the system. Those endpoints are in turn identified using known key names. An example response to the bootstrap URI might look like the following example:
This structure saves space in responses by providing the update address once, rather than with each response. Additionally, this structure lets the server re-structure the URLs/services used without breaking compatibility with existing client code by simply changing the URLs exposed for the well-defined keys in the bootstrap response.
The example computer system 700 includes a data processor 702 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), or both), a main memory 704 and a static memory 706, which communicate with each other via a bus 708. The computer system 700 may further include a video display unit 710 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 700 also includes an input device 712 (e.g., a keyboard), a cursor control device 714 (e.g., a mouse), a disk drive unit 716, a signal generation device 718 (e.g., a speaker) and a network interface device 720.
The disk drive unit 716 includes a non-transitory machine-readable medium 722 on which is stored one or more sets of instructions (e.g., software 724) embodying any one or more of the methodologies or functions described herein. The instructions 724 may also reside, completely or at least partially, within the main memory 704, the static memory 706, and/or within the processor 702 during execution thereof by the computer system 700. The main memory 704 and the processor 702 also may constitute machine-readable media. The instructions 724 may further be transmitted or received over a network 726 via the network interface device 720. While the machine-readable medium 722 is shown in an example embodiment to be a single medium, the term “machine-readable medium” should be taken to include a single non-transitory medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more sets of instructions. The term “machine-readable medium” can also be taken to include any non-transitory medium, or combination of transitory media collaborating to create a non-transitory or semi-non-transitory medium, that is capable of storing, encoding or carrying a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the various embodiments, or that is capable of storing, encoding or carrying data structures utilized by or associated with such a set of instructions. The term “machine-readable medium” can accordingly be taken to include, but not be limited to, solid-state memories, optical media, and magnetic media.
The Abstract of the Disclosure is provided to comply with 37 C.F.R. §1.72(b), requiring an abstract that will allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.
Claims
1. A system comprising:
- a data processor;
- a network connection, in data communication with the processor, for access to a network; and
- a message queue system module, executable by the processor, to: receive queue subscription requests from a plurality of clients via the network connection, the queue subscription requests including a request for data from an identified queue and a request to receive invalidation information associated with the requested data; use a consistent hash table to map the identified queue to a corresponding queue node; convey information indicative of an invalidation event associated with the requested data to any of the plurality of clients from which a request to receive invalidation information was received; and receive a request for an update to the requested data in response to the conveyance of the information indicative of the invalidation event associated with the requested data.
2. The system of claim 1 wherein the queue subscription request is sent over a hypertext transfer protocol (HTTP) based protocol and the queue is identified using a uniform resource identifier (URI).
3. The system of claim 1 wherein the information indicative of the invalidation event is conveyed to any of the plurality of clients via WebSocket protocol.
4. The system of claim 1 wherein the information indicative of the invalidation event includes an identifier of the data that was invalidated.
5. The system of claim 1 wherein the information indicative of the invalidation event includes a shortened version of an identifier of the data that was invalidated.
6. The system of claim 1 wherein the information indicative of the invalidation event is sent through the corresponding subscribed queue.
7. The system of claim 1 wherein the information indicative of the invalidation event is sent as a state update in a state mount that maps an identifier of the data to a version number.
8. The system of claim 1 wherein the message queue system module being further configured to use a supervisor process to manage a mapping of the identified queue to a corresponding queue node.
9. The system of claim 1 wherein the message queue system module being further configured to translate the request from a protocol to a system message.
10. A method comprising:
- receiving queue subscription requests from a plurality of clients via a network connection, the queue subscription requests including a request for data from an identified queue and a request to receive invalidation information associated with the requested data;
- using a consistent hash table to map the identified queue to a corresponding queue node;
- conveying information indicative of an invalidation event associated with the requested data to any of the plurality of clients from which a request to receive invalidation information was received; and
- receiving a request for an update to the requested data in response to the conveyance of the information indicative of the invalidation event associated with the requested data.
11. The method of claim 10 wherein the queue subscription request is sent over a hypertext transfer protocol (HTTP) based protocol and the queue is identified using a uniform resource identifier (URI).
12. The method of claim 10 wherein the information indicative of the invalidation event is conveyed to any of the plurality of clients via WebSocket protocol.
13. The method of claim 10 wherein the information indicative of the invalidation event includes an identifier of the data that was invalidated.
14. The method of claim 10 wherein the information indicative of the invalidation event includes a shortened version of an identifier of the data that was invalidated.
15. The method of claim 10 wherein the information indicative of the invalidation event is sent through the corresponding subscribed queue.
16. The method of claim 10 wherein the information indicative of the invalidation event is sent as a state update in a state mount that maps an identifier of the data to a version number.
17. The method of claim 10 including using a supervisor process to manage a mapping of the identified queue to a corresponding queue node.
18. The method of claim 10 including translating the request from a protocol to a system message.
19. A non-transitory machine-useable storage medium embodying instructions which, when executed by a machine, cause the machine to:
- receive queue subscription requests from a plurality of clients via the network connection, the queue subscription requests including a request for data from an identified queue and a request to receive invalidation information associated with the requested data;
- use a consistent hash table to map the identified queue to a corresponding queue node;
- convey information indicative of an invalidation event associated with the requested data to any of the plurality of clients from which a request to receive invalidation information was received; and
- receive a request for an update to the requested data in response to the conveyance of the information indicative of the invalidation event associated with the requested data.
20. The machine-useable storage medium of claim 19 wherein the instructions being further configured to translate the request from a protocol to a system message.
Type: Application
Filed: Feb 6, 2017
Publication Date: May 25, 2017
Inventor: Jon Watte (Redwood City, CA)
Application Number: 15/425,395