SYSTEM AND METHOD FOR REAL-TIME ANALYSIS OF NETWORK TRAFFIC
A mirrored live-data flow of the live-data flow passing through a selected point within a network is monitored at a first processing node. The live-data flow comprises data that is in active transmission between endpoints in the network and prior to exit from the network and onward storage of the data in a database. Each packet within the mirrored data flow is decoded at the first processing node according to each protocol associated with a packet. Packets having a plurality of protocols associated therewith are decoded in parallel with each other. Each of the decoded packets are compared at the first processing node to a set of predetermined or deduced conditions. A predetermined or deduced response is executed based upon detection of a predetermined or deduced condition within the decoded packets. At least a portion of the decoded packets of the live-data flow causing execution of the predetermined or deduced response are processed at a second processing node to determine a manner for controlling an operation of the network at a same time the live-data flow is in active transmission between the endpoints in the network. The operation of the network is controlled in response to the processing step.
This application claims benefit of U.S. Provisional Application No. 61/877,810, filed Sep. 13, 2013, entitled REAL TIME ANALYSIS OF NETWORK TRAFFIC, the specification of which is incorporated by reference herein in its entirety.
TECHNICAL FIELDThe invention relates to voice and data networks, and more particularly to the real-time analysis of a live-data stream resulting in a situational deduction simultaneous to the live-data transmission over the voice and data networks, and as a result providing an opportunity to make effective an alert or action that now affects a set of probable outcomes before that data in transmission exits the network, or becomes at rest as a stored event, log record, or application record of what has already happened as the only outcome.
BACKGROUNDThe proliferation of internet- and mobile-connected devices—the ‘Internet of Everything’—has increased data traffic volume, transmission speeds and usage on communications networks. The ubiquity of device types and connections (cellular, wireless, multi-SIM, machine-to-machine) and the expansion of usage types (voice, high-definition video, music, data) have also made it more complex to monitor and secure these networks and to conduct analysis on the traffic and content.
To accomplish this, the traffic must be instrumented (what data is moving across the network), analyzed (what is the content of the traffic), and understood (what are the implications of this) so a relevant decision can be made or action taken within the available window of opportunity. This is especially so in the case of time-critical revenue, customer, operational, or security impacting events. Examples of such events include fraud occurring on mobile carrier networks, cellular zones dropping calls above an acceptable threshold, malfunctioning mobile applications, or malicious content or agents compromising a network.
This network data is captured by a variety of network probes sitting ‘inline’ (intrusively) inside the network. Network events must first ‘complete’ (example: after a voice call is completed and goes through ‘call teardown’) before they are translated into offline database records (example: Call Detail Records, Event Detail Records). These records are extracted at regular time intervals and provided to applications in offline enterprise data centers for post-event processing and analysis.
These systems can suffer from latency delays of up to 15 minutes for event data to be extracted and delivered to databases. In many cases, multiple terabytes of data are written into databases, posing ‘Big Data’ analytical challenges when time-critical results are needed. The inline hardware represents significant capital expenditures. These types of systems also provide a limited ability to respond flexibly to live conditions, as the application layer is not integrated contextually within the data collection layer. Database records are not generated for some network events that may provide indications of fraud or other critical issues that must be detected.
A use case is mobile carrier fraud detection that utilizes call detail records that have been delivered to a data warehouse after the relevant network traffic or calls have been completed. Detection of fraud in this case occurs after the actual fraudulent event has occurred, and in many cases, the carrier has already incurred a financial loss. Any actions taken to remediate (example: block the caller) can only be applied to the next time a relevant event appears in the network.
For a more complete understanding, reference is now made to the following description taken in conjunction with the accompanying drawings in which:
Referring now to the drawings, wherein like reference numbers are used herein to designate like elements throughout, the various views and embodiments of a system and method for real-time live-data analysis of network traffic are illustrated and described, and other possible embodiments are described. The figures are not necessarily drawn to scale, and in some instances the drawings have been exaggerated and/or simplified in places for illustrative purposes only. One of ordinary skill in the art will appreciate the many possible applications and variations based on the following examples of possible embodiments.
Referring now to the drawings, and more particularly to
The System 102 uses relational processing languages and techniques to enable detection of a situation in real-time and in parallel to its occurrence within a network; and not at a later point in time after the data has left the network for analysis based upon post-event data processing, which does not allow an opportunity to affect a change in outcome on that present event. The network traffic 104 is comprised of continuous transmissions of signaling and related data content (live-data) as can be found within voice communications or data networks such as those provided by mobile, broadband, or data communications network service providers. The System 102 provides any network provider (wireless carrier, fixed wire/line carrier, cable operator, etc.) an opportunity to detect and identify target events or patterns of data flow or relationships (“Events”) occurring within its network traffic 104 as they occur and to automatically deduce and take predictively relevant actions or control responsive to the detection in a concurrent manner to those transmissions. The network live-data, real-time analysis, and deduction system 102 provides the automated action in any number of fashions, including, but not limited to providing information to a dashboard, web based or mobile device display 106 that responds to a detected Event in parallel to the Event occurring and remaining open within the network traffic 104, or the generation of automated alerts 108 that may then be responded to manually or by the network.
Live-data is data that is in transmission between endpoints, not at rest within a database. Live-data is transient data in that it exists only for that period of time it is in transmission. The term “real-time” typically refers to the immediacy of a process or response to a query being made available in time for its usefulness. The term real time has nothing to do with the age or relevancy of the data, but instead has everything to do with the timeliness of response relevant to a time period. The term real time is therefore an omni-available description that introduces a time period and that needs to be qualified as, “real time to what?” Data that is time-critical relates to the period of urgency or usefulness applied to it. Real time live-data analysis is the time-critical processing of network traffic in parallel with its transmission and before such network traffic completes its transmission and exits the network to become an “already-happened” data event at rest.
The System 102 provides a non-intrusive process that enables data center logic to operate concurrently with the transmission before the transmission terminates and exits the network to become a data center application event, and additionally provides the ability for the data warehouse system to interact in a time-critical manner with the same network traffic 104 to provide contextualization of conditions based on trends or other data. The System 102 enables concurrent analysis and deduction of relationships and probabilities as Events occur and are transmitted as network traffic 104, thus allowing deductive parallel operations with the concurrently occurring network traffic and its operations. The System 102 does not reside within a data center that operates on a sequence of post event analytical functions; rather it is architected as a larger network topology operating non-intrusively and in parallel to the network traffic 104.
Within a network topology, the system is able to use one or more virtual machines as data collection devices (“ingestor node(s)”) connected non-intrusively to network elements that provide a port mirror to non-intrusively ingest network traffic (“live-data source”) to dynamically and continuously decode signaling, packet or data content (“network traffic”), and action such identifiable selected network traffic to trap and generate immediate alerts, and additionally pass through all or such selected subject matter for further processing simultaneously with and live to the network traffic event remaining open or in transit, or before the transmission exits the network and becomes a data center log record or application event. The system 102 is in two parts, consisting of one or more ingestor nodes 110 and one or more semantic nodes 112. The ingestor node 110 enables a non-intrusive, direct mirroring of network traffic 104 and its content, and provides protocol decoding, data extraction, and prescribed Event alert capabilities. The ingestor node also feeds an assigned semantic node 112 with such prescribed traffic as required. The ingestor node 110 non-intrusively undertakes its analysis and alerts while a particular Event is occurring or in transmission.
The various rules in control that dynamically instruct ingestor nodes 110 as to what particular protocol and information is being sought to be alerted by the System 102 are provided by the semantic node 112. The semantic node provides one or more virtual machines for the purpose of collecting all or selective network traffic from the ingestor node(s) 110 and enabling access to relational language processing in combination with their application use cases and variable windows of time to provide analysis and reasoned deduction of outcomes of time-critical live-data situations for the generation of further alerts, intercept and interdiction actions (“semantic node(s)”), being able to affect a more desirable or predictable outcome of the network traffic, before the transmission exits the network and becomes a data center log record or application event. The primary functions of the semantic node 112 are to attach to the ingestor node 110 for the receipt of all ingestor node packets 1104. Functions include to receive selected ingestor node packets 1104; the preparation and management of time critical processes required by use case applications 1102 to process the described use cases; to provide fast in-memory storage for statistical models required by a use case application; to provide application visualization and system administration visualization through the visualization VM 1110; and to provide integrity check of packets mirrored to packets that exit the network.
The System 102 has the ability to process data from the network traffic 104 at gigabit speeds. The ingestor node 110 filters, decodes, undertakes prescribed alerts and feeds selective or all network traffic into the semantic node 112. The semantic node 112 undertakes application specific use case tasks including situational analysis, contextual reasoning and deductive processing according to rules, statistical models and, if any, subject matter databases attached to the semantic node 112.
Referring now to
The semantic node 112 provides rules engine functionalities 210, visualization functionality 212, and command and control framework 214 to provide for an application use case execution. The rules engine 210, visualization 212 and command and control 214 provide a manner for analyzing the received data according to a particular use case. Specific use cases are provided within this framework using an open application programming interface (API) application blade architecture 216 that enables a user to develop and add multiple application use cases to the System 102. The semantic node 112 can be expanded to incorporate SSD and hard drive databases 218 provided they are able to perform at the time-critical speeds of the live-data processing. In direct relation to an embedded use case, the semantic node 112 has the ability for internal contextual evolution of the application specific statistical models by way of contextual table update and dynamically allocated stored procedures. This provides a certain amount of internally biased (situational learning) based on the correctness of the recommended decisions and execution of each application use case. Multiple applications can coexist and be implemented within the same semantic node 112 and processed from the same live-data input.
Referring now to
If no prescribed conditions are detected, control passes back to step 302 and the process repeats. Once a particular prescribed condition is detected, the ingestor node 110 sends an alert to the semantic node 112 or undertakes a preset action at step 312. This action could be to send a prescribed alert to network elements to truncate or trap and redirect that particular network traffic to other systems, including the semantic node, for processing. Such processing may include change of content, copy of content or to create interdiction schemes for further network traffic of a like nature. All decoded network traffic is sent at step 314 to the semantic node 112 wherein such particular use case rules associated with any detected conditions is applied to the data.
Referring now to
Referring now to
This information may be monitored for using particular statistical models implemented within the semantic node and in-memory database 412 and may additionally use additional contextual data from outside databases 414. The information within the semantic node and in-memory database 412 controls the operation of a rules engine 416 that generates the appropriate responses to information detected by the packet sniper 410 and generates various responses thereto such as email alerts 418, visualization outputs 420, configuration parameters 422 and framework queries 424. Information within the semantic node and in-memory database 412 may also be updated through a machine learning feedback loop 426.
Referring now to
In a method similar to that of the live-data network traffic ingest, the file-based information is also ingested, monitored and analyzed using particular statistical models implemented within the semantic node and in-memory database 512 and may additionally use contextual data from outside databases 514. The information within the semantic node and in-memory database 512 controls the operation of a rules engine 516 that generates the appropriate responses to information detected by the packet sniper 510 and generates various responses thereto such as email alerts 518, visualization outputs 520, configuration parameters 522 and framework queries 524. Information within the semantic node and in-memory database 512 may also be updated through a machine learning feedback loop 526.
The systems of
Referring now to
The semantic node 112 in layer 610 contains the application decision matrices, self-learning cognitive decision support, and action logic to enable execution of the desired use case outcome. Each semantic node 112 contains the use case or pattern recognition logic to identify with instances and situations that are of interest in accordance with their use case. The semantic node 112 provides a contextual learning loop through an independent process 614 connecting to legacy storage 616 and providing updates to the semantic node 112 in parallel to the system 102.
Referring now to
The live-data source provides network traffic (structured or unstructured) to the ingestor node 110 for decoding and identification. Upon ingestion by the ingestor node 110, the network traffic is sent to the protocol decoder 708 that decodes and identifies each wanted protocol packet and discretizes such wanted decoded network traffic as packets into a time dependent buffer (“TDB”) as allocated by the time dependent buffer VM (“TDB VM”) 908. The TDB VM 908 is a semaphore-based internal memory allocation manager for the ingestor node 110 that assists in the integrity of memory allocation and release to ensure that both locked and lockless operations can occur in parallel, in real-time as needed and without clash. This memory is allocated and distributed at arbitrary lengths, based on need (via a variable length bitmap). The address of each newly loaded TDB is passed to a process whereby prescribed or deduced events are looked for in packet sniper 718.
The packet sniper 718 compares the decoded data to certain conditions of interest as indicated by the prescribed rules provided by the semantic node 112 or by deduced conditions determined by the contextual data and feedback loop/learning loop undertaken by the semantic node 112. The packet sniper 718 provides positive indications 720 upon detection of these conditions. On completion of its search, each packet sniper 718 releases its previously allocated TDB to the ingestor node memory manager for use by other parallel current tasks or future operations that could be requested or introduced to the ingestor node 110. The TDB allows a no-lock, variable time latency multiprocessing of each packet by the ingestor node 110, and, the capability for locked operation in the eventuality of write functions being required to change the contents of packets. The packet sniper 718 further counts the number of packets that are received from the decoder 708 and provides this as a packet count indication 722. The packet count 722 is used to verify live event network traffic flow with post event network traffic records, providing a network transmission integrity check for network operations. The packets of interest detected by the packet sniper 718 are referenced against an action table by the ingestor node 110 and such prescribed action is executed. Network traffic of interest is flagged and sent to the semantic node 112 for application based processing. Selected or all network traffic flows to the application relevancy filter 724 within the semantic node 112; these are provided for longer term storage or transferred to legacy data or discard 726. Relevant network traffic is passed to the application rules engine 728 for further analysis to determine the actions required based upon the detected data.
The application rules engine 728 initiates particular actions and interventions 730 in accord with each application use case deduction and initiates the desired analytic outcome(s). The application rules engine 728 may also provide information to enable contextual updates with live-data events and actions at 732, in addition to the ability to enable manual input/output as part of the learning loop at step 734. The determined actions and interventions at 730 drive contextual updates with live-data events and actions that occur at 732. The actions and interventions 730 are used to execute particular actions at 736 or to provide information to the grid manager 712 within the ingestor node 110. The contextual update with live-data events and actions at 732 enable the creation of visualization and notifications of live-data alerts and other metrics to provide necessary notifications at step 738. The contextual update with live-data events and actions 732 also provides information for storage and application specific static and dynamic statistical model 740 and provides information to the activity and packet count journal 742. They also enable adjustment to the conditions, rules and actions which are passed back to ingestor node 110 and packet sniper 718 to provide dynamic and deducted additions to those prescribed by the use case. The visualization and notification of live-data alerts and other metrics execute an action at 736, or alternatively or additionally, enact live output to dashboards or data integration with other systems such as email, SMS, etc., at 744. After the executed actions at 736 are caused to occur, unwanted packets are discarded at 746. Information generated responsive to the activities are stored within the packet count journal 742.
Each use case provides the control information that controls the operation of its respective processes within the semantic node 112 and ingestor node 110. Each blade 750 may be associated with a particular use case such that a particular condition or operation may be monitored and detected by the ingestor node 110 and semantic node 112. Multiple blades 750 may be utilized such that different use cases may be implemented by the system 102 on the same network traffic 104 in parallel in a multithreaded fashion.
Referring now to
The data handler 810 generates various sources of semantic data 814. This data is provided to a semantic data writer 816 so that it may be written to a semantic data application program interface 818. The API 818 provides the data to the semantic node and in-memory database 820 that contains application specific parameters, traps and alerts that are generated responsive to various statistical models relating to received Events within the semantic node 112. Various alerts and reports are generated responsive to the semantic node and in-memory database 820 operations.
Referring now to
The ingestor node 110 consists of four agents able to operate independently and in parallel: 1) the ingest VM 902, 2) the governor VM 906, 3) the time dependent buffer (TDB) VM 908 and 4) the grid VM 910. The ingest VM 902 ingests the mirrored network traffic, undertakes protocol decoding, acquires a TDB, and discretizes and writes the required packetized data to the assigned TDB. The protocol decoder process within the ingest VM 902 uses an informational map that the ingestor node 110 uses for the dynamic allocation of threads and cores to decode one or potentially more protocol packets in parallel.
A network packet may contain multiple protocols. For example, an internet protocol (IP) packet may include web traffic (HTTP), mail (SMTP), internet phone (VOIP), file transfer (FTP) and network monitor (SNMP), amongst others. When the protocol decoder tells the ingestor node 110 to decode HTTPs, SMTP, FTP protocols, the protocol decoder collects information on both the sender and the target servers. The ingestor node 110 allocates three threads each operating on its assigned protocol and all three threads run in parallel to more readily operate on the packet. The design of the protocol decoder is lockless and a read-only operation. As an example, a decoded packet within a TDB VM 908 could be analyzed by three or more protocol decoders independently in parallel and with no fixed ordering. Thus, the HTTP decoder would perform a bit-comparison to determine if there were an HTTP page request within the packet, retrieve the target server name, and place the information within the semantic data queue. The SMTP decoder would perform a bit comparison to determine if there were an SMTP send mail within a packet, retrieve the mail server name and sender, and place the information within the semantic data queue. The FTP decoder would perform a bit comparison to determine if there were an SMTP PASV within the packet, retrieve the mail server name, and place the information within the semantic data queue. Each protocol decoder would independently release its use of its allocated TDB VM 908.
The ingest VM 902 also includes one or more packet sniper 718 process(es) for providing multi threaded parallel comparisons for prescribed or deduced conditions. The packet sniper process also includes the information that the ingestor node 110 uses for allocation of threads and/or cores to analyze per data type along with where and/or how to generate alerts to the semantic node 112. Similar to the protocol decoders, multiple packet sniper processes can be enacted on any assigned TDB, each process releasing its interest in the TDB when finished. The conditions being sought by packet sniper processes are set up by the semantic node 112 or may optionally be established by direct input to the ingest VM 902. The ingest VM 902 is also able to simultaneously transmit selected or all data to the semantic node 112.
In one example, a decoded SS7 packet contains the phone number of a caller and the phone number of a call recipient. To address the requirement of alerting when caller (1234567890) makes calls to any number, and to alert when called number (1900PREMIUM) receives calls from any number, the packet sniper configuration tells the ingestor node 110 of these two separate operations with respect to an outgoing sniper and an incoming sniper. The ingestor node 110 allocates two packet snipers, each operating on its assigned task and within its own in-memory database or assigned TDB VM 908. Each thread runs in parallel and independently with no fixed ordering and will operate on a decoded packet. When the outgoing sniper matches the caller number to a caller blacklist in its in-memory database, an alert will be generated. Similarly, if the incoming sniper matches a called number to a called blacklist within its memory database, the packet sniper generates an alert. Packet sniper will independently release use of its TDB VM 908.
The governor VM 906 acts as a performance watchdog with the ability to organize core and/or memory availability of the ingest VM 902 responsive to its detected conditions. The dynamic allocation and release of multiple TDB VM 908 allows multiple functions of disparate timing to be scheduled by the ingest VM 902 so that optimum memory availability is provided to those functions. The TDB VM 908 provides the ingestor node 110 with the ability to use memory efficiently in concert with the speed of ingest and any disparate ingestor node 110 processing. The TDB VM 908 uses a combination of semaphores and arbitrary memory mapping dynamically responding to allocation of memory requests. The TDB VM 908 allows for the efficient use and tuning of memory based upon time required and size needed. Multiple ingestor node tasks and VMs are able to request workspace of varying need and time. TDB VM 908 flags the required memory blocks. These can be flagged as a lock or no lock status. The flagged memory can then be used in parallel by multiple tasks in read only mode, and dynamically locked if in write mode. Each task releases its need for the memory block on completion of its task. The final release will release that memory block back to the TDB VM 908 for further use. TDB VM 908 is able to allocate as a single block of memory non-contiguous blocks grouped as a virtual contiguous allocations of memory.
This memory management is illustrated for three simultaneously operating processes in
Referring now to
Thus, from the port mirror the network traffic can be copied (in parallel to its transmission) into one or more of the allocated TDBs 1013 and made available to one or more of assigned scheduled cores of the ingest VM 902 and, by using variable bitmap searching, the required protocols are decoded and recognized, or the required patterns are recognized at step 1010. The address of TDBs 1013 containing wanted protocols/packets/patterns are passed to packet sniper 1016 and other such tasks for further processing or inspection. The TDB VM 908 process monitors the availability of memory blocks and presents the available status to the ingest VM 902. The ingest VM 902 schedules the sending of the ingested data to the semantic node 112 in parallel scheduling routines through the packet sniper 1016 that compares data for preselected alerts or actions at inquiry step 1018. Once a TDB 1013 is fully released and its contents transmitted at step 1020 to the semantic node 112, the now available TDB addresses are returned at step 1022 to the TDB VM 908 memory map as being available. Control will then pass back to step 1002.
If the packet sniper 1016 does not detect a comparison match at inquiry step 1018, control passes to step 1024 to determine if different content exists. If so, additional comparisons are performed at step 1018. If no further comparison data is available, control passes to steps 1026 and 1028 wherein the packet sniper journal is updated at step 1026, and the memory associated with the compared data is released and the TDB VM 908 memory map updated at step 1028. The TDB VM 908 does not clear buffers for use until every task has issued a clear status on that TDB 1013.
Packet sniper 1016 is engaged when each ingest VM 902 has completed its loading of live-data from the allocating core. The packet sniper 1016 is responsive to dynamic or deduced updates received from the semantic node at 1017. This update information 1017 enables the packet sniper 1016 to target particular content and/or situations. This information is stored within a target content and/or situation file 1019 that controls the operation of the packet sniper 1016. Packet sniper 1016 analyses the contents of the TDB 1013 for content or conditions that have already been determined as being of interest at inquiry step 1018, as well as updated deduced conditions from step 1019. If found, packet sniper 1016 performs predetermined action triggers at 1030 that can either execute within the ingestor node 110 or defer to the semantic node 112. If inquiry step 1018 determines that a match does exist, the action associated with the match is executed at step 1030 and an alert is generated to the semantic node 112 at step 1032. Packet sniper 1016 will then continue its searches at step 1024.
The role of the governor VM 906 is to monitor and maintain the preset performance levels of core usage and memory space available to all virtual machines and tasks within their host ingestor node 110. Assigned cores that operate at a higher percent busy value or excessive memory usage cause an alarm to be sent to the semantic node 112 for diagnostic records and alerts.
The governor VM 906 measures the time periods of the ingestor node 110. This comprises measuring the time taken for the TDB VM 908, the packet sniper(s) and other tasks to complete their operations, and additionally, ensuring that memory usage is not growing beyond a certain threshhold. The governor VM 906 operates in parallel to all of the other virtual machines in the ingestor node 110 and engages dynamic performance balancing of available cores and memory should processes start to encroach on preset or dynamically set hurdles. The performance gathering data of the governor VM 906 is logged and sent at regular intervals to the semantic node 112 for journal entry at 1036. The governor VM 906 also acts as the entry point for executing messaging from the grid VM 910 and command and control functions from the assigned semantic node 112. The governor VM 906 determines at inquiry steps 1038-1042 whether there has been a grid VM 910 condition set or an internal performance breach. When a grid VM 910 condition or performance breach is detected, the governor VM 906 undertakes reallocation of priorities and resources as provided by the resident operating system and utilities at step 1044 and at step 1046. Governor VM 906 undertakes similar actions when receiving command, control, update, or diagnostic instructions by the assigned semantic node 112.
As a result of a threshold alarm, the governor VM 906 commences working with the operating system and TDB VM 908 to reassign other cores and memory of a lower priority and to allocate the newly-available resources to assist in reducing the workload of other cores. Thus, in a situation where cores running ingest or decode or packet sniper tasks approached a set threshold level of, for example, 70% and, or, the amount of available memory for allocation to those tasks in the TDB 1013 also reached a threshold level of, for example, not less than 20%, the governor VM 906 would a) attempt to reassign or cease lower priority work, b) attempt to increase available memory in the TDB 1013, and c) inform the assigned semantic node 112 of the condition.
The role of the grid VM 910 is to manage for its host ingestor node 110 the intercommunications between peer ingestor nodes 110, and thereby the intercommunications between multiple semantic nodes 112. Based on use case performance requirements it is possible to configure any number of ingestor nodes 110 and semantic nodes 112 into an analytical grid architecture. Thus, the grid VM 910 receives inter-ingestor node notification at 1050 and makes notes of these indications at 1052. The grid VM 910 is also able to send notifications to other ingestor nodes 110 at 1054. The data within the grid VM 910 is referred to as map of operations and contains a role both within the grid and within the node. The grid VM 910 enables notification of dynamic conditions and required action among various ingestor nodes 110 within a set of Systems 102.
Referring now to
The semantic node 112 provides a framework for time-critical situational analysis, decision support deduction and action processing of multiple use applications 1102 with regard to the live-data packets 1104 sent by the ingestor node 110. In some cases this may require the use case application to access various other data such as legacy data center records 1106 or to send alerts or to seek action that may require the servicing of the use case application's needs to include non live-data access to data storage outside the System 102.
The decision accuracy and situational relevancy of semantic node 112 is continually updated through the recording of actions and alerts within the action and alerts database 1108. The actions and alerts are deemed to be correct/non-correct through programmatic access to data center records 1106 and the subsequent reformulation of statistical subject matter used in decision support situational analysis. The semantic node 112 consists of three processes that operate dynamically and independently to form the rules engine 1112. These include the application blade manager 1114, visualization VM 1110 and self-learning loop 1116. A semantic node 112 further includes two virtual machines (agents) including a grid VM 1118 and governor VM 1120. The grid VM 1118 and governor VM 1120 operate in the same fashion discussed herein above with respect to the ingestor node 110 and provide the same functionalities. Queries to the semantic node 112 can be dynamically and programmatically executed responsive to use case application 1102 control or may also be learned through matrices input and defined or external machine (big data) input, including statistical models and pattern recognition.
The visualization VM 1110 provides the framework to drive dashboards (visual analysis tool or data presentation media) reporting in real-time to the activities being undertaken or their results, and provides an operational command and control entry point to the System 102.
Referring now to
This known unlabeled data may be used to determine the statistical accuracy of customer algorithm results at step 1222 or provide customer analyst label outcomes at step 1224. The customer analyst label outcomes may provide known data with labeled responses at step 1226 which may be used to derive an action list for the ingestor node 110 at step 1228. Inquiry step 1230 determines if there is a learned classification algorithm based upon the labeled responses. If not, the machine learning algorithm builds a classification model using the labeled responses as a training data set at step 1232. If so, the machine-learned algorithm is run against data for validation to calculate the statistical accuracy at step 1236. At step 1238, a comparison of the accuracy and speed of the machine learned algorithm against the statistical accuracy of the customer algorithm may be based upon the result from step 1236, and the statistical accuracy of customer algorithm results at step 1240. All this information is used to generate a report outcome to the graphical user interface as a customer inquiry at step 1242. Additionally, this outcome is used to calculate the deduced conditions which are provided back to the ingestor node 110 and packet sniper 1016.
Referring now to
The system described herein above with respect to
This methodology may also be utilized in a number of applications for controlling and managing customer experience. These include things such as bill shock management, social network analysis for churn avoidance, identification of non-optimal network conditions and immediately notifying or offloading subscribers for amelioration, high-value subscribers and the provision of granularized service to them for things such as dropped calls, wireless offloading for congestion, dynamic notifications for network outages, All-You-Can-App (customized tariff plans based on personalized application usage), and social network analysis for individualized experiences.
With respect to network operations applications, the system methodology can provide an intelligent network planning to prioritize/plan/optimize investments ahead of a demand curve, provide subscriber-centric wireless offload based on contextual intelligence, provide congestion control at the granular level, provide core instrumentation and alerting, provide traffic management, provide instrumentation for circuit measurements, detect silent/dropped calls, calculate answer ratios, real-time control and alerts and to provide for data session quality-of-service monitoring and control. In one example, the System 102 receives outage plans for cell towers and commences monitoring in conjunction with a live-data source the presence, movement and activities of such mobile devices or devices within that nominated cell tower transmission area. A file is built in real-time to that monitoring and a usage map is dynamically built. The map is used to selectively alert through SMS, email, or other such contact methods such dynamic situations or planned outages creating a just in time dynamic alert system based in real time to the live-data deductions.
Finally, with respect to network security applications, the system methodology enables analysis of live-data network traffic for the purpose of identifying malicious content or agents as they enter the network at any determined location or between two or more points, in applications, packets, on devices or network elements. This identification and detection in concert with the packet sniper capabilities of automated alert and prescribed or dynamic/deduced actions can isolate, trap, or reject the passage of such threats from further movement through or into the network (or out of the network into further onwards data centers or enterprise systems). While each of these various applications of the described methodology are only examples thereof, it would be appreciated by one skilled in the art that various other implementations of the methodology in accordance with the general process described herein may also be implemented.
Referring now to
The System 102 can detect the number of outgoing calls from a single roaming subscriber to one or more international numbers at step 1402. Next, a determination is made at inquiry step 1404 as to whether the number of outgoing calls from a single roaming subscriber to one or more international numbers has exceeded a user configurable threshold and, if so, whether this has occurred within a user configurable period of time at inquiry step 1406. If the number of outgoing calls has exceeded the threshold within the configured time period, alarms with associated reports may be generated at step 1408. The alarm may be used to indicate to the network provider that an outgoing call threshold from the specified roaming subscriber number has been exceeded and further scrutiny is necessary. A drill down report generated along with the alarm is made available for the network provider that will list the international numbers that are being called. If inquiry steps 1404 and 1406 determine that the configurable call numbers or time periods have not been exceeded, control passes back to step 1410 to continue monitoring the roaming data at step 1402. Outcomes from 1408 are integrated with external contextual data at 1412, and this information is utilized by the semantic node 112 to calculate dynamic changes to any parameters relevant to the use case.
Referring now to
The methodology uses data sources consulted by the semantic node 112 that include known revenue share fraud databases or threat lists that have been built based on past calling behavior, carrier fraud and threat databases. In using the methodology of
The reports generated in response to detection of this condition would include updates of all current fraud events updated with all victims who have received SMS or phone calls. The reports would show common numbers any victims are calling back in order to identify the callback numbers of the SMS attacks. The reports would further provide real-time calculations of KPIs and savings in the dashboard to show cost/call of each return call so analysts can track savings from the time the callback number is barred to customers. This will calculate how much it would have cost the customer had the Wangiri fraud not been identified and stopped. Thus, a particular savings benefit can be numerically defined for customers and the network provider.
Another type of fraud which may be detected by the system 102 is International Revenue Share fraud. This type of fraud involves perpetrators making calls to international premium numbers on stolen or purchased SIM cards from within the carrier network. This type of fraud has two subtypes. Within the “number callout” scenario, subscribers call international premium numbers as evidenced by a sudden high number of outbound calls to a small range of destinations. This could indicate the usage of stolen SIMs or SIMs purchased with no intention to pay the full contract/bill. In this case, there is no correlated inbound trigger of calls from an international number as in Wangiri Fraud, and the calls are placed from within the carrier network, unlike the international roaming fraud. In the “country callout” scenario a high number of calls are suddenly placed to a specific country. These calls exceed the normal baseline call rates and the calls are placed from within the carrier network. External data sources may be consulted by the semantic node 112 in order to access known revenue sharing databases, threat lists that have been built on past calling behavior, carrier fraud and threat databases.
The calling patterns are detected in the manner illustrated in
Referring now to
The drilldown reports provided at 1608 and 1708 respectively can provide updates on the configurable time period of each fraud event. Reports may also provide a summary of each fraud alert for immediate scanning by an analyst, enabling them to determine how many A numbers/B numbers, cumulative duration, etc. The reports may also provide risk scoring of each alert based upon a configurable set of questions (e.g. are 90% of calls being answered, are majority of calls 2 minutes plus). The report may also provide risk scoring of each alert based upon an external big data contextualization (do any A numbers in this alert have a current balance owing greater than $X). The alert generation may comprise the provision of an application program interface to customer billing and customer profile information, as at 1612 and 1712. Finally, calling maps may be generated to show the relationship between anyone involved in the fraud event, showing all activity for the past 48 hours. These external data sources can be linked to semantic node 112 for ongoing, automatic adjustment or feedback to the use case rules and can inform packet sniper in ingestor node 110 to be aware of specific subscribers, phone numbers, relationships, patterns, thresholds, or other factors, that, when encountered in the network traffic 104, will be automatically alerted on or actions/instructions sent to other systems. Examples include communications to network operations to terminate a call, bar a specific subscriber, prevent outbound calls to a specific phone number—all of these are actions to alter the specific activity as it is detected. This enables the carrier to prevent the losses from being incurred by intercepting the fraudulent activity before or while it happens.
Referring now to
Referring now to
Referring now to
Referring now to
Referring now to
Simultaneously, the planned cell tower outage schedules act as event triggers 2308, and manual updates and changes to these schedules 2306 are ingested by the ingest VM 2302. These are integrated at 2304 and sent onwards to the network topology bitmap 2303. The network topology bitmap 2303 represents a live-data mirror of device locations, the cell tower locations or planned or dynamically required outages for service improvements of those towers, as well as accessing a historical record of the presence of the device locations within the targeted cell tower locations. This historical record allows for a deductive process to occur as to the multiple locations over a period of time with regard to both individual devices as well as multiple cell towers. In this fashion, outage notifications can be based on both real-time (immediately-occurring), or historically based device presence in each cell tower location.
The role of the semantic node is shown in
The ability for the system to provide real-time sentiment analysis to the carrier is illustrated in
A further example of the use of the real-time data monitoring system is with respect to network/core instrumentation and alerting. Examples of this include the ability to monitor, measure and alert on any network operation or function with the option to set configurable parameters for threshold, limits, alarms and performance optimums. In all cases, visualizations and queries can be drilled down to show innumerable combinations of data (e.g. calls by time, country, circuit, partner, device, etc.), and time periods (real-time, immediate performance and drill down to show how immediate conditions compare against any desired time period of minutes, hours, days, weeks, months, etc.). In all cases, thresholds or performance norms can be set or changed in real-time by the customer and any deviation or desired alerting/alarming can be sent to a variety of destinations including dashboards, email, mobile devices or other applications, solutions or systems.
The system can measure the performance of network circuits (CICs) in real-time and provide visualization of all monitored CICs over a selectable time period to show trends and performance norms. When any single CIC or group of fellow CICs fall below the threshold which are configurable and changeable in real-time from the dashboard, alerts can be sent to the dashboard and/or to email, SMS or other connected systems.
Measurements of total network traffic can be as granular as the customer desires. Measurements can include total calls in/out, total SMS in/out and any combination of drill down on these analyses including querying the data by circuit, by cell tower, by interconnected partner, by inbound or outbound traffic, by destination or origin country, by device type, by conversation length, etc. Anything that can be measured can be queried and displayed on the dashboard.
The system may be used to measure the ratio of answered to unanswered calls against a customer-configured threshold. Real-time data can be drilled down by any of the categories mentioned in the previous use case and thresholds can be changed in real-time. Alerts can be sent to a dashboard, email, SMS or other system. This system may also detect average conversation times and interconnect traffic data and provide alerts, reports, etc., based upon this information. Thus, using the above described system and method, real-time data flow within a network, via a connection to a particular network element, switch, etc., may be achieved in order to analyze the real-time data flow in order to generate analysis and reports of the data while the data is actively being generated before it exits the network for onward storage. This enables network providers to provide much more up to date and real-time responses to the analyzed data and achieve improvements to system performance and issues as these events are occurring rather than at a later date based upon post-data analysis.
It will be appreciated by those skilled in the art having the benefit of this disclosure that this system and method for real-time live-data analysis of network traffic provides a manner for monitoring and analyzing network content as the data is moving through the network and provides an ability to affect the outcome that ordinarily in the absence of such a system and method would be not-affected in relationship to its normal course of business.
It should be understood that the drawings and detailed description herein are to be regarded in an illustrative rather than a restrictive manner, and are not intended to be limiting to the particular forms and examples disclosed. On the contrary, included are any further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments apparent to those of ordinary skill in the art, without departing from the spirit and scope hereof, as defined by the following claims. Thus, it is intended that the following claims be interpreted to embrace all such further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments.
Claims
1. A method for monitoring live-data flow through a network, comprising:
- monitoring, at a first processing node, a mirrored live-data flow of the live-data flow passing through a selected point within the network in a non-intrusive manner that does not affect the live-data flow passing through the selected point, wherein the live-data flow comprises data that is in active transmission between endpoints in the network and prior to onward storage of the data in a database;
- decoding, at the first processing node, each packet within the mirrored data flow according to each protocol associated with a packet, wherein packets have a plurality of protocols associated therewith are decoded in parallel with each other;
- comparing, at the first processing node, each of the decoded packets to at least one of a set of predetermined or deduced conditions received from a second processing node;
- executing at least one of a predetermined or deduced response including an indication of occurrence of the at least one predetermined or deduced condition based upon detection of the at least one predetermined or deduced condition within the decoded packets;
- processing, at the second processing node, at least a portion of the decoded packets of the live-data flow causing execution of the at least one predetermined or deduced response to determine a manner for controlling an operation of the network at a same time the live-data flow is in active transmission between the endpoints in the network; and
- controlling the operation of the network in response to the processing step while events associated with the live-data flow are occurring within the network.
2. The method of claim 1, wherein the step of decoding further includes the steps of:
- determining whether the decoded packets are wanted for further processing;
- discarding unwanted decoded packets;
- assigning wanted decoded packets to at least one time dependent buffer for comparison to the at least one predetermined or deduced conditions, the time dependent buffer comprising a variable length bit map;
- forwarding wanted packets to the assigned at least one time dependent buffer for comparison to the set of at least one predetermined or deduced conditions.
3. The method of claim 2 further including the step of transmitting the wanted data packets to the second processing node for the step of processing.
4. The method of claim 1, wherein the step of processing further comprises the steps of:
- receiving the indication of detection of one of the at least one predetermined or deduced conditions;
- assigning at least one predefined application associated with the indication;
- processing the decoded data flow using the assigned at least one predefined application in simultaneous multithreaded parallel fashion to generate at least one control action for controlling the operation in the network at the same time the live-data flow is in active transmission between the endpoints in the network, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized; and
- controlling the operation of the network using the at least one control action.
5. The method of claim 1, wherein the step of processing further comprises processing the portion of the decoded packets decoded data flow using relational language processing.
6. The method of claim 1, wherein the step of executing the at least one predetermined or deduced response further comprises generating an alert to the network provider in response to the at least one predetermined or deduced condition.
7. The method of claim 1, wherein the step of processing further comprises the step of processing, at the second node at least a portion of the decoded packets of the live-data flow, the processing causing execution of the predetermined or deduced response and the deducing of dynamically created conditions responsive to historical data, the dynamically deduced conditions established as part of a learning process including statistical models and pattern recognition, operating within the second node with the option to provide the dynamically deduced conditions to the first processing node to update the deduced conditions.
8. The method of claim 1, wherein the step of controlling further comprises maintaining at least one network operating parameter at a predetermined level and generating an alarm when the live-data of the data flow indicates that the at least one network parameter is moving from the predetermined level.
9. The method of claim 1, wherein the step of controlling further comprises detecting an occurrence of a fraudulent activity on the network based on the live-data within the data flow and generating a warning to notify a network provider of the fraudulent activity as the fraudulent activity is being transmitted in the data flow over the network.
10. A system for monitoring live-data flow through a network, comprising:
- a server communicating with the network;
- a network interface card associated with the server for providing access to the data flow through the network;
- a processor within the server the processor implementing a first processing node for: monitoring, at a first processing node, a mirrored live-data flow of the live-data flow passing through a selected point within the network in a non-intrusive manner that does not affect the live-data flow passing through the selected point, wherein the live-data flow comprises data that is in active transmission between endpoints in the network and prior to storage of the data in a database; decoding, at the first processing node, each packet within the mirrored data flow according to each protocol associated with a packet, wherein packets have a plurality of protocols associated therewith are decoded in parallel with each other; comparing, at the first processing node, each of the decoded packets to at least one of a set of predetermined or deduced conditions; executing at least one of a predetermined or deduced response including an indication of occurrence of the at least one predetermined or deduced condition based upon detection of at least one predetermined or deduced condition within the decoded packets;
- the processor within the server the processor further implementing a second processing node for: processing, at a second processing node, at least a portion of the decoded packets of the live-data flow causing execution of the at least one predetermined or deduced response to determine a manner for controlling an operation of the network at a same time the live-data flow is in active transmission between the endpoints in the network, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized; and controlling the operation of the network in response to the processing step while events associated with the live-data flow are occurring within the network.
11. The system of claim 10, further including a memory associated with the second processing node, the memory including application processing rule sets each defining a different manner for controlling operations at the same time the live-data flow is in active transmission between the endpoints in the network in response to the at least one detected predetermined or deduced condition, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized.
12. The system of claim 11, wherein the memory further defines the at least one predetermined or deduced conditions for the comparison at the first node.
13. The system of claim 10, further including:
- at least one time dependent buffer comprising a variable length bit map;
- wherein the processor further implements the first processing node for: determining whether the decoded packets are wanted for further processing; discarding unwanted decoded packets; assigning wanted decoded packets to the at least one time dependent buffer for comparison to the at least one of the set of predetermined or deduced conditions; forwarding wanted packets to the assigned at least one time dependent buffer for comparison to the at least one set of predetermined or deduced conditions.
14. The system of claim 11, wherein the processor further implements the first processing node for transmitting the wanted data packets to the second processing node for the step of processing.
15. The system of claim 10, wherein the processor further implements the second processing node for:
- receiving the indication of detection of one of the at least one predetermined or deduced conditions;
- assigning at least one predefined application associated with the indication;
- processing the decoded data flow using the assigned at least one predefined application in simultaneous multithreaded parallel fashion to generate at least one control action for controlling the operation in the network at the same time the live-data flow is in active transmission between the endpoints in the network, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized; and
- controlling the operation of the network using the at least one control action.
16. The system of claim 10, wherein the processor further implements the second processing node for processing the portion of the decoded packets decoded data flow using relational language processing.
17. The system of claim 10, wherein the processor further implements the second processing node for generating an alert to another processor node in response to the at least one predetermined or deduced condition.
18. The system of claim 10, wherein the processor further implements the second processing node for processing at least a portion of the decoded packets of the live-data flow causing execution of the predetermined response and the deducing of dynamically created conditions responsive to historical data, the dynamically deduced conditions established as part of a learning process including statistical models and pattern recognition, operating within the second node with the option to provide the dynamically deduced conditions to the first processing node to update the deduced conditions
19. The system of claim 10, wherein the processor further implements the second processing node for controlling the network by monitoring for an occurrence of network security threats within the live-data of the data flow and isolating any detected security threats within the live-data from the data flow of the network.
20. The system of claim 10, wherein the processor further implements the second processing node for monitoring the live-data of the data flow to create a model of subscriber movement and frequency of presence within each cell tower location, wherein each cell tower has planned and unplanned outages, further wherein as these outages occur, the second processor generating a roster of all subscribers currently within or approaching the cell tower, and generating a notification to be broadcast at least one of generally or selectively based on each subscribers communications preferences.
21. A system for monitoring live-data flow through a network, comprising:
- a network interface for connecting to the network;
- a processor coupled to the network interface;
- a memory coupled to the processor, the memory storing a plurality of instructions for execution by the processor, the plurality of instructions including: instructions for monitoring, at a first processing node, a mirrored live-data flow of the live-data flow passing through a selected point within the network in a non-intrusive manner that does not affect the live-data flow passing through the selected point, wherein the live-data flow comprises data that is in active transmission between endpoints in the network and prior to storage of the data in a database; instructions for decoding, at the first processing node, each packet within the mirrored data flow according to each protocol associated with a packet, wherein packets have a plurality of protocols associated therewith are decoded in parallel with each other; instructions for comparing, at the first processing node, each of the decoded packets to at least one of a set of predetermined or deduced conditions; instructions for executing at least one of a predetermined or deduced response including an indication of occurrence of the at least one predetermined or deduced condition based upon detection of at least one of a predetermined or deduced condition within the decoded packets; instructions for processing, at a second processing node, at least a portion of the decoded packets of the live-data flow causing execution of the at least one predetermined or deduced response to determine a manner for controlling an operation of the network at a same time the live-data flow is in active transmission between the endpoints in the network; and instructions for controlling the operation of the network in response to the processing step, while events associated with the live-data flow are occurring within the network.
22. The system of claim 21, wherein the instructions for decoding further include:
- instructions for determining whether the decoded packets are wanted for further processing;
- instructions for discarding unwanted decoded packets;
- instructions for assigning wanted decoded packets to at least one time dependent buffer for comparison to the at least one predetermined or deduced conditions, the time dependent buffer comprising a variable length bit map;
- instructions for forwarding wanted packets to the assigned at least one time dependent buffer for comparison to the at least one of the set of predetermined or deduced conditions.
23. The system of claim 22 further including instructions for transmitting the wanted data packets to the second processing node for the processing of the portion of the decoded packets.
24. The system of claim 21, wherein the instructions for processing further comprises:
- instructions for receiving the indication of detection of one of the predetermined or deduced conditions;
- instructions for assigning at least one predefined application associated with the indication;
- instructions for processing the decoded data flow using the assigned at least one predefined application in simultaneous multithreaded parallel fashion to generate at least one control action for controlling the operation in the network at the same time the live-data flow is in active transmission between the endpoints in the network, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized; and
- instructions for controlling the operation of the network using the at least one control action.
25. The system of claim 21, wherein the instructions for processing further comprises instructions for processing the portion of the decoded packets decoded data flow using relational language processing.
26. The system of claim 21, wherein the instructions for executing at least one predetermined or deduced response further comprises instructions for generating a notification to the network provider in response to the at least one predetermined or deduced condition.
27. The system of claim 21, wherein the instructions for executing the at least one predetermined or deduced response further comprises instructions for processing at least a portion of the decoded packets of the live-data flow causing execution of the predetermined response and the deducing of dynamically created conditions responsive to historical data, the dynamically deduced conditions established as part of a learning process including statistical models and pattern recognition, operating within the second node with the option to provide the dynamically deduced conditions to the first processing node to update the deduced conditions
28. The system of claim 21, wherein the instructions for controlling further comprises instructions for controlling the network by monitoring for an occurrence of network security threats within the live-data of the data flow and isolating any detected security threats within the live-data from the data flow of the network, wherein controlling comprises alerting or instructing a network provider to affect an event outcome before the event is finalized.
29. The system of claim 21, wherein the instructions for controlling further comprises instructions for monitoring the live-data of the data flow to create a model of subscriber movement and frequency of presence within each cell tower location, wherein each cell tower has planned and unplanned outages, further wherein as these outages occur, the second processing generating a roster of all subscribers currently within or approaching the cell tower, and generating a notification to be broadcast at least one of generally or selectively based on each subscribers communications preferences.
30. A method for monitoring live-data flow through a network, comprising:
- monitoring, at a first processing node, a mirrored live-data flow of the live-data flow passing through a selected point within the network, wherein the live-data flow comprises data that is in active transmission between endpoints in the network and prior to storage of the data in a database;
- decoding, at the first processing node, each packet within the mirrored data flow according to each protocol associated with a packet, wherein packets have a plurality of protocols associated therewith are decoded in parallel with each other;
- comparing, at the first processing node, each of the decoded packets to at least one of the set of predetermined or deduced conditions;
- executing at least one predetermined or deduced response based upon detection of at least one predetermined or deduced condition within the decoded packets;
- processing, at a second processing node, at least a portion of the decoded packets of the live-data flow causing execution of the at least one predetermined or deduced response using a relational language processing to determine a manner for controlling an operation of the network at a same time the live-data flow is in active transmission between the endpoints in the network, wherein the step of processing further comprises the steps of: receiving the indication of detection of one of the at least one predetermined or deduced conditions; assigning at least one predefined application associated with the indication; processing the decoded data flow using the assigned at least one predefined application to generate at least one control action for controlling the operation in the network at the same time the live-data flow is in active transmission between the endpoints in the network, the step of processing further including processing at least a portion of the decoded packets of the live-data flow causing execution of the predetermined response and the deducing of dynamically created conditions responsive to historical data, the dynamically deduced conditions established as part of a learning process including statistical models and pattern recognition, operating within the second node with the option to provide the dynamically deduced conditions to the first processing node to update the deduced conditions; and
- controlling the operation of the network using the at least one control action.
Type: Application
Filed: Sep 12, 2014
Publication Date: Mar 19, 2015
Inventors: CARISSA RICHARDS (GEORGETOWN, TX), MYVAN QUOC (FREMONT, CA), NEAL CODDINGTON (NOVATO, CA), GEORGE MCCARTHY (MERIDA), HARIHARAN RAMACHANDRAN (SYDNEY)
Application Number: 14/485,172
International Classification: H04L 12/26 (20060101); H04L 12/801 (20060101);