System and method for data source flattening

Info

Publication number: 20030097364
Type: Application
Filed: Nov 13, 2001
Publication Date: May 22, 2003
Patent Grant number: 6799182
Inventor: Anthony P. Bata (Superior, CO)
Application Number: 10007971

Abstract

A system and method for enabling data exchanges between various data processing systems, including disparate systems, is described. In one exemplary embodiment, the present invention can receive a hierarchical data structure containing a plurality of data categories; identify a plurality of conversion targets included in the plurality of data categories; generate a plurality of tabular data structures, each of the plurality of tabular data structures corresponding to one of the plurality of conversion targets; and join each of the generated plurality of tabular data structures.

Description

Description

RELATED APPLICATIONS

[0001] The present application is related to commonly owned and assigned application nos.

[0002] Attorney Docket No. 004608. P001, entitled Presenting and Structurally Representing General-Purpose Data Sources, filed on Mar. 2, 2001;

[0003] Attorney Docket No. RIOL-001/00US, entitled System and Method for Heterogeneous Data Source Integration, filed Jul. 30, 2001;

[0004] Attorney Docket No. 004608.P002, entitled Manipulating General Purpose Data Sources Responsive to File System Operations, filed on Sep. 14, 2001; and

[0005] Attorney Docket No. 004608.P003, entitled XML Presentation of General-Purpose Data Sources, filed on Sep. 14, 2001; all of which are incorporated herein by reference.

FIELD OF THE INVENTION

[0006] The present invention relates to data processing systems. In particular, but not by way of limitation, the present invention relates to systems and methods for enabling data exchanges between various data processing systems, including disparate data processing systems.

BACKGROUND OF THE INVENTION

[0007] Companies are leveraging powerful new business-to-business (B2B) applications and network-based services to collaborate with customers and trading partners throughout the value chain. These companies are redefining existing relationships; creating compelling new relationships; and reaching new markets through a variety of online business models such as public e-marketplaces, private exchanges/supply chains, and direct trading partner links.

[0008] To fully capitalize on these new business applications and relationships, individual companies and e-marketplaces need to provide an efficient means of integrating trading partners into the various trading systems that exist today and that will exist tomorrow. Although most companies understand the benefit of such integration, these companies have also seen other companies undergo expensive, time-consuming projects to integrate their business systems, their trading partners' business systems, and the new technology. Rather than embarking on these expensive, time-consuming integration projects, many companies have avoided the process and instead, have opted to use electronic catalogs and to manually process B2B transactions. In other words, many companies have opted to manually reenter data into their backend systems even though the data was previously transmitted to them electronically through, for example, a web site or email.

[0009] Presently, a company's option for integrating disparate internal and external business systems centers on traditional middleware and custom-coded adapter technology—many adapters are built on a prepackaged adapter that requires significant customization. Middleware and adapters, however, typically cost hundreds of thousands of dollars and take many months to implement. Although these systems are functional, they are not satisfactory. Accordingly, a system and method are needed to address the shortfalls of present technology and to provide other new and innovative features.

SUMMARY OF THE INVENTION

[0010] Exemplary embodiments of the present invention that are shown in the drawings are summarized below. These and other embodiments are more fully described in the Detailed Description section. It is to be understood, however, that there is no intention to limit the invention to the forms described in this Summary of the Invention or in the Detailed Description. One skilled in the art can recognize that there are numerous modifications, equivalents and alternative constructions that fall within the spirit and scope of the invention as expressed in the claims.

[0011] The present invention can provide a system and method for enabling data exchanges between various data processing systems, including disparate systems, without the typical custom computer code generation generally required to accomplish such integration. In one exemplary embodiment, the present invention can include a message bus disposed between a set of data source recognizers and a set of content recognizers. In this embodiment, the data source recognizers are configured (1) to monitor for incoming/received data originating from an associated data provider such as a trading partner; (2) to extract the received data from a data handler, such as a web server; and (3) post the received data to the message bus. Once the data has been posted to the message bus, the individual content recognizers inspect the data and, if appropriate, pull the data from the message bus for subsequent processing.

[0012] An individual content recognizer in this embodiment can be configured to pull, for example, a specific transaction type—such as a purchase order received from a particular trading partner—from the message bus. The content recognizer can then perform a series of predetermined transactions using the data pulled from the message bus. These steps can include making calls to an application program interface (API), making inserts/deletes to a database table, retrieving data from a database table, sending responses to trading partners, etc. Thus, the content recognizers cannot only pull data from the message bus, they can also post data to the message bus. Notably, this embodiment of the present invention can initiate these transactions across disparate systems, thereby providing integration both between trading partners and an enterprise and between an enterprise's own backend systems.

[0013] In one embodiment, the present invention achieves integration across disparate systems by joining data from disparate systems rather than only data from related database tables. For example, embodiments of the present invention can join data from data systems such as APIs, web services, messaging systems, databases of different types, event messengers, ERP systems and the like. In the structured query language (SQL) of relational databases, a join is a combination of tables in a database using a common element (key). In this context, the concept of a join can be extended to include a combination of data elements from disparate systems and different data structures (e.g., hierarchical and relational.) Even though data from disparate systems is being joined, the join behaves the same as if two tables from the same database type were being joined. One implementation of the join process is described in commonly owned and assigned U.S. patent application Ser. No. ______, Attorney Docket No. 004608.P001, entitled Presenting and Structurally Representing General-Purpose Data Sources.

[0014] The join of data from these disparate systems can be represented as a node in a hierarchical structure, e.g., tree. This node can then be mapped to the fields of an incoming document so that data fields in the incoming document can be recognized and easily used by the content recognizers, or other modules, in subsequent transactions. By joining disparate data sources, embodiments of the present invention, with a minimal amount of custom-coding, can enable the creation of a transaction process that uses several disparate systems. Joining of data from disparate data sources, however, is not necessarily required for the operation of the present invention.

[0015] In one embodiment, data from disparate data systems can be integrated through a conversion process called flattening. Flattening, in one implementation, can convert hierarchical data into tabular data. For example, flattening can convert an XML document into a table and/or tabular data into an XML document. Notably, flattening can be used in any system that needs to convert between hierarchical data and tabular data.

[0016] As previously stated, the above-described embodiments and implementations are for illustration purposes only. Numerous other embodiments, implementations, and details of the invention are easily recognized by those of skill in the art from the following descriptions and claims.

BRIEF DESCRIPTION OF THE DRAWINGS

[0017] Various objects and advantages and a more complete understanding of the present invention are apparent and more readily appreciated by reference to the following Detailed Description and to the appended claims when taken in conjunction with the accompanying Drawings wherein:

[0018] FIG. 1 is a high level diagram of a system incorporating one embodiment of the present invention;

[0019] FIG. 2 is a block diagram of one embodiment of an information manager constructed in accordance with the principles of the present invention;

[0020] FIG. 3 is a block diagram of an alternate embodiment of an information manager constructed in accordance with the principles of the present invention;

[0021] FIG. 4 is a flowchart of one method in accordance with the present invention for configuring an information manager;

[0022] FIG. 5 is a flowchart of one method in accordance with the present invention for processing a purchase order;

[0023] FIG. 6 is an exemplary XML document;

[0024] FIG. 7 is exemplary tabular data produced be flattening the XML document of FIG. 6;

[0025] FIG. 8 is a flowchart of one method for flattening hierarchical data to produce tabular data;

[0026] FIG. 9 is a flowchart of another method for flattening hierarchical data to produce tabular data;

[0027] FIG. 10 is a flowchart of one method for flattening tabular data to produce hierarchical data; and

[0028] FIG. 11 is a flowchart of another method for flattening tabular data to produce hierarchical data.

DETAILED DESCRIPTION

[0029] Referring now to the drawings, where like or similar elements are designated with identical reference numerals throughout the several views, and referring in particular to FIG. 1, it illustrates a high level diagram of a system 100 incorporating one embodiment of the present invention. This system 100 includes a plurality of trading partners 105 connected with a transaction-conducting enterprise 110 through either the Internet 115 or a value-added network (VAN) 120. As can be appreciated by those of skill in the art, other trading partners could be connected with the enterprise 110 through other means.

[0030] In operation, a trading partner 105 transfers data to the enterprise 110. For example, a trading partner 105 could transfer a purchase order to the enterprise's server 125. The information manager 130, via a local network 132, could then recognize that a purchase order was received from a trading partner 105, parse the purchase order, map the purchase order data fields to a join of the appropriate data structures, and provide the appropriate portions of the data to the relevant backend systems 140. In one embodiment, the purchase order could be added to the database 140A and an inventory management system 140C (shown as a custom application) could be updated to reflect the quantities ordered in the received purchase order. These steps are described in more detail below. Notably, however, the information manger 130 can process data received in most any format and communicate that data to various, disparate backend systems 140 with a minimal amount of custom coding.

[0031] Referring now to FIG. 2, it is a block diagram of one embodiment of an information manager 130 constructed in accordance with the principles of the present invention. In this embodiment, a plurality of data providers 145 are in communication with the information manager 130. The term “data providers” is used to refer generally to any system that exchanges data with the information manager 130. For example, the data providers 145 include trading partners 105 and/or the enterprise's backend systems 140 (both shown in FIG. 1.) Moreover, data providers 145 are assumed to provide documents of data to the enterprise 110. Thus, the data received by the data sources is referred to herein as documents. However, the data sources are by no means limited to receiving only documents and the data providers 145 are not limited to exchanging documents with the information manager.

[0032] The data source recognizers 150 included in the information manager 130 are configured to enable an exchange and manipulation of data between the data providers 145 even though the data providers 145 can include disparate systems. In one embodiment, the data source recognizers 150 are configured to monitor specific communications channels for documents originating from certain data providers. For example, data source recognizer 150A could monitor a web server 125 (shown in FIG. 1) to identify files sent to the enterprise 110 via HTTP. Another data source recognizer 150B could monitor for documents originating from an Oracle™ database 145B (also 140A in FIG. 1) attached to the local network 132 or for documents originating from a web service provider (a SAP system, a Java API, etc.). In addition to monitoring and identifying the source of incoming documents, the data source recognizer 150 can be further configured (1) to recognize data items placed on the message bus 155 by other data source recognizers 150 and/or content recognizers 160, (2) to extract any recognized data items, and (3) to provide, e.g., push, extracted data items from a document to the relevant data providers 145.

[0033] Still referring to FIG. 2, when a data source recognizer 150 recognizes a received document as originating from a monitored source, the data source recognizer 150 can parse the document and post the data contained therein to the message bus 155 or, if necessary, push the document to some other system. For example, if data source recognizer 150A is configured to monitor for documents originating from data provider 145A, and data source recognizer 150A identifies a document received at the web server 125 (shown in FIG. 1) as originating from data provider 145A, data source recognizer 150A can parse the received document and place the resulting data on the message bus 155. Data source recognizer 150A, however, may ignore all other externally received documents.

[0034] Once the document, or the parse results, are placed on the message bus 155, the individual content recognizers 160 can inspect the contents of the data and determine whether to pull the data from the message bus 155 for processing or whether to ignore the data. In one embodiment, data placed on the message bus 155 is subject to flattening by the flattener 162. The flattener 162 can present the content recognizers 150 with data in a familiar format regardless of the original format of the data. Similarly, the flattener 162 can flatten data originating from the content recognizers 160 and bound for the data source recognizers 150. Flattening is discussed in detail with regard to FIGS. 6-11.

[0035] The content recognizers 160 can inspect the document's header data as well as its contents. Thus, a content recognizer 160 can be configured to identify a document based on its contents rather than merely its header information. For example, when content recognizer 160A is configured to monitor for purchase orders from data provider 145A, the content recognizer 160A can search a received document for data provider 145A's name and typical purchase order characteristics. When the content recognizer 160A detects a match with a document on the message bus 155, it can pull the document. Otherwise, it can ignore all other documents on the message bus 155.

[0036] After a content recognizer 160 recognizes and pulls data from the message bus 155, the content recognizer 160 can initiate a transaction process, for example, by calling the transaction processor 165 or by initiating procedures internal to the content recognizer 160. For example, when the content recognizer 160 is configured to monitor for purchase orders from a particular trading partner, the associated transaction process could include a step for storing the purchase order in a database and a step for changing an inventory count in an inventory management system. Generally, the systems that maintain these different data items, i.e., purchase order data and inventory data, are disparate. The purchase order data system, for example, could be an Oracle database system interfaced through a standard Oracle interface and the inventory management system could be a custom system interfaced through an associated API. Despite the differences in the two systems, embodiments of the present invention can access these systems and perform the necessary transactions without necessarily using the expensive adapters and middleware of present technology. The process for performing these transactions is discussed in detail herein.

[0037] Referring now to FIG. 3, it is a block diagram of an alternate embodiment of an information manager 130 constructed in accordance with the principles of the present invention. As with FIG. 3, this embodiment includes a plurality of content recognizers 160 and a plurality of data source recognizers 150. The generic components of FIG. 2, however, have been replaced with detailed components to further illustrate the versatility and adaptability of the information manager 130.

[0038] In particular, this embodiment includes an Oracle data source recognizer 150D, a packaged system data source recognizer (packaged systems include ERP systems), an API data source recognizer 150F, an EDI data source recognizer 150G, a web service data source recognizer 150H, an HTTP data source recognizer 150I, and a trading partner data source recognizer 150J. Each of these data source recognizers 150 is configured to exchange data with an appropriate data provider 145D-145J. Additionally, the generic content recognizers 160A-160B have been replaced with a series of specific content recognizers: a purchase order recognizer 160D, a purchase order acknowledgement recognizer 160E, a price check recognizer 160F, an inventory check recognizer 160G, and an order status recognizer 160H. As can be appreciated, other recognizers, represented by the custom recognizer 160C, can be configured to meet an enterprise's 110 specific needs. Moreover, multiple copies of these content recognizers 160 can be created and custom configured. For example, different purchase order recognizers 160D can be created for different trading partners 145I/105 and 145J/105.

[0039] To further illustrate the operation of the information manger 130, a method for configuring an information manager to process a purchase order is illustrated in FIG. 4, and a run-time operation of the information manager using that a configuration is shown in FIG. 5. Both of these methods are described with relation to FIG. 3 and are exemplary only.

[0040] Referring now to FIG. 4, it illustrates one method for configuring the information manager to process an incoming purchase order originated from a trading partner. To configure the information manager to process a purchase order from a trading partner, such as trading partner 145I/105, an administrator for the information manager 130 generates a data source recognizer, such as data source recognizer 150J (step 170). The data source recognizer can be specifically pointed at the trading partner, i.e., it only listens for documents originating from the trading partner (step 170 ), or the data source recognizer can be pointed to an HTTP connection such that it listens for any documents received via the HTTP connection.

[0041] Once the data source recognizer for the trading partner has been generated, a custom module for the trading partner's purchase order should be created (step 175). In the preferred embodiment, the custom module is represented as a file folder called “trading partner PO” in a Windows™ file structure. Moreover, in the presently preferred embodiment, the data source recognizer can also be logically represented as a file folder in a Windows file structure. As those of skill in the art can understand, the data source recognizer and the custom module could be represented in various other ways, including other file structures such as those in a Solaris™ environment. For the description of FIGS. 4 and 5, however, the method of operation will be described with regard to file folders as commonly seen in the Windows environment. This description is exemplary only.

[0042] The trading partner PO recognizer, represented as the trading partner PO folder, is an empty content recognizer until configured. In other words, the trading partner PO recognizer should be pointed to an input source, pointed to output locations, and given a defined transaction process (step 180). Such configuration could involve configuring the trading partner PO recognizer to look for unique information that will be found in the purchase order. Such unique information can be found in the header and/or in the body of the purchase order received from trading partner 1. Physically, this configuration can be done through changing the properties of the trading partner PO folder rather than through custom coding, as is generally done in traditional integration technologies.

[0043] Once the trading partner PO recognizer has been pointed to an input source and configured as a recognizer, the data source recognizers for any backend systems—or files/tables/programs within a backend system—that will use purchase orders from trading partner should be associated with the trading partner PO recognizer (steps 185 and 190). This association can be completed by copying folders associated with the particular backend systems into the trading partner PO folder (step 195). These folders typically contain information about methods for communicating with the associated backend systems. Moreover, the information manager can “discover“ the interfaces. As previously described, the data providers in FIG. 2 can include these backend systems.

[0044] The information manager next performs a join of the data structures associated with the relevant backend systems (step 200). This join can be completed even if the relevant backend systems are disparate. The join operation can be completed using “drag and drop” mouse operations, without writing any computer code. Diagram A shows such a join where the first data set represents the fields required by an API associated with the inventory management system, the second data set represents the columns in the purchase order database table, and the third data set represents the node resulting from a join of the first data set and the second data sets. Notably, the join shown in Diagram A involves data from disparate systems rather than data merely from different tables within a database. 1 Diagram A: Join Results API Database Attribute 1 Column 1 Attribute 2 Column 2 Attribute 3 Colunm 3 Join Node - Attribute 1 - Attribute 2 - Attribute 3 - Column 1 - Column 2 - Column 3

[0045] The join shown in Diagram A creates a single document, which in the preferred embodiment is an XML document, representing the data needed to process a purchase order received from the trading partner. This document can be used to map the data in an incoming purchase order to the appropriate fields in other systems such as the inventory management system and the purchase order database table. This mapping can be done in a visual interface without writing any custom code, as is generally necessary.

[0046] Finally, the trading partner PO recognizer can be configured to transform, if necessary, and to pass the appropriate data to the backend systems and/or data providers (step 205). For example, the PO recognizer can be configured to condition the purchase order database table for insertion of a new record. Alternatively, the purchase order database table can be configured to accept an insert from the trading partner PO recognizer. Similarly, the trading partner PO recognizer can be configured to pass particular parameters to the API associated with the inventory management system. For example, the trading partner PO recognizer can be configured to pass an item number to the API such that the inventory management system returns the current inventory for that item. If the current inventory is less than the requested number of items in the purchase order, a message can be generated and placed on the message bus.

[0047] Referring now to FIG. 5, there is illustrated the run-time operation of the information manager 130 as configured in FIG. 4. Initially, the trading partner PO recognizer identifies a purchase order received from the trading partner and posts that purchase order to the message bus 155 where it is parsed and broadcast to all of the content recognizers 160 (steps 300 and 315). The preconfigured trading partner PO recognizer inspects the data placed on the message bus 155 and extracts the purchase order (steps 320 and 325). Finally, the trading partner recognizer initiates the defined transaction process (step 330). For example, the appropriate fields can be passed to the inventory management system API, API 145F for example, and the purchase order can be inserted into the purchase order database table, database 145D for example.

[0048] As those skilled in the art can understand, FIGS. 4 and 5 are merely illustrations of the operation of one simple embodiment of the present invention. Accordingly, the present invention should in no way be limited to the implementations described in relation thereto. The present invention can be adapted to meet almost any business demand—whether it be transactional integration or other demand.

[0049] FIGS. 6-11 illustrate portions of the flattening process that can be used to integrate disparate data types. This flattening process has applicability both with the information manger 130 and with other devices known in the art but not necessarily described above. In particular, the flattening process and the flattener 162 can be used independently of the elements previously described.

[0050] Referring now to FIG. 6, it is an exemplary XML document representing a purchase order 335. This purchase order 335, for example, could be received at the HTTP Data Source 150I shown in FIG. 3. The order 335 includes five basic data categories: number, buyer, supplier, line item, and payment instruction. Data categories “number,” “buyer,” “supplier” and “line item” also contain data entries. For example, data category “number” contains the data entry “111-111.” Further, data category “buyer” contains data entry “contact 1,” “contact 2,” and “contact 3.” Data category “supplier” contains data entries “contact 1” and “contact 2,” and data category “line item” contains data entries “item 1,” “item 2,” “item 3,” and “item 4.”

[0051] Referring now to FIG. 7, it is exemplary tabular data structure 340 produced by flattening the purchase order 335 of FIG. 6. In one embodiment, this table 340 could be presented to a data source recognizer 150 or a content recognizer 160 (shown in FIG. 2). The table 340 includes three columns corresponding to the targeted data categories “number,” buyer,” and “item” from the XML document 335. Notably, the targeted data categories can be selected by a user, designated by an API, or otherwise indicated. The data used to populate the rows of the table 340 corresponds to data in the XML document 335. For example, the data in the “number” column corresponds to the data category “number” and the associated entry in the XML document 335. Similarly, the data in the “buyer” column and the “item” column corresponds to the data with the associated data categories in the XML document 335.

[0052] The data in table 340 can be considered as the cross-product of selected data from the XML document 335. In this implementation, the data categories “number,” “buyer,” and “line item” are identified as the target conversion fields for generating the table. Other data categories, such as “supplier” and “payment information,” can be ignored. Because three data categories are targeted, the table should have at least three corresponding columns. The rows in the table represent various combinations of the data found in the XML document 335. Note that the arrangement of the rows and columns could be changed according to need or preference.

[0053] The generation of a tabular data structure from a hierarchical data structure can be better illustrated with reference to the flowchart of FIG. 8. In this embodiment, an XML document, or any type of hierarchical data, is initially received at, for example, the information manger 130 (Step 345), and the document type identified (Step 350). For example, if the XML document received corresponds to the XML document 335 shown in FIG. 6, the document type could be indicated by the data category “Order.”

[0054] Once an XML document has been received and typed, a list of target conversion fields can be identified (Step 355). This list can correspond to the document type and/or other data whether or not that data is included in the XML document. With regard to the XML document 335 in FIG. 6 and the table in FIG. 7, the list of target conversion fields could be the data categories “number,” buyer,” and “item.” Data within the XML document 335 not related to these data categories can be ignored.

[0055] Once the list of target conversion fields have been identified, the number of columns needed for the table being generated can be dynamically determined and the initial portion of the table generated (Steps 360 and 365). Generally, the number of columns matches the number of target conversion fields. For example, the table of FIG. 7 has three columns, each of which matches one target conversion field.

[0056] After the columns of the table have been generated, the individual rows can be populated with the appropriate data. To populate the rows, data is retrieved from the received XML document (Step 370). For example, a first pass over the XML document could result in the data item “111-111” being retrieved from the “number” data category, “contact 1” being retrieved from the “buyer” data category, and “item 1” being retrieved from the “line item” data category. These data items could then be used to populate a row of the table (Step 375). Subsequent rows could be populated such that all possible combinations of the target conversion fields are placed in the table.

[0057] As described above, table rows can be generated and populated dynamically as passes are made over the XML document. In other embodiments, however, the number of rows needed in the table can be determined before any rows are populated. For example, in the XML document 335 of FIG. 6, there are three target conversion fields. The first field has one entry; the second field has three entries, and the third field has four entries. Accordingly, the number of rows required could be determined to be tweleve (1*3*4). Thus, twelve rows could be initially generated, and the fields populated accordingly. The determined number of rows could also be used to estimate a memory requirement for storing the generated table.

[0058] Referring now to FIG. 9, it is a flowchart of another method of generating tabular data from a hierarchical data structure. In this embodiment, an XML document is received and its target conversion fields identified (Steps 380 and 385). A table is then generated for each target conversion field (Step 390). For example, the “Number” field in the XML document of FIG. 6 would result in a 1×1 table containing the data “111-111.” Similarly, the “Buyer” field would result in a 1×3 table containing the data “contact 1,” “contact 2,” and contact 3.”

[0059] After the tables, or at least some of the tables, have been generated, the individual tables can be joined to form a flattened data structure (Step 395). For example, the 1×1 “Number” table can be joined with the 1×3 “Buyer” table. This joined table could then be joined with a 1×4 “Line Item Table” to generate a flattened data structure, e.g., a joined table, representing the target conversion fields “Number,” “Buyer,” and “Line Item.”

[0060] Referring now to FIG. 10, it is a flowchart of one method for flattening tabular data to produce hierarchical data. For example, FIG. 10 shows one method for flattening the table 340 of FIG. 7 to produce portions of the XML document 335 of FIG. 6. In this embodiment, the tabular data is received from some source and the data categories, e.g., column headers, are identified (Steps 400 and 405). With regard to the table in FIG. 7, the data categories include “number,” “buyer,” and “line item.” These data categories, or some indication thereof, are then used as data categories in an XML document or in some other hierarchical data structure.

[0061] Once the data categories are established in the XML document, data from the table can be added to the XML document (Steps 410, 415 and 420). For example, data entries can be identified and added in association with the proper data category identified. Repeat entries in the table would not necessarily need to be repeated in the XML document.

[0062] In another embodiment, data from the table could be added to an XML document template. FIG. 11 illustrates such an embodiment. In this implementation, tabular data is initially received (Step 425). This data is then associated with a corresponding document type (Step 430). Once the document type has been identified, an appropriate template can be retrieved (Step 435). Data categories, e.g., column headers, can be associated with data categories in the template, and data from the table can be used to populate the data entries associated with the data categories (Steps 440 and 445).

[0063] In conclusion, the present invention provides, among other things, a system and method for enabling data exchanges between various data processing systems. Those skilled in the art can readily recognize that numerous variations and substitutions may be made in the invention, its use and its configuration to achieve substantially the same results as achieved by the embodiments described herein. Accordingly, there is no intention to limit the invention to the disclosed exemplary forms. Many variations, modifications and alternative constructions fall within the scope and spirit of the disclosed invention as expressed in the claims.

Claims

1. A method for managing a hierarchical data set, the method comprising:

receiving the hierarchical data set containing a first, second and third data category, wherein the first data category is associated with a first data item included in the hierarchical data set, the second data category is associated with a second data item included in the hierarchical data set, and the third data category is associated with a third data item included in the hierarchical data set;

generating a first tabular data structure corresponding to the first data category and the first data item;

generating a second tabular data structure corresponding to the second data category and the second data item;

generating a third tabular data structure corresponding to the third data category and the third data item;

joining the first tabular data structure with the second tabular data structure to generate a first joined tabular data structure; and

joining the third tabular data structure with the first joined tabular data structure, thereby generating a second joined tabular data structure comprising the first data item, the second data item, and the third data item.

2. The method of claim 1, wherein receiving the hierarchical data set comprises:

receiving an XML document.

3. The method of claim 1, further comprising:

providing the second joined tabular data structure to a content recognizer.

4. The method of claim 1, further comprising:

providing the second joined tabular data structure to a data source recognizer.

5. The method of claim 1, further comprising:

providing the second joined tabular data structure to an API.

6. The method of claim 1, further comprising:

providing the second joined tabular data structure to a database system.

7. A method for managing a hierarchical data structure, the method comprising:

receiving the hierarchical data structure containing a plurality of data categories;

identifying a plurality of conversion targets included in the plurality of data categories;

generating a plurality of tabular data structures, each of the plurality of tabular data structures corresponding to at least one of the plurality of conversion targets; and

joining each of the generated plurality of tabular data structures.

8. The method of claim 7, wherein each the plurality of conversion targets is associated with at least one of a plurality of data items and wherein joining each of the generated plurality of tabular data structures results in a combined tabular data structure comprising each of the plurality of data items.

9. The method of claim 7, further comprising:

identifying a document type associated with the received hierarchical data structure;

wherein the plurality of conversion targets are identified based upon the identified document type.

10. The method of claim 7, further comprising:

providing the join of each of the generated plurality of tabular data structures to one of a content recognizer and a data source recognizer.

11. The method of claim 7, further comprising:

providing the join of each of the generated plurality of tabular data structures to an API.

12. The method of claim 7, further comprising:

providing the join of each of the generated plurality of tabular data structures to a database system.

13. A system for generating a combined tabular data structure from a hierarchical data structure containing a plurality of data categories, the system comprising:

a processor;

a storage device; and

a plurality of instructions stored upon the storage device, the plurality of instructions configured to cause the processor to

receive a hierarchical data structure containing a plurality of data categories;

identify a plurality of conversion targets included in the plurality of data categories;

generate a plurality of tabular data structures, each of the plurality of tabular data structures corresponding to at least one of the plurality of conversion targets; and

join each of the generated plurality of tabular data structures.

14. The system of claim 13, wherein the hierarchical data structure comprises:

an XML document.

15. The system of claim 13, wherein the plurality of instructions are further configured to cause the processor to:

identify a document type associated with the received hierarchical data structure;

wherein the plurality of conversion targets are identified based upon the identified document type.

16. The system of claim 13, wherein each the plurality of conversion targets is associated with at least one of a plurality of data items and wherein joining each of the generated plurality of tabular data structures results in a combined tabular data structure comprising each of the plurality of data items.

17. The system of claim 13, wherein the plurality of instructions are further configured to cause the processor to:

identify a document type associated with the received hierarchical data structure;

wherein the plurality of conversion targets are identified based upon the identified document type.

18. The system of claim 13, wherein the plurality of instructions are further configured to cause the processor to:

provide the join of each of the generated plurality of tabular data structures to one of a content recognizer and a data source recognizer.

19. The system of claim 13, wherein the plurality of instructions are further configured to cause the processor to:

provide the join of each of the generated plurality of tabular data structures to an API.

20. The system of claim 13, wherein the plurality of instructions are further configured to cause the processor to:

provide the join of each of the generated plurality of tabular data structures to a database system.

21. An article of manufacture for generating a combined tabular data structure from a received hierarchical data structure containing a plurality of data categories, the article of manufacture comprising:

a storage medium; and

a plurality of instructions stored upon the storage medium, the plurality of instructions configured to cause a processor to:

receive a hierarchical data structure containing a plurality of data categories;

identify a plurality of conversion targets included in the plurality of data categories;

generate a plurality of tabular data structures, each of the plurality of tabular data structures corresponding to at least one of the plurality of conversion targets; and

join each of the generated plurality of tabular data structures.

22. The system of claim 21, wherein the hierarchical data structure comprises:

an XML document.

23. The system of claim 21, wherein the plurality of instructions are further configured to cause the processor to:

identify a document type associated with the received hierarchical data structure;

wherein the plurality of conversion targets are identified based upon the identified document type.

24. A system for managing a hierarchical data structure, the system comprising:

means for receiving the hierarchical data structure containing a first, second and third data category, wherein the first data category is associated with a first data item, the second data category is associated with a second data item, and the third data category is associated with a third data item;

means for generating a first tabular data structure corresponding to the first data category and the first data item;

means for generating a second tabular data structure corresponding to the second data category and the second data item;

means for generating a third tabular data structure corresponding to the third data category and the third data item;

means for joining the first tabular data structure with the second tabular data structure to generate a first joined tabular data structure; and

means for joining the third tabular data structure with the joined tabular data structure, thereby creating a second joined tabular data structure.

25. The system of claim 24, further comprising:

a data source recognizing means for receiving the second joined tabular data structure.

26. The system of claim 24, further comprising:

a programming interface means for receiving the second joined tabular data structure.

27. The system of claim 24, further comprising:

a data storage means for receiving the second joined tabular data structure.

28. A system for managing a data item, the system comprising:

a plurality of data source recognizers;

a plurality of content recognizers;

a data flattener; and

a message bus connected with the plurality of data source recognizers, the plurality of content recognizers, and the data flattener.