Method and apparatus for implementing container managed batch jobs in an enterprise java bean environment
An improved method, apparatus, and computer instructions for creating and running batch jobs in an object oriented environment, such as a J2EE environment. A request to execute a batch job is received. A deployment descriptor file is processed to identify a batch bean to be invoked. This batch job session bean processes the request, parses deployment descriptor file that comprises definitions of relationships between other helper classes, entity and session beans. The identified batch bean is invoked to execute the batch job step in the order described in the deployment descriptor applying checkpoints at intervals specified in the descriptor.
The present invention is related to the following applications entitled METHOD AND APPARATUS FOR IMPLEMENTING CONTAINER MANAGED USES, OWNERSHIPS, AND REFERENCES IN AN ENTERPRISE JAVABEAN ENVIRONMENT, Ser. No. ______ attorney docket no. AUS920040979US1 filed on ______; SYSTEM AND METHOD TO IMPLEMENT CONTAINER MANAGED STREAMS IN J2EE ENVIRONMENTS, Ser. No. ______ attorney docket no. AUS920040980US1 filed on ______. All of the above related applications are assigned to the same assignee, and incorporated herein by reference.
BACKGROUND OF THE INVENTION1. Technical Field
The present invention relates to an improved data processing system. In particular, the present invention relates to implementing enterprise JavaBeans™ development environment in a data processing system. Still more particularly, the present invention relates to implementing container managed uses, ownerships, reference only, batch processing, and sequential stream relationships in an enterprise JavaBeans™ development environment.
2. Description of Related Art
In most enterprise application development environments, developers often use enterprise JavaBeans™ objects for modeling interactions between components and for managing data persistence in their applications. J2EE Enterprise JavaBeans™ is a specification available from Sun Microsystems, Inc. Examples of enterprise JavaBeans™ (EJB) objects include ‘Entity’ beans and ‘Session’ beans. Entity beans model the persistent data used by the EJB application and application clients. An Entity bean is a type of enterprise JavaBeans™ that persists data in a data source beyond the lifetime of the client application. Session beans are designed to encapsulate the functions of a given task (or session) as requested by clients of the EJB interfaces. Often Session beans themselves are used to interact with the modeled Entity beans to perform some business logic.
Currently, relationships between entity beans in an enterprise application may be maintained by using container managed relationship (CMR) fields. Developers implement CMR fields by specifying the desired relationship between entity beans in the entity bean definition and adding the EJB relationships to a deployment descriptor file. When the enterprise application is deployed, the EJB container automatically enforces referential integrity of the relationships. Thus, when the code of an entity bean is updated, the container automatically updates the related entity bean. In this way, business methods may use CMR accessor methods to manipulate container managed relationships.
While relationships between entity beans are maintained, there is no existing mechanism that represents ‘uses’ relationship between an entity bean and a session bean. Thus, if an entity bean uses a session bean that encapsulates a related but reusable task, developers have to manually perform a lookup of a session bean home interface using Java Native Directory Interface (JNDI) passing a name made available to the entity bean code and then create an instance of the session bean. The container does not manage the relationship or generate relevant code. This manual operation of session bean lookup can be error prone, time-consuming, and inconsistent with the manner in which other container managed relationships are handled. Therefore, it would be advantageous to have an improved method for the container to maintain a ‘uses’ relationship between enterprise JavaBeans™, particularly between entity and session beans, such that implementation of JavaBeans™ may be more simple and more consistent.
In addition to uses relationship, there is also no existing mechanism that represents an ‘owns’ relationship between entity beans. Ownership is a particular type of directional relationship that indicates all access to the target entity of the relationship is implied to pass through the source entity. Thus, the “key” of the target always includes the key of the source. For example, if an order entity bean with an order number as the key “owns” a number of line items, the implicit key of an order line item is the order number plus any other properties of line items that ensure uniqueness, such as the product key or a sequence number.
Currently, an entity definition must explicitly specify the key fields of the owner entity as well the others specific to the target owned entity that insure uniqueness in any context. The disadvantage of this approach is that the owned entity is only usable in the limited context of the ownership relationship that exists at the moment. But that relationship may change. For example, if the developer subsequently decides that an order entity is owned by a customer entity, then the order entity needs to be changed to include the customer key fields as part of its essential state. Because the ownership relationship propagates to all other entities owned by order, the line item entity needs to be modified to include the customer key fields as part of its essential state as well.
Therefore, it would be advantageous to allow a developer to specify only the “key” properties of an entity and assume that when a container managed “ownership” or “owns” relationship is specified between two entities, that the key of the “owner” is propagated to the “owned” entity for purposes of deployment to the underlying database technology. Such an approach will enable an entity definition to be reused in many contexts without changes to the basic “type” definition.
For example, the same line item entity could be used in orders whether owned by customers or not. Further, an entity like an Address could be owned by multiple types of entities simultaneously and even more than once by the same one (such as a shipping and billing address for a customer). Such a mechanism enables all persistent objects to be declared as entity EJBs, with the understanding that the “owns” relationship will make the distinction between deployment options.
With CMR fields, if a relationship is established between entity beans, a method is generated to reference an entity bean from another entity bean. In cases of a one-to-many relationship, the entity bean on one side retrieves a list of related entity beans using the method. Currently, in order to extract primary key values from the list of entity beans returned, developers have to manually write additional code to extract the values. This manual process is costly, since the non-key essential state of the entity (the CMP fields) is typically needed to instantiate the EJB in memory. Furthermore, a getPrimaryKey method has to be invoked on the objects. In addition, this manual process is redundant because the container already has access to the key values when the list is built. Therefore, it would also be advantageous to have an improved method that enables automatically generating a method that performs the manual process to return a list of simple primary key values instead of the list of entity EJBs.
Furthermore, in current J2EE application servers, there is no existing mechanism that supports efficient batch computations. Batch computations are operations that execute highly repetitive tasks according to a preconfigured schedule. For example, a “transfer funds” batch process in a banking application may be needed to repeatedly transfer funds from one account to another as specified by an input file or database that contains the from and to account numbers, along with the amount to transfer. Other examples of batch computations include cases where the account has insufficient funds, a request is then written to an “overdraft” output file, where a final summary record is logged describing the number of funds transferred and total amount of funds transferred successfully as well as the number and total amount of overdrafts.
Typically, an enterprise batch job is scheduled manually by a user using non J2EE solutions. However, the non J2EE solutions do not allow reuse of online J2EE application logic, such as transferring funds from one account to another. Thus, a user has to re-implement the online application logic every time a batch job is run by the non J2EE solutions.
Further, enabling many of the expected qualities of service, such as the ability to checkpoint and restart, requires manual coding to persist the “essential state” of the batch job. Making the checkpoint interval configurable also requires manual coding. In case of a fund transfer batch process, the essential states include the position in the transfer request file, the current sum of funds transferred successfully, and the total number of accounts involved. Requiring manual coding of this restart logic is likewise tedious and error prone.
Therefore, it would be advantageous to have an improved method to support batch computations in J2EE application servers, such that batch applications may be built, deployed, and run in a J2EE environment enabling reuse of application logic.
In addition, even though CMR fields allow establishing multi-cardinality relationships between two entities, the assumption is that a list is returned including all the related entities. There is no existing mechanism for specifying that the multi-cardinality relationship is primarily “one at a time” for the purpose of either reading or writing. This relationship is known as a “streaming” relationship. A fund transfer batch job is one example where a streaming relationship between two entities is especially important. The fund transfer batch job may require reading one fund transfer request at a time or writing one overdraft record at a time.
Typically, because the batch data is associated with file data streams, the code for reading records is manually coded in a non-standard way that prevents the container from managing the efficiency or establishing relationships with other EJBs. Even if such a streaming relationship is coded with standard entity EJBs, the developer must manually code and use a special kind of finder method that returns a single entity when given a current position, possibly among others, for propagation of keys.
Therefore, it would be advantageous to have an improved method that recognizes a relationship between entity beans as an input or output data streaming relationship, such that relevant code may be automatically generated to manage the relationship. In addition, it would be advantageous to have an improved method that automatically generates a method for input stream relationships that returns the last record in the data stream, such that no explicit lookup is necessary in the source code; and a method for output stream relationships that appends a record to the end of the stream.
Furthermore, for batch processing in a J2EE environment, there is no existing mechanism that provides a generic definition for batch processing, including job steps and other configuration information for a batch job. Currently, developers have to customize each batch job step separately, which requires significant development effort. Therefore, it would be advantageous to have a definition language integrated with the EJB container that allows developers to specify batch processing information.
SUMMARY OF THE INVENTIONThe present invention provides an improved method, apparatus, and computer instructions for creating and running batch jobs in an object oriented environment, such as a J2EE environment. A request to execute a batch job is received. A deployment descriptor file is processed to identify a batch bean to be invoked. This batch job session bean processes the request, parses deployment descriptor file that comprises definitions of relationships between other helper classes, entity and session beans. The identified batch bean is invoked to execute the batch job step in the order described in the deployment descriptor applying checkpoints at intervals specified in the descriptor.
BRIEF DESCRIPTION OF THE DRAWINGSThe novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings, wherein:
With reference now to the figures,
In the depicted example, server 104 is connected to network 102 along with storage unit 106. In addition, clients 108, 110, and 112 are connected to network 102. These clients 108, 110, and 112 may be, for example, personal computers or network computers. In the depicted example, server 104 provides data, such as boot files, operating system images, and applications to clients 108-112. Clients 108, 110, and 112 are clients to server 104. Network data processing system 100 may include additional servers, clients, and other devices not shown. In the depicted example, network data processing system 100 is the Internet with network 102 representing a worldwide collection of networks and gateways that use the Transmission Control Protocol/Internet Protocol (TCP/IP) suite of protocols to communicate with one another. At the heart of the Internet is a backbone of high-speed data communication lines between major nodes or host computers, consisting of thousands of commercial, government, educational, and other computer systems that route data and messages. Of course, network data processing system 100 also may be implemented as a number of different types of networks, such as for example, an intranet, a local area network (LAN), or a wide area network (WAN).
Referring to
Peripheral component interconnect (PCI) bus bridge 214 connected to I/O bus 212 provides an interface to PCI local bus 216. A number of modems may be connected to PCI local bus 216. Typical PCI bus implementations will support four PCI expansion slots or add-in connectors. Communications links to clients 108-112 in
Additional PCI bus bridges 222 and 224 provide interfaces for additional PCI local buses 226 and 228, from which additional modems or network adapters may be supported. In this manner, data processing system 200 allows connections to multiple network computers. A memory-mapped graphics adapter 230 and hard disk 232 may also be connected to I/O bus 212 as depicted, either directly or indirectly.
Those of ordinary skill in the art will appreciate that the hardware depicted in
The data processing system depicted in
With reference now to
An operating system runs on processor 302 and is used to coordinate and provide control of various components within data processing system 300 in
Those of ordinary skill in the art will appreciate that the hardware in
As another example, data processing system 300 may be a stand-alone system configured to be bootable without relying on some type of network communication interfaces. As a further example, data processing system 300 may be a personal digital assistant (PDA) device, which is configured with ROM and/or flash ROM in order to provide non-volatile memory for storing operating system files and/or user-generated data.
The depicted example in
The present embodiment provides an improved method, apparatus, and computer instructions for implementing container managed uses relationships between enterprise JavaBeans™ in a J2EE development environment. The present embodiment may be implemented in an application server, such as data processing system 200 in
With the present embodiment, a user may specify a ‘uses’ relationship between source enterprise JavaBeans™ and target enterprise JavaBeans™ in a container managed uses (CMU) section of the deployment descriptor file. The source and target EJBs may be defined using a key, such as the namespace and the home interface name of the bean. When the deployment description file is processed to generate EJBs, the container recognizes which EJB uses which EJB based on the relationship specified in the CMU section of the deployment descriptor file. In turn, a getter method on the session bean is generated and associated with the entity bean. The getter method performs a manual operation to obtain a reference to the session bean through a home interface. The method finds a home interface by performing a JNDI lookup of the session bean and creates a reference to the session bean instance.
For example, a user may specify in the CMU section of the deployment descriptor file that a customer source entity bean uses a credit card target session bean. When the deployment descriptor file is processed, a getCreditCard( ) method is generated by the present embodiment and associated with the customer entity bean. The getCreditCard( ) method performs a manual lookup of the credit card session bean using JNDI based on a key, such as the home interface name and namespace of the credit card session bean, and provides a reference to the credit card target session bean instance.
Thus, with the present embodiment, container managed uses relationships may be maintained between entity beans and session beans. A service locator pattern may be used by entities that are extended to session beans. A user may specify relationships in the deployment descriptor file along with CMR fields. In this way, manual lookup of session bean home interface by developers are eliminated and replaced with a getter method that is automatically generated to perform the session bean lookup based on the specified relationship in the deployment descriptor file.
In addition, by maintaining CMU relationships between EJBs, not only local relationships are maintained, remote EJB relationships are also maintained with handling of remote exceptions. The present embodiment includes an attribute in the CMU descriptor for a remote target session that indicates whether standard remote exceptions are to be passed through to the client or the source entity EJB.
Turning now to
Source EJB 410 may be a subtype of an entity EJB or session EJB. The mechanism of the present embodiment generates a getter method and associates it with source EJB 410 for each session bean it uses. In this example, getAccountAccessSL method 412 provides a reference to SLSB 402, getAccountAccessSF method 414 provides a reference to SFSB 404, getAccountC method 416 provides a reference to CMP EJB 406, and getAccountB method 418 provides a reference to BMP EJB 408. However, it is noted that the names of the methods need not include the types and do not need to be explicitly known to the client code in the source EJB.
Turning now to
Next, the mechanism of the present embodiment processes the deployment descriptor file (step 504) and generates a getter method on the entity bean that provides a reference to the session bean (step 506). Later, when the user invokes the getter method on the entity bean for the session bean (step 508), the getter method performs a JNDI lookup to either create or find a home interface of the session bean (step 510). Then, the mechanism of the present embodiment returns the created or found session bean instance to the user (step 512) and the process terminates thereafter.
In addition to ‘uses’ relationships, the present embodiment also extends the CMR methodology to include ‘owns’ relationship between entity beans, such that the key fields of the owner in the relationship may be propagated to the owned entity transparently. Currently, developers use CMP fields to specify keys and CMR fields to specify relationships between EJBs. However, CMR fields only provide general relationships between two EJBs and are not intended for use in keys. In particular, there is no way for a user to specify a key on one EJB as being part of the key for another EJB. Nor is there a way to specify a directional relationship like ownership that indicates propagation of key fields.
For example, the current system may maintain a relationship between an order bean and a line item bean using CMR fields. The key of a particular line item of an order may be specified using an order id in combination with a product id. The order id is part of the order bean. Thus, part of the key of the particular line item is related to the owner of line item bean, the order bean. There is currently no way to specify both in the context of a single directional relationship.
With the present embodiment, a user may specify a ‘owns’ relationship between the order bean and the line item in a container managed ‘own’ section of the deployment descriptor file. When the mechanism of the present embodiment processes the deployment descriptor file, the container recognizes which EJB owns which EJB.
In addition, the container recognizes that the key field of the owner bean propagates transparently and automatically to the owned bean, providing both key fields and the benefit of a CMR through a <OwnerType> get<OwnerType>( ) method. In the above example, the mechanism of the present embodiment propagates the order id of the order bean to the key of the line item. If an update is made to the specification of key fields of the order, then upon re-deployment, the mechanism of the present embodiment modifies the line item deployment to reflect the new fields that now make up part of its key. Further, if the order is subsequently deployed as being “owned” by another entity, like a customer entity, the mechanism of the present embodiment propagates the key fields of the customer to order and to line item where CMR methods can be automatically generated to getcustomer( ) and getorder( ).
One advantage of the present embodiment is that the data associated with instances owned by another can be stored in the same table as the owning entity. In this case, the key fields of the owning entity need only be stored once per record regardless of the number of owned entities stored in the same record, reducing the amount of data that need be stored and retrieved. The other CMP fields of the owned entities can be mapped to columns that encode both the role and CMP field name. For example, suppose a user uses an ORDER table record to store an order entity and its billing and shipping addresses related through CMO. In this case, the ORDERID field only need be stored in the record once. The CMP fields of the shipping and billing addresses, like the street, city, state and postal code, can be stored in columns with names like SHIPPINGSTREET, BILLINGSTREET, SHIPPINGCITY, BILLINGCITY and so on.
Another advantage of the present embodiment is if each entity type is stored in its own table regardless of role or owner, then the mechanism of the present embodiment may map the key fields of the owner and the role name to columns transparently and automatically to uniquely identify an instance. The mechanism of the present embodiment may map the CMP fields of the type to columns without regard to their role in any owning entity. For example, if all order shipping and billing addresses entities are stored in an ADDRESS table separate from the ORDER table, then the serialized key of the order is stored in a general OWNER column. The ROLE column for a given role would store values like “ORDERSHIPPING” or “ORDERBILLING”. Both OWNER and ROLE would uniquely identify the instance. The CMP fields for an address type would be mapped to their own columns, like STREET, CITY, STATE and POSTALCODE.
Still another advantage of the present embodiment is if each entity associated with a given owner is stored in its own table, then the key fields of the owner type can be automatically and transparently mapped by the mechanism of the present embodiment to their own columns to enable indexing and queries on the owner key fields. The role of the instance, if the same type plays more than one role with respect to the owning entity, is indicated through a ROLE column as described above. And also similar to when entity types are mapped to their own table, the CMP fields can be mapped by the mechanism of the present embodiment to columns without regard to their role in any owning entity. For example, if a user uses an ORDERADDRESS table to store addresses owned by orders, then a column for ORDERID would maintain the key of the owning order, the ROLE column would indicate whether the address is a shipping or billing address through a string or encoded value, and the CMP fields for address would be maintained in columns like STREET, CITY, STATE and POSTALCODE as above.
And still yet another advantage of the present embodiment is if a user stores each ownership relationship in its own table, then the mechanism of the present embodiment maps the key fields of the owner type automatically and transparently to their own columns to enable indexing and queries on the owner key fields. The role of the instance, if the same type plays more than one role with respect to the owning entity, is indicated through the table name and need not be stored at all. As mentioned above, the mechanism of the present embodiment maps the CMP fields to columns without regard to their role in any owning entity. For example, if a user uses ORDERBILLINGADDRESS table to store billing addresses owned by orders, then a column for ORDERID would maintain the key of the owning order, and the CMP fields for address would be maintained in columns like STREET, CITY, STATE and POSTALCODE as above.
It is entirely possible to persistently store the data associated with CMO in files and other non-relational table mechanisms following a similar mapping approach to those described above.
In addition to enabling multiple approaches to reducing redundancy of data, CMO also makes modifying the relationships between entities more efficient. For example, an order may be initially considered “unowned”, and has no owner key to propagate into the tables or other persistence mechanisms. Later, if a customer owns an order and the order owns line items and shipping and billing addresses, the present embodiment makes it possible to transparently and automatically propagate the new ownership relationship into the underlying persistence mechanism when the mechanism of the present embodiment re-deploys the entities after the changes.
Furthermore, the present embodiment of CMO enhances reuse of EJBs as types. For example, a customer may include information such as a preferred home address, billing address, and shipping address. A developer may reuse the address EJB type simply by specifying new CMO relationships between customer and address EJBs for the home, billing and shipping roles Subsequently, a new entity EJB, such as shipper may be developed to fulfill a given application function. It can reuse the address type again by declaring a CMO. The reuses of the address EJB can exploit any of the approaches outlined above to persistently store the data.
Turning now to
With current CMR methodology, a relationship may be specified between order 600 and line items 602. In this illustrative example implementation, a user may store order and line items data in two separate tables of a database, order table 604 and line item table 606. Order table 604 includes a list of order ids, for example, order id 1. Line item table 606 includes a list of line items for an order. In this example, order id 1 has a first line item identified by a product id 1 and a quantity 1. Order id also has a second line item identified by a product id 2 and a quantity 2.
With CMR fields, a relationship is maintained between order id 1 and the two line items using the order id field in line item table 606. Thus, the order id field is duplicated in the database. The mechanism of the present embodiment eliminates the duplicated order ids. Similar to order table 604, order table 608 also has a list of order ids, such as order id 1. However, unlike Line item table 606, line item table 610 uses a single record for the two line items of order id 1. The record includes a product id 1, quantity 1, product id 2, and quantity 2. Since a ‘owns’ relationship is maintained between the order instance and line item instances, there is no need for an order id field in line item table 610. Therefore, the mechanism of the present embodiment eliminates redundant data from the database.
Turning now to
Next, the mechanism of the present embodiment processes the deployment descriptor file (step 704) and generates relationships and keys to the line items and associated them with the order to reference corresponding line items (step 706). When the user later invokes a method on the order to retrieve its line items (step 708), the container recognizes the relationship between the order and its line items (step 710). In turn, the mechanism of the present embodiment returns corresponding line items to the user using the relationships and keys generated (step 712) and the process terminates thereafter.
The present embodiment also provides a lightweight “reference only” relationship between entity beans by extending the CMR methodology to specify a CM Ref relationship for an entity bean, such that only a single primary key or a list of primary keys for the multi-cardinality relationship is returned as an alternative to a list of EJB objects. With CMR fields, when a user specifies a relationship between entity beans, the current system returns only a single or list of EJB objects. In turn, developers have to manually invoke a getPrimaryKey method on each object in the list to determine the primary key value of each object.
With the present embodiment, a user may specify a CMRef relationship between two entity beans in the deployment descriptor file. When the mechanism of the present embodiment processes the deployment descriptor file, the mechanism of the present embodiment generates a get<Role> method that returns a key or list of keys of the related EJB type with the number of keys depending on the cardinality of the CMRef. This get<Role> method is more efficient than the alternative that returns one or more EJBS, since only the key fields need to be retrieved, and no EJB need be instantiated and cached.
For example, a customer entity bean may be related to a list of account entity beans. By declaring a CMR field with a role name of accounts, a getAccounts method is specified. When a user invokes the getAccounts method on or within the customer object, the current system returns a list of account EJB objects. In turn, developers iterate the list of accounts and retrieve each corresponding primary key. This operation is expensive because it requires significant development efforts.
With the present embodiment, a user may specify CMRef relationship with a role of accountIDs in the deployment descriptor file between the customer and the account entity beans. When the mechanism of the present embodiment processes the deployment descriptor file, in addition or alternative to the getAccounts method as discussed above, the mechanism of the present embodiment generates a getAccountIds method to return a list of account ids. In this way, no EJBs are maintained in memory while the primary key values are still accessible to the user.
In another example, a user may specify a CMR relationship between an order and line items with a role name of lineItems. A generated getLineItems method in the order entity bean generally returns a list of line item EJBs. In turn, developers iterate through the list and invoke a getPrimaryKey method on each of the line items to retrieve the primary key value. With CMRef, suppose that a user specifies a CMRef relationship between the order and line items with a role named lineItemKeys. In this case, a getLineItemKeys method in the order entity bean is generated to return only a list of line item keys, not a list of line items. Thus, the mechanism of the present embodiment provides a lightweight solution to the users who only want to access the primary keys of other entity beans.
Turning now to
With the CMR fields, after the current system returns a list of accounts 804, developers have to manually invoke a getPrimaryKey method on each account object in order to retrieve a primary key for each account. However, with CMRef of the present embodiment, along with a getAccounts method, the mechanism of the present embodiment generates a getAccountIds method 806 upon processing the Ref relationships in the deployment description file. The getAccountIds method returns a list of primary keys instead of a list of account objects.
In addition, customer bean 801 shows a customer bean that only specifies a CMRef relationship between customer and accounts. Therefore, customer bean 801 only includes a getAccountKeys method 810, which returns a list of keys for each account object 812. In this way, a ‘lite’ relationship between the customer bean and the account bean is represented by the keys returned by the mechanism of the present embodiment using the getAccountKeys method 810. This difference shows that the name of the CMRef method is specifiable by the developer.
Turning now to
Later, when the user invokes the get<EJB>Keys method on the entity bean (step 908), the mechanism of the present embodiment returns a list of primary keys for the session bean to the user (step 910) and the process terminates thereafter.
In addition to maintaining uses, owns, and ref relationships between entity and session beans, the mechanism of the present embodiment provides a container managed (CM) batch programming model for building batch applications in a J2EE environment. The programming model automatically generates code necessary for building the batch jobs in a J2EE environment.
In a preferred embodiment, when a job scheduler starts a batch job, the present embodiment accesses an XJCL definition to determine one or more batch beans to invoke. XJCL definition is an extensible markup language (XML) definition provided by the present embodiment that includes a number of job steps, classes, conditions, checkpoints, and other configuration information necessary for a batch job. From the XJCL definition, the present embodiment creates a list of batch beans needed for the batch job, for example, to calculate interests for a list of bank accounts. The present embodiment then creates a global transaction that invokes the batch beans inside a loop until a job completion indication is received, for example, until all of the interests for the list of bank accounts are calculated.
Each time the batch bean returns, the batch container determines if a checkpoint should be performed. The batch bean then uses a checkpoint to determine whether a condition is met during the process of a batch job. If a checkpoint should be performed, the batch container extracts the current cursor and commits the global transaction. The batch container then starts a new global transaction for the next job step. If additional job steps are present for the current batch job, the mechanism of the present embodiment continues to access the XJCL definition to invoke the next batch bean until all job steps are processed.
In processing job steps, the present embodiment takes a data stream as an input to a batch job. A user may specify how an entity bean can be associated with the data stream in a container managed stream (CMS) section of the deployment descriptor file. When the mechanism of the present embodiment processes the deployment descriptor file, the mechanism of the present embodiment creates a stream object for the data stream.
In addition, the mechanism of the present embodiment generates a get<Stream> and a put<Stream> method and associates them with the entity bean, such that the user may manipulate attributes of the data stream directly. From a batch perspective, several input and output streams may now be handled in a batch job using the present embodiment. Furthermore, since a CMS is a special form of CMO relationship (ownership), the get<Stream> method takes no parameters and simply returns the last stream object processed for the owning entity. In this way, a user may directly access the last record processed from thousands of records stored within the stream without a key.
Similarly, a user may specify the relationship between an owning entity bean and an owned data stream for an output data stream. In the bank statement example above, if an account requires a special statement to be printed, the batch bean may mark the account and reroute the account to a different file, such as an output data stream. Using a put<Stream> method, the mechanism of the present embodiment appends the special account to the next record in the stream object without having to perform a lookup of the last record with a key.
It should be noted that the data representing the persistent data of an output stream can be mapped to another input stream, enabling the developer to create chains of entities linked together by container managed streams. A preferred embodiment of these entities is described in CMBatch below.
In addition to getting input data streams or putting output data streams, the mechanism of the present embodiment allows developers to implement container managed uses, refs, and owns relationships between the batch bean and other necessary components in order to perform the batch job. For example, the batch bean may use a number of batchable components to access data from a database. The batch bean may also own a number of data record beans that store the records of the input data stream.
Turning now to
External components include XJCL 1002, check point cursor and job status 1004, and external database 1006. Internal components of CM batch include input batch data stream 1008, batch job stateless session bean 1010, batch bean 1012, data record beans 1014, batchable components 1016, and output batch data streams 1018.
Typically, when a batch job scheduler starts a batch job, the batch container accesses an XJCL definition to determine the appropriate batch bean to be invoked. If the job scheduler starts multiple batch jobs, the batch container accesses multiple XJCL definitions to determine a number of batch beans to be invoked. In addition to the batch bean, XJCL definition also defines a sequence of job steps to be taken by each batch bean, the server or pool to handle the batch job, and other configuration information for the batch job.
After the batch container invokes the batch bean, the batch container may invoke batch job stateless session bean 1010 to dispatch the job to batch bean 1012.
Batch bean 1012 takes input batch data stream 1008 generated by the get<Stream> method using CM stream and performs individual job step for each record in the input data stream. For each record in the data stream, batch bean 1012 uses data record bean 1014 for storing data. In addition to input batch data stream 1008, batch bean 1012 may use batchable components 1016 to retrieve data from external database 1006.
Each time batch bean 1012 returns, the batch container determines whether a checkpoint should be performed using a checkpoint cursor and job status 1004. If checkpoint should be performed, the batch container extracts the current cursor and commits the global transaction. The batch container starts a new global transaction. If a user specifies additional job steps in the XJCL definition, the batch container accesses XJCL 1002 repeatedly to determine the next batch bean to be invoked.
During the batch process, if a record has to be written to a different location, for example, for a special account, batch bean 1012 may invoke put<Stream> method generated by CM stream and puts the record in output batch data stream 1018.
Turning now to
This detailed illustrative example shows that a posting step class 1100 includes a number of methods, including createJobstep 1102, destroyJobStep 1104, and processJobStep 1106. ProcessJobStep 1106 gets the next posting object from an input data stream using getNextPosting method 1108 and identifies the transaction key of the next posting. If the transaction key is zero, ProcessJobStep 1106 calls a getAccountAccess( ).creditAccount( ) 1110 to set up a credit account for the transaction based on the account number and amount of the next posting.
GetAccountAccess( ).creditAccount( ) 1110 illustrates an example of CMU relationship specified between the AccountAccess entity bean and the CreditAccount session bean. In this case, since a relationship between AccountAccess and CreditAccount is specified, CreditAccount may be directly accessed from the Account access entity bean. Similarly, ProcessJobStep 1106 may call getAccountAccess( ).debitAccount( ) 1112 to set up a debit account for the transaction based on the account number and amount of the next posting. However, if no transaction key is found, an overdraft exception is thrown and ProcessJobStep 1106 calls a putNextOverdraft method 1113 to handle the overdraft error.
At the end of posting step class 1100, posting step class 1100 includes a number of sections defining a CMB section with a getTotalPosting method 1114, setTotalPosting method 1116; a first CMS input section with a getNextPosting method 1118; a CMS output section with a putNextOverdraft method 1120; and a CMU section with a getAccountAccess method 1122. A user may use these sections to implement container managed uses, stream, and batch in an enterprise JavaBeans™ environment.
Turning now to
Posting step entity bean class 1200 includes implementations for methods defined in the CMU, CMB, and CMS sections of posting step abstract class 1100 in
Another example implementation of CMS in posting step entity bean class 1200 is getNextPosting method 1204, which is a CMS method. GetNextPosting 1204 takes an input batch data stream from a posting stream config using a getBatchDataStream method 1206 and returns the next record in the posting stream to the user using a getNextRecord method 1207. The mechanism of the present embodiment generates a GetBatchDataStreamPosting method 1206 to retrieve an input stream from a data stream.
Posting stream is described in further detail in
Another example implementation of CMS in posting step entity bean class 1200 is a putNextOverdraft method 1208, which gets an overdraft stream from an overdraft stream config using a getBatchDataStream method 1209 and puts an overdraft value into the next record of the overdraft using a putNextRecord method 1210. Overdraft stream is described in further detail in
Turning now to
Turning now to
Turning now to
In this example implementation, the name of the batch job is ‘Testjob’ 1502 and ‘jndi-name’ 1504 describes the location of the home interface for ‘TestJob’ 1502. In addition to job name and location, XJCL definition 1500 uses a ‘job-scheduling-criteria’ 1506 to specify which pool of application servers is used to process the batch job. Next, XJCL definition 1500 uses a step-scheduling-criteria 1508 to specify the order in which job steps are scheduled. For example, job steps may be scheduled to execute sequentially or in parallel. Sequentially means that one step is scheduled after a previous step is complete. In parallel means that two job steps may be executed at the same time.
In addition, XJCL definition 1500 uses a checkpoint-algorithm 1510 to specify a checkpoint algorithm to be used during batch processing. A checkpoint algorithm may be time-based with a preset time interval or record-based. With time-based checkpoint, the batch bean may perform a checkpoint every 5 seconds. If 1000 records may be processed in 2 seconds and a failure occurs in 6 seconds, a checkpoint at 5 seconds has already committed the first 2000 records before the failure occurs. When the batch job is restarted, the batch process may begin at record 2001 instead of record 1. With record-based checkpoint, the batch bean may perform a checkpoint every 1000 records and every 1000 records is committed. Thus, if a failure occurs at 3100, the batch process may begin at record 3001 instead of record 1.
XJCL definition 1500 uses a results-algorithm 1512 to specify a function to perform after the batch job is completed. For example, the result algorithm may calculate a job sum at the end of the batch job for a total number. Subsequently, XJCL definition 1500 includes a number of job steps, which identifies each job step to be performed for the batch job. Job-step ‘Step1’ 1514 includes a jndi-name, which identifies the location of the job step. Similar to TestJob 1500, job-step ‘Step1’ 1514 also includes a checkpoint algorithm and a result algorithm.
In addition, job-step ‘Step1’ 1514 includes a batch-data-stream, which specifies an input data stream that the batch bean takes as an input or an output data stream that the batch bean writes to as an output. In this example, the logical name of the output stream is ‘myoutput’ and the name of the class is ‘PostingOutputStream.’
Job-step ‘Step2’ 1516 is similar to job-step ‘Step1’ 1514, except that job-step ‘Step2’ 1516 includes a step-scheduling condition 1518, which evaluates a resource-expression ‘ReturnCode_Step1’ to determine if it equals to 0 before scheduling job-step ‘Step2’ 1516.
In addition, job-step ‘Step2’ 1516 takes an input data stream named ‘myinput’ 1520 in
In summary, the present embodiment enables developers to implement container managed uses, owns, and reference relationships between entity beans and session beans without having to manually perform a lookup of the session bean home interfaces. In addition, the present embodiment provides a generic programming model for batch processing in a J2EE application environment, with the capability of inputting or outputting a data stream. Furthermore, the present embodiment provides a generic definition, XJCL definition, for specifying configuration information of a batch job, such that a number of job steps, conditions, and other necessary information for a batch job may be specified.
It is important to note that while the present embodiment has been described in the context of a fully functioning data processing system, those of ordinary skill in the art will appreciate that the processes of the present embodiment are capable of being distributed in the form of a computer readable medium of instructions and a variety of forms and that the present embodiment applies equally regardless of the particular type of signal bearing media actually used to carry out the distribution. Examples of computer readable media include recordable-type media, such as a floppy disk, a hard disk drive, a RAM, CD-ROMs, DVD-ROMs, and transmission-type media, such as digital and analog communications links, wired or wireless communications links using transmission forms, such as, for example, radio frequency and light wave transmissions. The computer readable media may take the form of coded formats that are decoded for actual use in a particular data processing system.
The description of the present embodiment has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the embodiment in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art. The embodiment was chosen and described in order to best explain the principles of the embodiment, the practical application, and to enable others of ordinary skill in the art to understand the embodiment for various embodiments with various modifications as are suited to the particular use contemplated.
Claims
1. A method in a data processing system for generating batch jobs in an object oriented environment, the method comprising:
- receiving a request to execute a batch job;
- responsive to receiving the request, processing a deployment descriptor file to identify a batch bean for invocation to form an identified batch bean, wherein the deployment descriptor file comprises definitions of relationships between a plurality of entity and session beans; and
- invoking the identified batch bean to execute the batch job.
2. The method of claim 1, wherein the invoking step includes:
- repeatedly invoking the identified batch bean until an indication is returned, indicating that a step for the identified batch bean is completed or an exception is thrown that prevents further processing.
3. The method of claim 2, wherein each invocation of the identified batch bean is used to process a record in a set of records for the batch job.
4. The method of claim 2, wherein the identified batch bean returns on each invocation of the identified batch bean.
5. The method of claim 4 further comprising:
- responsive to the identified batch bean returning, determining whether a check point should be performed.
6. The method of claim 2, wherein identified batch bean receives the set of records in an input batch data stream.
7. The method of claim 1, wherein the object oriented environment is an enterprise JavaBeans environment.
8. A data processing system for generating batch jobs in an object oriented environment, the data processing system comprising:
- receiving means for receiving a request to execute a batch job;
- processing means, responsive to receiving the request, for processing a deployment descriptor file to identify a batch bean for invocation to form an identified batch bean, wherein the deployment descriptor file comprises definitions of relationships between a plurality of entity and session beans; and
- invoking means for invoking the identified batch bean to execute the batch job.
9. The data processing system of claim 8, wherein the invoking step includes:
- repeating means for repeatedly invoking the identified batch bean until an indication is returned, indicating that a step for the identified batch bean is completed or an exception is thrown that prevents further processing.
10. The data processing system of claim 9, wherein each invocation of the identified batch bean is used to process a record in a set of records for the batch job.
11. The data processing system of claim 9, wherein the identified batch bean returns on each invocation of the identified batch bean.
12. The data processing system of claim 11 further comprising:
- determining means, responsive to the identified batch bean returning, for determining whether a check point should be performed.
13. The data processing system of claim 9, wherein identified batch bean receives the set of records in an input batch data stream.
14. A computer program product in a data processing system for generating batch jobs in an object oriented environment, the computer program product comprising:
- first instructions for receiving a request to execute a batch job;
- second instructions, responsive to receiving the request, for processing a deployment descriptor file to identify a batch bean for invocation to form an identified batch bean, wherein the deployment descriptor file comprises definitions of relationships between a plurality of entity and session beans; and
- third instructions for invoking the identified batch bean to execute the batch job.
15. The computer program product of claim 14, wherein the third instructions includes:
- sub instructions for repeatedly invoking the identified batch bean until an indication is returned, indicating that a step for the identified batch bean is completed or an exception is thrown that prevents further processing.
16. The computer program product of claim 15, wherein each invocation of the identified batch bean is used to process a record in a set of records for the batch job.
17. The computer program product of claim 15, wherein the identified batch bean returns on each invocation of the identified batch bean.
18. The method of claim 17 further comprising:
- first instructions, responsive to the identified batch bean returning, for determining whether a check point should be performed.
19. The computer program product of claim 15, wherein identified batch bean receives the set of records in an input batch data stream.
20. The computer program product of claim 14, wherein the object oriented environment is an enterprise JavaBeans environment.
Type: Application
Filed: Jan 7, 2005
Publication Date: Jul 13, 2006
Inventors: Geoffrey Hambrick (Round Rock, TX), Robert High (Round Rock, TX), Rodney Little (Wappingers Falls, NY), Sridhar Sudarsan (Austin, TX)
Application Number: 11/030,778
International Classification: G06F 7/00 (20060101);