Information processing system and method of controlling information processing system
The present invention provides an information processing system which comprises multiple processor nodes creating parallel computers, an information transmission line for connecting the processor nodes, and a separation switch provided on the information transmission line for separating the information transmission line so that the processor nodes create multiple parallel computers independent of one another.
Latest Patents:
1. Field of the Invention
The present invention relates to an information processing system and a method of controlling the information processing system, and particularly to a technique that can be effectively applied to an information processing technique or the like such as the parallel processing based on load balancing by using a plurality of processor nodes and the like.
2. Description of the Related Art
High performance computers that are capable of simulating various physical phenomena in the natural field or in manufactured objects have been used for designs and simulations in the fields of nuclear utilization, automobiles, ships, aircrafts, high-rise buildings and the like. Also, recently, high performance computers have been used in the field of biotechnology and chemistry, for such purposes as molecular design, genetic analysis and the like, in order to be utilized not only in universities or research laboratories, but also in business enterprises. High performance computers are used for various processor applications and the processing time thereof varies very much.
As methods for realizing the above high performance computer, the techniques below are used to increase the speed of recursive computations whose computation amount is large (e.g., the matrix computation which is conducted frequently in the sci-tech fields).
Specifically, there is (1) a technique for enhancing the performance of each processor itself, (2) a technique for enhancing the degree of the parallelism of multiple parallel-connected processors and (3) a technique for realizing the parallel processing by using a special computation device called a vector processor.
Usually, a high performance computer is realized by selecting one or more of the above techniques (1) to (3). In the case of the parallel computer whose processors exhibit a high degree of parallelism, enhancement of the performance of the network connecting the various processors becomes more important.
In many cases, when an application requires a large scale computation to be executed, the computation is conducted with a priority on reducing the computation time by occupying the entire high performance computer.
Conversely, when an application requires just a small or middle scale computation to be executed, it is advantageous to execute simultaneously different programs to promote efficient utilization of the above high performance computer. In this case, a consideration for security is necessary so that there is no leakage of information among users between the execution of each application's computations.
Especially in the latter case, it is important to heighten security among the various users. A software counter measure is possible to implement, in which, for example, user specific information is included in the information transmitted among processor nodes so that processes may discriminate by user on what information is used. However, this presents a new problem in that the delay time of the information transmission is increased by the overhead of the transmitted information that the user discrimination process entails.
Accordingly, in a conventional high performance computer, there is a tendency to simplify the processes used to transmit information between processor nodes to increase the performance of the information transmission speed and the transmission capacity among processor nodes. Therefore, a software counter measure is difficult to employ because it causes deterioration of performance and more complexity in configuration.
In Japanese Patent Application Publication No. 2004-532447, a technique is disclosed in which, in a parallel computer a group consisting of redundant spare processors is provided in order to realize fault tolerance by controlling the computer with software to replace a group that has experienced hardware failure. However, in the above Japanese Patent Application Publication No. 2004-532447, the above technical problem regarding securing the parallel computer when the parallel computer is simultaneously used by a plurality of different users is not recognized.
SUMMARY OF THE INVENTIONOne objective of the present invention is to provide an information processing technique that realizes a simultaneous use of a parallel computer by a plurality of users with a high level of security and without deteriorating the computation performance.
Another object of the present invention is to provide an information processing technique that realizes a parallel execution of multiple application programs with a variety of computation scales while maintaining the security among application programs.
Another object of the present invention is to realize an enhancement in the rate of operation of a parallel computer comprised of multiple processor nodes.
Another object of the present invention is to provide an information processing technique that realizes an enhancement in the failure resistance of a parallel computer by the separation of processor nodes in a unit of each processor node.
The first aspect of the present invention provides an information processing system, comprised of a plurality of processor nodes for creating parallel computers, an information transmission line for connecting the processor nodes, and a separation switch provided on the information transmission line for separating the information transmission line so that the multiple processor nodes may create multiple parallel computers that are independent of one another.
A second aspect of the present invention provides the information processing system as detailed in the first aspect, wherein the separation switch physically separates the information transmission line.
A third aspect of the present invention provides the information processing system as detailed in the first aspect, wherein each of the processor nodes comprises an input port and an output port, the separation switch holds a switch matrix for connecting multiple connection ports to which the input port and the output port are connected and the arbitrary connection port via the information transmission line, the switch matrix connects the input ports and the output ports of the plurality of the processor nodes to constitute loops, and the switch matrix skips and excludes a pair of the input port and the output port of each of the processor nodes from the loops so that the disconnection from the parallel computer is conducted in a unit of each of the processor nodes.
A fourth aspect of the present invention provides an information processing system, comprising first switches for binding grouped processor nodes and controlling routes of information transmitted among the processor nodes, a second switch for controlling routes of information transmitted among the processor nodes among the groups that the processor nodes are connected in a unit of the group via the first switches, and a third switch provided between the first switches and the second switch for controlling the presence and the absence of the connection with the second switch regarding each of the groups.
A fifth aspect of the present invention provides a method of controlling an information processing system to connect multiple processor nodes via an optical transmission line and cause the processor nodes to operate as parallel computers, establishing a step of arranging an optical switch on the optical transmission line, and a step of causing the processor nodes to operate as parallel computers that are independent of one another by separating the optical transmission line by the optical switch as occasion demands.
A sixth aspect of the present invention provides a method of controlling an information processing system in which a fat tree is established by binding multiple grouped processor nodes by first switches for controlling routes of information transmitted among the processor nodes in the groups and by binding multiple first switches by a second switch for controlling routes of information transmitted among the processor nodes among the various groups, creating a step of arranging a third switch on an information transmission line between the first switches and the second switch, and a step of constructing a plurality of parallel computers which are independent of one another in a unit of the group as a minimal unit by controlling the presence and the absence of the connection with the second switch regarding each of the groups by the third switch.
BRIEF DESCRIPTION OF THE DRAWINGS
Hereinafter, embodiments of the present invention will be explained in detail with reference to the drawings.
As shown in
Each processor node 11 comprises a computation core 11a, a communication interface 11d for connecting the computation core 11a to an optical signal-outputting unit 21 and an optical signal-inputting unit 22. The computation core 11a comprises, for example, a CPU 11c for executing computation processes, and memory, 11b in which information such as a program or data and the like for controlling the CPU 11c is stored.
In the optical signal-outputting unit 21 connected to the communication interface 11d, there is a plurality of fixed wavelength optical transmitters 21a, and each of the fixed wavelength optical transmitters 21a is connected to the vertical loop optical fibers 12 and the horizontal loop optical fibers 13.
In the optical signal-inputting unit 22 connected to the communication interface 11d, there is a plurality of fixed wavelength optical receivers 22a, and the vertical loop optical fibers 12 and the horizontal loop optical fibers 13 are connected to each of the fixed wavelength optical receivers 22a.
A message to be transmitted from one processor node 11 to other processor nodes is converted into an optical signal by the optical signal outputting unit 21, output to the vertical loop optical fibers 12 and the horizontal loop optical fibers 13, received by the optical signal-inputting unit 22, and transmitted to the computation core 11a after being converted into an electrical signal as occasion demands.
When the information processing system 10 in the two dimensional torus connection is realized as shown in
In the present embodiment, an optical switch 14 (a separation switch) is provided on the route of the horizontal loop optical fiber. The optical switch 14 is comprised of multiple connection ports 14a, a switch matrix 14b for setting a connection route among the connection ports 14a and a separation control-inputting unit 14c for externally controlling the setting operation of the connection route by the switch matrix 14b.
In the configuration of
The switch matrix 14b is comprised of, for example, a plurality of micro movable mirrors each in a size of a cross section of an optical fiber created by employing a micro processing technique such as a micro electro mechanical system (MEMS) or the like, a driving circuit system for driving the movable mirrors by the static electricity or the like, and further, an optical system or the like for converging the optical signals which are incident and reflected on/by the movable mirrors. Accordingly, the transmission delay in the connected state and the separated state of the horizontal loop optical fiber 13 in the optical switch 14 is generally very small.
Each of the horizontal loop optical fibers 13 is divided by the optical switch 14 and each of the ends by the division is connected to its corresponding connection port 14a.
In the default state of the separation control-inputting unit 14c, the connection among the connection ports 14a is set so that each horizontal loop optical fiber 13 constitutes one loop. In this state, one parallel computer C0 is created by all of the processor nodes 11 (eleven in this case).
Also, when a separation instruction of the horizontal loop optical fibers 13 is made by the separation control-inputting unit 14c, in the switch matrix 14b, the connection routes via the separation control-inputting unit 14c are set so that a couple of connection ends of each of the horizontal loop optical fibers 13 create a loop as shown in
In this way are the multiple processor nodes 11 that were connected via the horizontal loop optical fibers 13 in the row direction separated into a group of separated loop optical fibers 13a (eight nodes in this case) and a group of separated loop optical fibers 13b (eight nodes in this case) to create a parallel computer C1 and a parallel computer C2 respectively, each being independent of the other.
In this case, in the separated state as shown in
Therefore, the leakage of information can be certainly avoided between a simulation program executed by a user who uses the parallel computer C1 and another simulation program executed by another user who uses the parallel computer C2.
Further, complex software processes such as inserting special information in messages transmitted and received among the processor nodes 11 or reading the inserted special information from the message to sort it or the like for the sake of avoiding the leakage of information in the case that the parallel computer C0 is to be operated as parallel computers C1 and C2 is not necessary at all. Accordingly, the overhead due to the above software processes does not occur.
Therefore, one parallel computer C0 can operate as the independent parallel computers C1 and C2 without the occurrence of the overhead, while at the same time maintaining the high level security.
Also, when the parallel computer C0 created by causing the optical switch 14 to be in a connecting state and connecting all of the processor nodes 11 in the row direction by one horizontal loop optical fiber 13 is used, the overhead of the transmissions of the optical signals in the optical switch 14 does not occur so that the high performance of the parallel computer C0 is not deteriorated.
Also in this case, an optical switch 14-1 uses the horizontal loop optical fibers 13 as connection means so that the parallel computer C0 is realized by the parallel connection of all of the processor nodes 11 (sixteen nodes in this case) A parallel computation of maximum performance is therefore realized.
Also, the parallel computer C1 and the parallel computer C2 which are physically independent from each other can be realized by setting the optical switch 14-1 in the separating state and separating the horizontal loop optical fibers 13 into a plurality of the separated loop optical fibers 13 and the separated loop optical fibers 13b. In the case of this separating state, the separated ends (connection ends with respect to the optical switch 14) of each optical fiber of the multiplexed horizontal loop optical fibers 13 are loop-connected in the switch matrix 14b similarly as the case in
In the above multiplexing of the vertical loop optical fibers 12 and the horizontal loop optical fibers 13, a larger number of the optical fibers are required. Therefore, an example in which the number of the optical fibers constituting the vertical loop optical fibers 12 and the horizontal loop optical fibers 13 is reduced by using the optical signals of the wavelength division multiplexing (WDM) in
Further, in the connecting state of the optical switch 14-1, the parallel computer C0 is created by the parallel operations of all of the processor nodes 11-1. Also, as occasion demands, the parallel computer C1 and the parallel computer C2 which are independent from each other can be realized when the optical switch 14-1 separates the horizontal wavelength multiplex optical fibers 16 into two groups of separated wavelength multiplex fibers 16a and separated wavelength multiplex fibers 16b.
Also, on the side of the optical signal-inputting unit 22, a branching filter 24 is provided for branching the wavelength division multiplexed optical signals coming from the vertical wavelength multiplex optical fiber 15 and the horizontal wavelength multiplex optical fiber 16 for each wavelength and for inputting to the fixed wavelength optical receivers 22a.
There is merit in the fact that, by wavelength division multiplexing the vertical wavelength multiplex optical fibers 15 and the horizontal wavelength multiplex optical fibers 16 as shown in
In the above case, the optical signal-outputting unit 21 and the optical signal-inputting unit 22 of the processor nodes 11-2 connected to the horizontal loop optical fiber 13 (horizontal wavelength multiplex optical fiber 13c) are also of the intermediate configuration between the configurations of
Generally, because an optical module comprising the coupler 23 and the branching filter 24 and the like used for the wavelength division multiplexing is expensive, the example in which the wavelength multiplexing is conducted only on the routes between the processor nodes between which the optical switch 14-1 is provided is explained in
In addition, in
Similarly as in the case of the two dimensional torus connection of
Specifically, multiple processor nodes 11 are each connected to the electric switches 31 (first switch) via optical fibers 34 for a group including the nodes in a prescribed number, and accommodated in cases 30 in a unit of the group.
The electric switch 31 connects the processor nodes 11 in the case 30 by a full mesh connection by a packet routing technique, for example.
The electric switch 31 of each of the cases 30 is further connected to the higher electric switch 32 (second switch) via the optical fiber 35. The optical fiber 35 transmits, for example, the wavelength division multiplexed optical signals between the electric switch 31 and the electric switch 32.
The electric switch 32 connects the cases 30 (electric switches 31) in the full mesh connection by, for example, a packet routing technique among multiple ports 32a to which the optical fibers 35 are connected from the side of the electric switches 31.
In the above case, on the routes of the optical fibers 35, an optical switch 33 (third switch) is provided. This optical switch 33 is comprised of multiple connection ports 33b to which the optical fibers 35 are connected from both of the electric switches 31 and the electric switch 32, a switch matrix 33a for disconnecting and setting of the connection routes regarding the optical transmission lines between these connection ports 33b and a separation control-inputting unit 33c for externally controlling the operation of the switch matrix 33a.
Further, in the present embodiment, as shown in
Specifically, in the example of
Accordingly, by setting one parallel computer C0 of the information processing system 10 to the separated state or the connected state among the cases 30 as shown in
In the configuration examples of
In the configuration example of
The connection configuration of the logical three dimensional torus connection of
Specifically, in each of the processor nodes 11, three fixed wavelength optical transmitters 21a and three fixed wavelength optical receivers 22a are provided in each of the optical signal-outputting units 21 and the optical signal inputting units 22 for receiving and transmitting the optical signals in the directions of the three dimensions of x, y and z. And each of the fixed wavelength optical transmitters 21a and three fixed wavelength optical receivers 22a are connected to the connection ports 41b of the optical switches 41 via the optical fibers 42.
Each of the optical switches 41 comprises a switch matrix 41a for connecting the input and output of each of the multiple connection ports 42b to other arbitrary connection ports 41b respectively corresponding to the axes of x, y and z. The above connection setting among the connection ports 41b by the switch matrix 41a can be externally controlled by a separation control-inputting unit 41c.
For example, in the example of
Further, when, for example, a failure has occurred in one processor node (processor node N5) or the processor node N5 has become dispensable, only the processor node N5 has to be disconnected from the three dimensional torus connection. In the above case, as shown in
Thereby, only the processor node N5 with the failure or the dispensability can be disconnected from the three dimensional torus connection.
Additionally, when all of the transmission lines are connected in the three dimensional torus connection, some of the transmission lines are looped and the traffic becomes excessive so that the transmission amount overflows the transmission capacity and the link may go down. However, in the present embodiment, the optical switch 41 can suitably disconnect the link in accordance with the connection configuration of the processor nodes 11 which is required for each application so that the link going down due to the loop configuration of the transmission lines can be avoided.
Also, in
For example, in
As described above, according to the respective embodiments of the present invention, it is possible that the high performance computer comprised of multiple processor nodes 11 can be physically separated by the optical switch as occasion demands, that the parallel computers C1 to C3 that can be independently operated in parallel can be created, and the parallel computers C1 to C3 are used by different users for different applications (application programs). Also, the leakage of information among the application programs executed by respective users can be certainly avoided.
Accordingly, large-scale application programs that require high performance can be executed and parallel operations by multiple users can be realized with a high level of security so that the rate of operation can be enhanced by the use of the parallel computers in various tasks.
In addition, the present invention is not limited to the configurations explained in the above embodiments and it is obvious that various modifications are possible without departing from the spirit of the present invention.
For example, in the above respective embodiments, examples in which an optical fiber is used for the information transmission line have been offered. However, a metal cable can be used instead of an optical fiber, for example.
Utilizing the present invention, multiple users can use parallel computers simultaneously with a high level of security without deteriorating the computation performance.
Also, multiple application programs with a variety of computation scales can be executed in parallel while maintaining the security among the application programs.
Also, the enhancement of the rate of operation of the parallel computer created by multiple processor nodes is realized.
Also, the enhancement of failure resistance of a parallel computer by the disconnection of the processor nodes in a unit of each processor node is realized.
Claims
1. An information processing system, comprising:
- a plurality of processor nodes for constituting parallel computers;
- an information transmission line for connecting the processor nodes; and
- a separation switch provided on the information transmission line for separating the information transmission line so that multiple processor nodes may create a plurality of parallel computers which are independent of one another.
2. The information processing system according to claim 1, wherein:
- the separation switch physically separates the information transmission line.
3. The information processing system according to claim 1, wherein:
- the information transmission line consists of an optical transmission line which creates a loop;
- communications among multiple processor nodes are conducted by optical signals transmitted on the optical transmission line; and
- the separation switch is an optical switch for separating the optical transmission line multiple loops and for uniting these loops.
4. The information processing system according to claim 1, wherein:
- the information transmission line consists of a single optical transmission line;
- communications among multiple processor nodes are conducted by wavelength division multiplexed optical signals transmitted on the optical transmission line; and
- the separation switch is an optical switch for separating and connecting the optical transmission line which transmits the wavelength division multiplexed optical signals.
5. The information processing system according to claim 1, wherein:
- the information transmission line consists of a plurality of optical transmission lines;
- the optical transmission line for connecting the separation switch and the processor node that is the closest to the separation switch is selectively unified; and
- wavelength division multiplexed optical signals are transmitted on the selectively unified optical transmission line.
6. The information processing system according to claim 1, wherein:
- multiple processor nodes are connected via the information transmission line so that the two dimensional torus or the three dimensional torus is created; and
- said processor nodes are separated, by the separation switch, to belong to the multiple two dimensional tori or the three dimensional tori which are different from the multiple processor nodes.
7. The information processing system according to claim 1, wherein:
- each of the processor nodes is comprised of an input port and an output port;
- the separation switch is comprised of a switch matrix for connecting multiple connection ports to which the input port and the output port are connected and the arbitrary connection port via the information transmission line;
- the switch matrix connects the input ports and the output ports of the multiple processor nodes to create loops; and
- the switch matrix skips and excludes a pair of the input port and the output port of each of the processor nodes from the loops so that the disconnection from the parallel computer is conducted in a unit of each of the processor nodes.
8. An information processing system, comprising:
- first switches for binding multiple processor nodes constituting groups and controlling routes of information transmitted among the processor nodes in the groups;
- a second switch for controlling routes of information transmitted among the processor nodes among the multiple groups that processor nodes are connected in a unit of the group via the multiple first switches; and
- a third switch provided between the first switches and the second switch for controlling the presence and the absence of the connection with the second switch regarding each of the groups.
9. The information processing system according to claim 8, wherein:
- the first switches and the second switch are connected via optical transmission lines which transmit wavelength division multiplexed optical signals; and
- the third switch is an optical switch provided on the optical transmission lines.
10. The information processing system according to claim 8, wherein:
- the third switch includes a function to connect the first switches disconnected from the second switch; and
- the processor nodes respectively constitute a plurality of parallel computers which are independent of one another in a unit of the group as a minimal unit.
11. A method of controlling an information processing system to connect a plurality of processor nodes via an optical transmission line and cause the processor nodes to operate as parallel computers, comprising:
- a step of arranging an optical switch on the optical transmission line; and
- a step of causing the multiple processor nodes to operate as parallel computers that are independent of one another by separating the optical transmission line by the optical switch as occasion demands.
12. The method of controlling an information processing system according to claim 11, wherein:
- information among the various processor nodes is transmitted and received by wavelength division multiplexed optical signals transmitted on the optical transmission line.
13. A method of controlling an information processing system in which a fat tree is created by binding multiple processor nodes constituting groups by first switches for controlling routes of information transmitted among the processor nodes in the groups and by binding multiple first switches by a second switch for controlling routes of information transmitted among the processor nodes among multiple groups, comprising:
- a step of arranging a third switch on an information transmission line between the first switches and the second switch; and
- a step of creating multiple parallel computers which are independent of one another in a unit of the group as a minimal unit by controlling the presence and the absence of the connection with the second switch regarding each of the groups by the third switch.
14. The method of controlling an information processing system according to claim 13, wherein:
- the first switches and the second switch are connected via optical transmission lines which transmit wavelength division multiplexed optical signals; and
- the presence and the absence of connection with the second switch regarding each of the groups is controlled by using an optical switch as the third switch based on the presence and the absence of separation of the optical transmission lines.
Type: Application
Filed: Jun 7, 2005
Publication Date: Aug 3, 2006
Applicant:
Inventors: Takao Naito (Kawasaki), Toshiki Tanaka (Kawasaki), Kouichiro Amemiya (Kawasaki)
Application Number: 11/145,971
International Classification: G06F 15/173 (20060101);