Server cluster and control mechanism thereof

- Quanta Computer Inc.

A server cluster including a network switch and multiple server nodes is provided. The network switch is connected to an external network. Each server node performs an operation system and respectively includes a network port, a network chip and a south bridge chip. The network port is connected to the network switch via a cable. The network chip outputs a power-off signal according to a received power-off packet after the network switch is started. The south bridge chip outputs a shutdown signal to shut down the server node according to the power-off signal when the server node is turned on and the operation system is working normally.

Skip to: Description  ·  Claims  ·  References Cited  · Patent History  ·  Patent History
Description

This application claims the benefit of Taiwan application Serial No. 100136173, filed Oct. 5, 2011, the subject matter of which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates in general to a server cluster and a control mechanism thereof.

2. Description of the Related Art

The blade server is optimized with modular design to reduce the physical space and energy to a minimum and further simplify the server configuration. The operation of the blade server relies on manual operation, or, the administrator can turn off the power with a baseboard management controller (BMC) operated by a remote-end management server. The aforementioned mechanism must use a remote-end management server and a baseboard management controller, so the cost of the blade serve cannot be effectively reduced.

SUMMARY OF THE INVENTION

The invention is directed to a server cluster and a control mechanism thereof. The power of the server node is turned off according to a detected power-off packet of a network without using any baseboard management controller (BMC), so that the cost of the blade serve can be effectively reduced.

According to a first aspect of the present invention, a server cluster including a network switch and multiple server nodes is provided. The network switch is connected to an external network. Each server node performs an operation system and respectively includes a network port, a network chip and a south bridge chip. The network port is connected to the network switch via a cable. The network chip outputs a power-off signal according to a received power-off packet after the network switch is started. The south bridge chip outputs a shutdown signal to shut down the server node according to the power-off signal when the server node is turned on and the operation system is working normally.

According to a second first aspect of the present invention, a control mechanism of a server cluster is provided. The server cluster includes a network switch and multiple server nodes. Each server node performs an operation system and respectively includes a network port, a network chip and a south bridge chip. The network switch is connected to an external network. The network port is connected to the network switch via a cable. The control mechanism of a server cluster includes the following steps. A network switch is started. A network chip is used for outputting a power-off signal according to a received power-off packet. When the server node is turned on and the operation system is working normally, the south bridge chip is used for outputting a shutdown signal to shut down the server node according to the power-off signal.

The above and other aspects of the invention will become better understood with regard to the following detailed description of the preferred but non-limiting embodiment(s). The following description is made with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a server cluster according to one embodiment of the invention;

FIG. 2 shows a flowchart of a control mechanism of a server cluster according to one embodiment of the invention;

FIG. 3 shows a circuit diagram of an example of a pulse delay circuit according to one embodiment of the invention.

DETAILED DESCRIPTION OF THE INVENTION

The invention relates to a server cluster and a control mechanism thereof. The power of the server node is turned off according to a detected power-off packet of a network without using any baseboard management controller (BMC), so that the cost of the blade serve can be effectively reduced.

Referring to FIG. 1, a server cluster according to one embodiment of the invention is shown. The server cluster 100 includes a network switch 110 and multiple server nodes 120. The network switch 110 is connected to an external network such as an Internet. Each server node 120 includes a network port 124, a network chip 126, a south bridge chip 128 and a pulse delay circuit 130. The network port 124 is connected to the network switch 110 via a network link 122 such as a cable. The network chip 126 outputs a power-off signal according to a received power-off packet after the network switch 110 is turned on. The power-off packet is such as but not limited to a network reboot on LAN (ROL) packet or a network wake on LAN (WOL) packet. The south bridge chip 128 is electrically connected to the network chip 126. The pulse delay circuit 130 is electrically connected to the network chip 126 and the south bridge chip 128.

Referring to FIG. 2, a flowchart of a control mechanism of a server cluster according to one embodiment of the invention is shown. In step S200, a network switch 110 is started. In step S210, a power-off packet is received by a network chip 126, and a power-off signal PWR_OFF_L is outputted accordingly. In step S220, whether the server node 120 is turned on is judged. If the server node 120 is turned off, then none of the south bridge chip 128 and the pulse delay circuit 130 needs to generate action, and the process returns to step S200. If the server node 120 is turned on, then the process proceeds to step S230. In step S230, whether the operation system of the server node 120 is abnormal such as crash or hang-up is judged.

When the operation system of the server node 120 is normal or is working normally, then the process proceeds to step S240. In step S240, the pin of a system management interrupt (SMI) of the south bridge chip 128 is enabled by the power-off signal PWR_OFF_L to generate a shutdown signal SCI to notify the operation system to shut down the server node 120. In step S250, the server node 120 is shut down. If the operation system of the server node 120 is abnormal such as hang-up, then the process proceeds to step S260. In step S260, the power-off signal PWR_OFF_L is transformed into a delay signal and further transmitted the delay signal to the south bridge chip 128 by the pulse delay circuit 130, such that the south bridge chip 128 outputs a power signal PWR_BUT_L to shut down the server node 120.

In the aforementioned mechanism, the pulse delay circuit 130 substantially detects an on/off state of the server node 120 according to a signal SLP_S5_L of the south bridge chip 128. The pulse delay circuit 130 transforms a power-off signal PWR_OFF_L into a low level delay signal whose pulse width amounts to 4 seconds when the server node 120 is turned on and the operation system is abnormal. Under the current protocol, the low level delay signal whose pulse width amounts to 4 seconds enables the south bridge chip 128 to force the operation system to shut down the server node 120.

Referring to FIG. 3, a circuit diagram of an example of a pulse delay circuit according to one embodiment of the invention is shown. In the pulse delay circuit 130, the power-off signal PWR_OFF_L whose pulse width is merely a few micro-seconds is inverted first. Then, the pulse width is delayed by an RC circuit. Lastly, the signal SLP_S5_L is outputted when the server node 120 is turned on and inverted as a low level delay signal whose pulse width amounts to 4 seconds.

According to the server cluster and the control mechanism thereof disclosed in the above embodiments of the invention, a power-off packet of the network, such as a network reboot on LAN (ROL) packet or a network wake on LAN (WOL) packet, is detected with a simple logic circuit to turn off the power of the server node without using a baseboard management controller, such that the cost of the server cluster can be effectively reduced.

While the invention has been described by way of example and in terms of the preferred embodiment (s), it is to be understood that the invention is not limited thereto. On the contrary, it is intended to cover various modifications and similar arrangements and procedures, and the scope of the appended claims therefore should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements and procedures.

Claims

1. A server cluster, comprising:

a network switch connected to an external network; and
a plurality of server nodes, wherein each server node performs an operation system and respectively comprises: a network port connected to the network switch via a cable; a network chip used for outputting a power-off signal according to a received power-off packet after the network switch is started; a south bridge chip used for outputting a shutdown signal to shut down the server node according to the power-off signal when the server node is turned on and the operation system is working normally; and a pulse delay circuit that transforms the power-off signal into a delay signal and transmits the delay signal to the south bridge chip when the operation system is abnormal and the server node, is turned on, such that the south bridge chip shuts down the server node; wherein the pulse delay circuit judges whether the server node is turned on through the south bridge chip.

2. The server cluster according to claim 1, wherein the power-off packet is a network power-on reboot on LAN packet or a network wake on LAN packet.

3. The server cluster according to claim 1, wherein the delay signal has a low level pulse whose pulse width amounts to 4 seconds.

4. A control mechanism method of a server cluster, wherein the server cluster comprises a network switch and a plurality of server nodes, each server node performs an operation system and respectively comprises a network port, a network chip and a south bridge chip, the network switch is connected to an external network, the network port is connected to the network switch via a cable, and the control mechanism method comprises:

starting the network switch;
outputting a power-off signal by the network chip according to a received power-off packet; and
outputting a shutdown signal by the south bridge chip to shut down the server node according to the power-off signal when the server node is turned on and the operation system is working normally,
wherein each server node further comprises a pulse delay circuit, and the control mechanism method further comprises judging whether the server node is abnormal when the operation system is turned on; and transforming the power-off signal into a delay signal and further transmitting the delay signal to the south bridge chip by the pulse delay circuit when the operation system is abnormal, such that the south bridge chip shuts down the server node;
wherein the pulse delay circuit judges whether the server node is turned on through the south bridge chip.

5. The control mechanism method of a server cluster according to claim 4, wherein the power-off packet is a network power-on reboot on LAN packet or a network wake on LAN packet.

6. The control mechanism method of a server cluster according to claim 4, wherein the delay signal has a low level pulse whose pulse width amounts to 4 seconds.

7. A server cluster, comprising:

a network switch connected to an external network; and
a plurality of server nodes, wherein each server node performs an operation system and comprises:
a network port connected to the network switch;
a network chip used for outputting a power-off signal according to a received power-off packet;
a south bridge chip used for outputting a shutdown signal to shut down the server node according to the power-off signal when the server node is on and the operation system is working; and
a pulse delay circuit that transforms the power-off signal into a delay signal and transmits the delay signal to the south bridge chip when the operation system is abnormal and the server node is on, such that the south bridge chip shuts down the server node.

8. The server cluster according to claim 7, wherein the pulse delay circuit judges whether the server node is turned on through the south bridge chip.

9. The server cluster according to claim 7, wherein the power-off packet is a network reboot on LAN packet or a network wake on LAN packet.

10. The server cluster according to claim 7, wherein the delay signal has a low level pulse whose pulse width amounts to 4 seconds.

11. A control method for a server cluster, wherein the server cluster comprises a network switch and a plurality of server nodes, each server node performs an operation system and comprises a network port, a network chip, a pulse delay circuit, and a south bridge chip, the network switch is connected to an external network, the network port is connected to the network switch, and the control method comprising:

starting the network switch;
outputting a power-off signal by the network chip according to a received power-off packet;
judging whether the server node is abnormal when the operation system is on;
outputting a shutdown signal by the south bridge chip to shut down the server node according to the power-off signal when the server node is on and the operation system is working; and
transforming the power-off signal into a delay signal and further transmitting the delay signal to the south bridge chip by the pulse delay circuit when the operation system is abnormal, such that the south bridge chip shuts down the server node.

12. The control method of claim 11, wherein the pulse delay circuit judges whether the server node is turned on through the south bridge chip.

13. The control method according to claim 11, wherein the power-off packet is a network reboot on LAN packet or a network wake on LAN packet.

14. The control method according to claim 11, wherein the delay signal has a low level pulse whose pulse width amounts to 4 seconds.

Referenced Cited
U.S. Patent Documents
5696895 December 9, 1997 Hemphill et al.
6269288 July 31, 2001 Smith
6473865 October 29, 2002 Kamoshida
6654896 November 25, 2003 Saunders
6859882 February 22, 2005 Fung
6968465 November 22, 2005 Freevol
7069317 June 27, 2006 Colrain et al.
7352289 April 1, 2008 Harris
7380144 May 27, 2008 Green
7472179 December 30, 2008 Tarui et al.
7573832 August 11, 2009 Kenghe
7664991 February 16, 2010 Gunda et al.
8212396 July 3, 2012 Chiou
8271632 September 18, 2012 Kobayashi et al.
20080183880 July 31, 2008 Sasage et al.
20110161695 June 30, 2011 Okita et al.
20120226918 September 6, 2012 Rallo
Foreign Patent Documents
102144374 August 2011 CN
200509593 March 2005 TW
200709609 March 2007 TW
200828887 July 2008 TW
Other references
  • Second Office Action mailed on Aug. 4, 2015 in Chinese Application No. 201110315416.9.
  • Summary of Second Office Action mailed on Aug. 4, 2015 in Chinese Application No. 201110315416.9.
  • First Office Action mailed on Jan. 12, 2015 in Chinese Application No. 201110315416.9.
  • Summary of First Office Action mailed on Jan. 12, 2015 in Chinese Application No. 201110315416.9.
Patent History
Patent number: RE46520
Type: Grant
Filed: Apr 14, 2015
Date of Patent: Aug 22, 2017
Assignee: Quanta Computer Inc. (Taoyuan)
Inventors: Le-Sheng Chou (Zhongli), Sz-Chin Shih (New Taipei)
Primary Examiner: Kenneth J Whittington
Application Number: 14/686,458
Classifications
Current U.S. Class: Repair Failed Node Without Replacement (i.e., On-line Repair) (714/4.3)
International Classification: G06F 1/00 (20060101); G06F 11/00 (20060101); H04L 12/24 (20060101); H04L 12/26 (20060101); H04L 12/56 (20060101); G06F 1/32 (20060101); G06F 11/30 (20060101); H04L 12/12 (20060101); G06F 11/07 (20060101); H04L 12/931 (20130101);