Computer chip heat responsive method and apparatus
Disclosed is an apparatus incorporating hardware based logic and a predetermined default list of software affecting responses to be taken in connection with temperatures sensed by thermal sensors checking the temperature of portions of computer logic. At the time application software is loaded, the software can modify the default response list. The list of responses to be taken and the over temperature conditions at which they are to be activated are stored in hardware directly accessible by hardware based thermal sensor monitoring logic for direct control of the hardware. The control can alter conditions such as clock frequency, stopping use of application software, interrupting OS functionality, removing power from components and so forth.
Latest IBM Patents:
The invention relates to a PU (processing unit) control and, more particularly, to a method of controlling chip temperature in an operating PU.
BACKGROUNDWhen a program is running in a CPU (Central Processing Unit) or other PU, local hot spots can develop on an integrated circuit chip. Present day chips are configured with islands of circuitry where different islands perform different functions in the PU operation. On a multiprocessor chip, each island may be a separate PU. In either event, each island is likely to experience different workloads and thus reach different “hot spot” temperatures. Known prior art responses comprise shutting down all processing operations of the processor until the chip is adequately cool, decreasing operational workload in a standardized manner for all software used on the PU or increasing the cooling air flow used to lower the temperature of the chip.
It would be desirable if a method could be devised whereby the PU could be allowed to continue to operate, but the hot spot islands of the PU would be substantially immediately required to run at a reduced capacity while the hot spot cools. It would be further desirable to reduce the load on the hot spot island as a function of the amount of overheating of the island in question.
SUMMARY OF THE INVENTIONThe present invention comprises a software modifiable list of over temperature responses in hardware for direct action access by hardware based monitoring circuitry.
For a more complete understanding of the present invention, and its advantages, reference will now be made in the following Detailed Description to the accompanying drawings, in which:
In the remainder of this description, a processing unit (PU) may be a sole processor of computations in a device. In such a situation, the PU is typically referred to as a CPU (central processing unit). The invention, however, may also be readily practiced on a multiprocessor chip. While the explanation following is directed to sensing a single hot spot of an integrated circuit chip, the invention as practiced will typically have a multiplicity of sensors, each monitoring the temperature of different logic circuits, only some of which may overheat for certain types of computer program operations. As an example, if a program entailed the computation of an extremely large number of consecutively occurring floating point arithmetic operations, the floating point arithmetic logic circuit portion might overheat without causing any other portions to overheat. It should further be noted that the explanation of operation is provided for a single PU controlling operations as concerns only thermal sensors monitoring logic of that PU. Similar logic would be used in a multiprocessor situation where thermal units measuring the temperature of circuitry of multiple PUs are all controlled by a single piece of hardware-based monitoring logic.
In
In
In
In
In
The flowchart of
The flow of the hardware-based logic shows the thermal sensors being checked by a decision block 90. While a separate thermal monitor may be used for each thermal sensor, a more typical situation would be for a thermal monitor to check the temperature of a plurality of sensors on a multiplexed basis. If any change is noted from one temperature indicated level to another, such as from less than TT Lv2 to greater than TT Lv2, a decision block 92 will determine if the default responses have been modified by the running software. If not, the default response will be followed as set forth in block 94. If there has been a modification, the response suggested by the running application software will be followed as set forth in a block 96. Once this action is taken, the hardware logic will check the next sensor, such as one of those between 40 and 52, to determine if its temperature has altered since the last check.
An example of how dynamic response alteration would be useful could be as follows. For the purpose of this explanation, it may be assumed that the example is directed to multiple programs running on a single PU. The operating system of this PU may use time slicing between two, or more, running applications, such as programs #1 and #2. In this example, each program runs on this PU for 2 ms, then switches to the next, or other, one. It may be further assumed that program #1 is real-time, and program #2, and others, is not. In other words, program #2, and/or others, may be considered to be a background or less important program(s). The temperature control software in such a situation of a TT Lv2 temperature sensed might set the response for a thermal event while P1 is running as “do nothing,” while the response for P2, and other background programs, may be set as “pause” or “halt.” Thus the circuitry would only be active one-half, or less, of the time. This reduced operation of the overheated circuitry may be entirely sufficient to drop the temperature of the affected circuitry down below the level of TT Lv2, thereby allowing background program(s) to proceed with their operation. If the pause action includes shutting down the clocks and removing power to the affected circuit during the pause operation, the time for a significant reduction in temperature will be even further reduced.
In summary, the present invention comprises having direct hardware control of temperature reducing actions affecting circuitry of an integrated circuit to quickly provide a temperature reducing response. This is accomplished by having the hardware have default responses to over temperature conditions, which, at the time of loading application software to be run on the PU, may be stored as a preferred alternate to the default condition response actions of the thermal sensor monitoring logic. The information stored may be for any sensed portion of a chip that is determined to be overheated or may be different for each different sensor or some intermediate combination. The response actions may be altered by the temperature control software dynamically during the running of a loaded software program in response to other sensors or due to a change in the operating environment, such as running additional programs simultaneous with the first loaded program.
It should be noted that the portions of circuitry sensed may be an entire integrated circuit as well as smaller portions. The smaller portions may be described as islands of circuitry and may comprise action specific logic, such as a floating point math unit. In the case of a multiprocessor chip, these islands may again comprise an entire PU.
Although the invention has been described with reference to a specific embodiment, the description is not meant to be construed in a limiting sense. Various modifications of the disclosed embodiment, as well as alternative embodiments of the invention, will become apparent to persons skilled in the art upon reference to the description of the invention. It is therefore contemplated that the claims will cover any such modifications or embodiments that fall within the true scope and spirit of the invention.
Claims
1. A method of reducing hot spot temperatures in specific logic defined circuitry islands of an integrated circuit chip, comprising:
- providing a temperature sensor for each of a plurality of logically definable islands of circuitry on a chip;
- incorporating an action list of possible responses to given hot spot temperatures in software to be run on a PU (processing unit) utilizing said chip;
- storing the selected response(s) in hardware for direct access by action selecting logic circuitry;
- selecting at least one of the possible responses to be used as a function of a detected specific island hot spot temperature; and
- activating an action in accordance with a selected response.
2. A method for providing a programmable response for thermal sensor implementation on a chip substrate defining a plurality of islands, comprising:
- generating an action list of different possible responses associated with said islands to hot spots on said substrate;
- storing the action list in hardware for direct access;
- detecting one of said hot spots and parameters associated with said detected one of said hot spots;
- selecting one of said possible responses from said action list functionally related to said parameters; and
- activating said response in response to said selecting.
3. A method of responding to integrated chip hot spots in computer apparatus, comprising:
- incorporating at least one thermal sensor in computer circuitry likely to overheat;
- monitoring said at least one thermal sensor whereby at least one default response stored in hardware is provided upon detection of a thermal sensor traversing a predetermined temperature; and
- accepting inputs from application software to modify said at least one default response.
4. The method of claim 3, comprising in addition a list of default responses that are to be taken for each of a plurality of predetermined temperatures traversed by said at least one thermal sensor.
5. The method of claim 4, comprising in addition means for dynamically modifying any part of a list of default responses that are to be taken for each of a plurality of predetermined temperatures traversed by said at least one thermal sensor via inputs from running application software.
6. Computer apparatus, comprising:
- at least one thermal sensor circuit;
- thermal sensor hardware based monitoring logic circuit including response action storage means;
- control means operable to alter computer operational attributes in accordance with instructions from said thermal sensor monitoring logic circuit; and
- code imbedded in software being run on the computer which may dynamically update potential response actions in said response action storage means of said thermal sensor monitoring logic circuit in accordance with requirements of the computer program being run.
7. A method of modifying computer hardware operating parameters as a function of sensed chip operating temperatures where the computer includes at least one thermal sensor and associated temperature monitoring hardware logic having response action storage capability, comprising:
- generating a hardware based default action list of responses to be taken by the monitoring hardware with detected sensor temperature changes, wherein the default action list is at least stored in the monitoring hardware; and
- changing the action taken by the hardware logic as a function of running computer software inputs to said hardware logic.
8. Computer apparatus, comprising:
- at least one thermal sensor circuit; and
- thermal sensor monitoring logic circuit including software modifiable default response action storage means, wherein the thermal sensor monitoring logic is at least configured to select a potential response from an action list of potential responses stored in the software modifiable response action storage means.
9. Apparatus as claimed in claim 8, comprising, in addition;
- means, responsive to software supplied inputs, operable to modify at least one of the default responses to be taken by said monitoring logic circuit.
10. A computer program product for providing a programmable response for thermal sensor implementation on a chip substrate defining a plurality of islands, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
- computer code for generating an action list of different possible responses associated with said islands to hot spots on said substrate; and
- computer code for replacing default hardwired responses in a software modifiable response action storage means with said action list of defined responses in the software modifiable response action storage means.
11. A computer program product for redefining a hardwired response for hot spots on a chip substrate defining a plurality of islands, the computer program product having a medium with a computer program embodied thereon, the computer program comprising computer code, imbedded in software being run on the computer, for redefining and/or dynamically updating the hardwired response actions in response action storage means of thermal sensor monitoring logic circuit in accordance with requirements of the computer program being run, wherein the computer code for redefining and/or dynamically updating at least comprises having computer code for storing an action list of potential responses in the response storage means.
12. A computer program product for redefining a hardwired default response for hot spots on a chip substrate defining a plurality of islands, the computer program product having a medium with a computer program embodied thereon, the computer program comprising computer code for changing the hardwired default action taken by hardware logic as a function of running computer software inputs to said hardware logic, wherein the computer code for changing comprises at least computer code for storing an action list of potential responses in a response storage means located in said hardware logic.
13. A computer program product for redefining a hardwired default response for hot spots on a chip substrate defining a plurality of islands, the computer program product having a medium with a computer program embodied thereon, the computer program comprising:
- computer code for retrieving an action list of different possible responses associated with said islands to hot spots on said substrate from a running application program; and
- computer code for replacing default hardwired responses with said application program action list of defined responses, wherein the computer code for replacing comprises at least computer code for storing an action list of potential responses in a response storage means located in said hardware logic.
5994752 | November 30, 1999 | Sander et al. |
6308311 | October 23, 2001 | Carmichael et al. |
6442735 | August 27, 2002 | Joshi et al. |
20020173928 | November 21, 2002 | Willner et al. |
Type: Grant
Filed: Mar 27, 2003
Date of Patent: Aug 23, 2005
Patent Publication Number: 20040193383
Assignee: International Business Machines Corporation (Armonk, NY)
Inventors: Joachim Gerhard Clabes (Austin, TX), Lawrence Joseph Powell, Jr. (Round Rock, TX), Daniel Lawrence Stasiak (Rochester, MN), Michael Fan Wang (Austin, TX)
Primary Examiner: John Barlow
Assistant Examiner: Meagan S Walling
Attorney: Carr LLP
Application Number: 10/401,410