METHOD AND APPARATUS FOR ON-DEMAND POWER MANAGEMENT
A method for on-demand power management is described. The method includes detecting a plurality of processing events on a system bus within a processing system, recognizing a processing event pattern, and correlating the processing event pattern with a processing demand.
This application is a continuation of U.S. application Ser. No. 12/004,593 filed Dec. 20, 2007, which is a divisional of application Ser. No. 11/019,804, filed Dec. 21, 2004, now U.S. Pat. No. 7,337,335, which are hereby incorporated by reference.
TECHNICAL FIELDThe present invention relates generally to power management and in particular to managing voltages and frequencies in response to application processing demands.
BACKGROUNDAs digital electronic processing systems trend toward higher operating frequencies and smaller device geometries, power management has become increasingly important to prevent thermal overload while maintaining system performance and prolonging battery life in portable systems.
The two principal sources of power dissipation in digital logic circuits are static power dissipation and dynamic power dissipation. Static power dissipation is dependent on temperature, device technology and processing variables, and is composed primarily of leakage currents. Dynamic power dissipation is the predominant loss factor in digital circuitry and is proportional to the operating clock frequency, the square of the operating voltage and the capacitive load. Capacitive load is highly dependent on device technology and processing variables, so most approaches to dynamic power management focus on frequency and voltage control.
One conventional approach to power management halts the processing system to adjust core clock frequencies and voltages, during which time the processor does not execute operating system code or application code, and then restarts the system after the new frequencies and voltages have stabilized. Such an approach is described in U.S. Pat. No. 6,754,837, as illustrated in
Another conventional approach to power management, described in U.S. Pat. No. 6,788,156, changes the clock frequency of a processor while the processor is operating, but requires the frequency changes to be made in small increments to avoid processing errors that large frequency steps would cause. As a result, this approach may require a significant time period to achieve a desired operating frequency.
Yet another conventional approach to power management, described in U.S. Pat. No. 6,778,418, employs a fixed relationship between voltage and frequency, either through a lookup table or by use of a frequency to voltage converter. In this approach, a frequency increase is always preceded by a voltage increase and a frequency decrease always precedes a voltage decrease. In addition, a frequency increase is delayed while the voltage is ramped up to a corresponding voltage. The new frequency and voltage are not scaled independently, and the new operating point may not be optimum with respect to an application's processing demand.
The present invention is illustrated by way of example, and not of limitation, in the figures of the accompanying drawings in which:
In the following description, numerous specific details are set forth such as examples of specific components, devices, methods, etc., in order to provide a thorough understanding of embodiments of the present invention. It will be apparent, however, to one skilled in the art that these specific details need not be employed to practice embodiments of the present invention. In other instances, well-known materials or methods have not been described in detail in order to avoid unnecessarily obscuring embodiments of the present invention. It should be noted that the “line” or “lines” discussed herein, that connect elements, may be single lines or multiple lines. It will also be understood by one having ordinary skill in the art that lines and/or other coupling elements may be identified by the nature of the signals they carry (e.g., a “clock line” may implicitly carry a “clock signal”) and that input and output ports may be identified by the nature of the signals they receive or transmit (e.g., “clock input” may implicitly receive a “clock signal”).
A method and apparatus for on-demand power management is described. In one embodiment, the method includes monitoring a processing demand in a processing system operating at a first one or more voltages and a first one or more clock frequencies phase-locked to a reference frequency. The method also includes generating a second one or more clock frequencies in response to the processing demand, wherein the second one or more clock frequencies is phase-locked to the reference frequency and phase-matched to the first one or more clock frequencies. The method also includes switching from the first one or more clock frequencies to the second one or more clock frequencies without halting the processing system. In one embodiment, the method further includes generating a second one or more voltages in response to the processing demand, and switching from the first one or more voltages to the second one or more voltages without halting the processing system.
In one embodiment, the apparatus includes a system controller to monitor an application processing demand on a processing system and to determine one or more clock frequencies and one or more voltages at which the processing system operates. The apparatus also includes a power distribution manger, coupled with the system controller, to provide one or more operating voltages to the processing system and to switch between a first one or more voltages and a second one or more voltages without halting the processing system. The apparatus also includes a clock domain manager, coupled with the system controller, to provide one or more clock signals to the processing system and to switch between a first one or more clock frequencies and a second one or more clock frequencies without halting the processing system. The first one or more clock frequencies and the second one or more clock frequencies are phase-locked to a common reference frequency and the second one or more clock frequencies are phase-matched to the first one or more clock frequencies. In one embodiment, the apparatus also includes a compensation engine coupled with the system controller, the power distribution manager and the clock domain manager, to receive voltage and frequency commands from the system controller and to compensate the voltage and frequency commands for temperature and processing variables.
Processing system 100 may also include power manager 105, which may be coupled to system bus 102, frequency source 108 and voltage source 109. Power manager 105 may also be coupled to system processor 101 and peripherals 104-1 through 104-k via a clock bus 106 and voltage bus 107. In one embodiment, as illustrated in
With reference to
In one embodiment, power manager 105 may be configured to monitor processing activity on system bus 102 while supplying the first one or more clock frequencies f1′-fm′ and the first one or more voltages V1′-Vn′ to system processor 101 and peripherals 104-1 through 104-k. Power manager 105 may also be configured to determine a processing demand based on the monitored processing activity and to generate the second one or more clock frequencies f1″-fm″ and the second one or more voltages V1″-Vn″ in response to the processing demand. Power manager 105 may also be configured to switch from the first one or more voltages to the second one or more voltages without halting the processing system 100, and to switch from the first one or more clock frequencies to the second one or more clock frequencies without halting the processing system 100.
System controller 201 may include a bus interface unit 205 to monitor processing activity on system bus 102 and to select a new operating point for the processing system 100. System controller 201 may also include a programmable memory 206 coupled with the bus interface unit 205. Programmable memory 206 may include programmed information to enable the bus interface unit 205 to correlate activity on the system bus 102 with the application processing demand in processing system 100.
In one embodiment, bus interface unit 205 may be configured to detect a plurality of commands on the system bus 102 and to recognize a command pattern, programmed in programmable memory 206, associated with a change in the application processing demand. The command pattern may be a generic processing command pattern, or a command pattern and bus transaction cycles associated with a specific system processor 101 or a processor family of which system processor 101 may be a member. In response to recognizing the command pattern, bus interface unit 205 may select the new operating point for the processing system 100. The new operating point may include a new set of operating voltages V1″-Vn″ which are different from a current set of operating voltages V1′-Vn″, and/or a new set of clock frequencies f1″-fm″ which are different from a current set of operating clock frequencies f1′-fm′. In one embodiment, the current sets of operating voltages and clock frequencies and the new sets of operating voltages and clock frequencies may be written to hardware registers (not shown) within system controller 201 or software defined registers (e.g., memory locations in programmable memory 206).
Alternatively, bus interface unit 205 may be configured to detect an average number of processing events per unit time on system bus 102 and to compare the average number of processing events with one or more current clock frequencies 112. Based on the comparison, bus interface unit 205 may select a new operating point as described above.
As described in greater detail below, system controller 201 may also include a state machine 207, coupled with the bus interface unit 206 and a command bus 208, to control the provision of voltages V1-Vn in the power distribution manager 202 and the provision of clock frequencies f1-fm in the clock domain manager 203.
It will be appreciated by one having ordinary skill in the art that system controller 201 may be configured to automatically monitor the processing activity on system bus 102 and to and autonomously command the one or more voltages V1-Vn and the one or more clock frequencies f1-fm to select a new operating point as the application processing demand in processing system 100 changes. However, system controller 201 may also include a command interrupt line 209, coupled with state machine 207, to override the automatic control of the one or more voltages V1-Vn and the one or more clock frequencies f1-fm (e.g., in response to a critical power demand from the system processor 101 or one or more of peripherals 104-1 through 104-n). Command interrupt line 209 may be used to set processing system 100 to a predetermined operating point wherein the system controller 201 commands the power distribution manager 202 to provide one or more predetermined voltages to the processing system 100 and wherein the system controller commands the clock domain manager to provide one or more predetermined clock frequencies to the processing system 100.
It will be appreciated by one of ordinary skill in the art that all clock frequencies f1-fm will be harmonically related because all are phase-locked to the common reference frequency 110. In particular, any two clock frequencies in a single frequency control channel (e.g., clock frequencies f1′ and f1″ in frequency control channel 501-1) will be harmonically related.
As noted above, the ping-pong controllers 402 in the power distribution manager 202 may receive commands from state machine 207 in system controller 201 to control the dual voltage regulators 403, and the ping-pong controllers 502 in the clock domain manger 203 may receive commands from the state machine 207 in the system controller 201 to control the dual PLL's 503.
In one embodiment, when a new operating voltage and/or a new clock frequency is commanded by the system controller, state machine 207 may operate in a ping-pong mode or a steady-state mode. Ping-pong mode is a symmetric mode where a new steady-state operating voltage is provided alternately by VR1 and VR2 with each change, and where the new steady-state clock frequency is provided alternately by PLL1 and PLL2 with each change. Steady-state mode is an asymmetrical mode where a new steady-state voltage is always provided by one voltage regulator (e.g., VR1) after a transient change is provided by the other voltage regulator (e.g., VR2) and where a new steady-state clock frequency is always provided by one PLL (e.g., PLL1) after a transient change is provided by the other PLL (e.g., PLL2). Table 1 defines the state variables used in
In an initial state (701), VR1 is set to a first voltage, which is selected by multiplexer 404 and provided to processing system 100. In the initial state PLL1 is set to a first clock frequency, which is selected by multiplexer 504 and provided to processing system 100. Bus interface unit 205 periodically checks the system bus 102 for processing activity. If bus interface unit 205 does not detect a change in processing activity, the change frequency flag is cleared (cmd_fc=0) and the change voltage flag is cleared (cmd_vc=0). If bus interface unit 205 detects a change in processing activity on system bus 102 that warrants a change in the operating point of processing system 100, bus interface unit 205 will select the new operating point from programmable memory 206, which may require a new voltage and/or new clock frequency.
If a new voltage is required (cmd_vc=1), VR2 is commanded to the new voltage (702). After the new voltage is stabilized (chk_st=1), the output of VR2 is selected (703). In ping-pong mode (mode_pp=1), VR2 continues to be selected while the voltage requirement does not change (cmd_vc=0). If the voltage requirement changes (cmd_vc=1), VR1 is commanded to the new voltage (704a). After the new voltage is stabilized (chk_st=1), the output of VR1 is selected (705) and the system returns to the initial state with the new voltage. In steady-state mode (mode_ss=1) at 703, the output of VR1 is commanded to equal the output of VR2 (704b) and the output of VR1 is selected (705) when VR1 is stabilized (chk_st=1) and the system returns to the initial state with the new voltage.
If a new clock frequency is required (cmd_fc=1), PLL2 is commanded to the new frequency (706). After the new frequency is stabilized (chk_st=1), the output of PLL2 is selected (707). In ping-pong mode (mode_pp=1), PLL2 continues to be selected while the frequency requirement does not change (cmd_fc=0). If the frequency requirement changes (cmd_fc=1), PLL1 is commanded to the new frequency (708a).
After the new frequency is stabilized (chk_st=1), the output of PLL1 is selected (709) and the system returns to the initial state (701) with the new frequency. In steady-state mode (mode_ss=1) at 707, the output of PLL1 is commanded to equal the output of PLL2 (708b) and the output of PLL1 is selected (709) when PLL1 is stabilized (chk_st=1) and the system returns to the initial state (701) with the new frequency.
In one embodiment, as illustrated in
In one embodiment, as illustrated in
In one embodiment, as illustrated in
Thus, a method and apparatus for on-demand power management has been described. It will be apparent from the foregoing description that aspects of the present invention may be embodied, at least in part, in software. That is, the techniques may be carried out in a computer system or other data processing system in response to its processor, such as system controller 201, executing sequences of instructions contained in a memory, such as programmable memory 206. In various embodiments, hardwired circuitry may be used in combination with software instructions to implement the present invention. Thus, the techniques are not limited to any specific combination of hardware circuitry and software or to any particular source for the instructions executed by the data processing system. In addition, throughout this description, various functions and operations may be described as being performed by or caused by software code to simplify description. However, those skilled in the art will recognize what is meant by such expressions is that the functions result from execution of the code by a processor or controller, such as system controller 201.
A machine-readable medium can be used to store software and data which when executed by a data processing system causes the system to perform various methods of the present invention. This executable software and data may be stored in various places including, for example, memory 103 and programmable memory 206 or any other device that is capable of storing software programs and/or data.
Thus, a machine-readable medium includes any mechanism that provides (i.e., stores and/or transmits) information in a form accessible by a machine (e.g., a computer, network device, personal digital assistant, manufacturing tool, any device with a set of one or more processors, etc.). For example, a machine-readable storage medium includes recordable/non-recordable media (e.g., read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; etc.); etc.
It should be appreciated that references throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Therefore, it is emphasized and should be appreciated that two or more references to “an embodiment” or “one embodiment” or “an alternative embodiment” in various portions of this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures or characteristics may be combined as suitable in one or more embodiments of the invention. In addition, while the invention has been described in terms of several embodiments, those skilled in the art will recognize that the invention is not limited to the embodiments described. The embodiments of the invention can be practiced with modification and alteration within the scope of the appended claims. The specification and the drawings are thus to be regarded as illustrative instead of limiting on the invention.
Claims
1. A method, comprising:
- detecting a plurality of processing events on a system bus within a processing system;
- recognizing a processing event pattern; and
- correlating the processing event pattern with a processing demand.
2. The method of claim 1, wherein the plurality of processing events comprises a plurality of system commands.
3. The method of claim 1, wherein the processing event pattern comprises an average number of processing events per unit time.
4. The method of claim 1, wherein correlating the processing event pattern with a processing demand comprises comparing the average number of processing events per unit time with an operating clock frequency.
5. The method of claim 1, further comprising:
- switching from a first one or more clock frequencies to a second one or more clock frequencies without halting the processing system, wherein the second one or more clock frequencies are generated in response to the processing demand.
6. The method of claim 1, further comprising:
- switching from a first plurality of voltages to a second plurality of voltages in the processing system without halting the processing system, wherein the second plurality of voltages are generated in response to the processing demand.
7. An apparatus, comprising:
- a bus interface unit coupled to a system bus within a processing system, to detect a plurality of processing events on the system bus within the processing system; and
- a programmable storage device coupled to the bus interface unit, wherein the bus interface unit recognizes a processing event pattern programmed in the programmable storage device and correlates the processing event pattern with a processing demand.
8. The apparatus of claim 7, wherein the plurality of processing events comprises a plurality of system commands.
9. The apparatus of claim 7, wherein the processing event pattern comprises an average number of processing events per unit time.
10. The apparatus of claim 7, wherein the bus interface unit compares the average number of processing events per unit time with an operating clock frequency.
11. The apparatus of claim 7, further comprising a system controller comprising the bus interface unit.
12. The apparatus of claim 7, further comprising:
- a clock domain manager coupled to the system controller, the clock domain manager to switch from a first one or more clock frequencies to a second one or more clock frequencies without halting the processing system, wherein the second one or more clock frequencies are generated in response to the processing demand.
13. The apparatus of claim 11, further comprising:
- a power distribution manager coupled to the system controller, the power distribution manager to switch from a first plurality of voltages to a second plurality of voltages in the processing system without halting the processing system, wherein the second plurality of voltages are generated in response to the processing demand.
14. The apparatus of claim 11, further comprising:
- a compensation engine coupled to the system controller, the compensation engine to compensate voltage and frequency commands from the bus interface unit for temperature dependent processing and device variables.
15. The apparatus of claim 14, wherein the compensation engine comprises a temperature sensor to measure and report a temperature having an effect on an operating point of the apparatus.
Type: Application
Filed: Apr 14, 2010
Publication Date: Aug 5, 2010
Inventors: Joel A. Jorgenson (Fargo, ND), Divyata Kakumanu (Moorhead, MN), Brian M. Morlock (Fargo, ND)
Application Number: 12/760,130
International Classification: G06F 1/26 (20060101); G06F 9/46 (20060101); G06F 1/08 (20060101);