Patents by Inventor Doron Orenstein

Doron Orenstein has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20060149931
    Abstract: According to one embodiment, a method is disclosed. The method includes detecting a load miss at a central processing unit (CPU), stalling a read only buffer (ROB), speculatively retiring an instruction causing the ROB stall and subsequent instructions, keeping registers that have not been renamed in the ROB upon retirement, and flushing the CPU pipeline upon receiving data from the load miss.
    Type: Application
    Filed: December 28, 2004
    Publication date: July 6, 2006
    Inventors: Akkary Haitham, Doron Orenstein, Ravi Rajwar, Srikanth Srinivasan
  • Publication number: 20060129860
    Abstract: A clock frequency control unit for an integrated circuit (IC) includes a clock generator, a finite state machine (FSM), and a gating circuit (GC). The FSM has at least first and second states corresponding to non-low workload low workload states, respectively. In the first state, the GC provides a clock signal to functional units of the IC with the same frequency as the clock generator output. In the second state, the GC reduces the frequency of the clock signal. In one embodiment, the GC masks out selected cycles of the clock generator output to reduce the clock signal frequency. The FSM monitors the operation of the IC to transition from the first state to the second state when selected “low workload” conditions are detected (e.g., long latency cache miss). Similarly, the FSM transitions from the second state to the first state when selected “non-low workload” conditions are detected.
    Type: Application
    Filed: January 12, 2006
    Publication date: June 15, 2006
    Inventors: Itamar Kazachinsky, Doron Orenstein
  • Patent number: 7051227
    Abstract: A clock frequency control unit for an integrated circuit (IC) includes a clock generator, a finite state machine (FSM), and a gating circuit (GC). The FSM has at least first and second states corresponding to non-low workload low workload states, respectively. In the first state, the GC provides a clock signal to functional units of the IC with the same frequency as the clock generator output. In the second state, the GC reduces the frequency of the clock signal. In one embodiment, the GC masks out selected cycles of the clock generator output to reduce the clock signal frequency. The FSM monitors the operation of the IC to transition from the first state to the second state when selected “low workload” conditions are detected (e.g., long latency cache miss). Similarly, the FSM transitions from the second state to the first state when selected “non-low workload” conditions are detected.
    Type: Grant
    Filed: September 30, 2002
    Date of Patent: May 23, 2006
    Assignee: Intel Corporation
    Inventors: Itamar S. Kazachinsky, Doron Orenstein
  • Publication number: 20060095807
    Abstract: A method and apparatus for changing the configuration of a multi-core processor is disclosed. In one embodiment, a throttle module (or throttle logic) may determine the amount of parallelism present in the currently-executing program, and change the execution of the threads of that program on the various cores. If the amount of parallelism is high, then the processor may be configured to run a larger amount of threads on cores configured to consume less power. If the amount of parallelism is low, then the processor may be configured to run a smaller amount of threads on cores configured for greater scalar performance.
    Type: Application
    Filed: September 28, 2004
    Publication date: May 4, 2006
    Inventors: Edward Grochowski, John Shen, Hong Wang, Doron Orenstein, Gad Sheaffer, Ronny Ronen, Murali Annavaram
  • Patent number: 6944720
    Abstract: A memory system is provided for storing multiple data types. The memory system includes a main memory, a local cache, and a translation unit. The local cache has multiple entries, each of which includes a data field to store data and a status field to indicate a storage state for the stored data. The translation unit includes a translation lookaside buffer (TLB) and a status-cache (STC). The TLB stores address translations for data in the main memory, and the STC stores storage states for data indicated by the address translations.
    Type: Grant
    Filed: March 27, 2003
    Date of Patent: September 13, 2005
    Assignee: Intel Corporation
    Inventors: Zeev Sperber, Guy Peled, Doron Orenstein, Ehud Cohen, Gabi Malka
  • Patent number: 6886105
    Abstract: A method and apparatus for resuming operations from a low latency wake-up low power state. One embodiment provides a system including a processor, an operating system, and a memory subsystem that requires initialization commands to exit a memory low power state. Control logic detects exit from an operating system low latency low power state and responsively generates a plurality of initialization commands to remove the memory subsystem from the memory low power state prior to deasserting a stop clock signal and allowing execution to resume.
    Type: Grant
    Filed: February 14, 2000
    Date of Patent: April 26, 2005
    Assignee: Intel Corporation
    Inventors: Opher Kahn, Doron Orenstein
  • Publication number: 20050027941
    Abstract: Apparatus, system and methods are provided for performing speculative data prefetching in a chip multiprocessor (CMP). Data is prefetched by a helper thread that runs on one core of the CMP while a main program runs concurrently on another core of the CMP. Data prefetched by the helper thread is provided to the helper core. For one embodiment, the data prefetched by the helper thread is pushed to the main core. It may or may not be provided to the helper core as well. A push of prefetched data to the main core may occur during a broadcast of the data to all cores of an affinity group. For at least one other embodiment, the data prefetched by a helper thread is provided, upon request from the main core, to the main core from the helper core's local cache.
    Type: Application
    Filed: July 31, 2003
    Publication date: February 3, 2005
    Inventors: Hong Wang, Perry Wang, Jeffery Brown, Per Hammarlund, George Chrysos, Doron Orenstein, Steve Liao, John Shen
  • Patent number: 6724391
    Abstract: The present invention provides a mechanism for implementing z-compression in a manner that is transparent to the user. Blocks of z-data are associated with storage locations in a z-data buffer in cleared, compressed, or uncompressed data states. Operations to the z-data buffer are monitored for selected operations. These operations may include clear or lock operations. If a selected operation is detected, a modified version of the selected operation is implemented to mask differences between the storage states of the data blocks.
    Type: Grant
    Filed: June 30, 2000
    Date of Patent: April 20, 2004
    Assignee: Intel Corporation
    Inventors: Guy Peled, Zeev Sperber, Doron Orenstein, Guiliermo Savranski
  • Publication number: 20040064752
    Abstract: A clock frequency control unit for an integrated circuit (IC) includes a clock generator, a finite state machine (FSM), and a gating circuit (GC). The FSM has at least first and second states corresponding to non-low workload low workload states, respectively. In the first state, the GC provides a clock signal to functional units of the IC with the same frequency as the clock generator output. In the second state, the GC reduces the frequency of the clock signal. In one embodiment, the GC masks out selected cycles of the clock generator output to reduce the clock signal frequency. The FSM monitors the operation of the IC to transition from the first state to the second state when selected “low workload” conditions are detected (e.g., long latency cache miss). Similarly, the FSM transitions from the second state to the first state when selected “non-low workload” conditions are detected.
    Type: Application
    Filed: September 30, 2002
    Publication date: April 1, 2004
    Inventors: Itamar S. Kazachinsky, Doron Orenstein
  • Publication number: 20040003215
    Abstract: A method and apparatus for executing low power validations for high confidence predictions. More particularly, the present invention pertains to using confidence levels of speculative executions to decrease power consumption of a processor without affecting its performance. Non-critical instructions, or those instructions whose prediction, rather than verification, lie on the critical path, can thus be optimized to consume less power.
    Type: Application
    Filed: June 28, 2002
    Publication date: January 1, 2004
    Inventors: Evgeni Krimer, Bishara Shomar, Ronny Ronen, Doron Orenstein
  • Publication number: 20030191903
    Abstract: A memory system is provided for storing multiple data types. The memory system includes a main memory, a local cache, and a translation unit. The local cache has multiple entries, each of which includes a data field to store data and a status field to indicate a storage state for the stored data. The translation unit includes a translation lookaside buffer (TLB) and a status-cache (STC). The TLB stores address translations for data in the main memory, and the STC stores storage states for data indicated by the address translations.
    Type: Application
    Filed: March 27, 2003
    Publication date: October 9, 2003
    Inventors: Zeev Sperber, Guy Peled, Doron Orenstein, Ehud Cohen, Gabi Malka
  • Publication number: 20030188212
    Abstract: A method and apparatus for resuming operations from a low latency wake-up low power state. One embodiment provides a system including a processor, an operating system, and a memory subsystem that requires initialization commands to exit a memory low power state. Control logic detects exit from an operating system low latency low power state and responsively generates a plurality of initialization commands to remove the memory subsystem from the memory low power state prior to deasserting a stop clock signal and allowing execution to resume.
    Type: Application
    Filed: February 14, 2000
    Publication date: October 2, 2003
    Inventors: Opher Kahn, Doron Orenstein
  • Patent number: 6580427
    Abstract: A graphics system is provided to implement compression of depth or z-data. The graphic system includes a buffer, a status table, and a read/write unit. The buffer stores depth data for multiple blocks of pixels in associated buffer entries. The status table stores status values for the entries of the buffer. The status value for a given entry indicates an access mode for the corresponding depth data according to whether the data is compressed, uncompressed or in a reference state. The read/write unit implements data accesses for a given entry responsive to the status value associated with the entry.
    Type: Grant
    Filed: June 30, 2000
    Date of Patent: June 17, 2003
    Assignee: Intel Corporation
    Inventors: Doron Orenstein, Guy Peled, Zeev Sperber, Ehud Cohen, Gabi Malka
  • Patent number: 6557083
    Abstract: A memory system is provided for storing multiple data types. The memory system includes a main memory, a local cache, and a translation unit. The local cache has multiple entries, each of which includes a data field to store data and a status field to indicate a storage state for the stored data. The translation unit includes a translation lookaside buffer (TLB) and a status-cache (STC). The TLB stores address translations for data in the main memory, and the STC stores storage states for data indicated by the address translations.
    Type: Grant
    Filed: June 30, 2000
    Date of Patent: April 29, 2003
    Assignee: Intel Corporation
    Inventors: Zeev Sperber, Guy Peled, Doron Orenstein, Ehud Cohen, Gabi Malka
  • Publication number: 20020083353
    Abstract: A system and corresponding method use a PAUSE instruction as a low power hint in a single threaded or multithreaded environment using “processor slow mode.” One embodiment actually lowers the frequency of the processor clock. Another embodiment virtually lowers the frequency of the processor clock by gating M clock cycles out of every N clock cycles. When all threads have issued a PAUSE instruction, the processor enters slow mode and remains there for a while. After this while, the processor returns to normal mode. Alternatively, an event, such as an interrupt or an exception, can cause the processor to return to normal mode from slow mode.
    Type: Application
    Filed: December 7, 2000
    Publication date: June 27, 2002
    Inventors: Doron Orenstein, Ronny Ronen
  • Patent number: 5835748
    Abstract: A method and apparatus for executing different sets of instructions that cause a processor to perform different data type operations on different physical registers files that logically appear to software as a single aliased register file. According to one aspect of the invention, a processor is provided that includes at least two physical register files--one for executing scalar data type operations and the other for executing packed data type operations. In addition, the processor includes a transition unit that is configured to cause the two physical register files to logically appear to software executing on the processor as a single logical register file.
    Type: Grant
    Filed: December 19, 1995
    Date of Patent: November 10, 1998
    Assignee: Intel Corporation
    Inventors: Doron Orenstein, Ofri Wechsler, Millind Mittal, Andrew F. Glew, Larry M. Mennemeier, Alexander D. Peleg, David Bistry, Carole Dulong, Eiichi Kowashi, Benny Eitan, Derrick Lin, Ramamohan R. Vakkalagadda
  • Patent number: 5787026
    Abstract: The invention provides a method and apparatus for providing operand reads in a processor pipeline. According to one aspect of the invention, a method is described for executing an instruction in a computer pipeline that requires different operands be read from the same register file in different stages of the computer pipeline. According to another aspect of the invention, a method is described for executing an instruction in a processor pipeline. According to this method, at least a first operand is read from a register file in a first stage of the processor pipeline. If execution of the instruction causes the processor to place the first operand in a storage area other than the register file, then the first operand in written to that storage area in a subsequent stage of the processor pipeline. Otherwise, one or more ALU operations are performed on the first operand and at least a second operand in a different subsequent stage of the processor pipeline.
    Type: Grant
    Filed: December 20, 1995
    Date of Patent: July 28, 1998
    Assignee: Intel Corporation
    Inventors: Doron Orenstein, Millind Mittal, Ofri Wechsler
  • Patent number: 5450605
    Abstract: The specification discloses a method and apparatus for determining the length of variable-length instructions that appear sequentially in an instruction stream without differentiation. The apparatus may be used to facilitate parallel processing of such variable-length instructions by a computer system.
    Type: Grant
    Filed: January 28, 1993
    Date of Patent: September 12, 1995
    Assignee: Intel Corporation
    Inventors: Edward Grochowski, Kenneth Shoemaker, Uri Weiser, Doron Orenstein