Patents by Inventor Stephan Jourdan

Stephan Jourdan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20070283129
    Abstract: According to one embodiment, a method is disclosed. The method includes receiving a value at a vector length (VL) tracker and establishing a VL for subsequent micro-operations (?ops) that are to be executed corresponding to the value.
    Type: Application
    Filed: December 28, 2005
    Publication date: December 6, 2007
    Inventors: Stephan Jourdan, Avinash Sodani, Michael Fetterman, Per Hammarlund, Glenn Hinton
  • Publication number: 20070157188
    Abstract: The present subject matter relates to operation frame filtering, building, and execution. Some embodiments include identifying a frame signature, counting a number of execution occurrences of the frame signature, and building a frame of operations to execute instead of operations identified by the frame signature.
    Type: Application
    Filed: December 30, 2005
    Publication date: July 5, 2007
    Inventors: Stephan Jourdan, Per Hammarlund, Alexandre Farcy, John Miller
  • Publication number: 20070156990
    Abstract: A method is disclosed. The method includes scheduling a load operation at least twice the size of a maximum access supported by a memory device, dividing the load operation into a plurality of separate load operation segments having a size equivalent to the maximum access supported by the memory device, and performing each of the plurality of load operation segments. A further method is disclosed where a temporary register is used to minimize the number of memory accesses to support unaligned accesses.
    Type: Application
    Filed: December 30, 2005
    Publication date: July 5, 2007
    Inventors: Per Hammarlund, Stephan Jourdan, Michael Fetterman, Glenn Hinton, Sebastien Hily, Ronak Singhal
  • Publication number: 20070157008
    Abstract: Microarchitecture policies and structures to predict execution clusters and facilitate inter-cluster communication are disclosed. In disclosed embodiments, sequentially ordered instructions are decoded into micro-operations. Execution of one set of micro-operations is predicted to involve execution resources to perform memory access operations and inter-cluster communication, but not to perform branching operations. Execution of a second set of micro-operations is predicted to involve execution resources to perform branching operations but not to perform memory access operations. The micro-operations are partitioned for execution in accordance with these predictions, the first set of micro-operations to a first cluster of execution resources and the second set of micro-operations to a second cluster of execution resources. The first and second sets of micro-operations are executed out of sequential order and are retired to represent their sequential instruction ordering.
    Type: Application
    Filed: December 30, 2005
    Publication date: July 5, 2007
    Inventors: Avinash Sodani, Alexandre Farcy, Stephan Jourdan, Per Hammarlund, Mark Davis
  • Publication number: 20070157007
    Abstract: Apparatuses and methods for dead instruction identification are disclosed. In one embodiment, an apparatus includes an instruction buffer and a dead instruction identifier. The instruction buffer is to store an instruction stream having a single entry point and a single exit point. The dead instruction identifier is to identify dead instructions based on a forward pass through the instruction stream.
    Type: Application
    Filed: December 29, 2005
    Publication date: July 5, 2007
    Inventors: Stephan Jourdan, Matthew Merten, Alexandre Farcy
  • Publication number: 20070157006
    Abstract: Microarchitecture policies and structures partition execution resource clusters. In disclosed microarchitecture embodiments, micro-operations representing a sequential instruction ordering are partitioned into a two sets. To one set of micro-operations execution resources are allocated from a cluster of execution resources that can perform memory access operations but not branching operations. To the other set of micro-operations execution resources are allocated from a cluster of execution resources that can perform branching operations but not memory access operations. The first and second sets of micro-operations may be executed out of sequential order but are retired to represent their sequential instruction ordering.
    Type: Application
    Filed: December 30, 2005
    Publication date: July 5, 2007
    Inventors: Stephan Jourdan, Avinash Sodani, Alexandre Farcy, Per Hammarlund, Sebastien Hily, Mark Davis
  • Publication number: 20070130448
    Abstract: Methods and apparatus to identify memory communications are described. In one embodiment, an access to a stack pointer is monitored, e.g., to maintain a stack tracker structure. The information stored in the stack tracker structure may be utilized to generate a distance value corresponding to a relative distance between a load instruction and a previous store instruction.
    Type: Application
    Filed: December 1, 2005
    Publication date: June 7, 2007
    Inventors: Stephan Jourdan, Mark Davis, Sebastien Hily
  • Publication number: 20070079179
    Abstract: In one embodiment, the present invention includes a method for executing an operation on low order portions of first and second source operands using a first execution stack of a processor and executing the operation on high order portions of the first and second source operands using a second execution stack of the processor, where the operation in the second execution stack is staggered by one or more cycles from the operation in the first execution stack. Other embodiments are described and claimed.
    Type: Application
    Filed: September 30, 2005
    Publication date: April 5, 2007
    Inventors: Stephan Jourdan, Avinash Sodani, Michael Fetterman, Per Hammarlund, Ronak Singhal, Glenn Hinton
  • Publication number: 20070061555
    Abstract: Method, apparatus, and system for tracking call returns. At least one embodiment maps the locations of a return instruction pointer within a speculative return stack buffer and a committed return stack buffer to determine a return stack buffers from which the return instruction pointer should be retrieved.
    Type: Application
    Filed: September 15, 2005
    Publication date: March 15, 2007
    Inventors: Michael St. Clair, Boyd Phelps, Stephan Jourdan
  • Patent number: 7181597
    Abstract: A system and method of managing processor instructions provides enhanced performance. The system and method provide for decoding a first instruction into a plurality of operations with a decoder. A first copy of the operations is passed from the decoder to a build engine associated with a trace cache. The system and method further provide for passing a second copy of the operation from the decoder directly to a back end allocation module such that the operations bypass the build engine and the allocation module is in a decoder reading state.
    Type: Grant
    Filed: August 31, 2005
    Date of Patent: February 20, 2007
    Assignee: Intel Corporation
    Inventors: John Alan Miller, Stephan Jourdan
  • Publication number: 20070005940
    Abstract: Embodiments of the present invention provide an apparatus, system, and method of routing a source operand. Some demonstrative embodiments my include replacing a source operand of a micro operation to be executed by an execution unit with a value type representing a source value, e.g., if the source operand corresponds to the source value. Other embodiments are described and claimed.
    Type: Application
    Filed: June 30, 2005
    Publication date: January 4, 2007
    Inventors: Zeev Sperber, Guillermo Savransky, Sagi Lahav, Thierry Pons, Stephan Jourdan
  • Patent number: 7136992
    Abstract: A method and apparatus for a loop predictor for predicting the end of a loop is disclosed. In one embodiment, the loop predictor may have a predict counter to hold a predict count representing the expected number of times that a predictor stew value will repeat during the execution of a given loop. The loop predictor may also have one or more running counters to hold a count of the times that the stew value has repeated during the execution of the present loop. When the counter values match the predictor may issue a prediction that the loop will end.
    Type: Grant
    Filed: December 17, 2003
    Date of Patent: November 14, 2006
    Assignee: Intel Corporation
    Inventors: Subramaniam Maiyuran, Peter J. Smith, Stephan Jourdan
  • Patent number: 7130965
    Abstract: Embodiments of the present invention relate to a memory management scheme and apparatus that enables efficient cache memory management. The method includes writing an entry to a store buffer at execute time; determining if the entry's address is in a first-level cache associated with the store buffer before retirement; and setting a status bit associated with the entry in said store buffer, if the address is in the cache in either exclusive or modified state. The method further includes immediately writing the entry to the first-level cache at or after retirement when the status bit is set; and de-allocating the entry from said store buffer at retirement. The method further may comprise resetting the status bit if the cacheline is allocated over or is evicted from the cache before the store buffer entry attempts to write to the cache.
    Type: Grant
    Filed: December 23, 2003
    Date of Patent: October 31, 2006
    Assignee: Intel Corporation
    Inventors: Per H. Hammarlund, Stephan Jourdan, Sebastien Hily, Aravindh Baktha, Hermann Gartler
  • Publication number: 20060123219
    Abstract: Fusing micro-operations (uops) together. Intra-instruction fusing can increase cache memory storage efficiency and computer instruction processing bandwidth within a microprocessor without incurring significant computer system cost. Uops are fused, stored in a cache memory, un-fused, executed in parallel, and retired in order to optimize cost and performance.
    Type: Application
    Filed: January 24, 2006
    Publication date: June 8, 2006
    Inventors: Nicholas Samra, Stephan Jourdan
  • Publication number: 20060098518
    Abstract: Embodiments of the present invention provide a method and system for staging the data output from an addressable memory location as a plurality of fields. In embodiments, each field of a data item that is stored at an address may be output during a different clock cycle. In further embodiments, the most time critical field may be output first.
    Type: Application
    Filed: December 23, 2005
    Publication date: May 11, 2006
    Inventors: Stephan Jourdan, Boyd Phelps, Chris Yuker
  • Publication number: 20050289324
    Abstract: A system and method of managing processor instructions provides enhanced performance. The system and method provide for decoding a first instruction into a plurality of operations with a decoder. A first copy of the operations is passed from the decoder to a build engine associated with a trace cache. The system and method further provide for passing a second copy of the operation from the decoder directly to a back end allocation module such that the operations bypass the build engine and the allocation module is in a decoder reading state.
    Type: Application
    Filed: August 31, 2005
    Publication date: December 29, 2005
    Inventors: John Miller, Stephan Jourdan
  • Patent number: 6952764
    Abstract: A method for stopping replay tornadoes in a processor. The method of one embodiment comprises scheduling an instruction for execution speculatively. A determination is made whether the instruction executed correctly. The instruction is routed to a replay mechanism if the instruction did not execute correctly. A determination is made whether a replay tornado exists. The instruction is routed for re-execution if the instruction executed incorrectly and no replay tornado exists. Breaking the replay tornado if the replay tornado exists. Replay safe instructions in the pipeline are retired. Non-replay safe instructions in the pipeline are marked for re-execution. The non-replay safe instructions are rescheduled for re-execution.
    Type: Grant
    Filed: December 31, 2001
    Date of Patent: October 4, 2005
    Assignee: Intel Corporation
    Inventors: David J. Sager, Stephan Jourdan, Per Hammarlund
  • Patent number: 6950924
    Abstract: A system and method of managing processor instructions provides enhanced performance. The system and method provide for decoding a first instruction into a plurality of operations with a decoder. A first copy of the operations is passed from the decoder to a build engine associated with a trace cache. The system and method further provide for passing a second copy of the operation from the decoder directly to a back end allocation module such that the operations bypass the build engine and the allocation module is in a decoder reading state.
    Type: Grant
    Filed: January 2, 2002
    Date of Patent: September 27, 2005
    Assignee: Intel Corporation
    Inventors: John Alan Miller, Stephan Jourdan
  • Publication number: 20050193278
    Abstract: Systems and methods of managing threads provide for supporting a plurality of logical threads with a plurality of simultaneous physical threads in which the number of logical threads may be greater than or less than the number of physical threads. In one approach, each of the plurality of logical threads is maintained in one of a wait state, an active state, a drain state, and a stall state. A state machine and hardware sequencer can be used to transition the logical threads between states based on triggering events and whether or not an interruptible point has been encountered in the logical threads. The logical threads are scheduled on the physical threads to meet, for example, priority, performance or fairness goals. It is also possible to specify the resources that are available to each logical thread in order to meet these and other, goals. In one example, a single logical thread can speculatively use more than one physical thread, pending a selection of which physical thread should be committed.
    Type: Application
    Filed: December 29, 2003
    Publication date: September 1, 2005
    Inventors: Per Hammarlund, Stephan Jourdan, Pierre Michaud, Alexandre Farcy, Morris Marden, Robert Hinton, Douglas Carmean
  • Publication number: 20050149707
    Abstract: Systems and methods of processing branch instructions provide for a bimodal predictor and a plurality of global predictors. The bimodal predictor is coupled to a prediction selector, where the bimodal predictor generates a bimodal prediction for branch instructions. The plurality of global predictors is coupled to the prediction selector, where each global predictor generates a corresponding global prediction for a branch instruction using different history or stew lengths. The prediction selector selects branch predictions from the bimodal prediction and the global predictions in order to arbitrate between predictors. The arbitration, update, and allocation schemes are designed to choose the most accurate predictor for each branch. Lower level predictors are used as filters to increase effective predictor capacity. Allocate and update schemes minimize aliasing between predictors.
    Type: Application
    Filed: December 24, 2003
    Publication date: July 7, 2005
    Inventors: Stephan Jourdan, Mark Davis, Pierre Michaud