Including Coprocessor Patents (Class 712/34)
  • Patent number: 8127113
    Abstract: System and method for generating hardware accelerators and processor offloads. System for hardware acceleration. System and method for implementing an asynchronous offload. Method of automatically creating a hardware accelerator. Computerized method for automatically creating a test harness for a hardware accelerator from a software program. System and method for interconnecting hardware accelerators and processors. System and method for interconnecting a processor and a hardware accelerator. Computer implemented method of generating a hardware circuit logic block design for a hardware accelerator automatically from software. Computer program and computer program product stored on tangible media implementing the methods and procedures of the invention.
    Type: Grant
    Filed: December 1, 2006
    Date of Patent: February 28, 2012
    Assignee: Synopsys, Inc.
    Inventors: Navendu Sinha, William Charles Jordan, Bryon Irwin Moyer, Stephen John Joseph Fricke, Roberto Attias, Akash Renukadas Deshpande, Vineet Gupta, Shobhit Sonakiya
  • Patent number: 8122156
    Abstract: A method is provided for processing operation command in a computer that has a display and a host which includes a first display processing unit for local displaying and a second display processing unit for remote displaying. The operation command is from a remote data processing terminal. The method includes: receiving a first operation command from the data processing terminal, the first operation command being a power-on command; performing power-on of the computer, shielding the first display processing unit and loading only a driver of the second display processing unit according to first operation command; receiving a second operation command from the data processing terminal, the second operation command being not a power-on command; executing the second operation command to obtain operation results, the operation results being image data processed by the second display processing unit, and sending the operation results to the remote data processing terminal, for remote displaying.
    Type: Grant
    Filed: January 13, 2009
    Date of Patent: February 21, 2012
    Assignee: Lenovo (Beijing) Limited
    Inventors: Yiqiang Yan, Shaoping Peng, Bo Liu, Xiaohua Jiang, Chengkun Sun
  • Patent number: 8122229
    Abstract: A dispatch mechanism is provided for dispatching instructions of an executable from a host processor to a heterogeneous co-processor. According to certain embodiments, cache coherency is maintained between the host processor and the heterogeneous co-processor, and such cache coherency is leveraged for dispatching instructions of an executable that are to be processed by the co-processor. For instance, in certain embodiments, a designated portion of memory (e.g., “UCB”) is utilized, wherein a host processor may place information in such UCB and the co-processor can retrieve information from the UCB (and vice-versa). The UCB may thus be used to dispatch instructions of an executable for processing by the co-processor. In certain embodiments, the co-processor may comprise dynamically reconfigurable logic which enables the co-processor's instruction set to be dynamically changed, and the dispatching operation may identify one of a plurality of predefined instruction sets to be loaded onto the co-processor.
    Type: Grant
    Filed: September 12, 2007
    Date of Patent: February 21, 2012
    Assignee: Convey Computer
    Inventors: Steven J. Wallach, Tony Brewer
  • Patent number: 8095934
    Abstract: In a first data delivery apparatus, a user-input receiving unit receives data and a request for executing a workflow, a first data processing unit processes the data based on the workflow, a destination obtaining unit obtains a destination from the workflow, and a transferring unit transfers the data, the workflow, and a progress of the workflow to the destination. In a second data delivery apparatus, a transfer receiving unit receives the data, the workflow, and the progress of the workflow, a workflow executing unit executes the workflow from a non-executed part based on the progress of the workflow, and a data delivery unit delivers data to other destination.
    Type: Grant
    Filed: March 7, 2008
    Date of Patent: January 10, 2012
    Assignee: Ricoh Company, Ltd.
    Inventor: Tetsuhiko Omori
  • Patent number: 8085273
    Abstract: A multi-mode parallel 3-D graphics system having multiple graphics processing pipelines with multiple GPUs supporting a parallel graphics rendering process having time, frame and object division modes of operation, wherein each GPU comprises video memory, a geometry processing subsystem and a pixel processing subsystem, and wherein 3D scene profiling is performed in real-time, and the parallelization state/modes of the system are dynamically controlled to meet graphics application requirements. The multiple modes of parallel graphics rendering use real-time graphics application profiling, and dynamic control over time-division, frame-division, and object-division modes of parallel operation, within the same parallel graphics platform, which can be realized on PC-based computing system architectures.
    Type: Grant
    Filed: January 18, 2007
    Date of Patent: December 27, 2011
    Assignee: Lucid Information Technology, Ltd
    Inventors: Reuven Bakalash, Yaniv Leviathan
  • Patent number: 8082429
    Abstract: An information processing apparatus performs switching between an exception handler and normal processing. The information processing apparatus includes a processor; a data processing unit that performs particular processing upon receiving a processing request from the processor; an interrupt controller that issues an interrupt request to the processor; and an exception control unit that controls the interrupt controller, wherein the data processing unit is connected with the exception control unit via a dedicated line. The data processing unit includes a notification unit that notifies, via the dedicated line, the exception control unit of status information indicating current status of the data processing unit, and based on the notified status information and setup information set by the processor, the exception control unit judges whether to cause the interrupt controller to issue an interrupt request to execute an exception handler to the processor.
    Type: Grant
    Filed: March 21, 2011
    Date of Patent: December 20, 2011
    Assignee: Panasonic Corporation
    Inventors: Hideshi Nishida, Takeshi Furuta, Tetsuya Tanaka, Kozo Kimura, Tokuzo Kiyohara
  • Patent number: 8078838
    Abstract: A multiport semiconductor memory device having a processor wake-up function and multiprocessor system, the multiprocessor system including a first processor configured to perform a first predetermined task; a second processor configured to perform a second predetermined task; and a multiport semiconductor memory device coupled to the first and second processors. The multiport semiconductor memory device includes a memory cell array having at least one shared memory area; a first port coupled to the at least one shared memory area; a second port coupled to the at least one shared memory area; and a wake-up signal generator. The first processor is coupled to the at least one shared memory area via the first port, the second processor is coupled to the at least one shared memory area via the second port, and the wake-up signal generator is coupled to the first processor and the second processor.
    Type: Grant
    Filed: September 23, 2008
    Date of Patent: December 13, 2011
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jin-Hyoung Kwon, Han-Gu Sohn, Kwang-Myeong Jang
  • Patent number: 8074224
    Abstract: Embodiments of the present invention facilitate dynamically adapting to state information changes in a graphics processing environment. In one embodiment, a master register holds state information corresponding to units of work (threads) to be performed. The state information in the master register is copied to a per-group state register when a group of threads is to be launched. The per-group state register is coupled to processing engines configured to process the threads, so that the processing engines read state information from the per-group state register rather than the master register. In another embodiment, a number of master registers may be used to store state information for different types of threads.
    Type: Grant
    Filed: December 19, 2005
    Date of Patent: December 6, 2011
    Assignee: NVIDIA Corporation
    Inventors: Bryon S. Nordquist, Brett W. Coon
  • Patent number: 8069338
    Abstract: A data processing device includes a program execution section to supply an operation direction signal to a peripheral device based on an executed program and execute a branch operation in response to a branch direction signal, and a branch wait operation section to receive the branch direction signal and a peripheral device status notification signal indicating whether an operation performed in the peripheral device is being executed. The branch wait operation section outputs an instruction issue stop signal directing waiting of the branch operation to the program execution section if the branch direction signal is input during a period when the peripheral device status notification signal is active indicating that the operation in the peripheral device is being executed.
    Type: Grant
    Filed: November 14, 2008
    Date of Patent: November 29, 2011
    Assignee: Renesas Electronics Corporation
    Inventors: Hitoshi Suzuki, Yukihiko Akaike
  • Patent number: 8060727
    Abstract: There is provided a novel microprogrammed processor (100) by combining two or more processor cores (10) in such a way that the processor cores can share the special microprogram memory resource (20) that is located deep inside the processor architecture. In other words, the novel microprogrammed processor (100) basically includes at least two processor cores (10), and a common internal microprogram control store (20) including microcode instructions for controlling at least the internal standard operation of the multiple processor cores, and suitable elements (30) for providing time-shared access to the microprogram control store by the processor cores.
    Type: Grant
    Filed: June 20, 2008
    Date of Patent: November 15, 2011
    Assignees: Conemtech AB, IMSYS AB
    Inventor: Stefan Blixt
  • Patent number: 8055882
    Abstract: Disclosed is a multiprocessor apparatus including a plurality of processors connected to a common bus, a co-processor provided in common to the processors, an arbitration circuit that arbitrates contention among the processors with respect to use of a resource in the co-processor through a tightly coupled bus by the processors and a multiplexer coupled to the arbitration circuit, coupled to the processors through a local buses, and coupled to the co-processor through the local buses to transfer the commands received from the respective processors to the co-processor in accordance with a permission signal output by the arbitration circuit.
    Type: Grant
    Filed: July 18, 2008
    Date of Patent: November 8, 2011
    Assignee: Renesas Electronics Corporation
    Inventors: Shinji Kashiwagi, Hiroyuki Nakajima
  • Patent number: 8046565
    Abstract: Systems and methods for accelerators which may execute a program in conjunction with a PC are disclosed. In one embodiment, an accelerator includes a plurality of calculation units, each calculation unit operable to execute the program in parallel, an operation control unit configured to control an operation capability or a processing capability of at least one of the plurality of calculation units, and a control unit configured to determine a corresponding operation capability or processing capability for the plurality of calculation units based on load information on the program to be executed and control the operation control unit accordingly.
    Type: Grant
    Filed: December 5, 2007
    Date of Patent: October 25, 2011
    Assignee: Kabushiki Kaisha Toshiba
    Inventor: Hideki Yasukawa
  • Patent number: 8036243
    Abstract: A single chip protocol converter integrated circuit (IC) capable of receiving packets generating according to a first protocol type and processing said packets to implement protocol conversion and generating converted packets of a second protocol type for output thereof, the process of protocol conversion being performed entirely within the single integrated circuit chip. The single chip protocol converter can be further implemented as a macro core in a system-on-chip (SoC) implementation, wherein the process of protocol conversion is contained within a SoC protocol conversion macro core without requiring the processing resources of a host system. Packet conversion may additionally entail converting packets generated according to a first protocol version level and processing the said packets to implement protocol conversion for generating converted packets according to a second protocol version level, but within the same protocol family type.
    Type: Grant
    Filed: August 11, 2008
    Date of Patent: October 11, 2011
    Assignee: International Business Machines Corporation
    Inventors: Christos J. Georgiou, Victor L. Gregurick, Indira Nair, Valentina Salapura
  • Patent number: 8032734
    Abstract: A coprocessor interface unit for interfacing a coprocessor to an out-of-order execution pipeline, and applications thereof. In an embodiment, the coprocessor interface unit includes an in-order instruction queue, a coprocessor load data queue, and a coprocessor store data queue. Instructions are written into the in-order instruction queue by an instruction dispatch unit. Instructions exit the in-order instruction queue and enter the coprocessor. In the coprocessor, the instructions operate on data read from the coprocessor load data queue. Data is written back, for example, to memory or a register file by inserting the data into the out-of-order execution pipeline, either directly or via the coprocessor store data queue, which writes back the data.
    Type: Grant
    Filed: September 6, 2006
    Date of Patent: October 4, 2011
    Assignee: MIPS Technologies, Inc.
    Inventors: Kjeld Svendsen, Maria Ukanwa
  • Patent number: 8006069
    Abstract: Inter-processor communication systems and methods that define within the instruction set of the microprocessor a command for directing the microprocessor to relinquish control over at least one of the microprocessor's internal registers. The microprocessor may then signal a communication interface that collects data from external sources. The communication interface takes control over the internal register released by the microprocessor and inputs the collected external data directly into the internal register of the microprocessor. Once data is place into the internal register, control of that register may be returned to the microprocessor.
    Type: Grant
    Filed: October 5, 2007
    Date of Patent: August 23, 2011
    Assignee: Synopsys, Inc.
    Inventors: Simon Jones, Carl Norman Graham, Kar-Lik Wong
  • Patent number: 8006068
    Abstract: Access to data storage is described. A general-purpose processor and an auxiliary processing unit (APU) interface coupled to the general-purpose processor are provided. Data storage coupled to the general-purpose processor via the APU interface is provided for a fixed or low variable read latency access and a fixed write latency access to the data storage. A first instruction is passed to the general-purpose processor and to the APU interface. The first instruction is identified as part of a set of instructions accessible by the APU interface. The first instruction is used to write data into the data storage. A second instruction is passed to the general-purpose processor and to the APU interface. The second instruction is identified as part of the set of instructions accessible by the APU interface. The second instruction is used to read the data from the data storage, and the data is then output.
    Type: Grant
    Filed: April 18, 2007
    Date of Patent: August 23, 2011
    Assignee: Xilinx, Inc.
    Inventor: Glenn C. Steiner
  • Patent number: 8006074
    Abstract: Methods and apparatus are provided for efficiently executing extended custom instructions on a programmable chip. Components of a processor core such as arithmetic logic units, program sequencer units, and address generation units are integrated with customizable logic blocks. Various customizable logic blocks can be invoked in a pipelined manner using an available customized instruction set while allowing a processor to continue simultaneous operation. Program counter snooping is also provided to add custom instruction functionality to a processor with no additional provisions for adding custom instructions.
    Type: Grant
    Filed: December 24, 2003
    Date of Patent: August 23, 2011
    Assignee: Altera Corporation
    Inventors: Tracy Miranda, Steven Perry
  • Publication number: 20110191539
    Abstract: A data processing apparatus is provided, configured to carry out data processing operations on behalf of a main data processing apparatus, comprising a coprocessor core configured to perform the data processing operations and a reset controller configured to cause the coprocessor core to reset. The coprocessor core performs its data processing in dependence on current configuration data stored therein, the current configuration data being associated with a current processing session. The reset controller is configured to receive pending configuration data from the main data processing apparatus, the pending configuration data associated with a pending processing session, and to store the pending configuration data in a configuration data queue. The reset controller is configured, when the coprocessor core resets, to transfer the pending configuration data from the configuration data queue to be stored in the coprocessor core, replacing the current configuration data.
    Type: Application
    Filed: February 3, 2010
    Publication date: August 4, 2011
    Applicant: ARM Limited
    Inventors: Ola Hugosson, Erik Persson, Pontus Borg
  • Patent number: 7987341
    Abstract: A computing machine includes a first buffer and a processor coupled to the buffer. The processor executes an application, a first data-transfer object, and a second data-transfer object, publishes data under the control of the application, loads the published data into the buffer under the control of the first data-transfer object, and retrieves the published data from the buffer under the control of the second data-transfer object. Alternatively, the processor retrieves data and loads the retrieved data into the buffer under the control of the first data-transfer object, unloads the data from the buffer under the control of the second data-transfer object, and processes the unloaded data under the control of the application. Where the computing machine is a peer-vector machine that includes a hardwired pipeline accelerator coupled to the processor, the buffer and data-transfer objects facilitate the transfer of data between the application and the accelerator.
    Type: Grant
    Filed: October 9, 2003
    Date of Patent: July 26, 2011
    Assignee: Lockheed Martin Corporation
    Inventors: Chandan Mathur, Scott Hellenbach, John W. Rapp
  • Patent number: 7973804
    Abstract: A circuit arrangement and method support a multithreaded rendering architecture capable of dynamically routing pixel fragments from a pixel fragment generator to any pixel shader from among a pool of pixel shaders. The pixel fragment generator is therefore not tied to a specific pixel shader, but is instead able to utilize multiple pixel shaders in a pool of pixel shaders to minimize bottlenecks and improve overall hardware utilization and performance during image processing.
    Type: Grant
    Filed: March 11, 2008
    Date of Patent: July 5, 2011
    Assignee: International Business Machines Corporation
    Inventors: Eric Oliver Mejdrich, Paul Emery Schardt, Robert Allen Shearer
  • Patent number: 7969187
    Abstract: A hardware interface in an integrated circuit is disclosed. The hardware interface comprises data storage coupled to store and provide data; a data shifter coupled to the data storage to at least bit shift the data obtained from the data storage; and a control circuit coupled to the data storage and the data shifter for controlling a transfer of the data from the data storage and the data shifter. The control circuit comprises a state machine for controlling operation of the data storage and the data shifter; and the state machine is programmable responsive to code executable by a processor coupled to an auxiliary processing unit to adapt to the auxiliary processing unit.
    Type: Grant
    Filed: August 6, 2010
    Date of Patent: June 28, 2011
    Assignee: Xilinx, Inc.
    Inventors: Stephen A. Neuendorffer, Paul M. Hartke, Paul R. Schumacher
  • Patent number: 7971030
    Abstract: An apparatus, method, and system for synchronicity independent, resource delegating, power and instruction optimizing processor is provided where instructions are delegated between various processing resources of the processor. An Integer Processing Unit (IPU) of the processor delegates complicated mathematical instructions to a Mathematical Processing Unit (MPU) of the processor. Furthermore, the processor puts underutilized processing resources to sleep thereby increasing power usage efficiency. A cache of the processor is also capable of accepting delegated operations from the IPU. As such, the cache performs various logical operations on delegated requests allowing it to lock and share memory without requiring extra processing cycles by the entire processor.
    Type: Grant
    Filed: August 6, 2003
    Date of Patent: June 28, 2011
    Assignee: MMAGIX Technology Limited
    Inventor: Daniel Shane O'Sullivan
  • Patent number: 7962553
    Abstract: A method for distributing low-priority maintenance tasks in a computer system employing a plurality of central processing units. In an illustrative embodiment, an activity flag is assigned to each of the plurality of central processing units. After a network packet is received, it is processed in one of the plurality of central processing units and the maintenance tasks are assigned to a central processing unit based on the status of its activity flag.
    Type: Grant
    Filed: July 19, 2007
    Date of Patent: June 14, 2011
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventor: Venkataraman Kamalaksha
  • Patent number: 7962721
    Abstract: There is provided an information processing apparatus. The apparatus comprises: a processor; at least one I2C device; and a processor support chip. The processor support chip comprises a local service controller and a jointly addressable memory space and has an interface to the processor for the transfer of information therebetween. The processor support chip also has an interface for communication with a service processor; and an I2C interface for communication with the at least one I2C device. The local service controller has exclusive read and write access to the I2C interface; and is operable to maintain a data structure indicating a current value associated with the I2C device in the jointly addressable memory space for access by the processor and the service processor.
    Type: Grant
    Filed: May 4, 2004
    Date of Patent: June 14, 2011
    Assignee: Oracle America, Inc.
    Inventors: James E. King, Rhod J. Jones, Paul J. Garnett
  • Patent number: 7961194
    Abstract: A method of controlling the mode of parallel operation of a multi-mode parallel graphics processing system (MMPGPS) embodied within a host computing system having (i) host memory space (HMS) for storing one or more graphics-based applications and a graphics library for generating graphics commands and data (GCAD) during the run-time (i.e. execution) of the graphics-based application, (ii) one or more CPUs for executing said graphics-based applications, (iii) a display device for displaying images containing graphics during the execution of said graphics-based applications, and (iv) a multi-mode parallel graphics rendering subsystem supporting multiple modes of parallel operation selected from the group consisting of object division, image division, and time division and having a plurality of graphic processing pipelines (GPPLs) supporting a parallel graphics rendering process that employs one of the object division, image division and/or time division modes of parallel operation.
    Type: Grant
    Filed: August 30, 2007
    Date of Patent: June 14, 2011
    Assignee: Lucid Information Technology, Ltd.
    Inventors: Reuven Bakalash, Yaniv Leviathan
  • Patent number: 7944450
    Abstract: A computing system capable of parallelizing the operation of multiple graphics processing units (GPUs) supported on a hybrid CPU/GPU fusion-architecture chip and/or on an external graphics card, and employing a multi-mode parallel graphics rendering subsystem having software and hardware implemented components. The computing system includes (i) CPU memory space for storing one or more graphics-based applications, (ii) one or more CPUs for executing the graphics-based applications, (iii) a multi-mode parallel graphics rendering subsystem supporting multiple modes of parallel operation, (iv) a plurality of graphic processing pipelines (GPPLs), implemented using the GPUs, and (vi) an automatic mode control module. During the run-time of the graphics-based application, the automatic mode control module automatically controls the mode of parallel operation of the multi-mode parallel graphics rendering subsystem so that the GPUs are driven in a parallelized manner.
    Type: Grant
    Filed: September 26, 2007
    Date of Patent: May 17, 2011
    Assignee: Lucid Information Technology, Ltd.
    Inventors: Reuven Bakalash, Yaniv Leviathan
  • Patent number: 7940274
    Abstract: A computing system capable of parallelizing the operation of multiple graphics processing units (GPUs) supported on an integrated graphic device (IGD) embodied within a bridge circuit, and employing a multi-mode parallel graphics rendering subsystem having software and hardware implemented components. The computing system includes (i) CPU memory space for storing one or more graphics-based applications, (ii) one or more CPUs for executing the graphics-based applications, (iii) an external graphics card supporting at least one GPU and being connected to the bridge circuit by way of a data communication interface, (iv) a multi-mode parallel graphics rendering subsystem supporting multiple modes of parallel operation, (v) a plurality of graphic processing pipelines (GPPLs), implemented using the GPUs, and (vi) an automatic mode control module.
    Type: Grant
    Filed: September 25, 2007
    Date of Patent: May 10, 2011
    Assignee: Lucid Information Technology, Ltd
    Inventors: Reuven Bakalash, Yaniv Leviathan
  • Patent number: 7934082
    Abstract: An information processing apparatus performs switching between an exception handler and normal processing. The information processing apparatus includes a processor a data processing unit that performs a particular processing upon receiving a processing request from the processor; an interrupt controller that issues an interrupt request to the processor; and an exception control unit that controls the interrupt controller, wherein the data processing unit is connected with the exception control unit via a dedicated line. The data processing unit includes a notification unit that notifies, via the dedicated line, the exception control unit of status information indicating current status of the data processing unit, and based on the notified status information and setup information set by the processor, the exception control unit judges whether to cause the interrupt controller to issue to the processor an interrupt request to execute an exception handler.
    Type: Grant
    Filed: August 19, 2005
    Date of Patent: April 26, 2011
    Assignee: Panasonic Corporation
    Inventors: Hideshi Nishida, Takeshi Furuta, Tetsuya Tanaka, Kozo Kimura, Tokuzo Kiyohara
  • Patent number: 7930519
    Abstract: A processor unit and a coprocessor unit are disclosed. In one embodiment, the processor unit includes a functional unit that receives a set of instructions in an instruction stream and provides the set of instructions to the coprocessor unit. The coprocessor executes the instructions and initiates transmission of a set of execution results corresponding to the set of instructions to the processor unit's functional unit. The processor functional unit may be coupled to the coprocessor unit through a shared bus circuit implementing a packet-based protocol. The processor unit and the coprocessor unit may share a coherent view of system memory. In various embodiments, the functional unit may alter entries in a translation lookaside buffer (TLB) located in the coprocessor unit, resume and suspend a thread executing on the coprocessor unit, etc.
    Type: Grant
    Filed: December 17, 2008
    Date of Patent: April 19, 2011
    Assignee: Advanced Micro Devices, Inc.
    Inventor: Michael Frank
  • Patent number: 7925862
    Abstract: A coprocessor (14) may be used to perform one or more specialized operations that can be off-loaded from a primary or general purpose processor (12). It is important to allow efficient communication and interfacing between the processor (12) and the coprocessor (14). In one embodiment, a coprocessor (14) generates and provides instructions (200, 220) to an instruction pipe (20) in the processor (12). Because the coprocessor (14) generated instructions are part of the standard instruction set of the processor (12), cache (70) coherency is easy to maintain. Also, circuitry (102) in coprocessor (14) may perform an operation on data while circuitry (106) in coprocessor (14) is concurrently generating processor instructions (200, 220).
    Type: Grant
    Filed: June 27, 2006
    Date of Patent: April 12, 2011
    Assignee: Freescale Semiconductor, Inc.
    Inventors: William C. Moyer, Kevin B. Traylor
  • Patent number: 7925863
    Abstract: Multiple hardware accelerators can be used to efficiently perform processes that would otherwise be performed by general purpose hardware running software. The software overhead and bus bandwidth associated with running multiple hardware acceleration processes can be reduced by chaining multiple independent hardware acceleration operations within a circuit card assembly. Multiple independent hardware accelerators can be configured on a single circuit card assembly that is coupled to a computing device. The computing device can generate a playlist of hardware acceleration operations identifying hardware accelerators and associated accelerator options. A task management unit on the circuit card assembly receives the playlist and schedules the hardware acceleration operations such that multiple acceleration operations may be successively chained together without intervening data exchanges with the computing device.
    Type: Grant
    Filed: June 9, 2008
    Date of Patent: April 12, 2011
    Assignee: LSI Corporation
    Inventor: Douglas Edward Hundley
  • Patent number: 7917907
    Abstract: Techniques for processing transmissions in a communications (e.g., CDMA) system. An aspect of the disclosed subject matter includes a method for processing instructions on a multithreaded processor. The multithreaded processor processes a plurality of threads via a plurality of processor pipelines. The method includes the step determining the operating frequency, F, at which the multithreaded processor operates. Then, the method determines a variable thread switch timeout state for triggering the switching of the processing among the plurality of active threads. The variable thread switch timeout state varies so that each of the plurality of active threads operates at a frequency of an allocated portion of the frequency, F. The allocated portion at which the active threads operate is determined at least in part in order to optimize the operation of the multithreaded processor.
    Type: Grant
    Filed: March 23, 2005
    Date of Patent: March 29, 2011
    Assignee: QUALCOMM Incorporated
    Inventors: Muhammad Ahmed, Sujat Jamil, Erich Plondke, Lucian Codrescu, William C. Anderson
  • Patent number: 7908462
    Abstract: The current invention provides a virtual world simulation system capable of hosting with massive amount of concurrent players by integrating commodity parallel co-processors into servers. The current invention proposes novel parallel processing algorithms to make use of commodity parallel co-processors like a graphic processing unit (GPU) or any specialized hardware with parallel architecture design like a field-programmable gate array (FPGA), to accelerate virtual world simulation.
    Type: Grant
    Filed: June 9, 2010
    Date of Patent: March 15, 2011
    Assignee: Zillians Incorporated
    Inventor: Mu Chi Sung
  • Patent number: 7904838
    Abstract: An integrated circuit includes a core-logic providing a core-logic output, a latch in communication with the core-logic to store a state of the core-logic output, and an isolation circuit for selectively interconnecting the core-logic output to an input of the latch. The circuit also includes and a power consumption controller in communication with the core-logic, the latch and the isolation circuit, for controlling the latch to store a state of the core-logic output, and output a corresponding signal. The controller is further operable to signal the isolation circuit to isolate the core-logic output from the latch by providing an output corresponding to predetermined value and transition the core-logic from a high power state and a low power state. This prevents transient signals from propagating to interconnected circuit blocks and external devices.
    Type: Grant
    Filed: August 15, 2007
    Date of Patent: March 8, 2011
    Assignee: ATI Technologies ULC
    Inventors: Aris Balatsos, Charles Leung, Siva Raghu Ram Voleti
  • Patent number: 7900021
    Abstract: An image processing apparatus for storing taken image data into a memory and for reading the stored image data to effect image processing thereof, including: a storage section for storing sequence codes that indicate processing procedures of a series of image processing for the image data; an image processing section for effecting the series of image processing on the read out image data; and a sequencer for controlling the image processing section based on the sequence codes read out from the storage section in effecting the series of image processing at the image processing section, wherein the sequence codes are composed of data coding all sequence instructions corresponding to the series of image processing and are tranferred and stored collectively from CPU to the storage section before starting the series of image processing every time when the series of image processing is to be effected.
    Type: Grant
    Filed: August 30, 2004
    Date of Patent: March 1, 2011
    Assignee: Olympus Corporation
    Inventors: Keisuke Nakazono, Akira Ueno, Motoo Azuma
  • Patent number: 7886129
    Abstract: A configurable coprocessor interface between a central processing unit (CPU) and a coprocessor is provided. The coprocessor interface has an instruction transfer signal group for transferring different instruction types from the CPU to the coprocessor, sequentially or in parallel, a busy signal group, for allowing the coprocessor to signal the CPU that it cannot receive a transfer of one or more of the different instruction types, and an instruction order signal group for indicating to the coprocessor a relative execution order for multiple instructions that are transferred in parallel. In addition, the coprocessor interface includes separate data transfer signal groups for data being transferred from the CPU to the coprocessor, and for data being transferred from the coprocessor to the CPU, along with a data order signal group for indicating a relative order of data (if transferred out-of-order).
    Type: Grant
    Filed: August 20, 2004
    Date of Patent: February 8, 2011
    Assignee: MIPS Technologies, Inc.
    Inventors: Lawrence Henry Hudepohl, Darren Miller Jones, Radhika Thekkath, Franz Treue
  • Publication number: 20110029757
    Abstract: A stream processor includes a programmable main processor MP, and a coprocessor CP that executes an extension instruction, the extension instruction being different from a basic instruction executed by the main processor MP. The main processor MP includes a coprocessor controller CPC outputting the extension instruction to the coprocessor CP, and the coprocessor CP includes a task controller TC, the task controller controlling a task performed based on the extension instruction and outputting status information ST of the task on every clock. The coprocessor controller CPC controls the coprocessor CP based on the status information ST and a basic instruction executed by the main processor MP in background in advance.
    Type: Application
    Filed: June 29, 2010
    Publication date: February 3, 2011
    Applicant: NEC ELECTRONICS CORPORATION
    Inventor: Hiroyuki NAKAJIMA
  • Publication number: 20110023040
    Abstract: A technique for processing instructions in an electronic system is provided. In one embodiment, a processor of the electronic system may submit a unit of work to a queue accessible by a coprocessor, such as a graphics processing unit. The coprocessor may process work from the queue, and write a completion record into a memory accessible by the processor. The electronic system may be configured to switch between a polling mode and an interrupt mode based on progress made by the coprocessor in processing the work. In one embodiment, the processor may switch from an interrupt mode to a polling mode upon completion of a threshold amount of work by the coprocessor. Various additional methods, systems, and computer program products are also provided.
    Type: Application
    Filed: July 24, 2009
    Publication date: January 27, 2011
    Applicant: APPLE INC.
    Inventors: Ian Hendry, Anthony G. Sumpter
  • Patent number: 7877576
    Abstract: When an interruption instruction occurs in an information processing apparatus including a CPU and a coprocessor, execution of a single dedicated instruction “GETACX Dm,Dn” performs saving of necessary data from all registers. “Dm” is a value output from a general register group 104 to a first data input bus 120. Each of calculation units implemented in a coprocessor 110 recognizes a value stored therein. If a value “Dm” specifies one of the calculation units, the specified calculation unit outputs, to a selector 116, data stored in a register included in the specified calculation unit. An implemented calculation unit information output circuit 117 stores therein the count of the calculation units implemented in the coprocessor 110. If a value of the first data input bus 120 is greater than the count of the calculation units, the implemented calculation unit information output circuit 117 outputs a value “1” to a flag register 102.
    Type: Grant
    Filed: February 19, 2008
    Date of Patent: January 25, 2011
    Assignee: Panasonic Corporation
    Inventors: Toru Morikawa, Jiro Miyake, Hiroyuki Mizohata
  • Patent number: 7873785
    Abstract: A processor is provided. The processor includes at least two cores. The at least two cores have a first level cache memory and are multi-threaded. A crossbar is included. A plurality of cache bank memories in communication with the at least two cores through the crossbar is provided. Each of the plurality of cache bank memories communicates with a main memory interface. A plurality of input/output interface modules in communication with the main memory interface and providing a link to the at least two cores are included. The link bypasses the plurality of cache bank memories and the crossbar. Threading hardware configured to enable the at least two cores to switch from a first thread to a second thread in a manner hiding delays caused by cache accesses is included. A server and a method for determining when to switch threads in a multi-core multi-thread environment are included.
    Type: Grant
    Filed: May 26, 2004
    Date of Patent: January 18, 2011
    Assignee: Oracle America, Inc.
    Inventor: Kunle A. Olukotun
  • Patent number: 7865698
    Abstract: A method for decoding, including: obtaining an op-code from a master device; setting a mode to mask a first portion of the bits of the op-code, where the first portion of the bits are for being treated as a wildcard value; and decoding a second portion of the op-code that is not masked to determine whether the op-code is for a slave device. The decoding of the second portion is performed by a controller having a decoder, and the controller bridges the master device for communication with the slave device. The decoding of the first portion of the bits is performed by the slave device. The first portion of the bits identifies an instruction from a group of instructions, and the group of instructions uses a single configuration register of registers of the controller.
    Type: Grant
    Filed: March 27, 2008
    Date of Patent: January 4, 2011
    Assignee: Xilinix, Inc.
    Inventors: Kathryn S. Purcell, Ahmad R. Ansari
  • Patent number: 7865675
    Abstract: A data processing apparatus 2 includes a programmable general purpose processor 10 coupled to a hardware accelerator 12. A memory system 14, 6, 8 is shared by the processor 10 and the hardware accelerator 12. Memory system monitoring circuitry 16 is responsive to one or more predetermined operations performed by the processor 10 upon the memory system 14, 6, 8 to generate a trigger to the hardware accelerator 12 for it to halt its processing operations and clean any data values held as temporary variables within registers 20 of the hardware accelerator back to the memory system 14, 6, 8.
    Type: Grant
    Filed: December 6, 2007
    Date of Patent: January 4, 2011
    Assignee: ARM Limited
    Inventors: Nigel Charles Paver, Stuart David Biles
  • Patent number: 7865697
    Abstract: A mechanism enabling a processor in a multiprocessor complex to function as a coprocessor to execute a specific function. The method includes a mechanism for activating a coprocessor to function as a coprocessor as well as a mechanism to execute a coprocessor request on the system. The present invention also provides a mechanism for efficient processor to processor communication for processors coupled to a common bus. Overall system performance is enhanced by significantly reducing the use of hardware interrupts for processor to processor communication.
    Type: Grant
    Filed: February 27, 2008
    Date of Patent: January 4, 2011
    Assignee: International Business Machines Corporation
    Inventors: Zorik Machulsky, Julian Satran, Leah Shalev, Michael Steven Siegel, Gregory Scott Still, James Xenidis
  • Patent number: 7856545
    Abstract: A co-processor module for accelerating computational performance includes a Field Programmable Gate Array (“FPGA”) and a Programmable Logic Device (“PLD”) coupled to the FPGA and configured to control start-up configuration of the FPGA. A non-volatile memory is coupled to the PLD and configured to store a start-up bitstream for the start-up configuration of the FPGA. A mechanical and electrical interface is for being plugged into a microprocessor socket of a motherboard for direct communication with at least one microprocessor capable of being coupled to the motherboard. After completion of a start-up cycle, the FPGA is configured for direct communication with the at least one microprocessor via a microprocessor bus to which the microprocessor socket is coupled.
    Type: Grant
    Filed: July 27, 2007
    Date of Patent: December 21, 2010
    Assignee: DRC Computer Corporation
    Inventor: Steven Casselman
  • Patent number: 7847800
    Abstract: Disclosed is a system for producing images including emulation techniques. The system provides for emulation of graphics processing resources such that a central processing unit may provide graphics support. Disclosed embodiments include emulation of selected graphics calls as well as emulation of a programmable graphics processor for compatibility with systems having no compatible GPU.
    Type: Grant
    Filed: April 16, 2004
    Date of Patent: December 7, 2010
    Assignee: Apple Inc.
    Inventor: John Harper
  • Patent number: 7844805
    Abstract: A processor for a portable electronic device. The processor includes a RISC (reduced instruction set computing) core a CISC (complex instruction set computing) core, a video accelerator circuit and an audio accelerator circuit. Each of the video and audio accelerator circuits are coupled to both the RISC and CISC cores, with both cores and both accelerator circuit being incorporated into a single integrated circuit. In a first plurality of operational modes, the RISC core is active, while the CISC core is in one of a sleep state or a power off state. In a second plurality of modes, both the RISC and CISC cores are active.
    Type: Grant
    Filed: June 6, 2007
    Date of Patent: November 30, 2010
    Assignee: VIA Technologies, Inc.
    Inventor: Chi Chang
  • Patent number: 7840781
    Abstract: Various approaches for profiling a target system are described. In one approach, a uni-directional, point-to-point bus has a single input port and a single output port. A target processor has a trace port coupled to the input port of the bus and is configured to execute a plurality of instructions one or more times. The target processor provides state data at the trace port and to the input port of the bus. A profile circuit arrangement is coupled to the output port of the first bus, and a memory is coupled to the profile circuit arrangement. The profile circuit arrangement is configured to read data from the output port of the first bus and write the data to the memory.
    Type: Grant
    Filed: April 4, 2007
    Date of Patent: November 23, 2010
    Assignee: Xilinx, Inc.
    Inventors: Sivakumar Velusamy, Navaneethan Sundaramoorthy, Raj Kumar Nagarajan, Satish R. Ganesan
  • Patent number: 7836316
    Abstract: A network device may comprise an auxiliary processor to conserve the power of the network device. The auxiliary processor may modify one or more definition parameters of the programmable processing unit based on determining that the load value of the programmable processing unit is lower than a threshold value. The modifying of the definition parameters may comprise reducing an operating frequency of the programmable processing unit, reducing a number of a micro-programmable units resident on the programmable processing unit, or both.
    Type: Grant
    Filed: March 5, 2007
    Date of Patent: November 16, 2010
    Assignee: Intel Corporation
    Inventors: Udaya Shankara, Veluchamy Dinakaran
  • Patent number: 7834658
    Abstract: Method and apparatus for communication of data is described. More particularly, generation of an interface for coupling to an auxiliary processor unit for communication of data in an integrated circuit is described. Programmable logic is programmed to provide a hardware interface for communicating the data between memory and a user-defined circuit. The data is communicated at least in part via an auxiliary processor unit coupled to the hardware interface. The programming includes configuring the programmable logic to use the auxiliary processor unit to respond to coded instructions executed by a central processing unit through the provided hardware interface.
    Type: Grant
    Filed: April 18, 2006
    Date of Patent: November 16, 2010
    Assignee: Xilinx, Inc.
    Inventors: Stephen A. Neuendorffer, Paul M. Hartke, Paul R. Schumacher
  • Patent number: 7831805
    Abstract: Provides methods, systems and apparatus for coupling a general purpose processor (GPP) to an application specific instruction set processor (ASIP) in such a manner that the GPP can include execute instructions that do not normally comprise part of its instruction set architecture (ISA). The GPP is coupled to the ASIP via a coprocessor port such that instructions issued by the GPP to the port are conveyed to a novel pre-decoder module of the ASIP. The pre-decoder module translates the GPP instruction into operation codes for ASIP instructions to be executed in the ASIP or to an address in the ASIP instruction memory that identifies a start address for a plurality of ASIP instructions defining a complex application specific function. Once the ASIP has executed the instructions it shares the result of the execution with the GPP. In this way, the GPP takes advantage of the ASIP in its ability to more quickly execute an application specific program/procedure.
    Type: Grant
    Filed: October 28, 2007
    Date of Patent: November 9, 2010
    Assignee: International Business Machines Corporation
    Inventors: Andreas C. Doering, Silvio Dragone