Vector Processor Patents (Class 712/2)

Scalar/vector processor interface (Class 712/3)

Distributing of vector data to vector registers (Class 712/4)

Masking to control an access to data in vector register (Class 712/5)

Controlling access to external vector data (Class 712/6)

Vector processor operation (Class 712/7)

Read XF instruction for processing vectors

Patent number: 8862932

Abstract: The described embodiments include a processor that handles faults. The processor first receives a first input vector, a control vector, and a predicate vector, each vector comprising a plurality of elements. For each element in the first input vector for which a corresponding element in the control vector and the predicate vector are active, the processor then performs a read operation using an address from the element of the first input vector. When a fault condition is encountered while performing the read operation, the processor determines if the element is a first element where a corresponding element of the control vector is active. If so, the processor handles/processes the fault. Otherwise, the processor masks the fault for the element.

Type: Grant

Filed: July 18, 2012

Date of Patent: October 14, 2014

Assignee: Apple Inc.

Inventor: Jeffry E. Gonion
COLLAPSING OF MULTIPLE NESTED LOOPS, METHODS AND INSTRUCTIONS

Publication number: 20140189287

Abstract: In an embodiment, the present invention is directed to a processor including a decode logic to receive a multi-dimensional loop counter update instruction and to decode the multi-dimensional loop counter update instruction into at least one decoded instruction, and an execution logic to execute the at least one decoded instruction to update at least one loop counter value of a first operand associated with the multi-dimensional loop counter update instruction by a first amount. Methods to collapse loops using such instructions are also disclosed. Other embodiments are described and claimed.

Type: Application

Filed: December 27, 2012

Publication date: July 3, 2014

Inventors: Mikhail Plotnikov, Andrey Naraikin, Elmoustapha Ould-Ahmed-Vall
INSTRUCTION AND LOGIC TO PROVIDE VECTOR COMPRESS AND ROTATE FUNCTIONALITY

Publication number: 20140122831

Abstract: Instructions and logic provide vector compress and rotate functionality. Some embodiments, responsive to an instruction specifying: a vector source, a mask, a vector destination and destination offset, read the mask, and copy corresponding unmasked vector elements from the vector source to adjacent sequential locations in the vector destination, starting at the vector destination offset location. In some embodiments, the unmasked vector elements from the vector source are copied to adjacent sequential element locations modulo the total number of element locations in the vector destination. In some alternative embodiments, copying stops whenever the vector destination is full, and upon copying an unmasked vector element from the vector source to an adjacent sequential element location in the vector destination, the value of a corresponding field in the mask is changed to a masked value. Alternative embodiments zero elements of the vector destination, in which no element from the vector source is copied.

Type: Application

Filed: October 30, 2012

Publication date: May 1, 2014

Inventors: Tal Uliel, Elmoustapha Ould-Ahmed-Vall, Robert Valentine
Latency tolerant system for executing video processing operations

Patent number: 8687008

Abstract: A latency tolerant system for executing video processing operations. The system includes a host interface for implementing communication between the video processor and a host CPU, a scalar execution unit coupled to the host interface and configured to execute scalar video processing operations, and a vector execution unit coupled to the host interface and configured to execute vector video processing operations. A command FIFO is included for enabling the vector execution unit to operate on a demand driven basis by accessing the memory command FIFO. A memory interface is included for implementing communication between the video processor and a frame buffer memory. A DMA engine is built into the memory interface for implementing DMA transfers between a plurality of different memory locations and for loading the command FIFO with data and instructions for the vector execution unit.

Type: Grant

Filed: November 4, 2005

Date of Patent: April 1, 2014

Assignee: NVIDIA Corporation

Inventors: Ashish Karandikar, Shirish Gadre, Stephen D. Lew
Sharing a fault-status register when processing vector instructions

Patent number: 8683178

Abstract: The described embodiments provide a processor that executes vector instructions. In the described embodiments, the processor initializes an architectural fault-status register (FSR) and a shadow copy of the architectural FSR by setting each of N bit positions in the architectural FSR and the shadow copy of the architectural FSR to a first predetermined value. The processor then executes a first first-faulting or non-faulting (FF/NF) vector instruction. While executing the first vector instruction, the processor also executes one or more subsequent FF/NF instructions. In these embodiments, when executing the first vector instruction and the subsequent vector instructions, the processor updates one or more bit positions in the shadow copy of the architectural FSR to a second predetermined value upon encountering a fault condition.

Type: Grant

Filed: April 20, 2011

Date of Patent: March 25, 2014

Assignee: Apple Inc.

Inventor: Jeffry E. Gonion
Specifications-driven platform for analog, mixed-signal, and radio frequency verification

Patent number: 8682631

Abstract: A design specifications-driven platform (100) for analog, mixed-signal and radio frequency verification with one embodiment comprising a client (160) and server (150) is presented. The server comprises an analog verification database (110), a code and document generator (1020), a design to specifications consistency checker (103), a symbol generator (104), a coverage analyzer (105), a server interface (106), a web server (111), and an analog verification server application (101). The client comprises a web browser (130), generated datasheets and reports (120), generated models, regression tests, netlists, connect modules, and symbols (121), generated simulation scripts (122), a client interface (124), design data (131), simulators (132), and a design data extractor (123).

Type: Grant

Filed: August 31, 2010

Date of Patent: March 25, 2014

Inventors: Henry Chung-herng Chang, Kenneth Scott Kundert
Load/move and duplicate instructions for a processor

Patent number: 8650382

Abstract: A method includes, in a processor, loading/moving a first portion of bits of a source into a first portion of a destination register and duplicate that first portion of bits in a subsequent portion of the destination register.

Type: Grant

Filed: September 14, 2012

Date of Patent: February 11, 2014

Assignee: Intel Corporation

Inventor: Patrice Roussel
System and method for implementing elliptic curve scalar multiplication in cryptography

Patent number: 8649508

Abstract: A system and method for implementing the Elliptic Curve scalar multiplication method in cryptography, where the Double Base Number System is expressed in decreasing order of exponents and further on using it to determine Elliptic curve scalar multiplication over a finite elliptic curve.

Type: Grant

Filed: September 29, 2008

Date of Patent: February 11, 2014

Assignee: Tata Consultancy Services Ltd.

Inventor: Natarajan Vijayarangan
Non-faulting and first faulting instructions for processing vectors

Patent number: 8578209

Abstract: The described embodiments include a processor that handles faults during execution of a vector instruction. The processor starts by receiving a vector instruction that uses at least one vector of values that includes N elements as an input. In addition, the processor optionally receives a predicate vector that includes N elements. The processor then executes the vector instruction. In the described embodiments, when executing the vector instruction, if the predicate vector is received, for each element in the vector of values for which a corresponding element in the predicate vector is active, otherwise, for each element in the vector of values, the processor performs an operation for the vector instruction for the element in the vector of values. While performing the operation, the processor conditionally masks faults encountered (i.e., faults caused by an illegal operation).

Type: Grant

Filed: August 17, 2012

Date of Patent: November 5, 2013

Assignee: Apple Inc.

Inventors: Jeffry E. Gonion, Keith E. Diefendorff
Dynamics control

Patent number: 8554421

Abstract: A vehicle having a dynamics control system, the vehicle comprising: a first set comprising multiple adjustable sub-systems that affect the performance of the vehicle's powertrain; a second set comprising multiple adjustable sub-systems that affect the vehicle's handling; a dynamics user interface including a first input device and a second input device; and a dynamics controller coupled to the user interface and configured to adjust the operation of the sub-systems of the first set in dependence on the first input device and to adjust the operation of the sub-systems of the second set in dependence on the second input device.

Type: Grant

Filed: September 8, 2010

Date of Patent: October 8, 2013

Assignee: McLaren Automotive Limited

Inventors: Antony Sheriff, Mark Vinnels, Richard Felton
OS-mediated launch of OS-independent application

Patent number: 8539200

Abstract: A system, method, and computer readable medium for an operating system (OS) mediated launch of an OS dependent application is disclosed. An application running within an OS may operate outside an OS environment by constructing for example a capsule file, passing the capsule file to firmware interface, and restarting the system. The firmware interface may load various drivers and applications contained within the capsule file and execute them to perform a task. Upon completion of the task, the OS is booted again and the original application may resume control, making use of any information stored by the firmware interface in a dedicated status table or file. Other embodiments may be employed, and other embodiments are described and claimed.

Type: Grant

Filed: April 23, 2008

Date of Patent: September 17, 2013

Assignee: Intel Corporation

Inventors: David H. Huang, Xin Li, Ruth Li, Vincent J. Zimmer
VECTOR PROCESSING APPARATUS AND VECTOR PROCESSING METHOD

Publication number: 20130232317

Abstract: A vector processing apparatus includes a storage pointer generation unit and an instruction execution unit including a plurality of vector pipeline units. The storage pointer generation unit receives the vector instruction and range information thereof and generates the storage pointer value. When receiving a succeeding vector instruction being able to be processed in parallel together with a preceding vector instruction, the storage pointer generation unit updates the storage pointer value based on the range information so as to input each element of the succeeding vector instruction into a vector pipeline unit that is unused by the preceding vector instruction, and the instruction execution unit processes in parallel the preceding vector instruction and the succeeding vector instruction according to the storage pointer value.

Type: Application

Filed: March 1, 2013

Publication date: September 5, 2013

Inventor: Masao YASUDA
PROCESSING VECTORS USING A WRAPPING ROTATE PREVIOUS INSTRUCTION IN THE MACROSCALAR ARCHITECTURE

Publication number: 20130024651

Abstract: Embodiments of a system and a method in which a processor may execute instructions that cause the processor to receive an operand vector, a selection vector, and a control vector are disclosed. The executed instructions may also cause the processor to perform a wrapping rotate previous operation dependent upon the input vectors.

Type: Application

Filed: September 28, 2012

Publication date: January 24, 2013

Applicant: APPLE INC.

Inventor: Apple Inc.
Break, pre-break, and remaining instructions for processing vectors

Patent number: 8356159

Abstract: The described embodiments provide a system that sets elements in a result vector based on an input vector. During operation, the system determines a location of a key element within the input vector. Next, the system generates a result vector. When generating the result vector, the system sets one or more elements of the result vector based on the location of the key element in the input vector.

Type: Grant

Filed: April 7, 2009

Date of Patent: January 15, 2013

Assignee: Apple Inc.

Inventors: Jeffry E. Gonion, Keith E. Diefendorff, Jr.
Non-faulting and first-faulting instructions for processing vectors

Patent number: 8271832

Abstract: The described embodiments include a processor that handles faults during execution of a vector instruction. The processor starts by receiving a vector instruction that uses at least one vector of values that includes N elements as an input. In addition, the processor optionally receives a predicate vector that includes N elements. The processor then executes the vector instruction. In the described embodiments, when executing the vector instruction, if the predicate vector is received, for each element in the vector of values for which a corresponding element in the predicate vector is active, otherwise, for each element in the vector of values, the processor performs an operation for the vector instruction for the element in the vector of values. While performing the operation, the processor conditionally masks faults encountered (i.e., faults caused by an illegal operation).

Type: Grant

Filed: August 31, 2010

Date of Patent: September 18, 2012

Assignee: Apple Inc.

Inventors: Jeffry E. Gonion, Keith E. Diefendorff
Methods, apparatus and systems to improve security in computer systems

Patent number: 8261085

Abstract: According to some implementations methods, apparatus and systems are provided involving the use of processors having at least one core with a security component, the security component adapted to read and verify data within data blocks stored in a L1 instruction cache memory and to allow the execution of data block instructions in the core only upon the instructions being verified by the use of a cryptographic algorithm.

Type: Grant

Filed: September 26, 2011

Date of Patent: September 4, 2012

Assignee: Media Patents, S.L.

Inventor: Álvaro Fernández Gutiérrez
Digital media processor

Patent number: 8253750

Abstract: Circuits, methods, and apparatus that provide highly integrated digital media processors for digital consumer electronics applications. These digital media processors are capable of performing the parallel processing of multiple format audio, video, and graphics signals. In one embodiment, audio and video signals may be received from a variety of input devices or appliances, such as antennas, VCRs, DVDs, and networked devices such as camcorders and modems, while output audio and video signals may be provided to output devices such as televisions, monitors, and networked devices such as printers and networked video recorders. Another embodiment of the present invention interfaces with a variety of devices such as navigation, entertainment, safety, memory, and networking devices. This embodiment can also be configured for use in a digital TV, set-top box, or home server. In this configuration, video and audio streams may be received from a number of cable, satellite, Internet, and consumer devices.

Type: Grant

Filed: July 8, 2010

Date of Patent: August 28, 2012

Assignee: NVIDIA Corporation

Inventors: Jen-Hsun Huang, Gerrit A. Slavenburg, Stephen D. Lew, John C. Schafer, Thomas F. Fox, Taner E. Ozcelik
System, method, and computer program product for converting a scan algorithm to a segmented scan algorithm in an operator-independent manner

Patent number: 8243083

Abstract: A system, method, and computer program product are provided for converting a scan algorithm to a segmented scan algorithm in an operator independent manner. In operation, a scan algorithm and a limit index data structure are identified. Utilizing the limit index data structure, the scan algorithm is converted to a segmented scan algorithm in an operator-independent manner. Additionally, the segmented scan algorithm is performed to produce an output.

Type: Grant

Filed: December 11, 2008

Date of Patent: August 14, 2012

Assignee: NVIDIA Corporation

Inventors: Michael J. Garland, Shubhabrata Sengupta
Apparatus interoperable with backward compatible optical USB device

Patent number: 8234416

Abstract: An apparatus configured to couple to a universal serial bus (USB) 3.0 connector. The apparatus includes a management controller configured to couple to the USB 3.0 connector. The management controller is configured to detect from behavior on the D+ and D? pins of the USB 3.0 connector whether a device plugged into the USB 3.0 connector is a conventional USB 3.0 device or an optical USB device, where the optical USB device employs one of the D+ and D? pins of the USB 3.0 connector to transmit a data signal and the other to transmit a clock signal.

Type: Grant

Filed: June 18, 2010

Date of Patent: July 31, 2012

Assignee: VIA Technologies, Inc.

Inventor: Jiin Lai
Method and apparatus for executing program code

Patent number: 8209525

Abstract: The described embodiments provide a system that executes program code. While executing program code, the processor encounters at least one vector instruction and at least one vector-control instruction. The vector instruction includes a set of elements, wherein each element is used to perform an operation for a corresponding iteration of a loop in the program code. The vector-control instruction identifies elements in the vector instruction that may be operated on in parallel without causing an error due to a runtime data dependency between the iterations of the loop. The processor then executes the loop by repeatedly executing the vector-control instruction to identify a next group of elements that can be operated on in the vector instruction and selectively executing the vector instruction to perform the operation for the next group of elements in the vector instruction, until the operation has been performed for all elements of the vector instruction.

Type: Grant

Filed: April 7, 2009

Date of Patent: June 26, 2012

Assignee: Apple Inc.

Inventors: Jeffry E. Gonion, Keith E. Diefendorff, Jr.
Graphics processing method and apparatus implementing window system

Patent number: 8203567

Abstract: A graphics processing method and apparatus described herein is capable of converting graphics processing of a window system into a vector-based application program interface (API) format usable in the GPU and performing the converted graphics processing in the GPU. For example, the vector-based API may be based on an OpenVG standard or an EGL standard.

Type: Grant

Filed: July 3, 2009

Date of Patent: June 19, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dong-kyun Jeong, Soo-chan Lim, Na-min Kim
Scalar precision float implementation on the “W” lane of vector unit

Patent number: 8169439

Abstract: Embodiments of the invention are generally related to image processing, and more specifically to vector units for supporting image processing. A combined vector/scalar unit is provided wherein one or more processing lanes of the vector unit are used for performing scalar operations. An integrated register file is also provided for storing vector and scalar data. Therefore, the transfer of data to memory to exchange data between independent vector and scalar units is obviated and a significant amount of chip area is saved.

Type: Grant

Filed: October 23, 2007

Date of Patent: May 1, 2012

Assignee: International Business Machines Corporation

Inventors: David Arnold Luick, Eric Oliver Mejdrich, Adam James Muff
Vector computer system with cache memory and operation method thereof

Patent number: 8151058

Abstract: A vector computer system includes a vector processor configured to issue a vector store instruction which includes a plurality of store requests; a cache memory of a write back system provided between the vector processor and a main memory; and a write allocate determining section configured to generate an allocation control signal which specifies whether the cache memory operates based on a write allocate system or a non-write allocate system. When the vector processor issues the vector store instruction, the write allocate determining section generates the allocation control signal to each of the plurality of store requests based on a write pattern as a pattern of target addresses of the plurality of store requests. The cache memory executes each store request based on one of the write allocate system and the non-write allocate system which is specified based on the allocation control signal.

Type: Grant

Filed: October 5, 2009

Date of Patent: April 3, 2012

Assignee: NEC Corporation

Inventor: Koji Kobayashi
Executing a gather operation on a parallel computer

Patent number: 8140826

Abstract: Methods, apparatus, and computer program products are disclosed for executing a gather operation on a parallel computer according to embodiments of the present invention. Embodiments include configuring, by the logical root, a result buffer or the logical root, the result buffer having positions, each position corresponding to a ranked node in the operational group and for storing contribution data gathered from that ranked node.

Type: Grant

Filed: May 29, 2007

Date of Patent: March 20, 2012

Assignee: International Business Machines Corporation

Inventors: Charles J. Archer, Joseph D. Ratterman
SYSTEMS AND METHODS FOR PROCESSING COMMUNICATIONS SIGNALS fUSING PARALLEL PROCESSING

Publication number: 20110302390

Abstract: Systems and methods for performing processing of communications signals on multi-processor architectures. The system consists of a digital interface that translate numbers that represent a waveform in some format to analog signals for use in transmission and translating analog signals to numbers representing those waveforms in some format that can be processed by the commodity digital hardware and software combination. The digital hardware and software incorporates parallel hardware and software that can process single or multiple streams and multiple processing steps as required for the communications system in any combination. In the examples, the use of general purpose graphics processing units (GPGPUs) is illustrated, but the system is not necessarily limited to such an implementation. The system is highly scalable and modular for addressing a wide range of radio requirements, preferably using commodity components.

Type: Application

Filed: June 5, 2010

Publication date: December 8, 2011

Inventors: Greg Copeland, Shehrzad Qureshi
Initialisation of a pipelined processor

Patent number: 8055888

Abstract: A data processing apparatus is disclosed that comprises a pipelined processor, said pipelined processor comprising a processing pipeline for processing instructions in a plurality of stages, at least some of said plurality of stages each comprising storage elements for storing an instruction or decoded instruction being processed in said stage, said storage elements in at least one of said stages comprising settable elements, each of said settable elements being adapted to store a predetermined value in response to a wake up event, said settable elements being arranged such that in response to said wake up event said values stored in said settable elements form an instruction or decoded instruction.

Type: Grant

Filed: February 28, 2008

Date of Patent: November 8, 2011

Assignee: ARM Limited

Inventors: Chiloda Ashan Senerath Pathirane, David Michael Gilday
Motherboard device and computing apparatus capable of selectively controlling electric power supplied to one of two graphics cards

Patent number: 8051305

Abstract: A motherboard device includes a first connecting interface coupled to a first graphics card, a second connecting interface coupled to a second graphics card, a power source connected electrically to the first connecting interface for supplying electric power to the first graphics card via the first connecting interface, and a switch unit interconnecting electrically the power source and the second connecting interface, and operable so as to switch between an ON-state, where the power source supplies electric power to the second graphics card via the second connecting interface, and an OFF-state, where the electric power from the power source is not supplied to the second graphics card.

Type: Grant

Filed: June 4, 2008

Date of Patent: November 1, 2011

Assignees: Micro-Star International Co., Ltd., MSI Electronic (Kun Shan) Co., Ltd.

Inventor: Wen-Jie Zhu
Method of operation for parallel LCP solver

Patent number: 7937359

Abstract: A method of operating a Linear Complementarity Problem (LCP) solver is disclosed, where the LCP solver is characterized by multiple execution units operating in parallel to implement a competent computational method adapted to resolve physics-based LCPs in real-time.

Type: Grant

Filed: April 27, 2009

Date of Patent: May 3, 2011

Assignee: NVIDIA Corporation

Inventors: Lihua Zhang, Richard Tonge, Dilip Sequeira, Monier Maher
PXE booting a storage processor from a peer storage processor

Patent number: 7930533

Abstract: A system for pre-execution environment (PXE) booting a storage processor from a peer storage processor allows for the ability to reboot and/or restart the storage processor without an externally connected PXE server. In response to a reboot request of the storage processor, the peer storage processor pushes an operating system boot image and/or other information to the storage processor for PXE booting the storage processor, and vice versa. The system may also operate with multiple coupled computers.

Type: Grant

Filed: September 26, 2007

Date of Patent: April 19, 2011

Assignee: EMC Corporation

Inventors: Ying Guo, Qing Liu, Kevin Richards
Multiprocessor Computer System and Method Having at Least One Processor with a Dynamically Reconfigurable Instruction Set

Publication number: 20110055516

Abstract: An innovative realization of computer hardware, software and firmware comprising a multiprocessor system wherein at least one processor can be configured to have a fixed instruction set and one or more processors can be statically or dynamically configured to implement a plurality of processor states in a plurality of technologies. The processor states may be instructions sets for the processors. The technologies may include programmable logic arrays.

Type: Application

Filed: August 20, 2010

Publication date: March 3, 2011

Applicant: FTL Systems Technology Corporation

Inventor: John C. Willis
Method and system for efficient matrix multiplication in a SIMD processor architecture

Patent number: 7873812

Abstract: The new system provides for efficient implementation of matrix multiplication in a SIMD processor. The new system provides ability to map any element of a source vector register to be paired with any element of a second source vector register for vector operations, and specifically vector multiply and vector-multiply-accumulate operations to implement a variety of matrix multiplications without the additional permute or data re-ordering instructions. Operations such as DCT and Color-space transformations for video processing could be very efficiently implemented using this system.

Type: Grant

Filed: April 5, 2004

Date of Patent: January 18, 2011

Inventor: Tibet Mimar
Aligning precision converted vector data using mask indicating offset relative to element boundary corresponding to precision type

Patent number: 7865693

Abstract: Mechanisms for aligning enhanced precision vectors based on reduced precision data values are provided. At least one data value, having a first precision type, is received for storing in a vector register. The vector register stores data as a vector having a plurality of vector elements. The first precision type is modified to have a second precision type different in precision than the first precision type to thereby generate at least one modified data value. The at least one modified data value is stored in at least one vector element of the plurality of vector elements. An alignment of the at least one modified data value is determined relative to a boundary of a vector element of the vector register. An alignment operation to re-align the at least one modified data value based on the boundary of the vector element of the vector register is performed.

Type: Grant

Filed: October 14, 2008

Date of Patent: January 4, 2011

Assignee: International Business Machines Corporation

Inventors: Alexandre E. Eichenberger, Bruce M. Fleischer, Michael K. Gschwind
Compilation for a SIMD RISC processor

Patent number: 7840954

Abstract: A computer implemented method, data processing system, and computer usable code are provided for generating code to perform scalar computations on a Single-Instruction Multiple-Data (SIMD) Reduced Instruction Set Computer (RISC) architecture. The illustrative embodiments generate code directed at loading at least one scalar value and generate code using at least one vector operation to generate a scalar result, wherein all scalar computation for integer and floating point data is performed in a SIMD vector execution unit.

Type: Grant

Filed: November 29, 2005

Date of Patent: November 23, 2010

Assignee: International Business Machines Corporation

Inventor: Michael Karl Gschwind
USE OF VECTORIZATION INSTRUCTION SETS

Publication number: 20100293534

Abstract: In one embodiment, the invention is a method and apparatus for use of vectorization instruction sets. One embodiment of a method for generating vector instructions includes receiving source code written in a high-level programming language, wherein the source code includes at least one high-level instruction that performs multiple operations on a plurality of vector operands, and compiling the high-level instruction(s) into one or more low-level instructions, wherein the low-level instructions are in an instruction set of a specific computer architecture.

Type: Application

Filed: May 15, 2009

Publication date: November 18, 2010

Inventors: HENRIQUE ANDRADE, Bugra Gedik, Hua Yong Wang, Kun-Lung Wu
INTERLEAVED MULTI-THREADED VECTOR PROCESSOR

Publication number: 20100281234

Abstract: A method includes providing a processor configured to execute instructions. The method may further include providing a first set of registers in the processor to store first data and first instructions associated with a first thread, and providing a second set of registers in the processor to store second data and second instructions associated with a second thread. The method may further include transmitting the first data and first instructions associated with the first thread to the first set of registers, and executing the first instructions in order to process the first data. The method may further include transmitting the second data and second instructions to the second set of registers while executing the first instructions and processing the first data. A corresponding apparatus is also disclosed and claimed herein.

Type: Application

Filed: April 30, 2009

Publication date: November 4, 2010

Applicant: Novafora, Inc.

Inventors: MUHAMMAD AHMED, Marc Schaub, Shlomo Selim Rakib
Arrangement, system and method for vector permutation in single-instruction multiple-data mircoprocessors

Patent number: 7809931

Abstract: A vector permutation system (100) for a single-instruction multiple-data microprocessor has a set of vector registers (110) which feed vectors to permutation logic (120) and then to a negate block (130) where they are permuted and selectively negated according to control parameters received from a selected one of a set of control registers (140). A control arrangement (145, 150) selects which control register is to provide the control parameters. In this way no separate permutation instructions are necessary or need to be executed, and no permutation parameters need to be stored in the vector registers (10). This leads to higher performance, a smaller vector registers file and hence a smaller size of the microprocessor and better program code density.

Type: Grant

Filed: October 6, 2003

Date of Patent: October 5, 2010

Assignee: Freescale Semiconductor, Inc.

Inventor: Martin Raubuch
Power management method and system based on data unit running cycle and performance requirement

Patent number: 7793133

Abstract: Power management methods and systems. First, a running cycle of a processing unit processing a data unit is recorded. A gating signal is generated according to the running cycle and a performance requirement, and a working clock is adjusted according to the gating signal. Thereafter, the adjusted working signal is provided to the processing unit.

Type: Grant

Filed: June 6, 2007

Date of Patent: September 7, 2010

Assignee: Via Technologies, Inc.

Inventor: Ko-Fang Wang
Data processor and methods thereof

Patent number: 7788471

Abstract: A system and method for performing vector arithmetic is disclosed. The method includes loading two operand vectors, each composed of a number of vector elements, into two storage locations. A selected arithmetic operation is performed on the operand vectors to produce a result vector having the number of vector elements. Each vector element of the result vector is associated with an arithmetic logic cell that has a first input that can receive any vector element from the first vector and a second input that can receive any vector element from the second vector. Accordingly each vector element of the result vector is a function of any two individual vector elements of the operand vectors. By applying the operand vector elements to the appropriate arithmetic logic cells, and by selecting the appropriate arithmetic operation, complex vector operations can be performed efficiently.

Type: Grant

Filed: September 18, 2006

Date of Patent: August 31, 2010

Assignee: Freescale Semiconductor, Inc.

Inventor: Chengke Sheng
Scalable parallel pipeline floating-point unit for vector processing

Patent number: 7765386

Abstract: An embodiment of the present invention is a technique to perform floating-point operations for vector processing. An input queue captures a plurality of vector inputs. A scheduler dispatches the vector inputs. A plurality of floating-point (FP) pipelines generates FP results from operating on scalar components of the vector inputs dispatched from the scheduler. An arbiter and assembly unit arbitrates use of output section and assembles the FP results to write to the output section.

Type: Grant

Filed: September 28, 2005

Date of Patent: July 27, 2010

Assignee: Intel Corporation

Inventors: David D. Donofrio, Michael Dwyer
Cipher message assist instruction

Patent number: 7720220

Abstract: A method, system and program product for executing a cipher message assist instruction in a computer system by specifying, via the cipher message assist instruction, either a capability query installed function or execution of a selected function of one or more optional functions, wherein the selected function is an installed optional function, wherein the capability query determines which optional functions of the one or more optional functions are installed on the computer system.

Type: Grant

Filed: February 27, 2009

Date of Patent: May 18, 2010

Assignee: International Business Machines Corporation

Inventors: Shawn D. Lundvall, Ronald M. Smith, Sr., Phil Chi-Chung Yeh
Dynamically Aligning Enhanced Precision Vectors Based on Addresses Corresponding to Reduced Precision Vectors

Publication number: 20100095086

Abstract: Mechanisms for aligning enhanced precision vectors based on reduced precision data values are provided. At least one data value, having a first precision type, is received for storing in a vector register. The vector register stores data as a vector having a plurality of vector elements. The first precision type is modified to have a second precision type different in precision than the first precision type to thereby generate at least one modified data value. The at least one modified data value is stored in at least one vector element of the plurality of vector elements. An alignment of the at least one modified data value is determined relative to a boundary of a vector element of the vector register. An alignment operation to re-align the at least one modified data value based on the boundary of the vector element of the vector register is performed.

Type: Application

Filed: October 14, 2008

Publication date: April 15, 2010

Applicant: International Business Machines Corporation

Inventors: Alexandre E. Eichenberger, Bruce M. Fleischer, Michael K. Gschwind
VECTOR COMPUTER SYSTEM WITH CACHE MEMORY AND OPERATION METHOD THEREOF

Publication number: 20100088473

Abstract: A vector computer system includes a vector processor configured to issue a vector store instruction which includes a plurality of store requests; a cache memory of a write back system provided between the vector processor and a main memory; and a write allocate determining section configured to generate an allocation control signal which specifies whether the cache memory operates based on a write allocate system or a non-write allocate system. When the vector processor issues the vector store instruction, the write allocate determining section generates the allocation control signal to each of the plurality of store requests based on a write pattern as a pattern of target addresses of the plurality of store requests. The cache memory executes each store request based on one of the write allocate system and the non-write allocate system which is specified based on the allocation control signal.

Type: Application

Filed: October 5, 2009

Publication date: April 8, 2010

Inventor: Koji KOBAYASHI
System and method of processing data using scalar/vector instructions

Patent number: 7676647

Abstract: A processor device is disclosed that includes a register file with a combined condition code register for scalar and vector operations. The processor device utilizes the combined condition code register for scalar and vector operations. Further, a compare operation can store resulting bits in the combined condition code register and a conditional operation can utilize the combined condition code register bits for evaluating a condition.

Type: Grant

Filed: August 18, 2006

Date of Patent: March 9, 2010

Assignee: QUALCOMM Incorporated

Inventors: Lucian Codrescu, Erich Plondke, Taylor Simpson
INCREMENT-PROPAGATE AND DECREMENT-PROPAGATE INSTRUCTIONS FOR PROCESSING VECTORS

Publication number: 20100042807

Abstract: The described embodiments provide a processor for generating a result vector with incremented or decremented values from an input vector. During operation, the processor receives an input vector and a control vector. The processor then copies a value contained in a selected element of the input vector. The processor next generates the result vector, which involves writing an incremented or decremented value to the result vector, depending on the value of the control vector and the embodiment. In addition, a predicate vector can be used to control the values that are written to the result vector.

Type: Application

Filed: June 30, 2009

Publication date: February 18, 2010

Applicant: APPLE INC.

Inventors: Jeffry E. Gonion, Keith E. Diefendorff, JR.
Method and apparatus for self-healing symmetric multi-processor system interconnects

Patent number: 7661006

Abstract: A computer implemented method, apparatus, and computer program product for managing symmetric multiprocessor interconnects. The process identifies functional communication connections between each processor in a plurality of processors on a multiprocessor to form identified functional communication connections. The process maps every functional communication connection between any two processors in the plurality of processors, based on the identified functional communication connections, to form an interconnect matrix. The process creates a path map using the interconnect matrix. The path map comprises a sequence of communication connections between the plurality of processors. The process initializes the plurality of processors using the path map.

Type: Grant

Filed: January 9, 2007

Date of Patent: February 9, 2010

Assignee: International Business Machines Corporation

Inventors: Luai A. Abou-Emara, Mark David McLaughlin, Jorge N. Yanez
Extensible memory architecture and communication protocol for supporting multiple devices in low-bandwidth, asynchronous applications

Patent number: 7640155

Abstract: A target interface system for interfacing selected components of a communication system and methods for manufacturing and using same. The target interface system includes target interface logic that is distributed among a plurality of reconfigurable logic devices. Being coupled via a serial link, the reconfigurable logic devices each have an input connection for receiving incoming data packets and an output connection for providing outgoing data packets. The serial link couples the input and output connections of successive reconfigurable logic devices to form a dataring structure for distributing the data packets among the reconfigurable logic devices. Thereby, the dataring structure maintains data synchronization among the reconfigurable logic devices such that the distribution of the target interface logic among the reconfigurable logic devices is transparent to software.

Type: Grant

Filed: May 31, 2005

Date of Patent: December 29, 2009

Assignee: QuickTurn Design Systems, Inc.

Inventors: Mitchell G. Poplack, John A. Maher
Remote sensor processing system and method

Patent number: 7619541

Abstract: A sensor assembly includes a sensor operable to sense a physical parameter and generate an electrical signal responsive to the sensed physical parameter. Local processing circuitry is physically positioned proximate the sensor and is electrically coupled to the sensor. The local processing circuitry includes an output port adapted to be coupled to a communications channel and the local processing circuitry is operable to process data from the sensor to generate processed sensor data and to provide the processed data on the output port.

Type: Grant

Filed: October 3, 2005

Date of Patent: November 17, 2009

Assignee: Lockheed Martin Corporation

Inventors: Kenneth R. Schulz, Andrew Hamm, John Rapp
Method and apparatus for controlling heat generation in a multi-core processor

Patent number: 7617403

Abstract: The disclosed methodology and apparatus may reduce heat generation in a multi-core processor. In one embodiment, a multi-core processor cycles selected processor cores off in a predetermined pattern across the processor die over time to reduce the average heat generation by the processor. The disclosed multi-core processor may reduce or avoid undesirable hot spots that impact processor life.

Type: Grant

Filed: July 26, 2006

Date of Patent: November 10, 2009

Assignee: International Business Machines Corporation

Inventors: Louis Bennie Capps, Jr., Warren D. Dyckman, Michael Jay Shapiro
Automatic generation of streaming data interface circuit

Patent number: 7603492

Abstract: A streaming data interface device (700) of a streaming processing system (200) is automatically generated by selecting a set of circuit parameters (610) consistent with a set of circuit constraints and generating (612, 614) a representation of a candidate memory interface device based upon a set of stream descriptors. The candidate streaming data interface device is evaluated (616) with respect to one or more quality metrics and the representation of the candidate streaming processor circuit is output (622) if the candidate memory interface device satisfies a set of processing system constraints and is better in at least one of the one or more quality metrics than other candidate memory interface devices.

Type: Grant

Filed: September 20, 2005

Date of Patent: October 13, 2009

Assignee: Motorola, Inc.

Inventors: Sek M. Chai, Nikos Bellas, Malcolm R. Dwyer, Erica M. Lau, Zhiyuan Li, Daniel A. Linzmeier
Method and system for parallel vector data processing of vector data having a number of data elements including a defined first bit-length

Patent number: 7600104

Abstract: System and method are provided for parallel vector data processing having a data processor capable of vector data having a defined first bit-length. In one embodiment, at least one of first and second operand registers is used for storing operands, and an additional data storage element is used to have a size to store a number of bits corresponding to the first bit-length, and the storage element is segmented into a defined number of segments. An instruction set storage unit is used for storing a set of instructions for the data processor to process a set of data in parallel by use of the additional storage element.

Type: Grant

Filed: August 15, 2006

Date of Patent: October 6, 2009

Inventor: Peter Neumann

prev 1 2 3 4 next