Operation Patents (Class 712/30)

Master/slave (Class 712/31)

CORE AFFINITY BITMASK TRANSLATION

Publication number: 20140344550

Abstract: Technologies are generally described for systems methods, and devices related to core affinity bitmask translation. An example system may include first, second, third and fourth cores, and a dispatcher. The dispatcher may be configured to receive a first request where the first request include a core affinity bitmask and instructions. The core affinity bitmask can identify at least the first core and the second core. The dispatcher may be configured to determine a first affinity between the first core and the second core. The dispatcher may then identify the third core and the fourth core as having similar affinity to achieve a substantially similar performance. The dispatcher may also be configured to generate a second request that includes a translated core affinity bitmask. The translated core affinity bitmask may be effective to identify the third core and the fourth core as appropriate cores to execute the instructions.

Type: Application

Filed: May 15, 2013

Publication date: November 20, 2014

Inventor: Ezekiel Kruglick
INTEGRATED CIRCUIT DEVICE, ASYMMETRIC MULTI-CORE PROCESSING MODULE, ELECTRONIC DEVICE AND METHOD OF MANAGING EXECUTION OF COMPUTER PROGRAM CODE THEREFOR

Publication number: 20140325183

Abstract: An asymmetric multi-core processing module is described. The asymmetric multi-core processing module comprises at least one processing core of a first type, at least one processing core of at least one further type, and at least one core identifier configuration component. The at least one core identifier configuration component is arranged to enable dynamic configuration of a value of a core identifier of at least one of the processing cores of the first and at least one further types.

Type: Application

Filed: November 28, 2011

Publication date: October 30, 2014

Applicant: FREESCALE SEMICONDUCTOR, INC.

Inventors: Anton Rozen, Dan Kuzmin, Michael Priel, Leonid Smolyansky
System and Method for Creating Highly Scalable High Availability Cluster in a Massively Parallel Processing Cluster of Machines in a Network

Publication number: 20140325182

Abstract: System and method embodiments are provided to implement highly scalable and high availability (HA) clusters in massively parallel processing (MPP) systems. The embodiments include a method to build a highly scalable MPP HA cluster, which provides HA to the cluster while allowing it to scale to relatively larger number of nodes. An embodiment apparatus includes a plurality of data processing nodes distributed in a plurality of corresponding sub-clusters and configured to exchange heart-beat messages between each other within limit of each of the corresponding sub-clusters to maintain sub-cluster membership integrity and detect failures in the corresponding sub-clusters. The sub-clusters are arranged in a fan-out tree hierarchy and configured to prevent heart-beat messaging between each other.

Type: Application

Filed: April 26, 2013

Publication date: October 30, 2014

Applicant: FutureWei Technologies, Inc.

Inventors: Gangavara Prasad Varakur, Anil Chillarige
Thread synchronization in a multi-thread, multi-flow network communications processor architecture

Patent number: 8874878

Abstract: Described embodiments provide a packet classifier for a network processor that generates tasks corresponding to each received packet. The packet classifier includes a scheduler to generate contexts corresponding to tasks received by the packet classifier from processing modules of the network processor. The packet classifier processes threads of instructions, each thread of instructions corresponding to a context received from the scheduler, and each thread associated with a data flow. A thread status table has N entries to track up to N active threads. Each status entry includes a valid status indicator, a sequence value, a thread indicator and a flow indicator. A sequence counter generates a sequence value for each data flow of each thread and is incremented when processing of a thread is started, and is decremented when a thread is completed. Instructions are processed in the order in which the threads were started for each data flow.

Type: Grant

Filed: November 28, 2012

Date of Patent: October 28, 2014

Assignee: LSI Corporation

Inventors: Deepak Mital, James Clee, Jerry Pirog
Task queuing in a multi-flow network processor architecture

Patent number: 8873550

Abstract: Described embodiments generate tasks corresponding to each packet received by a network processor. A destination processing module receives a task and determines, based on the task size, a queue in which to store the task, and whether the task is larger than space available within a current memory block of the queue. If the task is larger, an address of a next memory block in a memory is determined, and the address is provided to a source processing module of the task. The source processing module writes the task to the memory based on a provided offset address and the address of the next memory block, if provided. If a task is written to more than one memory block, the destination processing module preloads the address of the next memory block to a local memory to process queued tasks without stalling to retrieve the address of the next memory block.

Type: Grant

Filed: November 28, 2012

Date of Patent: October 28, 2014

Assignee: LSI Corporation

Inventors: Deepak Mital, William Burroughs, Michael R. Betker, Joseph R. Hasting
MULTI-CORE PROCESSOR

Publication number: 20140317380

Abstract: A multi-core processor includes a plurality of former-stage cores that perform parallel processing using a plurality of pipelines covering a plurality of stages. In the pipelines, the former-stage cores perform stages ending with an instruction decode stage; stages starting with an instruction execution stage are executed by a latter-stage core. A dynamic load distribution block refers to decode results in the instruction decode stage and controls to assign the latter-stage core with a latter-stage-needed decode result being a decode result whose processing needs to be executed in the latter-stage core.

Type: Application

Filed: April 4, 2014

Publication date: October 23, 2014

Applicant: DENSO CORPORATION

Inventors: Hirofumi YAMAMOTO, Takeshi KONDO, Shinichirou TAGUCHI, Takatoshi NOMURA, Daihan WANG, Tomoyoshi FUNAZAKI, Yukoh MATSUMOTO
INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD

Publication number: 20140304490

Abstract: An application process is switched between asymmetric processor cores having no compatibility in instruction set architectures so that the process can be continuously executed. In an information processing device, when a request to switch an execution subject is generated while a first processor core is executing an application program, a switch process code makes the first processor core specify a basic block being executed at present. The switch process code makes the first processor core execute a first execution code until a branch instruction at the end of the specified basic block, and makes a second processor core execute a second execution code from an instruction at the head of a basic block to be executed next to the specified basic block.

Type: Application

Filed: April 2, 2014

Publication date: October 9, 2014

Applicant: Renesas Electronics Corporation

Inventors: Masakatsu TOYAMA, Masanori HAYASHIKOSHI
EXECUTION CONTEXT SWAP BETWEEN HETEROGENOUS FUNCTIONAL HARDWARE UNITS

Publication number: 20140281380

Abstract: Remapping technologies for execution context swap between heterogeneous functional hardware units are described. A computing system includes multiple registers configured to store remote contexts of functional units. A mapping table maps the remote context to the functional units. An execution unit is configured to execute a remapping tool that intercepts an operation to access a remote context of a first functional unit of the plurality of functional units that is taken offline. The remapping tool determines that the first functional unit is remapped to a second functional unit using the mapping table. The operation is performed to access the remote context that is remapped to the second functional unit. The first functional unit and the second functional unit may be heterogeneous functional units.

Type: Application

Filed: March 12, 2013

Publication date: September 18, 2014

Inventors: Inder M. Sodhi, Marc Torrant, Zeev Offen, Michael Mishaeli, Ashish V. Choubal, Jason W. Brandt
Direct scatter loading of executable software image from a primary processor to one or more secondary processor in a multi-processor system

Patent number: 8838949

Abstract: In a multi-processor system, an executable software image including an image header and a segmented data image is scatter loaded from a first processor to a second processor. The image header contains the target locations for the data image segments to be scatter loaded into memory of the second processor. Once the image header has been processed, the data segments may be directly loaded into the memory of the second processor without further CPU involvement from the second processor.

Type: Grant

Filed: March 21, 2011

Date of Patent: September 16, 2014

Assignee: QUALCOMM Incorporated

Inventors: Nitin Gupta, Daniel H. Kim, Igor Malamant, Steve Haehnichen
Characterization and validation of processor links

Patent number: 8832513

Abstract: A processor link that couples a first processor and a second processor is selected for validation and a plurality of communication parameter settings associated with the first and the second processors is identified. The first and the second processors are successively configured with each of the communication parameter settings. One or more test data pattern(s) are provided from the first processor to the second processor in accordance with the communication parameter setting. Performance measurements associated with the selected processor link and with the communication parameter setting are determined based, at least in part, on the test data pattern as received at the second processor. One of the communication parameter settings that is associated with the highest performance measurements is selected. The selected communication parameter setting is applied to the first and the second processors for subsequent communication between the first and the second processors via the processor link.

Type: Grant

Filed: November 20, 2013

Date of Patent: September 9, 2014

Assignee: International Business Machines Corporation

Inventors: Robert W. Berry, Anand Haridass, Prasanna Jayaraman
Characterization and validation of processor links

Patent number: 8826092

Abstract: A processor link that couples a first processor and a second processor is selected for validation and a plurality of communication parameter settings associated with the first and the second processors is identified. The first and the second processors are successively configured with each of the communication parameter settings. One or more test data pattern(s) are provided from the first processor to the second processor in accordance with the communication parameter setting. Performance measurements associated with the selected processor link and with the communication parameter setting are determined based, at least in part, on the test data pattern as received at the second processor. One of the communication parameter settings that is associated with the highest performance measurements is selected. The selected communication parameter setting is applied to the first and the second processors for subsequent communication between the first and the second processors via the processor link.

Type: Grant

Filed: October 25, 2011

Date of Patent: September 2, 2014

Assignee: International Business Machines Corporation

Inventors: Robert W. Berry, Jr., Anand Haridass, Prasanna Jayaraman
Background Collective Operation Management In A Parallel Computer

Publication number: 20140244974

Abstract: Background collective operation management in a parallel computer, the parallel computer including one or more compute nodes operatively coupled for data communications over one or more data communications networks, including: determining, by a management availability module, whether a compute node in the parallel computer is available to perform a background collective operation management task; responsive to determining that the compute node is available to perform the background collective operation management task, determining, by the management availability module, whether the compute node has access to sufficient resources to perform the background collective operation management task; and responsive to determining that the compute node has access to sufficient resources to perform the background collective operation management task, initiating, by the management availability module, execution of the background collective operation management task.

Type: Application

Filed: March 5, 2013

Publication date: August 28, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
Parallel processing using plural processing modules when processing time including parallel control overhead time is determined to be less than serial processing time

Patent number: 8819396

Abstract: A data processing apparatus includes an output unit. The output unit determines, when parallel control is performed in a data processor created in the data processing apparatus so that plural processing modules forming the data processor perform data processing in parallel, on the basis of a value representing a parallel-processing time for which at least two processing modules are operated in parallel and a value representing a control time, which is not necessary when serial control is performed so that the processing modules serially perform data processing but which is necessary when the parallel control is performed so that the processing modules perform data processing in parallel, whether a time necessary to complete data processing performed by the data processor under the parallel control would be shorter than a time necessary to complete data processing performed by the data processor under the serial control, and outputs a determination result.

Type: Grant

Filed: September 1, 2011

Date of Patent: August 26, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventor: Ryoko Usuba
Precise-Restartable Parallel Execution of Programs

Publication number: 20140229704

Abstract: Interrupt handling on a multiprocessor computer executing multiple computational operations in parallel is provided by establishing a total ordering of the multiple computational operations and defining an architectural state at the time of the interrupt as if the computational operations executed in the total ordering.

Type: Application

Filed: February 13, 2013

Publication date: August 14, 2014

Applicant: WISCONSIN ALUMNI RESEARCH FOUNDDATION

Inventor: WISCONSIN ALUMNI RESEARCH FOUNDATION
ANALOG PROCESSOR COMPRISING QUANTUM DEVICES

Publication number: 20140229705

Abstract: Analog processors for solving various computational problems are provided. Such analog processors comprise a plurality of quantum devices, arranged in a lattice, together with a plurality of coupling devices. The analog processors further comprise bias control systems each configured to apply a local effective bias on a corresponding quantum device. A set of coupling devices in the plurality of coupling devices is configured to couple nearest-neighbor quantum devices in the lattice. Another set of coupling devices is configured to couple next-nearest neighbor quantum devices. The analog processors further comprise a plurality of coupling control systems each configured to tune the coupling value of a corresponding coupling device in the plurality of coupling devices to a coupling. Such quantum processors further comprise a set of readout devices each configured to measure the information from a corresponding quantum device in the plurality of quantum devices.

Type: Application

Filed: February 7, 2014

Publication date: August 14, 2014

Applicant: D-Wave Systems Inc.

Inventors: Alexander Maassen van den Brink, Peter Love, Mohammad H. S. Amin, Geordie Rose, David Grant, Miles F. H. Steininger, Paul I. Bunyk, Andrew J. Berkley
MEMORY MANAGEMENT IN A STREAMING APPLICATION

Publication number: 20140215184

Abstract: One embodiment is directed to a method for processing a stream of tuples. The method may include receiving a stream of tuples to be processed by a plurality of processing elements operating on one or more computer processors. Each of the processing elements has an associated memory space. In addition, the method may include monitoring the plurality of processing elements. The monitoring may include identifying a first performance metric for a first processing element. The method may include modifying the first processing element based on the first performance metric. The modifying of the first processing element may include employing memory management of the associated memory space.

Type: Application

Filed: February 22, 2013

Publication date: July 31, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: INTERNATIONAL BUSINESS MACHINES CORPORATION
DYNAMICALLY CONTROLLING INTERCONNECT FREQUENCY IN A PROCESSOR

Publication number: 20140208141

Abstract: In one embodiment, the present invention includes a method for determining whether a number of stalled cores of a multicore processor is greater than a stall threshold. If so, a recommendation may be made that an operating frequency of system agent circuitry of the processor be increased. Then based on multiple recommendations, a candidate operating frequency of the system agent circuitry can be set. Other embodiments are described and claimed.

Type: Application

Filed: March 13, 2012

Publication date: July 24, 2014

Inventors: Malini K. Bhandaru, Ankush Varma, James R. Vash, Monica Wong-Chan, Eric J. Dehaemer, Christopher Allan Poirier, SR., Scott P. Bobholz
OPTIMAL LOGICAL PROCESSOR COUNT AND TYPE SELECTION FOR A GIVEN WORKLOAD BASED ON PLATFORM THERMALS AND POWER BUDGETING CONSTRAINTS

Publication number: 20140189302

Abstract: A processor includes multiple physical cores that support multiple logical cores of different core types, where the core types include a big core type and a small core type. A multi-threaded application includes multiple software threads are concurrently executed by a first subset of logical cores in a first time slot. Based on data gathered from monitoring the execution in the first time slot, the processor selects a second subset of logical cores for concurrent execution of the software threads in a second time slot. Each logical core in the second subset has one of the core types that matches the characteristics of one of the software threads.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Inventors: Dheeraj R. Subbareddy, Ganapati N. Srinivasa, David A. Koufaty, Scott D. Hahn, Mishali Naik, Paolo Narvaez, Abirami Prabhakaran, Eugene Gorbatov, Alon Naveh, Inder M. Sodhi, Eliezer Weissmann, Paul Brett, Gaurav Khanna, Russell J. Fenger
Processing Core Having Shared Front End Unit

Publication number: 20140189300

Abstract: A processor having one or more processing cores is described. Each of the one or more processing cores has front end logic circuitry and a plurality of processing units. The front end logic circuitry is to fetch respective instructions of threads and decode the instructions into respective micro-code and input operand and resultant addresses of the instructions. Each of the plurality of processing units is to be assigned at least one of the threads, is coupled to said front end unit, and has a respective buffer to receive and store microcode of its assigned at least one of the threads.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Inventors: Name ILAN PARDO, DROR MARKOVICH, OREN BEN-KIKI, YUVAL YOSEF
HIGH DYNAMIC RANGE SOFTWARE-TRANSPARENT HETEROGENEOUS COMPUTING ELEMENT PROCESSORS, METHODS, AND SYSTEMS

Publication number: 20140189301

Abstract: A processor of an aspect includes at least one lower processing capability and lower power consumption physical compute element and at least one higher processing capability and higher power consumption physical compute element. Migration performance benefit evaluation logic is to evaluate a performance benefit of a migration of a workload from the at least one lower processing capability compute element to the at least one higher processing capability compute element, and to determine whether or not to allow the migration based on the evaluated performance benefit. Available energy and thermal budget evaluation logic is to evaluate available energy and thermal budgets and to determine to allow the migration if the migration fits within the available energy and thermal budgets. Workload migration logic is to perform the migration when allowed by both the migration performance benefit evaluation logic and the available energy and thermal budget evaluation logic.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Inventors: Eugene Gorbatov, Alon Naveh, Inder M. Sodhi, Ganapati N. Srinivasa, Eliezer Weissmann, Guarav Khanna, Mishali Naik, Russell J. Fenger, Andrew D. Henroid, Dheeraj R. Subbareddy, David A. Koufaty, Paolo Narvaez
CONFIGURABLE RING NETWORK

Publication number: 20140189298

Abstract: A apparatus and computing device for providing a configurable ring network are provided herein. The apparatus includes logic to configure a ring processor for each of a plurality of processing elements, and logic to network each ring processor, wherein each ring processor communicates with other ring processors using a set of commands.

Type: Application

Filed: December 27, 2012

Publication date: July 3, 2014

Inventors: Teresa Morrison, Scott A. Krig
HETERGENEOUS PROCESSOR APPARATUS AND METHOD

Publication number: 20140189299

Abstract: A heterogeneous processor architecture is described. For example, a processor according to one embodiment of the invention comprises: a set of large physical processor cores; a set of small physical processor cores having relatively lower performance processing capabilities and relatively lower power usage relative to the large physical processor cores; virtual-to-physical (V-P) mapping logic to expose the set of large physical processor cores to software through a corresponding set of virtual cores and to hide the set of small physical processor core from the software.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Inventors: Paolo Narvaez, Ganapati N. Srinivasa, Eugene Gorbatov, Dheeraj R. Subbareddy, Mishali Naik, Alon Naveh, Abirami Prabhakaran, Eliezer Weissmann, David A. Koufaty, Paul Brett, Scott D. Hahn, Andrew J. Herdrich, Ravishankar Iyer, Nagabhushan Chitlur, Inder M. Sodhi, Gaurav Khanna, Russell J. Fenger
PARALLEL PROCESSING USING MULTI-CORE PROCESSOR

Publication number: 20140181470

Abstract: Disclosed are methods, systems, paradigms and structures for processing data packets in a communication network by a multi-core network processor. The network processor includes a plurality of multi-threaded core processors and special purpose processors for processing the data packets atomically, and in parallel. An ingress module of the network processor stores the incoming data packets in the memory and adds them to an input queue. The network processor processes a data packet by performing a set of network operations on the data packet in a single thread of a core processor. The special purpose processors perform a subset of the set of network operations on the data packet atomically. An egress module retrieves the processed data packets from a plurality of output queues based on a quality of service (QoS) associated with the output queues, and forwards the data packets towards their destination addresses.

Type: Application

Filed: December 2, 2013

Publication date: June 26, 2014

Applicant: Unbound Networks, Inc.

Inventors: Damon Finney, Ashok Mathur
Adaptive Data Collection Practices in a Multi-Processor Device

Publication number: 20140181471

Abstract: Adaptive data collection practices in a multi-processor device. The device may include a first processor and a second processor. The first processor may operate in any of a plurality of power states. The first processor may indicate to the second processor when it transitions to a different power state. The second processor may collect information relating to its operation. The second processor may collect the information according to different information collecting modes depending on in which power state the first processor is operating. Less information may be collected in an information collecting mode corresponding to a lower power state of the first processor than in an information collecting mode corresponding to a higher power state of the first processor.

Type: Application

Filed: December 20, 2013

Publication date: June 26, 2014

Applicant: APPLE INC.

Inventors: Ben-Heng Juang, Arjuna Sivasithambaresan, Jesus A. Gutierrez Gomez, Karthik Anantharaman, Srinivasan Nimmala
SELECTION OF A PRIMARY MICROPROCESSOR FOR INITIALIZATION OF A MULTIPROCESSOR SYSTEM

Publication number: 20140173250

Abstract: Embodiments of the present invention provide a method for initializing a plurality of processors of a multi-processor system by executing, at each respective processor of the plurality of processors, at least a portion of local initialization code stored on the respective processor. Receiving, at a designated processor of the plurality of processors, external initialization code stored in external memory, wherein the remainder of the plurality of processors do not have access to the external initialization code stored in external memory. Determining, the designated processor, send at least a portion of the external initialization code to a processor of the remainder of the plurality of processors.

Type: Application

Filed: December 19, 2012

Publication date: June 19, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Van H. Lee, David D. Sanner, Thi N. Tran
Performing Frequency Coordination In A Multiprocessor System Based On Response Timing Optimization

Publication number: 20140173248

Abstract: In an embodiment, a processor includes a core to execute instructions and a logic to receive memory access requests from the core and to route the memory access requests to a local memory and to route snoop requests corresponding to the memory access requests to a remote processor. The logic is configured to maintain latency information regarding a difference between receipt of responses to the snoop requests from the remote processor and receipt of responses to the memory access requests from the local memory. Other embodiments are described and claimed.

Type: Application

Filed: December 17, 2012

Publication date: June 19, 2014

Inventors: Ankush Varma, Krishnakanth V. Sistla
SYSTEM AND METHOD FOR CONNECTING A SYSTEM ON CHIP PROCESSOR AND AN EXTERNAL PROCESSOR

Publication number: 20140173249

Abstract: A system and method are provided for connecting a system on chip (SoC) processor and an external processor. The SoC processor receives as input a content stream, and processes the content stream. Further, the application processor that is connected to the SoC processor receives the processed content stream, performs further processing on the processed content stream, and outputs the further processed content stream hack to the SoC processor.

Type: Application

Filed: December 18, 2012

Publication date: June 19, 2014

Applicant: NVIDIA CORPORATION

Inventors: Thomas F. Fox, Gerrit A. Slavenburg
SELECTION OF A PRIMARY MICROPROCESSOR FOR INITIALIZATION OF A MULTIPROCESSOR SYSTEM

Publication number: 20140173251

Abstract: Embodiments of the present invention provide a method for initializing a plurality of processors of a multi-processor system by executing, at each respective processor of the plurality of processors, at least a portion of local initialization code stored on the respective processor. Receiving, at a designated processor of the plurality of processors, external initialization code stored in external memory, wherein the remainder of the plurality of processors do not have access to the external initialization code stored in external memory. Determining, the designated processor, send at least a portion of the external initialization code to a processor of the remainder of the plurality of processors.

Type: Application

Filed: January 9, 2014

Publication date: June 19, 2014

Applicant: International Business Machines Corporation

Inventors: Van H. Lee, David D. Sanner, Thi N. Tran
Collective acceleration unit tree structure

Patent number: 8756270

Abstract: A mechanism is provided in a collective acceleration unit for performing a collective operation to distribute or collect data among a plurality of participant nodes. The mechanism receives an input collective packet for a collective operation from a neighbor node within a collective tree. The input collective packet comprises a tree identifier and an input data field and wherein the collective tree comprises a plurality of sub trees. The mechanism maps the tree identifier to an index within the collective acceleration unit. The index identifies a portion of resources within the collective acceleration unit and is associated with a set of neighbor nodes in a given sub tree within the collective tree. For each neighbor node the collective acceleration unit stores destination information. The collective acceleration unit performs an operation on the input data field using the portion of resources to effect the collective operation.

Type: Grant

Filed: April 24, 2012

Date of Patent: June 17, 2014

Assignee: International Business Machines Corporation

Inventors: Lakshminarayana B. Arimilli, Bernard C. Drerup, Paul F. Lecocq, Hanhong Xue
PROCESSING SYSTEM WITH SYNCHRONIZATION INSTRUCTION

Publication number: 20140164735

Abstract: Embodiments of a multi-processor array are disclosed that may include a plurality of processors, and controllers. Each processor may include a plurality of processor ports and a sync adapter. Each sync adapter may include a plurality of adapter ports. Each controller may include a plurality of controller ports, and a configuration port. The plurality of processors and the plurality of controllers may be coupled together in an interspersed arrangement, and the controllers may be distinct from the processors. Each processor may be configured to send a synchronization signal through its adapter ports to one or more controllers, and to pause execution of program instructions while waiting for a response from the one or more controllers.

Type: Application

Filed: October 10, 2013

Publication date: June 12, 2014

Applicant: COHERENT LOGIX, INCORPORATED

Inventors: Carl S. Dobbs, Afzal M. Malik, Kenneth R. Faulkner, Michael B. Solka
Collective acceleration unit tree structure

Patent number: 8751655

Abstract: A mechanism is provided in a collective acceleration unit for performing a collective operation to distribute or collect data among a plurality of participant nodes. The mechanism receives an input collective packet for a collective operation from a neighbor node within a collective tree. The input collective packet comprises a tree identifier and an input data field and wherein the collective tree comprises a plurality of sub trees. The mechanism maps the tree identifier to an index within the collective acceleration unit. The index identifies a portion of resources within the collective acceleration unit and is associated with a set of neighbor nodes in a given sub tree within the collective tree. For each neighbor node the collective acceleration unit stores destination information. The collective acceleration unit performs an operation on the input data field using the portion of resources to effect the collective operation.

Type: Grant

Filed: March 29, 2010

Date of Patent: June 10, 2014

Assignee: International Business Machines Corporation

Inventors: Lakshminarayana B. Arimilli, Bernard C. Drerup, Paul F. Lecocq, Hanhong Xue
Scheduler of reconfigurable array, method of scheduling commands, and computing apparatus

Patent number: 8745608

Abstract: A scheduler of a reconfigurable array, a method of scheduling commands, and a computing apparatus are provided. To perform a loop operation in a reconfigurable array, a recurrence node, a producer node, and a predecessor node are detected from a data flow graph of the loop operation such that resources are assigned to such nodes so as to increase the loop operating speed. Also, a dedicated path having a fixed delay may be added to the assigned resources.

Type: Grant

Filed: February 1, 2010

Date of Patent: June 3, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Won-sub Kim, Tae-wook Oh, Bernhard Egger
Transferring data in a parallel processing environment

Patent number: 8745604

Abstract: An integrated circuit includes a plurality of tiles. Each tile includes a processor, a switch including switching circuitry to forward data over data paths from other tiles to the processor and to switches of other tiles, and a switch memory that stores instruction streams that are able to operate independently for respective output ports of the switch.

Type: Grant

Filed: February 25, 2008

Date of Patent: June 3, 2014

Assignee: Massachusetts Institute of Technology

Inventor: Anant Agarwal
SYSTEMS AND METHOD FOR ASSIGNING EXECUTABLE FUNCTIONS TO AVAILABLE PROCESSORS IN A MULTIPROCESSING ENVIRONMENT

Publication number: 20140149716

Abstract: A computer implemented method for assigning executable functions to available processors in a Multiprocessor environment comprising (as an example of a collection of different processing architectures) one or more CPUs and one or more GPUs, the method comprising: providing an input source comprising instructions and data; breaking the input source into data oriented cell and interface objects with processing attributes; assigning the cell and interface objects to one or more of the GPUs and CPUs based on processing attributes and the multiprocessor environment; and producing output data.

Type: Application

Filed: December 23, 2011

Publication date: May 29, 2014

Applicant: University of New Brunswick

Inventor: Andrew Gerber
Processing System With Interspersed Processors With Multi-Layer Interconnect

Publication number: 20140143520

Abstract: Embodiments of a multi-processor array are disclosed that may include a plurality of processors and configurable communication elements coupled together in a interspersed arrangement. Each configurable communication element may include a local memory and a plurality of routing engines. The local memory may be coupled to a subset of the plurality of processors. Each routing engine may be configured to receive one or more messages from a plurality of sources, assign each received message to a given destination of a plurality of destinations dependent upon configuration information, and forward each message to assigned destination. The plurality of destinations may include the local memory, and routing engines included in a subset of the plurality of configurable communication elements.

Type: Application

Filed: March 27, 2013

Publication date: May 22, 2014

Applicant: Coherent Logix, Incorporated

Inventors: Carl S. Dobbs, Michael R. Trocino, Michael B. Solka
EXPOSING HOST OPERATING SYSTEM SERVICES TO AN AUXILLARY PROCESSOR

Publication number: 20140136817

Abstract: Aspect methods, systems and devices may be configured to perform two-way and/or reverse procedure calls in a computing device or across a network to offload the bulk of processing operations from a general purpose processor to an auxiliary processor, while perform operations that require access to context information locally on the general purpose processor (e.g., application processor, CPU, etc.). The two-way and/or reverse procedure calls allow an auxiliary processor to perform operations that include subroutines that require access to an application processor's or a calling process's context information, without requiring the calling process to send the context information to the auxiliary processor (e.g., as part of the procedure call/method invocation, etc.).

Type: Application

Filed: December 20, 2012

Publication date: May 15, 2014

Applicant: QUALCOMM INCORPORATED

Inventors: Anatoly E. Yakovenko, Ramesh Chandrasekhar
TRACKING AND RECLAIMING PHYSICAL REGISTERS

Publication number: 20140129804

Abstract: A method and apparatus for tracking and reclaiming physical registers is presented. Some embodiments of the apparatus include rename logic configurable to map architectural registers to physical registers. The rename logic is configurable to bypass allocation of a physical register to an architectural register when information to be written to the architectural register satisfies a bypass condition. Some embodiments of the apparatus also include a plurality of first bits associated with the architectural registers. The rename logic is configurable to set one of the first bits to indicate that allocation of a physical register to the corresponding architectural register has been bypassed.

Type: Application

Filed: November 2, 2012

Publication date: May 8, 2014

Inventor: John M. King
Generating And Communicating Platform Event Digests From A Processor Of A System

Publication number: 20140122834

Abstract: In an embodiment, a processor includes a plurality of counters each to provide a count of a performance metric of at least one core of the processor, a plurality of threshold registers each to store a threshold value with respect to a corresponding one of the plurality of counters, and an event logic to generate an event digest packet including a plurality of indicators each to indicate whether an event occurred based on a corresponding threshold value and a corresponding count value. Other embodiments are described and claimed.

Type: Application

Filed: October 30, 2012

Publication date: May 1, 2014

Inventors: Mrittika Ganguli, Tessil Thomas, Vinila Rose, Hussam Mousa, Moham J. Kumar
Storage system comprising multiple microprocessors and method for sharing processing in this storage system

Patent number: 8713288

Abstract: The present invention provides a storage system in which each microprocessor is able to execute synchronous processing and asynchronous processing in accordance with the operating status of the storage system. Any one attribute, from among multiple attributes (operating modes) prepared beforehand, is set in each microprocessor in accordance with the operating status of the storage system. The attribute that is set in each microprocessor is regularly reviewed and changed.

Type: Grant

Filed: June 17, 2010

Date of Patent: April 29, 2014

Assignee: Hitachi, Ltd.

Inventors: Tomohiro Yoshihara, Shintaro Kudo, Norio Shimozono
Starting other processing elements (OPEs) responsive to a determination that currently-running OPEs are inadequate to invoke a library for a multi-library application

Patent number: 8694756

Abstract: A mechanism is provided for invoking multi-library application on a multiple processor system, wherein the multiple processor system comprises a Power Processing Element (PPE) and a plurality of Synergistic Processing Element (SPE). Applications including multi-libraries run in the memory of the PPEs. The mechanism comprises maintaining the status of each SPE in the applications running on the PPE, where there are SPE agents for capturing the instructions from the PPE in the SPEs that have been started. In response to a request for invoking a library, the PPE determines whether the number of available SPEs for invoking the library is adequate based on the current status of SPEs. If the number of available SPEs is adequate, the PPE sends a run instruction to selected SPEs. After finishing the invocation of all libraries, the PPE sends termination instructions to all started SPEs.

Type: Grant

Filed: April 20, 2012

Date of Patent: April 8, 2014

Assignee: International Business Machines Corporation

Inventors: Hui Li, Hong Bo Peng, Bai Ling Wang
System for data collection from processing elements in a SIMD processor

Patent number: 8688958

Abstract: A processor has a plurality of PEs (processing elements) that operate in parallel based on operation commands and an information collection unit that collects the data of the plurality of PEs, wherein each of the plurality of PEs holds data and a condition flag, supplies the data and the condition flag to the information collection unit upon receiving an operation command, and upon receiving an update request for updating the condition flag, updates the condition flag in accordance with the update request that was received; and the information collection unit, upon receiving the data and the condition flags, selects one PE based on a predetermined order of priority from among the PEs for which the received condition flags are active and both supplies the data of the selected PE as collection result data and supplies an update request for updating the condition flag of the PE that was selected.

Type: Grant

Filed: January 14, 2010

Date of Patent: April 1, 2014

Assignee: NEC Corporation

Inventor: Shohei Nomoto
PROCESSOR HAVING MULTIPLE CORES, SHARED CORE EXTENSION LOGIC, AND SHARED CORE EXTENSION UTILIZATION INSTRUCTIONS

Publication number: 20140089635

Abstract: An apparatus of an aspect includes a plurality of cores and shared core extension logic coupled with each of the plurality of cores. The shared core extension logic has shared data processing logic that is shared by each of the plurality of cores. Instruction execution logic, for each of the cores, in response to a shared core extension call instruction, is to call the shared core extension logic. The call is to have data processing performed by the shared data processing logic on behalf of a corresponding core. Other apparatus, methods, and systems are also disclosed.

Type: Application

Filed: September 27, 2012

Publication date: March 27, 2014

Inventors: Eran Shifer, Mostafa Hagog, Eliyahu Turiel
OPERATING METHOD FOR A COMPUTER SYSTEM

Publication number: 20140075155

Abstract: A method for operating a computer system, in which a plurality of processes run, within the scope of which a multiplicity of objects are accessed. The method initially predefines, for the subsequent continued operation, which of the processes can be used to access which of the objects. The processes may be distributed among a plurality of processors of a multiprocessor system, and it is initially predefined which of the processors can address which objects.

Type: Application

Filed: February 23, 2012

Publication date: March 13, 2014

Applicant: HighTec EDV-Systeme GmbH

Inventor: Horst Lehser
Group based allocation of network bandwidth

Patent number: 8661138

Abstract: The present invention extends to methods, systems, and computer program products for group based allocation of terminal server network bandwidth. Output packets are classified into groups based on classification criteria. Output packets for each group are queue into a corresponding queue. During a queue flush cycle each queue containing data is flushed for an essentially equal amount of time. Flushing each queue essentially equally reduces the negative impact that can otherwise result when a subset of sessions (or even a single session) request(s) a disproportional share of terminal server network bandwidth. Responsiveness can be further increased by distributing the essentially equal amount for each queue across the queue flush cycle.

Type: Grant

Filed: January 9, 2012

Date of Patent: February 25, 2014

Assignee: Microsoft Corporation

Inventors: Ara Bernardi, Nelamangal Krishnaswamy Srinivas, Ashwin Palekar
Application-based specialization for computing nodes within a distributed processing system

Patent number: 8656355

Abstract: A distributed processing system is described that employs “application-based” specialization. In particular, the distributed processing system is constructed as a collection of computing nodes in which each computing node performs a particular processing role within the operation of the overall distributed processing system. Each of the computing nodes includes an operating system, such as the Linux operating system, and includes a plug-in software module to provide a distributed memory operating system that employs the role-based computing techniques. An administration node maintains a database that defines a plurality of application roles. Each role is associated with a software application, and specifies a set of software components necessary for execution of the software application. The administration node deploys the software components to the application nodes in accordance with the application roles associates with each of the application nodes.

Type: Grant

Filed: April 2, 2012

Date of Patent: February 18, 2014

Assignee: CA, Inc.

Inventors: Steven M. Oberlin, David W. McAllister
MOBILE TERMINAL AND CONTROL METHOD THEREOF

Publication number: 20140032880

Abstract: A mobile terminal and a method for controlling the mobile terminal are disclosed. A mobile terminal according to one embodiment of the present invention comprises at least one sensor; a first processor for controlling operation of the at least one sensor; a second processor for controlling an application; and a vibration unit detecting a force applied by the user, where the vibration unit is woken up when a force applied by the user exceeds a predetermined magnitude while the at least one sensor, the first processor, the second processor, and the vibration unit are all in a sleep state; and if the first processor is woken up by the vibration unit, the first processor wakes up the second processor based on sensing data collected by the at least one sensor.

Type: Application

Filed: July 23, 2013

Publication date: January 30, 2014

Applicant: LG ELECTRONICS INC.

Inventor: Hokyung KA
PROCESSOR AND CONTROL METHOD THEREOF

Publication number: 20140025925

Abstract: An processor includes: multiple arithmetic processing sections to execute arithmetic processing; and multiple registers provided for the multiple arithmetic processing sections. A register value of a register of the multiple registers corresponding to a given one of the multiple arithmetic processing sections is changed if program execution by the given one of the multiple arithmetic processing sections reaches a predetermined location in a program, and priorities of the arithmetic processing sections are dynamically determined in response to register values of the registers.

Type: Application

Filed: June 3, 2013

Publication date: January 23, 2014

Inventor: YUJI KONDO
Source Code Level Multistage Scheduling Approach for Software Development and Testing for Multi-Processor Environments

Publication number: 20140006751

Abstract: In one embodiment, a heterogeneous multi-processor computer system includes (i) a plurality of dedicated processors (DPs), each DP configured to implement one or more program modules during runtime operations; (ii) two or more control processors (CPs), each CP configured to run scheduling software for controlling the runtime operations by a corresponding subset of DPs; and (iii) one or more buses interconnecting the DPs and CPs. Each CP is configured to vary timing of implementation of the program modules for the corresponding subset of DPs based on resource availability, and each CP is configured to vary timing of data transfers by the corresponding subset of DPs based on resource availability.

Type: Application

Filed: January 24, 2013

Publication date: January 2, 2014

Applicant: LSI CORPORATION

Inventors: Pavel Aleksandrovich Aliseychik, Petrus Sebastiaan Andrianus Daniel Evers, Denis Vasilevich Parfenov, Alexander Nikolaevich Filippov, Denis Vladimirovich Zaytsev
MANAGING COMPUTING RESOURCES THROUGH AGGREGATED CORE MANAGEMENT

Publication number: 20140006749

Abstract: Systems, methods and computer program products manage computing resources for a system. A system includes a set of processors having multiple processor cores present in the computer system, where the processor cores form an aggregated set of processor cores for the system. A dependency analyzer determines dependencies among a set of workload components executing on the set of processor cores. A policy includes rules associated with managing one or more of power consumption, heat production, operating cost or workload balancing for the set of aggregated processor cores. In response to a workload event, a management component sets a state of one or more of the processor cores in accordance with the workload event, the policy and the set of dependencies.

Type: Application

Filed: June 29, 2012

Publication date: January 2, 2014

Applicant: International Business Machines Corporation

Inventors: James C. Fletcher, Bala Rajaraman
3-D STACKED MULTIPROCESSOR STRUCTURES AND METHODS TO ENABLE RELIABLE OPERATION OF PROCESSORS AT SPEEDS ABOVE SPECIFIED LIMITS

Publication number: 20140006750

Abstract: A three-dimensional (3-D) processor system includes a first processor chip and a second processor chip in a stacked configuration. The first processor chip includes a first processor having a first set of state registers. The second processor chip includes a second processor having a second set of state registers that corresponds to the first set of state registers. The first and second processors are connected through vertical connections between the first and second processor chips. A mode control circuit operates the processor system in one of a plurality of operating modes. In one mode of operation, the first processor is active and the second processor is inactive, and the first processor operates at a speed greater than a maximum safe speed of the first processor, and the first processor uses the second set of state registers of the second processor to checkpoint a state of the first processor.

Type: Application

Filed: September 4, 2012

Publication date: January 2, 2014

Applicant: International Business Machines Corporation

Inventors: Alper Buyuktosunoglu, Philip G. Emma, Allan M. Hartstein, Michael B. Healy, Krishnan K. Kailas

prev 1 2 3 4 5 6 7 … next