Magnetoresistive memory for a complex programmable logic device
The present invention is directed to magnetoresistive memory and data storage devices. A system for providing distributed functionality in an electronic environment includes a plurality of platforms suitable for providing a logic function. The platforms include embedded programmable logic, and MRAM memory, the logic and MRAM memory communicatively coupled via an interconnect.
[0001] The present application is a divisional of U.S. patent application Ser. No. 10/061,660, filed Feb. 1, 2002, which is herein incorporated by reference in its entirety.
FIELD OF THE INVENTION[0002] The present invention generally relates to the field of semiconductors and semiconductor design, and particularly to an architecture for a configurable platform including magnetoresistive memory.
BACKGROUND OF THE INVENTION[0003] Integrated circuits have become a necessary part of everyday modem society. From wireless phones and information handling systems, to household appliances and data storage systems, a wide range of integrated circuits are utilized to provide a broad range of functionality. To provide this functionality, integrated circuits may need to be specialized to have the functions necessary to achieve the desired results, such as through the provision of an application specific integrated circuit (ASIC). An ASIC is typically optimized for a given function set, thereby enabling the circuit to perform the functions in an optimized manner. However, there may be a wide variety of end-users desiring such targeted functionality, with each user desiring different functionality for different uses.
[0004] Additionally, more and more functions are being included within each integrated circuit. While providing a semiconductor device that includes a greater range of functions supported by the device, inclusion of this range further complicates the design and increases the complexity of the manufacturing process. Further, such targeted functionality may render the device suitable for a narrow range of consumers, thereby at least partially removing an “economy of scale” effect that may be realized by selling greater quantities of the device.
[0005] Thus, the application specific integrated circuit business is confronted by the contradiction that the costs of design and manufacture dictate high volumes of complex designs. Because of this, the number of companies fielding such custom designs is dwindling in the face of those rapidly escalating costs.
[0006] Further, there are limited options, previously, for storing state information on a platform device. In order to give the platform device state, a platform utilizes memory that may be written to and read from to configure a platform or groups of platforms.
[0007] Therefore, it would be desirable to provide a platform architecture of the present invention.
SUMMARY OF THE INVENTION[0008] Accordingly, the present invention is directed to an architecture. In a first aspect of the present invention, a system for providing distributed functionality in an electronic environment includes a plurality of platforms suitable for providing a logic function. The platforms include embedded programmable logic and MRAM memory, the logic and MRAM memory communicatively coupled via an interconnect.
[0009] In a second aspect of the present invention, a data storage system includes an electronic data storage device suitable for the storage of electronic data, an MRAM memory operable for the storage of electronic data utilizing a magnetoresistive effect and a controller communicatively coupled to the data storage device and the MRAM memory. The controller is suitable for controlling operations of the data storage device, wherein the controller receives data for writing to the electronic data storage device, the data is stored utilizing the MRAM memory before writing to the electronic data storage
[0010] It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention as claimed. The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate an embodiment of the invention and together with the general description, serve to explain the principles of the invention.
BRIEF DESCRIPTION OF THE DRAWINGS[0011] The numerous advantages of the present invention may be better understood by those skilled in the art by reference to the accompanying figures in which:
[0012] FIG. 1 is an illustration of an embodiment of the present invention wherein a platform operable to embodiment embody the present invention is shown;
[0013] FIG. 2 is an illustration of an embodiment of the present invention wherein a plurality of platforms as shown in FIG. 1 are provided in a fabric;
[0014] FIG. 3 is an illustration of an embodiment of the present invention wherein a plurality of platforms incorporating MRAM in a sea of platforms architecture is shown; and
[0015] FIG. 4 is an illustration of an electronic data storage system including MRAM.
DETAILED DESCRIPTION OF THE INVENTION[0016] Reference will now be made in detail to the presently preferred embodiments of the invention, examples of which are illustrated in the accompanying drawings.
[0017] Referring generally now to FIGS. 1 through 4, exemplary embodiments of the present invention are shown. The present invention provides an architecture for an element for use as a design component in application specific integrated circuit (ASIC) or semiconductor design. Technical questions involving complex device design may be thought of in two general areas, metamethodology and platform architecture. Metamethodology is a formal organizing architecture for defining and managing arbitrary semiconductor design flows in predictable, efficient ways, which may be tailored to specific product and process characteristics. For instance, different flows may have different combinations of tools in them, and thus, the successful operations may require the imposition of automated rule-based assessment of progress and error in order to allow designers to work productively at a higher level of abstraction than is available today.
[0018] As semiconductor designs progress along the deep sub-micron roadmap, and as the challenge of managing complexity becomes ever greater, the need to define building blocks of designs at higher levels of abstraction becomes even more pressing. Such a higher-level building block includes a “platform.”
[0019] Referring now to FIG. 1, an embodiment 100 of the present invention is shown wherein elements of an exemplary platform are provided. A platform may include a combination of the following elements: (1) embedded programmable logic 102, which in some contemplated embodiments is analogous to field programmable gate array (FPGA) or complex programmable logic device (CPLD) cores that FPGA companies sell as complete devices; (2) reconfigurable cores 104 such as a findamental processor elements to which may be added instruction-specialized, application-specific instruction set extensions; (3) an advanced interconnect 106, which in contemplated embodiments is scalable, and may be isochronous; (4) software models and heuristics 108; and (5) specialized memories 110, which may include nonvolatile structures like MRAM, which is a memory that is based on the magneto-resistive effect, as well as other memories as contemplated by a person of ordinary skill in the art. Specialized one-time programmable flash memory may also be included.
[0020] Programmable logic 102 components may include blocks of programmable gate arrays, “seas of adders”, CPLD structures, and other suitable programmable circuit elements definable from a stored representation, either at power-on, dynamically while in operation, and the like.
[0021] Reconfigurable cores 104 may include a base processor design, plus instruction set extensions designed to carry out function-specific logical and arithmetic operations with optimal efficiency. For instance, such reconfigurable cores may implement digital signal processing instruction set enhancements.
[0022] An interconnect 106 architecture is provided to allow the programmable logic and the reconfigurable cores to communicate with one another and with associated memory blocks. Such an architecture may define a transport which is scalable in bandwidth and is inherently isochronous. Further, it may be realizable within a switching fabric, which permits complex, adaptive, interconnect and access paths to be defined on the fly. Isochrony, through a universal time base, may simplify the problems of closing timing in complex designs.
[0023] The software for the platform may include the development environment and its interface to the metamethodology, as well as software IP cores, which may be implemented on the platform components, and the customer-developed custom code implementing proprietary or functionally specific routines.
[0024] There are numerous ways of structuring logic blocks: such as the number of logic elements in an organization; the kinds of cores and the numbers of cores; whether or not DSP-specific characteristics are included, and the like. Additionally, the characteristics of the interconnect, software, and memory may all be varied in innumerable different ways without departing from the spirit and scope of the present invention.
[0025] The present invention provides a platform, which may be embedded in a methodological framework that allows designers to work with the platform at a high level of abstraction. The complexity of the interactions between elements may so great that by providing abstractions, the interactions may be rendered manageable and tractable for designers that will use them practically.
[0026] Referring now to FIG. 2, an embodiment 200 of the present invention is shown wherein an architecture is configured as a sea of platforms. In this embodiment, a fabric is shown including a plurality of platforms. Preferably, the platforms are connected utilizing similar protocols, interconnect technology and interconnect architecture to unify the platforms in a single fabric.
[0027] Thus, a structure of the resources may be provided including the memory, reconfigurable cores, embedded programmable blocks, software and interconnect, which communicates and intercommunicates coherently on an isochronous fabric. Such a structure may be suitable for providing a “programmable ASIC”. For instance, an application-neutral device may be constructed with potential for accepting complex logic and definitions that are programmed completely independently of the fabrication of the device. This may be thought of as a decoupled model, in which the contemplated implementation of the device as used by an end-user is decoupled from the process of designing and manufacturing the physical device itself.
[0028] Each of these intersections on this fabric, may contain one or more processors, embedded programmable logic, memory, software capabilities and its own interconnect internally, as described in relation to FIG. 1. By providing this fabric including reconfigurable cores, the present invention may provide a targeted semiconductor providing desired functionality without requiring specialized design and manufacturing processes as previously required in ASIC manufacture.
[0029] For example, a manufacturer may receive a register transfer level (RTL) definition of a solution to a problem from a customer for a specification. A customer may want to create, for instance, a communications device, a storage device, a controller, a switching product, a game controller for consumer application, a satellite TV set-up box, and the like, and supply an RTL specification for the desired device to the manufacturer.
[0030] By utilizing the present invention, the RTL may be mapped into the platform of the present invention to provide the desired functionality as indicated by the RTL. For instance, instruction set architecture extensions may be utilized for mapping to the reconfigurable core the desired functionality. The extensions may be crafted to solve efficiently and specifically problems in encryption, in encoding or decoding, in modulation, in signal processing, in data transformation of various kinds, and the like. Additionally, abstract logic functions may be implemented, such as specialized shift registers, multiplexers (MUX), and the like as contemplated by a person of ordinary skill in the art. Thus, an instruction set extension may be affiliated with a wide range of functionality.
[0031] An embedded programmable logic core (EPLC) block may be tied to a set of instruction set extensions such that the EPLC block would have responsibility under software control for invoking any of several extensions to be active in a particular temporal episode. In this way, the “personality” of a reconfigurable core may be changed dynamically under the control of this EPLC mechanism. Thus, in an aspect of the present invention, this may enable the mapping of an RTL efficiently into a sea of platforms. The constituents of one platform may allow an EPLC block to play a role when choosing appropriate instruction set extensions as needed, given the temporal evolution of the function that the block is fulfilling.
[0032] Isochronous Functionality
[0033] Referring again to FIG. 2, by providing an isochronous fabric, the register files of multiple processors in multiple platforms may be utilized as dynamically extensible. In other words, it is possible because of the isochronous characteristic of the fabric, without additional software or additional overhead, to synchronize and coordinate the instruction-set extension operations on these register files over as many platforms as needed, which may be thought of as extending horizontally across the fabric as desired to achieve the necessary resources, such as processor power and the like, to fulfill a particular complex logic function.
[0034] Arbitrary sets of logic functions may be deployed across register files, and the instruction set extensions treated as general logic engines that are reconfigurable “on-the-fly,” on a cycle-by-cycle basis. For instance, given, (a), the provision of a proper instruction set extension or extensions to coordinate discrete, distinctive, different instruction set extensions on a cycle-by-cycle basis; and (b), that the execution is synchronized, for instance, to ensure that the right instruction set extension is invoked in the right set of reconfigurable cores at the right cycle, the functions may be deployed across platforms, operating as logic engines, as needed. Thus, by knowing the functionality of a register, where that functionality is located, and the function of a register at a given point in time, logic functions may be targeted to provide the functionality.
[0035] For example, in terms of actual behavior of applications in the real world, loads vary, and functions vary as loads vary. By utilizing the present invention, spatial distribution of functions across the array may vary as a function of the dynamic changes in the functional load that is actually being asked of a particular device.
[0036] To track these changes and provide the functionality, a map may be maintained indicating the functionality of the platforms. For instance, in an aspect of the present invention, a master map is maintained of the instantaneous distribution of functions across the platforms. Such a map may be thought of as a functional virtualization, in which the map indicating corresponding functions and locations is fully virtualized. Thus, functional virtualization may be provided in addition to a general logic capability previously discussed.
[0037] Because the isochronous foundation of this embodiment of the present invention, the isochronous fabric provides coordinated synchronization without the bookkeeping or overhead which may be associated utilizing other methods. By providing a mapping of particular components, i.e. what the particular components are set up to do what particular component of a function at what point in time, desired functionality may be achieved in a coordinated fashion.
[0038] Compiler
[0039] In an additional aspect of the present invention, a smart compiler is provided which “understands” how to manage and develop a binary executable for a particular instruction set extension. Further, the compiler technology may be generalized so that it has the property of extending this understanding, so that it may track which extensions are mapped to which particular set of processors, and understands temporally the load value, i.e. the cycle by cycle availability of a resource of a particular kind.
[0040] In effect, the compiler technology extends horizontally across processor function sets, so that the compiler, when an application, methodology and like program of instructions is expressed and translated to the compiler, the compiler may determine availability of the resources. Additionally, through the use of an isochronous fabric, there is no overhead associated with altering the connections. Reconfiguring the functionality of a device employing the platforms may be accomplished through changing the map.
[0041] The compiler technology of the present invention may implement this space/time view through an arbitrarily extensible very large instruction word (VLIW) architecture that is variable. For example, although the architecture has been used in multimedia engines, the width is cycle-by-cycle variable according to this aspect of the present invention.
[0042] The advantages of such a “smart” compiler are numerous. For instance, in cache management, a variety of considerations may be accounted for, such as latency and the resultant performance penalty, associated overhead of flushing the cache versus maintaining a function in place, and the like. Thus, a compiler of the present invention may optimize operations performed by the platforms.
[0043] Therefore, in an embodiment of the present invention, a compiler is provided that is capable of maintaining space/time mapping of the instruction set extensions over an isochronous fabric so that cycle-by-cycle ability is maintained to affiliate objects and communicatively couple them as desired.
[0044] Mapping
[0045] To coordinate and provide desired functionality, a map of the present invention is provided. For instance, in an aspect of the present invention, RTL is expressed, in terms of combinations of instruction set architecture extensions and embedded programmable logic core (EPLC) blocks.
[0046] For example, in an aspect of the present invention, a map is provided for describing functions of platforms of the present invention expressed in a graph-theoretic manner. A map may be provided as a graph, for instance, employing graph coloring and with efficient graph-traversal algorithms to describe the interaction and functionality of the components.
[0047] Formalisms may be employed for expressing functions, such as MUXs, latches, codecs, and like logic functions and re-expressing the functions in terms of efficient instruction set architecture (ISA) extensions. Preferably, the extensions take into account that some components of the ISA may be modified on a cycle-by-cycle basis on the one hand and may be varied in width on the other hand.
[0048] For instance, standard library functions, such as concrete, practical, standard functions in an ASIC, may be expressed as mathematical abstracts. These mathematical abstracts may be expressed as instruction set extensions and EPLC adjuncts that will allow these instructions to be manipulated rapidly. In one contemplated embodiment, ASIC library functions are implemented with minimal overhead penalties and space penalties.
[0049] In this way, the present invention provides two degrees of freedom in finding optimal expressions of logic functions that can be deployed across the sea of platforms of the present invention.
[0050] Magnetoresistive Memory for a Complex Programmable Logic Device
[0051] The present invention may also include nonvolatile memories embedded in platforms. Traditionally, there were limited options for storing state information on a platform device, such as a device including a microcontroller, a reconfigurable core, a DSP, embedded programmable logic, and the like, which is communicatively coupled via a scalable interconnect. In order to give the platform device state, the platform needs some kind of memory that may be written to and read from to configure a chip. The state may be defined at manufacturing time, in the field, such as sending the chip out into the field, and then configuring the chip once it arrives, so that the state may be contextual, and the like as contemplated by a person of ordinary skill in the art.
[0052] In addition to defining state, it is also desirable that the platform support repair functionality. For example, in an instance of chip malfunction, a method of introducing a bug fix may be implemented, a self-healing mechanism employed that will allow the chip to reconfigure itself based on a previous definition, redefine the configuration of the chip based on changing needs, and the like. For instance, a new protocol standard may be encountered, and thus the chip may need to be updated with the new standard in the field.
[0053] Traditionally, one way of providing this platform support has been to include flash memories within a device. Flash memory is a form of writable nonvolatile memory that may be upgraded in the field. However, flash memory has several disadvantages. Flash memory is asymmetric and requires wear leveling.
[0054] For example, flash cells may wear from repeated writings. Therefore, although flash memory may be read repetitively with no harm, a memory location of flash memory may not be written a great multitude of times. There is a wear phenomenon in which voltage required to successfully write to a location gradually increases over time, and as soon as the voltage required to perform a write passes a certain threshold, the device is no longer reliably writable at that location. Thus, the pattern of writes is distributed in a random fashion, i.e. wear leveling, to reduce hot spots that become unwritable. Flash memory is also slow, expensive and is complicated to embed in an ASIC process, and thus, has severe limitations.
[0055] Thus, the provision of a reliable memory device may have a significant impact on platform architecture. For example, by providing a reliable memory, platform structures may have the ability to reliably retain state even in the absence of power, which alters how designing a platform and getting the platform to function is approached, because of the ability to have a complex instantaneously recorded state that is preserved through random power interruptions.
[0056] By not having these advantages, a variety of considerations must be addressed. For instance, a platform may be provided that has a simpler functionality because the functionality has to be 100 percent defined in the silicon, or the platform must have a secure way of returning to its desired configuration due to malfunction or other functionality interruption, the platform has to be constructed as a “perfect” device because the platform does not have the ability to fix itself, and other considerations as contemplated by a person of ordinary skill in the art. All of those constraints are lifted if a writable nonvolatile memory of the present invention is provided.
[0057] Additionally, it is expected that the proportion of a die consumed by memory will continue to increase. In other words, memory is going to occupy more and more of the die area and which is true whether utilizing a sea of platforms architecture or the more traditional processor and IP cores organization is utilized. This argues in favor of the emergence of a nonvolatile form that may retain its content even though the content may become quite large, such as multiple megabytes of data.
[0058] Therefore, the present invention provides MRAM for support of the desired functionality. MRAM is a way of applying the magnetoresitive effect, including giant magnetoresistive effect and the like. MRAM includes a set of characteristics that may have the effect of changing how platform architectures are conceived and put together, because MRAM may supply large, symmetrical, cheap, long-lasting nonvolatile memory. Symmetrical and asymmetrical may refer to reading and writing, as well as access time and energy.
[0059] GMR technology, such as the technology used in heads of disk drives, exploits a tunneling phenomenon in quantum electronics. Although there is some debate about the theoretical underpinnings of GMR, from a pragmatic point of view, it is a well understood issue.
[0060] The inclusion of MRAM in a platform of the present invention may provide a wide range of desired behaviors. For example, if a memory block included in a platform is all or largely MRAM, a variety of beneficial behaviors may be provided; very large memory areas may be provided; a plurality of megabytes may be written arbitrarily; the memory is reliable; the content is symmetrical (in other words, the amount of energy required to write is essentially equivalent to the amount of energy required to read, which is not true with flash); and the like. These behaviors allow definition of a state of high complexity that may be completely independent in the device itself and may be interrogated and updated remotely with little or no risk. Because flash is slow to write, requires wear leveling and has a finite lifetime, state information may not be updated in the device in the nonvolatile memory space utilizing flash memory a multitude of times, and thus, requires the additional considerations previously described to be addressed.
[0061] The use of MRAM may have a profound impact on how platforms are structured in the future. The MRAM approach provides the capability of accommodating a vastly greater range of states in a platform device's state space which may be reliably and dependably updated and interrogated. Secure encryption may also be utilized so the content may be nonvolatile but still secure and interrogatable only from a remote trusted authority.
[0062] This functionality changes platform character because now the control codes that define the state of the platform, (such as a sea of platforms, which may be an immensely complex system environment) may be written into a symmetrical low-cost, high-density writable nonvolatile memory that may be upgraded and interrogated at will. The state memory may be updated rapidly and securely in order to make sure that the device stays on-line.
[0063] For example, referring now to the embodiment shown in FIG. 3, a sea of platforms architecture may be provided in which each of the platforms has a corresponding MRAM for storage of state information. By including state information within the platforms themselves, the state information may be accessible even in the case of power failure, and may be written and accessed a multitude of times without the problems of wear-leveling as required in utilizing flash memory. Although MRAM is shown within each platform, MRAM may be distributed through the “sea of platforms” utilizing a variety of methods without departing from the spirit and scope thereof, such as in platform groupings and the like.
[0064] The use of MRAM may also result in increased yields due to the repair capability. For instance, repair capabilities may include complex repair algorithms, redundant resources on the device that may be invoked remotely when malfunction of the device is detected, and the like. Thus, the ability to repair a platform including this architecture may decrease production costs, improve device operability and increase performance of the platform incorporating this functionality.
[0065] Genetic programming may also be greatly improved, such as in instances in which a chip computes among a multitude of possibilities, MRAM provides the memory characteristics needed to facilitate this access. For instance, where the patterns of access are heavily concentrated in a very small number of locations for intense periods of time, flash is ill-suited for computationally intensive environments where writes need to be performed into the nonvolatile space. Additionally, many algorithms either require such access or are heavily optimized on the assumption of the ability to write into a nonvolatile space repetitively and intensively.
[0066] Further, this nonvolatile, as opposed to volatile, memory may have improved power consumption, power management functionality, as well as support disaster recovery as described previously.
[0067] In another embodiment of the present invention shown in FIG. 4, MRAM is incorporated into a data storage system. For instance, the control of large repositories, such as a RAID array including multiple data storage devices 402, 404 & 406, disk farms and the like, may be provided in which a write operation needs to be performed for storage of data. However, the disk may have high latency, delayed operation in the instance of a swappable disk, and the like.
[0068] To provide reliable data storage, an area of nonvolatile memory, in this case MRAM 410, is provided so that in the case of power failure, device malfunction, and the like, that state information is captured even before it has been committed to the very high-latency medium that is represented by a disk drive and the like. For example, the MRAM repository 410 may be accessible by a controller 408 of a RAID array, in which data is first received by the controller and written to the MRAM 410 before being transferred to the data storage devices 402, 404 & 406. Thus, the design of controllers 408 for large repositories will be heavily affected by the availability of these symmetrical nonvolatile memories.
[0069] It is believed that the architecture of the present invention and many of its attendant advantages will be understood by the foregoing description. It is also believed that it will be apparent that various changes may be made in the form, construction and arrangement of the components thereof without departing from the scope and spirit of the invention or without sacrificing all of its material advantages. The form herein before described being merely an explanatory embodiment thereof. It is the intention of the following claims to encompass and include such changes.
Claims
1. A system for providing distributed dynamic functionality in an electronic environment comprising:
- a plurality of platforms, the platforms suitable for providing a logic function, the platforms including embedded programmable logic and MRAM memory, the logic and MRAM memory communicatively coupled via an interconnect.
2. The system as described in claim 1, further comprising a reconfigurable core.
3. The system as described in claim 1, further comprising a map expressing logic function of the plurality of platforms.
4. A data storage system, comprising:
- an electronic data storage device suitable for the storage of electronic data;
- an MRAM memory, the MRAM memory operable for the storage of electronic data utilizing a magnetoresistive effect; and
- a controller communicatively coupled to the data storage device and the MRAM memory; the controller suitable for controlling operations of the data storage device, wherein the controller receives data for writing to the electronic data storage device; the data is stored utilizing the MRAM memory before writing to the electronic data storage device.
5. The data storage system as described in claim 4, wherein the electronic data storage device is a disk drive.
6. The data storage system as described in claim 4, further comprising a second electronic data storage device, the electronic data storage device and the second electronic data storage device arranged with the controller as a RAID system.
7. The data storage system as described in claim 4, wherein the MRAM memory is utilized as a buffer by the controller.
8. The data storage system as described in claim 4, wherein the MRAM memory is symmetrical.
9. The data storage system as described in claim 8, wherein an amount of energy required to write to the MRAM memory is generally equivalent to an amount of energy required to read the MRAM memory.
10. A data storage system, comprising:
- a means for storing electronic data;
- a means for storing electronic data utilizing a magnetoresistive effect; and
- a means for controlling operations of the data storage means, the controlling means communicatively coupled to the data storage means and the magnetoresistive data storage means; wherein the controlling receives data for writing to the data storage means, the data is stored utilizing the magnetoresistive data storage means before writing to the data storage means.
11. The data storage system as described in claim 10, wherein the data storage means is a disk drive.
12. The data storage system as described in claim 10, further comprising a second means for storing electronic data, the data storage means and the second data storage means arranged with the controlling means as a RAID system.
13. The data storage system as described in claim 10, wherein the magnetoresistive data storage means is utilized as a buffer by the controller.
14. The data storage system as described in claim 10, wherein the magnetoresistive data storage means is symmetrical.
Type: Application
Filed: May 21, 2004
Publication Date: Nov 4, 2004
Inventor: Christopher L. Hamlin (Los Gatos, CA)
Application Number: 10851004