Array transformation in a behavioral synthesis tool
A behavioral synthesis tool for generating an integrated circuit design is described. The behavioral synthesis tool allows a designer to interactively allocate variables or arrays to memory resources without having to modify a source code description of the integrated circuit. The behavioral synthesis tool reads the source code description and generates a synthesis intermediate format stored in memory. The synthesis tool searches the in-memory synthesis intermediate format to find arrays for each process. The arrays are then listed in a graphical user interface (GUI). The GUI allows the designer to create memory resources, specifying the type of memory, the packing mode, etc. The designer is also provided the ability to vary the format among a plurality of formats used to pack arrays to memory during the memory packing process. Upon completion of modifying the memory allocation, the designer saves the changes and such changes are effectuated by automatically updating the synthesis intermediate format.
The present invention relates generally to behavioral synthesis tools for creating integrated circuits, and more particularly relates to behavioral synthesis tools that provide for improved packing of arrays to memory.
BACKGROUNDWith the proliferation of data-intensive applications, such as sound, image and video processing, the memory subsystem has become an important focus of electronic system design. More than three-quarters of a data-intensive system can be made up of storage components, making the memory subsystem the most crucial part of the design of an integrated circuit. Most of these systems need to be high-speed due to the large amounts of data involved and must be designed carefully to avoid a solution that is larger than expected.
The design of an integrated circuit no longer begins with a circuit diagram. Instead, it begins with a software program that describes the behavior or functionality of a circuit. This software program is a source code description that defines an algorithm to be performed with limited implementation details. Designers direct behavioral synthesis tools to convert the source code description into a register transfer level (RTL) description. The RTL description is used to ultimately generate a netlist that includes a list of components in the circuit and the interconnections between the components. This netlist is used to create the physical integrated circuit.
Arrays provide a powerful and convenient method for modeling the behavior of memories in source code descriptions. That is, behavioral descriptions are used to manipulate groups of data in an abstract manner using arrays. These arrays are, under the control of the designer, packed to memory. Behavioral synthesis tools automatically construct the logic to control the memory, freeing the designer to explore architectures using different memories with different characteristics (e.g., synchronous versus asynchronous, single port versus dual port), and make intelligent decisions about an appropriate implementation for a design.
To pack arrays to a memory, the designer must specifically assign the variables representing the arrays to a memory in source code and specify the type of memory and other memory parameters. This is accomplished using a set of attributes or directives. For example, Synopsis® tools use a “pragma” statement.
After the designer designates the details of memory allocation in the source code description (using pragma statements or other directives), the designer runs the source code description through the synthesis tool. The synthesis tool generates a report that the designer can use to analyze the performance of the circuit. For example, the user can examine the speed and area of the circuit to determine whether the current memory allocation is acceptable. If the memory allocation is not acceptable, the designer must return to an editor, re-edit the source code description to change the details of memory allocation, and run the source code description through the synthesis tool again. Such a technique for modifying the memory allocation is time consuming and inefficient and gives the designer only a limited amount of options for designating how memory will be allocated.
It is desirable, therefore, to provide a method and synthesis tool that allows a designer to modify memory resources more quickly and simply as well as provide the designer with more advanced options for specifying how arrays will be packed to memory.
SUMMARYMethods, systems, and behavioral synthesis tools are provided that allow a designer to change the format of how an array is packed to memory during the memory packing process when converting a source code description of a circuit to an RTL description. The designer can write a source code description at the algorithmic level describing the behavior of the circuit to be designed. The designer then uses a behavioral synthesis tool to generate a number of different architectural designs using synthesis techniques. Each design can implement differing memory allocation by allowing the designer to change a number of different constraints such as whether to use RAMs vs. registers, which type of RAM to use, how many memories to use, whether to use on or off-chip memory, etc. The designer is also provided the ability to transform the layout format of the arrays in the source code description such that the designer can quickly and easily control the packing of the arrays into the chosen memories. These constraints can be changed either using a graphical user interface, changing constraints within the behavioral synthesis tool, or by manually manipulating the source code description. The behavioral synthesis tool then creates a report for each design to analyze the performance of the circuit. For example, the designer can examine and compare the speed and area of the circuits created from each of the designs to determine whether the memory performance and size are acceptable.
A source code file having a description of the hardware is read into a database within the behavioral synthesis tool. The behavioral synthesis tool analyzes the source code description and generates a data structure associated with the source code description. The designer can then modify a number of constraints dictating the details of memory allocation such as type of memory, number of memories, memory size, etc. Thus, rather than having to re-edit the source code description, the designer can change these memory constraints interactively and dynamically during the memory packing process to control how arrays are packed into memory. Once the designer is satisfied with the design, an RTL description is produced from the data structure.
A number of additional options for packing arrays into memory are provided by the behavioral synthesis tool to the designer so that the designer may select one of a plurality of array layout formats during the memory packing process. Providing the designer the ability to transform the layout of arrays dynamically allows the designer a fast, efficient method of customizing the memory allocation of the circuit to be synthesized.
Further features and advantages of the invention will become apparent with reference to the following detailed description and accompanying drawings.
At this point in the flow, the memory allocation information is stored in the intermediate database independently, meaning the information is not yet fully integrated into the SIF. This allows the designer to interactively change the memory allocation and control when such changes are applied to the SIF. The memory allocation information is stored as a set of resources for the design, wherein each resource represents a memory resource that may potentially be present in the final integrated circuit design. Additionally, arrays in the source code description are also stored in the data structure. The data structure may be organized in many different ways. One possible organization is to have various fields associated with the resources and arrays stored in the data structure. For instance, in one embodiment the following fields could be associated with a resource stored in the data structure:
A unique identification for the resource
A type of memory to be used for the resource
A list of variables packed to the resource
A packing mode for the resource
A flag indicating whether the resource is external to the design
A format for packing variables to the resource
Likewise, in one embodiment the following fields could be associated with arrays stored in the data structure:
A unique identification for the resource
An array length
An array width
A start address in a resource
A start bit in the resource
A format for packing the array to the resource
In process block 34, the designer can interactively change the memory allocation for variables that are in the source code description. That is, the source code description includes variables representing arrays that are associated with memory storage (although in some cases arrays are not packed to memory). The memory allocation for variables representing the arrays is modified by allowing the designer to manipulate a number of constraints relating to memory allocation. For instance, a first constraint may allow the designer to choose from the many different memories available in circuit design. Additional constraints may vary the size and type of memory selected (e.g., dual port memory, single port memory, etc.). Thus, in process block 34, the designer can choose which memories to use, the variables associated with those memories, the packing mode, etc.
In order to give the designer more flexibility in design, the designer is also provided the ability in process block 34 to transform the layout of the arrays into a variety of formats. The format used to pack an array to memory can have a significant impact on the overall memory performance of the circuit. Often the format used will dictate the number of reads and/or writes necessary to access a given number of words of an array or memory. Various formats can be chosen that emphasize keeping the size of the circuit within given specifications, while other formats can be chosen that emphasize the speed of the circuit (reducing the number of clock cycles/reads/writes). Specific array layout formats will be discussed at length with respect to
The designer can modify any of the memory allocation constraints discussed above (i.e. size, number, and type of memories, packing mode, formats) in a number of ways. The designer can use the GUI 18 shown in
After memory is properly allocated, the designer can perform an initial check to analyze the area and timing of the circuit (process block 36). This may be done, for example, by displaying an area versus latency graph that gives the designer some immediate feedback on whether the memory allocation is satisfactory. If the designer is unhappy with the area or timing, the designer can return to process block 34 to further modify the memory allocation, as shown by arrow 38. On the other hand, if the designer is satisfied with the memory allocation, the RTL code can be generated and simulated (process blocks 40 and 42). Finally, an RTL synthesis tool can perform RTL synthesis (process block 44) and the gate level design can be verified and analyzed (process block 46).
For each array or memory, there is a fixed data block, which is read from or written into it at the same time. This fixed data block is called a word. An array or memory can be characterized by the size and number of words. The first parameter of a memory or array is the word size, which is the number of bits read or written in a single access. The second parameter is the number of words a memory can store. Each word has a unique address under which it can be read or written, which is usually a number from 0 to the wordcount-1. Arrays can usually be indexed by an expression which is computed during runtime. Memories as hardware units can have additional parameters such as delays and number of ports.
Further details are now given on how the memory allocation is applied to the SIF. After the designer is satisfied with the memory allocation, the designer directs the GUI to apply the memory allocation information to the SIF. Thus, at this point any changes the designer has made to the memory allocation are reflected in the database by modifying the independent set of resources and their field information. Applying this information to SIF involves several steps:
1. Create a memory data object representing each memory resource. This involves determining the size of the memory based on the variables packed to the memory and the size of the variables, along with the packing mode. Also, since the original variable index may get packed to a different memory address (based on the packing mode and the order of variables on the memory), offsets are computed for each array/index.
2. Transform all read/write accesses to any of the variables packed to the memory resource into corresponding read/write(s) to the created memory data object in step 1. Transforming includes modifying the SIF appropriately to reflect the packing of the original array index to the actual memory address location. This packing is dependent on the ordering of the variables and the packing mode used. An appropriate offset is inserted for each array index access. This offset insertion is also known as address calculation.
Referring to
Assume for purposes of this example that the designer has read code segment 160 into a behavioral synthesis tool and now wishes to transform the format of this array such that it is only 50 words in width, and 64 bits wide. At this point using prior methods, the designer would have to re-edit source code segment 160 to reflect the change, both when declaring the array and whenever the array is accessed, as shown in code segment 162. Code segment 162 shows the declaration of array A changed to reflect the new size, but also the changes that must be made to the subsequent accesses to the array. In order to ensure the array accesses will still assign the correct word of array A after the size change, a new address must be calculated for the desired word. Depending on the changes made, this calculation could be quite difficult. Thus, manually producing this new code requires substantial editing time on the part of the designer and is error prone even for the simplest code such as that shown in
However, the behavior synthesis tool of the current embodiment allows the designer to simply change a constraint to transform the layout of array A from 100 by 32 to 50 by 64. Since the source code segment 160 has already been read into the SIF, a provided GUI will display the array A along with its current size, 100 by 32. The designer then changes a size constraint for the array indicating the array should be transformed to a 50 by 64 format. Once confirmed, the constraint within the SIF for the size of array A is changed to reflect the designer's desired change, and the behavioral synthesis tool creates RTL code reflecting any changes in the constraints. In other words, even though code segment 160 was read into the SIF, due to the designer's configuration of the constraints through the GUI, the RTL code is created as if the code segment 162 had been originally read in. This can be accomplished by the behavioral synthesis tool by actually creating and editing the code segment 160 to reflect the functionality of code segment 162, editing the SIF of code 160 to reflect the functionality of code segment 162, or leave code segment 160 unchanged yet still create RTL code to reflect the functionality of code segment 162.
Alternatively, in another embodiment, the designer could have manually added a size constraint to the source code segment 160, such as that shown in code segment 164, and then read in code segment 164 to the SIF. In yet another embodiment, the size constraint within the SIF could be changed manually by the designer without the assistance of a GUI.
Allowing a designer control over how arrays are packed to memory during the memory packing process in synthesizing an integrated circuit is desirable because the chosen architecture will have a significant impact on the performance of the memory system. For example, depending upon how the array words are accessed, it may be more effective to have the RAM word size different from the array word size. If two consecutive words from the array are accessed together, it may be efficient to pack two array words into a single RAM word. Or, conversely, if half of the array word is accessed often, it may be effective to split the array word into two RAM words.
Consider two arrays A (word size: 16) and B (word size: 32) are to be packed together. The RAM word size could be chosen to be, for instance, 16 or 32. If a designer chooses 32, the array A words could be zero-padded to make all A and B words the same size. The solution just suggested is obviously quite wasteful in terms of memory usage. Another solution would be to pack two array A words into a single RAM word. A third solution would be to choose 16 as the RAM word size, and split each array B word into two RAM words. A choice between the second and third formats would be dictated by word access patterns for the two arrays. Often there are numerous formats available for the same design description, each having advantages and disadvantages compared to other formats.
In many cases, access to memory elements is the bottleneck of a design, because it limits the maximum parallelism of the performed functionality. One option is to increase the width of a memory word, so each memory access reads or writes multiple array words. Packing an array to a memory with larger word size is a typical transformation to increase the throughput of the memory port as an alternative to memories with multiple ports.
Packing multiple array words to a single word of memory requires each array access to be transformed into a memory access, where certain address calculations and sub-selections of memory words have to be performed. Writing an element of the array requires reading the memory word first or the array element sharing the same memory word is not overwritten. In other words, each array read transforms into a memory read and an array write transforms into multiple memory accesses. A designer would therefore find this format useful if multiple array accesses can be packed to a single memory access. This may be the case when the source code description contains numerous loops that access the array elements in sequential order, or if the source code description always, or often, accesses the array words in pairs.
Interlacing is a format which skips a specified number of words when reading from an array in order to pack the words to memory in a given order. A typical application would be if the odd addressed and even addressed words of the array are used in separate parts of the algorithm.
In this case each array access results in multiple memory accesses. This decreases the performance of the design but might be a design constraint, e.g. when only certain memory word sizes are available.
The packing shown in
Alternatively, array 110 could also be packed such that the first 5 words of array 110 are packed sequentially into memory 112 and the second 5 words of array 110 are packed into memory 114, as shown in
Array 122 is packed so that it occupies space in memory 124 immediately adjacent array 120. Thus, word 0 of array 122 is packed to word 2 of memory 124, as was word 0 of array 120. However, word 0 of array 122 is displaced such that the least significant bit is located at bit 12 of the memory word. The subsequent words of array 122 are then packed to the subsequent words of memory 124, with the least significant bit of each array word located at bit 12 of the memory word.
A designer is thus given the alternative selection of increasing the speed of the circuit by packing one array onto multiple memories as shown in
The formats described with respect to
It is also noteworthy that the described formats can be used either in conjunction with resolution modes.
For example, the array 154 selected in GUI 150 is a size 21 by 64, meaning it has words 64 bits in width, and length of 21 words. The display pane 152 provides a symbolic name of a resource associated with the variables. In this case, “dct_temp_rsc” is a memory resource used for the array 154, named “dct_temp”. The GUI 150 shows a settings tab 156 used by the designer to change the memory allocation of the circuit using the provided constraints. The setting tab 156 in this example shows only one of the many constraints that can be provided for customizing the memory allocation of a circuit design. Here, the array word width constraint can be changed via the input window 158 to a desired width. Once a new width in entered, the display pane 154 will reflect the new width for array dct_temp. At this point the designer may make other changes such as the format of array dct_temp, other arrays, change the memory type, etc. Or, if he is satisfied, he can then select the “OK” button to apply the changes to the SIF.
Any of the aspects of the methods, systems and behavioral synthesis tools described above may be performed in a distributed computer network.
Having illustrated and described the principles of the illustrated embodiments, it will be apparent to those skilled in the art that the embodiments can be modified in arrangement and detail without departing from such principles.
For example, although a particular GUI was shown, the user interface can be modified to illustrate the variables and resources in a different way. The particular user interface used is not important to the invention.
Also, the transformation of arrays from one format to another during the packing process can be accomplished in a variety of ways. For instance, an array of a first format can be transformed into an intermediate array in a second format, and then directly packed into a memory. Alternatively, the array of the first format can be transformed as it is being packed into the memory, such that the array is then stored in memory in the desired second format.
Although the formats and methods described herein are generally illustrated in isolation for purposes of clarity, one of skill in the art will recognize that the methods and formats for transforming array layout described herein may be combined or modified. For instance, an array being packed into a customized location, such as shown in
Additionally, the data structure stored in database 16 can be structured in a wide variety of ways. The particular data structure used is not important to the invention.
Moreover, although the description focuses on arrays, variables that are associated with single registers in an integrated circuit may also be modified using the formats or tools described. Furthermore, any single variable or non-array data can be represented as an array and processed by embodiments of the described behavioral synthesis tool.
Although a particular data structure is illustrated, those skilled in the art recognize that a wide variety of data structures may be used.
In view of the many possible embodiments to which the principles of our invention may be applied, it should be recognized that the illustrated embodiment is only a preferred example of the invention and should not be taken as a limitation on the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope of these claims.
Claims
1.-51. (canceled)
52. A method of interactively allocating memory in a behavioral synthesis tool, comprising:
- analyzing a source code description describing behavior of an integrated circuit, the source code description including one or more array variables;
- representing the source code description in an intermediate format, the intermediate format including memory allocation information for the one or more array variables; and
- allowing a user to change the memory allocation information for at least one of the one or more array variables while the source code description is represented in the intermediate format and before a register-transfer-level (RTL) description of the integrated circuit is generated using the memory allocation information.
53. The method of claim 52, wherein the method further comprises providing a graphical user interface through which the user can indicate desired changes to the memory allocation information.
54. The method of claim 52, further comprising modifying the memory allocation information in the intermediate format in response to changes input from the user.
55. The method of claim 52, further comprising modifying the source code description in response to changes input from the user.
56. The method of claim 52, wherein the memory allocation information for one of the one or more array variables includes an array length, and wherein the method further comprises changing the array length of the one of the one or more array variables in response to input from the user.
57. The method of claim 52, wherein the memory allocation information for one of the one or more array variables includes an array width, and wherein the method further comprises changing the array width of the one of the one or more array variables in response to input from the user.
58. The method of claim 52, wherein the memory allocation information for one of the one or more array variables includes a memory type, and wherein the method further comprises changing the memory type for the one of the one or more array variables in response to input from the user.
59. The method of claim 52, wherein the memory allocation information for one of the one or more array variables includes a memory packing format, and wherein the method further comprises changing the memory packing format for the one of the one or more array variables in response to input from the user.
60. The method of claim 52, further comprising displaying an area versus latency graph for at least one of the one or more array variables using current memory allocation information and before the RTL description of the integrated circuit is generated using the memory allocation information.
61. The method of claim 52, further comprising generating an RTL description of the integrated circuit using the memory allocation information.
62. A system, comprising:
- means for representing a source code description describing behavior of an integrated circuit in an intermediate format, the source code description including one or more array variables, and the intermediate format including memory allocation information for the one or more array variables; and
- means for allowing a user to change the memory allocation information for at least one of the one or more array variables while the source code description is represented in the intermediate format and before a register-transfer-level (RTL) description of the integrated circuit is generated using the memory allocation information.
63. The system of claim 62, further comprising means for modifying the memory allocation information in the intermediate format in response to changes input from the user.
64. The system of claim 62, further comprising means for modifying the source code description in response to changes input from the user.
65. A synthesis system, comprising:
- a memory storing an intermediate format representation of a source code description, the source code description and the intermediate format representation describing behavior of an integrated circuit and including one or more array variables; and
- a graphical user interface that allows a user to modify memory allocation information for at least one of the one or more array variables before a register-transfer-level (RTL) description of the integrated circuit is generated using the memory allocation information.
66. The synthesis system of claim 65, wherein the graphical user interface displays a list of processes associated with the source code description and array variables accessed with the processes.
67. The synthesis system of claim 65, wherein the graphical user interface allows a user to modify an array length of at least one of the one or more array variables.
68. The synthesis system of claim 65, wherein the graphical user interface allows a user to modify a memory type to allocated to at least one of the one or more array variables.
69. The synthesis system of claim 65, wherein the graphical user interface allows a user to modify a memory packing format of at least one of the one or more array variables.
70. The synthesis system of claim 65, further comprising a display configured to display an area versus latency graph for at least one of the one or more array variables using current memory allocation information and before the RTL description of the integrated circuit is generated using the memory allocation information.
71. The synthesis system of claim 65, further comprising an RTL generator configured to generate an RTL description of the integrated circuit using the memory allocation information.
Type: Application
Filed: Dec 17, 2007
Publication Date: Jul 17, 2008
Inventors: Shiv Prakash (Tualatin, OR), Bryan Darrell Bowyer (Newberg, OR), Peter Pius Gutberlet (Wilsonville, OR)
Application Number: 12/002,617
International Classification: G06F 17/50 (20060101);