Full-system ISA Emulating System and Process Recognition Method
Disclosed is a method of recognizing a process in a full-system Industry Standard Architecture (ISA) emulator, comprising the steps of: recognizing a process based on a base address of a page table thereof, recognizing the switch between the processes when said base address of the page table has changed, recognizing the termination of a recorded process when the base address of the page table of the process which tries to modify the page table is not equal to the base address of the page table of the recorded process in the page table. With the recognized process, the binary translation results indexed based on content can be saved into a corresponding process repository, thereby achieving the permanent saving of the translation results and the reuse of translation and optimization on the basis of a previously executed program. Consequently, the overall performance of the full-system Industry Standard Architecture emulator is enhanced.
This application claims priority under 35 U.S.C. §119 from Chinese Patent Application No. 200710104743.3 filed Apr. 25, 2007, the entire contents of which are incorporated herein by reference.
FIELD OF THE INVENTIONThe invention generally relates to a full-system Industry Standard Architecture (ISA) emulating system and more particularly, to process recognition in the full-system ISA emulating system.
BACKGROUND OF THE INVENTIONA full-system ISA emulator is a tool which supports running operating systems and applications of one ISA platform on another ISA platform, for example, running x86 Linux and corresponding applications on a PowerPC machine. It is an important method to extend the usage of an existing platform, to migrate servers between different ISA platforms, to develop and debug software of a pre-hardware system on an existing platform, etc.
Currently, full-system ISA emulators can be categorized into two classes: 1) interpretation-based ones, which sequentially interpret and emulate each instruction of a target ISA on a host platform, and; 2) binary-translation-based ones, which when encountering a section of non-translated target binary codes, translate them to host binary codes, then store the translated codes into the translation cache, and finally execute the translated codes directly. Usually the binary-translation-based emulators achieve higher performance than the interpretation-based ones, since they interpret and emulate many target instructions in a batch, instead of only one instruction each time.
Typically, binary-translation-based emulators check if a section of target binary codes has been translated in the following way: 1) calculating the starting physical address of the target binary codes, and; 2) searching the translation cache for the address. If found it means the code section has already been translated. Therefore, the translation results are indexed by the physical address of target binary codes. Since the physical address of binary codes may change after the program exists and re-executes, the translation results cannot be stored in permanent media, such as a hard-disk. So in the case of cache replacement, current binary-translation-based emulators have to discard some translated codes, and re-translate them when encountering them again. Another common way to improve the performance of an emulator is to dynamically optimize the translated codes based on, for example, profiling. The optimized codes cannot be saved in the permanent media due to the above-mentioned reason about index by physical address. Since both optimization and translation are time consuming, re-translating and re-optimizing the codes will degrade the overall performance of an emulator.
Consequently, there is a need for a new way to index the translation results and to store the translation results in the permanent media in this way. It is known that the code page of a program is the only part which needs to be translated and optimized. That is to say, the code page of a program is the unique feature thereof. If the content of the code page can be used, i.e., translation and optimization results can be indexed based on the content, then these translation results can be stored in the permanent media.
Although the translation results can be stored permanently using the aforesaid idea, since all applications in a full-system ISA emulator are run on an emulated OS (operating system) instead of the emulator directly, the full-system ISA emulator can only see the instruction flows, the machine states, and the memory mapping, but cannot differentiate different programs and recognize separate processes. Since the processes cannot be recognized separately by the full-system ISA emulator, even though the translation results indexed by content are stored permanently using the aforesaid idea, they can only be put into a huge translation repository instead of many application-dedicated process repositories. Obviously, since the process cannot be recognized, the full-system ISA emulator will look up the stored translation results in the huge translation repository, which is time-consuming. And maintenance of a huge translation repository results in much more overhead and cost than a small process repository.
Additionally, full-system ISA emulators are also often used to generate the profiling data for applications. However, full-system ISA emulators can hardly generate the profiling data dedicated for target applications without interleaving of other concurrent applications, since they cannot recognize the processes separately.
Therefore, there is a need for a method of recognizing a process in a full-system ISA emulator and saving the content-based dynamic binary translation in the full-system ISA emulator by the recognized process.
SUMMARY OF THE INVENTIONIt is an object of the present invention to recognize a process in a full-system ISA emulator and save the content-based dynamic binary translation results by the recognized process.
To achieve the above object, the present invention provides a method of recognizing a process in a full-system ISA emulator, comprising the steps of: recognizing a process based on a base address of page table of the process; recognizing the switch between processes when said base address of page table changes; and recognizing the termination of a recorded process when the base address of a page table of a process which tries to modify the page table is not equal to the base address of the page table of the recorded process in the page table.
The present invention also proposes saving, by an emulator, of content-based dynamic binary translation based on the recognized process, the saving comprising the steps of: calculating the digestion of a code page; and looking up a process repository of the code page to determine whether it has been translated based on the digestion; if the code page has not been translated, then said emulator translates the code page and saves the translation results in the process repository.
The present invention also provides a full-system ISA emulating system, comprising: process recognition means for recognizing a process based on the base address of a page table of the process, recognizing the switch between processes when the base address of a page table changes, and recognizing the termination of a recorded process when the base address of a page table of a process which tries to modify a page table is not equal to the base address of the page table of the recorded process in the page table; translation code lookup means for calculating the digestion of a code page, and looking up a process repository of the code page to determine whether it has been translated based on the digestion; and translation code saving means for translating said code page and then saving the translation results in the process repository of said code page when said code page has not been translated.
In accordance with the method and means of the present invention, the permanent saving of translation results and reuse of the translation and optimization of the previously executed program can be achieved based on the recognized process. Consequently, the overall performance and efficiency of the full-system ISA emulator is increased, the overhead during the course of emulation is decreased, and code is optimized.
As the present invention is better understood, other objects and effects of the present invention will become more apparent and easy to understand from the following description, taken in conjunction with the accompanying drawings wherein:
Like reference numerals designate the same, similar, or corresponding features or functions throughout the drawings.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTSReferring to the accompanying drawings, the present invention will now be described in detail.
It should be noted that the above description omits some more specific technical details which may be well-known to those skilled in the art and essential for implementing the present invention for the purpose of easy understanding.
The specification of the present invention has been presented for purposes of illustration and description, and is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art.
Therefore, the embodiments were chosen and described in order to best explain the principles of the invention, the practical application, and to enable others of ordinary skill in the art to understand that all modifications and alterations made without departing from the spirit of the present invention fall into the protection scope of the present invention as defined in the appended claims.
Claims
1. A method of recognizing a process in a full-system Industry Standard Architecture emulator, comprising the steps of:
- recognizing a process based on a base address of a page table of the process;
- recognizing a switch between the processes when said base address of page table changes; and
- recognizing termination of a recorded process when the base address of a page table of a process which tries to modify the page table is not equal to the base address of the page table of the recorded process in the page table.
2. The method according to claim 1, wherein the page table of said process is recorded in a process database by a value of the base address of said process.
3. The method according to claim 1, wherein said base address of the page table is stored in a register and the changes of said base address of the page table are determined by monitoring the register.
4. The method according to claim 1, wherein said page table is stored in the memory area of a host platform and the memory area is set as a read-only area, exception handler is set by using said emulator, and upon an attempt to modify said page table in said read-only area, the exception handler is triggered to recognize whether the process is terminated.
5. The method according to claim 1, wherein said emulator saves content-based dynamic binary translation based on the method, said saving of translation comprising:
- a translation code lookup step, further comprising the steps of: calculating the digestion of a code page, and looking up a process repository of the code page to determine whether the code page has been translated based on the digestion;
- and a translation code saving step, further comprising the steps of translating said code page, and saving the translation results in said process repository by said emulator when said code page has not been translated.
6. The method according to claim 5, wherein said process repository is stored in a fast translation cache of a memory of a host platform, said translation results are only executed in the fast translation cache, said process repository is further stored in a permanent cache of the memory of the host platform, and the process repository stored in the permanent cache is a superset of process repositories stored in said fast translation cache.
7. The method according to claim 6, wherein the translation code lookup step further comprises the step of looking up said process repository in said fast translation cache by the base address of the current process repository.
8. The method according to claim 7, further comprising the step of loading said process repository from the permanent cache to the fast translation cache if said process repository has not been found in the fast translation cache.
9. The method according to claim 8, wherein the step of loading the process repository to the fast translation cache comprises the step of looking up the process repository in the permanent cache by the digestion of the code page.
10. The method according to claim 9, wherein the step of looking up the process repository in the permanent cache comprises:
- marking all the process repositories to be looked up in the permanent cache as candidate repositories, and determining whether there is a candidate process repository containing the code page by said digestion of the code page,
- when there is no process repository containing the code page in the candidate repositories, creating a new process repository for the code page if no temporary process repository has been created for the code page, otherwise, marking the temporary process repository as a new process repository and then loading it to the fast cache,
- when there is more than one process repository containing the code page in the candidate repositories, creating a temporary process repository, loading the translation code to the temporary process repository thereby to be loaded to the fast cache, then marking all the process repositories containing the code page as the candidate repositories for the subsequent code page,
- when the number of process repository containing code page is one, loading the process repository containing the code page to the fast cache.
11. A full-system Industry Standard Architecture emulating system, comprising:
- means for process recognition for recognizing a process based on a base address of a page table of the process;
- means for recognizing a switch between the processes when said base address of the page table changes;
- means for recognizing the termination of a recorded process when the base address of a page table of a process which tries to modify the page table is not equal to the base address of the page table of the recorded process in the page table;
- means for translation code look-up for calculating the digestion of a code page, and looking up a process repository of the code page to determine whether it has been translated based on the digestion; and,
- means for translating saved code for translating the code page and then saving the translation results in the process repository when the code page has not been translated.
12. The system according to claim 11, wherein said base address of the page table is saved in a register and the changes of said base address of the page table are determined by monitoring the register.
13. The system according to claim 11, wherein said page table is saved in the storage area of a host platform and the storage area is set as a read-only area, said process recognition means sets an exception handler, and upon an attempt to modify said page table in said read-only area, the exception handler is triggered to recognize whether the process is terminated.
14. The system according to claim 11, wherein said process repository is saved in a fast translation cache of a memory of a host platform, said translation results are only executed in the fast translation cache, said process repository is also saved in a permanent cache of the memory of the host platform, and the process repositories saved in the permanent cache are the superset of those saved in the fast translation cache.
15. The system according to claim 14, further comprising:
- means for translation code look-up; and,
- means for process repository look-up for looking up the process repository in the permanent cache to load the process repository into the fast translation cache when the process repository has not been found in the fast translation cache.
16. The system according to claim 15, wherein the lookup of the process repository in the permanent cache comprises:
- marking all the process repositories to be looked up in the permanent cache as candidate repositories, determining whether there is a candidate repository containing the code page by said digestion of the code page,
- when there is no process repository containing the code page in the candidate repositories, creating a new process repository for the code page if no temporary process repository has been created for the code page, otherwise, marking the temporary process repository as a new process repository and then loading it to the fast cache,
- when there is more than one process repository containing the code page in the candidate repositories, creating a temporary process repository, loading the translation code to the temporary process repository to be loaded into fast cache, then marking all the process repositories containing the code page as the candidate repositories for the subsequent code page,
- when the number of process repositories containing a code page is one, loading the process repository containing the code page to the fast cache.
Type: Application
Filed: Apr 23, 2008
Publication Date: Oct 30, 2008
Patent Grant number: 8255201
Inventors: Hua Yong Wang (Beijing), Kun Wang (Beijing), Honesty Young (Saratoga, CA)
Application Number: 12/107,835
International Classification: G06F 12/10 (20060101);