DEVICE FOR VIDEO DECODING
A device for video decoding is disclosed. It includes at least a pipeline scheduler, a decoding core, a segmented context memory and a context cache. The pipeline scheduler and the decoding core could decrease the time taken in a code-decoding period. The segmented context memory and the context cache could reduce accessing time of reading and writing context values.
Latest Patents:
1. Field of Invention
The present invention relates to a device for video decoding. More particularly, the present invention relates to a device for decreasing the calculated amount of entropy coding in H.264/AVC video decoding standard.
2. Description of Related Art
There are two kinds of new H.264/AVC video decoding standard Entropy Coding algorithms. One is suitable for Context Adaptive Variable Length Coding (CAVLC) in Baseline Profile. The other is suitable for Context-based Adaptive Binary Arithmetic Coding (CABAC) in main profile.
Because CABAC refers to the context in the content, it is a more suitable way to code and enhance the efficiency of coding according to the probability distribution. CABAC uses the just coded symbols to estimate the probability for the next symbol to show up, and tremendously reduces the burdens between the symbols and with mobility adjusts every code of symbol to match the probability distribution. Therefore the compression rate could be very high.
The above-mentioned algorithm can save 50% more bit-rate than the known Variable Length Coding (VLC) in Entropy Coding. And CABAC can save 9% to 14% more bit-rate than CAVLC, but the price is that CABAC has at least 10% more calculating complexity than CAVLC. That is to say, if the two coding systems decode a symbol at the same time, CABAC wastes at least 10% more calculating complexity than CAVLC.
To achieve the goal of application on HDTV and spontaneous decoding in CABAC, the issue about how to enhance the whole efficiency and overall throughput rate when there are a tremendous amount of calculations is the problem that needs a solution.
In the main profile, except the header of the slice is decoded by the slice parser, macroblock (MB) and residual data are decoded by CABAC. The so-called slice is composed of numerous macroblocks, and an image can be composed of one or more slices. The slice is the smallest self-decodable unit in H.264/AVC video compression standard. That is, a slice can be decoded merely by self-decodable data instead of relying on another slice. The benefit is that when it is transmitted to the far end, after receiving the compressed data of the slice, it can be decoded at once and does not have to wait until the all the data is received completely. What's more, once the data is missing or mistaken in the transmitting process, it only affects the slice concerned and not the others.
Besides, CABAC defines a probability model by regularly updating the showing probability of 0 and 1 to decode the symbol directly, but increases higher probability dependence between every symbol. That is to say, the next symbol must wait. Not until the last symbol finishes updating the probability model can it proceed decoding. This process limits the data throughput in the unit time. This limitation is a bottleneck in calculating operations and a problem must to be solved in CABAC decoding design.
Refer to
Refer to
It is therefore an objective of the present invention to provide a video decoding device which adopts the pipeline scheduler in order to arrange the whole process of decoding, reduce the time of decoding and enhance the efficiency.
It is another an objective of the present invention to provide a video decoding device which adopts a context table in order to divide the context table into numerous sections, avoid wasting time waiting for the memory cache and improve the efficiency of context data access.
A video device adopting the pipeline scheduler is provided. In one embodiment of the present invention, the video device in accordance with the present invention includes three sets of decoding engines, a context model and probability model. The three sets of decoding engines are decode decision, decode bypass and decode terminate. When a symbol is decoded, only one set of decoding engines is used for each time according to the different needs of syntax elements. And when the decoding is finished, it is necessary to update the values in the context model and the probability model. Besides, this invention provides a look ahead parsing (LAP) technology to the decode decision in the three sets of engines in order to decode one more symbol in a period of time.
According to another objective of the present invention, a video decoding device is disclosed to efficiently control the memory by segmented context tables and context caches. In one embodiment of the present invention, the context tables can provide for accessing plenty of context data simultaneously in a period of time. The context caches previously read the context data that is soon used. When the interior decoding syntax circuit occurs, because only some of the specific context data are used, context caches can be saved in the context data that are used constantly in a short time. After the circuit is finished, the updated context data are written in the memory.
In conclusion, the invention efficiently enhances the overall throughput rate in the unit time and offers a method to shut down part of the decode engine to lower the waste of power consumption. Besides, this invention doesn't need to access the memory continually and reduces the waste of power.
It is to be understood that both the foregoing general description and the following detailed description are by examples, and are intended to provide further explanation of the invention as claimed.
The accompanying drawings are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention. In the drawings,
Reference will now be made in detail to the present preferred embodiments of the invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers are used in the drawings and the description to refer to the same or like parts.
While the specification concludes with claims defining the features of the invention that are regarded as novel, it is believed that the invention will be better understood from a consideration of the following description in conjunction with the figures, in which like reference numerals are carried forward.
Refer to
At first, the bit-stream manager 110 transmits bit-stream to syntax parser 120 in order to analyze the header of the slice, and then transmit the bit-stream to CABAC decoding core 100. System controller 130 prepares syntax info of the top macroblock 121 and the left macroblock 122 in syntax info memory of the macroblock 140. Two parts of the data will be generated after decoding operations of CABAC decoding core 100. One is the syntax data of the very macroblock, which is written into syntax info memory of the macroblock 140 through the system controller 130. The other is the remaining data, which is written into coefficient memory of the macroblock 150 to offer an inverse quantization (IQ) model and an inverse transform (IT) model to rebuild the data.
Refer to
Meanwhile, when the probability model 240 is updated, it might need to re-normalize the probability numerals from the bit-stream data 260. Every generated bit (or symbol) is checked by the bit-stream analysis 270 to make sure whether the decoded syntax bit-stream is finished or not. If the syntax is decoded completely, there are three situations. First, if the syntax 281 is “mb-type” and the acquired syntax value is “I_PCM”, it is necessary to initialize the probability model 210 and go on to decode the next syntax. Second, if the syntax 282 is “end_of_slice_flag” and the acquired syntax is 1, the slice is decoded completely. It is necessary to initialize the context table 200 and initialize the probability model 210, and to repeat the process described in the beginning of this paragraph. Third, it is the remaining situation, it is only necessary to go on to decode the next syntax. If the syntax is not decoded completely yet it decodes the next bit in the decoding syntax.
Refer to
Because each time only one decode engine is used, this embodiment in accordance with the present invention discloses a method to shut down the other two decode engines to reduce the waste of power. And according to this method, it can be sure that only one output is generated. Hence, when the last output is generated, only one OR logic gate 350 is used and multiplexers are not needed in order to reduce the cost of the hardware circuit area.
Refer to
Due to the fact that there is only a shift state instead of other operations, the hardware cost and time influence of this part is quite limited, but it is able to increase the overall throughput rate efficiently. And the context value of look ahead parsing detector 440 may come from the result of maximum probability symbol 410 and from the other context value. The last written-back context value might come from the maximum probability symbol 410 or maximum probability symbol 411. MPS 431 chooses the last saved context value.
Refer to
Refer to
Refer to
Refer to
Refer to
If the analysis conditions in the look ahead parsing detector are met, one more symbol is decoded in a period of time, enhancing the throughput rate of this video decoding. Segmented context memory 720 can make the pipeline scheduler and binarization controller 740 proceed reading, decoding symbol and writing in the same memory in the same period of time. Cache context 730 can previously read the context data in order to avoid unnecessary waiting time and apply to the segmented context memory 720 to manage the context memory effectively. The pipeline scheduler and the binarization controller 740 control the inner part of decoding syntax or schedules of multiple decoding syntaxes.
This overall output of the CABAC decoding core can support the compressive format of 30 frames in one second in the HD1080 standard.
According to the above-mentioned preferred embodiment of the present invention, there are the following advantages if this invention is applied to:
1. The embodiment in accordance with the present invention offers a highly efficient context decode construction of video compressive standard. The Pipeline Scheduler can optimize the period of time for decoding a symbol.
2. The embodiment in accordance with the present invention applies to the look ahead parsing analysis technique and can decode one more symbol in each period of time. The technology of the segmented context table cooperating with the cache context can efficiently manage and reduce the times of accessing the memory.
3. The embodiment in accordance with the present invention can effectively reduce one period of time for decoding and reducing the times of accessing the memory enormously, and therefore reduce the waste of power in the whole decoding system.
It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.
Claims
1. A device for video decoding, comprising:
- a pipeline scheduler selectively scheduling among multiple decoding syntaxes and controlling an inner part of one of the decoding syntaxes;
- a decoding core, comprising a look ahead parsing detector with analysis conditions, if the analysis conditions are met, one more symbol is decoded during one period;
- a segmented context memory, making the pipeline scheduler read and write in a memory at the same time; and
- a cache context register, previously reading a context data, and cooperating with the segmented context memory to manage a context memory.
2. The device for video decoding of claim 1, wherein the pipeline scheduler combines with a binarization controller.
3. The device for video decoding of claim 1, wherein a plurality of actions of the pipeline scheduler in the same period of time comprise reading memory, decoding symbol and writing memory.
4. The device for video decoding of claim 1, wherein the decoding core comprises at least one decoding engine.
5. The device for video decoding of claim 4, wherein the decoding engine is a decode-decision.
6. The device for video decoding of claim 5, wherein the decode-decision comprises a condition.
7. The device for video decoding of claim 6, wherein the condition is a codiRange greater than 256.
8. The device for video decoding of claim 6, wherein the condition is the codiRange equal to 256.
9. The device for video decoding of claim 6, wherein the condition is the codiRange greater than a codiOffset.
10. The device for video decoding of claim 6, wherein the condition is the codiRange equal to the codiOffset.
11. The device for video decoding of claim 6, wherein the condition is satisfied, the decode-decision decodes one more symbol during one period.
12. The device for video decoding of claim 4, wherein the decoding engine is a decode-bypass.
13. The device for video decoding of claim 4, wherein the decoding engine is a decode-terminal.
14. The device for video decoding of claim 4, wherein the decoding core decodes symbols, at least one decoding engine could be shut down.
15. The device for video decoding of claim 4, wherein the decoding core decodes symbols, opens only one decoding engine.
16. The device for video decoding of claim 1, wherein the decoding core comprises a logic gate.
17. The device for video decoding of claim 16, wherein the logic gate is an OR logic gate.
Type: Application
Filed: Sep 13, 2006
Publication Date: Mar 13, 2008
Applicant:
Inventors: Jiun-In Guo (Ming-Hsiung), Yao-Chang Yang (Ming-Hsiung)
Application Number: 11/519,922
International Classification: H03M 7/00 (20060101);