Transcoder and imaging apparatus for converting an encoding system of video signal
The time required to transcode an encoded image signal to an image signal encoded by a different encoding method is reduced. Disclosed herein is a transcoder comprising: a picture selector 4 which extracts specific types of pictures (I- and P- pictures) from a first image signal (MPEG2 stream) and generates a subset of the first image signal; a first decoder 5 which decodes the subset image signal; and a first encoder 7 which encodes the decoded image signal to a second image signal (MPEG4 stream). The picture selector 4 uses the extracted pictures to generate the subset image signal with a reduced effective length.
Latest Hitachi, Ltd. Patents:
- STORAGE SYSTEM
- Timetable creation apparatus, timetable creation method, and automatic vehicle control system
- Delivery date answering apparatus and delivery date answering method
- Microstructural image analysis device and microstructural image analysis method
- Beam monitoring system, particle therapy system, and beam monitoring method
1. Field of the Invention
The present invention relates to a transcoder for converting the coding format of an image signal and to an imaging apparatus and signal processor using this transcoder.
2. Description of the Related Art
3. Prior Art
To record moving image signals on recording media or transmitting them over the Internet, radio and other communication media, techniques for compressing and encoding moving image signals by MPEG and other compressing algorithms have recently been used in various fields. For example, MPEG2-Video (ISO/IEC13818-2) is used for recording to and playing back from DVDs. In addition, MPEG4-Video (ISO/IEC14496-2) is used in radio communication and Internet applications. Including them, a variety of encoding methods have been proposed. With the diversification of encoding methods, however, it has become necessary to transcode data among these formats and improve the transcoding capability. In particular, if the time required for the transcoding process is reduced, the user side is given raised convenience.
A technique disclosed in Japanese Patent Laid-open No. 2002-152755 concerns improvement in the transcoding speed. This technique “reduces the temporal resolution of an input video stream by thinning out bidirectional predictive frames and reduces the spatial resolution by reducing the picture size of the temporal resolution-lowered video stream without inversely transforming transform coefficients” aimed at “shortening the transcoding time required to convert the video data to a different kind of video data by entirely eliminating the necessity of inverse transformation of coded transform coefficients in the frequency domain”.
BRIEF SUMMARY OF THE INVENTIONAccording to the technique described in Japanese Patent Laid-open No. 2002-152755, the transcoding time can be reduced by eliminating the decoding process (i.e., inverse transformation of transform coefficients). However, in such a transcoder as to be configured to include a decoder for decoding an MPEG stream for display as an image signal, combined with an encoder for encoding/compressing the input image signal into an MPEG stream, eliminating the inverse transformation of transform coefficients does not substantially reduce the time required for transcoding if transcoding is made at a rate of one picture per frame period.
It is an object of the present invention to solve the above-mentioned problem and provide improved usability to the user by reducing the time required for transcoding.
To solve the above-mentioned problem, according to an aspect of the present invention, there is provided a transcoder which inputs a first image signal encoded by a first compressing and encoding method and transcodes the first image signal to a second image signal encoded by a second compressing and encoding method. The transcoder comprises: a picture selector which generates a subset image signal of the first image signal by extracting pictures of one ore more specific types in frames or fields from the first image signal; a first decoder which decodes the subset image signal generated by the picture selector; and a first encoder which encodes the decoded image signal by a second compressing and encoding method. The picture selector uses the extracted pictures to generate the subset image signal with a shorter effective length.
According to another aspect of the present invention, there is provided a transcoder which reads out a first image signal encoded by a first compressing and encoding method from a recoding medium and transcodes the first image signal to a second image signal encoded by a second compressing and encoding method. The transcoder comprises: an interface section which generates a subset image signal of the first image signal by extracting pictures of one or more specific types in frames or fields from the first image signal; a first decoder which decodes the subset image signal generated by the interface section; and a first encoder which encodes the decoded image signal by a second compressing and encoding method. The interface section uses the extracted pictures to generate the subset image signal with a reduced effective length.
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGThese and other objects, features and advantages of the present invention will become more readily apparent from the following detailed description when taken in conjunction with the accompanying drawing wherein:
In the transcoder shown in
Specifically, it is assumed in the following operational description that a 30 frames/sec MPEG2/MP@ML stream is transcoded to a 10 frames/sec MPEG4/SP stream. Firstly, an MPEG2 stream stored on the recording medium 1 such as a DVD is read out by an interface 2 and supplied to the picture selector 4. The picture selector 4 searches the incoming MPEG2 stream for picture headers and extracts pictures of a specific kind in frames or fields so as to generate an MPEG2 stream whose picture population is reduced to a third. How the picture selector 4 is configured will be described later in detail. The MPEG2 stream, after given extraction processing, is output and supplied from the picture selector 4 to the MPEG2 decoder 5. The MPEG2 decoder 5 converts the input MPEG stream to a digital video signal containing pixel value data by decoding the input MPEG stream. Pixel value data is accumulated in the frame memory and, according to a display sync signal, read out to the display section for output on the display 9. Concurrently, the digital video signal is supplied from the frame memory 6 to the MPEG4 encoder 7 which in turn encodes the signal to an MPEG4 stream and outputs it to the output terminal 51.
Firstly, the buffer memory readout circuit 43 detects a picture header 302 in the video ES 300, judges that the picture is a I-picture which is to be extracted, and starts reading out data. Then, the readout circuit 43 detects the next picture header 303, judges that the picture is a B-picture which is to be removed, and stops reading out data just before the picture header 303. The readout circuit 43 resumes header search and detects picture headers 304 and 305. The readout circuit 43 detects that the picture following the picture header 305 is a P-picture which is to be extracted, and starts reading out data. By repeating this procedure, only I-pictures and P-pictures are extracted from the video ES 300 to generate the video ES 301.
As apparent in
The MPEG2 decoder decodes the supplied MPEG2 stream and stores it in the frame memory 6 so that the decoded video signal may be displayed. The stored video signal is retained until displayed. In order to prevent the frame memory from overflowing or underflowing, the decoding speed must be equal to the display speed. Accordingly, increasing the total number of pictures increases the number of pictures to be displayed. The time required for recording becomes longer in proportion to the total number of pictures.
Assume that the video ES 300 is to be transcoded in a conventional method. In this case, after the non-thinned out picture sequence <IBBPBBPBBPBBPBBPBBI . . . > is decoded, B-pictures are removed to supply a stream <IPPPPI . . . > to the MPEG4 encoder 7 for encoding. Accordingly, the time required for transcoding equals the time required to decode and display the non-thinned out video ES 300 which contains all pictures.
According to the present invention, the above-mentioned transcoding is performed after a video ES 301, a subset of the video signal ES 300, is generated by removing B-pictures from the video ES 300. The MPEG decoder 5 decodes the video ES 301, that is, only the pictures <IPPPPI . . . > and directly supplies the decoded video signal to the MPEG4 encoder. The total number of pictures contained in the video ES 301 is reduced to a third by the picture extraction circuit 4 as compared with the total number of pictures contained in the video ES 300. Thus, since the time required for transcoding is reduced to a third as compared with that required to decode and display the video ES 300, the transcoding time can be reduced remarkably. In this case, pictures are displayed at the triple speed via the display unit 8 as compared with the display speed taken when the stream is played back without removing pictures.
As other preferable extraction schemes, extracting only I-pictures from a video stream comprising I-, P- and B-pictures, extracting I-pictures from a video stream comprising I- and P-pictures and extracting specific I-pictures from a video stream comprising I-pictures are also applicable. Further, such a configuration is possible as to allow the user to specify what pictures are to be extracted.
The aforementioned embodiment is also characterized by the processing speed of the picture selector 4. As described earlier, when the stream (video ES) 300 in
As an example, assume that the recording medium 1 is a DVD-ROM. In this case, since data can be input to the picture selector 4 via the interface section 2 at a bit rate of up to 160 Mbits/sec, it is possible to supply MPEG2 stream data to the picture selector 4 at a sufficiently high bit rate. In addition, since the readout circuit 43 can perform header search processing at a maximum bit rate of about 1 Gbits/sec if it operates in 16 bits at 81 MHz, the readout circuit 43 can easily afford to perform header search processing on the data which is input at the above-mentioned bit rate. The same header search processing can also be implemented by software. For example, if the CPU operates at 200 MHz and 30 cycles are spent to process each word, header search processing can be executed at a maximum bit rate of 106 Mbits/sec, which is sufficiently high for the input bit rate.
Then,
If the recording medium 1 is a DVD or the like, a management information file is stored together with an MPEG2 stream. A management information file is created by the interface 2 when a stream file, entered from the input terminal 51, is to be stored on the recording medium 1. In the management information file, the picture type, storage location and size of each picture are recorded. By referring to this management information file, the interface 2 identifies the locations of the respective pictures to be extracted. Accordingly the interface 2 extracts these pictures from the stream and supplies them to an MPEG2 decoder 5 where transcoding is done by the same process as in the first embodiment.
According to this embodiment, since the video signal is downscaled to a subset in the stage of extraction/readout from the recording medium 1, the time required for the subsequent processing can be shortened.
Although it is assumed in the above-mentioned example that the recoding medium conforms to the DVD Video Recording specifications, this embodiment is also similarly applicable to other recording media if management information indicating the type and location of each picture is recorded thereon.
Note that if the recording medium 1 is a removal recording medium and a stream is written onto the medium by using another apparatus, there is a possibility that the management file may not exist thereon. By taking into such a case into consideration, this embodiment may also be provided with such a function as to analyze the stored stream read out by the interface block 2 and create a management information file for storage on the recording medium.
If it is demanded by the user to record MPEG2 moving images on the recording medium 1, the user interface 20 issues a recording instruction to start the image pickup block 11 in the camera unit 10 generating a digital video signal. The MPEG2 encoder 12 generates an MPEG2 stream by encoding the digital video signal. Then, the MPEG2 stream is recorded on the recording medium 1 via the interface 2.
In addition, if it is demanded to transcode the MPEG2 stream recorded on the recording medium 1 to an MPEG4 stream, the user interface 20 issues a transcode instruction. The same transcoding process as in the first embodiment is started by this instruction to output an MPEG4 stream.
That is, in this imaging apparatus described as the third embodiment, it is possible not only to pick up images and record them as a high image quality MPEG2 stream but also to convert it to a low bit rate MPEG4 stream for output to the outside.
Although the imaging apparatus in
Although the first to third embodiments have been described on the assumption that the compressing/encoding format is transcoded from MPEG2 to MPEG4, the present invention is not limited to this transcoding scheme but can also be applied to between other compressing/encoding formats.
Further, a transcoder according to the present invention can be applied not only to such an imaging apparatus as mentioned above but also to such apparatus as a receiver (set-top box) which receives a video signal (stream) and transcodes it before outputting it to a display, an image output apparatus (display) which transcodes an input image signal before displaying it, an image recording apparatus which transcodes an input image signal before recoding it onto a recording medium (hard disk, optical disk or the like) and a home server which is connected to various image apparatus to supply image signals.
According to the present invention, it is possible to provide a transcoder and imaging apparatus improved in usability for the user.
The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The presented embodiments are therefore to be considered in all respects as illustrative and not restrictive. The scope of the invention being indicated by the appended claims rather than by the following description and range of equivalency of the claims are therefore intended to be embraced therein.
Claims
1. A transcoder which inputs a first image signal encoded by a first compressing and encoding method and transcodes the first image signal to a second image signal encoded by a second compressing and encoding method, said transcoder comprising:
- a picture selector which generates a subset image signal of the first image signal by extracting pictures of one or more specific types in frames or fields from the first image signal;
- a first decoder which decodes the subset image signal generated by the picture selector; and
- a first encoder which encodes the decoded image signal by a second compressing and encoding method;
- wherein the picture selector uses the extracted pictures to generate the subset image signal whose effective length is reduced.
2. A transcoder which reads out a first image signal encoded by a first compressing and encoding method from a recoding medium and transcodes the first image signal to a second image signal encoded by a second compressing and encoding method, said transcoder comprising:
- an interface section which generates a subset image signal of the first image signal by extracting pictures of one or more specific types in frames or fields from the first image signal;
- a first decoder which decodes the subset image signal generated by the interface section; and
- a first encoder which encodes the decoded image signal by a second compressing and encoding method;
- wherein the interface section uses the extracted pictures to generate the subset image signal whose effective length is reduced.
3. A transcoder according to claim 1, wherein the subset image signal has the extracted pictures arranged sequentially therein and the effective length of the subset image signal is reduced.
4. A transcoder according to claim 2, wherein the interface section extracts and reads out pictures of one or more specific types by referring to management information recorded along with the first image signal on the recording medium.
5. A transcoder according to claim 1, wherein the picture selector performs picture extraction in such a manner that each extracted picture can refer to another extracted picture for motion compensation.
6. A transcoder according to claim 1, wherein:
- the first compressing and encoding method is an MPEG2 method and the second compressing and encoding method is an MPEG4 method; and
- the picture selector generates the subset image signal by extracting I-pictures and P-pictures.
7. A transcoder according to claim 1, wherein the picture selector allows the user to specify what types of pictures are to be extracted.
8. A transcoder according to claim 1, wherein a bit rate of the first image signal supplied to the picture selector is set so as to compensate for an amount of code of the pictures which are not extracted when the subset image signal is generated.
9. A transcoder according to claim 1, further comprising:
- a frame memory for storing the image signal decoded by the first decoder; and
- a display section which reads out the image signal from the frame memory and outputs the image signal to a display unit;
- wherein images being transcoded are displayed on the display unit.
10. An imaging apparatus using the transcoder according to claim 1, said imaging apparatus comprising:
- an image pickup section which picks up an object;
- a second encoder which, by the first compressing and encoding method, encodes the first image signal supplied from the image pickup section; and
- a recording and reproducing section which records and reproduces the first image signal encoded by the second encoder to and from a recording medium,
- wherein the first image signal reproduced from the recording medium is supplied to the transcoder.
11. An imaging apparatus according to claim 10, further comprising:
- a receiver which receives an image signal from the outside;
- wherein the second encoder encodes the image signal supplied from the receiver by the first compressing and encoding method.
12. An imaging apparatus according to claim 10, further comprising:
- a receiver which receives the first image signal encoded by the first compressing and encoding method from the outside;
- wherein the recording and reproducing section records and reproduces the first image signal supplied from the receiver to and from the recording medium.
13. An imaging apparatus according to claim 10, wherein the recording and reproducing apparatus generates management information from the first image signal recorded on the recording medium and records the management information on the recording medium.
14. A signal processor using the transcoder according to claim 1, wherein said signal processor inputs a first image signal encoded by a first compressing and encoding method, transcodes the first image signal to a second image signal encoded by a second compressing and encoding method and outputs the second image signal to external equipment.
Type: Application
Filed: Jan 30, 2004
Publication Date: Mar 3, 2005
Applicant: Hitachi, Ltd. (Tokyo)
Inventors: Daisuke Yoshida (Yokohama), Hironori Komi (Tokyo), Yusuke Yatabe (Yokohama)
Application Number: 10/768,757