Abstract: An image processing device 20 acquires captured images obtained by imaging a sample including a target cell, performs machine learning based on a first image feature quantity, sets a plurality of object regions for detecting the target cell in the captured images, and displays the plurality of object regions in an order determined based on the first image feature quantity of each of the plurality of object regions. The image processing device 20 calculates a second image feature quantity in each of the plurality of object regions, sorts the plurality of displayed object regions in an order of the second image feature quantity similar to the second image feature quantity of a reference object region selected from among the plurality of object regions, and displays the plurality of object regions.
Abstract: According to exemplary embodiments of the present invention, references to a motion vector in a neighboring block of an enhancement layer or in a block of a previously coded picture, and a motion vector of a base layer are equally handled. Accordingly, a motion_prediction_flag with identification information can be integrated for further improved coding efficiency.
Abstract: In one example, a device for coding video data includes a video coder configured to determine a first target reference index representing a reference picture to which a candidate motion vector predictor for a current motion vector refers, determine, for the candidate motion vector predictor and during temporal motion vector prediction (TMVP) of the current motion vector, a value for an additional target reference index that is equal to a predetermined value, and code the current motion vector based at least in part on the value of the additional reference index.
Abstract: A method and system for reduced complexity motion compensated temporal processing for pre-analysis purposes. The processing complexity is reduced by reducing the number of pictures processed for analysis, reducing the number of references used for analysis, and/or spatially subsampling input pictures.
Abstract: Systems and methods for multimedia encoding and decoding are disclosed. The systems and methods include multimedia format detection systems, decoder functionality generation systems, decoder instantiation systems, and multimedia processing engines which are capable of selecting a decoder or playback mechanism for each input encoded multimedia stream. The functionality of the decoder or playback mechanism is represented as syntax elements which may be further encoded. The functionality for decoding or playback is then stored or transmitted with the multimedia bitstream. Alternatively, the functionality and multimedia bitstream can be embedded in or associated with a second digital bitstream. Further, the functionality associated with an encoded multimedia stream can be used to instantiate a decoder or playback mechanism and the encoded multimedia stream decoded with the instantiated decoder or mechanism.
Abstract: Overlapped block disparity estimation and compensation is described. Compensating for images with overlapped block disparity compensation (OBDC) involves determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The neighboring macroblocks may be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, predictions may be made for a region of the first macroblock that has an edge adjacent with the neighboring macroblocks. OBDC can be causally applied. Disparity compensation parameters or modes may be shared amongst views or layers. A variety of predictions may be used with causally-applied OBDC.
Abstract: In one example, an apparatus for encoding video data includes a video encoder configured to scan a two-dimensional block of transform coefficients to produce a one-dimensional vector of the transform coefficients, determine values indicative of whether the transform coefficients in the one-dimensional vector are significant; and entropy encode at least one of the values using a context model selected based on at least a percentage of significant coefficients in a predetermined number of the values encoded before the at least one of the values.
Type:
Grant
Filed:
July 7, 2011
Date of Patent:
May 23, 2017
Assignee:
QUALCOMM Incorporated
Inventors:
Marta Karczewicz, Muhammed Zeyd Coban, Peisong Chen, Hsiao-Chiang Chuang, Rajan L. Joshi
Abstract: A video processing system is provided to create quantization data parameters based on human eye attraction to provide to an encoder to enable the encoder to compress data taking into account the human perceptual guidance. The system includes a perceptual video processor (PVP) to generate a perceptual significance pixel map for data to be input to the encoder. Companding is provided to reduce the pixel values to values ranging from zero to one, and decimation is performed to match the pixel values to a spatial resolution of quantization parameter values (QP) values in a look up table (LUT). The LUT table values then provide the metadata to provide to the encoder to enable compression of the original picture to be performed by the encoder in a manner so that bits are allocated to pixels in a macroblock according to the predictions of eye tracking.
Type:
Grant
Filed:
March 15, 2013
Date of Patent:
May 23, 2017
Assignee:
ARRIS Enterprises, Inc.
Inventors:
Sean T. McCarthy, Peter A. Borgwardt, Vijay Kamarshi, Shiv Saxena
Abstract: An approach is provided in which a rear view system provides a rear view video feed to a vehicle operator. The rear view system determines that an eye focus point of a vehicle operator is positioned in a camera angle change region for a predetermined amount of time and performs adjustments based upon the vehicle operator's eye focus point accordingly.
Type:
Grant
Filed:
May 28, 2013
Date of Patent:
May 16, 2017
Assignee:
International Business Machines Corporation
Abstract: In one example, an apparatus includes a video encoder configured to partition a block of video data into a first geometric partition and a second geometric partition using a geometric motion partition line, wherein the block comprises N×N pixels, divide the block of video data into four equally-sized, non-overlapping (N/2)×(N/2) sub-blocks, and encode at least one of the sub-blocks through which the geometric motion partition line passes using a transform size smaller than (N/2)×(N/2). The video encoder may determine transform sizes for the sub-blocks based on whether the geometric motion partition line passes through the sub-blocks. In one example, a video decoder may inverse transform the sub-blocks, and may determine transform sizes for the sub-blocks based on whether the geometric motion partition line passes through the sub-blocks.
Type:
Grant
Filed:
January 25, 2011
Date of Patent:
May 16, 2017
Assignee:
QUALCOMM Incorporated
Inventors:
Peisong Chen, Xianglin Wang, Marta Karczewicz
Abstract: It is presented a splicing device for replacing video frames in a transport stream. The splicing device comprises a processor; and a memory storing instructions that, when executed by the processor, causes the splicing device to: receive the transport stream comprising frames of a first program stream and frames of a second program stream, and replace at least one of the frames of the second program stream with frames of a replacement program stored in a storage encoded at a plurality of different bit rates, wherein the frames of the replacement program are selected of from the plurality of different bit rates to comply with a maximum bandwidth of the transport stream.
Type:
Grant
Filed:
September 3, 2014
Date of Patent:
May 16, 2017
Assignee:
VIGOR SYSTEMS INC.
Inventors:
Frederick Kokaska, Magnus Sörlander, Janno Ossaar
Abstract: A computer-implemented system and method are described for performing video compression. For example, a method according to one embodiment comprises: encoding a plurality of video frames or portions thereof according to a first encoding format; transmitting the plurality of encoded video frames or portions to a client device; receiving feedback information from the client device, the feedback information usable to determine whether data contained in the video frames or portions has not been successfully received and/or decoded; in response to detecting that a video frame or portion thereof has not been successfully received and/or decoded, encoding a video frame or portion thereof according to a second encoding format; and transmitting the video frames or portions thereof to the client device.
Type:
Grant
Filed:
April 12, 2016
Date of Patent:
May 9, 2017
Assignee:
Sony Interactive Entertainment America LLC
Inventors:
Stephen G. Perlman, Roger van der Laan, Timothy Cotter, Scott Furman, Robert McCool, Ian Buckley
Abstract: System, apparatus, method, and computer readable media for on-the-fly captured video summarization. A video stream is incrementally summarized in concurrence with generation of the stream by a camera module. Saliency of the video stream summary is maintained as the stream evolves by updating the summary to include only the most significant frames. In one exemplary embodiment, saliency is determined by optimizing an objective function including terms that are indicative of both the diversity of a selection, and how representative the selection is to the processed portion of the video data corpus. A device platform including a CM and comporting with the exemplary architecture may provide video camera functionality at ultra-low power, and/or with ultra-low storage resources, and/or with ultra-low communication channel bandwidth.
Abstract: A coding format converter for converting an input stream of data of pictures of a first coding format into a data stream of data of pictures with a second coding format includes a decoder decoding the input stream with the first coding format, an encoder encoding the data of the picture thus decoding with the second coding format, a coding candidate search controller using first information on prediction residual availability indicative of whether or not residual information is encoded for each block in the decoded picture. When a search for coding candidates reveals a result that the first information for a coding target region of the picture is indicative of the absence of a residual, the controller refrains from searching for coding candidates which depend on second information on residual availability of the coding with the second coding format.
Abstract: In one method embodiment, partitioning a block matched reference frame into plural n×m non-overlapping pixel superblocks, where n and m are non-negative integer numbers; designating each of the n×m pixel superblocks as field or frame; and field processing by overlapped block processing logic two n×n blocks of an n×m overlapped superblock if one of first plural n×m superblocks intersected by the overlapped superblock has a field designation, otherwise frame processing, by the overlapped block processing logic, the two n×n blocks of the overlapped superblock.
Abstract: A camera module suitable for use for a vision system of a vehicle includes a camera housing portion having a generally cylindrical portion extending outwardly from a base portion. An imaging sensor is disposed at the base portion and a lens system is received at the cylindrical portion of the camera housing portion. The camera housing portion and a connector portion are configured to join together to encase the imaging sensor and circuitry. The connector portion may include a multi-pin connector extending in a direction away from the camera housing portion when the camera housing portion and the connector portion are joined together. The multi-pin connector may include a plurality of terminals, each of which has a first end having a pin configured to electrically connect with a connector of a vehicle and an opposing second end configured for electrical connection with the circuitry.
Type:
Grant
Filed:
March 22, 2016
Date of Patent:
April 18, 2017
Assignee:
MAGNA ELECTRONICS INC.
Inventors:
Robert L. Bingle, Joseph Camilleri, Peter J. Whitehead, Kenneth Schofield
Abstract: A method and system are disclosed for stereoscopic three dimensional (S3D) display of synthetic environment (SE) data simultaneously with operational symbology data to the operator of a vehicle. The method receives a plurality of data objects and places the objects within a plurality of levels of a presentation hierarchy and further within a plurality of S3D depth layers based on a set of data and operator focused rules. The method simultaneously displays to the operator each of the plurality of objects in the plurality of S3D depth layers according to the placement. The method transitions an object from one depth layer to a more proximal or distal depth layer to indicate relative motion or urgency. The method reserves a depth layer most proximal to the operator for a warning while a most distal set of depth layers is reserved for SE objects.
Type:
Grant
Filed:
December 19, 2012
Date of Patent:
April 11, 2017
Assignee:
Rockwell Collins, Inc.
Inventors:
Alex Postnikov, Tracy J. Barnidge, Timothy J. Etherington, Jason C. Wenger, Joseph L. Tchon
Abstract: A method and apparatus for emulation of Bi predicted B pictures is described where a B picture is constructed only from information from one preceding and successive frame without pixel averaging or motion vector computation. In the case of H.264 or analogous codecs, a top field of a B picture is formed from the successive I or P picture top field, and the bottom field formed from the previous P picture bottom field. B picture emulation may reverse the previous and successive pictures. For interlaced pictures, the odd and ever interlacing fields are treated as top and bottom fields, and also use information from both one preceding and successive frame. The resulting Bi predicted B pictures reduce inter-GOP flicker.
Abstract: Methods and apparatuses for encoding and decoding mode information of a block of an image. The method of decoding the mode information includes decoding information representing whether a current block is encoded in a first mode, and hierarchically decoding information representing an encoding mode of the current block from among a second mode and a third mode.
Type:
Grant
Filed:
September 17, 2010
Date of Patent:
April 11, 2017
Assignee:
SAMSUNG ELECTRONICS CO., LTD.
Inventors:
Sung-bum Park, Jung-woo Kim, Dai-woong Choi, Jae-won Yoon, Jun-ho Cho
Abstract: Traffic signals include displays and controllers for displaying information relating to the general condition of the traffic signal. The information can include a time remaining in seconds until the next change in the general condition. Traffic signal systems are adaptable to facilitate passage of emergency vehicles through intersections, including traffic signal systems with flat panel screens. The displays can be flat panel screens displaying information in multiple information formats.