Shape, Icon, Or Feature-based Compression Patents (Class 382/243)
-
Patent number: 12225193Abstract: A method for information compression/decompression, apparatuses and a non-transitory computer-readable storage medium are disclosed. The method for information compression may include: clustering text blocks to be processed into respective-text areas according to pixel distribution information of the text blocks to be processed; acquiring text row distribution information of each of the text areas according to foreground pixels of each text row in each of the text areas; scanning each text row in each of the text areas according to the acquired text row distribution information to acquire original pixel information of each text row; and performing lossless compression on the text row distribution information of a plurality of the text areas and the original pixel information of each text row of the plurality of the text areas.Type: GrantFiled: April 30, 2020Date of Patent: February 11, 2025Assignee: ZTE CORPORATIONInventors: Junping Gao, Zhenfeng Cui, Zhen Hu
-
Patent number: 12218689Abstract: Length-adapter input parameters for length-adaptive encoding include a data word length and a length-adapted codeword length, which are positive integers. Length-adapter output parameters include a primary data word length, a secondary data word length, a primary codeword length, and a secondary codeword length. A received data word is split according to splitter parameters into a primary data word based on the primary data word length and a secondary data word based on the secondary data word length. The primary data word is encoded in accordance with primary encoder parameters to generate a primary codeword from a primary code. The secondary data word is encoded in accordance with secondary encoder parameters to generate a secondary codeword from a secondary code. The primary and secondary codewords are combined in accordance with combiner parameters to generate a length-adapted codeword transmitted via a channel to a decoder.Type: GrantFiled: July 18, 2023Date of Patent: February 4, 2025Assignee: Polaran Haberlesme Teknolojileri Anonim SirketiInventor: Erdal Arikan
-
Patent number: 12176922Abstract: Length-adapter input parameters for length-adaptive encoding include a data word length and a length-adapted codeword length, which are positive integers. Length-adapter output parameters include a primary data word length, a secondary data word length, a primary codeword length, and a secondary codeword length. A received data word is split according to splitter parameters into a primary data word based on the primary data word length and a secondary data word based on the secondary data word length. The primary data word is encoded in accordance with primary encoder parameters to generate a primary codeword from a primary code. The secondary data word is encoded in accordance with secondary encoder parameters to generate a secondary codeword from a secondary code. The primary and secondary codewords are combined in accordance with combiner parameters to generate a length-adapted codeword transmitted via a channel to a decoder.Type: GrantFiled: July 18, 2023Date of Patent: December 24, 2024Assignee: Polaran Haberlesme Teknolojileri Anonim SirketiInventor: Erdal Arikan
-
Patent number: 12154302Abstract: Disclosed herein are a method, an apparatus and a storage medium for image encoding/decoding using a binary mask. An encoding method includes generating a latent vector using an input image, generating a selected latent vector component set using a binary mask, and generating a main bitstream by performing entropy encoding on the selected latent vector component set. A decoding method includes generating a selected latent vector component set including one or more selected latent vector components by performing entropy decoding on a main bitstream and generating the latent vector in which the one or more selected latent vector components are relocated by relocating the selected latent vector component set in the latent vector.Type: GrantFiled: December 3, 2021Date of Patent: November 26, 2024Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTEInventors: Joo-Young Lee, Se-Yoon Jeong, Hyoung-Jin Kwon, Dong-Hyun Kim, Youn-Hee Kim, Jong-Ho Kim, Ji-Hoon Do, Jin-Soo Choi, Tae-Jin Lee
-
Patent number: 12149262Abstract: Length-adapter input parameters for length-adaptive encoding include a data word length and a length-adapted codeword length, which are positive integers. Length-adapter output parameters include a primary data word length, a secondary data word length, a primary codeword length, and a secondary codeword length. A received data word is split according to splitter parameters into a primary data word based on the primary data word length and a secondary data word based on the secondary data word length. The primary data word is encoded in accordance with primary encoder parameters to generate a primary codeword from a primary code. The secondary data word is encoded in accordance with secondary encoder parameters to generate a secondary codeword from a secondary code. The primary and secondary codewords are combined in accordance with combiner parameters to generate a length-adapted codeword transmitted via a channel to a decoder.Type: GrantFiled: July 18, 2023Date of Patent: November 19, 2024Assignee: Polaran Haberlesme Teknolojileri Anonim SirketiInventor: Erdal Arikan
-
Patent number: 12095981Abstract: A data compression method is provided for compressing an image. A coding module may select a plurality of pixels with a sequence order from the image, and compress the plurality of pixels to generate a plurality of compressed pixels. For a current pixel p[i] having a previous pixel p[i?1] and a next pixel p[i+1], the coding module generates a coding mode M[i+1] configured for compressing the p[i+1], and generates a fixed-rate compressed value c[i] corresponding to the p[i]. The coding module stores the c[i] in a compressed pixel, and c[i] encapsulates the coding mode M[i+1]. The coding module then stores the plurality of compressed pixels into a compressed image corresponding to the image.Type: GrantFiled: January 30, 2023Date of Patent: September 17, 2024Assignees: VeriSilicon Microelectronics (Shanghai) Co., Ltd., VeriSilicon Holdings Co., Ltd.Inventors: Lefan Zhong, Mankit Lo, Wei Miao
-
Patent number: 12056902Abstract: A device includes a memory and at least one processor coupled to the memory. The at least one processor is configured to obtain an image and determine a parent cluster of pixels of an image having a centroid. The at least one processor is also configured to split the parent cluster into at least a first child cluster and a second child cluster and assign a pixel of the image to the first child cluster. Additionally, the at least one processor is configured to update a centroid of the first child cluster based at least in part on the pixel, replace the pixel in the image with the centroid of the first child cluster to produce a compressed image, and store the compressed image in the memory.Type: GrantFiled: April 11, 2023Date of Patent: August 6, 2024Assignee: TEXAS INSTRUMENTS INCORPORATEDInventors: Jeffrey Kempf, Jonathan Andrew Lucas
-
Patent number: 12026920Abstract: Disclosed are a point cloud encoding and decoding method, an encoder and a decoder. The method comprises: acquiring geometric information and attribute information of an input point cloud; determining a maximum allowable value of a sampling period when the input point cloud is subjected to level-of-detail (LOD) division; determining a preset value of the sampling period on the basis of the maximum allowable value of the sampling period; processing the input point cloud according to the preset value of the sampling period and the geometric information, so as to obtain at least one refinement layer and at least one detail layer; and encoding the attribute information by using the at least one refinement layer and the at least one detail layer, so as to generate a code stream, and writing the preset value of the sampling period into the code stream.Type: GrantFiled: June 28, 2023Date of Patent: July 2, 2024Assignee: GUANGDONG OPPO MOBILE TELECOMMUNICATIONS CORP., LTD.Inventors: Shuai Wan, Lei Wei, Fuzheng Yang
-
Patent number: 12028788Abstract: An object is to provide a communication system and a base station capable of predicting future communication quality in order to enable variations in communication quality due to variations in environment to be addressed. A communication system and a base station according to the invention learn an input and output relationship from surrounding environment information of the base station that can be acquired by a camera, a sensor, or the like, terminal information such as position information of a terminal and current communication quality to generate a learning model, and predict future communication quality using the learning model, the surrounding environment information, and the terminal information.Type: GrantFiled: April 26, 2019Date of Patent: July 2, 2024Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Riichi Kudo, Takeru Inoue, Atsushi Taniguchi, Kohei Mizuno
-
Patent number: 12008311Abstract: Systems and methods of improving the operation of a transaction network and transaction network devices are disclosed. An online purchase autofill plugin includes various modules and engines. The fields of online forms may be identified and the fields of online forms may be automatically filled. The user experience may be improved, and data security enhanced so that the transaction network more properly functions according to approved parameters, such as protecting the integrity of sensitive data.Type: GrantFiled: February 16, 2022Date of Patent: June 11, 2024Assignee: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.Inventor: Hans-Jurgen Greiner
-
Patent number: 11997307Abstract: A method of decoding video data in merge mode can include constructing a merge candidate list using available spatial and temporal merge candidates; deriving motion information using a merge index and the merge candidate list; generating a prediction block using the motion information; generating a residual block by inverse-quantizing a quantized block using a quantization parameter and a quantization matrix and by inverse-transforming the inverse quantized block; and generating a reconstructed block using the residual block and the prediction block, wherein the quantization parameter is generated per quantization unit and a minimum size of the quantization unit is adjusted per picture, and the quantization parameter is generated using a quantization parameter predictor and a differential quantization parameter.Type: GrantFiled: July 20, 2021Date of Patent: May 28, 2024Assignee: GENSQUARE LLCInventors: Soo Mi Oh, Moonock Yang
-
Patent number: 11922549Abstract: Text is generated from an object. Text is generated from a first object. The first object includes a second object and a third object. A step of detecting coordinate data of the second object is included. A step of detecting coordinate data of the third object is included. A step of extracting positional relation between the second object and the third object from coordinate data is included. A step of converting the extracted positional relation into graph data is included. A step of generating text about the positional relation between the second object and the third object from graph data is included.Type: GrantFiled: July 9, 2020Date of Patent: March 5, 2024Assignee: Semiconductor Energy Laboratory Co., Ltd.Inventors: Kengo Akimoto, Junpei Momo, Takahiro Fukutome
-
Patent number: 11861884Abstract: Certain aspects of the disclosure provide systems and methods for training an information extraction transformer model architecture directed to pre-training a first multimodal transformer model on an unlabeled dataset, training a second multimodal transformer model on a first labeled dataset to perform a key information extraction task processing the unlabeled dataset with the second multimodal transformer model to generate pseudo-labels for the unlabeled dataset, training the first multimodal transformer model based on a second labeled dataset comprising one or more labels, the pseudo-labels generated, or combinations thereof to generate a third multimodal transformer model, generating updated pseudo-labels based on label completion predictions from the third multimodal transformer model, and training the third multimodal transformer model using a noise-aware loss function and the updated pseudo-labels to generate an updated third multimodal transformer model.Type: GrantFiled: April 10, 2023Date of Patent: January 2, 2024Assignee: Intuit, Inc.Inventors: Karelia Del Carmen Pena Pena, Tharathorn Rimchala, Peter Lee Frick, Tak Yiu Daniel Li
-
Patent number: 11810252Abstract: The present disclosure relates to methods, devices, and systems for blending geographic data when combining geographic data sources. The methods, devices, and systems identify a blend region for transitioning between a first dataset and a second dataset. The methods, devices, and systems extrapolate geographic data from the second dataset to blend with the geographic data from the first dataset to create blended elevation data in the blend region. The methods, devices, and systems may generate an image for a geographic region with the first set of geographic data, the second set of geographic data, and the blended elevation data.Type: GrantFiled: April 9, 2021Date of Patent: November 7, 2023Assignee: Microsoft Technology Licensing, LLCInventor: Duncan Murray Lawler
-
Patent number: 11734486Abstract: Aspects of the invention include systems and methods for implementing a sweepline triangulation technique to optimize spanning graphs for circuit routing. A non-limiting example computer-implemented method includes receiving an unrouted net having a plurality of elements. The elements can include pins, vias, and wires. A sweepline is passed across the unrouted net until the sweepline intersects an element of the plurality of elements. In response to the sweepline intersecting the element, the sweepline is stopped and one or more nodes on the sweepline and one or more previous nodes are identified. A connectivity graph is built from the one or more nodes and the one or more previous nodes. The connectivity graph includes one or more arcs and one or more guides. A minimum spanning tree is built by removing one or more guides from the connectivity graph and the unrouted net is routed based on the minimum spanning tree.Type: GrantFiled: September 7, 2021Date of Patent: August 22, 2023Assignee: International Business Machines CorporationInventors: Diwesh Pandey, Gustavo Enrique Tellez
-
Patent number: 11722682Abstract: A method of entropy coding in a video encoder is provided that includes assigning a first bin to a first single-probability bin encoder based on a probability state of the first bin, wherein the first single-probability bin encoder performs binary arithmetic coding based on a first fixed probability state, assigning a second bin to a second single-probability bin encoder based on a probability state of the second bin, wherein the second single-probability bin encoder performs binary arithmetic coding based on a second fixed probability state different from the first fixed probability state, and coding the first bin in the first single-probability bin encoder and the second bin in the second single-probability bin encoder in parallel, wherein the first single-probability bin encoder uses a first rLPS table for the first fixed probability state and the second single-probability bin encoder uses a second rLPS table for the second fixed probability state.Type: GrantFiled: December 30, 2020Date of Patent: August 8, 2023Assignee: Texas Instruments IncorporatedInventors: Vivienne Sze, Madhukar Budagavi
-
Patent number: 11709798Abstract: An example method is provided in according with one implementation of the present disclosure. The method comprises generating, via a processor, a set of hashes for each of a plurality of objects. The method also comprises computing, via the processor, a high-dimensional sparse vector for each object, where the vector represents the set of hashes for each object. The method further comprises computing, via the processor, a combined high-dimensional sparse vector from the high-dimensional sparse vectors for all objects and computing a hash suppression threshold. The method also comprises determining, via the processor, a group of hashes to be suppressed by using the hash suppression threshold, and suppressing, via the processor, the group of selected hashes when performing an action.Type: GrantFiled: October 13, 2021Date of Patent: July 25, 2023Assignee: Hewlett Packard Enterprise Development LPInventors: Mehran Kafai, Kave Eshghi, Omar Aguilar Macedo
-
Patent number: 11657541Abstract: A method to compress an image includes assigning each pixel of the image to a cluster based on a red-green-blue (RGB) location of the pixel. The method also includes updating a centroid of the cluster after each pixel is assigned, based at least in part on the RGB location of the pixel, where the centroid is an RGB location. The method includes replacing each pixel in the image with an RGB value of the centroid of the cluster to which the pixel is assigned. The method also includes instructing a display to display a compressed image where, in the compressed image, each pixel in the image is replaced with the RGB value of the centroid of the cluster to which the pixel is assigned.Type: GrantFiled: December 31, 2020Date of Patent: May 23, 2023Assignee: TEXAS INSTRUMENTS INCORPORATEDInventors: Jeffrey Matthew Kempf, Jonathan Andrew Lucas
-
Patent number: 11657540Abstract: A device and method used to image cylindrical fluid conduits, such as pipes, wellbores and tubulars, with ultrasound transducers then compress that data for storage or visualization. The compressed images may be stored on the tool and/or transmitted over telemetry, enabling the device to inspect and record long pipes or wells in high resolution on a single trip. This allow the ultrasound imaging tool to record much longer wells in higher resolution than would otherwise be possible. An outward-facing radial array of ultrasound transducers captures cross-sectional slices of the conduit to create frames from scan lines. The frames are compressed by applying a demodulation process and spatial conversion process to the scan lines. Video compression is applied to the to the demodulated, spatially converted ultrasound images to return compressed images.Type: GrantFiled: June 24, 2020Date of Patent: May 23, 2023Assignee: DarkVision Technologies IncInventor: Steven Wrinch
-
Patent number: 11601713Abstract: A system and method for identifying media segments using audio augmented image cross-comparison is disclosed, in which a media segment identifying system analyses both audio and video content, producing a unique identifier to compare with previously identified media segments in a media segment database. The characteristic landmark-linked-image-comparisons are constructed by first identifying an audio landmark. The audio landmark is an audio peak that exceeds a predetermined threshold. Two digital images are then obtained, one associated directly with the audio landmark, and one obtained a predetermined landmark time removed from the first image. The two images are then used to provide a characteristic landmark-linked-image-comparison. The pair of images are reduced in pixel size and converted to gray scale. Corresponding pixels are compared to form a numeric comparison. One image is mirrored before comparison to reduce the possibility of null comparisons.Type: GrantFiled: December 9, 2020Date of Patent: March 7, 2023Inventors: Oran Gilad, Samuel Chenillo, Oren Steinfeld
-
Patent number: 11587281Abstract: A graphics processor architecture provides for scan conversion and ray tracing approaches to visible surface determination as concurrent and separate processes. Surfaces can be identified for shading by scan conversion and ray tracing. Data produced by each can be normalized, so that instances of shaders, being executed on a unified shading computation resource, can shade surfaces originating from both ray tracing and rasterization. Such resource also may execute geometry shaders. The shaders can emit rays to be tested for intersection by the ray tracing process. Such shaders can complete, without waiting for those emitted rays to complete. Where scan conversion operates on tiles of 2-D screen pixels, the ray tracing can be tile aware, and controlled to prioritize testing of rays based on scan conversion status. Ray population can be controlled by feedback to any of scan conversion, and shading.Type: GrantFiled: January 11, 2021Date of Patent: February 21, 2023Assignee: Imagination Technologies LimitedInventors: John W. Howson, Luke Tilman Peterson, Steven J. Clohset
-
Patent number: 11582463Abstract: A method, computer program, and computer system is provided for aligning across layers in a coded video stream. A video bitstream having multiple layers is decoded. One or more subpicture regions are identified from among the multiple layers of the decoded video bitstream, the subpicture regions including a background region and one or more foreground subpicture regions. An enhanced subpicture is decoded and displayed based on a determination that a foreground subpicture region is selected. The background region is decoded and displayed based on a determination that a foreground subpicture region was not selected.Type: GrantFiled: October 5, 2020Date of Patent: February 14, 2023Assignee: TENCENT AMERICA LLCInventors: Byeongdoo Choi, Shan Liu, Stephan Wenger
-
Patent number: 11544943Abstract: A method includes executing an encoder machine learning model on multiple token values contained in a document to create an encoder hidden state vector. A decoder machine learning model executing on the encoder hidden state vector generates raw text comprising an entity value and an entity label for each of multiple entities. The method further includes generating a structural representation of the entities directly from the raw text and outputting the structural representation of the entities of the document.Type: GrantFiled: May 31, 2022Date of Patent: January 3, 2023Assignee: Intuit Inc.Inventors: Tharathorn Rimchala, Peter Frick
-
Patent number: 11501469Abstract: A data generation system for generating data representing content to be displayed includes: a content dividing unit operable to divide content to be displayed into a plurality of polyhedra and generate polyhedron position information, an intersection detecting unit operable to generate intersection information that describes the intersection of one or more surfaces within the content with the plurality of polyhedra, a polyhedron classifying unit operable to classify each of the polyhedra in dependence upon the intersection information, the classification indicating the properties of the surface within the respective polyhedra, and a data generating unit operable to generate data comprising the polyhedron position information and the polyhedron classification information.Type: GrantFiled: September 12, 2018Date of Patent: November 15, 2022Assignee: Sony Interactive Entertainment Inc.Inventor: Patrick John Connor
-
Patent number: 11477482Abstract: A three-dimensional data storage method includes: acquiring one or more units in which an encoded stream generated by encoding point cloud data is stored; and storing the one or more units into a file. The storing includes storing, in control information for the file, information indicating that data stored in the file is data generated by encoding the point cloud data.Type: GrantFiled: December 23, 2020Date of Patent: October 18, 2022Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICAInventors: Noritaka Iguchi, Toshiyasu Sugio
-
Patent number: 11475602Abstract: Initial low-quality images of a progressively-displayed high-definition image are masked with corresponding progressively-revealing mask filters or masking algorithms to realistically obscure such low quality and therefore to provide a realistically appearing progressive presentation of the high-definition image.Type: GrantFiled: October 31, 2020Date of Patent: October 18, 2022Assignee: PANAMORPH, INC.Inventor: Shawn L. Kelly
-
Patent number: 11423867Abstract: Provided are a signal processing device and an image display apparatus including the same. The signal processing device and the image display apparatus including the same include a scaler configured to scale input images of various resolutions to a first resolution and a resolution enhancement processor configured to perform learning on the input images having the first resolution and to generate a first image having a second resolution higher than the first resolution. Accordingly, resolution may be stably improved even if input images of various resolutions are input.Type: GrantFiled: May 29, 2019Date of Patent: August 23, 2022Assignee: LG Electronics Inc.Inventor: Jungeun Lim
-
Patent number: 11405624Abstract: A moving image decoder (1) includes an intermediate estimated prediction mode deriving section (124) for transforming a prediction mode of each neighbor partition into an intermediate prediction mode included in an intermediate prediction set which is a sum of prediction sets (PS); and an estimated prediction mode deriving section (125) for deriving an estimated prediction mode by estimating a prediction mode of a target partition based on the intermediate prediction mode of each neighbor partition which is obtained by the transform.Type: GrantFiled: April 19, 2021Date of Patent: August 2, 2022Assignee: SHARP KABUSHIKI KAISHAInventors: Tomoyuki Yamamoto, Tomohiro Ikai
-
Patent number: 11397810Abstract: An information handling system improves removal of steganography data embedded in a graphics file by processing graphics files stored in a file system or transmitted through a network by processing the graphics files in a steganalyzer. The steganalyzer converts the body segment of the graphics file into binary code, and then compresses the binary code into a graphics file. This process results in the removal of any potential malicious code. The body segment location can be determined by parsing the portable network graphics file to determine a location of a pre-fix graphics file signature and a post-fix graphics file signature, with the graphics files signatures being specific to a particular type of graphics file.Type: GrantFiled: August 5, 2019Date of Patent: July 26, 2022Assignee: Dell Products L.P.Inventors: Yevgeni Gehtman, Maxim Futerman
-
Patent number: 11386597Abstract: An information processing apparatus (10) is for supporting work by a user who uses drawings for a plant. The information processing apparatus (10) includes a controller (15). The controller (15) is configured to generate an intermediate model, for at least one of a first drawing and a second drawing that include elements configuring the plant and are judged to have different formats, such that the format of the first drawing and the format of the second drawing are matched. The controller (15) is configured to judge whether a difference exists between the first drawing and the second drawing based on the generated intermediate model.Type: GrantFiled: December 25, 2020Date of Patent: July 12, 2022Assignee: YOKOGAWA ELECTRIC CORPORATIONInventors: Takahiro Kambe, Tatenobu Seki, Nobuaki Ema, Masato Annen
-
Patent number: 11348285Abstract: A method of compressing meshes using a projection-based approach, and leveraging the tools and syntax already generated for projection-based point cloud compression is described herein. Similar to the V-PCC approach, the mesh is segmented into surface patches, only the difference is that the segments follow the connectivity of the mesh. Each surface patch (or 3D patch) is then projected to a 2D patch, whereby in the case of the mesh, the triangle surface sampling is similar to a common rasterization approach used in computer graphics. For each patch, the position of the projected vertices is kept in a list, along with the connectivity of those vertices. The sampled surface now resembles a point cloud, and is coded with the same approach used for point cloud compression. Additionally, the list of vertices and connectivity is encoded per patch, and this data is sent along with the coded point cloud data.Type: GrantFiled: April 27, 2020Date of Patent: May 31, 2022Assignee: Sony Group CorporationInventors: Danillo Graziosi, Ohji Nakagami, Alexandre Zaghetto, Ali Tabatabai
-
Patent number: 11266480Abstract: Technology is described for augmenting medical imaging for use in a medical procedure. The method can include the operation of receiving an image of patient anatomy captured by a visual image camera during the medical procedure. An acquired medical image associated with the patient anatomy can then be retrieved. Another operation can be associating the acquired medical image to the patient anatomy. An augmentation tag associated with a location in one layer of the acquired medical image can be retrieved. A further operation can be projecting the acquired medical image and the augmentation tag using an augmented reality headset to form a single graphical view as an overlay to the patient anatomy in either 2D, 3D or holographic form.Type: GrantFiled: March 15, 2021Date of Patent: March 8, 2022Assignee: Novarad CorporationInventors: Wendell Arlen Gibby, Steven Todd Cvetko
-
Patent number: 11265542Abstract: Method for decoding a picture, comprising: decoding information that the picture is partitioned into more than one segment based on one or more syntax elements in a bitstream; decoding information that the spatial segmentation is uniform based on the one or more syntax elements; determining a segment unit size based on the one or more syntax elements or based on a predefined segment unit size; decoding a first value indicating a segment width from one or more code words in the bitstream; decoding a second value indicating a segment height from the one or more code words; deriving segment column widths based on a picture width in number of segment units and the first value; deriving segment row heights based on a picture height in number of segment units and the second value; deriving a spatial location for a current block based on the derived segment column widths and the derived segment heights; and decoding the current block based on the derived spatial location.Type: GrantFiled: March 5, 2021Date of Patent: March 1, 2022Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Mitra Damghanian, Martin Pettersson, Rickard Sjöberg
-
Patent number: 11122280Abstract: An image data of pictures constituting moving image data is encoded to generate an encoded video stream. In this case, the image data of the pictures constituting the moving image data is classified into a plurality of levels and encoded to generate a video stream having the image data of the pictures at the respective levels. Hierarchical composition is equalized between a low-level side and a high-level side, and corresponding pictures on the low-level side and the high-level side are combined into one set and are sequentially encoded. This allows a reception side to decode the encoded image data of the pictures on the low-level side and the high-level side with a smaller buffer size and a reduced decoding delay.Type: GrantFiled: August 23, 2019Date of Patent: September 14, 2021Assignee: SONY CORPORATIONInventor: Ikuo Tsukagoshi
-
Patent number: 11107462Abstract: Exemplary embodiments relate to improvements in spoken language understanding (SLU) systems. Conventionally, SLU systems include an automatic speech recognition (ASR) component configured to receive an input of audio data and to generate a textual representation of the audio data. Conventional SLU systems also include a natural language understanding (NLU) component configured to receive a text-based transcript and perform language-based tasks such as domain classification, intent determination, and slot-filling. However, these two components are typically trained separately based on different metrics. In real-world situations, errors in the ASR component propagate to the NLU component, which degrades the performance of the overall system. Exemplary embodiments described herein perform SLU in an end-to-end manner that infers semantic meaning directly from audio features without an intermediate text representation.Type: GrantFiled: October 30, 2018Date of Patent: August 31, 2021Assignee: FACEBOOK, INC.Inventors: Christian Fuegen, Yongquiang Wang, Anuj Kumar, Baiyang Liu, Dmitrii Serdiuk
-
Patent number: 11095901Abstract: A picture is obtained. The picture is intended to be compressed in a video stream. The picture includes pixel data that depicts a scene. A subject in the pixel data that depicts the scene is identified. The identification is based on the pixel data included in the picture. A plurality of object data is generated. The generated object data is related to the subject. The generation is based on the identification of the subject. An intended compression format of the pictured to be compressed in the video stream is determined. A pixel operation on the picture is performed. The performance is based on the determination of the intended compression format and before compression of the picture. The generated plurality of object data is related with the picture after compression of the picture.Type: GrantFiled: September 23, 2019Date of Patent: August 17, 2021Assignee: International Business Machines CorporationInventors: En-Shuo Hsu, Po-Hsun Tseng, David Shao Chung Chen, Wei-Te Chiang, Hsiao-Yung Chen
-
Patent number: 11089322Abstract: A method of decoding video data in merge mode can include constructing a merge candidate list using available spatial and temporal merge candidates; deriving motion information using a merge index and the merge candidate list; generating a prediction block using the motion information; generating a residual block by inverse-quantizing a quantized block using a quantization parameter and a quantization matrix and by inverse-transforming the inverse quantized block; and generating a reconstructed block using the residual block and the prediction block, wherein the quantization parameter is generated per quantization unit and a minimum size of the quantization unit is adjusted per picture by using a parameter which specifies the depth between the quantization unit having the minimum size and a largest coding unit, and the quantization parameter is generated using a quantization parameter predictor and a differential quantization parameter.Type: GrantFiled: December 12, 2018Date of Patent: August 10, 2021Assignee: INFOBRIDGE PTE. LTD.Inventors: Soo Mi Oh, Moonock Yang
-
Patent number: 11049240Abstract: The present invention relates to a method and system for assessing bone age using deep neural network, more specifically, in which regions of interest (ROIs) even for rotated objects can be more precisely and accurately extracted from an image by a rotated object detection technique used in region proposal networks. Thereby bones with different angles in the image can be detected with excellent speed and accuracy.Type: GrantFiled: May 23, 2019Date of Patent: June 29, 2021Assignee: HealthHub Co., Ltd.Inventors: Byung Il Lee, Sung Hyun Kim
-
Patent number: 11044478Abstract: A system comprises an encoder configured to compress images, such as image frames comprising attribute information and/or spatial for a point cloud and/or an occupancy map for the point cloud. Also, a system includes a decoder configured to decompress compressed image frames, such as image frames comprising compressed attribute and/or spatial information for the point cloud or an occupancy map for the point cloud. Additionally, the encoder may map N-bit data to M-bit code words, where M is less than N. Alternatively the encoder may map N-bit data to M-bit code words, where M is greater than N. In a similar manner, a decoder may map the M-bit code words back to the N-bit data.Type: GrantFiled: July 1, 2019Date of Patent: June 22, 2021Assignee: Apple Inc.Inventors: Alexandros Tourapis, Jungsun Kim, Fabrice A. Robinet, Khaled Mammou, Valery G. Valentin, Yeping Su
-
Patent number: 11037331Abstract: A method for de-contouring a source image is provided. In the method, a reference image is obtained based on the source image. Multi-oriented gradient calculation is performed on each pixel of the reference image, so as to obtain, for each pixel of the reference image, multiple gradient features that respectively correspond to multiple directions. For each pixel of the reference image, a monotonicity index is determined based on the corresponding gradient features. Then, a detail-protecting and de-contour operation is performed on the source image based on the monotonicity indices determined for the pixels of the reference image.Type: GrantFiled: January 21, 2020Date of Patent: June 15, 2021Assignee: NOVATEK MICROELECTRONICS CORP.Inventors: Cong Zhang, Jian-Hua Liang, Yuan-Jia Du
-
Patent number: 11032557Abstract: A moving image decoder (1) includes an intermediate estimated prediction mode deriving section (124) for transforming a prediction mode of each neighbor partition into an intermediate prediction mode included in an intermediate prediction set which is a sum of prediction sets (PS); and an estimated prediction mode deriving section (125) for deriving an estimated prediction mode by estimating a prediction mode of a target partition based on the intermediate prediction mode of each neighbor partition which is obtained by the transform.Type: GrantFiled: April 11, 2019Date of Patent: June 8, 2021Assignee: SHARP KABUSHIKI KAISHAInventors: Tomoyuki Yamamoto, Tomohiro Ikai
-
Patent number: 10984593Abstract: Techniques for designing and optimization of solid/cellular structures are described using a modeling process referred to as high-definition cellular level set in B-splines (HD-CLIBS). With this process, the entire design domain for the solid/cellular structure in question is subdivided into a set of connected volumetric cells in three dimensions. An implicit trivariate B-spline function is defined on each subdomain cell. With this parameterization scheme, constraints can be imposed on the relevant B-spline coefficients to naturally maintain geometric continuities at the connection faces between neighboring cells. The method offers several useful properties and powerful functionalities to build and modify a solid/cellular structure in the modeling process and to conduct topology optimization by directly adjusting the B-spline coefficients. The model construction can be carried out using a fast B-spline interpolation, and the topology optimization can involve a sequence of discrete B-spline convolutions.Type: GrantFiled: October 28, 2019Date of Patent: April 20, 2021Assignee: THE HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGYInventor: Michael Yu Wang
-
Patent number: 10945807Abstract: Technology is described for augmenting medical imaging for use in a medical procedure. The method can include the operation of receiving an image of patient anatomy captured by a visual image camera during the medical procedure. An acquired medical image associated with the patient anatomy can then be retrieved. Another operation can be associating the acquired medical image to the patient anatomy. An augmentation tag associated with a location in one layer of the acquired medical image can be retrieved. A further operation can be projecting the acquired medical image and the augmentation tag using an augmented reality headset to form a single graphical view as an overlay to the patient anatomy in either 2D, 3D or holographic form.Type: GrantFiled: February 21, 2018Date of Patent: March 16, 2021Assignee: Novarad CorporationInventors: Wendell Arlen Gibby, Steven Todd Cvetko
-
Patent number: 10924784Abstract: A receiving side can perform interactive processing based on information of an object. Image data is coded to obtain a video stream having coded image data. The video stream is transmitted in a state of being added with information of an object detected on the basis of image data. For example, information of an object includes coded data obtained by coding one-bit data showing a shape of the object, information of a region that is a rectangular area enclosing the object, display priority information of the region, text information that explains the object, and the like. The receiving side can acquire information of an object without the need of detecting an object by processing image data, and without depending on its own performance, and is allowed to perform interactive processing based on information of an object in an excellent manner.Type: GrantFiled: August 17, 2017Date of Patent: February 16, 2021Assignee: SONY CORPORATIONInventor: Ikuo Tsukagoshi
-
Patent number: 10869059Abstract: A system comprises an encoder configured to compress a point cloud comprising a plurality of points each point comprising spatial information for the point. The encoder is configured to sub-sample the points and determine subdivision locations for the subsampled points. Also, the encoder is configured to determine, for respective subdivision location, if a point is to be included, not included, or relocated relative to the subdivision location. The encoder encodes spatial information for the sub-sampled points and encodes subdivision location point inclusion/relocation information to generate a compressed point cloud. A decoder recreates an original or near replica of an original point cloud based on the spatial information and the subdivision location inclusion/relocation information included in the compressed point cloud.Type: GrantFiled: May 11, 2020Date of Patent: December 15, 2020Assignee: Apple Inc.Inventors: Khaled Mammou, Fabrice A. Robinet, Andrea Cremaschi, Alexandros Tourapis
-
Patent number: 10856001Abstract: A polygon unit-based image processing method, and a device for the same are disclosed. Specifically, a method for decoding an image on the basis of a polygon unit can comprise the steps of: deriving a motion vector predictor for a polygon apex forming the polygon unit; deriving a motion vector for the polygon apex on the basis of a motion vector difference for the polygon apex and the motion vector predictor; and deriving a prediction sample for the polygon unit from a division unit, which is specified by the motion vector, in a reference picture.Type: GrantFiled: February 15, 2016Date of Patent: December 1, 2020Assignee: LG ELECTRONICS INC.Inventors: Moonmo Koo, Sehoon Yea, Eunyong Son, Jin Heo
-
Patent number: 10778970Abstract: A method of decoding video data using quantized coefficient components and inter prediction information can include extracting quantized coefficient components and inter prediction information from a received bit stream; applying an inverse scan pattern to the quantized coefficient components to generate a quantized block having a size of a transform unit; generating a quantization parameter per quantization unit which is a unit for deriving a quantization parameter and inverse-quantizing the quantized block to generate a transformed block; generating a residual block by inverse-transforming the transformed block; deriving motion information and generating a prediction block; and generating a reconstructed block by using the residual block and the prediction block.Type: GrantFiled: January 8, 2019Date of Patent: September 15, 2020Assignee: INFOBRIDGE PTD. LTD.Inventors: Soo Mi Oh, Moonock Yang
-
Patent number: 10762407Abstract: A component incorporating a 3-D identification code includes: a component body having an interior bounded by an exterior surface; and an identification code formed as a part of at least one of the interior and the exterior surface, the identification code including a plurality of cells arranged in a three-dimensional space, wherein each of the cells is configured to encode more than two possible values.Type: GrantFiled: April 5, 2017Date of Patent: September 1, 2020Assignee: General Electric CompanyInventor: Scott Alan Gold
-
Patent number: 10764470Abstract: A system and method for tile type based color space transformation improves the performance and image quality of multifunction devices. The disclosed embodiments transform an input image on a tile by tile basis. If a tile is detected as neutral, simple 1D L* to CMYK Tone Reproduction Curves or Look-Up-Tables are used to convert the input pixels of the tile in L*a*b* to output pixels in CMYK. If a tile is detected as containing color content, then the input pixels are chrominance adjusted and subsequently converted to CMYK using regular tetrahedral interpolation.Type: GrantFiled: March 11, 2019Date of Patent: September 1, 2020Assignee: Xerox CorporationInventors: Xing Li, Peter McCandlish, Clara Cuciurean-Zapan
-
Patent number: 10706529Abstract: Digital image processing device and method for holistic evaluation of subtle irregularities in a digital image by using the scale space technique to identify irregularities of interest and by calculating a total irregularity score using a function of intensity, scale and optionally location of the identified irregularities of interest. Specifically, the digital image represents a liquid mixture formed by mixing two or more liquid compositions of different ingredients, colors, viscosities, and/or solubility; the subtle irregularities represent non-homogenous mixing spots or regions in such liquid mixture.Type: GrantFiled: June 7, 2018Date of Patent: July 7, 2020Assignee: The Procter & Gamble CompanyInventors: Fabio Zonfrilli, Qi Zhang