Patents by Inventor Xiyang Luo
Xiyang Luo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240346546Abstract: Systems, devices, methods, and computer readable medium for evaluating visual quality of digital content are disclosed. Methods can include identifying content assets including one or more images that are combined to create different digital components distributed to one or more client devices. A quality of each of the one or more images is evaluated using one or more machine learning models trained to evaluate one or more visual aspects that are deemed indicative of visual quality. An aggregate quality for the content assets is determined based, at least in part, on an output of the one or more machine learning models indicating the visual quality of each of the one or more images. A graphical user interface of a first computing device is updated to present a visual indication of the aggregate quality of the content assets.Type: ApplicationFiled: February 22, 2024Publication date: October 17, 2024Inventors: Catherine Shyu, Luying Li, Feng Yang, Junjie Ke, Xiyang Luo, Hao Feng, Chao-Hung Chen, Wenjing Kang, Zheng Xia, Shun-Chuan Chen, Yicong Tian, Xia Li, Han Ke
-
Patent number: 11960793Abstract: A method can perform a process with a method including capturing an image, determining an environment that a user is operating a computing device, detecting a hand gesture based on an object in the image, determining, using a machine learned model, an intent of a user based on the hand gesture and the environment, and executing a task based at least on the determined intent.Type: GrantFiled: December 30, 2022Date of Patent: April 16, 2024Assignee: GOOGLE LLCInventors: Archana Kannan, Roza Chojnacka, Jamieson Kerns, Xiyang Luo, Meltem Oktem, Nada Elassal
-
Publication number: 20240087075Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating and decoding watermarks. An image and a data item is received. The encoder generates a first watermark and then a second watermark is generated using multiple first watermarks. The second watermark is used to watermark the image by overlaying the second watermark over the image. To decode the watermark, presence of a watermark is determined on a portion of an image. A distortion model determines distortions in the image and modifies the portion of the image based on the predicted distortions. The modified portion is decoded using the decoder to obtain a predicted first data item that is further used to validate the watermark based on the first data item.Type: ApplicationFiled: January 11, 2022Publication date: March 14, 2024Inventors: Xiyang Luo, Feng Yang, Elnaz Barshan Tashnizi, Dake He, Ryan Matthew Haggarty, Michael Gene Goebel
-
Publication number: 20240020788Abstract: Systems and methods of the present disclosure are directed to a computing system. The computing system can obtain a message vector and video data comprising a plurality of video frames. The computing system can process the input video with a transformation portion of a machine-learned watermark encoding model to obtain a three-dimensional feature encoding of the input video. The computing system can process the three-dimensional feature encoding of the input video and the message vector with an embedding portion of the machine-learned watermark encoding model to obtain spatial-temporal watermark encoding data descriptive of the message vector. The computing system can generate encoded video data comprising a plurality of encoded video frames, wherein at least one of the plurality of encoded video frames includes the spatial-temporal watermark encoding data.Type: ApplicationFiled: March 24, 2021Publication date: January 18, 2024Inventors: Xiyang Luo, Feng Yang, Ce Liu, Huiwen Chang, Peyman Milanfar, Yinxiao Li
-
Publication number: 20240005563Abstract: Example embodiments allow for training of encoders (e.g., artificial neural networks (ANNs)) to facilitate dithering of images that have been subject to quantization in order to reduce the number of colors and/or size of the images. Such a trained encoder generates a dithering image from an input quantized image that can be combined, by addition or by some other process, with the quantized image to result in a dithered output image that exhibits reduced banding or is otherwise aesthetically improved relative to the un-dithered quantized image. The use of a trained encoder to facilitate dithering of quantized images allows the dithering to be performed in a known period of time using a known amount of memory, in contrast to alternative iterative dithering methods. Additionally, the trained encoder can be differentiable, allowing it to be part of a deep learning image processing pipeline or other machine learning pipeline.Type: ApplicationFiled: September 12, 2023Publication date: January 4, 2024Inventors: Innfarn Yoo, Xiyang Luo, Feng Yang
-
Publication number: 20230362399Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly training an encoder that generates a watermark and a decoder that decodes a data item encoded within the watermark. The training comprises obtaining a plurality of training images and data items. For each training image, a first watermark is generated using an encoder and a subsequent second watermark is generated by tiling two or more first watermarks. The training image is watermarked using the second watermark to generate a first error value and distortions are added to the watermarked image. A distortion detector predicts the distortions based on which the distorted image is modified. The modified image is decoded by the decoder to generate a predicted data item and a second error value. The training parameters of the encoder and decoder are adjusted based on the first and the second error value.Type: ApplicationFiled: January 11, 2022Publication date: November 9, 2023Inventors: Xiyang Luo, Feng Yang, Elnaz Barshan Tashnizi, Dake He, Ryan Matthew Haggarty, Michael Gene Goebel
-
Patent number: 11790564Abstract: Example embodiments allow for training of encoders (e.g., artificial neural networks (ANNs)) to facilitate dithering of images that have been subject to quantization in order to reduce the number of colors and/or size of the images. Such a trained encoder generates a dithering image from an input quantized image that can be combined, by addition or by some other process, with the quantized image to result in a dithered output image that exhibits reduced banding or is otherwise aesthetically improved relative to the un-dithered quantized image. The use of a trained encoder to facilitate dithering of quantized images allows the dithering to be performed in a known period of time using a known amount of memory, in contrast to alternative iterative dithering methods. Additionally, the trained encoder can be differentiable, allowing it to be part of a deep learning image processing pipeline or other machine learning pipeline.Type: GrantFiled: March 30, 2020Date of Patent: October 17, 2023Assignee: Google LLCInventors: Innfarn Yoo, Xiyang Luo, Feng Yang
-
Publication number: 20230325961Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a visually imperceptible or a visually perceptible watermark and outputting a result based on the determination. A watermark decoder receives an input image. The watermark decoder applies a decoder machine learning model to decode a watermarks at different levels of zoom. The water mark decoder determines whether a watermark was decoded to obtain a decoded watermark. The watermark decoder outputs a result based on the determination whether the watermark was decoded through application of the decoder machine learning model to the input image that includes outputting a zoomed output decoded through application of the decoder machine learning model to the input image.Type: ApplicationFiled: June 21, 2021Publication date: October 12, 2023Inventors: Dake He, Tianhao Zhang, Elnaz Barshan Tashnizi, Xiyang Luo, Huiwen Chang, Feng Yang, Ryan Matthew Haggarty
-
Publication number: 20230325959Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting and decoding a visually imperceptible or perceptible watermark. A watermark detection apparatus determines whether the particular image includes a visually imperceptible or perceptible watermark using detector a machine learning model. If the watermark detection apparatus detects a watermark, the particular image is routed to a watermark decoder. If the watermark detection apparatus cannot detect a watermark in the particular image, the particular image is filtered from further processing. The watermark decoder decodes the visually imperceptible or perceptible watermark detected in the particular image. After decoding, an item depicted in the particular image is validated based data extracted from the decoded visually imperceptible or perceptible watermark.Type: ApplicationFiled: June 21, 2021Publication date: October 12, 2023Inventors: Dake He, Tianhao Zhang, Elnaz Barshan Tashnizi, Xiyang Luo, Huiwen Chang, Feng Yang, Ryan Matthew Haggarty
-
Publication number: 20230289134Abstract: A method can perform a process with a method including capturing an image, determining an environment that a user is operating a computing device, detecting a hand gesture based on an object in the image, determining, using a machine learned model, an intent of a user based on the hand gesture and the environment, and executing a task based at least on the determined intent.Type: ApplicationFiled: December 30, 2022Publication date: September 14, 2023Inventors: Archana Kannan, Roza Chojnacka, Jamieson Kerns, Xiyang Luo, Meltem Oktem, Nada Elassal
-
Publication number: 20230214953Abstract: Systems and methods are directed to a computing system. The computing system can include one or more processors, a message embedding model, a message extraction model, and a first set of instructions that cause the computing system to perform operations including obtaining the three-dimensional image data and the message vector. The operations can include inputting three-dimensional image data and a message vector into the message embedding model to obtain encoded three-dimensional image data. The operations can include using the message extraction model to extract an embedded message from the encoded three-dimensional image data to obtain a reconstructed message vector. The operations can include evaluating a loss function for a difference between the reconstructed message vector and the message vector and modifying values for parameters of at least the message embedding model based on the loss function.Type: ApplicationFiled: June 5, 2020Publication date: July 6, 2023Inventors: Innfarn Yoo, Xiyang Luo, Feng Yang, Ondrej Stava
-
Publication number: 20230130410Abstract: Methods, systems, and computer programs encoded on a computer storage medium, that relate to generating quantization tables that are used during digital image compression of a digital image. Multiple training images are obtained. A model can be trained using the training images to generate a quantization table that can be used during encoding of an input image. For each training image, a quantization table can be obtained using the model. Using the quantization table, an encoded digital image is obtained for the training image. Using the encoded digital image and the training image, an image quality loss and a compression loss can be determined. An overall loss of the model can be determined by combining the image quality loss and the compression loss for the training image. The model can be updated based on the overall loss.Type: ApplicationFiled: April 17, 2020Publication date: April 27, 2023Inventors: Xiyang Luo, Feng Yang, Hossein Talebi
-
Publication number: 20230111326Abstract: Methods, systems, and computer programs encoded on a computer storage medium, that relate to extracting digital watermarks from images, irrespective of distortions introduced into these images. Methods can include inputting a first data item into a channel encoder that can generate a first encoded data item that is greater in length than the first data item and that (1) includes the input data item and (2) new data this is redundant of the input data item. Based on the first encoded data item and a first image, an encoder model can generate a first encoded image into which the first encoded data is embedded as a digital watermark. A decoder model can decode the first encoded data item to generate a second data, which can be decoded by the channel decoder to generate data that is predicted to be the first data.Type: ApplicationFiled: January 13, 2020Publication date: April 13, 2023Inventors: Ruohan Zhan, Feng Yang, Xiyang Luo, Peyman Milanfar, Huiwen Chang, Ce Liu
-
Publication number: 20230053317Abstract: Example embodiments allow for training of encoders (e.g., artificial neural networks (ANNs)) to generate a color palette based on an input image. The color palette can then be used to generate, using the input image, a quantized, reduced color depth image that corresponds to the input image. Differences between a plurality of such input images and corresponding quantized images are used to train the encoder. Encoders trained in this manner are especially suited for generating color palettes used to convert images into different reduced color depth image file formats. Such an encoder also has benefits, with respect to memory use and computational time or cost, relative to the median-cut algorithm or other methods for producing reduced color depth color palettes for images.Type: ApplicationFiled: January 8, 2020Publication date: February 16, 2023Inventors: Xiyang LUO, Innfarn YOO, Feng YANG
-
Patent number: 11543888Abstract: A method can perform a process with a method including capturing an image, determining an environment that a user is operating a computing device, detecting a hand gesture based on an object in the image, determining, using a machine learned model, an intent of a user based on the hand gesture and the environment, and executing a task based at least on the determined intent.Type: GrantFiled: June 25, 2020Date of Patent: January 3, 2023Assignee: GOOGLE LLCInventors: Archana Kannan, Roza Chojnacka, Jamieson Kerns, Xiyang Luo, Meltem Oktem, Nada Elassal
-
Publication number: 20220358537Abstract: Systems, devices, methods, and computer readable medium for evaluating visual quality of digital content are disclosed. Methods can include identifying content assets including one or more images that are combined to create different digital components distributed to one or more client devices. A quality of each of the one or more images is evaluated using one or more machine learning models trained to evaluate one or more visual aspects that are deemed indicative of visual quality. An aggregate quality for the content assets is determined based, at least in part, on an output of the one or more machine learning models indicating the visual quality of each of the one or more images. A graphical user interface of a first computing device is updated to present a visual indication of the aggregate quality of the content assets.Type: ApplicationFiled: August 6, 2020Publication date: November 10, 2022Inventors: Catherine Shyu, Luying Li, Feng Yang, Junjie Ke, Xiyang Luo, Hao Feng, Chao-Hung Chen, Wenjing Kang, Zheng Xia, Shun-Chuan Chen, Yicong Tian, Xia Li, Han Ke
-
Publication number: 20220335560Abstract: A computer-implemented method that provides watermark-based image reconstruction to compensate for lossy encoding schemes. The method can generate a difference image describing the data loss associated with encoding an image using a lossy encoding scheme. The difference image can be encoded as a message and embedded in the encoded image using a watermark and later extracted from the encoded image. The difference image can be added to the encoded image to reconstruct the original image. As an example, an input image encoded using a lossy JPEG compression scheme can be embedded with the lost data and later reconstructed, using the embedded data, to a fidelity level that is identical or substantially similar to the original.Type: ApplicationFiled: May 12, 2019Publication date: October 20, 2022Inventors: Innfarn Yoo, Feng Yang, Xiyang Luo
-
Publication number: 20220301141Abstract: Systems, devices, methods, and computer readable medium for evaluating visual quality of digital content are disclosed. Methods can include training machine learning models on images. A request is received to evaluate quality of an image included in a current version of a digital component generated by the computing device. The machine learning models are deployed on the image to generate a score for each quality characteristic of the image. A weight is assigned to each score to generate weighted scores. The weighted scores are combined to generate a combined score for the image. The combined score is compared to one or more thresholds to generate a quality of the image.Type: ApplicationFiled: August 6, 2020Publication date: September 22, 2022Inventors: Catherine Shyu, Xiyang Luo, Feng Yang, Junjie Ke, Yicong Tian, Chao-Hung Chen, Xia Li, Luying Li, Wenjing Kang, Shun-Chuan Chen
-
Patent number: 11429894Abstract: Example aspects of the present disclosure are directed to systems and methods for learning classification models which satisfy constraints such as, for example, constraints that can be expressed as a predicted positive rate or negative rate on a subset of the training dataset. In particular, through the use of quantile estimators, the systems and methods of the present disclosure can transform a constrained optimization problem into an unconstrained optimization problem that is solved more efficiently and generally than the constrained optimization problem. As one example, the unconstrained optimization problem can include optimizing an objective function where a decision threshold of the classification model is expressed as an estimator of a quantile function on the classification scores of the machine-learned classification model for a subset of the training dataset at a desired quantile.Type: GrantFiled: February 28, 2019Date of Patent: August 30, 2022Assignee: GOOGLE LLCInventors: Elad Edwin Tzvi Eban, Alan Mackey, Xiyang Luo
-
Patent number: 11158128Abstract: A system and method may provide for spatial and semantic auto-completion of an augmented or mixed reality environment. The system may detect physical objects in a physical environment based on analysis of image frames captured by an image sensor of a computing device. The system may detect spaces in the physical environment that are occupied by the detected physical objects, and may detect spaces that are unoccupied in the physical environment. Based on the identification of the detected physical objects, the system may gain a semantic understanding of the physical environment, and may determine suggested objects for placement in the physical environment based on the semantic understanding. The system may place virtual representations of the suggested objects in a mixed reality scene of the physical environment for user consideration.Type: GrantFiled: April 26, 2019Date of Patent: October 26, 2021Assignee: GOOGLE LLCInventors: Roza Chojnacka, Meltem Oktem, Rajan Patel, Uday Idnani, Xiyang Luo