Patents by Inventor Shengyang Dai

Shengyang Dai has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Functional image archiving

Patent number: 11829404

Abstract: Some implementations related to archiving of functional images. In some implementations, a method includes accessing images and determining one or more functional labels corresponding to each of the images and one or more confidence scores corresponding to the functional labels. A functional image score is determined for each of the images based on the functional labels having a corresponding confidence score that meets a respective threshold for the functional labels. In response to determining that the functional image score meets a functional image score threshold, a functional image signal is provided that indicates that one or more of the images that meet the functional image score threshold are functional images. The functional images are determined to be archived, and are archived by associating an archive attribute with the functional images such that functional images having the archive attribute are excluded from display in views of the images.

Type: Grant

Filed: December 11, 2020

Date of Patent: November 28, 2023

Assignee: Google LLC

Inventors: Shinko Cheng, Eunyoung Kim, Shengyang Dai, Madhur Khandelwal, Kristina Eng, David Loxton
Identifying key-value pairs in documents

Patent number: 11816710

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Type: Grant

Filed: March 1, 2022

Date of Patent: November 14, 2023

Assignee: Google LLC

Inventors: Yang Xu, Jiang Wang, Shengyang Dai
RESOURCE CONSTRAINED NEURAL NETWORK ARCHITECTURE SEARCH

Publication number: 20220414425

Abstract: Methods, and systems, including computer programs encoded on computer storage media for neural network architecture search.

Type: Application

Filed: August 19, 2022

Publication date: December 29, 2022

Applicant: Google LLC

Inventors: Ming-Hsuan Yang, Xiaojie Jin, Joshua Foster Slocum, Shengyang Dai, Jiang Wang
IDENTIFYING KEY-VALUE PAIRS IN DOCUMENTS

Publication number: 20220309549

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Type: Application

Filed: March 1, 2022

Publication date: September 29, 2022

Applicant: Google LLC

Inventors: Yang Xu, Jiang Wang, Shengyang Dai
Resource constrained neural network architecture search

Patent number: 11443162

Abstract: Methods, and systems, including computer programs encoded on computer storage media for neural network architecture search.

Type: Grant

Filed: August 23, 2019

Date of Patent: September 13, 2022

Assignee: Google LLC

Inventors: Ming-Hsuan Yang, Xiaojie Jin, Joshua Foster Slocum, Shengyang Dai, Jiang Wang
Systems and Methods for Object Detection Using Image Tiling

Publication number: 20220254137

Abstract: A computing system for detecting objects in an image can perform operations including generating an image pyramid that includes a first level corresponding with the image at a first resolution and a second level corresponding with the image at a second resolution. The operations can include tiling the first level and the second level by dividing the first level into a first plurality of tiles and the second level into a second plurality of tiles; inputting the first plurality of tiles and the second plurality of tiles into a machine-learned object detection model; receiving, as an output of the machine-learned object detection model, object detection data that includes bounding boxes respectively defined with respect to individual ones of the first plurality of tiles and the second plurality of tiles; and generating image object detection output by mapping the object detection data onto an image space of the image.

Type: Application

Filed: August 5, 2019

Publication date: August 11, 2022

Inventors: Jilin Tu, Jiang Wang, Huizhong Chen, Xiangxin Zhu, Shengyang Dai
Noise Tolerant Ensemble RCNN for Semi-Supervised Object Detection

Publication number: 20220172456

Abstract: The present disclosure provides systems and methods that include or otherwise leverage an object detection training model for training a machine-learned object detection model. In particular, the training model can obtain first training data and train the machine-learned object detection model using the first training data. The training model can obtain second training data and input the second training data into the machine-learned object detection model, and receive as an output of the machine-learned object detection model, data that describes the location of a detected object of a target category within images from the second training data. The training model can determine mined training data based on the output of the machine-learned object detection model, and train the machine-learned object detection model based on the mined training data.

Type: Application

Filed: March 8, 2019

Publication date: June 2, 2022

Inventors: Jiang Wang, Jiyang Gao, Shengyang Dai
Identifying key-value pairs in documents

Patent number: 11288719

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Type: Grant

Filed: February 27, 2020

Date of Patent: March 29, 2022

Assignee: Google LLC

Inventors: Yang Xu, Jiang Wang, Shengyang Dai
Sharing images and image albums over a communication network

Patent number: 11146520

Abstract: Implementations relate to sharing images and image albums over a communication network. In some implementations, a computer-implemented method includes determining that a shared album data structure is accessible by a device and includes references to album images. The device determines one or more suggested images from a collection of stored images associated with a user, based on determining sharing scores for the stored images. The sharing scores are based on comparing one or more characteristics of the stored images to one or more corresponding characteristics of the album images. The method includes causing display of the one or more suggested images by the device, receiving a selection of at least one selected image of the suggested images based on received user input, and causing an update of the shared album data structure with the at least one selected image to be accessible over a communication network by recipient users.

Type: Grant

Filed: November 6, 2019

Date of Patent: October 12, 2021

Assignee: Google LLC

Inventors: David Lieb, James Gallagher, Kedar Jayant Kanitkar, Teresa Ko, Loren Puchalla Fiore, Jason Chang, Nan Wang, Jingyu Cui, Shengyang Dai
FUNCTIONAL IMAGE ARCHIVING

Publication number: 20210097353

Abstract: Some implementations related to archiving of functional images. In some implementations, a method includes accessing images and determining one or more functional labels corresponding to each of the images and one or more confidence scores corresponding to the functional labels. A functional image score is determined for each of the images based on the functional labels having a corresponding confidence score that meets a respective threshold for the functional labels. In response to determining that the functional image score meets a functional image score threshold, a functional image signal is provided that indicates that one or more of the images that meet the functional image score threshold are functional images. The functional images are determined to be archived, and are archived by associating an archive attribute with the functional images such that functional images having the archive attribute are excluded from display in views of the images.

Type: Application

Filed: December 11, 2020

Publication date: April 1, 2021

Applicant: Google LLC

Inventors: Shinko CHENG, Eunyoung KIM, Shengyang DAI, Madhur KHANDELWAL, Kristina ENG, David LOXTON
RESOURCE CONSTRAINED NEURAL NETWORK ARCHITECTURE SEARCH

Publication number: 20210056378

Abstract: Methods, and systems, including computer programs encoded on computer storage media for neural network architecture search.

Type: Application

Filed: August 23, 2019

Publication date: February 25, 2021

Inventors: Ming-Hsuan Yang, Xiaojie Jin, Joshua Foster Slocum, Shengyang Dai, Jiang Wang
Functional image archiving

Patent number: 10891526

Abstract: Some implementations related to archiving of functional images. In some implementations, a method includes accessing images and determining one or more functional labels corresponding to each of the images and one or more confidence scores corresponding to the functional labels. A functional image score is determined for each of the images based on the functional labels having a corresponding confidence score that meets a respective threshold for the functional labels. In response to determining that the functional image score meets a functional image score threshold, a functional image signal is provided that indicates that one or more of the images that meet the functional image score threshold are functional images. The functional images are determined to be archived, and are archived by associating an archive attribute with the functional images such that functional images having the archive attribute are excluded from display in views of the images.

Type: Grant

Filed: December 21, 2018

Date of Patent: January 12, 2021

Assignee: Google LLC

Inventors: Shinko Cheng, Eunyoung Kim, Shengyang Dai, Madhur Khandelwal, Kristina Eng, David Loxton
IDENTIFYING KEY-VALUE PAIRS IN DOCUMENTS

Publication number: 20200273078

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Type: Application

Filed: February 27, 2020

Publication date: August 27, 2020

Inventors: Yang Xu, Jiang Wang, Shengyang Dai
Generating videos of media items associated with a user

Patent number: 10685680

Abstract: A method includes grouping media items associated with a user into segments based on a timestamp associated with each media item and a total number of media items. The method also includes selecting target media from the media items for each of the segments based on media attributes associated with the media item. The method also includes generating a video that includes the target media for each of the segments by generating a first animation that illustrates a first transition from a first item from the target media to a second item from the target media with movement of the first item from an onscreen location to an offscreen location, wherein the first item is adjacent to the second item in the first animation and determining whether the target media includes one or more additional items. The method also includes adding a song to the video.

Type: Grant

Filed: March 20, 2019

Date of Patent: June 16, 2020

Assignee: Google LLC

Inventors: Shengyang Dai, Timothy Sepkoski St. Clair, Koji Ashida, Jingyu Cui, Jay Steele, Qi Gu, Erik Murphy-Chutorian, Ivan Neulander, Flavio Lerda, Eric Charles Henry, Shinko Yuanhsien Cheng, Aravind Krishnaswamy, David Cohen, Pardis Beikzadeh
SHARING IMAGES AND IMAGE ALBUMS OVER A COMMUNICATION NETWORK

Publication number: 20200076756

Abstract: Implementations relate to sharing images and image albums over a communication network. In some implementations, a computer-implemented method includes determining that a shared album data structure is accessible by a device and includes references to album images. The device determines one or more suggested images from a collection of stored images associated with a user, based on determining sharing scores for the stored images. The sharing scores are based on comparing one or more characteristics of the stored images to one or more corresponding characteristics of the album images. The method includes causing display of the one or more suggested images by the device, receiving a selection of at least one selected image of the suggested images based on received user input, and causing an update of the shared album data structure with the at least one selected image to be accessible over a communication network by recipient users.

Type: Application

Filed: November 6, 2019

Publication date: March 5, 2020

Applicant: Google LLC

Inventors: David LIEB, James GALLAGHER, Kedar Jayant KANITKAR, Teresa KO, Loren PUCHALLA FIORE, Jason CHANG, Nan WANG, Jingyu CUI, Shengyang DAI
Sharing images and image albums over a communication network

Patent number: 10476827

Abstract: Implementations relate to sharing images and image albums over a communication network. In some implementations, a computer-implemented method includes determining that a shared album data structure is accessible by a device and includes references to album images. The device determines one or more suggested images from a collection of stored images associated with a user, based on determining sharing scores for the stored images. The sharing scores are based on comparing one or more characteristics of the stored images to one or more corresponding characteristics of the album images. The method includes causing display of the one or more suggested images by the device, receiving a selection of at least one selected image of the suggested images based on received user input, and causing an update of the shared album data structure with the at least one selected image to be accessible over a communication network by recipient users.

Type: Grant

Filed: September 27, 2016

Date of Patent: November 12, 2019

Assignee: Google LLC

Inventors: David Lieb, James Gallagher, Kedar Jayant Kanitkar, Teresa Ko, Loren Puchalla Fiore, Jason Chang, Nan Wang, Jingyu Cui, Shengyang Dai
GENERATING VIDEOS OF MEDIA ITEMS ASSOCIATED WITH A USER

Publication number: 20190252001

Abstract: A method includes grouping media items associated with a user into segments based on a timestamp associated with each media item and a total number of media items. The method also includes selecting target media from the media items for each of the segments based on media attributes associated with the media item. The method also includes generating a video that includes the target media for each of the segments by generating a first animation that illustrates a first transition from a first item from the target media to a second item from the target media with movement of the first item from an onscreen location to an offscreen location, wherein the first item is adjacent to the second item in the first animation and determining whether the target media includes one or more additional items. The method also includes adding a song to the video.

Type: Application

Filed: March 20, 2019

Publication date: August 15, 2019

Applicant: Google LLC

Inventors: Shengyang DAI, Timothy Sepkoski ST. CLAIR, Koji ASHIDA, Jingyu CUI, Jay STEELE, Qi GU, Erik MURPHY-CHUTORIAN, Ivan NEULANDER, Flavio LERDA, Eric Charles HENRY, Shinko Yuanhsien CHENG, Aravind KRISHNASWAMY, David COHEN, Pardis BEIKZADEH
FUNCTIONAL IMAGE ARCHIVING

Publication number: 20190197364

Abstract: Some implementations related to archiving of functional images. In some implementations, a method includes accessing images and determining one or more functional labels corresponding to each of the images and one or more confidence scores corresponding to the functional labels. A functional image score is determined for each of the images based on the functional labels having a corresponding confidence score that meets a respective threshold for the functional labels. In response to determining that the functional image score meets a functional image score threshold, a functional image signal is provided that indicates that one or more of the images that meet the functional image score threshold are functional images. The functional images are determined to be archived, and are archived by associating an archive attribute with the functional images such that functional images having the archive attribute are excluded from display in views of the images.

Type: Application

Filed: December 21, 2018

Publication date: June 27, 2019

Applicant: Google LLC

Inventors: Shinko CHENG, Eunyoung KIM, Shengyang DAI, Madhur KHANDELWAL, Kristina ENG, David LOXTON
Generating videos of media items associated with a user

Patent number: 10242711

Abstract: A method includes grouping media items associated with a user into segments based on a timestamp associated with each media item and a total number of media items. The method also includes selecting target media from the media items for each of the segments based on media attributes associated with the media item. The method also includes generating a video that includes the target media for each of the segments by generating a first animation that illustrates a first transition from a first item from the target media to a second item from the target media with movement of the first item from an onscreen location to an offscreen location, wherein the first item is adjacent to the second item in the first animation and determining whether the target media includes one or more additional items. The method also includes adding a song to the video.

Type: Grant

Filed: June 26, 2017

Date of Patent: March 26, 2019

Assignee: Google LLC

Inventors: Shengyang Dai, Timothy Sepkoski St. Clair, Koji Ashida, Jingyu Cui, Jay Steele, Qi Gu, Erik Murphy-Chutorian, Ivan Neulander, Flavio Lerda, Eric Charles Henry, Shinko Yuanhsien Cheng, Aravind Krishnaswamy, David Cohen, Pardis Beikzadeh
Generating image compositions

Patent number: 9965882

Abstract: Implementations generally relate to generating image compositions. In some implementations, a method includes receiving a plurality of photos from a user and determining one or more composition types from the photos. The method further includes generating one or more compositions from the received photos based on the one or more determined composition types, where each composition is based on modified foregrounds of the photos. The method further includes providing the one or more generated compositions to the user.

Type: Grant

Filed: October 3, 2016

Date of Patent: May 8, 2018

Assignee: Google LLC

Inventors: Erik Murphy-Chutorian, Matthew Steiner, Vahid Kazemi, Shengyang Dai

1 2 3 4 next