Abstract: Generating a compound predictor block of a current block of video can include generating, for the current block, predictor blocks comprising a first predictor block including first predictor pixels and a second predictor block including second predictor pixels; using at least a subset of the first predictor pixels to determine a first weight for a first predictor pixel of the first predictor pixels; obtaining a second weight for a second predictor pixel of the second predictor pixels, where the second predictor pixel is co-located with the first predictor pixel; and generating the compound predictor block by combining the first predictor block and the second predictor block, where the predictor block includes a weighted pixel that is determined using a weighted sum of the first predictor pixel and the second predictor pixel using the first weight and the second weight, respectively.
Type:
Grant
Filed:
October 19, 2020
Date of Patent:
May 24, 2022
Assignee:
GOOGLE LLC
Inventors:
Debargha Mukherjee, James Bankoski, Yue Chen, Yuxin Liu, Sarah Parker
Abstract: Implementations relate to generating and/or executing a customized interactive dialog application. The customized interactive dialog application may be generated from a state mapping tool that allows a user to generate custom states and custom transitions between the custom states. A customized configuration description is then generated based on the generated custom states and custom transitions. Further, a default configuration description is identified that includes additional or alternative states and transitions. In executing the customized interactive dialog application, dialog turns are generated based on the states and transition information, with the customized configuration description taking precedence and the default configuration description being utilized for any undefined states and/or transitions. Implementations additionally or alternatively relate to generating and/or executing a custom agent based on generated custom states and custom transitions, and a default configuration description.
Abstract: Implementations provided herein relate to correlating available input gestures to recently created application functions, and adapting available input gestures, and/or user-created input gestures, to be correlated with existing application functions. Available input gestures (e.g., a hand wave) can be those that can be readily performed upon setup of a computing device. When a user installs an application that is not initially configured to handle the available input gestures, the available input gestures can be correlated to certain functions of the application. Furthermore, a user can create new gestures for application actions and/or modify existing gestures according to their own preferences and/or physical capabilities. When multiple users elect to modify an existing gesture in the same way, the modification can be made universal, with permission from the users, in order to eliminate latency when subsequently adapting to preferences of other users.
Abstract: Generating audio tracks is provided. The system selects a digital component object having a visual output format. The system determines to convert the digital component object into an audio output format. The system generates text for the digital component object. The system selects, based on context of the digital component object, a digital voice to render the text. The system constructs a baseline audio track of the digital component object with the text rendered by the digital voice. The system generates, based on the digital component object, non-spoken audio cues. The system combines the non-spoken audio cues with the baseline audio form of the digital component object to generate an audio track of the digital component object. The system provides the audio track of the digital component object to the computing device for output via a speaker of the computing device.
Abstract: A dual-screen client device includes a user interface for providing content and application tools to a user in an easy to use manner. Different inputs of the user interface are presented to the user on one or both of the screens depending on a current state of the device and in response to user operation. Content such as an item of interest in a particular application is presented on one screen while tools or other inputs for that application are provided on the other screen. Through user operation of the interface or physical manipulation of the client device, the content and tools are swapped between the dual screens. For instance, buttons and menus can be brought up front to a closer screen for quick input, while active content (e.g., video chat, full image, drawings) is presented on the farther screen.
Abstract: The present disclosure provides systems and methods that perform face reconstruction based on an image of a face. In particular, one example system of the present disclosure combines a machine-learned image recognition model with a face modeler that uses a morphable model of a human's facial appearance. The image recognition model can be a deep learning model that generates an embedding in response to receipt of an image (e.g., an uncontrolled image of a face). The example system can further include a small, lightweight, translation model structurally positioned between the image recognition model and the face modeler. The translation model can be a machine-learned model that is trained to receive the embedding generated by the image recognition model and, in response, output a plurality of facial modeling parameter values usable by the face modeler to generate a model of the face.
Type:
Grant
Filed:
April 24, 2020
Date of Patent:
May 17, 2022
Assignee:
GOOGLE LLC
Inventors:
Forrester H. Cole, Dilip Krishnan, William T. Freeman, David Benjamin Belanger
Abstract: A technique is provided to determine whether a region within a web page is viewable to a user through a browser window. Often, browsers will only show part of a web page at given time, creating a difficulty in establishing whether a region of the web page, which may be an advertisement for example, is in view. This is addressed by providing one or more test features within the region, rendering the web page, monitoring a behavioural characteristic of the test features and determining whether the region is in view based on the monitored behavioural characteristic, wherein the behavioural characteristic varies according to whether the test feature is currently being displayed. One example of a behavioural characteristic is a frame progression rate. Browsers will typically redraw elements of a web page at a higher rate if they are currently in view through the browser window, and this characteristic can therefore be used to determine whether the test feature, and thus the region, is in view.
Type:
Grant
Filed:
February 17, 2017
Date of Patent:
May 17, 2022
Assignee:
GOOGLE LLC
Inventors:
Douglas De Jager, Simon Overell, Ben Hodgson
Abstract: Identifier dependent operation processing of packet based data communication is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A content selector component can select, based on the request or trigger keyword, a content item. A link generation component can determine whether the client computing device has an account or a record in a database associated with the service provider device. In the absence of the record or account, the link generation device generates and sends a virtual identifier to the service provider device with instructions to generate an account in the database using the virtual identifier. Once the account is created, the service provider device can communicate with the client computing device.
Abstract: Systems and techniques are provided for determination of user presence and absence using WiFi connections. Reports may be received from WiFi access points in an environment. The reports may include an identifier of a WiFi device, an indication of a connection to or disconnection from a WiFi access point, a time of the connection or disconnection, and an identifier of the WiFi access point. A connection sequence for the WiFi device may be generated from the reports. Whether the WiFi device is present in or absent from the environment as of a specified time may be determined based on the connection sequence. An indication of presence for a user associated with the WiFi device may generated if the WiFi device is present in the environment. An indication of absence for the user associated with the WiFi device may be generated if the WiFi device is absent from the environment.
Type:
Grant
Filed:
April 30, 2020
Date of Patent:
May 17, 2022
Assignee:
GOOGLE LLC
Inventors:
Marci Meingast, Andrew Axley, Daniele Midi
Abstract: Automated monitoring of a voice communication session, when the session is in an on hold status, to determine when the session is no longer in the on hold status. When it is determined that the session is no longer in the on hold status, user interface output is rendered that is perceptible to a calling user that initiated the session, and that indicates that the on hold status of the session has ceased. In some implementations, an audio stream of the session can be monitored to determine, based on processing of the audio stream, a candidate end of the on hold status. In response, a response solicitation signal is injected into an outgoing portion of the audio. The audio stream can be further monitored for a response (if any) to the response solicitation signal. The response (if any) can be processed to determine whether the end of the on hold status is an actual end of the on hold status.
Abstract: At least one aspect is directed to determining an estimate of an intersection of user identifiers in a first set of user identifiers and a second set of user identifiers. The first and second sets of user identifiers can be populated with user identifiers that have interacted with the same content item or content item campaign. Estimates of intersections of the first and the second sets can be determined based on a binomial vector approach, a vector of counts approach, or a hybrid approach. The binomial vector approach generates vectors based on k hashes of each user identifier in the first set and summing the vectors to generate a first vector. The intersection can be determined based on a dot product of the first vector and a second vector similarly generated from the second set of user identifiers.
Type:
Grant
Filed:
October 3, 2019
Date of Patent:
May 17, 2022
Assignee:
GOOGLE LLC
Inventors:
Michael Daub, Scott Schneider, Joseph Sean Cahill Goodknight Knightbrook, Sheng Ma, Laura Book
Abstract: A computer-implemented method is disclosed. The method includes receiving from a remote device a search query, generating a plurality of different category-directed result sets for the search query, determining an order for the plurality of category-directed result sets based on the search query, and transmitting the plurality of category-directed result sets to the remote device, in a manner that the result sets are to be displayed in the remote device in the determined order.
Type:
Grant
Filed:
August 28, 2020
Date of Patent:
May 17, 2022
Assignee:
GOOGLE LLC
Inventors:
Yael Shacham Shazeer, Leland Rechis, Scott Jenson, Gabriel Wolosin