Abstract: A method for navigating a list of media items on a client device includes providing a playlist portion of one or more playlist portions of a user interface (UI) of an application, wherein the playlist portion of the UI provides a list of media items streamable to the application in a first playlist of one or more playlists, receiving an indication of a gesture made by a user with respect to the playlist, identifying a location associated with the gesture, determining that the gesture location is in an area covering the playlist portion, in response to determining that the gesture is a swipe left gesture, navigating media items of the playlist in a forward direction, in response to determining that the gesture is a swipe right gesture, navigating media items of the playlist in a backward direction, and updating a viewing state associated with the first playlist based on the navigating.
Abstract: The present disclosure relates generally to the processing of machine-readable visual encodings in view of contextual information. One embodiment of aspects of the present disclosure comprises obtaining image data descriptive of a scene that includes a machine-readable visual encoding; processing the image data with a first recognition system configured to recognize the machine-readable visual encoding; processing the image data with a second, different recognition system configured to recognize a surrounding portion of the scene that surrounds the machine-readable visual encoding; identifying a stored reference associated with the machine-readable visual encoding based at least in part on one or more first outputs generated by the first recognition system based on the image data and based at least in part on one or more second outputs generated by the second recognition system based on the image data; and performing one or more actions responsive to identification of the stored reference.
Type:
Grant
Filed:
September 18, 2020
Date of Patent:
August 30, 2022
Assignee:
GOOGLE LLC
Inventors:
Alexander James Faaborg, Brett Aladdin Barros
Abstract: Computer generated speech can be generated for cross-lingual natural language textual data streams by utilizing a universal phoneme set. In a variety of implementations, the natural language textual data stream includes a primary language portion in a primary language and a secondary language portion that is not in the primary language. Phonemes corresponding to the secondary language portion can be determined from a set of phonemes in a universal data set. These phonemes can be mapped back to a set of phonemes for the primary language. Audio data can be generated for these phonemes to pronounce the secondary language portion of the natural language textual data stream utilizing phonemes associated with the primary language.
Abstract: Systems and methods of invoking functions of agents via digital assistant applications are provided. Each action-inventory can have an address template for an action by an agent. The address template can include a portion having an input variable used to execute the action. A data processing system can parse an input audio signal from a client device to identify a request and a parameter to be executed by the agent. The data processing system can select an action-inventory for the action corresponding to the request. The data processing system can generate, using the address template, an address. The address can include a substring having the parameter used to control execution of the action. The data processing system can direct an action data structure including the address to the agent to cause the agent to execute the action and to provide output for presentation.
Abstract: Consolidation of disparate data source queries is provided. An application intake component can receive an application package file, and process the file to identify one or more actions and entities for the application. A natural language processor component can receive an input audio signal to identify an action and an entity on which to perform the action. An application delivery component can identify applications that can perform the action and applications that have access to the entity, and then select an application that is configured to perform the action on the entity. The application delivery component can provide an indication of the application selected by the system to a client device.
Type:
Grant
Filed:
February 6, 2019
Date of Patent:
August 30, 2022
Assignee:
GOOGLE LLC
Inventors:
Anuj Thakkar, Chaitrali Amrutkar, Ognjen Nikolic, Jason Douglas, Ilya Firman, Steve Chen
Abstract: The present disclosure provides systems and methods that enable map search recommendations based on a geo-semantic index developed using image content analysis. In one example, a computer-implemented method can include obtaining, by one or more computing devices, a vocabulary of image feature types associated with user activities. The method can include obtaining a collection of imagery. The method can include performing image content analysis on the collection of imagery based on the vocabulary of image feature types. The method can include generating at least one activity score for each of a plurality of location cells in a geo-semantic index based at least in part on the vocabulary of image feature types. The method can include populating the geo-semantic index of location cells, the geo-semantic index of location cells including data indicative of the at least one activity score for each location cell.
Abstract: Video coding using tiling may include encoding a current frame by identifying a tile-width for encoding a current tile of the current frame, the tile-width indicating a cardinality of horizontally adjacent blocks in the current tile, identifying a tile-height for encoding the current tile of the current frame, the tile-height indicating a cardinality of vertically adjacent block in the current tile, and generating an encoded tile by encoding the current tile, such that a row of the current tile includes tile-width horizontally adjacent blocks from the plurality of blocks, and a column of the current tile includes tile-height vertically adjacent blocks from the plurality of blocks. Encoding the current frame may include outputting the encoded tile, wherein outputting the encoded tile includes including an encoded-tile size in an output bitstream, the encoded-tile size indicating a cardinality of bytes for including the encoded tile in the output bitstream.
Type:
Grant
Filed:
January 4, 2019
Date of Patent:
August 23, 2022
Assignee:
GOOGLE LLC
Inventors:
Ronald Sebastiaan Bultje, Sami Aleksi Pietilä
Abstract: Learning to effectively imitate human teleoperators, even in unseen, dynamic environments is a promising path to greater autonomy, enabling robots to steadily acquire complex skills from supervision. Various motion generation techniques are described herein that are rooted in contraction theory and sum-of-squares programming for learning a dynamical systems control policy in the form of a polynomial vector field from a given set of demonstrations. Notably, this vector field is provably optimal for the problem of minimizing imitation loss while providing certain continuous-time guarantees on the induced imitation behavior. Techniques herein generalize to new initial and goal poses of the robot and can adapt in real time to dynamic obstacles during execution, with convergence to teleoperator behavior within a well-defined safety tube.
Type:
Grant
Filed:
January 31, 2020
Date of Patent:
August 23, 2022
Assignee:
GOOGLE LLC
Inventors:
Bachir El Khadir, Vikas Sindhwani, Jacob Varley
Abstract: A computer-implemented method is described. The method includes a computing system receiving a first voice input from a user for generating a bookmark. Responsive to receiving the voice input, the computing system obtains a resource identifier associated with a resource page. The method includes the computing system storing the resource identifier as a bookmark in a bookmark index that is accessible in response to a request from a user device. The method further includes the computing system using the resource identifier, stored as the bookmark, to access the resource page. The computing system uses the resource identifier to access the resource page based on the request from the user device. The request is generated by the user device in response to the user device receiving a second voice input from the user.
Type:
Grant
Filed:
October 19, 2020
Date of Patent:
August 23, 2022
Assignee:
GOOGLE LLC
Inventors:
Mugurel Ionut Andreica, Gokhan H. Bakir, Tak Keung Cheng, Marcin M. Nowak-Przygodzki
Abstract: Techniques are described herein for selectively processing a user's utterances captured prior to and after an event that invokes an automated assistant to determine the user's intent and/or any parameters required for resolving the user's intent. In various implementations, respective measures of fitness for triggering responsive action by the automated assistant may be determined for pre-event and a post-event input streams. Based on the respective measures of fitness, one or both of the pre-event input stream or post-event input stream may be selected and used to cause the automated assistant to perform one or more responsive actions.
Type:
Grant
Filed:
February 20, 2019
Date of Patent:
August 23, 2022
Assignee:
GOOGLE LLC
Inventors:
Matthew Sharifi, Tom Hume, Mohamad Hassan Mohamad Rom, Jan Althaus, Diego Melendo Casado
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for handing off a user conversation between computer-implemented agents. One of the methods includes receiving, by a computer-implemented agent specific to a user device, a digital representation of speech encoding an utterance, determining, by the computer-implemented agent, that the utterance specifies a requirement to establish a communication with another computer-implemented agent, and establishing, by the computer-implemented agent, a communication between the other computer-implemented agent and the user device.
Type:
Grant
Filed:
July 27, 2020
Date of Patent:
August 23, 2022
Assignee:
GOOGLE LLC
Inventors:
Johnny Chen, Thomas L. Dean, Qiangfeng Peter Lau, Sudeep Gandhe, Gabriel Schine
Inventors:
Roger Graves, Eunkyung Chung, Allen Huang, Michael Adam Cohen, Peter Andrew Schiller, Glen Murphy, Winson Wing Sin Chung, Sunny Goyal, Arif Huda, Ryan Takashi Chu, Anthony Sathyam Wickham, Remington Cougar McElhaney
Inventors:
Roger Graves, Eunkyung Chung, Allen Huang, Michael Adam Cohen, Peter Andrew Schiller, Glen Murphy, Winson Wing Sin Chung, Sunny Goyal, Arif Huda, Ryan Takashi Chu, Anthony Sathyam Wickham, Remington Cougar McElhaney