Abstract: At least one aspect of the present disclosure is directed to systems and methods of pruning retrieval tokens from sets of retrieval tokens based on criteria. The system can receive a plurality of retrieval tokens including a second retrieval token. The system can retrieve an indication of a first token and a plurality of requests. The system can construct a first bit string based on the predicted requests and the first retrieval token. The system can retrieve a second bit string corresponding to the second retrieval token. The system can compare the first bit string to the second bit string to determine a similarity value. The system can determine the similarity value is greater than a predetermined threshold. The system can remove the first and second retrieval token from the plurality to create a pruned set of retrieval tokens. The system can provide the pruned set to a content.
Type:
Grant
Filed:
April 28, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Mahesh Keralapura Manjunatha, Chiu Wah So
Abstract: A trained model is retrained for video quality assessment and used to identify sets of adaptive compression parameters for transcoding user generated video content. Using transfer learning, the model, which is initially trained for image object detection, is retrained for technical content assessment and then again retrained for video quality assessment. The model is then deployed into a transcoding pipeline and used for transcoding an input video stream of user generated content. The transcoding pipeline may be structured in one of several ways. In one example, a secondary pathway for video content analysis using the model is introduced into the pipeline, which does not interfere with the ultimate output of the transcoding should there be a network or other issue. In another example, the model is introduced as a library within the existing pipeline, which would maintain a single pathway, but ultimately is not expected to introduce significant latency.
Abstract: Generating audio tracks is provided. The system selects a digital component object having a visual output format. The system determines to convert the digital component object into an audio output format. The system generates text for the digital component object. The system selects, based on context of the digital component object, a digital voice to render the text. The system constructs a baseline audio track of the digital component object with the text rendered by the digital voice. The system generates, based on the digital component object, non-spoken audio cues. The system combines the non-spoken audio cues with the baseline audio form of the digital component object to generate an audio track of the digital component object. The system provides the audio track of the digital component object to the computing device for output via a speaker of the computing device.
Abstract: Implementations described herein relate to training and refining robotic control policies using imitation learning techniques. A robotic control policy can be initially trained based on human demonstrations of various robotic tasks. Further, the robotic control policy can be refined based on human interventions while a robot is performing a robotic task. In some implementations, the robotic control policy may determine whether the robot will fail in performance of the robotic task, and prompt a human to intervene in performance of the robotic task. In additional or alternative implementations, a representation of the sequence of actions can be visually rendered for presentation to the human can proactively intervene in performance of the robotic task.
Type:
Grant
Filed:
August 11, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Seyed Mohammad Khansari Zadeh, Eric Jang, Daniel Lam, Daniel Kappler, Matthew Bennice, Brent Austin, Yunfei Bai, Sergey Levine, Alexander Irpan, Nicolas Sievers, Chelsea Finn
Abstract: Systems and methods of verifying trigger keywords in acoustic-based digital assistant applications are provided. A system can receive, from an application developer computing device, a request to generate a voice-based software application. The request can include a uniform resource locator (URL) associated with a service provided by the voice-based software application and an identifier corresponding to the URL. The system can identify a plurality of links that include the URL as a target. The system can determine a subset of the plurality of links having a respective character string that includes a reference to the identifier for the URL. The data processing system can compare the subset of the plurality of links to a threshold to determine a match level, and can determine a verification status of the request based on the match level.
Abstract: Implementations set forth herein relate to providing notifications regarding events that may interrupt content being rendered at an interface. The notifications can be preemptive and/or can indicate a predicted time and/or source for the events. The event can be, for example, a person attempting to contact a user who is viewing content at a display interface. The person can be associated with a food delivery that has been ordered by the user via a delivery application. An application, such an automated assistant application, can predict when the person is expected to arrive with the food delivery, and generate a notification ahead of the person arriving. In some implementations, the notification can be rendered at a scrubber user interface (UI) at a location corresponding to the time that the food delivery is expected to arrive, thereby putting the user on notice of when the streaming content may be interrupted.
Type:
Grant
Filed:
January 17, 2024
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Cliff Kuang, Jesse Kaczmarek, Andy Gugel, Jonathan Lee
Abstract: Implementations set forth herein relate to generating a pre-call analysis for one or more users that are receiving and/or initializing a call with one or more other users, and/or prioritizing pre-call content according to whether security-related value was gleaned from provisioning certain pre-call content. One or more machine learning models can be employed for determining the pre-call content to be cached and/or presented prior to a user accepting a call from another user. Feedback provided before, during, and/or after the call can be used as a basis from which to prioritize certain content and/or sources of content when generating pre-call content for a subsequent call. Other information, such as contextual data (e.g., calendar entries, available peripheral devices, location, etc.) corresponding to the previous call and/or the subsequent call, can also be used as a basis from which to provide a pre-call analysis.
Abstract: The present disclosure provides computer-implemented methods, systems, and devices for enabling frictionless transactions at a merchant location using audio communication. A central hub device receives transaction notification data describing a transaction with a user computing device. The central hub device accesses one or more location determination signals to estimate a location of the user computing device with the merchant location. The central hub device transmits transaction data to the estimated location of the user computing device using audio-based communications, the transaction data including an audio key that, when detected by the user computing device, causes the user computing device to automatically activate an application for providing payment data. The central hub device receiving transaction payment data from the user computing device. The central hub device executes the transaction by transmitting transaction data to a payment system.
Abstract: Various implementations relate to techniques, for controlling smart devices, that are low latency and/or that provide computational efficiencies (client and/or server) and/or network efficiencies. Those implementations relate to generating and/or utilizing cache entries, of a cache that is stored locally at an assistant client device, in control of various smart devices (e.g., smart lights, smart thermostats, smart plugs, smart appliances, smart routers, etc.). Each of the cache entries includes a mapping of text to one or more corresponding semantic representations.
Type:
Grant
Filed:
September 15, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
David Roy Schairer, Di Lin, Lucas Palmer
Abstract: A user device (UE) for managing radio bearers communicates, with a first base station over a first radio bearer associated with a dedicated control channel and configured to carry at least application-layer measurement reporting information, the radio bearer associated with a logical channel identity (2502); receives, from a radio access network (RAN) including the first base station and a second base station, a message related to (i) the first radio bearer or (ii) a second radio bearer having the logical channel identity and terminated at the second base station (2504); and release or reconfigure the first radio bearer in response to the message (2506).
Abstract: Systems and methods for providing scene understanding can include obtaining a plurality of images, stitching images associated with the scene, detecting objects in the scene, and providing information associated with the objects in the scene. The systems and methods can include determining filter tags or query tags that can be selected to filter the plurality of objects, which can then be provided as information to the user to provide further insight on the scene. The information may be provided in an augmented-reality experience via text or other user-interface elements anchored to objects in the images.
Type:
Grant
Filed:
December 20, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Jessica Lee, Christopher James Kelley, Alok Aggarwal, Harshit Kharbanda
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving a query provided by a user and comprising one or more terms. Obtaining context data based on at least a portion of a first resource displayed to the user at a time that the query is received. Obtaining a revised query that is based on the query and the context data. Receiving a plurality of search results responsive to the revised query. Automatically, selecting a search result that represents a second resource from the plurality of search results, and providing the second resource for display to the user.
Type:
Grant
Filed:
October 31, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Gokhan H. Bakir, Behshad Behzadi, Marcin M. Nowak-Przygodzki
Abstract: A processor executes program code that represents a portion of a video game and adds a sequence of text strings that represent game events to a text log during execution of the program code. The processor (or another processor that has access to the text log) performs a natural language processing (NLP) analysis of the text log to determine one or more characteristics of the portion of the video game. In some cases, the NLP analysis includes a sentiment analysis that attempts to determine characteristics of a player's experience while playing the video game, summarization technology that creates a human-readable summary of an aspect of the game or a portion of the video game, a semantic NLP ML algorithm in the semantic similarity modality to answer questions regarding the player's experience during the video game, or grouping players in a multiplayer game based on in-game behavior.
Abstract: Determining interest in promotional content to be displayed at a mobile communication device is described. The promotional content, including a first portion that is visible and a second portion that is hidden in a first state, may be received. The first and second portions are each selectable only when visible. A promotional display that includes the promotional content in the first state is generated. Upon receipt of a first input from the user in relation to the first portion, the promotional content may be transitioned from the first state to a second state such the first portion becomes hidden and the second portion becomes visible. Upon failing to receive a second input from the user in relation to the second portion, the promotional content may be transitioned from the second state to the first state. Upon receipt of the second input, an interest in the promotional content may be indicated.
Type:
Grant
Filed:
August 2, 2021
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
James S. Kelm, Thompson Alexander Ivor Gawley, Yelena Nakhimovsky, Jonathan Yu
Abstract: A method includes receiving from a client device a request for content, and transmitting to the client device a first content item, a second content item, and a script for displaying the first and second content items within an information resource. The script includes instructions that cause the client device to (1) display the first content item within a content slot having a first size occupying a first region of the information resource, (2) identify a user interaction associated with the first content item, (3) expand, responsive to the user interaction associated with the first content item, the content slot from a first size to a second size, and (4) display, responsive to the user interaction and in the expanded content slot, the first content item and the second content item and an actionable object configured to reduce the content slot from the second size to the first size.
Type:
Grant
Filed:
December 27, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Amy Wu, Brandon Murdock Pearcy, Nathan Peter Lucash, Jun Xu, Yi Zhang, Zhen Yu
Abstract: Systems and methods for generating min-increment counting bloom filters to determine count and frequency of device identifiers and attributes in a networking environment are disclosed. The system can maintain a set of data records including device identifiers and attributes associated with device in a network. The system can generate a vector comprising coordinates corresponding to counter registers. The system can identify hash functions to update a counting bloom filter. The system can hash the data records to extract index values pointing to a set of counter registers. The system can increment the positions in the min-increment counting bloom filter corresponding to the minimum values of the counter registers. The system can obtain an aggregated public key comprising a public key. The system can encrypt the counter registers using the aggregated shared key to generate an encrypted vector. The system can transmit the encrypted vector to a networked worker computing device.
Type:
Grant
Filed:
June 13, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Craig Wright, Laura Grace Book, Sheng Ma, Jiayu Peng, Xichen Huang, Benjamin R. Kreuter, Evgeny Skvortsov, Arthur Asuncion, James Robert Koehler
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, in an existing human-to-computer dialog session between a user and an automated assistant, it may be determined that the automated assistant has responded to all natural language input received from the user. Based on characteristic(s) of the user, information of potential interest to the user or action(s) of potential interest to the user may be identified. Unsolicited content indicative of the information of potential interest to the user or the action(s) may be generated and incorporated by the automated assistant into the existing human-to-computer dialog session.
Type:
Grant
Filed:
December 14, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Ibrahim Badr, Zaheed Sabur, Vladimir Vuskovic, Adrian Zumbrunnen, Lucas Mirelmann
Abstract: A system may receive an entity identifier for a user. The entity identifier may identify an entity that is associated with a category. The system may identify channel(s) for the category by obtaining data for the user, and analyzing the data to determine a score for each channel. The system may use the scores to select one or more of the channels. The system may provide content related to the selected channel(s) to a client device of the user, and the user may interact with the content via the client device.