Abstract: Provided are methods, systems, and devices for generating semantic objects and an output based on the detection or recognition of the state of an environment that includes objects. State data, based in part on sensor output, can be received from one or more sensors that detect a state of an environment including objects. Based in part on the state data, semantic objects are generated. The semantic objects can correspond to the objects and include a set of attributes. Based in part on the set of attributes of the semantic objects, one or more operating modes, associated with the semantic objects can be determined. Based in part on the one or more operating modes, object outputs associated with the semantic objects can be generated. The object outputs can include one or more visual indications or one or more audio indications.
Type:
Grant
Filed:
August 5, 2021
Date of Patent:
February 25, 2025
Assignee:
GOOGLE LLC
Inventors:
Tim Wantland, Donald A. Barnett, David Matthew Jones
Abstract: Embodiments are provided for receiving a request to output audio at a first speaker and a second speaker of an electronic device, determining that the electronic device is oriented in a portrait orientation or a landscape orientation, identifying, based on the determined orientation, a first equalization setting for the first speaker and a second equalization setting for the second speaker, providing, for output at the first speaker, a first audio signal with the first equalization setting, and providing, for output at the second speaker, a second audio signal with the second equalization setting.
Type:
Grant
Filed:
June 5, 2023
Date of Patent:
February 25, 2025
Assignee:
GOOGLE TECHNOLOGY HOLDINGS LLC
Inventors:
Adrian M. Schuster, Prabhu Annabathula, Kevin J. Bastyr, Andrew K. Wells, Wen Hao Zhang
Abstract: Systems and methods for generating a bias lighting effect are provided. A computer-implemented method can include obtaining a video comprising a plurality of video frames. For each of one or more video frames of the plurality of video frames, the method can include sampling an edge portion of the video frame. The edge portion can include a portion of the video frame adjacent to an edge of the video frame. The method can further include generating a bias lighting effect for the video frame. Generating the bias lighting effect can include inverting the edge portion across the edge and blurring the edge portion. The method can further include displaying the video frame concurrently with the bias lighting effect for the video frame. The bias lighting effect can be displayed adjacent to the edge of the video frame.
Type:
Grant
Filed:
May 1, 2019
Date of Patent:
February 25, 2025
Assignee:
GOOGLE LLC
Inventors:
Bryan Ku, Aileen Cheng, Rick Maria Frederikus Van Mook
Abstract: A processor executes program code that represents a portion of a video game and adds a sequence of text strings that represent game events to a text log during execution of the program code. The processor (or another processor that has access to the text log) performs a natural language processing (NLP) analysis of the text log to determine one or more characteristics of the portion of the video game. In some cases, the NLP analysis includes a sentiment analysis that attempts to determine characteristics of a player's experience while playing the video game, summarization technology that creates a human-readable summary of an aspect of the game or a portion of the video game, a semantic NLP ML algorithm in the semantic similarity modality to answer questions regarding the player's experience during the video game, or grouping players in a multiplayer game based on in-game behavior.
Abstract: Systems and methods of verifying trigger keywords in acoustic-based digital assistant applications are provided. A system can receive, from an application developer computing device, a request to generate a voice-based software application. The request can include a uniform resource locator (URL) associated with a service provided by the voice-based software application and an identifier corresponding to the URL. The system can identify a plurality of links that include the URL as a target. The system can determine a subset of the plurality of links having a respective character string that includes a reference to the identifier for the URL. The data processing system can compare the subset of the plurality of links to a threshold to determine a match level, and can determine a verification status of the request based on the match level.
Abstract: Implementations set forth herein relate to generating a pre-call analysis for one or more users that are receiving and/or initializing a call with one or more other users, and/or prioritizing pre-call content according to whether security-related value was gleaned from provisioning certain pre-call content. One or more machine learning models can be employed for determining the pre-call content to be cached and/or presented prior to a user accepting a call from another user. Feedback provided before, during, and/or after the call can be used as a basis from which to prioritize certain content and/or sources of content when generating pre-call content for a subsequent call. Other information, such as contextual data (e.g., calendar entries, available peripheral devices, location, etc.) corresponding to the previous call and/or the subsequent call, can also be used as a basis from which to provide a pre-call analysis.
Abstract: Various implementations relate to techniques, for controlling smart devices, that are low latency and/or that provide computational efficiencies (client and/or server) and/or network efficiencies. Those implementations relate to generating and/or utilizing cache entries, of a cache that is stored locally at an assistant client device, in control of various smart devices (e.g., smart lights, smart thermostats, smart plugs, smart appliances, smart routers, etc.). Each of the cache entries includes a mapping of text to one or more corresponding semantic representations.
Type:
Grant
Filed:
September 15, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
David Roy Schairer, Di Lin, Lucas Palmer
Abstract: An electronic game server receives a request from a client device to establish a real-time interactive gaming session, determines a device capability of an output device associated with the client device, determines a connection capability of the network connection, determines one or more target quality parameters for the real-time interactive gaming session based on the device capability and the connection capability, selects a first virtual machine of the plurality of virtual machines based on the one or more target quality parameters, establishes the real-time interactive gaming session with the client device, and provides to the real-time interactive gaming session, in accordance with the resource profile of the first virtual machine, resources for processing inputs from the client device and generating gameplay outputs in accordance with the processed inputs within the real-time interactive gaming session.
Type:
Grant
Filed:
December 15, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Dov Zimring, Paul Leventis, Benjamin Frenkel, Matthew Rodgers, Clinton Smullen, Robert McCool
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, in an existing human-to-computer dialog session between a user and an automated assistant, it may be determined that the automated assistant has responded to all natural language input received from the user. Based on characteristic(s) of the user, information of potential interest to the user or action(s) of potential interest to the user may be identified. Unsolicited content indicative of the information of potential interest to the user or the action(s) may be generated and incorporated by the automated assistant into the existing human-to-computer dialog session.
Type:
Grant
Filed:
December 14, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Ibrahim Badr, Zaheed Sabur, Vladimir Vuskovic, Adrian Zumbrunnen, Lucas Mirelmann
Abstract: Determining interest in promotional content to be displayed at a mobile communication device is described. The promotional content, including a first portion that is visible and a second portion that is hidden in a first state, may be received. The first and second portions are each selectable only when visible. A promotional display that includes the promotional content in the first state is generated. Upon receipt of a first input from the user in relation to the first portion, the promotional content may be transitioned from the first state to a second state such the first portion becomes hidden and the second portion becomes visible. Upon failing to receive a second input from the user in relation to the second portion, the promotional content may be transitioned from the second state to the first state. Upon receipt of the second input, an interest in the promotional content may be indicated.
Type:
Grant
Filed:
August 2, 2021
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
James S. Kelm, Thompson Alexander Ivor Gawley, Yelena Nakhimovsky, Jonathan Yu
Abstract: Systems and methods for providing scene understanding can include obtaining a plurality of images, stitching images associated with the scene, detecting objects in the scene, and providing information associated with the objects in the scene. The systems and methods can include determining filter tags or query tags that can be selected to filter the plurality of objects, which can then be provided as information to the user to provide further insight on the scene. The information may be provided in an augmented-reality experience via text or other user-interface elements anchored to objects in the images.
Type:
Grant
Filed:
December 20, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Jessica Lee, Christopher James Kelley, Alok Aggarwal, Harshit Kharbanda
Abstract: The present disclosure provides systems, methods, and computer program products for providing efficient embedding table storage and lookup in machine-learning models.
Abstract: A user device (UE) for managing radio bearers communicates, with a first base station over a first radio bearer associated with a dedicated control channel and configured to carry at least application-layer measurement reporting information, the radio bearer associated with a logical channel identity (2502); receives, from a radio access network (RAN) including the first base station and a second base station, a message related to (i) the first radio bearer or (ii) a second radio bearer having the logical channel identity and terminated at the second base station (2504); and release or reconfigure the first radio bearer in response to the message (2506).
Abstract: A method includes receiving from a client device a request for content, and transmitting to the client device a first content item, a second content item, and a script for displaying the first and second content items within an information resource. The script includes instructions that cause the client device to (1) display the first content item within a content slot having a first size occupying a first region of the information resource, (2) identify a user interaction associated with the first content item, (3) expand, responsive to the user interaction associated with the first content item, the content slot from a first size to a second size, and (4) display, responsive to the user interaction and in the expanded content slot, the first content item and the second content item and an actionable object configured to reduce the content slot from the second size to the first size.
Type:
Grant
Filed:
December 27, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Amy Wu, Brandon Murdock Pearcy, Nathan Peter Lucash, Jun Xu, Yi Zhang, Zhen Yu
Abstract: The present disclosure provides computer-implemented methods, systems, and devices for enabling frictionless transactions at a merchant location using audio communication. A central hub device receives transaction notification data describing a transaction with a user computing device. The central hub device accesses one or more location determination signals to estimate a location of the user computing device with the merchant location. The central hub device transmits transaction data to the estimated location of the user computing device using audio-based communications, the transaction data including an audio key that, when detected by the user computing device, causes the user computing device to automatically activate an application for providing payment data. The central hub device receiving transaction payment data from the user computing device. The central hub device executes the transaction by transmitting transaction data to a payment system.
Abstract: Implementations described herein relate to training and refining robotic control policies using imitation learning techniques. A robotic control policy can be initially trained based on human demonstrations of various robotic tasks. Further, the robotic control policy can be refined based on human interventions while a robot is performing a robotic task. In some implementations, the robotic control policy may determine whether the robot will fail in performance of the robotic task, and prompt a human to intervene in performance of the robotic task. In additional or alternative implementations, a representation of the sequence of actions can be visually rendered for presentation to the human can proactively intervene in performance of the robotic task.
Type:
Grant
Filed:
August 11, 2023
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Seyed Mohammad Khansari Zadeh, Eric Jang, Daniel Lam, Daniel Kappler, Matthew Bennice, Brent Austin, Yunfei Bai, Sergey Levine, Alexander Irpan, Nicolas Sievers, Chelsea Finn
Abstract: A trained model is retrained for video quality assessment and used to identify sets of adaptive compression parameters for transcoding user generated video content. Using transfer learning, the model, which is initially trained for image object detection, is retrained for technical content assessment and then again retrained for video quality assessment. The model is then deployed into a transcoding pipeline and used for transcoding an input video stream of user generated content. The transcoding pipeline may be structured in one of several ways. In one example, a secondary pathway for video content analysis using the model is introduced into the pipeline, which does not interfere with the ultimate output of the transcoding should there be a network or other issue. In another example, the model is introduced as a library within the existing pipeline, which would maintain a single pathway, but ultimately is not expected to introduce significant latency.