Abstract: A processor executes program code that represents a portion of a video game and adds a sequence of text strings that represent game events to a text log during execution of the program code. The processor (or another processor that has access to the text log) performs a natural language processing (NLP) analysis of the text log to determine one or more characteristics of the portion of the video game. In some cases, the NLP analysis includes a sentiment analysis that attempts to determine characteristics of a player's experience while playing the video game, summarization technology that creates a human-readable summary of an aspect of the game or a portion of the video game, a semantic NLP ML algorithm in the semantic similarity modality to answer questions regarding the player's experience during the video game, or grouping players in a multiplayer game based on in-game behavior.
Abstract: Generally, the present disclosure is directed to systems and methods that perform adaptive optimization with improved convergence properties. The adaptive optimization techniques described herein are useful in various optimization scenarios, including, for example, training a machine-learned model such as, for example, a neural network. In particular, according to one aspect of the present disclosure, a system implementing the adaptive optimization technique can, over a plurality of iterations, employ an adaptive learning rate while also ensuring that the learning rate is non-increasing.
Type:
Grant
Filed:
December 14, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Sashank Jakkam Reddi, Sanjiv Kumar, Satyen Chandrakant Kale
Abstract: The present disclosure provides systems, methods, and computer program products for providing efficient embedding table storage and lookup in machine-learning models.
Abstract: An electronic game server receives a request from a client device to establish a real-time interactive gaming session, determines a device capability of an output device associated with the client device, determines a connection capability of the network connection, determines one or more target quality parameters for the real-time interactive gaming session based on the device capability and the connection capability, selects a first virtual machine of the plurality of virtual machines based on the one or more target quality parameters, establishes the real-time interactive gaming session with the client device, and provides to the real-time interactive gaming session, in accordance with the resource profile of the first virtual machine, resources for processing inputs from the client device and generating gameplay outputs in accordance with the processed inputs within the real-time interactive gaming session.
Type:
Grant
Filed:
December 15, 2022
Date of Patent:
February 18, 2025
Assignee:
GOOGLE LLC
Inventors:
Dov Zimring, Paul Leventis, Benjamin Frenkel, Matthew Rodgers, Clinton Smullen, Robert McCool
Abstract: A computerized method, system for, and computer-readable medium operable to: retrieve program details data about at least one media program. Retrieve social media posts involving the at least one media program. Match the retrieved program details data with the retrieved social media posts to find matching locations. Store time offset data corresponding to the matched locations within the at least one media program into at least one micropost. Rank the at least one media program by the number of microposts corresponding to the at least one media program. Present the at least one media program, each of the at least one media program with at least one micropost, wherein the at least one micropost comprises a link to the original social media page hosting the social media post and a video deeplink that plays a time off-set video, the time off-set video starting from the stored time offset data.
Abstract: Implementations set forth herein relate to providing selectable autofill suggestions, which correspond to application actions that are at least partially fulfilled using server command data—prior to a user selecting a particular selectable autofill suggestion. Proactively fulfilling command data in this way mitigates latency between user selection of a suggestion and fulfillment of a particular action. Initially, a partial input can be processed to generate autofill suggestions, which can be communicated to a server device for further processing. The autofill suggestions can also be rendered for selection at a touch display interface, thereby allowing a user to select one of the autofill suggestions. As command fulfillment data is provided by the server, the command fulfillment data can be available to a corresponding application(s) in order that any corresponding actions can be at least partially fulfilled prior to user selection.
Abstract: A computer-implemented method includes receiving a document insight request that requests document insights for a corpus of documents. The document insight request includes the corpus of documents, a set of entities contained within each document of the corpus of documents, and document insight request parameters that includes a confidence value threshold. The method also includes generating the document insights for the corpus of documents based on the confidence value threshold. Here, the document insights include an accuracy target and a user review rate target. The method also includes transmitting the document insights to the user device causing a graphical user interface to display the document insights on the user device.
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.
Type:
Grant
Filed:
May 8, 2023
Date of Patent:
February 11, 2025
Assignee:
GOOGLE LLC
Inventors:
Marcin Nowak-Przygodzki, Nathan David Howard, Gabor Simko, Andrei Giurgiu, Behshad Behzadi
Abstract: Implementations relate to dynamically adapting a given assistant output based on a given persona, from among a plurality of disparate personas, assigned to an automated assistant. In some implementations, the given assistant output can be generated and subsequently adapted based on the given persona assigned to the automated assistant. In other implementations, the given assistant output can be generated specific to the given persona and without having to subsequently adapt the given assistant output to the given persona. Notably, the given assistant output can include a stream of textual content to be synthesized for audible presentation to the user, and a stream of visual cues utilized in controlling a display of a client device and/or in controlling a visualized representation of the automated assistant. Various implementations utilize large language models (LLMs), or output previously generated utilizing LLMs, to reflect the given persona in the given assistant output.
Type:
Grant
Filed:
May 13, 2022
Date of Patent:
February 11, 2025
Assignee:
GOOGLE LLC
Inventors:
Martin Baeuml, Thushan Amarasiriwardena, Roberto Pieraccini, Gianluca Martini
Abstract: Processor(s) of a client device of a user can receive a telephone call that is initiated by an additional user, and, in response to receiving the telephone call, identify an entity that is associated with the additional user, and determine, based on the entity that is associated with the additional user, whether to (1) fully automate the telephone call, or (2) partially automate the telephone call. In fully automating the telephone call, the processor(s) can cause a chatbot to engage in a corresponding conversation with the additional user and without prompting the user for any input. In partially automating the telephone call, the processor(s) can cause the chatbot to engage in a corresponding conversation with the additional user but with prompting the user for input(s) via suggestion chip(s). In some implementations, the processor(s) can further determine whether to (3) refrain from automating the telephone call entirely.
Type:
Grant
Filed:
December 15, 2022
Date of Patent:
February 11, 2025
Assignee:
GOOGLE LLC
Inventors:
Yoav Tzur, Yaniv Leviathan, Yossi Matias, Jan Jedrzejowicz
Abstract: On-device processor(s) of a client device may store, in on-device storage and in association with a time to live (TTL) in the on-device storage, a correction directed to ASR processing of audio data. The correction may include a portion of a given speech hypothesis that was modified to an alternate speech hypothesis. Further, the on-device processor(s) may cause an on-device ASR model to be personalized based on the correction. Moreover, and based on additional ASR processing of additional audio data, the on-device processor(s) may store, in the on-device storage and in association with an additional TTL in the on-device storage, a pseudo-correction directed to the additional ASR processing. Accordingly, the on-device processor(s) may cause the on-device ASR model to be personalized based on the pseudo-correction to prevent forgetting by the on-device ASR model.
Abstract: A reading assistant tool implemented in a browser application facilitates the presentation of text content in a manner and format that addresses the reading and/or comprehension capabilities of individual users, while maintaining the context of the original content. The reading assistant tool outputs a reader view pane alongside a main content pane. Original content is presented in the main content pane, and simplified/reformatted content corresponding to text content extracted from the original content, is presented in the reader view pane. Scrolling of the extracted text content in the reader view pane is synchronized with scrolling of the original content in the main content pane, so that context is maintained as the user moves through the content. The concurrent presentation of the original content and the extracted text content allows the user to more easily consume the extracted text content, while also maintaining context as originally intended.
Type:
Grant
Filed:
September 12, 2022
Date of Patent:
February 11, 2025
Assignee:
GOOGLE LLC
Inventors:
Abigail Beth Klein, Joel Christopher Riley
Abstract: Implementations described herein relate to causing certain reasoning with respect to why an automated assistant performed (or did not perform) certain fulfillment and/or alternate fulfillment of an assistant command. For example, implementations can receive user input that includes the assistant command, process the user input to determine data to be utilized in performance of the certain fulfillment or the alternate fulfillment of the assistant command, and cause the automated assistant to utilize the data to perform the certain fulfillment or the alternate fulfillment of the assistant command. In some implementations, output that includes the certain reasoning can be provided for presentation to a user in response to additional user input that requests the certain reasoning. In some implementations, a selectable element can be visually rendered and, when selected by the user, the output that includes the certain reasoning can be provided for presentation to the user.
Type:
Grant
Filed:
November 22, 2021
Date of Patent:
February 11, 2025
Assignee:
GOOGLE LLC
Inventors:
Felix Weissenberger, Alexander Froemmgen, Bogdan Prisacari
Abstract: Methods, apparatus, and computer readable media related to interaction between a user and an automated assistant during a dialog between the user and the automated assistant. Some implementations are directed to adapting a graphical and/or audible presentation of search results provided by the automated assistant for presentation to the user. The adaptation may be in response to attribute(s), of one or more of the search results, referenced in spoken and/or typed textual input provided by the user during the dialog. Some of those implementations may enable a user to provide textual input to navigate the search results within the dialog and within resource and/or interface constraints associated with the dialog. Some of those implementations may additionally and/or alternatively enable adapting, based on textual input provided by a user to the automated assistant, when and/or whether search results having certain attributes are provided to the user by the automated assistant.
Abstract: Implementations relate to an automated assistant that is capable of interacting with non-assistant applications that do not have functionality explicitly provided for interfacing with certain automated assistants. Application data, such as annotation data and/or GUI data, associated with a non-assistant application, can be processed to map such data into an embedding space. An assistant input command can then be processed and mapped to the same embedding space, and a distance from the assistant input command embedding and the non-assistant application data embedding can be determined. When the distance between the assistant input command embedding and the non-assistant application data embedding satisfies threshold(s), the automated assistant can generate instruction(s), for the non-assistant application, that correspond to the non-assistant application data.
Abstract: To select a lane in a multi-lane road segment for a vehicle travelling on the road segment, a system identifies, in multiple lanes and in a region ahead of the vehicle, another vehicle defining a target; the system applies an optical flow technique to track the target during a period of time, to generate an estimate of how fast traffic moves; and the system applies the estimate to machine learning (ML) model to generate a recommendation which one of the plurality of lanes the vehicle is to choose.
Abstract: A cyber-security analysis method uses machine learning (ML) technology to classify cyber-threat indicators, for example, as malicious or benign, by generating a threat score. The method includes receiving, at a compute device, a data set including cyber-threat indicators and verdicts serving as votes from each source in the set of sources. Each of the votes is associated with one of the cyber-threat indicators. An ML model is trained based on at least one of agreements among the sets of votes, and disagreements among the sets of votes to produce a trained ML model. In response to receiving a new cyber-threat indicator, votes are identified for each source from a subset of the sources, to define a second set of votes. The cyber-threat score is generated for the new cyber-threat indicator based on the trained ML model and the second set of votes.