Abstract: This application is directed to a speaker assembly in which a speaker is mounted in an enclosure structure. The enclosure structure exposes a speaker opening of the speaker and provides a sealed enclosure for a rear portion of the speaker, and further includes an electrically conductive portion. One or more electronic components are coupled to the electrically conductive portion of the enclosure structure (which is grounded in some implementations). The electrically conductive portion of the enclosure structure is configured to provide electromagnetic shielding for the electronic components and forms part of the sealed enclosure of the speaker. In some implementations, the electrically conductive portion of the enclosure structure is thermally coupled to the electronic components and acts as a heat sink that is configured to absorb heat generated by the electronic components and dissipate the generated heat away from the electronic components.
Abstract: A direct speech-to-speech translation (S2ST) model includes an encoder configured to receive an input speech representation that to an utterance spoken by a source speaker in a first language and encode the input speech representation into a hidden feature representation. The S2ST model also includes an attention module configured to generate a context vector that attends to the hidden representation encoded by the encoder. The S2ST model also includes a decoder configured to receive the context vector generated by the attention module and predict a phoneme representation that corresponds to a translation of the utterance in a second different language. The S2ST model also includes a synthesizer configured to receive the context vector and the phoneme representation and generate a translated synthesized speech representation that corresponds to a translation of the utterance spoken in the different second language.
Type:
Application
Filed:
April 4, 2024
Publication date:
August 15, 2024
Applicant:
Google LLC
Inventors:
Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz
Abstract: Various arrangements for performing the playback of media content are provided. A computing device can cause a media content item to be transmitted to a display device for presentation. An event that is associated with the presentation of the media content item can be detected. In response to the event, a mirroring session can be resumed on the computing device.
Type:
Application
Filed:
April 23, 2024
Publication date:
August 15, 2024
Applicant:
Google LLC
Inventors:
Stephen Konig, Yuri James Wiitala, Xiangjun Zhang, Chien-Jung Kung
Abstract: A method includes providing, via a user interface (UI) of a first application on a first screen device, a UI element prompting a user to connect the first screen device with a second screen device that is presenting a video to the user. The method further includes receiving, via the UI element, an indication of a user request to connect the first screen device with the second screen device. In response to receiving the indication of the user request to connect the first screen device with the second screen device, the method further includes causing the first screen device to be paired with the second screen device. The method further includes providing, for presentation in the UI of the first application of the first screen device, on or more comments provided by one or more other users for the video that is concurrently presented on the second screen device.
Type:
Grant
Filed:
May 16, 2022
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Aditya Nag, Christopher David Patrick Cooke, Ken Hy Kha Thai, Sana Mithani, Aran Mun
Abstract: Described techniques enable displaying a plurality of positions arranged around a central location on a display, the plurality of positions including at least one empty position and filled positions displaying a first subset of list elements, the filled positions including an adjacent position that is adjacent to the at least one empty position. Movement of a gaze point may be tracked, relative to the filled positions and around the central location towards the at least one empty position. The at least one empty position may be filled with a list element of the list elements, and the adjacent position may be emptied to thereby display a second subset of the list elements.
Abstract: A system for passive alignment of fibers to an interface of a photonic integrated circuit (PIC) includes an input frame, an actuator, and an output frame. The actuator arranged to apply force along a force axis to the input frame. The output frame including a tip for picking up a plate and transferring the force thereto, the output frame being connected to the input frame such that the output frame may tilt relative to the input frame and the output frame is elastically biased relative to the input frame into a position wherein the tip is aligned on the force axis.
Type:
Grant
Filed:
May 30, 2023
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Daoyi Wang, Ryohei Urata, Lieven Verslegers, Jan Petykiewicz
Abstract: Implementing methods to provide a shortened textual summary that includes status information that is most pertinent to the user for a plurality of connected smart devices. The method includes determining a list of current statuses for a plurality of enabled smart devices and filtering the list to remove statuses that may not be of interest to the user. The filtering of the list is based on a current context of the requesting user and one or more previous contexts of the user. The resulting filtered statuses are then converted to textual snippets, summarized, and provided to the user via one or more output devices.
Type:
Grant
Filed:
September 24, 2021
Date of Patent:
August 13, 2024
Assignee:
GOOGLE LLC
Inventors:
Yuzhao Ni, Ashwin Limaye, Cindy Tran, Thomas Clifton, David Roy Schairer
Abstract: Implementations set forth herein relate to an automated assistant that allows third party applications to inject dependencies to leverage automated assistant functions. Furthermore, enabling such dependency injections can allow third party applications to preserve privacy of any application content that is used during execution of automated assistant functions. In some implementations, a third party application can initialize a function with an assistant dependency using parameters that are tagged as private. Initializing a function in such as a way can allow private content communicated between the third party application and the automated assistant to be abstracted for security purposes. The abstracted content can thereafter be communicated to a remote server—such as a server hosting an extensively trained machine learning model. Intelligent output provided by the server can then be incorporated into one or more processes of the third party application without comprising security.
Abstract: The present disclosure provides systems and methods that reduce vulnerability of software systems (e.g., machine-learned models) to adversarial attacks by increasing variety within the software system. In particular, a software system can include a number of subcomponents that interoperate using predefined interfaces. To increase variety within the software system, multiple, different versions of one or more of the subcomponents of the software system can be generated. In particular, the different versions of the subcomponent(s) can be different from each other in some way, while still remaining functionally equivalent (e.g., able to perform the same functions with comparable accuracy/success). A plurality of different variants of the software system can be constructed by mixing and matching different versions of the subcomponents. A large amount of variety can be exhibited by the variants of the software system deployed at a given time, thereby leading to increased robustness against adversarial attacks.
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.
Type:
Grant
Filed:
May 8, 2023
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
Abstract: A computer-implemented method that includes receiving, by a processing unit, an instruction that specifies data values for performing a tensor computation. In response to receiving the instruction, the method may include, performing, by the processing unit, the tensor computation by executing a loop nest comprising a plurality of loops, wherein a structure of the loop nest is defined based on one or more of the data values of the instruction. The tensor computation can be at least a portion of a computation of a neural network layer. The data values specified by the instruction may comprise a value that specifies a type of the neural network layer, and the structure of the loop nest can be defined at least in part by the type of the neural network layer.
Type:
Grant
Filed:
June 21, 2022
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Ravi Narayanaswami, Dong Hyuk Woo, Olivier Temam, Harshit Khaitan
Abstract: A method including determining a gaze direction of a user of a wearable device, capturing an image using a forward-looking camera of the wearable device, detecting a surroundings of the user based on the image, determining whether or not the user is distracted based on the gaze direction and the surroundings, and in response to determining the user is distracted, causing an operation to be performed on the wearable device, the operation configured to cause the user to change the user's attention.
Abstract: Systems and methods for validation of modeling and simulation systems that provide for the virtual fitting of wearable devices, such as glasses, by a user. Three-dimensional modeling and simulation of test subject both with and without fitting frames corresponding to a wearable device may be captured to validate the modeling and simulation modules and associated algorithms and machine learning modules used to simulate the fit of the wearable device on a user. Validation in this manner may provide for increased accuracy/realism of the modeling and simulation systems.
Type:
Grant
Filed:
February 9, 2022
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Idris Syed Aleem, Rees Anwyl Samuel Simmons, Cory Stegelmeier, Philip Lindsley Davidson
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for analyzing images for generating query responses. One of the methods includes determining, using a textual query, an image category for images responsive to the textual query, and an output type that identifies a type of requested content; selecting, using data that associates a plurality of images with a corresponding category, a subset of the images that each belong to the image category, each image in the plurality of images belonging to one of the two or more categories; analyzing, using the textual query, data for the images in the subset of the images to determine images responsive to the textual query; determining a response to the textual query using the images responsive to the textual query; and providing, using the output type, the response to the textual query for presentation.
Type:
Grant
Filed:
February 16, 2023
Date of Patent:
August 13, 2024
Assignee:
GOOGLE LLC
Inventors:
Gokhan H. Bakir, Marcin Bortnik, Malte Nuhn, Kavin Karthik Ilangovan
Abstract: A mobile device includes a panel audio loudspeaker including a display panel and an actuator coupled to the display panel. The mobile device includes a temperature sensor arranged to sense a temperature of the display panel, and an electronic control module in communication with the actuator and the temperature sensor. The electronic control module is programmed to perform operations including: obtaining, from the temperature sensor, data indicating a temperature of the display panel; and based on the data indicating the temperature of the display panel, adjusting a power signal provided to the actuator to drive the panel audio loudspeaker. The power signal can be adjusted by selecting, based on the data indicating the temperature of the display panel, a target temperature of the display panel; mapping the target temperature to a target power level; and changing the power signal provided to the actuator to the target power level.
Type:
Grant
Filed:
December 14, 2020
Date of Patent:
August 13, 2024
Assignee:
Google LLC
Inventors:
Jennis Jose, Ian Peter Lewis, Chia Cheng Weng, TeYuan Wang, Dominic Todd Pinchott, Paul Brian Crosbie, Wael Essam Enan, Michael Scot Pate