Patents by Inventor Gaurav Bhaya

Gaurav Bhaya has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240119958
    Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
    Type: Application
    Filed: October 17, 2023
    Publication date: April 11, 2024
    Inventors: Anshul Kothari, Gaurav Bhaya, Tarun Jain
  • Patent number: 11949733
    Abstract: Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The system can transmit the action data structure to a third party provider device. The system can receive an indication from the third party provider device that a communication session was established with the device.
    Type: Grant
    Filed: February 4, 2021
    Date of Patent: April 2, 2024
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Robert James Stets, Jr.
  • Patent number: 11948572
    Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
    Type: Grant
    Filed: October 24, 2022
    Date of Patent: April 2, 2024
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Robert Stets
  • Patent number: 11930050
    Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
    Type: Grant
    Filed: July 1, 2022
    Date of Patent: March 12, 2024
    Assignee: GOOGLE LLC
    Inventors: Justin Lewis, Richard Rapp, Gaurav Bhaya, Robert Stets
  • Patent number: 11880442
    Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
    Type: Grant
    Filed: December 6, 2021
    Date of Patent: January 23, 2024
    Assignee: GOOGLE LLC
    Inventors: Ken Krieger, Andrew Joseph Alexander Gildfind, Nicholas Salvatore Arini, Simon Michael Rowe, Raimundo Mirisola, Gaurav Bhaya, Robert Stets
  • Patent number: 11823704
    Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
    Type: Grant
    Filed: March 18, 2021
    Date of Patent: November 21, 2023
    Assignee: GOOGLE LLC
    Inventors: Anshul Kothari, Gaurav Bhaya, Tarun Jain
  • Patent number: 11798555
    Abstract: A system of reducing transmissions of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify candidate interfaces and determine if prior instances of the packetized data was transmitted to the candidate interfaces. The interface management component can prevent the transmission of the packetized data if determined to be redundant, such as having previously received the data, and instead transmit it to a separate client device of a different device type.
    Type: Grant
    Filed: August 3, 2021
    Date of Patent: October 24, 2023
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Tarun Jain, Anshul Kothari
  • Publication number: 20230246936
    Abstract: A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.
    Type: Application
    Filed: April 10, 2023
    Publication date: August 3, 2023
    Inventors: Gaurav Bhaya, Robert Stets
  • Patent number: 11705121
    Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
    Type: Grant
    Filed: July 23, 2020
    Date of Patent: July 18, 2023
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Robert Stets, Umesh Patil
  • Patent number: 11705127
    Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
    Type: Grant
    Filed: June 11, 2021
    Date of Patent: July 18, 2023
    Assignee: GOOGLE LLC
    Inventors: Anshul Kothari, Gaurav Bhaya, Tarun Jain
  • Publication number: 20230186920
    Abstract: Processing stacked data structures is provided. A system receives an input audio signal detected by a sensor of a local computing device, identifies an acoustic signature, and identifies an account corresponding to the signature. The system establishes a session and a profile stack data structure including a first profile layer having policies configured by a third-party device. The system pushes, to the profile stack data structure, a second profile layer retrieved from the account. The system parses the input audio signal to identify a request and a trigger keyword. The system generates, based on the trigger keyword and the second profile layer, a first action data structure compatible with the first profile layer. The system provides the first action data structure for execution. The system disassembles the profile stack data structure to remove the first profile layer or the second profile layer from the profile stack data structure.
    Type: Application
    Filed: January 13, 2023
    Publication date: June 15, 2023
    Inventors: Anshul Kothari, Gaurav Bhaya, Tarun Jain
  • Patent number: 11664025
    Abstract: The present disclosure is generally directed to the generation of voice-activated data flows in interconnected network. The voice-activated data flows can include input audio signals that include a request and are detected at a client device. The client device can transmit the input audio signal to a data processing system, where the input audio signal can be parsed and passed to the data processing system of a service provider to fulfill the request in the input audio signal. The present solution is configured to conserve network resources by reducing the number of network transmissions needed to fulfill a request.
    Type: Grant
    Filed: May 28, 2021
    Date of Patent: May 30, 2023
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Ulas Kirazci, Bradley Abrams, Adam Coimbra, Ilya Firman, Carey Radebaugh
  • Publication number: 20230111040
    Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
    Type: Application
    Filed: October 24, 2022
    Publication date: April 13, 2023
    Inventors: Gaurav Bhaya, Robert Stets
  • Patent number: 11625402
    Abstract: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor (“NLP”) component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface (“API”) can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: April 11, 2023
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Robert Stets
  • Patent number: 11627065
    Abstract: A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.
    Type: Grant
    Filed: January 19, 2021
    Date of Patent: April 11, 2023
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Robert Stets
  • Patent number: 11610045
    Abstract: Systems and methods for automatically determining a content item size may be based on a size of a viewport and a width of a parent element. A script may be configured to determine a size of a viewport, determine a width of a parent element of a resource, and determine a content item size based, at least in part, on the size of the view port and the width of the parent element. A dimension of the determined content item size may be used by a content item selection system to determine a set of content items. A content item selection system may select a content item from the determined set of content items and serve data to effect display of the selected content item in the parent element with the resource.
    Type: Grant
    Filed: August 3, 2021
    Date of Patent: March 21, 2023
    Assignee: GOOGLE LLC
    Inventors: Graeme John Rimmer, Lewis Jay Hemens, Gaurav Bhaya, Robert Stets
  • Patent number: 11557302
    Abstract: Processing stacked data structures is provided. A system receives an input audio signal detected by a sensor of a local computing device, identifies an acoustic signature, and identifies an account corresponding to the signature. The system establishes a session and a profile stack data structure including a first profile layer having policies configured by a third-party device. The system pushes, to the profile stack data structure, a second profile layer retrieved from the account. The system parses the input audio signal to identify a request and a trigger keyword. The system generates, based on the trigger keyword and the second profile layer, a first action data structure compatible with the first profile layer. The system provides the first action data structure for execution. The system disassembles the profile stack data structure to remove the first profile layer or the second profile layer from the profile stack data structure.
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: January 17, 2023
    Assignee: GOOGLE LLC
    Inventors: Anshul Kothari, Tarun Jain, Gaurav Bhaya
  • Patent number: 11514907
    Abstract: The present disclosure is generally directed to the generation of voice-activated data flows in interconnected network. The voice-activated data flows can include input audio signals that include a request and are detected at a client device. The client device can transmit the input audio signal to a data processing system, where the input audio signal can be parsed and passed to the data processing system of a service provider to fulfill the request in the input audio signal. The present solution is configured to conserve network resources by reducing the number of network transmissions needed to fulfill a request.
    Type: Grant
    Filed: April 28, 2020
    Date of Patent: November 29, 2022
    Assignee: GOOGLE LLC
    Inventors: Gaurav Bhaya, Ulas Kirazci, Bradley Abrams, Adam Coimbra, Ilya Firman, Carey Radebaugh
  • Patent number: 11508371
    Abstract: Processing stacked data structures is provided. A system receives an input audio signal detected by a sensor of a local computing device, identifies an acoustic signature, and identifies an account corresponding to the signature. The system establishes a session and a profile stack data structure including a first profile layer having policies configured by a third-party device. The system pushes, to the profile stack data structure, a second profile layer retrieved from the account. The system parses the input audio signal to identify a request and a trigger keyword. The system generates, based on the trigger keyword and the second profile layer, a first action data structure compatible with the first profile layer. The system provides the first action data structure for execution. The system disassembles the profile stack data structure to remove the first profile layer or the second profile layer from the profile stack data structure.
    Type: Grant
    Filed: May 14, 2020
    Date of Patent: November 22, 2022
    Assignee: GOOGLE LLC
    Inventors: Anshul Kothari, Tarun Jain, Gaurav Bhaya, Ruxandra Davies, Lisa Takehana
  • Publication number: 20220353303
    Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
    Type: Application
    Filed: July 1, 2022
    Publication date: November 3, 2022
    Inventors: Justin Lewis, Richard Rapp, Gaurav Bhaya, Robert Stets