Patents by Inventor Jonathan Huang

Jonathan Huang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12272370
    Abstract: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
    Type: Grant
    Filed: October 3, 2023
    Date of Patent: April 8, 2025
    Assignee: Apple Inc.
    Inventors: Carlos M. Avendano, John Woodruff, Jonathan Huang, Mehrez Souden, Andreas Koutrouvelis
  • Patent number: 12207074
    Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
    Type: Grant
    Filed: December 7, 2023
    Date of Patent: January 21, 2025
    Assignee: Apple Inc.
    Inventors: Hassan Taherian, Jonathan Huang, Carlos M. Avendano
  • Patent number: 12182674
    Abstract: The subject disclosure provides systems and methods for providing locally trained models for detecting individual sounds using electronic devices. Local detection of individual sounds with a detection model at an electronic device can be provided by obtaining training samples for the detection model with the electronic device, and generating additional negative and positive training samples based on the obtained training samples. A two-stage detection process may be provided, in which a trigger model at a device compares an audio input to a reference sound to trigger a detection model at the device. The detection of individual sounds with a detection model at an electronic device can also leverage audio capture capabilities of multiple devices in an acoustic scene to capture multiple concurrent training samples.
    Type: Grant
    Filed: May 4, 2022
    Date of Patent: December 31, 2024
    Assignee: Apple Inc.
    Inventors: Jonathan Huang, Miquel Espi Marques, Carlos M. Avendano, Kevin M. Durand, David Findlay, Vasudha Kowtha, Daniel C. Klingler, Yichi Zhang
  • Publication number: 20240363094
    Abstract: A conversation detector processes microphone signals and other sensor signals of a headphone to declare a conversation and configures a filter block to activate a transparency audio signal. It then declares an end to the conversation based on processing one or more of the microphone signals and the other sensor signals, and in response deactivates the transparency audio signal. The conversation detector monitors an idle duration in which an OVAD and a TVAD are both or simultaneously indicating no activity and declares the end to the conversation in response to the idle duration being longer than an idle threshold. Other aspects are also described and claimed.
    Type: Application
    Filed: March 29, 2024
    Publication date: October 31, 2024
    Inventors: Ashok Masilamani, Prateek Murgai, John Woodruff, David M. Fischer, Jonathan D. Sheaffer, Jonathan Huang, Sorin V. Dusan, Andrew W. Malta, Erik D. Hornberger, Yichi Zhang, Miquel Espi Marques, Carlos M. Avendano
  • Publication number: 20240107254
    Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
    Type: Application
    Filed: December 7, 2023
    Publication date: March 28, 2024
    Inventors: Hassan Taherian, Jonathan Huang, Carlos M. Avendano
  • Publication number: 20240029754
    Abstract: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
    Type: Application
    Filed: October 3, 2023
    Publication date: January 25, 2024
    Inventors: Carlos M. AVENDANO, John WOODRUFF, Jonathan HUANG, Mehrez SOUDEN, Andreas KOUTROUVELIS
  • Patent number: 11860288
    Abstract: Methods, apparatus, systems, and articles of manufacture to detect the location of sound sources external to computing devices are disclosed. An apparatus, to determine a direction of a source of a sound relative to a computing device, includes a cross-correlation analyzer to generate a vector of values corresponding to a cross-correlation of first and second audio signals corresponding to the sound. The first audio signal is received from a first microphone of the computing device. The second audio signal is received from a second microphone of the computing device. The apparatus also includes a location analyzer to use a machine learning model and a set of the values of the vector to determine the direction of the source of the sound.
    Type: Grant
    Filed: June 26, 2020
    Date of Patent: January 2, 2024
    Assignee: INTEL CORPORATION
    Inventors: Hector Cordourier Maruri, Adam Kupryjanow, Karol Duzinkiewicz, Jose Rodrigo Camacho Perez, Paulo Lopez Meyer, Julio Zamora Esquivel, Alejandro Ibarra Von Borstel, Jonathan Huang
  • Patent number: 11863961
    Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
    Type: Grant
    Filed: December 5, 2022
    Date of Patent: January 2, 2024
    Assignee: Apple Inc.
    Inventors: Hassan Taherian, Jonathan Huang, Carlos M. Avendano
  • Publication number: 20230419538
    Abstract: A method includes receiving video data that includes a series of frames of image data. Here, the video data is representative of an actor performing an activity. The method also includes processing the video data to generate a spatial input stream including a series of spatial images representative of spatial features of the actor performing the activity, a temporal input stream representative of motion of the actor performing the activity, and a pose input stream including a series of images representative of a pose of the actor performing the activity. Using at least one neural network, the method also includes processing the temporal input stream, the spatial input stream, and the pose input stream. The method also includes classifying, by the at least one neural network, the activity based on the temporal input stream, the spatial input stream, and the pose input stream.
    Type: Application
    Filed: September 11, 2023
    Publication date: December 28, 2023
    Applicant: Google LLC
    Inventors: Yinxiao Li, Zhichao Lu, Xuehan Xiong, Jonathan Huang
  • Publication number: 20230360641
    Abstract: The subject disclosure provides systems and methods for generating and storing learned embeddings of audio inputs to an electronic device. The electronic device may generate and store encoded versions of audio inputs and learned embeddings of the audio inputs. When a new audio input is obtained, the electronic device can generate an encoded version of the new audio input, compare the encoded version of the new audio input to the stored encoded versions of prior audio inputs, and if the encoded version of the new audio input matches one of the stored encoded versions of the prior audio inputs, the electronic device can provide a stored learned embedding that corresponds to the one of the stored encoded versions of the prior audio inputs to a detection model at the electronic device. The cached embeddings can be provided to locally trained models for detecting individual sounds using electronic devices.
    Type: Application
    Filed: May 4, 2022
    Publication date: November 9, 2023
    Inventors: Daniel C. KLINGLER, Carlos M. AVENDANO, Jonathan HUANG, Miquel ESPI MARQUES
  • Patent number: 11810588
    Abstract: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
    Type: Grant
    Filed: January 31, 2022
    Date of Patent: November 7, 2023
    Assignee: Apple Inc.
    Inventors: Carlos M. Avendano, John Woodruff, Jonathan Huang, Mehrez Souden, Andreas Koutrouvelis
  • Patent number: 11776156
    Abstract: A method includes receiving video data that includes a series of frames of image data. Here, the video data is representative of an actor performing an activity. The method also includes processing the video data to generate a spatial input stream including a series of spatial images representative of spatial features of the actor performing the activity, a temporal input stream representative of motion of the actor performing the activity, and a pose input stream including a series of images representative of a pose of the actor performing the activity. Using at least one neural network, the method also includes processing the temporal input stream, the spatial input stream, and the pose input stream. The method also includes classifying, by the at least one neural network, the activity based on the temporal input stream, the spatial input stream, and the pose input stream.
    Type: Grant
    Filed: June 11, 2021
    Date of Patent: October 3, 2023
    Assignee: Google LLC
    Inventors: Yinxiao Li, Zhichao Lu, Xuehan Xiong, Jonathan Huang
  • Patent number: 11684516
    Abstract: Hearing protection and communication apparatus using vibration sensors are disclosed. An example wearable electronic device includes means for transducing vibrations associated with speech into a first signal; means for transducing sound associated with ambient noise into a second signal; and means for processing to cause a speaker to output a signal to reduce the ambient noise; detect an identifier in the speech; and cause audio data representative of the speech to be transmitted to a second device associated with the identifier.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: June 27, 2023
    Assignee: Intel Corporation
    Inventors: Willem M. Beltman, Hector A. Cordourier Maruri, Paulo Lopez Meyer, Jonathan Huang
  • Patent number: 11676974
    Abstract: An electronic device has a display substrate including a display area, a driver area, and a fan-out area. The fan-out area has interconnects providing electrical accesses to display elements of the display area. The device has a driver chip disposed on the driver area. The driver chip includes a first edge adjacent to the display area and multiple pad groups, each pad group including a respective row of electronic pads that is (i) arranged substantially in parallel with the first edge and (ii) electrically coupled to a respective subset of display elements via respective interconnects routed on a respective region of the fan-out area. The pad groups include a first pad group and a second pad group. The first and second pad groups have two different distances from the first edge and correspond to two different subsets of interconnects routed on two non-overlapping regions of the fan-out area.
    Type: Grant
    Filed: September 28, 2021
    Date of Patent: June 13, 2023
    Assignee: PARADE TECHNOLOGIES, LTD.
    Inventors: Yueh-Lin Yang, Haijun Chen, Tatao Hsu, Jonathan Huang
  • Patent number: 11592715
    Abstract: An electronic device has a display screen and a driver chip disposed on a driver area of the display screen. A fan-out area of the display screen has interconnects configured to provide electrical accesses to display elements of the display area. The driver chip includes a first edge, a second edge, and a row of electronic pads proximate to the first edge. The electronic pads have a first subset of end pads at a first end of the first row, a second subset of end pads at a second opposite end of the first row, and a subset of intermediate pads located between the first subset and second subset of end pads. The first subset of end pads physically contact a first subset of interconnects from the first edge, and the subset of intermediate pads physically contact a second subset of interconnects from the one or more second edges.
    Type: Grant
    Filed: May 28, 2021
    Date of Patent: February 28, 2023
    Assignee: PARADE TECHNOLOGIES, LTD.
    Inventors: Yueh-Lin Yang, Quan Yu, Haijun Chen, Tatao Hsu, Jonathan Huang
  • Patent number: 11544463
    Abstract: An embodiment of a spoken intent detection device includes technology to detect a phrase in an electronic representation of an audio stream based on a pre-defined vocabulary, associate a time stamp with the detected phrase, and classify a spoken intent based on a sequence of detected phrases and the respective associated time stamps. Other embodiments are disclosed and claimed.
    Type: Grant
    Filed: May 9, 2019
    Date of Patent: January 3, 2023
    Assignee: Intel Corporation
    Inventors: Munir Georges, Wenda Chen, Tobias Bocklet, Jonathan Huang
  • Patent number: 11539935
    Abstract: In one embodiment, a computing system may receive, from a second computing system, video streams of a scene, the video streams including at least a first image and a second image that are simultaneously captured by a first camera and a second camera of the second computing system, respectively. The system may determine, using a sensor system, a viewpoint of a viewer with respect to a display region of a monoscopic display associated with the first computing system. The system may generate an output image of the scene by blending, according to blending proportions computed using the viewpoint of the viewer, corresponding portions of the first image and the second image. The system may display the output image in the display region of the monoscopic display.
    Type: Grant
    Filed: December 2, 2020
    Date of Patent: December 27, 2022
    Assignee: Meta Platforms Technologies, LLC
    Inventors: Andrew Garrod Bosworth, Timo Juhani Ahonen, Brian Keith Cabral, Ryan Cairns, Andrea Colaco, Jonathan Huang, Michael Fredrick Cohen
  • Patent number: 11533577
    Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
    Type: Grant
    Filed: May 20, 2021
    Date of Patent: December 20, 2022
    Assignee: APPLE INC.
    Inventors: Hassan Taherian, Jonathan Huang, Carlos M. Avendano
  • Publication number: 20220391758
    Abstract: The subject disclosure provides systems and methods for providing locally trained models for detecting individual sounds using electronic devices. Local detection of individual sounds with a detection model at an electronic device can be provided by obtaining training samples for the detection model with the electronic device, and generating additional negative and positive training samples based on the obtained training samples. A two-stage detection process may be provided, in which a trigger model at a device compares an audio input to a reference sound to trigger a detection model at the device. The detection of individual sounds with a detection model at an electronic device can also leverage audio capture capabilities of multiple devices in an acoustic scene to capture multiple concurrent training samples.
    Type: Application
    Filed: May 4, 2022
    Publication date: December 8, 2022
    Inventors: Jonathan HUANG, Miquel ESPI MARQUES, Carlos M. AVENDANO, Kevin M. DURAND, David FINDLAY, Vasudha KOWTHA, Daniel C. KLINGLER, Yichi ZHANG
  • Publication number: 20220384489
    Abstract: An electronic device has a display substrate including a display area, a driver area, and a fan-out area. The fan-out area has interconnects providing electrical accesses to display elements of the display area. The device has a driver chip disposed on the driver area. The driver chip includes a first edge adjacent to the display area and multiple pad groups, each pad group including a respective row of electronic pads that is (i) arranged substantially in parallel with the first edge and (ii) electrically coupled to a respective subset of display elements via respective interconnects routed on a respective region of the fan-out area. The pad groups include a first pad group and a second pad group. The first and second pad groups have two different distances from the first edge and correspond to two different subsets of interconnects routed on two non-overlapping regions of the fan-out area.
    Type: Application
    Filed: September 28, 2021
    Publication date: December 1, 2022
    Inventors: Yueh-Lin Yang, Haijun Chen, Tatao Hsu, Jonathan Huang