Patents Examined by Yosef K Laekemariam

Apparatus and method for screen related audio object remapping

Patent number: 12380903

Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

Type: Grant

Filed: January 18, 2024

Date of Patent: August 5, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Simone Fueg, Jan Plogsties, Sascha Dick, Johannes Hilpert, Julien Robilliard, Achim Kuntz, Andreas Hoelzer
End-fire array microphone arrangements inside a vehicle

Patent number: 12363476

Abstract: The present application discloses exemplary arrangements of an end-fire microphone array inside a vehicle and exemplary digital signal processors configured for the end-fire in-car microphone array. The exemplary digital signal processors may be configured in two modes, a phone call mode and an automatic speech recognition mode. In each mode, the exemplary digital signal processors are improved for enhanced SNR. Different end-fire microphone array arrangements are also disclosed.

Type: Grant

Filed: August 15, 2022

Date of Patent: July 15, 2025

Assignee: VUI.AI Inc

Inventor: Dashen Fan
Colorless generation of elevation perceptual cues using all-pass filter networks

Patent number: 12363499

Abstract: A system includes one or more computing devices that encode spatial perceptual cues into a monaural channel to generate a plurality of output channels. A computing device determines a target amplitude response for the mid and side channels of the plurality of output channels, defining a spatial perceptual associated with one or more frequency-dependent phase shifts. The computing device determines a transfer function of a single-input, multi-output allpass filter based on the target amplitude response and determines coefficients of the allpass filter based on the transfer function, and processes the monaural channel with the coefficients of the allpass filter to generate the plurality of channels having the encoded spatial perceptual cues. The allpass filter is configured to be colorless with respect to the individual output channels, allowing for the placement of spatial cues into the audio stream to be decoupled from the overall coloration of the audio.

Type: Grant

Filed: July 7, 2022

Date of Patent: July 15, 2025

Assignee: Boomcloud 360 Inc.

Inventors: Zachary Seldess, Joseph Anthony Mariglio, III
Audio signal processing method and audio signal processing device

Patent number: 12363493

Abstract: The audio signal processing method in accordance with one embodiment receives an audio signal, obtains a first image, estimates room information based on the obtained first image, sets an acoustic parameter according to the estimated room information, applies sound processing to the audio signal according to the set acoustic parameter, and outputs the audio signal subjected to the sound processing.

Type: Grant

Filed: March 7, 2023

Date of Patent: July 15, 2025

Assignee: YAMAHA CORPORATION

Inventors: Satoshi Ukai, Masashi Suzuki
Transitioning of meeting engagement by transitioning viewing perspectives to a temporary viewing perspective showing clusters of group activity

Patent number: 12356117

Abstract: The disclosed techniques provide promotion of engagement by transitioning viewing perspectives to a temporary viewing perspective showing clusters of group activity. A system shows each person a view of a large virtual environment, e.g., in a stadium full of representations of meeting attendees. Each person sees the virtual environment from a point of view originating from each person's representation, e.g., a first-person avatar view. When a group activity meets one or more conditions, the system determines parameters of a temporary perspective view from a point-of-view of a virtual camera based on the positions of people in a cluster of representations. The parameters of the temporary perspective view can be selected to show a threshold number of representations found in a cluster in a virtual environment. The system can then transition the view of each member of the group from a first-person perspective to the point-of-view of the virtual camera.

Type: Grant

Filed: December 29, 2022

Date of Patent: July 8, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventor: Jason Thomas Faulkner
Session description for communication session

Patent number: 12356105

Abstract: There is provided generating, at a control device of a media communication session between participant devices, a session description for the communication session, the session description including media stream identifiers and at least one element including an arrangement of the media stream identifiers indicating a background media stream and associated one or more overlay media streams, and sending, by the control device, the session description to the participant devices for indicating the participant devices to associate the background media stream with one or more overlay media streams for rendering a scene with at least three degrees of freedom.

Type: Grant

Filed: April 1, 2021

Date of Patent: July 8, 2025

Assignee: Nokia Technologies Oy

Inventors: Sujeet Mate, Saba Ahsan, Igor Curcio
Virtual privacy curtain

Patent number: 12356114

Abstract: Presented herein are techniques to process an image for a video conference. A method includes obtaining an image from a camera, identifying a first person and a second person in the image, labeling pixels in the image belonging to the first person and to the second person, determining a first distance between the first person and the camera and a second distance between the second person and the camera, and based on the first distance and the second distance, selectively removing or blurring at least one of the first person and the second person from the image, before transmitting the image to an endpoint of a video conference, by manipulating the pixels in the image belonging to the at least one of the first person and the second person.

Type: Grant

Filed: October 17, 2022

Date of Patent: July 8, 2025

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Dario Cazzani, William Reed, Mitchell Mosure
Rendering binaural audio over multiple near field transducers

Patent number: 12356164

Abstract: An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.

Type: Grant

Filed: February 29, 2024

Date of Patent: July 8, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Mark F. Davis, Nicolas R. Tsingos, C. Phillip Brown
Method, an apparatus and a computer program product for video encoding and video decoding

Patent number: 12341840

Abstract: Some embodiments relate to a method including obtaining information of available quality features of a sender device for providing omnidirectional visual media content; selecting one or more of the available quality features; and providing an indication of the selected one or more of the available quality features. There is also provided a method including obtaining information of available quality features of a receiver device for processing omnidirectional visual media content; providing an indication of the available quality features; receiving from a sender device indication of available quality features of the sender device among the available quality features of the receiver device; selecting one of the quality features, which are available both in the sender device and in the receiver device; and providing an indication of the selected quality feature.

Type: Grant

Filed: March 24, 2021

Date of Patent: June 24, 2025

Assignee: Nokia Technologies Oy

Inventors: Sujeet Shyamsundar Mate, Saba Ahsan, Igor Curcio
Capturing and presenting audience response at scale

Patent number: 12335322

Abstract: One example method includes receiving, by the video conference provider, a plurality of responses during a time period, the plurality of responses each provided by one of a plurality of participants in a video conference, the video conference associated with a host and hosted by a video conference provider, and each of the plurality of responses associated with a timestamp and one of a plurality of types of responses, wherein the one of the types of responses comprises an audible response, a visual response, or a user input response. The method also includes sampling, by the video conference provider, the plurality of responses to identify a plurality of response characteristics and generating, by the video conference provider, a consolidated response associated with each timestamp, the consolidated response based at least in part on the plurality of response characteristics.

Type: Grant

Filed: May 2, 2024

Date of Patent: June 17, 2025

Assignee: Zoom Communications, Inc.

Inventors: Brendan James Ittelson, Pankaj Prasad
Matching active speaker pose between two cameras

Patent number: 12335608

Abstract: Described are multiple cameras in a conference room, each pointed in a different direction. A primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Pose information of the speaker is developed. Pose information of each individual identified in each other camera is developed. The speaker pose information is compared to the pose information of the individuals from the other cameras. The best match for each other camera is selected as the speaker in that camera. The speaker views of each camera are compared to determine the speaker view with the most frontal view of the speaker. That camera is selected to provide the video for provision to the far end.

Type: Grant

Filed: April 12, 2024

Date of Patent: June 17, 2025

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Jian David Wang, Xiangdong Wang, Varun Ajay Kulkarni
Headphone interaction with media playback system

Patent number: 12328543

Abstract: Example techniques may involve headphone interaction. An implementation may include while headphones are disconnected from a control device, the control device receiving an indication of particular audio content being played back by a first zone of the media playback system. While the particular audio content is being played back by the one or more playback devices of the media playback system, detecting that headphones have been connected to the first control device. Headphones may be connectable to the control device via either (a) an analog headphone jack or (b) a point-to-point personal area network connection. In response to detecting that the headphones are connected to the control device: the control device (i) causing the one or more playback devices of the first zone to stop playback of the particular audio content; (ii) retrieving the particular audio content and (iii) resuming playback of the particular audio content via the connected headphones.

Type: Grant

Filed: January 26, 2024

Date of Patent: June 10, 2025

Assignee: Sonos, Inc.

Inventors: Eric Clayton, Charles Alessi
Wearable skin vibration or silent gesture detector

Patent number: 12320642

Abstract: Disclosed herein are wearable devices, their configurations, and methods of operation that use self-mixing interferometry signals of a self-mixing interferometry sensor to recognize user inputs. The user inputs may include voiced commands or silent gesture commands. The devices may be wearable on the user's head, with the self-mixing interferometry sensor configured to direct a beam of light toward a location on the user's head. Skin deformations or vibrations at the location may be caused by the user's speech or the user's silent gestures and recognized using the self-mixing interferometry signal. The self-mixing interferometry signals may be used for bioauthentication and/or audio conditioning of received sound or voice inputs to a microphone.

Type: Grant

Filed: January 16, 2024

Date of Patent: June 3, 2025

Assignee: Apple Inc.

Inventors: Mehmet Mutlu, Ahmet Fatih Cihan
Technologies for video conferencing

Patent number: 12323558

Abstract: Techniques for video conferencing are disclosed. In one embodiment, a mirror assembly can be used to position a mirror in front of a camera, reflecting a view of the camera downward towards a whiteboard or other drawing surface, allowing a user to show a drawing during a video conference. In another embodiment, video data and audio data of a user in a video conference can be analyzed to determine what portions of the video data and audio data should be transmitted to a remote participant in the video conference. Unintentional input supplied by the user in the video data and audio data may be used to determine which portions of the video data and audio data to transmit.

Type: Grant

Filed: March 26, 2021

Date of Patent: June 3, 2025

Assignee: Intel Corporation

Inventors: Aleksander Magi, Soethiha Soe, Marko Bartscherer, Prakash Kurma Raju, Arvind Sundaram, Ayeshwarya B. Mahajan, Akarsha Rajeshwar Kadadevaramath, Gokul V. Subramaniam, Raghavendra Doddi, Prasanna Pichumani
Methods and apparatus for compressing and decompressing a higher order ambisonics representation

Patent number: 12317055

Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.

Type: Grant

Filed: February 2, 2024

Date of Patent: May 27, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Sven Kordon, Alexander Krueger
Method and system for audio bridging with an output device

Patent number: 12294851

Abstract: A method performed by a first electronic device that includes a first speaker, the method includes, receiving, via a network, a representation of audio content, while a second electronic device is playing back the audio content through a second speaker, determining that the first electronic device is moving away from the second electronic device, and, in response to determining that the first electronic device is moving away from the second electronic device, using the representation of audio to play back the audio content through the first speaker.

Type: Grant

Filed: October 3, 2022

Date of Patent: May 6, 2025

Assignee: Apple Inc.

Inventors: Christopher T. Eubank, Ronald J. Guglielmone, Jr.
False-positive filter for speaker tracking during video conferencing

Patent number: 12289560

Abstract: A method including receiving, at a camera, light comprising a visible spectrum and an infrared spectrum (200). The method also includes detecting, using the visible spectrum, a candidate image of a head (202). The method also includes determining an estimated true size of the head (204). The method also includes determining, using the infrared spectrum, a temperature of the head (206). The method also includes filtering the candidate image of the head with respect to an operation of a video controller configured to track one or more images of heads in the visible spectrum. Filtering is based on both the estimated true size of the head and the temperature of the head (208).

Type: Grant

Filed: August 29, 2022

Date of Patent: April 29, 2025

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: Fahri Yaras
Systems and methods for multi-user video communication with engagement detection and adjustable fidelity

Patent number: 12279065

Abstract: In one embodiment, a method includes maintaining a video communication between client devices with each being associated with a respective video stream, which is associated with a respective match scale measured based on a height of frames of the video stream and a depth of subjects within the frames, determining a respective scaling factor and cropping for each video stream, wherein the respective scaling factor is calculated based on the match scale associated with the corresponding video stream and a target match scale determined from the match scales associated with all video streams, and wherein the respective cropping is determined based on a uniformity of positions of the subjects within the frames across all video streams, generating a merged video stream from each video stream based on the respective scaling factor and cropping, and sending instructions for presenting the merged video stream to one or more of the client devices.

Type: Grant

Filed: April 20, 2022

Date of Patent: April 15, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dean N. Reading, Marc Estruch Tena, Lin Sun, Fulya Yilmaz, Cathy Kim, Hanna Fuhrmann, Catherine S. Kim, Jun Yeon Cho, Imran Mohammed, Curtis D. Aumiller
Freeze frame video presentation during a video call

Patent number: 12279072

Abstract: An electronic device, a method, and a computer program product provides freeze frame videos in place of a live video feed to a video communication session. A processor of the electronic device determines that a local participant is connected via the electronic device to an ongoing video communication session with second participants having corresponding second participant devices. The processor captures, via an image capturing device, local video encompassing the local participant. The processor determines a video segment of the local video to identify as a freeze frame video. The processor presents the freeze frame video to the video communication session in response to a trigger condition that pauses a presentation of live video feed of the local participant to the video communication session. The processor loops the presentation of the freeze frame video until an expiration of a threshold maximum time established for presenting the freeze frame video.

Type: Grant

Filed: September 28, 2022

Date of Patent: April 15, 2025

Assignee: Motorola Mobility LLC

Inventors: Amit Kumar Agrawal, Jeffrey T Snow, Olivier D Meirhaeghe, Bill Ryan
Visual image management

Patent number: 12279071

Abstract: Disclosed aspects relate to visual image management associated with a set of visual images with respect to a set of computer users. A set of visual image features that relates to both the set of visual images and the set of computer users may be detected. Based on the set of visual image features that relates to both the set of visual images and the set of computer users, a set of display attributes may be determined to construct a presentation configuration associated with the set of visual images with respect to the set of computer users. The set of display attributes may be provided to construct the presentation configuration to display the set of visual images with respect to the set of computer users.

Type: Grant

Filed: May 11, 2022

Date of Patent: April 15, 2025

Inventor: Brian R. Muras

1 2 3 4 5 … next