Audio To Video Patents (Class 348/515)

Electronic device for processing audio data and method for operating same

Patent number: 12198732

Abstract: An electronic device according to various embodiments includes a camera; a display; a communication module supporting Bluetooth communication; and a processor configured to: establish a communication link with a plurality of external electronic devices through the communication module; transmit a first signal indicating an occurrence of an event using the camera to at least one of the plurality of external electronic devices through the communication link; receive audio data corresponding to sound acquired by each of the plurality of external electronic devices, from each of the plurality of external electronic devices in predetermined time periods through the communication link in a state in which the plurality of external electronic devices are worn; and synchronize the audio data with video acquired using the camera and store the synchronized data, based on time and an order at which each of the plurality of external electronic devices acquires the sound.

Type: Grant

Filed: July 29, 2021

Date of Patent: January 14, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Gupil Cheong, Hyunwook Kim, Hangil Moon, Byoungchul Lee, Juyeon Jin, Doosuk Kang, Bokun Choi
Method and apparatus for identifying a fake video call

Patent number: 12047529

Abstract: A method and apparatus for identifying a faked video is provided herein. During operation, when a video, or video call is received from a device, a simultaneous audio call is placed to the device (i.e., the video call and audio call take place simultaneously in time). The audio streams on both the video and audio call are compared, and a difference between the audio streams is identified. The video is deemed a potential fake if the difference between the audio streams is above a particular threshold.

Type: Grant

Filed: August 24, 2020

Date of Patent: July 23, 2024

Assignee: MOTOROLA SOLUTIONS, INC.

Inventors: Chew Yee Kee, Guo Dong Gan, Mun Yew Tham
Systems and methods for automatic alignment between audio recordings and labels extracted from a multitude of asynchronous sensors in urban settings

Patent number: 12020156

Abstract: A method includes receiving audio stream data associated with a data capture environment, and receiving sensor data associated with the data capture environment. The method also includes identifying at least some events in the sensor data, and calculating at least one offset value for at least a portion of the audio stream data that corresponds to at least one event of the sensor data. The method also includes synchronizing at least a portion of the sensor data associated with the portion of the audio stream data that corresponds to the at least one event of the sensor data, and labeling at least the portion of the audio stream data that corresponds to the at least one event of the sensor data. The method also includes generating training data using at least some of the labeled portion of the audio stream data, and training a machine learning model using the training data.

Type: Grant

Filed: July 13, 2022

Date of Patent: June 25, 2024

Assignee: Robert Bosch GmbH

Inventors: Luca Bondi, Shabnam Ghaffarzadegan, Samarjit Das
Dynamic delay equalization for media transport

Patent number: 11949930

Abstract: Systems and methods of the present disclosure provide for dynamic delay equalization of related media signals in a media transport system. Methods include receiving a plurality of related media signals, transporting the related media signals along different media paths, calculating uncorrected propagation delays for the media paths, and delaying each of the related media signals by an amount related to the difference between the longest propagation delay (of the uncorrected propagation delays) and the uncorrected propagation delay of the related media signal/media path. Calculating the uncorrected propagation delays and delaying the related media signals may be performed in response to a change to the propagation delay of at least one of the related media signals/media paths. Additionally or alternatively, calculating the uncorrected propagation delays and delaying the related media signals may be performed while transporting the related media signals.

Type: Grant

Filed: January 22, 2023

Date of Patent: April 2, 2024

Assignee: Biamp Systems, LLC

Inventors: Eugene Gurfinkel, Michael K. Davis, Charles H. Van Dusen
Media-aware navigation metadata

Patent number: 11895369

Abstract: The present disclosure relates to methods and apparatus for processing media content having video content and associated audio content. A method of processing media content having video content and associated audio content comprises the method includes receiving the video content and the associated audio content, analyzing the associated audio content, determining one or more navigation points for enabling navigation of the media content based on the analysis, wherein the one or more navigation points indicate points of interest in the associated audio content for short-term rewinding and/or fast forwarding, embedding the one or more navigation points into metadata for the media content, and outputting the video content, the associated audio content, and the metadata.

Type: Grant

Filed: August 22, 2018

Date of Patent: February 6, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Christopher Graham Hines
Video processing apparatus, video processing system and video processing method

Patent number: 11838673

Abstract: A video processing apparatus is provided, including an audio acquisition part; a video reception part; a video transmission time reception part receiving, from a video output device, a video transmission time, which is a time at which the video received by the video reception part is transmitted from the video output device; a video processing part; a video processing completion time acquisition part acquiring a video processing completion time, which is a time at which processing performed by the video processing part is completed; a delay time calculation part calculating a delay time, which is a time difference between the video processing completion time and the video transmission time; a delayed audio creation part creating delayed audio obtained by delaying the audio acquired by the audio acquisition part by the delay time; and an output part outputting the video processed by the video processing part and the delayed audio.

Type: Grant

Filed: December 23, 2022

Date of Patent: December 5, 2023

Assignee: Roland Corporation

Inventor: Kenichi Matsumoto
Display

Patent number: 11789539

Abstract: A display includes: a monitor that includes an input/output region and performs haptic feedback when detecting an input from an operator while an image is displayed, the input/output region being obtained by superimposing a first region for displaying an image, a second region including a plurality of input detection regions each for detecting an input from an operator, and a third region including a plurality of haptic feedback regions each for performing haptic feedback to the operator; and a data processor that determines a haptic output value in each of the plurality of haptic feedback regions on the basis of image data of an image being displayed on the first region and a detection result of an input from an operator in each of the plurality of input detection regions.

Type: Grant

Filed: May 29, 2020

Date of Patent: October 17, 2023

Assignee: MITSUBISHI ELECTRIC CORPORATION

Inventor: Koichi Orito
Methods for transporting digital media

Patent number: 11764890

Abstract: A networked system is provided for transporting digital media packets, such as audio and video. The network includes network devices interconnected to send and receive packets. Each network device can receive and transmit media signals from media devices. A master clock generates a system time signal that the network devices use, together with a network time protocol to generate a local clock signal synchronised to the system time signal for both rate and offset. The local clock signal governs both the rate and offset of the received or transmitted media signals. The system, which can be implemented using conventional network equipment enables media signals to be transported to meet quality and timing requirements for high quality audio and video reproduction.

Type: Grant

Filed: January 27, 2022

Date of Patent: September 19, 2023

Assignee: Audinate Holdings Pty Limited

Inventors: Aidan Williams, Varuni Witana
Event based audio-video sync detection

Patent number: 11659217

Abstract: Techniques are described for detecting desynchronization between an audio component and a video component of a media presentation. Feature sets may be determined for portions of the audio component and portions of the video component, which may then be used to generate correlations between portions of the audio component and portions of the video component. Synchronization may then be assessed based on the correlations.

Type: Grant

Filed: March 29, 2021

Date of Patent: May 23, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Hooman Mahyar, Avijit Vajpayee, Abhinav Jain, Arjun Cholkar, Vimal Bhat
Dynamic delay equalization for media transport

Patent number: 11606589

Abstract: Systems and methods of the present disclosure provide for dynamic delay equalization of related media signals in a media transport system. Methods include receiving a plurality of related media signals, transporting the related media signals along different media paths, calculating uncorrected propagation delays for the media paths, and delaying each of the related media signals by an amount related to the difference between the longest propagation delay (of the uncorrected propagation delays) and the uncorrected propagation delay of the related media signal/media path. Calculating the uncorrected propagation delays and delaying the related media signals may be performed in response to a change to the propagation delay of at least one of the related media signals/media paths. Additionally or alternatively, calculating the uncorrected propagation delays and delaying the related media signals may be performed while transporting the related media signals.

Type: Grant

Filed: May 24, 2022

Date of Patent: March 14, 2023

Assignee: Biamp Systems, LLC

Inventors: Eugene Gurfinkel, Michael K. Davis, Charles H. Van Dusen
Method of providing sound that matches displayed image and display device using the method

Patent number: 11553275

Abstract: A method of providing sounds matching an image displayed on a display panel includes: calculating a first object in the image by analyzing digital video data corresponding to the image, and calculating first gain values based on a location of the first object, and applying first gain values to a plurality of sound data; displaying the image on the display panel based on the digital video data; and outputting the plurality of sounds by vibrating the display panel based on the plurality of sound data to which the first gain values applied, using a plurality of sound generating devices.

Type: Grant

Filed: November 8, 2019

Date of Patent: January 10, 2023

Assignee: SAMSUNG DISPLAY CO., LTD.

Inventors: Byeong Hee Won, Jin Oh Kwag, Sung Chan Jo, Yi Joon Ahn, Jae Been Lee
Movie theater audio distribution system and method of use

Patent number: 11461072

Abstract: A movie theater audio distribution system includes a visual display associated with a theater, the visual display having a transmitter; a server to control the visual display; a computer having an audio programing platform to command the server to control the visual display; a headphone device having a transceiver to communicate wirelessly with the server, the headphone device having a control system to receive commands from the server; the audio programming platform provides a way to command the server to transmit an audio to the headphone device correlated to the visual display; and the transmitter is to wirelessly communicate with the transceiver thereby activating the audio associated with the visual display to play through the headphone device when the headphone device is in close proximity to the visual display.

Type: Grant

Filed: September 10, 2020

Date of Patent: October 4, 2022

Inventor: Stacey Castillo
Connected accessory for a voice-controlled device

Patent number: 11443739

Abstract: Coordinated operation of a voice-controlled device and an accessory device in an environment is described. A remote system processes audio data it receives from the voice-controlled device in the environment to identify a first intent associated with a first domain, a second intent associated with a second domain, and a named entity associated with the audio data. The remote system sends, to the voice-controlled device, first information for accessing main content associated with the named entity, and a first instruction corresponding to the first intent. The remote system also sends, to the accessory device, second information for accessing control information or supplemental content associated with the main content, and a second instruction corresponding to the second intent. The first and second instructions, when processed by the devices in the environment, cause coordinated operation of the voice-controlled device and the accessory device.

Type: Grant

Filed: October 30, 2019

Date of Patent: September 13, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Derick Deller, Apoorv Naik, Zoe Adams, Aslan Appleman, Link Cornelius, Pete Klein
Matching mouth shape and movement in digital video to alternative audio

Patent number: 11436780

Abstract: A method for matching mouth shape and movement in digital video to alternative audio includes deriving a sequence of facial poses including mouth shapes for an actor from a source digital video. Each pose in the sequence of facial poses corresponds to a middle position of each audio sample. The method further includes generating an animated face mesh based on the sequence of facial poses and the source digital video, transferring tracked expressions from the animated face mesh or the target video to the source video, and generating a rough output video that includes transfers of the tracked expressions. The method further includes generating a finished video at least in part by refining the rough video using a parametric autoencoder trained on mouth shapes in the animated face mesh or the target video. One or more computers may perform the operations of the method.

Type: Grant

Filed: November 23, 2020

Date of Patent: September 6, 2022

Assignee: WARNER BROS. ENTERTAINMENT INC.

Inventors: Tom David Stratton, Shaun Lile
Interleaving video content in a multi-media document using keywords extracted from accompanying audio

Patent number: 11403676

Abstract: Provided herein are systems and methods of classifying video content. At least one server can identify a video content item identifying a plurality of segments to play primary video content. The at least one server can identify a set of words from a segment of the plurality of segments by using at least one of a transcription corresponding to the segment or speech recognition on audio content corresponding to the segment. The at least one server can determine a classification for the segment based on the set of words from the segment. The at least one server can store, in one or more data structures, an association between the video content item and the classification to categorize the segment of the video content item.

Type: Grant

Filed: June 8, 2020

Date of Patent: August 2, 2022

Assignee: GOOGLE LLC

Inventors: Jason S. Bayer, Ronojoy Chakrabarti, Keval Desai, Manish P Gupta, Jill A Huchital, Willard V T Rusch, II
Dynamic delay equalization for media transport

Patent number: 11343552

Abstract: Systems and methods of the present disclosure provide for dynamic delay equalization of related media signals in a media transport system. Methods include receiving a plurality of related media signals, transporting the related media signals along different media paths, calculating uncorrected propagation delays for the media paths, and delaying each of the related media signals by an amount related to the difference between the longest propagation delay (of the uncorrected propagation delays) and the uncorrected propagation delay of the related media signal/media path. Calculating the uncorrected propagation delays and delaying the related media signals may be performed in response to a change to the propagation delay of at least one of the related media signals/media paths. Additionally or alternatively, calculating the uncorrected propagation delays and delaying the related media signals may be performed while transporting the related media signals.

Type: Grant

Filed: September 1, 2020

Date of Patent: May 24, 2022

Assignee: Biamp Systems, LLC

Inventors: Eugene Gurfinkel, Michael K. Davis, Charles H. Van Dusen
Caption timestamp predictor

Patent number: 11342002

Abstract: An automated solution to determine suitable time ranges or timestamps for captions is described. In one example, a content file includes subtitle data with captions for display over respective timeframes of video. Audio data is extracted from the video, and the audio data is compared against a sound threshold to identify auditory timeframes in which sound is above the threshold. The subtitle data is also parsed to identify subtitle-free timeframes in the video. A series of candidate time ranges is then identified based on overlapping ranges of the auditory timeframes and the subtitle-free timeframes. In some cases, one or more of the candidate time ranges can be merged together or omitted, and a final series of time ranges or timestamps for captions is obtained. The time ranges or timestamps can be used to add additional non-verbal and contextual captions and indicators, for example, or for other purposes.

Type: Grant

Filed: December 5, 2018

Date of Patent: May 24, 2022

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Prabhakar Gupta, Shaktisingh P Shekhawat, Kumar Keshav
Selecting a type of synchronization

Patent number: 11330151

Abstract: An apparatus, method and computer program product for receiving captured visual information comprising a representation of an object, receiving captured audio information associated with the object, determining a user awareness parameter indicating a level of user comprehension of a context of capturing the visual information and the audio information and selecting, based on the user awareness parameter, a type of synchronization of the captured audio information with respect to the captured visual information.

Type: Grant

Filed: April 10, 2020

Date of Patent: May 10, 2022

Assignee: Nokia Technologies Oy

Inventors: Matti Kajala, Antero Tossavainen, Mikko Olavi Heikkinen, Miikka Tapani Vilermo
Unordered matching of audio fingerprints

Patent number: 11328011

Abstract: A method includes computing match scores for each portion of multiple portions of a first audio fingerprint. The match scores are based on a comparison of the portion with each of multiple portions of a second audio fingerprint. The method includes generating a list of runs based on the highest score for each portion of the multiple portions of the first audio fingerprint. The method includes determining, based on the list of runs, an unordered match between a set of consecutive portions of the first audio fingerprint and a set of non-consecutive portions of the second audio fingerprint. The method includes, in response to determining that a position threshold of the unordered match satisfies a position criterion, outputting an indicator that the first audio fingerprint matches the second audio fingerprint.

Type: Grant

Filed: October 1, 2020

Date of Patent: May 10, 2022

Assignee: iHeartMedia Management Services, Inc.

Inventor: Dyon Anniballi
Methods for transporting digital media

Patent number: 11271666

Abstract: A networked system is provided for transporting digital media packets, such as audio and video. The network includes network devices interconnected to send and receive packets. Each network device can receive and transmit media signals from media devices. A master clock generates a system time signal that the network devices use, together with a network time protocol to generate a local clock signal synchronised to the system time signal for both rate and offset. The local clock signal governs both the rate and offset of the received or transmitted media signals. The system, which can be implemented using conventional network equipment enables media signals to be transported to meet quality and timing requirements for high quality audio and video reproduction.

Type: Grant

Filed: September 24, 2019

Date of Patent: March 8, 2022

Assignee: AUDINATE HOLDINGS PTY LIMITED

Inventors: Aidan Williams, Varuni Witana
Method of encoding watermark into digital image by partitioning image into blocks of a same size, apparatus for encoding watermark into digital image, and method of detecting watermark in digital image thereof

Patent number: 11244418

Abstract: A method of encoding a watermark into a digital image is provided. The method includes partitioning an image into a plurality of blocks of a same size; accumulating the plurality of blocks of the same size into a single block image; performing a Fourier transformation on the single block image to obtain a two-dimensional Fourier spectrum defined by Fourier coefficients at different positions of a Fourier domain; inserting a watermark into a frequency domain of the two-dimensional Fourier spectrum by modifying the two-dimensional Fourier spectrum as a function of watermarking coefficients in the watermark, to obtain a modified Fourier spectrum; performing an inverse Fourier transformation on the modified Fourier spectrum to obtain a watermarked image; copying the watermarked image horizontally and vertically into a plurality of copied watermarked images; and splicing the plurality of copied watermarked images into a reconstituted watermark image.

Type: Grant

Filed: March 15, 2019

Date of Patent: February 8, 2022

Assignee: BOE Technology Group Co., Ltd.

Inventor: Xiaojun Tang
Method and apparatus for processing of auxiliary media streams embedded in a MPEGH 3D audio stream

Patent number: 11232805

Abstract: The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.

Type: Grant

Filed: February 22, 2019

Date of Patent: January 25, 2022

Assignee: Dolby International AB

Inventors: Stephan Schreiner, Christof Fersch
Projector and display system

Patent number: 11234037

Abstract: A projector includes: a projection unit which projects content in response to a playback instruction to play back the content; and a transmitting unit which transmits the playback instruction to another projector. The playback instruction includes specification information which specifies the content.

Type: Grant

Filed: July 18, 2018

Date of Patent: January 25, 2022

Assignee: SEIKO EPSON CORPORATION

Inventors: Kazuyoshi Kitabayashi, Takahiro Otsu
Display apparatus and method of controlling thereof

Patent number: 11190806

Abstract: A display apparatus is disclosed. The display apparatus includes a display, a communication interface, a receiver, and a processor configured to decode an encoded video frame and an encoded audio frame, received through the receiver, transmit information on decoding time of the decoded video frame to an audio apparatus through the communication interface, delay the decoded audio frame by a first time, and transmit information on decoding time of the decoded audio frame, information on the first time, and an audio frame delayed by the first time to the audio apparatus through the communication interface, in response to the transmission, receive information on a second time delayed in the audio apparatus to output the audio frame from the audio apparatus through the communication interface, and synchronize an audio frame output from the audio apparatus with a video frame output through the display based on the information on the second time.

Type: Grant

Filed: August 5, 2020

Date of Patent: November 30, 2021

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jaecheol Lee, Haejong Kim
Digital tutorial generation system

Patent number: 11119727

Abstract: Digital tutorial generation techniques and systems are described in which a digital tutorial is generated automatically and without user intervention. History data is generated describing a sequence of user inputs provided as part of user interaction with an application and audio data is received capturing user utterances, e.g., speech, from a microphone of the computing device. A step-identification module of the tutorial generation system identifies a plurality of tutorial steps based on a sequence of user inputs described by the history data. A segmentation module of the tutorial generation system then generates a plurality of audio segments from the audio data corresponding to respective ones of the plurality of tutorial steps. The digital tutorial is then generated by a synchronization module of the tutorial generation system by synchronizing the plurality of audio segments as part of the plurality of tutorial steps, which is then output.

Type: Grant

Filed: June 25, 2020

Date of Patent: September 14, 2021

Assignee: Adobe Inc.

Inventors: Subham Gupta, Sudhir Tubegere Shankaranarayana, Jaideep Jeyakar, Ashutosh Dwivedi
Bidirectional synchronizing camera, camera system including the same and method of operating the camera

Patent number: 11108935

Abstract: In one example embodiment, a camera system includes a plurality of cameras, a camera controller configured to control the plurality of cameras, a control signal line configured to facilitate an exchange of at least one control signal between the camera controller and the plurality of cameras and a synchronization signal line commonly connected to the plurality of cameras, and configured to transmit at least one transmission synchronization signal for synchronizing at least two cameras among the plurality of cameras.

Type: Grant

Filed: September 18, 2020

Date of Patent: August 31, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventors: Il Joong Kim, Jeong A Jo
Systems and methods for selective audio segment compression for accelerated playback of media assets by service providers

Patent number: 11102523

Abstract: Systems and methods are disclosed herein for selective audio segment compression for accelerated playback of media assets by service providers. A playback speed of the video segment of a media asset is calculated based on the duration of the video segment and a received playback time period. The system receives audio segments and corresponding priority weights. The audio segments with the lowest priority weight are removed from the group of various audio segments. The system then determines whether the duration of the remaining audio segments exceeds the received playback time period. If so, the system modifies the remaining audio segments by removing another audio segment with the lowest priority weight from the remaining audio segments. The system then rechecks whether the received playback time period is exceeded. If not, the system generates for playback the video segment based on the video playback speed and the remaining audio segments.

Type: Grant

Filed: March 19, 2019

Date of Patent: August 24, 2021

Assignee: Rovi Guides, Inc.

Inventors: Neeraj Kumar, Vishwas Sharadanagar Panchaksharaiah, Vikram Makam Gupta
Audio and video playback system and method for playing audio data applied thereto

Patent number: 10992451

Abstract: An audio and video playback system includes an audio and video playback device having a local audio device, and a secondary audio device. A method for playing audio data includes: allocating a local audio buffer space and a secondary audio buffer space to the local audio device and the secondary audio device, respectively; processing obtained multimedia data to generate local audio data and secondary audio data; writing the local audio data and the secondary audio data to the local audio buffer space and the secondary audio buffer space, respectively; reading the local audio data and the secondary audio data buffered in the local audio buffer space and the secondary audio buffer space to the local audio device and the secondary audio device, to have the local audio device and the secondary audio device play the local audio data and the secondary audio data, respectively.

Type: Grant

Filed: November 5, 2018

Date of Patent: April 27, 2021

Assignee: MEDIATEK INC.

Inventor: Fu Jun Zhu
System and method for interleaved media communication and conversion

Patent number: 10986154

Abstract: A method or system configured for receiving a first single data stream representing a first multimedia file, the first single data stream including an interleaved sequence of data elements of a plurality of media, and/or transmitting a second single data stream representing a second multimedia file, the second single data stream including an interleaved sequence of data elements of said plurality of media, where the second multimedia file differs from said first multimedia file by at least one data element of a selected medium extracted from said first multimedia file, and/or by at least one data element of a selected medium added to the first multimedia file, and/or by at least one data element of a selected medium added to the first multimedia file being a converted version of the at least one data element of a selected medium extracted from the first multimedia file.

Type: Grant

Filed: May 12, 2017

Date of Patent: April 20, 2021

Assignee: GLIDE TALK LTD.

Inventors: Liron Hertz, Roi Ginat
Systems and methods for transforming digital audio content into visual topic-based segments

Patent number: 10971121

Abstract: A system for platform-independent visualization of audio content, in particular audio tracks utilizing a central computer system in communication with user devices via a computer network. The central system utilizes various algorithms to identify spoken content from audio tracks and selects visual assets associated with the identified content. Thereafter, a visualized audio track is available for users to listen and view. Audio tracks, for example Podcasts, may be segmented into topical audio segments based upon themes or topics, with segments from disparate podcasts combined into a single listening experience, based upon certain criteria, e.g., topics, themes, keywords, and the like.

Type: Grant

Filed: July 9, 2019

Date of Patent: April 6, 2021

Assignee: Tree Goat Media, Inc.

Inventors: Michael Kakoyiannis, Sherry Mills, Christoforos Lambrou, Vladimir Canic, Srdjan Jovanovic
System and method for adaptive audio signal generation, coding and rendering

Patent number: 10904692

Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.

Type: Grant

Filed: November 11, 2019

Date of Patent: January 26, 2021

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Charles Q. Robinson, Nicolas R. Tsingos, Christophe Chabanne
Low latency wireless Virtual Reality systems and methods

Patent number: 10880587

Abstract: Virtual Reality (VR) processing devices and methods are provided for transmitting user feedback information comprising at least one of user position information and user orientation information, receiving encoded audio-video (A/V) data, which is generated based on the transmitted user feedback information, separating the A/V data into video data and audio data corresponding to a portion of a next frame of a sequence of frames of the video data to be displayed, decoding the portion of a next frame of the video data and the corresponding audio data, providing the audio data for aural presentation and controlling the portion of the next frame of the video data to be displayed in synchronization with the corresponding audio data.

Type: Grant

Filed: September 5, 2019

Date of Patent: December 29, 2020

Assignees: ATI TECHNOLOGIES ULC, ADVANCED MICRO DEVICES, INC.

Inventors: Lei Zhang, Gabor Sines, Khaled Mammou, David Glen, Layla A. Mah, Rajabali M. Koduri, Bruce Montag
Providing and transmitting audio signal

Patent number: 10880659

Abstract: There is provided a system (100) comprising an audio streaming device (102) having an audio streaming device receiver (104) arranged for receiving a first audio signal (106) comprising a first audio content and a second audio signal (108) comprising a second audio content, the system furthermore comprising a memory device (110) arranged for storing a user defined setting (112), a processor (114) arranged for providing an output audio signal (116), said output audio signal comprising a combination of the first audio content, and the second audio content, wherein the output audio signal comprises a ratio of a level of the first audio content and a level of the second audio content, and the ratio is determined based on the user defined setting (112), and wherein the system is further comprising a system transmitter (118) arranged for wirelessly transmitting the output audio signal (116).

Type: Grant

Filed: April 10, 2020

Date of Patent: December 29, 2020

Assignee: OTICON A/S

Inventors: Michael Syskind Pedersen, Povl Koch, David Thorn Blix, Matias Tofteby Bach
Transmission device, transmission method, reception device, and reception method

Patent number: 10848801

Abstract: A reception side can easily recognize that metadata is inserted into an audio stream. A container of a predetermined format including an audio stream into which metadata is inserted is transmitted. Identification information indicating that the metadata is inserted into the audio stream is inserted into a layer of the container. At the reception side, it is possible to easily recognize that the metadata is inserted into the audio stream and acquire the metadata reliably without waste by performing the process of extracting the metadata inserted into the audio stream based on the recognition.

Type: Grant

Filed: April 23, 2019

Date of Patent: November 24, 2020

Assignee: Saturn Licensing LLC

Inventor: Ikuo Tsukagoshi
Media application backgrounding

Patent number: 10841359

Abstract: A media application is disclosed. The media application provides a playback of a media item that includes a video portion and an audio portion. The media application stops the playback of the video portion of the media item while continuing to provide the audio portion of the media item. The media application resumes the playback of the video portion of the media item in synchronization with the audio portion being provided.

Type: Grant

Filed: September 30, 2019

Date of Patent: November 17, 2020

Assignee: GOOGLE LLC

Inventors: Oliver John Woodman, Matt Doucleff
Synchronizing text-to-audio with interactive videos in the video framework

Patent number: 10805665

Abstract: A device configured to determine a time on a progress bar and to identify a timestamp in the video timing map based on the time on the progress bar. The device is further configured to identify a source scene identifier corresponding with the identified timestamp and to play a video scene corresponding with the identified source scene identifier. The device is further configured to identify a first animation identifier corresponding with the identified timestamp and to play a first animation associated with the first animation identifier. The device is further configured to determine that the first animation identifier is present in the audio sample buffer, to identify an audio sample associated with the first animation identifier, and to play the identified audio sample.

Type: Grant

Filed: December 13, 2019

Date of Patent: October 13, 2020

Assignee: Bank of America Corporation

Inventor: Shankar Sangoli
Adaptive switching in a whole home entertainment system

Patent number: 10805658

Abstract: Provided herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for synchronizing playback of audio and video associated with a content, such as a movie or TV show. Also provided herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for coordinating devices in a whole home entertainment system that includes a wireless network, to improve collective utilization of the wireless network and thereby enhance user experience.

Type: Grant

Filed: September 12, 2018

Date of Patent: October 13, 2020

Assignee: ROKU, INC.

Inventors: Ilya Asnis, Anthony John Wood
Audio video synchronization

Patent number: 10805663

Abstract: Systems, methods, and apparatuses are described for detecting synchronization errors between audio and video signals. Scene changes may be detected based on anchor frames. Offsets between a scene change in a video signal and a reduced audio level or burst of high audio level in the audio signal may indicate a synchronization error.

Type: Grant

Filed: July 13, 2018

Date of Patent: October 13, 2020

Assignee: Comcast Cable Communications, LLC

Inventor: Michael Rekstad
Dynamic delay equalization for media transport

Patent number: 10764620

Abstract: Systems and methods of the present disclosure provide for dynamic delay equalization of related media signals in a media transport system. Methods include receiving a plurality of related media signals, transporting the related media signals along different media paths, calculating uncorrected propagation delays for the media paths, and delaying each of the related media signals by an amount related to the difference between the longest propagation delay (of the uncorrected propagation delays) and the uncorrected propagation delay of the related media signal/media path. Calculating the uncorrected propagation delays and delaying the related media signals may be performed in response to a change to the propagation delay of at least one of the related media signals/media paths. Additionally or alternatively, calculating the uncorrected propagation delays and delaying the related media signals may be performed while transporting the related media signals.

Type: Grant

Filed: August 18, 2019

Date of Patent: September 1, 2020

Assignee: Biamp Systems, LLC

Inventors: Eugene Gurfinkel, Michael K. Davis, Charles H. Van Dusen
Dynamic haptic generation based on detected video events

Patent number: 10748390

Abstract: A method or system that receives input media including at least video data in which a video event within the video data is detected. Related data that is associated with the detected video event is collected and one or more feature parameters are configured based on the collected related data. The type of video event is determining and a set of feature parameters is selected based on the type of video event. A haptic effect is then automatically generated based on the selected set of feature parameters.

Type: Grant

Filed: October 12, 2018

Date of Patent: August 18, 2020

Assignee: Immersion Corporation

Inventor: Liwen Wu
Methods and apparatus to identify media based on watermarks across different audio streams and/or different watermarking techniques

Patent number: 10694243

Abstract: Methods and apparatus to identify media based on watermarks across different audio streams and/or different watermarking techniques are disclosed. An example apparatus includes a watermark detector to detect a first watermark embedded in a first audio stream associated with media and to detect a second watermark embedded in a second audio stream associated with the media. The second audio stream is different than the first audio stream. The example apparatus includes a watermark analyzer to compare first media identifying information in the first watermark with second media identifying information in the second watermark. The example apparatus also includes a media detection event controller to associate the first and second watermarks with a media detection event when the first media identifying information is consistent with the second media identifying information. The example apparatus further includes a transmitter to transmit the media detection event to a data collection facility.

Type: Grant

Filed: May 31, 2018

Date of Patent: June 23, 2020

Assignee: The Nielsen Company (US), LLC

Inventors: Justin Fahnestock, Ronan Heffernan, Muhammad Amir, Wes Kercher, Scott Barraclough, John Kistenmacher
Method and device for transmitting audio and video for playback

Patent number: 10650862

Abstract: A system that incorporates teachings of the subject disclosure may include, for example, detecting a first action at a first time during a first presentation of video content of a multimedia stream. The first action is coincident with a visual aspect of an event observable in the video content. A second action is detected at a second time during a second presentation of audio content of an audio stream, wherein the second action is coincident with an audible aspect of the event observable in a the second presentation of the audio content. A time difference is determined between the first time and the second time, wherein the first presentation of the video content and the second presentation of the audio content are synchronized based on the time difference. Other embodiments are disclosed.

Type: Grant

Filed: December 9, 2015

Date of Patent: May 12, 2020

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Wayne R. Heinmiller, Carol S. Gruchala, Dianna Tiliks
Quality evaluation of multimedia delivery in cloud environments

Patent number: 10623789

Abstract: Methods, systems, and computer programs for measuring quality of multimedia delivery to a client are presented. A method includes operations for embedding video markers in a video stream of a multimedia stream, and embedding audio markers in an audio stream of the multimedia stream. The video stream and the audio stream are then transmitted separately to the client. Further, video markers received at the client are extracted from the transmitted video stream, and audio markers received at the client are extracted from the transmitted audio stream. A measure of the audio-video synchronization quality is obtained by determining a quantifiable time difference between the video stream and the audio stream received at the client, where the quantifiable time difference is calculated based on the extracted video markers and the extracted audio markers.

Type: Grant

Filed: May 26, 2017

Date of Patent: April 14, 2020

Assignee: VMware, Inc.

Inventors: Lawrence Andrew Spracklen, Banit Agrawal, Rishi Bidarkar
Remote control framework

Patent number: 10607479

Abstract: A remote control framework enables a plurality of target devices to be controlled by a plurality of remote control devices irrespective of bearer types. In a preferred embodiment any target device may also act as a control device and any control device may also act as a target device. The framework also enables any application running on any target device to be controlled by any controller device.

Type: Grant

Filed: February 28, 2018

Date of Patent: March 31, 2020

Assignee: Conversant Wireless Licensing S.a.r.l.

Inventors: Sian James, Neal Harris, John Turner, Tim Howes
Speaker segmentation and clustering for video summarization

Patent number: 10535371

Abstract: Techniques are provided for video summarization, based on speaker segmentation and clustering, to identify persons and scenes of interest. A methodology implementing the techniques according to an embodiment includes extracting audio content from a video stream and detecting one or more segments of the audio content that include the voice of a single speaker. The method also includes grouping the one or more detected segments into an audio cluster associated with the single speaker and providing a portion of the audio cluster to a user. The method further includes receiving an indication from the user that the single speaker is a person of interest. Segments of interest are then extracted from the video stream, where each segment of interest is associated with a scene that includes the person of interest. The extracted segments of interest are then combined into a summarization video.

Type: Grant

Filed: September 13, 2016

Date of Patent: January 14, 2020

Assignee: INTEL CORPORATION

Inventors: Gokcen Cilingir, Narayan Biswal
Systems and methods for achieving multi-dimensional audio fidelity

Patent number: 10499178

Abstract: There is provided a non-transitory memory storing an executable code, a hardware processor executing the executable code to receive a visualization of a three-dimensional (3D) position for each audio object of a plurality of audio objects in a first mix of an object-based audio of a media content, the visualization corresponding to a timeline of the media content, receive a second mix of the object-based audio of the media content, and play the second mix of the object-based audio of the media content using an audio playback system while displaying the visualization of the 3D position for each of the plurality of audio objects of the first mix of the object-based audio on a display.

Type: Grant

Filed: October 14, 2016

Date of Patent: December 3, 2019

Assignee: Disney Enterprises, Inc.

Inventor: Mark Arana
System and method for adaptive audio signal generation, coding and rendering

Patent number: 10477339

Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.

Type: Grant

Filed: June 17, 2019

Date of Patent: November 12, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Charles Q. Robinson, Nicolas R. Tsingos, Christophe Chabanne
Method for embedding motion data of an object into a video file to allow for synchronized visualization of the motion data upon playback of the video file

Patent number: 10469750

Abstract: A method is provided for embedding motion data of an object collected by an inertial measurement unit that is attached to the object into a video file that includes video frames of the object in motion captured by a video recording device. The video file has a predefined video file format that is configured to include metadata that is storable at predefined time intervals of the video file. The method operates by capturing video frames of an object in motion and simultaneously collecting motion data of the object, storing the captured video frames in the video file, and storing the collected motion data, converting the motion data to the metadata, and inserting the metadata into one or more time intervals of the video file, wherein the metadata in each time interval includes the metadata for a plurality of successive or preceding video frames.

Type: Grant

Filed: May 22, 2018

Date of Patent: November 5, 2019

Assignee: BioForce Analytics LLC

Inventors: Eric L. Canfield, Scott J. Soma, Brandon T. Fanti, Vineeth Voruganti, Daniel J. Gao, Aron Sun, Ryan M. LaRue, Saahas S. Yechuri
Methods for transporting digital media

Patent number: 10461872

Abstract: A networked system is provided for transporting digital media packets, such as audio and video. The network includes network devices interconnected to send and receive packets. Each network device can receive and transmit media signals from media devices. A master clock generates a system time signal that the network devices use, together with a network time protocol to generate a local clock signal synchronized to the system time signal for both rate and offset. The local clock signal governs both the rate and offset of the received or transmitted media signals. The system, which can be implemented using conventional network equipment enables media signals to be transported to meet quality and timing requirements for high quality audio and video reproduction.

Type: Grant

Filed: August 31, 2018

Date of Patent: October 29, 2019

Assignee: Audinate Pty Limited

Inventors: Aidan Williams, Varuni Witana
Media player distribution and collaborative editing

Patent number: 10455050

Abstract: In one approach, a server computer receives a playlist from a first client computer, wherein the playlist identifies a plurality of media assets and includes synchronization information that specifies how to present the plurality of media assets as a synchronized media presentation. The server computer receives a request from the first client computer to share the playlist with a second client computer. The server computer causes the plurality of media assets to be deposited in a client storage accessible to the second client computer. The server computer sends the playlist to the second client computer. The second client computer presents the synchronized media presentation based on the plurality of media assets deposited in the client storage and the synchronization information of the playlist.

Type: Grant

Filed: April 24, 2017

Date of Patent: October 22, 2019

Assignee: QWIRE INC.

Inventors: Leigh B. Roberts, Jr., Jonathan Louis Ehrlich, Scott Freiman

1 2 3 4 5 … next