Abstract: A system for audio-visual multi-speaker speech separation. The system includes a processing circuitry and a memory containing instructions that, when executed by the processing circuitry, configure the system to: receive audio signals captured by at least one microphone; receive video signals captured by at least one camera; and apply audio-visual separation on the received audio signals and video signals to provide isolation of sounds from individual sources, wherein the audio-visual separation is based, in part, on angle positions of at least one speaker relative to the at least one camera. The system provides for reliable speech processing and separation in noisy environments and environments with multiple users.
Type:
Grant
Filed:
April 6, 2020
Date of Patent:
October 17, 2023
Assignee:
HI AUTO LTD.
Inventors:
Yaniv Shaked, Yoav Ramon, Eyal Shapira, Roy Baharav
Abstract: A system and method for audio-visual multi-speaker speech separation, including: receiving audio signals captured by at least one microphone; receiving video signals captured by at least one camera; and applying audio-visual separation on the received audio signals and video signals to provide isolation of sounds from individual sources, wherein the audio-visual separation is based, in part, on angle positions of at least one speaker relative to the at least one camera.
Type:
Application
Filed:
April 6, 2020
Publication date:
October 7, 2021
Applicant:
Hi Auto LTD.
Inventors:
Yaniv SHAKED, Yoav RAMON, Eyal SHAPIRA, Roy BAHARAV