Abstract: Hearing device, accessory device, and a method of operating a hearing system comprising a hearing device and an accessory device is disclosed, the method comprising obtaining, in the accessory device, an audio input signal representative of audio from one or more audio sources; obtaining image data with a camera of the accessory device; identifying one or more audio sources including a first audio source based on the image data; determining a first model comprising first model coefficients, wherein the first model is based on image data of the first audio source and the audio input signal; and transmitting a hearing device signal to the hearing device, wherein the hearing device signal is based on the first model.
Abstract: Apparatus for isolation of a media stream of a first modality from a complex media source having at least two media modality, and multiple objects, and events, comprises: recording devices for the different modalities; an associator for associating between events recorded in said first modality and events recorded in said second modality, and providing an association output; and an isolator that uses the association output for isolating those events in the first mode correlating with events in the second mode associated with a predetermined object, thereby to isolate a isolated media stream associated with said predetermined object. Thus it is possible to identify events such as hand or mouth movements, and associate these with sounds, and then produce a filtered track of only those sounds associated with the events. In this way a particular speaker or musical instrument can be isolated from a complex scene.
Type:
Grant
Filed:
April 6, 2008
Date of Patent:
February 25, 2014
Assignee:
Technion Research & Development Foundation Limited
Abstract: The subject matter of this specification can be embodied in, among other things, a computer-implemented method that includes receiving a plurality of images having human faces. The method further includes generating a data structure having representations of the faces and associations that link the representations based on similarities in appearance between the faces. The method further includes outputting a first gender value for a first representation of a first face that indicates a gender of the first face based on one or more other gender values of one or more other representations of one or more other faces that are linked to the first representation.
Abstract: The subject matter of this specification can be embodied in, among other things, a computer-implemented method that includes receiving a plurality of images having human faces. The method further includes generating a data structure having representations of the faces and associations that link the representations based on similarities in appearance between the faces. The method further includes outputting a first gender value for a first representation of a first face that indicates a gender of the first face based on one or more other gender values of one or more other representations of one or more other faces that are linked to the first representation.
Abstract: An information processing apparatus that includes an image acquisition unit to acquire a temporal sequence of frames of image data, a detecting unit to detect a lip area and a lip image from each of the frames of the image data, a recognition unit to recognize a word based on the detected lip images of the lip areas, and a controller to control an operation at the information processing apparatus based on the word recognized by the recognition unit.
Type:
Application
Filed:
June 15, 2010
Publication date:
December 30, 2010
Applicant:
Sony Corporation
Inventors:
Kazumi AOYAMA, Kohtaro SABE, Masato ITO
Abstract: Apparatus for isolation of a media stream of a first modality from a complex media source having at least two media modality, and multiple objects, and events, comprises: recording devices for the different modalities; an associator for associating between events recorded in said first modality and events recorded in said second modality, and providing an association output; and an isolator that uses the association output for isolating those events in the first mode correlating with events in the second mode associated with a predetermined object, thereby to isolate a isolated media stream associated with said predetermined object. Thus it is possible to identify events such as hand or mouth movements, and associate these with sounds, and then produce a filtered track of only those sounds associated with the events. In this way a particular speaker or musical instrument can be isolated from a complex scene.
Type:
Application
Filed:
April 6, 2008
Publication date:
November 25, 2010
Applicant:
Technion Research & Development Foundation Ltd.