Patents by Inventor Hans Peter Graf

Hans Peter Graf has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MASSIVELY PARALLEL, SMART MEMORY BASED ACCELERATOR

Publication number: 20110119467

Abstract: Systems and methods for massively parallel processing on an accelerator that includes a plurality of processing cores. Each processing core includes multiple processing chains configured to perform parallel computations, each of which includes a plurality of interconnected processing elements. The cores further include multiple of smart memory blocks configured to store and process data, each memory block accepting the output of one of the plurality of processing chains. The cores communicate with at least one off-chip memory bank.

Type: Application

Filed: July 26, 2010

Publication date: May 19, 2011

Applicant: NEC Laboratories America, Inc.

Inventors: Srihari Cadambi, Abhinandan Majumdar, Michela Becchi, Srimat Chakradhar, Hans Peter Graf
System and method for triphone-based unit selection for visual speech synthesis

Patent number: 7933772

Abstract: A system and method for generating a video sequence having mouth movements synchronized with speech sounds are disclosed. The system utilizes a database of n-phones as the smallest selectable unit, wherein n is larger than 1 and preferably 3. The system calculates a target cost for each candidate n-phone for a target frame using a phonetic distance, coarticulation parameter, and speech rate. For each n-phone in a target sequence, the system searches for candidate n-phones that are visually similar according to the target cost. The system samples each candidate n-phone to get a same number of frames as in the target sequence and builds a video frame lattice of candidate video frames. The system assigns a joint cost to each pair of adjacent frames and searches the video frame lattice to construct the video sequence by finding the optimal path through the lattice according to the minimum of the sum of the target cost and the joint cost over the sequence.

Type: Grant

Filed: March 19, 2008

Date of Patent: April 26, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Eric Cosatto, Hans Peter Graf, Fu Jie Huang
System and method for sending multi-media messages using emoticons

Patent number: 7921013

Abstract: A system and method of providing sender-customization of multi-media messages through the use of emoticons is disclosed. The sender inserts the emoticons into a text message. As an animated face audibly delivers the text, emoticons associated with the message are started a predetermined period of time or number of words prior to the position of the emoticon in the message text and completed a predetermined length of time or number of words following the location of the emoticon. The sender may insert emoticons through the use of emoticon buttons that are icons available for choosing. Upon sender selections of an emoticon, an icon representing the emoticon is inserted into the text at the position of the cursor. Once an emoticon is chosen, the sender may also choose the amplitude for the emoticon and increased or decreased amplitude will be displayed in the icon inserted into the message text.

Type: Grant

Filed: August 30, 2005

Date of Patent: April 5, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Joern Ostermann, Mehmet Reha Civanlar, Eric Cosatto, Hans Peter Graf, Yann Andre LeCun
DIGITALLY-GENERATED LIGHTING FOR VIDEO CONFERENCING APPLICATIONS

Publication number: 20110002508

Abstract: A method of improving the lighting conditions of a real scene or video sequence. Digitally generated light is added to a scene for video conferencing over telecommunication networks. A virtual illumination equation takes into account light attenuation, lambertian and specular reflection. An image of an object is captured, a virtual light source illuminates the object within the image. In addition, the object can be the head of the user. The position of the head of the user is dynamically tracked so that an three-dimensional model is generated which is representative of the head of the user. Synthetic light is applied to a position on the model to form an illuminated model.

Type: Application

Filed: September 8, 2010

Publication date: January 6, 2011

Applicant: AT&T Intellectual Property II, L.P..

Inventors: Andrea Basso, Eric Cosatto, David Crawford Gibbon, Hans Peter Graf, Shan Liu
System and method of providing conversational visual prosody for talking heads

Patent number: 7844467

Abstract: A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and controlling the virtual agent movement according to the prosodic analysis.

Type: Grant

Filed: January 25, 2008

Date of Patent: November 30, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Volker Franz Strom
Digitally-generated lighting for video conferencing applications

Patent number: 7805017

Abstract: A method of improving the lighting conditions of a real scene or video sequence. Digitally generated light is added to a scene for video conferencing over telecommunication networks. A virtual illumination equation takes into account light attenuation, lambertian and specular reflection. An image of an object is captured, a virtual light source illuminates the object within the image. In addition, the object can be the head of the user. The position of the head of the user is dynamically tracked so that an three-dimensional model is generated which is representative of the head of the user. Synthetic light is applied to a position on the model to form an illuminated model.

Type: Grant

Filed: May 8, 2007

Date of Patent: September 28, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Andrea Basso, Eric Cosatto, David Crawford Gibbon, Hans Peter Graf, Shan Liu
System and Method of Controlling Sound in a Multi-Media Communication Application

Publication number: 20100114579

Abstract: A computing device and computer-readable medium storing instructions for controlling a computing device to customize a voice in a multi-media message created by a sender for a recipient, the multi-media message comprising a text message from the sender to be delivered by an animated entity. The instructions comprise receiving from the sender inserted voice emoticons, which may be repeated, into the text message associated with parameters of a voice used by an animated entity to deliver the text message; and transmitting the text message such that a recipient device can deliver the multi-media message at a variable level associated with a number of times a respective voice emoticon is repeated.

Type: Application

Filed: December 29, 2009

Publication date: May 6, 2010

Applicant: AT & T Corp.

Inventors: Joern Ostermann, Mehmet Reha Civanlar, Hans Peter Graf, Thomas M. Isaacson
System and method of controlling sound in a multi-media communication application

Patent number: 7697668

Abstract: A computing device and computer-readable medium storing instructions for controlling a computing device to customize a voice in a multi-media message created by a sender for a recipient, the multi-media message comprising a text message from the sender to be delivered by an animated entity. The instructions comprise receiving from the sender inserted voice emoticons, which may be repeated, into the text message associated with parameters of a voice used by an animated entity to deliver the text message; and transmitting the text message such that a recipient device can deliver the multi-media message at a variable level associated with a number of times a respective voice emoticon is repeated.

Type: Grant

Filed: August 3, 2005

Date of Patent: April 13, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Joern Ostermann, Mehmat Reha Civanlar, Hans Peter Graf, Thomas M. Isaacson
Coarticulation Method for Audio-Visual Text-to-Speech Synthesis

Publication number: 20100076762

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Type: Application

Filed: November 30, 2009

Publication date: March 25, 2010

Applicant: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
System for Low-Latency Animation of Talking Heads

Publication number: 20100076750

Abstract: Methods and apparatus for rendering a talking head on a client device are disclosed. The client device has a client cache capable of storing audio/visual data associated with rendering the talking head. The method comprises storing sentences in a client cache of a client device that relate to bridging delays in a dialog, storing sentence templates to be used in dialogs, generating a talking head response to a user inquiry from the client device, and determining whether sentences or stored templates stored in the client cache relate to the talking head response. If the stored sentences or stored templates relate to the talking head response, the method comprises instructing the client device to use the appropriate stored sentence or template from the client cache to render at least a part of the talking head response and transmitting a portion of the talking head response not stored in the client cache, if any, to the client device to render a complete talking head response.

Type: Application

Filed: November 30, 2009

Publication date: March 25, 2010

Applicant: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Joern Ostermann
System and Method for Parallelizing and Accelerating Learning Machine Training and Classification Using a Massively Parallel Accelerator

Publication number: 20090304268

Abstract: A method system for training an apparatus to recognize a pattern includes providing the apparatus with a host processor executing steps of a machine learning process; providing the apparatus with an accelerator including at least two processors; inputting training pattern data into the host processor; determining coefficient changes in the machine learning process with the host processor using the training pattern data; transferring the training data to the accelerator; determining kernel dot-products with the at least two processors of the accelerator using the training data; and transferring the dot-products back to the host processor.

Type: Application

Filed: June 4, 2009

Publication date: December 10, 2009

Applicant: NEC LABORATORIES AMERICA, INC.

Inventors: Srihari Cadambi, Igor Durdanovic, Venkata Jakkula, Eric Cosatto, Murugan Sankaradass, Hans Peter Graf, Srimat T. Chakradhar
Coarticulation method for audio-visual text-to-speech synthesis

Patent number: 7630897

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Type: Grant

Filed: May 19, 2008

Date of Patent: December 8, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
Automated Method and System for Nuclear Analysis of Biopsy Images

Publication number: 20090297007

Abstract: An automated method and system for analyzing a digital image of a biopsy to determine whether the biopsy is normal or abnormal, i.e., exhibits some type of disease such as, but not limited to, cancer. In the method and system, a classifier is trained to recognize well formed nuclei outlines from imperfect nuclei outlines in digital biopsy images. The trained classifier may then be used to filter nuclei outlines from one or more digital biopsy images to be analyzed, to obtain the well formed nuclei outlines. The well formed nuclei outlines may then be used to obtain statistics on the size or area of the nuclei for use in determining whether the biopsy is normal or abnormal.

Type: Application

Filed: June 2, 2008

Publication date: December 3, 2009

Applicant: NEC Laboratories America, Inc.

Inventors: Eric Cosatto, Hans-Peter Graf, Matthew L. Miller
System for low-latency animation of talking heads

Patent number: 7627478

Abstract: Methods and apparatus for rendering a talking head on a client device are disclosed. The client device has a client cache capable of storing audio/visual data associated with rendering the talking head. The method comprises storing sentences in a client cache of a client device that relate to bridging delays in a dialog, storing sentence templates to be used in dialogs, generating a talking head response to a user inquiry from the client device, and determining whether sentences or stored templates stored in the client cache relate to the talking head response. If the stored sentences or stored templates relate to the talking head response, the method comprises instructing the client device to use the appropriate stored sentence or template from the client cache to render at least a part of the talking head response and transmitting a portion of the talking head response not stored in the client cache, if any, to the client device to render a complete talking head response.

Type: Grant

Filed: July 16, 2007

Date of Patent: December 1, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Eric Cosatto, Hans Peter Graf, Joern Ostermann
COARTICULATION METHOD FOR AUDIO-VISUAL TEXT-TO-SPEECH SYNTHESIS

Publication number: 20080221904

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Type: Application

Filed: May 19, 2008

Publication date: September 11, 2008

Applicant: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
PARALLEL SUPPORT VECTOR METHOD AND APPARATUS

Publication number: 20080201281

Abstract: Disclosed is an improved technique for training a support vector machine using a distributed architecture. A training data set is divided into subsets, and the subsets are optimized in a first level of optimizations, with each optimization generating a support vector set. The support vector sets output from the first level optimizations are then combined and used as input to a second level of optimizations. This hierarchical processing continues for multiple levels, with the output of each prior level being fed into the next level of optimizations. In order to guarantee a global optimal solution, a final set of support vectors from a final level of optimization processing may be fed back into the first level of the optimization cascade so that the results may be processed along with each of the training data subsets.

Type: Application

Filed: April 28, 2008

Publication date: August 21, 2008

Applicant: NEC Laboratories America, Inc.

Inventors: Hans Peter Graf, Eric Cosatto, Leon Bottou, Vladimir N. Vapnik
Spread kernel support vector machine

Patent number: 7406450

Abstract: Disclosed is a parallel support vector machine technique for solving problems with a large set of training data where the kernel computation, as well as the kernel cache and the training data, are spread over a number of distributed machines or processors. A plurality of processing nodes are used to train a support vector machine based on a set of training data. Each of the processing nodes selects a local working set of training data based on data local to the processing node, for example a local subset of gradients. Each node transmits selected data related to the working set (e.g., gradients having a maximum value) and receives an identification of a global working set of training data. The processing node optimizes the global working set of training data and updates a portion of the gradients of the global working set of training data. The updating of a portion of the gradients may include generating a portion of a kernel matrix. These steps are repeated until a convergence condition is met.

Type: Grant

Filed: February 20, 2006

Date of Patent: July 29, 2008

Assignee: NEC Laboratories America, Inc.

Inventors: Hans Peter Graf, Igor Durdanovic, Eric Cosatto, Vladimir Vapnik
Coarticulation method for audio-visual text-to-speech synthesis

Patent number: 7392190

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data, second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Type: Grant

Filed: August 24, 2006

Date of Patent: June 24, 2008

Assignee: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
System and method for triphone-based unit selection for visual speech synthesis

Patent number: 7369992

Abstract: A system and method for generating a video sequence having mouth movements synchronized with speech sounds are disclosed. The system utilizes a database of n-phones as the smallest selectable unit, wherein n is larger than 1 and preferably 3. The system calculates a target cost for each candidate n-phone for a target frame using a phonetic distance, coarticulation parameter, and speech rate. For each n-phone in a target sequence, the system searches for candidate n-phones that are visually similar according to the target cost. The system samples each candidate n-phone to get a same number of frames as in the target sequence and builds a video frame lattice of candidate video frames. The system assigns a joint cost to each pair of adjacent frames and searches the video frame lattice to construct the video sequence by finding the optimal path through the lattice according to the minimum of the sum of the target cost and the joint cost over the sequence.

Type: Grant

Filed: February 16, 2007

Date of Patent: May 6, 2008

Assignee: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Fu Jie Huang
System and method of providing conversational visual prosody for talking heads

Patent number: 7353177

Abstract: A system and method of controlling the movement of a virtual agent while the agent is listening to a human user during a conversation is disclosed. The method comprises receiving speech data from the user, performing a prosodic analysis of the speech data and controlling the virtual agent movement according to the prosodic analysis.

Type: Grant

Filed: September 28, 2005

Date of Patent: April 1, 2008

Assignee: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Volker Franz Storm

prev … 2 3 4 5 6 7 8 next