Patents by Inventor Enrico Bocchieri
Enrico Bocchieri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10121468Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: GrantFiled: June 15, 2016Date of Patent: November 6, 2018Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Enrico Bocchieri, Diamantino Antonio Caseiro
-
Patent number: 9558738Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating an acoustic model for use in speech recognition. A system configured to practice the method first receives training data and identifies non-contextual lexical-level features in the training data. Then the system infers sentence-level features from the training data and generates a set of decision trees by node-splitting based on the non-contextual lexical-level features and the sentence-level features. The system decorrelates training vectors, based on the training data, for each decision tree in the set of decision trees to approximate full-covariance Gaussian models, and then can train an acoustic model for use in speech recognition based on the training data, the set of decision trees, and the training vectors.Type: GrantFiled: March 8, 2011Date of Patent: January 31, 2017Assignee: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Diamantino Antonio Caseiro, Dimitrios Dimitriadis
-
Patent number: 9484018Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for building an automatic speech recognition system through an Internet API. A network-based automatic speech recognition server configured to practice the method receives feature streams, transcriptions, and parameter values as inputs from a network client independent of knowledge of internal operations of the server. The server processes the inputs to train an acoustic model and a language model, and transmits the acoustic model and the language model to the network client. The server can also generate a log describing the processing and transmit the log to the client. On the server side, a human expert can intervene to modify how the server processes the inputs. The inputs can include an additional feature stream generated from speech by algorithms in the client's proprietary feature extraction.Type: GrantFiled: November 23, 2010Date of Patent: November 1, 2016Assignee: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Dimitrios Dimitriadis, Horst J. Schroeter
-
Publication number: 20160293161Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: ApplicationFiled: June 15, 2016Publication date: October 6, 2016Inventors: Enrico BOCCHIERI, Diamantino Antonio CASEIRO
-
Patent number: 9373326Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: GrantFiled: November 14, 2014Date of Patent: June 21, 2016Assignee: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Diamantino Antonio Caseiro
-
Publication number: 20150073793Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: ApplicationFiled: November 14, 2014Publication date: March 12, 2015Inventors: Enrico BOCCHIERI, Diamantino Antonio Caseiro
-
Patent number: 8914510Abstract: A network communication system includes a connection server that assigns a network address within a data communication network to a subscriber terminal. The connection server receives outgoing communications from the subscriber terminal and transmits the outgoing communications to a network access point and receives incoming communications from the network access point and transmits the incoming communications to the subscriber terminal. The connection server intercepts a tracking cookie received from a remote server in the data communications network and intended for the subscriber terminal and stores the tracking cookie at the connection server so that the tracking cookie can be used to support a communication session between the subscriber terminal and the remote server without the tracking cookie being stored at the subscriber terminal.Type: GrantFiled: November 17, 2008Date of Patent: December 16, 2014Assignee: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Horst Schroeter
-
Patent number: 8892443Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: GrantFiled: December 15, 2009Date of Patent: November 18, 2014Assignee: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Diamantino Antonio Caseiro
-
Patent number: 8862582Abstract: Disclosed are a system, method and computer-readable medium for organizing images. A method aspect relates to receiving an image into a device, receiving incidental information associated with the image, organizing the image and the incidental information into a data structure such as a sparse array, classifying the received image with an image classifier and storing the classified image in an image database, receiving a search query and responding to the search query by searching for and retrieving matching images in the image database based on a comparison of the image search query to the data structure.Type: GrantFiled: November 15, 2007Date of Patent: October 14, 2014Assignee: AT&T Intellectual Property I, L.P.Inventors: Charles Blewett, Enrico Bocchieri, Giuseppe Di Fabbrizio, Donnie Henderson, Thomas Killian, Thomas Kirk, David Kormann, Gregory T. Vesonder
-
Publication number: 20120232902Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating an acoustic model for use in speech recognition. A system configured to practice the method first receives training data and identifies non-contextual lexical-level features in the training data. Then the system infers sentence-level features from the training data and generates a set of decision trees by node-splitting based on the non-contextual lexical-level features and the sentence-level features. The system decorrelates training vectors, based on the training data, for each decision tree in the set of decision trees to approximate full-covariance Gaussian models, and then can train an acoustic model for use in speech recognition based on the training data, the set of decision trees, and the training vectors.Type: ApplicationFiled: March 8, 2011Publication date: September 13, 2012Applicant: AT&T Intellectual Property I, L.P.Inventors: Enrico BOCCHIERI, Diamantino Antonio Caseiro, Dimitrios Dimitriadis
-
Publication number: 20120130709Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for building an automatic speech recognition system through an Internet API. A network-based automatic speech recognition server configured to practice the method receives feature streams, transcriptions, and parameter values as inputs from a network client independent of knowledge of internal operations of the server. The server processes the inputs to train an acoustic model and a language model, and transmits the acoustic model and the language model to the network client. The server can also generate a log describing the processing and transmit the log to the client. On the server side, a human expert can intervene to modify how the server processes the inputs. The inputs can include an additional feature stream generated from speech by algorithms in the client's proprietary feature extraction.Type: ApplicationFiled: November 23, 2010Publication date: May 24, 2012Applicant: AT&T Intellectual Property I, L.P.Inventors: Enrico BOCCHIERI, Dimitrios Dimitriadis, Horst J. Schroeter
-
Publication number: 20110144973Abstract: Disclosed herein are systems, methods, and computer-readable storage media for a speech recognition application for directory assistance that is based on a user's spoken search query. The spoken search query is received by a portable device and portable device then determines its present location. Upon determining the location of the portable device, that information is incorporated into a local language model that is used to process the search query. Finally, the portable device outputs the results of the search query based on the local language model.Type: ApplicationFiled: December 15, 2009Publication date: June 16, 2011Applicant: AT&T Intellectual Property I, L.P.Inventors: Enrico Bocchieri, Diamantino Antonio Caseiro
-
Publication number: 20100125668Abstract: A network communication system includes a connection server that assigns a network address within a data communication network to a subscriber terminal. The connection server receives outgoing communications from the subscriber terminal and transmits the outgoing communications to a network access point and receives incoming communications from the network access point and transmits the incoming communications to the subscriber terminal. The connection server intercepts a tracking cookie received from a remote server in the data communications network and intended for the subscriber terminal and stores the tracking cookie at the connection server so that the tracking cookie can be used to support a communication session between the subscriber terminal and the remote server without the tracking cookie being stored at the subscriber terminal.Type: ApplicationFiled: November 17, 2008Publication date: May 20, 2010Inventors: Enrico Bocchieri, Horst Schroeter
-
Publication number: 20090132467Abstract: Disclosed are a system, method and computer-readable medium for organizing images. A method aspect relates to receiving an image into a device, receiving incidental information associated with the image, organizing the image and the incidental information into a data structure such as a sparse array, classifying the received image with an image classifier and storing the classified image in an image database, receiving a search query and responding to the search query by searching for and retrieving matching images in the image database based on a comparison of the image search query to the data structure.Type: ApplicationFiled: November 15, 2007Publication date: May 21, 2009Applicant: AT & T LabsInventors: Charles Blewett, Enrico Bocchieri, Giuseppe Di Fabbrizio, Donnie Henderson, Thomas Killian, Thomas Kirk, David Kormann, Gregory T. Vesonder
-
A SYSTEM AND METHOD FOR PROVIDING LARGE VOCABULARY SPEECH PROCESSING BASED ON FIXED-POINT ARITHMETIC
Publication number: 20070192104Abstract: Disclosed herein is a system, method and computer-readable medium storing instructions for controlling a computing device according to the method. The invention relates to a system, method and computer-readable medium storing instructions for controlling a computing device according to the method. As an example embodiment, the method uses a speech recognition decoder that operates or uses fixed point arithmetic. The exemplary method comprises representing arc costs associated with at least one finite state transducer (FST) in fixed point, representing parameters associated with a hidden Markov model (HMM) in fixed point and processing speech data in the speech recognition decoder using fixed point arithmetic for the fixed point FST arc costs and the fixed point HMM parameters. The method may also include computing at the decoder sentence hypothesis probabilities with fixed point arithmetic as type Q-2e numbers.Type: ApplicationFiled: February 16, 2006Publication date: August 16, 2007Applicant: AT&T Corp.Inventors: CHARLES BLEWETT, ENRICO BOCCHIERI -
Patent number: 4908865Abstract: Recognition of sound units is improved by comparing frame-pair feature vectors which helps compensate for context variations in the pronunciation of sound units. A plurality of reference frames are stored of reference feature vectors representing reference words. A linear predictive coder (10) generates a plurality of spectral feature vectors for each frame of the speech signals. A filter bank system (12) transforms the spectral feature vectors to filter bank representations. A principal feature vector transformer (14) transforms the filter bank representations to an identity matrix of transformed input feature vectors. A concatenate frame system (16) concatenates the input feature vectors of adjacent frames to form the feature vector of a frame-pair. A transformer (18) and a comparator (20) compute the likelihood that each input feature vector for a frame-pair was produced by each reference frame. This computation is performed individually and independently for each reference frame-pairs.Type: GrantFiled: December 22, 1988Date of Patent: March 13, 1990Assignee: Texas Instruments IncorporatedInventors: George R. Doddington, Enrico Bocchieri