Patents by Inventor Ronan Collobert

Ronan Collobert has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8874434
    Abstract: The method and apparatus for discriminative natural language parsing, uses a deep convolutional neural network adapted for text and a structured tag inference in a graph. In the method and apparatus, a trained recursive convolutional graph transformer network, formed by the deep convolutional neural network and the graph, predicts “levels” of a parse tree based on predictions of previous levels.
    Type: Grant
    Filed: June 1, 2011
    Date of Patent: October 28, 2014
    Assignee: NEC Laboratories America, Inc.
    Inventors: Ronan Collobert, Bing Bai
  • Patent number: 8504361
    Abstract: A method and system for labeling a selected word of a sentence using a deep neural network includes, in one exemplary embodiment, determining an index term corresponding to each feature of the word, transforming the index term or terms of the word into a vector, and predicting a label for the word using the vector. The method and system, in another exemplary embodiment, includes determining, for each word in the sentence, an index term corresponding to each feature of the word, transforming the index term or terms of each word in the sentence into a vector, applying a convolution operation to the vector of the selected word and at least one of the vectors of the other words in the sentence, to transform the vectors into a matrix of vectors, each of the vectors in the matrix including a plurality of row values, constructing a single vector from the vectors in the matrix, and predicting a label for the selected word using the single vector.
    Type: Grant
    Filed: February 9, 2009
    Date of Patent: August 6, 2013
    Assignee: NEC Laboratories America, Inc.
    Inventors: Ronan Collobert, Jason Weston
  • Patent number: 8392436
    Abstract: A method and system for searching for information contained in a database of documents each includes an offline part and an online part. The offline part includes predicting, in a first computer process, semantic data for sentences of the documents contained in the database and storing this data in a database. The online part includes querying the database for information with a semantically-sensitive query, predicting, in a real time computer process, semantic data for the query, and determining, in a second computer process, a matching score against all the documents in the database, which incorporates the semantic data for the sentences and the query.
    Type: Grant
    Filed: February 2, 2009
    Date of Patent: March 5, 2013
    Assignee: NEC Laboratories America, Inc.
    Inventors: Bing Bai, Jason Weston, Ronan Collobert
  • Patent number: 8341095
    Abstract: A system and method for determining a similarity between a document and a query includes building a weight vector for each of a plurality of documents in a corpus of documents stored in memory and building a weight vector for a query input into a document retrieval system. A weight matrix is generated which distinguishes between relevant documents and lower ranked documents by comparing document/query tuples using a gradient step approach. A similarity score is determined between weight vectors of the query and documents in a corpus by determining a product of a document weight vector, a query weight vector and the weight matrix.
    Type: Grant
    Filed: September 18, 2009
    Date of Patent: December 25, 2012
    Assignee: NEC Laboratories America, Inc.
    Inventors: Bing Bai, Jason Weston, Ronan Collobert, David Grangier
  • Patent number: 8266083
    Abstract: A method for training a learning machine for use in discriminative classification and regression includes randomly selecting, in a first computer process, an unclassified datapoint associated with a phenomenon of interest; determining, in a second computer process, a set of datapoints associated with the phenomenon of interest that is likely to be in the same class as the selected unclassified datapoint; predicting, in a third computer process, a class label for the selected unclassified datapoint in a third computer process; predicting a class label for the set of datapoints in a fourth computer process; combining the predicted class labels in a fifth computer process, to predict a composite class label that describes the selected unclassified datapoint and the set of datapoints; and using the combined class label to adjust at least one parameter of the learning machine in a sixth computer process.
    Type: Grant
    Filed: February 2, 2009
    Date of Patent: September 11, 2012
    Assignee: NEC Laboratories America, Inc.
    Inventors: Jason Weston, Ronan Collobert
  • Patent number: 8234228
    Abstract: The invention includes a method for training a learning machine having a deep multi-layered network, with labeled and unlabeled training data. The deep multi-layered network is a network having multiple layers of non-linear mapping. The method generally includes applying unsupervised embedding to any one or more of the layers of the deep network. The unsupervised embedding is operative as a semi-supervised regularizer in the deep network.
    Type: Grant
    Filed: February 6, 2009
    Date of Patent: July 31, 2012
    Assignee: NEC Laboratories America, Inc.
    Inventors: Jason Weston, Ronan Collobert
  • Patent number: 8180633
    Abstract: A system and method for semantic extraction using a neural network architecture includes indexing each word in an input sentence into a dictionary and using these indices to map each word to a d-dimensional vector (the features of which are learned). Together with this, position information for a word of interest (the word to labeled) and a verb of interest (the verb that the semantic role is being predicted for) with respect to a given word are also used. These positions are integrated by employing a linear layer that is adapted to the input sentence. Several linear transformations and squashing functions are then applied to output class probabilities for semantic role labels. All the weights for the whole architecture are trained by backpropagation.
    Type: Grant
    Filed: February 29, 2008
    Date of Patent: May 15, 2012
    Assignee: NEC Laboratories America, Inc.
    Inventors: Ronan Collobert, Jason Weston
  • Publication number: 20110301942
    Abstract: The method and apparatus for discriminative natural language parsing, uses a deep convolutional neural network adapted for text and a structured tag inference in a graph. In the method and apparatus, a trained recursive convolutional graph transformer network, formed by the deep convolutional neural network and the graph, predicts “levels” of a parse tree based on predictions of previous levels.
    Type: Application
    Filed: June 1, 2011
    Publication date: December 8, 2011
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Ronan Collobert, Bing Bai
  • Patent number: 7778949
    Abstract: Disclosed is a method for training a transductive support vector machine. The support vector machine is trained based on labeled training data and unlabeled test data. A non-convex objective function which optimizes a hyperplane classifier for classifying the unlabeled test data is decomposed into a convex function and a concave function. A local approximation of the concave function at a hyperplane is calculated, and the approximation of the concave function is combined with the convex function such that the result is a convex problem. The convex problem is then solved to determine an updated hyperplane. This method is performed iteratively until the solution converges.
    Type: Grant
    Filed: March 21, 2007
    Date of Patent: August 17, 2010
    Assignee: NEC Laboratories America, Inc.
    Inventors: Ronan Collobert, Jason Weston, Leon Bottou
  • Publication number: 20100179933
    Abstract: A system and method for determining a similarity between a document and a query includes building a weight vector for each of a plurality of documents in a corpus of documents stored in memory and building a weight vector for a query input into a document retrieval system. A weight matrix is generated which distinguishes between relevant documents and lower ranked documents by comparing document/query tuples using a gradient step approach. A similarity score is determined between weight vectors of the query and documents in a corpus by determining a product of a document weight vector, a query weight vector and the weight matrix.
    Type: Application
    Filed: September 18, 2009
    Publication date: July 15, 2010
    Applicant: NEC Laboratories America, Inc.
    Inventors: BING BAI, Jason Weston, Ronan Collobert, David Grangier
  • Publication number: 20090210218
    Abstract: A method and system for labeling a selected word of a sentence using a deep neural network includes, in one exemplary embodiment, determining an index term corresponding to each feature of the word, transforming the index term or terms of the word into a vector, and predicting a label for the word using the vector. The method and system, in another exemplary embodiment, includes determining, for each word in the sentence, an index term corresponding to each feature of the word, transforming the index term or terms of each word in the sentence into a vector, applying a convolution operation to the vector of the selected word and at least one of the vectors of the other words in the sentence, to transform the vectors into a matrix of vectors, each of the vectors in the matrix including a plurality of row values, constructing a single vector from the vectors in the matrix, and predicting a label for the selected word using the single vector.
    Type: Application
    Filed: February 9, 2009
    Publication date: August 20, 2009
    Applicant: NEC Laboratories America, Inc.
    Inventors: Ronan Collobert, Jason Weston
  • Publication number: 20090204556
    Abstract: A method for training a learning machine for use in discriminative classification and regression includes randomly selecting, in a first computer process, an unclassified datapoint associated with a phenomenon of interest; determining, in a second computer process, a set of datapoints associated with the phenomenon of interest that is likely to be in the same class as the selected unclassified datapoint; predicting, in a third computer process, a class label for the selected unclassified datapoint in a third computer process; predicting a class label for the set of datapoints in a fourth computer process; combining the predicted class labels in a fifth computer process, to predict a composite class label that describes the selected unclassified datapoint and the set of datapoints; and using the combined class label to adjust at least one parameter of the learning machine in a sixth computer process.
    Type: Application
    Filed: February 2, 2009
    Publication date: August 13, 2009
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Jason Weston, Ronan Collobert
  • Publication number: 20090204605
    Abstract: A method and system for searching for information contained in a database of documents each includes an offline part and an online part. The offline part includes predicting, in a first computer process, semantic data for sentences of the documents contained in the database and storing this data in a database. The online part includes querying the database for information with a semantically-sensitive query, predicting, in a real time computer process, semantic data for the query, and determining, in a second computer process, a matching score against all the documents in the database, which incorporates the semantic data for the sentences and the query.
    Type: Application
    Filed: February 2, 2009
    Publication date: August 13, 2009
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Bing Bai, Jason Weston, Ronan Collobert
  • Publication number: 20090204558
    Abstract: A method for training a learning machine having a deep network with a plurality of layers, includes applying a regularizer to one or more of the layers of the deep network; training the regularizer with unlabeled data; and training the deep network with labeled data. Also, an apparatus for use in discriminative classification and regression, including an input device for inputting unlabeled and labeled data associated with a phenomenon of interest; a processor; and a memory communicating with the processor. The memory includes instructions executable by the processor for implementing a learning machine having a deep network structure and training the learning machine by applying a regularizer to one or more of the layers of the deep network; training the regularizer with unlabeled data; and training the deep network with labeled data.
    Type: Application
    Filed: February 6, 2009
    Publication date: August 13, 2009
    Applicant: NEC Laboratories America, Inc.
    Inventors: Jason Weston, Ronan Collobert
  • Publication number: 20090171868
    Abstract: Disclosed is a method for early termination in training support vector machines. A support vector machine is iteratively trained based on training examples using an objective function having primal and dual formulations. At each iteration, a termination threshold is calculated based on the current SVM solution. The termination threshold increases with the number of training examples. The termination threshold can be calculated based on the observed variance of the loss for the current SVM solution. The termination threshold is compared to a duality gap between primal and dual formulations at the current SVM solution. When the duality gap is less than the termination threshold, the training is terminated.
    Type: Application
    Filed: December 27, 2007
    Publication date: July 2, 2009
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Leon Bottou, Ronan Collobert, Jason Edward Weston
  • Publication number: 20080221878
    Abstract: A system and method for semantic extraction using a neural network architecture includes indexing each word in an input sentence into a dictionary and using these indices to map each word to a d-dimensional vector (the features of which are learned). Together with this, position information for a word of interest (the word to labeled) and a verb of interest (the verb that the semantic role is being predicted for) with respect to a given word are also used. These positions are integrated by employing a linear layer that is adapted to the input sentence. Several linear transformations and squashing functions are then applied to output class probabilities for semantic role labels. All the weights for the whole architecture are trained by backpropagation.
    Type: Application
    Filed: February 29, 2008
    Publication date: September 11, 2008
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Ronan Collobert, Jason Weston
  • Publication number: 20070265991
    Abstract: Disclosed is a method for training a transductive support vector machine. The support vector machine is trained based on labeled training data and unlabeled test data. A non-convex objective function which optimizes a hyperplane classifier for classifying the unlabeled test data is decomposed into a convex function and a concave function. A local approximation of the concave function at a hyperplane is calculated, and the approximation of the concave function is combined with the convex function such that the result is a convex problem. The convex problem is then solved to determine an updated hyperplane. This method is performed iteratively until the solution converges.
    Type: Application
    Filed: March 21, 2007
    Publication date: November 15, 2007
    Applicant: NEC LABORATORIES AMERICA, INC.
    Inventors: Ronan Collobert, Jason Weston, Leon Bottou