Patents by Inventor Ioan Calapodescu

Ioan Calapodescu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

END-TO-END NEURAL TEXT-TO-SPEECH MODEL WITH PROSODY CONTROL

Publication number: 20230215421

Abstract: Methods and systems for generating an end-to-end neural text-to-speech (TTS) model to process an input text to generate speech representations. An annotated set of text documents including annotations inserted therein to indicate prosodic features are input into the TTS model. The TTS model is trained using the annotated dataset and a corresponding dataset of speech representations of the text documents that include prosody associated with the indicated prosodic features. The trained TTS model learns to associate the prosody with the annotations.

Type: Application

Filed: September 23, 2022

Publication date: July 6, 2023

Inventors: Ioan CALAPODESCU, Inyoung KIM, Laurent BESACIER, Siddique LATIF
Method and system for training neural sequence-to-sequence models by incorporating global features

Patent number: 11681911

Abstract: Methods for training a neural sequence-to-sequence (seq2seq) model. A processor receives the model and training data comprising a plurality of training source sequences and corresponding training target sequences, and generates corresponding predicted target sequences. Model parameters are updated based on a comparison of predicted target sequences to training target sequences to reduce or minimize both a local loss in the predicted target sequences and an expected loss of one or more global or semantic features or constraints between the predicted target sequences and the training target sequences given the training source sequences. Expected loss is based on global or semantic features or constraints of general target sequences given general source sequences.

Type: Grant

Filed: October 15, 2019

Date of Patent: June 20, 2023

Assignee: NAVER CORPORATION

Inventors: Vu Cong Duy Hoang, Ioan Calapodescu, Marc Dymetman
Method and system for training document-level natural language processing models

Patent number: 11625544

Abstract: In methods for training a natural language generation (NLG) model using a processor a document-level machine translation (MT) model is provided by training an MT model to receive as input, token sequences in a first language, and to generate as output, token sequences in a second language. An augmented document-level MT model is provided by training the document-level MT model to receive as input, paired language-independent structured data and token sequences in the first language, and to generate as output, token sequences in the second language. The augmented document-level MT model is trained to receive as input, language-independent structured data, and to generate as output, token sequences in the second language.

Type: Grant

Filed: September 17, 2020

Date of Patent: April 11, 2023

Assignee: NAVER CORPORATION

Inventors: Ioan Calapodescu, Alexandre Berard, Fahimeh Saleh, Laurent Besacier
Computer-Implemented Method for Improving Classification of Labels and Categories of a Database

Publication number: 20230004581

Abstract: There is disclosed a method for using a computer to enable correction of misclassified labels in a database. The computer initially applies a dataset (including labels pointing respectively to categories) to a first classifier, which includes a first loss function. Pursuant to the initial application, the computer determines that one or more labels have been misclassified. Responsive to such determination, the computer changes the first loss function to a second loss function to form a second classifier including the second loss function. The computer then applies the dataset to the second classifier for enabling correction of the one or more misclassified labels.

Type: Application

Filed: May 2, 2022

Publication date: January 5, 2023

Applicant: Naver Corporation

Inventors: Nikolaos LAGOS, Ioan CALAPODESCU, JinHee LEE, Salah AIT-MOKHTAR
METHOD AND SYSTEM FOR TRAINING DOCUMENT-LEVEL NATURAL LANGUAGE PROCESSING MODELS

Publication number: 20220050973

Abstract: In methods for training a natural language generation (NLG) model using a processor a document-level machine translation (MT) model is provided by training an MT model to receive as input, token sequences in a first language, and to generate as output, token sequences in a second language. An augmented document-level MT model is provided by training the document-level MT model to receive as input, paired language-independent structured data and token sequences in the first language, and to generate as output, token sequences in the second language. The augmented document-level MT model is trained to receive as input, language-independent structured data, and to generate as output, token sequences in the second language.

Type: Application

Filed: September 17, 2020

Publication date: February 17, 2022

Inventors: Ioan CALAPODESCU, Alexandre BERARD, Fahimeh SALEH, Laurent BESACIER
SYSTEM AND METHOD FOR PROCESSING POINT-OF-INTEREST DATA

Publication number: 20210390392

Abstract: Methods employing temporal and/or cultural data are provided for automatically assigning one or more semantic tags to a point-of-interest (POI) using a processor. The POI is represented by attribute data. A dataset including temporal attribute data and/or cultural data is provided to a multilabel classifier comprising a neural network model. One or more predicted semantic tags for the POI are received from an output of the multilabel classifier. The predicted semantic tags are stored in a database as additional attribute data of the POI.

Type: Application

Filed: October 28, 2020

Publication date: December 16, 2021

Inventors: Nikolaos LAGOS, Salah AIT-MOKHTAR, Ioan CALAPODESCU
METHOD AND SYSTEM FOR TRAINING NEURAL SEQUENCE-TO-SEQUENCE MODELS BY INCORPORATING GLOBAL FEATURES

Publication number: 20210110254

Abstract: Methods for training a neural sequence-to-sequence (seq2seq) model. A processor receives the model and training data comprising a plurality of training source sequences and corresponding training target sequences, and generates corresponding predicted target sequences. Model parameters are updated based on a comparison of predicted target sequences to training target sequences to reduce or minimize both a local loss in the predicted target sequences and an expected loss of one or more global or semantic features or constraints between the predicted target sequences and the training target sequences given the training source sequences. Expected loss is based on global or semantic features or constraints of general target sequences given general source sequences.

Type: Application

Filed: October 15, 2019

Publication date: April 15, 2021

Inventors: Vu Cong Duy HOANG, Ioan CALAPODESCU, Marc DYMETMAN
System and method for entity extraction from semi-structured text documents

Patent number: 10489439

Abstract: A method for extracting entities from a text document includes, for at least a section of a text document, providing a first set of entities extracted from the at least a section, clustering at least a subset of the extracted entities in the first set into clusters, based on locations of the entities in the document. Complete ones of the clusters of entities are identified. Patterns for extracting new entities are learned based on the complete clusters. New entities are extracted from incomplete clusters based on the learned patterns.

Type: Grant

Filed: April 14, 2016

Date of Patent: November 26, 2019

Assignee: XEROX CORPORATION

Inventors: Ioan Calapodescu, Nicolas Guerin, Fanchon Jacques
SYSTEM AND METHOD FOR ENTITY EXTRACTION FROM SEMI-STRUCTURED TEXT DOCUMENTS

Publication number: 20170300565

Abstract: A method for extracting entities from a text document includes, for at least a section of a text document, providing a first set of entities extracted from the at least a section, clustering at least a subset of the extracted entities in the first set into clusters, based on locations of the entities in the document. Complete ones of the clusters of entities are identified. Patterns for extracting new entities are learned based on the complete clusters. New entities are extracted from incomplete clusters based on the learned patterns.

Type: Application

Filed: April 14, 2016

Publication date: October 19, 2017

Applicant: Xerox Corporation

Inventors: Ioan Calapodescu, Nicolas Guerin, Fanchon Jacques
Compact fuzzy private matching using a fully-homomorphic encryption scheme

Patent number: 9749128

Abstract: A method for data matching includes providing two sets of encrypted data elements by converting data elements to respective sets of vectors and encrypting each vector with a public key of a homomorphic encryption scheme. Each data element includes a sequence of characters drawn from an alphabet. For pairs of encrypted data elements, a comparison measure is computed between the sets of encrypted vectors. An obfuscated vector is generated for each encrypted data element in the first set, which renders the first encrypted data element indecipherable when the comparison measure does not meet a threshold for at least one of the pairs of data encrypted elements comprising that encrypted data element. The obfuscated vectors can be decrypted with a private key, allowing data elements in the first set to be deciphered if the comparison measure meets the threshold for at least one of the data elements in the second set.

Type: Grant

Filed: May 15, 2014

Date of Patent: August 29, 2017

Assignee: XEROX CORPORATION

Inventors: Ioan Calapodescu, Saghar Estehghari, Johan Clier
VIRTUAL REALITY METHOD AND SYSTEM FOR TEXT MANIPULATION

Publication number: 20170124762

Abstract: A Virtual Reality (VR) method and system for text manipulation. According to an exemplary embodiment, a VR method displays and provides a user with interaction with displayed documents in a Virtual Environment (VE), the VE including a VR head-mounted display and one or more gesture sensors. Text manipulation is performed using natural human body interactions with the VR system.

Type: Application

Filed: October 28, 2015

Publication date: May 4, 2017

Applicant: Xerox Corporation

Inventors: Caroline Privault, Fabien Guillot, Christophe Legras, Ioan Calapodescu
LARGE-SCALE BATCH ACTIVE LEARNING USING LOCALITY SENSITIVE HASHING

Publication number: 20160307113

Abstract: A system and method for selection of a batch of objects are provided. Each object in a pool is assigned to a subset of a set of buckets. The assignment is based on signatures, generated, for example, by LSH hashing object representations of the objects in the pool. The signatures are then segmented into bands which are each assigned to a respective bucket in the set, based on the elements of the band. An entropy value is computed for each of a set of objects remaining in the pool using a current classifier model. A batch of objects for retraining the model is selected. This includes selecting objects from the set of objects based on their computed entropy values and respective assigned buckets.

Type: Application

Filed: April 20, 2015

Publication date: October 20, 2016

Inventors: Ioan Calapodescu, Caroline Privault, Jean-Michel Renders
COMPACT FUZZY PRIVATE MATCHING USING A FULLY-HOMOMORPHIC ENCRYPTION SCHEME

Publication number: 20160119119

Abstract: A method for data matching includes providing two sets of encrypted data elements by converting data elements to respective sets of vectors and encrypting each vector with a public key of a homomorphic encryption scheme. Each data element includes a sequence of characters drawn from an alphabet. For pairs of encrypted data elements, a comparison measure is computed between the sets of encrypted vectors. An obfuscated vector is generated for each encrypted data element in the first set, which renders the first encrypted data element indecipherable when the comparison measure does not meet a threshold for at least one of the pairs of data encrypted elements comprising that encrypted data element. The obfuscated vectors can be decrypted with a private key, allowing data elements in the first set to be deciphered if the comparison measure meets the threshold for at least one of the data elements in the second set.

Type: Application

Filed: May 15, 2014

Publication date: April 28, 2016

Applicant: Xeror Corporation

Inventors: Ioan Calapodescu, Saghar Estehghari, Johan Clier
Homomorphic cryptography modeling in support of privacy policies

Patent number: 9313022

Abstract: Data privacy is becoming increasingly important and, in some jurisdictions, required. Access to private data can be controlled by forcing all access to go through minimizations services that allow only authorized access to private data. These minimization services can become processing bottlenecks if the only way to modify private data is by way of requests to the minimization service. Certain homomorphic operations allow for encrypted data to be modified without being first decrypted although other operands must be encrypted. Augmenting a minimization service to provide a public encryption key provides for encryption of the other operands. Providing a records manager that can take advantage of homomorphic operations allows certain data operations to be performed without compromising security and without accessing the minimization service.

Type: Grant

Filed: March 23, 2015

Date of Patent: April 12, 2016

Assignee: Xerox Corporation

Inventors: Thierry Jacquin, Johan Clier, Ioan Calapodescu
HOMOMORPHIC CRYPTOGRAPHY MODELING IN SUPPORT OF PRIVACY POLICIES

Publication number: 20150195083

Abstract: Data privacy is becoming increasingly important and, in some jurisdictions, required. Access to private data can be controlled by forcing all access to go through minimizations services that allow only authorized access to private data. These minimization services can become processing bottlenecks if the only way to modify private data is by way of requests to the minimization service. Certain homomorphic operations allow for encrypted data to be modified without being first decrypted although other operands must be encrypted. Augmenting a minimization service to provide a public encryption key provides for encryption of the other operands. Providing a records manager that can take advantage of homomorphic operations allows certain data operations to be performed without compromising security and without accessing the minimization service.

Type: Application

Filed: March 23, 2015

Publication date: July 9, 2015

Inventors: Thierry Jacquin, Johan Clier, Ioan Calapodescu