Abstract: The invention relates to a method for clustering nodes of a network, said network comprising nodes associated with message edges of text data, the method comprising an initialization step of determination of a first initial clustering of the nodes, and a step of iterative inference of a generative model of text documents. Edges are modeled with a Stochastic Block Model (SBM) and the sets of documents between and within clusters are modeled according to a generative model of documents. The inference step comprises iteratively modelling the text documents and the underlying topics of their textual content, and updating the clustering as a function of said modelling, until a convergence criterion is fulfilled and an optimized clustering and corresponding optimized values of the parameters of the models are output.
Type:
Application
Filed:
April 6, 2017
Publication date:
October 11, 2018
Applicants:
Universite Paris Descartes, Universite Paris 1 Pantheon-Sorbonne, Centre National de la Recherche Scientifique (CNRS)