Abstract: Disclosed herein is a method and system for producing a term association vector space on demand for a client given a document set in electronic form. The method extracts terms from the document set, stripping out words that do not convey meaning and adding important phrases within the context of the document set to the terms. Associations between terms are calculated, subjected to further analytical processes, and collected in a matrix, whose rows are vectors defining the vector space. Additional associational data can be added by matrix arithmetic, and documents can be rendered as further vectors in the space.
Type:
Application
Filed:
March 15, 2013
Publication date:
September 18, 2014
Applicant:
LUMINOSO TECHNOLOGIES, INC.
Inventors:
Robert Speer, Lance Nathan, Jason Alonso, Catherine Havasi, Kenneth Arnold
Abstract: A system and related method are disclosed for rendering a set of words linked to an n-dimensional vector space in a word cloud rendered from a two-dimensional projection of the vector space, where the user can click and drag a word, and the subspace and projection thereon will shift to place the word where the user has dragged it in a new projection, and the other words in the cloud will shift correspondingly, offering the user new insights. The importance of words in a document set is represented by word size, and relatedness between words demonstrated by color similarity.