Abstract: Methods, systems, and apparatuses, including computer programs encoded on computer-readable media, for tokenizing n-grams from a plurality of text units. A multi-dimensional array is created having a plurality of dimensions based upon the plurality of text units and the n-grams from the plurality of text units. The multi-dimensional array is normalized and the dimensionality of the multi-dimensional array is reduced. The reduced dimensionality multi-dimensional array is clustered to generate a plurality of clusters that each cluster includes one or more of the plurality of text units.
Type:
Grant
Filed:
October 19, 2012
Date of Patent:
September 22, 2015
Assignee:
NETWORKED INSIGHTS, LLC
Inventors:
Baoqiang Cao, T. Ryan Fitz-Gibbon, Lucas Forehand, Ryan McHale, Bradley Burke
Abstract: A system and a method of identifying information characterizing use of a website is provided. A plurality of user profiles are analyzed. A user profile includes information associated with an interaction by a user with the website. A plurality of user comments associated with the website are analyzed. Characteristic information associated with use of the website is determined based on the analyzed user profiles and the analyzed user comments. The determined characteristic information is presented to a user.
Type:
Grant
Filed:
February 29, 2008
Date of Patent:
April 12, 2011
Assignee:
Networked Insights, LLC
Inventors:
Daniel Neely, Glenn Jenkins, Matthew Wulff, Michael Mitchell