Abstract: Embodiments of the present invention provide methods and apparatus for transcoding received text fragments and documents. A featurization configuration is produced to create token components for evaluating the content of the text fragment. Other embodiments may be described and claimed.
Abstract: Embodiments of the present invention provide methods and apparatus for determining languages of documents, including text messages and text fragments, generally sent via wireless communication devices. Other embodiments may be described and claimed.
Abstract: Embodiments of the present invention provide methods and apparatuses adapted to generate contextualized tokens to facilitate classification of text fragments.