Abstract: A system and method is provided for generating a translingual parsing model and for using the model to machine translate a source language to a destination language. The system and method includes receiving a sentence in the source language, and with the translingual parsing model, searching among and statistically ranking candidate parses each having elements labeled with destination language words, syntactic labels, and role labels indicating relationships between the elements. A statistically high ranked parse is selected and rearranged using the syntactic and role labels, in accordance with word order conventions of the destination language to generate a translingual parse of the source language sentence.
Abstract: Computer-based systems and methods enable analysts to manage and explore the information that hard drives and other storage devices or sources of data may contain, and for extracting forensic features and performing cross drive analysis.
Abstract: An automated system and method is provided for debugging training data used to train an automated language identifier. The system and method collects texts written in a particular language, generates an occurrence count for words in each text by counting the number of times each of the words is found within the text, and generates an occurrence ratio (OR) of each of the words by dividing the occurrence count by the total number of words in each text. Words are then filtered from the texts in which their occurrence ratios are substantially higher than their occurrence ratios in at least one of the other texts, to generate a clean text.
Abstract: A system and method for natural language processing comprises a blackboard data structure for providing a shared knowledge repository over which a collection of natural language agents can execute processes on the processable data form, each agent being capable of providing a processing resource usable for serving requests to execute a natural language process on the processable data form, and determining, based on their respective capabilities and examination of the blackboard, what requests for processing they can best serve; and a dispatcher for coordinating the work of registered agents, maintaining a high-level description of tasks to be completed to provide a solution to a given natural language engineering problem, and determining the registered agents that best provide a solution to the given natural language engineering problem.