Abstract: A system for data entry by associating structured textual context to images comprising a computer apparatus having a display device to facilitate interaction with a user. The system has electronics, data, tokens, grammatical rules, and an interface. The data is comprised of records, images, and template images, the template images having hotspots and sub-template images. The hotspots have a selection, and the sub-template images have sub-template hotspots, with the sub-template hotspots having a selection. The selections are preferably associated with diagnostic images, of which there are both general and template-specific.