Abstract: Disclosed herein are systems, devices, and methods for contextualizing media. In some variations, a method of organizing audio may comprise generating first graph data nodes from structured text data comprising a predetermined audio data model and generating second graph data nodes from unstructured data. The first and second graph data nodes may be associated with the audio. The one or more first graph data nodes may be linked to the one or more corresponding second graph data nodes using a natural language processing model.