Abstract: Provided is a method including obtaining a prompt, determining a prompt embedding vector representing the prompt in an embedding space, modifying the prompt embedding vector using a trained model configured to adjust prompt embedding vectors to decrease proximity to vectors of blocks in a data set from which data is retrieved to augment generation by the generative AI model, determining that the modified prompt embedding vector is within a threshold distance to vectors in the embedding space corresponding to one or more blocks in the data set, selecting the one or more blocks in the data set, generating a response using the generative AI model based on the selected one or more blocks in the data set, quantifying an amount of influence of the respective block on corresponding text in the generated response, and providing the response and a representation of the quantified amount of influence as an output.