Text this: Building bridges for multimodal research :