Pātuhitia tēnei: Corpus data across languages and disciplines