Pātuhitia tēnei: Approaching language variation through corpora