Seol mar théacs é seo: Approaching language variation through corpora