Seol mar théacs é seo: Multilingual corpora and multilingual corpus analysis