Seol mar théacs é seo: Advances in corpus-based contrastive linguistics