Seol mar théacs é seo: Text variability measures in corpus design for Setswana lexicography