Stav dette: Approaching language variation through corpora