SMS dit: Approaching language variation through corpora