Text this: Statistical analysis of massive data streams