Data contains the time series of the volume (the number of mention per hour) of 1,000 Memetracker phrases and 1,000 Twitter hashtags. Memetracker phrases are the 1,000 highest total volume phrases among 343 million phrases collected from Sep 2008 to Aug 2009. Twitter hashtags are the 1,000 highest total volume hashtags among 6 million hashtags from Jun to Dec 2009.
Dataset statistics of each data set | |
---|---|
Number of time series | 1,000 |
Length of time series | 128 |
Time unit | 1 hour |
File | Description |
---|---|
MemePhr.txt | Memetracker phrases |
TwtHtag.txt | Twitter hashtags |
Each item (a phrase or a hashtag) consists of two lines. The first line describes the meta-information about the item, and the second line shows the value of time series. Values are separated by tabs.
Example:
Description