Venue: Your Desktop
Social media platforms are a rich source for data. The University of Michigan collects data with the Twitter Decahose, maintaining an archive of 10% of tweets made from the past decade. This collection is maintained in collaboration by MIDAS, CSCAR and ARC.
This workshop covers what the Twitter Decahose is, the process to obtain access, and details on the data format and metadata included, with live examples to process a sample into a filtered set using Python and PySpark.
More information on research datasets from MIDAS can be found at: https://midas.umich.edu/research-datasets/
Please register at least 48 hours in advance.