FilesJanuary 12, 2022 Dataset Open Access
Dataset: "I can't keep it up anymore." The Voat.co dataset
Mekacher, Amin; Papasavva, Antonis
This is the dataset released with the paper titled: "I can't keep it up anymore." The Voat.co dataset.
The dataset consists of 15,133 Newline delimited JSON files (ndjson). More specifically, 7,616 files for submission data, 7,515 for comment data, 1 for user data, and 1 for subverse data. Each line in the ndjson files consists of a JSON object. The JSON objects contain all the key/values we collect through the Voat API and the custom parser of the Internet Archive Wayback Machine Voat snapshot release.
For the detailed description of every key in the JSON structure, along with the type of the value, please read the readme.pdf file provided with this dataset.
If you find our dataset useful, please cite our paper:
Code: Select all
@article{mekacher2022can, title={" I can't keep it up anymore." The Voat. co dataset}, author={Mekacher, Amin and Papasavva, Antonis}, journal={arXiv preprint arXiv:2201.05933}, year={2022} }
Readme.pdf
voat_dataset.zip (2.2 GB)
Source
viewtopic.php?f=50&t=6589
This joke is gonna get me banned.