The Web Archives for Historical Research (WAHR) group has the goal of linking history and big data to give historians the tools required to find and interpret digital sources from web archives.
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 10 of 26 Results
Jan 10, 2022
Ruest, Nick, 2022, "#healthcanada #NACI #fordnation #medicalfreedom #covid19 #covid19vaccines #protectourfamilies #protectyourchildren #holdtheline tweets", https://doi.org/10.5683/SP3/QFISO4, Borealis, V1, UNF:6:sy3ljwP4IHuYHlWUJwlNzA== [fileUNF]
2,661,117 tweet ids for #healthcanada #NACI #fordnation #medicalfreedom #covid19 #covid19vaccines #protectourfamilies #protectyourchildren #holdtheline tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. tw...
Nov 8, 2021
Ruest, Nick, 2021, "#elxn44 tweets (44th Canadian Federal Election)", https://doi.org/10.5683/SP3/UY0YJ5, Borealis, V1, UNF:6:LjbT/ksxbwx+JBpiO6cJIA== [fileUNF]
2,075,645 tweet ids for #elxn44 tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate elxn44-tweet-ids.txt > elxn44.jsonl. Tweets were collected via the Standard Search API on a cron job every f...
Jan 23, 2021
Ruest, Nick, 2017, "Tweets to Donald Trump (@realDonaldTrump)", https://doi.org/10.5683/SP/8BAVQM, Borealis, V10
362,464,578 tweet ids for tweets directed at Donald Trump (@realDonaldTrump), collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate to_realdonaldtrump_20210120_ids.txt > to_realdonaldtrump_20210120.json...
Jan 15, 2021
Ruest, Nick, 2020, "Tyendinaga tweet ids", https://doi.org/10.5683/SP2/FQR2CK, Borealis, V2
80,264 tweet ids for Tyendinaga tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate tyendinaga-20210115-ids.txt > tyendinaga.jsonl. Tweets were collected via the Standard Search API on a cron...
Jan 15, 2021
Ruest, Nick, 2020, "Wet'suwet'en tweet ids", https://doi.org/10.5683/SP2/C0KFTF, Borealis, V2
425,227 tweet ids for Wet'suwet'en tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate wetsuweten-20210115-ids.txt > wetsuweten.jsonl Tweets were collected via the Standard Search API on a cro...
Apr 28, 2020
Ruest, Nick; Wilk, Jocelyn; Thurman, Alex, 2020, "University Archives web archive collection derivatives", https://doi.org/10.5683/SP2/FONRZU, Borealis, V1
Web archive derivatives of the University Archives collection from Columbia University Libraries. The derivatives were created with the Archives Unleashed Toolkit and Archives Unleashed Cloud. The cul-1914-parquet.tar.gz derivatives are in the Apache Parquet format, which is a co...
Feb 16, 2020
Ruest, Nick; Sala, Christine; Thurman, Alex, 2020, "Avery Library Historic Preservation and Urban Planning web archive collection derivatives", https://doi.org/10.5683/SP2/Z68EVJ, Borealis, V1
Web archive derivatives of the Avery Library Historic Preservation and Urban Planning collection from Columbia University Libraries. The derivatives were created with the Archives Unleashed Toolkit and Archives Unleashed Cloud. The cul-1757-parquet.tar.gz derivatives are in the A...
Nov 23, 2019
Ruest, Nick, 2019, "#elxn43 tweets (43rd Canadian Federal Election)", https://doi.org/10.5683/SP2/QAMPPI, Borealis, V1
2,944,525 tweet ids for #elxn43 tweets, collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate elxn43-ids.txt > elxn43.jsonl. Tweets were collected via the Standard Search API on a cron job every five da...
Dec 10, 2017
Ruest, Nick, 2017, "#JeffSessions tweets", https://doi.org/10.5683/SP/MQ3Y99, Borealis, V1
2,278,757 tweet ids for #JeffSessions collected with Documenting the Now's twarc. Tweets can be “rehydrated” with Documenting the Now’s twarc, or Hydrator. twarc hydrate to_realdonaldtrump_ids.txt > to_donaltrump.jsonl.
Nov 26, 2017
Ruest, Nick, 2017, "#paradisepapers tweets", https://doi.org/10.5683/SP/D7R2WB, Borealis, V1
1,797,260 tweet ids for #paradisepapers collected with Documenting the Now's twarc from November 5-26, 2017. Tweets can be “rehydrated” with Documenting the Now’s twarc (https://github.com/DocNow/twarc). twarc.py hydrate paradisepapers_ids.txt > paradisepapers.json. Or with Docum...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.