Advertisement · 728 × 90
#
Hashtag
#warc2corpus
Advertisement · 728 × 90
Post image Post image Post image

On our way to Tartu, Estonia, to present the #warc2corpus pipeline! Look forward to discuss how web archive collections can be turned into corpus data for distant reading.
#DHNB2025 #DigitalHumanities
#OpenScience #WebArchiving
#ComputationalSocialSciences
@dhnb.bsky.social @netpreserve.bsky.social

8 3 1 0