Advertisement · 728 × 90
#
Hashtag
#webarchives
Advertisement · 728 × 90
Preview
Preserving The Web Is Not The Problem. Losing It Is. Recent reporting by Nieman Lab describes how some major news organizations—including The Guardian, The New York Times, and Reddit—are limiting or blocking access to their content in the Internet Ar…

Preserving The Web Is Not The Problem. Losing It Is. | Techdirt www.techdirt.com/2026/02/17/p... #archives #webarchives

0 0 0 0
But in recent months The New York Times began blocking the Archive from crawling its website, using technical measures that go beyond the web's traditional robots.txt rules. That risks cutting off a record that historians and journalists have relied on for decades. Other newspapers, including The Guardian, seem to be following suit. 

For nearly three decades, historians, journalists, and the public have relied on the Internet Archive to preserve news sites as they appeared online. Those archived pages are often the only reliable record of how stories were originally published. In many cases, articles get edited, changed, or removed—sometimes openly, sometimes not. The Internet Archive often becomes the only source for seeing those changes. When major publishers block the Archive’s crawlers, that historical record starts to disappear.

But in recent months The New York Times began blocking the Archive from crawling its website, using technical measures that go beyond the web's traditional robots.txt rules. That risks cutting off a record that historians and journalists have relied on for decades. Other newspapers, including The Guardian, seem to be following suit. For nearly three decades, historians, journalists, and the public have relied on the Internet Archive to preserve news sites as they appeared online. Those archived pages are often the only reliable record of how stories were originally published. In many cases, articles get edited, changed, or removed—sometimes openly, sometimes not. The Internet Archive often becomes the only source for seeing those changes. When major publishers block the Archive’s crawlers, that historical record starts to disappear.

Blocking the Internet Archive Won't Stop AI, But It Will Erase the Web's Historical Record www.eff.org/deeplinks/20... from @eff.org

#webarchives #webhistory #fairuse #openweb #techpolicy

3 1 0 1
Post image

"A Glimpse into How #AI Tools Can Enhance the Way We Study Web Archive Content: Challenges and Opportunities" (by Jhon G. Botello, via Web Science and Digital Libraries Research Group, ODU) ws-dl.blogspot.com/2026/03/2026... #archives #webarchives #webarchiving #GenAI #LLMs #libraries

3 2 0 0
Post image

Brewster Kahle (Internet Archive Founder and Director) is the Guest on the Latest Episode of ATG The Podcast www.infodocket.com/2026/03/10/i... @archive.org @chashub.bsky.social #libraries #oa #archives #webarchives

3 1 1 1
Post image

Report: The DOJ Has Been Taking Down Epstein Files. Here’s What Remains (via @cbsnews.com) www.cbsnews.com/projects/202... #archives #webarchives

0 0 0 0
Video

Feb 2026 information labs recap
🔗 www.linkedin.com/pulse/februa...
⇒ New #HumansOfAI Perspectives report from the AI lab
⇒ Reflections on #SocialMediaBans, #copyright, #WebArchives & EU #AI policy
#DigitalNetworksAct feedback open
⇒ New 3 THINGS series

0 0 0 0
Two of  one hundred college drawings by Cindy Rehm as part of The Seers, a project using images largely sourced from historic books at the Internet Archive. The right featues a feather, a seated woman's bare back, a statue of a nude woman, and the Moon, all arranged over the features of a cat's face. The left depicts a statue of a nude woman, a vase, shells, and a dress.

Two of one hundred college drawings by Cindy Rehm as part of The Seers, a project using images largely sourced from historic books at the Internet Archive. The right featues a feather, a seated woman's bare back, a statue of a nude woman, and the Moon, all arranged over the features of a cat's face. The left depicts a statue of a nude woman, a vase, shells, and a dress.

Post image

Enter a world of mystic cats & feminist histories 🐈✨

Internet Archive’s latest Artist in Residence, Cindy Rehm, brings the Archive’s collections to life in THE SEERS.

Learn more ⤵️
blog.archive.org/2026/02/19/t...

@internetarchive #ArtistInResidence #WebArchives #FeministArt #CollageArt

3 0 1 0
Promotional slide captioned “Ways of Working with the Wayback Machine, 9 February 2026,  Internet Archive HQ,” with Internet Archive Europe and Wayback Machine logos at the top. The slide includes a photo of about ten adults seated in a bright room with large windows and wooden floors, arranged in a casual circle discussion. To the right is a graphic of a magnifying glass over a document icon.

Promotional slide captioned “Ways of Working with the Wayback Machine, 9 February 2026, Internet Archive HQ,” with Internet Archive Europe and Wayback Machine logos at the top. The slide includes a photo of about ten adults seated in a bright room with large windows and wooden floors, arranged in a casual circle discussion. To the right is a graphic of a magnifying glass over a document icon.

🧵 Curious how researchers, journalists, and artists dig into the web’s past? 🌐

WAYS OF WORKING WITH THE WAYBACK MACHINE shows how the #WaybackMachine opens up a world of discovery and creativity.

Learn more ⤵️
www.internetarchive.eu/2026/02/18/w...

1️⃣/5️⃣

#WebArchives #DigitalResearch

164 43 3 4
Post image

Ways of Working With The Wayback Machine (via @internetarchive.eu)
www.internetarchive.eu/2026/02/18/w... #webarchives #archives @archive.org

6 2 0 0
Wayback Machine Director Pushes Back on AI Scraping Fears Driving Archive Blocks | Internet Archive Blogs

Wayback Machine Director Pushes Back on #AI Scraping Fears Driving Archive Blocks (via @archive.org) blog.archive.org/2026/02/18/w... #libraries #webarchives

2 1 0 0
The CIA's "World Factbook" has been around for over 60 years. In regularly updated editions, it summarizes basic information and statistical data for regions of the world and all countries. Initially, it was classified and intended for US government employees, but versions for the public have existed since the 1970s. The Factbook went online in 1994, making the data collection significantly older than Wikipedia, than Google, and than most of the internet offerings still accessible today – including heise online. It most recently included over 5000 public domain photos, many of them vacation pictures from CIA employees. According to the announcement, the Factbook will now be discontinued entirely, meaning the printed editions as well as the internet version.

The CIA's "World Factbook" has been around for over 60 years. In regularly updated editions, it summarizes basic information and statistical data for regions of the world and all countries. Initially, it was classified and intended for US government employees, but versions for the public have existed since the 1970s. The Factbook went online in 1994, making the data collection significantly older than Wikipedia, than Google, and than most of the internet offerings still accessible today – including heise online. It most recently included over 5000 public domain photos, many of them vacation pictures from CIA employees. According to the announcement, the Factbook will now be discontinued entirely, meaning the printed editions as well as the internet version.

"The World Factbook": CIA unexpectedly closes an internet veteran www.heise.de/en/news/The-...

No real explanation given www.cia.gov/stories/stor...

Wayback Machine's full-text search of archived pages web.archive.org/collection-s...

#openweb #webarchives #censorship

1 0 0 0
Post image

Internet Archive Adds Searchable Access to Archived Pages From the #CIA World Factbook www.infodocket.com/2026/02/08/i... #webarchives #govdocs #ciaworldfactbook @archive.org

9 5 0 0
Post image

The Wayback Machine is Looking to Make Deadlinks a Thing of the Past (via BetaNews) betanews.com/article/the-... #archives #webarchives #publishing #linkrot

1 1 0 0
Post image

Report: "The Long Now of the Web: Inside the Internet Archive’s Fight Against Forgetting" (via @hackernoon.com) hackernoon.com/the-long-now... #archives #webarchives @archive.org

4 4 0 0
Post image

Milestones: Web #Archiving at the Library After 25 Years (via @librarycongress.bsky.social) blogs.loc.gov/thesignal/20... #archives #webarchives #libraries

2 0 0 0
Post image

Fun Library Kiosk and Novel Web-Based Display of Millions of Web Pages (via @archive.org) blog.archive.org/2025/12/16/f... #webarchives #archives

3 1 0 0

She jumped into an impromptu discussion about connecting wider research communities to historical Internet data as part of the Internet Data Trust work. #InternetArchive #WebArchives #InternetData #Research #DataTrust #DigitalPreservation /end

4 0 0 0
Post image

Many thanks to @internetsociety.bsky.social for hosting a great workshop where @murchstudio.com shared lessons from @internetarchive.eu. #InternetArchive #WebArchives #InternetData #Research #DataTrust #DigitalPreservation /1

7 1 1 0

PS: If you do not have access to one of these documents, send us a message 🥸

#lostmedia #lostwave #research #webarchives #internetarchives

1 0 0 0
Post image

The Case For Preserving Scholarly Blogs (via @lseimpactblog.bsky.social) blogs.lse.ac.uk/impactofsoci... #scholcomm #libraries #webarchives #archives

1 0 0 0
Post image

Wonderful and Exciting News!!! Meet Merrilee Proffitt, Director of Democracy’s Library (via @internetarchive)blog.archive.org/2025/11/18/meet-merrilee... #archives #archives #webarchives #democracy

0 1 0 0
Post image

Wonderful and Exciting News!!! Meet Merrilee Proffitt, Director of Democracy’s Library (via @archive.org) blog.archive.org/2025/11/18/m... #archives #archives #webarchives #democracy

4 0 0 0
Post image

Inside the Old Church Where One Trillion Webpages are Being Saved (via @cnn.com) edition.cnn.com/2025/11/16/t... #archives #internetarchive #webarchives #libraries

0 0 0 0
Internet Archive: Digital Library of Free & Borrowable Texts, Movies, Music & Wayback Machine

archive.org down?

(weirdly whenever I see it may be down I think... okay, this might just be the beginning of the end...)

#Archives #WebArchives #digipres #IsThisTheEnd

0 0 0 0
Post image

One Trillion Web Pages Archived: Internet Archive Celebrates a Civilization-Scale Milestone (via @archive.org) blog.archive.org/2025/10/31/o... & 'The Truth is Paywalled.' Internet Vets Lament the State of the 'Open' Web (via @pcmag.com) www.pcmag.com/news/the-tru... #web #webarchives

2 0 0 0
Post image

Federal Agencies Post Incendiary Banners About the Government Shutdown envirodatagov.org/federal-agencies-post-in... @internetarchive #usgov #webarchives

0 0 0 0
Post image

Federal Agencies Post Incendiary Banners About the Government Shutdown envirodatagov.org/federal-agen... @envirodgi.bsky.social @archive.org #usgov #webarchives

2 0 0 0
Post image

Milestones: Internet Archive Celebrates One Trillion Web Pages Preserved www.infodocket.com/2025/10/20/m... #webarchives #archives #digipres #libraries
@brewster.kahle.org @archive.org

1 0 0 0
Post image

Milestones: Internet Archive Celebrates One Trillion Web Pages Preserved www.infodocket.com/2025/10/20/milestones-in... #webarchives #archives #digipres #libraries […]

[Original post on newsie.social]

0 0 0 0
Post image

‼️ ATTENTION ‼️

⏰ Proposals for #iipcWAC26, "Sustainable #WebArchiving," in Brussels are due in 1 WEEK (15 OCT): netpreserve.org/ga2026/CfP

First-time submitters encouraged! Need inspiration?
www.youtube.com/@iipc8855/fe...

#WebArchives #WebArchiveWednesday #DigitalPreservation #DigitalHumanities

0 2 0 0