Anyone interested in govt transparency and public access should check out GovScape from @bcgl.bsky.social and his team👇
It's an incredibly powerful tool that allows visual, semantic text, and keywords search of 10 million U.S. government PDFs (70 million pages!) and counting: www.govscape.net
Posts by Kyle Deeds
New Research Tool: GovScape (US Gov PDFs)
govscape.net ||| Research Paper (preprint) About #GovScape arxiv.org/abs/2511.11010 #govdocs @eotarchive.org
We’re live! Search 10 million+ U.S. government PDFs (70 million pages)! GovScape offers visual search, semantic text search, and keyword search. Explore below:
Website: govscape.net
ArXiv link: arxiv.org/abs/2511.11010
Huge step forward in enabling access and use of content archived from government websites!
My favorite use of govscape.net is to search "[INSERT COMPANY] lawsuit". See: the time that Skippy sued customs to demand that their fake peanut butter be recognized as "peanut butter or peanut slurry".
govscape.net/preview/KBVJ...
@bcgl.bsky.social @govscape.bsky.social
Link rot on the government web is actually one of the things we’re hoping to look into with this dataset! But, if you just want the PDF, you can use the “download PDF” button to grab it from the archive instead
1/ Announcing GovScape – a public search system for 10 million U.S. government PDFs (70 million pages)! GovScape offers visual search, semantic text search, and keyword search. Explore below:
Website: www.govscape.net
ArXiv link: arxiv.org/abs/2511.11010