#

web-archive

Here are 11 public repositories matching this topic...

devanshbatham / ArchiveFuzz

Hunt down the secrets from the WebArchives for Fun and Profit

osint bughunting security-tools web-archive subdomain-scanner subdomain-enumeration email-enumeration

Updated Dec 8, 2022
Python

hoardy-web

Own-Data-Privateer / hoardy-web

Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.

cli backups internet archiving snapshot self-hosted archive browser-extension archiver web-archiving wayback-machine web-browsing web-archive website-archive auto-save offline-reading internet-archiving

Updated Jul 25, 2025
Python

cdx-summary

internetarchive / cdx-summary

Summarize web archive capture index (CDX) files.

nodejs python statistics collection webcomponents archive report summary warc cdx web-archive

Updated Jul 29, 2022
Python

TarekJor / bookmark-archiver

🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

Updated Aug 12, 2018
Python

anjackson / sliver

A tool for collection archival slivers of the web and web archives

web-archiving web-archives web-archive

Updated Feb 18, 2025
Python

MementoMap

oduwsdl / MementoMap

A Tool to Summarize Web Archive Holdings

python memento profiling web-archive mementomap ukvs

Updated Jun 15, 2021
Python

thiagolopes / alexandria

Backup and save websites

web-archive

Updated Jul 30, 2025
Python

ibnesayeed / utils

Miscellaneous utility scripts

python linux shell utilities scripts archiving hacktoberfest web-archive

Updated Apr 9, 2025
Python

india-ultimate / the-huddle

A mirror of The Huddle magazine

static-site web-archive ultimate-frisbee

Updated Aug 10, 2022
Python

ArtificialOSS / WebCrawl

Crawls the web to generate a huge dataset for training

crawler ai artificial-intelligence dataset-generation commoncrawl web-archive

Updated Jan 24, 2024
Python

KaineRecycler / YouTube-Content-Archive

YouTube Content Archive Database

youtube youtube-api web-archive

Updated Mar 18, 2024
Python

Improve this page

Add a description, image, and links to the web-archive topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the web-archive topic, visit your repo's landing page and select "manage topics."