Web archiving - Claude skills for journalism

When to use

Service	Best for	API	Deletions
Wayback Machine	Historical research	Yes (free)	On request
Archive.today	Paywall bypass, quick saves	No	Never
Perma.cc	Legal citations	Yes (free tier)	By creator
ArchiveBox	Self-hosted, privacy	Local	Never
Conifer	Interactive content	Yes	By creator

Python code for checking availability, saving pages, and retrieving historical snapshots via CDX API.

Archive to Wayback, Archive.today, and Perma.cc simultaneously for maximum preservation.

Chain of custody documentation, content hashing, and timestamped capture records.

Self-hosted archiving setup, Python integration, and scheduled archiving workflows.

Try services in this order for maximum coverage:

916B+ pages, historical depth, API access

On-demand snapshots, paywall bypass

Recent pages, search: cache:url

Click dropdown arrow in search results

Searches multiple archives simultaneously

# Clone the repository

git clone https://github.com/jamditis/claude-skills-journalism.git

# Copy the skill to your Claude config

cp -r claude-skills-journalism/web-archiving ~/.claude/skills/

Or download just this skill from the GitHub repository.

Source verification Web scraping Digital archive

Wayback Machine API, multi-archive redundancy, and legal evidence preservation in one skill.