Data Rescue 2025
  • 🛟Data Rescues 2025
  • 🧐What are Data Rescues
  • 🙏Community Agreements
  • 🗃️Collecting Scope
  • ⭐How To Start
    • 🎙️Track 1 (Communications)
    • 🔍Track 2 (Data Assessment)
    • 🕵️Track 3 (Technical)
  • 🛠️Resources & Tools
    • Tools
    • Readings
    • Model Projects
    • Updates
  • 🙋Stay in Touch
Powered by GitBook
On this page
  1. Resources & Tools

Tools

Information about tools needed for tasks or other important digital preservation work

PreviousResources & ToolsNextReadings

Last updated 4 months ago

For Authenticity and Verification

  • Making signed BagIt files:

  • Make Bags: or

  • Create checksums:

Metadata Creation and Description

  • Analyze file & produce basic metadata:

  • Index web archive files:

For Data an Web Archive Capturing and Harvesting

  • Conifer tool (website interactions):

  • Browser extension web crawl (single page):

  • Browser extension to add webpage to Internet Archive:

  • Copy websites (HTTrack):

  • Crawl website (Heritrix):

  • Capture backend of websites:

For Website Monitoring and Assessments

  • Estimate website size:

  • Monitor websites in bulk (thousands):

  • Monitor websites (single or small batch):

  • Detect website changes:

  • Assess websites (note differences in stories):

  • Assess websites (compare two pages):

General Lists of Digital Preservation Tools

Community Owned digital Preservation Tool Registry (COPTR)

🛠️
https://github.com/harvard-lil/bag-nabit
https://github.com/WeAreAVP/fixity
https://github.com/LibraryOfCongress/bagger
https://corz.org/windows/software/checksum/
https://coptr.digipres.org/index.php/NARA_File_Analyzer_and_Metadata_Harvester
https://conifer.rhizome.org/_faq
https://warcreate.com/
https://web.archive.org/
http://www.httrack.com/
https://sourceforge.net/projects/heritrix.mirror/
https://deeparc.sourceforge.net/
https://github.com/izkreny/website-size
https://github.com/edgi-govdata-archiving/web-monitoring
https://distill.io/
https://github.com/openpreserve/pagelyzer
https://github.com/DocNow/diffengine
http://pagelyzer.openpreservation.org/
https://www.digipres.org/tools/by-function/#createorreceive(acquire):webcrawl