Introduction
ArchiveBox is an open‑source self‑hosted web archiving tool that saves snapshots of websites—HTML, PDFs, screenshots, media files, and more—for long‑term preservation. Written in Python and driven by a simple CLI, it’s perfect for researchers, journalists, librarians, or anyone who wants to maintain an offline archive of web content.
By using ClawCloud Run’s free tier, you can deploy ArchiveBox in minutes with one click, bind your own domain, get automatic HTTPS, and scale resources as needed.
Prerequisites
- ClawCloud Account
Sign up or log in at ClawCloud Console. - Real‑Name Verification
Complete any required identity checks. - (Optional) Domain Name
For binding your ArchiveBox interface to a custom URL.
Step 1: Log In to ClawCloud Run
- Navigate to the ClawCloud Run Console and sign in.
- You’ll land on the Run dashboard.
Step 2: Install ArchiveBox from the App Store
- From the dashboard, click “APP Store”.
- In the search bar, type “ArchiveBox”.
- Click “Deploy APP” on the ArchiveBox listing.
- Wait a moment—ClawCloud will provision an
archivebox-xxxxxxxx
application, complete with its own database.
Step 3: Access the ArchiveBox UI
- Under APP Launchpad, find and click Details on your newly created
archivebox-xxxxxxxx
app. - Scroll to Network and click the Public Address URL (e.g.
https://abc123.ap-northeast-1.clawcloudrun.com
). - You’ll see the ArchiveBox web interface. From here, you can start adding URLs to archive:
- Paste URLs into the “Add URL” field
- Choose which archiving backends to use (e.g. Chrome, wget, pdf-export)
- Click “Add” to queue jobs
Managing & Scaling Your ArchiveBox Deployment
Whenever you need to modify settings—such as updating ArchiveBox to a newer image, resizing compute resources, binding domains, or expanding storage—follow these steps:
- In APP Launchpad, click your
archivebox-xxxxxxxx
app. - Click the Update button in the top‑right corner.
1. Upgrading ArchiveBox Version
- In the Image section, change the Image Name to the desired tag:
archivebox/archivebox:0.6.3
2. Adjusting CPU & Memory
- Under Usage, drag the CPU and Memory sliders to allocate more resources.
3. Binding a Custom Domain
- CNAME your domain to the ClawCloud Run subdomain.
- In Network, click Custom Domain, enter your domain, and submit.
- Wait a few minutes for DNS propagation and automatic HTTPS certificate issuance.
4. Expanding Storage
- Under Local Storage, select your archive volume, input the new size, and save.
Note: After any change, click Update and allow a few minutes for the platform to apply your modifications.