⬛️ CLI tool for saving complete web pages as a single HTML file
_____ ______________ __________ ___________________ ___ | \ / \ | | | | | | | \_/ __ \_| __ | | ___ ___ |__| | | | | | | | | | | | | | | |\ /| |__| _ |__| |____| | | | | __ | | | \___/ | | \ | | | | | | | |___| |__________| \_____________________| |___| |___| |___|
A data hoarder’s dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive.
Unlike the conventional “Save page as”,
monolithnot only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share.
If compared to saving websites with
wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
$ cargo install monolith
$ brew install monolith
$ snap install monolith
$ cd /usr/ports/www/monolith/ $ make install clean
Every release contains pre-built binaries for Windows, GNU/Linux, as well as platforms with non-standart CPU architecture.
Dependency:
libssl-dev
$ git clone https://github.com/Y2Z/monolith.git $ cd monolith $ make install
The guide can be found here
$ monolith https://lyrics.github.io/db/P/Portishead/Dummy/Roads/ -o portishead-roads-lyrics.html
or
$ cat index.html | monolith -aIiFfcMv - > index-processed.html
-a: Exclude audio sources
-b: Use custom
base URL
-c: Exclude CSS
-e: Ignore network errors
-f: Omit frames
-F: Exclude web fonts
-i: Remove images
-I: Isolate the document
-j: Exclude JavaScript
-k: Accept invalid X.509 (TLS) certificates
-M: Don't add timestamp and URL information
-o: Write output to
file
-s: Be quiet
-t: Adjust
network request timeout
-u: Provide
custom User-Agent
-v: Exclude videos
Please set
https_proxy,
http_proxy, and
no_proxyenvironment variables.
Please open an issue if something is wrong, that helps make this project better.
Monolith Chrome Extension: https://github.com/rhysd/monolith-of-web
Pagesaver: https://github.com/distributed-mind/pagesaver
Personal WayBack Machine: https://github.com/popey/pwbm
Hako: https://github.com/dmpop/hako
The Unlicense
Keep in mind that
monolithis not aware of your browser’s session