Data Deduplication and Compression

This courselet explores two key techniques for minimizing storage usage in Linux using the Btrfs file system: data deduplication and compression. Learners will mount Btrfs volumes, use duperemove to identify and eliminate duplicate data, and verify space savings with disk usage tools. The courselet also covers enabling compression options in Btrfs and observing their effects on data storage. By the end, learners will gain hands-on experience in configuring efficient storage systems with Btrfs using standard Linux tools and commands.

- - - 1 Aug. 7, 2025, 6:19 PM

Authors

Digital Object Identifier (DOI)

10.5281/zenodo.16763363 (2025-08-01T18:12UTC)
Launch on Chameleon

Launching this artifact will open it within Chameleon’s shared Jupyter experiment environment, which is accessible to all Chameleon users with an active allocation.

Download Archive

Download an archive containing the files of this artifact.

Version Stats

- - -