WIP: Hard-Drive Visualizations

Some work-in-progress visualizations of the physical location of specific files on my hard-drive, being made as part of my residency at Bell Labs. The above two images are details, showing files (in red) and empty space (in gray); each little square is one 512-byte sector.

This current version is visualizing an 8GB thumb drive. The prints are approximately 36×60″ (91×152 cm) so each individual sector can be seen. The hope is to do a version of my laptop’s hard-drive, which will require a much larger print or set of prints.

The goal of these visualizations is not analytics or troubleshooting – instead, I’m very interested in the abstraction between us, our computers, and our data. We take a photograph of something in the world and store it on a physical drive (even the end-point of the cloud is still a physical object). Once stored on a hard-drive, digital objects, however ephemera-seeming, continue to exist as physical ones and zeroes, magnetic charges of a specific size and intensity. These visualizations are about trying to unpack and see my personal data on that level.

But there is a significant disconnect between myself and my computer, and the actual data on my hard-drive. It turns out that drives contain a specialized chip that handles requests to read and write data; the algorithms that actually retrieve and set data are a closely guarded industrial secret.  These chips act as a black box between the user and their data, rendering a detailed picture of the exact location of the bits that make up our digital lives almost impossible. This obscurity is especially prevalent in SSDs, due to the way they store data. This post on Aleratec puts it a finer point on it:

…unlike traditional hard-drives where the physical location of each bit of data is known and constant, the physical location of data in an SSD is highly abstracted from the outside world.  Whereas each Logical Block Address (LBA) on a [hard-drive] always points to the same physical location, the physical location to which an SSD LBA points changes often.

My images then are a best-guess of where the actual data is stored, extracted using digital forensics tools. Below is the full image of the drive – click on it for a full-sized view.

Visualizing Mac OS Commands

A early test for a series of images visualizing the fundamental commands (essentially tiny programs) that run the Mac OS, things like ls, cd, and zsh (above). Bits from the binary files (found in usr/bin) are shown as black or white pixels. Click on the image for a higher-res version, where you can more easily see individual bits.

Random Walk: Square Root of Two

Following yesterday’s experiments with the Random Pi Walk (hat tip Alex Bellos), I’ve upped the ante.  The above image is the decimal expansion of the square root of two, following the first one million digits.  The data is thanks to Stan Kerr via Project Gutenberg.  Each decimal digit 0-9 results in a change of direction of 36 degrees and, in this case, travels 3 pixels in that direction.

The resulting image is MUCH larger than the previous visualizations – click here for the full resolution version.


The above image is the result of a Processing sketch that takes an audio file, sorts its sample values, and visualizes the current and original position of each sample.  At 44,100 samples per second, the above 10-second audio file has 441,000 arcs.  Height and line color are proportionate to the distance the line travels, otherwise the image would basically be a big block.

Still in refinement, source code forthcoming…