NAME
Apophis - Content-addressable storage with deterministic UUID v5 identifiers
VERSION
Version 0.02
SYNOPSIS
use Apophis;
my $ca = Apophis->new(
namespace => 'myapp-files',
store_dir => '/var/store',
);
# Identify content — deterministic UUID v5
my $id = $ca->identify(\$content);
my $id = $ca->identify_file('/path/to/large-file'); # streaming, O(1) memory
# Store (atomic write, CAS dedup)
my $id = $ca->store(\$content);
# Retrieve
my $data = $ca->fetch($id);
# Check / remove
if ($ca->exists($id)) { ... }
$ca->remove($id);
# Integrity verification
my $ok = $ca->verify($id); # re-hash and compare
# Sharded path
my $path = $ca->path_for($id);
# /var/store/a3/bb/a3bb189e-8bf9-5f18-b3f6-1b2f5f5c1e3a
# Bulk operations
my @ids = $ca->store_many(\@content_refs);
my @missing = $ca->find_missing(\@ids);
# Metadata
$ca->store(\$content, meta => { mime_type => 'image/png' });
DESCRIPTION
Apophis is a 100% XS content-addressable storage library built on the Horus UUID library (RFC 9562). It generates deterministic UUID v5 identifiers for arbitrary content using SHA-1 namespace hashing.
Same content always produces the same UUID. Different namespaces produce different UUIDs for the same content.
Stored objects are sharded across a 2-level hex directory tree (65,536 directories) for efficient filesystem access at scale. Writes are atomic (temp + rename). CAS is naturally idempotent — no locking required.
All logic is implemented in C for maximum performance. The Perl layer is just XSLoader — every method is an XSUB.
METHODS
new
my $ca = Apophis->new(
namespace => 'myapp-files', # required
store_dir => '/var/store', # optional default store path
);
Creates a new Apophis instance. The namespace string is hashed via UUID v5 (using DNS as root namespace) to produce a proper namespace UUID.
identify
my $id = $ca->identify(\$content);
Returns a deterministic UUID v5 string for the given content.
identify_file
my $id = $ca->identify_file('/path/to/file');
Streaming identification — reads the file in 64KB chunks via SHA-1. Uses O(1) memory regardless of file size. Returns the same UUID that identify() would for the same content.
store
my $id = $ca->store(\$content);
my $id = $ca->store(\$content, store_dir => '/other');
my $id = $ca->store(\$content, meta => { mime_type => 'image/png' });
Identifies the content and writes it to the sharded store. Returns the UUID. If the content already exists, returns immediately (CAS dedup). Writes are atomic via temp file + rename.
fetch
my $data_ref = $ca->fetch($id);
Returns a scalar reference to the stored content, or undef if not found.
exists
if ($ca->exists($id)) { ... }
Returns true if the content exists in the store.
remove
$ca->remove($id);
Removes the content and its metadata sidecar (if any) from the store.
path_for
my $path = $ca->path_for($id);
Returns the 2-level sharded filesystem path for the given UUID:
a3bb189e-8bf9-... → /store/a3/bb/a3bb189e-8bf9-...
verify
my $ok = $ca->verify($id);
Re-reads the stored content, re-identifies it, and compares the UUID. Returns true if the content is intact.
store_many
my @ids = $ca->store_many(\@content_refs);
Stores multiple content items. Returns a list of UUIDs.
find_missing
my @missing = $ca->find_missing(\@ids);
Returns the subset of IDs that do not exist in the store.
namespace
my $ns = $ca->namespace();
Returns the namespace UUID string.
DEPENDENCIES
Horus — pure C UUID library (header-only, RFC 9562).
AUTHOR
LNATION <email@lnation.org>
LICENSE
This module is free software; you may redistribute and/or modify it under the same terms as Perl itself.