Changes for version v0.10.001 - 2016-05-12
- added -log-level option to dcdb-info.perl
- removed dates from generic term-tuple vocabulary ("x-tuples" -> "t-tuples"), a la tdf relation
- changed db structure for more efficient 2-pass Cofreqs queries (f2 bug-fix)
- Cofreqs now 3-level (id1 -> (date -> (id2->f)))
- Unigrams now 2-level (id1 -> (date -> f))
- Relation::subprofile1() and subprofile2() calling conventions changed
- changed temporary file format for "tokens.dat" used by DiaColloDB::create(): added dates
- changed export text file formats
- Unigrams: added dates
- Cofreqs: added dates and un-collocated f1 lines
- "x-tuple" exports replaced by corresponding "t-tuple" exports xenum->tenum, ATTR_2x.*->ATTR_2t, etc.
- added upgrade package v0_10_x2t
- added compatibility wrappers Compat::v0_09::* for transparent use of old indices
- added auto-backup of changed files to upgrade framework
- upgraders are now instantiated as objects, not just packages: cache header & options
- added DiaColloDB::Upgrade::Base::revert() method and -revert option to dcdb-upgrade.perl
- default implementation relies on subclass revert_created() and revert_updated() methods
- added dcdb-upgrade.perl options -keep, -[no]backup
- added DiaColloDB::Utils functions copyto(), moveto(), copyto_a(), cp_a()
- added DiaColloDB::Persistent method-wrappers copyto(), moveto(), copyto_a()
- added optimized PackedFile::MMap::bsearch() method
- for faster v0.10.x Cofreqs 'onepass' mode; still not as fast as v0.09.x 1-pass but it's incorrect anyways
- removed unused methods Cofreqs::f1(), Cofreqs::f12()
- removed obsolete method DiaColloDB::xidsByDate()
- re-factored compatibility wrappers into DiaColloDB::Compat::vX_Y_Z::*
Modules
diachronic collocation database, top-level
diachronic collocation db, top-level client API
diachronic collocation db: client: local dbdir
diachronic collocation db: client: remote http server
diachronic collocation db: client: distributed
DiaColloDB utilities: compatibility modules: v0.08.x
diachronic collocation db, integer-integer* multimap file, backwards-compatible (v0.08.x)
DiaColloDB utilities: compatibility modules: v0.09.x
collocation db, top-level: backwards-compatible (v0.09.x)
collocation db, relation API: backwards-compatible (v0.09.x)
collocation db, profiling relation: co-frequency database (v0.9x)
collocation db, profiling relation: unigram database (v0.09.x)
diachronic collocation db, source corpous
diachronic collocation db, source document (base class)
diachronic collocation db, source document, DDC tab-dump
diachronic collocation db, source document, raw JSON
diachronic collocation db, source document, TCF format
diachronic collocation db, source document, TEI format
diachronic collocation db, symbollt-gtinteger enum
diachronic collocation db, symbollt-gtinteger enum, fixed-length symbols
diachronic collocation db, symbollt-gtinteger enum, fixed-length symbols, mmap
diachronic collocation db, symbollt-gtinteger enum, mmap
diachronic collocation db: symbollt-gtinteger enum: tied interface
DiaColloDB logging (using Log::Log4perl)
diachronic collocation db, integer->integer* multimap file, e.g. for expansion indices
collocation db, integer->integer* multimap file, using mmap
DiaColloDB utilities: (temporary) mmaped PDLs
diachronic collocation db: flat fixed-length record-oriented files
collocation db: flat fixed-length record-oriented files; mmap variant
diachronic collocation db, generic persistent objects
diachronic collocation db, (co-)frequency profile
diachronic collocation db, diff profiles
diachronic collocation db, (co-)frequency profile, by date-slice
diachronic collocation db, (co-)frequency profile diffs, by date
diachronic collocation db, relation API (abstract & utilities)
diachronic collocation db, profiling relation: native fixed-window co-frequency index
diachronic collocation db, profiling relation: ddc client
collocation db, profiling relation: (term x document) raw-frequency matrix
collocation db, profiling relation: PDL: query hacks
diachronic collocation db, profiling relation: native unigram index
Descript: DiaColloDB: temporary data structures: common base class
DiaColloDB: temporary arrays
DiaColloDB: temporary hashes
DiaColloDB: temporary mmaped vec() buffers
diachronic collocation db, timer
DiaColloDB utilities: auto-magic upgrades: top level
DiaColloDB utilities: auto-magic upgrade: base class / API
DiaColloDB utilities: auto-magic upgrade: v0.04: date limits
DiaColloDB utilities: auto-magic upgrade: v0.09.x: MultiMapFile format
DiaColloDB utilities: auto-magic upgrade: v0.10.x: x-tuples (+date) to t-tuples (-date)
diachronic collocation database, generic utilities
Provides
in DiaColloDB/Relation/Cofreqs.pm
in DiaColloDB/Compat.pm
in DiaColloDB/Compat/v0_09/Relation/Cofreqs.pm
in DiaColloDB/Compat/v0_09/Relation/Unigrams.pm
in DiaColloDB/Relation/DDC.pm
in DiaColloDB/EnumFile/Tied.pm
in DiaColloDB/EnumFile/FixedMap.pm
in DiaColloDB/EnumFile/Tied.pm
in DiaColloDB/EnumFile/Tied.pm
in DiaColloDB/Profile/MultiDiff.pm
in DiaColloDB/Profile/MultiDiff.pm
in DiaColloDB/Relation/Unigrams.pm