Revision history for Parse-MediaWikiDump
1.0.6 Dec 04, 2010
* Fix for bug #58196 - error "not a MediaWiki link dunp file" due
to absence of 'LOCK TABLES ...' line in link dump file?
* Added option to pass input to ::Pages constructor via named parameters so
that MediaWiki::DumpFile::Compat and ::Pages share the same API
* Software is nearly retired, only need more testing on MediaWiki::DumpFile::Compat
1.0.5 Apr 21, 2010
* Updated docs pointing people to MediaWiki::DumpFile::Compat
1.0.4 Jan 04, 2010
* Fixed bug #53361: Incorrectly assigned usernames
with anon edits
* Added support for getting access to IP of anonymous
edits
1.0.3 Nov 21, 2009
* Fixed bug #51607 "Build failed CPAN smoke test for
for i686pc solaris" by specifying minimum version
numbers for all dependencies in Makefile.PL.
1.0.2 Nov 15, 2009
* Fixed bug #51461 "Warnings During Build" - now
the test process squelches the harmless
warning from Test::Memory::Cycle.
1.0.1 Nov 11, 2009
* CPAN indexer did not like previous version number
1.0.0 Nov 11, 2009
* Stable status achieved
* Slight speed tweak on character handler for XML
parser
* Added dependency on Devel::Cycle 1.11 as 1.10
causes a false positive to be thrown on the
memory leak test
0.98 Oct 28, 2009
* Bumped processing speed back up
* Fixed possible infinite loop scenario
* Ordered tests
* Added test to find circular references
0.97 Oct 23, 2009
* Fixed all known memory leaks
* No more Object::Destroyer
* Cleaned out some old cruft
0.96 Oct 22, 2009
* Allowed parsing of 0.4 version XML dump files but not
support for new features
* Added in a method to retrieve the version number of
the XML dump file
0.95 Oct 14, 2009
* Found and removed a circular reference but it did not
stop the memory leak
* Fixed bug 50092 - some times $page->text would return
a reference to an undefined value
* Implemented support for compressed file GLOB objects per
bug 50241
0.94 Sep 28, 2009
* Fix bug 49979 - "redirect in newer Wikipedia dumps" by allowing
unknown tag names to exist
0.93 Sep 15, 2009
* Made ::Pages a subclass of ::Revisions
* Discovered a bug regression: ::Pages and ::Revisions leak
memory/are not properly garbage collected
0.92 Apr 15, 2009
* Completed documentation for all modules
* Added test for backwards compatibility to the pre-factory
Parse::MediaWikiDump interface
0.91 May 13, 2009
* Updated documentation to more explicitly list what kind of
dump files each parser object can deal with.
* Added dependency on perl 5.8.8 for :utf8 compatibility.
* Split up lib/ into multiple files.
* Fix for bug #46054 - using categories method of
Parse::MediaWikiDump::page object causes script to crash.
0.90 May 07, 2009
* Implemented new parsing engine and called it
Parse::MediaWikiDump::Revisions. Soon it will be replacing
Parse::MediaWikiDump::Pages as a base engine. It is fully backwards
compatible so please feel free to test it in your existing utilities
and report success and failure to the author.
* Moved namespace logic into Parse::MediaWikiDump::page and updated
Parse::MediaWikiDump::Pages to support it.
0.51 May 31, 2008
* Fix for bug 36255 "Parse::MediaWikiDump::page::namespace may return
a string which is not really a namespace" provided by Amir E. Aharoni.
* Moved test data into t/ and moved speed_test.pl into examples/
* Exceedingly complicated functions (parse_head() and parse_page()) are
not funny. Added some comments on how to rectify that situation.
* Tightened up the tests a little bit.
0.50 Jun 27, 2006
* Added category links parser.
* Removed all instances of shift() from the code.
0.40 Jun 21, 2006
* Increased processing speed by around 40%! Thank you
Andrew Rodland.
0.33 Jun 18, 2006
* Added current_byte and size methods to page dumper.
0.32 Feb 25, 2006
* Added a line to create a package named Parse::MediaWikiDump so
the module will get listed on CPAN search and the cpan command
line tool.
0.31 Jan 10, 2006
* Fix bug 16981 - Parse::MediaWikiDump::page->redirect does not work
with redirects that have a : in them.
* Fix bug 16981 part two: title with a non-breaking space in it would
come through as undefined.
0.30 December 23, 2005
* the Pages and Links class now both use a method named next() to get
the next record. The old link() and page() methods are retained for
now but should be migrated away from as soon as is convenient.
* Added list of dump files that this module can process to the README
file.
0.24 December 19, 2005
* Fixed bug #16616 - the category method only works properly on English
language dump files.
0.23 December 19, 2005
* Fixed email address for author.
* Fixed omission of namespace method for pages objects in the
documentation.
* Added limitations section to README.
* Fixed http://rt.cpan.org bug #16583 - Module dies when parsing
the 20051211 German Wikipedia dump.
* Added some comments to the source code.
0.22 September 15, 2005
* Created some new and more comprehensive examples.
* Parse::MediaWikiDump::Pages now dies with a specific error
if it is asked to parse a comprehensive (full pages) dump file.
* Updated Parse::MediaWikiDump::Links to new dump file format.
* Added tests for Parse::MediaWikiDump::Links.
* Solved a bug: Expat's current_byte method returns a 32 bit
signed integer and the english Wikipedia dumps cause the number to
wrap; implemented a work around for this.
0.21 September 10, 2005
* Improve testing of Parse::MediaWikiDump::Pages
* Fix silly bug related to opening file handle references
* Found new bug: The links dump format has been changed and the
existing code can not parse the new format
* Found new bug: comprehensive dump files like
20050909_pages_full.xml.gz cause the stack to grow too large and
the module to abort early.
0.2 September 9, 2005
* Add tests and test data
0.1 September 6, 2005
* First version, released on an unsuspecting world.