NAME
perl5137delta - what is new for perl v5.13.7
DESCRIPTION
This document describes differences between the 5.13.6 release and the 5.13.7 release.
If you are upgrading from an earlier release such as 5.13.5, first read perl5136delta, which describes differences between 5.13.5 and 5.13.6.
Core Enhancements
Single term prototype
The +
prototype is a special alternative to $
that will act like \[@%]
when given a literal array or hash variable, but will otherwise force scalar context on the argument. This is useful for functions which should accept either a literal array or an array reference as the argument:
sub smartpush (+@) {
my $aref = shift;
die "Not an array or arrayref" unless ref $aref eq 'ARRAY';
push @$aref, @_;
}
When using the +
prototype, your function must check that the argument is of an acceptable type.
use re '/flags';
The re
pragma now has the ability to turn on regular expression flags till the end of the lexical scope:
use re '/x';
"foo" =~ / (.+) /; # /x implied
See "'/flags' mode" in re for details.
Statement labels can appear in more places
Statement labels can now occur before any type of statement or declaration, such as package
.
use feature "unicode_strings"
now applies to more regex matching
Another chunk of the "The "Unicode Bug"" in perlunicode is fixed in this release. Now, regular expressions compiled within the scope of the "unicode_strings" feature (or under the "u" regex modifier (specifiable currently only with infix notation (?u:...)
or via use re '/u'
) will match the same whether or not the target string is encoded in utf8, with regard to [[:posix:]]
character classes
Work is underway to add the case sensitive matching to the control of this feature, but was not complete in time for this dot release.
Array and hash container functions accept references
All built-in functions that operate directly on array or hash containers now also accept hard references to arrays or hashes:
|----------------------------+---------------------------|
| Traditional syntax | Terse syntax |
|----------------------------+---------------------------|
| push @$arrayref, @stuff | push $arrayref, @stuff |
| unshift @$arrayref, @stuff | unshift $arrayref, @stuff |
| pop @$arrayref | pop $arrayref |
| shift @$arrayref | shift $arrayref |
| splice @$arrayref, 0, 2 | splice $arrayref, 0, 2 |
| keys %$hashref | keys $hashref |
| keys @$arrayref | keys $arrayref |
| values %$hashref | values $hashref |
| values @$arrayref | values $arrayref |
| ($k,$v) = each %$hashref | ($k,$v) = each $hashref |
| ($k,$v) = each @$arrayref | ($k,$v) = each $arrayref |
|----------------------------+---------------------------|
This allows these built-in functions to act on long dereferencing chains or on the return value of subroutines without needing to wrap them in @{}
or %{}
:
push @{$obj->tags}, $new_tag; # old way
push $obj->tags, $new_tag; # new way
for ( keys %{$hoh->{genres}{artists}} ) {...} # old way
for ( keys $hoh->{genres}{artists} ) {...} # new way
For push
, unshift
and splice
, the reference will auto-vivify if it is not defined, just as if it were wrapped with @{}
.
Calling keys
or values
directly on a reference gives a substantial performance improvement over explicit dereferencing.
For keys
, values
, each
, when overloaded dereferencing is present, the overloaded dereference is used instead of dereferencing the underlying reftype. Warnings are issued about assumptions made in the following three ambiguous cases:
(a) If both %{} and @{} overloading exists, %{} is used
(b) If %{} overloading exists on a blessed arrayref, %{} is used
(c) If @{} overloading exists on a blessed hashref, @{} is used
y///r
The /r
flag, which was added to s///
in 5.13.2, has been extended to the y///
operator.
It causes it to perform the substitution on a copy of its operand, returning that copy instead of a character count.
New global variable ${^GLOBAL_PHASE}
A new global variable, ${^GLOBAL_PHASE}
, has been added to allow introspection of the current phase of the perl interpreter. It's explained in detail in "${^GLOBAL_PHASE}" in perlvar and "BEGIN, UNITCHECK, CHECK, INIT and END" in perlmod.
Unicode Version 6.0 is now supported (mostly)
Perl comes with the Unicode 6.0 data base updated with Corrigendum #8, with one exception noted below. See http://unicode.org/versions/Unicode6.0.0 for details on the new release. Perl does not support any Unicode provisional properties, including the new ones for this release, but their database files are packaged with Perl.
Unicode 6.0 has chosen to use the name BELL
for the character at U+1F514, which is a symbol that looks like a bell, and used in Japanese cell phones. This conflicts with the long-standing Perl usage of having BELL
mean the ASCII BEL
character, U+0007. In Perl 5.14, \N{BELL}
will continue to mean U+0007, but its use will generate a deprecated warning message, unless such warnings are turned off. The new name for U+0007 in Perl will be ALERT
, which corresponds nicely with the existing shorthand sequence for it, "\a"
. \N{BEL}
will mean U+0007, with no warning given. The character at U+1F514 will not have a name in 5.14, but can be referred to by \N{U+1F514}
. The plan is that in Perl 5.16, \N{BELL}
will refer to U+1F514, and so all code that uses \N{BELL}
should convert by then to using \N{ALERT}
, \N{BEL}
, or "\a"
instead.
Improved support for custom OPs
Custom ops can now be registered with the new custom_op_register
C function and the XOP
structure. This will make it easier to add new properties of custom ops in the future. Two new properties have been added already, xop_class
and xop_peep
.
xop_class
is one of the OA_*OP constants, and allows B and other introspection mechanisms to work with custom ops that aren't BASEOPs. xop_peep
is a pointer to a function that will be called for ops of this type from Perl_rpeep
.
See "Custom Operators" in perlguts and "Custom Operators" in perlapi for more detail.
The old PL_custom_op_names
/PL_custom_op_descs
interface is still supported but discouraged.
Incompatible Changes
Dereferencing typeglobs
If you assign a typeglob to a scalar variable:
$glob = *foo;
the glob that is copied to $glob
is marked with a special flag indicating that the glob is just a copy. This allows subsequent assignments to $glob
to overwrite the glob. The original glob, however, is immutable.
Many Perl operators did not distinguish between these two types of globs. This would result in strange behaviour in edge cases: untie $scalar
would do nothing if the last thing assigned to the scalar was a glob (because it treated it as untie *$scalar
, which unties a handle). Assignment to a glob slot (e.g., (*$glob) = \@some_array
) would simply assign \@some_array
to $glob
.
To fix this, the *{}
operator (including the *foo
and *$foo
forms) has been modified to make a new immutable glob if its operand is a glob copy. Various operators that make a distinction between globs and scalars have been modified to treat only immutable globs as globs.
This causes an incompatible change in code that assigns a glob to the return value of *{}
when that operator was passed a glob copy. Take the following code, for instance:
$glob = *foo;
*$glob = *bar;
The *$glob
on the second line returns a new immutable glob. That new glob is made an alias to *bar
. Then it is discarded. So the second assignment has no effect.
It also means that tie $handle
will now tie $handle
as a scalar, even if it has had a glob assigned to it.
The upside to this incompatible change is that bugs [perl #77496], [perl #77502], [perl #77508], [perl #77688], and [perl #77812], and maybe others, too, have been fixed.
See http://rt.perl.org/rt3/Public/Bug/Display.html?id=77810 for even more detail.
Clearing stashes
Stash list assignment %foo:: = ()
used to make the stash anonymous temporarily while it was being emptied. Consequently, any of its subroutines referenced elsewhere would become anonymous (showing up as "(unknown)" in caller
). Now they retain their package names, such that caller
will return the original sub name if there is still a reference to its typeglob, or "foo::__ANON__" otherwise [perl #79208].
Deprecations
\N{BELL}
is deprecated
This is because Unicode is using that name for a different character. See "Unicode Version 6.0 is now supported (mostly)" for more explanation.
Performance Enhancements
When an object has many weak references to it, freeing that object can under some some circumstances take O(N^2) time to free (where N is the number of references). The number of circumstances has been reduced. [perl #75254].
Modules and Pragmata
New Modules and Pragmata
The following modules were added by the
Unicode::Collate
upgrade from 0.63 to 0.67. See below for details.Unicode::Collate::CJK::Big5
Unicode::Collate::CJK::GB2312
Unicode::Collate::CJK::JISX0208
Unicode::Collate::CJK::Korean
Unicode::Collate::CJK::Pinyin
Unicode::Collate::CJK::Stroke
Updated Modules and Pragmata
Archive::Extract
has been upgraded from 0.44 to 0.46Resolves an issue with NetBSD-current and its new unzip executable.
Archive::Tar
has been upgraded from 1.68 to 1.72This adds the ptargrep utility for using regular expressions against the contents of files in a tar archive.
B
has been upgraded from 1.24 to 1.26.It no longer crashes when taking apart a
y///
containing characters outside the octet range or compiled in ause utf8
scope.The size of the shared object has been reduced by about 40%, with no reduction in functionality.
B::Deparse
has been upgraded from 0.99 to 1.01.It fixes deparsing of
our
followed by a variable with funny characters (as permitted under theutf8
pragma) [perl #33752].CGI
has been upgraded from 3.49 to 3.50This provides the following security fixes: the MIME boundary in multipart_init is now random and improvements to the handling of newlines embedded in header values.
The documentation for param_fetch() has been corrected and clarified.
CPAN
has been upgraded from 1.94_61 to 1.94_62CPANPLUS
has been upgraded from 0.9007 to 0.9010Fixes for the SQLite source engine and resolving of issues with the testsuite when run under local::lib and/or cpanminus
CPANPLUS::Dist::Build
has been upgraded from 0.48 to 0.50Data::Dumper
has been upgraded from 2.129 to 2.130_01.DynaLoader
has been upgraded from 1.10 to 1.11.It fixes a buffer overflow when passed a very long file name.
ExtUtils::Constant
has been upgraded from 0.22 to 0.23.The
AUTOLOAD
helper code generated byExtUtils::Constant::ProxySubs
can nowcroak
for missing constants, or generate a completeAUTOLOAD
subroutine in XS, allowing simplification of many modules that use it. (Fcntl
,File::Glob
,GDBM_File
,I18N::Langinfo
,POSIX
,Socket
)ExtUtils::Constant::ProxySubs
can now optionally push the names of all constants onto the package's C{@EXPORT_OK}. This has been used to replace less space-efficient code inB
, helping considerably shrink the size of its shared object.Fcntl
has been upgraded from 1.09 to 1.10.File::Fetch
has been upgraded from 0.24 to 0.28HTTP::Lite
is now supported for 'http' scheme.The
fetch
utility is supported on FreeBSD, NetBSD and Dragonfly BSD for thehttp
andftp
schemes.File::Glob
has been upgraded from 1.09 to 1.10.File::stat
has been upgraded from 1.03 to 1.04.The
-x
and-X
file test operators now work correctly under the root user.GDBM_File
has been upgraded from 1.11 to 1.12.This fixes a memory leak when DBM filters are used.
Hash::Util
has been upgraded from 0.09 to 0.10.Hash::Util::FieldHash
has been upgraded from 1.05 to 1.06.I18N::Langinfo
has been upgraded from 0.06 to 0.07.Locale::Maketext
has been upgraded from 1.16 to 1.17.Math::BigInt
has been upgraded from 1.97 to 1.99_01.Math::BigRat
has been upgraded from 0.26 to 0.26_01Math::BigInt::FastCalc
has been upgraded from 0.22 to 0.24_01.MIME::Base64
has been upgraded from 3.09 to 3.10Includes new functions to calculate the length of encoded and decoded base64 strings.
mro
has been upgraded from 1.04 to 1.05.NDBM_File
has been upgraded from 1.09 to 1.10.This fixes a memory leak when DBM filters are used.
ODBM_File
has been upgraded from 1.08 to 1.09.This fixes a memory leak when DBM filters are used.
Opcode
has been upgraded from 1.16 to 1.17.parent
has been upgraded from 0.223 to 0.224Pod::Simple
has been upgraded from 3.14 to 3.15Includes various fixes to
HTML
andXHTML
handling.POSIX
has been upgraded from 1.21 to 1.22.re
has been upgraded from 0.13 to 0.14, for the sake of the newuse re "/flags"
pragma.Safe
has been upgraded from 2.28 to 2.29.It adds
&version::vxs::VCMP
to the default share.SDBM_File
has been upgraded from 1.07 to 1.08.SelfLoader
has been upgraded from 1.17 to 1.18.It now works in taint mode [perl #72062].
Socket
has been upgraded from 1.90 to 1.91.Storable
has been upgraded from 2.22 to 2.24Includes performance improvement for overloaded classes.
Sys::Hostname
has been upgraded from 1.13 to 1.14.Unicode::Collate
has been upgraded from 0.63 to 0.67This release newly adds locales
ja
ko
andzh
and its variants (zh__big5han
,zh__gb2312han
,zh__pinyin
,zh__stroke
).Supported UCA_Version 22 for Unicode 6.0.0.
The following modules have been added:
Unicode::Collate::CJK::Big5
forzh__big5han
which makes tailoring of CJK Unified Ideographs in the order of CLDR's big5han ordering.Unicode::Collate::CJK::GB2312
forzh__gb2312han
which makes tailoring of CJK Unified Ideographs in the order of CLDR's gb2312han ordering.Unicode::Collate::CJK::JISX0208
which makes tailoring of 6355 kanji (CJK Unified Ideographs) in the JIS X 0208 order.Unicode::Collate::CJK::Korean
which makes tailoring of CJK Unified Ideographs in the order of CLDR's Korean ordering.Unicode::Collate::CJK::Pinyin
forzh__pinyin
which makes tailoring of CJK Unified Ideographs in the order of CLDR's pinyin ordering.Unicode::Collate::CJK::Stroke
forzh__stroke
which makes tailoring of CJK Unified Ideographs in the order of CLDR's stroke ordering.
Documentation
perlvar reorders the variables and groups them by topic. Each variable introduced after Perl 5.000 notes the first version in which it is available. perlvar also has a new section for deprecated variables to note when they were removed.
New Documentation
perlpodstyle
New style guide for POD documentation, split mostly from the NOTES section of the pod2man man page.
( This was added to v5.13.6
but was not documented with that release ).
Changes to Existing Documentation
Array and hash slices in scalar context are now documented in perldata.
perlform and perllocale have been corrected to state that
use locale
affects formats.
Diagnostics
New Diagnostics
"Using !~ with %s doesn't make sense": This message was actually added in 5.13.2, but was omitted from perldelta. It now applies also to the
y///
operator, and has been documented.
Utility Changes
ptargrep
ptargrep is a utility to apply pattern matching to the contents of files in a tar archive. It comes with
Archive::Tar
.
Testing
The new t/mro/isa_aliases.t has been added, which tests that
*Foo::ISA = *Bar::ISA
works properly.t/mro/isarev.t has been added, which tests that
PL_isarev
(accessible at the Perl level viamro::get_isarev
) is updated properly.t/run/switchd-78586.t has been added, which tests that [perl #78586] has been fixed (related to line numbers in the debugger).
Platform Support
Platform-Specific Notes
- Windows
-
Directory handles are now properly cloned when threads are created. In perl 5.13.6, child threads simply stopped inheriting directory handles. In previous versions, threads would share handles, resulting in crashes.
Support for building with Visual C++ 2010 is now underway, but is not yet complete. See README.win32 for more details.
- VMS
-
Record-oriented files (record format variable or variable with fixed control) opened for write by the perlio layer will now be line buffered to prevent the introduction of spurious line breaks whenever the perlio buffer fills up.
Internal Changes
lex_start
has been added to the API, but is considered experimental.A new
parse_block
function has been added to the API [perl #78222].A new, experimental API has been added for accessing the internal structure that Perl uses for
%^H
. See the functions beginning withcophh_
in perlapi.A stash can now have a list of effective names in addition to its usual name. The first effective name can be accessed via the
HvENAME
macro, which is now the recommended name to use in MRO linearisations (HvNAME
being a fallback if there is noHvENAME
).These names are added and deleted via
hv_ename_add
andhv_ename_delete
. These two functions are not part of the API.The way the parser handles labels has been cleaned up and refactored. As a result, the
newFOROP()
constructor function no longer takes a parameter stating what label is to go in the state op.The
newWHILEOP()
andnewFOROP()
functions no longer accept a line number as a parameter.A new
parse_barestmt()
function has been added, for parsing a statement without a label.A new
parse_label()
function has been added, that parses a statement label, separate from statements.The
CvSTASH()
macro can now only be used as an rvalue.CvSTASH_set()
has been added to replace assignment toCvSTASH()
. This is to ensure that backreferences are handled properly. These macros are not part of the API.The
op_scope()
andop_lvalue()
functions have been added to the API, but are considered experimental.
Selected Bug Fixes
The
parse_stmt
C function added in earlier in the 5.13.x series has been fixed to work with statements ending with}
[perl #78222].The
parse_fullstmt
C function added in 5.13.5 has been fixed to work when called while an expression is being parsed.Characters in the Latin-1 non-ASCII range (0x80 to 0xFF) used not to match themselves if the string happened to be UTF8-encoded internally, the regular expression was not, and the character in the regular expression was inside a repeated group (e.g.,
Encode::decode_utf8("\303\200") =~ /(\xc0)+/
) [perl #78464].The
(?d)
regular expression construct now overrides a previous(?u)
oruse feature "unicode_string"
[perl #78508].A memory leak in
do "file"
, introduced in perl 5.13.6, has been fixed [perl #78488].Various bugs related to typeglob dereferencing have been fixed. See "Dereferencing typeglobs", above.
The
SvPVbyte
function available to XS modules now calls magic before downgrading the SV, to avoid warnings about wide characters [perl #72398].The
=
operator used to ignore magic (e.g., tie methods) on its right-hand side if the scalar happened to hold a typeglob. This could happen if a typeglob was the last thing returned from or assigned to a tied scalar [perl #77498].sprintf
was ignoring locales when called with constant arguments [perl #78632].A non-ASCII character in the Latin-1 range could match both a Posix class, such as
[[:alnum:]]
, and its inverse[[:^alnum:]]
. This is now fixed for regular expressions compiled under the"u"
modifier. See "use feature "unicode_strings"
now applies to more regex matching". [perl #18281].Concatenating long strings under
use encoding
no longer causes perl to crash [perl #78674].Typeglob assignments would crash if the glob's stash no longer existed, so long as the glob assigned to was named 'ISA' or the glob on either side of the assignment contained a subroutine.
Calling
->import
on a class lacking an import method could corrupt the stack, resulting in strange behaviour. For instance,push @a, "foo", $b = bar->import;
would assign 'foo' to
$b
[perl #63790].Creating an alias to a package when that package had been detached from the symbol table would result in corrupted isa caches [perl #77358].
.=
followed by<>
orreadline
would leak memory if$/
contained characters beyond the octet range and the scalar assigned to happened to be encoded as UTF8 internally [perl #72246].The
recv
function could crash when called with the MSG_TRUNC flag [perl #75082].Evaluating a simple glob (like
*a
) was calling get-magic on the glob, even when its contents were not being used [perl #78580].This bug was introduced in 5.13.2 and did not affect earlier perl versions.
Matching a Unicode character against an alternation containing characters that happened to match continuation bytes in the former's UTF8 representation (
qq{\x{30ab}} =~ /\xab|\xa9/
) would cause erroneous warnings [perl #70998].s///r
(added in 5.13.2) no longer leaks.The trie optimisation was not taking empty groups into account, preventing 'foo' from matching
/\A(?:(?:)foo|bar|zot)\z/
[perl #78356].A pattern containing a
+
inside a lookahead would sometimes cause an incorrect match failure in a global match (e.g.,/(?=(\S+))/g
) [perl #68564].Iterating with
foreach
over an array returned by an lvalue sub now works [perl #23790].$@
is now localised during calls tobinmode
to prevent action at a distance [perl #78844].PL_isarev
, which is accessible to Perl viamro::get_isarev
is now updated properly when packages are deleted or removed from the@ISA
of other classes. This allows many packages to be created and deleted without causing a memory leak [perl #75176].undef *Foo::
andundef *Foo::ISA
anddelete $package::{ISA}
used not to update the internal isa caches if the stash or@ISA
array had a reference elsewhere. In fact,undef *Foo::ISA
would stop a new@Foo::ISA
array from updating caches.@ISA
arrays can now be shared between classes via*Foo::ISA = \@Bar::ISA
or*Foo::ISA = *Bar::ISA
[perl #77238].The parser no longer hangs when encountering certain Unicode characters, such as U+387 [perl #74022].
formline
no longer crashes when passed a tainted format picture. It also taints$^A
now if its arguments are tainted [perl #79138].A signal handler called within a signal handler could cause leaks or double-frees. Now fixed. [perl #76248].
When trying to report
Use of uninitialized value $Foo::BAR
, crashes could occur if the GLOB of the global variable causing the warning has been detached from its original stash by, for exampledelete $::{'Foo::'}
. This has been fixed by disabling the reporting of variable names in the warning in those cases.
Obituary
Randy Kobes, creator of the kobesearch alternative to search.cpan.org and contributor/maintainer to several core Perl toolchain modules, passed away on September 18, 2010 after a battle with lung cancer. His contributions to the Perl community will be missed.
Acknowledgements
Perl 5.13.7 represents approximately one month of development since Perl 5.13.6 and contains 73100 lines of changes across 518 files from 39 authors and committers:
Abhijit Menon-Sen, Abigail, Ben Morrow, Chas. J. Owens IV, Chris 'BinGOs' Williams, Craig A. Berry, David Golden, David Mitchell, Father Chrysostomos, Fingle Nark, Florian Ragwitz, George Greer, Grant McLean, H.Merijn Brand, Ian Goodacre, Jan Dubois, Jerry D. Hedden, Jesse Vincent, Karl Williamson, Lubomir Rintel, Marty Pauley, Moritz Lenz, Nicholas Clark, Nicolas Kaiser, Niko Tyni, Peter John Acklam, Rafael Garcia-Suarez, Shlomi Fish, Steffen Mueller, Steve Hay, Tatsuhiko Miyagawa, Tim Bunce, Todd Rinaldo, Tom Christiansen, Tom Hukins, Tony Cook, Yves Orton, Zefram and brian d foy
Many of the changes included in this version originated in the CPAN modules included in Perl's core. We're grateful to the entire CPAN community for helping Perl to flourish.
Reporting Bugs
If you find what you think is a bug, you might check the articles recently posted to the comp.lang.perl.misc newsgroup and the perl bug database at http://rt.perl.org/perlbug/ . There may also be information at http://www.perl.org/ , the Perl Home Page.
If you believe you have an unreported bug, please run the perlbug program included with your release. Be sure to trim your bug down to a tiny but sufficient test case. Your bug report, along with the output of perl -V
, will be sent off to perlbug@perl.org to be analysed by the Perl porting team.
If the bug you are reporting has security implications, which make it inappropriate to send to a publicly archived mailing list, then please send it to perl5-security-report@perl.org. This points to a closed subscription unarchived mailing list, which includes all the core committers, who will be able to help assess the impact of issues, figure out a resolution, and help co-ordinate the release of patches to mitigate or fix the problem across all platforms on which Perl is supported. Please only use this address for security issues in the Perl core, not for modules independently distributed on CPAN.
SEE ALSO
The Changes file for an explanation of how to view exhaustive details on what changed.
The INSTALL file for how to build Perl.
The README file for general stuff.
The Artistic and Copying files for copyright information.