lib/Perl6/Doc/Bible/A02.pod


            
              —
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
              
=encoding utf-8
=head1 NAME
Apocalypse_2 - Bits and Pieces
=head1 AUTHOR
Larry Wall <larry@wall.org>
=head1 VERSION
  Maintainer: Larry Wall <larry@wall.org>
  Date: 3 May 2001
  Last Modified: 18 May 2006
  Number: 2
  Version: 6
Here's Apocalypse 2, meant to be read in conjunction with Chapter 2 of
the Camel Book. The basic assumption is that if Chapter 2 talks about
something that I don't discuss here, it doesn't change in Perl 6. (Of
course, it could always just be an oversight. One might say that people
who oversee things have a gift of oversight.)
Before I go further, I would like to thank all the victims, er,
participants in the RFC process. (I beg special forgiveness from those
whose brains I haven't been able to get inside well enough to
incorporate their ideas). I would also like to particularly thank
Damian Conway, who will recognize many of his systematic ideas here,
including some that have been less than improved by my meddling.
Here are the RFCs covered:
    RFC  PSA  Title
    ---  ---  -----
      Textual
    005  cdr  Multiline Comments for Perl
    102  dcr  Inline Comments for Perl
      Types
    161  adb  Everything in Perl Becomes an Object
    038  bdb  Standardise Handling of Abnormal Numbers Like Infinities and NaNs
    043  bcb  Integrate BigInts (and BigRats) Support Tightly With the Basic Scalars
    192  ddr  Undef Values ne Value
    212  rrb  Make Length(@array) Work
    218  bcc  C<my Dog $spot> Is Just an Assertion
      Variables
    071  aaa  Legacy Perl $pkg'var Should Die
    009  bfr  Highlander Variable Types
    133  bcr  Alternate Syntax for Variable Names
    134  bcc  Alternative Array and Hash Slicing
    196  bcb  More Direct Syntax for Hashes
    201  bcr  Hash Slicing
      Strings
    105  aaa  Remove "In string @ must be \@" Fatal Error
    111  aaa  Here Docs Terminators (Was Whitespace and Here Docs)
    162  abb  Heredoc Contents
    139  cfr  Allow Calling Any Function With a Syntax Like s///
    222  abb  Interpolation of Object Method Calls
    226  acr  Selective Interpolation in Single Quotish Context
    237  adc  Hashes Should Interpolate in Double-Quoted Strings
    251  acr  Interpolation of Class Method Calls
    252  abb  Interpolation of Subroutines
    327  dbr  C<\v> for Vertical Tab
    328  bcr  Single Quotes Don't Interpolate \' and \\
      Files
    034  aaa  Angle Brackets Should Not Be Used for File Globbing
    051  ccr  Angle Brackets Should Accept Filenames and Lists
      Lists
    175  rrb  Add C<list> Keyword to Force List Context (like C<scalar>)
      Retracted
    010  rr  Filehandles Should Use C<*> as a Type Prefix If Typeglobs Are Eliminated
    103  rr  Fix C<$pkg::$var> Precedence Issues With Parsing of C<::>
    109  rr  Less Line Noise - Let's Get Rid of @%
    245  rr  Add New C<empty> Keyword to DWIM for Clearing Values
    263  rr  Add Null() Keyword and Fundamental Data Type
=head1 Atoms
Perl 6 programs are notionally written in Unicode, and assume Unicode
semantics by default even when they happen to be processing other
character sets behind the scenes. Note that when we say that Perl is
written in Unicode, we're speaking of an abstract character set, not
any particular encoding. (The typical program will likely be written in
UTF-8 in the West, and in some 16-bit character set in the East.)
=head1 Molecules
=head2 RFC 005: Multiline Comments for Perl
I admit to being prejudiced on this one -- I was unduly influenced at a
tender age by the rationale for the design of Ada, which made a good
case, I thought, for leaving multiline comments out of the language.
But even if I weren't blindly prejudiced, I suspect I'd look at the
psychology of the thing, and notice that much of the time, even in
languages that have multiline comments, people nevertheless tend to use
them like this:
    /*
     *  Natter, natter, natter.
     *  Gromish, gromish, gromish.
     */
The counterargument to that is, of course, that people don't I<always>
do that in C, so why should they have to do it in Perl? And if there
were no other way to do multiline comments in Perl, they'd have a
stronger case. But there already is another way, albeit one rejected by
this RFC as "a workaround."
But it seems to me that, rather than adding another kind of comment or
trying to make something that looks like code behave like a comment,
the solution is simply to fix whatever is wrong with POD so that its
use for commenting can no longer be considered a workaround. Actual
design of POD can be put off till Apocalypse 26, but we can speculate
at this point that the rules for switching back and forth between POD
and Perl are suboptimal for use in comments. If so, then it's likely
that in Perl 6 we'll have a rule like this: If a C<=begin MUMBLE>
transitions from Perl to POD mode then the corresponding C<=end MUMBLE>
should transition back (without a C<=cut> directive).
Note that we haven't defined our C<MUMBLE>s yet, but they can be set up
to let our program have any sort of programmatic access to the data
that we desire. For instance, it is likely that comments of this kind
could be tied in with some sort of literate (or at least, semiliterate)
programming framework.
=head2 RFC 102: Inline Comments for Perl
I have never much liked inline comments -- as commonly practiced they
tend to obfuscate the code as much as they clarify it. That being said,
"All is fair if you predeclare." So there should be nothing
preventing someone from writing a lexer regex that handles them,
provided we make the lexer sufficiently mutable. Which we will. (As it
happens, the character sequence "C</*>" will be unlikely to occur in
standard Perl 6. Which I guess means it I<is> likely to occur in
nonstandard Perl 6. C<:-)>
A pragma declaring nonstandard commenting would also allow people to
use C</* */> for multiline comments, if they like. (But I still think
it'd be better to use POD directives for that, just to keep the text
accessible to the program.)
[Update: It eventually became apparent (after five years!) that we
could simplify the distinction between postfix and infix operators
if we had a general way to embed comments, so we now have a general
quote-like mechanism for embedded comments such that you can say
C<$foo\#(bar).baz> to mean C<$foo.baz>.  Basically, if the C<#>
character is immediately followed by a bracket, that bracket pair
determines the scope of the comment.  (If you're wondering how the
backslash/dot become one dot in the example, see the explanation of the
"long dot" in S02.)]
=head1 Built-In Data Types
The basic change here is that, rather than just supporting scalars,
arrays and hashes, Perl 6 supports opaque objects as a fourth
fundamental data type. (You might think of them as pseudo-hashes done
right.) While a class can access its object attributes any way it
likes, all external access to opaque objects occurs through methods,
even for attributes. (This guarantees that attribute inheritance works
correctly.)
While Perl 6 still defaults to typeless scalars, Perl will be able to
give you more performance and safety as you give it more type
information to work with. The basic assumption is that homogenous data
structures will be in arrays and hashes, so you can declare the type of
the scalars held in an array or hash. Heterogenous structures can still
be put into typeless arrays and hashes, but in general Perl 6 will
encourage you to use classes for such data, much as C encourages you to
use structs rather than arrays for such data.
One thing we'll be mentioning before we discuss it in detail is the
notion of "properties." (In Perl 5, we called these "attributes,"
but we're reserving that term for actual object attributes these days,
so we'll call these things "properties.") Variables and values can
have additional data associated with them that is "out of band" with
respect to the ordinary typology of the variable or value. For now,
just think of properties as a way of adding ad hoc attributes to a
class that doesn't support them. You could also think of it as a form
of class derivation at the granularity of the individual object,
without having to declare a complete new class.
[Update: We're now calling compile-time properties "traits".  And objects
don't really have properties separate from their attributes--this is now
handled with a mixin mechanism.]
=head2 RFC 161: Everything in Perl Becomes an Object.
This is essentially a philosophical RFC that is rather short on detail.
Nonetheless, I agree with the premise that all Perl objects should act
like objects if you choose to treat them that way. If you choose not to
treat them as objects, then Perl will try to go along with that, too.
(You may use hash subscripting and slicing syntax to call attribute
accessors, for instance, even if the attributes themselves are not
stored in a hash.) Just because Perl 6 is more object-oriented
internally, does not mean you'll be forced to think in object-oriented
terms when you don't want to. (By and large, there will be a few places
where OO-think is more required in Perl 6 than in Perl 5. Filehandles
are more object-oriented in Perl 6, for instance, and the special
variables that used to be magically associated with the currently
selected output handle are better specified by association with a
specific filehandle.)
=head2 RFC 038: Standardise Handling Of Abnormal Numbers Like
Infinities and NaNs
This is likely to slow down numeric processing in some locations.
Perhaps it could be turned off when desirable. We need to be careful
not to invent something that is guaranteed to run slower than IEEE
floating point. We should also try to avoid defining a type system that
makes translation of numeric types to Java or C# types problematic.
That being said, standard semantics are a good thing, and should be the
default behavior.
=head2 RFC 043: Integrate BigInts (and BigRats) Support Tightly With
the Basic Scalars
This RFC suggests that a pragma enables the feature, but I think it
should probably be tied to the run-time type system, which means it's
driven more by how the data is created than by where it happens to be
stored or processed. I don't see how we can make it a pragma, except
perhaps to influence the meaning of "int" and "num" in actual
declarations further on in the lexical scope:
    use bigint;
    my int $i;
might really mean
    my bigint $i;
or maybe just
    my int $i is bigint;
since representation specifications might just be considered part of
the "fine print." But the whole subject of lexically scoped variable
properties specifying the nature of the objects they contain is a bit
problematic. A variable is a sort of mini-interface, a contract if you
will, between the program and the object in question. Properties that
merely influence how the program sees the object are not a problem --
when you declare a variable to be constant, you're promising not to
modify the object through that variable, rather than saying something
intrinsically true about the object. (Not that there aren't objects
that are intrinsically constant.)
Other property declarations might need to have some say in how
constructors are called in order to guarantee consistency between the
variable's view of the object, and the nature of the object itself. In
the worst case we could try to enforce consistency at run time, but
that's apt to be slow. If every assignment of a C<Dog> object to a
C<Mammal> variable has to check to see whether C<Dog> is a C<Mammal>,
then the assignment is going to be a dog.
So we'll have to revisit this when we're defining the relationship
between variable declarations and constructors. In any event, if we
don't make Perl's numeric types automatically promote to big
representations, we should at least make it easy to specify it when you
I<want> that to happen.
[Update: The C<Int> type automatically upgrades to arbitrary precision
internally.  The C<int> type does not.]
=head2 RFC 192: Undef Values ne Value
I've rejected this one, because I think something that's undefined
should be considered just that, undefined. I think the standard
semantics are useful for catching many kinds of errors.
That being said, it'll hopefully be easy to modify the standard
operators within a particular scope, so I don't think we need to think
that our way to think is the only way to think, I think.
=head2 RFC 212: Make C<length(@array)> Work
Here's an oddity, an RFC that the author retracted, but that I accept,
more or less. I think C<length(@array)> should be equivalent to
C<@array.length()>, so if there's a C<length> method available, it
should be called.
The question is whether there should be a C<length> method at all, for
strings or arrays. It almost makes more sense for arrays than it does
for strings these days, because when you talk about the length of a
string, you need to know whether you're talking about byte length or
character length. So we may split up the traditional length function
into two, in which case we might end up with:
    $foo.chars
    $foo.bytes
    @foo.elems
Or some such. Whatever the method names we choose, differentiating them
would be more powerful in supplying context. For instance, one could
envision calling C<@foo.bytes> to return the byte length of all the
strings. That wouldn't fly if we overloaded the method name.
Even C<chars($foo)> might not be sufficiently precise, since, depending
on how you're processing Unicode, you might want to know how long the
string is in actual characters, not counting combining characters that
don't take extra space. But that's a topic for later.
[Update: There is no C<length> function.  There are C<bytes>,
C<codes>, C<graphs>, and C<langs> methods for the various Unicode
support levels.  (The C<chars> method returns one of those values
depending on the current Unicode support level.)  Arrays and hashes
report number of elements with the C<elems> method.]
=head2 RFC 218: C<my Dog $spot> Is Just an Assertion
I expect that a declaration of the form:
    my Dog $spot;
is merely an assertion that you will not use C<$spot> inconsistently
with it being a C<Dog>. (But I mean something different by
"assertion" than this RFC does.) This assertion may or may not be
tested at every assignment to C<$spot>, depending on pragmatic context.
This bare declaration does not call a constructor; however, there may
be forms of declaration that do. This may be necessary so that the
variable and the object can pass properties back and forth, and in
general, make sure they're consistent with each other. For example, you
might declare an array with a multidimensional shape, and this shape
property needs to be visible to the constructor, if we don't want to
have to specify it redundantly.
On the other hand, we might be able to get assignment sufficiently
overloaded to accomplish the same goal, so I'm deferring judgment on
that. All I'm deciding here is that a bare declaration without
arguments as above does not invoke a constructor, but merely tells the
compiler something.
[Update: The constructor may be called using the C<.=new()> construct.]
=head2 Other Decisions About Types
Built-in object types will be in all uppercase: C<INTEGER>, C<NUMBER>,
C<STRING>, C<REF>, C<SCALAR>, C<ARRAY>, C<HASH>, C<REGEX> and C<CODE>.
Corresponding to at least some of these, there will also be lowercase
intrinsic types, such as C<int>, C<num>, C<str> and C<ref>. Use of the
lowercase typename implies you aren't intending to do anything fancy
OO-wise with the values, or store any run-time properties, and thus
Perl should feel free to store them compactly. (As a limiting case,
objects of type C<bit> can be stored in one bit.) This distinction
corresponds roughly to the boxed/unboxed distinction of other computer
languages, but it is likely that Perl 6 will attempt to erase the
distinction for you to the extent possible. So, for instance, an C<int>
may still be used in a string context, and Perl will convert it for
you, but it won't cache it, so the next time you use it as a string, it
will have to convert again.
[Update: The object types are no longer all caps, but C<Int>, C<Num>,
C<Str>, etc.]
The declared type of an array or hash specifies the type of each
element, not the type of an array or hash as a whole. This is justified
by the notion that an array or hash is really just a strange kind of
function that (typically) takes a subscript as an argument and returns
a value of a particular type. If you wish to associate a type with the
array or hash as a whole, that involves setting a C<tie> property. If
you find yourself wishing to declare different types on different
elements, it probably means that you should either be using a class for
the whole heterogenous thing, or at least declare the type of array or
hash that will be a base class of all the objects it will contain.
Of course, untyped arrays and hashes will be just as acceptable as they
are currently. But a language can only run so fast when you force it to
defer all type checking and method lookup till run time.
The intent is to make use of type information where it's useful, and
not require it where it's not. Besides performance and safety, one
other place where type information is useful is in writing interfaces
to other languages. It is postulated that Perl 6 will provide enough
optional type declaration syntax that it will be unnecessary to write
XS-style glue in most cases.
[Update: Turns out one of the most important reasons for adding type
information is that it allows for multimethod dispatch.]
=head1 Variables
=head2 RFC 071: Legacy Perl $pkg'var Should Die
I agree. I was unduly influenced by Ada syntax here, and it was a
mistake. And although we're adding a properties feature into Perl 6
that is much like Ada's attribute feature, we won't make the mistake of
reintroducing a syntax that drives highlighting editors nuts. We'll try
to make different mistakes this time.
=head2 RFC 009: Highlander Variable Types
I basically agree with the problem this RFC is trying to solve, but I
disagree with the proposed solution. The basic problem is that, while
the idiomatic association of C<$foo[$bar]> with C<@foo> rather than
C<$foo> worked fine in Perl 4, when we added recursive data structures
to Perl 5, it started getting in the way notationally, so that initial
funny character was trying to do too much in both introducing the
"root" of the reference, as well as the context to apply to the final
subscript. This necessitated odd looking constructions like:
    $foo->[1][2][3]
This RFC proposes to solve the dilemma by unifying scalar variables
with arrays and hashes at the name level. But I think people like to
think of C<$foo>, C<@foo> and C<%foo> as separate variables, so I don't
want to break that. Plus, the RFC doesn't unify C<&foo>, while it's
perfectly possible to have a reference to a function as well as a
reference to the more ordinary data structures.
So rather than unifying the names, I believe all we have to do is unify
the treatment of variables with respect to references. That is, all
variables may be thought of as references, not just scalars. And in
that case, subscripts always dereference the reference implicit in the
array or hash named on the left.
This has two major implications, however. It means that Perl
programmers must learn to write C<@foo[1]> where they used to write
C<$foo[1]>. I think most Perl 5 people will be able to get used to
this, since many of them found the current syntax a bit weird in the
first place.
The second implication is that slicing needs a new notation, because
subscripts no longer have their scalar/list context controlled by the
initial funny character. Instead, the context of the subscript will
need to be controlled by some combination of:
=over
=item 1. Context of the entire term.
=item 2. Appearance of known list operators in the subscript, such as
comma or range.
=item 3. Explicit syntax casting the inside of the subscript to list or
scalar context.
=item 4. Explicit declaration of default behavior.
=back
One thing that probably shouldn't enter into it is the run-time type of
the array object, because context really needs to be calculated at
compile time if at all possible.
In any event, it's likely that some people will want subscripts to
default to scalars, and other people will want them to default to
lists. There are good arguments for either default, depending on
whether you think more like an APL programmer or a mere mortal.
[Update: Rvalue subscripts are always list context, but it's trivial to
force scalar context with either of the C<+> or C<~> unary operators.
Lvalue subscripts are scalar context unless the lvalue is in parentheses.]
There are other larger implications. If composite variables are thought
of as scalar references, then the names C<@foo> and C<%foo> are really
scalar variables unless explicitly dereferenced. That means that when
you mention them in a scalar context, you get the equivalent of Perl
5's C<\@foo> and C<\%foo>. This simplifies the prototyping system
greatly, in that an operator like C<push> no longer needs to specify
some kind of special reference context for its first argument -- it can
merely specify a scalar context, and that's good enough to assume the
reference generation on its first argument. (Of course, the function
signature can always be more specific if it wants to. More about that
in future installments.)
There are also implications for the assignment operator, in that it has
to be possible to assign array references to array variables without
accidentally invoking list context and copying the list instead of the
reference to the list. We could invent another assignment operator to
distinguish the two cases, but at the moment it looks as though bare
variables and slices will behave as lvalues just as they do in Perl 5,
while lists in parentheses will change to a binding of the right-hand
arguments more closely resembling the way Perl 6 will bind formal
arguments to actual arguments for function calls. That is to say,
    @foo = (1,2,3);
will supply an unbounded list context to the right side, but
    (@foo, @bar) = (@bar, @foo)
will supply a context to the right side that requests two scalar values
that are array references. This will be the default for unmarked
variables in an lvalue list, but there will be an easy way to mark
formal array and hash parameters to slurp the rest of the arguments
with list context, as they do by default in Perl 5.
(Alternately, we might end up leaving the ordinary list assignment
operator with Perl 5 semantics, and define a new assignment operator
such as C<:=> that does signatured assignment. I can argue that one
both ways.)
[Update: We ended up with a C<:=> binding operator.]
Just as arrays and hashes are explicitly dereferenced via subscripting
(or implicitly dereferenced in list context), so too functions are
merely named but not called by C<&foo>, and explicitly dereferenced
with parentheses (or by use as a bare name without the ampersand (or
both)). The Perl 5 meanings of the ampersand are no longer in effect,
in that ampersand will no longer imply that signature matching is
suppressed -- there will be a different mechanism for that. And since
C<&foo> without parens doesn't do a call, it is no longer possible to
use that syntax to automatically pass the C<@_> array -- you'll have to
do that explicitly now with C<foo(@_)>.
Scalar variables are special, in that they may hold either references
or actual "native" values, and there is no special dereference syntax
as there is for other types. Perl 6 will attempt to hide the
distinction as much as possible. That is, if C<$foo> contains a native
integer, calling the C<$foo.bar> method will call a method on the
built-in type. But if C<$foo> contains a reference to some other
object, it will call the method on that object. This is consistent with
the way we think about overloading in Perl 5, so you shouldn't find
this behavior surprising. It may take special syntax to get at any
methods of the reference variable itself in this case, but it's OK if
special cases are special.
[Update: The C<variable($foo)> pseudo-function allows you to specify the
container rather than the contained object.]
=head2 RFC 133: Alternate Syntax for Variable Names
This RFC has a valid point, but in fact we're going to do just the
opposite of what it suggests. That is, we'll consider the funny
characters to be part of the name, and use the subscripts for context.
This works out better, because there's only one funny character, but
many possible forms of dereferencing.
[Update: Nowadays we call those funny characters I<sigils>.  And for weirdly
scoped variables there's a second character called a I<twigil>.]
=head2 RFC 134: Alternative Array and Hash Slicing
We're definitely killing Perl 5's slice syntax, at least as far as
relying on the initial character to determine the context of the
subscript. There are many ways we could reintroduce a slicing syntax,
some of which are mentioned in this RFC, but we'll defer the decision
on that till Apocalypse 9 on Data Structures, since the interesting
parts of designing slice syntax will be driven by the need to slice
multidimensional arrays.
[Update: There is no Apocalypse 9, but there is a Synopsis 9 that
covers these matters.]
For now we'll just say that arrays can have subscript signatures much
like functions have parameter signatures. Ordinary one-dimensional
arrays (and hashes) can then support some kind of simple slicing syntax
that can be extended for more complicated arrays, while allowing
multidimensional arrays to distinguish between simple slicing and
complicated mappings of lists and functions onto subscripts in a manner
more conducive to numerical programming.
On the subject of hash slices returning pairs rather than values, we
could distinguish this with special slice syntax, or we could establish
the notion of a hashlist context that tells the slice to return pairs
rather than just values. (We may not need a special slice syntax for
that if it's possible to typecast back and forth between pair lists and
ordinary lists.)
[Update: Slicing to get a pairlist can be done by attaching a C<:p>
modifier to the subscript.  In general though there's no such thing
as a hashlist context.  It's just that the list context supplied by
assignment to a hash happens to know how to deal with pairs.]
=head2 RFC 196: More Direct Syntax for Hashes
This RFC makes three proposals, which we'll consider separately.
Proposal 1 is "that a hash in scalar context evaluate to the number of
keys in the hash." (You can find that out now, but only by using the
C<keys()> function in scalar context.) Proposal 1 is OK if we change
"scalar context" to "numeric context," since in scalar context a
hash will produce a reference to the hash, which just happens to numify
to the number of entries.
We must also realize that some implementations of hash might have to go
through and count all the entries to return the actual number.
Fortunately, in boolean context, it suffices to find a single entry to
determine whether the hash contains anything. However, on hashes that
don't keep track of the number of entries, finding even one entry might
reset any active iterator on the hash, since some implementations of
hash (in particular, the ones that don't keep track of the number of
entries) may only supply a single iterator.
[Update: You may also call C<.elems> to be more explicit.]
Proposal 2 is "that the iterator in a hash be reset through an
explicit call to the C<reset()> function." That's fine, with the
proviso that it won't be a function, but rather a I<method> on the HASH
class.
[Update: all list contexts in Perl 6 are lazy by default, and different
list contexts generate their own iterators, so all you have to do to
"reset" and iterator is stop reading from the list in question.]
Proposal 3 is really about C<sort> recognizing pairs and doing the
right thing. Defaulting to sorting on C<$^a[0] cmp $^b[0]> is likely to
be reasonable, and that's where a pair's key would be found. However,
it's probable that the correct solution is simply to provide a default
string method for anonymous lists that happens to produce a decent key
to sort on when C<cmp> requests a string representation of either of
its arguments. The C<sort> itself should probably just concentrate on
memoizing the returned strings so they don't have to be recalculated.
[Update: The C<sort> interface has been completely revamped since this
was written.  This will eventually appear in S29, but as of now it's
just in the perl6-language archives.]
=head2 RFC 201: Hash Slicing
This RFC proposes to use C<%> as a marker for special hash slicing in
the subscript. Unfortunately, the C<%> funny character will not be
available for this use, since all hash refs will start with C<%>.
Concise list comprehensions will require some other syntax within the
subscript, which will hopefully generalize to arrays as well.
=head2 Other Decisions About Variables
Various special punctuation variables are gone in Perl 6, including all
the deprecated ones. (Non-deprecated variables will be replaced by some
kind of similar functionality that is likely to be invoked through some
kind of method call on the appropriate object. If there is no
appropriate object, then a named global variable might provide similar
functionality.)
Freeing up the various bracketing characters allows us to use them for
other purposes, such as interpolation of expressions:
    "$(expr)"           # interpolate a scalar expression
    "@(expr)"           # interpolate a list expression
[Update: Those forms mean something else now (casting).  Expression
interpolation is normally done via closure.]
C<$#foo> is gone. If you want the final subscript of an array, and
C<[-1]> isn't good enough, use C<@foo.end> instead.
Other special variables (such as the regex variables) will change from
dynamic scoping to lexical scoping. It is likely that even C<$_> and
C<@_> will be lexically scoped in Perl 6.
[Update: And indeed they are.  But they happen to be a special kind of
lexical variable called an "environment" variable, modeled on Unix
environment variables.  This allows subroutines to get at them and use
them as defaults, in a pronominal sort of way.]
=head1 Names
In Perl 5, lexical scopes are unnamed and unnameable. In Perl 6, the
current lexical scope will have a name that is visible within the
lexical scope as the pseudo class C<MY>, so that such a scope can, if
it so chooses, delegate management of its lexical scope to some other
module at compile time. In normal terms, that means that when you use a
module, you can let it import things lexically as well as packagely.
[Update: The currently compiling lexical scope may also be named from anywhere
as the C<COMPILING> pseudopackage these days.]
Typeglobs are gone. Instead, you can get at a variable object through
the symbol table hashes that are structured much like Perl 5's. The
variable object for C<$MyPackage::foo> is stored in:
    %MyPackage::{'$foo'}
Note that the funny character is part of the name. There is no longer
any structure in Perl that associates everything with the name
"C<foo>".
[Update: The right way to say that now is "C<< MyPackage::<$foo> >>".
Hence the C<$foo> variable in the scope currently being compiled is
known as C<< COMPILING::<$foo> >>.]
Perl's special global names are stored in a special package named
"C<*>" because they're logically in every scope that does not hide
them. So the unambiguous name of the standard input filehandle is
C<$*STDIN>, but a package may just refer to C<$STDIN>, and it will
default to C<$*STDIN> if no package or lexical variable of that name
has been declared.
[Update: We did s/STD// on those, so standard input is now just C<$*IN>.]
Some of these special variables may actually be cloned for each lexical
scope or each thread, so just because a name is in the special global
symbol table doesn't mean it always behaves as a global across all
modules. In particular, changes to the symbol table that affect how the
parser works must be lexically scoped. Just because I install a special
rule for my cool new hyperquoting construct doesn't mean everyone else
should have to put up with it. In the limiting case, just because I
install a Python parser, it shouldn't force other modules into a maze
of twisty little whitespace, all alike.
Another way to look at it is that all names in the "C<*>" package are
automatically exported to every package and/or outer lexical scope.
[Update: The names are no longer automatically exported, but you can import
them from the global namespace via "C<use GLOBALS '$IN', '$OUT';>" and such.]
=head1 Literals
=head2 Underscores in Numeric Literals
Underscores will be allowed between any two digits within a number.
=head2 RFC 105: Remove "In string @ must be \@" Fatal Error
Fine.
[Update: The interpolation rules for arrays have been completely revised.
A bare array name no longer interpolates--you have to say C<@foo[]>.]
=head2 RFC 111: Here Docs Terminators (Was Whitespace and Here Docs)
Fine.
=head2 RFC 162: Heredoc contents
I think I like option (e) the best: remove whitespace equivalent to the
terminator.
By default, if it has to dwim, it should dwim assuming that hard tabs
are 8 spaces wide. This should not generally pose a problem, since most
of the time the tabbing will be consistent throughout anyway, and no
dwimming will be necessary. This puts the onus on people using
nonstandard tabs to make sure they're consistent so that Perl doesn't
have to guess.
Any additional mangling can easily be accomplished by a user-defined
operator.
[Update: Here docs are now just a :to variant on extensible quotes, so
any customization you can do to C<q/foo/> you can also do to C<q:to/END/>.
=head2 RFC 139: Allow Calling Any Function With a Syntax Like s///
Creative quoting will be allowed with lexical mutataion, but we can't
parse C<foo(bar)> two different ways simultaneously, and I'm unwilling
to prevent people from using parens as quote characters. I don't see
how we can reasonably have new quote operators without explicit
declaration. And if the utility of a quote-like operator is sufficient,
there should be little relative burden in requiring such a declaration.
The form of such a declaration is left to the reader as an exercise in
function property definition. We may revisit the question later in this
series. It's also possible that a quote operator such as C<qx//> could
have a corresponding function name like C<quote:qx> that could be
invoked as a function.
=head2 RFC 222: Interpolation of Object Method Calls
I've been hankering for methods to interpolate for a long time, so I'm
in favor of this RFC. And it'll become doubly important as we move
toward encouraging people to use accessor methods to refer to object
attributes outside the class itself.
I have one "but," however. Since we'll switch to using C&lt;.> instead
of C<< -> >>, I think for sanity's sake we may have to require the
parentheses, or "C<$file.$ext>" is going to give people fits. Not to
mention "C<$file.ext>".
[Update: Nowadays we also require brackets on array interpolations
and braces on hash interpolations.  See S03 for more.]
=head2 RFC 226: Selective Interpolation in Single Quotish Context.
This proposal has much going for it, but there are also difficulties,
and I've come close to rejecting it outright simply because the
single-quoting policy of Perl 5 has been successful. And I think the
proposal in this RFC for C<\I>...C<\E> is ugly. (And I'd like to kill
C<\E> anyway, and use bracketed scopings.)
However, I think there is a major "can't get there from here" that we
could solve by treating interpolation into single quotes as something
hard, not something easy. The basic problem is that it's too easy to
run into a C<\$> or C<\@> (or a C<\I> for that matter) that wants to be
taken literally. I think we could allow the interpolation of arbitrary
expressions into single-quoted strings, but only if we limit it to an
unlikely sequence where three or more characters are necessary for
recognition. The most efficient mental model would seem to be the idea
of embedding one kind of quote in another, so I think this:
    \q{stuff}
will embed single-quoted stuff, while this:
    \qq{stuff}
will embed double-quoted stuff. A variable could then be interpolated
into a single-quoted string by saying:
    \qq{$foo}
=head2 RFC 237: Hashes Should Interpolate in Double-Quoted Strings
I agree with this RFC in principle, but we can't define the default
hash stringifier in terms of variables that are going away in Perl 6,
so the RFC's proposal of using C<$"> is right out.
All objects should have a method by which they produce readable output.
How this may be overridden by user preference is open to debate.
Certainly, dynamic scoping has its problems. But lexical override of an
object's preferences is also problematic. Individual object properties
appear to give a decent way out of this. More on that below.
[Update: Hash values by default interpolate with tabs between key
and value, and with newline between pairs.  But you can give it a specific
format with the C<.as> method.]
On C<printf> formats, I don't see any way to dwim that C<%d> isn't an
array, so we'll just have to put formats into single quotes in general.
Those format strings that also interpolate variables will be able to
use the new C<\qq{$var}> feature.
[Update: Since hash interpolations require braces now, C<printf> formats
are safe again (unless they happen to be followed by curlies).]
Note for those who are thinking we should just stick with Perl 5
interpolation rules: We have to allow C<%> to introduce interpolation
now because individual hash values are no longer named with
C<$foo{$bar}>, but rather C<%foo{$bar}>. So we might as well allow
interpolation of complete hashes.
=head2 RFC 251: Interpolation of Class Method Calls
Class method calls are relatively rare (except for constructors, which
will be rarely interpolated). So rather than scanning for identifiers
that might introduce a class, I think we should just depend on
expression interpolation instead:
    "There are $(Dog.numdogs) dogs."
[Update: That's now done with closure interpolation.]
=head2 RFC 252: Interpolation of Subroutines
I think subroutines should interpolate, provided they're introduced
with the funny character. (On the other hand, how hard is 
C<$(sunset $date)> or C<@(sunset $date)>? On the gripping hand, I 
like the consistency of C<&> with C<$>, C<@> and C<%>.)
I think the parens are required, since in Perl 6, scalar C<&sub> will
just return a reference, and require parens if you really want to deref
the sub ref. (It's true that a subroutine can be called without parens
when used as a list operator, but you can't interpolate those without a
funny character.)
For those worried about the use of C<&> for signature checking
suppression, we should point out that C<&> will no longer be the way to
suppress signature checking in Perl 6, so it doesn't matter.
=head2 RFC 327: C<\v> for Vertical Tab
I think the opportunity cost of not reserving C<\v> for future use is
too high to justify the small utility of retaining compatibility with a
feature virtually nobody uses anymore. For instance, I almost used
C<\v> and C<\V> for switching into and out of verbatim (single-quote)
mode, until I decided to unify that with quoting syntax and use
C<\qq{}> and C<\q{}> instead.
[Update: Turns out that C<\v> matches vertical whitespace in patterns,
which conveniently includes vertical tab--whatever that is...  Also we
now have C<\h> for horizontal whitespace.]
=head2 RFC 328: Single quotes don't interpolate \' and \\
I think hyperquotes will be possible with a declaration of your quoting
rules, so we're not going to change the basic single-quote rules
(except for supporting C<\q>).
[Update: There are adverbial modifiers now that can do hyperquoting. See S02.]
=head2 Other Decisions About Literals
=head3 Scoping of \L et al.
I'd like to get rid of the gratuitously ugly C&lt;\E> as an end-of-scope
marker. Instead, if any sequence such as C<\L>, C<\U> or C<\Q> wishes
to impose a scope, then it must use curlies around that scope:
C<\L{I<stuff>}>, C<\U{I<stuff>}> or C<\Q{I<stuff>}>. Any literal
curlies contained in I<stuff> must be backslashed. (Curlies as syntax
(such as for subscripts) should nest correctly.)
[Update: Those constructs are now gone entirely.  Use closure interpolation
to interpolate the value of an expression.]
=head3 Bareword Policy
There will be no barewords in Perl 6. Any bare name that is a declared
package name will be interpreted as a class object that happens to
stringify to the package name. All other bare names will be interpreted
as subroutine or method calls. For nonstrict applications, undefined
subroutines will autodefine themselves to return their own name. Note
that in C<${name}> and friends, the name is considered autoquoted, not
a bareword.
[Update: The C<${name}> construct is gone.  Use closure interpolation
to disambiguate expression interpolations: C<"{$name}text">.
Use C<$($ref)> or C<$$ref> for hard dereferences.
Use C<$::($name)> for symbolic dereferences.]
=head3 Weird brackets
Use of brackets to disambiguate
    "${foo[bar]}"
from
    "${foo}[bar]"
will no longer be supported. Instead, the expression parser will always
grab as much as it can, and you can make it quit at a particular point
by interpolating a null string, specified by C<\Q>:
    "$foo\Q[bar]"
[Update: That's gone too.  Just use closure interpolation to disambiguate.]
=head3 Special tokens
Special tokens will turn into either POD directives or lexically scoped
OO methods under the C<MY> pseudo-package:
    Old                 New
    ---                 ---
    __LINE__            MY.line
    __FILE__            MY.file
    __PACKAGE__         MY.package
    __END__             =begin END      (or remove)
    __DATA__            =begin DATA
[Update: The first three are now C<$?LINE>, C<$?FILE>, and C<$?PACKAGE>.
There are other such variables too.  See S02.]
=head3 Heredoc Syntax
I think heredocs will require quotes around any identifier, and we need
to be sure to support C<< << qq(END) >> style quotes. Space is
now allowed before the (required) quoted token. Note that custom
quoting is now possible, so if you define a fancy C<qh> operator for
your fancy hyperquoting algorithm, then you could say C<< <<qh(END) >>.
It is still the case that you can say C<< <<"" >> to grab
everything up to the next blank line. However, Perl 6 will consider any
line containing only spaces, tabs, etc., to be blank, not just the ones
that immediately terminate with newline.
[Update: C<q:to/END/> is now how you form a here doc.]
=head1 Context
In Perl 5, a lot of contextual processing was done at run-time, and
even then, a given function could only discover whether it was in void,
scalar or list context. In Perl 6, we will extend the notion of context
to be more amenable to both compile-time and run-time analysis. In
particular, a function or method can know (theoretically even at
compile time) when it is being called in:
    Void context
    Scalar context
        Boolean context
        Integer context
        Numeric context
        String context
        Object context
    List context
        Flattening list context (true list context).
        Non-flattening list context (list of scalars/objects)
        Lazy list context (list of closures)
        Hash list context (list of pairs)
(This list isn't necessarily exhaustive.)
Each of these contexts (except maybe void) corresponds to a way in
which you might declare the parameters of a function (or the left side
of a list assignment) to supply context to the actual argument list (or
right side of a list assignment). By default, parameters will supply
object context, meaning individual parameters expect to be aliases to
the actual parameters, and even arrays and hashes don't do list context
unless you explicitly declare them to. These aren't cast in stone yet
(or even Jello), but here are some ideas for possible parameter
declarations corresponding to those contexts:
    Scalar context
        Boolean context                 bit $arg
        Integer context                 int $arg
        Numeric context                 num $arg
        String context                  str $arg
        Object context                  $scalar, %hash, Dog @canines, &foo
    List context
        Flattening list context         *@args
        Non-flattening list context     $@args
        Lazy list context               &@args
        Hash list context               *%args
[Update: Some of these are bogus.  See S06 for the most recent
formulation.  In general, most functions should not be asking for
their context, but should return something that works in any context.]
(I also expect unary * to force flattening of arrays in rvalue
contexts. This is how we defeat the type signature in Perl 6, instead
of relying on the initial ampersand. So instead of Perl 5's
C<&push(@list)>, you could just say C<push *@list>, and it wouldn't
matter what C<push>'s parameter signature said.)
[Update: Lazy flattening is now done with the C<[,]> reduce operator.
Eager flattening is done with the C<eager> list operator.]
It's also possible to define properties to modify formal arguments,
though that can get clunky pretty quickly, and I'd like to have a
concise syntax for the common cases, such as the last parameter
slurping a list in the customary fashion. So the signature for the
built-in C<push> could be
    sub push (@array, *@pushees);
Actually, the signature might just be C<(*@pushees)>, if C<push> is
really a method in the C<ARRAY> class, and the object is passed
implicitly:
    class ARRAY;
    sub .push (*@pushees);
    sub .pop (;int $numtopop);
    sub .splice (int $offset, int $len, *@repl);
But we're getting ahead of ourselves.
[Update: Indeed, methods are declared with a C<method> keyword, and
without the dot.  Also the built-in class is C<Array>, not C<ARRAY>.
See A12 and S12 for much more about classes and methods.]
By the way, all function and method parameters (other than the object
itself) will be considered read-only unless declared with the C<rw>
property. (List assignments will default the other way.) This will
prevent a great deal of the wasted motion current Perl implementations
have to go through to make sure all function arguments are valid
lvalues, when most of them are in fact never modified.
Hmm, we're still getting ahead of ourselves. Back to contexts.
=head2 References are now transparent to boolean context
References are no longer considered to be "always true" in Perl 6.
Any type can overload its C<bit()> casting operator, and any type that
hasn't got a C<bit()> of its own inherits one from somewhere else, if
only from class UNIVERSAL. The built-in bit methods have the expected
boolean semantics for built-in types, so arrays are still true if they
have something in them, strings are true if they aren't C<""> or 
C<"0">, etc.
[Update: The universal object class is nowadays called "C<Object>".  The
universal type is called "C<Any>".  And references aren't really references
anymore.  But it's still true that the individual object decides whether
to be true or false according to its type.]
=head1 Lists
=head2 RFC 175: Add C<list> keyword to force list context (like
C<scalar>)
Another RFC rescued from the compost pile. In Perl 6, type names will
identify casting functions in general. (A casting function merely
forces context -- it's a no-op unless the actual context is different.)
In Perl 6, a list used in a scalar context will automatically turn
itself into a reference to the list rather than returning the last
element. (A subscript of C<[-1]> can always be used to get the last
element explicitly, if that's actually desired. But that's a rarity, in
practice.) So it works out that the explicit list composer:
    [1,2,3]
is syntactic sugar for something like:
    scalar(list(1,2,3));
Depending on whether we continue to make a big deal of the list/array
distinction, that might actually be spelled:
    scalar(array(1,2,3));
Other casts might be words like C<hash> (supplying a pairlist context)
and C<objlist> (supplying a scalar context to a list of expressions).
Maybe even the optional C<sub> keyword could be considered a cast on a
following block that might not otherwise be considered a closure in
context. Perhaps C<sub> is really spelled C<lazy>. In which case, we
might even have a C<lazylist> context to supply a lazy context to a
list of expressions.
[Update: Most of these aren't necessary.  Just use the appropriate method.]
And of course, you could use standard casts like C<int()>, C<num()>,
and C<str()>, when you want to be explicit about such contexts at
compile time. (Perl 5 already has these contexts, but only at run
time.) Note also that, due to the relationship between unary functions
and methods, C<$foo.int>, C<$foo.num>, and C<$foo.str> will be just a
different way to write the same casts.
Lest you worry that your code is going to be full of casts, I should
point out that you won't need to use these casts terribly often because
each of these contexts will typically be implied by the signature of
the function or method you're calling. (And Perl will still be
autoconverting for you whenever it makes sense.) More on that in
Apocalypse 6, Subroutines. If not sooner.
So, while boolean context might be explicitly specified by writing:
    if (bit $foo)
or
    if ($foo.bit)
you'd usually just write it as in Perl 5:
    if ($foo)
[Update: The parens are now optional, so that's just "C<if $foo>".]
=head2 Other Decisions about Lists
Based on some of what we've said, you can see that we'll have the
ability to define various kinds of lazily generated lists. The specific
design of these operators is left for subsequent Apocalypses, however.
I will make one observation here, that I think some of the proposals
for how array subscripts are generated should be generalized to work
outside of subscripts as well. This may place some constraints on the
general use of the C<:> character in places where an operator is
expected, for instance.
[Update: And indeed, we now have adverbs that look like C<:foo(1)>.]
As mentioned above, we'll be having several different kinds of list
context. In particular, there will be a hash list context that assumes
you're feeding it pairs, and if you don't feed it pairs, it will assume
the value you feed it is a key, and supply a default value. There will
likely be ways to get hashes to default to interesting values such as 0
or 1.
[Update: there isn't really a hash list context.]
In order to do this, the C<< => >> operator has to at least mark its
left operand as a key. More likely, it actually constructs a pair
object in Perl 6. And the C<< { foo => $bar } >> list composer will be
required to use C<< => >> (or be in a hashlist context), or it will
instead be interpreted as a closure without a C<sub>. (You can always
use an explicit C<sub> or C<hash> to cast the brackets to the proper
interpretation.)
I've noticed how many programs use C<qw()> all over the place (much
more frequently than the input operator, for instance), and I've always
thought C<qw()> was kind of ugly, so I'd like to replace it with
something prettier. Since the input operator is using up a pair of
perfectly good bracketing characters for little syntactic gain, we're
going to steal those and make them into a qw-like list composer. In
ordinary list context, the following would be identical:
    @list = < foo $bar %baz blurch($x) >;
    @list = qw/ foo $bar %baz blurch($x) /;                     # same as this
    @list = ('foo', '$bar', '%baz', 'blurch($x)');              # same as this
But in hashlist context, it might be equivalent to this:
    %list = < foo $bar %baz blurch($x) >;
    %list = (foo => 1, '$bar' => 1, '%baz' = 1, blurch => $x);  # same as this
[Update: There is also a version with French quotes C<«»> that does
interpolation before splitting into words.  Either can be used as a
hash subscript slice.]
=head1 Files
Basically, file handles are just objects that can be used as iterators,
and don't belong in this chapter anymore.
=head2 RFC 034: Angle Brackets Should Not Be Used for File Globbing
Indeed, they won't be. In fact, angle brackets won't be used for input
at all, I suspect. See below. Er, above.
[Update: Angle brackets don't directly do input.  There is a unary C<=>
that does the equivalent.  For an argument it takes a filehandle,
interator, filename, or list of any of those, and returns lines.
There is a corresponding C<lines> list operator.]
=head2 RFC 051: Angle Brackets Should Accept Filenames and Lists
There is likely to be no need for an explicit input operator in Perl 6,
and I want the angles for something else. I/O handles are a subclass of
iterators, and I think general iterator variables will serve the
purpose formerly served by the input operator, particularly since they
can be made to do the right Thing in context. For instance, to read
from standard input, it will suffice to say
    while ($STDIN) { ... }
and the iterator will know it should assign to C<$_>, because it's in a
Boolean context.
[Update: No, this confuses two different meanings of boolean. Use
C<for =$*IN {...}> instead.]
I read this RFC more as requesting a generic way to initialize an
iterator according to the type of the iterator. The trick in this case
is to prevent the re-evaluation of the spec every time -- you don't
want to reopen the file every time you read a line from it, for
instance. There will be standard ways to suppress evaluation in Perl 6,
both from the standpoint of the caller and the callee. In any case, the
model is that an anonymous subroutine is passed in, and called only
when appropriate. So an iterator syntax might prototype its argument to
be an anonymous sub, or the user might explicitly pass an anonymous
sub, or both. In any event, the C<sub> keyword will be optional in Perl
6, so things like:
    while (file {LIST}) { ... }
can be made to defer evaluation of LIST to the appropriate moment (or
moments, if LIST is in turn generating itself on the fly). For
appropriate parameter declarations I suppose even the brackets could be
scrapped.
=head1 Properties
Variables and values of various types have various kinds of data
attributes that are naturally associated with them by virtue of their
type. You know a dog comes equipped with a wag, hopefully attached to a
tail. That's just part of doghood.
Many times, however, you want the equivalent of a Post-It(r) note, so
you can temporarily attach bits of arbitrary information to some
unsuspecting appliance that (though it wasn't designed for it) is
nevertheless the right place to put the note. Similarly, variables and
values in Perl 6 allow you to attach arbitrary pieces of information
known as "properties." In essence, any object in Perl can have an
associated hash containing these properties, which are named by the
hash key.
Some of these properties are known at compile time, and don't actually
need to be stored with the object in question, but can actually be
stored instead in the symbol table entry for the variable in question.
(Perl still makes it appear as though these values are attached to the
object.) Compile-time properties can therefore be attached to variables
of any type.
[Update: Compile-time properties are now known as "traits".]
Run-time properties really are associated with the object in question,
which implies some amount of overhead. For that reason, intrinsic data
types like C<int> and C<num> may or may not allow run-time properties.
In cases where it is allowed, the intrinsic type must generally be
promoted to its corresponding object type (or wrapped in an object that
delegates back to the original intrinsic for the actual value). But you
really don't want to promote an array of a million bits to an array of
a million objects just because you had the hankering to put a sticky
note on one of those bits, so in those cases it's likely to be
disallowed, or the bit is likely to be cloned instead of referenced, or
some such thing.
Properties may also be attached to subroutines.
In general, you don't set or clear properties directly -- instead you
call an accessor method to do it for you. If there is no method of that
name, Perl will assume there was one that just sets or clears a
property with the same name as the method. However, using accessor
methods to set or clear properties allows us to define synthetic
properties. For instance, there might be a real C<constant> property
that you could attach to a variable. Certain variables (such as those
in a function prototype) might have C<constant> set by default. In that
case, setting a synthetic property such as C<rw> might clear the
underlying C<constant> property.
A property may be attached to the foregoing expression by means of the
"is" keyword. Here's a compile-time property set on a variable:
    my int $pi is constant = 3;
Here's a run-time property set on a return value:
    return 0 is true;
[Update: Run-time properties are set with C<but> rather than C<is>.
They're really just mixins.  And there is no "constant" trait, but
rather a "readonly" trait.  Use the special declarator
    constant int $pi = 3;
to declare a compile-time constant.]
Whether a property is applied to a variable at compile time or a value
at run-time depends on whether it's in lvalue or rvalue context.
(Variable declarations are always in lvalue context even when you don't
assign anything to them.)
The "C<is>" works just like the "C<.>" of a method call, except
that the return value is the object on the left, not the return value
of the method, which is discarded.
As it happens, the "C<is>" is optional in cases where an operator is
already expected. So you might see things like:
    my int $pi constant = 3;
    return 0 true;
In this case, the methods are actually being parsed as postfix
operators. (However, we may make it a stricture that you may omit the
C<is> only for predeclared property methods.)
[Update: the C<is> or C<but> is never optional.]
Since these actually are method calls, you can pass arguments in
addition to the object in question:
    my int @table is dim(366,24,60);
[Update: that should be C<my int @table[^366;^24;^60];> according to S09...]
Our examples above are assuming an argument of C<(1)>:
    my int $pi is constant(1) = 3;
    return 0 is true(1);
Since the "C<is>" is optional in the common cases, you can stack
multiple properties without repeating the "C<is>."
    my int $pi is shared locked constant optimize($optlevel) = 3;
[Update: the C<is> is no longer optional.]
(Note that these methods are called on the C<$pi> variable at compile
time, so it behooves you to make sure everything you call is defined.
For instance, C<$optlevel> needs to be known at compile-time.)
Here are a list of property ideas stolen from Damian. (I guess that
makes it intellectual property theft.) Some of the names have been
changed to protect the (CS) innocent.
    # Subroutine attributes...
    sub name is rw { ... }                      # was lvalue
    my sub rank is same { ... }                 # was memoized
    $snum = sub is optimize(1) { ... };         # "is" required here
    # Variable attributes...
    our $age is constant = 21;                  # was const
    my %stats is private;
    my int @table is dim(366,24,60);
    $arrayref = [1..1000000] is computed Purpose('demo of anon var attrs');
    sub choose_rand (@list is lazy) { return $list[rand @list] }
                                                # &@list notation is likely
    $self = $class.bless( {name=>$name, age=>$age} is Initialized );
    # Reference attributes...
    $circular = \$head is weak;
    # Literal attributes...
    $name = "Damian" is Note("test data only");
    $iohandle = open $filename is dis(qw/para crlf uni/) or die;
    $default = 42 is Meaning(<<OfLife);
                             The Answer
                             OfLife
    package Pet is interface;
    class Dog inherits('Canine') { ... }
    print $data{key is NoteToSelf('gotta get a better name for this key')};
(I don't agree with using properties for all of these things, but it's
pretty amazing how far into the ground you can drive it.)
Property names should start with an identifier letter (which includes
Unicode letters and ideographs). The parsing of the arguments (if any)
is controlled by the signature of the method in question. Property
method calls without a "." always modify their underlying property.
[Update: the signature has nothing to do with how they're parsed.  If
a property has an argument it must follow without an intervening space.
However, other brackets are allowed, and using C<will> rather than C<is>
allows you to specify a closure with an intervening space.]
If called as an ordinary method (with a "."), the property value is
returned without being modified. That value could then be modified by a
run-time property. For instance, C<$pi.constant> would return C<1>
rather than the value of C<$pi>, so we get:
    return $pi.constant is false;       # "1 but false" (not possible in Perl 5)
On the other hand, if you omit the dot, something else happens:
    return $pi constant is false;       # 3 but false (and 3 is now very constant)
[Update: The dot form is only for access of properties and traits that
are already there.]
Here are some more munged Damian examples:
    if (&name.rw) { ... }
    $age++ unless $age.constant;
    $elements = return reduce $^ * $^, *@table.dim;
    last if ${self}.Initialized;
    print "$arrayref.Purpose() is not $default.Meaning()\n";
    print %{$self.X};    # print hash referred to by X attribute of $self
    print %{$self}.X;    # print X attribute of hash referred to by $self
    print %$self.X;      # print X attribute of hash referred to by $self
As with the dotless form, if there is no actual method corresponding to
the property, Perl pretends there's a rudimentary one returning the
actual property.
[Update: Actually, they are real method calls now, and properties and
traits are really mixins.  See A12.]
Since these methods return the properties (except when overridden by
dotless syntax), you can temporize a property just as you can any
method, provided the method itself allows writing:
    temp $self.X = 0;
Note that
    temp $self is X = 0;
would assign to 0 to C<$self> instead. (Whether it actually makes sense
to set the compile-time X property at run time on the C<$self> variable
is anybody's guess.)
Note that by virtue of their syntax, properties cannot be set by
interpolation into a string. So, happily:
    print "My $variable is foobar\n";
does not attempt to set the C<foobar> property on C<$variable>.
The "C<is>" keyword binds with the same precedence as ".", even
when it's not actually there.
[Update: Actually, C<but> now has the precedence of other non-chaining
operators, such as C<..>.  Since C<is> occurs only in declarations, it
is not subject to operator precedence at all.]
Note that when you say C<$foo.bar>, you get C<$foo>'s compile-time
property if there is one (which is known at compile time, duh).
Otherwise it's an ordinary method call on the value (which looks for a
run-time property only if a method can't be found, so it shouldn't
impact ordinary method call overhead.)
[Update: No, that's always an ordinary method call now.  Use
C<variable($foo).bar> to get at a property of the variable, or to call
any other method on the variable object rather than the referent
of C<$foo>.]
To get to the properties directly without going through the method
interface, use the special C<btw> method, which returns a hash ref to
the properties hash.
    $foo.btw{constant}
Note that synthetic properties won't show up there!
[Update: there is no such hash anymore, and synthetic properties
are simply synthetic methods.]
None of the property names in this Apocalypse should be taken as final.
We will decide on actual property names as we proceed through the
series.
Well, that's it for Apocalypse 2. Doubtless there are some things I
should have decided here that I didn't yet, but at least we're making
progress. Well, at least we're moving in some direction or other. Now
it's time for us to dance the Apocalypso, in honor of Jon Orwant and
his new wife.
	Global
`s`	Focus search bar
`?`	Bring up this help dialog
	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)
	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse
	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)