NAME

PDLA::FAQ - Frequently asked questions about PDLA

VERSION

Current FAQ version: 1.007

DESCRIPTION

This is version 1.007 of the PDLA FAQ, a collection of frequently asked questions about PDLA - the Perl Data Language.

ABOUT THIS DOCUMENT

Q: 1.1 Where to find this document

You can find the latest version of this document at http://pdl.perl.org/?docs=FAQ&title=Frequently%20Asked%20Questions .

Q: 1.2 How to contribute to this document

This is a considerably reworked version of the PDLA FAQ. As such many errors might have crept in and many updates might not have made it in. You are explicitly encouraged to let us know about questions which you think should be answered in this document but currently aren't.

Similarly, if you think parts of this document are unclear, please tell the FAQ maintainer about it. Where a specific answer is taken in full from someones posting the authorship should be indicated, let the FAQ maintainer know if it isn't. For more general information explicit acknowledgment is not made in the text, but rather there is an incomplete list of contributors at the end of this document. Please contact the FAQ maintainer if you feel hard done by.

Send your comments, additions, suggestions or corrections to the PDLA mailing list at pdl-general@lists.sourceforge.net. See Q: 3.2 below for instructions on how to join the mailing lists.

GENERAL QUESTIONS

Q: 2.1 What is PDLA ?

PDLA stands for Perl Data Language . To say it with the words of Karl Glazebrook, initiator of the PDLA project:

The PDLA concept is to give standard perl5 the ability
to COMPACTLY store and SPEEDILY manipulate the large
N-dimensional data sets which are the bread and butter
of scientific computing. e.g. $a=$b+$c can add two
2048x2048 images in only a fraction of a second.

It provides tons of useful functionality for scientific and numeric analysis.

For readers familiar with other scientific data evaluation packages it may be helpful to add that PDLA is in many respects similar to IDL, MATLAB and similar packages. However, it tries to improve on a number of issues which were perceived (by the authors of PDLA) as shortcomings of those existing packages.

Q: 2.2 Who supports PDLA? Who develops it?

PDLA is supported by its users. General informal support for PDLA is provided through the PDLA mailing list (pdl-general@lists.sourceforge.net , see below).

As a Perl extension (see Q: 2.5 below) it is devoted to the idea of free and open development put forth by the Perl community. PDLA was and is being actively developed by a loosely knit group of people around the world who coordinate their activities through the PDLA development mailing list (pdl-devel@lists.sourceforge.net , see Q: 3.2 below). If you would like to join in the ongoing efforts to improve PDLA please join this list.

Q: 2.3 Why yet another Data Language ?

There are actually several reasons and everyone should decide for himself which are the most important ones:

  • PDLA is "free software". The authors of PDLA think that this concept has several advantages: everyone has access to the sources -> better debugging, easily adaptable to your own needs, extensible for your purposes, etc... In comparison with commercial packages such as MATLAB and IDL this is of considerable importance for workers who want to do some work at home and cannot afford the considerable cost to buy commercial packages for personal use.

  • PDLA is based on a powerful and well designed scripting language: Perl. In contrast to other scientific/numeric data analysis languages it has been designed using the features of a proven language instead of having grown into existence from scratch. Defining the control structures while features were added during development leads to languages that often appear clumsy and badly planned for most existing packages with similar scope as PDLA.

  • Using Perl as the basis a PDLA programmer has all the powerful features of Perl at his hand, right from the start. This includes regular expressions, associative arrays (hashes), well designed interfaces to the operating system, network, etc. Experience has shown that even in mainly numerically oriented programming it is often extremely handy if you have easy access to powerful semi-numerical or completely non-numerical functionality as well. For example, you might want to offer the results of a complicated computation as a server process to other processes on the network, perhaps directly accepting input from other processes on the network. Using Perl and existing Perl extension packages things like this are no problem at all (and it all will fit into your "PDLA script").

  • Extremely easy extensibility and interoperability as PDLA is a Perl extension; development support for Perl extensions is an integral part of Perl and there are already numerous extensions to standard Perl freely available on the network.

  • Integral language features of Perl (regular expressions, hashes, object modules) immensely facilitated development and implementation of key concepts of PDLA. One of the most striking examples for this point is probably PDLA::PP (see Q: 6.16 below), a code generator/parser/pre-processor that generates PDLA functions from concise descriptions.

  • None of the existing data languages follow the Perl language rules, which the authors firmly believe in:

    • TIMTOWTDI: There is more than one way to do it. Minimalist languages are interesting for computer scientists, but for users, a little bit of redundancy makes things wildly easier to cope with and allows individual programming styles - just as people speak in different ways. For many people this will undoubtedly be a reason to avoid PDLA ;)

    • Simple things are simple, complicated things possible: Things that are often done should be easy to do in the language, whereas seldom done things shouldn't be too cumbersome.

    All existing languages violate at least one of these rules.

  • As a project for the future PDLA should be able to use super computer features, e.g. vector capabilities/parallel processing, GPGPU acceleration. This will probably be achieved by having PDLA::PP (see Q: 6.16 below) generate appropriate code on such architectures to exploit these features.

  • [ fill in your personal 111 favourite reasons here...]

Q: 2.4 What is PDLA good for ?

Just in case you do not yet know what the main features of PDLA are and what one could do with them, here is a (necessarily selective) list of key features:

PDLA is well suited for matrix computations, general handling of multidimensional data, image processing, general scientific computation, numerical applications. It supports I/O for many popular image and data formats, 1D (line plots), 2D (images) and 3D (volume visualization, surface plots via OpenGL - for instance implemented using Mesa or video card OpenGL drivers), graphics display capabilities and implements many numerical and semi-numerical algorithms.

Through the powerful pre-processor it is also easy to interface Perl to your favorite C routines, more of that further below.

Q: 2.5 What is the connection between PDLA and Perl ?

PDLA is a Perl5 extension package. As such it needs an existing Perl5 installation (see below) to run. Furthermore, much of PDLA is written in Perl (+ some core functionality that is written in C). PDLA programs are (syntactically) just Perl scripts that happen to use some of the functionality implemented by the package "PDLA".

Q: 2.6 What do I need to run PDLA on my machine ?

Since PDLA is just a Perl5 package you need first of all an installation of Perl5 on your machine. As of this writing PDLA requires version 5.10.x of perl, or higher. More information on where and how to get a Perl installation can be found at the Perl home page http://www.perl.org and at many CPAN sites (if you do not know what CPAN is, check the answer to the next question).

To build PDLA you also need a working C compiler, support for Xsubs, and the package Extutils::MakeMaker. If you don't have a compiler there might be a binary distribution available, see "Binary distributions" below.

If you can (or cannot) get PDLA working on a new (previously unsupported) platform we would like to hear about it. Please, report your success/failure to the PDLA mailing list at pdl-general@lists.sourceforge.net . We will do our best to assist you in porting PDLA to a new system.

Q: 2.7 Where do I get it?

PDLA is available as source distribution in the Comprehensive Perl Archive Network (or CPAN) and from the sourceforge.net project page at https://sourceforge.net/projects/pdl/files/. The CPAN archives contains not only the PDLA distribution but also just about everything else that is Perl-related. CPAN is mirrored by dozens of sites all over the world. The main site is http://www.cpan.org, and local CPAN sites (mirrors) can be found there. Within CPAN you find the latest released version of PDLA in the directory /modules/by-module/PDLA/. PDLA's homepage is at http://pdl.perl.org and the latest version can also be downloaded from there.

Q: 2.8 What do I have to pay to get PDLA?

We are delighted to be able to give you the nicest possible answer on a question like this: PDLA is *free software* and all sources are publicly available. But still, there are some copyrights to comply with. So please, try to be as nice as we (the PDLA authors) are and try to comply with them.

Oh, before you think it is *completely* free: you have to invest some time to pull the distribution from the net, compile and install it and (maybe) read the manuals.

GETTING HELP/MORE INFORMATION

Q: 3.1 Where can I get information on PDLA?

The complete PDLA documentation is available with the PDLA distribution. Use the command perldoc PDLA to start learning about PDLA.

The easiest way by far, however, to get familiar with PDLA is to use the PDLA on-line help facility from within the PDLA shell, pdla2 Just type pdla2 at your system prompt. Once you are inside the pdla2 shell type help . Using the help and apropos commands inside the shell you should be able to find the way round the documentation.

Even better, you can immediately try your newly acquired knowledge about PDLA by issuing PDLA/Perl commands directly at the command line. To illustrate this process, here is the record of a typical pdla2 session of a PDLA beginner (lengthy output is only symbolically reproduced in braces ( <... ...> ) ):

unix> pdla2
pdla> help
< ... help output ... >
pdla> help PDLA::QuickStart
< ... perldoc page ... >
pdla> $a = pdl (1,5,7.3,1.0)
pdla> $b = sequence float, 4, 4
pdla> help inner
< ... help on the 'inner' function ... >
pdla> $c = inner $a, $b
pdla> p $c
[22.6 79.8 137 194.2]

For further sources of information that are accessible through the Internet see next question.

Q: 3.2 Are there other PDLA information sources on the Internet?

First of all, for all purely Perl-related questions there are tons of sources on the net. Good points to start are http://www.perl.com and http://www.perl.org .

The PDLA home site can be accessed by pointing your web browser to http://pdl.perl.org . It has tons of goodies for anyone interested in PDLA:

  • PDLA distributions

  • On-line documentation

  • Pointers to an HTML archive of the PDLA mailing lists

  • A list of platforms on which PDLA has been successfully tested.

  • News about recently added features, ported libraries, etc.

  • Name of the current pumpkin holders for the different PDLA modules (if you want to know what that means you better had a look at the web pages).

If you are interested in PDLA in general you can join the PDLA mailing list pdl-general@lists.sourceforge.net. This is a forum to discuss programming issues in PDLA, report bugs, seek assistance with PDLA related problems, etc. To subscribe, fill out the form at http://lists.sourceforge.net/lists/listinfo/pdl-general .

A searchable archive and a hypertext version of the traffic on this list (1997-2004) can be found at http://www.xray.mpe.mpg.de/mailing-lists/perldl/ . More recent messages (since June 2005) can be found at TBD.

If you are interested in all the technical details of the ongoing PDLA development you can join the PDLA developers mailing list pdl-devel@lists.sourceforge.net . To subscribe, fill out the form at http://lists.sourceforge.net/lists/listinfo/pdl-devel .

A searchable archive and a hypertext version of the traffic on this list (1997-2004) can be found at http://www.xray.mpe.mpg.de/mailing-lists/pdl-porters/ . More recent messages (since June 2005) can be found at TBD

Cross-posting between these lists should be avoided unless there is a very good reason for doing that.

Q: 3.3 What is the current version of PDLA ?

As of this writing (FAQ version 1.007 of 08 Oct 2016) the latest stable version is 2.017. The latest stable version should always be available from a CPAN mirror site near you (see Question 2.7 for info on where to get PDLA).

The most current (possibly unstable) version of PDLA can be obtained from the Git repository, see Question 4.10 and periodic CPAN developers releases of the Git code will be made for testing purposes and more general availability.

Q: 3.4 How can PDLA-2.2 be older than PDLA-2.007?

Over its development, PDLA has used both a single floating point version number (from the versions 1.x through 2.005) at which point it switched to a dotted triple version for 2.1.1 onward---EXCEPT for version 2.2 which came out which should have been 2.2.0. To simplify and unify things, PDLA has reverted to a single float version representation with PDLA-2.006. This can cause dependency problems for modules that set a minimum PDLA version of 2.2. The work around it, note that all extant PDLA releases have version numbers greater than 2.2.1 so that using 0 as the minimum version will work.

Q: 3.5 I want to contribute to the further development of PDLA. How can I help?

Two ways that you could help almost immediately are (1) participate in CPAN Testers for PDLA and related modules, and (2) proofreading and clarifying the PDLA documentation so that it is most useable for PDLA users, especially new users.

To participate in CPAN Testers and contribute test reports, the page http://wiki.cpantesters.org/wiki/QuickStart has instructions for starting for either CPAN or CPANPLUS users.

If you have a certain project in mind you should check if somebody else is already working on it or if you could benefit from existing modules. Do so by posting your planned project to the PDLA developers mailing list at pdl-devel@lists.sourceforge.net . See the subscription instructions in Question 3.2. We are always looking for people to write code and/or documentation ;).

Q: 3.6 I think I have found a bug in the current version of PDLA. What shall I do?

First, make sure that the bug/problem you came across has not already been dealt with somewhere else in this FAQ. Secondly, you can check the searchable archive of the PDLA mailing lists to find whether this bug has already been discussed. If you still haven't found any explanations you can post a bug report to pdl-general@lists.sourceforge.net , or through the Bugs link on http://pdl.perl.org . See the BUGS file in the PDLA distribution for what information to include. If you are unsure, discussions via the perldl mailing list can be most helpful.

INSTALLATION

Q: 4.1 I have problems installing PDLA. What shall I do?

First make sure you have read the file INSTALL in the distribution. This contains a list of common problems which are unnecessary to repeat here.

Next, check the file perldl.conf to see if by editing the configuration options in that file you will be able to successfully build PDLA. Some of the modules need additional software installed, please refer to the file DEPENDENCIES for further details. Make sure to edit the location of these packages in perldl.conf if you have them in non-standard locations.

N.B. Unix shell specific: If you would like to save an edited perldl.conf for future builds just copy it as ~/.perldl.conf into your home directory where it will be picked up automatically during the PDLA build process.

Also, check for another, pre-existing version of PDLA on the build system. Multiple PDLA installs in the same PATH or @INC can cause puzzling test or build failures.

If you still can't make it work properly please submit a bug report including detailed information on the problems you encountered to the perldl mailing list ( pdl-general@lists.sourceforge.net , see also above). Response is often rapid.

Q: 4.2 Are there configuration files for PDLA I have to edit?

Most users should not have to edit any configuration files manually. However, in some cases you might have to supply some information about awkwardly placed include files/libraries or you might want to explicitly disable building some of the optional PDLA modules. Check the files INSTALL and perldl.conf for details.

If you had to manually edit perldl.conf and are happy with the results you can keep the file handy for future reference. Place it in ~/.perldl.conf where it will be picked up automatically or use perl Makefile.PL PDLACONF=your_file_name next time you build PDLA.

Q: 4.3 Do I need other software for successful operation?

For the basic PDLA functionality you don't need any additional software. However, some of the optional PDLA modules included in the distribution (notably most graphics and some I/O modules) require certain other libraries/programs to be installed. Check the file DEPENDENCIES in the distribution for details and directions on how to get these.

Q: 4.4 How can I install PDLA in a non-standard location?

To install PDLA in a non-standard location, use the INSTALL_BASE option in the perl Makefile.PL configure step. For example, perl Makefile.PL INSTALL_BASE=/mydir/perl5 will configure PDLA to install into the tree rooted at /mydir/perl5. For more details see "How do I keep my own module/library directory?" in perlfaq8 and subsequent sections. Another alternative is to use local::lib to do the heavy listing for the needed configuration.

Q: 4.5 How can I force a completely clean installation?

To guarantee a completely clean installation of PDLA, you will need to first delete the current installation files and folders. These will be all directories named PDLA in the Perl @INC path, files named *Pdlapp* in any Inline directories, and the programs pdl, pdladoc, pdla2, perldla, and pptemplate. Then just build and install as usual. This is much easier to keep track of if you always install PDLA into a non-standard location. See Q: 4.4 above.

BINARY DISTRIBUTIONS

Q: 4.5 What binary distributions are available?

Information about binary distributions of PDLA can be found on http://pdl.perl.org . At present there are binary distributions of PDLA for Linux (RedHat and Debian), FreeBSD, Mac OS X and Windows, though they might not be the most recent version.

If someone is interested in providing binary distributions for other architectures, that would be very welcome. Let us know on the pdl-devel@lists.sourceforge.net mailing list. Also check your Linux distribution's package manager as many now include PDLA. PPMs for win32 versions (both 32bit and 64bit) are also available.

Q: 4.6 Does PDLA run on Linux? (And what about packages?)

Yes, PDLA does run on Linux and indeed much of the development has been done under Linux. On http://pdl.perl.org you can find links to packages for some of the major distributions. Also check your distribution's package manager (yum, apt, urpmi, ...) as PDLA is now found by many of these.

Q: 4.7 Does PDLA run under Windows?

PDLA builds fine on Win32 using MinGW or Microsoft compilers. See the win32/INSTALL file in the PDLA source distribution for details. Other compilers have not been tested--input is welcome. There is also a distribution of PDLA through ActiveState's ppm, though it might not always be the latest version. PDLA-2.017 builds out of the box on Strawberry Perl and ActiveState Perl and there are distributions of Strawberry Perl with bundled PDLA (see http://strawberryperl.com/releases.html).

CVS, GIT, AND ON-GOING DEVELOPMENT

Q: 4.8 Can I get PDLA via CVS?

No. PDLA development was conducted with a CVS repository from December 1999 to April 2009. In April 2009 the project switched to the Git version control system (see http://git-scm.com).

Q: 4.9 How do I get PDLA via Git?

Assume you have Git installed on your system and want to download the project source code into the directory PDLA. To get read-only access to the repository, you type at the command line

git clone git://git.code.sf.net/p/pdl/code pdl-code

For official PDLA developers, to get read/write access to the repository type at the command line

git clone ssh://USERNAME@git.code.sf.net/p/pdl/code pdl-code

Q: 4.10 I had a problem with the Git version, how do I check if someone has submitted a patch?

The Sourceforge system contains a patch-manager which contains patches that have not yet been applied to the distribution. This can be accessed via the Tickets menu at PDLA's Sourceforge project page http://sourceforge.net/projects/pdl .

In addition, if you are not subscribing to the mailing list, check the archive of the pdl-porters and perldl mailing lists. See Question 3.2 for details.

Q: 4.11 I have gotten developer access to Git, how do I upload my changes?

The first thing you should do is to read the Git documentation and learn the basics about Git. There are many sources available online. But here are the basics:

Before you upload your changes, commit them to YOUR repository

git add <file1> <file2> ...
git commit

or combine these two with

git commit -a

Then pull in any changes others have made

git pull origin

Test the PDLA before you push it to the main repository. If the code is broken for you, then it is most likely broken for others and they won't be happy to have their recent PDLA fail to build! NOTE: git makes it very easy to maintain a separate branch of development. [ TBD, provide information on how ].

Then update the shared repository (at SF.net) with your changes

git push origin master

PDLA JARGON

Q: 5.1 What is threading (is PDLA a newsreader) ?

Unfortunately, in the context of PDLA the term threading can have two different (but related) meanings:

  • When mentioned in the INSTALL directions and possibly during the build process we have the usual computer science meaning of multi-threading in mind (useful mainly on multiprocessor machines or clusters)

  • PDLA threading of operations on piddles (as mentioned in the indexing docs) is the iteration of a basic operation over appropriate sub-slices of piddles, e.g. the inner product inner $a, $b of a (3) pdl $a and a (3,5,4) pdl $b results in a (5,4) piddle where each value is the result of an inner product of the (3) pdl with a (3) sub-slice of the (3,5,4) piddle. For details check PDLA::Indexing

PDLA threading leads naturally to potentially parallel code which can make use of multi threading on multiprocessor machines/networks; there you have the connection between the two types of use of the term.

Q: 5.2 What is a piddle?

Well, PDLA scalar variables (which are instances of a particular class of Perl objects, i.e. blessed thingies (see perldoc perlobj )) are in common PDLA parlance often called piddles (for example, check the mailing list archives). Err, clear? If not, simply use the term piddle when you refer to a PDLA variable (an instance of a PDLA object as you might remember) regardless of what actual data the PDLA variable contains.

TECHNICAL QUESTIONS

Q: 6.1 What is perldla? What is pdla2?

Sometimes perldla (pdla2) is used as a synonym for PDLA. Strictly speaking, however, the name perldla (pdla2) is reserved for the little shell that comes with the PDLA distribution and is supposed to be used for the interactive prototyping of PDLA scripts. For details check perldla or pdla2.

Q: 6.2 How do I get on-line help for PDLA?

Just type help (shortcut = "?") at the pdla2 shell prompt and proceed from there. Another useful command is the apropos (shortcut = "??") command. Also try the demo command in the perldla or pdla2 shell if you are new to PDLA.

MANIPULATION OF PIDDLES

Q: 6.3 I want to access the third element of a pdl but $a[2] doesn't work ?!

See answer to the next question why the normal Perl array syntax doesn't work for piddles.

Q: 6.4 The docs say piddles are some kind of array. But why doesn't the Perl array syntax work with piddles then ?

OK, you are right in a way. The docs say that piddles can be thought of arrays. More specifically, it says ( PDLA::QuickStart ):

I find when using the Perl Data Language it is most useful
to think of standard Perl @x variables as "lists" of generic
"things" and PDLA variables like $x as "arrays" which can be
contained in lists or hashes.

So, while piddles can be thought of as some kind of multi-dimensional array they are not arrays in the Perl sense. Rather, from the point of view of Perl they are some special class (which is currently implemented as an opaque pointer to some stuff in memory) and therefore need special functions (or 'methods' if you are using the OO version) to access individual elements or a range of elements. The functions/methods to check are at / set (see the section 'Sections' in PDLA::QuickStart ) or the powerful slice function and friends (see PDLA::Slices and PDLA::Indexing and especially PDLA::NiceSlice ).

Finally, to confuse you completely, you can have Perl arrays of piddles, e.g. $spec[3] can refer to a pdl representing ,e.g, a spectrum, where $spec[3] is the fourth element of the Perl list (or array ;) @spec . This may be confusing but is very useful !

Q: 6.5 How do I concatenate piddles?

Most people will try to form new piddles from old piddles using some variation over the theme: $a = pdl([$b, 0, 2]) , but this does not work. The way to concatenate piddles is to use the function cat (see also append and glue). Similarly you can split piddles using the command dog .

Q: 6.6 Sometimes I am getting these strange results when using inplace operations?

This question is related to the inplace function. From the documentation (see PDLA::QuickStart):

Most functions, e.g. log(), return a result which is a
transformation of their argument. This makes for good
programming practice. However many operations can be done
"in-place" and this may be required when large arrays are in
use and memory is at a premium. For these circumstances the
operator inplace() is provided which prevents the extra copy
and allows the argument to be modified. e.g.:

$x = log($array);          # $array unaffected
log( inplace($bigarray) ); # $bigarray changed in situ

And also from the doc !!:

Obviously when used with some functions which can not be
applied in situ (e.g. convolve()) unexpected effects may
occur!

Q: 6.7 What is this strange usage of the string concatenation operator .= in PDLA scripts?

See next question on assignment in PDLA.

Q: 6.8 Why are there two different kinds of assignment in PDLA ?

This is caused by the fact that currently the assignment operator = allows only restricted overloading. For some purposes of PDLA it turned out to be necessary to have more control over the overloading of an assignment operator. Therefore, PDLA peruses the operator .= for certain types of assignments.

Q: 6.9 How do I set a set of values in a piddle?

In Perl 5.6.7 and higher this assignment can be made using lvalue subroutines:

pdla> $a = sequence(5); p $a
[0 1 2 3 4]
pdla> $a->slice('1:2') .= pdl([5,6])
pdla> p $a
[0 5 6 3 4]

see PDLA::Lvalue for more info. PDLA also supports a more matrix-like slice syntax via the PDLA::NiceSlice module:

pdla> $a(1:2) .= pdl([5,6])
pdla> p $a
[0 5 6 3 4]

With versions of Perl prior to 5.6.7 or when running under the perl debugger this has to be done using a temporary variable:

pdla> $a = sequence(5); p $a
[0 1 2 3 4]
pdla> $tmp = $a->slice('1:2'); p $tmp;
[1 2]
pdla> $tmp .= pdl([5, 6]);    # Note .= !!
pdla> p $a
[0 5 6 3 4]

This can also be made into one expression, which is often seen in PDLA code:

pdla> ($tmp = $a->slice('1:2')) .= pdl([5,6])
pdla> p $a
[0 5 6 3 4]

Q: 6.10 Can I use a piddle in a conditional expression?

Yes you can, but not in the way you probably tried first. It is not possible to use a piddle directly in a conditional expression since this is usually poorly defined. Instead PDLA has two very useful functions: any and all . Use these to test if any or all elements in a piddle fulfills some criterion:

pdla> $a=pdl ( 1, -2, 3);
pdla> print '$a has at least one element < 0' if (any $a < 0);
$a has at least one element < 0

pdla> print '$a is not positive definite' unless (all $a > 0);
$a is not positive definite

Q: 6.11 Logical operators and piddles - '||' and '&&' don't work!

It is a common problem that you try to make a mask array or something similar using a construct such as

$mask = which($piddle > 1 && $piddle < 2);   # incorrect

This does not work! What you are looking for is the bitwise logical operators '|' and '&' which work on an element-by-element basis. So it is really very simple: Do not use logical operators on multi-element piddles since that really doesn't make sense, instead write the example as:

$mask = which($piddle > 1 & $piddle < 2);

which works correctly.

ADVANCED TOPICS

Q: 6.12 What is a null pdl ?

null is a special token for 'empty piddle'. A null pdl can be used to flag to a PDLA function that it should create an appropriately sized and typed piddle. Null piddles can be used in places where a PDLA function expects an output or temporary argument. Output and temporary arguments are flagged in the signature of a PDLA function with the [o] and [t] qualifiers (see next question if you don't know what the signature of a PDLA function is). For example, you can invoke the sumover function as follows:

sumover $a, $b=null;

which is equivalent to

$b = sumover $a;

If this seems still a bit murky check PDLA::Indexing and PDLA::PP for details about calling conventions, the signature and threading (see also below).

Q: 6.13 What is the signature of a PDLA function ?

The signature of a function is an important concept in PDLA. Many (but not all) PDLA function have a signature which specifies the arguments and their (minimal) dimensionality. As an example, look at the signature of the maximum function:

'a(n); [o] b;'

this says that maximum takes two arguments, the first of which is (at least) one-dimensional while the second one is zero-dimensional and an output argument (flagged by the [o] qualifier). If the function is called with piddles of higher dimension the function will be repeatedly called with slices of these piddles of appropriate dimension(this is called threading in PDLA).

For details and further explanations consult PDLA::Indexing and PDLA::PP .

Q: 6.14 How can I subclass (inherit from) piddles?

The short answer is: read PDLA::Objects (e.g. type help PDLA::Objects in the perldla or pdla2 shell).

The longer answer (extracted from PDLA::Objects ): Since a PDLA object is an opaque reference to a C struct, it is not possible to extend the PDLA class by e.g. extra data via sub-classing (as you could do with a hash based Perl object). To circumvent this problem PDLA has built-in support to extend the PDLA class via the has-a relation for blessed hashes. You can get the HAS-A to behave like IS-A simply in that you assign the PDLA object to the attribute named PDLA and redefine the method initialize(). For example:

package FOO;

@FOO::ISA = qw(PDLA);
sub initialize {
   my $class = shift;
   my $self = {
      creation_time => time(),  # necessary extension :-)
      PDLA => PDLA->null,         # used to store PDLA object
   };
   bless $self, $class;
}

For another example check the script t/subclass.t in the PDLA distribution.

Q: 6.15 What on earth is this dataflow stuff ?

Dataflow is an experimental project that you don't need to concern yourself with (it should not interfere with your usual programming). However, if you want to know, have a look at PDLA::Dataflow . There are applications which will benefit from this feature (and it is already at work behind the scenes).

Q: 6.16 What is PDLA::PP?

Simple answer: PDLA::PP is both a glue between external libraries and PDLA and a concise language for writing PDLA functions.

Slightly longer answer: PDLA::PP is used to compile very concise definitions into XSUB routines implemented in C that can easily be called from PDLA and which automatically support threading, dataflow and other things without you having to worry about it.

For further details check PDLA::PP and the section below on Extensions of PDLA.

Q: 6.17 What happens when I have several references to the same PDLA object in different variables (cloning, etc?) ?

Piddles behave like Perl references in many respects. So when you say

$a = pdl [0,1,2,3];
$b = $a;

then both $b and $a point to the same object, e.g. then saying

$b++;

will *not* create a copy of the original piddle but just increment in place, of which you can convince yourself by saying

print $a;
[1 2 3 4]

This should not be mistaken for dataflow which connects several *different* objects so that data changes are propagated between the so linked piddles (though, under certain circumstances, dataflown piddles can share physically the same data).

It is important to keep the "reference nature" of piddles in mind when passing piddles into subroutines. If you modify the input piddles you modify the original argument, not a copy of it. This is different from some other array processing languages but makes for very efficient passing of piddles between subroutines. If you do not want to modify the original argument but rather a copy of it just create a copy explicitly (this example also demonstrates how to properly check for an explicit request to process inplace, assuming your routine can work inplace):

sub myfunc {
   my $pdl = shift;
   if ($pdl->is_inplace) {
      $pdl->set_inplace(0)
   } else {
      # modify a copy by default
      $pdl = $pdl->copy
   }
   $pdl->set(0,0);
   return $pdl;
}

MISCELLANEOUS

Q: 6.18 What I/O formats are supported by PDLA ?

The current versions of PDLA already support quite a number of different I/O formats. However, it is not always obvious which module implements which formats. To help you find the right module for the format you require, here is a short list of the current list of I/O formats and a hint in which module to find the implementation:

  • A home brew fast raw (binary) I/O format for PDLA is implemented by the FastRaw module

  • The FlexRaw module implements generic methods for the input and output of `raw' data arrays. In particular, it is designed to read output from FORTRAN 77 UNFORMATTED files and the low-level C write function, even if the files are compressed or gzipped.

    It is possible that the FastRaw functionality will be included in the FlexRaw module at some time in the future.

  • FITS I/O is implemented by the wfits/rfits functions in PDLA::IO::FITS .

  • ASCII file I/O in various formats can be achieved by using the rcols and rgrep functions, also in PDLA::IO::Misc .

  • PDLA::IO::Pic implements an interface to the NetPBM/PBM+ filters to read/write several popular image formats; also supported is output of image sequences as MPEG movies, animated GIFs and a wide variety of other video formats.

  • On CPAN you can find the PDLA::NetCDF module that works with PDLA 2.007.

For further details consult the more detailed list in the PDLA::IO documentation or the documentation for the individual modules.

Q: 6.19 How can I stack a set of 2D arrays (images) into a 3D piddle?

Assuming all arrays are of the same size and in some format recognized by rpic (see PDLA::IO::Pic ) you could say:

use PDLA::IO::Pic;
@names = qw/name1.tif .... nameN.tif/;  # some file names
$dummy = PDLA->rpic($names[0]);
$cube = PDLA->zeroes($dummy->type,$dummy->dims,$#names+1); # make 3D piddle
for (0..$#names) {
    # this is the slice assignment
    ($tmp = $cube->slice(":,:,($_)")) .= PDLA->rpic($names[$_]);
}

or

$cube(:,:,($_)) .= PDLA->rpic($names[$_]);

for the slice assignment using the new PDLA::NiceSlice syntax and Lvalue assignments.

The for loop reads the actual images into a temporary 2D piddle whose values are then assigned (using the overloaded .= operator) to the appropriate slices of the 3D piddle $cube .

Q: 6.20 Where are test files for the graphics modules?

This answer applies mainly to PDLA::Graphics::TriD (PDLA's device independent 3D graphics model) which is the trickiest one in this respect. You find some test scripts in Demos/TriD in the distribution. There are also 3dtest.pl and line3d.pl in the PDLA/Example/TriD directory. After you have built PDLA you can do:

perl -Mblib Example/TriD/3dtest.pl

perl -Mblib Example/TriD/line3d.pl

to try the two TriD test programs. They only exercise one TriD function each but their simplicity makes it easy to debug if needed with the Perl debugger, see perldbug.

The programs in the Demo directory can be run most easily from the perldla or pdla2 interactive shell:

perl -Mblib perldla  or  perl -Mblib Perldl2/pdla2

followed by demo 3d or demo 3d2 at the prompt. demo by itself will give you a list of the available PDLA demos.

You can run the test scripts in the Demos/TriD directory manually by changing to that directory and running

perl -Mblib <testfile>

where testfile ; should match the pattern test[3-9].p and watch the results. Some of the tests should bring up a window where you can control (twiddle) the 3D objects with the mouse. Try using mouse button 1 for turning the objects in 3D space, mouse button 3 to zoom in and out, and 'q' to advance to the next stage of the test.

Q: 6.21 What is TriD or PDLA::TriD or PDLA::Graphics::TriD?

Questions like this should be a thing of the past with the PDLA on-line help system in place. Just try (after installation):

un*x> pdla2
pdla> apropos trid

Check the output for promising hits and then try to look up some of them, e.g.

pdla> help PDLA::Graphics::TriD

Note that case matters with help but not with apropos .

Q: 6.22 PGPLOT does not write out PNG files.

There are a few sources of trouble with PGPLOT and PNG files. First, when compiling the pgplot libraries, make sure you uncomment the PNG entries in the drivers.list file. Then when running 'make' you probably got an error like

C<make: *** No rule to make target `png.h', needed by `pndriv.o'.  Stop.>

To fix this, find the line in the 'makefile' that starts with 'pndriv.o:' (it's near the bottom). Change, for example, ./png.h to /usr/include/png.h, if that is where your header files are (you do have the libpng and libz devel packages, don't you?). Do this for all four entries on that line, then go back and run make.

Second, if you already have the PGPLOT Perl module and PDLA installed, you probably tried to write out a PNG file and got fatal error message like:

C<undefined symbol: png_create_write_struct>

This is because the PGPLOT Perl module does not automatically link against the png and z libraries. So when you are installing the PGPLOT Perl module (version 2.19) from CPAN, don't do install PGPLOT, but just do get PGPLOT. Then exit from CPAN and manually install PGPLOT, calling the makefile thusly:

C<perl Makefile.PL EXLIB=png,z EXDIR=/usr/lib>

assuming that there exist files such as /usr/lib/libpng.so.*, /usr/lib/libz.so.*. Then do the standard make;make test;make install; sequence. Now you can write png files from PDLA!

EXTENSIONS OF PDLA

Q: 7.1 I am looking for a package to do XXX in PDLA. Where shall I look for it?

The first stop is again perldla or pdla2 and the on-line help or the PDLA documentation. There is already a lot of functionality in PDLA which you might not be aware of. The easiest way to look for functionality is to use the apropos command:

pdla> apropos 'integral'
ceil            Round to integral values in floating-point format
floor           Round to integral values in floating-point format
intover         Project via integral to N-1 dimensions
rint            Round to integral values in floating-point format

Since the apropos command is no sophisticated search engine make sure that you search on a couple of related topics and use short phrases.

However there is a good chance that what you need is not part of the PDLA distribution. You are then well advised to check out http://pdl.perl.org where there is a list of packages using PDLA. If that does not solve your problem, ask on the mailing-list, if nothing else you might get assistance which will let you interface your package with PDLA yourself, see also the next question.

Q: 7.2 Can I access my C/FORTRAN library routines in PDLA?

Yes, you can, in fact it is very simple for many simple applications. What you want is the PDLA pre-processor PP (PDLA::PP ). This will allow you to make a simple interface to your C routine.

The two functions you need to learn (at least first) are pp_def which defines the calling interface to the function, specifying input and output parameters, and contains the code that links to the external library. The other command is pp_end which finishes the PP definitions. For details see the PDLA::PP man-page, but we also have a worked example here.

double eight_sum(int n)
{
     int i;
     double sum, x;

     sum = 0.0; x=0.0;
     for (i=1; i<=n; i++) {
       x++;
       sum += x/((4.0*x*x-1.0)*(4.0*x*x-1.0));
     } 
     return 1.0/sum;
}

We will here show you an example of how you interface C code with PDLA. This is the first example and will show you how to approximate the number 8...

The C code is shown above and is a simple function returning a double, and expecting an integer - the number of terms in the sum - as input. This function could be defined in a library or, as we do here, as an inline function.

We will postpone the writing of the Makefile till later. First we will construct the .pd file. This is the file containing PDLA::PP code. We call this eight.pd .

# 
# pp_def defines a PDLA function. 
#
pp_addhdr (
'
double eight_sum(int n)
{
  int i;
  double sum, x;

  sum = 0.0; x=0.0;
  for (i=1; i<=n; i++) {
   x++; 
   sum += x/((4.0*x*x-1.0)*(4.0*x*x-1.0));
  }
 return 1.0/sum; 

}  
'); 

pp_def (
        'eight',
     Pars => 'int a(); double [o]b();',
        Code => '$b()=eight_sum($a());'
       );

# Always make sure that you finish your PP declarations with
# pp_done

pp_done();

A peculiarity with our example is that we have included the entire code with pp_addhdr instead of linking it in. This is only for the purposes of example, in a typical application you will use pp_addhdr to include header files. Note that the argument to pp_addhdr is enclosed in quotes.

What is most important in this example is however the pp_def command. The first argument to this is the name of the new function eight , then comes a hash which the real meat:

  • This gives the input parameters (here a) and the output parameters (here b). The latter are indicated by the [o] specifier. Both arguments can have a type specification as shown here.

    Many variations and further flexibility in the interface can be specified. See perldoc PDLA::PP for details.

  • This switch contains the code that should be executed. As you can see this is a rather peculiar mix of C and Perl, but essentially it is just as you would write it in C, but the variables that are passed from PDLA are treated differently and have to be referred to with a preceding '$'.

    There are also simple macros to pass pointers to data and to obtain the values of other Perl quantities, see the manual page for further details.

Finally note the call to pp_done() at the end of the file. This is necessary in all PP files.

OK. So now we have a file with code that we dearly would like to use in Perl via PDLA. To do this we need to compile the function, and to do that we need a Makefile.

use PDLA::Core::Dev;
use ExtUtils::MakeMaker;
PDLA::Core::Dev->import();

$package = ["eight.pd",Eight,PDLA::Eight];
%hash = pdlpp_stdargs($package);

WriteMakefile( %hash );

sub MY::postamble {pdlpp_postamble($package)};

The code above should go in a file called Makefile.PL, which should subsequently be called in the standard Perl way: perl Makefile.PL . This should give you a Makefile and running make should compile the module for you and make install will install it for you.

Q: 7.3 How can I interface package XXX in PDLA?

This question is closely related to the previous one, and as we said there, the PDLA::PP pre-processor is the standard way of interfacing external packages with PDLA. The most usual way to use PDLA::PP is to write a short interface routine, see the PDLA::PP perldoc page and the answer to the previous question for examples.

However it is also possible to interface a package to PDLA by re-writing your function in PDLA::PP directly. This can be convenient in certain situations, in particular if you have a routine that expects a function as input and you would like to pass the function a Perl function for convenience.

The PDLA::PP perldoc page is the main source of information for writing PDLA::PP extensions, but it is very useful to look for files in the distribution of PDLA as many of the core functions are written in PDLA::PP. Look for files that end in .pd which is the generally accepted suffix for PDLA::PP files. But we also have a simple example here.

The following example will show you how to write a simple function that automatically allows threading. To make this concise the example is of an almost trivial function, but the intention is to show the basics of writing a PDLA::PP interface.

We will write a simple function that calculates the minimum, maximum and average of a piddle. On my machine the resulting function is 8 times faster than the built-in function stats (of course the latter also calculates the median).

Let's jump straight in. Here is the code (from a file called quickstats.pd )

#
pp_def('quickstats',
     Pars => 'a(n); [o]avg(); [o]max(); [o]min()',
     Code => '$GENERIC(a) curmax, curmin;
              $GENERIC(a) tmp=0;
                 loop(n) %{
                   tmp += $a();
                   if (!n || $a() > curmax) { curmax = $a();}
                   if (!n || $a() < curmin) { curmin = $a();}
                 %}
                 $avg() = tmp/$SIZE(n);
              $max() = curmax;
              $min() = curmin;
                '
     );

pp_done();

The above might look like a confusing mixture of C and Perl, but behind the peculiar syntax lies a very powerful language. Let us take it line by line.

The first line declares that we are starting the definition of a PDLA:PP function called quickstats .

The second line is very important as it specifies the input and output parameters of the function. a(n) tells us that there is one input parameter that we will refer to as a which is expected to be a vector of length n (likewise matrices, both square and rectangular would be written as a(n,n) and a(n,m) respectively). To indicate that something is an output parameter we put [o] in front of their names, so referring back to the code we see that avg, max and min are three output parameters, all of which are scalar (since they have no dimensional size indicated.

The third line starts the code definition which is essentially pure C but with a couple of convenient functions. $GENERIC is a function that returns the C type of its argument - here the input parameter a. Thus the first two lines of the code section are variable declarations.

The loop(n) construct is a convenience function that loops over the dimension called n in the parameter section. Inside this loop we calculate the cumulative sum of the input vector and keep track of the maximum and minimum values. Finally we assign the resulting values to the output parameters.

Finally we finish our function declaration with pp_done() .

To compile our new function we need to create a Makefile, which we will just list since its creation is discussed in an earlier question.

use PDLA::Core::Dev;
use ExtUtils::MakeMaker;
PDLA::Core::Dev->import();

$package = ["quickstats.pd",Quickstats,PDLA::Quickstats];
%hash = pdlpp_stdargs($package);

WriteMakefile( %hash );

sub MY::postamble {pdlpp_postamble($package)};

An example Makefile.PL

Our new statistic function should now compile using the tried and tested Perl way: perl Makefile.PL; make .

You should experiment with this function, changing the calculations and input and output parameters. In conjunction with the PDLA::PP perldoc page this should allow you to quickly write more advanced routines directly in PDLA::PP.

BUGS

If you find any inaccuracies in this document (or dis-functional URLs) please report to the perldl mailing list pdl-general@lists.sourceforge.net.

ACKNOWLEDGMENTS

Achim Bohnet (ach@mpe.mpg.de ) for suggesting CoolHTML as a prettypodder (although we have switched to XML now) and various other improvements. Suggestions for some questions were taken from Perl FAQ and adapted for PDLA.

CONTRIBUTORS

Many people have contributed or given feedback on the current version of the FAQ, here is an incomplete list of individuals whose contributions or posts to the mailing-list have improved this FAQ at some point in time alphabetically listed by first name: Christian Soeller, Chris Marshall, Doug Burke, Doug Hunt, Frank Schmauder, Jarle Brinchmann, John Cerney, Karl Glazebrook, Kurt Starsinic, Thomas Yengst, Tuomas J. Lukka.

AUTHOR AND COPYRIGHT

This document emerged from a joint effort of several PDLA developers (Karl Glazebrook, Tuomas J. Lukka, Christian Soeller) to compile a list of the most frequently asked questions about PDLA with answers. Permission is granted for verbatim copying (and formatting) of this material as part of PDLA.

Permission is explicitly not granted for distribution in book or any corresponding form. Ask on the PDLA mailing list pdl-general@lists.sourceforge.net if some of the issues covered in here are unclear.