NAME

PDL::NetCDF - Object-oriented interface between NetCDF files and PDL objects.

Perl extension to allow interface to NetCDF portable binary gridded files via PDL objects.

SYNOPSIS

use PDL;
use PDL::NetCDF;
use PDL::Char;

my $ncobj = PDL::NetCDF->new ("test.nc", {REVERSE_DIMS => 1, PDL_BAD => 1});  # New file
my $pdl = pdl [[1, 2, 3], [4, 5, 6]];

# Specify variable name to put PDL in, plus names of the dimensions.  Dimension         
# lengths are taken from the PDL, in this case, dim1 = 2 and dim2 = 3.      
$ncobj->put ('var1', ['dim1', 'dim2'], $pdl);
# or for netcdf4 files
# $ncobj->put ('var1', ['dim1', 'dim2'], $pdl, {DEFLATE => 9, _FillValue => -999});

# get the deflate level (for any fileformat)
my ($deflate, $shuffle) = $ncobj->getDeflateShuffle('var1');

# $pdlout = [[1, 2, 3], [4, 5, 6]]
my $pdlout = $ncobj->get ('var1');

# Store textual NetCDF arrays using perl strings:  (This is a bit primitive, but works)
my $str = "Station1  Station2  Station3  ";
$obj->puttext('textvar', ['n_station', 'n_string'], [3,10], $str);
my $outstr = $obj->gettext('textvar');
# $outstr = "Station1  Station2  Station3  "

# Now textual NetCDF arrays can be stored with PDL::Char style PDLs.  This is much
# more natural and flexible than the above method.
$str = PDL::Char->new (['Station1', 'Station2', 'Station3']);
$obj->put ('stations', ['dim_station', 'dim_charlen'], $str);
$outstr = $obj->get('stations');
print $outstr;
# Prints: ['Station1', 'Station2', 'Station3']
# For more info on PDL::Char variables see PDL::Char(3), or perldoc PDL::Char

# $dim1size = 2
my $dim1size = $ncobj->dimsize('dim1');

# A slice of the netCDF variable.
# [0,0] is the starting point, [1,2] is the count.
# $slice = [1,2]
my $slice  = $ncobj->get ('var1', [0,0], [1,2]);

# Attach a double attribute of size 3 to var1
$ncobj->putatt (double([1,2,3]), 'double_attribute', 'var1');

# $attr1 = [1,2,3]
my $attr1 = $ncobj->getatt ('double_attribute', 'var1');

# $type = PDL::double
my $type = $ncobj->getvariabletype('var1');

# Write a textual, global attribute.  'attr_name' is the attribute name.
$ncobj->putatt ('The text of the global attribute', 'attr_name');          

# $attr2 = 'The text of the global attribute'
my $attr2 = $ncobj->getatt ('attr_name');

# Close the netCDF file.  The file is also automatically closed in a DESTROY block
# when it passes out of scope.  This just makes is explicit.
$ncobj->close;

For (much) more information on NetCDF, see

http://www.unidata.ucar.edu/packages/netcdf/index.html

Also see the test file, test.pl in this distribution for some working examples.

DESCRIPTION

This is the PDL interface to the Unidata NetCDF library. It uses the netCDF version 3 library to make a subset of netCDF functionality available to PDL users in a clean, object-oriented interface.

Another NetCDF perl interface, which allows access to the entire range of netCDF functionality (but in a non-object-oriented style which uses perl arrays instead of PDLs) is available through Unidata at http://www.unidata.ucar.edu/packages/netcdf/index.html).

The NetCDF standard allows N-dimensional binary data to be efficiently stored, annotated and exchanged between many platforms.

When one creates a new netCDF object, this object is associated with one netCDF file.

FUNCTIONS

isNetcdf4

Check if compiled against netcdf4

Arguments: none

if (PDL::NetCDF::isNetcdf4) {
	# open netcdf4 file
}

defaultFormat

Get or change the default format when creating a netcdf-file. This can be overwritten by the NC_FORMAT option for new. Possible values are: PDL::NetCDF::NC_FORMAT_CLASSIC, PDL::NetCDF::NC_FORMAT_64BIT, PDL::NetCDF::NC_FORMAT_NETCDF4, PDL::NetCDF::NC_FORMAT_NETCDF4_CLASSIC

Arguments:
1) new format (constant)
Return:
old format as one of the NC_FORMAT_* constants

new

Create an object representing a netCDF file.

Arguments:  
1) The name of the file.
2) optional:  A hashref containing options.  Currently defined are:
   TEMPLATE:
   An existing netCDF object for a file with
   identical layout.  This allows one to read in many similar netCDF
   files without incurring the overhead of reading in all variable
   and dimension names and IDs each time.  Caution:  Undefined
   weirdness may occur if you pass the netCDF object from a dissimilar
   file!
   MODE:
   use sysopen file-opening arguments, O_RDONLY, O_RDWR, O_CREAT, O_EXCL
   when used, this will overwrite the '>file.nc' type of opening
   see L<perlopentut> for usage of O_RDONLY...
   REVERSE_DIMS:
   this will turn the order of the dimension-names of
   netcdf-files. Even with this option the 'put' function will write
   variables in FORTRAN order (as before) and will reverse the
   dimension names so they fit this order.  With this option, the
   'putslice' function will write variables in the same way as 'put'.
   You should use this option if your planning to work with other
   netcdf-programs (ncview, NCL) or if you are planning to combine
   putslice and slice.  You should _not_ use this option, if you need
   compatibility to older versions of PDL::NetCDF.
   NC_FORMAT:
   set the file format for a new netcdf file, see defaultFormat()
   SLOW_CHAR_FETCH:
   If this option is set, then a 'get' into a PDL::Char will be done
   one string at a time instead of all text data at once.  This
   is necessary if there are NULLs (hex 0) values embedded in the string
   arrays.  This takes longer, but gives the correct results.  If
   the fetch of a string array yields only the first element, try setting
   this option.
   PDL_BAD:
   _FillValue's or missing_values are translated to bad-pdls.

Example:

  my $nc = PDL::NetCDF->new ("file1.nc", {REVERSE_DIMS => 1, PDL_BAD => 1});
  ...
  foreach my $ncfile (@a_bunch_of_similar_format_netcdf_files) {
    $nc = PDL::NetCDF->new("file2.nc", {TEMPLATE => $nc});  # These calls to 'new' are *much* faster
    ...
  }

  # opening using MODE
  use Fcntl; # define O_CREAT...
  # opening a completely new file (deleting if it exists!)
  my $newnc = PDL::NetCDF->new ("file2.nc", {MODE => O_CREAT|O_RDWR,
					     REVERSE_DIMS => 1, NC_FORMAT => PDL::NetCDF::NC_FORMAT_NETCDF4});
  # opening existing file for reading and writing
  $nc = PDL::NetCDF->new ("file2.nc", {MODE => O_RDWR}
			REVERSE_DIMS => 1});
  # opening existing file for reading only
  $nc = PDL::NetCDF->new ("file2.nc", {MODE => O_RDONLY,
				       REVERSE_DIMS => 1});

If this file exists and you want to write to it, prepend the name with the '>' character: ">name.nc"

Returns: The netCDF object. Barfs if there is an error.

getFormat

Get the format of a netcdf file

Arguments: none

Returns: @ integer equal to one of the PDL::NetCDF::NC_FORMAT_* constants.

put

Put a PDL matrix to a netCDF variable.

Arguments:

1) The name of the variable to create

2) A reference to a list of dimension names for this variable

3) The PDL to put. It must have the same number of dimensions as specified in the dimension name list.

4) Optional options hashref: {SHUFFLE => 1, DEFLATE => 7, COMPRESS => 0, _FillValue => -32767}

Returns: None.

my $pdl = pdl [[1, 2, 3], [4, 5, 6]];

# Specify variable name to put PDL in, plus names of the dimensions.  Dimension         
# lengths are taken from the PDL, in this case, dim1 = 2 and dim2 = 3.      
$ncobj->put ('var1', ['dim1', 'dim2'], $pdl);                                               
                                          
# Now textual NetCDF arrays can be stored with PDL::Char style PDLs.  
$str = PDL::Char->new (['Station1', 'Station2', 'Station3']);
$obj->put ('stations', ['dim_station', 'dim_charlen'], $str);
$outstr = $obj->get('stations');
print $outstr;
# Prints: ['Station1', 'Station2', 'Station3']
# For more info on PDL::Char variables see PDL::Char(3), or perldoc PDL::Char

putslice

Put a PDL matrix to a slice of a NetCDF variable

Arguments:

1) The name of the variable to create

2) A reference to a list of dimension names for this variable

3) A reference to a list of dimensions for this variable

4) A reference to a list which specifies the N dimensional starting point of the slice.

5) A reference to a list which specifies the N dimensional count of the slice.

6) The PDL to put. It must conform to the size specified by the 4th and 5th arguments. The 2nd and 3rd argument are optional if the variable is already defined in the netcdf object.

7) Optional options: {DEFLATE => 7, SHUFFLE => 0/1, _FillValue => -32767} will use gzip compression (level 7) on that variable and shuffle will not/will use the shuffle filter. These options are only valid for netcdf4 files. If you are unsure, test with ($nc->getFormat >= PDL::NetCDF::NC_FORMAT::NC_FORMAT_NETCDF4)

In addition, netcdf4 does not allow changing the _FillValue attribute
after the variable has been put/putslice'd. Therefore, the _FillValue
can be set with an option to put/putslice.

Returns: None.

 my $pdl = pdl [[1, 2, 3], [4, 5, 6]];

 # Specify variable name to put PDL in, plus names of the dimensions.  Dimension         
 # lengths are taken from the PDL, in this case, dim1 = 2 and dim2 = 3.      
 $ncobj->putslice ('var1', ['dim1', 'dim2', 'dim3'], [2,3,3], [0,0,0], [2,3,1], $pdl);                                               
 $ncobj->putslice ('var1', [], [], [0,0,2], [2,3,1], $pdl);                                               

 my $pdl2 = $ncobj->get('var1');

 print $pdl2;

 [
[
 [          1 9.96921e+36           1]
 [          2 9.96921e+36           2]
 [          3 9.96921e+36           3]
]
[
 [          4 9.96921e+36           4]
 [          5 9.96921e+36           5]
 [          6 9.96921e+36           6]
]
]

note that the netcdf missing value (not 0) is filled in.    

sync

Synchronize the data to the disk. Use this if you want to read the file from another process without closing the file. This makes only sense after put, puttext, putslice, putatt operations

Returns: nothing. Barfs on error.

$ncobj->sync

get

Get a PDL matrix from a netCDF variable.

Arguments:

1) The name of the netCDF variable to fetch. If this is the only argument, then the entire variable will be returned.

To fetch a slice of the netCDF variable, optional 2nd and 3rd arguments must be specified:

2) A pdl which specifies the N dimensional starting point of the slice.

3) A pdl which specifies the N dimensional count of the slice.

Also, an options hashref may be passed. The option 'NOCOMPRESS' tells PDL::NetCDF to *not* try to uncompress an compressed variable. See the COMPRESS option on 'put' and 'putslice' for more info. The option 'PDL_BAD' tells PDL::NetCDF to translate _FillValue or missing_value attributes to bad-values, e.g. NaN's.

Returns: The PDL representing the netCDF variable. Barfs on error.

# A slice of the netCDF variable.
# [0,0] is the starting point, [1,2] is the count.
my $slice  = $ncobj->get ('var1', [0,0], [1,2], {NOCOMPRESS => 1, PDL_BAD => 1});

# If var1 contains this:  [[1, 2, 3], [4, 5, 6]]
# Then $slice contains: [1,2] (Size '1' dimensions are eliminated).

putatt

putatt -- Attach a numerical or textual attribute to a NetCDF variable or the entire file.

Arguments:

1) The attribute. Either: A one dimensional PDL (perhaps containing only one number) or a string.

2) The name to give the attribute in the netCDF file. Many attribute names have pre-defined meanings. See the netCDF documentation for more details.

3) Optionally, you may specify the name of the pre-defined netCDF variable to associate this attribute with. If this is left off, the attribute is a global one, pertaining to the entire netCDF file.

Returns: Nothing. Barfs on error.

# Attach a double attribute of size 3 to var1
$ncobj->putatt (double([1,2,3]), 'double_attribute', 'var1');

# Write a textual, global attribute.  'attr_name' is the attribute name.
$ncobj->putatt ('The text of the global attribute', 'attr_name');          

getatt

Get an attribute from a netCDF object.

Arguments:

1) The name of the attribute (a text string).

2) The name of the variable this attribute is attached to. If this argument is not specified, this function returns a global attribute of the input name.

# Get a global attribute
my $attr2 = $ncobj->getatt ('attr_name');

# Get an attribute associated with the variable 'var1'
my $attr1 = $ncobj->getatt ('double_attribute', 'var1');

getDeflateShuffle

Get the deflate level and the shuffle flag for a variable.

Can be called on all files, although only netcdf4 files support shuffle and deflate.

Arguments:

1) The name of the variable.

Returns:

($deflate, $shuffle)

my ($deflate, $shuffle) = $nc->getDeflateShuffle('varName');

getvariabletype

Get a type of a variable from a netCDF object.

Arguments:

1) The name of the variable.

Returns: PDL::type or undef, when variable not defined

# Get a type
my $type = $ncobj->getvariabletype ('var1');

puttext

Put a perl text string into a multi-dimensional NetCDF array.

Arguments:

1) The name of the variable to be created (a text string).

2) A reference to a perl list of dimension names to use in creating this NetCDF array.

3) A reference to a perl list of dimension lengths.

4) A perl string to put into the netCDF array. If the NetCDF array is 3 x 10, then the string must have 30 charactars.

5) Optional nc4 options: {DEFLATE => 7, SHUFFLE => 0}

my $str = "Station1  Station2  Station3  ";
$obj->puttext('textvar', ['n_station', 'n_string'], [3,10], $str);

gettext

Get a multi-dimensional NetCDF array into a perl string.

Arguments:

1) The name of the NetCDF variable.

my $outstr = $obj->gettext('textvar');

dimsize

Get the size of a dimension from a netCDF object.

Arguments:

1) The name of the dimension.

Returns: The size of the dimension.

my $dim1size = $ncobj->dimsize('dim1');

close

Close a NetCDF object, writing out the file.

Arguments: None

Returns: Nothing

This closing of the netCDF file can be done explicitly though the 'close' method. Alternatively, a DESTROY block does an automatic close whenever the netCDF object passes out of scope.

$ncobj->close();

getdimensionnames ([$varname])

Get all the dimension names from an open NetCDF object. If a variable name is specified, just return dimension names for *that* variable.

Arguments: none

Returns: An array reference of dimension names

my $varlist = $ncobj->getdimensionnames();
foreach(@$varlist){
  print "Found dim $_\n";
}

getattributenames

Get the attribute names for a given variable from an open NetCDF object.

Arguments: Optional variable name, with no arguments it will return the objects global netcdf attributes.

Returns: An array reference of attribute names

my $attlist = $ncobj->getattributenames('var1');

getvariablenames

Get all the variable names for an open NetCDF object.

Arguments: none.

Returns: An array reference of variable names

my $varlist = $ncobj->getvariablenames();

setrec

Set up a 'record' of several 1D netCDF variables with the same dimension. Once this is set up, quick reading/writing of one element from all variables can be put/get from/to a perl list.

Arguments:

1) The names of all the netCDF variables to group into a record

Returns: A record name to use in future putrec/getrec calls

my $rec = $ncobj->setrec('var1', 'var2', 'var3');

getrec

Gets a 'record' (one value from each of several 1D netCDF variables) previously set up using 'setrec'. These values are returned in a perl list.

Arguments:

1) The name of the record set up in 'setrec'.
2) The index to fetch.

Returns: A perl list of all values. Note that these variables can be of different types: float, double, integer, string.

my @rec = $ncobj->getrec($rec, 5);

putrec

Puts a 'record' (one value from each of several 1D netCDF variables) previously set up using 'setrec'. These values are supplied as a perl list reference.

Arguments:

1) The name of the record set up in 'setrec'.
2) The index to set.
3) A perl list ref containing the values.

Returns: None.

$ncobj->putrec($rec, 5, \@values);

WRITING NetCDF-FILES EFFICIENTLY

Writing several variables to NetCDF-files can take a long time. When a new variable is attached by put to a file, the attribute header has to be written. This might force the internal netcdf-library to restructure the complete file, and thus might take very much IO-resources. By pre-defining the dimensions, attributes, and variables, much time can be saved. Essentially the rule of thumb is to define and write the data in the order it will be laid out in the file. Talking PDL::NetCDF, this means the following:

Open the netcdf file
    my $nc = new PDL::NetCDF('test.nc', {MODE => O_CREAT|O_RDWR,
					 REVERSE_DIMS => 1});
Write the global attributes
$nc->putatt (double([1,2,3]), 'double_attribute');
Define all variables, make use of the NC_UNLIMITED dimension
   # here it is possible to choose float/double/short/long
   $pdl_init = long ([]);  
   for (my $i=0; $i<$it; $i++) {
       my $out2 = $nc->putslice("VAR$i",
	   		     ['x','y','z','t'],
		 	     [150,100,20,PDL::NetCDF::NC_UNLIMITED()],
			     [0,0,0,0],[1,0,0,0],$pdl_init);
   }
Write the variable-attributes
$nc->putatt ("var-attr", 'attribute', 'VAR0'); 
Write data with putslice
$nc->putslice("VAR5",[],[],[0,0,0,0],[$datapdl->dims],$datapdl);

AUTHOR

Doug Hunt, dhunt\@ucar.edu.

CONTRIBUTORS

Heiko Klein, heiko.klein\@met.no Edward Baudrez, Royal Meteorological Institute of Belgium, edward.baudrez\@meteo.be Ed J (mohawk2), etj@cpan.org

SEE ALSO

perl(1), PDL(1), netcdf(3).