NAME
Spreadsheet::WriteExcel - Write text and numbers to a cross-platform Excel binary file.
VERSION
This document refers to version 0.21 of Spreadsheet::WriteExcel, released October 1, 2000.
SYNOPSIS
To write a string and a number to the first worksheet in an Excel workbook called perl.xls:
use Spreadsheet::WriteExcel;
$row1 = $col1 = 0;
$row2 = 1;
my $workbook = Spreadsheet::WriteExcel->new("perl.xls");
$worksheet = $workbook->addworksheet();
$worksheet->write($row1, $col1, "Hi Excel!");
$worksheet->write($row2, $col1, 1.2345);
DESCRIPTION
The Spreadsheet::WriteExcel module can be used to write numbers and text in the native Excel binary file format. Multiple worksheets can be added to a workbook. Formatting of cells is not yet supported.
The Excel file produced by this module is compatible with Excel 5, 95, 97 and 2000.
The module will work on the majority of Windows, UNIX and Macintosh platforms. Generated files are also compatible with the Linux/UNIX spreadsheet applications Star Office, Gnumeric and XESS. The generated files are not compatible with MS Access.
WORKBOOK METHODS
The Spreadsheet::WriteExcel module provides an object oriented interface to a new Excel workbook.The following methods are available through a new workbook.
new()
A new Excel workbook is created using the new()
constructor as follows:
my $workbook = Spreadsheet::WriteExcel->new("filename.xls");
Note my
is required to allocate a new workbook regardless of whether the strict
pragma is in operation or not.
You can redirect the output to STDOUT using the special Perl filehandle "-"
. This can be useful for CGIs which have a Content-type of application/vnd.ms-excel
, for example:
#!/usr/bin/perl -w
use strict;
use Spreadsheet::WriteExcel;
print "Content-type: application/vnd.ms-excel\n\n";
my $workbook = Spreadsheet::WriteExcel->new("-");
$workbook->write(0, 0, "Hi Excel!");
addworksheet($sheetname)
At least one worksheet should be added to a new workbook:
$worksheet1 = $workbook->addworksheet(); # Sheet1
$worksheet2 = $workbook->addworksheet('Foglio2'); # Foglio2
$worksheet3 = $workbook->addworksheet('Data'); # Data
$worksheet4 = $workbook->addworksheet(); # Sheet4
If $sheetname
is not specified the default Excel convention will be followed, i.e. Sheet1, Sheet2, etc.
close()
The close()
method can be called to explicitly close an Excel file. Otherwise the file will be closed automatically when the object reference goes out of scope or when the program ends.
$workbook->close();
In general it is only necessary to explicitly close a file if you want to perform some other operation on it such as copying or checking the size.
worksheets()
The worksheets()
method returns a reference to the array of worksheets in a workbook. This can be useful if you want to repeat an operation on each worksheet in a workbook or where you wish to refer to a worksheet by its index:
foreach $worksheet (@{$workbook->worksheets()}) {
$worksheet->write(0, 0, "Hello");
}
# or:
$worksheets = $workbook->worksheets();
@$worksheets[0]->write(0, 0, "Hello");
References are explained in detail in perlref
and perlreftut
in the main Perl documentation.
WORKSHEET METHODS
The following methods are available through to a new worksheet.
write($row, $column, $token)
The write()
method calls write_number()
if $token
matches the following regex:
$token =~ /^([+-]?)(?=\d|\.\d)\d*(\.\d*)?([Ee]([+-]?\d+))?$/
Otherwise it calls write_string()
:
$worksheet->write(0, 0, "Hello" ); # write_string()
$worksheet->write(1, 0, "One" ); # write_string()
$worksheet->write(2, 0, 2 ); # write_number()
$worksheet->write(3, 0, 3.00001); # write_number()
It should be noted that $row
and $column
are zero indexed cell locations for the write
methods. Thus, Cell A1 is (0, 0) and Cell AD2000 is (1999, 29). Cells can be written to in any order. They can also be overwritten.
The write
methods return:
0 for success
-1 for insufficient number of arguments
-2 for row or column out of bounds
-3 for string too long.
write_number($row, $column, $number)
Write an integer or a float to the cell specified by $row
and $column
:
$worksheet->write_number(0, 0, 1 );
$worksheet->write_number(1, 0, 2.3451);
write_string($row, $column, $string)
Write a string to the cell specified by $row
and $column
:
$worksheet->write_string(0, 0, "Your text here" );
The maximum string size is 255 characters.
activate()
The activate()
method is used to specify which worksheet is initially selected in a multi-sheet workbook:
$worksheet1 = $workbook->addworksheet('To');
$worksheet2 = $workbook->addworksheet('the');
$worksheet3 = $workbook->addworksheet('wind');
$worksheet3->activate();
This is similar to the Excel VBA activate method. The default value is the first worksheet.
set_first_sheet()
The activate()
method determines which worksheet is initially selected. However, if there are a large number of worksheets the selected worksheet may not appear on the screen. To avoid this you can select which is the leftmost visible worksheet using set_first_sheet()
:
for (1..20) {
$workbook->addworksheet;
}
$worksheet21 = $workbook->addworksheet();
$worksheet22 = $workbook->addworksheet();
$worksheet21->set_first_sheet();
$worksheet22->activate();
This method is not required very often. The default value is the first worksheet.
set_selection($first_row, $first_col, $last_row, $last_col);
This method can be used to specify which cell or cells are selected in a worksheet. The most common requirement is to select a single cell, in which case $last_row
and $last_col
are not required. The active cell within a selected range is determined by the order in which $first
and $last
are specified:
$worksheet1->set_selection(3, 3);
$worksheet2->set_selection(3, 3, 6, 6);
$worksheet3->set_selection(6, 6, 3, 3);
The default is cell (0, 0).
set_col_width($first_col, $last_col, $width);
This method can be used to specify the width of a single column or a range of columns. If the method is applied to a single column the value of $first_col
and $last_col
should be the same:
$worksheet->set_col_width(0, 0, 20);
$worksheet->set_col_width(1, 3, 30);
The width corresponds to the column width value that is specified in Excel. It is approximately equal to the length of a string in the default font of Arial 10.
EXAMPLES
The following is a general example which demonstrates most of the features of the Spreadsheet::WriteExcel module:
#!/usr/bin/perl -w
use strict;
use Spreadsheet::WriteExcel;
# Create a new Excel workbook
my $workbook = Spreadsheet::WriteExcel->new("regions.xls");
# Add some worksheets
my $north = $workbook->addworksheet("North");
my $south = $workbook->addworksheet("South");
my $east = $workbook->addworksheet("East");
my $west = $workbook->addworksheet("West");
# Add a caption to each worksheet
foreach my $worksheet (@{$workbook->worksheets()}) {
$worksheet->write(0, 0, "Sales");
}
# Write some data
$north->write(0, 1, 200000);
$south->write(0, 1, 100000);
$east->write (0, 1, 150000);
$west->write (0, 1, 100000);
# Set the active worksheet
$south->activate();
# Set the width of the first column
$south->set_col_width(0, 0, 20);
# Set the active cell
$south->set_selection(0, 1);
The following example converts a tab separated file called tab.txt
into an Excel file called tab.xls
.
#!/usr/bin/perl -w
use strict;
use Spreadsheet::WriteExcel;
open (TABFILE, "tab.txt") or die "tab.txt: $!";
my $workbook = Spreadsheet::WriteExcel->new("tab.xls");
my $worksheet = $workbook->addworksheet();
# Row and column are zero indexed
my $row = 0;
while (<TABFILE>) {
chomp;
# Split on single tab
my @Fld = split('\t', $_);
my $col = 0;
foreach my $token (@Fld) {
$worksheet->write($row, $col, $token);
$col++;
}
$row++;
}
LIMITATIONS
The following limits are imposed by Excel or the version of the BIFF file that has been implemented:
Description Limit Source
----------------------------------- ------ -------
Maximum number of chars in a string 255 Excel 5
Maximum number of columns 256 Excel 5, 97
Maximum number of rows in Excel 5 16,384 Excel 5
Maximum number of rows in Excel 97 65,536 Excel 97
The minimum file size is 6K due to the OLE overhead. The maximum file size is approximately 7MB (7087104 bytes) of BIFF data.
PORTABILITY
WriteExcel.pm will only work on systems where perl packs floats in 64 bit IEEE format. The float must also be in little-endian format but WriteExcel.pm will reverse it as necessary. Thus:
print join(" ", map { sprintf "%#02x", $_ } unpack("C*", pack "d", 1.2345)), "\n";
should give (or in reverse order):
0x8d 0x97 0x6e 0x12 0x83 0xc0 0xf3 0x3f
In general, if you don't know whether your system supports a 64 bit IEEE float or not, it probably does. If your system doesn't WriteExcel will croak()
with the message given in the Diagnostics section.
DIAGNOSTICS
- Filename required in WriteExcel('Filename')
-
A filename must be given in the constructor.
- Can't open filename. It may be in use.
-
The file cannot be opened for writing. It may be protected or already in use.
- Required floating point format not supported on this platform.
-
Operating system doesn't support 64 bit IEEE float or it is byte-ordered in a way unknown to WriteExcel.
- Maximum file size, 7087104, exceeded.
-
The current OLE implementation only supports a maximum BIFF file of this size.
THE EXCEL BINARY FORMAT
Excel data is stored in the "Binary Interchange File Format" (BIFF) file format. Details of this format are given in the Excel SDK, the "Excel Developer's Kit" from Microsoft Press. It is also included in the MSDN CD library but is no longer available on the MSDN website. An older version of the BIFF documentation is available at http://www.cubic.org/source/archive/fileform/misc/excel.txt
Issues relating to the Excel SDK are discussed, occasionally, at news://microsoft.public.excel.sdk
The BIFF portion of the Excel file is comprised of contiguous binary records that have different functions and that hold different types of data. Each BIFF record is comprised of the following three parts:
Record name; Hex identifier, length = 2 bytes
Record length; Length of following data, length = 2 bytes
Record data; Data, length = variable
The BIFF data is stored along with other data in an OLE Compound File. This is a structured storage which acts like a file system within a file. A Compound File is comprised of storages and streams which, to follow the file system analogy, are like directories and files.
The documentation for the OLE::Storage module, http://user.cs.tu-berlin.de/~schwartz/pmh/guide.html , contains one of the few descriptions of the OLE Compound File in the public domain.
Another useful source is the filters project http://arturo.directmail.org/filtersweb/
The source code for the Excel plugin of the Gnumeric spreadsheet also contains information relevant to the Excel BIFF format and the OLE container, http://www.gnumeric.org/
The soon to be GPLed source code for Star Office should also be of interest, http://www.openoffice.org/
Please note that the provision of this information does not constitute an invitation to start hacking at the BIFF or OLE file formats. There are more interesting ways to waste your time. ;)
WRITING EXCEL FILES
Depending on your requirements, background and general sensibilities you may prefer one of the following methods of getting data into Excel:
* CSV, comma separated variables or text. If the file extension is csv
, Excel will open and convert this format automatically.
* DBI, ADO or ODBC. Connect to an Excel file as a database. Using the appropriate driver Excel will behave like a database.
* HTML tables. This is an easy way of adding formatting.
* Win32::OLE module and office automation. See, the section "Reading Excel Files".
READING EXCEL FILES
Despite the title of this module the most commonly asked questions are in relation to reading Excel files. To read data from Excel files try:
* DBI, ADO or ODBC. Connect to an Excel file as a database. Using the appropriate driver Excel will behave like a database.
* HTML tables. If the files are saved from Excel in a HTML format the data can be accessed using HTML::TableExtract http://search.cpan.org/search?dist=HTML-TableExtract
* OLE::Storage, aka LAOLA. This is a Perl interface to OLE file formats. In particular, the distro contains an Excel to HTML converter called Herbert, http://user.cs.tu-berlin.de/~schwartz/pmh/ There is also an open source C/C++ project based on the LAOLA work. Try the Filters Project at http://arturo.directmail.org/filtersweb/ and the xlHtml Project at http://www.xlhtml.org/ The xlHtml filter is more complete than Herbert.
* Win32::OLE module and office automation. This requires a Windows platform and an installed copy of Excel. This is the most powerful and complete method for interfacing with Excel. See http://www.activestate.com/Products/ActivePerl/docs/faq/Windows/ActivePerl-Winfaq12.html and http://www.activestate.com/Products/ActivePerl/docs/site/lib/Win32/OLE.html
If your main platform is UNIX but you have the resources to set up a separate Win32/MSOffice server, you can convert office documents to text, postscript or PDF using Win32:OLE. For a demonstration of how to do this using Perl see Docserver: http://search.cpan.org/search?mode=module&query=docserver
If you wish to view Excel files on a UNIX/Linux platform check out the excellent Gnumeric spreadsheet application at http://www.gnumeric.org/gnumeric
If you wish to view Excel files on Windows platforms which don't have Excel installed you can use the free Microsoft Excel Viewer http://officeupdate.microsoft.com/downloadDetails/xlviewer.htm
BUGS
The Excel files that are produced by this module are not compatible with MS Access. Use DBI or ODBC instead.
The lack of a portable way of writing a little-endian 64 bit IEEE float.
QuickView: If you wish to write files are fully compatible with QuickView it is necessary to write the cells in a sequential row by row order.
TO DO
This module will be extended to include the following, probably in this order:
1. Cell and font formatting
2. Row and column formatting
3. Unlimited file size
4. Document summary information
5. Formulas (hopefully)
Items 1. and 2. should be ready be Nov 1 2000 unless you are reading this in December 2000.
ACKNOWLEDGEMENTS
The following people contributed to the debugging and testing of WriteExcel.pm:
Arthur@ais, Artur Silveira da Cunha, CPAN testers, Daniel Gardner, Harold Bamford, Johan Ekenberg, John Wren, Michael Buschauer, Mike Blazer, Paul J. Falbe.
AUTHOR
John McNamara jmcnamara@cpan.org
I have eaten
the plums
that were in
the icebox
and which
you were probably
saving
for breakfast
Forgive me
they were delicious
so sweet
and so cold
- William Carlos Williams
COPYRIGHT
Copyright (c) 2000, John McNamara. All Rights Reserved. This module is free software. It may be used, redistributed and/or modified under the same terms as Perl itself.