NAME

SQL::Statement::Functions - built-in & user-defined SQL functions

SYNOPSIS

SELECT Func(args);
SELECT * FROM Func(args);
SELECT * FROM x WHERE Funcs(args);
SELECT * FROM x WHERE y < Funcs(args);

DESCRIPTION

This module contains the built-in functions for SQL::Parser and SQL::Statement. All of the functions are also available in any DBDs that subclass those modules (e.g. DBD::CSV, DBD::DBM, DBD::File, DBD::AnyData, DBD::Excel, etc.).

This documentation covers built-in functions and also explains how to create your own functions to supplement the built-in ones. It's easy. If you create one that is generally useful, see below for how to submit it to become a built-in function.

Function syntax

When using SQL::Statement/SQL::Parser directly to parse SQL, functions (either built-in or user-defined) may occur anywhere in a SQL statement that values, column names, table names, or predicates may occur. When using the modules through a DBD or in any other context in which the SQL is both parsed and executed, functions can occur in the same places except that they can not occur in the column selection clause of a SELECT statement that contains a FROM clause.

# valid for both parsing and executing

    SELECT MyFunc(args);
    SELECT * FROM MyFunc(args);
    SELECT * FROM x WHERE MyFuncs(args);
    SELECT * FROM x WHERE y < MyFuncs(args);

# valid only for parsing (won't work from a DBD)

    SELECT MyFunc(args) FROM x WHERE y;

User-Defined Functions

Loading User-Defined Functions

In addition to the built-in functions, you can create any number of your own user-defined functions (UDFs). In order to use a UDF in a script, you first have to create a perl subroutine (see below), then you need to make the function available to your database handle with the CREATE FUNCTION or LOAD commands:

# load a single function "foo" from a subroutine
# named "foo" in the current package

     $dbh->do(" CREATE FUNCTION foo EXTERNAL ");

# load a single function "foo" from a subroutine
# named "bar" in the current package

     $dbh->do(" CREATE FUNCTION foo EXTERNAL NAME bar");


# load a single function "foo" from a subroutine named "foo"
# in another package

     $dbh->do(' CREATE FUNCTION foo EXTERNAL NAME "Bar::Baz::foo" ');

# load all the functions in another package

     $dbh->do(' LOAD "Bar::Baz" ');

Functions themselves should follow SQL identifier naming rules. Subroutines loaded with CREATE FUNCTION can have any valid perl subroutine name. Subroutines loaded with LOAD must start with SQL_FUNCTION_ and then the actual function name. For example:

package Qux::Quimble;
sub SQL_FUNCTION_FOO { ... }
sub SQL_FUNCTION_BAR { ... }
sub some_other_perl_subroutine_not_a_function { ... }
1;

# in another package
$dbh->do("LOAD Qux::Quimble");

# This loads FOO and BAR as SQL functions.

Creating User-Defined Functions

User-defined functions (UDFs) are perl subroutines that return values appropriate to the context of the function in a SQL statement. For example the built-in CURRENT_TIME returns a string value and therefore may be used anywhere in a SQL statement that a string value can. Here' the entire perl code for the function:

# CURRENT_TIME
#
# arguments : none
# returns   : string containing current time as hh::mm::ss
#
sub SQL_FUNCTION_CURRENT_TIME {
    sprintf "%02s::%02s::%02s",(localtime)[2,1,0]
}

More complex functions can make use of a number of arguments always passed to functions automatically. Functions always receive these values in @_:

sub FOO {
    my($self,$sth,@params);
}

The first argument, $self, is whatever class the function is defined in, not generally useful unless you have an entire module to support the function.

The second argument, $sth is the active statement handle of the current statement. Like all active statement handles it contains the current database handle in the {Database} attribute so you can have access to the database handle in any function:

sub FOO {
    my($self,$sth,@params);
    my $dbh = $sth->{Database};
    # $dbh->do( ...), etc.
}

In actual practice you probably want to use $sth->{Database} directly rather than making a local copy, so $sth->{Database}->do(...).

The remaining arguments, @params, are arguments passed by users to the function, either directly or with placeholders; another silly example which just returns the results of multiplying the arguments passed to it:

sub MULTIPLY {
    my($self,$sth,@params);
    return $params[0] * $params[1];
}

# first make the function available
#
$dbh->do("CREATE FUNCTION MULTIPLY");

# then multiply col3 in each row times seven
#
my $sth=$dbh->prepare("SELECT col1 FROM tbl1 WHERE col2 = MULTIPLY(col3,7)");
$sth->execute;
#
# or
#
my $sth=$dbh->prepare("SELECT col1 FROM tbl1 WHERE col2 = MULTIPLY(col3,?)");
$sth->execute(7);

Creating In-Memory Tables with functions

A function can return almost anything, as long is it is an appropriate return for the context the function will be used in. In the special case of table-returning functions, the function should return a reference to an array of array references with the first row being the column names and the remaining rows the data. For example:

1. create a function that returns an AoA,

sub Japh {[
    [qw( id word   )],
    [qw( 1 Hacker  )],
    [qw( 2 Perl    )],
    [qw( 3 Another )],
    [qw( 4 Just    )],
]}

2. make your database handle aware of the function

$dbh->do("CREATE FUNCTION 'Japh');

3. Access the data in the AoA from SQL

$sth = $dbh->prepare("SELECT word FROM Japh ORDER BY id DESC");

Or here's an example that does a join on two in-memory tables:

sub Prof  {[ [qw(pid pname)],[qw(1 Sue )],[qw(2 Bob)],[qw(3 Tom )] ]}
sub Class {[ [qw(pid cname)],[qw(1 Chem)],[qw(2 Bio)],[qw(2 Math)] ]}
$dbh->do("CREATE FUNCTION $_) for qw(Prof Class);
$sth = $dbh->prepare("SELECT * FROM Prof NATURAL JOIN Class");

The "Prof" and "Class" functions return tables which can be used like any SQL table.

More complex functions might do something like scrape an RSS feed, or search a file system and put the results in AoA. For example, to search a directory with SQL:

sub Dir {
    my($self,$sth,$dir)=@_;
    opendir D, $dir or die "'$dir':$!";
    my @files = readdir D;
    my $data = [[qw(fileName fileExt)]];
    for (@files) {
        my($fn,$ext) = /^(.*)(\.[^\.]+)$/;
        push @$data, [$fn,$ext];
    }
    return $data;
}
$dbh->do("CREATE FUNCTION Dir");
printf "%s\n", join'   ',@{ $dbh->selectcol_arrayref("
    SELECT fileName FROM Dir('./') WHERE fileExt = '.pl'
")};

Obviously, that function could be expanded with File::Find and/or stat to provide more information and it could be made to accept a list of directories rather than a single directory.

Table-Returning functions are a way to turn *anything* that can be modeled as an AoA into a DBI data source.

Built-in Functions

SQL-92/ODBC Compatibility

All ODBC 3.0 functions are available except for the following:

### SQL-92 / ODBC Functions

# CONVERT / CAST - Complex to implement, but a draft is in the works.
# DIFFERENCE     - Function is not clearly defined in spec and has very limited applications
# EXTRACT        - Contains a FROM keyword and requires rather freeform datetime/interval expression

### ODBC 3.0 Time/Date Functions only

# DAYOFMONTH, DAYOFWEEK, DAYOFYEAR, HOUR, MINUTE, MONTH, MONTHNAME, QUARTER, SECOND, TIMESTAMPDIFF, 
#    WEEK, YEAR - Requires freeform datetime/interval expressions.  In a later release, these could
#                    be implemented with the help of Date::Parse.

ODBC 3.0 functions that are implemented with differences include:

# SOUNDEX  - Returns true/false, instead of a SOUNDEX code
# RAND     - Seed value is a second parameter with a new first parameter for max limit
# LOG      - Returns base X (or 10) log of number, not natural log.  LN is used for natural log, and
#               LOG10 is still available for standards compatibility.
# POSITION - Does not use 'IN' keyword; cannot be fixed as previous versions of SQL::Statement defined
#               the function as such.
# REPLACE / SUBSTITUTE - Uses a regular expression string for the second parameter, replacing the last two
#                           parameters of the typical ODBC function

Aggregate Functions

MIN, MAX, AVG, SUM, COUNT

Aggregate functions are handled elsewhere, see SQL::Parser for documentation.

Date and Time Functions

These functions can be used without parentheses.

CURRENT_DATE aka CURDATE

# purpose   : find current date
# arguments : none
# returns   : string containing current date as yyyy-mm-dd

CURRENT_TIME aka CURTIME

# purpose   : find current time
# arguments : optional seconds precision
# returns   : string containing current time as hh:mm:ss (or ss.sss...)

CURRENT_TIMESTAMP aka NOW

# purpose   : find current date and time
# arguments : optional seconds precision
# returns   : string containing current timestamp as yyyy-mm-dd hh:mm:ss (or ss.sss...)

UNIX_TIMESTAMP

# purpose   : find the current time in UNIX epoch format
# arguments : optional seconds precision (unlike the MySQL version)
# returns   : a (64-bit) number, possibly with decimals

String Functions

ASCII & CHAR

# purpose   : same as ord and chr, respectively (NULL for any NULL args)
# arguments : string or character (or number for CHAR); CHAR can have any amount of numbers for a string

BIT_LENGTH

# purpose   : length of the string in bits
# arguments : string

CHARACTER_LENGTH aka CHAR_LENGTH

# purpose   : find length in characters of a string
# arguments : a string
# returns   : a number - the length of the string in characters

COALESCE aka NVL aka IFNULL

# purpose   : return the first non-NULL value from a list
# arguments : 1 or more expressions
# returns   : the first expression (reading left to right)
#             which is not NULL; returns NULL if all are NULL
#

CONCAT

# purpose   : concatenate 1 or more strings into a single string;
#			an alternative to the '||' operator
# arguments : 1 or more strings
# returns   : the concatenated string
#
# example   : SELECT CONCAT(first_string, 'this string', ' that string')
#              returns "<value-of-first-string>this string that string"
# note      : if any argument evaluates to NULL, the returned value is NULL

CONV

# purpose   : convert a number X from base Y to base Z (from base 2 to 92)
# arguments : X (can by a number or string depending on the base), Y, Z (Z defaults to 10)
# returns   : either a string or number, in base Z
# notes     : 
#    * Supports negative and decimal numbers
#    * Will use big numbers if it has to, so accuracy is at near absolute levels
#    * Letters are case-sensitive after base 36
#    * Base character sets are: (second set is for compatibility with base 64)
#          2 to 62 = 0-9, A-Z, a-z
#         62 to 92 = A-Z, a-z, 0-9, +/_=~|,;:?!@#$%^&*()<>{}[]\`'"

DECODE

# purpose   : compare the first argument against
#             succeding arguments at position 1 + 2N
#             (N = 0 to (# of arguments - 2)/2), and if equal,
#				return the value of the argument at 1 + 2N + 1; if no
#             arguments are equal, the last argument value is returned
# arguments : 4 or more expressions, must be even # of arguments
# returns   : the value of the argument at 1 + 2N + 1 if argument 1 + 2N
#             is equal to argument1; else the last argument value
#
# example   : SELECT DECODE(some_column,
#                    'first value', 'first value matched'
#                    '2nd value', '2nd value matched'
#                    'no value matched'
#                    )

INSERT

# purpose   : string where L characters have been deleted from STR1, beginning at S,
#             and where STR2 has been inserted into STR1, beginning at S.  NULL for any NULL args.
# arguments : STR1, S, L, STR2

HEX & OCT & BIN

# purpose   : convert number X from decimal to hex/octal/binary; equiv. to CONV(X, 10, 16/8/2)
# arguments : X

LEFT & RIGHT

# purpose   : leftmost or rightmost L characters in STR, or NULL for any NULL args
# arguments : STR1, L

LOCATE aka POSITION

# purpose   : starting position (one-based) of the first occurrence of STR1
              within STR2; 0 if it doesn't occur and NULL for any NULL args
# arguments : STR1, STR2, and an optional S (starting position to search)

LOWER & UPPER aka LCASE & UCASE

# purpose   : lower-case or upper-case a string
# arguments : a string
# returns   : the sting lower or upper cased

LTRIM & RTRIM

# purpose   : left/right counterparts for TRIM
# arguments : string

OCTET_LENGTH

# purpose   : length of the string in bytes (not characters)
# arguments : string

REGEX

# purpose   : test if a string matches a perl regular expression
# arguments : a string and a regex to match the string against
# returns   : boolean value of the regex match
#
# example   : ... WHERE REGEX(col3,'/^fun/i') ... matches rows
#             in which col3 starts with "fun", ignoring case

REPEAT

# purpose   : string composed of STR1 repeated C times, or NULL for any NULL args
# arguments : STR1, C

REPLACE aka SUBSTITUTE

# purpose   : perform perl subsitution on input string
# arguments : a string and a substitute pattern string
# returns   : the result of the substitute operation
#
# example   : ... WHERE REPLACE(col3,'s/fun(\w+)nier/$1/ig') ... replaces
#			all instances of /fun(\w+)nier/ in col3 with the string
#			between 'fun' and 'nier'

SOUNDEX

# purpose   : test if two strings have matching soundex codes
# arguments : two strings
# returns   : true if the strings share the same soundex code
#
# example   : ... WHERE SOUNDEX(col3,'fun') ... matches rows
#             in which col3 is a soundex match for "fun"

SPACE

# purpose   : a string of spaces
# arguments : number of spaces

SUBSTRING

SUBSTRING( string FROM start_pos [FOR length] )

Returns the substring starting at start_pos and extending for "length" character or until the end of the string, if no "length" is supplied. Examples:

SUBSTRING( 'foobar' FROM 4 )       # returns "bar"

SUBSTRING( 'foobar' FROM 4 FOR 2)  # returns "ba"

Note: The SUBSTRING function is implemented in SQL::Parser and SQL::Statement and, at the current time, can not be over-ridden.

SUBSTR

# purpose   : same as SUBSTRING, except with comma-delimited params, instead of
              words (NULL for any NULL args)
# arguments : string, start_pos, [length]

TRANSLATE

# purpose   : transliteration; replace a set of characters in a string with another
              set of characters (a la tr///), or NULL for any NULL args
# arguments : string, string to replace, replacement string

TRIM

TRIM ( [ [LEADING|TRAILING|BOTH] ['trim_char'] FROM ] string )

Removes all occurrences of <trim_char> from the front, back, or both sides of a string.

BOTH is the default if neither LEADING nor TRAILING is specified.

Space is the default if no trim_char is specified.

Examples:

TRIM( string )
  trims leading and trailing spaces from string

TRIM( LEADING FROM str )
  trims leading spaces from string

TRIM( 'x' FROM str )
  trims leading and trailing x's from string

Note: The TRIM function is implemented in SQL::Parser and SQL::Statement and, at the current time, can not be over-ridden.

UNHEX

# purpose   : convert each pair of hexadecimal digits to a byte (or a Unicode character)
# arguments : string of hex digits, with an optional encoding name of the data string

Numeric Functions

ABS

# purpose   : find the absolute value of a given numeric expression
# arguments : numeric expression

CEILING (aka CEIL) & FLOOR

# purpose   : rounds up/down to the nearest integer
# arguments : numeric expression

EXP

# purpose   : raise e to the power of a number
# arguments : numeric expression

LOG

# purpose   : base B logarithm of X
# arguments : B, X or just one argument of X for base 10

LN & LOG10

# purpose   : natural logarithm (base e) or base 10 of X
# arguments : numeric expression

MOD

# purpose   : modulus, or remainder, left over from dividing X / Y
# arguments : X, Y

POWER aka POW

# purpose   : X to the power of Y
# arguments : X, Y

RAND

# purpose   : random fractional number greater than or equal to 0 and less than the value of X
# arguments : X (with optional seed value of Y)

ROUND

# purpose   : round X with Y number of decimal digits (precision)
# arguments : X, optional Y defaults to 0

SIGN

# purpose   : returns -1, 0, 1, NULL for negative, 0, positive, NULL values, respectively
# arguments : numeric expression

SQRT

# purpose   : square root of X
# arguments : X

TRUNCATE aka TRUNC

# purpose   : similar to ROUND, but removes the decimal
# arguments : X, optional Y defaults to 0

Trigonometric Functions

All of these functions work exactly like their counterparts in Math::Trig; go there for documentation.

ACOS
ACOSEC
ACOSECH
ACOSH
ACOT
ACOTAN
ACOTANH
ACOTH
ACSC
ACSCH
ASEC
ASECH
ASIN
ASINH
ATAN
ATANH
COS
COSEC
COSECH
COSH
COT
COTAN
COTANH
COTH
CSC
CSCH
SEC
SECH
SIN
SINH
TAN
TANH

Takes a single parameter. All of Math::Trig's aliases are included.

ATAN2

The y,x version of arc tangent.

DEG2DEG
DEG2GRAD
DEG2RAD

Converts out-of-bounds values into its correct range.

GRAD2DEG
GRAD2GRAD
GRAD2RAD
RAD2DEG
RAD2GRAD
RAD2RAD

Like their Math::Trig's counterparts, accepts an optional 2nd boolean parameter (like TRUE) to keep prevent range wrapping.

DEGREES
RADIANS

DEGREES and RADIANS are included for SQL-92 compatibility, and map to RAD2DEG and DEG2RAD, respectively.

PI

PI can be used without parentheses.

System Functions

DBNAME & USERNAME (aka USER)

# purpose   : name of the database / username
# arguments : none

Special Utility Functions

IMPORT

CREATE TABLE foo AS IMPORT(?)    ,{},$external_executed_sth
CREATE TABLE foo AS IMPORT(?)    ,{},$AoA

RUN

Takes the name of a file containing SQL statements and runs the statements; see SQL::Parser for documentation.

Submitting built-in functions

If you make a generally useful UDF, why not submit it to me and have it (and your name) included with the built-in functions? Please follow the format shown in the module including a description of the arguments and return values for the function as well as an example. Send them to the dbi-dev@perl.org mailing list (see http://dbi.perl.org).

Thanks in advance :-).

ACKNOWLEDGEMENTS

Dean Arnold supplied DECODE, COALESCE, REPLACE, many thanks! Brendan Byrd added in the Numeric/Trig/System functions and filled in the SQL92/ODBC gaps for the date/string functions.

AUTHOR & COPYRIGHT

Copyright (c) 2005 by Jeff Zucker: jzuckerATcpan.org Copyright (c) 2009,2010 by Jens Rehsack: rehsackATcpan.org

All rights reserved.

The module may be freely distributed under the same terms as Perl itself using either the "GPL License" or the "Artistic License" as specified in the Perl README file.