NAME

Chemistry::Smiles - SMILES parser

SYNOPSYS

#!/usr/bin/perl
use Chemistry::Smiles;

my $s = 'C1C[13C]1(=O)[O-]';

# Default use - Requires Chemistry::Mol
my $default_parser = new Chemistry::Smiles;
my $mol = $default_parser->parse($s, new Chemistry::Mol);
print $mol->print;

# Callback use
my $i = 0;
my $callback_parser = new Chemistry::Smiles(
    add_atom => sub {print "ATOM(@_)\n"; ++$i},
    add_bond => sub {print "BOND(@_)\n"}
);
$callback_parser->parse($s, 'mol');

DESCRIPTION

This object-oriented module parses a SMILES (Simplified Molecular Input Line Entry Specification) string. It can either return the molecule as a Chemistry::Mol object or be used via callback functions.

METHODS

Chemistry::Smiles->new([add_atom => \&sub1, add_bond => \&sub2])

Create a SMILES parser. If the add_atom and add_bond subroutine references are given, they will be called whenever an atom or a bond needs to be added to the molecule. If they are not specified, default methods, which create a Chemistry::Mol object, will be used.

$obj->parse($string, $mol)

Parse a Smiles $string. $mol is a "molecule state object". It can be anything; the parser doesn't do anything with it except sending it as the first parameter to the callback functions. If callback functions were not provided when constructing the parser object, $mol must be a Chemistry::Mol object, because that's what the default callback functions require.

CALLBACK FUNCTIONS

$atom = add_atom($mol, $iso, $sym, $chir, $hcount, $chg)

Called by the parser whenever an atom is found. The first parameter is the state object given to $obj->parse(). The other parameters are the isotope, symbol, chirality, hydrogen count, and charge of the atom. Only the symbol is guaranteed to be defined. Mnemonic: the parameters are given in the same order that is used in a SMILES string (such as [18OH-]). This callback is expected to return something that uniquely identifies the atom that was created (it might be a number, a string, or an object).

add_bond($mol, $type, $a1, $a2)

Called by the parser whenever an bond needs to be created. The first parameter is the state object given to $obj->parse(). The other parameters are the bond type and the two atoms that need to be bonded. The atoms are identified using the return values from the add_atom() callback.

SEE ALSO

The SMILES Home Page at http://www.daylight.com/dayhtml/smiles/ The Daylight Theory Manual at http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html

BUGS

The SMILES specification is not fully implemented yet. For example, branches that start before an atom (such as (OC)C, which should be equivalent to C(CO) and COC).

AUTHOR

Ivan Tubert <itub@cpan.org>

COPYRIGHT

Copyright (c) 2004 Ivan Tubert. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.