NAME

Text::Treesitter::Node - an element of a tree-sitter parse result

SYNOPSIS

Usually accessed indirectly, via Text::Treesitter::Tree.

use Text::Treesitter;

my $ts = Text::Treesitter->new(
   lang_name => "perl",
);

my $tree = $ts->parse_string( $input );

my $root = $tree->root_node;

foreach my $node ( $root->child_nodes ) {
   next if $node->is_extra;
   my $name = $node->is_named ? $node->type : '"' . $node->text . '"';

   printf "Node %s extends from line %d to line %d\n",
      $name,
      ( $node->start_point )[0] + 1,
      ( $node->end_point )[0] + 1;
}

DESCRIPTION

The result of a parse operation is a tree of nodes represented by instances of this class, which are all stored in an instance of Text::Treesitter::Tree. Most of the work of handling the result of a parse operation is done by operating on these tree nodes.

Note that tree-sitter's struct TSNode type is a structure directly and not a pointer to it. Therefore, every time the Perl binding wraps it, it has to create a new object instance for it. You cannot therefore rely on the identity of these objects to remain invariant as a means to keep track of a particular tree node.

METHODS

tree

$tree = $node->tree;

Returns the Text::Treesitter::Tree instance from which this child node was obtained.

text

$text = $node->text;

Returns the substring of the tree's stored text that is covered by this node.

type

$type = $node->type;

Returns a description string giving the name of the grammar rule (or directly an input string for anonymous nodes).

start_byte

$pos = $node->start_byte;

Returns the offset into the input string where this node's extent begins

end_byte

$pos = $node->end_byte;

Returns the offset into the input string just past where this node's extent finishes (i.e. the first byte of the input string that is not part of this node).

start_char

end_char

$pos = $node->start_char;

$pos = $node->end_char;

Returns the start and end offset position counted in characters (suitable for use with substr, length, etc...) rather than plain bytes.

start_point

( $line, $col ) = $node->start_point;

Returns the position in the input text where this node's extent begins, split into a line and column number (both 0-based; the string is considered to start at position (0, 0)). Note that the column is counted in bytes, not characters.

end_point

( $row, $col ) = $node->start_point;

Returns the position in the input text just past where this node's extent finishes, split into a row (line) and column number (both 0-based).

start_row

start_column

end_row

end_column

$row = $node->start_row;
$row = $node->end_row;

$col = $node->start_column;
$col = $node->end_column;

Since version 0.11.

Returns individual fields of the start or end position of the node's extent, all as 0-based indexes.

These are more efficient if you only need the row or column; use "start_point" or "end_point" if you need both.

is_named

$bool = $node->is_named;

Returns true if the node represents a named rule in the grammar.

is_missing

$bool = $node->is_missing;

Returns true if the node was inserted by the parser to recover from certain kinds of syntax error.

is_extra

$bool = $node->is_extra;

Returns true if the node represents something which is not required by the grammar but could appear anywhere (for example, a comment).

has_error

$bool = $node->has_error;

Returns true if the node or any of its descendents represents a syntax error.

parent

$parent = $node->parent;

Returns the node's immediate parent; the node from which this node was obtained. Returns undef on the root node.

child_count

$count = $node->child_count;

Returns the number of child nodes contained by this one.

child_nodes

@nodes = $node->child_nodes;

Returns a list of child nodes. The length of the returned list will the size given by "child_count".

field_names_with_child_nodes

@kvlist = $node->field_names_with_child_nodes;

Returns an even-length key/value list containing field names associated with child nodes. The list will be twice as long as the size given by "child_count" and consist of pairs. In each pair, the first value is either a field name or undef if the node has no field name, and the second is the child node itself.

On Perl version 5.36 or above, the multi-variable foreach list syntax may be useful to handle these:

foreach my ($name, $child) ($node->field_names_with_child_nodes) {
   ...
}

On earlier version, the List::Util pair functions such as pairs might be used instead:

use List::Util 'pairs';

foreach (pairs $node->field_names_with_child_nodes) {
   my ($name, $child) = @$_;
   ...
}

child_by_field_name

$child = $node->child_by_field_name( $field_name );

Since version 0.07.

Returns the child node associated with the given field name. This would be the same as the value found by

my %children = $node->field_names_with_child_nodes;
$child = $children{ $field_name };

If the node does not have a child with the given field name, an exception is thrown.

try_child_by_field_name

$child = $node->try_child_by_field_name( $field_name );

Since version 0.07.

Similar to "child_by_field_name" but returns undef if there is no such child rather than throwing an exception.

debug_sprintf

$str = $node->debug_sprintf();

Returns a debugging test string that represents the node and all its child nodes, in a format similar to tree-sitter's usual S-expr notation.

Basic named nodes are printed with their name in parens; (type). Anonymous nodes have their text string in quotes; "text". Child nodes of named are included within the parens of the type name. Field names are printed as prefixes with a colon.

(node)

(node (children) (go) "here")

(node left: (node) right: (node))

TODO

The following C library functions are currently unhandled:

ts_node_child_by_field_id
ts_node_next_sibling
ts_node_prev_sibling
ts_node_next_named_sibling
ts_node_prev_named_sibling
ts_node_first_child_for_byte
ts_node_first_named_child_for_byte
ts_node_descendant_for_byte_range
ts_node_descendant_for_point_range
ts_node_named_descendant_for_byte_range
ts_node_named_descendant_for_point_range
ts_node_edit
ts_node_eq

AUTHOR

Paul Evans <leonerd@leonerd.org.uk>