NAME

txdrows - split a section of sets out of singe data file

SYNOPSIS

pipe | txdrows -b=3 -e=10 | pipe

DESCRIPTION

pipe | txdrows 'near([1], 3.1)' | pipe

This program extracs rows (data sets, sections, records, ...) of a numerical text file. Either this is a configured range of lines via begin/end indices or a decimation factor, or a set of rows matching a given expression on the command line. The last example employs such an expression to match the first row that has a value near 3.1 in the first column. You could specify a third argument to near() to change the default allowed deviation. If you deal with integer values, using

pipe | txdrows '[1] == 3' | pipe

is fine, too, for selecting value 3.

PARAMETERS

These are the general rules for specifying parameters to this program:

txdrows -s -xyz -s=value --long --long=value [--] [files/stuff]

You mention the options to change parameters in any order or even multiple times. They are processed in the oder given, later operations overriding/extending earlier settings. Using the separator "--" stops option parsing An only mentioned short/long name (no "=value") means setting to 1, which is true in the logical sense. Also, prepending + instead of the usual - negates this, setting the value to 0 (false). Specifying "-s" and "--long" is the same as "-s=1" and "--long=1", while "+s" and "++long" is the sames as "-s=0" and "--long=0".

There are also different operators than just "=" available, notably ".=", "+=", "-=", "*=" and "/=" for concatenation / appending array/hash elements and scalar arithmetic operations on the value. Arrays are appended to via "array.=element", hash elements are set via "hash.=name=value". You can also set more array/hash elements by specifying a separator after the long parameter line like this for comma separation:

--array/,/=1,2,3  --hash/,/=name=val,name2=val2

The available parameters are these, default values (in Perl-compatible syntax) at the time of generating this document following the long/short names:

begin, b (scalar)
1

begin of section (above all other criteria)

black (scalar)
0

ignore whitespace at beginning and end of line (disables strict mode) (from Text::NumericData)

comchar (scalar)
undef

comment character (if not set, deduce from data or use #) (from Text::NumericData)

comregex (scalar)
'[#%]*[^\\S\\015\\012]*'

regex for matching comments (from Text::NumericData)

config, I (array)
[]

Which configfile(s) to use (overriding automatic search in likely paths); special: just -I or --config causes printing a current config file to STDOUT

empty (scalar)
0

treat empty lines as empty data sets, preserving them in output (from Text::NumericData)

end, e (scalar)
-1

end of section (when negative: until end; above all other criteria)

fill (scalar)
undef

fill value for undefined data (from Text::NumericData)

help, h (scalar)
0

Show the help message. Value 1..9: help level, par: help for paramter par (long name) only.

Additional fun with negative values, optionally followed by comma-separated list of parameter names: -1: list par names, -2: list one line per name, -3: -2 without builtins, -10: dump values (Perl style), -11: dump values (lines), -100: print POD.

justmatch, j (scalar)
1

if an expression to match is given, select what to print out: 0 means all matches including header, >0 means just the first n matches, <0 means all matches, but no header

lineend (scalar)
undef

line ending to use: (DOS, MAC, UNIX or be explicit if you can, taken from data if undefined, finally resorting to UNIX) (from Text::NumericData)

numformat, N (array)
[]

printf formats to use (if there is no "%" present at all, one will be prepended) (from Text::NumericData)

numregex (scalar)
'[\\+\\-]?\\d*\\.?\\d*[eE]?\\+?\\-?\\d*'

regex for matching numbers (from Text::NumericData)

outsep (scalar)
undef

use this separator for output (leave undefined to use input separator, fallback to TAB) (from Text::NumericData)

quote (scalar)
undef

quote titles (from Text::NumericData)

quotechar (scalar)
undef

quote character to use (derived from input or ") (from Text::NumericData)

ranges, R (array)
[]

give multiple ranges (format: "from:to", either may be omitted or set to negative to extend to beginning/end) inside the main range given by begin and end indices, this does not duplicate or rearrange data, just specifies an additional condition to include a record or not, in input order

reduce, r (scalar)
'1'

Reduce row count by a certain factor: Only include every ...th one. A value of 2 means rows 1,3,5... , a value of 10 means rows 1,11,21... (from the input).

separator (scalar)
undef

use this separator for input (otherwise deduce from data; TAB is another way to say "tabulator", fallback is ) (from Text::NumericData)

strict, S (scalar)
0

strictly split data lines at configured separator (otherwise more fuzzy logic is involved) (from Text::NumericData)

text, T (scalar)
1

allow text as data (not first column) (from Text::NumericData)

verbose, v (scalar)
0

be verbose about things

version (scalar)
0

print out the program version

AUTHOR

Thomas Orgis <thomas@orgis.org>

LICENSE AND COPYRIGHT

Copyright (c) 2005-2023 Thomas Orgis, Free Software licensed under the same terms as Perl 5.10