NAME
subst - Greple module for text search and substitution
VERSION
Version 2.21
SYNOPSIS
greple -Msubst --dict dictionary [ options ]
--dict dictionary file
--dictdata dictionary data
--check=[ng,ok,any,outstand,all,none]
--select=N
--linefold
--stat
--with-stat
--stat-style=[default,dict]
--diff
--diffcmd command
--create
--replace
--overwrite
--[no-]warn-overlap
--[no-]warn-include
DESCRIPTION
This greple module supports check and substitution of text files based on dictionary data.
Dictionary file is given by --dict option and each line contains pattern and expected string pairs.
greple -Msubst --dict DICT
If the dictionary file contains following data:
colou?r color
cent(er|re) center
above command find the first pattern which does not match the second string, that is "colour" and "centre" in this case.
Field "//" in dictionary data is ignored, so this file can be written like this:
colou?r // color
cent(er|re) // center
You can use same file by greple's -f option and string after "//" is ignored as a comment in that case.
greple -f DICT ...
Option --dictdata can be used to provide dictionary data in command line.
greple --dictdata $'colou?r color\ncent(er|re) center\n'
Overlapped pattern
When the matched string is same or shorter than previously matched string by another pattern, it is simply ignored (--no-warn-include by default). So, if you have to declare conflicted patterns, put the longer pattern in front.
If the matched string overlaps with previously matched string, it is warned (--warn-overlap by default) and ignored.
Terminal color
This version uses Getopt::EX::termcolor module. It sets option --light-screen or --dark-screen depending on the terminal on which the command run, or TERM_BGCOLOR environment variable.
Some terminals (eg: "Apple_Terminal" or "iTerm") are detected automatically and no action is required. Otherwise set TERM_BGCOLOR environment to #000000 (black) to #FFFFFF (white) digit depending on terminal background color.
OPTIONS
- --check=outstand|ng|ok|any|all|none
-
Option --check takes argument from ng, ok, any, outstand, all and none.
With default value outstand, command will show information about both expected and unexpected words only when unexpected word was found in the same file.
With value ng, command will show information about unexpected words. With value ok, you will get information about expected words. Both with value any.
Value all and none make sense only when used with --stat option, and display information about never matched pattern.
- --select=N
-
Select Nth entry from the dictionary. Argument is interpreted by Getopt::EX::Numbers module. Range can be defined like --select=1:3,7:9. You can get numbers by --stat option.
- --linefold
-
If the target data is folded in the middle of text, use --linefold option. It creates regex patterns which matches string spread across lines. Substituted text does not include newline, though. Because it confuses regex behavior somewhat, avoid to use if possible.
- --stat
- --with-stat
-
Print statistical information. Works with --check option.
Option --with-stat print statistics after normal output, while --stat print only statistics.
- --stat-style=[default|dict]
-
Using --stat-style=dict option with --stat and --check=any, you can get dictionary style output for your working document.
- --subst
-
Substitute unexpected matched pattern to expected string. Newline character in the matched string is ignored. Pattern without replacement string is not changed.
- --diff
- --diffcmd=command
-
Option -diff produce diff output of original and converted text.
Specify diff command name used by --diff option. Default is "diff -u".
- --create
-
Create new file and write the result. Suffix ".new" is appended to original filename.
- --replace
-
Replace the target file by converted result. Original file is renamed to backup name with ".bak" suffix.
- --overwrite
-
Overwrite the target file by converted result with no backup.
- --[no-]warn-overlap
-
Warn overlapped pattern. Default on.
- --[no-]warn-include
-
Warn included pattern. Default off.
DICTIONARY
This module includes example dictionaries. They are installed share directory and accessed by --exdict option.
greple -Msubst --exdict jtca-katakana-guide-3.dict
- --exdict dictionary
-
Use dictionary flie in the distribution as a dictionary file.
- --exdictdir
-
Show dictionary directory.
- --exdict jtca-katakana-guide-3.dict
- --jtca-katakana-guide
-
Created from following guideline document.
外来語(カタカナ)表記ガイドライン 第3版 制定:2015年8月 発行:2015年9月 一般財団法人テクニカルコミュニケーター協会 Japan Technical Communicators Association https://www.jtca.org/standardization/katakana_guide_3_20171222.pdf
- --exdict jtf-style-guide-3.dict
- --jtf-style-guide
-
Created from following guideline document.
JTF日本語標準スタイルガイド(翻訳用) 第3.0版 2019年8月20日 一般社団法人 日本翻訳連盟(JTF) 翻訳品質委員会 https://www.jtf.jp/jp/style_guide/pdf/jtf_style_guide.pdf
- --exdict sccc2.dict
- --sccc2
-
Dictionary used for "C/C++ セキュアコーディング 第2版" published in 2014.
https://www.jpcert.or.jp/securecoding_book_2nd.html
- --exdict microsoft.dict
- --microsoft
-
Dictionary generated from Microsoft localization style guide.
https://www.microsoft.com/ja-jp/language/styleguides
Data is generated from this article:
https://www.atmarkit.co.jp/news/200807/25/microsoft.html
INSTALL
CPANMINUS
$ cpanm App::Greple::subst
or
$ curl -sL http://cpanmin.us | perl - App::Greple::subst
SEE ALSO
https://github.com/kaz-utashiro/greple
https://github.com/kaz-utashiro/greple-subst
https://www.jtca.org/standardization/katakana_guide_3_20171222.pdf
https://www.jtf.jp/jp/style_guide/styleguide_top.html, https://www.jtf.jp/jp/style_guide/pdf/jtf_style_guide.pdf
https://www.microsoft.com/ja-jp/language/styleguides, https://www.atmarkit.co.jp/news/200807/25/microsoft.html
https://qiita.com/kaz-utashiro/items/85add653a71a7e01c415
AUTHOR
Kazumasa Utashiro
LICENSE
Copyright 2017-2021 Kazumasa Utashiro.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.