NAME
ebook - create and manipulate e-books from the command line
SYNOPSIS
ebook COMMAND arg1 arg2 --opt1 --opt2
See also "EXAMPLES".
COMMANDS
adddoc
Adds a documents to both the book manifest and spine.
Options
--opffile
--opf
-
The OPF file to modify. If not specified one will be searched for in the current directory.
--identifier
--id
-
The ID attribute to use for the added manifest item. This is required, and ebook will abort if it is not specified.
--mimetype
--mtype
-
The mime type string to use for the added manifest item. If not specified, it will be autodetected via File::Mimeinfo::Magic. This may not result in an optimal string.
Example
ebook adddoc --opf mybook.opf --id 'text-ch1' chapter1.html
additem
Add an item to the book manifest, but not the spine.
Note that the "fix" command will automatically insert manifest items for any local files referenced by existing manifest items.
Options
--opffile
--opf
-
The OPF file to modify. If not specified one will be searched for in the current directory.
--identifier
--id
-
The ID attribute to use for the added manifest item. This is required, and ebook will abort if it is not specified.
--mimetype
--mtype
-
The mime type string to use for the added manifest item. If not specified, it will be autodetected via File::Mimeinfo::Magic. This may not result in an optimal string.
Example
ebook additem --opf mybook.opf --id 'illus-ch1' chapter1-illus.jpg
bisac
Search for a BISAC code matching a case-insensitive regular expression.
Options
regexp
-
The first argument is taken as a regular expression to use for the search. If this is either '.' or not specified, the entire list of valid codes is printed.
This requires that the BISAC codes be downloaded ahead of time. (See
dlbisac
.)
blank
Create a blank e-book structure.
Options
--opffile filename.opf
--opf filename.opf
-
Use the specified OPF file. This can also be specified as the first non-option argument, which will override this option if it exists. If no file is specified, the program will abort with an error. =item
--author
"Author Name"The author of the book. If not specified, defaults to "Unknown Author".
--title
"Title Name"-
The title of the book. If not specified, defaults to "Unknown Title".
--dir directory
-d directory
-
Output the OPF file in this directory, creating it if necessary.
Example
ebook blank newfile.opf --author "Me Myself" --title "New File"
ebook blank --opffile newfile.opf --author "Me Myself" --title "New File"
Both of those commands have the same effect.
config
Make changes to the EBook::Tools configuration file.
The configuration file itself is located as either $ENV{HOME}/.ebooktools/config.ini
or as $ENV{USERPROFILE}\Application Data\EBook-Tools
, depending on platform and which directory is found first. See "userconfigdir()" in EBook::Tools for details.
Arguments / Subcommands
Configuration is always handled in the format of:
ebook config subcommand value
default
Replace any existing configuration file with a default template. This creates the file if it does not exist. This should be done once before any other configuration manipulation is done, unless a configuration file has been manually created ahead of time.
debug
Sets the default debugging level when no verbosity is specified. Note that verbosity can only be increased, not decreased, with the
-v
option.tidysafety
Sets the default safety level when tidy is used. Valid values are from 0-4. See "unpack" for details on what each value means.
mobipids
A comma-separated list of Mobipocket PIDs to try to use to decrypt e-books. This value is only used if the appropriate plug-in modules or helper applications are available, as DRM is not supported natively by EBook::Tools. Note that if the PID includes a $ character, the entire PID string has to be enclosed in single quotes.
Examples
ebook config default
ebook config debug 2
ebook config mobipids '1234567890,2345678$90'
convert
Unpacks the ebook specified as the first argument, runs standard fixes on the contents, and repacks it into a new format in the output file specified as the second argument. Currently the only supported output format is epub, which is the format you will get irrespective of the extension you actually give the output file.
Options
All options from unpack
and fix
are technically valid here as well, though of course some options are nonsensical in this context and will likely break the conversion (e.g. --nosave).
Example
ebook convert MyBook.prc MyBook.epub
ebook convert --name MyBook.lit /home/myname/MyBook.epub
dlbisac
Downloads and caches the Book Industry Study Group BISAC codes into a local database. This will destroy the existing contents of that table if this has been done previously.
downconvert
dc
If the appropriate helpers or plugins are available, write a copy of the input file with the DRM restrictions removed.
NOTE: no actual DRM-removal code is present in this package. This is just presents a unified interface to other programs that have that capability.
Arguments
infile
The first non-option argument is taken to be the input file. If not specified, the program exits with an error.
outfile
The second non-option argument is taken to be the output file. If not specified, the program will use a name based on the input file, appending '-nodrm' to the basename and keeping the extension. In the special case of Mobipocket files ending in '-sm', the '-sm' portion of the basename is simply removed, and nothing else is appended.
key
The third non-option argument is taken to be either the decryption key/PID, or in the case of Microsoft Reader (.lit) files, the
keys.txt
file containing the decryption keys.If not specified, this will be looked up from the configuration file. Convertlit keyfiles will be looked for in standard locations. If no key is found, the command aborts and exits with an error.
Example
ebook downconvert NewBook.lit NewBook-readable.lit mykeys.txt
ebook dc MyBook-sm.prc
fix
Find and fix problems with an e-book, including enforcing a standard specification and ensuring that all linked objects are present in the manifest.
Options
--opffile filename.opf
--opf filename.opf
-
Use the specified OPF file. This can also be specified as the first non-option argument, which will override this option if it exists. If no file is specified, one will be searched for.
--oeb12
-
Force the OPF to conform to the OEB 1.2 standard. This is the default.
--opf20
-
Force the OPF to conform to the OPF 2.0 standard. If both this and
--oeb12
are specified, the program will abort with an error. --mobi
-
Correct Mobipocket-specific elements, creating an output element to force UTF-8 output if one does not yet exist.
--erotic
or--sex
-
Enable special handling for erotic fiction (most notably special subject normalization rules).
--names
-
Normalize names to standard capitalization and format (primary name display is "First Middle Last", but file-as is "Last, First Middle".
This is not done by default as it can damage unusual but correct names.
genepub
Generate a .epub book from existing OPF data.
Options
--input filename.opf
--i filename.opf
--opffile filename.opf
--opf filename.opf
-
Use the specified OPF file. If no file is specified, one will be searched for.
--output bookname.epub
-o bookname.epub
-
Use the specified name for the final output file. This can also be specified as the first non-option argument, which will override this option if it exists. If not specified, the book will have the same filename as the OPF file, with the extension changed to
.epub
. --dir directory
-d directory
-
Output the final .epub book into the specified directory. The default is to use the current working directory.
Example
ebook genepub mybook.opf -o my_special_book.epub -d ../epubbooks
or in the simplest case:
ebook genepub
genimp
Generate a eBookwise .imp book from a .RES directory
Options
--input DIRNAME.RES
-i DIRNAME.RES
-
Specifies the resource directory to use for input. A valid resource directory will contain at least a
RSRC.INF
file, aDATA.FRK
file, and several other files with four-capital-letter filenames.This can also be specified as the first non-option argument, which will override this option if it exists. If not specified, the current directory will be used.
--output bookname.epub
-o bookname.epub
-
Use the specified name for the final output file. If not specified, the book will have the same filename as the input, with the extension changed to
.imp
.
Examples
ebook genimp MyUnpackedBook.RES MyBook.imp
ebook genimp --resdir ../MyUnpackedBook.RES -f imp/MyBook.imp
genmobi
Generate a Mobipocket .mobi/.prc book from OPF, HTML, or ePub input.
Options
--input filename
--i filename
-
Use the specified file for input. Valid formats are OPF, HTML, and ePub. This can also be specified as the first non-option argument, which will override this option if it exists. If no file is specified, an OPF file in the current directory will be searched for.
--output bookname.prc
-o bookname.prc
-
Use the specified name for the final output file. If not specified, the book will have the same filename as the input file, with the extension changed to
.mobi
(this file is always created bymobigen
, specifying a different filename only causes it to be renamed afterwards).This can also be specified as the second non-option argument, which will override this option if it exists.
--dir directory
-d directory
-
Output the final book into the specified directory. The default is to use the current working directory, which is where
mobigen
will always place it initially; if specified this only forces the file to be moved after generation. --compression x
-c x
-
Use the specified compression level
x
, where 0 is no compression, 1 is PalmDoc compression, and 2 is HUFF/CDIC compression. If not specified, defaults to 1 (PalmDoc compression).
Example
ebook genmobi mybook.opf -o my_special_book.prc -d ../mobibooks
ebook genmobi mybook.html mybook.prc -c2
or in the simplest case:
ebook genmobi
genncx
Given an OPF file, creates a NCX-format table of contents from the package unique-identifier, the dc:title, dc:creator, and spine elements, and then add the NCX entry to the manifest if it is not already referenced.
The OPF file will be cleaned to OPF20 format before this happens.
Options
--opffile filename.opf
--opf filename.opf
-
Use the specified OPF file. This can also be specified as the first non-option argument, which will override this option if it exists. If no file is specified, one will be searched for.
impmeta
Set specific metadata values in an ETI .imp file.
Options
--input filename.imp
-i filename.imp
Specify the input filename. This can also be specified as the first argument, in which case the -i option will be ignored.
--output modified.imp
-o modified.imp
Specify the output filename. If not specified, the input file will be overwritten.
--identifier
Specify the identifier metadata.
--category
--cat
Specify the category metadata.
--subcategory
--subcat
Specify the subcategory metadata.
--title
Specify the title metadata.
--lastname
Specify the author last name metadata.
--middlename
Specify the author middle name metadata.
--firstname
Specify the author first name metadata. Note that IMP files commonly place the full name in this component, and leave the middlename and lastname entries blank.
Examples
ebook impmeta mybook.imp --title 'Fixed Title' --lastname 'John Q. Brandy'
ebook impmeta -i mybook.imp -o fixed.imp --title 'Fixed Title'
setcover
Sets the cover image
Takes as a single argument the href of the file to use.
Options
--opffile
--opf
Specifies the OPF file to modify. If not specified, the script will attempt to find one
--identifier
--id
Specifies the ID to assign to the associated manifest item.
setmeta
Set specific metadata values on an OPF file, creating a new entry if none exists.
Both the element to set and the value are specified as additional arguments, not as options.
The elements that can be set are currently:
- date
- description
- publisher
- rights
- series
- subject
- title
- type
The 'series' values can take an extra argument containing the series index position.
The 'subject' elements can be added multiple times (including in a single command-line, though --id will only set the ID on the first one specified). Other entries will be overwritten.
The element argument can be shortened to the minimum number of letters necessary to uniquely identify it.
Options
--opffile
=item *--opf
Specifies the OPF file to modify. If not specified, the script will attempt to find one in the current directory.
--delete
Allows the deletion of subject and series metadata. Has no effect on other elements.
--fileas
Specifies the 'file-as' attribute when setting an author. Has no effect on other elements.
--identifier
--id
Specifies the ID to assign to the element.
Examples
ebook setmeta series 'Some Other Series' 03
ebook setmeta title 'My Great Title'
ebook setmeta t 'My Great Title'
ebook --opf newfile.opf setmeta author 'John Smith' --fileas 'Smith, John' --id mainauthor
splitmeta
Split the <metadata>...</metadata> block out of a pseudo-HTML file that contains one.
splitpre
Split <pre>...</pre> blocks out of an existing HTML file, wrapping each one found into a separate HTML file.
The first non-option argument is taken to be the input file. The second non-option argument is taken to be the basename of the output files.
stripscript
Strips <script>...</script> blocks out of a HTML file.
The first non-option argument is taken to be the input file. The second non-option argument is taken to be the output file. If the latter is not specified, the input file will be overwritten.
Options
--noscript
Strips <noscript>...</noscript> blocks as well.
tidyxhtml
Run tidy on a HTML file to enforce valid XHTML output (required by the OPF 2.0 specification).
tidyxml
Run tidy an a XML file (for neatness).
unpack_ebook
Unpacks an ebook into its component parts, creating an OPF for them if necessary.
Options
--input
=item-i
-
The filename of the ebook to unpack. This can also be specified as the first non-option argument, in which case it will override the option if it exists.
--dir
=item-d
-
The directory to unpack into, which will be created if it does not exist, defaulting to the filename with the extension removed. This can also be specified as the second non-option argument, in which case it will override the option if it exists.
--format
-
The unpacking routines should autodetect the type of book under normal conditions. If autodetection fails, a format can be forced here. See EBook::Tools::Unpack for a list of available formats.
--htmlconvert
-
Attempt to convert the extracted text to HTML. This is obviously only of value if the format doesn't use HTML normally.
--raw
-
This causes a lot of raw, unparsed, unmodified data to be dumped into the directory along with everything else. It's useful for debugging exactly what was in the file being unpacked, but not for much else.
-
Set the primary author of the unpacked e-book, overriding what is detected. Not all e-book formats contain author metadata, and if none is found and this is not specified the primary author will be set to 'Unknown Author'.
--title
-
Set the title of the unpacked e-book, overriding what is detected. A title will always be detected in some form from the e-book, but the exact text can be overridden here.
--opffile
--opf
-
The filename of the OPF metadata file that will be generated. If not specified, defaults to
content.opf
. --tidy
-
Run tidy on any HTML output files to convert them to valid XHTML. Be warned that this can occasionally change the formatting, as Tidy isn't very forgiving on certain common tricks (such as empty <pre> elements with style elements) that abuse the standard.
--tidycmd
-
The tidy executable name. This has to be a fully qualified pathname if tidy isn't on the path. Defaults to 'tidy'.
--tidysafety
-
The safety level to use when running tidy (default is 1). Potential values are:
$tidysafety < 1
:-
No checks performed, no error files kept, works like a clean tidy -m
This setting is DANGEROUS!
$tidysafety == 1
:-
Overwrites original file if there were no errors, but even if there were warnings. Keeps a log of errors, but not warnings.
$tidysafety == 2
:-
Overwrites original file if there were no errors, but even if there were warnings. Keeps a log of both errors and warnings.
$tidysafety == 3
:-
Overwrites original file only if there were no errors or warnings. Keeps a log of both errors and warnings.
$tidysafety
= 4>:-
Never overwrites original file. Keeps a log of both errors and warnings.
Examples
ebook unpack mybook.pdb My_Book --author "By Me"
ebook unpack -i mybook.pdb -d My_Book --author "By Me"
Both of the above commands do the same thing
EXAMPLES
ebook splitmeta book.html mybook.opf
ebook tidyxhtml book.html
ebook tidyxml mybook.opf
ebook fix mybook.opf --oeb12 --mobi
ebook genepub
ebook blank newbook.opf --title "My Title" --author "My Name"
ebook adddoc myfile.html
ebook fix newbook.opf --opf20 -v
ebook genepub
ebook unpack mybook.pdb my_book
cd my_book
ebook addoc new_document.html
ebook fix
ebook genepub
BUGS/TODO
Need to implement a one-pass conversion from one format to another. This will wait until more formats are supported by the underlying modules, however.
Documentation is incomplete
Not all configuration file options are actually used
COPYRIGHT
Copyright 2008 Zed Pobre
LICENSE
Licensed to the public under the terms of the GNU GPL, version 2.