NAME
Image::Leptonica::Func::list
VERSION
version 0.04
list.c
list.c
Inserting and removing elements
void listDestroy()
DLLIST *listAddToHead()
l_int32 listAddToTail()
l_int32 listInsertBefore()
l_int32 listInsertAfter()
void *listRemoveElement()
void *listRemoveFromHead()
void *listRemoveFromTail()
Other list operations
DLLIST *listFindElement()
DLLIST *listFindTail()
l_int32 listGetCount()
l_int32 listReverse()
DLLIST *listJoin()
Lists are much harder to handle than arrays. There is
more overhead for the programmer, both cognitive and
codewise, and more likelihood that an error can be made.
For that reason, lists should only be used when it is
inefficient to use arrays, such as when elements are
routinely inserted or deleted from inside arrays whose
average size is greater than about 10.
A list of data structures can be implemented in a number
of ways. The two most popular are:
(1) The list can be composed of a linked list of
pointer cells ("cons cells"), where the data structures
are hung off the cells. This is more difficult
to use because you have to keep track of both
your hanging data and the cell structures.
It requires 3 pointers for every data structure
that is put in a list. There is no problem
cloning (using reference counts) for structures that
are put in such a list. We implement lists by this
method here.
(2) The list pointers can be inserted directly into
the data structures. This is easy to implement
and easier to use, but it adds 2 ptrs of overhead
to every data structure in which the ptrs are embedded.
It also requires special care not to put the ptrs
in any data that is cloned with a reference count;
else your lists will break.
Writing C code that uses list pointers explicitly to make
and alter lists is difficult and prone to error.
Consequently, a generic list utility that handles lists
of arbitrary objects and doesn't force the programmer to
touch the "next" and "prev" pointers, is quite useful.
Such functions are provided here. However, the usual
situation requires traversing a list and applying some
function to one or more of the list elements. Macros
for traversing the list are, in general, necessary, to
achieve the goal of invisibly handling all "next" and "prev"
pointers in generic lists. We provide macros for
traversing a list in both forward and reverse directions.
Because of the typing in C, implementation of a general
list utility requires casting. If macros are used, the
casting can be done implicitly; otherwise, using functions,
some of the casts must be explicit. Fortunately, this
can be implemented with void* so the programmer using
the library will not have to make any casts! (Unless you
compile with g++, in which case the rules on implicit
conversion are more strict.)
For example, to add an arbitrary data structure foo to the
tail of a list, use
listAddToTail(&head, &tail, pfoo);
where head and tail are list cell ptrs and pfoo is
a pointer to the foo object.
And to remove an arbitrary data structure foo from a
list, when you know the list cell element it is hanging from,
use
pfoo = listRemoveElement(&head, elem)
where head and elem are list cell ptrs and pfoo is a pointer
to the foo object. No casts are required for foo in
either direction in ANSI C. (However, casts are
required for ANSI C++).
We use lists that are composed of doubly-linked
cells with data structures hanging off the cells.
We use doubly-linked cells to simplify insertion
and deletion, and to allow operations to proceed in either
direction along the list. With doubly-linked lists,
it is tempting to make them circular, by setting head->prev
to the tail of the list and tail->next to the head.
The circular list costs nothing extra in storage, and
allows operations to proceed from either end of the list
with equal speed. However, the circular link adds
cognitive overhead for the application programmer in
general, and it greatly complicates list traversal when
arbitrary list elements can be added or removed as you
move through. It can be done, but in the spirit of
simplicity, we avoid the temptation. The price to be paid
is the extra cost to find the tail of a list -- a full
traversal -- before the tail can be used. This is a
cheap price to pay to avoid major headaches and buggy code.
When you are only applying some function to each element
in a list, you can go either forwards or backwards.
To run through a list forwards, use:
for (elem = head; elem; elem = nextelem) {
nextelem = elem->next; (in case we destroy elem)
<do something with elem->data>
}
To run through a list backwards, find the tail and use:
for (elem = tail; elem; elem = prevelem) {
# prevelem = elem->prev; (in case we destroy elem)
<do something with elem->data>
}
Even though these patterns are very simple, they are so common
that we've provided macros for them in list.h. Using the
macros, this becomes:
L_BEGIN_LIST_FORWARD(head, elem)
<do something with elem->data>
L_END_LIST
L_BEGIN_LIST_REVERSE(tail, elem)
<do something with elem->data>
L_END_LIST
Note again that with macros, the application programmer does
not need to refer explicitly to next and prev fields. Also,
in the reverse case, note that we do not explicitly
show the head of the list. However, the head of the list
is always in scope, and functions can be called within the
iterator that change the head.
Some special cases are simpler. For example, when
removing all items from the head of the list, you can use
while (head) {
obj = listRemoveFromHead(&head);
<do something with obj>
}
Removing successive elements from the tail is equally simple:
while (tail) {
obj = listRemoveFromTail(&head, &tail);
<do something with obj>
}
When removing an arbitrary element from a list, use
obj = listRemoveElement(&head, elem);
All the listRemove*() functions hand you the object,
destroy the list cell to which it was attached, and
reset the list pointers if necessary.
Several other list operations, that do not involve
inserting or removing objects, are also provided.
The function listFindElement() locates a list pointer
by matching the object hanging on it to a given
object. The function listFindTail() gets a handle
to the tail list ptr, allowing backwards traversals of
the list. listGetCount() gives the number of elements
in a list. Functions that reverse a list and concatenate
two lists are also provided.
These functions can be modified for efficiency in the
situation where there is a large amount of creation and
destruction of list cells. If millions of cells are
made and destroyed, but a relatively small number are
around at any time, the list cells can be stored for
later re-use in a stack (see the generic stack functions
in stack.c).
FUNCTIONS
listAddToHead
l_int32 listAddToHead ( DLLIST **phead, void *data )
listAddToHead()
Input: &head (<optional> input head)
data (void* ptr, to be added)
Return: 0 if OK; 1 on error
Notes:
(1) This makes a new cell, attaches the data, and adds the
cell to the head of the list.
(2) When consing from NULL, be sure to initialize head to NULL
before calling this function.
listAddToTail
l_int32 listAddToTail ( DLLIST **phead, DLLIST **ptail, void *data )
listAddToTail()
Input: &head (<may be updated>, head can be null)
&tail (<updated>, tail can be null)
data (void* ptr, to be hung on tail cons cell)
Return: 0 if OK; 1 on error
Notes:
(1) This makes a new cell, attaches the data, and adds the
cell to the tail of the list.
(2) &head is input to allow the list to be "cons'd" up from NULL.
(3) &tail is input to allow the tail to be updated
for efficient sequential operation with this function.
(4) We assume that if *phead and/or *ptail are not NULL,
then they are valid addresses. Therefore:
(a) when consing from NULL, be sure to initialize both
head and tail to NULL.
(b) when tail == NULL for an existing list, the tail
will be found and updated.
listDestroy
void listDestroy ( DLLIST **phead )
listDestroy()
Input: &head (<to be nulled> head of list)
Return: void
Notes:
(1) This only destroys the cons cells. Before destroying
the list, it is necessary to remove all data and set the
data pointers in each cons cell to NULL.
(2) listDestroy() will give a warning message for each data
ptr that is not NULL.
listFindElement
DLLIST * listFindElement ( DLLIST *head, void *data )
listFindElement()
Input: head (list head)
data (void* address, to be searched for)
Return: cell (the containing cell, or null if not found or on error)
Notes:
(1) This returns a ptr to the cell, which is still embedded in
the list.
(2) This handle and the attached data have not been copied or
reference counted, so they must not be destroyed. This
violates our basic rule that every handle returned from a
function is owned by that function and must be destroyed,
but if rules aren't there to be broken, why have them?
listFindTail
DLLIST * listFindTail ( DLLIST *head )
listFindTail()
Input: head
Return: tail, or null on error
listGetCount
l_int32 listGetCount ( DLLIST *head )
listGetCount()
Input: head (of list)
Return: number of elements; 0 if no list or on error
listInsertAfter
l_int32 listInsertAfter ( DLLIST **phead, DLLIST *elem, void *data )
listInsertAfter()
Input: &head (<optional> input head)
elem (list element to be inserted after;
must be null if head is null)
data (void* ptr, to be added)
Return: 0 if OK; 1 on error
Notes:
(1) This can be called on a null list, in which case both
head and elem must be null. The head is included
in the call to allow "consing" up from NULL.
(2) If you are searching through a list, looking for a condition
to add an element, you can do something like this:
L_BEGIN_LIST_FORWARD(head, elem)
<identify an elem to insert after>
listInsertAfter(&head, elem, data);
L_END_LIST
listInsertBefore
l_int32 listInsertBefore ( DLLIST **phead, DLLIST *elem, void *data )
listInsertBefore()
Input: &head (<optional> input head)
elem (list element to be inserted in front of;
must be null if head is null)
data (void* address, to be added)
Return: 0 if OK; 1 on error
Notes:
(1) This can be called on a null list, in which case both
head and elem must be null.
(2) If you are searching through a list, looking for a condition
to add an element, you can do something like this:
L_BEGIN_LIST_FORWARD(head, elem)
<identify an elem to insert before>
listInsertBefore(&head, elem, data);
L_END_LIST
listJoin
l_int32 listJoin ( DLLIST **phead1, DLLIST **phead2 )
listJoin()
Input: &head1 (<may be changed> head of first list)
&head2 (<to be nulled> head of second list)
Return: 0 if OK, 1 on error
Notes:
(1) The concatenated list is returned with head1 as the new head.
(2) Both input ptrs must exist, though either can have the value NULL.
listRemoveElement
void * listRemoveElement ( DLLIST **phead, DLLIST *elem )
listRemoveElement()
Input: &head (<can be changed> input head)
elem (list element to be removed)
Return: data (void* struct on cell)
Notes:
(1) in ANSI C, it is not necessary to cast return to actual type; e.g.,
pix = listRemoveElement(&head, elem);
but in ANSI C++, it is necessary to do the cast:
pix = (Pix *)listRemoveElement(&head, elem);
listRemoveFromHead
void * listRemoveFromHead ( DLLIST **phead )
listRemoveFromHead()
Input: &head (<to be updated> head of list)
Return: data (void* struct on cell), or null on error
Notes:
(1) in ANSI C, it is not necessary to cast return to actual type; e.g.,
pix = listRemoveFromHead(&head);
but in ANSI C++, it is necessary to do the cast; e.g.,
pix = (Pix *)listRemoveFromHead(&head);
listRemoveFromTail
void * listRemoveFromTail ( DLLIST **phead, DLLIST **ptail )
listRemoveFromTail()
Input: &head (<may be changed>, head must NOT be null)
&tail (<always updated>, tail may be null)
Return: data (void* struct on cell) or null on error
Notes:
(1) We include &head so that it can be set to NULL if
if the only element in the list is removed.
(2) The function is relying on the fact that if tail is
not NULL, then is is a valid address. You can use
this function with tail == NULL for an existing list, in
which case the tail is found and updated, and the
removed element is returned.
(3) In ANSI C, it is not necessary to cast return to actual type; e.g.,
pix = listRemoveFromTail(&head, &tail);
but in ANSI C++, it is necessary to do the cast; e.g.,
pix = (Pix *)listRemoveFromTail(&head, &tail);
listReverse
l_int32 listReverse ( DLLIST **phead )
listReverse()
Input: &head (<may be changed> list head)
Return: 0 if OK, 1 on error
Notes:
(1) This reverses the list in-place.
AUTHOR
Zakariyya Mughal <zmughal@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2014 by Zakariyya Mughal.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.