Take me over?
NAME
Catalyst::Plugin::Session::Tutorial - Understanding and using sessions.
ASSUMPTIONS
This tutorial assumes that you are familiar with web applications in general and Catalyst specifically (up to models and configuration), and that you know what HTTP is.
WHAT ARE SESSIONS
When users use a site, especially one that knows who they are (sites you log in to, sites which let you keep a shopping cart, etc.), the server preparing the content has to know that request X comes from client A while request Y comes from client B, so that each user gets the content meant for it.
The problem is that HTTP is a stateless protocol. This means that every request is distinct, and even if it comes from the same client, it's difficult to know that.
The way sessions are maintained between distinct requests is that the client says, for every request, "I'm client A" or "I'm client B".
This piece of data that tells the server "I'm X" is called the session ID, and the threading of several requests together is called a session.
HOW SESSIONS WORK
Cookies
HTTP has a feature that lets this become easier, called cookies. A cookie is something the server asks the client to save somewhere, and resend every time a request is made.
The way they work is that the server sends the Set-Cookie
header, with a cookie name, a value, and some metadata (like when it expires, what paths it applies to, etc.). The client saves this.
Then, on every subsequent request the client will send a Cookie
header, with the cookie name and value.
Cookie Alternatives
Another way is to make sure that the session ID is repeated is to include it in every URI.
This can be as either a part of the path, or as a query parameter.
This technique has several issues which are discussed in "CAVEATS" in Catalyst::Plugin::Session::State::URI.
Server-Side Behavior
When the server receives the session ID it can then look this key up in a database of some sort. For example the database can contain a shopping cart's contents, user preferences, etc.
USING SESSIONS
In Catalyst, the Catalyst::Plugin::Session plugin provides an API for convenient handling of session data. This API is based on the older, less flexible and less reliable Catalyst::Plugin::Session::FastMmap.
The plugin is modular, and requires backend plugins to be used.
State Plugins
State plugins handle session ID persistence. For example Catalyst::Plugin::Session::State::Cookie creates a cookie with the session ID in it.
These plugins will automatically set $c->sessionid
at the begining of the request, and automatically cause $c->sessionid
to be saved by the client at the end of the request.
Store Plugins
The backend into which session data is stored is provided by these plugins. For example, Catalyst::Plugin::Session::Store::DBI uses a database table to store session data, while Catalyst::Plugin::Session::Store::FastMmap uses Cache::FastMmap.
Configuration
First you need to load the appropriate plugins into your Catalyst application:
package MyApp;
use Catalyst qw/
Session
Session::State::Cookie
Session::Store::File
/;
This loads the session API, as well as the required backends of your choice.
After the plugins are loaded they need to be configured. This is done according to "Configure_your_application" in Catalyst::Manual::Cookbook.
Each backend plugin requires its own configuration options (with most plugins providing sensible defaults). The session API itself also has configurable options listed in "CONFIGURATION" in Catalyst::Plugin::Session.
For the plugins above we don't need any configuration at all - they should work out of the box, but suppose we did want to change some things around, it'll look like this:
MyApp->config( session => {
cookie_name => "my_fabulous_cookie",
storage => "/path/to/store_data_file",
});
Usage
Now, let's say we have an online shop, and the user is adding an item to the shopping cart.
Typically the item the user was viewing would have a form or link that adds the item to the cart.
Suppose this link goes to /cart/add/foo_baz/2
, meaning that we want two units of the item foo_baz
to be added to the cart.
Our add
action should look something like this:
package MyApp::Controller::Cart;
sub add : Local {
my ( $self, $c, $item_id, $quantity ) = @_;
$quantity ||= 1;
if ( $c->model("Items")->item_exists($item_id) ) {
$c->session->{cart}{$item_id} += $quantity;
} else {
die "No such item";
}
}
The way this works is that $c->session
always returns a hash reference to some data which is stored by the storage backend plugin. The hash reference returned always contains the same items that were in there at the end of the last request.
All the mishmash described above is done automatically. First, the method looks to see if a session ID is set. This session ID will be set by the State plugin if appropriate, at the start of the request (e.g. by looking at the cookies sent by the client).
If a session ID is set, the store will be asked to retrieve the session data for that specific session ID, and this is returned from $c->session
. This retrieval is cached, and will only happen once per request, if at all.
If a session ID is not set, a new one is generated, a new anonymous hash is created and saved in the store with the session ID as the key, and the reference to the hash is returned.
The action above takes this hash reference, and updates a nested hash within it, that counts quantity of each item as stored in the cart.
Any cart-listing code can then look into the session data and use it to display the correct items, which will, of course, be remembered across requests.
Here is an action some Template Toolkit example code that could be used to generate a cart listing:
sub list_cart : Local {
my ( $self, $c ) = @_;
# get the cart data, that maps from item_id to quantity
my $cart = $c->session->{cart} || {};
# this is our abstract model in which items are stored
my $storage = $c->model("Items");
# map from item_id to item (an object or hash reference)
my %items = map { $_ => $storage->get_item($_) } keys %$cart;
# put the relevant info on the stash
$c->stash->{cart}{items} = \%items;
$c->stash->{cart}{quantity} = $cart;
}
And [a part of] the template it forwards to:
<table>
<thead>
<tr>
<th>Item</th>
<th>Quantity</th>
<th>Price</th>
<th>remove</th>
</tr>
</thead>
<tbody>
[%# the table body lists all the items in the cart %]
[% FOREACH item_id = cart.items.keys %]
[%# each item has its own row in the table %]
[% item = cart.items.$item_id %]
[% quantity = cart.quantity.$item_id %]
<tr>
<td>
[%# item.name is an attribute in the item
# object, as loaded from the store %]
[% item.name %]
</td>
<td>
[%# supposedly this is part of a form where you
# can update the quantity %]
<input type="text" name="[% item_id %]_quantity"
value="[% quantity %]" />
</td>
<td> $ [% item.price * quantity %] </td>
<td>
<a href="[% c.uri_for('/cart/remove') %]/[% item_id %]">
<img src="/static/trash_can.png" />
</a>
</td>
[% END %]
<tbody>
<tfoot>
<tr>
<td colspan="2"> Total: </td>
<td>
[%# calculate sum in this cell - too
# much headache for a tutorial ;-) %]
</td>
<td>
<a href="[% c.uri_for('/cart/empty') %]">Empty cart</a>
</td>
</tr>
</tfoot>
</table>
As you can see the way that items are added into $c->session->{cart}
is pretty simple. Since $c->session
is restored as necessary, and contains data from previous requests by the same client, the cart can be updated as the user navigates the site pretty transparently.
SECURITY ISSUES
These issues all relate to how session data is managed, as described above. These are not issues you should be concerned about in your application code, but are here for their educational value.
(Not) Trusting the Client
In order to avoid the overhead of server-side data storage, the session data can be included in the cookie itself.
There are two problems with this:
The user can change the data.
Cookies have a 4 kilobyte size limit.
The size limit is of no concern in this section, but data changing is. In the database scheme the data can be trusted, since the user can neither read nor write it. However, if the data is delegated to the user, then special measures have to be added for ensuring data integrity, and perhaps secrecy too.
This can be implemented by encrypting and signing the cookie data, but this is a big headache.
Session Hijacking
What happens when client B says "I'm client A"? Well, basically, the server buys it. There's no real way around it.
The solution is to make "I'm client A" a difficult thing to say. This is why session IDs are randomized. If they are properly randomized, session IDs are so hard to guess that they must be stolen instead.
This is called session hijacking. There are several ways one might hijack another user's session.
Cross Site Scripting
One is by using cross site scripting attacks to steal the cookie data. In community sites, where users can cause the server to display arbitrary HTML, they can use this to put JavaScript code on the server.
If the server does not enforce a strict subset of tags that may be used, the malicious user could use this code to steal the cookies (there is a JavaScript API that lets cookies be accessed, but this code has to be run on the same website that the cookie came from).
Social Engineering
By tricking a user into revealing a URI with session data embedded in it (when cookies are not used), the session ID can also be stolen.
Also, a naive user could be tricked into showing the cookie data from the browser to a malicious user.
AUTHOR
Yuval Kogman <nothingmuch@woobling.org>