NAME
MongoDBx::Class::Tutorial - Tutorial for using MongoDBx::Class
VERSION
version 0.3
INTRODUCTION
MongoDBx::Class is a flexible object relational mapper (ORM) for MongoDB databases. Before reading this tutorial, please read "DESCRIPTION" in MongoDBx::Class.
CREATING DOCUMENT CLASSES
The first step towards using MongoDBx::Class is creating document classes. Lets say your application is called MyApp, and that it reviews novels by popular authors. Every novel has a title (its name), the full name of its author, its release year, a synopsis, a list of tags categorizing the novel, a list of related novels, and zero or more reviews.
MongoDBx::Class provides you with a flexible foundation for building classes that represent our proposed database schema and the relationships (or "joins") between these classes. This foundation is provided by three modules:
MongoDBx::Class::Moose - intended to be used by document classes in place of Moose. It provides Moose, and all needed relationship types, in the form of Moose attributes.
MongoDBx::Class::Document - a Moose role intended to be consumed by document classes. When documents are expanded from the database, this role is applied to them and thus provides them some common attributes and methods.
MongoDBx::Class::EmbeddedDocument - a Moose role intended to be consumed by embedded document classes. When embedded documents are found inside other documents and expanded, this role is applied to them and thus provides them some common attributes and methods.
Before actually creating document classes, we need to design our schema. The most common question to tackle when designing a schema is when and how to use references, and when to simply embed documents. Refer to http://www.mongodb.org/display/DOCS/Schema+Design#SchemaDesign-Embedvs.Reference for some information about schema design in MongoDB.
Our database schema will be like this:
Every novel will be represented by a document in the 'novels' collection.
An author's name will be stored in the novel's document as an embedded document that has first and last names, and possibly a middle name.
The synopsis of the novel will be stored as a standalone document in the 'synopsis' collection, and will simply reference the novel with a 'novel' attribute.
The list of tags will be stored in the novel document as an array of embedded documents.
The list of related novels will be stored in the novel document as an array of database references to other novels.
The reviews for every novel are stored as standalone documents in the 'reviews' collection, and will simply reference the novel with a 'novel' attribute.
Let's look at a possible JSON representation of a novel - "The Valley of Fear" by Sir Arthur Conan Doyle:
# in the 'novels' collection
{
"_id": ObjectId("4cbca90d576fad5916790100"),
"title": "The Valley of Fear",
"year": 1914,
"author": {
"first_name": "Arthur",
"middle_name": "Conan",
"last_name": "Doyle"
},
"tags": [
{ "category": "mystery", "subcategory": "thriller" },
{ "category": "mystery", "subcategory": "detective" },
{ "category": "crime", "subcategory": "fiction" }
],
"related_novels": [
{ "$ref": "novels", "$id": ObjectId("4cbca90d3a41e35916720100") },
{ "$ref": "novels", "$id": ObjectId("4cbca90d44d8c959167a0100") },
{ "$ref": "novels", "$id": ObjectId("4cbca90d486bbf59166f0100") }
]
}
# in the 'synopsis' collection
{
"_id": ObjectId("4cbca90d699e9a5916670100"),
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"text": "The Valley of Fear is the final Sherlock Holmes novel by Sir Arthur Conan Doyle. The story was first published in the Strand Magazine between September 1914 and May 1915. The first book edition was published in New York on 27 February 1915."
}
# in the 'reviews' collection
{
"_id": ObjectId("4cbca90dfbb2945916740100"),
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Some Guy",
"text": "I really liked it!",
"score": 5
},
{
"_id": ObjectId("4cbca90e0ad57b5916f50100"),
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Some Other Guy",
"text": "It was okay.",
"score": 3
},
{
"_id": ObjectId("4cbca90e0b9c175916c60100"),
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Totally Different Guy",
"text": "Man, that just sucked!",
"score": 1
}
We now need to translate this structure to MongoDBx::Class document classes. As mentioned before, document classes are Moose classes, but instead of using Moose, they use MongoDBx::Class::Moose. We'll start with the document class representing novels:
package MyApp::Schema::Novel;
use MongoDBx::Class::Moose;
use namespace::autoclean;
with 'MongoDBx::Class::Document';
has 'title' => (is => 'ro', isa => 'Str', required => 1, writer => 'set_title');
holds_one 'author' => (is => 'ro', isa => 'MyApp::Schema::PersonName', required => 1, writer => 'set_author');
has 'year' => (is => 'ro', isa => 'Int', predicate => 'has_year', writer => 'set_year');
holds_many 'tags' => (is => 'ro', isa => 'MyApp::Schema::Tag', predicate => 'has_tags');
joins_one 'synopsis' => (is => 'ro', isa => 'Synopsis', coll => 'synopsis', ref => 'novel');
has_many 'related_novels' => (is => 'ro', isa => 'Novel', predicate => 'has_related_novels', writer => 'set_related_novels', clearer => 'clear_related_novels');
joins_many 'reviews' => (is => 'ro', isa => 'Review', coll => 'reviews', ref => 'novel');
sub print_related_novels {
my $self = shift;
foreach my $other_novel ($self->related_novels) {
print $other_novel->title, ', ',
$other_novel->year, ', ',
$other_novel->author->name, "\n";
}
}
around 'reviews' => sub {
my ($orig, $self) = (shift, shift);
my $cursor = $self->$orig;
return $cursor->sort([ year => -1, title => 1, 'author.last_name' => 1 ]);
};
__PACKAGE__->meta->make_immutable;
Aside from standard Moose usage, we've used some of MongoDBx::Class' built-in relationship types. Head over to MongoDBx::Class::Moose for a list of provided relationship types and how they translate to database entries.
In this example, we've used the holds_one
relationship to denote that a Novel document entirely holds one embedded PersonName sub-document. We've used holds_many
to have Novel documents entirely hold Tag documents. We've also used joins_one
to easily find a novel's synopsis, located in the synopsis collection. joins_many
was used to easily find all Review documents located in the reviews collection. Finally, we've used has_many
for the list of references to related novels.
You will notice that when using the holds_one
and holds_many
relationship types, we've given the full package names to the isa
option (such as MyApp::Schema::PersonName), while in other relationship types, we've given the class names only (such as 'Snyopsis'). This inconsistency is only temporary, and will change in future versions, so keep your finger on the pulse.
Lets create our two embedded document classes. We'll start with PersonName:
package MyApp::Schema::PersonName;
use MongoDBx::Class::Moose;
use namespace::autoclean;
with 'MongoDBx::Class::EmbeddedDocument';
has 'first_name' => (is => 'ro', isa => 'Str', required => 1, writer => 'set_first_name');
has 'middle_name' => (is => 'ro', isa => 'Str', predicate => 'has_middle_name', writer => 'set_middle_name');
has 'last_name' => (is => 'ro', isa => 'Str', required => 1, writer => 'set_last_name');
sub name {
my $self = shift;
my $name = $self->first_name;
$name .= ' '.$self->middle_name.' ' if $self->has_middle_name;
$name .= $self->last_name;
return $name;
}
__PACKAGE__->meta->make_immutable;
This is a very simple class, with no relationships. We use the name()
method to easily print a person's full name.
On to the Tag document class:
package MyApp::Schema::Tag;
use MongoDBx::Class::Moose;
use namespace::autoclean;
with 'MongoDBx::Class::EmbeddedDocument';
has 'category' => (is => 'ro', isa => 'Str', required => 1);
has 'subcategory' => (is => 'ro', isa => 'Str', required => 1);
__PACKAGE__->meta->make_immutable;
Again, this is a very simple example. Embedded documents will often be as simple as that.
We have two document classes left. The first is the Synopsis class:
package MyApp::Schema::Synopsis;
use MongoDBx::Class::Moose;
use namespace::autoclean;
with 'MongoDBx::Class::Document';
belongs_to 'novel' => (is => 'ro', isa => 'Novel', required => 1);
has 'text' => (is => 'ro', isa => 'Str', writer => 'set_text', required => 1);
__PACKAGE__->meta->make_immutable;
Here, we've used belongs_to
to signify a Synopsis document belongs to a Novel document. Every Synopsis document has a 'novel' attribute which references the parent Novel document.
Only the Review class is left:
package MyApp::Schema::Review;
use MongoDBx::Class::Moose;
use namespace::autoclean;
with 'MongoDBx::Class::Document';
belongs_to 'novel' => (is => 'ro', isa => 'Novel', required => 1);
has 'reviewer' => (is => 'ro', isa => 'Str', required => 1);
has 'text' => (is => 'ro', isa => 'Str', required => 1);
has 'score' => (is => 'ro', isa => 'Int', predicate => 'has_score');
__PACKAGE__->meta->make_immutable;
That wraps up our document classes. Before we continue, it is important to note that MongoDBx::Class' ability to identify the class of a document is currently reliant on the existance of a '_class' attribute in every document in the database (as described in "CAVEATS AND THINGS TO CONSIDER" in MongoDBx::Class). So, looking at the JSON representations from before, we need to modify the representations like so:
# in the 'novels' collection
{
"_id": ObjectId("4cbca90d576fad5916790100"),
"_class": "Novel",
"title": "The Valley of Fear",
"year": 1914,
"author": {
"first_name": "Arthur",
"middle_name": "Conan",
"last_name": "Doyle"
},
"tags": [
{ "category": "mystery", "subcategory": "thriller" },
{ "category": "mystery", "subcategory": "detective" },
{ "category": "crime", "subcategory": "fiction" }
],
"related_novels": [
{ "$ref": "novels", "$id": ObjectId("4cbca90d3a41e35916720100") },
{ "$ref": "novels", "$id": ObjectId("4cbca90d44d8c959167a0100") },
{ "$ref": "novels", "$id": ObjectId("4cbca90d486bbf59166f0100") }
]
}
# in the 'synopsis' collection
{
"_id": ObjectId("4cbca90d699e9a5916670100"),
"_class": "Synopsis",
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"text": "The Valley of Fear is the final Sherlock Holmes novel by Sir Arthur Conan Doyle. The story was first published in the Strand Magazine between September 1914 and May 1915. The first book edition was published in New York on 27 February 1915."
}
# in the 'reviews' collection
{
"_id": ObjectId("4cbca90dfbb2945916740100"),
"_class": "Review",
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Some Guy",
"text": "I really liked it!",
"score": 5
},
{
"_id": ObjectId("4cbca90e0ad57b5916f50100"),
"_class": "Review",
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Some Other Guy",
"text": "It was okay.",
"score": 3
},
{
"_id": ObjectId("4cbca90e0b9c175916c60100"),
"_class": "Review",
"novel": { "$ref": "novels", "$id": ObjectId("4cbca90d576fad5916790100") },
"reviewer": "Totally Different Guy",
"text": "Man, that just sucked!",
"score": 1
}
You will notice that it is not required to add the '_class' attribute to embedded documents, only to standalone documents. The reason for the '_class' requirement is the fact that MongoDBx::Class doesn't enforce one collection for every document class. Every collection can have documents of one or more classes, and documents of the same class can be stored in one or more collections, even databases.
LOADING MongoDBx::Class AND CONNECTING TO A MongoDB SERVER
The next step is loading the schema we've just created, and connecting to a MongoDB server:
my $dbx = MongoDBx::Class->new(namespace => 'MyApp::Schema');
We need to pass the namespace of our schema to MongoDBx::Class. It will attempt to automatically load every document class under that namespace, and will return a MongoDBx::Class object back.
We then initiate a connection to a server:
$dbx->connect();
We don't pass anything to the connect()
method, so it attempts to connect to a MongoDB server running on 'localhost', on the default 27017 port. We can connect to a specific server like so:
$dbx->connect(host => $host, port => $port);
While the connect()
method returns a MongoDBx::Class::Connection object, we do not use it directly. The method also stores that object in the MongoDBx::Class object ($dbx
above) itself as the 'conn attribute, and we're supposed to use it from there.
$dbx->conn->get_database('whatever');
INSERTING DOCUMENTS
Now the we've loaded our schema and connected to a server, we can start using MongoDBx::Class. Basically, our usage will not differ greatly from direct MongoDB usage, as MonogDBx::Class simply extends MongoDB. The biggest difference between directly using MongoDB and using MongoDBx::Class, is the automatic expanding and collapsing of documents. Documents are automatically expanded when inserting documents only if the safe
option is on. By default, MongoDB performs unsafe operations. MongoDBx::Class adds the ability to use safe operations by default. Lets activate it first:
$dbx->conn->safe(1);
Now we don't need to pass <{ safe =
1 }>> to our insert
s, update
s and other operations. If you're doing mostly safe operations, but need to do unsafe operations from time to time, you can pass <{ safe =
0 }>>.
Lets create a novel document:
my $db = $dbx->conn->get_database('myapp');
my $novels_coll = $db->get_collection('novels');
my $novel = $novels_coll->insert({
_class => 'Novel',
title => 'The Valley of Fear',
year => 1914,
author => {
first_name => 'Arthur',
middle_name => 'Conan',
last_name => 'Doyle',
},
tags => [
{ category => 'mystery', subcategory => 'thriller' },
{ category => 'mystery', subcategory => 'detective' },
{ category => 'crime', subcategory => 'fiction' },
],
});
Notice that when inserting the novel document, we've directly inserted the PersonName embedded document and the Tags embedded documents as hash-refs.
This insert, which was safe (since the 'safe' attribute of our connection object had a true value), returns the novel document after expansion:
$novel->author->name; # prints 'Arthur Conan Doyle'
If the insert was unsafe, we'd just get the MongoDB::OID of the document back. But note that you can't get the OID object and immediately attempt to load the document with it, as you can't predict the order in which MongoDB will perform asynchronous operations.
Lets insert our synopsis now:
my $synopsis = $db->synopsis->insert({
_class => 'Synopsis',
novel => $novel,
text => "The Valley of Fear is the final Sherlock Holmes novel by Sir Arthur Conan Doyle. The story was first published in the Strand Magazine between September 1914 and May 1915. The first book edition was published in New York on 27 February 1915.",
});
Notice how we've passed the $novel
object directly as the 'novel' attribute. When inserting, MongoDBx::Class will automatically save it as a DBRef object for us.
Now for our reviews:
my @reviews = $dbx->conn->get_collection('reviews')->batch_insert([
{
_class => 'Review',
novel => $novel,
reviewer => 'Some Guy',
text => 'I really liked it!',
score => 5,
},
{
_class => 'Review',
novel => $novel,
reviewer => 'Some Other Guy',
text => 'It was okay.',
score => 3,
},
{
_class => 'Review',
novel => $novel,
reviewer => 'Totally Different Guy',
text => 'Man, that just sucked!',
score => 1,
}
]);
my ($total_score, $avg_score) = (0, 0);
foreach (@reviews) {
$total_score += $_->score || 0;
}
$avg_score = $total_score / scalar(@reviews);
print $avg_score; # prints 3
If we now run <$novel-
reviews()>>, we'd get a MongoDBx::Class::Cursor back. And since we've created a method modification in the Novel class on this method, this cursor will also be sorted.
foreach ($novel->reviews) {
# do stuff
}
SEARCHING DOCUMENTS
MongoDBx::Class adds automatic expansion to document searching, plus some convenient shortcuts. Say we've received the ID of a novel document as input, and we want to load it. With MongoDB, we'd probably do:
$db->novels->find_one({ _id => MongoDB::OID->new(value => $oid) })
With MongoDBx::Class, we can just do:
$db->novels->find_one($oid);
And $oid
can either be the string ID, or a MongoDB::OID object.
If the document is found, and has the '_class' attribute, then it will be automatically expanded.
my $novel = $db->novels->find_one($oid);
print $novel->author->name; # again, prints 'Arthur Conan Doyle'
Lets search for reviews written by a certain reviewer:
my $cursor = $db->reviews->find({ reviewer => 'Some Guy' })->sort({ score => -1 }); # we can also use C<query()> or C<search()>, they're the same method
This gives a MongoDBx::Class::Cursor back:
print $cursor->count; # prints 1
while (my $review = $cursor->next) {
print $review->novel->title, "\n";
}
Sorting documents is easier. We can sort by a list of ordered attributes like so:
$cursor->sort([ attr1 => 1, attr2 => 1, attr3 => -1 ]);
UPDATING DOCUMENTS
Updating documents is much easier with MongoDBx::Class. There are two ways to update documents in MongoDBx::Class:
The older, MongoDB way of using the
update()
method in MongoDBx::Class::Collection. This is now mostly used to update multiple documents at once.The new, MongoDBx::Class way of using the
update()
method provided to document classes by MongoDBx::Class::Document. This is used to update a specific document.
First, lets take a look at the first way. Suppose we want to cheat and update all reviews for "The Valley of Fear" with a score of five:
$db->reviews->update({ 'novel.$id' => $novel->_id }, { '$set' => { score => 5 } }, { multiple => 1 });
This is exactly like using MongoDB::Collection directly. If we're updating a specific document we've already loaded, however, MongoDBx::Class provides a much more comfortable way. Instead of doing:
$db->novels->update({ _id => $novel->_id }, { ... })
We can do:
$novel->update({ year => 1915 });
This will effectively invoke a '$set' update like this:
$db->novels->update({ _id => $novel->_id }, { '$set' => { year => 1915 } });
But this isn't really the Moose way of doing things, so MongoDBx::Class gives us yet another way of updating a document:
$novel->set_year(1915);
$novel->update;
When invoking update()
on a document object with no arguments, a "snapshot" of the document is taken, and the following update is effectively performed:
$db->novels->update({ _id => $novel->_id }, { title => "The Valley of Fear", year => 1915, ... });
When we pass arguments to the update()
method (there are two arguments we can pass, the hash-ref of updates to perform, and the standard options hash-ref we know from the original update()
method in MongoDB::Collection), MongoDBx::Class simply performs a '$set' update on the passed hash-ref attributes only. So doing this:
$novel->set_year(1915);
$novel->update({ title => 'The Valley of Fearrrrrr' });
Will only result in the 'title' attribute being updated, not the year attribute.
Updating embedded documents is similarly easy:
$novel->author->set_first_name('Sir Arthur');
$novel->update;
REMOVING DOCUMENTS
Removing documents with MongoDBx::Class is very easy. Having the document object, simply call:
$novel->remove;
Or:
$novel->delete;
And this novel document will be removed from the database. Note, however, that the delete operation does not cascade, so only the novel document is deleted. The synopsis and reviews are not deleted, and you have to do so manually.
You can still use the original remove()
method on collections, now mostly to remove multiple documents:
$db->get_collection('reviews')->remove({ 'novel.$id' => $novel->_id });
FAQ
Can I use more than one database?
Yes, you can use as many databases as you like and use the same document classes across them:
my $data_db = $dbx->conn->get_database('myapp_data');
my $user_db = $dbx->conn->get_database('myapp_users');
Can I define different document classes to different databases?
There currently isn't a way to define individual schemas for different databases. You can, however, "split" your schema. For example, if your application has a data DB and a user DB, you can put all the document classes of the data DB under MyApp::Schema::Data, and all the document classes of the user DB under MyApp::Schema::User.
What if I want to use the asynchronous AnyMongo driver instead?
Currently, MongoDBx::Class only supports the official MongoDB driver, but support for AnyMongo is planned.
What if I have attributes I don't want to save in the database?
MongoDBx::Class does not provide an option like that yet, but will probably do so in upcoming versions.
Does MongoDBx::Class provide any automatic object inflations/expansions for common classes?
No, at least not yet. Have a look at MongoDB::DataTypes for a list of current inflations the MongoDB driver natively supports. I do plan to provide some inflations in future versions.
Who framed Roger Rabbit?
I framed Roger Rabbit!
WRAP UP
You now know how you can use MongoDBx::Class in your applications, and make MongoDB act like a relational database, without sacrificing its NoSQL nature and special features.
Before using MongoDBx::Class, please take into account that its alpha software, and is not ready yet for production use. If you find bugs, please report them in the usual channels (see "BUGS" in MongoDBx::Class).
AUTHOR
Ido Perlmuter, <ido at ido50.net>
SUPPORT
You can find documentation for this module with the perldoc command.
perldoc MongoDBx::Class::Tutorial
SEE ALSO
MongoDBx::Class, MongoDB, http://www.mongodb.org/.
LICENSE AND COPYRIGHT
Copyright 2010 Ido Perlmuter.
This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.
See http://dev.perl.org/licenses/ for more information.