Version 4 (modified by jkocherhans, 18 years ago) ( diff )

updated to use option 3 for modeling parent relations in sql

The previous version of this page is obsolete due to the model syntax change. Additionally, there's now a better reference for this topic:

http://www.djangoproject.com/documentation/models/subclassing/

magic-removal: Model Inheritance

This is a proposal for how subclassing should work in Django. There a lot of details to get right, so this proposal should be very specific and detailed. Most of the ideas here come from the thread linked below:

http://groups.google.com/group/django-developers/browse_frm/thread/ea5e0bf903058fac/9a68ac0d99cb6d7d?q=semantics&rnum=1#9a68ac0d99cb6d7d

Subclassing

For subclassing, there are 3 main issues:

  1. How do we model the relations in SQL?
  2. How do joins work?
  3. How does the API work?

Note that I have only provided examples for single inheritance here. Is multiple inheritance worth supporting?

The examples will use the following models:

class Place(models.Model):
    name = models.CharField(maxlength=50)

class Restaurant(Place):
    description = models.TextField()

class ItalianRestaurant(Restaurant):
    has_decent_gnocci = models.BooleanField()

1. Modeling parent relations in SQL?

The general consesus seems to be this:

CREATE TABLE "myapp_place" (
    "id" integer NOT NULL PRIMARY KEY,
    "name" varchar(50) NOT NULL
);

CREATE TABLE "myapp_restaurant" (
    /* PRIMARY KEY REFERENCES "myapp_places" ("id") works for postgres, what about others? */
    "id" integer NOT NULL PRIMARY KEY REFERENCES "myapp_places" ("id"), 
    "description" text NOT NULL
);

CREATE TABLE "myapp_italianrestaurant" (
    "id" integer NOT NULL PRIMARY KEY REFERENCES "myapp_restaurant" ("id"),
    "has_decent_gnocci" bool NOT NULL
);

2. Modeling joins in SQL

When we want a list of ItalianRestaurants, we obviously need all the fields from myapp_restaurant and myapp_place as well. This could be accomplished by left joins. It would look something like this:

SELECT ...
FROM myapp_italianrestaurant as ir
LEFT JOIN myapp_restaurant as r
ON ir.restaurant_id=r.id
LEFT JOIN myapp_place as p
ON r.place_id=p.id

But what if we want a list of Places, what should we do? We can either just get the places:

SELECT ...
FROM myapp_place

Or we can get everything with left joins (this allows the iterator to return objects of the appropriate type, rather than just a bunch of Places):

SELECT ...
FROM myapp_place as p
LEFT JOIN myapp_restaurant as r
ON r.place_id=p.id
LEFT JOIN myapp_italianrestaurant as ir
ON ir.restaurant_id=r.id

Imagine we have more than one subclass of Place though. The join clause and the column list would get pretty hefty. This could obviously get unmanageable pretty quickly.

Another option is to lazily load objects like Restaurant and ItalianRestaurant while we're iterating over Place.objects.all(), but that requires a lot of database queries. Either way, doing this will be expensive, and api should reflect that. You're much better off just using Places fields if you are going to iterate over Place.objects.all()

3. API

The following API examples assume we have created these objects:

p = Place(name='My House')
r = Restaurant(name='Road Kill Cafe', description='Yuck!')
ir = ItalianRestaurant(name='Luigis', description='Something', has_decent_gnocci=False)

For the following examples, assume Place.objects.get(2) returns r and Place.objects.get(3) returns ir.

python result
A. Place.objects.count() 3
B. Restaurant.objects.count() 2
C. ItalianRestaurant.objects.count() 1
D. Place.objects.get(2).description 'Yuck!' or AttributeError?
E. Restaurant.objects.get(2).description 'Yuck!'

Change the current usage of subclassing

class MyArticle(Article):
  ...fields...
  class META:
        module_name = 'my_articles'
        remove_fields = ...some fields...

would change to:

class MyArticle(meta.Model):
   ...fields...
   class META:
        copy_from = Article
        remove_fields = ...some fields...
Note: See TracWiki for help on using the wiki.
Back to Top