Opened 14 years ago
Closed 14 years ago
#15748 closed Bug (invalid)
QuerySet subscripting won't work as expected
Description ¶
when got a QuerySet by using filter and it's easy to use the following code to do the change and save operation:
qs = SomeModel.objects.filter(owner_id=123) # suppose qs has 1 or many elements last_login_time = qs[0].last_login_time qs[0].last_login_time = datetime.now() # I expect it can assign the new value, but it won't assertEquals(qs[0].last_login_time, old_login_time) # YES, it doesn't change qs[0].save() #So it won't update the old record
And after figuring this out, the following code will be used instead and it works:
qs = SomeModel.objects.filter(owner_id=123) # suppose qs has 1 or many elements obj = qs[0] last_login_time = obj.last_login_time obj.last_login_time = datetime.now() # I expect it can assign the new value, but it will assertNotEquals(obj.last_login_time, old_login_time) # YES, it does change obj.save() #So it will update the old record as expected
And I have met some of my friends/colleagues use the first approach to do the record updating. And IMO, it's
natural and prone to use. (when you *type qs[0]* and *type obj* , they have the same type)
After reading the code(db.models.query), it can be figured out why.(when you subscript the QuerySet it will use the qs = self._clone() and assigning a value won't change at all)
Possible solutions:
- make the assigning work for the subscripting QuerySet
- announce the above first approach is wrong and let the users know it
The code posted is really not idiomatic - if you are expecting there to be one item, you should be using 'get' as documented. If you expecting more than one item and need to deal with them all you should be iterating over the queryset. Both of these ways will be efficient - we have optimized the API to make idiomatic usage efficient, and we cannot optimize for all possible ways of accessing querysets.
Regarding 1:
If you look at implementing this, you'll see why it is a bad idea. It would mean that if I start to index a queryset, the queryset has to cache the values returned. We then have to decide how to issue the SQL query:
We already have a result cache that kicks in if you start iterating (rather than indexing), but in that context it is sensible because it is sensible to anticipate that if the user is iterating over the results, they will probably want all of them eventually.
Regarding 2:
This behaviour could be guessed from the documented way that slicing/indexing querysets work (i.e. that they are lazy queries and not lists) and the fact that the ORM is not an identity mapper.
We can't enumerate every wrong way to use an API, and adding docs that mention this won't help - if it is an issue of what people expect to work, they won't scour docs to find out if it is going to work or not, they will just try it.
Our documentation does in fact contain all this in enough detail:
http://docs.djangoproject.com/en/1.3/ref/models/querysets/#when-querysets-are-evaluated
http://docs.djangoproject.com/en/1.3/topics/db/queries/#caching-and-querysets