#22349 closed Cleanup/optimization (fixed)
RawQuerySet doesn't implement __len__, __bool__
| Reported by: | Craig de Stigter | Owned by: | |
|---|---|---|---|
| Component: | Documentation | Version: | dev |
| Severity: | Normal | Keywords: | |
| Cc: | loic@… | Triage Stage: | Accepted |
| Has patch: | yes | Needs documentation: | no |
| Needs tests: | no | Patch needs improvement: | no |
| Easy pickings: | no | UI/UX: | no |
Description
RawQuerySet should mirror QuerySet behaviour where possible.
However, RawQuerySet.__bool__ always returns True, instead of evaluating the query results and checking the length is nonzero.
RawQuerySet.__len__ doesn't work at all:
* TypeError: object of type 'RawQuerySet' has no len()
This is not intuitive since it's not how QuerySet works. It basically means that you have to cast RawQuerySet to a list before doing much with its results.
Change History (14)
comment:1 by , 12 years ago
| Triage Stage: | Unreviewed → Accepted |
|---|
comment:2 by , 12 years ago
| Owner: | changed from to |
|---|---|
| Status: | new → assigned |
comment:3 by , 12 years ago
| Owner: | removed |
|---|---|
| Status: | assigned → new |
comment:4 by , 12 years ago
Dunno if it's by design, but RawQuerySet and QuerySet operate vastly differently as there is no such thing as an "evaluated" RawQuerySet:
list(queryset)
with assertNumQueries(1)
for i in range(10):
queryset[0]
list(raw_queryset)
with assertNumQueries(10)
for i in range(10):
raw_queryset[0]
This fundamental difference makes having a shared ancestor impractical.
With that in mind, I think adding __len__ would be a nasty footgun, if you only want the number of rows, you can write a COUNT SQL query, if you want the count and the result, then you should definitely store the result in a list to avoid a second query.
Of course we could make RawQuerySet maintain an internal cache akin to QuerySet, but that'd have to be a ticket in its own right.
comment:5 by , 12 years ago
Wow, really? That's crazy. I had no idea. Perhaps the docs should make mention of that? I have been using RawQueryset for UPDATE ... RETURNING * queries and I'll need to check my code for bugs if that's the case.
Either we should add the internal cache, or at least the docs should add some big huge scary warning about that inconsistency. https://docs.djangoproject.com/en/dev/topics/db/sql/#django.db.models.Manager.raw
comment:6 by , 12 years ago
| Cc: | added |
|---|
comment:7 by , 12 years ago
We discussed this with Loic on IRC. Introducing __len__ and friends without caching the results is a bad idea. One possibility considered was adding caching to RawQuerySet so that it works like QuerySet, and also add .iterator() method to RawQuerySet so that when iterating over large queryset one can avoid the performance and memory penalty of caching.
There doesn't seem to be a way to introduce this change using deprecation path or check framework, so we would need to just do the change and warn about the change in release notes. As the current behavior doesn't seem to be a big pain point for users, and some users could be relying on current behavior, I think it is better to leave things as they are. However, we should add some documentation about the evaluation time differences between qs and rawqs, so I'll leave this ticket open for that.
comment:8 by , 12 years ago
| Component: | Database layer (models, ORM) → Documentation |
|---|---|
| Needs documentation: | set |
| Type: | Bug → Cleanup/optimization |
Moving back to documentation based on the provided feedback.
comment:10 by , 11 years ago
| Needs documentation: | unset |
|---|
comment:11 by , 11 years ago
| Owner: | set to |
|---|---|
| Resolution: | → fixed |
| Status: | new → closed |
For the sake of consistency and DRYness it might be worth creating common ancestor class for
QuerySetandRawQuerySet(sayBaseQuerySet) that implements common operations.