Opened 4 years ago
Closed 3 years ago
#32704 closed Bug (fixed)
QuerySet.defer() doesn't clear deferred field when chaining with only().
Description ¶
Considering a simple Company
model with four fields: id
, name
, trade_number
and country
. If we evaluate a queryset containing a .defer()
following a .only()
, the generated sql query selects unexpected fields. For example:
Company.objects.only("name").defer("name")
loads all the fields with the following query:
SELECT "company"."id", "company"."name", "company"."trade_number", "company"."country" FROM "company"
and
Company.objects.only("name").defer("name").defer("country")
also loads all the fields with the same query:
SELECT "company"."id", "company"."name", "company"."trade_number", "company"."country" FROM "company"
In those two cases, i would expect the sql query to be:
SELECT "company"."id" FROM "company"
In the following example, we get the expected behavior:
Company.objects.only("name", "country").defer("name")
only loads "id" and "country" fields with the following query:
SELECT "company"."id", "company"."country" FROM "company"
Change History (10)
comment:1 by , 4 years ago
Cc: | added |
---|
comment:2 by , 4 years ago
Summary: | Unexpected behavior of .defer() and .only() → QuerySet.defer() doesn't clear deferred field when chaining with only(). |
---|---|
Triage Stage: | Unreviewed → Accepted |
follow-up: 4 comment:3 by , 4 years ago
After reading the documentation carefully, i cannot say that it is clearly stated that deferring all the fields used in a previous .only()
call performs a reset of the deferred set.
Moreover, in the .defer()
section, we have:
You can make multiple calls to defer(). Each call adds new fields to the deferred set:
and this seems to suggest that:
- calls to
.defer()
cannot remove items from the deferred set and evaluatingqs.defer("some_field")
should never fetch the column"some_field"
(since this should add"some_field"
to the deferred set) - the querysets
qs.defer("field1").defer("field2")
andqs.defer("field1", "field2")
should be equivalent
IMHO, there is a mismatch between the doc and the actual implementation.
comment:4 by , 4 years ago
- calls to
.defer()
cannot remove items from the deferred set and evaluatingqs.defer("some_field")
should never fetch the column"some_field"
(since this should add"some_field"
to the deferred set)
Feel-free to propose a docs clarification (see also #24048).
- the querysets
qs.defer("field1").defer("field2")
andqs.defer("field1", "field2")
should be equivalent
That's why I accepted Company.objects.only("name").defer("name").defer("country")
as a bug.
follow-up: 6 comment:5 by , 4 years ago
I think that what is described in the documentation is what users are expected and it is the implementation that should be fixed!
With your patch proposal, i do not think that:
Company.objects.only("name").defer("name").defer("country")
is equivalent to Company.objects.only("name").defer("name", "country")
follow-up: 8 comment:6 by , 4 years ago
Replying to Manuel Baclet:
I think that what is described in the documentation is what users are expected and it is the implementation that should be fixed!
I don't agree, and there is no need to shout. As documented: "The only() method is more or less the opposite of defer(). You call it with the fields that should not be deferred ...", so .only('name').defer('name')
should return all fields. You can start a discussion on DevelopersMailingList if you don't agree.
With your patch proposal, i do not think that:
Company.objects.only("name").defer("name").defer("country")
is equivalent toCompany.objects.only("name").defer("name", "country")
Did you check this? with proposed patch country
is the only deferred fields in both cases. As far as I'm aware that's an intended behavior.
comment:7 by , 4 years ago
With the proposed patch, I think that:
Company.objects.only("name").defer("name", "country")
loads all fields whereas:
Company.objects.only("name").defer("name").defer("country")
loads all fields except "country"
.
Why is that?
In the first case:
existing.difference(field_names) == {"name"}.difference(["name", "country"]) == empty_set
and we go into the if
branch clearing all the deferred fields and we are done.
In the second case:
existing.difference(field_names) == {"name"}.difference(["name"]) == empty_set
and we go into the if
branch clearing all the deferred fields. Then we add "country"
to the set of deferred fields.
comment:8 by , 3 years ago
Cc: | added |
---|---|
Has patch: | set |
Owner: | changed from | to
Status: | new → assigned |
Hey all,
Replying to Mariusz Felisiak:
Replying to Manuel Baclet:
With your patch proposal, i do not think that:
Company.objects.only("name").defer("name").defer("country")
is equivalent toCompany.objects.only("name").defer("name", "country")
Did you check this? with proposed patch
country
is the only deferred fields in both cases. As far as I'm aware that's an intended behavior.
I believe Manuel is right. This happens because the set difference in one direction gives you the empty set that will clear out the deferred fields - but it is missing the fact that we might also be adding more defer
fields than we had only
fields in the first place, so that we actually switch from an .only()
to a .defer()
mode.
See the corresponding PR that should fix this behaviour https://github.com/django/django/pull/14667
Replying to Manuel Baclet:
This is an expected behavior,
defer()
removes fields from the list of fields specified by theonly()
method (i.e. list of fields that should not be deferred). In this exampleonly()
addsname
to the list,defer()
removesname
from the list, so you have empty lists and all fields will be loaded. It is also documented:I agree you shouldn't get all field, but only
pk
,name
, andtrade_number
:this is due to the fact that
defer()
doesn't clear the list of deferred field when chaining withonly()
. I attached a proposed patch.