#36342 closed Cleanup/optimization (invalid)
Slicing a QuerySet with a result cache results in a list?
| Reported by: | Willem Van Onsem | Owned by: | |
|---|---|---|---|
| Component: | Database layer (models, ORM) | Version: | 5.2 |
| Severity: | Normal | Keywords: | |
| Cc: | Triage Stage: | Unreviewed | |
| Has patch: | no | Needs documentation: | no |
| Needs tests: | no | Patch needs improvement: | no |
| Easy pickings: | no | UI/UX: | no |
Description
As per code if you slice a QuerySet with a result cache, we return the sliced result cache, this thus means that for a QuerySet:
from django.contrib.auth.model import User
qs = User.objects.all()
bool(qs) # enable/disable
qs[:3].values('id')
will raise an error because qs[:3] returns a list, whereas if we comment out the bool, it will still work.
This is done probably because of performance reasons: if we have a QuerySet, and we already know the results, we can just work with these results.
But I'm wondering if we "can have the cake and eat it too". We could for example create a sliced copy of the queryset, and populate the result cache of the queryset. Something along the lines of:
def __getitem__(self, k):
"""Retrieve an item or slice from the set of results."""
if not isinstance(k, (int, slice)):
raise TypeError(
"QuerySet indices must be integers or slices, not %s."
% type(k).__name__
)
if (isinstance(k, int) and k < 0) or (
isinstance(k, slice)
and (
(k.start is not None and k.start < 0)
or (k.stop is not None and k.stop < 0)
)
):
raise ValueError("Negative indexing is not supported.")
# remove below
# if self._result_cache is not None:
# return self._result_cache[k]
if isinstance(k, slice):
qs = self._chain()
if k.start is not None:
start = int(k.start)
else:
start = None
if k.stop is not None:
stop = int(k.stop)
else:
stop = None
qs.query.set_limits(start, stop)
if self._result_cache is not None:
# populate the QuerySet
qs._result_cache = self._result_cache[k]
return list(qs)[:: k.step] if k.step else qs
this thus means that, (a) unless we use a step, we always get a QuerySet for slicing (since it is not always known in advance *if* the QuerySet has a result cache, that can be the result of complicated code flows); and (b) if the result cache was present, the queryset we generate will not have to fetch the data, if we don't make more QuerySet calls.
Change History (2)
comment:1 by , 7 months ago
| Resolution: | → invalid |
|---|---|
| Status: | new → closed |
comment:2 by , 7 months ago
I think it is indeed not a bug, it is more a feature request, since it results in unpredictable behavior, since you don't know per se if the queryset got evaluated. It is more whether we can not, when slicing an evaluated queryset, return a queryset with cached results. Then the outcome is not different depending on the *state* of the QuerySet.
Slicing with a step is more predictable, since it will *always* return a list when sliced, whereas the slicing without a step, thus depends on the state of the QuerySet which is harder to determine.
Thank you Willem Van Onsem for taking the time to create this report. I created a test for this:
tests/queries/tests.py
If
self.assertIs(bool(qs), True)is commented out, the test passes. If it's not, the test fails with:====================================================================== ERROR: test_slice_after_result_cached (queries.tests.WeirdQuerysetSlicingTests.test_slice_after_result_cached) ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/nessita/fellowship/django/tests/queries/tests.py", line 2979, in test_slice_after_result_cached self.assertQuerySetEqual(qs[:3].values("id"), Article.objects.filter(id__lt=4).values("id")) ^^^^^^^^^^^^^ AttributeError: 'list' object has no attribute 'values'I couldn't find a dupe, and I was inclined to accept this until I found the explicit docs about this. From https://docs.djangoproject.com/en/5.2/ref/models/querysets/:
5.2
Closing as
invalidgiven the above.