Opened 13 hours ago
Last modified 11 hours ago
#36683 assigned Bug
Update with Distinct Produces Unintentional Results
Reported by: | Matt Shirley | Owned by: | Matt Shirley |
---|---|---|---|
Component: | Database layer (models, ORM) | Version: | 5.2 |
Severity: | Normal | Keywords: | orm, distinct, update |
Cc: | Triage Stage: | Accepted | |
Has patch: | yes | Needs documentation: | no |
Needs tests: | no | Patch needs improvement: | no |
Easy pickings: | no | UI/UX: | no |
Description
Similar to #32433.
The ORM permits calling update()
on a Queryset
when a distinct()
was applied. This results in unforeseen consequences where a larger data set will be updated than what is expected. Similar to the example provided in #32433:
Comment.objects.order_by('post_id', 'created_at').distinct('post_id').update(deleted=True)
The developer may assume that this will delete only one comment per post_id
, however, the distinct()
is ignored (as UPDATE
has no distinct) which results in the entire table being updated:
UPDATE "post_comment" SET "deleted" = true'
Since delete()
already guards against this case, I believe update()
should behave consistently to protect the developer from mistakes, even though applying a distinct()
before an update()
is generally unusual.
Change History (3)
comment:1 by , 12 hours ago
Has patch: | set |
---|
comment:2 by , 12 hours ago
Triage Stage: | Unreviewed → Accepted |
---|
comment:3 by , 11 hours ago
Owner: | set to |
---|---|
Status: | new → assigned |
https://github.com/django/django/pull/19997 (PR)