Opened 6 years ago

Last modified 6 years ago

#29244 closed Cleanup/optimization

Passing dict to a Func subclass fails silently on hash step — at Initial Version

Reported by: Rob Jauquet Owned by: nobody
Component: Core (Other) Version: 2.0
Severity: Normal Keywords:
Cc: Triage Stage: Accepted
Has patch: yes Needs documentation: no
Needs tests: no Patch needs improvement: no
Easy pickings: no UI/UX: no

Description

With this code:

from django.contrib.postgres.search import SearchQuery, Value, Func
from django.core.paginator import Paginator
from django.db.models import F, TextField, DateTimeField, Models
from django.utils import timezone

class Entry(Model):
    text = TextField()
    published_on = DateTimeField(default=timezone.now())

class Headline(Func):
    function = 'ts_headline'
    output_field = TextField()

    def __init__(self, text, query, options=None):
        extra = {}
        expressions = [text, query]
        if options:
            opt_str = ''
            for key, value in options.items():
                opt_str += f'{key}={value},'
            expressions.append(Value(opt_str[:-1]))
        super().__init__(*expressions, **extra)

entries = Entry.objects.annotate(
    text_highlighted=Headline(
        F('text'), 
        SearchQuery('house'), 
        options={'HighlightAll': False},
    )
).order_by('-published_on')

paginator = Paginator(entries, 20)

# evaluates the queryset
paginator.page(1)

Passing options by dict to the Func subclass causes the count step for pagination to fail silently in paginator.py line 85 (1) which is in turn caused by a silenced TypeError during the hash in expressions.py line 372 (2).

Then, because of the fallback to len on paginator.py line 90 (3) the entire queryset is evaluated instead of evaluating just the specified page (and in this specific case, calling ts_headline on every row instead of just the first page).

The correct way of passing options using **options instead of options=None fixes this case, but the silent error and fallback to calling len on a queryset causes an unexpected expensive query when using pagination.

(1) https://github.com/django/django/blob/master/django/core/paginator.py#L85
(2) https://github.com/django/django/blob/master/django/db/models/expressions.py#L372
(3) https://github.com/django/django/blob/master/django/core/paginator.py#L90

Change History (0)

Note: See TracTickets for help on using tickets.
Back to Top