Opened 18 years ago

Closed 10 years ago

Last modified 2 years ago

#2594 closed Bug (wontfix)

Template system should handle whitespace better

Reported by: Gary Wilson <gary.wilson@…> Owned by: jshedd
Component: Template system Version: dev
Severity: Normal Keywords:
Cc: mccutchen@…, gary.wilson@…, cgrady@…, Tai Lee, kmike84@…, andrep, Stephane "Twidi" Angel, Florian.Sening@…, Waldir Pimenta, ldiqual@…, python@…, ua_django_bugzilla@…, s.shanabrook@…, chipx86@…, tomasbabej@…, trbs@…, adehnert Triage Stage: Accepted
Has patch: yes Needs documentation: no
Needs tests: no Patch needs improvement: yes
Easy pickings: no UI/UX: no

Description

So, I think that the template system should not leave blank lines
behind if the tag(s) on that line evaulated to the empty string. To
make sure we are all on the same page, here is an example:

This template code:

<h1>My list</h1>

<ul>
{% for item in items %}
    <li>{{ item }}</li>
{% endfor %}
</ul>

Would normally evaluate to:

<h1>My list</h1>

<ul>

    <li>item 1</li>

    <li>item 2</li>

    <li>item 3</li>

</ul>

And with the {% spaceless %} tag evaluates to:

<h1>My list</h1> <ul> <li>item 1</li> <li>item 2</li> <li>item 3</li>
</ul>

And with the StripWhitespaceMiddleware evaluates to

<h1>My list</h1>
<ul>
    <li>item 1</li>
    <li>item 2</li>
    <li>item 3</li>
</ul>

But what I think should evaluate to:

<h1>My list</h1>

<ul>
    <li>item 1</li>
    <li>item 2</li>
    <li>item 3</li>
</ul>

In other words, leave lines that I add, but remove the empty lines that
had template tags on them.

Attachments (13)

template_whitespace-2006-08-22.diff (5.3 KB ) - added by Gary Wilson <gary.wilson@…> 18 years ago.
template_whitespace.patch (11.9 KB ) - added by Chris Beaven 18 years ago.
Alternate version
template_whitespace_2.patch (10.8 KB ) - added by Chris Beaven 18 years ago.
Optimized version of my alternate patch
template_whitespace_2.2.patch (10.8 KB ) - added by Chris Beaven 18 years ago.
oops, this one i meant :P
template_whitespace_ticket_2594.diff (3.3 KB ) - added by jshedd 15 years ago.
Causes template tag blocks to collapse the whitespace around them when they are on their own line.
template_whitespace_ticket_2594_tests.diff (2.1 KB ) - added by jshedd 15 years ago.
Tests for whitespace collapsing.
template_whitespace_ticket_2594_with_tests.diff (5.5 KB ) - added by jshedd 15 years ago.
Everything in one diff.
whitespace_benchmarks.zip (2.4 KB ) - added by jshedd 15 years ago.
Benchmarks from unladen-swallow for the template system.
2594.diff (14.2 KB ) - added by Rob Hudson 14 years ago.
Patched against trunk (and template shuffling), removed trailing white space, and adjusted to add comments and make pass with steveire's tests.
whitespace.patch (24.8 KB ) - added by Stephen Kelly 14 years ago.
0001-Commiting-whitespace-patch-from-django-ticket-2594.patch (53.0 KB ) - added by fahhem 14 years ago.
Modifies DebugLexer too, not just Lexer
0001-Commiting-whitespace-patch-from-django-ticket-2594.2.patch (53.0 KB ) - added by fahhem 14 years ago.
Same as previous, with fixed whitespace issue
ticket-2594-template-strip-leading-whitespace-2012-06-28.diff (26.8 KB ) - added by Leon Matthews 12 years ago.
Update patch against 1.5-dev

Download all attachments as: .zip

Change History (74)

by Gary Wilson <gary.wilson@…>, 18 years ago

comment:2 by mccutchen@…, 18 years ago

Cc: mccutchen@… added
Summary: template system should handle whitespace better[patch] template system should handle whitespace better

(just indicating that this ticket has a patch)

comment:3 by anonymous, 18 years ago

Cc: gary.wilson@… added

comment:4 by Gary Wilson <gary.wilson@…>, 18 years ago

Although, this patch does not work as desired in all cases. See the google group link above.

comment:5 by anonymous, 18 years ago

Summary: [patch] template system should handle whitespace betterTemplate system should handle whitespace better

The speed issue is the killer here. We can't have a 100% performance hit on template rendering times -- speed in that area is one of Django's selling points. I'm going to remove the "patch" keyword for now, Gary, since I don't think we can take this patch. Leaving the ticket open, though, since the idea is a good one and if we can fix it, we should.

I'll try to work up something a bit faster; I had something that was almost right a while back, how hard can it be? :-(

comment:6 by Gary Wilson <gary.wilson@…>, 18 years ago

Here are some performance measures I got when I tested a few weeks ago:

Using the command:
$ python timeit.py -n 50 -s "from test.views import my_render" "my_render()"

without patch (r3735):
50 loops, best of 3: 6.34 msec per loop
50 loops, best of 3: 6.23 msec per loop
50 loops, best of 3: 6.24 msec per loop

with patch:
50 loops, best of 3: 8.67 msec per loop
50 loops, best of 3: 8.46 msec per loop
50 loops, best of 3: 8.46 msec per loop

So about 33-39% slower. And it doesn't work in all cases. :(

The test code looked like:

from django.template import Context, Template

def my_render():
    s = """\
<html>
<body>
    <div id="content">
    {% if items %}
    <ul>
        <li>Single item:</li>
        <li>
        {{ item.0.name }}
        </li>
    {% for item in items %}
    
        {% if item.name %}
        <li>{{ item.name }}</li>
        {% endif %}
        {% if not item.name %}
        <li>No name item</li>
        {% endif %}
        {% for item in fooitems %}
        <li>{{ item.name }}</li>
        {% endfor %}
        {% if item.des %}
        <li>{{ item.des }}</li>
        {% endif %}
    {% endfor %}
    {% if items %}
        <li>Spaceless items to follow:</li>
        {% spaceless %}
        {% for item in items %}
        
        <li>{{ item.name }}</li>
        {% endfor %}
        {% endspaceless %}
    {% endif %}
    </ul>
    {% endif %}
    {% if items %}
    <ul>
    {% for item in items %}
        {{ item.name }}
    {% endfor %}
    {% for foo in foos %}
    
        {{ foo }}
    {% endfor %}
    {% for item in items %}
        <li>huh{{ foo }}</li>
    {% endfor %}
    </ul>
    {% endif %}
    {% if items %}<p>We have items.</p>{% endif %}
    {% if items %}
    <p>We have items.</p>
    {% endif %}
    {% if foo %}<p>We have foo.</p>{% endif %}
    {% if foo %}
    <p>We have foo.</p>
    {% endif %}
    </div>
</body>
</html>
"""
    class Item:
        pass
    items = []
    for x in range(4):
        item = Item()
        item.name = 'item' + str(x)
        items.append(item)
    t = Template(s)
    c = Context({'items': items})
    return t.render(c)

by Chris Beaven, 18 years ago

Attachment: template_whitespace.patch added

Alternate version

comment:7 by Chris Beaven, 18 years ago

I've had a long ongoing discussion with Gary and came up with a completely different version to solve the same problem (actually, a couple of different versions, but this one is the best).

We did some profiling of this one too, and it turned out to be about 50% slower. If someone wants to have a look at optimization, I'd be most appreciative.

comment:8 by Chris Beaven, 18 years ago

I'm happy to talk about my patch with anyone interested - smileychris at gmail.

How it works:

  1. When a tag is found on its own line, the lexer adds a token before and after containing the relevant whitespace to hold on to.
  2. The parser turns these into nodes.
  3. The nodelist works out at render time whether or not the whitespace nodes should be used.
    • If an EndNode is reached before a StartNode is found, the EndNode is discarded.
    • If an EndNode is reached and the previous rendered block ended in a new line, both nodes are discarded.
    • Otherwise, the whitespace nodes are rendered.

comment:9 by Malcolm Tredinnick, 18 years ago

Owner: changed from Adrian Holovaty to Malcolm Tredinnick

by Chris Beaven, 18 years ago

Attachment: template_whitespace_2.patch added

Optimized version of my alternate patch

comment:10 by Chris Beaven, 18 years ago

This one doesn't cause anywhere near as huge of a performance hit as the last one. And it's patched versus some recent changes.

by Chris Beaven, 18 years ago

oops, this one i meant :P

comment:11 by Simon G. <dev@…>, 18 years ago

Has patch: set
Patch needs improvement: set
Triage Stage: UnreviewedDesign decision needed

comment:12 by Collin Grady <cgrady@…>, 17 years ago

Cc: cgrady@… added

+1 to cleaner template output :)

comment:13 by Jacob, 17 years ago

Triage Stage: Design decision neededSomeday/Maybe

I like the idea, but there's no way we can make the template engine 50% slower for this.

comment:14 by korpios, 17 years ago

Cc: korpios@… added

comment:15 by korpios, 16 years ago

Cc: korpios@… removed

comment:16 by jshedd, 15 years ago

Owner: changed from nobody to jshedd

by jshedd, 15 years ago

Causes template tag blocks to collapse the whitespace around them when they are on their own line.

by jshedd, 15 years ago

Tests for whitespace collapsing.

by jshedd, 15 years ago

Everything in one diff.

by jshedd, 15 years ago

Attachment: whitespace_benchmarks.zip added

Benchmarks from unladen-swallow for the template system.

comment:17 by jshedd, 15 years ago

Patch needs improvement: unset

For the patch, I ran a copy of unladen-swallow's django template benchmark, attached as a zip to the ticket.

RunDjango 1.1.1Django TrunkDjango Trunk with patch
11.05046296121.208662986761.20275402069
21.070580959321.209710121151.20344805717
31.14760208131.20852494241.20287799835
41.065352916721.210731983181.20276403427
51.054219961171.211869001391.20313000679

Best I can tell, my patch is actually slightly faster, so yay!

comment:18 by jshedd, 15 years ago

As requested by Alex_Gaynor, running Gary Wilson's benchmark:

python -m timeit -n 50 -s "from views import my_render" "my_render()"

Django 1.1.1:
50 loops, best of 3: 3.64 msec per loop
50 loops, best of 3: 3.66 msec per loop
50 loops, best of 3: 3.65 msec per loop

Trunk, without patch:
50 loops, best of 3: 4.12 msec per loop
50 loops, best of 3: 4.11 msec per loop
50 loops, best of 3: 4.14 msec per loop

Trunk, with patch:
50 loops, best of 3: 4.13 msec per loop
50 loops, best of 3: 4.11 msec per loop
50 loops, best of 3: 4.13 msec per loop

Script looked like:

from django.template import Context, Template
from django.conf import settings
settings.configure()

def my_render():
    s = """\
<html>
<body>
    <div id="content">
    {% if items %}
    <ul>
        <li>Single item:</li>
        <li>
        {{ item.0.name }}
        </li>
    {% for item in items %}
    
        {% if item.name %}
        <li>{{ item.name }}</li>
        {% endif %}
        {% if not item.name %}
        <li>No name item</li>
        {% endif %}
        {% for item in fooitems %}
        <li>{{ item.name }}</li>
        {% endfor %}
        {% if item.des %}
        <li>{{ item.des }}</li>
        {% endif %}
    {% endfor %}
    {% if items %}
        <li>Spaceless items to follow:</li>
        {% spaceless %}
        {% for item in items %}
        
        <li>{{ item.name }}</li>
        {% endfor %}
        {% endspaceless %}
    {% endif %}
    </ul>
    {% endif %}
    {% if items %}
    <ul>
    {% for item in items %}
        {{ item.name }}
    {% endfor %}
    {% for foo in foos %}
    
        {{ foo }}
    {% endfor %}
    {% for item in items %}
        <li>huh{{ foo }}</li>
    {% endfor %}
    </ul>
    {% endif %}
    {% if items %}<p>We have items.</p>{% endif %}
    {% if items %}
    <p>We have items.</p>
    {% endif %}
    {% if foo %}<p>We have foo.</p>{% endif %}
    {% if foo %}
    <p>We have foo.</p>
    {% endif %}
    </div>
</body>
</html>
"""
    class Item:
        pass
    items = []
    for x in range(4):
        item = Item()
        item.name = 'item' + str(x)
        items.append(item)
    t = Template(s)
    c = Context({'items': items})
    return t.render(c)

comment:19 by jshedd, 15 years ago

Triage Stage: Someday/MaybeUnreviewed

comment:20 by Russell Keith-Magee, 15 years ago

Triage Stage: UnreviewedAccepted

The idea has already been notionally accepted by Jacob - patch now needs review and triage.

comment:21 by Russell Keith-Magee, 14 years ago

Needs documentation: set

On first inspection it looks reasonably good. The performance stats are promising, too.

I can see a couple of minor areas for improvement:

  • Trailing whitespace - a lot of the diff is actually just your editor adding spaces to the end of lines.
  • Edge cases in tests - futzing with regular expressions makes me nervous, so I'd like to see a lot more tests to verify that nothing weird is going on. In particular, the test cases are fairly minimal - most of them start with the newline that is going to be stripped. I'd be interested in seeing more verification that if there is text/newlines *before* the newline that is to be stripped, that text won't be affected adversely.
  • Documentation. This behaviour needs to be documented -- nothing major; a quick note in the description of the template language should do.

The other thing holding up this ticket is the start of feature development for 1.3. We can't apply this until be branch for 1.2 support.

comment:22 by Russell Keith-Magee, 14 years ago

Patch needs improvement: set

comment:23 by Joe, 14 years ago

Could you just do something like this?

t = Template(s)
c = Context({'items': items})
result = t.render(c)
return os.linesep.join([s for s in result.splitlines() if s.strip()])

comment:24 by Stephen Kelly, 14 years ago

I wrote some more tests for this and it turned out that the presence of a tag would strip out all whitespace occurring before it, not just one line. I don't know if that was intentional, but I thought it surprising.

Another thing I noticed is that trailing whitespace is not stripped. I don't know how relevant that is, but I mimiced that behavior in this patch.

BEFORE PATCH:
kde-devel@bishop:~/Random/djtbenchmarks$ python -m timeit -n 50 -s "from bm import my_render" "my_render()"
50 loops, best of 3: 3.29 msec per loop

AFTER PATCH:
kde-devel@bishop:~/Random/djtbenchmarks$ python -m timeit -n 50 -s "from bm import my_render" "my_render()"
50 loops, best of 3: 3.51 msec per loop

comment:25 by Tai Lee, 14 years ago

Cc: Tai Lee added

comment:26 by Rob Hudson, 14 years ago

The patch attached brings in jshedd's work and steveire's tests. I updated to trunk, removed trailing white space, and tweaked regex slightly.

Pre regex tweaking had failures on templatetag-whitespace numbers 14-17, 21-22, 25-30, 32, 34.
Tweaked regex fails now on 26-30, 32, 34.

I plan on breaking out the regular expression with comments to better understand it, and try to fix the remaining failures.

by Rob Hudson, 14 years ago

Attachment: 2594.diff added

Patched against trunk (and template shuffling), removed trailing white space, and adjusted to add comments and make pass with steveire's tests.

comment:27 by Rob Hudson, 14 years ago

My local testing shows:

python -m timeit -n 50 -s "from views import my_render" "my_render()"

Django 1.2.3
50 loops, best of 3: 3.75 msec per loop
50 loops, best of 3: 3.74 msec per loop
50 loops, best of 3: 3.76 msec per loop

Django trunk without patch
50 loops, best of 3: 3.72 msec per loop
50 loops, best of 3: 3.73 msec per loop
50 loops, best of 3: 3.73 msec per loop

Django trunk with patch
50 loops, best of 3: 3.73 msec per loop
50 loops, best of 3: 3.73 msec per loop
50 loops, best of 3: 3.72 msec per loop

comment:28 by Malcolm Tredinnick, 14 years ago

I seriously doubt anybody has used vertical tabs (\v) or form feeds (\f) in Django templates ever. There's no need to put them in the reg-exp.

Also, realise that this is backwards incompatible and not in a trivial way. Anybody using templates to generate output in a format where whitespace is always significant (plain text email is one very big case) would have their output broken by this (unclear if Jacob and Russell both thought this was an acceptable sacrifice -- I definitely don't -- or if it was overlooked). It has to be optional and off by default in the rendering process. (The other case where I can immediately think of this breaking something is Django's debug page's "paste traceback" button -- I remember Erik Karulf spending ages to make sure the whitespace in what we POST'ed to dpaste was sensible.

comment:29 by Carl Meyer, 14 years ago

I agree with Malcolm that outright changing this is a backwards-compatibility issue. Unfortunately.

On the other hand, I think the new behavior is clearly superior in pretty much every case, and a major win, so I think it's worth finding a way to do it via an optional flag (an argument to Template.init? Would then need to be added to several shortcuts as well...), and then consider (later?) possible deprecation paths to migrate towards making it the default.

comment:30 by Tai Lee, 14 years ago

I'd like to see this enabled with a template tag, as it is template authors who will be needing to use it most. I'd also like to see a deprecation path so that this can become the default behaviour.

Similar to when autoescaping was introduced, we could add a template tag {% stripwhitespace on|off %}{% endstripwhitespace %} that people can put in their base templates to preserve the existing behaviour when the default changes or enable the new behaviour before it becomes the default. We could raise a deprecation warning whenever a template is rendered without this tag.

comment:31 by Stephen Kelly, 14 years ago

I'm not very familiar with the behavior of the blocktrans tag, but I guess there would need to be some handling of the whitespace at the end of a msgid being present in the po file, but not in the compiled template. I don't know if gettext is that strict. I guess the message extractor script should handle template syntax on its own line in the same way that the latest patch here does.

If the django message extractor script already uses the same classes, then there may be no problem, but it's something to look into.

This is related to #5849.

by Stephen Kelly, 14 years ago

Attachment: whitespace.patch added

comment:32 by Stephen Kelly, 14 years ago

Needs documentation: unset
Patch needs improvement: unset

by fahhem, 14 years ago

Modifies DebugLexer too, not just Lexer

by fahhem, 14 years ago

Same as previous, with fixed whitespace issue

comment:33 by Łukasz Rekucki, 14 years ago

Type: defectBug

comment:34 by Łukasz Rekucki, 14 years ago

Severity: normalNormal

comment:35 by patchhammer, 14 years ago

Easy pickings: unset
Patch needs improvement: set

0001-Commiting-whitespace-patch-from-django-ticket-2594.2.patch fails to apply cleanly on to trunk

comment:36 by Aymeric Augustin, 13 years ago

UI/UX: unset

Change UI/UX from NULL to False.

comment:37 by tobia, 13 years ago

I just started a thread about an alternative solution to this problem.

Newline stripping in templates: the dnl way

http://groups.google.com/group/django-developers/browse_thread/thread/8d6b6a3aae82ee03

comment:38 by Mikhail Korobov, 13 years ago

Cc: kmike84@… added

comment:39 by andrep, 13 years ago

Cc: andrep added

comment:40 by Stephane "Twidi" Angel, 13 years ago

Cc: Stephane "Twidi" Angel added

comment:41 by anonymous, 13 years ago

Please someone from django core sacrifice a day, and test and ad this to 1.4.x. Thanks a lot!

comment:42 by Florian Sening, 13 years ago

Cc: Florian.Sening@… added

comment:43 by Waldir Pimenta, 13 years ago

Cc: Waldir Pimenta added

comment:44 by ldiqual@…, 12 years ago

Cc: ldiqual@… added


comment:45 by Leon Matthews, 12 years ago

Cc: python@… added

by Leon Matthews, 12 years ago

Update patch against 1.5-dev

comment:46 by Leon Matthews, 12 years ago

I've spent the morning updating the patch against the latest development version.

It was fairly straightforward, except for the Lexer.create_token() method, which took some time. It was already fairly hairy in 1.4, and the addition of the verbatim tag in 1.5-dev had made it even more so. I've added comments to make it easier for the next person to unravel the logic.

All tests pass, pre- and post-patch, and the results of Gary Wilson's benchmark remain good.

python -m timeit -n500 -s 'from bm import my_render' 'my_render()'

Without patch:
500 loops, best of 3: 1.96 msec per loop
500 loops, best of 3: 1.95 msec per loop
500 loops, best of 3: 1.96 msec per loop

With patch, TEMPLATE_STRIP_LEADING_WHITESPACE = True:
500 loops, best of 3: 1.96 msec per loop
500 loops, best of 3: 1.96 msec per loop
500 loops, best of 3: 1.97 msec per loop

With patch, TEMPLATE_STRIP_LEADING_WHITESPACE = False:
500 loops, best of 3: 1.99 msec per loop
500 loops, best of 3: 2 msec per loop
500 loops, best of 3: 2.01 msec per loop

comment:47 by Matthias Dahl, 12 years ago

Cc: ua_django_bugzilla@… added

comment:48 by s.shanabrook@…, 12 years ago

Cc: s.shanabrook@… added

What needs to be done to get this merged still?

comment:49 by Tai Lee, 12 years ago

You could try reading through the ticket and django-developers group history so that you're familiar with all the options and opinions that have been presented. Then outline a proposal on the django-developers group to move the ticket forward by choosing a preferred option (probably the one which had the most support), summarise why it has been chosen over any others, update any patches so that they apply cleanly and ask for a code review and any final objections that would delay this being committed.

comment:50 by anonymous, 12 years ago

Why not implement the {%- tag -%} support in Jinja2, and let people choose themselves?

comment:51 by Christian Hammond, 11 years ago

Cc: chipx86@… added

comment:52 by tomasbabej@…, 11 years ago

Cc: tomasbabej@… added

comment:53 by FunkyBob, 11 years ago

There's no way I'd want to have _only_ a global flag to enable this behaviour. Certainly, there are cases where I'd rather avoid leading spaces and blank lines, but equally there are cases where it's important.

Not all the world is HTML. Django templates are frequently used for content types where the leading whitespace may be important, in the same projects where it's used for HTML, where it's less important.

Typically to avoid these problems I've started using constructs like:

<h1>The title</h1>
<ul>{% for item in somelist %}
    <li>{{ item }}</li>{% endfor %}
</ul>

Perhaps not as obvious with the template tags, but clearer with the HTML.

comment:54 by Florian Sening, 11 years ago

Just wanted to note that this is exactly the problem I'm facing. Not everything is HTML and sometimes whitespace matters. The templates for these cases are very hard to read. :/

comment:55 by trbs, 11 years ago

Cc: trbs@… added

comment:56 by Aymeric Augustin, 10 years ago

Resolution: wontfix
Status: newclosed

As far as I know, the consensus among the core team is that:

  • the Django template language is good enough for generating HTML, which isn't sensitive to whitespace;
  • the long term plan is to replace it with a better engine (most likely Jinja), not to keep working on it;
  • if you have more specialized needs (want to generate RTF?) just use another template engine, there are several to choose from.

Hence, closing wontfix. Thanks everyone who tried contributing to this ticket, and sorry it didn't happen.

comment:57 by Tai Lee, 10 years ago

Is there an open ticket for tracking the progress of integrating Django's newly blessed template engine (Jinja)? Or any indication on how "long term" that plan is? Was there a google groups discussion about this long term plan?

I did a quick search and couldn't find either. That being the case, it seems that if no new improvements will be made to Django's template engine because it might be replaced in the next 5-10 years (seems like a big job), that's simply leaving everyone who finds the current engine lacking in any way, stuck in limbo indefinitely.

Is the existing template engine in permanent feature freeze now? Only bug fixes allowed?

comment:58 by Aymeric Augustin, 10 years ago

Let me restate this with more plainly: we're not interested in this feature and we don't provide an alternative.

comment:59 by anonymous, 10 years ago

I think that what @aaugustin is trying to say is:
Google.com/search?q=how+do+i+use+jinja2+with+django
There, another bug fixed; right?

comment:60 by Andrew Farrell, 10 years ago

To achieve this from jinja2, pass both trim_blocks=True and lstrip_blocks=True when you instantiate either your Environment or Template.

see: http://jinja.pocoo.org/docs/dev/templates/#whitespace-control

comment:61 by adehnert, 2 years ago

Cc: adehnert added
Note: See TracTickets for help on using tickets.
Back to Top