Version 63 (modified by jbronn, 19 years ago) ( diff )

distance function documentation update.

Contents

The GIS branch intends to be a world-class geographic web framework. Our goal is to make it as easy as possible to build GIS web applications and harness the power of spatially enabled data.

Background

Note: The content herein is a loosely structured collection of notes and links that we have found useful, not necessarily what will be supported in the future.

What's GIS?

  • Series of blog posts giving intro to GIS; choice quote from an early post: "If you feel like ending a conversation with a developer then simply bring up the topic of character encodings ... [o]r ... coordinate systems. ... So in the spirit of Tim Bray's and Joel Spolsky's wonderful writeups of character encodings, I thought I'd put together a basic survival guide to coordinate systems over my next few posts and then tie it back to Google Maps."
  • More on map projections, including why people can't agree on just one (utf-8).
  • geodesy the field of science for this stuff.

Useful Code

  • PostGIS, the OpenGIS SQL Types (pdf) implementation for Postgresql
  • GEOS, low-level C++ port of Jave Topology Suite, used by PostGIS
  • GeoTypes is a type (and conversion) library for PostGIS via psycopg.
  • Geopy
    • Calculates distances using (very accurate) Vincenty, and uses the WGS 84 datum by default.
    • Has utility functions for unit of measurement (UOM) conversions (e.g. meters -> kilometers, kilometers -> miles, etc.)
    • Excellent GeoCoding capabilites. Has interfaces for Google, Yahoo, Microsoft Live, MediaWiki, and Geocoder.us.
  • GDAL/OGR, a library for fiddling with raster geo images.
    • Has a Python interface. A SWIG interface is in development, but not yet stable (no access to full API).
    • shapelib and ogr2ogr are useful for ESRI shapefile manipulations. ESRI shapefiles are a lingua frana GIS format.
  • Geo::Coder::US An excellent Perl library for GeoCoding that powers Geocoder.us. Users can create their own Geographic databases using the Census Bureau's TIGER/Line data (see below).
  • GeoRosetta, CC-BY-SA licensed, quality-controlled, collection of geocoding data. Not yet released to public(?).
  • MapServer: University of Minnesota (UMN) "open source development environment for building spatially-enabled internet applications."
  • MapNik: C++ and Python toolkit for developing mapping applications. Claimed benefits over MapServer: "It uses the AGG library and offers world class anti-aliasing rendering with subpixel accuracy for geographic data. It is written from scratch in modern C++ and doesn't suffer from design decisions made a decade ago." See MapNik FAQ.
  • Ruby on Rails
    • IvyGIS: Google-maps type displays with RoR and UMN's MapServer
    • Spatial Adapter for Rails: A plugin for Rails which manages the MySql Spatial and PostGIS geometric columns in a transparent way (that is like the other base data type columns). This might have some useful techniques for when we try to support other spatial extensions other than PostGIS.
    • Cartographer GMaps plugin

Useful Data

  • TIGER/Line: "The TIGER/Line files are extracts of selected geographic and cartographic information from the Census Bureau's TIGER® (Topologically Integrated Geographic Encoding and Referencing) database." This data is useful in creating your own geocoding database service. Currently 2006 Second Edition is the latest. Note: The Census Bureau will be providing SHP files in Fall, 2007.

FAQ

  • Place your questions here.
  • Q: When dealing with points (say, degrees) from, do they need to be converted to be useful on the back-end data, assuming -that- data is in degrees? Is it enough to have the same datum and origin? (Reading the intro above is likely to answer the question.)
    • My (JDunck) reading indicates yes. Given the same coordinate system (i.e. datum, origin, and axes), degrees are useful without conversion.
  • Q: Can this implementation work with MySQL spatial-extensions. If not, it's planned?
    • No. It is (now) planned, see Phase 3 below. From the last time I (jbronn) checked, MySQL's spatial capabilities have improved. However, we're going to focus our efforts on PostGIS until things are worked out a bit more. As a spatial database PostGIS it is more standards compliant (OpenGIS consortium), more widely used, and has more features (e.g. coordinate transformation, geometry_columns and spatial_ref_sys tables). It is definitely something I would want to implement in the future since I do like MySQL.

Implementation

Phase 1

  • Create Geometry-enabled fields and manager. Status: complete as of r4788.
  • Allow for Geometry-enabled queries. Status: complete as of r4788.

Phase 2

  • Pending
    • Add geometry-enabled routines to the fields that call directly on GEOS routines -- like area(), centroid(), etc. (partially complete as of r4884. See Extra Instance Methods section below)
    • Add as much from the PostGIS API as possible.
    • Support for a mapping framework (e.g. Google Maps/Earth, Yahoo Maps, MS Live, etc.)
      • Admin fields and forms (WKT field currently as of r4884, but we want widgets to view and manipulate geographic objects).
    • Utilities for importing raster data (SHP files first) directly into Django models.
  • Complete
    • PostGIS indexing capability

Phase 3

  • Support MySQL databases.
  • Geocoding framework.

Design Issues

  • Client JS/Flash framework, i.e., do we want to support OpenLayers, the Google Maps API, the Yahoo API?
    • So far, Google Maps looks the most promising for being supported first (people are familiar with it, and it's more stable than open layers).
    • Yahoo! has a really slick flash interface, I'd like to support this eventually.
  • Mapping Framework (generating custom tiles, layers, labels, etc.)
    • Mapnik is modern, but very early on in development and completely lacks documentation. However, the code is elegant and clean, and it was designed for integration with Python -- we're leaning towards this right now.
    • Mapserver has been around for a while, strong backing in the community (e.g. native support in QGIS). Even with documentation, the code looks less inviting than Mapnik (all in C); also has archaic text-based configuration files (pre-dating markup languages).
  • GEOS
    • GEOS is no longer maintained by Sean Gillies. See Sean Gillies, Geometries for Python (blog post explaining rationale for abandoning GEOS support); see also Sean's message on the GEOS-Devel Mailing List (Mar. 5, 2007).
    • Might consider either using PCL or implement a ctypes wrapper for the routines that we need -- can't really port PCL code here because it is GPL (Django is licensed under BSD).
  • WMS Server
    • I'm not satisfied with any of the current WMS/WFS implementations. One implemented in Django would be desirable, e.g., django.contrib.gis.wms. Thoughts anyone? (OWSLib looks good, see below)

Collaboration

  • PCL (Python Cartographic Library), now part of GIS Python, has done a lot of good work already. Let's apply the DRY principle.
  • Strong opportunities for collaboration with regards to:
    • Mapping framework
    • WMS/WMF Framework -- OWSLib looks excellent for this (BSD licensed and has unit tests!)
    • Utilities
    • Database representation ideas
    • GEOS support, Sean Gilles (lead developer of PCL) looking for help maintaining Python/SWIG interface to GEOS. If SWIG interface no longer maintained, might have to move to PCL for up-to-date GEOS library support.

Example

Geographic Models

Here is an example of how the model API currently works (assume this example is in geo_app/models.py):

from django.contrib.gis.db import models

class District(models.Model, models.GeoMixin):
    name = models.CharField(maxlength=35)
    num  = models.IntegerField()
    poly = models.PolygonField()

    objects = models.GeoManager()

class School(models.Model, models.GeoMixin):
    name  = models.CharField(maxlength=35)
    point = models.PointField(index=True)

    objects = models.GeoManager()

Notes: The GeoMixin class allows for extra instance methods. The index keyword is used to indicate that a GiST index be created for the School PointFields fields.

Using syncdb

Use the manage.py to invoke syncdb like you normally would:

$ python manage.py sqlall geo_app
BEGIN;
CREATE TABLE "geo_app_school" (
    "id" serial NOT NULL PRIMARY KEY,
    "name" varchar(35) NOT NULL
);
CREATE TABLE "geo_app_district" (
    "id" serial NOT NULL PRIMARY KEY,
    "name" varchar(35) NOT NULL,
    "num" integer NOT NULL
);
SELECT AddGeometryColumn('geo_app_school', 'point', 4326, 'POINT', 2);
CREATE INDEX "geo_app_school_point_id" ON "geo_app_school" USING GIST ( "point" GIST_GEOMETRY_OPS );
SELECT AddGeometryColumn('geo_app_district', 'poly', 4326, 'MULTIPOLYGON', 2);
COMMIT;
$ python manage.py syncdb geo_app

Note: The geometry columns are created outside of the CREATE TABLE statements by the AddGeometryColumn. This is done according to the OpenGIS specfication. See Open GIS Consortium, Inc., OpenGIS Simple Feature Specification For SQL, Document 99-049 (May 5, 1999), at Ch. 2.3.8 (Geometry Values and Spatial Reference Systems, pg. 39).

Spatial Queries

After a geographic model has been created, the PostGIS additions to the API may be used. Geographic queries are done by normally by using filter() and exclude() on geometry-enabled models using geographic lookup types (see the Database API below for lookup types). In the following example, the bbcontains lookup type is used which is the same as the PostGIS && operator. It looks to see if the bounding box of the polygon contains the specific point. The next example uses the PostGIS Contains() function, which calls GEOS library to test if the polygon actually contains the specific point, not just the bounding box.

>>> from geo_app.models import District, School
>>> qs1 = District.objects.filter(poly__bbcontains='POINT(-95.362293 29.756539)') 
>>> qs2 = District.objects.filter(poly__contains='POINT(-95.362293 29.756539)') 

Both spatial queries and normal queries using filter() may be used in the same query. For example, the following query set will only show school districts that have 'Houston' in their name and contain the given point within their polygon boundary:

>>> qs = District.objects.filter(name__contains='Houston').filter(poly__contains='POINT(-95.362293 29.756539)')

Or combine both the bounding box routines (less accurate, fast) with the GEOS routines (most accurate, slower) to get a query that is both fast and accurate:

>>> qs = District.objects.filter(poly__bbcontains='POINT(-95.362293 29.756539)').filter(poly__contains='POINT(-95.362293 29.756539)')

Installation

Installation of the GeoDjango module will also require the installation of existing open source geographic libraries and a spatial database (currently only PostGIS). This section will describe the installation process for these libraries. Initially, these instructions will pertain only to a Linux platform (particularly Debian or Ubuntu). Mac & Windows support will be considered later; however, these instructions will most likely work through the Mac shell. Don't hold your breath for Windows support. Community support for prerequisites is better than previously believed, Windows support will come much earlier than expected.

Django

  • GeoDjango exists in the gis branch from SVN:
    $ svn co http://code.djangoproject.com/svn/django/branches/gis django_gis
    $ ln -s django_gis /path/to/site-packages/django
    

GEOS

  • Latest GEOS version is 3.0.0RC4
    • Also requires SWIG >= 1.3.28. (Ubuntu Dapper comes with 1.3.27.)
    • If there's trouble locating your python, include PYTHON=/path/to/python, or --enable-python=/path/to/python.
  • Configure (enabling python), make, and install.
    $ ./configure --enable-python
    $ make
    # make install
    

PROJ.4

  • Latest PROJ.4 version is 4.5.0
  • First, download the PROJ datum shifting files. These will come in handy for coordinate transformations when other programs (like Mapserver or Mapnik) are not able to cope with EPSG transformations (I learned the hard way). Untar/unzip these in the nad subdirectory of the PROJ source. For example, if PROJ was unzipped in a directory named proj, then untar these files in proj/nad. Do this before you do the configure/make/install dance.
  • Next, configure, make and install.
    $ ./configure
    $ make
    # make install 
    

PostGIS

  • Latest PostGIS version is 1.2.1
  • First build & install PostGIS. We are currently using v8.1 of PostgreSQL.
    $ ./configure --with-geos --with-proj
    $ make
    # make install
    
  • Next, create a role and database for your application, and allow it to access PostGIS functionality:
    # su - postgres
    $ psql
    postgres=# CREATE ROLE <user> LOGIN;
    postgres=# \q
    $ createdb -O <user> <db_name>
    $ createlang plpgsql <db_name>
    $ psql -d <db_name> -f /usr/local/share/lwpostgis.sql
    $ psql -d <db_name> -f /usr/local/share/spatial_ref_sys.sql
    $ psql <db_name>
    <db_name>=# GRANT SELECT, UPDATE, INSERT, DELETE ON geometry_columns TO <user>;
    <db_name>=# GRANT SELECT ON spatial_ref_sys TO <user>;
    

  • Finally, update your settings.py to reflect the name and user for the spatially enabled database. So far, we only plan to support the psycopg2 backend, thus: DATABASE_ENGINE='postgresql_psycopg2'.

GDAL

  • Optional, but highly useful for coordinate transformations and reading/writing both vector (e.g. SHP) and raster (e.g. TIFF) geographic data.
    • For example, the following command will convert your SHP file into WGS84 (standard lat/lon). Then you can import directly into your database using shp2pgsql (utility from PostGIS):
      ogr2ogr -t_srs WGS84 output.shp input.shp
      
  • Latest GDAL version is 1.4.0. Configure with GEOS and Python support, then make and install:
    $ ./configure --with-geos --with-python
    $ make
    # make install
    
  • Note: This is done without the 'next generation' SWIG Python bindings. I've had trouble getting them to work, and the rumor is this only works on Windows. The compilation flag to enable these is --with-ngpython, but our packages currently only use the old bindings.

Model API

Fields

The following geometry-enabled fields are available:

  • PointField
  • LineStringField
  • PolygonField
  • MultiPointField
  • MultiLineStringField
  • MultiPolygonField
  • GeometryCollectionField

Field Keywords

  • Field keywords are used during model creation, for example:
    from django.contrib.gis.db import models
    
    class Zip(models.Model, models.GeoMixin):
      code = models.IntegerField()
      poly = models.PolygonField(srid=-1, index=True)
    
      object = models.GeoManager()
    
  • srid
    • Sets the SRID (Spatial Reference System Identity) of geometry to the given value. Defaults to 4326 (WGS84). See Open GIS Consortium, Inc., OpenGIS Simple Feature Specification For SQL, Document 99-049 (May 5, 1999), at Ch. 2.3.8 (Geometry Values and Spatial Reference Systems, pg. 39).
  • index
    • If set to True, will create a GiST index for the given geometry. Update the index with the PostgreSQL command VACUUM ANALYZE (may take a while to execute depending on how large your geographic-enabled tables are).

Creating and Saving Models with Geometry Fields

Here is an example of how to create a geometry object (assuming the Zip model example above):

>>> from zipcode.models import Zip
>>> z = Zip(code=77096, poly='POLYGON(( 10 10, 10 20, 20 20, 20 15, 10 10))')
>>> z.save()

Geometries are represented as strings in either of the formats WKT (Well Known Text) or HEXEWKB (PostGIS specific, essentially a WKB geometry in hexadecimal). For example:

Database API

Note: The following database lookup types can only be used with on geographic fields with filter(). Filters on 'normal' fields (e.g. CharField) may be chained with those on geographic fields. Thus, geographic queries take the following form (assuming the Zip model used in the Model API section above):

>>> qs = Zip.objects.filter(<geo field A>__<geo lookup type>=<geo string B>)
>>> qs = Zip.objects.exclude(...)

PostGIS Operator Field Lookup Types

  • See generally, "Operators", PostGIS Documentation at Ch. 6.2.2
  • Note: This API is subject to some change -- we're open to suggestions.
  • overlaps_left
    • Returns true if A's bounding box overlaps or is to the left of B's bounding box.
    • PostGIS equivalent "&<"
  • overlaps_right
    • Returns true if A's bounding box overlaps or is to the right of B's bounding box.
    • PostGIS equivalent "&>"
  • left
    • Returns true if A's bounding box is strictly to the left of B's bounding box.
    • PostGIS equivalent "<<"
  • right
    • Returns true if A's bounding box is strictly to the right of B's bounding box.
    • PostGIS equivalent ">>"
  • overlaps_below
    • Returns true if A's bounding box overlaps or is below B's bounding box.
    • PostGIS equivalent "&<|"
  • overlaps_above
    • Returns true if A's bounding box overlaps or is above B's bounding box.
    • PostGIS equivalent "|&>"
  • strictly_below
    • Returns true if A's bounding box is strictly below B's bounding box.
    • PostGIS equivalent "<<|"
  • strictly_above
    • Returns true if A's bounding box is strictly above B's bounding box.
    • PostGIS equivalent "|>>"
  • same_as
    • The "same as" operator. It tests actual geometric equality of two features. So if A and B are the same feature, vertex-by-vertex, the operator returns true.
    • PostGIS equivalent "~="
  • contained
    • Returns true if A's bounding box is completely contained by B's bounding box.
    • PostGIS equivalent "@"
  • bbcontains
    • Returns true if A's bounding box completely contains B's bounding box.
    • PostGIS equivalent "~"
  • bboverlaps
    • Returns true if A's bounding box overlaps B's bounding box.
    • PostGIS equivalent "&&"

PostGIS GEOS Function Field Lookup Types

  • See generally "Geometry Relationship Functions", PostGIS Documentation at Ch. 6.1.2.
  • This documentation will be updated completely with the content from the aforementioned PostGIS docs.
  • distance
    • Warning: This function lookup type does not work, and will be moved to a routine as part of GeoManager.
    • Return the cartesian distance between two geometries in projected units.
    • PostGIS equivalent Distance(geometry, geometry)
  • equals
    • Requires GEOS
    • Returns 1 (TRUE) if the given Geometries are "spatially equal".
    • Use this for a 'better' answer than '='. equals('LINESTRING(0 0, 10 10)','LINESTRING(0 0, 5 5, 10 10)') is true.
    • PostGIS equivalent Equals(geometry, geometry), OGC SPEC s2.1.1.2
  • disjoint
    • Requires GEOS
    • Returns 1 (TRUE) if the Geometries are "spatially disjoint".
    • PostGIS equivalent Disjoint(geometry, geometry)
  • intersects
    • PostGIS equivalent Intersects(geometry, geometry)
  • touches
    • PostGIS equivalent Touches(geometry, geometry)
  • crosses
    • PostGIS equivalent Crosses(geometry, geometry)
  • overlaps
    • PostGIS equivalent Overlaps(geometry, geometry)
  • contains
    • PostGIS equivalent Contains(geometry, geometry)
  • intersects
    • PostGIS equivalent Intersects(geometry, geometry)
  • relate
    • PostGIS equivelent Relate(geometry, geometry)

Extra Instance Methods

A model with geometry fields will get the following methods:

get_GEOM_wkt

For every geometry field, the model object will have a get_GEOM_wkt method, where GEOM is the name of the geometry field. For example (using the School model from above):

>>> skool = School.objects.get(name='PSAS')
>>> print skool.get_point_wkt()
POINT(-95.460822 29.745463)

get_GEOM_centroid

For every geometry field, the model object will have a get_GEOM_centroid method, where GEOM is the name of the geometry field. This routine will return the centroid of the geometry. For example (using the District model from above):

>>> dist = District.objects.get(name='Houston ISD')
>>> print dist.get_poly_centroid()
POINT(-95.231713 29.723235)

get_GEOM_area

For every geometry field, the model object will have a get_GEOM_area method, where GEOM is the name of the geometry field. This routine will return the area of the geometry.

>>> dist = District.objects.get(name='Houston ISD')
>>> print dist.get_poly_area()
0.08332

Note: The units system needs to be figured out here, since I don't know what these units represent.

Note: See TracWiki for help on using the wiki.
Back to Top