Supported spatial data types include:
Geographic objects, which are also made up of points, lines, polygons, etc., in 2-dimensional space. They are projected onto the surface of a sphere and are represented in SQL by the
GEOGRAPHYdata type. (Technically, they are projected onto a spheroid: "a sphere with a bulge"). The spheroid projection means that:
- The X and Y coordinates of 2-dimensional points are actually Longitude and Latitude values.
- The paths between geographic objects are not straight lines; they are curves, and so the distances between objects are calculated using great circle math.
In this section, we'll describe how to:
- Install CockroachDB with spatial support.
- Use it with a real-world spatial data set.
Install a build of CockroachDB with support for spatial data by following the instructions at Install CockroachDB.
cockroach sql --insecure --host=localhost --port=26257
Leave this shell open for use in the examples below.
Example 1. Load NWS Tornado data
In this example we will import a specific data set that is available as a Shapefile.
Step 1. Convert the shapefile to SQL
In this example, we will import the US National Weather Service's Tornadoes (1950-2018) data set (specifically the tornado start points data set).
First, unzip the tornado data:
Next, convert the Shapefile data to SQL using the following command:
shp2pgsql 1950-2018-torn-initpoint.shp > tornado-points.sql &
Step 2. Create the tornadoes database
CREATE DATABASE tornadoes; USE tornadoes;
IMPORT the tornado data
Start a local file server in the directory where the file
IMPORT PGDUMP 'http://localhost:3000/tornado-points.sql';
job_id | status | fraction_completed | rows | index_entries | bytes ---------------------+-----------+--------------------+-------+---------------+----------- 604425003155914753 | succeeded | 1 | 63645 | 0 | 18287213 (1 row)
Step 4. Query the tornado data
Once the data has finished importing, we can query the tornado data from SQL.
For example, we may be interested in the 1999 Oklahoma tornado outbreak, which is described by Wikipedia as:
a significant tornado outbreak that affected much of the Central and parts of the Eastern United States, with the highest record-breaking wind speeds of 302 ± 22 mph (486 ± 35 km/h). During this week-long event, 154 tornadoes touched down (including one in Canada), more than half of them on May 3 and 4 when activity reached its peak over Oklahoma, Kansas, Nebraska, Texas, and Arkansas.
According to the wiki page linked above, there were 152 tornadoes confirmed between May 2-8, 1999 (one in Canada).
We can try to verify this number against the NWS's tornado data set with the following query:
SELECT COUNT(*) FROM "1950-2018-torn-initpoint" WHERE yr = 1999 AND mo = 5 AND dy >= 02 AND dy <= 08;
count --------- 150 (1 row)
It might be interesting to look into why these numbers are different!
Next, let's get a list of starting points for all of the tornadoes in the outbreak that started in Oklahoma:
SELECT ST_AsText(geom) FROM "1950-2018-torn-initpoint" WHERE yr = 1999 AND st = 'OK' AND mo = 5 AND dy > 02 AND dy <= 08;
st_astext -------------------------------------------------- POINT (-98.379999999999995 34.770000000000003) POINT (-98.329999999999998 34.780000000000001) POINT (-98.319999999999993 34.880000000000003) POINT (-98.230000000000004 34.920000000000002) POINT (-99.019999999999996 34.799999999999997) POINT (-98.25 35.030000000000001) POINT (-98.120000000000005 34.969999999999999) POINT (-98.030000000000001 35.049999999999997) POINT (-97.980000000000004 35.079999999999998) POINT (-98.569999999999993 34.950000000000003) POINT (-97.849999999999994 35.130000000000003) POINT (-98.430000000000007 34.979999999999997) POINT (-98.329999999999998 35.07) POINT (-98.019999999999996 35.719999999999999) POINT (-97.980000000000004 35.719999999999999) POINT (-97.599999999999994 35.299999999999997) POINT (-98.280000000000001 35.119999999999997) POINT (-98.200000000000003 35.170000000000002) POINT (-97.400000000000006 35.399999999999999) POINT (-98.099999999999994 35.18) POINT (-98.069999999999993 35.270000000000003) POINT (-98.129999999999995 35.270000000000003) POINT (-98.019999999999996 35.32) POINT (-97.299999999999997 35.469999999999999) POINT (-98 35.270000000000003) POINT (-97.969999999999999 35.399999999999999) POINT (-97.219999999999999 35.549999999999997) POINT (-97.920000000000002 35.420000000000002) POINT (-97.900000000000006 35.43) POINT (-97.230000000000004 35.579999999999998) POINT (-98.370000000000005 35.880000000000003) POINT (-97.920000000000002 35.520000000000003) POINT (-98.280000000000001 35.649999999999999) POINT (-97.849999999999994 35.530000000000001) POINT (-97.200000000000003 35.130000000000003) POINT (-97.780000000000001 35.649999999999999) POINT (-98.030000000000001 35.850000000000001) POINT (-97.719999999999999 35.700000000000003) POINT (-98.030000000000001 35.880000000000003) POINT (-97 35.369999999999997) POINT (-97.680000000000007 35.780000000000001) POINT (-97.950000000000003 35.93) POINT (-98.170000000000002 35.850000000000001) POINT (-97.680000000000007 35.880000000000003) POINT (-97.879999999999995 36.020000000000003) POINT (-97.950000000000003 36.020000000000003) POINT (-98 35.5) POINT (-97.879999999999995 36.100000000000001) POINT (-97.969999999999999 35.549999999999997) POINT (-96.799999999999997 35.649999999999999) POINT (-97.650000000000006 36.119999999999997) POINT (-98.25 36.299999999999997) POINT (-97.719999999999999 35.780000000000001) POINT (-97.780000000000001 35.850000000000001) POINT (-97.599999999999994 35.920000000000002) POINT (-97.420000000000002 36.030000000000001) POINT (-96.129999999999995 35.979999999999997) POINT (-96.069999999999993 36.020000000000003) POINT (-95.650000000000006 35.630000000000003) POINT (-95.180000000000007 35.950000000000003) POINT (-94.730000000000004 36) POINT (-97.400000000000006 35.32) POINT (-96.400000000000006 36.469999999999999) POINT (-95.579999999999998 34.579999999999998) POINT (-95.219999999999999 34.880000000000003) POINT (-95 35.130000000000003) POINT (-94.780000000000001 35.299999999999997) POINT (-94.700000000000003 35.43) POINT (-94.549999999999997 35.57) (69 rows)
We can see that almost half of all of the tornadoes in this outbreak began in Oklahoma.
It might be interesting to draw these points on a map. The image below shows the points from the query above drawn as a simple polygon on a map of Oklahoma. The boxes around the polygon show the spatial index coverings for the polygon.
(Map data © 2020 Google)
Example 2. Load NYC data for the PostGIS tutorial
Follow the steps below to load the SQL for the NYC data used in the Introduction to PostGIS tutorial.
CockroachDB can work with the tutorial up to Chapter 22, with the following exceptions:
- Do not try to load Shapefiles via the GUI as shown in the tutorial. Instead, follow the steps below to load the SQL data directly into CockroachDB. (We have already converted the tutorial Shapefiles to SQL for you.)
- We do not support GML or KML data.
- We do not support SVG.
Step 1. Load the NYC data
Clone the data set:
git clone https://github.com/otan-cockroach/otan-scripts
Load the SQL files into your CockroachDB cluster:
cat otan-scripts/geospatial_sql/*.sql | cockroach sql --insecure --host=localhost --port=26257
The command above will take a few minutes to run.
Step 2. Follow the PostGIS tutorial
When the cluster is finished loading the data, open a SQL shell and start working through the Introduction to PostGIS tutorial:
cockroach sql --insecure --host=localhost --port=26257
However, we do not yet implement the full list of PostGIS built-in functions and operators. Also, our spatial indexing works differently (see the Performance section below). For a list of the spatial functions we support, see Geospatial functions.
If your application needs support for functions that are not yet implemented, please check out our meta-issue for built-in function support on GitHub, which describes how to find an issue for the built-in function(s) you need.
For a list of other known limitations, see Known Limitations.
For general CockroachDB troubleshooting information, see this troubleshooting overview.
If you need help troubleshooting an issue with our spatial support, please get in touch using our Support resources.
In order to avoid full table scans, make sure to add indexes to any columns that are accessed as part of a predicate in the
WHERE clause. For geospatial columns, the index will only be used if the column is accessed using an index-accelerated geospatial function from the list below (all of these functions work on
GEOMETRY data types; a
* means that a function also works on
GEOGRAPHY data types):
To use a version of a function from the list above that will explicitly not use the index, add an underscore (
_) to the beginning of the function name, e.g.,
The syntax for adding an index to a geometry column is:
CREATE INDEX tornado_geom_idx ON tornadoes USING GIST (geom);
This creates a (spatial) inverted index on the
Because CockroachDB is a scale-out, multi-node database, our spatial indexing strategy is based on a space-filling curve/quad-tree design (also known as "divide the space"), rather than the R-Tree data structure used by some other spatial databases (also known as "divide the objects"). Other databases that use a "divide the space" strategy include Microsoft SQL Server and MongoDB.
For more detailed information about how CockroachDB's spatial indexes work, see Spatial indexes.
If you encounter behavior that you think is due to a performance issue, please get in touch using our Support resources.
- Install CockroachDB
- Spatial Features
- Spatial indexes
- Spatial & GIS Glossary of Terms
- Working with Spatial Data
- Migrate from Shapefiles
- Migrate from GeoJSON
- Migrate from GeoPackage
- Migrate from OpenStreetMap
- Spatial functions
- Well known text
- Well known binary
- SRID 4326 - longitude and latitude
- Introducing Distributed Spatial Data in Free, Open Source CockroachDB (blog post)
- Troubleshooting overview
- Support resources