From: Regina Obe Date: Tue, 22 May 2012 13:34:11 +0000 (+0000) Subject: Add new functions and notes that 2.1 is tiger_2011 instead of tiger_2010. More on... X-Git-Tag: 2.1.0beta2~1003 X-Git-Url: https://granicus.if.org/sourcecode?a=commitdiff_plain;h=496692243db8b0eb6e663732639c577df0b387ae;p=postgis Add new functions and notes that 2.1 is tiger_2011 instead of tiger_2010. More on upgrade instructions from tiger_2010 to tiger_2011 git-svn-id: http://svn.osgeo.org/postgis/trunk@9780 b70326c6-7e19-0410-871a-916f4a2858ee --- diff --git a/doc/extras_tigergeocoder.xml b/doc/extras_tigergeocoder.xml index 0ef4e86a5..e9df246c1 100644 --- a/doc/extras_tigergeocoder.xml +++ b/doc/extras_tigergeocoder.xml @@ -5,20 +5,20 @@ A plpgsql based geocoder written to work with the TIGER (Topologically Integrated Geographic Encoding and Referencing system ) / Line and Master Address database export released by the US Census Bureau. In prior versions the TIGER files were released in ASCII format. The older geocoder used to work with that format is in extras/tiger_geocoder/tiger_2006andbefore. - There are four components to the geocoder: the data loader functions, the address normalizer, the address geocoder, and the reverse geocoder. The latest version updated to use the TIGER 2010 census data is located in the extras/tiger_geocoder/tiger_2010 folder. + There are four components to the geocoder: the data loader functions, the address normalizer, the address geocoder, and the reverse geocoder. The latest version updated to use the TIGER 2011 census data is located in the extras/tiger_geocoder/tiger_2011 folder. Although it is designed specifically for the US, a lot of the concepts and functions are applicable and can be adapted to work with other country address and road networks. The script builds a schema called tiger to house all the tiger related functions, reusable lookup data such as road type prefixes, suffixes, states, various control tables for managing data load, and skeleton base tables from which all the tiger loaded tables inherit from. Another schema called tiger_data is also created which houses all the census data for each state that the loader downloads from Census site and loads into the database. In the current model, each set of state tables is - prefixed with the state code e.g ma_addr, ca_edges etc with constraints to enforce only that state data. Each of these tables inherits from the base addr, faces, edges, etc located in the tiger schema. + prefixed with the state code e.g ma_addr, ma_edges etc with constraints to enforce only that state data. Each of these tables inherits from the tables addr, faces, edges, etc located in the tiger schema. All the geocode functions only reference the base tables, so there is no requirement that the data schema be called tiger_data or that data can't be further partitioned into other schemas -- e.g a different schema for each state, as long as all the tables inherit from the tables in the tiger schema. -If you are using a prerelease version of PostGIS 2.0.0 tiger geocoder, +If you are using tiger geocoder (tiger_2010), you can upgrade the scripts using the accompanying upgrade_geocoder.bat -/ .sh scripts in tiger_2010. We'll be refining the upgrade scripts -until release. +/ .sh scripts in tiger_2011. One major change between tiger_2010 and tiger_2011 is that the county and county and state tables are no longer broken out by state. We'll be refining the upgrade scripts +until release. If you have data from tiger_2010 and want replace with tiger_2011 refer to Design: @@ -115,6 +115,53 @@ DROP INDEX tiger_data.idx_tiger_data_ma_county_snd_name; + + + Drop_Nation_Tables_Generate_Script + + Generates a script that drops all tables in the specified schema that start with county_all, state_all or stae code followed by county or state. + + + + + + text Drop_State_Tables_Generate_Script + text param_state + text param_schema=tiger_data + + + + + + Description + + Generates a script that drops all tables in the specified schema that start with county_all, state_all or stae code followed by county or state. This is needed if you are upgrading from tiger_2010 to tiger_2011 data. + + + Availability: 2.1.0 + + + + + + + Examples + SELECT drop_nation_tables_generate_script(); +DROP TABLE tiger_data.county_all; +DROP TABLE tiger_data.county_all_lookup; +DROP TABLE tiger_data.state_all; +DROP TABLE tiger_data.ma_county; +DROP TABLE tiger_data.ma_state; + + + + + + See Also + + + + Drop_State_Tables_Generate_Script @@ -666,6 +713,150 @@ for z in *.zip; do $UNZIPTOOL -o -d $TMPDIR $z; done + + + See Also + + + + + + + Loader_Generate_Script + + Generates a shell script for the specified platform for the specified states that will download Tiger data, stage and load into tiger_data schema. Each state script is returned as a separate record. Latest version supports Tiger 2010 structural changes and also loads census tract, block groups, and blocks tables. + + + + + + setof text loader_generate_script + text[] param_states + text os + + + + + + Description + + Generates a shell script for the specified platform for the specified states that will download Tiger data, stage and load into tiger_data schema. Each state script is returned as a separate record. + It uses unzip on Linux (7-zip on Windows by default) and wget to do the downloading. It uses to load in the data. Note the smallest unit it does is a whole state, but you can overwrite this by downloading the files yourself. It will only + process the files in the staging and temp folders. + It uses the following control tables to control the process and different OS shell syntax variations. + + + + loader_variables keeps track of various variables such as census site, year, data and staging schemas + + + loader_platform profiles of various platforms and where the various executables are located. Comes with windows and linux. More can be added. + + + loader_lookuptables each record defines a kind of table (state, county), whether to process records in it and how to load them in. Defines the steps to import data, stage data, add, removes columns, indexes, and constraints for each. Each table is prefixed with the state and inherits from a table in the tiger schema. e.g. creates tiger_data.ma_faces which inherits from tiger.faces + + + Availability: 2.0.0 to support Tiger 2010 structured data and load census tract (tract), block groups (bg), and blocks (tabblocks) tables . + + + + + + + Examples + Generate script to load up data for 2 states in Windows shell script format. + SELECT loader_generate_script(ARRAY['MA','RI'], 'windows') AS result; +-- result -- +set STATEDIR="\gisdata\www2.census.gov\geo\pvs\tiger2010st\44_Rhode_Island" +set TMPDIR=\gisdata\temp\ +set UNZIPTOOL="C:\Program Files\7-Zip\7z.exe" +set WGETTOOL="C:\wget\wget.exe" +set PGBIN=C:\Program Files\PostgreSQL\8.4\bin\ +set PGPORT=5432 +set PGHOST=localhost +set PGUSER=postgres +set PGPASSWORD=yourpasswordhere +set PGDATABASE=geocoder +set PSQL="%PGBIN%psql" +set SHP2PGSQL="%PGBIN%shp2pgsql" + +%WGETTOOL% http://www2.census.gov/geo/pvs/tiger2010st/44_Rhode_Island/ --no-parent --relative --recursive --level=2 --accept=zip,txt --mirror --reject=html +: +: + Generate sh script + SELECT loader_generate_script(ARRAY['MA','RI'], 'sh') AS result; +-- result -- +STATEDIR="/gisdata/www2.census.gov/geo/pvs/tiger2010st/44_Rhode_Island" +TMPDIR="/gisdata/temp/" +UNZIPTOOL=unzip +PGPORT=5432 +PGHOST=localhost +PGUSER=postgres +PGPASSWORD=yourpasswordhere +PGDATABASE=geocoder +PSQL=psql +SHP2PGSQ=shp2pgsql + +wget http://www2.census.gov/geo/pvs/tiger2010st/44_Rhode_Island/ --no-parent --relative --recursive --level=2 --accept=zip,txt --mirror --reject=html +: +: + + + + + + See Also + + + + + + + Loader_Generate_Nation_Script + + Generates a shell script for the specified platform that loads in the county and state lookup tables. + + + + + + text loader_generate_nation_script + text os + + + + + + Description + + Generates a shell script for the specified platform that loads in the county_all, county_all_lookup, state_all tables into tiger_data schema. These inherit respectively from the county, county_lookup, state tables in tiger schema. + It uses unzip on Linux (7-zip on Windows by default) and wget to do the downloading. It uses to load in the data. + It uses the following control tables tiger.loader_platform, tiger.loader_variables, and tiger.loader_lookuptables to control the process and different OS shell syntax variations. + + + + loader_variables keeps track of various variables such as census site, year, data and staging schemas + + + loader_platform profiles of various platforms and where the various executables are located. Comes with windows and linux/unix. More can be added. + + + loader_lookuptables each record defines a kind of table (state, county), whether to process records in it and how to load them in. Defines the steps to import data, stage data, add, removes columns, indexes, and constraints for each. Each table is prefixed with the state and inherits from a table in the tiger schema. e.g. creates tiger_data.ma_faces which inherits from tiger.faces + + + Availability: 2.1.0 + If you were running tiger_2010 version and you want to reload as state with tiger_2011, you'll need to for the very first load generate and run drop statements before you run this script. + + + + + Examples + Generate script script to load nation data Windows. + SELECT loader_generate_nation_script('windows'); + Generate script to load up data for Linux/Unix systems. + SELECT loader_generate_nation_script('sh'); + + See Also