Import data
Import paths
Danger
With dynamic segmentation, importing paths is very risky if paths are already present in the same area in Geotrek, it is only safe for an area where no path is already created.
Indeed, if you import paths where there are existing paths, treks, POIs or trails linked topology might be impacted.
Before import paths layer, it is important to prepare them. Paths must be:
valid geometry
simple geometry (no intersection)
all intersections must cut the paths
not double or covering others
We use QGis to clean a path layer, with plugin Grass. Here are the operations:
check the SRID (must be the same as in Geotrek)
vectors → geometric tools → “collect geometries”
vectors → geometric tools → “group”
- clean geometries
search “v_clean” in “Processing toolbox”
select following options in cleaning tool: break, snap, duplicate (ou rmdup), rmline, rmdangle, chdangle, bpol, prune
in threshold enter 2,2,2,2,2,2,2,2 (2 meters for each option)
- delete duplicate geometries
search “duplicate” in “Processing toolbox”
- regroup lines
search “v.build.polyline” in “Processing toolbox”)
select “first” in “Category number mode”
There are two ways to import path : importing your shapefile with command line, or via QGis following this blog post.
To import a shapefile containing your paths, use the command loadpaths:
sudo geotrek loadpaths {Troncons.shp} \
--srid=2154 --comments-attribute IT_VTT IT_EQ IT_PEDEST \
--encoding latin9 -i
Import data from touristic data systems (SIT)
Configure APIDAE (ex-SITRA) import
To import touristic content from APIDAE (ex-SITRA), edit /opt/geotrek-admin/var/conf/parsers.py file with the following content:
from geotrek.tourism.parsers import TouristicContentApidaeParser
class HebergementParser(TouristicContentApidaeParser):
label = "Hébergements"
api_key = 'xxxxxxxx'
project_id = 9999
selection_id = 99999
category = "Hébergement"
type1 = ["Camping"]
type2 = ["3 étoiles", "Tourisme et Handicap"] # just remove this line if no type2
Then set up appropriate values:
labelat your convenience,api_key,project_idandselection_idaccording to your APIDAE (ex-SITRA) configurationcategory,type1andtype2(optional) to select in which Geotrek category/type imported objects should goYou can add
delete = Truein your class if you want to delete objects in Geotrek databases that has been deleted in your Apidae selection. It will only delete objects that match with your class settings (category, types, portal, provider…)You can also use the class
HebergementParserif you only import accomodationsSee https://github.com/GeotrekCE/Geotrek-admin/blob/master/geotrek/tourism/parsers.py for details about Parsers
You can duplicate the class. Each class must have a different name. Don’t forget the u character before strings if they contain non-ascii characters.
To apply changes, you may have to run sudo service geotrek restart.
Configure Marque Esprit Parc import
To import touristic content from Esprit Parc national database, create (or update) /opt/geotrek-admin/var/conf/parsers.py file with the following content:
from geotrek.tourism.parsers import EspritParcParser
class XXXEspritParcParser(EspritParcParser):
label = "Marque Esprit Parc"
url = "https://gestion.espritparcnational.com/ws/?f=getProduitsSelonParc&codeParc=XXX"
Then set up appropriate values:
XXXby unique national park code (ex: PNE)
You can duplicate the class. Each class must have a different name. Don’t forget the u character before strings if they contain non-ascii characters.
In this case categories and types in Geotrek database have to be the same as in Esprit parc database. Otherwise missing categories and types will be created in Geotrek database.
Imported contents will be automatically published and approved.
If you use an url that filters a unique category, you can change its name. Example to get only Honey products and set the Geotrek category and type in which import them:
class MielEspritParcParser(EspritParcParser):
label = "Miel Esprit Parc national"
url = "https://gestion.espritparcnational.com/ws/?f=getProduitsSelonParc&codeParc=XXX&typologie=API"
constant_fields = {
'category': "GeotrekCategoryName",
'published': True,
'approved': True,
'deleted': False,
}
m2m_constant_fields = {
'type1': "GeotrekTypeName",
}
URL to get Esprit parc types: https://gestion.espritparcnational.com/ws/?f=getTypologieProduits.
Sensitive areas import
When sensitive areas module is enabled, Geotrek provides 3 parsers to import data:
Import sensitive areas from http://biodiv-sports.fr (
geotrek.sensitivity.parsers.BiodivParser). By default this parser imports all sensitive areas in configured spatial extent.Import species sensitive areas from a ziped shapefile. Imported field names are:
espece(required),contactanddescriptio. Species with corresponding names have to be created manually before import.Import regulatory sensitive areas from a ziped shapefile. Imported field names are:
nom(required),contact,descriptio,periode(month numbers separated with comas),pratiques(separated with comas), andurl. Practices with corresponding names have to be created manually before import.
You can start imports from “Import” menu or from command line. You can override them in your var/conf/parsers.py
file.
Multiple imports
When you need to import data for the same object found in 2 different parsers, you can to force the aggregation of both values in many to many relationship case. It can be interesting with portals for example.
Param for the aggregation : m2m_aggregate_fields
Here is an example with 2 parsers :
class Portal_1Parser(XXXParser):
portal = "portal_1"
class AggregateParser(XXXParser):
portal = "portal_2"
m2m_aggregate_fields = ["portal"]
Then, when you import the first parser Portal_1Parser, you get multiple objects with portal_1 as portal.
If any object of the Portal_1Parser is also in AggregateParser, fields in m2m_aggregate_fields will have their values not be replaced but aggregated.
Then your object in both portals will have as portal: portal_1, portal_2
Here in this example whenever you import the first parser
Portal_1Parser, portals are replaced becausem2m_aggregate_fieldsis not filled. Then, be careful to import parsers in the right order or add the paramm2m_aggregate_fieldson all parsers.
If you need to cancel the aggregation of portals, remove param m2m_aggregate_fields.
Importing from multiple sources with deletion
When importing data for the same model using two (or more) different sources, the provider field should be used to differenciate between sources, allowing to enable object deletion with delete = True without causing the last parser to delete objects created by preceeding parsers.
In the following example, Provider_1Parser and Provider_2Parser will each import their objects, set the provider field on these objects, and only delete objects that disappeared from their respective source since last parsing.
class Provider_1Parser(XXXXParser):
delete = True
provider = "provider_1"
class Provider_2Parser(XXXParser):
delete = True
provider = "provider_2"
Danger
It is recommended to use provider from the first import - Do not add a provider field to preexisting parsers that already imported objects, or you will have to manually set the same value for provider on all objects already created by this parser.
Danger
If a parser does not have a provider value, it will not take providers into account, meaning that it could delete objects from preceeding parsers even if these other parsers do have a provider themselves.
The following example would cause NoProviderParser to delete objects from Provider_2Parser and Provider_1Parser.
class Provider_1Parser(XXXXParser):
delete = True
provider = "provider_1"
class Provider_2Parser(XXXParser):
delete = True
provider = "provider_2"
class NoProviderParser(XXXParser):
delete = True
provider = None (default)
Generic settings for your parser
This settings may be overriden when you define a new parser:
labelparser display name (default:None)modelimport content with this model (default:None)filenamefile imported if no url (default:None)urlflow url imported from if no filename (default:None)simplify_tolerance(default:0) # metersupdate_onlydon’t create new contents (default:False)delete(default:False)duplicate_eid_allowedif True, allows differents contents with same eid (default:False)fill_empty_translated_fieldsif True, fills empty translated fields with same value (default:False)warn_on_missing_fields(default:False)warn_on_missing_objects(default:False)separator(default:'+')eidfield name for eid (default:None)provider(default:None)fields(default:None)m2m_fields(default:{})constant_fields(default:{})m2m_constant_fields(default:{})m2m_aggregate_fields(default:[])non_fields(default:{})natural_keys(default:{})field_options(default:{})default_languageuse another default language for this parser (default:None)
Start import from command line
Just run:
sudo geotrek import HebergementParser
Change HebergementParser to match one of the class names in var/conf/parsers.py file.
You can add -v2 parameter to make the command more verbose (show progress).
Thank to cron utility you can configure automatic imports.
Start import from Geotrek-admin UI
Open the top right menu and clic on imports.
Import data from a remote Geotrek instance
Importing from a Geotrek instance works the same way as from SIT. A usecase for this is to aggregate data from several Geotrek-admin instance.
Danger
Importing data from a remote Geotrek instance does not work with dynamic segmentation, your instance where you import data must have dynamic segmentation disabled.
For example, to import treks from another instance,
edit /opt/geotrek-admin/var/conf/parsers.py file with the following content:
class DemoGeotrekTrekParser(BaseGeotrekTrekParser):
url = "https://remote-geotrek-admin.net" # replace url with remote instance url
delete = False
field_options = {
'difficulty': {'create': True, },
'route': {'create': True, },
'themes': {'create': True},
'practice': {'create': True},
'accessibilities': {'create': True},
'networks': {'create': True},
'geom': {'required': True},
'labels': {'create': True},
}
Then run in command line
sudo geotrek import DemoGeotrekTrekParser
Treks are now imported into your own instance.
Import data from a file
You can also use some of Geotrek commands to import data from a vector file handled by GDAL (https://gdal.org/drivers/vector/index.htm) (e.g.: ESRI Shapefile, GeoJSON, GeoPackage etc.)
Possible data are e.g.: POI, infrastructures, signages, cities, districts, restricted areas.
You must use these commands to import spatial data because of the dynamic segmentation, which will not be computed if you enter the data manually.
Here are the Geotrek commands available to import data from file:
loadinfrastructureloadsignageloadpoiloadcitiesloaddistrictsloadrestrictedareas
Usually, these commands come with ability to match file attributes to model fields.
To get help about a command:
sudo geotrek help <subcommand>
Example: sudo geotrek help loadpoi
usage: manage.py loadpoi [-h] [--encoding ENCODING] [--name-field NAME_FIELD] [--type-field TYPE_FIELD] [--description-field DESCRIPTION_FIELD]
[--name-default NAME_DEFAULT] [--type-default TYPE_DEFAULT] [--version] [-v {0,1,2,3}] [--settings SETTINGS] [--pythonpath PYTHONPATH]
[--traceback] [--no-color] [--force-color] [--skip-checks]
point_layer
Load a layer with point geometries in a model
positional arguments:
point_layer
optional arguments:
-h, --help show this help message and exit
--encoding ENCODING, -e ENCODING
File encoding, default utf-8
--name-field NAME_FIELD, -n NAME_FIELD
Name of the field that contains the name attribute. Required or use --name-default instead.
--type-field TYPE_FIELD, -t TYPE_FIELD
Name of the field that contains the POI Type attribute. Required or use --type-default instead.
--description-field DESCRIPTION_FIELD, -d DESCRIPTION_FIELD
Name of the field that contains the description of the POI (optional)
--name-default NAME_DEFAULT
Default value for POI name. Use only if --name-field is not set
--type-default TYPE_DEFAULT
Default value for POI Type. Use only if --type-field is not set
--version show program's version number and exit
-v {0,1,2,3}, --verbosity {0,1,2,3}
Verbosity level; 0=minimal output, 1=normal output, 2=verbose output, 3=very verbose output
--settings SETTINGS The Python path to a settings module, e.g. "myproject.settings.main". If this isn't provided, the DJANGO_SETTINGS_MODULE environment variable will
be used.
--pythonpath PYTHONPATH
A directory to add to the Python path, e.g. "/home/djangoprojects/myproject".
--traceback Raise on CommandError exceptions
--no-color Don't colorize the command output.
--force-color Force colorization of the command output.
--skip-checks Skip system checks.
Delete attachment from disk
When an attachment (eg. pictures) is removed, its file is not automatically removed from disk.
You have to run sudo geotrek clean_attachments manually or in a cron to remove old files.
After that, you should run sudo geotrek thumbnail_cleanup to remove old thumbnails.