-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Showing
5 changed files
with
13 additions
and
141 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,22 +1,26 @@ | ||
This docker image is used to generate TDB2 datasets used by Fuseki. | ||
## Overview | ||
This repository contains a Dockerfile to build an image which generates TDB2 datasets used by Fuseki. | ||
It includes: | ||
|
||
1. Validation of RDF files using Apache Jena RIOT | ||
Files that fail validation are renamed with the suffix `.error` - this prevents tdbloader attempting to load them. | ||
2. Creation of TDB2 datasets using `tdb2.tdbloader` or `tdb2.xloader` (for large datasets) | ||
3. Creation of a Spatial Index for use with Apache Jena GeoSPARQL | ||
4. Addition of Feature counts (via. a tdb2.update SPARQL update) - this is specific to OGC conformant datasets which contain geo:Features, and will be made optional in future versions (though the command will run harmlessly otherwise!). | ||
|
||
An additional set of instructions is also provided for running this Dockerfile on an EC2 instance - note this has only been necessary for very large datasets. | ||
|
||
Example command to build this image: | ||
`docker build -t tdb-generation .` | ||
`docker build -t tdb-generation:<tag> .` | ||
|
||
Example command to run this image locally. | ||
``` | ||
docker run -v $(pwd)/output:/databases -v $(pwd)/data:/rdf tdb2-generation | ||
docker run -v $(pwd)/output:/databases -v $(pwd)/data:/rdf tdb2-generation:<tag> | ||
``` | ||
|
||
Where: | ||
- `$(pwd)/output` is the directory where the TDB2 databases will be created | ||
- `$(pwd)/data` is the directory containing the RDF files to be loaded | ||
- `$(pwd)/data` is the directory containing the RDF files to be loaded | ||
|
||
## Text indexing | ||
|
||
Not currently supported - can be supported by adding functionality to optionally include a mounted config.ttl file. This is required as the text index is not configurable via the command line. |
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.