Nanopublications are small units of publishable information, used for scientific results and more. This dataset is based on a subset of a dump of all available nanopublications as of April 5, 2018. Only the first 5M of freely-licensed nanopubs were included. Each nanopub consists of several RDF graphs and thus is an RDF dataset. The included data is primarily from the biomedical domain. More information: paper, website.
This README is a snapshot of documentation for the latest development version of the dataset. Full documentation for all versions can be found on the website.
- Title: Nanopublications (en)
- Identifier:
nanopubs
- Has version:
dev
- Theme:
- Metadata (eurovoc:c_40f54e0c)
- Open data (eurovoc:c_5ea6e5c4)
- Open science (eurovoc:c_99a79cea)
- Research results (eurovoc:6306)
- Scientific research (eurovoc:2924)
- Creator:
- Authors of the included nanopublications (cited within the dataset) (1)
- Name: Authors of the included nanopublications (cited within the dataset)
- Tobias Kuhn (2)
- Name: Tobias Kuhn
- Comment: Author of the nanopublications dump (en)
- Homepage: https://orcid.org/0000-0002-1267-0234
- Piotr Sowiński (3)
- Name: Piotr Sowiński
- Nickname: Ostrzyciel
- Homepage:
- Authors of the included nanopublications (cited within the dataset) (1)
- License: https://spdx.org/licenses/CC-BY-SA-3.0
- Rights: This dataset only includes freely-licensed publications (CC BY, CC BY-SA, or ODbL licenses). Each nanopublication includes information about its original authors and is self-citing. The dataset is marked as under CC BY-SA, as this is the most restrictive license in the dataset. (en)
- Source: https://doi.org/10.5281/zenodo.1213293
- Date Issued: 2023-04-30
- Date Modified: 2024-08-29
- Landing page: nanopubs (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- RDF stream type usage (1)
- Has stream element count: 5,000,000
- Has stream element split:
- Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element is one nanopublication. (en)
- Uses vocabulary:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
- Title: Full stream distribution
- Identifier:
stream-full
- Has file name:
stream_full.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 5,000,000
- Byte size: 1.0 GB
- Media type: application/trig
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/stream_full.tar.gz
- Title: Full Jelly distribution
- Identifier:
jelly-full
- Has file name:
jelly_full.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- RDF stream type usage (1)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Jelly distribution (rb:jellyDistribution)
- Has stream element count: 5,000,000
- Byte size: 1.5 GB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/jelly_full.jelly.gz
- Title: Full flat distribution
- Identifier:
flat-full
- Has file name:
flat_full.nq.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 5,000,000
- Byte size: 1.7 GB
- Media type: application/n-quads
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/flat_full.nq.gz
- Title: 1M elements stream distribution
- Identifier:
stream-1m
- Has file name:
stream_1M.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 1,000,000
- Byte size: 277.0 MB
- Media type: application/trig
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/stream_1M.tar.gz
- Title: 1M elements Jelly distribution
- Identifier:
jelly-1m
- Has file name:
jelly_1M.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 332.7 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/jelly_1M.jelly.gz
- Title: 1M elements flat distribution
- Identifier:
flat-1m
- Has file name:
flat_1M.nq.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 384.6 MB
- Media type: application/n-quads
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/flat_1M.nq.gz
- Title: 100K elements stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 100,000
- Byte size: 25.6 MB
- Media type: application/trig
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/stream_100K.tar.gz
- Title: 100K elements Jelly distribution
- Identifier:
jelly-100k
- Has file name:
jelly_100K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 29.6 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/jelly_100K.jelly.gz
- Title: 100K elements flat distribution
- Identifier:
flat-100k
- Has file name:
flat_100K.nq.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 35.7 MB
- Media type: application/n-quads
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/flat_100K.nq.gz
- Title: 10K elements stream distribution
- Identifier:
stream-10k
- Has file name:
stream_10K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 10,000
- Byte size: 2.6 MB
- Media type: application/trig
- Packaging format: application/tar
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/stream_10K.tar.gz
- Title: 10K elements Jelly distribution
- Identifier:
jelly-10k
- Has file name:
jelly_10K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of RDF datasets. Each RDF dataset corresponds to one nanopublication. (en)
- Has stream type: RDF dataset stream (stax:datasetStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 2.9 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/jelly_10K.jelly.gz
- Title: 10K elements flat distribution
- Identifier:
flat-10k
- Has file name:
flat_10K.nq.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of quads. (en)
- Has stream type: Flat RDF quad stream (stax:flatQuadStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 3.5 MB
- Media type: application/n-quads
- Compression format: application/gzip
- Download URL: https://w3id.org/riverbench/datasets/nanopubs/dev/files/flat_10K.nq.gz