🤖 Neural SPARQL Machines

A LSTM-based Machine Translation Approach for Question Answering over Knowledge Graphs.

Code

Install git-lfs in your machine, then fetch all files and submodules.

git lfs fetch
git lfs checkout
git submodule update --init

Python setup

pip install -r requirements.txt

The Generator module

Pre-generated data

You can extract pre-generated data from data/monument_300.zip and data/monument_600.zip in folders having the respective names.

Manual Generation (Alternative to using pre-generated data)

The template used in the paper can be found in a file such as annotations_monument.tsv. data/monument_300 will be the ID of the working dataset used throughout the tutorial. To generate the training data, launch the following command.

mkdir data/monument_300
python generator.py --templates data/annotations_monument.csv --output data/monument_300

Launch the command to build the vocabularies for the two languages (i.e., English and SPARQL) and split into train, dev, and test sets.

./generate.sh data/monument_300

The Learner module

Now go back to the initial directory and launch train.sh to train the model. The first parameter is the prefix of the data directory and the second parameter is the number of training epochs.

./train.sh data/monument_300 12000

This command will create a model directory called data/monument_300_model.

The Interpreter module

Predict the SPARQL query for a given question with a given model.

./ask.sh data/monument_300 "where is edward vii monument located in?"

Unit tests

Tests can be run, but exclusively within the root directory.

py.test *.py

Use cases & integrations

Components of the Adam Medical platform partly developed by Jose A. Alvarado at Graphen (including a humanoid robot called Dr Adam), rely on NSpM technology.
The Telegram NSpM chatbot offers an integration of NSpM with the Telegram messaging platform.
The Google Summer of Code program has been supporting 6 students to work on NSpM-backed project "A neural question answering model for DBpedia" since 2018.
A question answering system was implemented on top of NSpM by Muhammad Qasim.

Publications

SPARQL as a Foreign Language (2017)

arXiv: https://arxiv.org/abs/1708.07624

@inproceedings{soru-marx-2017,
    author = "Tommaso Soru and Edgard Marx and Diego Moussallem and Gustavo Publio and Andr\'e Valdestilhas and Diego Esteves and Ciro Baron Neto",
    title = "{SPARQL} as a Foreign Language",
    year = "2017",
    journal = "13th International Conference on Semantic Systems (SEMANTiCS 2017) - Posters and Demos",
    url = "https://arxiv.org/abs/1708.07624",
}

Neural Machine Translation for Query Construction and Composition (2018)

NAMPI Website: https://uclnlp.github.io/nampi/
arXiv: https://arxiv.org/abs/1806.10478

@inproceedings{soru-marx-nampi2018,
    author = "Tommaso Soru and Edgard Marx and Andr\'e Valdestilhas and Diego Esteves and Diego Moussallem and Gustavo Publio",
    title = "Neural Machine Translation for Query Construction and Composition",
    year = "2018",
    journal = "ICML Workshop on Neural Abstract Machines \& Program Induction (NAMPI v2)",
    url = "https://arxiv.org/abs/1806.10478",
}

Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition (2020)

arXiv: https://arxiv.org/abs/2010.10900

@inproceedings{panchbhai-2020,
    author = "Anand Panchbhai and Tommaso Soru and Edgard Marx",
    title = "Exploring Sequence-to-Sequence Models for {SPARQL} Pattern Composition",
    year = "2020",
    journal = "First Indo-American Knowledge Graph and Semantic Web Conference",
    url = "https://arxiv.org/abs/2010.10900",
}

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
data		data
gsoc		gsoc
nmt @ 0be8642		nmt @ 0be8642
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
analyse.py		analyse.py
analyse.sh		analyse.sh
ask.sh		ask.sh
build_vocab.py		build_vocab.py
filter_dataset.py		filter_dataset.py
generate.sh		generate.sh
generator.py		generator.py
generator_test.py		generator_test.py
generator_utils.py		generator_utils.py
interpreter.py		interpreter.py
requirements.txt		requirements.txt
split_in_train_dev_test.py		split_in_train_dev_test.py
train.sh		train.sh
training_log		training_log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Neural SPARQL Machines

Code

Python setup

The Generator module

Pre-generated data

Manual Generation (Alternative to using pre-generated data)

The Learner module

The Interpreter module

Unit tests

Use cases & integrations

Publications

SPARQL as a Foreign Language (2017)

Neural Machine Translation for Query Construction and Composition (2018)

Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition (2020)

Liber AI on Medium (2020)

Contact

Questions?

Follow us

About

Releases

Packages

Languages

License

weiyang22/NSpM

Folders and files

Latest commit

History

Repository files navigation

🤖 Neural SPARQL Machines

Code

Python setup

The Generator module

Pre-generated data

Manual Generation (Alternative to using pre-generated data)

The Learner module

The Interpreter module

Unit tests

Use cases & integrations

Publications

SPARQL as a Foreign Language (2017)

Neural Machine Translation for Query Construction and Composition (2018)

Exploring Sequence-to-Sequence Models for SPARQL Pattern Composition (2020)

Liber AI on Medium (2020)

Contact

Questions?

Follow us

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages