GitHub - HanJin996/typesql: TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation

TypeSQL

Source code accompanying our NAACL 2018 paper:TypeSQL: Knowledge-based Type-Aware Neural Text-to-SQL Generation

Environment Setup

The code uses Python 2.7 and Pytorch 0.2.0 GPU.
Install Python dependency: pip install -r requirements.txt
Install Pytorch 0.2.0: conda install pytorch=0.2.0 cuda91 -c pytorch. Replace cuda91 to whichever cuda version you have.

Download Data and Embeddings

Download the zip data file at the Google Drive, and put it in the root dir.
Download the pretrained Glove and the paraphrase embedding para-nmt-50m/data/paragram_sl999_czeng.txt. Put the unziped glove and para-nmt-50m folders in the root dir.

Train Models

To use knowledge graph types:

  mkdir saved_model_kg
  python train.py --sd saved_model_kg

To use DB content types:

   mkdir saved_model_con
   python train.py --sd saved_model_con --db_content 1

Test Models

Test Model with knowledge graph types:

python test.py --sd saved_model_kg

Test Model with knowledge graph types:

python test.py --sd saved_model_con --db_content 1

Get Data Types

Get a Google Knowledge Graph Search API Key by following the link
Search knowledge graph to get entities:

python get_kg_entities.py [Google freebase API Key] [input json file] [output json file]

Use detected knowledge graph entites and DB content to group questions and create type attributes in data files:

python data_process_test.py --tok [output json file generated at step 2] --table TABLE_FILE --out OUTPUT_FILE [--data_dir DATA_DIRECTORY] [--out_dir OUTPUT_DIRECTORY]

python data_process_train_dev.py --tok [output json file generated at step 2] --table TABLE_FILE --out OUTPUT_FILE [--data_dir DATA_DIRECTORY] [--out_dir OUTPUT_DIRECTORY]

Acknowledgement

The implementation is based on SQLNet. Please cite it too if you use this code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TypeSQL

Environment Setup

Download Data and Embeddings

Train Models

Test Models

Get Data Types

Acknowledgement

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
typesql		typesql
README.md		README.md
data_process_test.py		data_process_test.py
data_process_train_dev.py		data_process_train_dev.py
get_kg_entities.py		get_kg_entities.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

HanJin996/typesql

Folders and files

Latest commit

History

Repository files navigation

TypeSQL

Environment Setup

Download Data and Embeddings

Train Models

Test Models

Get Data Types

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages