Skip to content

Latest commit

 

History

History
159 lines (134 loc) · 6.77 KB

File metadata and controls

159 lines (134 loc) · 6.77 KB

BERT Large FP32 inference

This document has instructions for running BERT FP32 inference using Intel-optimized TensorFlow.

Datasets

BERT Large Data

Download and unzip the BERT Large uncased (whole word masking) model from the google bert repo. Then, download the Stanford Question Answering Dataset (SQuAD) dataset file dev-v1.1.json into the wwm_uncased_L-24_H-1024_A-16 directory that was just unzipped.

If you run on Windows, please use a browser to download and extract the dataset files. For Linux, run:

wget https://storage.googleapis.com/bert_models/2019_05_30/wwm_uncased_L-24_H-1024_A-16.zip
unzip wwm_uncased_L-24_H-1024_A-16.zip

wget https://rajpurkar.github.io/SQuAD-explorer/dataset/dev-v1.1.json -P wwm_uncased_L-24_H-1024_A-16

Set the DATASET_DIR to point to that directory when running BERT Large inference using the SQuAD data.

Quick Start Scripts

Script name Description
fp32_benchmark.sh This script runs bert large fp32 inference.
fp32_profile.sh This script runs fp32 inference in profile mode.
fp32_accuracy.sh This script is runs bert large fp32 inference in accuracy mode.
multi_instance_batch_inference.sh A multi-instance run that uses all the cores for each socket for each instance with a batch size of 128.
multi_instance_online_inference.sh A multi-instance run that uses 4 cores for each instance with a batch size of 1.

Run the model

Setup your environment using the instructions below, depending on if you are using AI Kit:

Setup using AI Kit on Linux Setup without AI Kit on Linux Setup without AI Kit on Windows

To run using AI Kit on Linux you will need:

  • numactl
  • unzip
  • wget
  • Activate the `tensorflow` conda environment
    conda activate tensorflow

To run without AI Kit on Linux you will need:

  • Python 3
  • intel-tensorflow>=2.5.0
  • git
  • numactl
  • unzip
  • wget
  • A clone of the Model Zoo repo
    git clone https://github.com/IntelAI/models.git

To run without AI Kit on Windows you will need:

After your setup is done, download and unzip the pretrained model. If you run on Windows, please use a browser to download and extract the checkpoint files and pretrained model using the links below. The path to this directory should be set as the CHECKPOINT_DIR before running quickstart scripts. For Linux, run:

wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/bert_large_checkpoints.zip
unzip bert_large_checkpoints.zip
CHECKPOINT_DIR=$(pwd)/bert_large_checkpoints

Download the frozen graph. The path to this file should be set in the PRETRAINED_MODEL environment variable before running the model.

wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v2_7_0/fp32_bert_squad.pb
PRETRAINED_MODEL=$(pwd)/fp32_bert_squad.pb

Next, set environment variables with paths to the dataset, checkpoint files, pretrained model, and an output directory, then run a quickstart script on either Linux or Windows systems. See the list of quickstart scripts for details on the different options.

Run on Linux

The snippet below shows how to run a quickstart script on Linux:

# cd to your model zoo directory
cd models

export DATASET_DIR=<path to the dataset being used>
export OUTPUT_DIR=<path to the directory where log files will be saved>
export CHECKPOINT_DIR=<path to the pretrained model checkpoints>
export PRETRAINED_MODEL=<path to the frozen graph>
# For a custom batch size, set env var `BATCH_SIZE` or it will run with a default value.
export BATCH_SIZE=<customized batch size value>

# Run a script for your desired usage
./quickstart/language_modeling/tensorflow/bert_large/inference/cpu/fp32/<script name>.sh

Run on Windows

The snippet below shows how to run a quickstart script on Windows systems:

Note: You may use cygpath to convert the Windows paths to Unix paths before setting the environment variables. As an example, if the dataset location on Windows is D:\user\wwm_uncased_L-24_H-1024_A-16, convert the Windows path to Unix as shown:

cygpath D:\user\wwm_uncased_L-24_H-1024_A-16
/d/user/wwm_uncased_L-24_H-1024_A-16

Then, set the DATASET_DIR environment variable set DATASET_DIR=/d/user/wwm_uncased_L-24_H-1024_A-16.

# cd to your model zoo directory
cd models

set DATASET_DIR=<path to the dataset being used>
set OUTPUT_DIR=<directory where log files will be saved>
set CHECKPOINT_DIR=<path to the pretrained model checkpoints>
set PRETRAINED_MODEL=<path to the frozen graph>
# For a custom batch size, set env var `BATCH_SIZE` or it will run with a default value.
set BATCH_SIZE=<customized batch size value>

# Run a script for your desired usage (`fp32_benchmark.sh`, `fp32_accuracy.sh`, or `fp32_profile.sh`)
bash quickstart\language_modeling\tensorflow\bert_large\inference\cpu\fp32\<script name>.sh

Additional Resources