Python module of substring_intersect
, to match a dataset of substrings against a datasets of search strings, using the ahocorasick
lib.
Spike.py uses the input file name as the search strings dataset, and additionally grabs substrings from it for testing.
time python spike.py lots_zips.csv