Skip to content

NYPL/research-catalog-indexer

Repository files navigation

Purpose

This app manipulates MarcInJson records, writing the result to the Elastic Search index that powers the Discovery API. It was built to replace the trifecta Discovery Store Poster (aka PCDM store updater)/Discovery API Indexer/Discovery Hybrid Indexer.

Contributing

This repo uses the Main-QA-Production

  1. Create feature branch off main
  2. Compute next logical version and update README.md, CHANGELOG.md, package.json, etc.
  3. Create PR against main
  4. After merging the PR, git tag main with new version number.
  5. Merge main > qa
  6. After QC signoff, merge qa > production

Deployment

This app uses Travis-CI and terraform for deployment. Code pushed to qa and main trigger deployments to qa and production, respectively.

Testing

To run several tests of this app's ability to handle various Kinesis events and write relevant ES documents:

nvm use
npm test

Local invocation

To run lambda locally, use this script:

sam local invoke --profile nypl-digital-dev -t sam.local.yml -e test/sample-events/b10001936.json

Running

Bulk Indexing

The bulk-indexer can be used to index lots of records at once by direct BibService SQL query.

For example, to reindex the first 10K NYPL bibs with marc 001 in QA:

node scripts/bulk-index.js --envfile config/qa-bulk-index.env --type bib --hasMarc 001 --nyplSource sierra-nypl --limit 10000

To enable NewRelic reporting during bulk-indexing:

  • Install the NR agent:
    • Follow the "guided install" link on this page
    • Follow instructions for your OS (e.g. for MacOS, the procedure is a long curl-to-bash command)
  • Invoke like: source ./scripts/decrypt-newrelic-key.sh; DISABLE_CIRC_DELETE=true node scripts/bulk-index.js --envfile config/qa-bulk-index.env --type bib --hasMarc 001 --nyplSource sierra-nypl --limit 10000 (Note that NR breaks deletes in the current elasticsearch client, so for now we have to add DISABLE_CIRC_DELETE=true as above.)
  • While running, view live data at NR > RC > Transactions

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages