Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconfigure current VA sitemap to support multiple sitemap files #15333

Closed
1 of 8 tasks
Tracked by #17673 ...
timcosgrove opened this issue Sep 20, 2023 · 0 comments
Closed
1 of 8 tasks
Tracked by #17673 ...

Reconfigure current VA sitemap to support multiple sitemap files #15333

timcosgrove opened this issue Sep 20, 2023 · 0 comments
Assignees

Comments

@timcosgrove
Copy link
Contributor

timcosgrove commented Sep 20, 2023

Requirements

We should set up our primary sitemap file as an index pointing to other sitemap files, so that different systems can manage their own sitemaps without interfering with each other.

Acceptance criteria

Preview Give feedback
  1. 0 of 3
    Accelerated Publishing
    timcosgrove tjheffner

Background & implementation details

This will be a production change to Content Build and it should be tested and deployed with the care that entails.

Currently both Content Build and Next Build generate sitemaps for the content they each manage. However, they have no awareness of each other's content, so in order to generate a complete sitemap, we need to combine them.

Currently both Content Build and Next Build generate their sitemap to sitemap.xml, which contains a large number of URLs. This should be changed such that both systems generate their large sitemap file to a specifically named sitemap file:

  • Content Build generates sitemap-cb.xml
  • Next Build generates sitemap-nb.xml

Then, additionally, each system must generate a new sitemap.xml that points to each of these. This sitemap will be a 'sitemap index' that points to the two files above.

See documentation on Sitemap indexes: https://developers.google.com/search/docs/crawling-indexing/sitemaps/large-sitemaps

We should confirm that a sitemap index can contain a reference to a file that is not available and that this does not cause problems. Initially, va.gov will not have the Next Build version of the file available for loading.

Since there's not really a decision yet on which system will initially own the sitemap index at sitemap.xml, it is probably easiest for both systems to output the same index file.

Implementation tasks

Preview Give feedback
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants