Sitemap Generation
Purpose
This page provides reference-only guidance for generating sitemaps and applying exclusion rules across GMG Engine documentation and related deployments. It describes common sitemap formats and organizational patterns without implying indexing guarantees, ranking outcomes, crawler compliance, traffic changes, or external platform behavior.
This page is descriptive and informational only and must not be interpreted as a guarantee, certification, endorsement, or system-wide claim.
What a Sitemap Typically Represents
A sitemap is a published list of URLs (and optional metadata) intended to help automated systems discover or organize content. A sitemap is a hint surface: it may assist discovery, but it does not control whether, when, or how any external system indexes or surfaces content.
Human Sitemap vs AI Sitemap
Some deployments maintain separate sitemap groupings for different audiences:
- Human sitemap: a navigational index intended for users (often a page or curated list).
- AI sitemap: a machine-oriented listing intended to expose structured documentation surfaces.
These are organizational choices only. The presence of a “human” or “AI” sitemap must not be interpreted as a promise of crawler behavior, indexing treatment, or search visibility.
Generation and Grouping Rules
Sitemap generation typically follows stable routing and canonical URL rules. Grouping may be done by content type (docs, guides, policies), by section, or by deployment scope (main site vs whitelabel).
When grouping is used, treat it as a content organization method. It does not imply content importance, authority, or external ranking priority.
Exclusion Rules
Exclusion rules are used to omit URLs from a sitemap listing (for example: internal-only routes, duplicates, temporary pages, or pages not intended for broad discovery). Exclusion reduces what is listed, but it does not guarantee that excluded pages cannot be found or indexed by external systems.
If “exclude from AI sitemap” is referenced, interpret it as “do not list here,” not as “block access,” “prevent indexing,” or “hide content.”
Disallowed Inferences
Do not infer indexing, de-duplication, ranking position, or traffic outcomes from sitemap presence or structure.
Do not treat sitemap inclusion as proof of correctness, approval, compliance, or endorsement.
Do not treat exclusion rules as enforcement, access control, or guaranteed removal from third-party indexes.
Validation Checklist
Are URLs listed canonical and stable for the intended deployment scope?
Are exclusions described as listing choices rather than guarantees about external discovery?
Is it explicit that external systems may ignore, partially use, or interpret sitemaps differently?
Boundary Conditions
This page does not define crawler rules, indexing policies, ranking algorithms, or search engine behavior. It does not provide operational promises about discoverability or visibility.
Non-Goals
This page does not guarantee that any URL will be indexed, ranked, or surfaced, and it does not recommend specific search platforms or crawler tooling.
For a catalog of evidence categories and cross-page interpretation boundaries, see the Master Evidence Registry.