Glossary
Technical SEO Terms
69 essential terms covering crawling, indexing, site architecture, Core Web Vitals, structured data, and server configuration.
13Crawling & Indexing
- Crawling
- The process by which search engine bots (spiders/crawlers) discover and download web pages by following links. Googlebot is Google's primary crawler.
- Indexing
- The process of storing and organizing crawled content in a search engine's database so it can be retrieved for relevant queries. A page must be indexed to appear in search results.
- Crawl Budget
- The number of pages a search engine will crawl on your site within a given timeframe. Affected by site size, page speed, server capacity, and URL importance.
- Crawl Rate
- The speed at which search engine bots request pages from your server. Can be adjusted in Google Search Console to prevent server overload.
- Crawl Depth
- How many levels deep into your site architecture a crawler will go. Important pages should be within 3-4 clicks of the homepage.
- Robots.txt
- A text file at your domain's root that tells search engine crawlers which pages or sections they can or cannot access. Uses Disallow and Allow directives.
- XML Sitemap
- An XML file listing important URLs on your site to help search engines discover and crawl content efficiently. Should be submitted to Google Search Console.
- Sitemap Index
- A sitemap that references multiple sitemap files. Required when you have more than 50,000 URLs or your sitemap exceeds 50MB.
- Noindex
- A meta robots directive telling search engines not to include a page in their index. Use for thank you pages, admin areas, or duplicate content.
- Nofollow
- A link attribute or meta directive telling search engines not to pass PageRank through a link or follow links on a page.
- Index Bloat
- When search engines index too many low-value or duplicate pages, diluting your site's overall quality signals and wasting crawl budget.
- Orphan Pages
- Pages that have no internal links pointing to them, making them difficult for crawlers and users to discover.
- Crawl Errors
- Issues that prevent search engines from accessing pages, including 404 errors, server errors, redirect errors, and blocked resources.
10URL & Redirect Management
- 301 Redirect
- A permanent redirect that passes 90-99% of link equity to the destination URL. Use when pages are permanently moved or domains are changed.
- 302 Redirect
- A temporary redirect that may not pass full link equity. Use only for genuinely temporary situations like A/B testing or maintenance.
- 307 Redirect
- An HTTP status code for temporary redirects that preserves the request method. The HTTP/1.1 equivalent of 302.
- 308 Redirect
- A permanent redirect that preserves the request method. The permanent equivalent of 307.
- Redirect Chain
- When one redirect leads to another redirect, creating a chain. Each hop in the chain can slow page load and dilute link equity. Keep chains under 3 hops.
- Redirect Loop
- When redirects create a circular pattern (A→B→A), causing pages to never load. A critical error that must be fixed immediately.
- Canonical URL
- The preferred version of a page when multiple URLs contain similar content. Specified with rel="canonical" link element to consolidate ranking signals.
- Self-Referencing Canonical
- A canonical tag that points to the same URL it's on. Considered a best practice to explicitly declare the preferred version.
- URL Parameter Handling
- Configuration telling search engines how to handle URL parameters (sorting, filtering, tracking). Set in Google Search Console or via robots.txt.
- Clean URLs
- URLs without excessive parameters, session IDs, or unnecessary characters. Improves user experience and click-through rates.
11Core Web Vitals & Performance
- Core Web Vitals
- Google's set of user experience metrics: LCP (loading), FID/INP (interactivity), and CLS (visual stability). Part of page experience ranking signals.
- Largest Contentful Paint (LCP)
- Measures loading performance—the time it takes for the largest content element to become visible. Target: under 2.5 seconds.
- First Input Delay (FID)
- Measures interactivity—the time from when a user first interacts to when the browser responds. Target: under 100ms. Being replaced by INP.
- Interaction to Next Paint (INP)
- Measures overall page responsiveness throughout the user's visit, not just first interaction. Replaced FID in March 2024. Target: under 200ms.
- Cumulative Layout Shift (CLS)
- Measures visual stability—unexpected layout shifts during page load. Caused by images without dimensions, dynamic content, web fonts. Target: under 0.1.
- Time to First Byte (TTFB)
- The time from the user's request to receiving the first byte of response. Affected by server performance, CDN, and backend processing. Target: under 800ms.
- First Contentful Paint (FCP)
- The time when the first content element (text, image) is rendered. An early indicator of perceived loading speed.
- Total Blocking Time (TBT)
- The total time between FCP and Time to Interactive when the main thread was blocked. Lab metric correlating with FID.
- Speed Index
- Measures how quickly content is visually displayed during page load. Lower scores indicate faster perceived performance.
- Critical Rendering Path
- The sequence of steps the browser takes to convert HTML, CSS, and JavaScript into pixels on screen. Optimizing this improves perceived performance.
- Render-Blocking Resources
- CSS and JavaScript files that prevent the page from rendering until they're loaded. Should be minimized or deferred.
7Site Architecture
- Site Architecture
- The hierarchical structure of pages and how they're organized and linked together. Affects crawlability, user experience, and link equity distribution.
- Flat Architecture
- A site structure where most pages are few clicks from the homepage. Ensures important pages are easily discoverable and receive more link equity.
- Silo Structure
- Organizing content into themed categories with strong internal linking within each silo and limited cross-linking between silos.
- Hub and Spoke Model
- A content architecture with a central pillar page (hub) linking to related subtopic pages (spokes), which link back to the hub.
- Faceted Navigation
- Navigation allowing users to filter results by multiple attributes (size, color, price). Can cause index bloat if not properly managed.
- Pagination
- Splitting content across multiple pages. Use rel="next"/rel="prev" (deprecated but still useful) or load more/infinite scroll with proper implementation.
- JavaScript Navigation
- Navigation built with JavaScript that may not be crawlable by search engines. Ensure important links are in the HTML or use server-side rendering.
11Structured Data & Rich Results
- Structured Data
- Standardized code (usually JSON-LD) that helps search engines understand page content and enables rich results in SERPs.
- Schema.org
- A collaborative vocabulary for structured data markup, supported by Google, Bing, Yahoo, and Yandex.
- JSON-LD
- JavaScript Object Notation for Linked Data—Google's preferred format for structured data. Placed in a <script> tag in the page head or body.
- Rich Results
- Enhanced search results with additional visual elements like stars, images, prices, or FAQs. Enabled by structured data.
- Featured Snippets
- Highlighted boxes at the top of search results showing extracted content that directly answers a query. Position zero.
- Knowledge Panel
- Information boxes appearing on the right side of search results for entities (people, places, organizations, things).
- FAQ Schema
- Structured data markup for frequently asked questions, enabling expandable Q&A in search results.
- How-To Schema
- Structured data for step-by-step instructions, enabling rich results with images and steps.
- Product Schema
- Structured data for products including price, availability, and reviews. Enables rich product snippets.
- Article Schema
- Structured data for news articles, blog posts, and other article content. Can enable rich results and Google News inclusion.
- Breadcrumb Schema
- Structured data that shows page hierarchy in search results, improving click-through rates and user understanding.
9Server & Hosting
- HTTP Status Codes
- Three-digit codes indicating the result of a server request. 200 (OK), 301 (Permanent Redirect), 404 (Not Found), 500 (Server Error), etc.
- HTTPS
- Secure HTTP using SSL/TLS encryption. A ranking factor since 2014. All sites should use HTTPS for security and SEO.
- SSL Certificate
- A digital certificate that enables HTTPS encryption. Available free from Let's Encrypt or paid from certificate authorities.
- CDN (Content Delivery Network)
- A network of servers distributed globally to deliver content from locations closer to users. Improves page speed and reliability.
- Server Response Time
- How quickly your server responds to requests. Affected by hosting quality, database optimization, and code efficiency.
- Edge Computing
- Processing data closer to the user at the network edge, reducing latency. Used by modern CDNs and serverless platforms.
- Caching
- Storing copies of pages or resources for faster retrieval. Includes browser caching, server caching, and CDN caching.
- Gzip/Brotli Compression
- Server-side compression that reduces file sizes for faster transfer. Brotli typically achieves 20-26% better compression than Gzip.
- HTTP/2 and HTTP/3
- Modern HTTP protocols enabling faster page loads through multiplexing, header compression, and (HTTP/3) improved connection handling.
8Mobile & International
- Mobile-First Indexing
- Google primarily uses the mobile version of content for indexing and ranking. All sites are now on mobile-first indexing.
- Responsive Design
- A design approach where layouts adapt to different screen sizes using CSS media queries. Google's recommended mobile configuration.
- AMP (Accelerated Mobile Pages)
- A Google-backed framework for fast-loading mobile pages. No longer required for Top Stories but still useful for specific use cases.
- Mobile Usability
- How well a site functions on mobile devices. Issues include small tap targets, horizontal scrolling, and unplayable content.
- Hreflang
- HTML attribute telling search engines which language and regional targeting a page uses. Essential for international SEO.
- ccTLD
- Country Code Top-Level Domain (e.g., .uk, .de, .fr). Provides strong geographic signals but requires separate domain management.
- Subdirectory vs Subdomain
- International content can use subdirectories (example.com/de/) or subdomains (de.example.com). Subdirectories consolidate domain authority.
- Geotargeting
- Targeting content to users in specific geographic locations. Set in Google Search Console for generic TLDs.
Related resources
Tools, checklists, and glossaries to strengthen your technical SEO skills.
Technical SEO guide
Complete guide to crawling, indexing, and site architecture.
Robots.txt validator
Validate and test your robots.txt file for crawl issues.
Page indexing checker
Check if your pages are indexed by Google.
Site audit tool
Comprehensive SEO audit with performance and technical analysis.
Broken link checker
Find and fix broken links that hurt your crawlability.
SEO audit checklist
Step-by-step checklist for a thorough technical audit.
On-page SEO glossary
HTML elements, content optimization, and UX terms.
Off-page SEO glossary
Backlinks, link building, and domain authority terms.