Skip to content

Add robots.txt file to project #35

@Pojzo

Description

@Pojzo

The site currently does not serve a robots.txt file. When crawlers such as Googlebot and Bingbot visit the domain, they receive a 404 for https://numli.net/robots.txt. While this does not break the site, it is best practice to provide a robots.txt to:
• Explicitly allow or restrict crawler access to certain paths.
• Prevent unnecessary 404s from automated crawler requests.
• Provide the location of the sitemap to improve SEO.

Expected behavior

A robots.txt file should be available at the root of the domain (/robots.txt) with sensible defaults. For example:

User-agent: *
Disallow:
Sitemap: https://<domain>/sitemap.xml

Benefits
• Improves SEO and site discoverability.
• Reduces noise in server logs from missing file requests.
• Provides flexibility to disallow crawling of private or irrelevant paths in the future.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions