Skip to content

Normalize online location variants to a canonical value #17

@Metrohan

Description

@Metrohan

What to build / Ne yapılacak

EN: Events that take place online currently have inconsistent location values such as "Zoom", "online", "Microsoft Teams", "Webinar", etc. A normalizer should be added inside process_scraped_events (or at scraper output time) that detects known online platform names and keywords, and collapses them all into a single canonical value (e.g. "Online").

TR: Çevrimiçi etkinliklerin konum alanına şu an "Zoom", "online", "Microsoft Teams", "Webinar" gibi tutarsız değerler yazılıyor. process_scraped_events içine (ya da scraper çıktısında) bilinen platform adlarını ve anahtar kelimeleri tespit eden bir normalizer eklenmeli; tüm bu varyantlar tek bir standart değere (örn. "Online") indirgenmeli.

Acceptance criteria

  • A reusable normalize_location(raw: str) -> str helper (or equivalent inline logic) maps known online keywords/platform names to "Online".
  • All scrapers pass their location output through this normalizer before persisting.
  • Unit tests cover the main variants: "Zoom", "zoom", "Microsoft Teams", "Webinar", "online""Online".
  • Physical venue strings (e.g. "İstanbul", "Ankara ODTÜ") are left unchanged.

Blocked by

None — can start immediately.

Metadata

Metadata

Assignees

No one assigned

    Labels

    data-qualityData accuracy and consistency issuesenhancementNew feature or requestscraperRelated to scraper modules

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions