Skip to content

Add geospatial sample provenance guard#379

Open
taherdhanera wants to merge 1 commit into
SCIBASE-AI:mainfrom
taherdhanera:taherdhanera/scibase-17-geospatial-sample-provenance-guard
Open

Add geospatial sample provenance guard#379
taherdhanera wants to merge 1 commit into
SCIBASE-AI:mainfrom
taherdhanera:taherdhanera/scibase-17-geospatial-sample-provenance-guard

Conversation

@taherdhanera
Copy link
Copy Markdown

@taherdhanera taherdhanera commented May 22, 2026

/claim #17

Summary

Adds geospatial-sample-provenance-guard, a self-contained Scientific Knowledge Graph Integration slice that validates field-sample and specimen location graph edges before they appear on entity pages or public discovery recommendations.

The guard evaluates:

  • latitude/longitude ranges
  • CRS normalization to graph policy
  • country/coordinate consistency
  • sensitive-site coordinate precision and redaction needs
  • sample voucher/specimen accession links
  • dataset DOI resolution and sample-to-dataset edge alignment
  • collection-date plausibility
  • public recommendation safety for sample-location graph paths

Non-overlap

This is not a broad graph extractor/navigator, link audit, ontology drift/alias/synonym guard, relationship conflict arbiter, author-affiliation disambiguation module, artifact lineage guard, evidence freshness checker, instrument-method compatibility guard, reproducibility route module, recommendation visibility/diversity guard, negative-result replication guard, measurement harmonization guard, claim qualifier guard, ethics provenance guard, funder award lineage guard, clinical trial registry guard, or software/runtime compatibility guard. It focuses specifically on geospatial field-sample provenance and safe location-edge publication.

Local validation

Run from geospatial-sample-provenance-guard/:

npm run check
npm test
npm run demo
npm run demo:video

All four commands passed locally.

Reviewer artifacts

  • reports/summary.json
  • reports/reviewer-packet.md
  • reports/summary.svg
  • reports/demo.webm

Safety

All data is synthetic. The module does not call geocoders, repositories, GIS systems, ontology services, specimen registries, journal systems, or external APIs. It does not include private field locations, real endangered species data, credentials, or live graph mutations.

Current status - 2026-05-29

Verified after newer same-issue #17 activity: this PR remains open, non-draft, CLEAN/mergeable, and distinct from KoiosSG PR #422. PR #379 is the geospatial field-sample provenance guard; PR #422 is a separate multilingual entity-alias guard.

Current status - 2026-05-30 01:03 IST

Visibility refresh after KoiosSG updated same-issue #17 PR #422. PR #379 remains open, non-draft, CLEAN/mergeable, Algora-labeled, and distinct: it validates geospatial field-sample/specimen provenance, coordinate policy, country consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, collection-date plausibility, and safe location-edge publication. PR #422 remains a separate multilingual entity-alias normalization and confidence-evidence guard.

@taherdhanera
Copy link
Copy Markdown
Author

Reviewer-ready checkpoint for /claim #17. This PR is open, non-draft, mergeable/CLEAN, Bounty claim labeled, and the body contains /claim #17. Scope remains geospatial sample provenance: coordinate/CRS validation, country consistency, sensitive-site redaction, voucher/accession links, DOI alignment, collection-date plausibility, and recommendation-safe graph edges with synthetic data only.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after issue #17 PR #422 was submitted: PR #379 remains open, non-draft, mergeable, bounty-labeled, and claim-marked for the existing geospatial sample provenance graph guard submission. Scope remains distinct from multilingual entity alias handling and no code changes are needed unless reviewers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after the latest PR #422 activity on issue #17: PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for the existing geospatial sample provenance graph guard submission. Scope remains distinct from multilingual entity alias handling; no code changes are needed unless reviewers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after the newer same-issue PR #447 activity: PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for issue #17.

The submitted scope remains the geospatial sample provenance graph guard: coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site coordinate precision and redaction, sample voucher/specimen accession links, dataset DOI alignment, collection-date plausibility, and public recommendation safety for sample-location graph paths.

This is distinct from PR #447's null-result/knowledge-gap signal guard, multilingual entity alias handling, temporal validity, biological accession crosswalk, software compatibility, funder/clinical lineage, recommendation diversity, ethics provenance, graph conflict arbitration, and the other #17 slices.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after the newer same-issue PR #422 hardening activity: PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for issue #17.

The submitted scope remains the geospatial sample provenance graph guard: coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site coordinate precision and redaction, sample voucher/specimen accession links, dataset DOI alignment, collection-date plausibility, and public recommendation safety for sample-location graph paths.

This is distinct from PR #422's multilingual entity alias guard and hardening update, PR #447's null-result/knowledge-gap signal guard, temporal validity, biological accession crosswalk, software compatibility, funder/clinical lineage, recommendation diversity, ethics provenance, graph conflict arbitration, and other #17 slices. No contributor-side code changes are pending unless reviewers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after newer same-issue PR #451 graph ingestion auditor activity. PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for issue #17.

Scope remains unchanged and distinct: geospatial sample provenance graph guard for Scientific Knowledge Graph Integration. It covers coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site redaction, voucher/accession linkage, provenance evidence, and reviewer-ready graph export artifacts for sample-location knowledge graph data.

This is separate from the newer graph ingestion auditor and other adjacent #17 slices. No implementation changes are needed unless reviewers request them.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after the newer same-issue PR #465 activity: PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for issue #17.

The submitted scope remains the geospatial sample provenance graph guard: coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site redaction, voucher/accession DOI links, specimen/source lineage, and deterministic reviewer audit artifacts before geospatial sample nodes are exposed in entity pages or recommendations.

PR #465 appears to add a separate recommendation path auditor for private node leaks, broken explanation paths, user filter violations, retracted evidence dependencies, weak evidence scores, stale trend features, duplicate recommendations, missing schema metadata, and unexplained personalization. That is adjacent, but PR #379 is still the prior geospatial sample provenance layer for this issue.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after newer same-issue PR #488 activity: PR #379 remains open, non-draft, mergeable/CLEAN, bounty-labeled, and claim-marked for issue #17.

The submitted scope remains the geospatial sample provenance graph guard: coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site redaction, voucher/accession DOI links, specimen/source lineage, dataset DOI alignment, collection-date plausibility, and deterministic reviewer audit artifacts before geospatial sample nodes are exposed in entity pages or recommendations. It is tied to the Pending USD 475 Algora claim: https://algora.io/claims/jrCcpWx7sq762TVd

This remains separate from PR #488's entity alias disambiguation guard for ontology candidate IDs, alias collisions, context windows, confidence thresholds, and blocked edge publication. No contributor-side code changes are pending unless reviewers request them.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility refresh after PR #422's newer hardening pass for language-tag normalization in multilingual alias lookup.

My existing issue #17 submission remains PR #379: #379

Current status re-verified now: PR #379 is open, non-draft, CLEAN/mergeable, bounty-labeled, includes /claim #17, and its Algora claim remains Pending for USD 475: https://algora.io/claims/jrCcpWx7sq762TVd

Scope reminder for review: PR #379 is the geospatial sample provenance graph guard for Scientific Knowledge Graph Integration. It validates coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site coordinate precision and redaction, sample voucher/specimen accession links, dataset DOI alignment, collection-date plausibility, and public recommendation safety for sample-location graph paths.

This remains separate from PR #422's multilingual entity alias guard and language-tag normalization hardening, PR #488's entity alias disambiguation guard, PR #447's null-result/knowledge-gap signal guard, and other adjacent #17 graph slices. No contributor-side code changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Status refresh after newer same-issue #17 PR activity, including KoiosSG PR #422.

This PR #379 remains open, non-draft, CLEAN/mergeable, and claims #17. It focuses specifically on geospatial field-sample provenance: coordinate validity, CRS normalization, country/coordinate consistency, sensitive-site precision/redaction, voucher/specimen accession links, DOI alignment, collection-date plausibility, and safe location-edge publication.

Non-overlap: PR #422 focuses on multilingual entity alias handling. That is a separate alias-disambiguation slice and does not replace this geospatial sample-provenance guard. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility refresh after newer same-issue #17 competitor PR #422 activity. This PR #379 remains open, non-draft, CLEAN/mergeable, bounty-listed, and claim-marked for issue #17. Scope remains distinct: geospatial field-sample provenance guard: coordinate policy, country consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, and safe location-edge publication. No contributor-side code changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility refresh after KoiosSG updated same-issue #17 PR #422 later than my last status.

This PR #379 remains open, non-draft, CLEAN/mergeable, bounty-labeled, and claim-marked for issue #17. It is tied to the Pending USD 475 Algora claim: https://algora.io/claims/jrCcpWx7sq762TVd

Scope remains distinct: PR #379 is the geospatial field-sample provenance guard covering coordinate policy, country consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, collection-date plausibility, and safe location-edge publication. PR #422 remains a separate multilingual entity-alias guard. No contributor-side code changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility refresh after newer same-issue #17 competitor PR activity from yunrongy424-oss PR #447.

This PR #379 remains open, non-draft, MERGEABLE, bounty-labeled, and claim-marked for issue #17. It is tied to the Pending USD 475 Algora claim: https://algora.io/claims/jrCcpWx7sq762TVd

Scope remains distinct: geospatial field-sample provenance guard covering coordinate policy, country consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, collection-date plausibility, and safe location-edge publication. PR #447 and PR #422 are separate knowledge-graph slices. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility refresh after KoiosSG updated same-issue #17 PR #422 again with the lowercase Greek confusable hardening pass.

This PR #379 remains open, non-draft, MERGEABLE, bounty-labeled, and claim-marked for issue #17. It is tied to the Pending USD 475 Algora claim: https://algora.io/claims/jrCcpWx7sq762TVd

Scope remains distinct: geospatial field-sample provenance guard covering coordinate policy, country consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, collection-date plausibility, and safe location-edge publication. PR #422 remains a separate multilingual entity-alias guard focused on script-confusable alias handling. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after KoiosSG updated same-issue #17 PR #422 at 2026-05-30T10:13:58Z.

This PR #379 remains open, non-draft, MERGEABLE/CLEAN, bounty-labeled, and claim-marked for issue #17. It is tied to the Pending USD 475 Algora claim: https://algora.io/claims/jrCcpWx7sq762TVd

Scope remains distinct: geospatial field-sample provenance guard covering coordinate policy, CRS normalization, country/coordinate consistency, sensitive-site redaction, voucher/specimen links, DOI edge alignment, collection-date plausibility, and safe location-edge publication. PR #422 remains the separate multilingual entity-alias guard. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @KoiosSG in PR #422 (multilingual entity alias guard).

PR #379 remains open, non-draft, MERGEABLE/CLEAN, bounty-labeled, and claim-marked for #17. Algora reward link remains indexed: https://algora.io/claims/jrCcpWx7sq762TVd

Scope remains the geospatial sample provenance guard, separate from PR #422's multilingual entity alias guard. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @keon0711: #17 (comment)

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from the newer temporal-consistency attempt. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @keon0711: #17 (comment)

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from the newer organism/strain boundary attempt. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 PR activity from @keon0711 / PR #515: #515

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from PR #515's organism/strain boundary guard. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @KoiosSG / PR #422: #422 (comment)

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from PR #422's multilingual entity-alias guard and latest malformed alias/mention hardening. No contributor-side code changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @orenodinner: #17 (comment)

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from the newer funding-award provenance graph attempt. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @orenodinner / PR #521: #521

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from PR #521's funding-award provenance graph guard. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 follow-up activity from @alexgduarte on PR #98: #98 (comment)

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from PR #98's broader scientific knowledge graph MVP module. No contributor-side changes are pending unless maintainers request revisions.

@taherdhanera
Copy link
Copy Markdown
Author

Visibility and merge/reward readiness refresh after newer same-issue #17 activity from @goodgoodclaw on PR #522:

My active #17 submission remains PR #379: #379

Current status re-verified now:

Scope remains distinct: PR #379 is the geospatial field-sample provenance guard for Scientific Knowledge Graph Integration. It validates coordinate ranges, CRS normalization, country/coordinate consistency, sensitive-site coordinate precision/redaction, sample voucher/specimen accession links, dataset DOI alignment, collection-date plausibility, and public recommendation safety for sample-location graph paths.

PR #522 is a separate scientific image metadata provenance graph guard for microscopy/gel/figure image nodes, source artifact linkage, checksums, acquisition chronology, channel/pixel metadata, license/access state, and PHI/private identifier leakage. No contributor-side changes are pending for PR #379 unless maintainers request revisions. Could a maintainer please review PR #379 for merge/reward decision, or point me to the exact change needed to unblock it?

@taherdhanera
Copy link
Copy Markdown
Author

PR-side visibility refresh after newer same-issue #17 activity from @jaxassistant55 on PR #526:

This PR #379 remains my active issue #17 submission.

Current status re-verified now:

Scope remains the geospatial field-sample provenance guard, separate from PR #526's scientific software dependency provenance graph guard. No contributor-side changes are pending unless maintainers request revisions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant