Born of discussion at CL2017, the germ of an idea for a WaC namespace storing the origin(s) for the data in a given file.
Suggested storage outline is as follows:
{
"__version__":"1.0.0",
"tool":"CoolWebScraper",
"sources":[
{ "url":"...", "date":"..." },
{ "url":"...", "date":"..." },
... etc
]
}
__version__ - Semver for this namespace
tool - Optional field if a specific tool has been used for this data
sources - An array of source references
Then for each source:
url - The originating URL for this source
date - When the URL was retrieved (ISO datespec, as per main standard)
Born of discussion at CL2017, the germ of an idea for a WaC namespace storing the origin(s) for the data in a given file.
Suggested storage outline is as follows:
__version__- Semver for this namespacetool- Optional field if a specific tool has been used for this datasources- An array of source referencesThen for each source:
url- The originating URL for this sourcedate- When the URL was retrieved (ISO datespec, as per main standard)