Currently, the kaggle datasets metadata <dataset> command outputs limited dataset metadata. For example, running:
kaggle datasets metadata mlg-ulb/creditcardfraud
returns JSON that includes fields such as datasetId, title, description, usabilityRating, totalViews, totalDownloads, etc.
However, it does not include the lastUpdateTime field, even though these are available from the web API endpoint:
https://www.kaggle.com/api/i/datasets.DatasetDetailService/GetDatasetBasics
That endpoint returns richer metadata, including:
lastUpdateTime
creatorDisplayName
It would be extremely useful if the CLI’s datasets metadata command exposed at least the lastUpdateTime field, so that workflows depending on dataset freshness can be automated without having to call the API separately.
This would make it much easier to build workflows around dataset versioning and updates directly from the CLI.
Thanks for considering!
Currently, the
kaggle datasets metadata <dataset>command outputs limited dataset metadata. For example, running:returns JSON that includes fields such as
datasetId,title,description,usabilityRating,totalViews,totalDownloads, etc.However, it does not include the
lastUpdateTimefield, even though these are available from the web API endpoint:That endpoint returns richer metadata, including:
lastUpdateTimecreatorDisplayNameIt would be extremely useful if the CLI’s
datasets metadatacommand exposed at least thelastUpdateTimefield, so that workflows depending on dataset freshness can be automated without having to call the API separately.This would make it much easier to build workflows around dataset versioning and updates directly from the CLI.
Thanks for considering!