Skip to content

Commit 1c24ea3

Browse files
committed
Fixed caching
1 parent c697aec commit 1c24ea3

2 files changed

Lines changed: 9 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,7 @@
1+
## 0.6.4 (unreleased)
2+
3+
- Fixed caching
4+
15
## 0.6.3 (2026-01-05)
26

37
- Updated Tokenizers to 0.22.2

lib/tokenizers/from_pretrained.rb

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ def cached_path(cache_dir, url, headers, options)
5555
resource_path = File.join(cache_dir, "#{fsum}.#{esum}")
5656
if File.exist?(resource_path)
5757
res = head_request(url, headers, options)
58-
if res["etag"] == etag
58+
if normalize_etag(res["etag"]) == normalize_etag(etag)
5959
return resource_path
6060
end
6161
end
@@ -106,6 +106,10 @@ def head_request(url, headers, options, redirects = 0)
106106
res
107107
end
108108

109+
def normalize_etag(etag)
110+
etag.delete_prefix("W/").delete_prefix('"').delete_suffix('"') if etag
111+
end
112+
109113
def cache_dir
110114
cache_dir =
111115
if ENV["TOKENIZERS_CACHE"]

0 commit comments

Comments
 (0)