Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
19 changes: 19 additions & 0 deletions .github/ISSUE_TEMPLATE/BUG_ISSUE_TEMPLATE
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
---
name: BUG FIX REQUEST
about: 프로젝트에서 발생한 버그를 기록해주세요
title: "[BUG]"
labels: ''
assignees: ''

---

**무슨 버그가 생겼는지 알려주세요**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

**가장 적합하다고 생각되는 해결 방안을 적어주세요**
A clear and concise description of what you want to happen.

**해당 기능에 대한 대안이 있다면 함께 작성해주세용**
A clear and concise description of any alternative solutions or features you've considered.

**참고 사항**
38 changes: 38 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
---
name: Bug report
about: Create a report to help us improve
title: "[BUG]"
labels: bug
assignees: ''

---

**Describe the bug**
A clear and concise description of what the bug is.

**To Reproduce(버그가 일어나는 경우)**
Steps to reproduce the behavior:
1. Go to '...'
2. Click on '....'
3. Scroll down to '....'
4. See error

**Expected behavior(정상적인 결과의 경우)**
A clear and concise description of what you expected to happen.

**Screenshots(스크린샷)**
If applicable, add screenshots to help explain your problem.

**Desktop (please complete the following information):**
- OS: [e.g. iOS]
- Browser [e.g. chrome, safari]
- Version [e.g. 22]

**Smartphone (please complete the following information):**
- Device: [e.g. iPhone6]
- OS: [e.g. iOS8.1]
- Browser [e.g. stock browser, safari]
- Version [e.g. 22]

**Additional context**
Add any other context about the problem here.
19 changes: 19 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
---
name: Feature request
about: 프로젝트에 필요한 기능을 제안해주세요
title: "[FEAT]"
labels: enhancement
assignees: ''

---

**무슨 기능이 필요한지 알려주세요**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

**가장 적합하다고 생각되는 해결 방안을 적어주세요**
A clear and concise description of what you want to happen.

**해당 기능에 대한 대안이 있다면 함께 작성해주세용**
A clear and concise description of any alternative solutions or features you've considered.

**참고 사항**
12 changes: 12 additions & 0 deletions .github/ISSUE_TEMPLATE/task-issue-template.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
---
name: task issue template
about: 할일 생기면 사용할 템플릿
title: "[TASK]"
labels: ''
assignees: ''

---

**할 일이 무엇인지 설명해주세요**

**참고사항**
25 changes: 25 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
# 모델 파일 제외
model/eng_hin.model
model/hin_eng.model

# 경계 박스 JSON 파일 제외
engBB.json
hinBB.json

source_eng/*.jpg
source_hin/*.jpg

# Diff_SceneTextEraser 폴더 무시
Diff_SceneTextEraser/

# IndicTransToolkit 폴더 무시
IndicTransToolkit/
model/

__pycache__/
cuda-keyring_1.0-1_all.deb
output/
tmp/

*.ipynb
first_test/
103 changes: 103 additions & 0 deletions =2.1
Original file line number Diff line number Diff line change
@@ -0,0 +1,103 @@
Collecting bitsandbytes
Using cached bitsandbytes-0.45.2-py3-none-manylinux_2_24_x86_64.whl.metadata (5.8 kB)
Collecting scipy
Using cached scipy-1.13.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB)
Collecting accelerate
Using cached accelerate-1.3.0-py3-none-any.whl.metadata (19 kB)
Collecting datasets
Using cached datasets-3.2.0-py3-none-any.whl.metadata (20 kB)
Collecting flash-attn
Using cached flash_attn-2.7.4.post1.tar.gz (6.0 MB)
Preparing metadata (setup.py): started
Preparing metadata (setup.py): finished with status 'done'
Requirement already satisfied: torch<3,>=2.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from bitsandbytes) (2.6.0+cu118)
Requirement already satisfied: numpy>=1.17 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from bitsandbytes) (2.0.2)
Requirement already satisfied: packaging>=20.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from accelerate) (24.2)
Collecting psutil (from accelerate)
Using cached psutil-6.1.1-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (22 kB)
Requirement already satisfied: pyyaml in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from accelerate) (6.0.2)
Requirement already satisfied: huggingface-hub>=0.21.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from accelerate) (0.28.1)
Requirement already satisfied: safetensors>=0.4.3 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from accelerate) (0.5.2)
Requirement already satisfied: filelock in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from datasets) (3.17.0)
Collecting pyarrow>=15.0.0 (from datasets)
Using cached pyarrow-19.0.0-cp39-cp39-manylinux_2_28_x86_64.whl.metadata (3.3 kB)
Collecting dill<0.3.9,>=0.3.0 (from datasets)
Using cached dill-0.3.8-py3-none-any.whl.metadata (10 kB)
Requirement already satisfied: pandas in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from datasets) (2.2.3)
Requirement already satisfied: requests>=2.32.2 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from datasets) (2.32.3)
Requirement already satisfied: tqdm>=4.66.3 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from datasets) (4.67.1)
Collecting xxhash (from datasets)
Using cached xxhash-3.5.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (12 kB)
Collecting multiprocess<0.70.17 (from datasets)
Using cached multiprocess-0.70.16-py39-none-any.whl.metadata (7.2 kB)
Collecting fsspec<=2024.9.0,>=2023.1.0 (from fsspec[http]<=2024.9.0,>=2023.1.0->datasets)
Using cached fsspec-2024.9.0-py3-none-any.whl.metadata (11 kB)
Collecting aiohttp (from datasets)
Using cached aiohttp-3.11.12-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (7.7 kB)
Collecting einops (from flash-attn)
Using cached einops-0.8.0-py3-none-any.whl.metadata (12 kB)
Collecting aiohappyeyeballs>=2.3.0 (from aiohttp->datasets)
Using cached aiohappyeyeballs-2.4.6-py3-none-any.whl.metadata (5.9 kB)
Collecting aiosignal>=1.1.2 (from aiohttp->datasets)
Using cached aiosignal-1.3.2-py2.py3-none-any.whl.metadata (3.8 kB)
Collecting async-timeout<6.0,>=4.0 (from aiohttp->datasets)
Using cached async_timeout-5.0.1-py3-none-any.whl.metadata (5.1 kB)
Collecting attrs>=17.3.0 (from aiohttp->datasets)
Using cached attrs-25.1.0-py3-none-any.whl.metadata (10 kB)
Collecting frozenlist>=1.1.1 (from aiohttp->datasets)
Using cached frozenlist-1.5.0-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (13 kB)
Collecting multidict<7.0,>=4.5 (from aiohttp->datasets)
Using cached multidict-6.1.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (5.0 kB)
Collecting propcache>=0.2.0 (from aiohttp->datasets)
Using cached propcache-0.2.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (9.2 kB)
Collecting yarl<2.0,>=1.17.0 (from aiohttp->datasets)
Using cached yarl-1.18.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (69 kB)
Requirement already satisfied: typing-extensions>=3.7.4.3 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from huggingface-hub>=0.21.0->accelerate) (4.12.2)
Requirement already satisfied: charset-normalizer<4,>=2 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from requests>=2.32.2->datasets) (3.4.1)
Requirement already satisfied: idna<4,>=2.5 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from requests>=2.32.2->datasets) (3.10)
Requirement already satisfied: urllib3<3,>=1.21.1 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from requests>=2.32.2->datasets) (2.3.0)
Requirement already satisfied: certifi>=2017.4.17 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from requests>=2.32.2->datasets) (2025.1.31)
Requirement already satisfied: networkx in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (3.2.1)
Requirement already satisfied: jinja2 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (3.1.5)
Requirement already satisfied: nvidia-cuda-nvrtc-cu11==11.8.89 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.8.89)
Requirement already satisfied: nvidia-cuda-runtime-cu11==11.8.89 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.8.89)
Requirement already satisfied: nvidia-cuda-cupti-cu11==11.8.87 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.8.87)
Requirement already satisfied: nvidia-cudnn-cu11==9.1.0.70 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (9.1.0.70)
Requirement already satisfied: nvidia-cublas-cu11==11.11.3.6 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.11.3.6)
Requirement already satisfied: nvidia-cufft-cu11==10.9.0.58 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (10.9.0.58)
Requirement already satisfied: nvidia-curand-cu11==10.3.0.86 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (10.3.0.86)
Requirement already satisfied: nvidia-cusolver-cu11==11.4.1.48 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.4.1.48)
Requirement already satisfied: nvidia-cusparse-cu11==11.7.5.86 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.7.5.86)
Requirement already satisfied: nvidia-nccl-cu11==2.21.5 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (2.21.5)
Requirement already satisfied: nvidia-nvtx-cu11==11.8.86 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (11.8.86)
Requirement already satisfied: triton==3.2.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (3.2.0)
Requirement already satisfied: sympy==1.13.1 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from torch<3,>=2.0->bitsandbytes) (1.13.1)
Requirement already satisfied: mpmath<1.4,>=1.1.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from sympy==1.13.1->torch<3,>=2.0->bitsandbytes) (1.3.0)
Requirement already satisfied: python-dateutil>=2.8.2 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from pandas->datasets) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from pandas->datasets) (2025.1)
Requirement already satisfied: tzdata>=2022.7 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from pandas->datasets) (2025.1)
Requirement already satisfied: six>=1.5 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from python-dateutil>=2.8.2->pandas->datasets) (1.17.0)
Requirement already satisfied: MarkupSafe>=2.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from jinja2->torch<3,>=2.0->bitsandbytes) (3.0.2)
Using cached bitsandbytes-0.45.2-py3-none-manylinux_2_24_x86_64.whl (69.7 MB)
Using cached scipy-1.13.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (38.6 MB)
Using cached accelerate-1.3.0-py3-none-any.whl (336 kB)
Using cached datasets-3.2.0-py3-none-any.whl (480 kB)
Using cached dill-0.3.8-py3-none-any.whl (116 kB)
Using cached fsspec-2024.9.0-py3-none-any.whl (179 kB)
Using cached aiohttp-3.11.12-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.6 MB)
Using cached multiprocess-0.70.16-py39-none-any.whl (133 kB)
Using cached pyarrow-19.0.0-cp39-cp39-manylinux_2_28_x86_64.whl (42.1 MB)
Using cached einops-0.8.0-py3-none-any.whl (43 kB)
Using cached psutil-6.1.1-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (287 kB)
Using cached xxhash-3.5.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (193 kB)
Using cached aiohappyeyeballs-2.4.6-py3-none-any.whl (14 kB)
Using cached aiosignal-1.3.2-py2.py3-none-any.whl (7.6 kB)
Using cached async_timeout-5.0.1-py3-none-any.whl (6.2 kB)
Using cached attrs-25.1.0-py3-none-any.whl (63 kB)
Using cached frozenlist-1.5.0-cp39-cp39-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (242 kB)
Using cached multidict-6.1.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (124 kB)
Using cached propcache-0.2.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (208 kB)
Using cached yarl-1.18.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (321 kB)
Building wheels for collected packages: flash-attn
Building wheel for flash-attn (setup.py): started
Building wheel for flash-attn (setup.py): finished with status 'canceled'
88 changes: 88 additions & 0 deletions =4.33.2
Original file line number Diff line number Diff line change
@@ -0,0 +1,88 @@
Collecting nltk
Using cached nltk-3.9.1-py3-none-any.whl.metadata (2.9 kB)
Collecting sacremoses
Using cached sacremoses-0.1.1-py3-none-any.whl.metadata (8.3 kB)
Collecting pandas
Using cached pandas-2.2.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (89 kB)
Collecting regex
Using cached regex-2024.11.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (40 kB)
Collecting mock
Using cached mock-5.1.0-py3-none-any.whl.metadata (3.0 kB)
Collecting transformers
Using cached transformers-4.48.3-py3-none-any.whl.metadata (44 kB)
Collecting mosestokenizer
Using cached mosestokenizer-1.2.1-py3-none-any.whl
Collecting click (from nltk)
Using cached click-8.1.8-py3-none-any.whl.metadata (2.3 kB)
Collecting joblib (from nltk)
Using cached joblib-1.4.2-py3-none-any.whl.metadata (5.4 kB)
Collecting tqdm (from nltk)
Using cached tqdm-4.67.1-py3-none-any.whl.metadata (57 kB)
Collecting numpy>=1.22.4 (from pandas)
Using cached numpy-2.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB)
Collecting python-dateutil>=2.8.2 (from pandas)
Using cached python_dateutil-2.9.0.post0-py2.py3-none-any.whl.metadata (8.4 kB)
Collecting pytz>=2020.1 (from pandas)
Using cached pytz-2025.1-py2.py3-none-any.whl.metadata (22 kB)
Collecting tzdata>=2022.7 (from pandas)
Using cached tzdata-2025.1-py2.py3-none-any.whl.metadata (1.4 kB)
Requirement already satisfied: filelock in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from transformers) (3.17.0)
Collecting huggingface-hub<1.0,>=0.24.0 (from transformers)
Using cached huggingface_hub-0.28.1-py3-none-any.whl.metadata (13 kB)
Collecting packaging>=20.0 (from transformers)
Using cached packaging-24.2-py3-none-any.whl.metadata (3.2 kB)
Collecting pyyaml>=5.1 (from transformers)
Using cached PyYAML-6.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (2.1 kB)
Collecting requests (from transformers)
Using cached requests-2.32.3-py3-none-any.whl.metadata (4.6 kB)
Collecting tokenizers<0.22,>=0.21 (from transformers)
Using cached tokenizers-0.21.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.7 kB)
Collecting safetensors>=0.4.1 (from transformers)
Using cached safetensors-0.5.2-cp38-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.8 kB)
Collecting docopt (from mosestokenizer)
Using cached docopt-0.6.2-py2.py3-none-any.whl
Collecting openfile (from mosestokenizer)
Using cached openfile-0.0.7-py3-none-any.whl.metadata (1.7 kB)
Collecting uctools (from mosestokenizer)
Using cached uctools-1.3.0-py3-none-any.whl
Collecting toolwrapper (from mosestokenizer)
Using cached toolwrapper-2.1.0-py3-none-any.whl
Requirement already satisfied: fsspec>=2023.5.0 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from huggingface-hub<1.0,>=0.24.0->transformers) (2025.2.0)
Requirement already satisfied: typing-extensions>=3.7.4.3 in /home/han/anaconda3/envs/itv2_hf/lib/python3.9/site-packages (from huggingface-hub<1.0,>=0.24.0->transformers) (4.12.2)
Collecting six>=1.5 (from python-dateutil>=2.8.2->pandas)
Using cached six-1.17.0-py2.py3-none-any.whl.metadata (1.7 kB)
Collecting charset-normalizer<4,>=2 (from requests->transformers)
Using cached charset_normalizer-3.4.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (35 kB)
Collecting idna<4,>=2.5 (from requests->transformers)
Using cached idna-3.10-py3-none-any.whl.metadata (10 kB)
Collecting urllib3<3,>=1.21.1 (from requests->transformers)
Using cached urllib3-2.3.0-py3-none-any.whl.metadata (6.5 kB)
Collecting certifi>=2017.4.17 (from requests->transformers)
Using cached certifi-2025.1.31-py3-none-any.whl.metadata (2.5 kB)
Using cached nltk-3.9.1-py3-none-any.whl (1.5 MB)
Using cached sacremoses-0.1.1-py3-none-any.whl (897 kB)
Using cached pandas-2.2.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (13.1 MB)
Using cached regex-2024.11.6-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (780 kB)
Using cached mock-5.1.0-py3-none-any.whl (30 kB)
Using cached transformers-4.48.3-py3-none-any.whl (9.7 MB)
Using cached huggingface_hub-0.28.1-py3-none-any.whl (464 kB)
Using cached numpy-2.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (19.5 MB)
Using cached packaging-24.2-py3-none-any.whl (65 kB)
Using cached python_dateutil-2.9.0.post0-py2.py3-none-any.whl (229 kB)
Using cached pytz-2025.1-py2.py3-none-any.whl (507 kB)
Using cached PyYAML-6.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (737 kB)
Using cached safetensors-0.5.2-cp38-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (461 kB)
Using cached tokenizers-0.21.0-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.0 MB)
Using cached tqdm-4.67.1-py3-none-any.whl (78 kB)
Using cached tzdata-2025.1-py2.py3-none-any.whl (346 kB)
Using cached click-8.1.8-py3-none-any.whl (98 kB)
Using cached joblib-1.4.2-py3-none-any.whl (301 kB)
Using cached openfile-0.0.7-py3-none-any.whl (2.4 kB)
Using cached requests-2.32.3-py3-none-any.whl (64 kB)
Using cached certifi-2025.1.31-py3-none-any.whl (166 kB)
Using cached charset_normalizer-3.4.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (146 kB)
Using cached idna-3.10-py3-none-any.whl (70 kB)
Using cached six-1.17.0-py2.py3-none-any.whl (11 kB)
Using cached urllib3-2.3.0-py3-none-any.whl (128 kB)
Installing collected packages: toolwrapper, pytz, openfile, docopt, urllib3, uctools, tzdata, tqdm, six, safetensors, regex, pyyaml, packaging, numpy, mock, joblib, idna, click, charset-normalizer, certifi, sacremoses, requests, python-dateutil, nltk, mosestokenizer, pandas, huggingface-hub, tokenizers, transformers
Successfully installed certifi-2025.1.31 charset-normalizer-3.4.1 click-8.1.8 docopt-0.6.2 huggingface-hub-0.28.1 idna-3.10 joblib-1.4.2 mock-5.1.0 mosestokenizer-1.2.1 nltk-3.9.1 numpy-2.0.2 openfile-0.0.7 packaging-24.2 pandas-2.2.3 python-dateutil-2.9.0.post0 pytz-2025.1 pyyaml-6.0.2 regex-2024.11.6 requests-2.32.3 sacremoses-0.1.1 safetensors-0.5.2 six-1.17.0 tokenizers-0.21.0 toolwrapper-2.1.0 tqdm-4.67.1 transformers-4.48.3 tzdata-2025.1 uctools-1.3.0 urllib3-2.3.0
Binary file added OCR/ENG_OCR/data/data_answer/answer_texts.xlsx
Binary file not shown.
5 changes: 5 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_100.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
158 128 411 181 "Footpath"
443 128 501 169 "To"
64 200 363 243 "Colchester"
394 199 487 239 "and"
72 271 382 312 "Greenstead"
1 change: 1 addition & 0 deletions OCR/ENG_OCR/data/data_answer/gt_101.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
176 189 456 274 "DELL"
7 changes: 7 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_102.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
602 292 610 302 "R"
27 467 128 479 "FOSTER'S"
279 468 380 479 "FOSTER'S"
524 461 621 478 "FOSTER'S"
272 120 415 256 "O"
321 157 363 215 "F"
82 284 577 368 "FOSTER'S"
4 changes: 4 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_103.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
112 111 332 143 "STIRLING"
350 107 524 138 "CASTLE"
68 281 298 315 "ARGYLL'S"
318 275 573 312 "LODGINGS"
8 changes: 8 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_104.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
86 33 376 138 "GOOD"
79 155 217 202 "HOME"
242 157 382 205 "MADE"
87 221 372 324 "FOOD"
74 375 155 405 "CASK"
170 375 387 405 "CONDITIONED"
129 419 329 491 "ALES"
100 540 356 598 "GARDEN"
4 changes: 4 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_105.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
425 0 934 129 "SCOTTISH"
400 430 882 527 "NATIONAL"
389 837 783 941 "GALLERY"
376 1158 789 1274 "MODERN"
3 changes: 3 additions & 0 deletions OCR/ENG_OCR/data/data_answer/gt_106.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
199 175 477 206 "Information"
296 233 484 267 "Tickets"
331 298 492 335 "Trains"
Loading