Add single-page Image & PDF OCR extractor (Tesseract.js + PDF.js) by maddix123 · Pull Request #1 · maddix123/today

maddix123 · 2026-02-08T14:28:48Z

Motivation

Replace the placeholder JSON file with a usable single-file web app that extracts text from images and PDFs entirely in the browser.
Provide a simple UX with drag-and-drop / file browse, preview, status updates, and copy-to-clipboard for quick manual OCR tasks.
Use client-side libraries to avoid server-side processing by leveraging Tesseract.js for OCR and pdfjs-dist for PDF rendering.

Description

Replaced index.html contents with a complete HTML/CSS/JS single-page app that implements the OCR UI and flows.
Added CDN imports for pdfjs-dist and tesseract.js and implemented extractFromImage, extractFromPdf, and extractText functions to perform OCR on images and each PDF page.
Implemented drag-and-drop and file input handling, image and first-page PDF preview rendering via canvas or img, and UI controls for Extract, Clear, and Copy to Clipboard.
Added status messaging and progress updates, and stitched per-page PDF results with separators for multi-page PDFs.

Testing

Launched a local static server with python -m http.server 8000, which started serving the app successfully.
Attempted an automated screenshot run with Playwright (mcp__browser_tools__run_playwright_script), but the Chromium process crashed so the capture failed.
No unit tests were added or run for this change; manual/interactive testing is expected in a browser environment.

Codex Task

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8207f7c5a3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-08T14:30:06Z

+      const renderPdfPreview = async (file) => {
+        const arrayBuffer = await file.arrayBuffer();
+        const pdf = await pdfjsLib.getDocument({ data: arrayBuffer }).promise;


Configure pdf.js worker before calling getDocument

Because the page loads pdfjs-dist from a CDN but never sets pdfjsLib.GlobalWorkerOptions.workerSrc, pdfjsLib.getDocument(...) can fail in common browser setups with the “No GlobalWorkerOptions.workerSrc specified” / worker load error, which means PDF previews and OCR never run. This regression only shows up for PDF uploads (images still work), but in that scenario the promise in renderPdfPreview and extractFromPdf rejects before any pages render, leaving users stuck without an actionable status update.

Useful? React with 👍 / 👎.

Add single-page OCR extractor

8207f7c

maddix123 added the codex label Feb 8, 2026 — with ChatGPT Codex Connector

chatgpt-codex-connector Bot reviewed Feb 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add single-page Image & PDF OCR extractor (Tesseract.js + PDF.js)#1

Add single-page Image & PDF OCR extractor (Tesseract.js + PDF.js)#1
maddix123 wants to merge 1 commit into
mainfrom
codex/create-single-file-text-extraction-app

maddix123 commented Feb 8, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maddix123 commented Feb 8, 2026

Motivation

Description

Testing

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Feb 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant