🕷️ Xcrap Image Text Extractor

Xcrap Image Text Extractor is a package of the Xcrap framework that abstracts the extraction of texts from images using the node-tesseract-ocr library.

📦 Installation

There are no secrets to installing it, just use your preferred dependency manager. Here is an example using NPM:

npm i @xcrap/image-text-extractor

🚀 Usage

Xcrap Image Text Extractor provides an async extractor that can be used in an HTML parsing model just like any extractor:

import { extractImageText } from "@xcrap/image-text-extractor"
import { HtmlParsingModel } from "@xcrap/parser"

const parsingModel = new HtmlParsingModel({
	imageTexts: {
		query: "img",
		multiple: true,
		extractor: extractImageText({ lang: "eng" })
	}
})

If you want to transform the src of the images to resolve relative paths or something like that, pass the transformSrc option in the options like this:

const parsingModel = new HtmlParsingModel({ 
    imageTexts: {
        query: "img",
        multiple: true,
        extractor: extractImageText({
            lang: "eng",
            transformSrc: (originalSrc) => {...}
        })
    }
})

Check out more options at node-tesseract-ocr.

🤝 Contributing

Want to contribute? Follow these steps:
Fork the repository.
Create a new branch (git checkout -b feature-new).
Commit your changes (git commit -m 'Add new feature').
Push to the branch (git push origin feature-new).
Open a Pull Request.

📝 License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github		.github
src		src
.gitignore		.gitignore
.npmignore		.npmignore
LICENSE		LICENSE
README.md		README.md
jest.config.ts		jest.config.ts
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🕷️ Xcrap Image Text Extractor

📦 Installation

🚀 Usage

🤝 Contributing

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🕷️ Xcrap Image Text Extractor

📦 Installation

🚀 Usage

🤝 Contributing

📝 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages