Skip to content

NikolaiHerrmann/deep-dating

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

292 Commits
 
 
 
 
 
 
 
 

Repository files navigation

A Generalized Deep Learning-Based Pipeline for Historical Manuscript Dating

In this work, a deep learning pipeline is proposed that learns to date manuscripts based on handwriting.

To run scripts install all Python requirements first:

pip install -r requirements.txt

Go to the src folder. The following command then runs the pipeline for an image (e.g., test.ppm):

python dating_pipeline.py --img test.ppm --dataset mps

--img specifies the path of the image and --dataset the dataset weights to use (can be mps, clamm, scribble, or himanis). Weights will automatically be downloaded from huggingface (https://huggingface.co/nikolai40/deep-dating).

To remove comments from the CLaMM dataset see: https://github.com/NikolaiHerrmann/comment-remover

The pipeline will produce a plot similar to this:

Datasets

The datasets used in this project:

Code overview

Directories

Main files

Set the DATASETS_PATH variable in dating_util.py for the location of the datasets. Downloading the datasets, unzipping them, and then putting them into a datasets folder should be enough. See dating_dataset.py on how datasets are loaded, individual paths can also be changed there.

Debug

RasterFairy

Package may error out due to using np.float which is deprecated. In order to fix change line 134 in rasterfairy.py from gridPoints2d[q['indices'][0]] = np.array(q['grid'][0:2], dtype=np.float) to gridPoints2d[q['indices'][0]] = np.array(q['grid'][0:2], dtype=float).

About

A deep learning pipeline is proposed that learns to date historical manuscripts based on handwriting

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors