Skip to content
This repository was archived by the owner on Jan 20, 2021. It is now read-only.
This repository was archived by the owner on Jan 20, 2021. It is now read-only.

The output csv file using command line with the option 'spreadsheet'('lattice') is not well formated #110

@LittleLakeFish

Description

@LittleLakeFish

Hi jeremybmerrill,

I used the cmd line with option 'spreadsheet'('lattice') to extract the table from the PDF file which I sent to your mailbox before.(the cmd line is "tabula --spreadsheet -a 171.743,81.473,409.658,611.618 -p 1") .
And I found the output csv file is not well formatted(there is reduplicative data, blank row between table data columns ect.). If I do this *_without *_the option 'spreadsheet'('lattice'), the csv file format is very nice just like the result of tabula executable.
Another issue is, the table extracted via the cmd line with option 'spreadsheet'('lattice') missed the last column of the original pdf table. (Just same as the The data of the last column was missing with Lattice algorithm)

tabula-extractor version is 0.8.0
Platform windows 7
JRuby 9.0.5.0
Could you please help to solve this?
Thanks a lot for your kindly help!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions