Skip to content

Encoding of the expenses/revenue data #59

@Johannesderuig

Description

@Johannesderuig

Not sure if this is the right place for this "issue", but nonetheless here we go:

I am have downloaded some of the e-filer datasets and am currently in the process of putting them together and cleaning them. I noticed that a lot of the (financial) variables are automatically put into strings due to a lot of missing values noted as "NA".
Especially within the expenses and revenue datasets I am not sure how to deal with these "NA" observation.
It seems to me that sometimes these are, as to be expected, just missing values and can be treated as such.
In other cases however, it seems they are a way of denoting 0.
How can/should I distinguish between the "true" NA's and the NA's that are actually just 0? Or are they all just 0?
Any help would be greatly appreciated.

Kind regards,
Johannes de Ruig.

NB: I am not much of a coding expert and work in Stata

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions