Tidying the dataset for regression use (or merge): operate year by year to keep memory to manageable levels.

tidy_qcew_year(dt, frequency = "all", industry = "naics", verbose = TRUE)

Arguments

dt

input dataset from get_qcew_cut

frequency

download either quarterly, monthly or all data

industry

download naics or sic data

verbose

do we print the years to see how fast we are going

Value

data.table dt_res

Note

returns a data.table file that is formatted according to tidy standard typically this will be year x sub_year (quarter or month) x size x own_code x industry the file can be aggregated as such I do not download all the information (some location quotients and taxes are forgotten)

Examples

# NOT RUN { dt_tidy <- tidy_qcew_year(dt, frequency = "all", industry = "naics") # }