Add do file for estimating financial distress parameters by igelstorm · Pull Request #315 · simpaths/SimPaths

igelstorm · 2026-01-20T12:05:17Z

What

This PR:

Adds Stata code for estimating parameters for the financial distress process
Replaces the existing financial distress parameters (reg_financial_distress.xlsx) with the output from this script based on the current initial population (this is slightly different from what was there before)

And, unrelatedly (but necessary for the above):

Fixes an issue that caused all of the existing estimation scripts to error when run with the initial population built using the current Stata code:
- The UKHLS variables fihhmnnet1_dv and ieqmoecd_dv are used in input\InitialPopulations\compile\RegressionEstimates\variable_update.do, but weren't being kept in the intermediate data files used (ukhls_pooled_all_obs_09.dta) - now they are

Why

We want the code used to estimate model parameters to be stored in this repo. This wasn't the case for the financial distress process.

Validation

I've manually verified that the row and column labels are correct.
I've compared the old and the new coefficients and they are broadly similar: reg_financial_distress_CHANGES.xlsx
- Minor differences are expected because the data used to estimate the previous version was different from the current data
- There are non-trivial differences in the coefficients for number of children (dnc) and a few of the UK regions - I'm guessing these might be due to changes in the estimation data since these were previously estimated

dkopasker

Just two comments: one about duplication of a variable, and one about compliance with the variable naming policy (which perhaps happened while you were on leave).

dkopasker · 2026-01-20T13:24:48Z

input/InitialPopulations/compile/02_create_UKHLS_variables.do

 	dimxwt dhhwt jbhrs jshrs j2hrs jbstat les_c3 les_c4 lessp_c3 lessp_c4 lesdf_c4 ydses_c5 month scghq2_dv ydisp ///
 	ypnbihs_dv yptciihs_dv yplgrs_dv ynbcpdf_dv ypncp ypnoab swv sedex ssscp sprfm sedag stm dagsp lhw l1_lhw pno ppno hgbioad1 hgbioad2 der adultchildflag ///
        econ_benefits econ_benefits_nonuc econ_benefits_uc ///
+	fihhmnnet1_dv ieqmoecd_dv ///


The equivalence scale is generated later in the do file (see gen moecd_eq = . //Modified OECD equivalence scale). For consistency, it may be best to use the generated variable.

That's good to know. Currently, ieqmoecd_dv this is used to calculate equivalised income in the input\InitialPopulations\compile\RegressionEstimates\variable_update.do which is used by most of the estimation files.

It sounds like a sensible idea to replace this with moecd_eq, but this would affect all estimation scripts and ideally would require all of the scripts to be rerun (i.e. the parameters reestimated). @dav-sonn, @dariaple or others in Essex might have a view on this.

dkopasker · 2026-01-20T13:26:39Z

input/InitialPopulations/compile/RegressionEstimates/reg_financial_distress.do

+* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects *
+**********************************************************************
+
+logit financial_distress ///


Does the variable named "financial_distress" comply with the new variable naming structure and does it need to?

Good question. I understand that @dav-sonn's open PR #313 will rename it to "yFinDstrssFlag" in the initial population CSV files. That said, the regression estimation scripts all use the pooled dataset ukhls_pooled_all_obs_09.dta, which is created before this renaming takes place, so all the variables here (exp_emp, lhw_c5, and so on) will continue to have their "old" names.

I think this is a question that goes beyond this PR and should probably be resolved separately (although @dav-sonn might have a view?).

yeah it looks like #313 will add the code to change the name in the csv file after the initial population has been created and independent of the regressions. So this change should be compatible with current state and should sync up nicely with refactoring!

As Matteo pointed out below, we haven't refactored the regressors in the processes' estimation scripts. Refactored variables are those in the initial populations, the output CSV files, and the Person, BenefitUnit, and Household classes.
I hope this clarifies!

andrewbaxter439 · 2026-01-20T15:18:32Z

input/InitialPopulations/compile/RegressionEstimates/reg_financial_distress.do

+
+
+**********************************************************************
+* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects *


Update heading, along lines of:

Suggested change

* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects *

* Financial Distress (binary) - estimated log odds of experiencing financial distress *

matteorichiardi · 2026-01-20T15:25:33Z

For the other processes we have agreed not to refactor the estimation scripts, but only refactor the final names...

…

On Tue, 20 Jan 2026 at 15:19, Andrew Baxter ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In input/InitialPopulations/compile/RegressionEstimates/reg_financial_distress.do <#315 (comment)> : > +******************************************************************* + +use "$dir_ukhls_data/ukhls_pooled_all_obs_09.dta", clear +do "$dir_do/variable_update" + + + +* Sample selection +drop if dag < 16 + + +xtset idperson swv + + +********************************************************************** +* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects * Update heading, along lines of: ⬇️ Suggested change -* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects * +* Financial Distress (binary) - estimated log odds of experiencing financial distress * — Reply to this email directly, view it on GitHub <#315 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACOK4OCUGADDWHNTSI6BSJD4HZBP5AVCNFSM6AAAAACSIRR7QSVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTMOBSG42DEMRQHE> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.*** com>

igelstorm and others added 5 commits January 8, 2026 17:11

WIP: duplicate reg_health_mental do file for financial distress

07ad677

WIP: update reg_financial_distress.do

970a01f

re-introducing missing variables for calculating regressions

e1de468

Update labels

8b7d281

Update financial distress coefficients

7c4b83a

igelstorm requested review from andrewbaxter439, dav-sonn and dkopasker January 20, 2026 12:05

Update integration test output

3f87c20

dkopasker reviewed Jan 20, 2026

View reviewed changes

andrewbaxter439 reviewed Jan 20, 2026

View reviewed changes

justin-ven added 2 commits January 21, 2026 09:33

Merge branch 'develop' into financial-distress-estimation-files

6a3bad0

update of validation statistics

bf034b5

justin-ven merged commit d99f968 into simpaths:develop Jan 21, 2026
6 checks passed

igelstorm deleted the financial-distress-estimation-files branch January 26, 2026 09:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add do file for estimating financial distress parameters#315

Add do file for estimating financial distress parameters#315
justin-ven merged 8 commits intosimpaths:developfrom
igelstorm:financial-distress-estimation-files

igelstorm commented Jan 20, 2026 •

edited

Loading

Uh oh!

dkopasker left a comment

Uh oh!

dkopasker Jan 20, 2026

Uh oh!

igelstorm Jan 20, 2026

Uh oh!

dkopasker Jan 20, 2026

Uh oh!

igelstorm Jan 20, 2026

Uh oh!

andrewbaxter439 Jan 20, 2026

Uh oh!

dav-sonn Jan 20, 2026

Uh oh!

andrewbaxter439 Jan 20, 2026

Uh oh!

matteorichiardi commented Jan 20, 2026 via email

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants



		**********************************************************************
		* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects *

	* HM1_L: GHQ12 score 0-36 of all working-age adults - baseline effects *
	* Financial Distress (binary) - estimated log odds of experiencing financial distress *

Conversation

igelstorm commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Validation

Uh oh!

dkopasker left a comment

Choose a reason for hiding this comment

Uh oh!

dkopasker Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

igelstorm Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

dkopasker Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

igelstorm Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

andrewbaxter439 Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

dav-sonn Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

andrewbaxter439 Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

matteorichiardi commented Jan 20, 2026 via email

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

igelstorm commented Jan 20, 2026 •

edited

Loading