Every phase of this study maps directly to the skills healthcare analytics organizations depend on. Defining a testable hypothesis from a complex federal dataset mirrors the clinical question framing required for HEDIS measure analysis and CMS quality reporting. Building ETL pipelines to resolve schema drift across NSDUH survey years is the same discipline applied to normalizing inconsistent EHR exports, payer claim formats, and ICD-10/CPT code structures across systems. Chi-Square independence testing is directly applicable to denial pattern analysis—determining whether denial rate is statistically dependent on payer type, service line, or coding practice. Translating codebook variables into readable nomenclature for non-technical stakeholders is exactly what RCM teams need when surfacing A/R aging trends or clean claim rates to CFO-level audiences. The full arc demonstrated here—question, pipeline, validation, insight, communication—is the complete workflow of a productive healthcare data analyst.
This analysis approaches a deceptively simple question with rigorous statistical methodology: does failing to treat mental health needs result in measurable workforce detachment? Using the 2019 SAMHSA NSDUH dataset, the study isolates 15,780 adults aged 18–25 and tests for statistical dependence between unmet mental health need and employment status.
The hypothesis is that adults reporting an unmet need will have statistically higher odds of unemployment or labor force detachment—controlling for age, sex, race/ethnicity, and education level.
Unmet mental health need stratified by sex and employment outcome — Chi-Square p < 0.001
| Raw Syntax | Cleaned Output | Data Dictionary Rationale |
|---|---|---|
| AMHTXND2 | unmet_need | Perceived unmet need for mental health treatment. |
| IRWRKSTAT | employment_status | Revised imputed employment status. |
| IRWRKSTAT18 | employment_status_18_plus | Age-restricted cohort analysis base. |
| AMHSVTYP | mental_health_treatment_type | Type of mental health treatment received. |
| SPDYRADJ | serious_psychological_distress | Standardized variable mapping across years. |
| catage / AGE | age_group | Unified age categorization across TEDS-A and NSDUH pipelines. |
| PSYPROB | cooccurring_mental_substance_disorder | Makes clinical correlation explicit. |
| EDUC | education_level | Education at time of admission/survey. |
| PRIMINC | primary_income_source | Principal source of income/support. |
| DETNLF | not_in_labor_force_detail | Detail matrix for labor force detachments. |
When people do not have an unmet mental health need, the vast majority are securely engaged in full-time work. When people do suffer from an unmet mental health need, the rate of total unemployment and severe workforce detachment spikes substantially relative to population size.