BACKGROUND: This study aimed to validate self-reported medical conditions in the Taiwan Biobank (TWBB), in which participants were inquired about 30 disease conditions, by comparing them with claims records from Taiwan's National Health Insurance (NHI) claims database. METHODS: We identified 30 clinical diagnoses using ICD-CM codes from ambulatory and hospital claims within the NHI claims database, matching diseases included in the TWBB. The concordance between self-reports and claims records was evaluated using tetrachoric correlation to assess the correlation between binary variables. RESULTS: A total of 131,834 participants aged 30-70 years with data from the TWBB and NHI records were included. Concordance analysis revealed tetrachoric correlations ranged from 0.420 (chronic obstructive pulmonary disease) to 0.970 (multiple sclerosis). However, several disorders exhibited lower tetrachoric correlations. The concordance was higher among those with higher education attainment, and lower among married individuals. CONCLUSION: The concordance between self-reports in the TWBB and NHI claims records varied across clinical diagnoses, showing inconsistencies depending on participant characteristics. These findings underscore the need for further investigation, especially when these variables are crucial to research objectives. Integrating complementary databases such as clinical diagnoses, prescription records, and medical procedures can enhance accuracy through customized algorithms based on disease categories and participant characteristics and optimize sensitivity or positive predictive values to align with specific research objectives.
Date:
2024-07-20
Relation:
Journal of Epidemiology. 2024 Jul 20;Article in Press.