Psychometric Validation of the Ultrashort Slovenian (OHIP-SVN5) and Croatian (OHIP-CRO5) Oral Health Impact Profile Questionnaires
Artikel-Kategorie: Original scientific article
Online veröffentlicht: 01. Sept. 2025
Seitenbereich: 167 - 177
Eingereicht: 19. Mai 2025
Akzeptiert: 25. Juli 2025
DOI: https://doi.org/10.2478/sjph-2025-0022
Schlüsselwörter
© 2025 Ksenija Rener-Sitar et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
Dental patient-reported outcome measures are important psychometric instruments that enable patients to self-assess the impacts of oral diseases and evaluate dental treatment modalities. Among several psychometrically validated questionnaires assessing self-perceived oral health, the Oral Health Impact Profile (OHIP) is the most frequently used and methodologically investigated instrument. Oral Health-Related Quality of Life (OHRQoL) is typically assessed nowadays using one version of the OHIP questionnaire, which consists of 49, 14, or 5 items ( 1,2,3). Additionally, disease-specific OHIP instruments have also been developed, such as OHIP-EDENT for edentulous patients (4), OHIP-TMD (5) for patients whose problems originate from the temporomandibular joints (6), or OHIP-ESTHET for assessment of orofacial aesthetics (7). The questionnaires, consisting of multiple items, place a significant burden on the person completing them because a lot of time is required, and some items may be unintentionally omitted. However, the 5-item OHIP version represents a questionnaire with the lowest burden for the patient, as five responses can be chosen on a 5-point ordinal rating scale, still capturing at least one question from each of the four dimensions of OHRQoL, specifically, Oral Function, Orofacial Pain, Orofacial Appearance, and Psychosocial Impact (8,9,10,11). Recently, it was proposed to use the ultra-short version, i.e., the OHIP-5, to replace the more extended versions, because it has been proven that summary scores of the 5-, 14-, and 49-item versions are highly correlated (12, 13). Using the shortest version should have the greatest potential to facilitate the future clinical use of the OHIP instrument (14).
In Slovenia, the OHIP-49 and OHIP-14 have already been validated (15, 16). In Croatia, the OHIP-49, OHIP-14 and OHIP-EDENT have been validated through psychometric testing ( 16,17,18). However, the shorter, that is, the 14-item OHIP version, has been used more often in research and clinical settings than the original, longer 49-item version ( 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32). To reduce respondents’ and examiners’ burden and time consumption, and to ensure that all questions are answered, many countries have already developed and validated ultra-short OHIP-5 versions ( 3, 33, 34, 35, 36, 37, 38, 39, 40). In Chile, the ultra-short OHIP 7-item version was also validated (41).
This study aimed to validate the Slovenian and Croatian versions of the OHIP-5 questionnaire in a target population. We hypothesised that the Slovenian and Croatian versions of the OHIP-5 would demonstrate adequate internal consistency, construct validity, test-retest reliability and responsiveness.
The Slovenian and Croatian versions were translated from the English 5-item version (3) using established standards (42). In each country, a professional translator and a dentist fluent in English, with international experience in English-speaking countries, performed the initial translation into Slovenian or Croatian. After review by two English-fluent dentists, the final versions were back-translated into English by another professional translator in collaboration with another dentist fluent in English, separately in each country. Both back-translations were then independently evaluated and compared with the original English version by two native English speakers.
The institutional ethics committees in Slovenia and Croatia approved the study under reference numbers 0120-219/2017-3 and 05-PA-26-6/2015, respectively. The participants were selected from different populations. Their mean age, gender, age range and research purposes for psychometric validation of the OHIP-5 questionnaire in Slovenia and Croatia are presented in Table 1.
Age, gender and research purpose of the respondents answering five questions of the Oral Health Impact Profile (OHIP-5).
400 (72.8) | 53.8 (9.7) | 28–70 | Internal consistency, EFA, Concurrent validity, Known group validity | |
223 (65.5) | 59.96 (14.6) | 28–82 | Internal consistency, EFA, Concurrent validity, Known group validity | |
38 (68.4) | 22.6 (1.7) | 20–26 | Test-retest reliability | |
30 (70.0) | 22.0 (1.4) | 19–25 | Test-retest reliability | |
30 (54.0) | 50.8 (14.2) | 18–70 | Responsiveness (sensitivity to change) | |
30 (65.0) | 65.5 (10.4) | 37–81 | Responsiveness (sensitivity to change) |
Legend:
EFA - exploratory factor analysis
Educational and research institutions in Ljubljana, Slovenia
People in the general practice doctor’s waiting room who came for a referral, examination, or due to sick leave for themselves or somebody else
University of Ljubljana, Medical Faculty, Department of Prosthodontics, Slovenia.
University of Zagreb, School of Dental Medicine, Department of Prosthodontics, Croatia.
University Medical Center of Ljubljana, Department of Oral and Maxillofacial Surgery, Slovenia.
A total of 663 participants from the general population (400 in Slovenia and 223 in Croatia) were involved in the research, as well as 68 students (38 in Slovenia and 30 in Croatia), and 60 dental patients with treatment needs (30 in each country). Each participant received a thorough written explanation of the study’s purposes and procedures and was assured of anonymity. In Slovenia, participants were recruited consecutively from among employees in educational or research institutions, while in Croatia, participants were recruited consecutively from among patients and their accompanying persons in the waiting room of their family practice doctors (Table 1). The responses to the OHIP-5 questions were made on a 5-point Likert scale (0=never; 1=hardly ever; 2=occasionally; 3=fairly often; and 4=very often) and referred to the period of the last 7 days (43). In addition to the OHIP-5 questions, the participants recruited from the Slovenian general population also answered questions about their age, gender, denture wear (either fixed or removable), and self-assessed their oral health and oral appearance on a 4-point scale (0=excellent; 1=good; 2=fair; 3=poor). In Croatia, the participants from the general population also answered questions about their age, gender and whether they wore a removable denture (yes or no), and assessed their self-perceived oral health, which was reported on a 5-point Likert scale as 0=excellent; 1=very good; 2=good; 3=fair; 4=poor. All participants completed the written forms, and no questionnaire had missing data in either country. Dental students in both countries also participated and completed the OHIP-5 questionnaire twice within 2-week intervals without any changes that could influence their OHRQoL. For recruited patients who required treatment, a trained dentist in each country conducted an oral examination before the first administration of the OHIP-5 questionnaire.
Two types of reliability were assessed: internal consistency and test-retest reliability. The internal consistency was evaluated by calculating the Cronbach’s α reliability coefficient, the Cronbach’s α reliability coefficient if the item was deleted, and the average inter-item correlation for the OHIP scores (44). The Cronbach’s α values ≥0.70 were considered satisfactory (45), while values >0.20 were considered acceptable for the inter-item correlation (46). For the test-retest reliability of the OHIP-5 Questionnaire, a convenience sample of 38 dental students was selected in Slovenia, and a sample of 30 was chosen in Croatia. Subjects were not allowed to undergo oral/dental treatments for two weeks; if any oral problems occurred, they would be excluded. The intraclass correlation coefficients (ICC) were calculated based on the one-way repeated-measures analysis of variance (ANOVA) from the repeated administration of the same questionnaires within a two-week period (47). The ICC>0.80 indicated excellent agreement, 0.61–0.80 good agreement, 0.41–0.60 moderate agreement and <0.40 poor agreement.
Exploratory factor analysis (EFA) was performed to test the dimensionality of the OHIP-SVN5 and OHIP-CRO5, aiming to determine whether the data fit a unidimensional model. Before the analysis, the Kaiser-Meyer-Olkin (KMO) statistic of sampling adequacy and Bartlett’s test of sphericity were made. The Kaiser-Meyer-Olkin values should be above 0.6, and Bartlett’s test of sphericity should have a significance of <0.0001 to perform the EFA. A minimum eigenvalue of 1 was assigned as the factor extraction criterion, and item loadings ≥0.4 were considered sufficient. A scree-plot was also designed, and the total variance was calculated.
Two types of validity were assessed: concurrent validity and known-group validity.
The association between self-reported oral health and the OHIP-5 summary scores was assessed by calculating the Spearman rank correlation. In the Slovenian sample from the general population, concurrent validity was evaluated using the Spearman rank correlation between the OHIPSVN5 summary scores and self-perceived oral health, as well as between the OHIP-SVN5 summary scores and self-perceived orofacial aesthetics. In Slovenian participants, self-perceived oral health and self-perceived orofacial aesthetics were rated on a 4-point scale, ranging from zero to three (0=excellent; 1=good; 2=fair; 3=poor). Self-perceived oral health in Croatia was assessed on a 5-point Likert scale (0=excellent; 1=very good; 2=good; 3=fair; 4=poor).
Tests were conducted to determine whether the OHIP-5 scores discriminate between two groups known to differ. This was assessed by testing the differences of each item and the OHIP-5 summary scores between the groups expected to have differences in OHRQoL impairment. In the Slovenian general population, validity was tested between participants with a denture (fixed or removable) and participants with natural teeth. In contrast, the validity of the Croatian general population sample was tested by comparing participants with removable dentures to those without them. The non-parametric Mann-Whitney U test was used for that purpose. Based on our previous research (15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32), we assumed that individuals with dentures would have worse oral health than those with natural teeth.
Responsiveness of the OHIP-SVN5 and the OHIP-CRO5 was tested in 30 Slovenian and 30 Croatian patients with treatment needs (Table 1). Slovenian patients required tooth extraction following unsuccessful endodontic treatment in the posterior jaw regions, while Croatian patients needed new complete dentures. All patients completed the OHIP-5 questionnaires twice: first, immediately before treatment, and second, one month after treatment. We assumed that OHRQoL would improve after treatment, compared to the status before treatment. The significance of the differences in the OHIP-5 scores between the baseline and the follow-up administrations was tested using the Wilcoxon Rank Sign non-parametric test and by calculating the standardised effect size (48) using the formula: (Baseline OHIP score - follow-up OHIP score)/(Standard deviation of baseline OHIP score). According to Cohen, effect sizes are classified as small (0.2), medium (0.5), or large (0.8) (49).
The Cronbach’s α coefficients were 0.766 for the OHIPSVN5 and 0.706 for the OHIP-CRO5 questionnaire. Table 2 presents the mean values, standard deviations, and Cronbach’s alpha coefficients for the Slovenian and Croatian OHIP-5 versions after one item was deleted. Table 3 shows inter-item correlation matrices of the Slovenian and Croatian OHIP-5 Questionnaires. All inter-item correlations were greater than 0.20 in the Slovenian and Croatian OHIP-5 questionnaires and were considered acceptable (46).
Means, standard deviations and Cronbach’s alpha when the item was deleted from the OHIP-SVN5 and OHIP-CRO5 questionnaires.
0.97 | 1.10 | 0.69 | 0.87 | 1.04 | 0.60 | |
0.67 | 0.85 | 0.74 | 0.41 | 0.76 | 0.67 | |
0.70 | 1.08 | 0.71 | 0.82 | 1.03 | 0.70 | |
0.24 | 0.64 | 0.73 | 0.25 | 0.53 | 0.64 | |
0.18 | 0.56 | 0.74 | 0.17 | 0.49 | 0.68 |
Inter-item correlation matrix of the OHIP-SVN5 and the OHIP-CRO5.
1.000 | |||||
0.464 | 1.000 | ||||
0.536 | 0.361 | 1.000 | |||
0.447 | 0.280 | 0.428 | 1.000 | ||
0.409 | 0.356 | 0.388 | 0.556 | 1.000 | |
1.000 | |||||
0.424 | 1.000 | ||||
0.412 | 0.231 | 1.000 | |||
0.469 | 0.339 | 0.351 | 1.000 | ||
0.369 | 0.315 | 0.240 | 0.516 | 1.000 |
The Kaiser-Meyer-Olkin (KMO) statistic for sampling adequacy was 0.783 for the OHIP-SVN5 and 0.768 for the OHIP-CRO5. The Bartlett’s test for sphericity was 537.76, with p<0.0001 in the Slovenian OHIP-5 version, while the Bartlett’s test for sphericity was 232.68, with p<0.0001 in the Croatian OHIP-5 version. The results of the EFA for the Slovenian and Croatian versions of the OHIP-5 questionnaire revealed a one-dimensional model, as all items loaded onto a single latent factor. The OHIP-SVN5 explained 52% of the variance, while the OHIP-CRO5 explained 50%. The scree plot also shows a one-factorial model for both language versions. The results of factor loadings are presented in Table 4. The Scree plots are presented in Figure 1.

Scree-plots of the OHIP-SVN5 and OHIP-CRO5.
Factor loadings of one dimension of the OHIP-5 questionnaire.
0.787 | 0.774 | |
0.651 | 0.642 | |
0.744 | 0.611 | |
0.744 | 0.780 | |
0.740 | 0.699 |
Extraction Method: Principal Component Analysis. One component extracted.
The test-retest reliability results of the Slovenian and Croatian students are presented in Table 5. The ICCs indicated very good to excellent reliability. There were no significant differences between the questionnaires completed within two weeks, either for each item or for the OHIP summary scores (p>0.05). In the Slovenian version, the ICC was not computed for the item: ‘Less flavour in food’, as the difference was zero.
Test-retest reliability measured by intraclass correlation coefficients (ICC) of the Slovenian and Croatian versions of the 5-item Oral Health Impact Profile (OHIP-SVN5 and OHIP-CRO5).
Difficulty chewing | 0.74 | −0.03 (0.43) | −0.169 – 0.12 | 0.711 N.S. |
Painful aching | 0.65 | 0.105 (0.61) | −0.093 – 0.30 | 0.291 N.S. |
Uncomfortable about appearance | 0.82 | 0.105 (0.45) | −0.043 – 0.25 | 0.160 N.S. |
Less flavour in food | N.C. | 0 (0) | N.C. | N.C. |
Difficulty doing usual jobs | 0.70 | −0.026 (0.60) | −.080 – 0.03 | 0.324 N.S. |
Difficulty chewing | 0.84 | 0.033 (0.32) | −0.086 – 0.153 | 0.537 N.S. |
Painful aching | 0.70 | 0.033 (0.18) | −0.035 – 0.10 | 0.326 N.S. |
Uncomfortable about appearance | 0.76 | 0.067 (0.37) | −0.070 – 0.20 | 0.326 N.S. |
Less flavour in food | 0.70 | 0.033 (0.18) | −0.035 – 0.10 | 0.326 N.S. |
Difficulty doing usual jobs | 0.70 | 0.033 (0.18) | −0.035 – 0.10 | 0.326 N.S. |
Legend:
ICC=Intraclass correlation coefficient
p=p-value
N.S.=p>0.05
N.C.=not computed because the standard error of the difference was 0
The results of the concurrent validity of the Slovenian and Croatian versions of the OHIP-5 questionnaires are presented in Table 6. The associations between self-reported oral health and the OHIP-5 summary scores, as assessed using the Spearman rank correlation, were positive and significant in both versions of the OHIP-5. Additionally, a significant association was found between self-reported orofacial aesthetics and the OHIP-5 summary scores in the Slovenian version.
Concurrent validity of the Slovenian and Croatian OHIP-5 Questionnaires.
excellent | 51 | 0.69 (1.16) | 0.601** |
good | 224 | 1.86 (2.06) | |
fair | 94 | 4.33 (3.38) | |
poor | 31 | 7.77 (3.81) | |
excellent | 52 | 1.12 (1.71) | 0.510** |
good | 213 | 1.83 (2.12) | |
fair | 106 | 3.99 (3.17) | |
poor | 29 | 7.90 (4.29) |
excellent | 85 | 0.31 (0.64) | 0.892** |
very good | 48 | 1.93 (1.11) | |
good | 53 | 4.36 (1.06) | |
fair | 23 | 6.61 (2.27) | |
poor | 4 | 10.25 (0.96) |
Legend:
p<0.01
The known-groups validity (i.e., divergent validity) was assessed by testing the significance of the differences in the OHIP-SVN5 and OHIP-CRO5 summary scores between groups, which were expected to have differences in OHRQoL impairment, and are presented in Table 7. The non-parametric Mann-Whitney U test was applied. It revealed significantly more impaired OHRQoL, as indicated by higher scores on each of the OHIP-5 items and higher summary scores for both OHIP-5 versions, in individuals wearing dentures.
Known-groups validity, assessed by the Mann-Whitney U test.
Difficulty chewing | no | 304 | 0.74 | 0.93 | −7.020 | <0.001** |
yes | 96 | 1.70 | 1.24 | |||
Painful aching | no | 304 | 0.57 | 0.79 | −4.174 | <0.001** |
yes | 96 | 0.98 | 0.94 | |||
Uncomfortable about appearance | no | 304 | 0.53 | 0.93 | −4.910 | <0.001** |
yes | 96 | 1.23 | 1.34 | |||
Less flavour in food | no | 304 | 0.14 | 0.49 | −5.262 | <0.001** |
yes | 96 | 0.54 | 0.92 | |||
Difficulty doing usual jobs | no | 304 | 0.12 | 0.46 | −3.214 | <0.001** |
yes | 96 | 0.34 | 0.80 | |||
no | 304 | 2.10 | 2.56 | −7.080 | <0.001** | |
yes | 96 | 4.80 | 3.86 |
Difficulty chewing | no | 128 | 0.30 | 0.57 | −10.031 | <0.001** |
yes | 95 | 1.65 | 1.03 | |||
Painful aching | no | 128 | 0.20 | 0.55 | −4.754 | <0.001** |
yes | 95 | 0.68 | 0.90 | |||
Uncomfortable about appearance | no | 128 | 0.54 | 0.79 | −4.534 | <0.001** |
yes | 95 | 1.20 | 1.18 | |||
Less flavour in food | no | 128 | 0.07 | 0.31 | −6.153 | <0.001** |
yes | 95 | 0.48 | 0.67 | |||
Difficulty doing usual jobs | no | 128 | 0.05 | 0.29 | −4.476 | <0.001** |
yes | 95 | 0.32 | 0.64 | |||
no | 128 | 1.17 | 1.67 | −9.103 | <0.001** | |
yes | 95 | 4.34 | 2.84 |
Legend:
p<0.01
Means and standard deviations of the pre- and post-treatment scores, mean differences, Z values, significance of the differences, effect sizes for each item, and OHIP-5 summary scores in Slovenian and Croatian patients are presented in Table 8.
Sensitivity to change (responsiveness) of the OHIP-SVN5 (N=30) and OHIP-CRO5 (N=30), pre- versus post-treatment.
Difficulty chewing | 2.33±0.96 | 1.87±0.82 | 0.47±0.68 | −2.64 | 0.008** | 0.48 |
Painful aching | 2.70±1.02 | 1.90±0.71 | 0.80±0.99 | −3.67 | 0.001** | 0.78 |
Uncomfortable about appearance | 1.90±0.84 | 1.96±0.98 | 0.06±0.69 | −0.58 | 0.564 N.S. | 0.07 |
Less flavour in food | 1.43±0.73 | 1.23±0.50 | 0.20±0.55 | −1.90 | 0.056 N.S. | 0.28 |
Difficulty doing usual jobs | 1.47±0.63 | 1.20±0.41 | 0.27±0.58 | −2.31 | 0.021* | 0.43 |
Difficulty chewing | 2.67±1.23 | 1.57±1.33 | 1.10±1.44 | −3.35 | 0.001** | 0.89 |
Painful aching | 0.53±1.04 | 0.17±0.38 | 0.37±0.89 | −2.31 | 0.021* | 0.35 |
Uncomfortable about appearance | 2.60±1.40 | 0.87±1.17 | 1.73±1.55 | −4.10 | <0.001** | 1.24 |
Less flavour in food | 0.80±1.19 | 0.40±0.89 | 0.40±0.81 | −2.44 | 0.015* | 0.34 |
Difficulty doing usual jobs | 0.77±1.10 | 0.20±0.61 | 0.57±0.94 | −2.85 | 0.004** | 0.52 |
Legend:
p<0.05;
p<0.01; N.S. p>0.05
This study assessed the psychometric properties of the two ultrashort OHIP versions, specifically the OHIP-5 instrument for the Slovenian and Croatian language-speaking populations. The results revealed satisfactory psychometric characteristics for both instruments (OHIPSVN5 and OHIP-CRO5), which may be used in clinical and research settings to assess OHRQoL and to distinguish between individuals with different levels of perceived oral health. Although other versions of the OHIP instrument already existed in Slovenia and Croatia (15,16,17,18), the 5-item OHIP had to be psychometrically validated for both language versions, since it has the least burden for the respondents among the various OHRQoL questionnaires. Internal consistency of the Slovenian and Croatian OHIP-5 instruments was confirmed by the Cronbach’s α coefficients >0.7, and the test-retest reliability had excellent ICC values. No significant difference was observed in the two-week period between the completion of the same questionnaire (p<0.05).
The factor analysis of the Slovenian and Croatian versions of the OHIP-5 questionnaire showed a one-dimensional model and good factor loadings. In clinical and public health studies, a unidimensional structure simplifies the use of the ultrashort OHIP version, as it allows simple scoring and interpretation. For this reason, the ultrashort 5-item OHIP version has gained increasing popularity worldwide over the past five years. Scree plots also indicated a one-factor solution, as all items loaded onto a single latent factor. The one-dimensional model of the OHIP-5 questionnaire was also determined in other language versions where factor analysis was performed (34, 35, 37).
Concurrent validity was confirmed by significant associations between self-reported oral health, self-reported orofacial aesthetics and the OHIP-SVN5 summary scores, as well as between self-reported oral health and OHIP-5 summary scores in the Croatian population. Both the OHIP-SVN5 and OHIP-CRO5 distinctly and significantly distinguished between different groups with differences in oral health impairment, i.e., between denture wearers and individuals with teeth. It is well-known that individuals wearing dentures have more problems during masticatory function, perceive more pain in function, have more psycho-social problems due to denture instability than individuals with natural teeth (23, 26, 50,51,52,53,54), and even cognitive decline has been attributed to lower chewing forces (55, 56).
Responsiveness to change of the OHIP-5 questionnaires was investigated to find out whether the instrument is sensitive to measuring changes elicited by a treatment. In Slovenian patients whose therapy involved a posterior tooth extraction after unsuccessful endodontic treatment, only the items ‘Difficulty chewing’, ‘Painful aching’, and ‘Difficulty doing usual jobs’ decreased, as well as the OHIP-5 summary score, indicating less impaired oral health after the treatment. The effect size of the treatment of 0.78 was medium (almost large). However, orofacial appearance did not change significantly after tooth extractions, as all were done in the posterior regions. Additionally, the item ‘Less flavour in food’ did not change significantly, which was an expected result following such treatment. In Croatia, the scores of all OHIP-5 items and the summary score were significantly reduced after treatment, indicating improvement in OHRQoL one month after the delivery of new complete dentures, with a large treatment effect size. The improvement observed reflects an intra-individual change, that is, a patient’s OHRQoL improves after receiving new dentures. This does not contradict our previous assumption that, on an inter-individual level, people with complete dentures generally report poorer OHRQoL compared to those with natural teeth. All listed results indicated that the OHIP-5 questionnaire, although one-dimensional, is consistent with the four-dimensional model of oral health, similar to other ultrashort OHIP versions (34, 35, 37, 57).
This study also has some limitations that should be acknowledged. The test-retest reliability was assessed in student subsamples, which may limit the generalisability of our reliability findings to the broader adult population. In addition, although our patient samples provided valuable insights into the responsiveness of the two validated OHIP-5 language versions in patients with oral health problems, they may not fully represent the general population in terms of sociodemographic and health-related characteristics. Future studies could address these aspects by including more diverse study samples and by establishing normative values for the OHIP-SVN5 and OHIP-CRO5.
The results of this study revealed good psychometric properties for both the Slovenian and Croatian ultrashort OHIP versions (i.e., the OHIP-SVN5 and OHIP-CRO5), with minimal burden on the respondents. This contributes to the international alignment of OHRQoL measurement by supporting the use of a standardised, cross-culturally adapted OHIP-5 instrument in Slovenian and Croatian populations. The one-dimensional structures of both language versions are similar to other ultrashort OHIP versions, consisting of five items.