|
|
||||||||
1 Wellington Asthma Research Group, Dept of Medicine, School of Medicine, Wellington, New Zealand. 2 Dept of Pediatric Respiratory Medicine, Faculty of Medical Sciences, Hospital Centro de Referencia de Salud El Pino, University of Santiago de Chile, Santiago, Chile. 3 Dept of Community Health, and 4 Dept of Paediatrics, Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
CORRESPONDENCE: J. Crane, Dept of Medicine, School of Medicine, PO Box 7343, Wellington, New Zealand. Fax: 64 43895427. E-mail: crane@wnmeds.ac.nz
Keywords: asthma, International Study of Asthma and Allergies in Childhood, prevalence, video questionnaire, wheezing, written questionnaire
Received: May 18, 2002
Accepted November 14, 2002
| Abstract |
|---|
|
|
|---|
In general, responses to the video questionnaire gave a lower prevalence than the written questionnaire and responses were closely correlated. The overall proportion of agreement was high, mean 0.89, but unbalanced, with good negative agreement but poor positive agreement. Chance corrected agreement using Cohen's kappa coefficient, was generally low, with only 20 centres with kappa >0.4. The contribution of each questionnaire to wheezing prevalence also varied between centres and suggests that written questions about wheezing are variably understood and interpreted by 1314 yr olds.
International comparisons of wheezing and its audiovisual presentation suggest that adolescents interpret a written question about wheezing differently from its audiovisual presentation and that this interpretation shows variation between centres. This relationship and the interpretation of both written and audiovisual presentation of symptoms requires further study in order to better predict asthma.
Epidemiological surveys of asthma have relied largely on questionnaire responses to questions about asthma symptoms, a diagnosis of asthma or drug treatment used for asthma management. Each of these has attendant problems. Symptoms may not be specific to asthma and may be difficult for the patient to interpret 1, while a diagnosis of asthma or drug therapy may be biased by the availability of healthcare services 2. These issues become particularly important when measuring asthma prevalence over time. Magnus and Jaakkola 3 have reviewed repeat asthma prevalence studies and concluded that diagnostic fashion and criteria may have changed with time and that the everyday perceptions of terms such as wheezing may well have changed, leading to an increased awareness, and hence to increased reporting. This review underlines the importance of understanding responses to questionnaires over time, but it is equally important to understand how their interpretation differs between countries, languages and cultures.
More recently, simple field methods for measuring bronchial hyperresponsiveness (BHR) have been developed and refined 4, 5, and have been used, often in conjunction with skin-prick tests, in an attempt to differentiate a group with atopy who have greater clinical severity and physiological abnormality 6. While BHR is useful as a marker of an airway abnormality strongly associated with asthma, it is not specific for asthma, may not accurately reflect asthma severity 7 and is too time consuming and expensive for large-scale surveys. These difficulties and the problems associated with questionnaire responses across languages and cultures 8 led the authors of this paper to develop and validate a video questionnaire for measuring asthma symptoms in children (a shortened version of the video questionnaire can be viewed at www.wnmeds.ac.nz/academic/med/warg/ADV.html) 9, 10. Following various refinements, this questionnaire has been used as an optional part of International Study of Asthma and Allergies in Childhood (ISAAC) for self-completion by 1314 yr olds. ISAAC provides a unique opportunity to compare the audiovisual presentation of asthma symptoms and signs with written equivalents, and to examine the interpretation of asthma symptoms when presented in a variety of languages and cultures.
| Materials and methods |
|---|
|
|
|---|
The aims, methods and structure of the ISAAC project have been described previously 11. Briefly, ISAAC is an international collaboration with three aims: 1) to compare the international variation in the prevalence and severity of asthma, rhinitis and eczema in children using simple self-completed questionnaires; 2) to provide a baseline for assessment of trends in the prevalence and severity of these diseases; and 3) to provide a framework for further aetiological research into their causes.
The sampling frame was schools containing children aged 67 and 1314 yrs, within a specified geographical area (ISAAC centre). Children aged 1314 yrs were selected because they were able to self-complete the written and video questionnaires. This paper concerns only 1314-yr-old children and only those centres that completed both the written and video questionnaires. The recommended sample size was 3,000, used to ensure good prevalence estimates for severe asthma. The ISAAC core questionnaires have been published previously 11, and both the written and video questionnaires were piloted prior to the main study 12. Most of the questions used for asthma symptoms are based on questions used in previous respiratory epidemiological studies and include both sensitive and specific indicators of asthma 1.
Translation
In 71% of centres the written questionnaire was translated from English into one (86%) or more local languages. The method of translation was standardised, according to guidelines developed in Germany 13 and adapted for ISAAC (ISAAC document 41). These guidelines included the use of translators familiar with asthma terminology (98% of centres that used translations), consultation with the local community (67%) and back translation to English by an independent translator (95%).
Video questionnaire
Two versions of the video questionnaire, the "European" version (AVQ 2.0) and the International version (AVQ 3.0), were used in the ISAAC Phase One study. Each contains five short sequences of asthma symptoms and signs. Each sequence is followed by three questions asking the respondent if their breathing has ever been like the person's in the video, if "yes" "in the last 12 months?" and if "yes" again "in the last month?". The first sequence shows a young person seated with clearly audible wheezing, but without breathlessness and no evidence of airway obstruction. Four further sequences are shown in the video: 1) exercise-induced wheezing, 2) waking at night with wheezing, 3) nocturnal coughing and 4) a final sequence showing a severe attack of asthma.
The International version was developed to make the exercise sequence more universally applicable, by showing running rather than indoor aerobic exercise, and also to provide ethnic diversity amongst the subjects in the video. In the European version all subjects are Caucasian, in the international version subjects include two Caucasians, two Maori males, one Chinese male and one Indian female. In the European version the text of the questions appears on the screen translated into the local language, while in the international version no written text appears on screen and a field researcher reads the same instructions to the subjects. The first video sequence, showing wheezing at rest, and the fourth sequence, showing coughing at night, are identical in both versions. Only the first video sequence has been compared with its written counterpart in this paper.
The video questionnaire was encouraged but not a mandatory part of ISAAC Phase One. All children were shown the video after they had completed the written questionnaire. Both questionnaires have been validated against nonspecific BHR among children in New Zealand, Australia and Hong Kong; the European version in 1214 yr olds in New Zealand 9, 10, the international version amongst 1218-yr-old Chinese children in Hong Kong 14 and among 1314-yr-old students from mixed ethnic backgrounds in Australia 15. This latter validation was against a challenge with hypertonic saline. Each of these studies showed that video questions overall had similar sensitivity and specificity for BHR as the written questions.
Data management and analysis
Each collaborating centre or a national centre entered information on the questionnaires exactly as recorded by the child (except five centres where data was re-coded to eliminate inconsistent responses). Thus apparent inconsistencies between responses to the stem and branch questions were generally accepted and not re-coded. Children in some centres answered the written questionnaire but did not see the video, they have been excluded from this analysis.
Regional classification
Centres have been classified by World Health Organisation (WHO) regions. For the purposes of this comparison, regional inclusions of five centres have been changed from the previous WHO classification outlined in the published methods 11. Karachi has been included in Southeast Asia instead of the Eastern Mediterranean and the four Finnish centres have been included in Western Europe rather than North Eastern Europe. These changes were made a priori before any analysis had been undertaken. Pakistan is geographically part of Southeast Asia and shares closer cultural and linguistic ties with India than the Eastern Mediterranean. The Finnish centres appeared to be more closely aligned with Western Europe economically and culturally than with the recently independent Baltic States, Russia and Poland.
Measures of agreement
A variety of measures of agreement have been used in studies comparing categorical and numerical measurements. Cohen's kappa has often been used for measuring reliability between observers, as it compares the observed agreement with perfect agreement while correcting for chance. Landis and Koch 16 have suggested kappa values
0.4 indicate poor agreement, values of 0.40.6 moderate agreement and
0.8 excellent agreement. However, as an omnibus index, kappa suffers from a number of problems that need to be taken into account when trying to understand the relationships between observations. The behaviour of this coefficient with varying prevalence was originally explored by Kraemer 17 and subsequently the paradox of high proportions of agreement (Po) but low kappa values was reviewed by Feinstein and Cicchetti 18, 19. Their observations on kappa are relevant to the comparisons of the ISAAC questionnaires. They have argued that in examining agreement, four measures should be presented, kappa, Po and the proportion of positive and negative agreement (Ppos, Pneg). This is because kappa will tend to diverge from the proportion of agreement in relation to prevalence and to an even greater extent when there is a large disparity in the observer's agreement for positive or negative ratings. Proportions of positive and negative agreement illustrate this disparity.
These four measures have been calculated. The proportions of agreement were calculated, as suggested by Feinstein and Cicchetti 18, 19.
| Results |
|---|
|
|
|---|
Comparison of written and video questionnaire responses
In all but nine centres the frequency of positive responses to the written question on wheezing in the last 12 months was greater than for the video question. Figure 1
shows the correlation between the written and video responses to the first video sequence and the written question on wheezing in the last 12 months. The other four video sequences were also highly correlated with their written counterparts with R2 values of 0.52 for exercise-induced wheezing, 0.74 for waking with wheezing, 0.57 for waking with coughing and 0.71 for the severe wheezing sequence (not shown).
|
|
|
|
|
|
| Discussion |
|---|
|
|
|---|
In examining agreement across the 99 centres, the generally low level of agreement also shows considerable variation between centres and to a lesser extent variation by region and amongst different languages. Why is chance corrected agreement low and why does it vary between centres and what does it tell us about reported wheezing in 1314 yr olds? There are a number of reasons for low levels of agreement, all or any of which are likely to be operative both within and between centres and between language groups.
First, kappa suffers from inherent problems as an omnibus measure of agreement, as do all such measures. Thus whenever the prevalence varies between the two questionnaires, kappa can never attain a value of 1, perfect agreement is impossible. This becomes more marked at the extremes of prevalence and with very high degrees of discordance 23. Various strategies have been employed to try and correct these difficulties using different forms of kappa but none have proved particularly useful 18. Kappa is most often used to compare rater responses for a particular outcome where the prevalence is unknown and where there is no gold standard. In this study, use of kappa is somewhat unusual. The current study compares subject responses to two different forms of a question designed to capture similar information. The study is inherently less interested in the level of agreement per se but rather in what agreement or lack of agreement tells us about the interpretation of wheezing and asthma in different populations.
Secondly, the comparison was undertaken by 1314 yr olds in very different environments and their understanding of the term wheezing is likely to differ from the present authors' understanding, and may vary by region and centre. English-speaking children have difficulty in understanding and explaining the term wheezing when asked by a physician 24. Responses of 1314 yr olds to the written ISAAC questionnaire give symptom prevalence rates as much as two-fold higher than their parents 25. This may be because parents do not appreciate the frequency of symptoms that their children actually experience or that the 1314 yr olds misinterpret symptom questions.
Evidence of differences in interpretation are apparent from the ISAAC study. For example, the written question about wheezing in the last 12 months was designed to collect information on any episode of wheezing during that period. Amongst 1314 yr olds, in every centre except two, children reported higher levels of wheezing after exercise in the previous 12 months than of any wheezing in the past 12 months. In contrast, amongst 67 yr olds, where the identical questions were completed by the children's parents, exercise wheezing in the last 12 months was reported at a lower frequency than any wheezing in every centre 26. Thus the written question to 1314 yr olds has not captured all wheezing in the previous 12 months and was generally interpreted by almost all children to exclude exercise wheezing. The context of the written question on wheezing in the past 12 months has been differently interpreted by 1314 yr olds compared with what was expected.
Thirdly, the video question may have been interpreted as illustrating more severe wheezing than that experienced by individuals. This is supported by the higher prevalence of wheezing from the written question compared to the video in 90 of the 99 centres. The video questionnaire was designed to illustrate wheezing. The subject in the video sequence had no objective evidence of airflow obstruction at the time of the filming. Validation exercises in Australia and Hong Kong suggested similar sensitivity and specificity for BHR as the written question suggesting that they were measuring wheezing of similar severity, at least similar severity as judged by BHR 14, 27. However, some children have interpreted the video sequence as being more severe than their experience of wheezing, by a positive response to the written question and a negative response to the video.
Lastly, there were respondents in every centre who responded negatively to the written questionnaire but positively to the video. This is illustrated by language group in figure 5
. Presumably these children were uncertain about the meaning of the written term wheezing but recognised the symptom when it was shown to them audiovisually.
Agreement by region
Mean values for kappa are similar between most regions, with the exception of North East Europe, but with considerable variation between centres within each region. Centres in North America, Western Europe and Oceania tended to show less variation than other regions. For Oceania, which comprises Australia and New Zealand, this likely reflects their common language, the common historical cultural heritage of the majority of the immigrant population and a longstanding high level of awareness of asthma as a major health problem in the community.
Agreement by language
Comparison of agreement by language groups shows more variation. Again, English and Russian showed the greatest difference. Mean scores for agreement in Spanish-speaking centres in Spain and Latin America were similar. However kappa varied from 0.27 to 0.44 amongst Spanish centres in Spain and to a greater extent in Spanish-speaking centres in Latin America, which included different countries and where kappa varied from 0.19 to 0.58. Chinese speakers showed lower levels of agreement than English or Spanish speakers, but agreement varied from 0.13 to 0.33. Russian speakers showed the lowest level of agreement ranging from 0.080.32.
By employing two questionnaires asking similar questions in different ways it is possible to examine their combined and separate contributions to wheezing prevalence in three distinct groups, a positive response to the written question only, positive response to both or to the video only.
Figure 5
examines the contribution of positive responses to each questionnaire separately and a combined response as a proportion of all wheezing. The written questionnaire was administered before the video in every centre. Thus for English and Spanish speakers almost half of the children (48% and 45%, respectively) who reported any wheezing on either questionnaire did not agree when they saw and heard wheezing in the video question. For Russian speakers this was 84% and Chinese 59%. These respondents would have included some children who did not understand what was meant by the term wheezing or considered that their own experience of wheezing was less severe than the video depiction. Conversely, only 10% of all Russian wheezing children and 38% of English-speaking wheezing children agreed when they saw the video. These children presumably interpreted the video as representing their understanding and experience of wheezing. These children may be a more severe wheezing group and the proportion of children in this group would then be a marker of asthma severity. A variable proportion of children, 6% of Russian-speaking wheezing children and 28% of Spanish, having responded negatively to the written question then responded positively to the video. These children presumably did not understand the written question but recognised the symptom when they saw and heard it. These children would not be included in a survey of wheezing prevalence based on written questions alone.
A large proportion of 1314-yr-old children who responded to a written question about wheezing, either did not understand the meaning of the term and its intended context or have wheezing that is not distinctly audible and is considered by them to be less severe than a video sequence depicting audible wheeze without breathlessness. Further study of children in this group is clearly warranted. Children who respond only to the video, after responding negatively to the written question, may represent a group with previously unrecognised symptoms who also did not understand the term wheezing but recognised it when they saw and heard it. The proportions of children in these groups varies considerably between centres and by language groups. Study of children separated by their responses to written and or video questionnaires would help clarify what exactly is being measured and how it relates to episodic airflow obstruction. Similar studies amongst adults would also be of interest.
The use of a video questionnaire, in addition to the traditional written format, raises questions about questionnaire interpretation, wheezing severity and unrecognised symptoms, which warrants clarification in order to better define asthma in population surveys. This present analysis of the two questions does not significantly affect the broad conclusions about international variations in reported wheezing prevalence between centres and regions in the International Study of Asthma and Allergies in Childhood, since the two questionnaires are highly correlated. It does, however, raise questions about exactly what is being measured by either questionnaire in different centres and, in turn, how these responses might relate to episodic airflow obstruction.
| Acknowledgements |
|---|
|
|
|---|
ISAAC Steering Committee: N. Aït-Khaled, G. Anabwani, H.R. Anderson, M.I. Asher, R. Beasley, B. Björkstén, M. Burr, J. Crane, P.E. Ellwood, U. Keil, C.K.W. Lai, J. Mallol, F.D. Martinez, E.A. Mitchell, S. Montefort, N. Pearce, C.F. Robertson, J.R. Shah, D. Strachan, A.W. Stewart, E. von Mutius, S.K. Weiland, H.C. Williams.
ISAAC International Data Centre: M.I. Asher (Director), T.O. Clayton, P.E. Ellwood, E.A. Mitchell, A.W. Stewart. Principal Investigators are listed in 26.
| References |
|---|
|
|
|---|
as a coefficient of reliability. Psychmetrica 1979;44:461472.[CrossRef]
This article has been cited by other articles:
![]() |
G. Netuveli, B. Hurwitz, and A. Sheikh Lineages of language and the diagnosis of asthma J R Soc Med, January 1, 2007; 100(1): 19 - 24. [Abstract] [Full Text] [PDF] |
||||
![]() |
M Nickmilder and A Bernard Ecological association between childhood asthma and availability of indoor chlorinated swimming pools in Europe Occup. Environ. Med., January 1, 2007; 64(1): 37 - 46. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Michel, M. Silverman, M-P. F. Strippoli, M. Zwahlen, A. M. Brooke, J. Grigg, and C. E. Kuehni Parental understanding of wheeze and its impact on asthma prevalence estimates Eur. Respir. J., December 1, 2006; 28(6): 1124 - 1130. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. O.M. Van De Ven, R. J.J.M. Van Den Eijnden, and R. C.M.E. Engels Atopic diseases and related risk factors among Dutch adolescents Eur J Public Health, October 1, 2006; 16(5): 549 - 558. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. C. Smeeton, R. J. Rona, M. Oyarzun, and P. V. Diaz Agreement between Responses to a Standardized Asthma Questionnaire and a Questionnaire following a Demonstration of Asthma Symptoms in Adults Am. J. Epidemiol., February 15, 2006; 163(4): 384 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. M. Hunninghake, S. T. Weiss, and J. C. Celedon Asthma in Hispanics Am. J. Respir. Crit. Care Med., January 15, 2006; 173(2): 143 - 163. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Nystad, E. Roysamb, P. Magnus, K. Tambs, and J. R Harris A comparison of genetic and environmental variance structures for asthma, hay fever and eczema with symptoms of the same diseases: a study of Norwegian twins Int. J. Epidemiol., December 1, 2005; 34(6): 1302 - 1309. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |