Abstract
Development of an integrated data visualization system for health examination survey data on
the internet is an extension of data presentation and utilization of the 4th national health examination
survey during 2008-2009 by increasing the variety of variables used in the system for responding to
wider demand of users and providing more accessibility of data via the internet.
Steps of development started with the study of variables collected in the survey for adults aged
>= 15 years and children aged 1-14 years followed by selecting and grouping of variables for
preparing dataset used in the next steps. Design of data presentation and visualization composed of 3
models. The first model is the presentation of outcome measures including proportion, multinomial
proportion and mean divided by 1-2 categorical variables which can be either characteristic or
outcome data. The second model is the presentation of correlation pattern at individual level between
a pair of continuous variables which can be either characteristic or outcome data and can be filtered
by one categorical variable and disaggregated by another categorical variable. The third model is the
presentation of distribution pattern of samples divided by 1-2 categorical variables which can be either
characteristic or outcome data. The number of outcome measures for the first model is 103 in 21
groups for adult and 30 in 10 groups for children. The number of categorical variables for all models is
106 in 27 groups for adult and 63 in 19 groups for children. The number of continuous variables for the
second model is 35 in 12 groups for adult and 23 in 8 groups for children. The next step was data
processing for each data presentation model. This process provided summary data or corresponding
data for each model. Finally, the data visualization on the internet was designed and developed using
appropriated data visualization tools. Users can select the variables to generate the graphical
presentation as demanded and can export graph to image file and excel table for further usage.
The result of development of data presentation and visualization for the 4th national health
examination survey data showed the possibility to extend to the 5th national health examination data
with one more model for comparing the 4th survey data with the 5thsurvey data to show the changing
trend overtime. Moreover, from the variety of outcome measures and variables selected in the system,
this system can provide various objectives of data utilization including targeting population and area
target, monitoring disparity of health and health care among population, exploring relationship among
risks and between risks and outcomes, and building new research question and hypothesis.