which is a psychometric method. Based on the results,
we then conduct a cluster analysis of sample videos
and classify the videos into a number of groups. In the
present paper, our goal is to clarify the effects of the
parameters on the subjective video quality for each
of the respective groups. We therefore analyze these
main factors. Here, we define the subjective video
quality as the level of user satisfaction with respect to
video quality.
3.1 Subjective Assessment
3.1.1 Experimental Methods
In this section, we investigate the degree of the ef-
fects of the quality parameters on subjective video
quality. Several evaluation methods of user satisfac-
tion have been proposed, including a subjective qual-
ity estimation method using neural networks(Lin and
Mersereau, 1999). However, the evaluation accuracy
of audio and video quality is low because determina-
tion of the absolute degree of audio or video quality
is difficult. In order to obtain reliable results for such
items, a method of paired comparisons can be used.
Using this method, test subjects can easily evaluate
video quality. Therefore, the present paper applies the
method for subjective assessment.
Methods of paired comparisons can be classi-
fied into two types. The first type, which includes
Beadley’s method and Thurstone’s method, expresses
the comparison results by ranking. The second type is
Scheffe’s method, in which the results are expressed
as scores. In this present paper, we express the com-
parison results as scores and analyze the results by
assuming various structures of these scores, i.e. we
adopt Scheffe’s method. In addition, we use Nakaya’s
modified method(JUSE, 1973), in which each test
subject is assigned all combinations of trials.
The experimental procedure is described in detail
below. We provided 30 sample videos having vari-
ous program features, i.e., various themes, styles and
content, for example. A list of these sample videos
is shown in Table 2. We selected the sample videos
to reflect various viewing purposes in order to ana-
lyze the relationship between the video content and
subjective video quality. The video features of these
sample videos are described by the feature descrip-
tion parameters defined in Section 2 These parameter
values are also listed in Table 2. Motion intensity is
represented on a scale from 1 to 5 (in which larger
numbers indicate more intensive motion). The num-
ber of objects is represented on a scale from 1 to 3
(in which larger numbers indicate a greater number
of objects). Finally, the importance of semantic para-
meters is represented on a scale from 1 to 3 (in which
larger numbers indicate higher importance).
Table 3: Quality parameter values for the experiment.
Quality Rate Size
Original 100 30 fps 640×480
Slight degradation
27 10 fps 384×256
Medium degradation
15 3 fps 192×128
Severe degradation
10 1 fps 96×94
Next, we prepared 10 test videos for the exper-
iment by changing the three quality parameters in
three grades. In general, degradation appearances of
digital video quality by the same factor (e.g. packet
loss) differ among different codec types and it is nec-
essary to consider these effects on subjective evalua-
tion values. In the present paper, in order to eliminate
these effects, we prepared the video under the con-
dition that sufficient system resources exist. Quality
parameter values for the experiment are listed in Ta-
ble 3. In the table, video quality is denoted as image
degradation ratios.
The number of test video is 10 for each sample
program. In this experiment, we made pairs of test
videos for each program and had test subjects evaluate
which video was preferable and the degree of prefer-
ability with respect to subjective video quality. The
subjects rated 45 pairs, such as a video with slight
degradation of frame size and a video with a medium
degradation of image quality, on a scale raging from
−3 to 3 (i.e. seven-grade) for each program (See Ta-
ble 4). Lower grades denote lower subjective evalua-
tion values. Each test video was made in advance and
was watched by the subjects for approximately twenty
seconds on a 19-inch LCD monitor connected to a PC.
We determined the experimental conditions based on
ITU-R Recommendation BT.500(ITU-R, 2000). The
distance between the test subject and the monitor was
60 cm, and the number of light sources in the experi-
mental room was one. The measured intensity of illu-
mination was 20 lux at the viewing point. The experi-
mental procedure for comparing video streams, which
consisted of watching and assessing streams, is shown
in Fig. 1. In this figure, A and B compared video
streams, and gray images are displayed for one sec-
ond between A and B and for three seconds at the end
of the sequence.
In this case, we did not consider the comparison or-
der and asked all subjects to compare ordered pairs
once, because spatial comparisons, for example of
colors and shapes, were performed in this experiment,
and afterimages may not have existed. The number of
subjects was six (five male and one female), and 9,450
sets of data were obtained.
3.1.2 Experimental Results
This evaluation results can be summarized in a table
having 10 degradation parameters as the vertical and
horizontal rows, and the grades as its elements. An
SIGMAP 2006 - INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA
APPLICATIONS
202