Figure 4 shows how many web pages have their ob-
jects scattered on at least n hosts. One notable point
is that more than half of the pages are scattered on at
least three hosts, while more than 10 % are on at least
seven hosts. We have not tried to estimate what im-
pact the use of multiple hosts can precisely have on
the perceived quality, but we’re really akin to think
that more than a few hosts can’t do any good.
Web page content is thus generally spread among
several hosts. We now try to find out how well this
content is spread. We first define as “the biggest host”
for each site the host serving the biggest part (in bytes)
of the page. We define its weight as the percentage of
the page it owns. Figure 5 shows what the weight of
the biggest host is for sites with 2 to 5 hosts.
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
weight of biggest host (bytes)
sites
2 hosts
3 hosts
4 hosts
5 hosts
Figure 5: Weight of biggest host (in bytes).
It seems web page content is not well balanced
among hosts for most sites. For example, the biggest
host owns more than 80 % of the content for 35 % of
the three hosts sites.
We don’t show them here for lack of space, but
roughly similar (slightly better) curves are obtained
if we find the biggest host and its weight based on the
number of files rather than the number of bytes.
5 CONCLUSION
As a conclusion, we have found that most web pages
have lots of embedded objects, and that the mean size
of such embedded objects is rather small. This seems
to be mainly because of small pictures used exten-
sively in the web pages. Because of the way HTTP
has been designed, such small objects can have signif-
icant negative effects on the time necessary to down-
load the whole web page. Features such as persistent
connections and request pipelining have been intro-
duced in HTTP to mitigate these negative effects, but
we found that half of the studied sites do not support
them, which is rather interesting.
Finally, our results show that many sites use objects
scattered on a large number of hosts. This again can
significantly impact performance, mainly by forcing
clients to perform many DNS resolutions. Moreover,
for most sites the content is not well balanced among
hosts.
Another issue we have not considered so far is the
effect of server load on TCP persistent connections
behavior. As we said, idle connections go in the slow
start phase when transmission restarts. Thus, a slight
overload on the server, by introducing small delays
between responses, may result in much more delays
as perceived by the client.
Finally, two points that seem worth investigating
are whether correct use of style-sheets can reduce the
number of small pictures in the examined web sites,
and why persistent connections are so unpopular.
REFERENCES
Alexa (2005). web search. Web site. www.alexa.com.
Allman, M., Paxson, V., and Stevens, W. (1999). TCP con-
gestion control. RFC 2581, IETF.
Deleuze, C. (2004). Content networks. Inter-
net Protocol Journal, 7(2):2–11. Available at
http://www.cisco.com/ipj.
Fielding, R., Gettys, J., Mogul, J., Frystyk, H., Masinter,
L., Leach, P., and Berners-Lee, T. (1999). Hypertext
transfer protocol – HTTP/1.1.
Jacobson, V. (1995). How to kill the internet. In SIGCOMM
95 middleware workshop.
Krishnamurthy, B., Mogul, J. C., and Kristol, D. M. (1999).
Key differences between HTTP/1.0 and HTTP/1.1. In
Proceedings of the WWW-8 Conference, Toronto.
Mogul, J. C. (1999). What’s wrong with HTTP (and why it
doesn’t matter). In Proceedings of the 1999 USENIX
Annual Technical Conference, Monterey, CA.
Nielsen, H. F. and al. (1997). Network performance ef-
fects of HTTP/1.1, CSS1, and PNG. Technical Report
NOTE-pipelining-970624, W3C.
Padmanabhan, V. N. and Mogul, J. (1994). Improving
HTTP latency. In Proceedings of the 2nd World Wide
Web Conference, pages 995–1005, Chicago, IL.
W3C (2001). Final HTTP-NG activity statement.
Web page. http://www.w3.org/Protocols/HTTP-
NG/Activity.html.
ZDNet (2005). Impots en ligne : Bercy ac-
corde un délai supplémentaire. Web page.
http://www.zdnet.fr/actualites/internet/
0,39020774,39216771,00.htm.
SOME POINTS AFFECTING WEB PERFORMANCE
245