Table 1: Computation times of different collaborative filtering algorithms.
Items 100 150 1000
Users FRAC CorrCF Item-Item FRAC CorrCF Item-Item FRAC CorrCF Item-Item
400 1”84 6”09 3”87 2”09 7”62 5”29 8”58 32”24 1’22”
800 7”03 19”98 7”23 7”34 25”67 10”53 30”17 1’52” 2’33”
1.400 11”21 1’00” 11”50 12”81 1’17” 18”10 49”47 6’04” 4’29”
10.000 6’50” 7h30’ 1’22” 9’12” - 2’05” 14’22” - 49’28”
can compute similarities. On the contrary, our algo-
rithm needs less votes and recomputations because
correlations are made between users.
Of course, the more the offline computations of our
algorithm take time, the more it can augur for slight
differences between the updated votes and the prefer-
ences really taken into account during the clustering
process. But these differences should be minimal be-
cause of the great number of users.
5 CONCLUSION AND
PERSPECTIVES
The novelty of our model relies on the fact that
we have mixed a distributed collaborative filtering
method with a behavior modeling technique. The
main advantage of this combination is to take into ac-
count in an overall way the strong constraints due to
an industrial context such as the privacy of users and
the sparsity of the matrix of votes. Moreover, thanks
to the new version of the clustering algorithm, we use
the matrix of votes to divide all the users into com-
munities. This new version has been especially de-
signed to treat a high quantity of information (Castag-
nos et al., 2005) and allows the scalability to real com-
mercial applications by dealing with time constraints.
We have implemented our architecture in a satellite
broadcasting software with about 120.000 users in or-
der to highlight the benefits of such a system.
We are now considering the possibilities to com-
bine our model with additional item-based filters in
order to sort items in increasing order of importance
for the active user on client side. In particular, we are
studying the added value of bayesian networks and
content-based filtering techniques in our architecture.
ACKNOWLEDGEMENTS
We want to thank SES ASTRA and the CRPHT which
have encouraged this work.
REFERENCES
Bradley, P. S. and Fayyad, U. M. (1998). Refining initial
points for k-means clustering. In Proceedings of the
15th International Conference on Machine Learning
(ICML98), pages 91–99, San Francisco, USA. Mor-
gan Kaufmann.
Breese, J. S., Heckerman, D., and Kadie, C. (1998). Em-
pirical analysis of predictive algorithms for collabo-
rative filtering. In Proceedings of the fourteenth An-
nual Conference on Uncertainty in Artificial Intelli-
gence (UAI-98), pages 43–52, San Francisco, CA.
Castagnos, S., Boyer, A., and Charpillet, F. (2005). A
distributed information filtering: Stakes and solution
for satellite broadcasting. In Proceedings 2005 Int.
Conf. on Information Systems and Technologies (We-
bIST05), Miami, USA.
Chan, P. (1999). A non-invasive learning approach to build-
ing web user profiles. In Workshop on Web usage
analysis and user profiling, Fifth International Con-
ference on Knowledge Discovery and Data Mining,
San Diego.
Chee, S. H. S., Han, J., and Wang, K. (2001). Rectree : An
efficient collaborative filtering method’. In Proceed-
ings 2001 Int. Conf. on Data Warehouse and Knowl-
edge Discovery (DaWaK’01), Munich, Germany.
Miller, B. N., Konstan, J. A., and Riedl, J. (2004). Pock-
etlens: Toward a personal recommender system.
In ACM Transactions on Information Systems, vol-
ume 22, pages 437–476.
Pennock, D. M., Horvitz, E., Lawrence, S., and Giles, C. L.
(2000). Collaborative filtering by personality diag-
nosis: a hybrid memory- and model-based approach.
In Proceedings of the sixteenth Conference on Uncer-
tainty in Artificial Intelligence (UAI-2000), San Fran-
cisco, USA. Morgan Kaufmann Publishers.
Resnick, P., Iacovou, N., Suchak, M., Bergstorm, P., and
Riedl, J. (1994). Grouplens: An open architecture for
collaborative filtering of netnews. In Proceedings of
ACM 1994 Conference on Computer Supported Co-
operative Work, pages 175–186, Chapel Hill, North
Carolina. ACM.
Sarwar, B. M., Karypis, G., Konstan, J. A., and Reidl, J.
(2001). Item-based collaborative filtering recommen-
dation algorithms. In World Wide Web, pages 285–
295.
WEBIST 2006 - WEB INTERFACES AND APPLICATIONS
440