such an open market setting, the proposed RTRM
matches the resource requests from the buyers to
providers using a double-auction paradigm. Specif-
ically, RTRM implements a multi-agent environment
that optimises the offered selling prices for all the in-
dependent providers based on the online pricing algo-
rithm. On the other hand, RTRM implements a fair
matching algorithm to dynamically match the buy-
ers’ resource demands in the open market. In this re-
gard, the proposed approach enables both participants
to maximise their utilities and the participation rate.
Besides, the proposed mechanism enhances resource
utilisation to minimise the bidder drop problem based
on the novel fairness mechanism. The experimental
results evaluate the efficiency of the RTRM by com-
paring the utilities of both the participants. In the fu-
ture, we aim to develop a mechanism that encourages
cooperative behaviour in such a competitive market
by designing a resource sharing mechanism among
the different providers to fulfil resource requests.
REFERENCES
Buchbinder, N., Jain, K., and Naor, J. S. (2007). Online
primal-dual algorithms for maximizing ad-auctions
revenue. In European Symposium on Algorithms,
pages 253–264. Springer.
Cai, H., Ren, K., Zhang, W., Malialis, K., Wang, J., Yu, Y.,
and Guo, D. (2017). Real-time bidding by reinforce-
ment learning in display advertising. In Proceedings
of the Tenth ACM International Conference on Web
Search and Data Mining, pages 661–670. ACM.
Charpentier, A., Elie, R., and Remlinger, C. (2021). Rein-
forcement learning in economics and finance. Com-
putational Economics, pages 1–38.
Chen, Y., Ganapathi, A. S., Griffith, R., and Katz, R. H.
(2010). Analysis and lessons from a publicly available
google cluster trace. EECS Department, University of
California, Berkeley, Tech. Rep. UCB/EECS-2010-95,
94.
Fink, A. M. (1964). Equilibrium in a stochastic n-person
game. Journal of science of the hiroshima university,
series ai (mathematics), 28(1):89–93.
Hassanzadeh, R., Movaghar, A., and Hassanzadeh, H. R.
(2016). A multi-dimensional fairness combinatorial
double-sided auction model in cloud environment. In
2016 8th International Symposium on Telecommuni-
cations (IST), pages 672–677. IEEE.
Hu, J., Wellman, M. P., et al. (1998). Multiagent rein-
forcement learning: theoretical framework and an al-
gorithm. In ICML, volume 98, pages 242–250. Cite-
seer.
Konda, V. R. and Tsitsiklis, J. N. (2000). Actor-critic algo-
rithms. In Advances in neural information processing
systems, pages 1008–1014.
Kong, Y., Zhang, M., and Ye, D. (2015). An auction-based
approach for group task allocation in an open network
environment. The Computer Journal, 59(3):403–422.
Krishna, V. (2009). Auction theory. Academic press.
Kumar, D., Baranwal, G., Raza, Z., and Vidyarthi, D. P.
(2019). Fair mechanisms for combinatorial reverse
auction-based cloud market. In Information and Com-
munication Technology for Intelligent Systems, pages
267–277. Springer.
Lee, C., Wang, P., and Niyato, D. (2013). A real-time group
auction system for efficient allocation of cloud inter-
net applications. IEEE Transactions on Services Com-
puting, 8(2):251–268.
Li, X., Ding, R., Liu, X., Liu, X., Zhu, E., and Zhong, Y.
(2016). A dynamic pricing reverse auction-based re-
source allocation mechanism in cloud workflow sys-
tems. Scientific Programming, 2016:17.
Murillo, J., Mu
˜
noz, V., L
´
opez, B., and Busquets, D. (2008).
A fair mechanism for recurrent multi-unit auctions. In
German Conference on Multiagent System Technolo-
gies, pages 147–158. Springer.
Myerson, R. B. (1981). Optimal auction design. Mathemat-
ics of operations research, 6(1):58–73.
Prasad, G. V., Prasad, A. S., and Rao, S. (2016). A combi-
natorial auction mechanism for multiple resource pro-
curement in cloud computing. IEEE Transactions on
Cloud Computing, 6(4):904–914.
Samimi, P., Teimouri, Y., and Mukhtar, M. (2016). A com-
binatorial double auction resource allocation model
in cloud computing. Information Sciences, 357:201–
216.
Shen, W., Feng, Y., and Lopes, C. V. (2019). Multi-winner
contests for strategic diffusion in social networks. In
Proceedings of the AAAI Conference on Artificial In-
telligence, volume 33, pages 6154–6162.
Skaperdas, S. (1996). Contest success functions. Economic
theory, 7(2):283–290.
Sutton, R. S., Barto, A. G., et al. (1998). Introduction to
reinforcement learning, volume 2. MIT press Cam-
bridge.
Toosi, A. N., Vanmechelen, K., Khodadadi, F., and Buyya,
R. (2016). An auction mechanism for cloud spot mar-
kets. ACM Transactions on Autonomous and Adaptive
Systems (TAAS), 11(1):2.
Wilkes, J. (2011). More Google cluster data. Google re-
search blog. Posted at http://googleresearch.blogspot.
com/2011/11/more-google-cluster-data.html.
Yuan, S., Wang, J., and Zhao, X. (2013). Real-time bidding
for online advertising: measurement and analysis. In
Proceedings of the Seventh International Workshop on
Data Mining for Online Advertising, page 3. ACM.
Zaman, S. and Grosu, D. (2013). Combinatorial auction-
based allocation of virtual machine instances in
clouds. Journal of Parallel and Distributed Comput-
ing, 73(4):495–508.
Zeng, L., Benatallah, B., Dumas, M., Kalagnanam, J., and
Sheng, Q. Z. (2003). Quality driven web services
composition. In Proceedings of the 12th interna-
tional conference on World Wide Web, pages 411–421.
ACM.
Reinforcement Learning-based Real-time Fair Online Resource Matching
41