87.69%, the second-best accuracy after the HPTGBC
classifier (88.16%).
The models’ accuracy is better than the
PCA+SVM (84.92%), GBC (86.92%), and
XGBOOST (87.53%).
The DBLOSUM in Negative Selection classifier
appears to be best suited to classification tasks where
false negatives pose a major risk, such as in medical
screening and diagnosis. It is more likely than most
techniques to result in false positives, but it is as
accurate, if not more accurate than most other
techniques.
ACKNOWLEDGEMENTS
This work was supported in part by a bursary from the
Prisoners of Conscience Appeal Fund and in part by
a scholarship from Heriot-Watt University, UK.
We thank the Department of Computer Science at
the University of Wisconsin - Madison, USA, for the
opportunity to attend talks, seminars, and discussions
in the field of AI and ML.
The Breast Cancer database was obtained from
the University of Wisconsin Hospitals, Madison,
from Dr William H. Wolberg. The Pima Indians
Diabetes and Adult Income datasets can be accessed
from the Machine Learning Repository at the
University of California, Irvine.
REFERENCES
Chakrabarty, N. and Biswas, S. (2018). A statistical
approach to adult census income level prediction. In
2018 International Conference on Advances in
Computing, Communication Control and Networking
(ICACCCN), pages 207–212. IEEE.
Dasgupta, D. (2006). Advances in artificial immune
systems. In IEEE computational intelligence magazine.
IEEE.
de Castro, L. N. and Timmis, J. (2002). Artificial immune
systems: A novel approach to pattern recognition. In
Artificial Neural Network in Pattern Recognition.
University of Paisley.
Dua, D. and Graff, C. (2019). UCI machine learning repos-
itory
Ed-daoudy, A. and Maalmi, K. (2020). Breast cancer
classification with reduced feature set using association
rules and support vector machine. In Network Modeling
Analysis in Health Informatics and Bioinformatics,
volume 9, pages 1–10. Springer.
Eddy, S. R. (2004). Where did the blosum62 alignment
score matrix come from? In Nature biotechnology,
volume 22, pages 1035–1036. Nature Publishing
Group.
Forrest, S., Perelson, A., Allen, L., and Cherukuri, R.
(1994). Self-nonself discrimination in a computer. In
Proceedings of 1994 IEEE Computer Society
Symposium on Research in Security and Privacy. IEEE
Comput. Soc. Press.
Freitas, A. and Timmis, J. (2007). Revisiting the
foundations of artificial immune systems for data
mining. In IEEE transactions on evolutionary
computation. IEEE.
Hamaker, J. and Boggess, L. (2004). Non-euclidean
distance measures in airs, an artificial immune
classification system. In Proceedings of the 2004
Congress on Evolutionary Computation (IEEE
Cat.No.04TH8753). IEEE.
Henikoff, S. and Henikoff, J. G. (1992). Amino acid
substitution matrices from protein blocks. In
Proceedings of the National Academy of Sciences -
PNAS, volume 89, pages 10915–10919,
WASHINGTON. National Academy of Sciences of the
United States of America.
Mangasarian, O. L., Setiono, R., and Wolberg, W. H.
(1990). Pattern recognition via linear programming:
Theory and application to medical diagnosis. In Large-
Scale Numerical Optimization, pages 22–30. SIAM
Publications.
Mangasarian, O. L. and Wolberg, W. H. (1990). Cancer
diagnosis via linear programming. Technical report,
University of Wisconsin-Madison Department of
Computer Sciences.
Pearson, W. R. (2013). Selecting the right similarity-
scoring matrix. In Current protocols in bioinformatics,
volume 43, pages 3–5. Wiley Online Library.
Qiu, T., Xiao, H., Qingchen, Z., Qiu, J., Yang, Y., Wu, D.,
Cao, Z., and Zhu, R. (2015). Proteochemo-metric
modeling of the antigen-antibody interaction: New
fingerprints for antigen, antibody and epitope-paratope
interaction. In PloS one. doi = 10.1371/jour-
nal.pone.0122416.
Raudys, S., Arasimavicius, J., and Biziuleviciene, G.
(2010). Learning the affinity measure in real and
artificial immune systems. In 2010 3rd International
Conference on Biomedical Engineering and
Informatics. IEEE.
Timmis, J. (2006). Challenges for artificial immune
systems. In Neural Nets. Springer Berlin Heidelberg.